Query lcl|NC_020488.1_cdsid_YP_007518354.1 [gene=I904_gp12] [protein=hypothetical protein] [protein_id=YP_007518354.1] [location=complement(18880..20946)] Match_columns 688 No_of_seqs 175 out of 277 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 16:14:21 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_12 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_12_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:108295 Length: 711 100.0 1E-171 8E-175 957.6 74.4 685 1-688 13-708 (711) 2 protein:vir:100920 Length: 725 100.0 4E-156 3E-159 872.4 67.8 655 11-688 1-687 (725) 3 protein:vir:77597 Length: 725 100.0 4E-156 2E-159 872.7 67.0 657 13-688 1-687 (725) 4 protein:vir:172 Length: 708 # 100.0 2E-155 1E-158 868.5 68.9 657 12-688 1-691 (708) 5 protein:vir:9263 Length: 725 # 100.0 2E-155 1E-158 868.5 67.3 655 11-688 1-687 (725) 6 protein:vir:105429 Length: 708 100.0 2E-154 1E-157 863.0 70.1 662 12-688 1-701 (708) 7 protein:vir:3520 Length: 720 # 100.0 2E-154 1E-157 863.1 68.5 659 12-688 1-697 (720) 8 protein:vir:105520 Length: 706 100.0 1E-153 8E-157 858.7 67.8 658 12-688 1-693 (706) 9 protein:vir:105619 Length: 772 100.0 1E-153 9E-157 858.5 67.2 649 1-688 1-693 (772) 10 protein:vir:3296 Length: 714 # 100.0 1E-149 6E-153 837.4 71.6 654 1-688 1-711 (714) 11 protein:vir:2764 Length: 714 # 100.0 1E-149 6E-153 837.4 71.6 654 1-688 1-711 (714) 12 protein:vir:10117 Length: 714 100.0 1E-149 6E-153 837.4 71.6 654 1-688 1-711 (714) 13 protein:vir:817 Length: 714 # 100.0 1E-149 6E-153 837.4 71.6 654 1-688 1-711 (714) 14 protein:vir:9950 Length: 714 # 100.0 1E-149 6E-153 837.4 71.6 654 1-688 1-711 (714) 15 protein:vir:104437 Length: 714 100.0 9E-148 6E-151 826.8 69.8 654 1-688 1-711 (714) 16 protein:vir:93630 Length: 776 100.0 2E-146 1E-149 820.0 64.0 651 1-688 22-711 (776) 17 protein:vir:95821 Length: 763 100.0 2.1E-87 1.3E-90 495.8 58.8 600 1-688 8-721 (763) 18 protein:vir:8846 Length: 705 # 100.0 5.5E-85 3.4E-88 482.5 60.7 585 2-688 1-693 (705) 19 protein:vir:80165 Length: 651 100.0 1.4E-62 8.6E-66 359.7 53.9 573 1-683 2-651 (651) 20 protein:vir:345 Length: 663 # 100.0 2.4E-57 1.5E-60 331.0 44.6 602 4-688 1-660 (663) 21 protein:vir:95449 Length: 584 100.0 1.2E-47 7.4E-51 277.8 38.8 528 8-638 1-584 (584) 22 protein:vir:94599 Length: 641 100.0 2E-44 1.2E-47 260.2 43.8 568 1-681 4-641 (641) 23 protein:vir:3139 Length: 599 # 100.0 1.3E-38 7.9E-42 228.3 35.8 551 8-645 1-599 (599) 24 protein:vir:3361 Length: 535 # 99.9 3.4E-22 2.1E-25 138.3 44.9 521 7-688 1-534 (535) 25 protein:vir:95315 Length: 559 99.9 2.4E-22 1.5E-25 139.1 43.5 545 15-685 1-559 (559) 26 protein:vir:1538 Length: 535 # 99.9 5.1E-22 3.2E-25 137.3 45.0 520 7-688 1-534 (535) 27 protein:vir:103765 Length: 549 99.9 4.5E-22 2.8E-25 137.6 44.3 529 8-651 1-549 (549) 28 protein:vir:10447 Length: 536 99.9 2E-22 1.2E-25 139.5 42.1 522 8-669 1-536 (536) 29 protein:vir:2198 Length: 536 # 99.9 2.5E-22 1.6E-25 139.0 41.7 520 8-669 1-536 (536) 30 protein:vir:7321 Length: 556 # 99.9 2.3E-21 1.4E-24 133.7 45.0 541 8-674 1-556 (556) 31 protein:vir:107404 Length: 555 99.9 1.7E-21 1.1E-24 134.4 43.2 537 8-688 1-552 (555) 32 protein:vir:107822 Length: 555 99.9 1.7E-21 1.1E-24 134.4 43.2 537 8-688 1-552 (555) 33 protein:vir:98506 Length: 555 99.9 1.7E-21 1.1E-24 134.4 43.2 537 8-688 1-552 (555) 34 protein:vir:96494 Length: 501 99.9 1.1E-22 7E-26 140.9 33.6 467 1-635 19-501 (501) 35 protein:vir:1785 Length: 555 # 99.9 2E-21 1.2E-24 134.1 38.3 528 16-688 1-555 (555) 36 protein:vir:102668 Length: 547 99.9 1E-20 6.5E-24 130.1 42.2 536 18-663 1-547 (547) 37 protein:vir:94709 Length: 522 99.9 8.5E-20 5.3E-23 125.1 44.0 514 8-676 1-522 (522) 38 protein:vir:2732 Length: 501 # 99.9 9.2E-22 5.7E-25 135.9 32.3 464 1-645 19-501 (501) 39 protein:vir:94572 Length: 535 99.9 1.1E-19 6.7E-23 124.6 42.3 527 7-687 1-535 (535) 40 protein:vir:8883 Length: 543 # 99.9 1.8E-19 1.1E-22 123.3 43.1 533 7-681 1-543 (543) 41 protein:vir:97171 Length: 512 99.9 1.6E-20 9.7E-24 129.2 34.9 475 1-645 20-512 (512) 42 protein:vir:3609 Length: 452 # 99.9 2.8E-20 1.7E-23 127.8 35.2 437 1-649 10-452 (452) 43 protein:vir:95806 Length: 440 99.9 2.3E-20 1.5E-23 128.2 34.3 427 19-610 1-440 (440) 44 protein:vir:99672 Length: 532 99.8 7.9E-19 4.9E-22 119.8 41.5 520 7-656 1-532 (532) 45 protein:vir:102950 Length: 471 99.8 8.8E-20 5.4E-23 125.1 36.1 452 14-612 1-471 (471) 46 protein:vir:99522 Length: 470 99.8 9.8E-20 6.1E-23 124.8 36.2 444 1-612 18-470 (470) 47 protein:vir:4898 Length: 502 # 99.8 1.4E-20 8.6E-24 129.4 31.4 456 1-610 20-502 (502) 48 protein:vir:105461 Length: 470 99.8 5.9E-20 3.7E-23 126.0 34.7 455 18-611 1-470 (470) 49 protein:vir:78805 Length: 511 99.8 1.4E-19 8.4E-23 124.0 35.1 468 1-645 30-511 (511) 50 protein:vir:96366 Length: 511 99.8 1.4E-19 8.4E-23 124.0 35.1 468 1-645 30-511 (511) 51 protein:vir:3964 Length: 453 # 99.8 1.2E-19 7.2E-23 124.4 33.8 433 1-610 10-453 (453) 52 protein:vir:96240 Length: 511 99.8 2.6E-19 1.6E-22 122.5 35.6 465 1-645 30-511 (511) 53 protein:vir:105292 Length: 478 99.8 1.4E-19 8.4E-23 124.0 33.5 459 1-610 5-478 (478) 54 protein:vir:9871 Length: 429 # 99.8 2.8E-19 1.7E-22 122.3 35.2 424 15-610 1-429 (429) 55 protein:vir:93747 Length: 472 99.8 2.1E-19 1.3E-22 122.9 34.5 455 1-612 1-472 (472) 56 protein:vir:78696 Length: 542 99.8 2.2E-19 1.3E-22 122.9 34.2 524 19-688 1-538 (542) 57 protein:vir:100039 Length: 522 99.8 1.3E-18 8.3E-22 118.6 38.4 509 21-668 1-522 (522) 58 protein:vir:95113 Length: 474 99.8 7.7E-20 4.8E-23 125.4 31.5 443 1-610 6-474 (474) 59 protein:vir:733 Length: 453 # 99.8 6.1E-19 3.8E-22 120.4 35.9 438 1-641 3-453 (453) 60 protein:vir:95899 Length: 474 99.8 1E-19 6.3E-23 124.7 31.6 459 1-645 6-474 (474) 61 protein:vir:96266 Length: 474 99.8 1E-19 6.3E-23 124.7 31.6 459 1-645 6-474 (474) 62 protein:vir:103951 Length: 511 99.8 2.6E-19 1.6E-22 122.5 33.8 468 1-645 30-511 (511) 63 protein:vir:96988 Length: 516 99.8 1.3E-18 7.9E-22 118.7 37.5 505 8-646 1-516 (516) 64 protein:vir:9306 Length: 511 # 99.8 3.9E-19 2.4E-22 121.5 33.9 468 1-645 30-511 (511) 65 protein:vir:80680 Length: 441 99.8 1E-18 6.3E-22 119.2 36.1 434 12-637 1-441 (441) 66 protein:vir:103330 Length: 517 99.8 3.8E-18 2.4E-21 116.1 39.1 508 12-663 1-517 (517) 67 protein:vir:96179 Length: 468 99.8 4.6E-19 2.8E-22 121.1 34.0 447 1-608 5-468 (468) 68 protein:vir:102330 Length: 451 99.8 4.9E-19 3.1E-22 121.0 34.0 438 15-608 1-451 (451) 69 protein:vir:99781 Length: 511 99.8 2.1E-19 1.3E-22 123.0 31.7 468 1-645 30-511 (511) 70 protein:vir:94101 Length: 474 99.8 1.3E-18 8.3E-22 118.6 36.0 449 1-610 3-474 (474) 71 protein:vir:105889 Length: 474 99.8 1.3E-18 8.3E-22 118.6 36.0 449 1-610 3-474 (474) 72 protein:vir:106639 Length: 481 99.8 1.3E-18 8E-22 118.7 35.7 447 1-613 22-481 (481) 73 protein:vir:94498 Length: 474 99.8 6.9E-19 4.3E-22 120.2 33.2 459 1-628 6-474 (474) 74 protein:vir:97447 Length: 474 99.8 6.9E-19 4.3E-22 120.2 33.2 459 1-628 6-474 (474) 75 protein:vir:5961 Length: 503 # 99.8 2.6E-19 1.6E-22 122.5 30.7 473 1-621 4-503 (503) 76 protein:vir:79043 Length: 479 99.8 3.2E-18 2E-21 116.5 36.4 462 1-637 6-479 (479) 77 protein:vir:107112 Length: 478 99.8 2.2E-18 1.4E-21 117.4 33.9 456 1-610 1-478 (478) 78 protein:vir:78942 Length: 510 99.8 3E-17 1.9E-20 111.1 40.1 500 16-657 1-510 (510) 79 protein:vir:1236 Length: 483 # 99.8 2.9E-18 1.8E-21 116.7 33.9 452 1-612 12-483 (483) 80 protein:vir:9922 Length: 489 # 99.8 8.2E-18 5.1E-21 114.3 35.7 456 2-607 1-489 (489) 81 protein:vir:38 Length: 496 # N 99.8 3E-17 1.8E-20 111.2 38.6 457 15-610 1-496 (496) 82 protein:vir:97336 Length: 492 99.8 1.6E-17 1E-20 112.6 35.7 457 1-619 21-492 (492) 83 protein:vir:94805 Length: 492 99.8 8.9E-18 5.5E-21 114.1 34.0 457 1-647 21-492 (492) 84 protein:vir:96839 Length: 474 99.8 4E-18 2.5E-21 115.9 32.0 458 1-649 5-474 (474) 85 protein:vir:94546 Length: 506 99.8 8.5E-18 5.3E-21 114.2 33.8 457 1-616 14-506 (506) 86 protein:vir:7017 Length: 515 # 99.8 1.1E-16 7.1E-20 108.0 39.8 499 11-646 1-515 (515) 87 protein:vir:1587 Length: 508 # 99.8 3.7E-17 2.3E-20 110.6 37.0 470 17-610 1-508 (508) 88 protein:vir:105641 Length: 516 99.8 2.9E-16 1.8E-19 105.8 41.1 503 8-646 1-516 (516) 89 protein:vir:6322 Length: 510 # 99.8 2E-16 1.2E-19 106.7 39.8 502 19-657 1-510 (510) 90 protein:vir:106571 Length: 499 99.8 1.2E-17 7.8E-21 113.3 32.8 466 8-642 1-499 (499) 91 protein:vir:80959 Length: 499 99.7 1.9E-16 1.2E-19 106.8 35.4 462 15-610 1-499 (499) 92 protein:vir:80211 Length: 514 99.7 2.6E-15 1.6E-18 100.5 41.7 495 23-639 1-514 (514) 93 protein:vir:2427 Length: 485 # 99.7 7.4E-17 4.6E-20 109.0 33.1 465 1-637 4-485 (485) 94 protein:vir:78537 Length: 480 99.7 7.8E-18 4.8E-21 114.4 27.5 469 8-642 1-480 (480) 95 protein:vir:9751 Length: 422 # 99.7 2.4E-16 1.5E-19 106.2 35.5 407 11-590 1-422 (422) 96 protein:vir:79703 Length: 505 99.7 2.7E-16 1.7E-19 106.0 35.5 460 17-600 1-505 (505) 97 protein:vir:94742 Length: 409 99.7 2.2E-16 1.4E-19 106.4 35.0 396 15-577 1-409 (409) 98 protein:vir:2341 Length: 488 # 99.7 1.9E-16 1.2E-19 106.8 33.3 463 8-614 1-488 (488) 99 protein:vir:78083 Length: 537 99.7 1.3E-15 8.2E-19 102.1 37.4 513 1-648 1-537 (537) 100 protein:vir:98883 Length: 517 99.7 1.2E-16 7.5E-20 107.9 31.3 471 17-608 1-517 (517) 101 protein:vir:78227 Length: 480 99.7 2.6E-17 1.6E-20 111.5 27.4 466 8-642 1-480 (480) 102 protein:vir:78907 Length: 518 99.7 1.6E-15 1E-18 101.7 36.3 484 17-607 1-518 (518) 103 protein:vir:3028 Length: 500 # 99.7 1.1E-15 7E-19 102.5 34.5 454 17-608 1-500 (500) 104 protein:vir:9815 Length: 500 # 99.7 1.1E-15 7E-19 102.5 34.5 454 17-608 1-500 (500) 105 protein:vir:4223 Length: 486 # 99.7 8.2E-16 5.1E-19 103.3 33.0 458 1-618 4-486 (486) 106 protein:vir:9568 Length: 410 # 99.7 1.5E-15 9.2E-19 101.9 34.0 397 34-592 1-410 (410) 107 protein:vir:7768 Length: 484 # 99.7 2E-16 1.2E-19 106.7 28.8 459 1-617 4-484 (484) 108 protein:vir:1634 Length: 409 # 99.7 1.3E-15 8.1E-19 102.2 33.2 397 15-577 1-409 (409) 109 protein:vir:104082 Length: 485 99.7 6.7E-16 4.1E-19 103.8 29.7 473 4-665 1-485 (485) 110 protein:vir:102602 Length: 456 99.7 6.6E-15 4.1E-18 98.3 34.4 437 11-638 1-456 (456) 111 protein:vir:105819 Length: 456 99.7 6.6E-15 4.1E-18 98.3 34.4 437 11-638 1-456 (456) 112 protein:vir:8184 Length: 474 # 99.7 7.1E-15 4.4E-18 98.2 34.0 450 1-640 1-474 (474) 113 protein:vir:7987 Length: 456 # 99.6 9.1E-15 5.7E-18 97.6 32.9 442 11-638 1-456 (456) 114 protein:vir:99916 Length: 504 99.6 8.2E-14 5.1E-17 92.3 36.3 451 1-613 1-504 (504) 115 protein:vir:99072 Length: 479 99.6 4.6E-15 2.9E-18 99.2 29.3 469 7-655 1-479 (479) 116 protein:vir:2500 Length: 501 # 99.6 8.9E-15 5.5E-18 97.6 30.5 479 1-639 10-501 (501) 117 protein:vir:7430 Length: 563 # 99.6 1.1E-14 7.1E-18 97.0 30.0 508 2-609 1-563 (563) 118 protein:vir:4782 Length: 522 # 99.6 3.5E-14 2.2E-17 94.3 32.3 485 17-609 1-522 (522) 119 protein:vir:101494 Length: 527 99.6 1.4E-13 8.5E-17 91.1 32.4 493 2-608 1-527 (527) 120 protein:vir:102239 Length: 527 99.6 1.4E-13 8.9E-17 91.0 32.4 493 2-608 1-527 (527) 121 protein:vir:98444 Length: 434 99.5 3.9E-14 2.4E-17 94.1 26.1 422 47-641 1-434 (434) 122 protein:vir:3520 Length: 720 # 98.9 2.3E-08 1.4E-11 62.4 31.2 605 16-688 1-692 (720) 123 protein:vir:9263 Length: 725 # 98.6 3E-07 1.9E-10 56.3 34.9 618 20-688 1-683 (725) 124 protein:vir:77597 Length: 725 98.5 3.9E-07 2.4E-10 55.7 34.7 619 20-688 1-683 (725) 125 protein:vir:100920 Length: 725 98.5 4.8E-07 3E-10 55.2 31.4 621 20-688 1-683 (725) 126 protein:vir:78393 Length: 489 98.3 1.9E-06 1.2E-09 51.9 28.3 472 1-606 1-489 (489) 127 protein:vir:94956 Length: 452 98.1 5.6E-06 3.5E-09 49.4 28.1 446 2-611 1-452 (452) 128 protein:vir:95014 Length: 491 98.0 6.6E-06 4.1E-09 49.0 30.2 474 1-610 1-491 (491) 129 protein:vir:103385 Length: 666 97.7 9.2E-06 5.7E-09 48.2 15.7 571 1-661 3-666 (666) 130 protein:vir:96403 Length: 666 97.7 7E-06 4.3E-09 48.9 14.5 571 1-661 3-666 (666) 131 protein:vir:8846 Length: 705 # 97.4 8.1E-05 5E-08 43.0 31.1 573 8-688 1-674 (705) 132 protein:vir:95149 Length: 501 97.2 0.00012 7.6E-08 42.0 28.3 475 2-610 1-501 (501) 133 protein:vir:105429 Length: 708 97.2 0.00014 8.8E-08 41.7 31.3 616 16-688 1-691 (708) 134 protein:vir:105520 Length: 706 97.1 0.00016 9.6E-08 41.5 35.7 599 8-688 1-698 (706) 135 protein:vir:80453 Length: 535 96.9 0.00028 1.7E-07 40.1 32.1 478 1-616 17-535 (535) 136 protein:vir:95821 Length: 763 96.8 0.00035 2.2E-07 39.5 31.0 584 1-688 63-724 (763) 137 protein:vir:97265 Length: 513 96.3 0.00081 5.1E-07 37.5 29.2 462 2-611 1-513 (513) 138 protein:vir:172 Length: 708 # 96.0 0.0011 6.9E-07 36.8 36.5 609 8-688 1-701 (708) 139 protein:vir:96783 Length: 488 95.7 0.0016 9.6E-07 36.0 30.5 449 1-594 13-488 (488) 140 protein:vir:108295 Length: 711 93.7 0.0066 4.1E-06 32.5 33.4 577 1-688 1-701 (711) 141 protein:vir:10117 Length: 714 46.3 0.74 0.00046 21.3 35.3 546 1-682 95-714 (714) 142 protein:vir:817 Length: 714 # 46.3 0.74 0.00046 21.3 35.3 546 1-682 95-714 (714) 143 protein:vir:3296 Length: 714 # 46.3 0.74 0.00046 21.3 35.3 546 1-682 95-714 (714) 144 protein:vir:9950 Length: 714 # 46.3 0.74 0.00046 21.3 35.3 546 1-682 95-714 (714) 145 protein:vir:2764 Length: 714 # 46.3 0.74 0.00046 21.3 35.3 546 1-682 95-714 (714) 146 protein:vir:78641 Length: 278 42.0 0.9 0.00056 20.8 17.8 267 173-510 1-278 (278) 147 protein:vir:104437 Length: 714 36.6 1.2 0.00072 20.2 33.0 539 1-682 95-714 (714) 148 protein:vir:80165 Length: 651 28.9 1.7 0.0011 19.3 26.2 527 1-657 72-651 (651) 149 protein:vir:80128 Length: 466 23.8 2.3 0.0014 18.6 10.1 94 581-688 1-108 (466) No 1 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=1.2e-171 Score=957.58 Aligned_cols=685 Identities=52% Similarity=0.874 Sum_probs=603.7 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQ 80 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~ 80 (688) ++.++.+...+++++.+.++++++++|+++++++++||.++.++++||+|+||++++++.|+.+||||++||+|+|+|++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~ 92 (711) T protein:vir:10 13 LYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQ 92 (711) T ss_pred hhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCcEEEcchHHHHHH Confidence 66677777888899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhCCcceEEEeCCcccc-------ccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCc Q lcl|NC_020488. 81 VLGDQRQNRPAIQVHPVEANAT-------KDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFG 153 (688) Q Consensus 81 i~g~~~~~r~~~~v~pr~~~~~-------~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G 153 (688) |+|++++||++++|+||+++.. ..+...+.+.+.+|.++|++||++++|+++.|+++++++++|+++++||+| T Consensus 93 v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G 172 (711) T protein:vir:10 93 VLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMG 172 (711) T ss_pred HhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcc Confidence 9999999999999999985422 233556677888999999999999999999999999999999999999999 Q ss_pred eEEEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccC Q lcl|NC_020488. 154 WLRVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWW 233 (688) Q Consensus 154 ~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~ 233 (688) |++|++||..+++++++|+|.+|++|++|||||+|+++|+|||+|+|+++|||+++|+++||+++..++...+...++.| T Consensus 173 ~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~ 252 (711) T protein:vir:10 173 YLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTW 252 (711) T ss_pred eEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999888887777888889 Q ss_pred CCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCC Q lcl|NC_020488. 234 TNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGS 313 (688) Q Consensus 234 ~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~ 313 (688) +++++|||+|||+++++..+++.+.+|++++.+..++.+..+...|...+..+.+++++|+|++++|+++|++++||||+ T Consensus 253 ~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~ 332 (711) T protein:vir:10 253 FTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPST 332 (711) T ss_pred cCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec Q lcl|NC_020488. 314 TIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY 393 (688) Q Consensus 314 ~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 393 (688) +||||||||++.+++++.+++|+||.|+|+|+++|+++|+++|+++++++++|++++|++++.++.|.+.+..+++++++ T Consensus 333 ~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~ 412 (711) T protein:vir:10 333 TIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTY 412 (711) T ss_pred cccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHHhccccCCCeeEe Confidence 99999999999999999999999999999999999999999999999999999999999999999888776666666655 Q ss_pred Ccc-cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 NAI-PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRR 472 (688) Q Consensus 394 ~~~-~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~ 472 (688) +++ .+..++++.+++++|+++++|++++.++|+++|||+++++|..+|++||+||++++++|++++++++|||++++++ T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~ 492 (711) T protein:vir:10 413 IPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRR 492 (711) T ss_pred cccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554 4556889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) +|+++|+||++|||++|+|||+|++++.+||.||..++++.+|.+++.||+++|+|||+|+++|+++|+|++.+..|+++ T Consensus 493 ~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql 572 (711) T protein:vir:10 493 VGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQF 572 (711) T ss_pred HHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhh---hhhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 553 VQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGI---EPPQPSPEQQANMAQAQADMEKAKADT 629 (688) Q Consensus 553 ~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~q~~~~~~q~~~~~~q~e~ 629 (688) ++.+|++++++++++++++++|+++++.+++++..+++........+.++ +++++..+++.+++++|++..+++++. T Consensus 573 ~~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~ 652 (711) T protein:vir:10 573 AQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADT 652 (711) T ss_pred HhhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999988877655433332222 222222333445556666666777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 630 AKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 630 ~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) +++++++++++++..+.+.+...+..+++.+ ++..+++...+++++ .+++..||+. T Consensus 653 ~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~--~~~~qq~~~~l~~~q-aelq~~q~~~ 708 (711) T protein:vir:10 653 AQAQADMLKAQLETEEAQKQLAMIEDMAQGG--DVVYQQVRELVAQAL-AEITASQANV 708 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHh Confidence 7777777666666655555544433332221 222223222222222 2333344444 No 2 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=4.2e-156 Score=872.43 Aligned_cols=655 Identities=23% Similarity=0.298 Sum_probs=522.7 Q ss_pred CCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhCCc Q lcl|NC_020488. 11 RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQNRP 90 (688) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~ 90 (688) .. +.+.+|.+++.+|+++++++++||.++.++++||+|+||++++++.|+.+|+| +||+|+|+|++|+|++++||+ T Consensus 1 m~--d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp--~~N~i~~~v~~v~g~e~~nr~ 76 (725) T protein:vir:10 1 MA--DNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPI 76 (725) T ss_pred CC--chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhHHhCCc Confidence 11 23568999999999999999999999999999999999999999999999998 579999999999999999999 Q ss_pred ceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcc Q lcl|NC_020488. 91 AIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLD 170 (688) Q Consensus 91 ~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~ 170 (688) +++|+|++++ |.++|++||++++|+++.|+++++++++|+++++||+||++|++||.++++++++ T Consensus 77 d~~v~p~~~~---------------d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~ 141 (725) T protein:vir:10 77 DVLYRPKDGA---------------SPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNN 141 (725) T ss_pred ceEEecCCcc---------------hHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCc Confidence 9999999763 8999999999999999999999999999999999999999999999998999988 Q ss_pred eeEEEe--c-ccceEEeCCcccccccccCceEEEEecCCHH---HHHHhcCCccchhccccc-ccccccCCCCCEEEEEE Q lcl|NC_020488. 171 LCIKSI--H-NRFAVLMDPDATEPDYSDANWCFISERMSKA---EFNKRYPGKAVGDLSDAE-RGEYSWWTNEEGVRVSE 243 (688) Q Consensus 171 ~~~~~v--~-~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~---e~~~~~p~~~~~~~~~~~-~~~~~~~~~~~~v~v~e 243 (688) ++|..+ + +|.+|||||+|+++|+|||+|+|+++||+++ +|.+.||+.+........ ...++.|+++++|||+| T Consensus 142 ~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E 221 (725) T protein:vir:10 142 QVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAE 221 (725) T ss_pred eeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEE Confidence 877654 3 5667999999999999999999999999975 566788876544322222 12234588899999999 Q ss_pred EEeeeecceeeeeccC---Cceeccc--ccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceE Q lcl|NC_020488. 244 YFYREPVTRKLLLLSD---GRTVWED--EVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVA 318 (688) Q Consensus 244 ~~~~~~~~~~~~~~~~---g~~~~~~--~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~v 318 (688) ||++.++...++.+.+ |..+.++ .....+..+...|..++..+.++++||+|+.++|+++|++|+||||++|||| T Consensus 222 ~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~v 301 (725) T protein:vir:10 222 FYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIV 301 (725) T ss_pred EEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEE Confidence 9999999888775543 5554433 3344566778889999999999999999999999999999999999999999 Q ss_pred EEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceee------ Q lcl|NC_020488. 319 PVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLR------ 392 (688) Q Consensus 319 p~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~------ 392 (688) ||||++.+++|++|++|+||.|+|+|+++|+++|+++|+++++++++++++.+++++.++.|...+.. +++. T Consensus 302 P~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~--~~~~~~~~~~ 379 (725) T protein:vir:10 302 PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY--PYYLLNRTDE 379 (725) T ss_pred EEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCc--eeeecccccc Confidence 99999999999999999999999999999999999999999999999999999999887777644332 2222 Q ss_pred cCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 393 YNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRR 472 (688) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~ 472 (688) .++.....++.+.+++++|+++++|++.+.++|+++|||+++++|..+|++||+||++++++|++.+++++|||+++++. T Consensus 380 ~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~ 459 (725) T protein:vir:10 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) T ss_pred cCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333455788889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) +|+++|+||++|||++|++||+|++|..++|.||..+.++.+|..++.||+ +|+|||+|+++|+++|+|++++..|+++ T Consensus 460 ~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi-~g~~Dv~v~~~p~~~s~r~~~~~~l~ql 538 (725) T protein:vir:10 460 DGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDI-RGRYECYTDVGPSFQSMKQQNRSEILEL 538 (725) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhcc-ccceeEEEeeccCcHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999 5899999999999999999999999999 Q ss_pred HHhhHHHH---HHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhh----hhhhhhHHHHHHHH-------HH Q lcl|NC_020488. 553 VQAVPAAG---GVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGI----EPPQPSPEQQANMA-------QA 618 (688) Q Consensus 553 ~q~~~~~~---~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~q~~~~-------~~ 618 (688) ++++|+.. ..++..+++++++|+++++.++++++.+++....+..++.++ ++...++++..++. +. T Consensus 539 l~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~ 618 (725) T protein:vir:10 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQG 618 (725) T ss_pred HHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHH Confidence 99987543 345566788999999999999999887765443322111111 11111111111222 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 619 QADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 619 q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ++++++++++..+++++..+.+.++...+++.+++..++...+ .+..+.+...++.++.+.....+++| T Consensus 619 qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q-~~~~~~~~~~~~~~q~~~~~~~~~~a 687 (725) T protein:vir:10 619 QAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSK-QSEFREFLKTVASFQQDRSEDARANA 687 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 3333333333333333333333333322222222211111111 11111111122223333333333333 No 3 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=3.8e-156 Score=872.66 Aligned_cols=657 Identities=22% Similarity=0.283 Sum_probs=523.9 Q ss_pred ccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhCCcce Q lcl|NC_020488. 13 DDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQNRPAI 92 (688) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~~~ 92 (688) -.+.+.+|.+++.+|+++++++++||.++.++++||+|+||++++++.|+.+|+| ++|+|+|+|++|+|++++||+++ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~~~~nr~d~ 78 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ccccHHHHHHHHHhhHHhCCcce Confidence 1123568999999999999999999999999999999999999999999999998 57999999999999999999999 Q ss_pred EEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCccee Q lcl|NC_020488. 93 QVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLC 172 (688) Q Consensus 93 ~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~ 172 (688) +|+|++++ |.++|++||++++|+++.|+++++++++|+++++||+||++|++||..+++|+++++ T Consensus 79 ~v~P~~~~---------------d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~ 143 (725) T protein:vir:77 79 LYRPKDGA---------------RPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) T ss_pred EEecCCcc---------------HHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCcee Confidence 99999763 899999999999999999999999999999999999999999999999999999888 Q ss_pred EEEe--c-ccceEEeCCcccccccccCceEEEEecCCHHHHHHhcC---Cccchhccc-ccccccccCCCCCEEEEEEEE Q lcl|NC_020488. 173 IKSI--H-NRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYP---GKAVGDLSD-AERGEYSWWTNEEGVRVSEYF 245 (688) Q Consensus 173 ~~~v--~-~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p---~~~~~~~~~-~~~~~~~~~~~~~~v~v~e~~ 245 (688) |+++ + +|.+|||||+|+++|+|||+|+|+++|||+++++.+|| ..+...... .....++.|.++++|||+||| T Consensus 144 i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~ 223 (725) T protein:vir:77 144 IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEE Confidence 7654 2 67789999999999999999999999999997776554 333222111 112224457788999999999 Q ss_pred eeeecceeeeeccC---Cce--ecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEE Q lcl|NC_020488. 246 YREPVTRKLLLLSD---GRT--VWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPV 320 (688) Q Consensus 246 ~~~~~~~~~~~~~~---g~~--~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~ 320 (688) |+.++...++...+ |.. ++.......+..+...|..++..+.++++||+|++++|+++|++|+||||++|||||| T Consensus 224 ~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:77 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEE Confidence 99999888876554 333 3334444556677888999999999999999999999999999999999999999999 Q ss_pred eeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCc----eeecCcc Q lcl|NC_020488. 321 LGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQS----VLRYNAI 396 (688) Q Consensus 321 ~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~----~~~~~~~ 396 (688) ||++.+++|++|++|+||.|+|+|+++|+++|+++|+++++++.++++.++++++.++.|...+..+.. +...++. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:77 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGD 383 (725) T ss_pred eeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCc Confidence 999999999999999999999999999999999999999999999999999999998888876654432 2223333 Q ss_pred cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 397 PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQI 476 (688) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~ 476 (688) ....++.+.+++++|+++++|++.+..+|+++|||+++++|..+|++||+||++++++|++.+++++|||+++++.+|++ T Consensus 384 ~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~ 463 (725) T protein:vir:77 384 LPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) T ss_pred ccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44457788899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) +|+||++|||++|++||+|+++..+++.||..+.++.+|..++.||++ |+|||+|+++|+++|+|+++++.|+++++++ T Consensus 464 lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~-g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~ 542 (725) T protein:vir:77 464 YQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhc-cceeeEEeeccchHHHHHHHHHHHHHHHHhc Confidence 999999999999999999999999999999999999999999999995 8999999999999999999999999999998 Q ss_pred HHHHH---HHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhh----hhhhhhHHHHHH-------HHHHHHHH Q lcl|NC_020488. 557 PAAGG---VVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGI----EPPQPSPEQQAN-------MAQAQADM 622 (688) Q Consensus 557 ~~~~~---~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~q~~-------~~~~q~~~ 622 (688) |+..+ .++..+++++++|+++++.++++++.++.....+.....++ ++...+.+++.+ .+++|+++ T Consensus 543 ~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~ 622 (725) T protein:vir:77 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAEL 622 (725) T ss_pred cccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH Confidence 76443 45566678899999999999999877765443322211111 111111111112 22333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 623 EKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 623 ~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ++++++..+++.+..+++++++..+++.+++..++...+. +..+++...++.++.++...++++| T Consensus 623 ~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~-a~~~~~~~~~~~~q~~~~~~~~~~a 687 (725) T protein:vir:77 623 AKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQ-SEFREFLKTVASFQQDRSEDARANA 687 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3334443333333333333333333332222222111111 1111111122222223333333333 No 4 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=2.2e-155 Score=868.52 Aligned_cols=657 Identities=24% Similarity=0.354 Sum_probs=526.2 Q ss_pred CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHH--HhhCCCCCCHHHHHHHHhcC----CCceeehhHHHHHHHHHHHH Q lcl|NC_020488. 12 DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDI--SFLAGEQWPESVRKEREDEG----RPCLTLNKLPQYVDQVLGDQ 85 (688) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~--~~~~G~Qw~~~~~~~~~~~g----~p~~~~N~i~~~i~~i~g~~ 85 (688) =.++..++|++++.+|.+++++++++|.++.++. .||+|+||+++++++|+++| |||+|||+|+|+|++|+|++ T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 3344456999999999999999999999998886 57999999999999998764 79999999999999999999 Q ss_pred HhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCC Q lcl|NC_020488. 86 RQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDD 165 (688) Q Consensus 86 ~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~ 165 (688) ++||++++|+|++.++ |.++|++||++++|+++.|+++++++++|+++++||+||++|+++|..++ T Consensus 81 ~~nr~d~~v~p~~~~~--------------d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~ 146 (708) T protein:vir:17 81 RNNRITVKFRPGDREA--------------SEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY 146 (708) T ss_pred hhCCcceEEecCCCcc--------------hHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccC Confidence 9999999999997643 78999999999999999999999999999999999999999999997764 Q ss_pred ---CCCcceeEEEeccc-ceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhc--ccccccccccCCCCCEE Q lcl|NC_020488. 166 ---AFDLDLCIKSIHNR-FAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDL--SDAERGEYSWWTNEEGV 239 (688) Q Consensus 166 ---~~~~~~~~~~v~~~-~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~--~~~~~~~~~~~~~~~~v 239 (688) .+..++.+.++.+| ++|||||+|+++|+|||+|+|+++|||+++++++||+++.... +....++ +.|++.++| T Consensus 147 d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~-~~~~~~d~v 225 (708) T protein:vir:17 147 DPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWE-YDWFDADVI 225 (708) T ss_pred CCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhcccc-ccccCCCeE Confidence 35667777777544 7999999999999999999999999999999999999875433 2333333 347788999 Q ss_pred EEEEEEeeeecceeeeeccC---Cc--eecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCc Q lcl|NC_020488. 240 RVSEYFYREPVTRKLLLLSD---GR--TVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGST 314 (688) Q Consensus 240 ~v~e~~~~~~~~~~~~~~~~---g~--~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~ 314 (688) ||+|||++.++...++.+.+ |. ++..+....+...+...|...+..+.+++++|+|+.++|+.+|++++|+||++ T Consensus 226 rv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~ 305 (708) T protein:vir:17 226 YIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEH 305 (708) T ss_pred EEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCc Confidence 99999999999988877654 33 34445556667778888888899999999999999999999999999999999 Q ss_pred cceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecC Q lcl|NC_020488. 315 IPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYN 394 (688) Q Consensus 315 ~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 394 (688) ||||||||++.+++|+++++|+||.|+|+|+++|+++|+++|+++++++.+++++.+++.+....|...+..+.++...+ T Consensus 306 fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~ 385 (708) T protein:vir:17 306 IPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLR 385 (708) T ss_pred cceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhh Confidence 99999999999999999989999999999999999999999999999999999999999998888877766555444333 Q ss_pred cc--------cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 395 AI--------PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 395 ~~--------~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) .. .+..++.+++++++|+++++|++.+..+|+++|||+++++|+.+| +||+||++++++|++.++.++||+ T Consensus 386 ~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn-~SG~Ai~~rq~qg~~~~~~~~Dnl 464 (708) T protein:vir:17 386 EVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASFIYLDNM 464 (708) T ss_pred ccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccc-hHHHHHHHHHHHHHHHHHHHHHHH Confidence 21 223456678899999999999999999999999999999998766 899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) +++++++|+++|+||++|||++|++||+|++|+.+++.+|..+.+..+|..++.|||++|+|||+|+++|+++|+|++.+ T Consensus 465 ~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~ 544 (708) T protein:vir:17 465 AKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATV 544 (708) T ss_pred HHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhHH---HHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhh----HHHHHHHHHHH Q lcl|NC_020488. 547 DSLMQFVQAVPA---AGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPS----PEQQANMAQAQ 619 (688) Q Consensus 547 ~~l~~~~q~~~~---~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~q~~~~~~q 619 (688) +.|+++++.+++ +.+.+++++++++++|+++++.+++++..++.....+..++.+++..+++ .+++.++.+++ T Consensus 545 ~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaq 624 (708) T protein:vir:17 545 SVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQ 624 (708) T ss_pred HHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999998764 45667888999999999999999999887765554433322222211111 11112222333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 620 ADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAA--MMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 620 ~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a--~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ++..++++++++++++..+.+++..+.+....+.+..+ ..++.+. +......+..+++...|..+ T Consensus 625 a~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~----~~~~~~~~~~~~l~~~q~~q 691 (708) T protein:vir:17 625 AQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARN----IDDKAVMEAIRLLKDVAESQ 691 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHhhhhhhhH Confidence 33334444444443333333222221111111111111 0111000 00000111112222212111 No 5 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=2.2e-155 Score=868.47 Aligned_cols=655 Identities=23% Similarity=0.302 Sum_probs=518.1 Q ss_pred CCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhCCc Q lcl|NC_020488. 11 RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQNRP 90 (688) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~ 90 (688) .. +.+.+|.+++.+|+++++++++||.++.++++||+|+||++++++.|+.+|+| +||+|+|+|++|+|++++||+ T Consensus 1 m~--d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~e~~nr~ 76 (725) T protein:vir:92 1 MA--DNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPI 76 (725) T ss_pred CC--chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhHHhCCc Confidence 11 23558999999999999999999999999999999999999999999999998 579999999999999999999 Q ss_pred ceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcc Q lcl|NC_020488. 91 AIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLD 170 (688) Q Consensus 91 ~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~ 170 (688) +++|+|++++ |.++|++||++++|+++.|+++++++++|+++++||+||++|++||..+++|+++ T Consensus 77 d~~v~P~~~~---------------d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~ 141 (725) T protein:vir:92 77 DVLYRPKDGA---------------SPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNN 141 (725) T ss_pred ceEEecCCcc---------------HHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCc Confidence 9999999763 8999999999999999999999999999999999999999999999998999988 Q ss_pred eeEEE--eccc-ceEEeCCcccccccccCceEEEEecCCHHHHHH---hcCCccchhccccc-ccccccCCCCCEEEEEE Q lcl|NC_020488. 171 LCIKS--IHNR-FAVLMDPDATEPDYSDANWCFISERMSKAEFNK---RYPGKAVGDLSDAE-RGEYSWWTNEEGVRVSE 243 (688) Q Consensus 171 ~~~~~--v~~~-~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~---~~p~~~~~~~~~~~-~~~~~~~~~~~~v~v~e 243 (688) ++|.+ |++| .+|||||+|+++|+|||+|+|+++|||+++++. .||..+.+...... ....+.|+++++|||+| T Consensus 142 ~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e 221 (725) T protein:vir:92 142 QVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAE 221 (725) T ss_pred eeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEE Confidence 87764 4444 469999999999999999999999999986655 56544333221111 11234477889999999 Q ss_pred EEeeeecceeeeeccC---Cceeccc--ccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceE Q lcl|NC_020488. 244 YFYREPVTRKLLLLSD---GRTVWED--EVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVA 318 (688) Q Consensus 244 ~~~~~~~~~~~~~~~~---g~~~~~~--~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~v 318 (688) ||++.++...++...+ |..+.++ .....+..+...|..++..+.++++||+|+.++|+++|++|+||||++|||| T Consensus 222 ~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~v 301 (725) T protein:vir:92 222 FYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIV 301 (725) T ss_pred EEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeE Confidence 9999999888876543 5554433 3345566778889999999999999999999999999999999999999999 Q ss_pred EEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec----- Q lcl|NC_020488. 319 PVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY----- 393 (688) Q Consensus 319 p~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~----- 393 (688) ||||++.+++|++|++|+||.|+|+|+++|+++|+++|+++++++++++++++++++.+..|...+..+ +++. T Consensus 302 P~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~ 379 (725) T protein:vir:92 302 PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYP--YYLLNRTDE 379 (725) T ss_pred EEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccc--eeecccccc Confidence 999999999999999999999999999999999999999999999999999999998877776544332 2222 Q ss_pred -CcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 -NAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRR 472 (688) Q Consensus 394 -~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~ 472 (688) ++.....++.+.+++++|+++++|++.+.++|+++|||+++++|..+|++||+||++++++|++.+++++|||+++++. T Consensus 380 ~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~ 459 (725) T protein:vir:92 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) T ss_pred ccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2233445788889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) +|+++|+||++|||++|++||+|++|..+++.||..+.++.+|..++.||++ |+|||+|+++|+++|+|++++..|+++ T Consensus 460 ~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~-g~~Dv~v~~~p~~~s~r~~~~~~l~ql 538 (725) T protein:vir:92 460 DGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) T ss_pred HHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccc-cceeeEEeeccChHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999994 899999999999999999999999999 Q ss_pred HHhhHHHHH---HHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhh----hhhhhhHHH-------HHHHHHH Q lcl|NC_020488. 553 VQAVPAAGG---VVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGI----EPPQPSPEQ-------QANMAQA 618 (688) Q Consensus 553 ~q~~~~~~~---~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~-------q~~~~~~ 618 (688) ++.+|++.+ .++..+++++++|+++++.++++++.++.....+...+.++ ++....+++ +++.++. T Consensus 539 ~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~ 618 (725) T protein:vir:92 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQG 618 (725) T ss_pred HHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHH Confidence 999886544 34556678999999999999999877665433221111110 001111111 1222333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 619 QADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 619 q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) |+++++++++..+++++.++++++.+..+++.+++..++...+....++ +...++.++.++...+.+.| T Consensus 619 qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~-~~~~~~~~q~~~~~~a~~~a 687 (725) T protein:vir:92 619 QAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFRE-FLKTVASFQQDRSEDARANA 687 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhc Confidence 4444444444444444443333333333222222211111111111111 11111111112222222222 No 6 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=2.2e-154 Score=863.02 Aligned_cols=662 Identities=25% Similarity=0.344 Sum_probs=528.7 Q ss_pred CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh--CCCCCCHHHHHHHHhc----CCCceeehhHHHHHHHHHHHH Q lcl|NC_020488. 12 DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL--AGEQWPESVRKEREDE----GRPCLTLNKLPQYVDQVLGDQ 85 (688) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~--~G~Qw~~~~~~~~~~~----g~p~~~~N~i~~~i~~i~g~~ 85 (688) -.++.+++|++++++|.++++++++||+++.+|++|| +|+||+++++++|+++ ||||+|+|+|+|+|++|+|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 4556678999999999999999999999999999887 5999999999999876 689999999999999999999 Q ss_pred HhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCC Q lcl|NC_020488. 86 RQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDD 165 (688) Q Consensus 86 ~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~ 165 (688) ++||++++|+|++.++ |.++|++||++++|+++.|+++++++++|+++++||+||++|+++|..++ T Consensus 81 ~~nr~d~~v~P~~~~~--------------d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~ 146 (708) T protein:vir:10 81 RNNRITVKFRPGDREA--------------SEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEY 146 (708) T ss_pred HhCCcceEEEcCCCCc--------------hHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeecccccc Confidence 9999999999997643 78999999999999999999999999999999999999999999997653 Q ss_pred ---CCCcceeEEEeccc-ceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccc-ccccccCCCCCEEE Q lcl|NC_020488. 166 ---AFDLDLCIKSIHNR-FAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAE-RGEYSWWTNEEGVR 240 (688) Q Consensus 166 ---~~~~~~~~~~v~~~-~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~-~~~~~~~~~~~~v~ 240 (688) .++.++.++.+++| ++|||||.|+++|+|||+|+|+++|||+++++++||+++........ .+..+.|.+.+.|+ T Consensus 147 d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~ 226 (708) T protein:vir:10 147 DPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY 226 (708) T ss_pred CCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceE Confidence 46677788887776 68999999999999999999999999999999999998764322211 12234577889999 Q ss_pred EEEEEeeeecceeeeeccC---Cce--ecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 241 VSEYFYREPVTRKLLLLSD---GRT--VWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 241 v~e~~~~~~~~~~~~~~~~---g~~--~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) |+|||++.++...++.+.+ |.+ +..+.....+..+...|...+..+.+++++|+|+.++|+.+|++++||||++| T Consensus 227 v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~f 306 (708) T protein:vir:10 227 IAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHI 306 (708) T ss_pred EEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCce Confidence 9999999988887765433 433 44455566778888889888999999999999999999999999999999999 Q ss_pred ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCc Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNA 395 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 395 (688) |||||||++.+++|+++++|+||.|+|+|+++|+++|+++++++++++..++++.+++.+.+..|...+..+.+++..++ T Consensus 307 P~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 386 (708) T protein:vir:10 307 PLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) T ss_pred eeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999998888887777666554433 Q ss_pred c--------cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 396 I--------PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 396 ~--------~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) . .+..++.+++++++|+++++|++.+..+|+++||+|++++|+.+| +||+||++++++|++.+++++|||+ T Consensus 387 ~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn-~SG~aI~~rq~qg~~~l~~~~Dnl~ 465 (708) T protein:vir:10 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASFIYLDNMA 465 (708) T ss_pred ccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccc-hHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 123356777899999999999999999999999999999998665 8999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) ++++++|+++|+||++|||++|++||+|++|+++++.+|..+.++.+|..++.|||++|+|||+|+++|+++|+|+++++ T Consensus 466 ~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~ 545 (708) T protein:vir:10 466 KSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVS 545 (708) T ss_pred HHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhHH---HHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhh----HHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPA---AGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPS----PEQQANMAQAQA 620 (688) Q Consensus 548 ~l~~~~q~~~~---~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~q~~~~~~q~ 620 (688) .|+++++.+|+ +.+.+++++++++|+|+++++.+++++..++..+..+..++.+++..+++ .+++.++.++++ T Consensus 546 ~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa 625 (708) T protein:vir:10 546 VLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQA 625 (708) T ss_pred HHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998875 45667889999999999999999999887765554433222222111111 111122223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCC Q lcl|NC_020488. 621 DMEKAKADTAKAQADMAMAQAKTAEA-------QAKLAEIEQAAMMAGPGSLEETVRNLVA-EAMAELMAQSQGNA 688 (688) Q Consensus 621 ~~~~~q~e~~~~q~e~~~~q~~~~~~-------~a~~~~~~~~a~~~~~~~~~~~~~~~~~-~a~~~~~~~~q~~~ 688 (688) ...++++++++++++.++.++...+. +++.+++-.++...+..+..+.++.... +..+++...++.+. T Consensus 626 ~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q~~~~~~~p~~ 701 (708) T protein:vir:10 626 QMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQS 701 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccC Confidence 33444444444333333322222221 1111111111000000000000000000 00111111111111 No 7 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=2.1e-154 Score=863.09 Aligned_cols=659 Identities=23% Similarity=0.329 Sum_probs=533.0 Q ss_pred CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh--CCCCCCHHHHH----HHHhcCCCceeehhHHHHHHHHHHHH Q lcl|NC_020488. 12 DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL--AGEQWPESVRK----EREDEGRPCLTLNKLPQYVDQVLGDQ 85 (688) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~--~G~Qw~~~~~~----~~~~~g~p~~~~N~i~~~i~~i~g~~ 85 (688) =.++..++|.+++.+|+++++++++||+++.+|++|| +|+||++++++ .++.+|+||++||+|+|+|++|+|++ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 4456678999999999999999999999999999998 59999999987 56678999999999999999999999 Q ss_pred HhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCC Q lcl|NC_020488. 86 RQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDD 165 (688) Q Consensus 86 ~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~ 165 (688) ++||++++|+|++.+ +|.++|++||++++|+++.|+++++++++|+++++||+||++|+++|+.++ T Consensus 81 ~~nr~d~~v~P~~~~--------------~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~ 146 (720) T protein:vir:35 81 RHNRITVKFRPGDKT--------------ASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNAL 146 (720) T ss_pred HhCCCceEEEcCCCc--------------chHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccC Confidence 999999999999764 389999999999999999999999999999999999999999999998764 Q ss_pred C---CCcceeEEEeccc-ceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEE Q lcl|NC_020488. 166 A---FDLDLCIKSIHNR-FAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRV 241 (688) Q Consensus 166 ~---~~~~~~~~~v~~~-~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 241 (688) . +.+++.+++|++| ++|||||+|+++|+|||+|+|+++|||+++++++||+++.........+.++.|++++.|++ T Consensus 147 d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i 226 (720) T protein:vir:35 147 DPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYI 226 (720) T ss_pred CCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEE Confidence 3 3456777777554 78999999999999999999999999999999999999988877777788888999999999 Q ss_pred EEEEeeeecceeeeeccC---Cceeccc--ccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 242 SEYFYREPVTRKLLLLSD---GRTVWED--EVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 242 ~e~~~~~~~~~~~~~~~~---g~~~~~~--~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +|||+++++...++.+.+ |..+.++ .....+..+...|......+.+++++|+|+.++|+.+|++++|+||++|| T Consensus 227 ~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP 306 (720) T protein:vir:35 227 AKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIP 306 (720) T ss_pred EEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccc Confidence 999999998877765443 4444433 33345666677777777888889999999999999999999999999999 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCC---ceeec Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQ---SVLRY 393 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~---~~~~~ 393 (688) ||||||++.+++|+++++|+||.|+|+||++|+++|+++++++++ +.+++.|++++.+.+.++++.++. +++.+ T Consensus 307 ~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~---~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~ 383 (720) T protein:vir:35 307 LIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQD---TGSIPIVGKSQIKTLEKYWANRNKNRPAFLPL 383 (720) T ss_pred eEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcC---CccccccCcchHHHHHHHhhcccccccccccc Confidence 999999999999999999999999999999999999999999876 456777777776666666555433 33333 Q ss_pred Cccc--------ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 NAIP--------GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDN 465 (688) Q Consensus 394 ~~~~--------~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn 465 (688) ++.. ...++.+.+++++|+++++|++.+..+|+++||||++++|..+| +||+||++++++|++.+.+++|| T Consensus 384 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~~~~~~Dn 462 (720) T protein:vir:35 384 NEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN-IAKETVNHLMHRSDMSSFIYLDN 462 (720) T ss_pred ccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc-hHHHHHHHHHHHHHHHHHHHHHH Confidence 3221 12366788899999999999999999999999999999999887 89999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHH Q lcl|NC_020488. 466 LSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEA 545 (688) Q Consensus 466 ~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~ 545 (688) |+++++++|+++|+||++|||++|++||+|++|.++++.+|..+.++.+|.++++|||++|+|||+|+++|+++|+|+++ T Consensus 463 l~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~ 542 (720) T protein:vir:35 463 MAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDAT 542 (720) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhHH---HHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhh---hhhhhHHHHHHHHHHH Q lcl|NC_020488. 546 ADSLMQFVQAVPA---AGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIE---PPQPSPEQQANMAQAQ 619 (688) Q Consensus 546 ~~~l~~~~q~~~~---~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~q~~~~~~q 619 (688) ++.|+++++.+|+ ++.++++++++++++|+++++.+++++..+++....+...+.++. ..+..++++.+++++| T Consensus 543 ~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aq 622 (720) T protein:vir:35 543 VSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQ 622 (720) T ss_pred HHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHH Confidence 9999999987654 566788999999999999999999998876654433322222111 1122223334455555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHhhcCC Q lcl|NC_020488. 620 ADMEKAKADTAKAQADMAMAQAKTAEAQ--AKLAEIEQAAMMAGPGSLEETVRN----LVAEAMAELMAQSQGNA 688 (688) Q Consensus 620 ~~~~~~q~e~~~~q~e~~~~q~~~~~~~--a~~~~~~~~a~~~~~~~~~~~~~~----~~~~a~~~~~~~~q~~~ 688 (688) +++.+++++.+++++++...+++...++ +++.+.+..+.+++.++..+.... ..+..+.++...+|++| T Consensus 623 a~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~ 697 (720) T protein:vir:35 623 GVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADA 697 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHH Confidence 5555556555555554444333332222 222222322223332222211110 01111222333333333 No 8 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=1.3e-153 Score=858.67 Aligned_cols=658 Identities=23% Similarity=0.349 Sum_probs=529.1 Q ss_pred CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh--CCCCCCHHHHHHHHhc----CCCceeehhHHHHHHHHHHHH Q lcl|NC_020488. 12 DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL--AGEQWPESVRKEREDE----GRPCLTLNKLPQYVDQVLGDQ 85 (688) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~--~G~Qw~~~~~~~~~~~----g~p~~~~N~i~~~i~~i~g~~ 85 (688) =.+++.+++.+++.+|.++++++++||+++.+|++|| +|+||+++++++|+++ ||||++||+|+|+|++|+|++ T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 3347788999999999999999999999999999998 6999999999999865 689999999999999999999 Q ss_pred HhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccC- Q lcl|NC_020488. 86 RQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTD- 164 (688) Q Consensus 86 ~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~- 164 (688) ++||++++|+|+++. .|.++|++||++++|+++.|+++++++++|+++++||+||++|+++|+.+ T Consensus 81 ~~nr~~~~v~P~~~~--------------~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~ 146 (706) T protein:vir:10 81 RNNRISVKFRPGDNA--------------ASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEY 146 (706) T ss_pred HhCCCceEEecCCCC--------------chHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeecccccc Confidence 999999999997653 38999999999999999999999999999999999999999999999875 Q ss_pred --CCCCcceeEEEecccc-eEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEE Q lcl|NC_020488. 165 --DAFDLDLCIKSIHNRF-AVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRV 241 (688) Q Consensus 165 --~~~~~~~~~~~v~~~~-~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 241 (688) ..++++|.+++|++|. +|||||+|+++|+|||+|+++++|||+++++++||+++.+.....+.+.+.+|...+.+++ T Consensus 147 d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~ 226 (706) T protein:vir:10 147 DPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYI 226 (706) T ss_pred CCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCccee Confidence 3567889899998886 7999999999999999999999999999999999999877666666666777888999999 Q ss_pred EEEEeeeecceeeeec-----cCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 242 SEYFYREPVTRKLLLL-----SDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 242 ~e~~~~~~~~~~~~~~-----~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) .|||.++++...++.+ .++.+++.......++.+...|......+.+++++|+|+.++|+.+|++++||||++|| T Consensus 227 ~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P 306 (706) T protein:vir:10 227 AKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIP 306 (706) T ss_pred cccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccc Confidence 9999888766555432 23445555555566667777788888889999999999999999999999999999999 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcc Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAI 396 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 396 (688) ||||||++.+++++..++|+||.|+|+||++|+++|+++++++++++...++..+.+++.+..|...+.....++.+++. T Consensus 307 ~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~ 386 (706) T protein:vir:10 307 LIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTV 386 (706) T ss_pred eEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhcccc Confidence 99999999999999999999999999999999999999999999988777777777776766776655443333322211 Q ss_pred --------cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 397 --------PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSR 468 (688) Q Consensus 397 --------~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~ 468 (688) .....+.+..++++|+++++|++.+..+|+++||||++++|+.+| +||+||++++++|++.+++++|||++ T Consensus 387 ~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~ 465 (706) T protein:vir:10 387 TDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN-VARETVNSLLNRSDMASFIYLDNMAK 465 (706) T ss_pred cCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc-hHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112344566788999999999999999999999999999998776 89999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHH Q lcl|NC_020488. 469 AIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADS 548 (688) Q Consensus 469 ~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~ 548 (688) +++++|+++|+||++|||++|+|||+|++++.+++.||..+.++.+|..++.|||++|+|||+|+++|+++|+|+++++. T Consensus 466 ~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~ 545 (706) T protein:vir:10 466 SLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNA 545 (706) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhHH---HHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 549 LMQFVQAVPA---AGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKA 625 (688) Q Consensus 549 l~~~~q~~~~---~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~ 625 (688) |+++++.+++ +.+.++++++++|++|+++++.++++++.+++.......++.++... +.++++++++++++.++ T Consensus 546 m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~---q~qq~q~~q~~~~~~~~ 622 (706) T protein:vir:10 546 LTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQ---QAQQAQATQPDPNMLLA 622 (706) T ss_pred HHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHH---HHHHHHHHHHHHHHHHH Confidence 9999997653 45567889999999999999999999888766554433322222211 11222223333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHhhcCC Q lcl|NC_020488. 626 KADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNL---------VAEAMAELMAQSQGNA 688 (688) Q Consensus 626 q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~---------~~~a~~~~~~~~q~~~ 688 (688) +++..++|+++.++++++.+.+.+..+.+.+++.++.++.+...... ..++.+. +...|+++ T Consensus 623 ~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~l~~-~~a~q~~~ 693 (706) T protein:vir:10 623 QAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMETLRLLKE-VAASQQQT 693 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhccCC Confidence 44444444444444444443333333323332222222222111110 0011111 11111222 No 9 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=1.4e-153 Score=858.53 Aligned_cols=649 Identities=16% Similarity=0.178 Sum_probs=531.4 Q ss_pred CCC---CCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHH Q lcl|NC_020488. 1 MLP---GNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQY 77 (688) Q Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~ 77 (688) |-- -+++++...++....++.+.+.+|.++++.+.+||.++.++++||+|+||+++++++|+++||||++||+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i~~~ 80 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLIGPA 80 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcchHHH Confidence 111 12245555566667788889999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEE Q lcl|NC_020488. 78 VDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRV 157 (688) Q Consensus 78 i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v 157 (688) |++|+|++++||++++|+||++. .|.++|++||++++|+++.|+++++++++|+++++||+||+++ T Consensus 81 v~~v~g~~~~nr~d~~v~Pr~~~--------------~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~ 146 (772) T protein:vir:10 81 LLSLQGYEAVTRTDWRVTPNGDV--------------GGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEV 146 (772) T ss_pred HHHHHHHHHhcCcceEEecCCCc--------------hHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEe Confidence 99999999999999999999643 3899999999999999999999999999999999999999998 Q ss_pred EEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccc----------- Q lcl|NC_020488. 158 LTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAE----------- 226 (688) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~----------- 226 (688) +++ +++++++|.|.+| ||++|||||+|++ |+|||+|+|+++|||+++++++||+++.......+ T Consensus 147 ~~~---~d~~~~~i~i~~v-~p~~v~~Dp~a~~-D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~ 221 (772) T protein:vir:10 147 SRE---SDPFKFPYRCRPI-RRDEIHWDMKCGD-DWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDL 221 (772) T ss_pred ccc---cCCCCCCeEEEee-CcccceecCCCCC-CHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccc Confidence 765 4668889999998 8999999999976 99999999999999999999999997643211100 Q ss_pred ---------------------ccccc--cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhh Q lcl|NC_020488. 227 ---------------------RGEYS--WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVT 283 (688) Q Consensus 227 ---------------------~~~~~--~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~ 283 (688) ....+ +..+.++|||+||||+.++..+++.+++|+++.++..++.+...+..|.... T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~ 301 (772) T protein:vir:10 222 GMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISP 301 (772) T ss_pred cccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccch Confidence 00011 1234588999999999999999999999999999998888888888886655 Q ss_pred heeeeeEEEEEEEEEchhhhcc-cCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020488. 284 RERRVKTYKVKWMKVTAYDVLE-GPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAP 362 (688) Q Consensus 284 ~~~~~~~~~v~~~~~~~~~ile-~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~ 362 (688) ... .+++|+|++++|+++|+ +++||||++||||||||++..++|.+| |+||.|+|+||++|+++|+++|+++++. T Consensus 302 ~~~--~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~--G~vr~~kd~Qr~~N~~~S~~~~~l~~~~ 377 (772) T protein:vir:10 302 KKV--TVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPY--GYVRGMKYAQDSLNSGVSKLRWGMSVAR 377 (772) T ss_pred hee--eeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCccc--chhhhhhhHHHHHHHHHHHHHHHHhccc Confidence 443 45679999999999997 799999999999999999998899887 9999999999999999999999998874 Q ss_pred CCceeechhhhcchH-HHHhhcccCCCceeecCccc--ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCC Q lcl|NC_020488. 363 KAPWVAPAESIEGYE-EEWNQANRKNQSVLRYNAIP--GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQ 439 (688) Q Consensus 363 ~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~ 439 (688) +++++|++++.+ ++.++.+++++++++..+.. .+.++.+.+++++|+++++|++.+.++|+++|||+++++|.. T Consensus 378 ---~~~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~ 454 (772) T protein:vir:10 378 ---VERTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRK 454 (772) T ss_pred ---ccccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCC Confidence 789999999876 46677777776666554322 235677888999999999999999999999999999999999 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCC--Ccceeeechhhhcccccce Q lcl|NC_020488. 440 GNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDG--EGDWVQINQMVMDEETQKP 517 (688) Q Consensus 440 ~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~--~~~~v~~n~~~~~~~~~~~ 517 (688) +|++||+||++++++|++.+++++|||+++++++|+++|+||++|||++|++||+++++ .++++.||..+.++.+|.. T Consensus 455 ~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~ 534 (772) T protein:vir:10 455 GTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAA 534 (772) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceeccccccc Confidence 99999999999999999999999999999999999999999999999999999999874 5899999999999999999 Q ss_pred eeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh-HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhh Q lcl|NC_020488. 518 VLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV-PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDE 596 (688) Q Consensus 518 ~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~-~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~ 596 (688) ++.|||++|+|||+|+++|+++|+|+++++.|+++++.+ |.+++.+++++++++++|+++++.+++++..++..+.+.+ T Consensus 535 ~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~ 614 (772) T protein:vir:10 535 YLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQ 614 (772) T ss_pred ceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccCChHHHH Confidence 999999999999999999999999999999999999764 5677888999999999999999999999876554332211 Q ss_pred HHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 597 MEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEA 676 (688) Q Consensus 597 ~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a 676 (688) .+++++.+.++++++++++. .+++++..+.+++.++.+++..+...+++..++++.+....+++... T Consensus 615 --------~~~~q~~qq~~~~~~~el~~-----~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~a~ 681 (772) T protein:vir:10 615 --------QQIDQAVQDALAKAGNDIKL-----RELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMIAP 681 (772) T ss_pred --------HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhH Confidence 11111111122222222221 11222222233333333444444444455556665555545444445 Q ss_pred HHHHHHHhhcCC Q lcl|NC_020488. 677 MAELMAQSQGNA 688 (688) Q Consensus 677 ~~~~~~~~q~~~ 688 (688) .++++++++|-. T Consensus 682 ~ad~~l~~~g~~ 693 (772) T protein:vir:10 682 IADAVMQSAGYQ 693 (772) T ss_pred HHHHHHHhcccc Confidence 556666666643 No 10 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=1e-149 Score=837.36 Aligned_cols=654 Identities=20% Similarity=0.250 Sum_probs=508.6 Q ss_pred CCCCCCCcCCC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~ 79 (688) |--.-..++.+ +++...+++.+++.+|..+.+++.+||.++.++++||+|+||+++++++|+++||||+|||+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 21111122222 45667789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +|+|++++||++++|+||++++ ++.++|++|+++++|+++.|+++++++++|+++++||+||+++++ T Consensus 81 ~v~g~~~~nr~~~~v~p~~~~~-------------~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:32 81 GVLGMEAKTRTDLVVMSDEPDD-------------ETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred HHHhHHHhCCcceEEecCCCCc-------------hhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 9999999999999999998752 367899999999999999999999999999999999999999987 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccc------------ Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAER------------ 227 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~------------ 227 (688) ++ ++++++|.|++| ||++|||||+|+++|+|||+|+++++|||+++|+++||+++...-.+... T Consensus 148 ~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:32 148 NS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred cc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 74 567899999999 89999999999999999999999999999999999999976432211100 Q ss_pred ----------------cccccCC--CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeee Q lcl|NC_020488. 228 ----------------GEYSWWT--NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVK 289 (688) Q Consensus 228 ----------------~~~~~~~--~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 289 (688) ..++.|. +.++|+|+||||+.++...++...+|++++++..+..+...+..|...+..+.++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:32 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0112232 3468999999999999999999999999999988888877777888777777766 Q ss_pred EEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceee Q lcl|NC_020488. 290 TYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVA 368 (688) Q Consensus 290 ~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 368 (688) ++ ++++++|.++| ++|+||||++||||||||++..++|.+| |+||.|+|+||++|+++|+++++++ ++.. ++ T Consensus 304 rv--~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~-~~ 376 (714) T protein:vir:32 304 RI--REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWLLQ--AKRV-IM 376 (714) T ss_pred eE--EEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHhhc--CCce-ee Confidence 54 44556777777 5789999999999999999999999977 9999999999999999999999764 4544 56 Q ss_pred chhhhcch-HHHHhhcccCCCceeecCc----ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchh Q lcl|NC_020488. 369 PAESIEGY-EEEWNQANRKNQSVLRYNA----IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQ 443 (688) Q Consensus 369 ~~~~i~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~ 443 (688) .+|++++. ++++++++++++++..... .....++++.+++++|+++++|++++.+.|+++||||++++|..+|++ T Consensus 377 ~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~ 456 (714) T protein:vir:32 377 DEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGAT 456 (714) T ss_pred ecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccch Confidence 67777654 5678888888876665322 222356788889999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCc---ceeeechhhhcccccceeee Q lcl|NC_020488. 444 SGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEG---DWVQINQMVMDEETQKPVLV 520 (688) Q Consensus 444 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~---~~v~~n~~~~~~~~~~~~~~ 520 (688) ||+||++++++|++++++++|||+++++.+|+++|+||++|||++|++||+|+++.. +++.+| +.+|.+.+. T Consensus 457 SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in-----~~~~~~~~~ 531 (714) T protein:vir:32 457 SGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN-----AEGDNGELT 531 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec-----cccCcceec Confidence 999999999999999999999999999999999999999999999999999886654 466665 467888999 Q ss_pred ccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh Q lcl|NC_020488. 521 NDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVP-AAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE 599 (688) Q Consensus 521 ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~-~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~ 599 (688) |||++|+|||+|+++|+++|+|++.++.|+++++.+| ..+.++++++++++++|+++++.+++++..+++.......++ T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:32 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999998764 456678889999999999999999999988776543333222 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH------------HH Q lcl|NC_020488. 600 AGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQA----KLAEIEQAAMMAG------------PG 663 (688) Q Consensus 600 ~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a----~~~~~~~~a~~~~------------~~ 663 (688) +++.++++ ++.+.+++++++.+++++.++.+++.+++++++..... .++..+.+...++ .+ T Consensus 612 ~q~~~~~~---q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~ 688 (714) T protein:vir:32 612 EQEVAAQQ---QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQ 688 (714) T ss_pred hHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 22111111 11111122222223333333333333332222211111 1111111110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 664 SLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 664 ~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..++... .++.+..+...++-+| T Consensus 689 ~~~~~~~--~~~~q~~q~~~~~~~~ 711 (714) T protein:vir:32 689 NMEQEQD--VLQQQMLYTLQQRMNE 711 (714) T ss_pred hhhhhhH--HHHHHHHHHHHHHHHh Confidence 0111111 1111223344445555 No 11 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=1e-149 Score=837.36 Aligned_cols=654 Identities=20% Similarity=0.250 Sum_probs=508.6 Q ss_pred CCCCCCCcCCC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~ 79 (688) |--.-..++.+ +++...+++.+++.+|..+.+++.+||.++.++++||+|+||+++++++|+++||||+|||+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 21111122222 45667789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +|+|++++||++++|+||++++ ++.++|++|+++++|+++.|+++++++++|+++++||+||+++++ T Consensus 81 ~v~g~~~~nr~~~~v~p~~~~~-------------~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:27 81 GVLGMEAKTRTDLVVMSDEPDD-------------ETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred HHHhHHHhCCcceEEecCCCCc-------------hhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 9999999999999999998752 367899999999999999999999999999999999999999987 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccc------------ Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAER------------ 227 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~------------ 227 (688) ++ ++++++|.|++| ||++|||||+|+++|+|||+|+++++|||+++|+++||+++...-.+... T Consensus 148 ~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:27 148 NS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred cc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 74 567899999999 89999999999999999999999999999999999999976432211100 Q ss_pred ----------------cccccCC--CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeee Q lcl|NC_020488. 228 ----------------GEYSWWT--NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVK 289 (688) Q Consensus 228 ----------------~~~~~~~--~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 289 (688) ..++.|. +.++|+|+||||+.++...++...+|++++++..+..+...+..|...+..+.++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:27 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0112232 3468999999999999999999999999999988888877777888777777766 Q ss_pred EEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceee Q lcl|NC_020488. 290 TYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVA 368 (688) Q Consensus 290 ~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 368 (688) ++ ++++++|.++| ++|+||||++||||||||++..++|.+| |+||.|+|+||++|+++|+++++++ ++.. ++ T Consensus 304 rv--~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~-~~ 376 (714) T protein:vir:27 304 RI--REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWLLQ--AKRV-IM 376 (714) T ss_pred eE--EEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHhhc--CCce-ee Confidence 54 44556777777 5789999999999999999999999977 9999999999999999999999764 4544 56 Q ss_pred chhhhcch-HHHHhhcccCCCceeecCc----ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchh Q lcl|NC_020488. 369 PAESIEGY-EEEWNQANRKNQSVLRYNA----IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQ 443 (688) Q Consensus 369 ~~~~i~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~ 443 (688) .+|++++. ++++++++++++++..... .....++++.+++++|+++++|++++.+.|+++||||++++|..+|++ T Consensus 377 ~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~ 456 (714) T protein:vir:27 377 DEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGAT 456 (714) T ss_pred ecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccch Confidence 67777654 5678888888876665322 222356788889999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCc---ceeeechhhhcccccceeee Q lcl|NC_020488. 444 SGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEG---DWVQINQMVMDEETQKPVLV 520 (688) Q Consensus 444 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~---~~v~~n~~~~~~~~~~~~~~ 520 (688) ||+||++++++|++++++++|||+++++.+|+++|+||++|||++|++||+|+++.. +++.+| +.+|.+.+. T Consensus 457 SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in-----~~~~~~~~~ 531 (714) T protein:vir:27 457 SGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN-----AEGDNGELT 531 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec-----cccCcceec Confidence 999999999999999999999999999999999999999999999999999886654 466665 467888999 Q ss_pred ccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh Q lcl|NC_020488. 521 NDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVP-AAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE 599 (688) Q Consensus 521 ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~-~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~ 599 (688) |||++|+|||+|+++|+++|+|++.++.|+++++.+| ..+.++++++++++++|+++++.+++++..+++.......++ T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:27 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999998764 456678889999999999999999999988776543333222 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH------------HH Q lcl|NC_020488. 600 AGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQA----KLAEIEQAAMMAG------------PG 663 (688) Q Consensus 600 ~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a----~~~~~~~~a~~~~------------~~ 663 (688) +++.++++ ++.+.+++++++.+++++.++.+++.+++++++..... .++..+.+...++ .+ T Consensus 612 ~q~~~~~~---q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~ 688 (714) T protein:vir:27 612 EQEVAAQQ---QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQ 688 (714) T ss_pred hHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 22111111 11111122222223333333333333332222211111 1111111110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 664 SLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 664 ~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..++... .++.+..+...++-+| T Consensus 689 ~~~~~~~--~~~~q~~q~~~~~~~~ 711 (714) T protein:vir:27 689 NMEQEQD--VLQQQMLYTLQQRMNE 711 (714) T ss_pred hhhhhhH--HHHHHHHHHHHHHHHh Confidence 0111111 1111223344445555 No 12 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=1e-149 Score=837.36 Aligned_cols=654 Identities=20% Similarity=0.250 Sum_probs=508.6 Q ss_pred CCCCCCCcCCC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~ 79 (688) |--.-..++.+ +++...+++.+++.+|..+.+++.+||.++.++++||+|+||+++++++|+++||||+|||+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 21111122222 45667789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +|+|++++||++++|+||++++ ++.++|++|+++++|+++.|+++++++++|+++++||+||+++++ T Consensus 81 ~v~g~~~~nr~~~~v~p~~~~~-------------~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:10 81 GVLGMEAKTRTDLVVMSDEPDD-------------ETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred HHHhHHHhCCcceEEecCCCCc-------------hhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 9999999999999999998752 367899999999999999999999999999999999999999987 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccc------------ Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAER------------ 227 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~------------ 227 (688) ++ ++++++|.|++| ||++|||||+|+++|+|||+|+++++|||+++|+++||+++...-.+... T Consensus 148 ~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:10 148 NS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred cc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 74 567899999999 89999999999999999999999999999999999999976432211100 Q ss_pred ----------------cccccCC--CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeee Q lcl|NC_020488. 228 ----------------GEYSWWT--NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVK 289 (688) Q Consensus 228 ----------------~~~~~~~--~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 289 (688) ..++.|. +.++|+|+||||+.++...++...+|++++++..+..+...+..|...+..+.++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:10 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0112232 3468999999999999999999999999999988888877777888777777766 Q ss_pred EEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceee Q lcl|NC_020488. 290 TYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVA 368 (688) Q Consensus 290 ~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 368 (688) ++ ++++++|.++| ++|+||||++||||||||++..++|.+| |+||.|+|+||++|+++|+++++++ ++.. ++ T Consensus 304 rv--~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~-~~ 376 (714) T protein:vir:10 304 RI--REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWLLQ--AKRV-IM 376 (714) T ss_pred eE--EEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHhhc--CCce-ee Confidence 54 44556777777 5789999999999999999999999977 9999999999999999999999764 4544 56 Q ss_pred chhhhcch-HHHHhhcccCCCceeecCc----ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchh Q lcl|NC_020488. 369 PAESIEGY-EEEWNQANRKNQSVLRYNA----IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQ 443 (688) Q Consensus 369 ~~~~i~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~ 443 (688) .+|++++. ++++++++++++++..... .....++++.+++++|+++++|++++.+.|+++||||++++|..+|++ T Consensus 377 ~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~ 456 (714) T protein:vir:10 377 DEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGAT 456 (714) T ss_pred ecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccch Confidence 67777654 5678888888876665322 222356788889999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCc---ceeeechhhhcccccceeee Q lcl|NC_020488. 444 SGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEG---DWVQINQMVMDEETQKPVLV 520 (688) Q Consensus 444 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~---~~v~~n~~~~~~~~~~~~~~ 520 (688) ||+||++++++|++++++++|||+++++.+|+++|+||++|||++|++||+|+++.. +++.+| +.+|.+.+. T Consensus 457 SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in-----~~~~~~~~~ 531 (714) T protein:vir:10 457 SGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN-----AEGDNGELT 531 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec-----cccCcceec Confidence 999999999999999999999999999999999999999999999999999886654 466665 467888999 Q ss_pred ccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh Q lcl|NC_020488. 521 NDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVP-AAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE 599 (688) Q Consensus 521 ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~-~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~ 599 (688) |||++|+|||+|+++|+++|+|++.++.|+++++.+| ..+.++++++++++++|+++++.+++++..+++.......++ T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:10 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999998764 456678889999999999999999999988776543333222 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH------------HH Q lcl|NC_020488. 600 AGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQA----KLAEIEQAAMMAG------------PG 663 (688) Q Consensus 600 ~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a----~~~~~~~~a~~~~------------~~ 663 (688) +++.++++ ++.+.+++++++.+++++.++.+++.+++++++..... .++..+.+...++ .+ T Consensus 612 ~q~~~~~~---q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~ 688 (714) T protein:vir:10 612 EQEVAAQQ---QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQ 688 (714) T ss_pred hHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 22111111 11111122222223333333333333332222211111 1111111110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 664 SLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 664 ~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..++... .++.+..+...++-+| T Consensus 689 ~~~~~~~--~~~~q~~q~~~~~~~~ 711 (714) T protein:vir:10 689 NMEQEQD--VLQQQMLYTLQQRMNE 711 (714) T ss_pred hhhhhhH--HHHHHHHHHHHHHHHh Confidence 0111111 1111223344445555 No 13 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=1e-149 Score=837.36 Aligned_cols=654 Identities=20% Similarity=0.250 Sum_probs=508.6 Q ss_pred CCCCCCCcCCC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~ 79 (688) |--.-..++.+ +++...+++.+++.+|..+.+++.+||.++.++++||+|+||+++++++|+++||||+|||+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 21111122222 45667789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +|+|++++||++++|+||++++ ++.++|++|+++++|+++.|+++++++++|+++++||+||+++++ T Consensus 81 ~v~g~~~~nr~~~~v~p~~~~~-------------~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:81 81 GVLGMEAKTRTDLVVMSDEPDD-------------ETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred HHHhHHHhCCcceEEecCCCCc-------------hhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 9999999999999999998752 367899999999999999999999999999999999999999987 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccc------------ Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAER------------ 227 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~------------ 227 (688) ++ ++++++|.|++| ||++|||||+|+++|+|||+|+++++|||+++|+++||+++...-.+... T Consensus 148 ~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:81 148 NS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred cc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 74 567899999999 89999999999999999999999999999999999999976432211100 Q ss_pred ----------------cccccCC--CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeee Q lcl|NC_020488. 228 ----------------GEYSWWT--NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVK 289 (688) Q Consensus 228 ----------------~~~~~~~--~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 289 (688) ..++.|. +.++|+|+||||+.++...++...+|++++++..+..+...+..|...+..+.++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:81 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0112232 3468999999999999999999999999999988888877777888777777766 Q ss_pred EEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceee Q lcl|NC_020488. 290 TYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVA 368 (688) Q Consensus 290 ~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 368 (688) ++ ++++++|.++| ++|+||||++||||||||++..++|.+| |+||.|+|+||++|+++|+++++++ ++.. ++ T Consensus 304 rv--~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~-~~ 376 (714) T protein:vir:81 304 RI--REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWLLQ--AKRV-IM 376 (714) T ss_pred eE--EEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHhhc--CCce-ee Confidence 54 44556777777 5789999999999999999999999977 9999999999999999999999764 4544 56 Q ss_pred chhhhcch-HHHHhhcccCCCceeecCc----ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchh Q lcl|NC_020488. 369 PAESIEGY-EEEWNQANRKNQSVLRYNA----IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQ 443 (688) Q Consensus 369 ~~~~i~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~ 443 (688) .+|++++. ++++++++++++++..... .....++++.+++++|+++++|++++.+.|+++||||++++|..+|++ T Consensus 377 ~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~ 456 (714) T protein:vir:81 377 DEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGAT 456 (714) T ss_pred ecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccch Confidence 67777654 5678888888876665322 222356788889999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCc---ceeeechhhhcccccceeee Q lcl|NC_020488. 444 SGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEG---DWVQINQMVMDEETQKPVLV 520 (688) Q Consensus 444 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~---~~v~~n~~~~~~~~~~~~~~ 520 (688) ||+||++++++|++++++++|||+++++.+|+++|+||++|||++|++||+|+++.. +++.+| +.+|.+.+. T Consensus 457 SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in-----~~~~~~~~~ 531 (714) T protein:vir:81 457 SGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN-----AEGDNGELT 531 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec-----cccCcceec Confidence 999999999999999999999999999999999999999999999999999886654 466665 467888999 Q ss_pred ccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh Q lcl|NC_020488. 521 NDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVP-AAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE 599 (688) Q Consensus 521 ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~-~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~ 599 (688) |||++|+|||+|+++|+++|+|++.++.|+++++.+| ..+.++++++++++++|+++++.+++++..+++.......++ T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:81 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999998764 456678889999999999999999999988776543333222 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH------------HH Q lcl|NC_020488. 600 AGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQA----KLAEIEQAAMMAG------------PG 663 (688) Q Consensus 600 ~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a----~~~~~~~~a~~~~------------~~ 663 (688) +++.++++ ++.+.+++++++.+++++.++.+++.+++++++..... .++..+.+...++ .+ T Consensus 612 ~q~~~~~~---q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~ 688 (714) T protein:vir:81 612 EQEVAAQQ---QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQ 688 (714) T ss_pred hHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 22111111 11111122222223333333333333332222211111 1111111110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 664 SLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 664 ~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..++... .++.+..+...++-+| T Consensus 689 ~~~~~~~--~~~~q~~q~~~~~~~~ 711 (714) T protein:vir:81 689 NMEQEQD--VLQQQMLYTLQQRMNE 711 (714) T ss_pred hhhhhhH--HHHHHHHHHHHHHHHh Confidence 0111111 1111223344445555 No 14 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=1e-149 Score=837.36 Aligned_cols=654 Identities=20% Similarity=0.250 Sum_probs=508.6 Q ss_pred CCCCCCCcCCC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~ 79 (688) |--.-..++.+ +++...+++.+++.+|..+.+++.+||.++.++++||+|+||+++++++|+++||||+|||+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 21111122222 45667789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +|+|++++||++++|+||++++ ++.++|++|+++++|+++.|+++++++++|+++++||+||+++++ T Consensus 81 ~v~g~~~~nr~~~~v~p~~~~~-------------~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:99 81 GVLGMEAKTRTDLVVMSDEPDD-------------ETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred HHHhHHHhCCcceEEecCCCCc-------------hhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 9999999999999999998752 367899999999999999999999999999999999999999987 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccc------------ Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAER------------ 227 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~------------ 227 (688) ++ ++++++|.|++| ||++|||||+|+++|+|||+|+++++|||+++|+++||+++...-.+... T Consensus 148 ~~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:99 148 NS---DPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred cc---CCCCCCeEEEec-chhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 74 567899999999 89999999999999999999999999999999999999976432211100 Q ss_pred ----------------cccccCC--CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeee Q lcl|NC_020488. 228 ----------------GEYSWWT--NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVK 289 (688) Q Consensus 228 ----------------~~~~~~~--~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 289 (688) ..++.|. +.++|+|+||||+.++...++...+|++++++..+..+...+..|...+..+.++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:99 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0112232 3468999999999999999999999999999988888877777888777777766 Q ss_pred EEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceee Q lcl|NC_020488. 290 TYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVA 368 (688) Q Consensus 290 ~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 368 (688) ++ ++++++|.++| ++|+||||++||||||||++..++|.+| |+||.|+|+||++|+++|+++++++ ++.. ++ T Consensus 304 rv--~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~-~~ 376 (714) T protein:vir:99 304 RI--REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWLLQ--AKRV-IM 376 (714) T ss_pred eE--EEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHhhc--CCce-ee Confidence 54 44556777777 5789999999999999999999999977 9999999999999999999999764 4544 56 Q ss_pred chhhhcch-HHHHhhcccCCCceeecCc----ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchh Q lcl|NC_020488. 369 PAESIEGY-EEEWNQANRKNQSVLRYNA----IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQ 443 (688) Q Consensus 369 ~~~~i~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~ 443 (688) .+|++++. ++++++++++++++..... .....++++.+++++|+++++|++++.+.|+++||||++++|..+|++ T Consensus 377 ~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~ 456 (714) T protein:vir:99 377 DEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGAT 456 (714) T ss_pred ecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccch Confidence 67777654 5678888888876665322 222356788889999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCc---ceeeechhhhcccccceeee Q lcl|NC_020488. 444 SGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEG---DWVQINQMVMDEETQKPVLV 520 (688) Q Consensus 444 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~---~~v~~n~~~~~~~~~~~~~~ 520 (688) ||+||++++++|++++++++|||+++++.+|+++|+||++|||++|++||+|+++.. +++.+| +.+|.+.+. T Consensus 457 SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in-----~~~~~~~~~ 531 (714) T protein:vir:99 457 SGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN-----AEGDNGELT 531 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeec-----cccCcceec Confidence 999999999999999999999999999999999999999999999999999886654 466665 467888999 Q ss_pred ccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh Q lcl|NC_020488. 521 NDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVP-AAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE 599 (688) Q Consensus 521 ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~-~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~ 599 (688) |||++|+|||+|+++|+++|+|++.++.|+++++.+| ..+.++++++++++++|+++++.+++++..+++.......++ T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:99 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchh Confidence 9999999999999999999999999999999998764 456678889999999999999999999988776543333222 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH------------HH Q lcl|NC_020488. 600 AGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQA----KLAEIEQAAMMAG------------PG 663 (688) Q Consensus 600 ~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a----~~~~~~~~a~~~~------------~~ 663 (688) +++.++++ ++.+.+++++++.+++++.++.+++.+++++++..... .++..+.+...++ .+ T Consensus 612 ~q~~~~~~---q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~ 688 (714) T protein:vir:99 612 EQEVAAQQ---QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQ 688 (714) T ss_pred hHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 22111111 11111122222223333333333333332222211111 1111111110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 664 SLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 664 ~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..++... .++.+..+...++-+| T Consensus 689 ~~~~~~~--~~~~q~~q~~~~~~~~ 711 (714) T protein:vir:99 689 NMEQEQD--VLQQQMLYTLQQRMNE 711 (714) T ss_pred hhhhhhH--HHHHHHHHHHHHHHHh Confidence 0111111 1111223344445555 No 15 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=8.9e-148 Score=826.76 Aligned_cols=654 Identities=20% Similarity=0.244 Sum_probs=511.9 Q ss_pred CCCC-CCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPG-NEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~ 79 (688) |-=- +++.-.+++.++..++.+++.+|.++++..++||.++.++++||+|+||+++++++|+++||||++||+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 2211 113344456677778999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +|+|++++||++++|+||+++. +|.++|++|+++++|+++.|+++++++++|+++++||+||+++++ T Consensus 81 ~v~g~~~~nr~~~~v~pr~~~~-------------~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:10 81 GVLGMEAKTRTDLIVMSDDPND-------------ETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred HHHHHHHhCCcceEEecCCCCh-------------hhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeee Confidence 9999999999999999998752 367899999999999999999999999999999999999999999 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccc------------ Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAER------------ 227 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~------------ 227 (688) +| ++++++|.|++| ||++|||||+|+++|+|||+|+++++|||+++++++||+++......... T Consensus 148 d~---d~~~~~i~i~~v-~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~ 223 (714) T protein:vir:10 148 NS---EPFGPEFKVSTV-SRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred cc---CCCCCCeEEEec-ChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhh Confidence 87 457889999999 89999999999999999999999999999999999999976432211110 Q ss_pred ----------------cccccC--CCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeee Q lcl|NC_020488. 228 ----------------GEYSWW--TNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVK 289 (688) Q Consensus 228 ----------------~~~~~~--~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 289 (688) ..++.| .+.++|+|+||||+.++...++.+.+|++++++..+..+...+..|...+..+.+ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~- 302 (714) T protein:vir:10 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRV- 302 (714) T ss_pred hcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccce- Confidence 011123 3356899999999999999999999999999988888777777777766655554 Q ss_pred EEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceee Q lcl|NC_020488. 290 TYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVA 368 (688) Q Consensus 290 ~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 368 (688) .||+|+++.|+++| ++++||||++||||||||++..++|.+| |+||.|+|+|+++|+++|+++++++. + ++++ T Consensus 303 -~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~l~~--~-~~~~ 376 (714) T protein:vir:10 303 -SRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWLLQA--K-RVIM 376 (714) T ss_pred -eeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccc--eehhhhhhHHHHHHHHHHHHHHHHhC--C-ceee Confidence 46888889998888 5689999999999999999998899877 99999999999999999999998753 3 4678 Q ss_pred chhhhcch-HHHHhhcccCCCceeecCc----ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchh Q lcl|NC_020488. 369 PAESIEGY-EEEWNQANRKNQSVLRYNA----IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQ 443 (688) Q Consensus 369 ~~~~i~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~ 443 (688) .+|++++. ++++++.+++++++.+... .....+++..+++++|+++++|++.+.+.|+++|||+++++|+.+|++ T Consensus 377 ~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~ 456 (714) T protein:vir:10 377 DEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGAT 456 (714) T ss_pred ccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchh Confidence 88888774 4688888888877665332 122356788889999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCC---cceeeechhhhcccccceeee Q lcl|NC_020488. 444 SGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGE---GDWVQINQMVMDEETQKPVLV 520 (688) Q Consensus 444 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~---~~~v~~n~~~~~~~~~~~~~~ 520 (688) ||+||++++++|++++++++|||+++++.+|+++|+||++|||++|++||+++++. ..++.+| ..++.+.+. T Consensus 457 SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n-----~~~~~~~~~ 531 (714) T protein:vir:10 457 SGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLN-----AEGDNGELT 531 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeec-----cccCCcccc Confidence 99999999999999999999999999999999999999999999999999988654 4566665 446778899 Q ss_pred ccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh-HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh Q lcl|NC_020488. 521 NDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV-PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE 599 (688) Q Consensus 521 ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~-~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~ 599 (688) |||++|+|||+|+++|+++++|++.++.|+++++.+ |.++.++++++++++++|+++++.+++++.++++.......++ T Consensus 532 nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e 611 (714) T protein:vir:10 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPE 611 (714) T ss_pred ccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcc Confidence 999999999999999999999999999999999876 5567788899999999999999999999988766543332222 Q ss_pred hhhhhhhhhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHH Q lcl|NC_020488. 600 AGIEPPQPSP----EQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAM------------MAGPG 663 (688) Q Consensus 600 ~~~~~~~~~~----~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~------------~~~~~ 663 (688) +++.++++++ +++.++.+++++.++++++.+++++.+. ....+++..++.++.+.. ++..+ T Consensus 612 ~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~---~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~ 688 (714) T protein:vir:10 612 EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQ---RDNASAQREVALTQGQRYVDALNQAHTAEIITGVQ 688 (714) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2211111111 1112223333333334433333222111 111111111111111110 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 664 SLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 664 ~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..++..... + .+..+...+|-+| T Consensus 689 ~~~q~~~~~-~-q~~~q~~~~~~~~ 711 (714) T protein:vir:10 689 NMEQEQDVL-Q-QQMLYTLQQRMNE 711 (714) T ss_pred hhhhhHHHH-H-HHHHHHHHHHHHh Confidence 111111111 1 1123344455555 No 16 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=1.6e-146 Score=819.96 Aligned_cols=651 Identities=21% Similarity=0.261 Sum_probs=505.7 Q ss_pred CCCCCC------CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhH Q lcl|NC_020488. 1 MLPGNE------PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKL 74 (688) Q Consensus 1 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i 74 (688) |+|+.+ +....++++..+++.+++.+|+++++.+.+||.++.++++||+|+||+++++++|+++|+||+|+|+| T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i 101 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERGQAPTVYNVI 101 (776) T ss_pred CCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCceEEecch Confidence 656554 44455678888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCce Q lcl|NC_020488. 75 PQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGW 154 (688) Q Consensus 75 ~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~ 154 (688) +++|++|+|++++||++++|+|++++ |.++|++||++++|+++.|+++.+++++|+|+++||+|| T Consensus 102 ~~~i~~v~g~~~~nr~~~~~~p~~~~---------------d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~ 166 (776) T protein:vir:93 102 SQSVNWIIGSEKRGRSDFKVLPRRKD---------------GGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGW 166 (776) T ss_pred HHHHHHHHHHHHhCCcceEEecCChh---------------HHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcce Confidence 99999999999999999999998753 899999999999999999999999999999999999999 Q ss_pred EEEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccc-------c Q lcl|NC_020488. 155 LRVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAE-------R 227 (688) Q Consensus 155 ~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~-------~ 227 (688) ++|+++|+. ++++++.++.+|++|||||+|+++|++||+|||+++|||+++|+++||+++........ . T Consensus 167 ~~v~~d~~~----~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 242 (776) T protein:vir:93 167 LESQVQDEN----DGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGT 242 (776) T ss_pred EEEEeeccC----CCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccch Confidence 999998754 35566677779999999999999999999999999999999999999987643211100 0 Q ss_pred ---------------------cccccCCCCCEEEEEEEEeeeecceeeeec--cCCceecccccchHHHHHHHhhhhhhh Q lcl|NC_020488. 228 ---------------------GEYSWWTNEEGVRVSEYFYREPVTRKLLLL--SDGRTVWEDEVKDVLDELRDLGTTVTR 284 (688) Q Consensus 228 ---------------------~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~--~~g~~~~~~~~~~~~~~~~~~g~~~~~ 284 (688) .....+.++++|||+|||||.++..+++.+ +++..+..+.....+...+..|...+. T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~ 322 (776) T protein:vir:93 243 DDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLA 322 (776) T ss_pred hcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeeh Confidence 001123456899999999999998877754 667777777777777777777766555 Q ss_pred eeeeeEEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020488. 285 ERRVKTYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPK 363 (688) Q Consensus 285 ~~~~~~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~ 363 (688) .+. +.+|+|+++.|.++| ++++||+|++||||||||++. +++++|+|+++.|+|+|+++|+++|+++|+|+ + T Consensus 323 ~~~--~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~--~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~---~ 395 (776) T protein:vir:93 323 VSP--MMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRR--ARDGMPYGVIRFMRGMQDDVNKRLSKALYILS---T 395 (776) T ss_pred hee--eeeeEEEEEecchhhhccCCCCCCCccceEEecCcee--cccccccchHHhhhHHHHHHHHHHHHHHHhhc---C Confidence 554 456777777777776 568999999999999999976 56677889999999999999999999999985 3 Q ss_pred CceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchh Q lcl|NC_020488. 364 APWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQ 443 (688) Q Consensus 364 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~ 443 (688) .++++++|++++.++++++.+++++++.++++.. ..+.+.+.++++++++++++++.++|+++|||+++++|..+|++ T Consensus 396 ~~~~~~~gav~~~d~~~~~~~rp~~vi~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~ 473 (776) T protein:vir:93 396 NKVLMEEGAVDDIDEFRREAARPDAVMTVKNGKL--GAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAV 473 (776) T ss_pred CceeeccccccchHHHHHhcccCCceeeeCCccc--cccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchh Confidence 5799999999999999999999888887766543 24556677889999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccc Q lcl|NC_020488. 444 SGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDI 523 (688) Q Consensus 444 sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi 523 (688) ||+||++++++|++++.+++|||+++++++|+++++||.+|||++|+|||+|++++.+||.||.. ++.||+ T Consensus 474 Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~---------~~~nd~ 544 (776) T protein:vir:93 474 SGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDG---------LPENDI 544 (776) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEeccc---------chhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999963 346999 Q ss_pred eeeeEEEEEecccCcHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhh Q lcl|NC_020488. 524 AAGKFDVTVKAGPSYQTQRMEAADSLMQFVQA-VPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGI 602 (688) Q Consensus 524 ~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~-~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~ 602 (688) ++|+|||+|++|++++++|+++++.|+++++. .|.+++.+.+.+++++++|+.+++.+++++..++..+.+......++ T Consensus 545 ~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~ 624 (776) T protein:vir:93 545 TRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEI 624 (776) T ss_pred ccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHH Confidence 99999999999999999999999999999875 46678888999999999999999999998877655444333222222 Q ss_pred hhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 603 EPPQPSP-EQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELM 681 (688) Q Consensus 603 ~~~~~~~-~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~ 681 (688) ++.+.++ .++.+.+.+++.+.+.+++..+++++..+++++....+.+......+++.++.++.++....+.....++.. T Consensus 625 ~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~ 704 (776) T protein:vir:93 625 AREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGI 704 (776) T ss_pred HHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Confidence 2111111 111111112222222222222222222222222111111111111222222222222211111111111222 Q ss_pred HHhhcCC Q lcl|NC_020488. 682 AQSQGNA 688 (688) Q Consensus 682 ~~~q~~~ 688 (688) .+.++-. T Consensus 705 ~~~a~~~ 711 (776) T protein:vir:93 705 LRESGWD 711 (776) T ss_pred hcccccc Confidence 2111111 No 17 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=2.1e-87 Score=495.77 Aligned_cols=600 Identities=14% Similarity=0.149 Sum_probs=397.7 Q ss_pred CCCCCCCcC-CC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--hCCCCCCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIK-TR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISF--LAGEQWPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~--~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) ++|.++|.+ ++ .+.++.++++.++.++..+..+....+.++...++| |.|+.- ..+..||+.++.+.|+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~grs~vv~~~v~~ 81 (763) T protein:vir:95 8 MVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK------PPKVKGRSQVQPKLVRR 81 (763) T ss_pred cCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc------ccccCCCccccCHHHHH Confidence 888887543 33 689999999999999999999988888887766665 455442 22457999999999999 Q ss_pred HHHHHHHHHHh---CCcce-EEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHHHcC Q lcl|NC_020488. 77 YVDQVLGDQRQ---NRPAI-QVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EYTSNAEAHYDNAFQHAVEGG 151 (688) Q Consensus 77 ~i~~i~g~~~~---~r~~~-~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~~~~~~~~~~~~~~d~~~~G 151 (688) .|+|+++.+.. ...++ .|.|++. +|.+.|++.|.+++|+ ..+|+.....+++|+++|++| T Consensus 82 ~ve~~~~~l~~~f~~~~~~~~~~P~~~---------------~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~ 146 (763) T protein:vir:95 82 QAEWRYSALTEPFLGSNKLFKVTPVTW---------------EDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDG 146 (763) T ss_pred HHHHHHHHHHHhhcCCCcEEEEecCCc---------------chHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcC Confidence 99999999998 44444 7777765 5999999999999996 567888899999999999999 Q ss_pred CceEEEEEeeccCC------------------------------------------C----------------------- Q lcl|NC_020488. 152 FGWLRVLTKYSTDD------------------------------------------A----------------------- 166 (688) Q Consensus 152 ~G~~~v~~~~~~~~------------------------------------------~----------------------- 166 (688) +|+++|+|+.+... . T Consensus 147 ~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (763) T protein:vir:95 147 TGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTT 226 (763) T ss_pred cceEEEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccce Confidence 99999988632100 0 Q ss_pred -------CCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHh-cCCccchhcc-----cccccc---- Q lcl|NC_020488. 167 -------FDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKR-YPGKAVGDLS-----DAERGE---- 229 (688) Q Consensus 167 -------~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~-~p~~~~~~~~-----~~~~~~---- 229 (688) ..+.++++.| +|++|||||++++ |++||+||+++.++|+++|..+ |+......+. ...... T Consensus 227 ~~~~~~~~k~~p~ie~V-~p~d~~iDp~a~s-D~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 304 (763) T protein:vir:95 227 TTEVEVPLANHPTVEML-NPENIIIDPSCQG-DINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATT 304 (763) T ss_pred eEEEEEEecCceEEEee-cHHHheecCCCCC-chhhCceEeeEEeccHHHHHhccCCccccchhcchhcccccccccccc Confidence 0123455556 8999999999887 8999999999999999999886 2211111110 000000 Q ss_pred -----cccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhc Q lcl|NC_020488. 230 -----YSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVL 304 (688) Q Consensus 230 -----~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~il 304 (688) ...+...++|.|.|||.+.... +||. .+.+...++|..+| T Consensus 305 ~~~~~~~~d~~~~~V~v~E~y~~~d~~------gdg~-----------------------------~~~~~v~~~g~~iL 349 (763) T protein:vir:95 305 TPQEFQISDPMRKRVVAYEYWGFWDIE------GNGV-----------------------------LEPIVATWIGSTLI 349 (763) T ss_pred chhhccCCCcccceEEEEEeeeeeccC------Ccce-----------------------------eEEEEEEEEcCeee Confidence 0011224688889998864321 1110 12233344556555 Q ss_pred c-cCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhc Q lcl|NC_020488. 305 E-GPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQA 383 (688) Q Consensus 305 e-~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 383 (688) + +++||+|++|||++++++ +++|++||+|+++.++|+|+++|+++|+++|+++++++++|++++|+++..+... T Consensus 350 ~~~~~p~~~~~~PFv~~~~~--p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~d~~~--- 424 (763) T protein:vir:95 350 RLEKNPYPDGKLPFVLIPYM--PVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDALNSRR--- 424 (763) T ss_pred ecccccccCCCcCEEEecce--eecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccchhhhc--- Confidence 5 679999999999987775 4689999999999999999999999999999999999999999999987654332 Q ss_pred ccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcch--hhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 384 NRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNE--QSGKAILARQRQGDRGTFA 461 (688) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~--~sg~ai~~~~~~~~~~~~~ 461 (688) +++++++.+..+......+....++++++..+.++++....++++|||++.++|..++. .++.+++.+++++++++.. T Consensus 425 ~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~ 504 (763) T protein:vir:95 425 YREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMA 504 (763) T ss_pred ccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHH Confidence 45555555544444445667778888999999999999999999999999999987653 4667789999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHH Q lcl|NC_020488. 462 YIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQ 541 (688) Q Consensus 462 ~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~ 541 (688) +++||+++++++|+++++||++|||++++|||+|+ +|+.+++.. ..|+|||+|+++++ +. T Consensus 505 ~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~----e~v~v~~~~--------------~~~~~DV~V~~~~a--s~ 564 (763) T protein:vir:95 505 ILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNE----EFVTIKRED--------------LKGNFDLEVDISTA--EV 564 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCC----ccccccHHH--------------hcCCcceEEecccc--hH Confidence 99999999999999999999999999999999986 577766533 25789999999864 56 Q ss_pred HHHHHHHHHHHHHhhHH-HHHHHHHH-HHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHH Q lcl|NC_020488. 542 RMEAADSLMQFVQAVPA-AGGVVLDL-IAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQ 619 (688) Q Consensus 542 r~~~~~~l~~~~q~~~~-~~~~~~~~-~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q 619 (688) +++..+.++++++.+++ +.+.+... +.+.+++....++.+.++...++..+. ++++++.+..+++.+++...+ + T Consensus 565 ~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~--~q~qaqle~~~~q~e~~~~~a--k 640 (763) T protein:vir:95 565 DNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPV--QEQLKQLAVEKAQLENEELRS--K 640 (763) T ss_pred HHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccch--hhhHHHHHHHHHHHHHHHHHH--H Confidence 66667777777776533 22223232 345566666667777777655443221 111112222222222222221 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------hhc Q lcl|NC_020488. 620 ADMEKAKADTAKAQADMAMAQAKTAEAQAKLA------EIEQAAMMAGPGSLEETVRNLVAEAMAELMAQ-------SQG 686 (688) Q Consensus 620 ~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~------~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~-------~q~ 686 (688) +++.++++....++++++++++..++.+.+.. +.+.+++. +..++..+++..........+.. ..| T Consensus 641 aq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~-~l~~~~a~~~~~~ea~~~~~~~~~~~~~~~~~~ 719 (763) T protein:vir:95 641 IRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQ-QLEITKALTKPRKEGELPPNLSAAIGYNALTNG 719 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhccChhHHHhhhhcccccc Confidence 22222222222222222222222211111110 00000000 00000000000000000000000 000 Q ss_pred CC Q lcl|NC_020488. 687 NA 688 (688) Q Consensus 687 ~~ 688 (688) .+ T Consensus 720 ~~ 721 (763) T protein:vir:95 720 ED 721 (763) T ss_pred cC Confidence 11 No 18 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=5.5e-85 Score=482.52 Aligned_cols=585 Identities=16% Similarity=0.172 Sum_probs=375.4 Q ss_pred CCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHH-HHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHH Q lcl|NC_020488. 2 LPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHN-FDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQ 80 (688) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~ 80 (688) |.|..+. ...+.++++..+...++.|.++.... ..++.++++||+|++|+. ...|+++++.|.|..+|++ T Consensus 1 ~~k~~~~---~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~------~~~~~s~~~~~~v~~~v~~ 71 (705) T protein:vir:88 1 MAKRRKI---KPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN------ERPGKSGIVSRDVQETVDW 71 (705) T ss_pred CCccccc---ccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc------ccCCCCccccHHHHHHHHH Confidence 4444433 34445669999999999999987744 458899999999999975 3579999999999999999 Q ss_pred HHHHHHh----CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHHHcCCceE Q lcl|NC_020488. 81 VLGDQRQ----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EYTSNAEAHYDNAFQHAVEGGFGWL 155 (688) Q Consensus 81 i~g~~~~----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~~~~~~~~~~~~~~d~~~~G~G~~ 155 (688) +++++.. +..-+.|.|+.. +|.+.|++++.+++|+ .+.|+....++++|+|+++||+||+ T Consensus 72 ~~~~l~~~~~~~~~~~~~~p~~~---------------~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~ 136 (705) T protein:vir:88 72 IMPSLMKVFTSGGQVVKYEPDTA---------------EDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVV 136 (705) T ss_pred HHHHHHHhhcCCCceEEEeeCCh---------------hHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEE Confidence 9999986 455578888875 4899999999999996 6788889999999999999999999 Q ss_pred EEEEeeccCCC------------------------------------------CCcceeEEEecccceEEeCCccccccc Q lcl|NC_020488. 156 RVLTKYSTDDA------------------------------------------FDLDLCIKSIHNRFAVLMDPDATEPDY 193 (688) Q Consensus 156 ~v~~~~~~~~~------------------------------------------~~~~~~~~~v~~~~~v~~Dp~a~~~d~ 193 (688) +|+|+...... ..+.++++.| +|++|+|||+|++ + T Consensus 137 kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V-~p~d~~~dp~a~~--~ 213 (705) T protein:vir:88 137 KVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCV-KPENFLVDRLATC--I 213 (705) T ss_pred EeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeec-cHHHceecCCCCC--c Confidence 99986431110 0156788888 6999999999874 7 Q ss_pred ccCceEEEEecCCHHHHHHhcCCccch-hccccc------------cccc-----------ccCCCCCEEEEEEEEeeee Q lcl|NC_020488. 194 SDANWCFISERMSKAEFNKRYPGKAVG-DLSDAE------------RGEY-----------SWWTNEEGVRVSEYFYREP 249 (688) Q Consensus 194 ~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~------------~~~~-----------~~~~~~~~v~v~e~~~~~~ 249 (688) +|++|++++.++|+++|.+++++.+.. .+...+ .+.. .+..+...|.+.|||.+.. T Consensus 214 ~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d 293 (705) T protein:vir:88 214 DDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLD 293 (705) T ss_pred ccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEec Confidence 799999999999999999886554321 111110 0000 0011223466666666532 Q ss_pred cceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCC Q lcl|NC_020488. 250 VTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGD 329 (688) Q Consensus 250 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~ 329 (688) .. +|| ..+.++..++|++|++.+ |++++||+.+.+ .++++ T Consensus 294 ~~------~d~-----------------------------~~~~~~~~~~g~~il~~~---~~~~~PF~~~~~--~p~~~ 333 (705) T protein:vir:88 294 VD------GDG-----------------------------ISELRRILYVGDYIISNE---PWDCRPFADLNA--YRIAH 333 (705) T ss_pred cc------CCc-----------------------------ceeeEEEEEeCccccccc---cCCCCCEEEecc--eeecC Confidence 11 111 123345556788888754 457899997544 46789 Q ss_pred cccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCc Q lcl|NC_020488. 330 KTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPAS 409 (688) Q Consensus 330 ~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 409 (688) ++||+|+++.++|+|+.+|+++|+++|+++++++|++++++|+++..+.+. ++++ .++.+++ ...+.++++++ T Consensus 334 ~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~~d~~~---~~pg-~vv~~~~---~~~i~~~~~~~ 406 (705) T protein:vir:88 334 KFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLT---NEAA-GIVRVKS---MNSITPLETPQ 406 (705) T ss_pred ccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccCcccccc---cCCC-eeEEecC---CCccccccCCc Confidence 999999999999999999999999999999999999999999986444322 3444 4444443 24578889999 Q ss_pred chHHHHHHHHHHHHHHHHHhCcChHHcCCCcc----hhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_020488. 410 MPAAELQLALSATDEMKATIGLYDASVGAQGN----EQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRV 484 (688) Q Consensus 410 ~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~----~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~ 484 (688) +|+++++|+++..+.++++|||+++++|.+++ +.|+.+++++.++|++++..+++||+ ++++++|++++.||.+| T Consensus 407 ~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~ 486 (705) T protein:vir:88 407 LSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKY 486 (705) T ss_pred CcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999997643 56899999999999999999999997 68999999999999999 Q ss_pred cCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHH--HHHHHHHHHHHHHhhH---HH Q lcl|NC_020488. 485 YDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQ--RMEAADSLMQFVQAVP---AA 559 (688) Q Consensus 485 ~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~--r~~~~~~l~~~~q~~~---~~ 559 (688) |++++++||+| .|+.+++..+ .++|||.|+++++..++ +.+.+..++++.+.+. .+ T Consensus 487 ~~~~~~~ri~g-----~~v~v~~~~~--------------~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~ 547 (705) T protein:vir:88 487 QNQEEVFQLRG-----KWVAVNPANW--------------RERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGL 547 (705) T ss_pred CCCceEEeecc-----chhccchHhh--------------ccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccch Confidence 99999999998 5777775433 36788888887776554 4445555554443322 11 Q ss_pred HHH--------HHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 560 GGV--------VLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAK 631 (688) Q Consensus 560 ~~~--------~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~ 631 (688) .+. +...+++.+++....++....... +..+.+.++.+...++..+..++|+++++++++++. T Consensus 548 ~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~---------e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~ 618 (705) T protein:vir:88 548 GVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSP---------EALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALA 618 (705) T ss_pred hhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhH---------HHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 111 112233444444433333211100 000011111111111111222222222222222222 Q ss_pred HHHHHHHHHH----HHHHHHHH-------HHHHHH---HHHHHHHHHHHHHHHH----HHHHHHHHHHHHhhcCC Q lcl|NC_020488. 632 AQADMAMAQA----KTAEAQAK-------LAEIEQ---AAMMAGPGSLEETVRN----LVAEAMAELMAQSQGNA 688 (688) Q Consensus 632 ~q~e~~~~q~----~~~~~~a~-------~~~~~~---~a~~~~~~~~~~~~~~----~~~~a~~~~~~~~q~~~ 688 (688) +++++.+.+. .+++.+.+ ..+.+. +...++.+...++... .+...+.++.....++. T Consensus 619 ~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~~~ 693 (705) T protein:vir:88 619 KQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKV 693 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH Confidence 2111111111 11111100 000000 0000000000000000 00000000000000000 No 19 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=1.4e-62 Score=359.70 Aligned_cols=573 Identities=12% Similarity=0.128 Sum_probs=347.7 Q ss_pred CCCCCCCcC--CCCccchHHHHHHHHHHHHHHHHhhhHHHHHHH----------HHHHhhCCCCCCHHHHHHHHhcCCCc Q lcl|NC_020488. 1 MLPGNEPIK--TRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQ----------EDISFLAGEQWPESVRKEREDEGRPC 68 (688) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~----------~~~~~~~G~Qw~~~~~~~~~~~g~p~ 68 (688) -+... ++. ...-.+.+.++..++.+++.+.+....+..+|. ++++||.|..+... ...+..||+. T Consensus 2 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~--~~~~~~~rs~ 78 (651) T protein:vir:80 2 KLATT-TTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSV--GDVNADWRHK 78 (651) T ss_pred ccccc-ccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhcccccccc--CCCCCCCCcc Confidence 00000 111 123445567899999999999998876666664 45677877666332 2224468999 Q ss_pred eeehhHHHHHHHHHHHHHhC-Ccc---eEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHH Q lcl|NC_020488. 69 LTLNKLPQYVDQVLGDQRQN-RPA---IQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAF 144 (688) Q Consensus 69 ~~~N~i~~~i~~i~g~~~~~-r~~---~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~ 144 (688) +++|.++..|+++++++... .|. ++|.|.+.. +.+.+.+++++.++.+.+.++++...++.++ T Consensus 79 ~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~-------------d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 145 (651) T protein:vir:80 79 ITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPG-------------QDNLLVSRLIKRYVQDKLTEGKFRAAYANFL 145 (651) T ss_pred ccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCch-------------hHHHHHHHHHHHHHHHHhhccCcHHHHHHHH Confidence 99999999999999999985 222 455554321 1123345556666665566899999999999 Q ss_pred HHHHHcCCceEEEEEeeccCC-------------------------CCCcceeEEEecccceEEeCCcccccccccCceE Q lcl|NC_020488. 145 QHAVEGGFGWLRVLTKYSTDD-------------------------AFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWC 199 (688) Q Consensus 145 ~d~~~~G~G~~~v~~~~~~~~-------------------------~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~ 199 (688) +|+++.|.||++|+|+...+. ...+.++++.| +|.+|||||.++ ++.|+.|| T Consensus 146 ~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v-~p~~~~~dp~a~--~~~d~~~v 222 (651) T protein:vir:80 146 RQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVL-DMFDCFYDPNVT--DPNRGAFI 222 (651) T ss_pred HhhcccCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEe-cHHHeeecCCCc--Ccccccee Confidence 999999999999988633110 01245778888 799999999886 47799999 Q ss_pred EEEecCCHHHHHHhc----C-Cccchh-cc---------------cccccccccCCCCCEEEEEEEEeeeecceeeeecc Q lcl|NC_020488. 200 FISERMSKAEFNKRY----P-GKAVGD-LS---------------DAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLS 258 (688) Q Consensus 200 ~~~~~~~~~e~~~~~----p-~~~~~~-~~---------------~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~ 258 (688) +++.+ ++.++..+. + +..... .. .....+.......++|.|.|||++... T Consensus 223 ~~~~~-t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~-------- 293 (651) T protein:vir:80 223 RKLTK-TKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHL-------- 293 (651) T ss_pred eeeee-eHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeec-------- Confidence 98865 555554431 1 110000 00 000000011123467899999976311 Q ss_pred CCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcc-cCCCCCCCccceEEEeeeeeccCCcccccchH Q lcl|NC_020488. 259 DGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLE-GPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLI 337 (688) Q Consensus 259 ~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile-~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v 337 (688) +|..+ +.++ ..+.|..++. ..+|+++ .+||+++.+ .+++|++||+|++ T Consensus 294 e~~~~---------------------------~~~~-v~~~g~~il~~~~~~~~~-~~Pf~~~~~--~~~~~~~yG~g~~ 342 (651) T protein:vir:80 294 ENKTY---------------------------HDVV-VTIMGNEVLRFEQNPYWC-GRPFVIGTY--IPTARQPYAMGAL 342 (651) T ss_pred cCCce---------------------------EEEE-EEEcCcEEecccccCCCC-CCCeeeecc--eecCccccCCChH Confidence 11111 1122 2334455553 4566655 459997655 4679999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceec-CCCcchHHHHH Q lcl|NC_020488. 338 RFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRD-MPASMPAAELQ 416 (688) Q Consensus 338 ~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 416 (688) +.+.|.|+.+|++.++++++++++++++|+++++++.+.+++. +.+++++ +.+.. ..+.++ +.++.+...++ T Consensus 343 ~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~---~~pg~vi-~~~~~---~~~~~l~~~~~~~~~~~~ 415 (651) T protein:vir:80 343 QPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVY---TEPGKVF-LVSDH---GDLQPLANQSSNFSITYQ 415 (651) T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhh---cCCCceE-EecCC---CCceeeccCcccchhHHH Confidence 9999999999999999999999999999999999988887653 3455554 43322 223333 33456778899 Q ss_pred HHHHHHHHHHHHhCcChHHcCCCc---chhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCcceEEE Q lcl|NC_020488. 417 LALSATDEMKATIGLYDASVGAQG---NEQSGKAILARQRQGDRGTFAYIDNLSR-AIRRVGQILIELIPRVYDSDRVLR 492 (688) Q Consensus 417 ll~~~~~~~~~~tGv~d~~~G~~~---~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~~~li~~~~~~~r~~r 492 (688) ++++..+.++++|||++.++|..+ .+.|+.+|..+++++.+++..++++|.+ +++.++++++.|+.+||+.++++| T Consensus 416 ~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~r 495 (651) T protein:vir:80 416 ESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVR 495 (651) T ss_pred HHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccee Confidence 999999999999999999999754 3468999999999999999999999986 899999999999999999999999 Q ss_pred EeccCC-CcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh---HHHHHH-----H Q lcl|NC_020488. 493 LRFQDG-EGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV---PAAGGV-----V 563 (688) Q Consensus 493 i~~~~~-~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~---~~~~~~-----~ 563 (688) |++++. ...++.++ .+|+. ++||++ ..|+.....|.+..+.+.++++.. |++.+. + T Consensus 496 i~~~~~~~~~~~~i~-------------~~dl~-~~~~iv-~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~ 560 (651) T protein:vir:80 496 VAGDEAGAYEYYELD-------------VEDLQ-KEVRLV-PIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRI 560 (651) T ss_pred ecccccccccccccC-------------cccee-eeeeee-eccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHH Confidence 998763 33444443 24554 577763 456555555666666666666543 332221 2 Q ss_pred HHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 564 LDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKT 643 (688) Q Consensus 564 ~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~ 643 (688) +..+++.+++++.+.++....++ .++++++....+++....+++...++....+.+ +.+.++++ T Consensus 561 ~~~l~~~~g~~~~~~~l~~~~q~--------------~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~--~~~~~~~~ 624 (651) T protein:vir:80 561 LVDLLQHWGFEEPEAYLKQQDQQ--------------APANPQEALLSQAKDVGGQAMSNMLQNQLQADG--GTQMMSEM 624 (651) T ss_pred HHHHHHHcCCCCcHHhcCCCccc--------------hhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH Confidence 34466677777665544211100 000001111111111111111111111100000 00001111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 644 AEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQ 683 (688) Q Consensus 644 ~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~ 683 (688) .. ..++++.+++..+ ..+. ..++.+.+ T Consensus 625 ~~---~~~~~~~~~~~~~---------~~~~-l~~~~~~~ 651 (651) T protein:vir:80 625 YG---TPNADQMQQELMA---------TTPN-VSEQQLTQ 651 (651) T ss_pred HH---HHHHHHHHHHHHH---------HHHH-HHHhhccC Confidence 11 1111111111111 0000 11111111 No 20 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=2.4e-57 Score=331.01 Aligned_cols=602 Identities=17% Similarity=0.128 Sum_probs=367.3 Q ss_pred CCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHH Q lcl|NC_020488. 4 GNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLG 83 (688) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g 83 (688) =++...+...+..+.+..+|..++.+++.++.+|++..+...+-|.|..|+.+.... -||+|+..|..+++ T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~~---------r~nl~~sni~~i~P 71 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAET---------RWNLFSTNIQTQMA 71 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCcccc---------ccchhhhhHHHHhh Confidence 122222344444555788999999999999999999999999999999997765322 38999999999999 Q ss_pred HHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHH--HhcChHHHHHHHHHHHHHcCCceEEEEEee Q lcl|NC_020488. 84 DQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIE--YTSNAEAHYDNAFQHAVEGGFGWLRVLTKY 161 (688) Q Consensus 84 ~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~--~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~ 161 (688) ......|.|.|+||..+++.+. -...+|+++++++..+ +..+++..+..+..|+++||+|+++|+|+. T Consensus 72 ~iYar~P~p~V~~rf~d~d~~~----------~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~ 141 (663) T protein:vir:34 72 SLYGQTPKVSVSRRFADADDDV----------ARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEV 141 (663) T ss_pred hhhcCCCcceeeecccCcccch----------hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeec Confidence 9999999999999998754222 3456788888887665 557799999999999999999999999975 Q ss_pred ccC----------CCCC---------------cceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCC Q lcl|NC_020488. 162 STD----------DAFD---------------LDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPG 216 (688) Q Consensus 162 ~~~----------~~~~---------------~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~ 216 (688) +.+ +..+ ..++|.+| ++.+|++|| |+. |+++.|++.+.||++++++++|.. T Consensus 142 ~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v-~~~dfl~~p-Ar~--W~ev~wva~r~~mtk~e~~~rf~~ 217 (663) T protein:vir:34 142 EWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYL-HWQDVLWSP-ARV--WHEVRWLAFRNLLDMREFNARFDA 217 (663) T ss_pred ccchhccccccCCCccccchhcccccchhhcccceeeeee-chhhcccch-hhc--cccccceeeeccCCHHHHHHhhcC Confidence 432 1111 24778888 588999999 554 789999999999999999999944 Q ss_pred ccc----hhccccc----ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeee Q lcl|NC_020488. 217 KAV----GDLSDAE----RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRV 288 (688) Q Consensus 217 ~~~----~~~~~~~----~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 288 (688) ... ..++... ..+.....+.++.+|+|+|.|... +|+++.+|-.++-++.++ T Consensus 218 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~--~V~w~~eg~~~~L~~~~p------------------ 277 (663) T protein:vir:34 218 DGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGR--KVDWYVEGYSAVLDTQPD------------------ 277 (663) T ss_pred ChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCc--EEEEEEcCcceecccCCC------------------ Confidence 321 1111111 111111233468899999998632 333333332222111111 Q ss_pred eEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceee Q lcl|NC_020488. 289 KTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVA 368 (688) Q Consensus 289 ~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~ 368 (688) +.-|++..|||..- +++.. .++.+-+-.++ .-.++|+++|.++..+.-+.. ..++++++ T Consensus 278 ------------~lgl~~ffPcPrpl------~~~~~-~ds~ipvpd~~-~y~~~~~E~n~~t~Rin~l~d-~ikv~gvy 336 (663) T protein:vir:34 278 ------------PLGLESFFPCPKPL------LANWT-TDKVVPRPDFV-LAQDLYKEIDLVSTRITLLER-AIRVVGVY 336 (663) T ss_pred ------------CCCCCCCCCCcccc------cceec-CCCeecCCcHH-HHHHHHHHHHHHHHHHHHHHh-hhhhceee Confidence 11122333333333 33322 23333233444 899999999988877665544 46889999 Q ss_pred chhhhcchHHHHhhcc----cCCCceeecCccccc-ccceecCCCcchHHHHHHHHH---HHHHHHHHhCcChHHcCCCc Q lcl|NC_020488. 369 PAESIEGYEEEWNQAN----RKNQSVLRYNAIPGV-DRPQRDMPASMPAAELQLALS---ATDEMKATIGLYDASVGAQG 440 (688) Q Consensus 369 ~~~~i~~~~~~~~~~~----~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ll~~---~~~~~~~~tGv~d~~~G~~~ 440 (688) +.|+.+++-+...... .+-..+..+....+. ..+.+++-.++.+.+..|.+. .+.++.++||++|+++|... T Consensus 337 ~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~ 416 (663) T protein:vir:34 337 DKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASD 416 (663) T ss_pred ccccchhHHHHHHHhhCCCceecchhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccC Confidence 8777754443222111 111111111111222 456677777777777777665 55567788999999999998 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeee Q lcl|NC_020488. 441 NEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLV 520 (688) Q Consensus 441 ~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ 520 (688) .+.|+.|.+.+++.|+.|++.+.+.+.++.++++++..+.|.+.|+.+.+-+++|..-.. -++|-+ .+..+. T Consensus 417 a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~-~~ei~~-------~~~~L~ 488 (663) T protein:vir:34 417 PRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTF-DKELAP-------KAAELI 488 (663) T ss_pred cchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCc-ccchhH-------HHHHHh Confidence 889999999999999999999999999999999999999999999999888888753221 222211 233345 Q ss_pred ccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHH----------HHHHhc-----CCccHHHHHHHHHh Q lcl|NC_020488. 521 NDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLD----------LIAKNM-----DWPGAQDIARRLQK 585 (688) Q Consensus 521 ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~----------~~~e~~-----~~~~~~ei~~~~~~ 585 (688) ||-. ..|.|.|..+........+..+.++++++.+.++.+.+.+ ++.+++ .|....++...+.+ T Consensus 489 n~~~-r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~ 567 (663) T protein:vir:34 489 KSRF-SMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGSSTIEGVLDK 567 (663) T ss_pred cCCC-cceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcCChhhhHHHHHHH Confidence 5543 5688888887776666666666677776665554444333 233322 22222222222211 Q ss_pred hccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 586 TLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSL 665 (688) Q Consensus 586 ~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~ 665 (688) ... ..++...++.++++.++....+..+++.+++.+.+++|.+ +|.+....+.+.++.+..++.++.+.. T Consensus 568 ~~~-------~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeAq~e---~q~~~~~~ql~~~~~~~k~~~~a~~~~ 637 (663) T protein:vir:34 568 AIA-------AAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQEMAKVQAE---VQGDLLRIQAETQANETKERQQAEWNV 637 (663) T ss_pred HHh-------hhHHHhhccCCCCcccchhhHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 110 0011111112233333333333333333444444433322 222233333333333333222222222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 666 EETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 666 ~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) +......+.....+.+..++-|. T Consensus 638 ~~a~q~~~~~~~~r~~~~~a~~~ 660 (663) T protein:vir:34 638 REAAQKNLISQAARAMNPQARNG 660 (663) T ss_pred HHHHHhhHHHHHHHhhchhhhcC Confidence 33333333334444444444444 No 21 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=1.2e-47 Score=277.82 Aligned_cols=528 Identities=13% Similarity=0.090 Sum_probs=331.2 Q ss_pred cCCCCccc-----hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDS-----QEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVL 82 (688) Q Consensus 8 ~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~ 82 (688) |.++..+. .+.+...+...|+.+.++.+.+..+|.+-++||.+ +-.. -..-.+...+..+++|++...+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a-~~~~-~~~~~~~~~r~~~~~~k~~~~~~~i~ 78 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFA-TDTT-TTSNQGLPWKNSTTLPKLCQIRDNLH 78 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHh-hhhh-hhhhcccccccccchhHHHHHHHHHH Confidence 33322211 23345778888999999999999999999999987 2221 22244567788999999999999999 Q ss_pred HHHHhC----CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 83 GDQRQN----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 83 g~~~~~----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) .++... +-=+++.+..++ ..+...+++++.++..-+.++++..++...|+|+++.|.|++++. T Consensus 79 ~~l~~~~Fp~~~w~~~v~~~~~-------------~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~ 145 (584) T protein:vir:95 79 SNYFSSLFPNDDWLRWVGYGKG-------------DSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVS 145 (584) T ss_pred HHHHHhhcCccceeeeecCCCc-------------hhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEe Confidence 988764 222334333332 124445899999999888999999999999999999999999998 Q ss_pred EeeccCCCC-------CcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhc-----CCccchhccc-- Q lcl|NC_020488. 159 TKYSTDDAF-------DLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRY-----PGKAVGDLSD-- 224 (688) Q Consensus 159 ~~~~~~~~~-------~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~-----p~~~~~~~~~-- 224 (688) |....+... -.++++++| +|.+|||||++++ ++|+.||+ +..+|++++.++- |.-..+.+.. T Consensus 146 ~~~~~~e~~e~~~v~~~~~prieri-SP~d~~~Dpsa~~--i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~ 221 (584) T protein:vir:95 146 FEAKYKEMTDGTLVPDYIGPRLVRI-SPLDIVFNPLATS--ISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRRE 221 (584) T ss_pred EeecceeeeccccccccccceEEee-ChhheeecCCCCC--ccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHH Confidence 765432211 236889999 6889999999965 77999999 6668999998763 1111111100 Q ss_pred --------ccccccc------------c--CCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhh Q lcl|NC_020488. 225 --------AERGEYS------------W--WTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTV 282 (688) Q Consensus 225 --------~~~~~~~------------~--~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~ 282 (688) .....++ . ......|.|.|+|-. .+ +..+. T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~--------------~~--~~~~~------------ 273 (584) T protein:vir:95 222 EICRHLGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGD--------------YH--DKETG------------ 273 (584) T ss_pred HhccCCCCCcccccccccccccccccccccccCCceeEEEeeccc--------------cc--ccccC------------ Confidence 0000000 0 011234555555411 00 00000 Q ss_pred hheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020488. 283 TRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAP 362 (688) Q Consensus 283 ~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~ 362 (688) ....+.++..+++++.|.-..+|+|++.+||+ ++.+.+.++++||+|+.+.+.|.|+++|.+++.++|++.++. T Consensus 274 ----e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~--~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~ 347 (584) T protein:vir:95 274 ----ELQTNRIITVVDRSTEVRNESIPTWFGSAPIY--HVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLII 347 (584) T ss_pred ----CCcccceEEEEeccEEEEeeecCCCCCCCCEE--EEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhc Confidence 00112233334445555445789999999998 444578899999999999999999999999999999999999 Q ss_pred CCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCc-chHHHHHHHHHHHHHHHHHhCcChHHcCCCcc Q lcl|NC_020488. 363 KAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPAS-MPAAELQLALSATDEMKATIGLYDASVGAQGN 441 (688) Q Consensus 363 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~ 441 (688) +|.+ +..++. ++. ..+++.++..-. . ...+++.++. .....++.|++....+++.|||+..++|.+++ T Consensus 348 ~pv~---k~~~~~-~~~---~~~pg~~~~~~~--~--~~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~ 416 (584) T protein:vir:95 348 QPPL---KIIGEV-EEF---VWGPGAEIHLDQ--G--GDVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTP 416 (584) T ss_pred Ccce---eecccc-chh---cccCCceeecCC--C--CCcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccc Confidence 9832 222222 221 123343333211 1 1344454432 22345567899999999999999999997654 Q ss_pred -hhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHcCcceEEEEeccC-CCcceeeechhhhccccccee Q lcl|NC_020488. 442 -EQSGKAILARQRQGDRGTFAYIDNLSRAI-RRVGQILIELIPRVYDSDRVLRLRFQD-GEGDWVQINQMVMDEETQKPV 518 (688) Q Consensus 442 -~~sg~ai~~~~~~~~~~~~~~~dn~~~~~-~~~~~~~~~li~~~~~~~r~~ri~~~~-~~~~~v~~n~~~~~~~~~~~~ 518 (688) +.|+..++++.++++..+.++.+.+...+ ++++..+++...++.+...++|++++. +...|+.|.+ T Consensus 417 ~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r----------- 485 (584) T protein:vir:95 417 GEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTR----------- 485 (584) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccCh----------- Confidence 56888899999999999999999997755 888999999988999999999999876 5667777643 Q ss_pred eeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHh-h-----HHH-HHHHHHHHHHhcCCccHHHHHHHHHhhccccc Q lcl|NC_020488. 519 LVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQA-V-----PAA-GGVVLDLIAKNMDWPGAQDIARRLQKTLPPGI 591 (688) Q Consensus 519 ~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~-~-----~~~-~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~ 591 (688) +|+. |+|+++..-.... ..|++..+.+.++++. + |.+ +..+...+.+++++|.-. ++. T Consensus 486 --~Dl~-g~~~~va~Ga~~~-~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~-~~~---------- 550 (584) T protein:vir:95 486 --EDIT-ANGKIRPIGARHF-GKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYE-IFR---------- 550 (584) T ss_pred --hhhc-cCeeEEeehhhHH-HHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCccc-ccC---------- Confidence 5554 7888776554333 3577777777777763 1 111 111222344444444311 110 Q ss_pred cchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 592 LDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAM 638 (688) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~ 638 (688) ... ..++|++.|+.+.+.|. ..++++++ .++.+- T Consensus 551 ---~~~------~~~~Q~~~q~~~~~~q~-~~~~~~~~---~~~~~~ 584 (584) T protein:vir:95 551 ---PNV------AVAEQAETQSLVAQAQE-DLQLQAQM---PAEGAI 584 (584) T ss_pred ---CCc------ccchhHHHHhhhHHHHH-HHHHHHhh---hhccCC Confidence 000 00011111111111110 00111111 011000 No 22 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=2e-44 Score=260.21 Aligned_cols=568 Identities=12% Similarity=0.112 Sum_probs=324.5 Q ss_pred CCC-----CCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCC-----CCCHHHH---HHHHhcCCC Q lcl|NC_020488. 1 MLP-----GNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGE-----QWPESVR---KEREDEGRP 67 (688) Q Consensus 1 ~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~-----Qw~~~~~---~~~~~~g~p 67 (688) =+| -+|+.+ .....+++...++.+|+.+.+..+.|...|++.++||... ....... ......+|. T Consensus 4 ~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 81 (641) T protein:vir:94 4 EMPTPIIEDKESAK--RKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRH 81 (641) T ss_pred CCCcccccCCcchh--hcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccc Confidence 222 233333 3344566999999999999999999999999999888541 1111000 011234567 Q ss_pred ceeehhHHHHHHHHHHHHHhC----CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHH Q lcl|NC_020488. 68 CLTLNKLPQYVDQVLGDQRQN----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNA 143 (688) Q Consensus 68 ~~~~N~i~~~i~~i~g~~~~~----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~ 143 (688) .++.+.+...++++++.+... +.=+++.|++. +|.+.|++++.++++.+.+|++..+++.. T Consensus 82 ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~---------------ed~~~A~~~~~~~~~~l~~~~~~~~~~~~ 146 (641) T protein:vir:94 82 RINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVP---------------ELADAARVVKQLTKTKLEAASIRDIFETY 146 (641) T ss_pred cccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCC---------------ChHHHHHHHHHHHHHHHhhcchHHHHHHH Confidence 889999999999999888774 22235556554 48889999999999999999999999999 Q ss_pred HHHHHHcCCceEEEEEeeccC-----------CCC-----------CcceeEEEecccceEEeCCcccccccccCceEEE Q lcl|NC_020488. 144 FQHAVEGGFGWLRVLTKYSTD-----------DAF-----------DLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFI 201 (688) Q Consensus 144 ~~d~~~~G~G~~~v~~~~~~~-----------~~~-----------~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~ 201 (688) +++++..|.|++++.|+.... +.+ ...++++.| +|.+|||||.++. .+..|+++ T Consensus 147 ~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v-~~~di~~dps~~~---~~~~f~~~ 222 (641) T protein:vir:94 147 VRNLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPL-SPYDVWLDTSGGK---NTGTFVRL 222 (641) T ss_pred HHHHhhcCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEec-chhheeecCCCCc---ccccceeh Confidence 999999999999988764321 111 122456666 7889999998753 24455433 Q ss_pred E-ecCCHHHHHHh--cCCccchhccccccc--ccc-----cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchH Q lcl|NC_020488. 202 S-ERMSKAEFNKR--YPGKAVGDLSDAERG--EYS-----WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDV 271 (688) Q Consensus 202 ~-~~~~~~e~~~~--~p~~~~~~~~~~~~~--~~~-----~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 271 (688) + ..++..++... |+............. ..+ ...+..+.++.|+|... ..+| T Consensus 223 r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~--------~~d~----------- 283 (641) T protein:vir:94 223 RHTREELHELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPL--------LVEG----------- 283 (641) T ss_pred hhhHHHHHHHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeee--------ccCC----------- Confidence 2 34444555443 322211111111000 000 00111122233333100 0011 Q ss_pred HHHHHHhhhhhhheeeeeEEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHH Q lcl|NC_020488. 272 LDELRDLGTTVTRERRVKTYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYW 350 (688) Q Consensus 272 ~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~ 350 (688) ...+.+++.+ +|..|| ++.+++ ++.+||+.+ .+.+++|.+||.|++..+.+.|+.+|++ T Consensus 284 ----------------~~~~~~~~~~-~g~~il~~~~~~~-~d~~Pf~~~--r~~~~~~~~YG~gp~~~~l~dqk~ln~l 343 (641) T protein:vir:94 284 ----------------VQFWCVHAVF-YGKQLIRLSDSKY-WCGSPFVTT--TLLPDRDSVYGMSVLHPNLGALHVLNVL 343 (641) T ss_pred ----------------CceeeEEEEE-eCCEEeecccccc-cCcCCeEEe--cceecCCcccCCChHHHHHHHHHHHHHH Confidence 1123344333 445555 444543 456799854 4457899999999999999999999999 Q ss_pred HHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCc-chHHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 351 MTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPAS-MPAAELQLALSATDEMKATI 429 (688) Q Consensus 351 ~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ll~~~~~~~~~~t 429 (688) .+.+++++.++++|++++..+++.+.+++. ..+++ ++..+.. ..++++.+.. ......+++++....+++.+ T Consensus 344 ~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~---~~PG~-ii~~~~~---~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~ 416 (641) T protein:vir:94 344 TNGRLDNLVLHINKMWTLVEDGILKREDVK---AKPGA-VFKVAQH---GSLQPIDMGRQDFVVTYQEAQVQESSVYRNT 416 (641) T ss_pred HHHHHHHHHHHhCCeeeeccccccccceee---ccCCc-ceeeCCC---CcceeecCCccccchhHHHHHHHHHHHHHhh Confidence 999999999999999999888876654331 23444 4443322 2244443322 22344677888888999999 Q ss_pred CcChHHcCCCc---chhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCcceEEEEeccCCC-cceee Q lcl|NC_020488. 430 GLYDASVGAQG---NEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRVYDSDRVLRLRFQDGE-GDWVQ 504 (688) Q Consensus 430 Gv~d~~~G~~~---~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~-~~~v~ 504 (688) |+...++|..+ ++.|++.+..+.++++.++..+.++|. .+++.+++.++.++.++++.+.++|+.+.... ..++. T Consensus 417 ~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~ 496 (641) T protein:vir:94 417 STGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFE 496 (641) T ss_pred hhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCC Confidence 99888777643 356999999999999999999999997 69999999999999999999999999986421 12333 Q ss_pred echhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHH---HHHH-----HHHHHHHhcCCccH Q lcl|NC_020488. 505 INQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPA---AGGV-----VLDLIAKNMDWPGA 576 (688) Q Consensus 505 ~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~---~~~~-----~~~~~~e~~~~~~~ 576 (688) +. .+++ .|+|++ +..+.+....+.+..+.|+++++.... +... ++..+++.++++.. T Consensus 497 ~~-------------p~~L-~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p 561 (641) T protein:vir:94 497 VS-------------PEYL-HYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDP 561 (641) T ss_pred CC-------------ccce-eeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCc Confidence 22 2344 367887 567767777777777777777664322 1111 11222233333222 Q ss_pred HHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 577 QDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQA 656 (688) Q Consensus 577 ~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~ 656 (688) ..+. +. +.+++++.+..+++++ ++.+.+++.-... ..+++ .++....+.+ T Consensus 562 ~~~i---r~---------------~~~~~~~~~~~~~~~q--~~~~~~a~~~~~~-~~~~a---------~~~~~~~~~~ 611 (641) T protein:vir:94 562 MRYI---KK---------------AEAPPAAPPIAPAEPG--ALPPEMMNSVGGG-LNDQA---------IAGMTPEDVS 611 (641) T ss_pred hhhc---cC---------------ccCchhHHHHHHHHHH--HHHHHHHHHHHhh-hHHHH---------HHHhhHHHHH Confidence 2111 10 0011111111111111 1111111110000 00000 0010011111 Q ss_pred HHHHH------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 657 AMMAG------PGSLEETVRNLVAEAMAELM 681 (688) Q Consensus 657 a~~~~------~~~~~~~~~~~~~~a~~~~~ 681 (688) ++..+ ..+-++.+ +...+...+++ T Consensus 612 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 641 (641) T protein:vir:94 612 DLASRIGIDTSDVAPEAMA-AATQQITSGAL 641 (641) T ss_pred HHHHhhcCCchhhhHHHHh-cccccccccCC Confidence 11000 00001111 00000001111 No 23 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=1.3e-38 Score=228.34 Aligned_cols=551 Identities=12% Similarity=0.071 Sum_probs=321.3 Q ss_pred cCCCC---------ccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHH---HhcCCCceeehhHH Q lcl|NC_020488. 8 IKTRD---------DDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKER---EDEGRPCLTLNKLP 75 (688) Q Consensus 8 ~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~---~~~g~p~~~~N~i~ 75 (688) |++.- -++...+..++...|+...++++....+|.+-++|.+- .+..+. +...+-..+.|++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~-----~~tr~t~~~~~~w~~s~t~~k~~ 75 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDA-----TDTRKTSNSKLPFKNSTTINKLA 75 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhh-----hcccccccCCCCcccccchHHHH Confidence 33221 23334566777778888888888888888888888642 111111 11335578899999 Q ss_pred HHHHHHHHHHHhC----CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcC Q lcl|NC_020488. 76 QYVDQVLGDQRQN----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGG 151 (688) Q Consensus 76 ~~i~~i~g~~~~~----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G 151 (688) .++++++.+.... +-=++|.+-.++. .-...++++..++..-+.++++..+++..+.|.++.| T Consensus 76 ~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~-------------~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G 142 (599) T protein:vir:31 76 HLHLMITTSYMEHLLPNRNWVDFVGFDNDS-------------VNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRG 142 (599) T ss_pred HHHHHHHHHHHhhhcCCccceEeeecCCch-------------hHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccC Confidence 9999999988753 3333454444321 1344577888888888889999999999999999999 Q ss_pred CceEEEEEe-----eccCCCC--CcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCC-----ccc Q lcl|NC_020488. 152 FGWLRVLTK-----YSTDDAF--DLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPG-----KAV 219 (688) Q Consensus 152 ~G~~~v~~~-----~~~~~~~--~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~-----~~~ 219 (688) ..+..+.++ +++...+ -..+++++| +|++|||||+|.+ ++|+.||+ +...|+.+|..+-.+ ... T Consensus 143 ~~vat~~~er~~~~~~d~~v~~~~~~P~~erv-sP~Di~~Dp~A~s--i~d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~~ 218 (599) T protein:vir:31 143 FCVAHTRHVKRMTVTAENQVIKNYSGTVTERL-SPSDVFWDVTADS--LPKAAKCI-RQLYTLGSLKREIEEGTFPLMSM 218 (599) T ss_pred ceeEeeeEEEcceeecccccccccccceEEee-cccceeeCCCCCC--CCcceeee-ehhhhHHHHHHHhccCCccccch Confidence 887665543 2222111 234778888 7999999999975 66998888 888889999885422 111 Q ss_pred hh----------ccccccccccc--CCC----CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhh Q lcl|NC_020488. 220 GD----------LSDAERGEYSW--WTN----EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVT 283 (688) Q Consensus 220 ~~----------~~~~~~~~~~~--~~~----~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~ 283 (688) .. ......+.+.. +.+ +..-.+.||+--.++. +..+|.+... T Consensus 219 d~~~~~~~~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~Ve--------vLeywGd~yd-------------- 276 (599) T protein:vir:31 219 EDFQKLREERRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVE--------VLTFMGDFYD-------------- 276 (599) T ss_pred HHHHHHHhhccCCCccccchhhhhhhccccccccccchhhhcccchhh--------hhhhhhhhhc-------------- Confidence 10 00111111111 111 1111222222111110 0011100000 Q ss_pred heeeeeEEEEEEEEEchhhh-cccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020488. 284 RERRVKTYKVKWMKVTAYDV-LEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAP 362 (688) Q Consensus 284 ~~~~~~~~~v~~~~~~~~~i-le~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~ 362 (688) ......-..+...+++...| .-+.+|+|++++||+ ++.+.|..+++||+|+...+.+.|..+|.+.+.+++++.+.. T Consensus 277 ee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyv--v~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l 354 (599) T protein:vir:31 277 EENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLH--IAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFL 354 (599) T ss_pred ccCCccccceEEEEecCcEEeecccCCCCCCCCCeE--EEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhh Confidence 00000011123334443344 456789999999999 555578899999999999999999999999999999998887 Q ss_pred CCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc- Q lcl|NC_020488. 363 KAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN- 441 (688) Q Consensus 363 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~- 441 (688) .+ .+...+.+...+-.| .| +.++.... ....+++.|++-.......+++....+++.||++.++.|.++. T Consensus 355 ~p-~l~~~~dl~~eD~~~----~P-~~v~~~~d---~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag 425 (599) T protein:vir:31 355 HP-SLKKVGDVREKGMRG----GP-NHVFEVEE---TGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAG 425 (599) T ss_pred cc-cccccccccccCccC----CC-CcceeecC---CCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccc Confidence 65 333333333322112 23 33333221 1234555665545556667888899999999999999997664 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCcceEEEEeccC-CCcceeeechhhhcccccceee Q lcl|NC_020488. 442 EQSGKAILARQRQGDRGTFAYIDNLSR-AIRRVGQILIELIPRVYDSDRVLRLRFQD-GEGDWVQINQMVMDEETQKPVL 519 (688) Q Consensus 442 ~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~~~li~~~~~~~r~~ri~~~~-~~~~~v~~n~~~~~~~~~~~~~ 519 (688) ..|+..++++.++++.+...+.+.+.+ +.+.+.+.++++.++|+|++.++||++++ |...|+.|.+ T Consensus 426 ~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~r------------ 493 (599) T protein:vir:31 426 EKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITA------------ 493 (599) T ss_pred hhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeeh------------ Confidence 469999999999999999999999976 56779999999999999999999999976 7888999865 Q ss_pred eccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh Q lcl|NC_020488. 520 VNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE 599 (688) Q Consensus 520 ~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~ 599 (688) +|+. +.+++ +..|...-..|++..+.+.++++. ++.+.+.|.+... -...+.+.+..+...+.... T Consensus 494 -edl~-~~~~~-v~~Ga~~v~ere~~~q~l~~il~~--~~~q~~~P~~~~k----~l~~~l~~~~~l~~~~~~~~----- 559 (599) T protein:vir:31 494 -DDLN-LNGQM-VAQGATLFAEKANTLQNLNAILGG--PLGAALAPHMSRT----KLFNAVEYLGDLDAYGIFTF----- 559 (599) T ss_pred -hhhh-CCeee-eechhhHHHHHHHHHHHHHHHhcc--cCCCccchhhHHH----HHHHHHHHHHhccccccCCC----- Confidence 3443 56776 666666556677777777777752 1111111111100 00111111111111110000 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 600 AGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 600 ~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~ 645 (688) .-...++|.+.+.+|+++++- -+.+..+.+-..-.+.+.+ T Consensus 560 -----~va~~eqq~~~~m~Q~~lq~~-~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 560 -----GIGVQEDQQLARMAQKSTQQT-EETALTQEEVGGPTTDTGQ 599 (599) T ss_pred -----chhHHHHHHHHHHHHHHHHHh-HhhhhhhhhcCCCCcccCC Confidence 000011111111111111100 0000000000000000000 No 24 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.92 E-value=3.4e-22 Score=138.30 Aligned_cols=521 Identities=11% Similarity=0.026 Sum_probs=269.2 Q ss_pred CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCC----CCCCHHHHHHHHhcCCCceeehhHHHHHHHHH Q lcl|NC_020488. 7 PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAG----EQWPESVRKEREDEGRPCLTLNKLPQYVDQVL 82 (688) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G----~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~ 82 (688) +++++.+-..+ +.++.+|+...+..+.|...|++..+|..- +.+...- .....+.-..-...+++.. T Consensus 1 m~~~~~~~~~~---~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~------~~~~~~~dst~~~a~~~La 71 (535) T protein:vir:33 1 MADSKRTGLGE---DGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNES------TDYTTPWQAVGARGLNNLA 71 (535) T ss_pred CChhhhhccCh---hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc------ccccccccccHHHHHHHHH Confidence 44444333333 345667777788888899999999998632 2221100 0111122233333444444 Q ss_pred HHHHhC----CcceEEEeCCccccccccccccccChhhHHHH---HHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceE Q lcl|NC_020488. 83 GDQRQN----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLA---EVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWL 155 (688) Q Consensus 83 g~~~~~----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~A---e~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~ 155 (688) +.+... ++=+++.+.+... ......+.+-.+.. +..+..+...+..|++..+...++.+.+..|.|++ T Consensus 72 a~l~~~ltP~~~WF~l~~~d~~~-----~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l 146 (535) T protein:vir:33 72 SKLMLALFPMQSWMKLTISEYEA-----KQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALL 146 (535) T ss_pred HHHHHhhcCCCcccccccChHHH-----hccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeE Confidence 444432 2212222222110 00000000011122 34444555556789999999999999999999987 Q ss_pred EEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCC Q lcl|NC_020488. 156 RVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTN 235 (688) Q Consensus 156 ~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 235 (688) .+. . + .++.+.+..+ +..++++..++.- ...-++++..+|..++-+.|+...... ....+ . T Consensus 147 ~~~--~---~-~~~~~~f~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~~~---~~~k~-----~ 207 (535) T protein:vir:33 147 YLP--E---P-EGSYNPMKLY-RLSSYVVQRDAYG----NVLQIVTRDQIAFGALPEDVRSAVEKS---GGEKK-----M 207 (535) T ss_pred Eee--c---C-CCCceeeEEE-EcCeeEEeeCCCC----CeeEEEeeEeecHHHHHHHhhhhhccc---ccccc-----c Confidence 653 1 1 1233445555 4566777655431 234488999999999999887643221 11111 1 Q ss_pred CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 236 EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 236 ~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) .+.+.++.+.++... ++. +.++ ....|..+....+.|+++.+ T Consensus 208 ~~~~~v~~~v~~~~~--------~~~-----------------------------~~~~-~~~~~~~~~~~~~~~~~~~~ 249 (535) T protein:vir:33 208 DEMVDVYTHVYLDEE--------SGD-----------------------------YLKY-EEVEDVEIDGSDATYPTDAM 249 (535) T ss_pred ccCCeEEEEEEeeCC--------CCc-----------------------------EEEE-EEEeCccccccccccccccC Confidence 123444444333211 111 1111 12223333333355677899 Q ss_pred ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCc Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNA 395 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 395 (688) ||+++.+ ..++|..||.|.+....+-.+.+|.+....+.......+++++++.+.+.+..++.. ..+|.++... T Consensus 250 P~i~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~---~~~g~~v~g~- 323 (535) T protein:vir:33 250 PYIPVRM--VRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTK---AQTGDFVPGR- 323 (535) T ss_pred Cceeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhccc---CCceeeecCC- Confidence 9996544 567999999999999999999999999999999999999999999888877655432 1223333221 Q ss_pred ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_020488. 396 IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVG 474 (688) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~ 474 (688) .+.-.+.......-.....+.++...+.|.... ..+.+...++...|++-|..+.+.....|...+.+|. .++..+. T Consensus 324 -~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli 401 (535) T protein:vir:33 324 -REDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (535) T ss_pred -cccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH Confidence 121122222233344556777888888887754 3333333556678999999999999999999999985 5777777 Q ss_pred HHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHH Q lcl|NC_020488. 475 QILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQ 554 (688) Q Consensus 475 ~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q 554 (688) +..+.++.+.. . +.+ + + .+.+.+.+..+ -....|....+.++++++ T Consensus 402 ~r~~~il~r~g---------------~---lP~--~-p------------~~~v~~~yis~-La~aqr~~~~~~l~~~~~ 447 (535) T protein:vir:33 402 RVLLKQLQATS---------------Q---IPE--L-P------------KEAVEPTISTG-LEAIGRGQDLDKLERCIS 447 (535) T ss_pred HHHHHHHHhcC---------------C---CCC--C-C------------ccceeEEEecH-HHHHHHHHHHHHHHHHHH Confidence 77777765421 0 000 0 0 01234444433 334567777777777776 Q ss_pred hhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_020488. 555 AVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADT-AKAQ 633 (688) Q Consensus 555 ~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~-~~~q 633 (688) .+.++.+ ..+. ..-+.+++...+....+-........+++.++..+ ++.+++++++ ++.+ -... T Consensus 448 ~la~~~P----~~~d--~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~----q~~~~~~~~~-----~~~~~g~~~ 512 (535) T protein:vir:33 448 AWAALAP----MQGD--PDINLAVIKLRIANAIGIDTSGILLTDEQKQALMM----QDAAQTGVEN-----AAAAGGAGV 512 (535) T ss_pred HHHhhCh----hhhh--ccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHH----HHHHHHHHHH-----HHHhhhhhh Confidence 5443322 2111 11255677766665554321111110000000000 0000000000 0000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 634 ADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 634 ~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) +..++.. ..+++..+...=-|+ T Consensus 513 ~~~~~~~---------------------------------~~~~~~~~~~~g~~~ 534 (535) T protein:vir:33 513 GALATSS---------------------------------PEAMQGAAAKAGLNA 534 (535) T ss_pred cchhhcC---------------------------------ChhHHHHHHhccCCC Confidence 0000000 000001111111111 No 25 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.91 E-value=2.4e-22 Score=139.15 Aligned_cols=545 Identities=12% Similarity=0.047 Sum_probs=275.5 Q ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhC--CCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh----- Q lcl|NC_020488. 15 SQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLA--GEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ----- 87 (688) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~--G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~----- 87 (688) =.+...++++.+|+...+..+.|...|++..+|.. ...+..++...- ....+.+.-+.....++++.+.+.. T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~-~~~~~~~~dst~~~a~~~Las~l~~~ltpp 79 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRN-DRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcc-cccccccccchHHHHHHHHHHHHHHhhcCC Confidence 12235678899999999999999999999999962 212222211111 1122333444455555555555444 Q ss_pred CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCC Q lcl|NC_020488. 88 NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAF 167 (688) Q Consensus 88 ~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~ 167 (688) +++=++..+.+..... ..+- .+--+..++.+...+..+++..+...++.+.+..|.|++.+.. +. T Consensus 80 ~~~WF~l~~~d~~~~e--------~~~v-~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~--d~---- 144 (559) T protein:vir:95 80 ARPWFRLATPDPEMMD--------YGPV-KLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLD--DD---- 144 (559) T ss_pred CCcccccccCCccccc--------hHHH-HHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeec--CC---- Confidence 3344444443322100 0011 1122344555666677899999999999999999999976532 11 Q ss_pred CcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccch-hcccccccccccCCCCCEEEEEEEEe Q lcl|NC_020488. 168 DLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVG-DLSDAERGEYSWWTNEEGVRVSEYFY 246 (688) Q Consensus 168 ~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~v~v~e~~~ 246 (688) +..+++..+ +..++++..++.- ...-|+++..||..++.++|+..... ..... +....+.+.+.|+.+-| T Consensus 145 ~~~~r~~~~-~l~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~----~~~~~~~~~v~v~~~V~ 215 (559) T protein:vir:95 145 EDIIRTMPF-PIGSYYLANSPRG----SVDTCFRKFSMTVRQLVQEFGLNNVSESVKSM----WESGTYEKWIEVMHSVY 215 (559) T ss_pred CceeEEEEe-ecCeEEEeeCCCC----CeEEEEEeEecCHHHHHHHcCcccCCHHHHHH----HhcCCCCCeEEEEEEEe Confidence 233566666 5678888776532 23347888999999999999875432 21111 11122234455554433 Q ss_pred eeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEch-hhhcccCCCCCCCccceEEEeeeee Q lcl|NC_020488. 247 REPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTA-YDVLEGPVDWPGSTIPVAPVLGKEM 325 (688) Q Consensus 247 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~-~~ile~~~p~~~~~~P~vp~~~~~~ 325 (688) -+... ..+ ....+..+...+||...+. .+++.. +.| .++||+++.+ . T Consensus 216 pr~~~------~~~---------------------~~~~~~~pf~s~~~e~~~~~~~~l~e-sg~--~e~P~~~~Rw--~ 263 (559) T protein:vir:95 216 PNIDR------DTS---------------------KLDSKNKPFKSVYYEVGGDNDKLLRE-SGF--DEFPIMAPRW--E 263 (559) T ss_pred ccccc------ccc---------------------ccccccceEEEEEEEecCCCceeeec-CCc--ccCCccceee--e Confidence 21100 000 0011112233444443322 344543 223 6799996544 5 Q ss_pred ccCCcccccc-hHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccccccccee Q lcl|NC_020488. 326 VIGDKTYYRG-LIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQR 404 (688) Q Consensus 326 ~~~~~~~g~g-~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 404 (688) ..+|..||.| .+....+-.+.+|.+....+..+.+..+|+++++.+...... +..+|++...+...+.+.+.. T Consensus 264 ~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~------~l~pgg~~~~~~~~~~~~i~p 337 (559) T protein:vir:95 264 VNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRA------SLLPGDITYIDQITGQDGFRP 337 (559) T ss_pred ecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccce------eeeccceeeeCCCCCccccee Confidence 6799999999 599999999999999999999999999999999877653221 122333333333333334433 Q ss_pred cCC-CcchHHHHHHHHHHHHHHHHHhCcCh-HHcC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Q lcl|NC_020488. 405 DMP-ASMPAAELQLALSATDEMKATIGLYD-ASVG-AQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIEL 480 (688) Q Consensus 405 ~~~-~~~~~~~~~ll~~~~~~~~~~tGv~d-~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~l 480 (688) ... ..-...+...++...+.|....-.+- .+++ .++...|++.|..+.+.....|.....+|. .+...+....+.+ T Consensus 338 ~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~i 417 (559) T protein:vir:95 338 AYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSM 417 (559) T ss_pred ecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Confidence 322 11223333445666677766553322 1233 344567999999999999999999998884 5777777777777 Q ss_pred HHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHH Q lcl|NC_020488. 481 IPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAG 560 (688) Q Consensus 481 i~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~ 560 (688) +.+.- .|.+.. ..+....++|..- ++-...++......+.++++.+..+. T Consensus 418 l~r~g------------------~lP~~p-----------~~l~~~~i~v~~i-s~La~aqk~~~~~~i~~~~~~~~~la 467 (559) T protein:vir:95 418 MVRKN------------------MLPPPP-----------DVMEGMPLKVEYI-SVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) T ss_pred HHhcC------------------CCCCCc-----------ccccCcceEEEee-cHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66541 010000 0010112333332 22333455666666666666555544 Q ss_pred HHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 561 GVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQ 640 (688) Q Consensus 561 ~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q 640 (688) ++ .|.++.. -+.+++...+....+-.. ......++..+..++.+++|+++++ + +... ++ ++ T Consensus 468 q~-~Pevld~---id~d~~~~~~a~~~Gvp~-~~irs~~ev~~~rqqr~~~qq~~q~--~-----~~~~---~a----a~ 528 (559) T protein:vir:95 468 QV-KPEALDK---LNVDQAIDAFADMSGVSP-TVIVPQEQVEQARQQRAQQQQQQQM--M-----AMGM---AA----AQ 528 (559) T ss_pred cc-Chhhhhc---CCHHHHHHHHHHHhCCch-hhcCCHHHHHHHHHHHHHHHHHHHH--H-----HHHH---HH----HH Confidence 42 3444443 345666666655443321 1100000000000000000000000 0 0000 00 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 641 AKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQ 685 (688) Q Consensus 641 ~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q 685 (688) .-....+++.. ...++.... -+..-...++| T Consensus 529 ~~~~~~~~~~~---------~~~~l~~~~-----~~~~~~~~~~~ 559 (559) T protein:vir:95 529 GVKTLSEAKTS---------DPSVLSAMA-----NAVSGQGGQSQ 559 (559) T ss_pred hhhccccccCC---------ChhHHHHHH-----HhhcCccccCC Confidence 00000000000 000000000 00000000000 No 26 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.91 E-value=5.1e-22 Score=137.31 Aligned_cols=520 Identities=11% Similarity=0.015 Sum_probs=270.4 Q ss_pred CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCC----CCCCHHHHHHHHhcCCCceeehhHHHHHHHHH Q lcl|NC_020488. 7 PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAG----EQWPESVRKEREDEGRPCLTLNKLPQYVDQVL 82 (688) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G----~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~ 82 (688) +++++.+...+ +.++.+|+...+.++.|...|++..+|..- +.+... ..+ ...+.-..-...+++.. T Consensus 1 m~~~~~~~~~~---~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~-----~~~-~~~~~dst~~~a~~~La 71 (535) T protein:vir:15 1 MADSKRTGLGE---DGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNE-----STD-YTTPWQAVGARGLNNLA 71 (535) T ss_pred CCccchhccch---HHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcc-----ccc-ccccccccHHHHHHHHH Confidence 44444333333 345667777778888899999999998632 222211 001 11122233334444444 Q ss_pred HHHHhC----CcceEEEeCCccccccccccccccChhhHHHH---HHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceE Q lcl|NC_020488. 83 GDQRQN----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLA---EVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWL 155 (688) Q Consensus 83 g~~~~~----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~A---e~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~ 155 (688) +.+... ++=+++.+.+... ........+-.+.. +..+..+...+..|++..+...++.+.+..|.|++ T Consensus 72 a~l~~~ltP~~~WF~l~~~d~~~-----~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 146 (535) T protein:vir:15 72 SKLMLALFPMQSWMKLTISEYEA-----KQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALL 146 (535) T ss_pred HHHHHhhcCCCcccccccChHHH-----hccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeE Confidence 444432 2222222222110 00000000011122 23444455556789999999999999999999986 Q ss_pred EEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCC Q lcl|NC_020488. 156 RVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTN 235 (688) Q Consensus 156 ~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 235 (688) .+. . + .++.+.+..+ +..++++..++.- ...-++++..||.+++-+.|+...... ... ... T Consensus 147 ~~~--~---~-~~~~~~f~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~~~~-------~~~-~~~ 207 (535) T protein:vir:15 147 YLP--E---P-EGSYNPMKLY-RLSSYVVQRDAYG----NVLQIVTRDQIAFGALPEDVRSAVEKA-------GGE-KKM 207 (535) T ss_pred Eee--c---C-CCCceeeEEE-EcCeeEEeeCCCC----CeeEEEEeEeecHHHHHHHHhHhhhcc-------ccc-cCC Confidence 653 1 1 1233455555 4567787765432 344588999999999988876543211 111 112 Q ss_pred CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 236 EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 236 ~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) .+.|.|+++.+++... +. +.++. ...|..+.-..+.|+++.+ T Consensus 208 ~~~v~v~~~v~~~~~~--------~~-----------------------------~~~~~-e~~g~~~~~~~~~~~~~~~ 249 (535) T protein:vir:15 208 DEMVDVYTHVYLDEES--------GD-----------------------------YLKYE-EVEDVEIDGSDATYPTDAM 249 (535) T ss_pred CCceeEEEEEEEecCC--------Cc-----------------------------EEEEE-EeeCccccccccccccccC Confidence 3456777666543211 11 11111 1223222212345677899 Q ss_pred ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCc Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNA 395 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 395 (688) ||+++.+ ..++|..||.|.+....+-.+.+|.+....+.......+++++++.+.+.+..++.. ..+|.++... T Consensus 250 P~i~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~---~~~g~~v~g~- 323 (535) T protein:vir:15 250 PYIPVRM--VRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTK---AQTGDFVPGR- 323 (535) T ss_pred Cceeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhccc---CCceeeecCC- Confidence 9996544 567999999999999999999999999999999999999999998888877654432 1223333221 Q ss_pred ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_020488. 396 IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVG 474 (688) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~ 474 (688) .+.-.+.......-.....+.++...+.|.... ..+.+...++...|++-|..+.+.....|...+.+|. .++..+. T Consensus 324 -~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli 401 (535) T protein:vir:15 324 -REDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (535) T ss_pred -cccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH Confidence 121122222233344556777888888887754 3333333556678999999999999999999999985 5777777 Q ss_pred HHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHH Q lcl|NC_020488. 475 QILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQ 554 (688) Q Consensus 475 ~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q 554 (688) +..+.++.+.. . +.+ + + .+.+.+.+..+ -....|....+.++++++ T Consensus 402 ~r~~~il~r~g---------------~---lP~--~-p------------~~~v~~~yis~-La~aqr~~~~~~l~~~~~ 447 (535) T protein:vir:15 402 RVLLKQLQATS---------------Q---IPE--L-P------------KEAVEPTISTG-LEAIGRGQDLDKLERCIS 447 (535) T ss_pred HHHHHHHHhcC---------------C---CCC--C-C------------ccceeEEEecH-HHHHHHHHHHHHHHHHHH Confidence 77777765421 0 000 0 0 01134444433 334567777777777776 Q ss_pred hhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhcccccc--chhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 555 AVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGIL--DQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKA 632 (688) Q Consensus 555 ~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~ 632 (688) .+.++.+ ..+. ..-+.+++...+....+-... -..+.+.++..+ +++++ + .+.+++.++ T Consensus 448 ~la~~~P----~~ld--~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~-----q~~~~-----~-~~~~~a~~~-- 508 (535) T protein:vir:15 448 AWAALAP----MQGD--PDINLAVIKLRIANAIGIDTSGILLTDEQKQALMM-----QDAAQ-----T-GIENAAATG-- 508 (535) T ss_pred HHHhcCh----hhhh--ccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHH-----HHHHH-----H-HHHHHHHHH-- Confidence 5433322 1111 112556777666655443211 111110000000 00000 0 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 633 QADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 633 q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ....+ +. +.. .-+.+ +.+....=.+| T Consensus 509 g~~~~-~~-------~~~-------~p~~~---------------~~~~~~~g~~~ 534 (535) T protein:vir:15 509 GAGVG-AL-------ATS-------SPEAM---------------QGAAAQAGLDA 534 (535) T ss_pred Hhhcc-ch-------hcc-------ChHHH---------------HHHHhccCCCC Confidence 00000 00 000 00000 00111111111 No 27 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.91 E-value=4.5e-22 Score=137.59 Aligned_cols=529 Identities=12% Similarity=0.073 Sum_probs=273.9 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhC---CC--CCCHHHHHHHHhcCCCceeehhHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLA---GE--QWPESVRKEREDEGRPCLTLNKLPQYVDQVL 82 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~---G~--Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~ 82 (688) |+. +...++++++.+|+...+..+.|...|++..+|.. |. -++..+...-..+. +.+.-..-...+++.. T Consensus 1 m~~----d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~-~~~~dstg~~a~~~LA 75 (549) T protein:vir:10 1 MTN----DDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERS-QKMFDSTAPLALRNFV 75 (549) T ss_pred CCc----chHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccc-cccccchHHHHHHHHH Confidence 444 33679999999999999999999999999999963 21 12211111000010 1122222333344444 Q ss_pred HHHHh-----CCcceEEEeCCccccccccccccccChhhHHHHHHHH---HHHHHHH--HhcChHHHHHHHHHHHHHcCC Q lcl|NC_020488. 83 GDQRQ-----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYE---SLIRNIE--YTSNAEAHYDNAFQHAVEGGF 152 (688) Q Consensus 83 g~~~~-----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~---~~i~~~~--~~~~~~~~~~~~~~d~~~~G~ 152 (688) +.+.. +++=+++.+.+.. ..+..+..+.|. ..+..++ ..+++..+...++.+.+..|. T Consensus 76 s~l~~~ltpp~~~wF~l~~~~~~------------~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gt 143 (549) T protein:vir:10 76 AAMDSMITPATQLWHRLKTGNDA------------LNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGP 143 (549) T ss_pred HHHHhhccCCCCccccccCCccc------------hhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcc Confidence 44333 2333344333221 011112222233 3333322 368899999999999999999 Q ss_pred ceEEEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccch-hcccccccccc Q lcl|NC_020488. 153 GWLRVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVG-DLSDAERGEYS 231 (688) Q Consensus 153 G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~~ 231 (688) |++.+. . ++ +..+.+..+ +..++++..++. . ...-++++..||..++.++||..+.. ........ T Consensus 144 a~l~~~--~---~~-~~~~~f~~~-pl~~~~v~~d~~-G---~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~--- 209 (549) T protein:vir:10 144 GALMIE--H---DV-GKGIVYRNV-PMQRLWFAENNS-G---LIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEK--- 209 (549) T ss_pred eeeEEe--e---cC-CCeeEEEEE-EcCeEEEeeCCC-C---CeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhc--- Confidence 997763 2 11 223455555 456777766543 1 22337889999999999999875432 21111111 Q ss_pred cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCC Q lcl|NC_020488. 232 WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWP 311 (688) Q Consensus 232 ~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~ 311 (688) . ..+.+.|+.+=|.+.... ... ...+ .+.+..+|+..++..++... . T Consensus 210 -~-~~~~~~v~~~V~pr~~~~--~~~-------------------------~~~~-~~pf~sv~~e~~~~~il~es-g-- 256 (549) T protein:vir:10 210 -D-PEKSAIFYHAVEPRADRD--PRK-------------------------LDGR-NMQFASYWLDEGRDRIVQNS-G-- 256 (549) T ss_pred -C-CCceEEEEEEeecCCCCC--ccc-------------------------cccc-cCceEEEEEEecCCEeeccC-C-- Confidence 1 124455543322211000 000 0001 12234445555666666542 2 Q ss_pred CCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCcee Q lcl|NC_020488. 312 GSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVL 391 (688) Q Consensus 312 ~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 391 (688) +.++||+|..+ ...+|..||.|.+....+-.+.+|.+....+.......+|+++++.+.+.+..+. . +|++. T Consensus 257 ~~e~P~~~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~l-----~-pgg~~ 328 (549) T protein:vir:10 257 FRTFPFAIGRF--YVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLDGFDL-----R-SGALN 328 (549) T ss_pred cccCCcceeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccccee-----c-cCCcc Confidence 35789996544 5679999999999999999999999999999999999999999988766554332 2 22322 Q ss_pred ecCcc-cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_020488. 392 RYNAI-PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RA 469 (688) Q Consensus 392 ~~~~~-~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~ 469 (688) ....+ .+...+..+....--......++...+.|....=++...+-.++...|++.|..+.+.....|.....++. .+ T Consensus 329 ~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~ 408 (549) T protein:vir:10 329 WGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSEL 408 (549) T ss_pred ccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 11111 12223333333323344556677777777776533332333456678999999999999999999998885 67 Q ss_pred HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccce-ee-eEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 470 IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIA-AG-KFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 470 ~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~-~~-~~dv~v~~~~~~~s~r~~~~~ 547 (688) ...+....+.++.+. | . |.+. ..++. .| .++|.. +++=...++..... T Consensus 409 l~Pli~R~~~il~r~----------g---~-----lP~~-----------p~~l~~~~~~~~i~y-is~La~aq~~~~~~ 458 (549) T protein:vir:10 409 LGPMIAREVDILAEA----------G---Q-----LPDM-----------PQELIDAGADVDVEY-DSPLNKAMRAGEGA 458 (549) T ss_pred HHHHHHHHHHHHHhc----------C---C-----CCCC-----------ChhhhcCCceeEEEe-ecHHHHHHHHHHHH Confidence 777777777766552 1 0 0000 00110 01 223332 23334455666666 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKA 627 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 627 (688) .+.++++.+..+.+ +.|.++... +.+++...+....+-...-....++.++. . ++.++++++.+++++ + T Consensus 459 ~i~~~~~~~~~laq-~~Pe~ld~i---d~d~~~~~~a~~~Gvp~~~irs~eev~~~-r--~~~~~qqq~~~~~~~----a 527 (549) T protein:vir:10 459 AILQWLQQLGIVSQ-FDPAAAKVP---NGARIARLLADYGGVPVEAMSTDEELQAQ-Q--AAEAQAAQMQQMLAA----A 527 (549) T ss_pred HHHHHHHHHHHHhc-cChhHHhcC---CHHHHHHHHHHhcCCCccccCCHHHHHHH-H--HHHHHHHHHHHHHHH----H Confidence 67777766555544 344444443 44666666655544321100010000000 0 000000000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 628 DTAKAQADMAMAQAKTAEAQAKLA 651 (688) Q Consensus 628 e~~~~q~e~~~~q~~~~~~~a~~~ 651 (688) . ..+++....+++.++ ++..+. T Consensus 528 ~-~a~~~a~~~~~~~ta-~~~~~~ 549 (549) T protein:vir:10 528 P-VAAGAIKDLSDAQTA-AQTARV 549 (549) T ss_pred H-HHHHHHHhhhhhcCC-CcccCC Confidence 0 000000000111111 000000 No 28 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.91 E-value=2e-22 Score=139.54 Aligned_cols=522 Identities=12% Similarity=0.052 Sum_probs=272.7 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ 87 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~ 87 (688) |+. .+.....+.++.+|+...+.++.|...|++..+|..-.-.+.+. . ........+.-+.-...+++..+.+.. T Consensus 1 m~~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~-~-~~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) T protein:vir:10 1 MAE---KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS-D-NASTDYQTPWQAVGARGLNNLASKLML 75 (536) T ss_pred Ccc---hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCC-C-cccccccccccccHHHHHHHHHHHHHh Confidence 222 22233456777888888888889999999999987432111100 0 011111223334444455555554444 Q ss_pred CC----cceEEEeCCccccccccccccccChhhHHH------HHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEE Q lcl|NC_020488. 88 NR----PAIQVHPVEANATKDTSKVPNVAGTSDYSL------AEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRV 157 (688) Q Consensus 88 ~r----~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~------Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v 157 (688) .- |=+++.+.+.... .....+.+. -+..++.+...+..|++..+...++.+.+..|.|++.+ T Consensus 76 ~ltP~~~WFrl~~~d~~~~--------~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~ 147 (536) T protein:vir:10 76 ALFPMQTWMRLTISEYEAK--------QLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL 147 (536) T ss_pred hhcCCCcccccccChhhhh--------ccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEE Confidence 32 2112222221100 000011111 23345556666778999999999999999999998654 Q ss_pred EEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 158 LTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) + +++..+...++.+ +..++++..++.- ...-++++..||..++.+.||+..... ....+ ..+ T Consensus 148 --~---e~~~~~~~~~~~~-pl~~~~v~~d~~G----~vd~i~r~~~~t~~~l~~~fg~~~~~~---~~~~~-----~~~ 209 (536) T protein:vir:10 148 --P---EPEGSNYNPMKLY-RLSSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVEGQ---GGEKK-----ADE 209 (536) T ss_pred --e---eCCCCceeeEEEE-EcCeEEEeeCCCC----CeeEEeeeeeccHHHHHHhhhhhhccc---ccccC-----ccc Confidence 2 2222233333344 4566776654321 334478999999999999998643221 11111 124 Q ss_pred EEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccce Q lcl|NC_020488. 238 GVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 238 ~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~ 317 (688) .|.|+++-+.+... +. +. ++....|..++.....++...+|| T Consensus 210 ~v~v~~~V~~~~~~--------~~-----------------------------~~-~~~e~~g~~v~~~~g~~~f~~~P~ 251 (536) T protein:vir:10 210 TIDVYTHIYLDEAS--------GE-----------------------------YL-RYEEVEGMEVQGSDGTYPKEACPY 251 (536) T ss_pred ceEEEEEEEEecCC--------Cc-----------------------------EE-EEEeecCccccccccccccccCCc Confidence 56666554443211 11 11 122345555555556677889999 Q ss_pred EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) +++.+ ...+|..||.|.+....+-.+.+|++....+.......++.++++.+.+.+...+.. ..+|.++.-. . T Consensus 252 i~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~---~~~g~~v~g~--~ 324 (536) T protein:vir:10 252 IPIRM--VRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK---AQTGDFVTGR--P 324 (536) T ss_pred eeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhcc---CCCcceecCC--c Confidence 96554 467999999999999999999999999999999999999999999888877665432 2234443311 1 Q ss_pred ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_020488. 398 GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQI 476 (688) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~ 476 (688) +.-.+.......--+...+.++...+.|.+..=+ +.+.-.++...|++-|..+.+.....|...+.+|. .++..+.+. T Consensus 325 ~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r 403 (536) T protein:vir:10 325 EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRV 403 (536) T ss_pred ccceeeeccccccchHHHHHHHHHHHHHHHHHhh-hhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 2112222333334455677788888888876632 22222456678999999999999999999998885 466667666 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) .+.++... | .+-.+ ..++ +.+.+.++. ....|.+.++.++.+++.+ T Consensus 404 ~~~il~r~-------------g--~lP~~--------------p~~~----v~~~~vs~l-~~l~r~~~~~~l~~~~~~l 449 (536) T protein:vir:10 404 LLKQLQAT-------------Q--QIPEL--------------PKEA----VEPTISTGL-EAIGRGQDLDKLERCVTAW 449 (536) T ss_pred HHHHHHhC-------------C--CCCCC--------------Chhh----ccceEEecH-HHHHHHHHHHHHHHHHHHH Confidence 66666432 1 00000 0111 122332332 2455666777777777655 Q ss_pred HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccc--cchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGI--LDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKAD-TAKAQ 633 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e-~~~~q 633 (688) .++.+ ..+. ..-+.+++...+....+-.+ .-..+.+.++.. ++++ +++. .++++. ..+.+ T Consensus 450 a~~~P----~~ld--~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r--------~q~~--~~~~-~~~~a~~~~~~~ 512 (536) T protein:vir:10 450 AALAP----MRDD--PDINLAMIKLRIANAIGIDTSGILLTEEQKQQKM--------AQQS--MQMG-MDNGAAALAQGM 512 (536) T ss_pred Hhhch----hhhc--ccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHH--------HHHH--HHHH-HHHHHHHHHHHH Confidence 43332 2221 12255677776665544311 111111000000 0000 0000 000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 634 ADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETV 669 (688) Q Consensus 634 ~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~ 669 (688) +.++..-.+....+. .....++.+ T Consensus 513 ~~~~~~~~~~~~~~~------------~~~g~~~~~ 536 (536) T protein:vir:10 513 AAQATASPEAMAAAA------------DSVGLQPGI 536 (536) T ss_pred HHHHhcCchhHHhhh------------hccccCCCC Confidence 000000000000000 000000100 No 29 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.91 E-value=2.5e-22 Score=138.99 Aligned_cols=520 Identities=12% Similarity=0.034 Sum_probs=271.0 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCC--CHHHHHHHHhcCCCceeehhHHHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQW--PESVRKEREDEGRPCLTLNKLPQYVDQVLGDQ 85 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw--~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~ 85 (688) |+ +.+.....+.++.+|+...+..+.|...|++..+|..-.-. +..... .....+.-+.-...+++..+.+ T Consensus 1 m~---~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~----~~~~~~~dst~~~a~~~Laa~l 73 (536) T protein:vir:21 1 MA---EKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS----TDYQTPWQAVGARGLNNLASKL 73 (536) T ss_pred Cc---chhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc----ccccccccccHHHHHHHHHHHH Confidence 22 22323345677888888888888999999999998743211 111101 1112233344444455555444 Q ss_pred HhCC----cceEEEeCCccccccccccccccChhhHHH------HHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceE Q lcl|NC_020488. 86 RQNR----PAIQVHPVEANATKDTSKVPNVAGTSDYSL------AEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWL 155 (688) Q Consensus 86 ~~~r----~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~------Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~ 155 (688) ...- |=+++.+.+.... .....+.+. -+..++.+...+..|++..+...++.+.+..|.|++ T Consensus 74 ~~~ltP~~~WFrl~~~d~~~~--------~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 145 (536) T protein:vir:21 74 MLALFPMQTWMRLTISEYEAK--------QLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL 145 (536) T ss_pred HHhhcCCCcccccccChhhhh--------ccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeE Confidence 4432 2112222221100 000011111 233455666667789999999999999999999986 Q ss_pred EEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCC Q lcl|NC_020488. 156 RVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTN 235 (688) Q Consensus 156 ~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 235 (688) .+ + +++..+...++.+ +..++++..++. . ...-++++..||..++.+.||+..... .... .. T Consensus 146 y~--~---e~~~~~~~~f~~~-pl~~~~v~~d~~---G-~vd~i~r~~~~t~~~l~~~fg~~~~~~---~~~~-----~~ 207 (536) T protein:vir:21 146 YL--P---EPEGSNYNPMKLY-RLSSYVVQRDAF---G-NVLQMVTRDQIAFGALPEDIRKAVEGQ---GGEK-----KA 207 (536) T ss_pred EE--e---eCCCCceeeEEEE-EcCeEEEeeCCC---C-CeeEEeeeeeccHHHHHHhhhhhhccc---cccc-----cc Confidence 54 2 2222233333344 456677665432 1 344588999999999999998743321 1111 12 Q ss_pred CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 236 EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 236 ~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) .+.|.|+.+-++++.. +. +.+| ....|..++.....|+...+ T Consensus 208 ~~~v~v~~~v~~~~~~--------~~-----------------------------~~~~-~e~~g~~v~~~~g~~~f~~~ 249 (536) T protein:vir:21 208 DETIDVYTHIYLDEDS--------GE-----------------------------YLRY-EEVEGMEVQGSDGTYPKEAC 249 (536) T ss_pred ccceeEEEEEEEecCC--------Cc-----------------------------EEEE-eccCCeeeccccCccccccC Confidence 2455555544433211 11 1111 12334445444556778899 Q ss_pred ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCc Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNA 395 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 395 (688) ||+++.+ ...+|..||.|.+....+-.+.+|++....+.......++.++++.+.+.+...+.. ..+|.++.-. T Consensus 250 P~i~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~---~~~g~~v~g~- 323 (536) T protein:vir:21 250 PYIPIRM--VRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK---AQTGDFVTGR- 323 (536) T ss_pred Ceeeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhcc---CCCcceecCC- Confidence 9996554 467999999999999999999999999999999999999999999888877665432 2234443311 Q ss_pred ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_020488. 396 IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVG 474 (688) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~ 474 (688) .+.-.+.......--+...+.++...+.|.+..=+ +.+.-.++...|++-|..+.+.....|...+.+|. .++..+. T Consensus 324 -~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli 401 (536) T protein:vir:21 324 -PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (536) T ss_pred -cccceeeeccccccchHHHHHHHHHHHHHHHHHhh-hhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH Confidence 12112222333334455677788888888776632 22222456678999999999999999999998885 4666676 Q ss_pred HHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHH Q lcl|NC_020488. 475 QILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQ 554 (688) Q Consensus 475 ~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q 554 (688) +..+.++... | .+-.+. .++ +.+.+.++. ....|.+.++.++.+++ T Consensus 402 ~r~~~il~r~-------------g--~lP~~p--------------~~~----v~~~~vs~l-~~l~r~~~~~~l~~~~~ 447 (536) T protein:vir:21 402 RVLLKQLQAT-------------Q--QIPELP--------------KEA----VEPTISTGL-EAIGRGQDLDKLERCVT 447 (536) T ss_pred HHHHHHHHhC-------------C--CCCCCC--------------hhh----ccceEEecH-HHHHHHHHHHHHHHHHH Confidence 6666666432 1 000000 011 122332332 24556667777777776 Q ss_pred hhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccc--cchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_020488. 555 AVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGI--LDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKAD-TAK 631 (688) Q Consensus 555 ~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e-~~~ 631 (688) .+.++.+ ..+. ..-+.+++...+....+-.+ .-..+.+.++.. +++++ +++ .++++. ..+ T Consensus 448 ~la~~~P----e~ld--~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r--------~q~~~--~~~-~~~~a~~~~~ 510 (536) T protein:vir:21 448 AWAALAP----MRDD--PDINLAMIKLRIANAIGIDTSGILLTEEQKQQKM--------AQQSM--QMG-MDNGAAALAQ 510 (536) T ss_pred HHHhhch----hhhc--ccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHH--------HHHHH--HHH-HHHHHHHHHH Confidence 5433332 2221 11255667766655544311 111110000000 00000 000 000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 632 AQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETV 669 (688) Q Consensus 632 ~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~ 669 (688) .++. ++... .+..++.......++.+ T Consensus 511 ~~~~-----------~~~~~-~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 511 GMAA-----------QATAS-PEAMAAAADSVGLQPGI 536 (536) T ss_pred HHHH-----------HHhcC-hhhHHhhhhccccCCCC Confidence 0000 00000 00000000000001100 No 30 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=99.90 E-value=2.3e-21 Score=133.72 Aligned_cols=541 Identities=11% Similarity=0.024 Sum_probs=273.9 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhC--CCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLA--GEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQ 85 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~--G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~ 85 (688) |. +...++++.+|+...+..+.|...|++..+|.. ..-+...+...-. +..+.+.-+.....++++.+.+ T Consensus 1 m~-------~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~-~~~~~~~dst~~~a~~~Las~l 72 (556) T protein:vir:73 1 MA-------ETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDD-RRNTKIVDPTGSMAQRILSSGM 72 (556) T ss_pred CC-------hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcch-hhcCccccchHHHHHHHHHHHH Confidence 11 123466888999999999999999999999962 2223322211111 1123344444455555555554 Q ss_pred Hh-----CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEe Q lcl|NC_020488. 86 RQ-----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTK 160 (688) Q Consensus 86 ~~-----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~ 160 (688) .. +++=+++.+.+.+... ..+ -.+--+..++.+...+..|++..+...++.+.+..|.|++.+. T Consensus 73 ~~~ltpp~~~WF~l~~~d~~~~~--------~~~-v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~-- 141 (556) T protein:vir:73 73 MSGITSPARPWFKLATPDPDMMD--------YGP-VKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVM-- 141 (556) T ss_pred HHhhcCCCCcccccccCcccccc--------hHH-HHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeee-- Confidence 44 3444455544332100 001 1122234566666677789999999999999999999997553 Q ss_pred eccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccc-hhcccccccccccCCCCCEE Q lcl|NC_020488. 161 YSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAV-GDLSDAERGEYSWWTNEEGV 239 (688) Q Consensus 161 ~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~-~~~~~~~~~~~~~~~~~~~v 239 (688) .+. ++-+++..+ +..+++++.++.- ...-|+++..|+..++.++|+..+. ...... +.....+..+ T Consensus 142 ~~~----~~~~r~~~~-~l~~~~~~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~----~~~~~~~~~~ 208 (556) T protein:vir:73 142 EDD----QDVIRTMPF-PIGSYYLANSPRG----SVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGM----WENGTYETWV 208 (556) T ss_pred ecC----CceEEEEEe-ecceeEEeeCCCC----CeEEEEEEEeccHHHHHHHcCcccCCHHHHHH----HhcCCccceE Confidence 211 233566666 5678888876542 2233788899999999999986542 222211 1111122345 Q ss_pred EEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEE-chhhhcccCCCCCCCccceE Q lcl|NC_020488. 240 RVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKV-TAYDVLEGPVDWPGSTIPVA 318 (688) Q Consensus 240 ~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~~~~ile~~~p~~~~~~P~v 318 (688) .|+.+=|.+.. ...+ ....+......+||... .+.+++.. +.| .++||+ T Consensus 209 ~v~~~V~pr~~------~~~~---------------------~~~~~~~p~~s~~~~~~~~~~~vl~e-sg~--~e~P~~ 258 (556) T protein:vir:73 209 EVNHCITPNVN------RDSG---------------------KMDSKNKPYRSVYFESGGDSDKLLRE-SGF--DEFPIL 258 (556) T ss_pred EEEEEEecccc------cccc---------------------ccCcccceEEEEEEEecCCCceeccc-CCc--ccCCce Confidence 55433221100 0000 00111122233444432 23445533 233 678999 Q ss_pred EEeeeeeccCCcccccch-HHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 319 PVLGKEMVIGDKTYYRGL-IRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 319 p~~~~~~~~~~~~~g~g~-v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) ++.+ ...+|..||.|. +....+-.+.+|.+....+....+..+|+++++.+......+ ..+++.+....+ . T Consensus 259 ~~Rw--~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~-----~~pgg~~~~~~~-~ 330 (556) T protein:vir:73 259 APRW--EVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQRVS-----LLPGDVTYLDVI-S 330 (556) T ss_pred eeee--eecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceecccccccccee-----eccCccccccCC-C Confidence 6554 457999999994 999999999999999999999999999999998875432111 223333322222 1 Q ss_pred ccccceecC--CCcchHHHHHHHHHHHHHHHHHhCcCh-HHcCC-CcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_020488. 398 GVDRPQRDM--PASMPAAELQLALSATDEMKATIGLYD-ASVGA-QGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRR 472 (688) Q Consensus 398 ~~~~~~~~~--~~~~~~~~~~ll~~~~~~~~~~tGv~d-~~~G~-~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~ 472 (688) +.+.+.++. .+. -..+.+.++...+.|....-++- .+++. ++...|++.|..+.+.....|.....+|. .+... T Consensus 331 ~~~~i~p~~~~~~d-~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~P 409 (556) T protein:vir:73 331 GQDGFKPAYLVNPN-TADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNP 409 (556) T ss_pred Cccceeeecccccc-HHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 223333332 122 34445566777777766553332 13343 44467999999999999999999998884 57777 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) +....+.++.+.- .|.+.. ..+....++|.... +-...++......+.++ T Consensus 410 li~r~~~il~r~g------------------~lP~~P-----------~~l~~~~i~v~yis-~La~aqk~~~~~~i~~~ 459 (556) T protein:vir:73 410 LIDRVFSIMARKN------------------MLPEPP-----------DVLQGMPLRIEYIS-VMAQAQKSIGLTSLSQT 459 (556) T ss_pred HHHHHHHHHHhcC------------------CCCCCc-----------hhhcCceeEEEeec-HHHHHHHHHHHHHHHHH Confidence 7777777666531 011000 01111223333322 33344555555566666 Q ss_pred HHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 553 VQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKA 632 (688) Q Consensus 553 ~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~ 632 (688) ++.+..+.++ .|.++.. -+.+++...+....+-.. ......++.++. .+++++++|++.+ + + T Consensus 460 ~~~~~~laq~-~Pe~~d~---id~d~~~~~~a~~~Gvp~-~~irs~eev~~~------rq~r~~~qq~~~~-~------~ 521 (556) T protein:vir:73 460 VGFIGQLAQF-KPEALDK---LDVDQAIDAFSEMSGVSP-TVIVPQEQVQGI------REERAKQAQAAQA-M------A 521 (556) T ss_pred HHHHHHHhcc-ChhhHhc---CCHHHHHHHHHHHcCCCh-hhcCCHHHHHHH------HHHHHHHHHHHHH-H------H Confidence 6655444442 3444443 344666666655443321 110000000000 0000000000000 0 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 633 QADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVA 674 (688) Q Consensus 633 q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~ 674 (688) +..++ ++..+.++..... ...+++..-..+.++++ T Consensus 522 ~~~~a------~~~~~~~~~~~~~-~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 522 MGQAA------AQGAKTLSETQTS-DPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHH------HHHHHHhhhccCC-CHHHHHHHHHhhcCCCC Confidence 00000 0000011110000 00000000000000000 No 31 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=99.90 E-value=1.7e-21 Score=134.41 Aligned_cols=537 Identities=12% Similarity=0.044 Sum_probs=273.0 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh---CCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL---AGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGD 84 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~---~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~ 84 (688) |.++. ..+.+..+|+...+..+.|...|++..+|. .|.=|..+. ..-..+ .+.+.-..-...+++..+. T Consensus 1 M~~~~------~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~-~~~~~~-~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQT------ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDR-NRGEKR-HNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcc------cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCC-Ccchhc-ccccccccHHHHHHHHHHH Confidence 33333 346788889999999999999999999997 343332211 110111 1223333444444444444 Q ss_pred HHh-----CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 85 QRQ-----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 85 ~~~-----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +.. +++=+++.+.+.+.. ...+-...+ +..+..+...+..|++..+...++.+.+..|.|++.+. T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~--------e~~~v~~~L-~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~- 142 (555) T protein:vir:10 73 MMAGMTSPARPWFRLTTSIPELD--------ESAAVKAWL-ANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVL- 142 (555) T ss_pred HHHhhcCCCCcccccccCccccc--------chHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEe- Confidence 443 344445555433211 001112222 34455566667789999999999999999999996553 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccch-hcccccccccccCCCCCE Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVG-DLSDAERGEYSWWTNEEG 238 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~ 238 (688) .+. ++.+++..+ +..++++..++. . ...-++++..||..++.++||..+.. ..... +.....+.. T Consensus 143 -~d~----~~~~rf~~~-pl~~~~v~~d~~---G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~----~~~~~~~~~ 208 (555) T protein:vir:10 143 -PDF----DAVVYHHSL-TAGEYAIAADNQ---G-RVNTLYREFQITVAQMVREFGKDKCSTTVQSL----FDRGALEQW 208 (555) T ss_pred -cCC----CceEEEEEe-ecceeEEeeCCC---C-CEEEEEEEEeccHHHHHHhcCcccCCHHHHHH----HhcCCCCce Confidence 211 234556666 466788765443 2 23446788899999999999875532 21111 111222345 Q ss_pred EEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEE-chhhhcccCCCCCCCccce Q lcl|NC_020488. 239 VRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKV-TAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 239 v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~~~~ile~~~p~~~~~~P~ 317 (688) +.|+.+.|-...... ... ..+..+...++|... .+..++.. +.| ..+|| T Consensus 209 v~v~~~V~pr~~~~~--~~~-------------------------~~~~~p~~s~~~~~~~d~~~vl~e-sgy--~e~P~ 258 (555) T protein:vir:10 209 VTVIHAIEPRADRDP--SKR-------------------------DDRNMAWKSVYFEPGADETRTLRE-SGY--RSFRA 258 (555) T ss_pred EEEEEEEeeccCcCc--CCC-------------------------CccccceEEEEEEeccCCcccccc-CCc--ccCCc Confidence 777776653321100 000 001111122333322 23445533 223 57999 Q ss_pred EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) +|+.+ ...+|..||.|.+....+-.+.+|++....+..+....++++.++.+.....- ...+++...+ ..+. T Consensus 259 i~~Rw--~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~-----~~~pgg~~~v-~~g~ 330 (555) T protein:vir:10 259 LCPRW--ALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI-----STVPGGLSYV-DAAA 330 (555) T ss_pred eeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc-----eecccccccc-ccCC Confidence 96544 56799999999999999999999999999999999999999999887643221 1223333222 2222 Q ss_pred ccccceec-CCCcchHHHHHHHHHHHHHHHHHhCcCh--HHcC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_020488. 398 GVDRPQRD-MPASMPAAELQLALSATDEMKATIGLYD--ASVG-AQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRR 472 (688) Q Consensus 398 ~~~~~~~~-~~~~~~~~~~~ll~~~~~~~~~~tGv~d--~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~ 472 (688) +.+.+... .....-+...+.++...+.|.... ..+ .+++ .++...|++.|..+.+.....|...+.++. .+... T Consensus 331 ~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~P 409 (555) T protein:vir:10 331 PNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDP 409 (555) T ss_pred CCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 22222222 222223455666777777777654 444 2233 344468999999999999999999998884 56667 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) +.+..+.++.+.- .+.+.. ..+....++|..-.. =...++......+.++ T Consensus 410 li~r~~~il~r~g------------------~lP~~P-----------~~l~~~~i~v~yis~-La~aq~~~~~~~i~~~ 459 (555) T protein:vir:10 410 LIELTFQRMVEAN------------------ILPPPP-----------QEMQGVDLNVEFVSM-LAQAQRAIATNSVDRF 459 (555) T ss_pred HHHHHHHHHHhcC------------------CCCCCc-----------hhhcCceeEEEeccH-HHHHHHHHHHHHHHHH Confidence 6666666665531 000000 001001123332222 2334555555556666 Q ss_pred HHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 553 VQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKA 632 (688) Q Consensus 553 ~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~ 632 (688) ++.+..+.++ .|.++.. -+.+++...+....+-... .....++ .+++.++++++++++.++ T Consensus 460 l~~i~~laq~-~P~vld~---id~d~~~~~~a~~~Gvp~~-~irs~ee-----------v~~~r~qr~~~~q~~~~a--- 520 (555) T protein:vir:10 460 VGNLGAVAGI-KPEVLDK---FDADRWADTYADMLGIDPE-LIVPGNQ-----------VALIRKQRADQQQAAQQA--- 520 (555) T ss_pred HHHHHHHhcC-Chhhhhc---CCHHHHHHHHHHHhCCCcc-ccCCHHH-----------HHHHHHHHHHHHHHHHHH--- Confidence 6555444332 3333333 3446666666554432211 0000000 000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 633 QADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 633 q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ++. .+.....+.+.... +.-...+....+|.+ T Consensus 521 ~~~-----~q~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~ 552 (555) T protein:vir:10 521 ALL-----NQGADTAAKLGSVD-------------------TSKQNALTDVTRAFS 552 (555) T ss_pred HHH-----HHHHHHHHHhcccc-------------------cCcchhHHHHHhhhc Confidence 000 00000000000000 000011111111112 No 32 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=99.90 E-value=1.7e-21 Score=134.41 Aligned_cols=537 Identities=12% Similarity=0.044 Sum_probs=273.0 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh---CCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL---AGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGD 84 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~---~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~ 84 (688) |.++. ..+.+..+|+...+..+.|...|++..+|. .|.=|..+. ..-..+ .+.+.-..-...+++..+. T Consensus 1 M~~~~------~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~-~~~~~~-~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQT------ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDR-NRGEKR-HNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcc------cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCC-Ccchhc-ccccccccHHHHHHHHHHH Confidence 33333 346788889999999999999999999997 343332211 110111 1223333444444444444 Q ss_pred HHh-----CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 85 QRQ-----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 85 ~~~-----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +.. +++=+++.+.+.+.. ...+-...+ +..+..+...+..|++..+...++.+.+..|.|++.+. T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~--------e~~~v~~~L-~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~- 142 (555) T protein:vir:10 73 MMAGMTSPARPWFRLTTSIPELD--------ESAAVKAWL-ANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVL- 142 (555) T ss_pred HHHhhcCCCCcccccccCccccc--------chHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEe- Confidence 443 344445555433211 001112222 34455566667789999999999999999999996553 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccch-hcccccccccccCCCCCE Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVG-DLSDAERGEYSWWTNEEG 238 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~ 238 (688) .+. ++.+++..+ +..++++..++. . ...-++++..||..++.++||..+.. ..... +.....+.. T Consensus 143 -~d~----~~~~rf~~~-pl~~~~v~~d~~---G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~----~~~~~~~~~ 208 (555) T protein:vir:10 143 -PDF----DAVVYHHSL-TAGEYAIAADNQ---G-RVNTLYREFQITVAQMVREFGKDKCSTTVQSL----FDRGALEQW 208 (555) T ss_pred -cCC----CceEEEEEe-ecceeEEeeCCC---C-CEEEEEEEEeccHHHHHHhcCcccCCHHHHHH----HhcCCCCce Confidence 211 234556666 466788765443 2 23446788899999999999875532 21111 111222345 Q ss_pred EEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEE-chhhhcccCCCCCCCccce Q lcl|NC_020488. 239 VRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKV-TAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 239 v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~~~~ile~~~p~~~~~~P~ 317 (688) +.|+.+.|-...... ... ..+..+...++|... .+..++.. +.| ..+|| T Consensus 209 v~v~~~V~pr~~~~~--~~~-------------------------~~~~~p~~s~~~~~~~d~~~vl~e-sgy--~e~P~ 258 (555) T protein:vir:10 209 VTVIHAIEPRADRDP--SKR-------------------------DDRNMAWKSVYFEPGADETRTLRE-SGY--RSFRA 258 (555) T ss_pred EEEEEEEeeccCcCc--CCC-------------------------CccccceEEEEEEeccCCcccccc-CCc--ccCCc Confidence 777776653321100 000 001111122333322 23445533 223 57999 Q ss_pred EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) +|+.+ ...+|..||.|.+....+-.+.+|++....+..+....++++.++.+.....- ...+++...+ ..+. T Consensus 259 i~~Rw--~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~-----~~~pgg~~~v-~~g~ 330 (555) T protein:vir:10 259 LCPRW--ALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI-----STVPGGLSYV-DAAA 330 (555) T ss_pred eeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc-----eecccccccc-ccCC Confidence 96544 56799999999999999999999999999999999999999999887643221 1223333222 2222 Q ss_pred ccccceec-CCCcchHHHHHHHHHHHHHHHHHhCcCh--HHcC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_020488. 398 GVDRPQRD-MPASMPAAELQLALSATDEMKATIGLYD--ASVG-AQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRR 472 (688) Q Consensus 398 ~~~~~~~~-~~~~~~~~~~~ll~~~~~~~~~~tGv~d--~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~ 472 (688) +.+.+... .....-+...+.++...+.|.... ..+ .+++ .++...|++.|..+.+.....|...+.++. .+... T Consensus 331 ~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~P 409 (555) T protein:vir:10 331 PNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDP 409 (555) T ss_pred CCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 22222222 222223455666777777777654 444 2233 344468999999999999999999998884 56667 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) +.+..+.++.+.- .+.+.. ..+....++|..-.. =...++......+.++ T Consensus 410 li~r~~~il~r~g------------------~lP~~P-----------~~l~~~~i~v~yis~-La~aq~~~~~~~i~~~ 459 (555) T protein:vir:10 410 LIELTFQRMVEAN------------------ILPPPP-----------QEMQGVDLNVEFVSM-LAQAQRAIATNSVDRF 459 (555) T ss_pred HHHHHHHHHHhcC------------------CCCCCc-----------hhhcCceeEEEeccH-HHHHHHHHHHHHHHHH Confidence 6666666665531 000000 001001123332222 2334555555556666 Q ss_pred HHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 553 VQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKA 632 (688) Q Consensus 553 ~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~ 632 (688) ++.+..+.++ .|.++.. -+.+++...+....+-... .....++ .+++.++++++++++.++ T Consensus 460 l~~i~~laq~-~P~vld~---id~d~~~~~~a~~~Gvp~~-~irs~ee-----------v~~~r~qr~~~~q~~~~a--- 520 (555) T protein:vir:10 460 VGNLGAVAGI-KPEVLDK---FDADRWADTYADMLGIDPE-LIVPGNQ-----------VALIRKQRADQQQAAQQA--- 520 (555) T ss_pred HHHHHHHhcC-Chhhhhc---CCHHHHHHHHHHHhCCCcc-ccCCHHH-----------HHHHHHHHHHHHHHHHHH--- Confidence 6555444332 3333333 3446666666554432211 0000000 000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 633 QADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 633 q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ++. .+.....+.+.... +.-...+....+|.+ T Consensus 521 ~~~-----~q~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~ 552 (555) T protein:vir:10 521 ALL-----NQGADTAAKLGSVD-------------------TSKQNALTDVTRAFS 552 (555) T ss_pred HHH-----HHHHHHHHHhcccc-------------------cCcchhHHHHHhhhc Confidence 000 00000000000000 000011111111112 No 33 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=99.90 E-value=1.7e-21 Score=134.41 Aligned_cols=537 Identities=12% Similarity=0.044 Sum_probs=273.0 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh---CCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL---AGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGD 84 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~---~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~ 84 (688) |.++. ..+.+..+|+...+..+.|...|++..+|. .|.=|..+. ..-..+ .+.+.-..-...+++..+. T Consensus 1 M~~~~------~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~-~~~~~~-~~~~~dst~~~a~~~LAa~ 72 (555) T protein:vir:98 1 MAEQT------ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDR-NRGEKR-HNNILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcc------cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCC-Ccchhc-ccccccccHHHHHHHHHHH Confidence 33333 346788889999999999999999999997 343332211 110111 1223333444444444444 Q ss_pred HHh-----CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 85 QRQ-----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 85 ~~~-----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) +.. +++=+++.+.+.+.. ...+-...+ +..+..+...+..|++..+...++.+.+..|.|++.+. T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~--------e~~~v~~~L-~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~- 142 (555) T protein:vir:98 73 MMAGMTSPARPWFRLTTSIPELD--------ESAAVKAWL-ANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVL- 142 (555) T ss_pred HHHhhcCCCCcccccccCccccc--------chHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEe- Confidence 443 344445555433211 001112222 34455566667789999999999999999999996553 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccch-hcccccccccccCCCCCE Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVG-DLSDAERGEYSWWTNEEG 238 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~ 238 (688) .+. ++.+++..+ +..++++..++. . ...-++++..||..++.++||..+.. ..... +.....+.. T Consensus 143 -~d~----~~~~rf~~~-pl~~~~v~~d~~---G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~----~~~~~~~~~ 208 (555) T protein:vir:98 143 -PDF----DAVVYHHSL-TAGEYAIAADNQ---G-RVNTLYREFQITVAQMVREFGKDKCSTTVQSL----FDRGALEQW 208 (555) T ss_pred -cCC----CceEEEEEe-ecceeEEeeCCC---C-CEEEEEEEEeccHHHHHHhcCcccCCHHHHHH----HhcCCCCce Confidence 211 234556666 466788765443 2 23446788899999999999875532 21111 111222345 Q ss_pred EEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEE-chhhhcccCCCCCCCccce Q lcl|NC_020488. 239 VRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKV-TAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 239 v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~~~~ile~~~p~~~~~~P~ 317 (688) +.|+.+.|-...... ... ..+..+...++|... .+..++.. +.| ..+|| T Consensus 209 v~v~~~V~pr~~~~~--~~~-------------------------~~~~~p~~s~~~~~~~d~~~vl~e-sgy--~e~P~ 258 (555) T protein:vir:98 209 VTVIHAIEPRADRDP--SKR-------------------------DDRNMAWKSVYFEPGADETRTLRE-SGY--RSFRA 258 (555) T ss_pred EEEEEEEeeccCcCc--CCC-------------------------CccccceEEEEEEeccCCcccccc-CCc--ccCCc Confidence 777776653321100 000 001111122333322 23445533 223 57999 Q ss_pred EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) +|+.+ ...+|..||.|.+....+-.+.+|++....+..+....++++.++.+.....- ...+++...+ ..+. T Consensus 259 i~~Rw--~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~-----~~~pgg~~~v-~~g~ 330 (555) T protein:vir:98 259 LCPRW--ALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI-----STVPGGLSYV-DAAA 330 (555) T ss_pred eeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc-----eecccccccc-ccCC Confidence 96544 56799999999999999999999999999999999999999999887643221 1223333222 2222 Q ss_pred ccccceec-CCCcchHHHHHHHHHHHHHHHHHhCcCh--HHcC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_020488. 398 GVDRPQRD-MPASMPAAELQLALSATDEMKATIGLYD--ASVG-AQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRR 472 (688) Q Consensus 398 ~~~~~~~~-~~~~~~~~~~~ll~~~~~~~~~~tGv~d--~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~ 472 (688) +.+.+... .....-+...+.++...+.|.... ..+ .+++ .++...|++.|..+.+.....|...+.++. .+... T Consensus 331 ~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~P 409 (555) T protein:vir:98 331 PNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDP 409 (555) T ss_pred CCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 22222222 222223455666777777777654 444 2233 344468999999999999999999998884 56667 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) +.+..+.++.+.- .+.+.. ..+....++|..-.. =...++......+.++ T Consensus 410 li~r~~~il~r~g------------------~lP~~P-----------~~l~~~~i~v~yis~-La~aq~~~~~~~i~~~ 459 (555) T protein:vir:98 410 LIELTFQRMVEAN------------------ILPPPP-----------QEMQGVDLNVEFVSM-LAQAQRAIATNSVDRF 459 (555) T ss_pred HHHHHHHHHHhcC------------------CCCCCc-----------hhhcCceeEEEeccH-HHHHHHHHHHHHHHHH Confidence 6666666665531 000000 001001123332222 2334555555556666 Q ss_pred HHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 553 VQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKA 632 (688) Q Consensus 553 ~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~ 632 (688) ++.+..+.++ .|.++.. -+.+++...+....+-... .....++ .+++.++++++++++.++ T Consensus 460 l~~i~~laq~-~P~vld~---id~d~~~~~~a~~~Gvp~~-~irs~ee-----------v~~~r~qr~~~~q~~~~a--- 520 (555) T protein:vir:98 460 VGNLGAVAGI-KPEVLDK---FDADRWADTYADMLGIDPE-LIVPGNQ-----------VALIRKQRADQQQAAQQA--- 520 (555) T ss_pred HHHHHHHhcC-Chhhhhc---CCHHHHHHHHHHHhCCCcc-ccCCHHH-----------HHHHHHHHHHHHHHHHHH--- Confidence 6555444332 3333333 3446666666554432211 0000000 000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 633 QADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 633 q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ++. .+.....+.+.... +.-...+....+|.+ T Consensus 521 ~~~-----~q~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~ 552 (555) T protein:vir:98 521 ALL-----NQGADTAAKLGSVD-------------------TSKQNALTDVTRAFS 552 (555) T ss_pred HHH-----HHHHHHHHHhcccc-------------------cCcchhHHHHHhhhc Confidence 000 00000000000000 000011111111112 No 34 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.89 E-value=1.1e-22 Score=140.89 Aligned_cols=467 Identities=9% Similarity=-0.004 Sum_probs=241.4 Q ss_pred CCCCC---CCcCCCCc---cchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeeh Q lcl|NC_020488. 1 MLPGN---EPIKTRDD---DSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLN 72 (688) Q Consensus 1 ~~~~~---~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N 72 (688) -.++. +......+ ....+++.++...+. ..-.....+..+||.|+++.-.........++| .++.| T Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~------~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n 92 (501) T protein:vir:96 19 RFHRESRIRYRADNLEELMVNNWELLKNFINHHK------LRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHN 92 (501) T ss_pred ccchhHHhhhcccccccccCChHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecc Confidence 00000 01111111 111222333222221 122335566789999988743222222233444 58899 Q ss_pred hHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCC Q lcl|NC_020488. 73 KLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGF 152 (688) Q Consensus 73 ~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~ 152 (688) ..+-+|+..+|+...+++.+.+.. ....+.+...+..++..|+++.....+..++++.|. T Consensus 93 ~~k~Ivd~~~~yl~g~p~~~~~~~--------------------~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ 152 (501) T protein:vir:96 93 YGRMISKFKTGYLAGNPIRVEYDD--------------------NDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGR 152 (501) T ss_pred hHHHHHHHHhhhhcccCeeEeeCC--------------------ccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCe Confidence 999999999999999988776531 112345666777788899999999999999999999 Q ss_pred ceEEEEEeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccccc Q lcl|NC_020488. 153 GWLRVLTKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEY 230 (688) Q Consensus 153 G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~ 230 (688) ||..++.+. ++.+.+..+ +|..+ +||+.... +.. ++.+.|... T Consensus 153 a~~~v~~de------dg~~~i~~~-~p~~~~~v~d~~~~~----~~~-~~v~~~~~~----------------------- 197 (501) T protein:vir:96 153 AYEVIYRSE------YDETRIKRL-SPLETFVIYDNSLED----NSI-AAVRYYNRG----------------------- 197 (501) T ss_pred EEEEEEEcC------CCceEEEEE-ccceeEEEEcCCCCC----ceE-EEEEEEEee----------------------- Confidence 998886542 356777666 67765 46653211 111 122222100 Q ss_pred ccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCC Q lcl|NC_020488. 231 SWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDW 310 (688) Q Consensus 231 ~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~ 310 (688) .....+..+++|..... +.+...+...+.+..|. T Consensus 198 ---~~~~~~~~~~vyt~~~i-------------------------------------------~~~~~~~~~~~~~~~~~ 231 (501) T protein:vir:96 198 ---TLQSAKDVVEIYTDEHI-------------------------------------------YTLDASDDFNEISVTTH 231 (501) T ss_pred ---cCCCcEEEEEEEcCCcE-------------------------------------------EEEeeCCCceecccccc Confidence 00112334455533211 11111111122223344 Q ss_pred CCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCce Q lcl|NC_020488. 311 PGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSV 390 (688) Q Consensus 311 ~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 390 (688) +.+.+|+|+|. ..+.|.|.+..++++++.+|..+|.+...+....++.+++.........+..... ...+.+ T Consensus 232 ~~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~-~~~~~~ 303 (501) T protein:vir:96 232 AFGTVPITEYL-------NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDM-KRTRLM 303 (501) T ss_pred CCCccceEEec-------CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhh-hhcCee Confidence 45677777642 2456889999999999999999999999998888877766443333222221211 112222 Q ss_pred eecCc-----ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 391 LRYNA-----IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDN 465 (688) Q Consensus 391 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn 465 (688) ..... ......+.++..+.-...+...++.+...|-.+|++.+.+.|..+++.||.|+..+-...........+. T Consensus 304 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~ 383 (501) T protein:vir:96 304 QLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQ 383 (501) T ss_pred eecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHH Confidence 22111 1112234455444445667788899999999999999988887667789999988877666667777777 Q ss_pred HHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHH Q lcl|NC_020488. 466 LSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEA 545 (688) Q Consensus 466 ~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~ 545 (688) |..++++++++++.++.... +....++ .+|.|.=.+..+....+. T Consensus 384 ~~~~l~~~~~li~~~~~~~~----------~~~~~d~-------------------------~~i~i~f~~~~p~n~~e~ 428 (501) T protein:vir:96 384 FTKGLKRRYRLAARIGSLVN----------EFKDFDE-------------------------SLLKITFTPNLPKSLNEQ 428 (501) T ss_pred HHHHHHHHHHHHHHHHHhcc----------ccccccc-------------------------ccceEEeCCCCCcCHHHH Confidence 77788777777766553321 1000000 122233334444445555 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHH Q lcl|NC_020488. 546 ADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEK 624 (688) Q Consensus 546 ~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~ 624 (688) .+.++.+... +....+++++++ ...+.-.+++.+.................... ...+... T Consensus 429 ad~~~kl~g~------iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~------------~~~~~~e 490 (501) T protein:vir:96 429 VSILTGLGGQ------VSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGK------------YTDEVKE 490 (501) T ss_pred HHHHHHHhcc------CchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccc------------cCCcCCC Confidence 5555555432 223445555543 23333334443221110000000000000000 0000000 Q ss_pred HHHHHHHHHHH Q lcl|NC_020488. 625 AKADTAKAQAD 635 (688) Q Consensus 625 ~q~e~~~~q~e 635 (688) ..++..+.-.+ T Consensus 491 ~~~d~~e~~~~ 501 (501) T protein:vir:96 491 THTDDFEREYE 501 (501) T ss_pred CCCCccccccC Confidence 00000000000 No 35 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=99.89 E-value=2e-21 Score=134.07 Aligned_cols=528 Identities=14% Similarity=0.115 Sum_probs=255.7 Q ss_pred hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh-----CCc Q lcl|NC_020488. 16 QEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ-----NRP 90 (688) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~-----~r~ 90 (688) ++ ..++.+|+...+..+.|...|++..+|..-.-...+. .. .....+.+.-+.-...+++..+.+.. +++ T Consensus 1 m~---~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~-~~-~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~ 75 (555) T protein:vir:17 1 MK---HSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEG-HV-QGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTS 75 (555) T ss_pred Ch---hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCC-Cc-ccccccccccccHHHHHHHHHHHHHHhhcCCCCc Confidence 23 3355677777777788999999999987432111100 00 01111223334444445555444444 333 Q ss_pred ceEEEeCCccccccccccccccChhh-HHHHH---HHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCC Q lcl|NC_020488. 91 AIQVHPVEANATKDTSKVPNVAGTSD-YSLAE---VYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDA 166 (688) Q Consensus 91 ~~~v~pr~~~~~~~~~~~~~~~~~~d-~~~Ae---~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~ 166 (688) =+++.+.+...... ..+++. .++.+ ..+..+...+..|++..+...++.+.+..|+|++.+ .+++ T Consensus 76 WF~l~~~d~~~~~~------~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~-----~~~~ 144 (555) T protein:vir:17 76 FFKLQINDAEIDNL------GMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQ-----GKKN 144 (555) T ss_pred ccccccCHHHHhhc------cCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEe-----cCCc Confidence 34444443221000 000001 11222 244556666778999999999999999999998543 2222 Q ss_pred CCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchh-cccccccccc------------cC Q lcl|NC_020488. 167 FDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGD-LSDAERGEYS------------WW 233 (688) Q Consensus 167 ~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~-~~~~~~~~~~------------~~ 233 (688) + +.+ +..++++..++. -...-++++..||..++.+.|++....+ .........+ .. T Consensus 145 ~------~~~-pl~~y~v~~d~~----G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~ 213 (555) T protein:vir:17 145 L------KLY-PLDRFVVSRDGE----GNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDK 213 (555) T ss_pred e------eEE-EcCeEEEeeCCC----cCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhccccc Confidence 2 222 334566655432 1345588999999999999997643211 1110000000 00 Q ss_pred CCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEc-hhhhcccCCCCCC Q lcl|NC_020488. 234 TNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVT-AYDVLEGPVDWPG 312 (688) Q Consensus 234 ~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~-~~~ile~~~p~~~ 312 (688) .....+.+..++.+. ++ +++|+... +..+-...+-+++ T Consensus 214 ~~~~~~~v~t~~~~~----------~~-------------------------------~~~~~~e~~~~~v~~~l~e~g~ 252 (555) T protein:vir:17 214 GKSNDALVYTYVCRK----------DG-------------------------------QVKWHQECDGKVIPGSNSSAPY 252 (555) T ss_pred CCCcceeEeeccccc----------CC-------------------------------eeEEEEecCceeccccccccCc Confidence 011111111111111 01 12222222 2222111123445 Q ss_pred CccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceee Q lcl|NC_020488. 313 STIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLR 392 (688) Q Consensus 313 ~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 392 (688) ..+||+++.+ ..++|..||.|.+....+-.+.+|.+....+..+....+++++++.+.+.+..++.. ..++.++. T Consensus 253 ~e~P~i~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l~~---~~~g~v~~ 327 (555) T protein:vir:17 253 THNPWIPLRF--NIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQNLAL---AANGAIIQ 327 (555) T ss_pred ccCCeeeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcceeec---CCCceeec Confidence 7899996544 567999999999999999999999999999999999999999998888776654432 12233332 Q ss_pred cCcccccccceecC--CCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_020488. 393 YNAIPGVDRPQRDM--PASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RA 469 (688) Q Consensus 393 ~~~~~~~~~~~~~~--~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~ 469 (688) +.. +.+..++ .+.--+.....++...+.|.+..-+. .-.++...|++.|..+.+.....|...+.+|. .+ T Consensus 328 --g~~--~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~---~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~ 400 (555) T protein:vir:17 328 --GRP--DDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML---QVRQSERTTATEVQATVQELNEQIGGIYSNLTTEL 400 (555) T ss_pred --CCc--ccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc---CCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 111 2233332 22223445666677777776654321 12345568999999999999999999999995 67 Q ss_pred HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHH Q lcl|NC_020488. 470 IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSL 549 (688) Q Consensus 470 ~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l 549 (688) +..+.+..+.++.+.-- +.+.. .++ ..+.+.++ -.+..|++..+.+ T Consensus 401 L~Pli~R~~~il~r~g~------------------lP~~p-----------~~~----v~~~i~~~-l~~l~r~~~~~~l 446 (555) T protein:vir:17 401 LQPYLARKLHLLQKQRK------------------LPQLP-----------KDL----VQPTVVAG-LWGVGRGQDKQQL 446 (555) T ss_pred HHHHHHHHHHHHHhCCC------------------CCCCC-----------Hhh----hccceeeh-HHHHHHHHHHHHH Confidence 77788877777765410 00000 011 11222222 2344567777777 Q ss_pred HHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 550 MQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADT 629 (688) Q Consensus 550 ~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~ 629 (688) +++++.+.++.. .+.++ +.-+.+++.+.+....+-.+.......++..+..+ ++.+++ +|+++..+.+.+ T Consensus 447 ~~~~~~laq~~~--~p~~~---d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq----~~~~~~-~q~~~~~qa~~~ 516 (555) T protein:vir:17 447 MEFITTLAQTMG--PEIAM---KYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGD----QQKQDM-VQASLINQAGQL 516 (555) T ss_pred HHHHHHHHhhcC--chhHh---hcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHH----HHHHHH-HHHHHHHHHHHH Confidence 777765543322 12333 33444666666655443211111000000000000 000000 000000000000 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 630 AK-AQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 630 ~~-~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) +. ...+++..+. .+.....++.. .+.. +.+-.+ +++.| T Consensus 517 ~~~~~~~~~~~~~----~~~~~~a~~~~------~a~~-------~~~~~~----~~~~~ 555 (555) T protein:vir:17 517 AKTPMAEQAMQLI----QQQQEGAQDAG------AAES-------ETSSAE----AQAGA 555 (555) T ss_pred HhhhhhhhHHhcc----ccchhhhhHHH------HHHh-------hcCCcc----cccCC Confidence 00 0000000000 00000000000 0000 000000 01111 No 36 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.89 E-value=1e-20 Score=130.13 Aligned_cols=536 Identities=9% Similarity=0.001 Sum_probs=269.9 Q ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhC---CCCCCHHHHHHHHh-cCCCceeehhHHHHHHHHHHHHHh-----C Q lcl|NC_020488. 18 AILQEIRERAAHAVTCWKHNFDAAQEDISFLA---GEQWPESVRKERED-EGRPCLTLNKLPQYVDQVLGDQRQ-----N 88 (688) Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~---G~Qw~~~~~~~~~~-~g~p~~~~N~i~~~i~~i~g~~~~-----~ 88 (688) =-.+++..+|+...+..+.|...|++..+|.. +.-+.+........ +....+.-+.-...++++.+.+.. + T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 12366778888889999999999999998873 32222211000000 001122333444445554444444 3 Q ss_pred CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCC Q lcl|NC_020488. 89 RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFD 168 (688) Q Consensus 89 r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~ 168 (688) ++=++..+.+... ....+-...+ +..+..+...+..+++..+...++.+.+..|.|++.+.-+ +... T Consensus 81 ~~WF~l~~~d~~~--------~~~~~v~~~L-~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d----~~~~ 147 (547) T protein:vir:10 81 TKWFELAFRDKEL--------NSDDECRKWL-ENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEED----EDEE 147 (547) T ss_pred CcccccccCCccc--------cchHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccC----CCCC Confidence 3444444433221 1111222223 3345566666778999999999999999999998776532 1123 Q ss_pred cceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccch-hcccccccccccCCCCCEEEEEEEEee Q lcl|NC_020488. 169 LDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVG-DLSDAERGEYSWWTNEEGVRVSEYFYR 247 (688) Q Consensus 169 ~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~v~v~e~~~~ 247 (688) +.+.+..+ +..++++..++.- ...-|+++..|+..++.++||..+.. .+...- +.+.......+.++.+.+. T Consensus 148 ~~~r~~~~-pl~~~~v~~d~~G----~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~--~~~~~~~~~~~~v~~~v~~ 220 (547) T protein:vir:10 148 GSVVFQSS-PIQDSYFEEDSRG----QVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKA--KEASNQAALKQEVVMCVFT 220 (547) T ss_pred CceeEEEe-ecceEEEeeCCCc----CeeeeeeeeeccHHHHHHhcCcccCCHHHHHHH--hcCCCcccceEEEEEEEee Confidence 45666666 5677888766532 22336788999999999999876532 111111 0111111224555555444 Q ss_pred eecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeecc Q lcl|NC_020488. 248 EPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVI 327 (688) Q Consensus 248 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~ 327 (688) +..... ...+++.+. ........+|+-..++..++... -| .++||+++.+ ... T Consensus 221 ~~~~~~--~~~~~~~~~--------------------~~~~p~~s~~~e~~~~~~~l~es-g~--~e~P~~~~Rw--~~~ 273 (547) T protein:vir:10 221 RYDKKQ--NRNAGTVLA--------------------PTERPFGKKWILKEGAVQLGEEG-GY--YEMPAYAIRW--RKS 273 (547) T ss_pred ccCCCC--Cccccceee--------------------ccccceeEEEEEecCceeeeecC-Cc--ccCCeeeeee--eec Confidence 321100 000000000 00111123444444445555432 23 5789996544 467 Q ss_pred CCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCC Q lcl|NC_020488. 328 GDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMP 407 (688) Q Consensus 328 ~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (688) +|..||.|.+....+-.+.+|.+....+..+.+..+++++++.+.+.+.. +..+|+++..++. +.++++.. T Consensus 274 ~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~------~~~pgg~~~~~~~---~~v~pl~~ 344 (547) T protein:vir:10 274 AGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISDI------DLGASGLTVVRDM---ESMKPFES 344 (547) T ss_pred CCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecccccccccc------eecCCeeeecCCc---ccceeeec Confidence 99999999999999999999999999999999999999999877665431 2234444444332 33443433 Q ss_pred CcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcC Q lcl|NC_020488. 408 ASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRVYD 486 (688) Q Consensus 408 ~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~~~ 486 (688) ..--......++...+.|....= .+.++=.++...|++-|..+.+.....|......|. .+...+....+.++.+.- T Consensus 345 ~~~~~~~~~~i~~~~~rI~~af~-~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g- 422 (547) T protein:vir:10 345 RARFDVSSIQLTDLRSAVRRIYY-VDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAG- 422 (547) T ss_pred ccchHHHHHHHHHHHHHHHHHhh-hhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC- Confidence 33333445666777777766431 222221244568999999999999999999998885 577777777776665431 Q ss_pred cceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_020488. 487 SDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDL 566 (688) Q Consensus 487 ~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~ 566 (688) . +-.+.+....+ ....++|+.-. +-...++......+.++++.+.++.++ .|. T Consensus 423 ---------~-----lP~~p~~l~~~-----------~~~~~~v~~is-~Laraq~~~~~~~i~~~~~~v~~laq~-~P~ 475 (547) T protein:vir:10 423 ---------K-----LGELPSKLLES-----------GKAAMDIVYTG-PLSRAQKIDQAASIERWAGSTAQLAEI-NPE 475 (547) T ss_pred ---------C-----CCCCchhhhcc-----------CcceEEEEecc-HHHHHHHHHHHHHHHHHHHHHHHhhcc-Chh Confidence 0 00000000000 01123333222 222233444455555555554444332 233 Q ss_pred HHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 567 IAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEA 646 (688) Q Consensus 567 ~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~ 646 (688) ++.. -+.+++...+....+-...-....++.++. .++. ++++++++|+.+..+..+ .+.... .. T Consensus 476 vld~---id~d~~~~~~a~~~Gvp~~~irs~eev~~~-r~qr--~~~~q~~~qaa~~~~~g~-------~m~~~~---~~ 539 (547) T protein:vir:10 476 VLDI---PDWDEMVRMLGSLLGAPQTLMRPKAKVTSI-RKNR--SQTQQKAEQAAIAEAEGN-------AMEAQG---KG 539 (547) T ss_pred hhhc---CCHHHHHHHHHHHhCCChhccCCHHHHHHH-HHHH--HHHHHHHHHHHHHHHHHH-------HHHhhc---Cc Confidence 3333 344666666655443221100000000000 0000 000000001110000000 000000 00 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_020488. 647 QAKLAEIEQAAMMAGPG 663 (688) Q Consensus 647 ~a~~~~~~~~a~~~~~~ 663 (688) .+.+ ++.+ T Consensus 540 ~a~~---------~~~~ 547 (547) T protein:vir:10 540 QAAL---------KENQ 547 (547) T ss_pred ccch---------hccC Confidence 0000 0000 No 37 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.88 E-value=8.5e-20 Score=125.13 Aligned_cols=514 Identities=11% Similarity=0.014 Sum_probs=264.7 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCC--CHHHHHHHHhcCCCceeehhHHHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQW--PESVRKEREDEGRPCLTLNKLPQYVDQVLGDQ 85 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw--~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~ 85 (688) |+.... .-.+.++.+|+...+.++.|...|++..+|..-.-. +..... ..+..+.-+.-...+++..+.+ T Consensus 1 ~~~~~~----~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~----~~~~~~~dst~~~a~~~Las~l 72 (522) T protein:vir:94 1 MAEREG----FAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSS----TEYTTPWQAVGARCLNNLAAKL 72 (522) T ss_pred Ccccch----hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc----ccccccccccHHHHHHHHHHHH Confidence 333222 224557778888888888999999999998743211 111111 1111233344444555555554 Q ss_pred HhCC----cceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEee Q lcl|NC_020488. 86 RQNR----PAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKY 161 (688) Q Consensus 86 ~~~r----~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~ 161 (688) ...- |=++..+.+....... .......+-...+ +..+..+...+..|++..+...++.+.+..|.|++.+. . T Consensus 73 ~~~ltP~~~WFrl~~~d~~~~~~~-~~~~~~~~v~~~L-~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~--~ 148 (522) T protein:vir:94 73 MLALFPQSPWMRLTVSEYEAKTLS-QDSEAAARVDEGL-AMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIP--E 148 (522) T ss_pred HhhcCCCCcccccccchhhhhccC-cccchhHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeee--c Confidence 4432 2112222211100000 0000000111122 34555555666789999999999999999999986543 1 Q ss_pred ccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEE Q lcl|NC_020488. 162 STDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRV 241 (688) Q Consensus 162 ~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 241 (688) +.-+....+..+ +..++++..++. . ...-++++..++.+.+-+.++.... .+.+ .-.+.|.| T Consensus 149 ---~~~~~~~~~~~~-pl~~y~v~~d~~---G-~vd~i~r~~~~~~~~l~~~~~~~~~-------~~~~---~p~~~v~v 210 (522) T protein:vir:94 149 ---PEQGTYSPMRMY-RLVSYVVQRDAF---G-NILQIVTIDKVAFSALPEDVKSQLN-------ADDY---EPDTELEV 210 (522) T ss_pred ---cCCCceeeEEEE-EcceEEEeeCCC---c-CeEEEeeeeeccHHhcchHHHHHHh-------cccC---CccceEEE Confidence 111122233333 345666654432 1 2334677788888776555433221 0111 11356777 Q ss_pred EEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEe Q lcl|NC_020488. 242 SEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVL 321 (688) Q Consensus 242 ~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~ 321 (688) +++.+++... .. ++....|..+....+-|+...+||+++. T Consensus 211 ~~~v~~~~~~----------~~------------------------------~~~~~~g~~~~~~~~~~~~~e~P~~~~R 250 (522) T protein:vir:94 211 YTHIYRQDDE----------YL------------------------------RYEEVEGIEVTGTDGSYPLTACPYIPVR 250 (522) T ss_pred EEEEEeeCCc----------ee------------------------------EEeeccCceecccCCCCccccCCceeee Confidence 7776654221 10 1111122222222344667889999654 Q ss_pred eeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccccccc Q lcl|NC_020488. 322 GKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDR 401 (688) Q Consensus 322 ~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 401 (688) + ..++|..||.|.+....+-.+.+|.+....+.......+|+++++++.+.+...+.. ..+|.++.. ..+.-. T Consensus 251 w--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~---~~~g~~v~g--~~~~v~ 323 (522) T protein:vir:94 251 M--VRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNK---AATGEFVAG--RVEDIN 323 (522) T ss_pred e--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheec---cCCceeecC--Ccccce Confidence 4 567999999999999999999999999999999999999999999888877765422 223344331 111112 Q ss_pred ceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_020488. 402 PQRDMPASMPAAELQLALSATDEMKATIGLYDASVG-AQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIE 479 (688) Q Consensus 402 ~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~ 479 (688) +.......--....+.++...+.|.+..-+. +++ .++...|++-|..+.+.....+...+.+|. .++..+.+..+. T Consensus 324 ~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~ 401 (522) T protein:vir:94 324 FLQLTKGQDFTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMN 401 (522) T ss_pred eeecccccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 2222223334556777888888888876443 344 345568999999999999999999998885 577777777777 Q ss_pred HHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHH Q lcl|NC_020488. 480 LIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAA 559 (688) Q Consensus 480 li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~ 559 (688) ++.+.-- +.+. ..+ .+.+.+.+ +-....|.+..+.+.++++.+.++ T Consensus 402 il~r~g~------------------lP~~-----------p~~----~v~v~~~s-~La~~qr~~~~~~l~~~~~~ia~l 447 (522) T protein:vir:94 402 QLQSAGM------------------IPDL-----------PKE----AVEPTVST-GLEALGRGQDLEKLTQAVNMMTGL 447 (522) T ss_pred HHHhcCC------------------CCCC-----------Ccc----cEEeeEec-HHHHHHHHHHHHHHHHHHHHHHhc Confidence 6654310 0000 001 12333333 233456677777777777765444 Q ss_pred HHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 560 GGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMA 639 (688) Q Consensus 560 ~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~ 639 (688) .+..+ ++.-+.+++...+....+-.........++.++..+ ++++ ++..++++..+ T Consensus 448 ~P~~~------~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~----q~~~-----~~~~~~~~~~~--------- 503 (522) T protein:vir:94 448 QPLSQ------DPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMA----EQSS-----QQAVVQGASAA--------- 503 (522) T ss_pred cchhh------hhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHH----HHHH-----HHHHHHHHHHH--------- Confidence 33221 112245777777666554311111100000000000 0000 00000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 640 QAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEA 676 (688) Q Consensus 640 q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a 676 (688) .+...+... .+....+ .+| T Consensus 504 ---~~~~~a~~~-~~~~~~~--------------~~~ 522 (522) T protein:vir:94 504 ---GANMGAAVG-QGAGEDM--------------AQA 522 (522) T ss_pred ---HHHhhhhhh-cccchhh--------------hcC Confidence 000000000 0000000 000 No 38 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.87 E-value=9.2e-22 Score=135.90 Aligned_cols=464 Identities=9% Similarity=-0.001 Sum_probs=241.9 Q ss_pred CCCCCC---CcCCCCccchHHHHHHHHHHHHHHHHhh-hHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhH Q lcl|NC_020488. 1 MLPGNE---PIKTRDDDSQEAILQEIRERAAHAVTCW-KHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKL 74 (688) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i 74 (688) ..+.+. .+....++......+.++.. .+.+ .....+..+-.+||.|++..-......+..++| .++.|.. T Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~----i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~ 94 (501) T protein:vir:27 19 RFHRESRIRYRADNLEELMVNNWELLKNF----INHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYG 94 (501) T ss_pred ccChhHHHhhccccccccccccHHHHHHH----HHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchH Confidence 111111 11111222211111222222 2222 223345667789999987642222222333444 5788999 Q ss_pred HHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCce Q lcl|NC_020488. 75 PQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGW 154 (688) Q Consensus 75 ~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~ 154 (688) +.+|+..+|+...+++.+.+... ...+.+...+..++..|+++.....+..++++.|.+| T Consensus 95 k~Ivd~~~~yl~g~p~~~~~~d~--------------------~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~ 154 (501) T protein:vir:27 95 RMISKFKTGYLAGNPIRVEYDDN--------------------DNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAY 154 (501) T ss_pred HHHHHHHhhhhcccCeeEecCCc--------------------cchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEE Confidence 99999999999999887765321 2234556667777888999999999999999999999 Q ss_pred EEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccccccc Q lcl|NC_020488. 155 LRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSW 232 (688) Q Consensus 155 ~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 232 (688) ..|+.+. ++.+.+..+ +|..++ ||+.... +.. .+++.|... T Consensus 155 ~~vy~de------d~~~~i~~~-~p~~~~~v~d~~~~~----~~~-~~ir~~~~~------------------------- 197 (501) T protein:vir:27 155 EVIYRNE------YDETRIKRL-NPLETFVIYDNSLED----NSI-AAVRYYNRG------------------------- 197 (501) T ss_pred EEEEeCC------CCceEEEEE-ccceeEEEecCCCCC----ceE-EEEEEEEee------------------------- Confidence 8887542 356777666 677754 5653211 111 222222110 Q ss_pred CCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCC Q lcl|NC_020488. 233 WTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPG 312 (688) Q Consensus 233 ~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~ 312 (688) ...+.+..+++|....+. .+...+...+.+..|.+. T Consensus 198 -~~~~~~~~~~vyt~~~v~-------------------------------------------~~~~~~~~~~~~~~~~~~ 233 (501) T protein:vir:27 198 -TLQNAKDVVEIYTNEHIY-------------------------------------------TLDASDDFNEISVTTHAF 233 (501) T ss_pred -ecCCcEEEEEEEeCCeEE-------------------------------------------EEEeCCceeeccccccCC Confidence 011223445555443211 111111111222334444 Q ss_pred CccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceee Q lcl|NC_020488. 313 STIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLR 392 (688) Q Consensus 313 ~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 392 (688) +.+|+|+|. ....|.|.+..++++++.+|..+|.+...+....++.+++.........+..... ...+.+.+ T Consensus 234 g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~-~~~~~~~~ 305 (501) T protein:vir:27 234 GTVPITEFL-------NNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDM-KRTRLMQL 305 (501) T ss_pred CcccEEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhh-hhcCceee Confidence 667776542 2456889999999999999999999999998887777665433332222222211 11223332 Q ss_pred cCcc-----cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 393 YNAI-----PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 393 ~~~~-----~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) ..+. .+...+.++..+.-..++...++.+...|-.+|++.+.+.|.-+++.||+|+..+-...........+.|. T Consensus 306 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~ 385 (501) T protein:vir:27 306 KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFT 385 (501) T ss_pred cccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 2211 11223455555444566777889999999999999988887766667999998887666666777777777 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .+++++.++++.++.... .....++ .+|.|.=.+..+....+..+ T Consensus 386 ~~l~~~~~li~~~~~~~~----------~~~~~d~-------------------------~~i~v~f~~~~p~n~~e~ad 430 (501) T protein:vir:27 386 QGLKRRYRLAARIGSLVN----------EFKDFDE-------------------------SLLKITFTPNLPKSLNEQVS 430 (501) T ss_pred HHHHHHHHHHHHHHhhcc----------ccccccc-------------------------ccceEEeCCCCCcCHHHHHH Confidence 778777777766543211 1000011 12233333444544555555 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHH-----hhhhhhhhhhHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEME-----EAGIEPPQPSPEQQANMAQAQAD 621 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~q~~~~~~q~~ 621 (688) .++.+... +....+++++++ ...++-.+++++............. ...............+ T Consensus 431 ~~~kl~g~------iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e------- 497 (501) T protein:vir:27 431 ILTGLGGQ------VSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFE------- 497 (501) T ss_pred HHHHHhcc------CcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCcccccc------- Confidence 55554322 223445555443 2333334444332111000000000 0000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 622 MEKAKADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 622 ~~~~q~e~~~~q~e~~~~q~~~~~ 645 (688) -+. + T Consensus 498 ----------~~~----------~ 501 (501) T protein:vir:27 498 ----------RAY----------E 501 (501) T ss_pred ----------ccC----------C Confidence 000 0 No 39 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=99.87 E-value=1.1e-19 Score=124.58 Aligned_cols=527 Identities=12% Similarity=0.056 Sum_probs=263.1 Q ss_pred CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHH Q lcl|NC_020488. 7 PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQR 86 (688) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~ 86 (688) ++.....+ ..-.+.++.+|+...+.++.|...|++..+|..-.-.+.+. . ......+.+.-..-...+++..+.+. T Consensus 1 ~~~~~~~~--~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~-~-~~~~~~~~~~dst~~~a~~~Laa~l~ 76 (535) T protein:vir:94 1 MASSQKRE--GFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDS-D-NASTDYTTPWQAVGARGLNNLASKLM 76 (535) T ss_pred CCchhhhh--hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCC-C-ccccccCCcccccHHHHHHHHHHHHH Confidence 22221211 12234477888888888889999999999987421111000 0 01111222333333344444444443 Q ss_pred h----CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeec Q lcl|NC_020488. 87 Q----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYS 162 (688) Q Consensus 87 ~----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~ 162 (688) . +++=+++.+.+....... .......+-...+ +..+..+...+..|++..+...++.+.+..|.|++.+.- T Consensus 77 ~~ltP~~~WF~l~~~d~~~~~~~-~~~~~~~~v~~~L-~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~--- 151 (535) T protein:vir:94 77 LALFPMQTWMKLTISEFEAKQLV-AQPAELAKVEEGL-SMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPE--- 151 (535) T ss_pred hhhcCCCCccccccChhhhhccc-cchhHHHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeecc--- Confidence 3 233223333221100000 0000000011112 233344445567899999999999999999999876532 Q ss_pred cCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEE Q lcl|NC_020488. 163 TDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVS 242 (688) Q Consensus 163 ~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~ 242 (688) ++..+ +.+..+ +..++++..++.- ...-|+++..++.+.+-..|++..... . + ....+.+.|+ T Consensus 152 --~~~~~-~~f~~~-pl~~y~v~~d~~G----~vd~i~r~~~~~~~~l~~~~~~~~~~~----~--~---~~~~~~v~v~ 214 (535) T protein:vir:94 152 --PEGTY-NPMKLY-RLSSYVVQRDAFG----TVLQIVTLDKTAYAALPEDVRNSMDSS----Q--E---HKGDEMIDVY 214 (535) T ss_pred --CcCcc-cceEEE-EcCeEEEeeCCCC----CeEEEEeeeeccHHHhhHHHHHHHHhc----c--c---cCCCceeEEE Confidence 11111 233334 3455666544321 234567888999999887776532110 0 0 1123456666 Q ss_pred EEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEee Q lcl|NC_020488. 243 EYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLG 322 (688) Q Consensus 243 e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~ 322 (688) ++-+++... + .+.++ +...|..+.-..+.+++..+||+++.+ T Consensus 215 ~~v~~~~~~--------~-----------------------------~~~~~-~e~~g~~~~~~~~~~g~~~~P~~~~Rw 256 (535) T protein:vir:94 215 THIYLDEES--------G-----------------------------EYLKY-EEIDGVEVEGTDASYPVDACPYIPVRM 256 (535) T ss_pred EEEEeeCCC--------C-----------------------------cEEEE-EEecCeeeccccccCccccCCceeeee Confidence 654433211 1 11122 223343332223456778999996654 Q ss_pred eeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccc Q lcl|NC_020488. 323 KEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRP 402 (688) Q Consensus 323 ~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 402 (688) ...+|..||.|.+....+-.+.+|++....+.......++.++++.+.+.+...+.. ..+|.++. + ..+.-.+ T Consensus 257 --~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~---~~~g~~v~-g-~~~~v~~ 329 (535) T protein:vir:94 257 --VRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTK---AQTGDFVS-G-RPEDISF 329 (535) T ss_pred --eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhccc---CCCceeec-C-Cccccee Confidence 467999999999999999999999999999999999999999999888877665432 22344433 2 1111122 Q ss_pred eecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_020488. 403 QRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELI 481 (688) Q Consensus 403 ~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li 481 (688) .......--+....+++...+.|.... ..+.+...++...|++-|..+.+.....|...+.+|. .++..+.+..+.++ T Consensus 330 ~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il 408 (535) T protein:vir:94 330 LQLEKAADFSVARAVSEQIEGRLSYAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQL 408 (535) T ss_pred eecccccchhHHHHHHHHHHHHHHHHH-hHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 223333344556677788888887655 2222223456678999999999999999999998885 56777777777766 Q ss_pred HHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHH Q lcl|NC_020488. 482 PRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGG 561 (688) Q Consensus 482 ~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~ 561 (688) .+.- . +.+. ..++ +++.+.+ +-....|.+.++.++++++.+.++.+ T Consensus 409 ~r~g---------------~---lP~~-----------p~~~----v~~~~vs-~la~l~r~~~~~~l~~~~~~laq~~P 454 (535) T protein:vir:94 409 QATN---------------Q---IPEL-----------PKEA----VEPTIST-GMEALGRGQDLDKLERCIAAWSALAP 454 (535) T ss_pred HhCC---------------C---CCCC-----------Chhh----ccceEee-hHHHHHHHHHHHHHHHHHHHHHhhCh Confidence 5431 0 0000 0111 1233322 23345566777777777765443332 Q ss_pred HHHHHHHHhcCCccHHHHHHHHHhhccccc--cchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_020488. 562 VVLDLIAKNMDWPGAQDIARRLQKTLPPGI--LDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAK-AQADMAM 638 (688) Q Consensus 562 ~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~-~q~e~~~ 638 (688) ..+. ..-+.+++...+....+-.. .-..+.+.++ ..+++++ ++++.+++.++. +...+++ T Consensus 455 ----~~ld--~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~--------~~~q~~~---~~~~~~~~~~~g~~~~~~~~ 517 (535) T protein:vir:94 455 ----MQGD--PDINIATIKLRIANAIGIDTSGILKTPEEKQQ--------EMAEAAQ---GTAMQNAAASAGAGAGTMAT 517 (535) T ss_pred ----HHhh--hcCCHHHHHHHHHHHhCCChhhhcCCHHHHHH--------HHHHHHH---HHHHHHHHHHHHHhhhcccc Confidence 2221 12345666666655544221 1111111000 0000000 000000000000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020488. 639 AQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGN 687 (688) Q Consensus 639 ~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~ 687 (688) ...+.+.... .....+-| T Consensus 518 ~~~~~~~~~~-------------------------------~~~g~~~~ 535 (535) T protein:vir:94 518 ASPENMKAAA-------------------------------AQAGMAPN 535 (535) T ss_pred cChHHHHHHH-------------------------------HHhccCCC Confidence 0000000000 00001111 No 40 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.87 E-value=1.8e-19 Score=123.34 Aligned_cols=533 Identities=11% Similarity=0.040 Sum_probs=263.9 Q ss_pred CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCC----CCCHHHHHHHHhcCCCceeehhHHHHHHHHH Q lcl|NC_020488. 7 PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGE----QWPESVRKEREDEGRPCLTLNKLPQYVDQVL 82 (688) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~----Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~ 82 (688) ++.++.+.... +.++.+|+...+..+.|...|++..+|..-. +++..- .+ ...+.-..-...+++.. T Consensus 1 ~~~~~~~~~~~---~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~-----~~-~~~~~dst~~~a~~~La 71 (543) T protein:vir:88 1 MAETKREGLAE---EGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSS-----TD-YTTPWQAVGARGLNNLS 71 (543) T ss_pred CcccccCcchH---HHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCccc-----cc-ccccccchHHHHHHHHH Confidence 55555444444 3456677777788889999999999888532 222110 01 11122333334444444 Q ss_pred HHHHhC----CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 83 GDQRQN----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 83 g~~~~~----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +.+... ++=+++.+.+....... .......+-+..+ +..++.+...+..|++..+...++.+.+..|.|++.+. T Consensus 72 a~l~~~ltP~~~WF~l~~~d~~~~~~~-~~~~~~~~v~~~L-~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~ 149 (543) T protein:vir:88 72 AKVMLALFPLQSWMKLKVSEWQAKQLV-SDPSQLAVVEQGL-GMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLP 149 (543) T ss_pred HHHHHhhcCCCcccccccChHHHhccc-CChhhHHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeec Confidence 444433 22112222221100000 0000000111112 23344555566789999999999999999999986542 Q ss_pred EeeccCCCCCc-ceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 159 TKYSTDDAFDL-DLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 159 ~~~~~~~~~~~-~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) +++..+ ......+.+..++++..++. -...-++++..++..++-..||+..... . ... ..+ T Consensus 150 -----~~~~~~~~~~~~~~~pl~~y~v~~d~~----G~v~~i~r~~~~~~~~l~~~~~~~v~~~----~----~~~-p~~ 211 (543) T protein:vir:88 150 -----PPDASSNSYNPMKLYTLHNHVVQRDAF----GNVLQIVTLDKVAYAALPEDVRNSLSGG----Q----EYK-PEQ 211 (543) T ss_pred -----cCccccceecceEEeEcceEEEeeCCC----CCeeeeeeeeeccHHHHhHHhhHHHHHH----h----hcC-Ccc Confidence 121111 11111122233455443321 1345577888999999988776532111 1 111 123 Q ss_pred EEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccce Q lcl|NC_020488. 238 GVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 238 ~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~ 317 (688) ++.|+++=+.++.. +...+ +..+.+..+.-..+.|+.+.+|| T Consensus 212 ~~~v~~~V~pr~~~--------~~~~~------------------------------~~~~~~~~v~~~~~~~~~~e~P~ 253 (543) T protein:vir:88 212 ELEVYTHIYIDDES--------GDFLS------------------------------YQEIEGVEVDGSDGQYPQDALPW 253 (543) T ss_pred ceEEEEEEEeecCC--------Ccccc------------------------------cccccCeeeecCCCccccccCCc Confidence 56666553332211 11000 00111222211234456678999 Q ss_pred EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) +++.+ ...+|..||.|.+....+-.+.+|.+....+..+....+++++++.+.+.+..++.. ..++.++. +.. T Consensus 254 i~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~---~~~g~~v~--g~~ 326 (543) T protein:vir:88 254 IAVRW--TKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVK---AQTGDFVA--GRK 326 (543) T ss_pred eeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhccc---CCCceeec--CCC Confidence 96544 567999999999999999999999999999999999999999998888877655432 12233322 112 Q ss_pred ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_020488. 398 GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQI 476 (688) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~ 476 (688) +.-.+.......--....+.++...+.|.+.. ..+.+...++...|++-|..+.+.....|...+.+|. .++..+.+. T Consensus 327 ~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r 405 (543) T protein:vir:88 327 ADIEFLQLEKTADFTVAKSVADAIEARLSYVF-MLNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRV 405 (543) T ss_pred CcceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 22122222333334557777888888888766 2333333566678999999999999999999999985 577777777 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) .+.++.+.-- +.+. ..+ .+.+.+.. +-.+..|.+..+.++.+++.+ T Consensus 406 ~~~il~r~g~------------------lP~~-----------p~~----~v~~~~vs-~l~~l~r~~~~~~l~~~~~~v 451 (543) T protein:vir:88 406 LLNQLQATQQ------------------IPNL-----------PQE----AVEPTVTT-GAEALGRGQDLDKLTQFLNAV 451 (543) T ss_pred HHHHHHhcCC------------------CCCC-----------chh----ceeeeEEe-cHHHHHHHHHHHHHHHHHHHH Confidence 7776655310 0000 001 12223222 223455777777777777765 Q ss_pred HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADM 636 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~ 636 (688) ..+.+ +.++ +.-+.+++...+....+-........+++.++..+++.++++++ .++.++......+. T Consensus 452 ~~~~~---p~vl---d~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~-------~~~~~~~~~~~~~~ 518 (543) T protein:vir:88 452 ATVSQ---LNGD---PDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGL-------NAAAGIGSGVAAQA 518 (543) T ss_pred Hhccc---hhhh---ccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHH-------HHHHHHhhchhhhh Confidence 44332 3333 33455777766665554321111111110000000000000000 00000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 637 AMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELM 681 (688) Q Consensus 637 ~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~ 681 (688) .+. - ++ ++.....+..+ .....-|. T Consensus 519 ~~~-~---~~------~~~~~~~~~~~----------~~p~~~~~ 543 (543) T protein:vir:88 519 TAS-P---EA------MESAMDTAGVQ----------PGPIATQV 543 (543) T ss_pred ccC-h---HH------HHHHhhhcCCC----------CCCCCCCC Confidence 000 0 00 00000000000 00000000 No 41 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.86 E-value=1.6e-20 Score=129.17 Aligned_cols=475 Identities=10% Similarity=0.005 Sum_probs=239.7 Q ss_pred CCCCCCCcC---CCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCC--CceeehhHH Q lcl|NC_020488. 1 MLPGNEPIK---TRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGR--PCLTLNKLP 75 (688) Q Consensus 1 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~--p~~~~N~i~ 75 (688) ..|+..-.- ...+.........+.....+... ..+....+-.+||.|.|..-.........++ ..++.|..+ T Consensus 20 ~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~---~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k 96 (512) T protein:vir:97 20 LFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMD---YQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYAS 96 (512) T ss_pred eeccccccccccCchhhhhhhhHHHHHHHHHHHHH---hhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHH Confidence 445444221 11111111112222222222111 2233455667899998763211111122233 357889999 Q ss_pred HHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceE Q lcl|NC_020488. 76 QYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWL 155 (688) Q Consensus 76 ~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~ 155 (688) .+|+..+|+...+.+.+.. - |. -....+..++..|+++.....+..+++++|.+|. T Consensus 97 ~Ivd~~~~yl~g~p~~~~~--~------------------d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~ 152 (512) T protein:vir:97 97 YISDFINGYFLGNPIQCQD--D------------------DK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (512) T ss_pred HHHHHHhhhhcccCceecc--C------------------Ch----HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEE Confidence 9999999999998877643 0 11 2345577778889999999999999999999998 Q ss_pred EEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccC Q lcl|NC_020488. 156 RVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWW 233 (688) Q Consensus 156 ~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~ 233 (688) .++.+. ++.+.+..+ +|..+| ||+.... -...+++.|... .... T Consensus 153 ~vy~de------d~~~~i~~~-~p~~~~~iyd~~~~~-----~~~~~vr~~~~~----------------------~~~~ 198 (512) T protein:vir:97 153 LMIRNQ------DDETRLYKS-DAMSTFVIYDNTIER-----NSIAGVRYLRTK----------------------PIDK 198 (512) T ss_pred EEEeCC------CCceEEEEE-cccceEEEEcCCCCC-----ceEEEEEEEEee----------------------eccc Confidence 876542 356777666 677754 6654321 122333333211 0000 Q ss_pred CCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCC Q lcl|NC_020488. 234 TNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGS 313 (688) Q Consensus 234 ~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~ 313 (688) .+.+.+..+++|.......+. ..++..... ......+.|.+.+ T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~--~~~~~~~~~-----------------------------------~~~~~~~~~~~~g 241 (512) T protein:vir:97 199 TDEDEVFTVDLFTSHGVYRYL--TSRTNGLKL-----------------------------------TPRENGFESHSFE 241 (512) T ss_pred cccceEEEEEEEeCCcEEEEE--ecCCCcccc-----------------------------------cccccccccccCc Confidence 112334455666544321111 111111000 0011233455567 Q ss_pred ccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec Q lcl|NC_020488. 314 TIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY 393 (688) Q Consensus 314 ~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 393 (688) .+|+|+|. ....|.|.+..++++++.+|...|.+.+.+....++.+++......+.++... ....+.+... T Consensus 242 ~vPvv~~~-------nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~--~~~~~~~~~~ 312 (512) T protein:vir:97 242 RMPITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK--QKEANVLFLE 312 (512) T ss_pred ccceEeec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhh--hhhccccccc Confidence 77777642 23457799999999999999999999999988887776654322222222111 1111111111 Q ss_pred C----------cccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 N----------AIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYI 463 (688) Q Consensus 394 ~----------~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~ 463 (688) . +..++..+.++..+.-..+....++.+...|-.+|++.+.+.|.-+++.||.|+..+-........... T Consensus 313 ~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~ 392 (512) T protein:vir:97 313 PTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 392 (512) T ss_pred ccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHH Confidence 0 111223345555444456677888899999999999999888866666899999988777777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHH Q lcl|NC_020488. 464 DNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRM 543 (688) Q Consensus 464 dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~ 543 (688) +.|..++++++++++.++...-. .+...++ .+|.+.=.+..+.... T Consensus 393 ~~f~~~l~~~~~li~~~~~~~~~---------~~~~~d~-------------------------~~i~~~f~~~~p~~~~ 438 (512) T protein:vir:97 393 GLFTKGLRRRAKLLETILKNTRS---------IDANKDF-------------------------NTVRYVYNRNLPKSLI 438 (512) T ss_pred HHHHHHHHHHHHHHHHHHHhcCC---------ccccccc-------------------------ccceEEeCCCCCcCHH Confidence 77778887777777665532210 0000111 0122222334444444 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHH Q lcl|NC_020488. 544 EAADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADM 622 (688) Q Consensus 544 ~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~ 622 (688) +..+.+..+... +....+++++++ ...++-.+++.+.................. ...... T Consensus 439 e~~~~~~kl~gi------iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~--~~~~~~----------- 499 (512) T protein:vir:97 439 EELKAYIDSGGK------ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR--DINDDE----------- 499 (512) T ss_pred HHHHHHHHHhcc------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCC--CCCCCC----------- Confidence 444545544322 223444555443 333333444433211100000000000000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 623 EKAKADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 623 ~~~q~e~~~~q~e~~~~q~~~~~ 645 (688) +.+...-...+.+ T Consensus 500 ----------~~~~~~~~~~~~~ 512 (512) T protein:vir:97 500 ----------QDDDTKDTVDKKE 512 (512) T ss_pred ----------CCCCccccccccC Confidence 0000000000000 No 42 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.85 E-value=2.8e-20 Score=127.76 Aligned_cols=437 Identities=11% Similarity=0.049 Sum_probs=232.2 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i 78 (688) .||+.+.+ ..+.+.++.+. +...+....+..+||.|.|= -.....+..++| .+++|..+.+| T Consensus 10 ~~~~~~~~-------~~~~i~~~i~~-------~~~~~~r~~~~~~Yy~g~~~--i~~~~~~~~~~~~~ki~~n~~~~iv 73 (452) T protein:vir:36 10 TFSKDEPI-------TVEVVTKFMEK-------HKLEVARYEYLKNMYLGIMA--IDDEPAKDSWKPDNRLAVNFTKYIV 73 (452) T ss_pred EcCCccCC-------CHHHHHHHHHH-------HHHHHHHHHHHHHHhccccc--cccCccccccCccceeecchHHHHH Confidence 45555533 22334443332 23344455677899999761 111222333444 47789999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+++...+.+.+.+ - |. .....++.++..|+++.....+..+++++|.||..++ T Consensus 74 d~~~~~l~g~~~~~~~--~------------------d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~ 129 (452) T protein:vir:36 74 DTFTGYFNGIPVKKSH--S------------------DK----EILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLY 129 (452) T ss_pred HHHhhhhcccCceeec--C------------------Ch----hHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEE Confidence 9999999988866543 1 11 1234577777889999999999999999999998887 Q ss_pred EeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+ +||+... +-...+++.|.+. T Consensus 130 ~d~------~g~~~i~~~-~p~~~~~v~d~~~~-----~~~~~~i~~~~~~----------------------------- 168 (452) T protein:vir:36 130 QDE------DTQTNVVYN-SPENMFMVYDDTVK-----QEPLFAVRYGVDE----------------------------- 168 (452) T ss_pred ecC------CCeeEEEEE-cccceEEEEcCCCC-----CceEEEEEEEEec----------------------------- Confidence 542 356666666 67765 4564321 1112222333211 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +....+++|..... ..+...+ +...+....|.+.+.+| T Consensus 169 ~~~~~~~vyt~~~i--~~~~~~~----------------------------------------~~~~~~~~~~~~~g~iP 206 (452) T protein:vir:36 169 DKKLQGEVYTLLET--IKISGEN----------------------------------------DEISFGEGTYNPYPDLP 206 (452) T ss_pred CceEEEEEEecCeE--EEEEEcC----------------------------------------CceEEecceeccCCccc Confidence 11122344432211 0011111 11112223344445666 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCc- Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNA- 395 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~- 395 (688) +|+|. +...|.|.+..++++++.+|..+|.+...+....++.+++.....+. +.. .. .+..+.+..... T Consensus 207 vv~~~-------n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~-~~~-~~-~~~~~~~~~~~~~ 276 (452) T protein:vir:36 207 VVEFY-------FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE-EDL-KN-IRSNRVINYYADG 276 (452) T ss_pred EEEec-------CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc-hhh-hh-hhhcceEEecCCC Confidence 66542 23457799999999999999999999999988888877765433322 111 11 122233332221 Q ss_pred ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 396 IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQ 475 (688) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 475 (688) ......+.++..+.-..++...++.+...|-..|++.+.+.+..+ +.||.|+..+-.............|..+++.+++ T Consensus 277 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 355 (452) T protein:vir:36 277 EGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFG-SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYK 355 (452) T ss_pred CccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122234555554444566777888999999999999887776654 4699999888777777777777777778877777 Q ss_pred HHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 476 ILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQA 555 (688) Q Consensus 476 ~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~ 555 (688) +++.+.... +. ..++ .+|.|.=.+..+....+..+.++++... T Consensus 356 li~~~~~~~----------~~--~~~~-------------------------~~i~i~f~~~~p~d~~~~a~~~~k~~g~ 398 (452) T protein:vir:36 356 LFCELSTNV----------SN--KDSW-------------------------KDIEYTFTRNEPKDIKEQAETANILMGI 398 (452) T ss_pred HHHHHHhcc----------CC--cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhcc Confidence 776654321 10 1111 1222322334444344444555544321 Q ss_pred hHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 556 VPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQA 634 (688) Q Consensus 556 ~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~ 634 (688) +....+++++++ ...++-.+++++............ .. ... T Consensus 399 ------iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~-----~~--~~~------------------------- 440 (452) T protein:vir:36 399 ------TSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDK-----QP--SEK------------------------- 440 (452) T ss_pred ------CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhc-----cC--CCC------------------------- Confidence 223444555543 233333344432211100000000 00 000 Q ss_pred HHHHHHHHHHHHHHH Q lcl|NC_020488. 635 DMAMAQAKTAEAQAK 649 (688) Q Consensus 635 e~~~~q~~~~~~~a~ 649 (688) ..+.+.-+.+.+ T Consensus 441 ---~~~~~~~~~~~e 452 (452) T protein:vir:36 441 ---GTDTVVSETNEE 452 (452) T ss_pred ---cccccCccccCC Confidence 000000000000 No 43 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.85 E-value=2.3e-20 Score=128.20 Aligned_cols=427 Identities=10% Similarity=0.025 Sum_probs=224.6 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHHHHHHHHHHhCCcceEEEe Q lcl|NC_020488. 19 ILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYVDQVLGDQRQNRPAIQVHP 96 (688) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i~~i~g~~~~~r~~~~v~p 96 (688) ++.. | ....+....+..+||.|+|+.-.........+++ .+++|..+.+|+..+|+...+.+.+.+.. T Consensus 1 ~~~~----~------~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~ 70 (440) T protein:vir:95 1 MLAA----F------LGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVME 70 (440) T ss_pred Chhh----H------HHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCC Confidence 1111 1 1223445566679999998743222222334444 57889999999999999999988875422 Q ss_pred CCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcceeEEEe Q lcl|NC_020488. 97 VEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKSI 176 (688) Q Consensus 97 r~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v 176 (688) . .+.+. + ..+..++..|+++.....+..+++++|.||+.++.+ .++.+.+..+ T Consensus 71 ~-----------------~~~~~---~-~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d------~~~~~~i~~~ 123 (440) T protein:vir:95 71 G-----------------GSADQ---L-STIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRD------KDKVDRVVLI 123 (440) T ss_pred C-----------------ccHHH---H-HHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEec------CCCceEEEEE Confidence 1 12222 2 235566788999999999999999999999888754 1356677666 Q ss_pred cccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceee Q lcl|NC_020488. 177 HNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKL 254 (688) Q Consensus 177 ~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~ 254 (688) +|.+++ ||+.... . ...+++.|...+ ....++|.......+. T Consensus 124 -~p~~~~~~~d~~~~~----~-~~~~i~~~~~~~------------------------------~~~~~vyt~~~~~~~~ 167 (440) T protein:vir:95 124 -SPLEMFVIRDLTVEQ----N-IIAAVHLPIYAD------------------------------KVNMTVYTKDKVITYK 167 (440) T ss_pred -cccceEEEEcCCCCC----c-eEEEEEEEEecC------------------------------ceEEEEEeCCeEEEEE Confidence 677654 6664321 1 222233332110 0012333322111100 Q ss_pred eeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCccccc Q lcl|NC_020488. 255 LLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYR 334 (688) Q Consensus 255 ~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~ 334 (688) .. .++ .+...+.++.|.+.+.+|+|+|.. ...|. T Consensus 168 ~~-~~~--------------------------------------~~~~~~~~~~~~~~g~vPvv~~~n-------~~~g~ 201 (440) T protein:vir:95 168 PY-SNN--------------------------------------SVRLVVDDVKKHSYNDVPVVEWWN-------NRFRM 201 (440) T ss_pred Ee-cCC--------------------------------------ccceeecceeeccCceeeEEEeeC-------CCCCC Confidence 00 000 001112233344445566665422 34577 Q ss_pred chHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhh--hcchHHHHhhcccCCCceeecCc-----ccccccceecCC Q lcl|NC_020488. 335 GLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAES--IEGYEEEWNQANRKNQSVLRYNA-----IPGVDRPQRDMP 407 (688) Q Consensus 335 g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~--i~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 407 (688) |.+..++++++.+|..+|.+...+.....+.+++.... .....+.... .+..+.+..... ......++++.. T Consensus 202 sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~lt~ 280 (440) T protein:vir:95 202 GDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAK-MKDANMLFLKTGISTTGQQTTADASYIYK 280 (440) T ss_pred CchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhh-hhhccceecccccccccCCCCcceeEEee Confidence 99999999999999999999999988777766543211 1001111110 111122221111 111223455544 Q ss_pred CcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCc Q lcl|NC_020488. 408 ASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDS 487 (688) Q Consensus 408 ~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~ 487 (688) +.-..++...++.+...|-..|++.+.+.+.-+++.||.|+..+-.............|..+++++++++..++..... T Consensus 281 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~- 359 (440) T protein:vir:95 281 QYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAING- 359 (440) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC- Confidence 4445667788999999999999999988877666689999988876666677777777777777777776655432210 Q ss_pred ceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_020488. 488 DRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLI 567 (688) Q Consensus 488 ~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~ 567 (688) . ..+ ..++.|.=.+..+....+..+.+..+... +....+ T Consensus 360 --------~--~~~-------------------------~~~v~i~f~~~~p~~~~~~ad~~~kl~g~------iS~et~ 398 (440) T protein:vir:95 360 --------P--VIE-------------------------ANKLTFTFHPNIPQDVWTEIKAYIEAGGE------ISQETL 398 (440) T ss_pred --------c--ccc-------------------------cccceEEeCCCCCCCHHHHHHHHHHHhcc------CcHHHH Confidence 0 000 12333333344444444455555544321 223344 Q ss_pred HHhcCCccHHHHHHHHHhhccccccchhhHHhh--hhhhhhhhHH Q lcl|NC_020488. 568 AKNMDWPGAQDIARRLQKTLPPGILDQDEMEEA--GIEPPQPSPE 610 (688) Q Consensus 568 ~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 610 (688) ++++++-..+.-.+++.+..... ..+.... .....+..++ T Consensus 399 ~~~l~~~d~~~E~~ri~~E~~~~---~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 399 MENASFTDYKTEHSRILKQGGSS---DLEIGQIVGDADVGQADTE 440 (440) T ss_pred HHhCCCCCcHHHHHHHHHHHHHh---hhhHHhhccCCCCCCcCCC Confidence 55554432221122222111000 0000000 0000000000 No 44 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=99.85 E-value=7.9e-19 Score=119.84 Aligned_cols=520 Identities=10% Similarity=0.041 Sum_probs=258.9 Q ss_pred CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCC----CCCHHHHHHHHhcCCCceeehhHHHHHHHHH Q lcl|NC_020488. 7 PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGE----QWPESVRKEREDEGRPCLTLNKLPQYVDQVL 82 (688) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~----Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~ 82 (688) ++.++.+-.. .+.++.+|+...+..+.|...|++..+|..-. .+..... .. ..+.-..-...+++.. T Consensus 1 m~~~~~~~~~---~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~----~~--~~~~dst~~~a~~~LA 71 (532) T protein:vir:99 1 MAEVEKTGFA---ADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGST----SY--TTPWQSIGARGLNNLA 71 (532) T ss_pred Ccchhhcccc---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchh----hc--cccccchHHHHHHHHH Confidence 3443333222 25566778877788888999999988887432 2221110 00 1122233334444444 Q ss_pred HHHHh-----CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEE Q lcl|NC_020488. 83 GDQRQ-----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRV 157 (688) Q Consensus 83 g~~~~-----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v 157 (688) +.+.. +++=+++.+.+........ ......+-+..++ .++..+...+..|++..+...++.+.+..|.|++.+ T Consensus 72 a~L~~~ltpp~~~WF~l~~~d~~l~~~~~-~~~~~~~v~~~L~-~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~ 149 (532) T protein:vir:99 72 SKLMLALFPVGSSFFKLNVSELEVKQSIT-SPEELTEIATGLA-MVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYI 149 (532) T ss_pred HHHHHhhcCCCCccccccCCHHHHhccCC-ChhhHHHHHHHHH-HHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEe Confidence 44443 2333344443321100000 0000011122222 345555666778999999999999999999998765 Q ss_pred EEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 158 LTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) ..+... . .....+..+ +..++++..++. . ...-++++..++.+.+-+.++... ....+. ..... T Consensus 150 ~~~~~~-~--~~~~~f~~~-pl~~y~v~~d~~-G---~v~~ivrr~~~~~~~l~e~~~~~~-------~~~~~~-~~p~~ 213 (532) T protein:vir:99 150 PSTEQV-E--GQSNAPKLY-KLHNFVVERDAY-D---NVLQIVTEDKIARAALPEDVRKSL-------EDAQGD-QNPSE 213 (532) T ss_pred cccccc-c--CcccceEEE-EcCeEEEeeCCC-C---CeeeEeeeeeecHHhcChHHHHHh-------hccccc-cCCCc Confidence 422111 1 122334444 345677665432 1 123356677788777644333221 111111 12234 Q ss_pred EEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccce Q lcl|NC_020488. 238 GVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 238 ~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~ 317 (688) +|.|+++.++++... .+.+|. ...|..+....+-|+...+|| T Consensus 214 ~v~v~~~v~~~~~~~-------------------------------------~~~~~~-~~~g~~~~~~~~~~~~~e~P~ 255 (532) T protein:vir:99 214 EVTIYTHVYRDPEAM-------------------------------------VFRSYQ-EIDGEIVAGTEGEYPLDSCPW 255 (532) T ss_pred ceEEEEEEEecCCCC-------------------------------------eeEEEE-eecCceecccccccccccCCc Confidence 577777666543210 011221 123333322234566788999 Q ss_pred EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) ++..+ ...+|..||.|.+....+-.+.+|.+....+.......++.++++.+.+.+...+.. ..+|.++.- .. T Consensus 256 ~~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~---~~~g~~v~g--~~ 328 (532) T protein:vir:99 256 IPVRL--IKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK---ANTGDFVAG--RK 328 (532) T ss_pred eeeee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhcc---CCCcceecC--Cc Confidence 96544 567999999999999999999999999999999999999999999888877765432 223444331 11 Q ss_pred ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_020488. 398 GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQI 476 (688) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~ 476 (688) +.-.+.......--+.....++...+.|.+.. ..+.+...++...|++.|..+.+.....|...+.+|. .++..+.+. T Consensus 329 ~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) T protein:vir:99 329 QDVEVFQLEKYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) T ss_pred ccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 11112222233334556677777788887754 3332333456678999999999999999999998885 577777777 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) .+.++.+.- . +.+. ..++. .-++++-+ +.- -|.+..+.++.+++.+ T Consensus 408 ~~~il~r~g---------------~---lP~~-----------p~~~~--~~~iv~~i--s~L-araq~~~~l~~~~~~l 453 (532) T protein:vir:99 408 LLKELQATS---------------K---IPNL-----------PKEAV--EPAIATGL--EAL-GRGHDLNKLNVFIDYM 453 (532) T ss_pred HHHHHHhcC---------------C---CCCC-----------Chhhc--ccceeecc--hHH-HHHHHHHHHHHHHHHH Confidence 777665421 0 0000 00110 11222211 211 2333445555555443 Q ss_pred HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccc--cchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGI--LDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQA 634 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~ 634 (688) .+.. +.++ +.-+.+++.+.+....+-.. .-..+.+.++..+ ++++++.++++..++.+.+ T Consensus 454 aq~~----p~~~---d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~-----------q~~~~~~~~~a~~~~~~~~ 515 (532) T protein:vir:99 454 IKLA----GLQD---DDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMA-----------EASTAAGMVTAGQQMGAAG 515 (532) T ss_pred Hhhc----chhh---hhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHH Confidence 2222 2222 33455667766655543211 1111111110000 0000000000111100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 635 DMAMAQAKTAEAQAKLAEIEQA 656 (688) Q Consensus 635 e~~~~q~~~~~~~a~~~~~~~~ 656 (688) .++.+ ...+++..+..+ T Consensus 516 ~~~~~-----~~~~~~~~~~~~ 532 (532) T protein:vir:99 516 GQAAA-----AMMQQQAGMPTQ 532 (532) T ss_pred HHhcc-----hhHHhhcCCCCC Confidence 00000 000000000000 No 45 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.85 E-value=8.8e-20 Score=125.06 Aligned_cols=452 Identities=11% Similarity=0.033 Sum_probs=235.3 Q ss_pred cchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCH------HHH-------HHHHhcCCC--ceeehhHHHHH Q lcl|NC_020488. 14 DSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPE------SVR-------KEREDEGRP--CLTLNKLPQYV 78 (688) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~------~~~-------~~~~~~g~p--~~~~N~i~~~i 78 (688) .+ -+.+.++...+. ....+.+....+..+||.|+|=-. ... .....+++| .++.|..+.+| T Consensus 1 ~~-~e~~~~~i~~~~---~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 76 (471) T protein:vir:10 1 ME-IEVIKKIISSQM---VKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLL 76 (471) T ss_pred CC-HHHHHHHHHHHH---HHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHH Confidence 22 223333333332 333455667788899999975100 000 000001222 47889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+.+.+.+ .|.+..+ .+..+. .|+++.....+..++.+.|.||..++ T Consensus 77 d~~~~yl~G~p~~~~~--------------------~~~~~~~----~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~ 131 (471) T protein:vir:10 77 DQKKAYALTYPPTFDV--------------------DDKKVND----MIVDVL-GDDYERISKQLCVNAGNAGIAWLHVW 131 (471) T ss_pred HhhhhhhcccCceecc--------------------CChHHHH----HHHHHH-hcCHHHHHHHHHHHHhhCCeEEEEEE Confidence 9999999998776533 1222233 344444 47899999999999999999998887 Q ss_pred EeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) ++.. ++.+.+..+ +|..+ +||+... +-...+++.|...+. .+. T Consensus 132 ~d~~-----~g~~~~~~~-~p~~~~~i~d~~~~-----~~~~~~ir~~~~~~~------------------------~~~ 176 (471) T protein:vir:10 132 KDAS-----DNSFRYACV-DSKEVIPIYSKSLD-----KKSIGVLRVYSSIDE------------------------TDG 176 (471) T ss_pred eeCC-----CCeeEEEEE-cccceEEEEcCCCC-----CceEEEEEEEEeecc------------------------CCC Confidence 6522 356777777 67775 4554321 113333444432110 012 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) ..+..+++|.......+ ...++............ .......+.....++.|.+.+.+| T Consensus 177 ~~~~~~~vy~~~~~~~y--~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~g~iP 234 (471) T protein:vir:10 177 KNYTVYEYWNDKECSFY--RHEKEKPLEELETFQAI--------------------SLIDTMNGDRSSDNSFKHDFGLVP 234 (471) T ss_pred ceeEEEEEEeCCcEEEE--EecCCcccccccccccc--------------------cccccccccccccccccCCCCcee Confidence 23444566644322221 11122111111000000 000111233444455555556677 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcc Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAI 396 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 396 (688) +|+|.. ...|.|.+..++++++.+|.++|.+.+.+....++.+++.........+..... +..+.+.....+ T Consensus 235 vv~~~n-------~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~-~~~~~i~~~~~~ 306 (471) T protein:vir:10 235 FIPFKN-------NEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDL-KRYKMIKMDNDG 306 (471) T ss_pred EEEecc-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHh-hcCCeEEecCCC Confidence 765422 344679999999999999999999999998888886665443233333333222 223333332221 Q ss_pred -cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 397 -PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQ 475 (688) Q Consensus 397 -~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 475 (688) .....+.++....-..++...++.+.+.|-..|+..+...+..+ +.||+|+..+-.............|..+++++.+ T Consensus 307 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~ 385 (471) T protein:vir:10 307 MGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLG-NSSGVALKFLYSLLELKAGNMETQFRSGYATLVK 385 (471) T ss_pred CccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22234566665555677888889999999999988777666544 4699999888766666666666666677766666 Q ss_pred HHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 476 ILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQA 555 (688) Q Consensus 476 ~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~ 555 (688) +++.++..+ ++ .+|.|.-.+..+....+..+.++.+.. T Consensus 386 li~~~~~~~----------------d~-------------------------~~i~i~f~~~~p~n~~e~~~~~~kl~g- 423 (471) T protein:vir:10 386 MILKHLGLS----------------DK-------------------------LKIKQTWTRNSINNDTEMAQVVSTLAT- 423 (471) T ss_pred HHHHHhccC----------------CC-------------------------ceeEEEeCCCCCCCHHHHHHHHHHHhc- Confidence 555543110 01 112222223333333344444444322 Q ss_pred hHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHH Q lcl|NC_020488. 556 VPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQ 612 (688) Q Consensus 556 ~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q 612 (688) .+....+++++++ .+.+.-.+++++......... ........+.+.+ T Consensus 424 -----~iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~-----~~~~~~~~~~e~~ 471 (471) T protein:vir:10 424 -----ITSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKL-----YDMEEVEHESEVE 471 (471) T ss_pred -----cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcc-----cccCCCCCccccC Confidence 1333444555433 233333444433211100000 0000000001000 No 46 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.84 E-value=9.8e-20 Score=124.80 Aligned_cols=444 Identities=11% Similarity=0.027 Sum_probs=230.1 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i 78 (688) ++|..+.+- .+ . |+..+.. +....+....+-.+||.|+|= ........++| .++.|..+.+| T Consensus 18 ~~~~~~~~~------~~-~---i~~~i~~---~~~~~~~~~~~l~~Yy~g~~~---i~~~~~~~~~~~~ki~~n~~~~Iv 81 (470) T protein:vir:99 18 IFPKGEKLT------SN-E---LLGFIAY---NETVLKPRYRENMKLYLGKHK---ILTAPEKETGADNRIVVNSAKYVV 81 (470) T ss_pred EeCCCCCcC------HH-H---HHHHHHH---HHHhhHHHHHHHHHHhccccc---cccCcccccCCcceeecchHHHHH Confidence 788666222 12 2 2222221 122333455667799999751 11111122333 57889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+.+.+.+. + |.+..+ .+..++..|+++.....+..+++++|.+|..++ T Consensus 82 d~~~~~l~g~p~~~~~~--~-----------------d~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~ 138 (470) T protein:vir:99 82 DVYNGYFCGIEPKLALL--N-----------------DSSKID----EIARWNRQENFFDTINEISKQCDIFGRSIASIY 138 (470) T ss_pred HHHhhhhccCCeeEeeC--C-----------------chhHHH----HHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEE Confidence 99999999887776541 1 222222 344566789999999999999999999988776 Q ss_pred EeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+ +||+.... . ..++++.|.... +. T Consensus 139 ~d~------dg~~~i~~~-~p~~~~~i~d~~~~~----~-~~~~vr~~~~~~--------------------------~~ 180 (470) T protein:vir:99 139 QGE------DARPHLMYS-SPNHAFIIYDDTVQR----Q-PLAFVHYQIDNS--------------------------NN 180 (470) T ss_pred eCC------CCeEEEEEE-ccceeEEEEcCCCCc----c-eEEEEEEEEEec--------------------------CC Confidence 531 356666666 67775 46653221 1 112222222100 01 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) ......++|+.... +.+...++ +......+..|.+.+.+| T Consensus 181 ~~~~~~~~~~~~~~--~~~~~~~~--------------------------------------~~~~~~~~~~~~~~g~vP 220 (470) T protein:vir:99 181 WTDAYGVIQYADKF--YKFKGYDI--------------------------------------EEDTNAAGYAINPYGLVP 220 (470) T ss_pred eeEEEEEEEecCeE--EEEEeccc--------------------------------------ccccccccccccCCCccc Confidence 11112222221100 00000000 000111122333445666 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHH--HHhhcccCCCceeecC Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEE--EWNQANRKNQSVLRYN 394 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~~~~~~~ 394 (688) +|+|. +..+|.|.+..++++++.+|..+|.+...+....++.+++........+. ... ......++... T Consensus 221 vv~~~-------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~--~~~~~~~~~~~ 291 (470) T protein:vir:99 221 AVEFF-------ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKF--DFKNNRVLYVS 291 (470) T ss_pred eEeec-------CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhh--hhhhcceeeec Confidence 66542 24567899999999999999999999999988888887765544332211 111 11112222222 Q ss_pred cc--cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 395 AI--PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRR 472 (688) Q Consensus 395 ~~--~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~ 472 (688) +. .....+.++..+.....+...++.+...|-..||+.+.+.+..+++.||.|+..+-.............|..++++ T Consensus 292 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 371 (470) T protein:vir:99 292 QLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQ 371 (470) T ss_pred CCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21 1223455665555556677788999999999999998877776666799999988777777777777777788888 Q ss_pred HHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHH Q lcl|NC_020488. 473 VGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQF 552 (688) Q Consensus 473 ~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~ 552 (688) ++++++.++...... ..++ .+|.+.=.+..+....+..+.+..+ T Consensus 372 ~~~li~~~~~~~~~~-----------~~~~-------------------------~~i~v~f~~~~p~~~~e~a~~~~kl 415 (470) T protein:vir:99 372 LYRIVLATLFNNKQD-----------QELW-------------------------SELDFKFTRNLPEDMASAIDNAKNA 415 (470) T ss_pred HHHHHHHHHhccCCc-----------cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHH Confidence 777776654332110 0000 1222333344444344444444443 Q ss_pred HHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHh-hhhhhhhhhHHHH Q lcl|NC_020488. 553 VQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEE-AGIEPPQPSPEQQ 612 (688) Q Consensus 553 ~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~q 612 (688) ... +....+++++++-..+.-.+++.+..........+... .......+..+.+ T Consensus 416 ~gi------is~et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 416 EGI------VSKKTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred hcc------CCHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccCC Confidence 321 22234445544433333233333221100000000000 0000000000000 No 47 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.84 E-value=1.4e-20 Score=129.45 Aligned_cols=456 Identities=9% Similarity=0.001 Sum_probs=234.9 Q ss_pred CCCCCC---CcCCCCccchHHHHHHHHHHHHHHHHhhh-HHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhH Q lcl|NC_020488. 1 MLPGNE---PIKTRDDDSQEAILQEIRERAAHAVTCWK-HNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKL 74 (688) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i 74 (688) -.+++. ......++......+.+ ....+.+. .......+-.+||.|+++.-.........+++ .++.|.. T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~i----~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~ 95 (502) T protein:vir:48 20 RFHRESRIRYRADNLEELMVNNWELL----KNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYG 95 (502) T ss_pred ccChhHHhhhcccchhhhccccHHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchH Confidence 111111 01111111111111222 22222222 22334456789999987642211222233443 6888999 Q ss_pred HHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCce Q lcl|NC_020488. 75 PQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGW 154 (688) Q Consensus 75 ~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~ 154 (688) +-+|+..+|++..+++.+.+.. ....+.+...+..++..|+++.....+..+++++|.|| T Consensus 96 k~Ivd~~~~yl~g~p~~~~~~d--------------------~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~ 155 (502) T protein:vir:48 96 RMISKFKTGYLAGNPIRVEYDD--------------------NEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAY 155 (502) T ss_pred HHHHHHHhhhhcccCeeEecCC--------------------ccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEE Confidence 9999999999999988876521 11224455566777788999999999999999999999 Q ss_pred EEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccccccc Q lcl|NC_020488. 155 LRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSW 232 (688) Q Consensus 155 ~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 232 (688) +.++.+. ++.+.+..+ +|..++ ||+... .+ ..++++.|... T Consensus 156 ~~v~~de------dg~~~i~~~-~p~~~~~vydd~~~----~~-~~~~ir~~~~~------------------------- 198 (502) T protein:vir:48 156 EVIYRSE------YDETRIKRL-SPLETFVIYDNSLE----DN-SIAAVRYYNRG------------------------- 198 (502) T ss_pred EEEEeCC------CCceEEEEE-cccceEEEEcCCCC----Cc-eEEEEEEEEEe------------------------- Confidence 8876542 356667666 677654 554321 11 12222222110 Q ss_pred CCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCC Q lcl|NC_020488. 233 WTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPG 312 (688) Q Consensus 233 ~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~ 312 (688) ...+.+.++++|..... + .+...+...+....|.+. T Consensus 199 -~~~~~~~~~~iyt~~~i----~---------------------------------------~~~~~~~~~~~~~~~~~~ 234 (502) T protein:vir:48 199 -TLQNAKDVVEIYTNQHI----Y---------------------------------------TLDASDSFNEISVTPHAF 234 (502) T ss_pred -ecCCcEEEEEEEeCCeE----E---------------------------------------EEEeCCceeeccceecCC Confidence 00112334455543211 0 111111112223334445 Q ss_pred CccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceee Q lcl|NC_020488. 313 STIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLR 392 (688) Q Consensus 313 ~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 392 (688) +.+|+|+|. +...|.|.+..++++++.+|..+|.+...+...+.+.+++.........+.... .+..+.+.. T Consensus 235 g~vPvv~~~-------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~-~~~~~~~~~ 306 (502) T protein:vir:48 235 GTVPITEFL-------NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASD-MKRTRLMQL 306 (502) T ss_pred CccceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhh-hhhcceeec Confidence 667776542 235678999999999999999999999999888877766544332222111111 111122211 Q ss_pred cC-----cccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 393 YN-----AIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 393 ~~-----~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) .. +..+...+.++..+.-..++...++.+...|-..|++.+.+.|.-+++.||.|+..+............+.|. T Consensus 307 ~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~ 386 (502) T protein:vir:48 307 KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFT 386 (502) T ss_pred cccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1112224445544444466777889999999999999988887766667999999887666666677777777 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .+++++.++++.++... +.....++ .+|.|.=.+..+....+.++ T Consensus 387 ~~l~~~~~li~~~~~~~----------~~~~~~d~-------------------------~~i~i~f~~~~p~d~~e~a~ 431 (502) T protein:vir:48 387 QGLKRRYRLAARIGSLV----------NEFKDFDE-------------------------SRLKITFTPNLPKSLYEQVS 431 (502) T ss_pred HHHHHHHHHHHHHHhhc----------cccccccc-------------------------ccceEEeCCCCCcCHHHHHH Confidence 77777776666654321 11001111 01222223344444444455 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhH-------------HhhhhhhhhhhHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEM-------------EEAGIEPPQPSPE 610 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~ 610 (688) .+.++... +....+++++++ ...++-.+++.+...+........ .+......+.-.+ T Consensus 432 ~~~kl~g~------iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 432 ILNDLGGQ------VSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred HHHHHhcc------CcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcCCCCC Confidence 55544322 223455555544 333333344432211100000000 0000000000000 No 48 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.84 E-value=5.9e-20 Score=125.99 Aligned_cols=455 Identities=10% Similarity=0.003 Sum_probs=232.2 Q ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCC--CCHHHHH-------HHHhcCCC--ceeehhHHHHHHHHHHHHH Q lcl|NC_020488. 18 AILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQ--WPESVRK-------EREDEGRP--CLTLNKLPQYVDQVLGDQR 86 (688) Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Q--w~~~~~~-------~~~~~g~p--~~~~N~i~~~i~~i~g~~~ 86 (688) =.+..+.+.................+..+||.|++ |...... .....++| .+++|..+.+|+..+|+.. T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 01222333333334444555667778889999975 1111100 01112232 5889999999999999999 Q ss_pred hCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCC Q lcl|NC_020488. 87 QNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDA 166 (688) Q Consensus 87 ~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~ 166 (688) .+++.+.+- |.+..+.+..++ . +++.........+++++|.+|..++++. T Consensus 81 G~p~~~~~~--------------------d~~~~~~l~~~~----~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~----- 130 (470) T protein:vir:10 81 SVFPDIDVG--------------------KDADNKKIIDVL----G-DDRALTLNGLLVDSSNAGRAWLHYWIDE----- 130 (470) T ss_pred ccceeeecC--------------------chHHHHHHHHHH----h-hhHHHHHHHHHHHHhhcCeeEEEEEecC----- Confidence 998776441 233334444433 2 3567777888899999999998887642 Q ss_pred CCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEE Q lcl|NC_020488. 167 FDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEY 244 (688) Q Consensus 167 ~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~ 244 (688) ++.+.+..+ +|..+| ||+... .+. .++++.|...+ ..+...+..+|+ T Consensus 131 -~~~~~~~~~-~p~~~~~v~d~~~~----~~~-~a~ir~y~~~~------------------------~~~~~~~~~~e~ 179 (470) T protein:vir:10 131 -DGNFRYGII-QPDQITPIYATTLD----NKL-LGILRSYKQLD------------------------PDSGKYFTVHEY 179 (470) T ss_pred -CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEEeee------------------------cCCceEEEEEEE Confidence 245666665 677754 454221 112 22333332110 001223445566 Q ss_pred EeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeee Q lcl|NC_020488. 245 FYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKE 324 (688) Q Consensus 245 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~ 324 (688) |.......+.........+........ .-....+.....+..+.+.+.+|+|+|.. T Consensus 180 yt~~~~~~~~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-- 235 (470) T protein:vir:10 180 WTDKEAQFFRTNATDSTVIEPYNIITS----------------------YDLSAGYETGQSNTLKHNFGRVPFIEFSK-- 235 (470) T ss_pred EcCCcEEEEEeecCcceeccccccccc----------------------cccccccccccccccccCCCeeeEEEeec-- Confidence 653332222111111111100000000 00000011111122233334555554322 Q ss_pred eccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCc-ccccccce Q lcl|NC_020488. 325 MVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNA-IPGVDRPQ 403 (688) Q Consensus 325 ~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 403 (688) ...|.|.+..++++++.+|.++|.+.+.+....++.+++.....++..+..... ...+.+.+... ......+. T Consensus 236 -----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~ 309 (470) T protein:vir:10 236 -----NKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDL-RKYKSIKINNTGNGDNSGVD 309 (470) T ss_pred -----CCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhh-hhcCeEeccCCCCCcCceeE Confidence 345789999999999999999999999999888888777654444444444332 22333333221 11223456 Q ss_pred ecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 404 RDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPR 483 (688) Q Consensus 404 ~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~ 483 (688) ++....-..+....++.+...|-..|++.+.+.+.. ++.||+|+..+-.............|..+++++.++++.++ T Consensus 310 ~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l-- 386 (470) T protein:vir:10 310 KLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYL-- 386 (470) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 666555567788888999999999999888776654 45899999988877777777777777777777666655433 Q ss_pred HcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_020488. 484 VYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVV 563 (688) Q Consensus 484 ~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~ 563 (688) . +.+ .++ .+|.|.-.+..+....+..+.++.+.. .+. T Consensus 387 --~------~~~----~d~-------------------------~~i~i~f~~~~p~d~~e~~~~~~~~~g------~iS 423 (470) T protein:vir:10 387 --N------FSD----ADK-------------------------RHISQHWTRTKVEDSLTKAQIVSTVAN------YSS 423 (470) T ss_pred --c------ccC----ccc-------------------------ceeeEEeccCCCCCHHHHHHHHHHHhc------cCc Confidence 1 100 011 122222223333333333343333321 233 Q ss_pred HHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHH Q lcl|NC_020488. 564 LDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQ 611 (688) Q Consensus 564 ~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 611 (688) ...+++++++ ...++-.+++++..........+... .......-++ T Consensus 424 ~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~~~--~~~~~~dde~ 470 (470) T protein:vir:10 424 KEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQADE--LNGKGVNDEQ 470 (470) T ss_pred HHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhccccc--cCCCCCCCCC Confidence 3444555542 34444444444321111000000000 0000000000 No 49 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.84 E-value=1.4e-19 Score=124.02 Aligned_cols=468 Identities=10% Similarity=0.012 Sum_probs=235.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCC--CceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGR--PCLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~--p~~~~N~i~~~i 78 (688) .+|+.+...+.. ..++...+.+... ..+....+..+||.|.|.--.........++ ..+++|..+.+| T Consensus 30 ~~~~~e~~~~~~-------~~~i~~~i~~~~~---~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (511) T protein:vir:78 30 TYDGTESDLLQN-------VNEVSKYIEHHMD---YQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (511) T ss_pred cccchhhhhhcC-------HHHHHHHHHHHHH---hhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHH Confidence 333333221111 1222222222122 2233445567899998763221111222222 367889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+++.+.+- |.+ ....+..++..|+++.....+..++++.|.+|..++ T Consensus 100 ~~~~~yl~g~p~~~~~~--------------------d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy 155 (511) T protein:vir:78 100 DFINGYFLGNPIQYQDD--------------------DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (511) T ss_pred HHHhhhhcccCceeecC--------------------chH----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEE Confidence 99999999988776431 222 334577777889999999999999999999988776 Q ss_pred EeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+| ||+.... . ...+++.|.... ....+. T Consensus 156 ~d~------dg~~~i~~~-~p~~~~~v~dd~~~~----~-~~~~vr~~~~~~----------------------~~~~~~ 201 (511) T protein:vir:78 156 RNQ------DDETRLYKS-DAMSTFIIYDNTVER----N-SIAGVRYLRTKP----------------------IDKTDE 201 (511) T ss_pred eCC------CCceEEEEE-cccceEEEEcCCCCC----c-eEEEEEEEEeee----------------------cccccc Confidence 531 356777666 677764 5543321 1 223333332110 000112 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +.+..+++|.......+. ...+..... .....++.|.+.+.+| T Consensus 202 ~~~~~~~vyt~~~i~~~~--~~~~~~~~~-----------------------------------~~~~~~~~~~~~g~vP 244 (511) T protein:vir:78 202 DEVFTVDLFTSHGVYRYL--TNRTNGLKL-----------------------------------TPRENSFESHSFERMP 244 (511) T ss_pred ceEEEEEEEeCCcEEEEE--ecCCCcccc-----------------------------------cccccccccCcCcccc Confidence 233445555443221111 111110000 0001233455556677 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec--- Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY--- 393 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--- 393 (688) +|+|. ....|.|.+..++++++.+|...|.+.+.+....++.+++......+.++... ....+.++.. T Consensus 245 vv~~~-------n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~--~~~~~~~~~~~~~ 315 (511) T protein:vir:78 245 ITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK--QKEANVLFLEPTV 315 (511) T ss_pred eEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcc--cccccceeccccc Confidence 76542 23457799999999999999999999999987777766554322222221111 1111111111 Q ss_pred ----Cc--ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 ----NA--IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 394 ----~~--~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) .+ ..+...+.++..+.-..++...++.+...|-.+|++.+.+.+.-+++.||.|+..+-.............|. T Consensus 316 ~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~ 395 (511) T protein:vir:78 316 YVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 395 (511) T ss_pred eeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111223444444444566777888899999999999988887765667999999887777777777777778 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .++++++++++.++...-.. ....++ .++.+.=.+..+....+..+ T Consensus 396 ~~l~~~~~li~~~~~~~~~~---------~~~~~~-------------------------~~i~~~f~~~~p~n~~e~~d 441 (511) T protein:vir:78 396 KGLRRRAKLLETILKNTRSI---------DANKDF-------------------------NTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred HHHHHHHHHHHHHHHhcCCC---------cccccc-------------------------ccceEEeCCCCCcCHHHHHH Confidence 88888777777655322100 000111 12222323344444444455 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAK 626 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q 626 (688) .++.+... +....+++++++ ...++-.+++.+................... .... T Consensus 442 ~~~kl~G~------iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~--~~~~---------------- 497 (511) T protein:vir:78 442 AYIDSGGK------ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD--INDD---------------- 497 (511) T ss_pred HHHHHhcc------CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCC--CCCC---------------- Confidence 55544322 223344455443 3333444444332111000000000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 627 ADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 627 ~e~~~~q~e~~~~q~~~~~ 645 (688) .+.....-.....+ T Consensus 498 -----~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 498 -----EQDDDTKDTVDKKE 511 (511) T ss_pred -----CCCCCccCcccccC Confidence 00000000000000 No 50 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.84 E-value=1.4e-19 Score=124.02 Aligned_cols=468 Identities=10% Similarity=0.012 Sum_probs=235.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCC--CceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGR--PCLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~--p~~~~N~i~~~i 78 (688) .+|+.+...+.. ..++...+.+... ..+....+..+||.|.|.--.........++ ..+++|..+.+| T Consensus 30 ~~~~~e~~~~~~-------~~~i~~~i~~~~~---~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (511) T protein:vir:96 30 TYDGTESDLLQN-------VNEVSKYIEHHMD---YQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (511) T ss_pred cccchhhhhhcC-------HHHHHHHHHHHHH---hhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHH Confidence 333333221111 1222222222122 2233445567899998763221111222222 367889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+++.+.+- |.+ ....+..++..|+++.....+..++++.|.+|..++ T Consensus 100 ~~~~~yl~g~p~~~~~~--------------------d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy 155 (511) T protein:vir:96 100 DFINGYFLGNPIQYQDD--------------------DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (511) T ss_pred HHHhhhhcccCceeecC--------------------chH----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEE Confidence 99999999988776431 222 334577777889999999999999999999988776 Q ss_pred EeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+| ||+.... . ...+++.|.... ....+. T Consensus 156 ~d~------dg~~~i~~~-~p~~~~~v~dd~~~~----~-~~~~vr~~~~~~----------------------~~~~~~ 201 (511) T protein:vir:96 156 RNQ------DDETRLYKS-DAMSTFIIYDNTVER----N-SIAGVRYLRTKP----------------------IDKTDE 201 (511) T ss_pred eCC------CCceEEEEE-cccceEEEEcCCCCC----c-eEEEEEEEEeee----------------------cccccc Confidence 531 356777666 677764 5543321 1 223333332110 000112 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +.+..+++|.......+. ...+..... .....++.|.+.+.+| T Consensus 202 ~~~~~~~vyt~~~i~~~~--~~~~~~~~~-----------------------------------~~~~~~~~~~~~g~vP 244 (511) T protein:vir:96 202 DEVFTVDLFTSHGVYRYL--TNRTNGLKL-----------------------------------TPRENSFESHSFERMP 244 (511) T ss_pred ceEEEEEEEeCCcEEEEE--ecCCCcccc-----------------------------------cccccccccCcCcccc Confidence 233445555443221111 111110000 0001233455556677 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec--- Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY--- 393 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--- 393 (688) +|+|. ....|.|.+..++++++.+|...|.+.+.+....++.+++......+.++... ....+.++.. T Consensus 245 vv~~~-------n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~--~~~~~~~~~~~~~ 315 (511) T protein:vir:96 245 ITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK--QKEANVLFLEPTV 315 (511) T ss_pred eEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcc--cccccceeccccc Confidence 76542 23457799999999999999999999999987777766554322222221111 1111111111 Q ss_pred ----Cc--ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 ----NA--IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 394 ----~~--~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) .+ ..+...+.++..+.-..++...++.+...|-.+|++.+.+.+.-+++.||.|+..+-.............|. T Consensus 316 ~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~ 395 (511) T protein:vir:96 316 YVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 395 (511) T ss_pred eeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111223444444444566777888899999999999988887765667999999887777777777777778 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .++++++++++.++...-.. ....++ .++.+.=.+..+....+..+ T Consensus 396 ~~l~~~~~li~~~~~~~~~~---------~~~~~~-------------------------~~i~~~f~~~~p~n~~e~~d 441 (511) T protein:vir:96 396 KGLRRRAKLLETILKNTRSI---------DANKDF-------------------------NTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred HHHHHHHHHHHHHHHhcCCC---------cccccc-------------------------ccceEEeCCCCCcCHHHHHH Confidence 88888777777655322100 000111 12222323344444444455 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAK 626 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q 626 (688) .++.+... +....+++++++ ...++-.+++.+................... .... T Consensus 442 ~~~kl~G~------iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~--~~~~---------------- 497 (511) T protein:vir:96 442 AYIDSGGK------ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD--INDD---------------- 497 (511) T ss_pred HHHHHhcc------CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCC--CCCC---------------- Confidence 55544322 223344455443 3333444444332111000000000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 627 ADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 627 ~e~~~~q~e~~~~q~~~~~ 645 (688) .+.....-.....+ T Consensus 498 -----~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 498 -----EQDDDTKDTVDKKE 511 (511) T ss_pred -----CCCCCccCcccccC Confidence 00000000000000 No 51 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.83 E-value=1.2e-19 Score=124.39 Aligned_cols=433 Identities=12% Similarity=0.074 Sum_probs=230.1 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i 78 (688) .||..+. ...+++.++...+ ...+....+..+||.|.| +-.....+..+++ .+++|..+-+| T Consensus 10 ~~p~d~~-------~~~~~l~~~i~~~-------~~~~~r~~~~~~yy~g~~--~i~~~~~~~~~~~~~ki~~n~~~~iv 73 (453) T protein:vir:39 10 TFPKDEP-------ITNEVVTKFMEKH-------RLEVARYEYLKNMYRGIM--AIDAEPTKDLWKPDNRLTVNFTKYIV 73 (453) T ss_pred EcCCCCC-------CCHHHHHHHHHHH-------HHHHHHHHHHHHHhhccC--chhcCCCccccCccceeecchHHHHH Confidence 5666552 2334445544332 334455667788999975 1111111222332 57789999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+.+.+.+- |.+ ....+..++..|+++.....+..+++++|.||+.|+ T Consensus 74 d~~~~~l~g~~~~~~~~--------------------d~~----~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~ 129 (453) T protein:vir:39 74 DTFTGYFNGIPVKKSHS--------------------DKE----TLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLY 129 (453) T ss_pred HHHhhhhcccCceeccC--------------------ChH----HHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEE Confidence 99999998887665321 111 234577777889999999999999999999998887 Q ss_pred EeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+ +||+.... ...++ ++.+.. . T Consensus 130 ~d~------~g~~~i~~~-~p~~~~~v~d~~~~~----~~~~~-ir~~~~-----------------------------~ 168 (453) T protein:vir:39 130 QNE------ETQTNVIYN-TPENMFMVYDDTIKQ----EPLFA-VRYGYD-----------------------------D 168 (453) T ss_pred ecC------CCceEEEEE-cccceEEEecCCCCC----eEEEE-EEEEEe-----------------------------C Confidence 542 356666665 67765 46543221 12222 222210 1 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +....+++|...... .+... ++...++++.|.+.+.+| T Consensus 169 ~~~~~~~~yt~~~i~--~~~~~----------------------------------------~~~~~~~~~~~~~~g~vP 206 (453) T protein:vir:39 169 DYKLYGEVYTKETTY--ALNGT----------------------------------------MGFYNMTEQAPNPFDDLP 206 (453) T ss_pred CeEEEEEEEeCCeEE--EEEec----------------------------------------CCceeeecccccCCCcee Confidence 122334555432110 00000 011112233344446666 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcc Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAI 396 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 396 (688) +|+|. ...+|.|.+..++++++.+|+++|.+...+.....+.+++....++... ... ....+.+...++. T Consensus 207 vv~~~-------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~--~~~-~~~~~~~~~~~~~ 276 (453) T protein:vir:39 207 VVEFY-------FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEED--LKN-IRSNRVINYYGES 276 (453) T ss_pred EEEec-------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCchh--hhh-hhhcceeeecCCC Confidence 66542 2346789999999999999999999999998888887776543333211 111 1222333322221 Q ss_pred c--ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 397 P--GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVG 474 (688) Q Consensus 397 ~--~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~ 474 (688) . ....+.++..+.-..++...++.+...|-.+|++.+.+.+..+ +.||.|+..+-.............|..++++++ T Consensus 277 ~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~ 355 (453) T protein:vir:39 277 SEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFG-SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRY 355 (453) T ss_pred CCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1223444444434466777788888888899988876665544 469999988776666666666777777777776 Q ss_pred HHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHH Q lcl|NC_020488. 475 QILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQ 554 (688) Q Consensus 475 ~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q 554 (688) ++++.+.... +. ..++ .+|.|.=.++.+....+..+.++.+.. T Consensus 356 ~li~~~~~~~----------~~--~~~~-------------------------~~i~v~f~~~~p~~~~~~a~~~~kl~g 398 (453) T protein:vir:39 356 KLYCELSTNV----------SN--KEAW-------------------------KDIEYTFTRNEPKDIKEQAETANILMG 398 (453) T ss_pred HHHHHHHhcc----------CC--cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhc Confidence 6666543211 11 0011 122333334444444555555555433 Q ss_pred hhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhh----hhhhhhhhHH Q lcl|NC_020488. 555 AVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEA----GIEPPQPSPE 610 (688) Q Consensus 555 ~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 610 (688) . +....+++++++ ...++-.+++++............... +...++-..+ T Consensus 399 ~------is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 399 I------TSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred c------CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 2 223444555543 333333444433222111100000000 0000000000 No 52 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.83 E-value=2.6e-19 Score=122.48 Aligned_cols=465 Identities=10% Similarity=0.017 Sum_probs=237.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCC--CceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGR--PCLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~--p~~~~N~i~~~i 78 (688) .+|..+..... ...++...+.+.. ...+....+..+||.|.|.--.........++ ..+++|..+.+| T Consensus 30 ~~~~~e~~~~~-------~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (511) T protein:vir:96 30 TYDGTESDLLQ-------NVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (511) T ss_pred ccchhhhhhhc-------cHHHHHHHHHHHH---HhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHH Confidence 33333311111 1112222222111 22344556678999998763221112222333 367889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+++.+.+- |.+ ....+..++..|+++.....+..++++.|.+|..++ T Consensus 100 ~~~~~yl~g~p~~~~~~--------------------~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy 155 (511) T protein:vir:96 100 DFINGYFLGNPIQYQDD--------------------DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (511) T ss_pred HHHHhhhccCCceeecC--------------------chH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEE Confidence 99999999998887531 122 334577778889999999999999999999998887 Q ss_pred EeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+| ||.... .-...+++.|... .....+. T Consensus 156 ~de------d~~~~i~~~-~p~~~~~vydd~~~-----~~~~~~vr~~~~~----------------------~~d~~~~ 201 (511) T protein:vir:96 156 RNQ------DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYLRTK----------------------PIDKTDE 201 (511) T ss_pred eCC------CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEEEee----------------------ecccccc Confidence 542 356777665 677765 554321 1122333333210 0000112 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +.+..+++|.......+. ...+..... ......+.|.+.+.+| T Consensus 202 ~~~~~~~iyt~~~i~~~~--~~~~~~~~~-----------------------------------~~~~~~~~~~~~~~vP 244 (511) T protein:vir:96 202 DEVFTVDLFTSHGVYRYL--TSRTNGLKL-----------------------------------TPRENGFESHSFERMP 244 (511) T ss_pred ceEEEEEEEeCCcEEEEE--ecCCCcccc-----------------------------------cccccccccccCCcee Confidence 234445555443221111 111111000 0001123444556667 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec--- Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY--- 393 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--- 393 (688) +|+|. ....|.|.+..++++++.+|...|.+.+.+...+++.+++......+..+... ...+..+... T Consensus 245 vv~~~-------nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~~~~~~~~~~~~~ 315 (511) T protein:vir:96 245 ITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK--QKEANVLFLEPTV 315 (511) T ss_pred eEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcc--cccccceeccccc Confidence 66542 12357799999999999999999999999987777766654323222222111 1111111110 Q ss_pred ----Cc--ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 ----NA--IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 394 ----~~--~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) .+ ..+...+.++..+.-..++...++.+...|..+|++.+.+.+.-+++.||+|+..+-.............|. T Consensus 316 ~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~ 395 (511) T protein:vir:96 316 YADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 395 (511) T ss_pred ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 111223455555444567788889999999999999998887655667999999888777777777777788 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .++++++++++.++..... .+...++. ++.+.=.+..+....+..+ T Consensus 396 ~~l~~~~~li~~~~~~~~~---------~~~~~d~~-------------------------~i~~~f~~~~p~n~~e~~~ 441 (511) T protein:vir:96 396 KGLRRRAKLLETILKNTWS---------IDANKDFN-------------------------TVRYVYNRNLPKSLIEELK 441 (511) T ss_pred HHHHHHHHHHHHHHHhhcC---------cccccccc-------------------------cceEEeCCCCCCCHHHHHH Confidence 8888877777765432210 00111111 2222223444444444455 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhh---hhhHHHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPP---QPSPEQQANMAQAQADME 623 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~q~~~~~~q~~~~ 623 (688) .++.+... +....+++++++ ...++-.+++.+.................... +...+.+ T Consensus 442 ~~~kl~G~------iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------- 504 (511) T protein:vir:96 442 AYIDSGGK------ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTK----------- 504 (511) T ss_pred HHHHHhcc------CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCccc----------- Confidence 44444321 223444555543 33344444444322110000000000000000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 624 KAKADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 624 ~~q~e~~~~q~e~~~~q~~~~~ 645 (688) -..++. + T Consensus 505 -~~~~~~--------------~ 511 (511) T protein:vir:96 505 -DTVDKK--------------E 511 (511) T ss_pred -cccccc--------------C Confidence 000000 0 No 53 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.83 E-value=1.4e-19 Score=124.03 Aligned_cols=459 Identities=12% Similarity=0.061 Sum_probs=231.1 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-----HHHHhcCCC--ceeehh Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-----KEREDEGRP--CLTLNK 73 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-----~~~~~~g~p--~~~~N~ 73 (688) =+|=.+|-..+--+..+..-....+.+.+..+.+........+..+||.|.|=--.-. ......++| .+++|. T Consensus 5 ~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 84 (478) T protein:vir:10 5 NWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNY 84 (478) T ss_pred cCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccceeccch Confidence 1222222111100000000000122233333444455667788899999975210000 001112233 478899 Q ss_pred HHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCc Q lcl|NC_020488. 74 LPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFG 153 (688) Q Consensus 74 i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G 153 (688) .+-+|+..+|+...+.+.+.+ .|.+..+.+.. ++ .|+++.....+..+++++|.| T Consensus 85 ~~~ivd~~~~~l~g~~~~~~~--------------------~~d~~~~~l~~----~~-~n~~~~~~~~~~~~~~~~G~~ 139 (478) T protein:vir:10 85 HQNLVDQKVAYAVANPVTFGV--------------------DNDKALKQIQH----TL-NHKWDDKLVDILTAASNKGIE 139 (478) T ss_pred HHHHHHHHHhhhccCCeeeec--------------------CChHHHHHHHH----HH-hcCHHHHHHHHHHHHHhcCeE Confidence 999999999999988877643 12233333333 33 368999999999999999999 Q ss_pred eEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccc Q lcl|NC_020488. 154 WLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYS 231 (688) Q Consensus 154 ~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~ 231 (688) |+.++++. ++.+.+..+ +|..++ ||+... .+..+ +++.|-.. + T Consensus 140 ~~~~~~d~------~g~~~~~~~-~p~~~~~i~d~~~~----~~~~~-~v~~~~~~--------~--------------- 184 (478) T protein:vir:10 140 WVQPYVDE------EGEFKTFRV-PAEQAVPIWTNKER----DELQA-FIRVYELD--------G--------------- 184 (478) T ss_pred EEEEEecC------CCeeEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEec--------C--------------- Confidence 98887642 345666666 677664 554321 12232 33333100 0 Q ss_pred cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCC Q lcl|NC_020488. 232 WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWP 311 (688) Q Consensus 232 ~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~ 311 (688) ...+++|.......+ ...++...+...... .. .....+....|.+ T Consensus 185 -------~~~~~~y~~~~i~~~--~~~~~~~~~~~~~~~------------~~--------------~~~~~~~~~~~~~ 229 (478) T protein:vir:10 185 -------AERVEYWTKDDVTYY--ELKEGQLIPDFYRSD------------DH--------------IQPHYYQGNKLMS 229 (478) T ss_pred -------ceEEEEEeCCeEEEE--EEcCCeeeccccccc------------cc--------------cccceeccccccc Confidence 001233322211111 111121111100000 00 0001122334555 Q ss_pred CCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCcee Q lcl|NC_020488. 312 GSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVL 391 (688) Q Consensus 312 ~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 391 (688) .+.+|+|+|. ...+|.|.+..++++++.+|...|.+...+.....+.+++.....++..+.... ....+.+. T Consensus 230 ~~~vPvv~~~-------n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~-~~~~~~~~ 301 (478) T protein:vir:10 230 WGRVPFIPFK-------NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHN-LKYYKAIS 301 (478) T ss_pred CCccceEEec-------cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhh-hhhcceEE Confidence 6777777652 346788999999999999999999999999888888766543322222222111 12223333 Q ss_pred ecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 392 RYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIR 471 (688) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 471 (688) ..+.. +..+.++....-.......++.+...|-..|++.+.+.+..+++.||.|+..+-.............|..+++ T Consensus 302 -~~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 379 (478) T protein:vir:10 302 -VAGES-GSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQ 379 (478) T ss_pred -ecCCC-CCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32222 2345555544445667788899999999999998887776666789999988776666666666666667776 Q ss_pred HHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHH Q lcl|NC_020488. 472 RVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQ 551 (688) Q Consensus 472 ~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~ 551 (688) ++.++++. ++. . ..++ .+|.|.=.+..+....+..+.++. T Consensus 380 ~~~~li~~----~~g---------~--~~~~-------------------------~~i~i~f~~~~p~d~~e~a~~~~k 419 (478) T protein:vir:10 380 ELLQYIID----FYR---------L--DVKV-------------------------QDIEITFNFNVMVNELENSQIAMN 419 (478) T ss_pred HHHHHHHH----HhC---------C--Cccc-------------------------ccceEEecCCCCCCHHHHHHHHHH Confidence 66655554 331 0 0011 112222233333333444444444 Q ss_pred HHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchh-----hHHhhhhhhhhhhHH Q lcl|NC_020488. 552 FVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQD-----EMEEAGIEPPQPSPE 610 (688) Q Consensus 552 ~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~ 610 (688) +... +....+++++++ ...++-.+++++.......... .....+.+....+++ T Consensus 420 l~g~------iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 420 STGL------LSKETILSNHAWVEDPVAEMERIEQENIELNQQLPDIEEGLNGEQQRQSENNQPE 478 (478) T ss_pred HhCC------CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCCCCCCCCC Confidence 4221 233455566553 4444445555432211000000 000000000000111 No 54 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.83 E-value=2.8e-19 Score=122.29 Aligned_cols=424 Identities=11% Similarity=0.055 Sum_probs=226.4 Q ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHHHHHHHHHHhCCcce Q lcl|NC_020488. 15 SQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYVDQVLGDQRQNRPAI 92 (688) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i~~i~g~~~~~r~~~ 92 (688) ...+++.++...+ ........+-.+||.|+| +-.....+..+++ .+++|..+.+|+..++++..+.+.+ T Consensus 1 l~~~~l~~~i~~~-------~~~~~r~~~l~~yy~g~~--~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~ 71 (429) T protein:vir:98 1 MTKDLLSELIQKH-------RSFNLSYSAYKQLYEGDH--AILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQT 71 (429) T ss_pred CCHHHHHHHHHHH-------HHHHHHHHHHHHHhcccc--ccccccccccCCCcceeecchHHHHHHHHhhhhcccCcee Confidence 3344444444433 233455556678999986 1111122333333 5889999999999999999887665 Q ss_pred EEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCccee Q lcl|NC_020488. 93 QVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLC 172 (688) Q Consensus 93 ~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~ 172 (688) .+ - + +.....+..+++.|+++.....+..+++++|.||+.++.+. +|.+. T Consensus 72 ~~--~------------------~----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~------~g~~~ 121 (429) T protein:vir:98 72 SH--E------------------N----KQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDE------NAEAG 121 (429) T ss_pred ec--C------------------C----hHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecC------CCcEE Confidence 43 1 1 12344567777789999999999999999999998886541 35666 Q ss_pred EEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeec Q lcl|NC_020488. 173 IKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPV 250 (688) Q Consensus 173 ~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~ 250 (688) +..+ +|..+ +||.... .-...+++.+.. ++.+...++|..+.. T Consensus 122 ~~~~-~p~~~~~v~dd~~~-----~~~~~~i~~~~~-----------------------------~~~~~~~~~~~~~~~ 166 (429) T protein:vir:98 122 ITYL-TPLEAFIVYDDSIR-----QKPLFAVRYFYN-----------------------------KGGVLEGSYSDASNI 166 (429) T ss_pred EEEE-cccceEEEEeCCCC-----CceEEEEEEEEe-----------------------------cCceEEEEEEeCceE Confidence 6655 67665 3543211 112222333211 112233344432211 Q ss_pred ceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCc Q lcl|NC_020488. 251 TRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDK 330 (688) Q Consensus 251 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~ 330 (688) .. +.... +...+.++.|.+.+.+|+|+|. .. T Consensus 167 ~~--~~~~~----------------------------------------~~~~~~~~~~~~~g~vPvv~~~-------n~ 197 (429) T protein:vir:98 167 TY--FKDGE----------------------------------------KGIEIGESEPHPFDGVPMIEYV-------EN 197 (429) T ss_pred EE--EEecC----------------------------------------CceEecccccccCCccceEEec-------CC Confidence 10 00000 0111223344455667776542 23 Q ss_pred ccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCcc Q lcl|NC_020488. 331 TYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASM 410 (688) Q Consensus 331 ~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 410 (688) .+|.|.+..++++++.+|.+.|.+.+.+.....+.+++.....+. +..... ...+.+....+......+.++..+.- T Consensus 198 ~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~ 274 (429) T protein:vir:98 198 EERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAELDD--ETLKSL-RDTRIINLKDTDAQQLTVEFLQKPDA 274 (429) T ss_pred CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCc--chhhhH-hhCceeeccCCCCCCcceeEEeecCC Confidence 568899999999999999999999999988888876654332221 121211 11223322222111223455544444 Q ss_pred hHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceE Q lcl|NC_020488. 411 PAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRV 490 (688) Q Consensus 411 ~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~ 490 (688) ..++...++.+.+.|-..|++.+.+.+..+ +.||.|+..+-.............|..+++++.++++.++.. T Consensus 275 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~------- 346 (429) T protein:vir:98 275 DATQEHLLDRLENLIFRTAMVANISDESFG-TASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTS------- 346 (429) T ss_pred HHHHHHHHHHHHHHHHHHhCccccCccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------- Confidence 566777789999999999998877666544 469999988776666666666677777777766666554321 Q ss_pred EEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHh Q lcl|NC_020488. 491 LRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKN 570 (688) Q Consensus 491 ~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~ 570 (688) .+. ..++ .+|.|.=.+..+....+..+.++++... +....++++ T Consensus 347 ---~~~--~~d~-------------------------~~i~v~f~~~~p~~~~~~a~~~~kl~g~------is~et~~~~ 390 (429) T protein:vir:98 347 ---KIG--PKDW-------------------------IGIKYKFTRNLPANLLEESQIAGNLAGI------VSEETQVGV 390 (429) T ss_pred ---CCC--cccc-------------------------ccceEEeCCCCCcCHHHHHHHHHHHhcc------CchHHHHHh Confidence 110 0011 1233333344444444555555554322 223444555 Q ss_pred cCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHH Q lcl|NC_020488. 571 MDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPE 610 (688) Q Consensus 571 ~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 610 (688) +++ ...+.-.+++++........ ++.....+......+ T Consensus 391 l~~v~d~~~E~~ri~~E~~~~~~~--~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 391 LSIVENPQKEIERKNSDKSTLISR--QAGGLNGQNTTTILE 429 (429) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHH--HHhhhcCCCCCCCCC Confidence 543 33333344443321110000 000000000000000 No 55 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.83 E-value=2.1e-19 Score=122.95 Aligned_cols=455 Identities=11% Similarity=0.033 Sum_probs=232.9 Q ss_pred CCCCCCCcCCCCccch--HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCC--CCHHHH---HHHHhcCCC--ceee Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQ--EAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQ--WPESVR---KEREDEGRP--CLTL 71 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Q--w~~~~~---~~~~~~g~p--~~~~ 71 (688) |+|...-.+....+.. ......+.+.+....+.....+....+..+||.|+| |..... .......++ .+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 8887664433222111 101112223334444555666778888899999975 111100 001112222 4678 Q ss_pred hhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcC Q lcl|NC_020488. 72 NKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGG 151 (688) Q Consensus 72 N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G 151 (688) |..+.+|+..+++...+.+.+.+ .|.+..+.+ +.++ .|+++.....+..+++++| T Consensus 81 n~~~~ivd~~~~~l~g~~~~~~~--------------------~d~~~~~~l----~~~~-~n~~~~~~~~~~~~~~~~G 135 (472) T protein:vir:93 81 NFHANLVDQKVSYIVGKPIAFKH--------------------TDDEVVKRI----DEVL-GNRFDDKLHSVLTGASNKG 135 (472) T ss_pred chHHHHHHHHhhhhcccCeeecc--------------------CChHHHHHH----HHHH-hccHHHHHHHHHHHHhhcC Confidence 99999999999999988766532 133333333 3333 4789999999999999999 Q ss_pred CceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccc Q lcl|NC_020488. 152 FGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGE 229 (688) Q Consensus 152 ~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~ 229 (688) .||+.|+.+. ++.+.+..+ +|..++ ||+... .+.. .+++.|...++ T Consensus 136 ~~~~~v~~d~------d~~~~i~~~-~p~~~~~i~d~~~~----~~~~-~~ir~~~~~~~-------------------- 183 (472) T protein:vir:93 136 IEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEH----EELE-AFIRMYKLENE-------------------- 183 (472) T ss_pred eEEEEEEECC------CCceEEEEE-cccceEEEEcCCCC----CceE-EEEEEEEeecc-------------------- Confidence 9998877532 345677666 677654 554321 1222 23333321100 Q ss_pred cccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCC Q lcl|NC_020488. 230 YSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVD 309 (688) Q Consensus 230 ~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p 309 (688) . .+++|........ ....+...... ... .........| T Consensus 184 -------~---~~~~~~~~~~~~~--~~~~~~~~~~~-~~~-----------------------------~~~~~~~~~~ 221 (472) T protein:vir:93 184 -------T---KVEYWDKVTVNYY--VYENGSLIPDY-SNN-----------------------------LENSKTHFST 221 (472) T ss_pred -------e---eEEEEecCeEEEE--EEecCeeeecc-ccc-----------------------------cccccccccc Confidence 0 0223322211111 11111111100 000 0000111223 Q ss_pred CCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCc Q lcl|NC_020488. 310 WPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQS 389 (688) Q Consensus 310 ~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 389 (688) .+.+.+|+|+|.. ..+|.|.+..++++++.+|.++|.+...+.....+.+++.........+... ..+..++ T Consensus 222 ~~~~~vPvv~~~n-------n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~ 293 (472) T protein:vir:93 222 GSWGKIPFIPFKN-------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR-LLRYYGA 293 (472) T ss_pred CCCCCcceEEecC-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHH-HHhhccc Confidence 3446667665422 3468899999999999999999999999988888877664333222222222 1222222 Q ss_pred eeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 390 VLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRA 469 (688) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~ 469 (688) +.. .. ...+.++..+.-..++...++.+...|-..||+.+.+.+.-+++.||.|+..+-.............|..+ T Consensus 294 ~~~-~~---~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~ 369 (472) T protein:vir:93 294 IKV-SD---NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVA 369 (472) T ss_pred ccc-CC---CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 21 12344444333446677888999999999999998887776677899999877666666666666666777 Q ss_pred HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHH Q lcl|NC_020488. 470 IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSL 549 (688) Q Consensus 470 ~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l 549 (688) ++.+.++++.++-.. .++ .++.|.=.+..+....+..+.+ T Consensus 370 l~~~~~li~~~~~~~---------------~~~-------------------------~~i~v~f~~~~p~~~~~~~~~~ 409 (472) T protein:vir:93 370 IQELLWFVFEHFDIK---------------GEH-------------------------KDVDISFNYNKVANTELQVQTA 409 (472) T ss_pred HHHHHHHHHHHhCCC---------------ccc-------------------------ceeeEEeCCCCCCCHHHHHHHH Confidence 776666655543110 011 1222222344444444445555 Q ss_pred HHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHh-----hhhhhhhhhHHHH Q lcl|NC_020488. 550 MQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEE-----AGIEPPQPSPEQQ 612 (688) Q Consensus 550 ~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~q 612 (688) +.+... +....+++++++ .+.+.-.+++++.............. ........+.+++ T Consensus 410 ~k~~gi------is~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 410 QQSMGI------VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred HHHhcc------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCCCcccCC Confidence 544332 223344555443 34444444443321100000000000 0000000000000 No 56 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=99.83 E-value=2.2e-19 Score=122.90 Aligned_cols=524 Identities=12% Similarity=0.044 Sum_probs=261.5 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhC-----CcceE Q lcl|NC_020488. 19 ILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQN-----RPAIQ 93 (688) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~-----r~~~~ 93 (688) ....++.+|+...+..+.|...|++..+|..-.-...+. .....+. ..+.-+.-...+++..+.+... ++=++ T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~-~~~~~~~-~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 78 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDG-HASGGRL-QQPYQSLGSKGVNALSSKLMLSLFPIQTSFFK 78 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCC-Ccccccc-cccccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 223356677777788889999999999986421111000 0001111 1122333344444444444442 33334 Q ss_pred EEeCCccccccccccccccCh-----hhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCC Q lcl|NC_020488. 94 VHPVEANATKDTSKVPNVAGT-----SDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFD 168 (688) Q Consensus 94 v~pr~~~~~~~~~~~~~~~~~-----~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~ 168 (688) ..+.+..... ....++ -...++ ..+..+...+..|++..+...++.+.+..|.|++.+ + +++ T Consensus 79 l~~~d~~l~~-----~~~~~~~~~~~v~~~L~-~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~---~~~-- 145 (542) T protein:vir:78 79 LQINDAEIAS-----VPELTPEVRSEIDMNLS-KMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFA--G---KKT-- 145 (542) T ss_pred ccCCHHHHHh-----hccCChhhHHHHHHHHH-HHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEe--c---CCC-- Confidence 4333211000 000011 111122 235566666778999999999999999999997644 2 222 Q ss_pred cceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchh-cccccccccccCCCCCEEEEEEEEee Q lcl|NC_020488. 169 LDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGD-LSDAERGEYSWWTNEEGVRVSEYFYR 247 (688) Q Consensus 169 ~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~-~~~~~~~~~~~~~~~~~v~v~e~~~~ 247 (688) ++.+ +-.++++..++.- ...-++++..||..++.+.|++..... ....... .....+.+++.++. T Consensus 146 ----~~~~-pl~~y~v~~d~~G----~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~-----~~~~~~~v~~~v~p 211 (542) T protein:vir:78 146 ----LKVY-PLDRYVIERDGDG----NVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVG-----EDGPKFGVAQGKGG 211 (542) T ss_pred ----ceEE-ecceeEEeeCCCC----CeEEEeeeeecCHHHHHHhhccccCchHHHhhccc-----cCCCeEEEEEEeec Confidence 2222 3345665544321 223388999999999999998654321 1111111 11233444444433 Q ss_pred eecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeecc Q lcl|NC_020488. 248 EPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVI 327 (688) Q Consensus 248 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~ 327 (688) ... +..+.. ..+....+.+|+-. .|..+....+-+++..+||++..+ ... T Consensus 212 r~~---------~~~~~~------------------~~~~~~~~s~~~e~-~g~~v~~~~~e~g~~~~P~i~~Rw--~~~ 261 (542) T protein:vir:78 212 RND---------AEVFTC------------------CKLVDGQHRWHQEC-DGKEIKGSRSSSPLKHSPWLPLRF--NVV 261 (542) T ss_pred ccC---------Cccccc------------------cccCCCeEEEEEEe-ccccccccccccccccCCceeeee--eec Confidence 211 100000 00111222333222 233321112334568899996544 567 Q ss_pred CCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCC Q lcl|NC_020488. 328 GDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMP 407 (688) Q Consensus 328 ~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (688) +|..||.|.+....+-.+.+|.+....+.......++.++++.+.+.+...+.. ..+|.++... .+.-.+..... T Consensus 262 ~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~~---~~~g~iv~g~--~~~v~~~~~~~ 336 (542) T protein:vir:78 262 DGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLAR---AGTGAIIQGR--AEDVSVVQANK 336 (542) T ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhccc---CCCceeecCC--ccceeeeeccc Confidence 999999999999999999999999999999999999999998888776654432 2234443221 11111222223 Q ss_pred CcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcC Q lcl|NC_020488. 408 ASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRVYD 486 (688) Q Consensus 408 ~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~~~ 486 (688) +.--....+.++...+.|.+..-+. .-.++...|++.|..+.+.....|...+.+|. .++..+.+..+.++.+.-- T Consensus 337 ~~~~~~~~~~i~~~~~rI~~aFl~~---~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~ 413 (542) T protein:vir:78 337 GADFRTVQEMIRDLSQRISDAFLIL---NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQ 413 (542) T ss_pred ccchhHHHHHHHHHHHHHHHHhccc---ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 3334556777888888887765332 12345567999999999999999999999984 5777777777776655310 Q ss_pred cceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_020488. 487 SDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDL 566 (688) Q Consensus 487 ~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~ 566 (688) +.+ +..++ +.+.+.++. ....|.+..+.+.++++.+.+... .+. T Consensus 414 ------------------lP~-----------~p~~l----v~~~~~s~L-a~~~r~~~~~~l~~~~~~i~~~~~--p~~ 457 (542) T protein:vir:78 414 ------------------LPS-----------LPKGL----VMPTVVAGL-GGVGRGEDRAALIEFMQTVGQAMG--PEA 457 (542) T ss_pred ------------------CCC-----------Cchhc----eeeeeechH-HHHHHHHHHHHHHHHHHHHHHhcC--Chh Confidence 000 00111 234444433 345667777777777776543311 122 Q ss_pred HHHhcCCccHHHHHHHHHhhccccc--cchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 567 IAKNMDWPGAQDIARRLQKTLPPGI--LDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTA 644 (688) Q Consensus 567 ~~e~~~~~~~~ei~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~ 644 (688) +.+ .-+.+++...+....+-.. .-..+...++.+ ++++++++|+++.++....+. . T Consensus 458 l~~---~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~-------~q~q~~~~~~al~~~a~~~a~-------~----- 515 (542) T protein:vir:78 458 LQQ---FIDPTEFLKRLAAASGIDTLNLVKSPETMANEA-------QQAQQQQMTASLMGQAGQLAK-------S----- 515 (542) T ss_pred HHh---cCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHH-------HHHHHHHHHHHHHHhhhhccc-------c----- Confidence 222 3345666666655544321 111111000000 000000000000000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 645 EAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 645 ~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..... ..++.+ +.. +......++ T Consensus 516 ----~~~~~----~~~~~~----------a~~---~~~~~~~~~ 538 (542) T protein:vir:78 516 ----PIGEK----MMQQIN----------APG---QEAPAGPQT 538 (542) T ss_pred ----ccccc----hhhhcC----------CCC---cCCCCCCcc Confidence 00000 000000 000 000111111 No 57 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=99.83 E-value=1.3e-18 Score=118.58 Aligned_cols=509 Identities=14% Similarity=0.101 Sum_probs=254.2 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHhhC---CCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh-----CCcce Q lcl|NC_020488. 21 QEIRERAAHAVTCWKHNFDAAQEDISFLA---GEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ-----NRPAI 92 (688) Q Consensus 21 ~~~~~~~~~~~~~~~~~r~~~~~~~~~~~---G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~-----~r~~~ 92 (688) -+++.+|+.-.+..++|...|++..+|.. +.-....... .+ + ...+.-..-...+++..+.+.. +++=+ T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~-~~-~-~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 77 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPN-HK-S-LTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFF 77 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcc-cc-c-ccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 44667888888888899999999999884 2211111000 00 0 0112222333334444433333 23333 Q ss_pred EEEeCCccccccccccccccChhh----HHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCC Q lcl|NC_020488. 93 QVHPVEANATKDTSKVPNVAGTSD----YSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFD 168 (688) Q Consensus 93 ~v~pr~~~~~~~~~~~~~~~~~~d----~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~ 168 (688) +..+.+.... ...+++. .+..+..+..+...+..|++..+...++.+.+..|.|++.+ + ++++ T Consensus 78 ~l~~~d~~l~-------~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~---~~~~- 144 (522) T protein:vir:10 78 KLQVRDDKLG-------EELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFM--G---KDGL- 144 (522) T ss_pred cccCChHHHh-------hhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEE--c---CCCc- Confidence 4444332110 0011111 12234455666666778999999999999999999998543 2 2322 Q ss_pred cceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeee Q lcl|NC_020488. 169 LDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYRE 248 (688) Q Consensus 169 ~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~ 248 (688) +.+ +-.++++..++.- ...-++++.+||..++.+.||......... .. ....+.+.|+.+.+.+ T Consensus 145 -----~~~-pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~---~~---~~~~~~v~v~~~v~p~ 208 (522) T protein:vir:10 145 -----KTF-PLTRYVINRDGDG----NVLEIVTKELISRKVLDIELPEPKPNTGID---ES---STTNDDVTIYTYVKLD 208 (522) T ss_pred -----eEE-EcceEEEeeCCCC----CeeEEEeeeeccHHHHHHhcchhccchhhh---cc---cCCCCceEEEEEEEee Confidence 222 3445776654321 344588999999999999998765322111 11 1223457777766544 Q ss_pred ecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccC Q lcl|NC_020488. 249 PVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIG 328 (688) Q Consensus 249 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~ 328 (688) ... +...+ + ....+..+....+-+++.++||++..+ ...+ T Consensus 209 ~~~--------~~~~~-----------------------------~-~~~~~~~~~~~~s~~g~~~~P~~~~Rw--~~~~ 248 (522) T protein:vir:10 209 KSS--------GRWVW-----------------------------H-QEAFDKIIPDSRSTAPKNASPWLPLRF--NTVD 248 (522) T ss_pred ccC--------CceEE-----------------------------E-EccCCccccccccccccccCCceeeee--eecC Confidence 221 11111 0 001122222223445678899996544 4679 Q ss_pred CcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCC Q lcl|NC_020488. 329 DKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPA 408 (688) Q Consensus 329 ~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (688) |..||.|.+....+-.+.+|.+....+.......++.++++.+.+.+...+.. ..++.++. +..+.-.+...... T Consensus 249 ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~---~~~~~~v~--g~~~~v~~~~~~~~ 323 (522) T protein:vir:10 249 GEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAK---AGNGAIVQ--GRPEDVAVIQVGKT 323 (522) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccC---CCCcceec--CCCccceeeccccc Confidence 99999999999999999999999999999999999999998888776654322 22233322 11111111111222 Q ss_pred cchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCc Q lcl|NC_020488. 409 SMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRVYDS 487 (688) Q Consensus 409 ~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~~~~ 487 (688) .--+.....++...+.|.+..-+ +...++...|++.|..+.+.....|...+.+|. .+...+.+..+.++.+- T Consensus 324 ~d~~~~~~~i~~~~~ri~~aFl~---~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~--- 397 (522) T protein:vir:10 324 ADFSTAANMATAIEKRLLEAFLV---MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRS--- 397 (522) T ss_pred ccchHHHHHHHHHHHHHHHHHhh---ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc--- Confidence 23344566677777777765311 112344567999999999999999999998884 56666766666665431 Q ss_pred ceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_020488. 488 DRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLI 567 (688) Q Consensus 488 ~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~ 567 (688) | .+.+ +..++. +-++ -++.+.- -|.+..+.++.+++.+.+... .+.+ T Consensus 398 ----------g-----~lP~-----------~p~~~~--~~~~--v~~is~L-araq~~~~l~~~~~~i~~~~~--p~~~ 444 (522) T protein:vir:10 398 ----------N-----QIPK-----------LPKDIV--RPTI--VAGVNAL-GRGQDRESLTAFVGTIAQTLG--PEAL 444 (522) T ss_pred ----------C-----CCCC-----------CCcccc--cccc--ccchhHH-HHHHHHHHHHHHHHHHHHhhC--chhh Confidence 0 0000 001110 1111 1122222 244455666666554322210 1222 Q ss_pred HHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 568 AKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQ 647 (688) Q Consensus 568 ~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~ 647 (688) . +.-+.+++...+....+-........+++-.+..+ ++++++ +++ +.++++.+........ -+ T Consensus 445 ~---~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q----~~q~~~-~~~----~~~~~a~~~~~~~~~~-----~~ 507 (522) T protein:vir:10 445 M---QYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQ----AAQQQA-AQQ----SLVDQAGQMTGSPLMD-----PT 507 (522) T ss_pred h---hcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHH----HHHHHH-HHH----HHHHHHHHHhcccccC-----cc Confidence 2 22355666666655544211111000000000000 000000 000 0000000000000000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 648 AKLAEIEQAAMMAGPGSLEET 668 (688) Q Consensus 648 a~~~~~~~~a~~~~~~~~~~~ 668 (688) +. .+ .+.++++..+. T Consensus 508 ~~---~~---~~~~~~~~~~~ 522 (522) T protein:vir:10 508 KN---PQ---LMDEEQPPMEE 522 (522) T ss_pred cc---HH---HHHHhCCCCCC Confidence 00 00 00111111110 No 58 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.83 E-value=7.7e-20 Score=125.36 Aligned_cols=443 Identities=11% Similarity=0.056 Sum_probs=227.7 Q ss_pred CCCCCCCcCC-------CCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHH-------HHHhcCC Q lcl|NC_020488. 1 MLPGNEPIKT-------RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRK-------EREDEGR 66 (688) Q Consensus 1 ~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~-------~~~~~g~ 66 (688) =+|--+|-.. +..+...+++.++. +.....+....+..+||.|.| + -... .....++ T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i-------~~~~~~~~~~~~~~~Yy~g~~-~-i~~r~~~~~~~~~~~~~~ 76 (474) T protein:vir:95 6 RMPWDKPYGEEVVEQLKPQFETQEEMIIRLI-------DDHRKQLDKITVGQRYYDKDN-D-IVKQMKKVDVYGNIDYDK 76 (474) T ss_pred ecCCCCchhhHHHHhhhhccCChHHHHHHHH-------HHHHHHHHHHHHHHHHhcccC-c-hhcccccccccccccccc Confidence 3444432221 11122222333332 233344555667788999976 1 1100 1112233 Q ss_pred C--ceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHH Q lcl|NC_020488. 67 P--CLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAF 144 (688) Q Consensus 67 p--~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~ 144 (688) | .+++|..+.+|+..++++..+.+.+.+ .|.+.. ..++.+++ |+++.....+. T Consensus 77 ~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~--------------------~d~~~~----~~l~~~~~-n~~~~~~~e~~ 131 (474) T protein:vir:95 77 PDWRITTNFHQNLVDQKVSYVASKPVTYSC--------------------EDESVL----KIIHDVLD-TRWDNKLIDIL 131 (474) T ss_pred ccceeccchHHHHHHHHHhhhccCCceecc--------------------CchHHH----HHHHHHHh-ccHHHHHHHHH Confidence 3 467999999999999999998877643 133333 34444444 67999999999 Q ss_pred HHHHHcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhc Q lcl|NC_020488. 145 QHAVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDL 222 (688) Q Consensus 145 ~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~ 222 (688) .++.++|.||..++.+. ++.+.+..+ +|..++ ||+... .+.. .+++.|...+ T Consensus 132 ~~~~~~G~~~~~v~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~~-~~i~~~~~~~-------------- 185 (474) T protein:vir:95 132 TATSNKGIDWLQVYINE------NGEMKLFRV-PAEQAIPIWVDKER----EELK-SFIRYYKFNN-------------- 185 (474) T ss_pred HHHhhcCcEEEEEEecC------CCceEEEEE-cccceEEEEcCCCC----CceE-EEEEEEEEcC-------------- Confidence 99999999998876531 356777666 677765 554321 1222 2233321100 Q ss_pred ccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhh Q lcl|NC_020488. 223 SDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYD 302 (688) Q Consensus 223 ~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ 302 (688) ...+++|.......+. ...+.+...... .... T Consensus 186 ----------------~~~~~~y~~~~~~~~~--~~~~~~~~~~~~------------------------------~~~~ 217 (474) T protein:vir:95 186 ----------------EEKVEFWTDTTVTYYV--LENGGLIPDYYY------------------------------GANH 217 (474) T ss_pred ----------------eeEEEEEeCCeEEEEE--EcCCcccccccc------------------------------Cccc Confidence 0012334332221111 111111110000 0111 Q ss_pred hcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhh Q lcl|NC_020488. 303 VLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQ 382 (688) Q Consensus 303 ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 382 (688) +.....+.+.+.+|+|+|.. ...|.|.+..++++++.+|.++|.+.+.+.....+.+++.....++..+.... T Consensus 218 ~~~~~~~~~~g~iPvv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~ 290 (474) T protein:vir:95 218 IQSHFSNGNWGRVPFIAFKN-------NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRG 290 (474) T ss_pred ccccccccCCCccceEeecC-------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhh Confidence 11222344456777776533 34578999999999999999999999999888888777655444433333222 Q ss_pred cccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 383 ANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAY 462 (688) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~ 462 (688) ....+ ++...+. ..+.++..+.-..++...++.+...|-..+++.+.+.+.-+++.||.|+..+-.......... T Consensus 291 -~~~~~-~i~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k 365 (474) T protein:vir:95 291 -LKYYK-AINVDGD---GGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKL 365 (474) T ss_pred -hhccc-eeeccCC---CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHH Confidence 12222 3322222 234555544445667778888999999999998877776666789999988876666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHH Q lcl|NC_020488. 463 IDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQR 542 (688) Q Consensus 463 ~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r 542 (688) ...|..++++++++++++. . . ..++ .++.|.=.++.+... T Consensus 366 ~~~~~~~l~~~~~li~~~~----g---------~--~~d~-------------------------~~i~v~f~~~~p~d~ 405 (474) T protein:vir:95 366 KNKATVAIQELIGFIIDFN----N---------L--KMDV-------------------------KDIEISFNFNRMMND 405 (474) T ss_pred HHHHHHHHHHHHHHHHHHh----C---------C--Cccc-------------------------ceeeEEeccCCCcCH Confidence 6667777777666655432 1 0 0011 111122122333322 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHhcC-CccHHHHHHHHHhhccccccchh-------hHHhhhhhhhhhhHH Q lcl|NC_020488. 543 MEAADSLMQFVQAVPAAGGVVLDLIAKNMD-WPGAQDIARRLQKTLPPGILDQD-------EMEEAGIEPPQPSPE 610 (688) Q Consensus 543 ~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~-~~~~~ei~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~ 610 (688) .+..+.+.++ +-+....++..++ ....++-.+++.+.......... ....+..+....+++ T Consensus 406 ~e~a~~~~~~-------g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 406 AEQSQIIAQS-------QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred HHHHHHHHhc-------CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccCCCC Confidence 3333333321 2233344444444 34444444444432211000000 000000000000111 No 59 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.82 E-value=6.1e-19 Score=120.43 Aligned_cols=438 Identities=10% Similarity=0.014 Sum_probs=226.9 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i 78 (688) |+|.+=-.-..+++...+.+.++...+ ...+..-.+..+||.|.|.- .....+..+++ .+++|..+.+| T Consensus 3 ~~~~~~~~~~~~~~~~~~~i~~~i~~~-------~~~~~r~~~~~~yy~g~~~i--~~~~~~~~~~~~~ki~~n~~~~iv 73 (453) T protein:vir:73 3 LKPIKLMTYSRDEEITDKVVNDFMKKH-------QEEVERYEYLGNMYKGIMEI--SSQKAKDSWKPDNRLTNNFAKYIV 73 (453) T ss_pred cccceeeeccccccCCHHHHHHHHHHH-------HHHHHHHHHHHHHhccccch--hcCCCCCccCccceeecchHHHHH Confidence 777766544444444444444443322 22334445567899998742 11222223443 57899999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+.+.+.. - |. .....+..++..|+++.....+..+++++|.||..++ T Consensus 74 d~~~~~l~g~~~~~~~--~------------------d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~ 129 (453) T protein:vir:73 74 DTFVGYFNGIPIKKTH--D------------------DK----SVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMY 129 (453) T ss_pred HHhhhhhcccCceeec--C------------------Ch----HHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEE Confidence 9999999988766532 1 11 2334566777889999999999999999999998887 Q ss_pred EeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEE-ecCCHHHHHHhcCCccchhcccccccccccCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFIS-ERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTN 235 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~-~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 235 (688) .+. ++.+.+..+ +|..+ +||+... + ..++.. .+.+. T Consensus 130 ~d~------~~~~~i~~~-~p~~~~~v~dd~~~-----~-~~~~~i~~~~~~---------------------------- 168 (453) T protein:vir:73 130 QNE------STESEVIYC-SPLNVFMVYDDSIK-----Q-KPLFAVYYGFDE---------------------------- 168 (453) T ss_pred eCC------CCceEEEEE-cccceEEEEeCCCC-----c-eeEEEEEEEEec---------------------------- Confidence 542 345666655 67665 4554322 1 122222 22110 Q ss_pred CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 236 EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 236 ~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) +.....++|....... +... ++...+..+.|.+.+.+ T Consensus 169 -~~~~~~~vyt~~~i~~--~~~~----------------------------------------~~~~~~~~~~~~~~g~v 205 (453) T protein:vir:73 169 -EGNLSGTVYTLLETIS--ITGK----------------------------------------AGEVKFGESTYNVYSDL 205 (453) T ss_pred -CceEEEEEEeCCeEEE--EEec----------------------------------------CCceEEccceeccCCce Confidence 0111234444321100 0000 01111122334445666 Q ss_pred ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecC- Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYN- 394 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~- 394 (688) |+|+|. +...|.|.+..++++++.+|..+|.+.+.+....++.+++....++.. ..... .....+.... T Consensus 206 Pvv~~~-------n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~--~~~~~-~~~~~~~~~~~ 275 (453) T protein:vir:73 206 PIVEYN-------FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDEE--DAKNI-KDNRLINFFDK 275 (453) T ss_pred eEEEec-------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCch--hhhcc-ccccccccccc Confidence 766542 234678999999999999999999999999888888776643222211 11110 0111111000 Q ss_pred ------cccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 395 ------AIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSR 468 (688) Q Consensus 395 ------~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~ 468 (688) .......+.++..+.-...+...++.+...|-..|++.+.+.+..+ +.||.|+..+-.............|.. T Consensus 276 ~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g-n~Sg~Al~~~~~~l~~ka~~~~~~~~~ 354 (453) T protein:vir:73 276 NSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFG-NSSGVALAYKLQAMSNLALSFQRKFQS 354 (453) T ss_pred ccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-CccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0011122444444444566777788889999999998876666543 469999988766666666666666667 Q ss_pred HHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHH Q lcl|NC_020488. 469 AIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADS 548 (688) Q Consensus 469 ~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~ 548 (688) +++++.++++.+... .+ ...++ .+|.|.=.+..+....+..+. T Consensus 355 ~l~~~~~li~~~~~~----------~~--~~~~~-------------------------~~i~v~f~~~~p~~~~~~a~~ 397 (453) T protein:vir:73 355 ALNRRYSLWSSLSTN----------AS--NKDAW-------------------------KDIEYTFTRNEPKDIKEQAET 397 (453) T ss_pred HHHHHHHHHHHHHhc----------cC--Ccccc-------------------------ccceEEeCCCCCCCHHHHHHH Confidence 777666665543211 01 01011 122222234444444445555 Q ss_pred HHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 549 LMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKA 627 (688) Q Consensus 549 l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 627 (688) ++.+... +....+++.+++ ...++-.+++++...... ..++.....++.+.. T Consensus 398 ~~k~~gi------is~et~~~~~~~~~d~~~E~~ri~~E~~~~~-------~~~~~~~~~~~~~~~-------------- 450 (453) T protein:vir:73 398 ANILKGI------TSEETALSVISVIPDVQAEMEKIKKKKLLQL-------SLTRTSNLVRMKQMR-------------- 450 (453) T ss_pred HHHHhcc------CcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-------HHHHhccCCcchhhh-------------- Confidence 5544321 222334444443 223333333332111000 000000000000000 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_020488. 628 DTAKAQADMAMAQA 641 (688) Q Consensus 628 e~~~~q~e~~~~q~ 641 (688) -.. T Consensus 451 -----------~~~ 453 (453) T protein:vir:73 451 -----------GNL 453 (453) T ss_pred -----------cCC Confidence 000 No 60 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.82 E-value=1e-19 Score=124.72 Aligned_cols=459 Identities=11% Similarity=0.062 Sum_probs=232.5 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCH-----HHHHHHHhcCCC--ceeehh Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPE-----SVRKEREDEGRP--CLTLNK 73 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~-----~~~~~~~~~g~p--~~~~N~ 73 (688) -+|--++-.....+..+.......+......+.....+....+..+||.|+|=-. ..........+| .+++|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:95 6 RMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNF 85 (474) T ss_pred cCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccch Confidence 4566665554444433333334444455555555555667778889999986100 001111112233 478999 Q ss_pred HHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCc Q lcl|NC_020488. 74 LPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFG 153 (688) Q Consensus 74 i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G 153 (688) .+.+|+..+|++..+.+.+.+ .|.+..+. +..+. .|+++.....+..+++++|.| T Consensus 86 ~k~Iv~~~~~yl~g~p~~~~~--------------------~~~~~~~~----l~~~~-~n~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:95 86 HQNLVDQKVSYVAGKPVTYAH--------------------DDDKVLDV----IHQVL-DTRWDNKLIDILTAASNKGID 140 (474) T ss_pred HHHHHHhhhhhhcccCceecc--------------------CChHHHHH----HHHHH-hccHHHHHHHHHHHHhhCCeE Confidence 999999999999998877643 12222333 33333 378999999999999999999 Q ss_pred eEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccc Q lcl|NC_020488. 154 WLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYS 231 (688) Q Consensus 154 ~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~ 231 (688) |..++.+. ++.+.+..+ +|.++| ||+... .+. ..+++.|... T Consensus 141 ~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~-~a~ir~~~~~------------------------ 184 (474) T protein:vir:95 141 WLQVYINE------DGELKLFRV-PAEQAIPIWTDKER----EQL-NAFIRIFTFN------------------------ 184 (474) T ss_pred EEEeeeCC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEeec------------------------ Confidence 98877542 345666666 677764 554321 122 2334444210 Q ss_pred cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCC Q lcl|NC_020488. 232 WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWP 311 (688) Q Consensus 232 ~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~ 311 (688) ....+++|.......+ ...++...... ............|.+ T Consensus 185 ------~~~~~~vy~~~~i~~~--~~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~ 226 (474) T protein:vir:95 185 ------GETKVEYWTAETVTYY--VYENGGLIPDF------------------------------YYGDEHIQTHFSTGS 226 (474) T ss_pred ------CeeEEEEEeCCeEEEE--EEcCCceeecc------------------------------ccccccccCcccccC Confidence 0011233433221111 11111111100 001111112223334 Q ss_pred CCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCcee Q lcl|NC_020488. 312 GSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVL 391 (688) Q Consensus 312 ~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 391 (688) .+.+|+|+|.. ...|.|.+..++++++.+|.+.|.+.+.+.....+.+++.....++..+.... .+..+++. T Consensus 227 ~~~vPvv~~~n-------n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~-~~~~~~i~ 298 (474) T protein:vir:95 227 WERVPFIAFKN-------NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEG-LKYYKAIN 298 (474) T ss_pred CCccceEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhh-hhccceee Confidence 45566665422 34577999999999999999999999999888887665533222222222221 11222222 Q ss_pred ecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 392 RYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIR 471 (688) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 471 (688) .+. ...+.++..+.-..+....++.+...|-..|++.+.+.+..+++.||.|+..+-.............|..+++ T Consensus 299 -~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~ 374 (474) T protein:vir:95 299 -VSS---DGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQ 374 (474) T ss_pred -ccC---CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 2345555555556778888999999999999998877766666789999988876666666666666777777 Q ss_pred HHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHH Q lcl|NC_020488. 472 RVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQ 551 (688) Q Consensus 472 ~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~ 551 (688) ++.++++. +.. .. .++. +|.+.=.+..+....+..+.+.+ T Consensus 375 ~~~~~i~~----~~g---------~~--~d~~-------------------------~i~i~f~~~~p~~~~e~a~~~~~ 414 (474) T protein:vir:95 375 ELMQFILD----FNK---------IK--LDAK-------------------------EIEITFNFNVMVNDLEQSQIGAQ 414 (474) T ss_pred HHHHHHHH----HhC---------CC--cccc-------------------------eeeEEecCCCccCHHHHHHHHHH Confidence 66665554 321 00 0110 11111122222222222332221 Q ss_pred HHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 552 FVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTA 630 (688) Q Consensus 552 ~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~ 630 (688) .+-+....+++.+++ ...++-.+++++.....................+..+ T Consensus 415 -------~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~-------------------- 467 (474) T protein:vir:95 415 -------SQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQ-------------------- 467 (474) T ss_pred -------cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCC-------------------- Confidence 112223344444443 3333334444322110000000000000000000000 Q ss_pred HHHHHHHHHHHHHHH Q lcl|NC_020488. 631 KAQADMAMAQAKTAE 645 (688) Q Consensus 631 ~~q~e~~~~q~~~~~ 645 (688) .++.+.+ T Consensus 468 --------~~~~e~~ 474 (474) T protein:vir:95 468 --------SENNQSK 474 (474) T ss_pred --------CCccccC Confidence 0000000 No 61 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.82 E-value=1e-19 Score=124.72 Aligned_cols=459 Identities=11% Similarity=0.062 Sum_probs=232.5 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCH-----HHHHHHHhcCCC--ceeehh Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPE-----SVRKEREDEGRP--CLTLNK 73 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~-----~~~~~~~~~g~p--~~~~N~ 73 (688) -+|--++-.....+..+.......+......+.....+....+..+||.|+|=-. ..........+| .+++|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:96 6 RMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNF 85 (474) T ss_pred cCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccch Confidence 4566665554444433333334444455555555555667778889999986100 001111112233 478999 Q ss_pred HHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCc Q lcl|NC_020488. 74 LPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFG 153 (688) Q Consensus 74 i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G 153 (688) .+.+|+..+|++..+.+.+.+ .|.+..+. +..+. .|+++.....+..+++++|.| T Consensus 86 ~k~Iv~~~~~yl~g~p~~~~~--------------------~~~~~~~~----l~~~~-~n~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:96 86 HQNLVDQKVSYVAGKPVTYAH--------------------DDDKVLDV----IHQVL-DTRWDNKLIDILTAASNKGID 140 (474) T ss_pred HHHHHHhhhhhhcccCceecc--------------------CChHHHHH----HHHHH-hccHHHHHHHHHHHHhhCCeE Confidence 999999999999998877643 12222333 33333 378999999999999999999 Q ss_pred eEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccc Q lcl|NC_020488. 154 WLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYS 231 (688) Q Consensus 154 ~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~ 231 (688) |..++.+. ++.+.+..+ +|.++| ||+... .+. ..+++.|... T Consensus 141 ~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~-~a~ir~~~~~------------------------ 184 (474) T protein:vir:96 141 WLQVYINE------DGELKLFRV-PAEQAIPIWTDKER----EQL-NAFIRIFTFN------------------------ 184 (474) T ss_pred EEEeeeCC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEeec------------------------ Confidence 98877542 345666666 677764 554321 122 2334444210 Q ss_pred cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCC Q lcl|NC_020488. 232 WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWP 311 (688) Q Consensus 232 ~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~ 311 (688) ....+++|.......+ ...++...... ............|.+ T Consensus 185 ------~~~~~~vy~~~~i~~~--~~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~ 226 (474) T protein:vir:96 185 ------GETKVEYWTAETVTYY--VYENGGLIPDF------------------------------YYGDEHIQTHFSTGS 226 (474) T ss_pred ------CeeEEEEEeCCeEEEE--EEcCCceeecc------------------------------ccccccccCcccccC Confidence 0011233433221111 11111111100 001111112223334 Q ss_pred CCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCcee Q lcl|NC_020488. 312 GSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVL 391 (688) Q Consensus 312 ~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 391 (688) .+.+|+|+|.. ...|.|.+..++++++.+|.+.|.+.+.+.....+.+++.....++..+.... .+..+++. T Consensus 227 ~~~vPvv~~~n-------n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~-~~~~~~i~ 298 (474) T protein:vir:96 227 WERVPFIAFKN-------NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEG-LKYYKAIN 298 (474) T ss_pred CCccceEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhh-hhccceee Confidence 45566665422 34577999999999999999999999999888887665533222222222221 11222222 Q ss_pred ecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 392 RYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIR 471 (688) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 471 (688) .+. ...+.++..+.-..+....++.+...|-..|++.+.+.+..+++.||.|+..+-.............|..+++ T Consensus 299 -~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~ 374 (474) T protein:vir:96 299 -VSS---DGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQ 374 (474) T ss_pred -ccC---CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 2345555555556778888999999999999998877766666789999988876666666666666777777 Q ss_pred HHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHH Q lcl|NC_020488. 472 RVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQ 551 (688) Q Consensus 472 ~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~ 551 (688) ++.++++. +.. .. .++. +|.+.=.+..+....+..+.+.+ T Consensus 375 ~~~~~i~~----~~g---------~~--~d~~-------------------------~i~i~f~~~~p~~~~e~a~~~~~ 414 (474) T protein:vir:96 375 ELMQFILD----FNK---------IK--LDAK-------------------------EIEITFNFNVMVNDLEQSQIGAQ 414 (474) T ss_pred HHHHHHHH----HhC---------CC--cccc-------------------------eeeEEecCCCccCHHHHHHHHHH Confidence 66665554 321 00 0110 11111122222222222332221 Q ss_pred HHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 552 FVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTA 630 (688) Q Consensus 552 ~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~ 630 (688) .+-+....+++.+++ ...++-.+++++.....................+..+ T Consensus 415 -------~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~-------------------- 467 (474) T protein:vir:96 415 -------SQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQ-------------------- 467 (474) T ss_pred -------cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCC-------------------- Confidence 112223344444443 3333334444322110000000000000000000000 Q ss_pred HHHHHHHHHHHHHHH Q lcl|NC_020488. 631 KAQADMAMAQAKTAE 645 (688) Q Consensus 631 ~~q~e~~~~q~~~~~ 645 (688) .++.+.+ T Consensus 468 --------~~~~e~~ 474 (474) T protein:vir:96 468 --------SENNQSK 474 (474) T ss_pred --------CCccccC Confidence 0000000 No 62 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.82 E-value=2.6e-19 Score=122.46 Aligned_cols=468 Identities=11% Similarity=0.013 Sum_probs=235.2 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCC--CceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGR--PCLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~--p~~~~N~i~~~i 78 (688) .+|..+...... -+.+.++...+ ....+....+..+||.|.|.--.........++ ..+++|..+.+| T Consensus 30 ~~~~~~~~~~~~----~~~i~~~i~~~------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (511) T protein:vir:10 30 TYDGTESDLLQN----VNEVSKCIEHH------MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (511) T ss_pred cCchhhhhcccC----HHHHHHHHHHH------HHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHH Confidence 333333211111 11222222111 122244556678999998763211111222233 367889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+++.+.+- |.+ ....+..++..|+++.....+..++++.|.+|..++ T Consensus 100 ~~~~~yl~g~p~~~~~~--------------------d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy 155 (511) T protein:vir:10 100 DFINGYFLGNPIQYQDD--------------------DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMI 155 (511) T ss_pred HHHhhhhcccCceeecC--------------------chH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEE Confidence 99999999988776431 122 335577777889999999999999999999988776 Q ss_pred EeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+ + ++.+.+..+ +|..+| ||+... .-...+++.|... .....+. T Consensus 156 ~d---e---dg~~~i~~~-~p~~~~~vydd~~~-----~~~~~~vr~~~~~----------------------~~d~~~~ 201 (511) T protein:vir:10 156 RN---Q---DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYLRTK----------------------PIDKTDE 201 (511) T ss_pred eC---C---CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEEEee----------------------ecccCcc Confidence 53 1 356777666 677754 554321 1122333333210 0001122 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +.+..+++|.......+. ...+..... + .....+.|.+.+.+| T Consensus 202 ~~~~~~~iyt~~~i~~~~--~~~~~~~~~---------------------------------~--~~~~~~~~~~~~~vP 244 (511) T protein:vir:10 202 DEVFTVDLFTSHGVYRYL--TSRTNGLKL---------------------------------T--PRENGFESHSFERMP 244 (511) T ss_pred ceEEEEEEEeCCcEEEEE--ecCCCcccc---------------------------------c--ccccccccccCccee Confidence 334445666543221111 111110000 0 001123344556666 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecC-- Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYN-- 394 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~-- 394 (688) +|+|. ....|.|.+..++++++.+|...|.+.+.+....++.+++......+.++... ....+.+.... T Consensus 245 vv~f~-------nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~~~~~~~~~~~~~ 315 (511) T protein:vir:10 245 ITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK--QKEANVLFLEPTV 315 (511) T ss_pred EEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhcc--chhccceeccccc Confidence 66542 12357799999999999999999999999988777766554323222222111 11111111110 Q ss_pred -----c--ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 395 -----A--IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 395 -----~--~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) + ..+...+.++..+.-..++...++.+...|..+|++.+.+.+.-+++.||.|+..+-.............|. T Consensus 316 ~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~ 395 (511) T protein:vir:10 316 YADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 395 (511) T ss_pred ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 111223445544444566778888899999999999888777655667999999887777777777777777 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .+++++++++..++...-. .+...++ .+|.+.=.+..+....+..+ T Consensus 396 ~~l~~~~~li~~~~~~~~~---------~~~~~d~-------------------------~~i~i~f~~~~p~d~~~~~~ 441 (511) T protein:vir:10 396 KGLRRRAKLLETILKNTRS---------IDANKDF-------------------------NTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred HHHHHHHHHHHHHHHhhCC---------ccccccc-------------------------ceeeEEeCCCCCcCHHHHHH Confidence 7777777776665432210 0001111 12333334444544555555 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAK 626 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q 626 (688) .++.+... +....+++++++ ...++-.+++.+............... ........ T Consensus 442 ~~~kl~G~------iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~--~~~~~~~~---------------- 497 (511) T protein:vir:10 442 AYIDSGGK------ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK--DPRDINDD---------------- 497 (511) T ss_pred HHHHHhcc------CcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhccc--CCCCCCCC---------------- Confidence 55554322 223444555443 333333444433211100000000000 00000000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 627 ADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 627 ~e~~~~q~e~~~~q~~~~~ 645 (688) ...+...-...+.+ T Consensus 498 -----~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 498 -----EQDDDTKDTVDKKE 511 (511) T ss_pred -----CCCCcccCcccccC Confidence 00000000000000 No 63 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=99.82 E-value=1.3e-18 Score=118.69 Aligned_cols=505 Identities=12% Similarity=0.014 Sum_probs=256.6 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ 87 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~ 87 (688) |+...+....-...+++.+|+...+..+.|...|++..+|..-.=+++.... ++...+.-..-...+++..+.+.. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~----~~~~~~~dstg~~a~~~LAa~l~~ 76 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDN----ETSQNGWQGVGAQATNHLANKLAQ 76 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCc----cccCCcccchHHHHHHHHHHHHHh Confidence 5555555555566889999999999999999999999998853222211100 111112222333334444444433 Q ss_pred C-----CcceEEEeCCccccccccccccccChhhHHH---HHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 88 N-----RPAIQVHPVEANATKDTSKVPNVAGTSDYSL---AEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 88 ~-----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~---Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) . ++=+++.+.+... ......+.+..+. .+..+..+...+..|++..+...++.+.+..|.|++.+ T Consensus 77 ~ltpp~~~WF~L~~~~~~~-----~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~-- 149 (516) T protein:vir:96 77 VLFPAQRSFFRVDLTAQGE-----KVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYK-- 149 (516) T ss_pred hhcCCCCcccccccChhHH-----hhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEe-- Confidence 2 3333443332110 0000001111112 23345556666778999999999999999999998544 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEE Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGV 239 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v 239 (688) + ++ +. ++.+ +..++++..++. . ...-++++.+++..+|.+.|+......... . + -..++.+ T Consensus 150 d---~~---~~--~~~~-pl~~y~v~~d~~-G---~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~-~--~---~~~~~~v 210 (516) T protein:vir:96 150 P---SK---GA--ISAI-PMHHYVVNRDTN-G---DLLDIILLQEKALRTFDPATRAVVEVGLKG-K--K---CKEDDSV 210 (516) T ss_pred c---CC---CC--EEEE-EcCeEEEeeCCC-C---CeeeehhhhHhhHHHHHHhhhhhhhhhhhh-h--h---cCCCCce Confidence 2 11 11 2333 345566654432 1 123377888999999988874422111100 0 0 0112334 Q ss_pred EEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEE Q lcl|NC_020488. 240 RVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAP 319 (688) Q Consensus 240 ~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp 319 (688) .|..+-++.+ ++. +.++. ...+..++. .+-|+...+||++ T Consensus 211 ~v~~~v~~~~---------~~~-----------------------------~~~~~-~~d~~~~~~-es~~~~~e~P~~~ 250 (516) T protein:vir:96 211 KLYTHAKYLG---------DGF-----------------------------WELKQ-SADDIPVGK-VSKIKSEKLPFIP 250 (516) T ss_pred EEEEeeeeeC---------Cce-----------------------------eEEEE-EeCceeecc-ccccccccCCeee Confidence 4443333322 110 11222 223333322 2345567899996 Q ss_pred EeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccccc Q lcl|NC_020488. 320 VLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGV 399 (688) Q Consensus 320 ~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (688) ..+ ...+|..||.|.+....+--+.+|++...++.......++.++++.+.+.+...+.. ..+|.++. + .. T Consensus 251 ~Rw--~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~---~~~g~i~~--g-~~- 321 (516) T protein:vir:96 251 LTW--KRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN---SGTGEVVT--G-VE- 321 (516) T ss_pred eee--eecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhcc---CCCceeec--C-Cc- Confidence 544 467999999999999999999999999999999999999999998888876654432 22334432 1 11 Q ss_pred ccceecC--CCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_020488. 400 DRPQRDM--PASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSR-AIRRVGQI 476 (688) Q Consensus 400 ~~~~~~~--~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~ 476 (688) +.+..++ +..--+.....++...+.|.... ..+.+.-.++...|++.|..+.+.-...|...+.+|.. ++..+.+. T Consensus 322 ~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af-~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r 400 (516) T protein:vir:96 322 EDIHIVQLGKYADLTPISAVLEVYTRRIGVVF-MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMW 400 (516) T ss_pred ccceeeecCcccchhHHHHHHHHHHHHHHHHH-hhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 2222222 22223555667777777777654 22222223455689999999998888888888888753 44444433 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) ++..+ ++ . + + .+..++++.. +-.+-.|.+..+.+..+++.+ T Consensus 401 ~l~~~-------------~p----~---l------p------------~~~v~~~~vs-~l~~l~r~~~~~~i~~~~~~i 441 (516) T protein:vir:96 401 GLLEA-------------GE----S---F------T------------SDLVDPVIIT-GIEALGRMAELDKLANFAQYM 441 (516) T ss_pred HHHhc-------------CC----C---C------c------------cccccceeec-hHHHHHHHHHHHHHHHHHHHH Confidence 32111 11 0 0 0 0111222222 233445666677777777655 Q ss_pred HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADM 636 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~ 636 (688) ..+.+. .+.++.. -+.+++.+.+....+-...-.... ++..+..+++++++ ..+++++.+ .++.. T Consensus 442 ~~~~~~-~p~v~d~---id~d~~~~~~a~~~Gvp~~~irs~-eev~~~~~~~~~~q---------~~~~~a~~~-~~~~~ 506 (516) T protein:vir:96 442 SLPLQW-PEPVLAA---VKWPDYMDWVRGQISAELPFLKSA-EEMAQEQEAQMQAQ---------QAQMLEEGV-AKAVP 506 (516) T ss_pred HHHhcC-ChhHHhc---CCHHHHHHHHHHHhCCCccccCCH-HHHHHHHHHHHHHH---------HHHHHHHHh-hhhhh Confidence 443322 2333333 344666666655444321110000 00000000000000 000001100 00000 Q ss_pred HHHHHHHHHH Q lcl|NC_020488. 637 AMAQAKTAEA 646 (688) Q Consensus 637 ~~~q~~~~~~ 646 (688) ..+.++.+++ T Consensus 507 ~~~~~~~~~~ 516 (516) T protein:vir:96 507 GVIQQELKEA 516 (516) T ss_pred HHhhcccccC Confidence 1011111111 No 64 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.82 E-value=3.9e-19 Score=121.52 Aligned_cols=468 Identities=10% Similarity=0.019 Sum_probs=234.6 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCC--CceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGR--PCLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~--p~~~~N~i~~~i 78 (688) -+|+.+...... ..++...+.+.. ...+....+-.+||.|.|..-.........++ -.++.|..+.+| T Consensus 30 ~~~~~e~~~~~~-------~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (511) T protein:vir:93 30 TYDGTESDLLQN-------VNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (511) T ss_pred cccchhhhhhcc-------HHHHHHHHHHHH---HhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHH Confidence 333333211111 111222121111 12233455667899998753211111122222 357889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+.+.+..- |.+ ....+..++..|+++.....+..+++++|.||..|+ T Consensus 100 ~~~~~yl~g~p~~~~~~--------------------d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy 155 (511) T protein:vir:93 100 DFINGYFLGNPIQYQDD--------------------DKD----VLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (511) T ss_pred HHHhhhhcccCeeeccC--------------------ChH----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEE Confidence 99999999988776430 222 334577777889999999999999999999998887 Q ss_pred EeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+| ||+... .-...+++.|.... ....+. T Consensus 156 ~de------~~~~~i~~~-~p~~~~~vydd~~~-----~~~~~~vr~~~~~~----------------------~~~~~~ 201 (511) T protein:vir:93 156 RNQ------DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYLRTKP----------------------IDKTDE 201 (511) T ss_pred eCC------CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEEEeee----------------------cccccc Confidence 542 356777665 677764 665332 11334444443110 000012 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +.+..+++|.......+. ...+..... ......+.|.+.+.+| T Consensus 202 ~~~~~~~iyt~~~i~~~~--~~~~~~~~~-----------------------------------~~~~~~~~~~~~g~vP 244 (511) T protein:vir:93 202 DEVFTVDLFTSHGVYRYL--TSRTNGLKL-----------------------------------TPRENGFESHSFERMP 244 (511) T ss_pred ceEEEEEEEeCCcEEEEE--ecCCCcccc-----------------------------------ccccccccccCCCccc Confidence 234445556543221111 111110000 0011223344456666 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecC-- Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYN-- 394 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~-- 394 (688) +|+|. ....|.|.+..++++++.+|..+|.+.+.+....++.+++......+.++... ....+.++... T Consensus 245 vv~~~-------nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~--~~~~~~~~~~~~~ 315 (511) T protein:vir:93 245 ITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK--QKEANVLFLEPTV 315 (511) T ss_pred eEEec-------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcc--cccccceeccccc Confidence 66542 23457799999999999999999999999987777766554322222222111 11111111110 Q ss_pred -------cccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 395 -------AIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 395 -------~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) ...+...+.++..+.-..++...++.+...|-.+|++.+.+.+.-+++.||.|+..+-.............|. T Consensus 316 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~ 395 (511) T protein:vir:93 316 YADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 395 (511) T ss_pred ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111233445544444566778888999999999999988877655667999999888777777777777778 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .++++++++++.++...... +...++. +|.+.=.+..+....+..+ T Consensus 396 ~~l~~~~~li~~~l~~~~~~---------~~~~d~~-------------------------~i~~~f~~~~p~n~~e~~~ 441 (511) T protein:vir:93 396 KGLRRRAKLLETILKNTWSI---------DANKDFN-------------------------TVRYVYNRNLPKSLIEELK 441 (511) T ss_pred HHHHHHHHHHHHHHHhccCc---------ccccccc-------------------------cceEEeCCCCCCCHHHHHH Confidence 88888777777655332210 0011110 1222223444444444455 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAK 626 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q 626 (688) .+..+... +....+++++++ ...++-.+++++................... ... T Consensus 442 ~~~kl~g~------iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~--~~~----------------- 496 (511) T protein:vir:93 442 AYIDSGGK------ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD--IND----------------- 496 (511) T ss_pred HHHHHhcc------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCC--CCC----------------- Confidence 55544322 223344555443 3333334444332111000000000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 627 ADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 627 ~e~~~~q~e~~~~q~~~~~ 645 (688) ....+...-...+++ T Consensus 497 ----~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 497 ----DEQDDDTKDTVDKKE 511 (511) T ss_pred ----CCCCCcccccccccC Confidence 000000000000000 No 65 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.82 E-value=1e-18 Score=119.24 Aligned_cols=434 Identities=11% Similarity=0.041 Sum_probs=211.1 Q ss_pred CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHH-HhcCCCceeehhHHHHHHHHHHHHHhCCc Q lcl|NC_020488. 12 DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKER-EDEGRPCLTLNKLPQYVDQVLGDQRQNRP 90 (688) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~-~~~g~p~~~~N~i~~~i~~i~g~~~~~r~ 90 (688) -+++..+++..+...+.. .+....+-.+||.|+|.-....... ..-..-.++.|..+-+|+..++...-+ T Consensus 1 ~~~~~~~~i~~l~~~~~~-------~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~-- 71 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQR-------LSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWL-- 71 (441) T ss_pred CCccHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccc-- Confidence 444444556665554432 2334445569999988642211000 000112356788888888877765211 Q ss_pred ceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcc Q lcl|NC_020488. 91 AIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLD 170 (688) Q Consensus 91 ~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~ 170 (688) . |. .+ |. ..+..++..|+++.....++.++++.|.||..|+.+ .++. T Consensus 72 g--~~--~~----------------d~-------~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d------~~g~ 118 (441) T protein:vir:80 72 G--WT--NG----------------DG-------YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPH------GDGT 118 (441) T ss_pred c--cc--CC----------------Ch-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC------CCCc Confidence 0 10 00 11 124556678999999999999999999999887643 2356 Q ss_pred eeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeee Q lcl|NC_020488. 171 LCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYRE 248 (688) Q Consensus 171 ~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~ 248 (688) +.+..+ +|.++ +||+...... .++.+..+- +++ ....++|+.. T Consensus 119 ~~i~~~-~p~~~~~i~d~~~~~~~----~~~~~~~~~-----------------------------~~~-~~~~~vy~~~ 163 (441) T protein:vir:80 119 VSVRPQ-SPKNCTGKFSADGSRLD----AGLVVQQTC-----------------------------DPE-VVEAELLLPD 163 (441) T ss_pred eEEEEE-ccceEEEEEeCCCCcee----EEEEEEEEe-----------------------------cCc-eEEEEEEecC Confidence 666655 68775 4777543211 111111110 000 1112333221 Q ss_pred ecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccC Q lcl|NC_020488. 249 PVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIG 328 (688) Q Consensus 249 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~ 328 (688) .. .+.. .+|. +..+..++.|.+.+.+|+|||... +.. T Consensus 164 ~~-~~~~--~~~~--------------------------------------~~~~~~~~~~~~~g~vPvv~~~n~--~~~ 200 (441) T protein:vir:80 164 VI-VQVE--RRGS--------------------------------------REWVEVDRIPNVLGAVPLVPIVNR--RRT 200 (441) T ss_pred eE-EEEE--EcCC--------------------------------------cceeeccccccCCCceeEEEeecc--ccC Confidence 10 0000 0100 001112334455678888887543 345 Q ss_pred CcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhh-hcchHHHHhhcccCCCceeecCcccccccceecC Q lcl|NC_020488. 329 DKTYYRGLI-RFGKDAQRMHNYWMTAATERVALAPKAPWVAPAES-IEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDM 406 (688) Q Consensus 329 ~~~~g~g~v-~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 406 (688) +.++|.|-+ +.++++++.+|..+|.+...+...+.+...+. |+ .+...+ .......+.++.......+..+.... T Consensus 201 ~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~-G~~~~~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~~ 277 (441) T protein:vir:80 201 SRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT-GVSADEFSQ--PGWVLSMASVWAVDKDDDGDTPNVGS 277 (441) T ss_pred CccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee-cCCcccccc--chhhhcccccccCCCCCCCCcceeEe Confidence 778888855 67999999999999999999988887765553 32 211111 11112233444433322333333322 Q ss_pred CC-cchHHHHHHHHHHHHHHHHHhCcChHHcCCCcch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 407 PA-SMPAAELQLALSATDEMKATIGLYDASVGAQGNE-QSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRV 484 (688) Q Consensus 407 ~~-~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~-~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~ 484 (688) .+ .....+...+......+-.+|++++..+|..+++ .||.|+..+...-........+.|..++++++++++.+ T Consensus 278 ~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~---- 353 (441) T protein:vir:80 278 FPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKA---- 353 (441) T ss_pred cCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 21 2234444445555555556689999899977654 59999998877666666666666777777766655443 Q ss_pred cCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_020488. 485 YDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVL 564 (688) Q Consensus 485 ~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~ 564 (688) +... +. ....+ .++.+.=.+..+....+..+.+.++.+..... +.. T Consensus 354 ~~~~------~~-~~~~~-------------------------~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~--~s~ 399 (441) T protein:vir:80 354 LDSR------VD-EADFF-------------------------GDVGLRWRDASTPTRAATADAVTKLVGAGILP--ADS 399 (441) T ss_pred hcCC------Cc-ccccc-------------------------eeeeEEeCCCCCcCHHHHHHHHHHHHhcCccc--ccH Confidence 2210 00 00000 12222223333334445555555555431111 011 Q ss_pred HHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 565 DLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMA 637 (688) Q Consensus 565 ~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~ 637 (688) ..+++.+++.. +++. ++.+... +++.++.+ . ....+.+-++- T Consensus 400 ~~~~~~l~~~~-~e~~-~~~~e~~---------------------e~~~~~~~-------~-~~~~~~~~~~~ 441 (441) T protein:vir:80 400 RTVLEMLGLDD-VQVE-AVMRHRA---------------------ESSDPLAV-------L-AGAISRQTNEV 441 (441) T ss_pred HHHHHhCCCCH-HHHH-HHHHHHH---------------------HHHHHHHH-------H-hhhhhcccccC Confidence 23344444431 2222 1111000 00000000 0 00000000000 No 66 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=99.82 E-value=3.8e-18 Score=116.08 Aligned_cols=508 Identities=11% Similarity=0.004 Sum_probs=256.8 Q ss_pred CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhC--- Q lcl|NC_020488. 12 DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQN--- 88 (688) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~--- 88 (688) -+...-.-..++..+|+...+..+.|...|++..+|..-.-.+...-... + ..+.-..-...+++..+.+... T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~---~-~~~~dstg~~a~~~LAa~l~~~ltp 76 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLS---S-QNAWQDDGASATNFLSNKLSQVLFP 76 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCcc---c-cccccchHHHHHHHHHHHHHHhhcC Confidence 23333345578889999999999999999999999885322111100000 0 1122223333444444444332 Q ss_pred --CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCC Q lcl|NC_020488. 89 --RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDA 166 (688) Q Consensus 89 --r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~ 166 (688) ++=+++.+.+....... .......+-+..+ +..+..+...+..|++..+...++.+.+..|.|++.+ + +. T Consensus 77 p~~~WF~l~~~~~~l~~~~-~~~~~~~~v~~~L-~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~----~~ 148 (517) T protein:vir:10 77 AQRSFFRIDLTPEGIKQLD-NEAMTQSTAQKLL-SDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYH--P----DK 148 (517) T ss_pred CCCccccccCCHHHHHhhc-cCcchHHHHHHHH-HHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--e----CC Confidence 22233333221100000 0000000112222 3345666666788999999999999999999987532 1 11 Q ss_pred CCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEe Q lcl|NC_020488. 167 FDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFY 246 (688) Q Consensus 167 ~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~ 246 (688) .. .++.+ +..++++..++. .+ ..-++++..++..++.+.|+...... .. ......++.+.|+.+-+ T Consensus 149 -~~--~~~~~-pl~~y~v~~d~~-G~---v~~ivrr~~~~~~~l~~~~~~~~~~~----~~--~~~~~~~~~v~v~~~v~ 214 (517) T protein:vir:10 149 -TS--PIQAV-PLHHYCVRRDNN-GT---VLDIVFLQEKALETFEPSIRMAIQAS----RK--GKQYKDKDNVKLYTHAK 214 (517) T ss_pred -CC--cEEEE-EcCeEEEeeCCC-cC---eEEEEeeeeccHHHHHHHhhhhcchh----hh--hhccCCcCceEEEEEEE Confidence 12 23333 345566654432 11 22367888999999999997643211 00 01112234455555433 Q ss_pred eeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeec Q lcl|NC_020488. 247 REPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMV 326 (688) Q Consensus 247 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~ 326 (688) +.+ +|.. +++..+.+..++ ..+-|+...+||+++.+ .. T Consensus 215 ~~~---------~~~~------------------------------~~~~~~d~~~~~-~~s~y~~~e~P~~~~Rw--~~ 252 (517) T protein:vir:10 215 RTK---------DGKY------------------------------LIRQSADDVPVG-KESTVTEDKSPFLILTW--KR 252 (517) T ss_pred EeC---------CCce------------------------------EEEEEeCceeec-cccccccccCCeeeeee--ee Confidence 321 1211 122223333333 23456678999997655 46 Q ss_pred cCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceec- Q lcl|NC_020488. 327 IGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRD- 405 (688) Q Consensus 327 ~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 405 (688) .+|..||.|.+....+--+.+|++....+.....+.+++++++.+.+.+...+.. ..+|.++. +.. ..+..+ T Consensus 253 ~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~---~~~g~~~~---g~~-~~v~~~~ 325 (517) T protein:vir:10 253 SYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVE---GGSGAVLH---GVE-GDIHIVQ 325 (517) T ss_pred cCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccC---CCcccccc---CCc-ccceeee Confidence 7999999999999999999999999999999999999999999888876654322 12223322 111 122222 Q ss_pred -CCCcchHHHHHHHHHHHHHHHHHhCcChHHcC-CCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_020488. 406 -MPASMPAAELQLALSATDEMKATIGLYDASVG-AQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIP 482 (688) Q Consensus 406 -~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G-~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~ 482 (688) ....-.+...+.++...+.|....= .+. ++ .++...|++.|..+.+.-...|...+.+|. .++..+.+.++..+. T Consensus 326 ~~~~~d~~~~~~~i~~~~~rI~~af~-~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~ 403 (517) T protein:vir:10 326 LGKYADYTPIQAVLNDYRQRIGRVFM-MEA-MTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGIS 403 (517) T ss_pred cccccchhHHHHHHHHHHHHHHHHHh-hhh-hhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhh Confidence 2333346667778888888887652 222 33 344568999999999998888888888875 455555554444332 Q ss_pred HHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020488. 483 RVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGV 562 (688) Q Consensus 483 ~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~ 562 (688) .-.. + .+ ..+.+.++. ....|.+..+.+.++++.+..+.+. T Consensus 404 ~~l~--------~-------------------------~~-----v~~~~~s~l-a~l~r~~~~~~i~~~~~~i~~~a~~ 444 (517) T protein:vir:10 404 SILT--------S-------------------------KN-----VSPTILTGI-EALGRMAELDKLGTFNGYVSMTAQW 444 (517) T ss_pred hhcC--------C-------------------------CC-----ccceeeccH-HHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1110 0 01 112222222 2445666777777777665444332 Q ss_pred HHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 563 VLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAK 642 (688) Q Consensus 563 ~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~ 642 (688) .+.+.+. -+.+++...+....+-...-....++..+ +++++++++++ ++.++.+ ...+.+... T Consensus 445 -~~~~~~~---id~d~~~~~~a~~~Gvp~~~irs~~ev~~-------~~~~~~~~~~~---~~~~~~a---g~~~~~~~~ 507 (517) T protein:vir:10 445 -PEPLQQA---IKWPDFTDWVQGQISANFPFFKTQDELNA-------EAQAQQEQEAT---KYAAEQA---GKAIPDMVK 507 (517) T ss_pred -ChHHHhc---CCHHHHHHHHHHHhCCChhhcCCHHHHHH-------HHHHHHHHHHH---HHHHHHH---HHHHHHHHh Confidence 2222222 24466666665544322110000000000 00000000000 0000000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 643 TAEAQAKLAEIEQAAMMAGPG 663 (688) Q Consensus 643 ~~~~~a~~~~~~~~a~~~~~~ 663 (688) .... ..+..| T Consensus 508 -~~~~----------~~~~~~ 517 (517) T protein:vir:10 508 -NGQI----------NPQGGQ 517 (517) T ss_pred -CCCC----------CCCCCC Confidence 0000 000000 No 67 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.82 E-value=4.6e-19 Score=121.13 Aligned_cols=447 Identities=14% Similarity=0.090 Sum_probs=227.3 Q ss_pred CCCCCCCcCCC-------CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCC--CHHHHHH---HHhcCCC- Q lcl|NC_020488. 1 MLPGNEPIKTR-------DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQW--PESVRKE---REDEGRP- 67 (688) Q Consensus 1 ~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw--~~~~~~~---~~~~g~p- 67 (688) -+|..+..-.. ......+++.++ .+...+.+....+..+||.|.|= ....... .....+| T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-------i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (468) T protein:vir:96 5 FWPNEKPYHERVVEQIKPQYETQEEMILRL-------ITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPD 77 (468) T ss_pred cCCcCceeehheeecccccccCcHHHHHHH-------HHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccc Confidence 35555422211 222223333333 23333445556777899999751 1000000 0112223 Q ss_pred -ceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Q lcl|NC_020488. 68 -CLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQH 146 (688) Q Consensus 68 -~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d 146 (688) .+++|..+.+|+..+|+...+.+.+.+ .|.+..+.+. .++ .|+++.....+..+ T Consensus 78 ~ki~~n~~~~Iv~~~~~~l~g~p~~~~~--------------------~d~~~~~~l~----~~~-~n~~~~~~~~~~~~ 132 (468) T protein:vir:96 78 WRMYTNYHQNLVDQKVAYAVANPVTYGT--------------------EDEKSLKTIQ----EVL-NHKWDDKLVDILTA 132 (468) T ss_pred cccccchHHHHHHHHHhhhccCCceecc--------------------CChHHHHHHH----HHH-hcCHHHHHHHHHHH Confidence 578999999999999999998877643 1233333333 333 36788888999999 Q ss_pred HHHcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccc Q lcl|NC_020488. 147 AVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSD 224 (688) Q Consensus 147 ~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~ 224 (688) +.++|.||..|+.+. ++.+++..+ +|..++ ||+.. ..+..+ +++.|... + . T Consensus 133 ~~~~G~~~~~v~~d~------~~~~~i~~~-~p~~~~~v~~~~~----~~~~~~-~ir~~~~~--------~---~---- 185 (468) T protein:vir:96 133 ASNKGVEWIQPYVDE------QGEFKTFRV-PAEQAIPIWTNKE----RDELKA-FIRLYELD--------G---G---- 185 (468) T ss_pred HhhcCeEEEEEEEcC------CCceEEEEE-cccceEEEEcCCC----CCceEE-EEEEEEec--------C---c---- Confidence 999999998887642 346777666 677765 54322 223222 33333100 0 0 Q ss_pred ccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhc Q lcl|NC_020488. 225 AERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVL 304 (688) Q Consensus 225 ~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~il 304 (688) .-+++|.......+. ..++..+....... .......+ T Consensus 186 ---------------~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~--------------------------~~~~~~~~ 222 (468) T protein:vir:96 186 ---------------ERVEYWTANDVTFYE--LKDGQLIPDYYQGE--------------------------EHVQAHYY 222 (468) T ss_pred ---------------eEEEEEeCCeEEEEE--EcCCceeecccccc--------------------------ccccccee Confidence 002333222111111 11111111000000 00001112 Q ss_pred ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcc Q lcl|NC_020488. 305 EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQAN 384 (688) Q Consensus 305 e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 384 (688) ....|.+.+.+|+|+|.. ...|.|.+..++++++.+|...|.+...+....++.+++.....++..++.... T Consensus 223 ~~~~~~~~~~iPvv~~~n-------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~- 294 (468) T protein:vir:96 223 VGNKSMSWNRVPFIPFKN-------NPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNL- 294 (468) T ss_pred eccccccCCcccEEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhh- Confidence 233455667778776522 345789999999999999999999999998888887776543333333332211 Q ss_pred cCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 385 RKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYID 464 (688) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~d 464 (688) +..+.+. ..+.. ...++++....-...+...++.+...|-..|++.+.+.+..+++.||.|+..+............. T Consensus 295 ~~~~~i~-~~~d~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~ 372 (468) T protein:vir:96 295 KYYKAIN-VDGDG-SGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKN 372 (468) T ss_pred hcCceEE-ecCCC-CCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHHHH Confidence 2223333 32222 223566665555677778889999999999999887776666678999998877666666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHH Q lcl|NC_020488. 465 NLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRME 544 (688) Q Consensus 465 n~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~ 544 (688) .|..+++++.++++. ++. .+ .++ .+|.|.=.++.+....+ T Consensus 373 ~~~~~l~~~~~li~~----~~g---------~~--~d~-------------------------~~i~i~f~~~~p~d~~e 412 (468) T protein:vir:96 373 KTLTALQELLQYIID----FYK---------LS--IKV-------------------------QDVEITFNFNVMVNELE 412 (468) T ss_pred HHHHHHHHHHHHHHH----HhC---------CC--ccc-------------------------ceeeEEecCCCCcCHHH Confidence 667777666655554 321 10 011 01111112222222222 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhh Q lcl|NC_020488. 545 AADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPS 608 (688) Q Consensus 545 ~~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 608 (688) ..+.+.+ .+-+....+++.+++ ...++-.+++++........ +...-.....++. T Consensus 413 ~a~~~~~-------~g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~--~~~~~~~~~~~~~ 468 (468) T protein:vir:96 413 QSQIGVN-------SQYLSKETVVTNHPWVDDPVAEMERIDQEELALPSI--EEGLNGKENNEPT 468 (468) T ss_pred HHHHHHh-------cCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH--hhccCCCCCCCCC Confidence 3332221 122233444555433 23333334443221110000 0000000011111 No 68 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.82 E-value=4.9e-19 Score=120.96 Aligned_cols=438 Identities=8% Similarity=0.010 Sum_probs=226.9 Q ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-----HHHHhcCCC--ceeehhHHHHHHHHHHHHHh Q lcl|NC_020488. 15 SQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-----KEREDEGRP--CLTLNKLPQYVDQVLGDQRQ 87 (688) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-----~~~~~~g~p--~~~~N~i~~~i~~i~g~~~~ 87 (688) ...+.+.++.+ .....+....+..+||.|++.-..-. ........| .+++|..+.+|+..+|+... T Consensus 1 l~~~~i~~~i~-------~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G 73 (451) T protein:vir:10 1 MELEKIRAIIS-------ADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFT 73 (451) T ss_pred CCHHHHHHHHH-------HHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheec Confidence 33334444333 23345566778889999986421100 001112233 57789999999999999999 Q ss_pred CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCC-- Q lcl|NC_020488. 88 NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDD-- 165 (688) Q Consensus 88 ~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~-- 165 (688) +.+.+.+. .|.+..+ .++... .|+++........+++++|.||..++.+-+... T Consensus 74 ~p~~~~~~-------------------~~~~~~~----~~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~ 129 (451) T protein:vir:10 74 YPVLFDID-------------------NNKELNE----KVTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQ 129 (451) T ss_pred ccceeecC-------------------CcHHHHH----HHHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCccccc Confidence 88776431 1233333 344433 478999999999999999999988877643322 Q ss_pred CCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEE Q lcl|NC_020488. 166 AFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSE 243 (688) Q Consensus 166 ~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e 243 (688) ...+.+.+..+ +|..++ ||.... .+ ...+.+.|...++-. .-.....+..++ T Consensus 130 ~~~~~~~~~~i-~p~~~~~vydd~~~----~~-~~~~ir~~~~~~~~~--------------------~~~~~~~~~~~e 183 (451) T protein:vir:10 130 VTNQTFKYGVV-NTEEIIPIYRNGIE----RE-LEAVIRYYIQLEDVK--------------------GQIQKQAYTYVE 183 (451) T ss_pred ccccceeEEEE-cccceEEEEcCCCC----Cc-eEEEEEEEEeeeccc--------------------ccccceEEEEEE Confidence 22356677666 677764 553221 12 233344443211100 000112233344 Q ss_pred EEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeee Q lcl|NC_020488. 244 YFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGK 323 (688) Q Consensus 244 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~ 323 (688) +|....... +...++. ..+..++..+.|-+.+.+|+|+|.. T Consensus 184 ~yt~~~~~~--~~~~~~~------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n- 224 (451) T protein:vir:10 184 FWTDKILDK--YKFFGVS------------------------------------CCGSQIEHITVQHRFNSVPFVEFSN- 224 (451) T ss_pred EEeCCeEEE--EEecccC------------------------------------ccccccccccccCCCCeeeEEEecc- Confidence 443321110 0000000 0112223333343445555554322 Q ss_pred eeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCc-ccccccc Q lcl|NC_020488. 324 EMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNA-IPGVDRP 402 (688) Q Consensus 324 ~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 402 (688) ...|.|.+..++++++.+|.+.|.+...+.-..++.+++.....+...+..... +..+.+..... ......+ T Consensus 225 ------n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~ 297 (451) T protein:vir:10 225 ------NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKEL-KRYKTIKTETDSEGDSGGL 297 (451) T ss_pred ------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHH-hhCCeEEecCcCCccCCcc Confidence 234679999999999999999999999998888887766432333333332222 22233333221 1222346 Q ss_pred eecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 403 QRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIP 482 (688) Q Consensus 403 ~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~ 482 (688) .++..+.-..++...++.+...|-..|++.+.+.+..+ +.||.|+..+-.............|..+++++.++++.++. T Consensus 298 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g-n~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~ 376 (451) T protein:vir:10 298 KTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFG-NASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG 376 (451) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 66665555677888899999999999998876555443 47999999887777677777777777777776666665431 Q ss_pred HHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020488. 483 RVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGV 562 (688) Q Consensus 483 ~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~ 562 (688) .+ ++ .+|.+.=.+..+....+..+.++.+.. .+ T Consensus 377 ~~----------------d~-------------------------~~i~i~f~~~~p~n~~e~~~~~~kl~g------~i 409 (451) T protein:vir:10 377 VT----------------DY-------------------------KKIQQTYTRNMMSNDLEDADIATKSVG------II 409 (451) T ss_pred CC----------------Cc-------------------------cceeEEecCCCCCCHHHHHHHHHHHhc------cC Confidence 10 01 011122223333333334444443322 12 Q ss_pred HHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhh Q lcl|NC_020488. 563 VLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPS 608 (688) Q Consensus 563 ~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 608 (688) ....++.++++ ...++..+++.+..... .. +.+..-..-.. T Consensus 410 S~et~~~~~p~v~d~~~e~~~~~ee~~~~---~~--~~~~~~~~~~~ 451 (451) T protein:vir:10 410 PTKIILRHHPWVDDVEEAEKLYLEEKKIQ---AS--KVSDDYNNFTE 451 (451) T ss_pred chHHHHHhCCCCCCHHHHHHHHHHHHHHH---HH--HHHhhcCCCCC Confidence 22334444433 22222222221110000 00 00000000000 No 69 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.82 E-value=2.1e-19 Score=123.04 Aligned_cols=468 Identities=11% Similarity=0.011 Sum_probs=237.3 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYV 78 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i 78 (688) .+|+.+...... .+.+.++ +.+.. ...+....+..+||.|.|..-.........++| .++.|..+.+| T Consensus 30 ~~~~~e~~~~~~----~~~i~~~---i~~~~---~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (511) T protein:vir:99 30 TYDGTESDLLQN----VNEVSKY---IEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (511) T ss_pred ccchhhhhhhcc----HHHHHHH---HHHHH---HhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHH Confidence 444433211111 1112222 22111 223445566789999987642221222223333 58889999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|+...+++.+.+- |.+ ....+..++..|+++.....+..++++.|.||..++ T Consensus 100 ~~~~~yl~g~p~~~~~~--------------------d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy 155 (511) T protein:vir:99 100 DFINGYFLGNPIQYQDD--------------------DKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (511) T ss_pred HHHHhhhcccCceeecC--------------------chH----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEE Confidence 99999999988776431 222 345677777889999999999999999999998887 Q ss_pred EeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) .+. ++.+.+..+ +|..+| ||+... .-...+++.|... .....+. T Consensus 156 ~de------d~~~~i~~~-~p~~~~~vyd~~~~-----~~~~~~vr~~~~~----------------------~~~~~~~ 201 (511) T protein:vir:99 156 RNQ------DDETRLYKS-DAMSTFVIYDNTIE-----RNSIAGVRYLRTK----------------------PIDKTDE 201 (511) T ss_pred eCC------CCceEEEEE-ccceeEEEEcCCCC-----CceEEEEEEEEee----------------------ecccCcc Confidence 542 356777666 787764 565321 1122333333211 0000112 Q ss_pred CEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 237 EGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 237 ~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +.+..+++|.......+.. ..+.... .......+.|.+.+.+| T Consensus 202 ~~~~~~~vyt~~~i~~~~~--~~~~~~~-----------------------------------~~~~~~~~~~~~~g~vP 244 (511) T protein:vir:99 202 DEVFTVDLFTSHGVYRYLT--SRTNGLK-----------------------------------LTPRENGFESHSFERMP 244 (511) T ss_pred ceEEEEEEEeCCcEEEEEe--cCCcccc-----------------------------------ccccccccccCCCCccc Confidence 3344456665432211111 0100000 00001223344556677 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec--- Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY--- 393 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--- 393 (688) +|+|.. ...|.|.+..++++++.+|..+|.+...+....++.+++......+..+... ...++.++.. T Consensus 245 vv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~--~~~~~~~~~~~~~ 315 (511) T protein:vir:99 245 ITEFSN-------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK--QKEANVLFLEPTV 315 (511) T ss_pred eEEecC-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcc--cccccceeccccc Confidence 765422 3457899999999999999999999999887776665543322222221110 1111111110 Q ss_pred ----C--cccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 ----N--AIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 394 ----~--~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) . ...+...+.++..+.-..++...++.+.+.|-.+|++.+.+.+.-+++.||+|+..+-.............|. T Consensus 316 ~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~ 395 (511) T protein:vir:99 316 YADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 395 (511) T ss_pred ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0111223445544444566777888899999999999988777655667999999888777777777777888 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .++++++++++.++...-.. +...++ .+|.+.=.+..+....+..+ T Consensus 396 ~~l~~~~~li~~~~~~~~~~---------~~~~~~-------------------------~~i~i~f~~~~p~n~~e~~~ 441 (511) T protein:vir:99 396 KGLRRRAKLLETILKNTRSI---------DVSKDF-------------------------NTVRYVYNRNLPKSLIEELK 441 (511) T ss_pred HHHHHHHHHHHHHHHhcCCc---------cccccc-------------------------ccceEEeCCCCCcCHHHHHH Confidence 88888888777765432110 000011 01222223344444444444 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAK 626 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q 626 (688) .++.+.. .+....+++++++ ...++-.+++++................... ....+.. ...+.. T Consensus 442 ~~~kl~G------iiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~--~~~~~~~-------~~~~~~ 506 (511) T protein:vir:99 442 AYIDSGG------KISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRN--INDDEQD-------DSTKDS 506 (511) T ss_pred HHHHHhc------cCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCC--CCCCCCC-------CCCcCc Confidence 4444432 1223344554433 3344444444432211000000000000000 0000000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 627 ADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 627 ~e~~~~q~e~~~~q~~~~~ 645 (688) .+.. + T Consensus 507 ~d~~--------------e 511 (511) T protein:vir:99 507 IDKK--------------E 511 (511) T ss_pred cccc--------------C Confidence 0000 0 No 70 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.81 E-value=1.3e-18 Score=118.56 Aligned_cols=449 Identities=10% Similarity=0.023 Sum_probs=235.0 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCC--CCHHH-------------HHHHHhcC Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQ--WPESV-------------RKEREDEG 65 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Q--w~~~~-------------~~~~~~~g 65 (688) |+===+.++ +.+...+.+.++.+ .....+.+..+..+||.|.+ +..-. ....+..+ T Consensus 3 ~~~~~~~~~--~~~~~~e~i~~~i~-------~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (474) T protein:vir:94 3 LYKLIDDIE--AQGILPKHIEALIE-------SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDV 73 (474) T ss_pred hHHHHhhcc--ccCCCHHHHHHHHH-------HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccccc Confidence 110001111 11122223333222 23334555556666666531 11000 01112344 Q ss_pred CC--ceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHH Q lcl|NC_020488. 66 RP--CLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNA 143 (688) Q Consensus 66 ~p--~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~ 143 (688) +| .+++|..+.+|+..+|+...+.+.+.+.+. ....+.+...+..++..|+++.....+ T Consensus 74 ~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~-------------------~~~~e~~~~~l~~~~~~n~~~~~~~~~ 134 (474) T protein:vir:94 74 SVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDEN-------------------AEKNEKLKKFITNFAIRNSVDDEDSEI 134 (474) T ss_pred CcccccccchHHHHHHhHhhheeccceeEeeCCC-------------------CcchHHHHHHHHHHHhhcCHhHHHHHH Confidence 45 588999999999999999999887766221 122344555666677789999999999 Q ss_pred HHHHHHcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchh Q lcl|NC_020488. 144 FQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGD 221 (688) Q Consensus 144 ~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~ 221 (688) ..+++++|.+|..++.+. ++.+.+..+ +|..++ || +.. +..+ +.+.|...+ T Consensus 135 ~~~~~~~G~a~~~~~~d~------~~~~~~~~i-~p~~~~~v~d-~~~-----~~~~-~i~~~~~~~------------- 187 (474) T protein:vir:94 135 GKMAAICGYGARLAYIDT------NGDIRIKNI-DPYNVIFVGD-NIL-----EPTY-SLRYFYEKD------------- 187 (474) T ss_pred HHHHhhcCeEEEEEEeCC------CCeeEEEEE-cccceEEEEc-CCC-----ceEE-EEEEEEEee------------- Confidence 999999999988776431 345666555 677653 43 111 1122 222221100 Q ss_pred cccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchh Q lcl|NC_020488. 222 LSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAY 301 (688) Q Consensus 222 ~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~ 301 (688) ..+...+..+++|..... +.+. .++ .+. T Consensus 188 -----------~~~~~~~~~~~~y~~~~~--~~~~-~~~--------------------------------------~~~ 215 (474) T protein:vir:94 188 -----------DDNGTDYVYAEFYDNAYY--YVFR-GEG--------------------------------------IDA 215 (474) T ss_pred -----------CCCceEEEEEEEEcCceE--EEEe-ecC--------------------------------------CCc Confidence 001111223344432211 1100 000 011 Q ss_pred hhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHh Q lcl|NC_020488. 302 DVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWN 381 (688) Q Consensus 302 ~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 381 (688) ..+.++.|.+.+.+|+|+|. ...+|.|.+..++++++.+|...|.+...+....++.+++....++ ++... T Consensus 216 ~~~~~~~~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~--~~~~~ 286 (474) T protein:vir:94 216 LQEVGRYEHLFDYNPLFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMS--EEMIQ 286 (474) T ss_pred ccccccccCCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC--chhhh Confidence 11223334444556666532 3456889999999999999999999999998888777665322221 12211 Q ss_pred hcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 382 QANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFA 461 (688) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~ 461 (688) . ....+.+...... ..+.++..+.-..++...++.+...|-..|++.+.+.+.-+++.||.|+..+-......... T Consensus 287 ~-~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~ 362 (474) T protein:vir:94 287 E-TQKSGAFELFDKD---MDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMT 362 (474) T ss_pred h-hhhcceeEecCCC---CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHH Confidence 1 2234455443322 34556655555577888889999999999999988877666678999999887777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCC-cceeeechhhhcccccceeeeccceeeeEEEEEecccCcHH Q lcl|NC_020488. 462 YIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGE-GDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQT 540 (688) Q Consensus 462 ~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~-~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s 540 (688) ....|..++++++++++.++..-.. ... .++ .|+.+.=.+..+. T Consensus 363 ~~~~~~~~l~~~~~li~~~l~~~~~----------~~~~~~~-------------------------~~i~~~f~~~~p~ 407 (474) T protein:vir:94 363 FERKMTAMLRYQFKVILSALKRKGY----------NLDDDSY-------------------------LNLIFKFTRNIPV 407 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccC----------CCCcccc-------------------------ccceEEeCCCCCC Confidence 7778888888888887776543211 000 000 1223333344444 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHh--hhhhhhhhhHH Q lcl|NC_020488. 541 QRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEE--AGIEPPQPSPE 610 (688) Q Consensus 541 ~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 610 (688) ...+..+.++.+.. .+....+++++++ .+.+...+++++.............. ...+..+.+-+ T Consensus 408 d~~e~a~~~~kl~g------~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 408 NKLEESQVLINLKG------QVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred CHHHHHHHHHHHhc------cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 44444555554432 1223445555543 45555555554322110000000000 00000000000 No 71 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.81 E-value=1.3e-18 Score=118.56 Aligned_cols=449 Identities=10% Similarity=0.023 Sum_probs=235.0 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCC--CCHHH-------------HHHHHhcC Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQ--WPESV-------------RKEREDEG 65 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Q--w~~~~-------------~~~~~~~g 65 (688) |+===+.++ +.+...+.+.++.+ .....+.+..+..+||.|.+ +..-. ....+..+ T Consensus 3 ~~~~~~~~~--~~~~~~e~i~~~i~-------~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (474) T protein:vir:10 3 LYKLIDDIE--AQGILPKHIEALIE-------SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDV 73 (474) T ss_pred hHHHHhhcc--ccCCCHHHHHHHHH-------HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccccc Confidence 110001111 11122223333222 23334555556666666531 11000 01112344 Q ss_pred CC--ceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHH Q lcl|NC_020488. 66 RP--CLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNA 143 (688) Q Consensus 66 ~p--~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~ 143 (688) +| .+++|..+.+|+..+|+...+.+.+.+.+. ....+.+...+..++..|+++.....+ T Consensus 74 ~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~-------------------~~~~e~~~~~l~~~~~~n~~~~~~~~~ 134 (474) T protein:vir:10 74 SVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDEN-------------------AEKNEKLKKFITNFAIRNSVDDEDSEI 134 (474) T ss_pred CcccccccchHHHHHHhHhhheeccceeEeeCCC-------------------CcchHHHHHHHHHHHhhcCHhHHHHHH Confidence 45 588999999999999999999887766221 122344555666677789999999999 Q ss_pred HHHHHHcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchh Q lcl|NC_020488. 144 FQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGD 221 (688) Q Consensus 144 ~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~ 221 (688) ..+++++|.+|..++.+. ++.+.+..+ +|..++ || +.. +..+ +.+.|...+ T Consensus 135 ~~~~~~~G~a~~~~~~d~------~~~~~~~~i-~p~~~~~v~d-~~~-----~~~~-~i~~~~~~~------------- 187 (474) T protein:vir:10 135 GKMAAICGYGARLAYIDT------NGDIRIKNI-DPYNVIFVGD-NIL-----EPTY-SLRYFYEKD------------- 187 (474) T ss_pred HHHHhhcCeEEEEEEeCC------CCeeEEEEE-cccceEEEEc-CCC-----ceEE-EEEEEEEee------------- Confidence 999999999988776431 345666555 677653 43 111 1122 222221100 Q ss_pred cccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchh Q lcl|NC_020488. 222 LSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAY 301 (688) Q Consensus 222 ~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~ 301 (688) ..+...+..+++|..... +.+. .++ .+. T Consensus 188 -----------~~~~~~~~~~~~y~~~~~--~~~~-~~~--------------------------------------~~~ 215 (474) T protein:vir:10 188 -----------DDNGTDYVYAEFYDNAYY--YVFR-GEG--------------------------------------IDA 215 (474) T ss_pred -----------CCCceEEEEEEEEcCceE--EEEe-ecC--------------------------------------CCc Confidence 001111223344432211 1100 000 011 Q ss_pred hhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHh Q lcl|NC_020488. 302 DVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWN 381 (688) Q Consensus 302 ~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 381 (688) ..+.++.|.+.+.+|+|+|. ...+|.|.+..++++++.+|...|.+...+....++.+++....++ ++... T Consensus 216 ~~~~~~~~~~~g~vPvv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~--~~~~~ 286 (474) T protein:vir:10 216 LQEVGRYEHLFDYNPLFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMS--EEMIQ 286 (474) T ss_pred ccccccccCCCCccceEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC--chhhh Confidence 11223334444556666532 3456889999999999999999999999998888777665322221 12211 Q ss_pred hcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 382 QANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFA 461 (688) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~ 461 (688) . ....+.+...... ..+.++..+.-..++...++.+...|-..|++.+.+.+.-+++.||.|+..+-......... T Consensus 287 ~-~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~ 362 (474) T protein:vir:10 287 E-TQKSGAFELFDKD---MDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMT 362 (474) T ss_pred h-hhhcceeEecCCC---CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHH Confidence 1 2234455443322 34556655555577888889999999999999988877666678999999887777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCC-cceeeechhhhcccccceeeeccceeeeEEEEEecccCcHH Q lcl|NC_020488. 462 YIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGE-GDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQT 540 (688) Q Consensus 462 ~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~-~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s 540 (688) ....|..++++++++++.++..-.. ... .++ .|+.+.=.+..+. T Consensus 363 ~~~~~~~~l~~~~~li~~~l~~~~~----------~~~~~~~-------------------------~~i~~~f~~~~p~ 407 (474) T protein:vir:10 363 FERKMTAMLRYQFKVILSALKRKGY----------NLDDDSY-------------------------LNLIFKFTRNIPV 407 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccC----------CCCcccc-------------------------ccceEEeCCCCCC Confidence 7778888888888887776543211 000 000 1223333344444 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHh--hhhhhhhhhHH Q lcl|NC_020488. 541 QRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEE--AGIEPPQPSPE 610 (688) Q Consensus 541 ~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 610 (688) ...+..+.++.+.. .+....+++++++ .+.+...+++++.............. ...+..+.+-+ T Consensus 408 d~~e~a~~~~kl~g------~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 408 NKLEESQVLINLKG------QVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred CHHHHHHHHHHHhc------cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 44444555554432 1223445555543 45555555554322110000000000 00000000000 No 72 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.81 E-value=1.3e-18 Score=118.67 Aligned_cols=447 Identities=9% Similarity=0.008 Sum_probs=234.4 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCH--HHHHHHHhcCCC--ceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPE--SVRKEREDEGRP--CLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~--~~~~~~~~~g~p--~~~~N~i~~ 76 (688) ++|.. .+....+.+.++...+ ..+.+..+.+..+||.|.+-.- ......+..++| .+..|..+. T Consensus 22 ~~~~~------~~~~~~~~i~~~i~~~------~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ 89 (481) T protein:vir:10 22 VVSDL------AELLKEENLRNFISRH------QTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKY 89 (481) T ss_pred eeecc------hhhcCHHHHHHHHHHH------HHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHH Confidence 44432 2333333444444332 2334556777889999986421 111122334444 478899999 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEE Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLR 156 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~ 156 (688) +|+..+|+...+.+.+.+ . |. .....+..++..|+++.....+..+++++|.||+. T Consensus 90 ivd~~~~~l~g~~~~~~~--~------------------d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~ 145 (481) T protein:vir:10 90 VSRFIVGYLTGNPITITH--Q------------------DN----QTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEI 145 (481) T ss_pred HHHHHHhhhccCCceEec--C------------------Ch----hHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEE Confidence 999999999988766543 1 22 22334566677899999999999999999999988 Q ss_pred EEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCC Q lcl|NC_020488. 157 VLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWT 234 (688) Q Consensus 157 v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~ 234 (688) ++.+. ++.+.+..+ +|..++ ||+... .....+.+.|... + . T Consensus 146 ~~~d~------dg~~~i~~~-~p~~~~~v~d~~~~-----~~~~~~i~~~~~~--------~-----------------~ 188 (481) T protein:vir:10 146 VYRDF------EDRDTFKVL-DPKSTFVVYDQTLD-----KKVVAGVRYFEKQ--------D-----------------K 188 (481) T ss_pred EEeCC------CCeEEEEEE-cccceEEEEcCCCC-----CceEEEEEEEEEe--------e-----------------C Confidence 86542 356776665 677764 554321 1122233322210 0 0 Q ss_pred CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCc Q lcl|NC_020488. 235 NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGST 314 (688) Q Consensus 235 ~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~ 314 (688) +...+..+++|....... +....+. ..+.++.|.+.+. T Consensus 189 ~~~~~~~~~~y~~~~i~~--~~~~~~~----------------------------------------~~~~~~~~~~~g~ 226 (481) T protein:vir:10 189 DKVPVQHVEVYTTDKIYY--IEIKGGT----------------------------------------YHRVEEVEHYYND 226 (481) T ss_pred CCceEEEEEEEecCeEEE--EEecCCc----------------------------------------eeecccccccCCc Confidence 112334455554322110 1111110 0011233434456 Q ss_pred cceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecC Q lcl|NC_020488. 315 IPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYN 394 (688) Q Consensus 315 ~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 394 (688) +|+|+|.. ..+|.|.+..++++++.+|..+|.+...+.....+.+++......+.+.. . ..+.++.+.... T Consensus 227 vPvv~~~n-------~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~-~-~~~~~~~~~~~~ 297 (481) T protein:vir:10 227 VPIIEYLN-------DQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDA-K-AFRDANMIHLEP 297 (481) T ss_pred eeEEEeec-------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccch-h-hhhhccceeccc Confidence 67665422 34577999999999999999999999999888888776643322222111 1 111122221111 Q ss_pred -----cccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 395 -----AIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRA 469 (688) Q Consensus 395 -----~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~ 469 (688) +......+.++....-...+...++.+...|-.+|++.+.+.|..+++.||.|+..+..............|..+ T Consensus 298 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 377 (481) T protein:vir:10 298 GTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKG 377 (481) T ss_pred cccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 011122344444443346677778889999999999999888876667899999877666666666666666677 Q ss_pred HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHH Q lcl|NC_020488. 470 IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSL 549 (688) Q Consensus 470 ~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l 549 (688) ++++.++++.++... + ...+ ...+|.+.=.+..+....+..+.+ T Consensus 378 l~~~~~li~~~~~~~----------~---~~~~-----------------------~~~~i~v~f~~~~~~~~~~~a~~~ 421 (481) T protein:vir:10 378 LMKRYKLLLNNVNLT----------G---LKQH-----------------------NYAELTITFTPNLPKSMMESINAF 421 (481) T ss_pred HHHHHHHHHHHHhcc----------C---CCcc-----------------------ccceeeEEeCCCCCcCHHHHHHHH Confidence 776666665554211 1 0000 012334444455555555555555 Q ss_pred HHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhh-hhhHHHHH Q lcl|NC_020488. 550 MQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPP-QPSPEQQA 613 (688) Q Consensus 550 ~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~q~ 613 (688) +++... +....+++++++ ...++-.+++++..................+. ......+- T Consensus 422 ~kl~g~------is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 422 NALSGG------VSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred HHHhcc------CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 554322 222344555543 33333344443322111110000000000000 00000000 No 73 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.81 E-value=6.9e-19 Score=120.17 Aligned_cols=459 Identities=11% Similarity=0.054 Sum_probs=231.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-----HHHHhcCCC--ceeehh Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-----KEREDEGRP--CLTLNK 73 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-----~~~~~~g~p--~~~~N~ 73 (688) =+|-.++..++..+...+......+......+..........+..+||.|+|.--.-. ......++| .+++|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:94 6 RMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNF 85 (474) T ss_pred cccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecch Confidence 3566665554433333333333334444444545555566677889999986321100 011123344 378999 Q ss_pred HHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCc Q lcl|NC_020488. 74 LPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFG 153 (688) Q Consensus 74 i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G 153 (688) .+.+|+..+++...+.+.+.+ .|....+ .++.+.+ |+++.....+..+++++|.| T Consensus 86 ~k~Ivd~~~~~l~g~p~~~~~--------------------~d~~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~ 140 (474) T protein:vir:94 86 HQNLVDQKVSYVASKPVTYSC--------------------EDENVLK----VIHDVLD-TRWDNKLIDILTATSNKGID 140 (474) T ss_pred HHHHHHHHHhhhhcCCceecc--------------------CcHHHHH----HHHHHHh-ccHHHHHHHHHHHHhhcCce Confidence 999999999999998877643 1333333 3444444 78999999999999999999 Q ss_pred eEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccc Q lcl|NC_020488. 154 WLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYS 231 (688) Q Consensus 154 ~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~ 231 (688) |..++.+. ++.+.+..+ +|..++ ||+... .+..+ +++.|... + T Consensus 141 ~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~~~-~ir~~~~~--------~--------------- 185 (474) T protein:vir:94 141 WLQVYINE------NGEMKLFRV-PAEQAIPIWVDKER----EELKS-FIRYYKFN--------N--------------- 185 (474) T ss_pred EEEEEecC------CCeeEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEec--------C--------------- Confidence 98876531 345666665 677764 554321 12222 33333210 0 Q ss_pred cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCC Q lcl|NC_020488. 232 WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWP 311 (688) Q Consensus 232 ~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~ 311 (688) ...+++|.......+. ..++........ ....+.....|.+ T Consensus 186 -------~~~~~~yt~~~~~~y~--~~~~~~~~~~~~------------------------------~~~~~~~~~~~~~ 226 (474) T protein:vir:94 186 -------EEKVEFWTDTTVTYYV--LENGGLIPDYYY------------------------------GANHVQSHFSNGN 226 (474) T ss_pred -------eEEEEEEeCCeEEEEE--EcCCcccccccc------------------------------CcCcccccccccC Confidence 0012334332221111 111111110000 0011112223334 Q ss_pred CCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCcee Q lcl|NC_020488. 312 GSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVL 391 (688) Q Consensus 312 ~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 391 (688) .+.+|+|+|. +..+|.|.+..++++++.+|...|.+.+.+.....+.+++.....++..++.... . ...++ T Consensus 227 ~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~-~-~~~~i 297 (474) T protein:vir:94 227 WGRVPFIAFK-------NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGL-K-YYKAI 297 (474) T ss_pred CCccceEEec-------CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh-h-cccee Confidence 4566666542 2356889999999999999999999999998888888776654444444433321 1 22223 Q ss_pred ecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 392 RYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIR 471 (688) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 471 (688) ...+ ...+.++..+.-..++...++.+...|-..|++.+.+.+.-+++.||.|+..+-.............|..+++ T Consensus 298 ~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 374 (474) T protein:vir:94 298 NVDG---DGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQ 374 (474) T ss_pred eccC---CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222 2235555555455667778899999999999998877766566789999987766666666666666666666 Q ss_pred HHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHH Q lcl|NC_020488. 472 RVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQ 551 (688) Q Consensus 472 ~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~ 551 (688) ++.++++ ++... ..++. +|.|.=.++.+....+..+.+.+ T Consensus 375 ~~~~li~----~~~~~-----------~~d~~-------------------------~i~v~f~~~~p~~~~e~a~~~~~ 414 (474) T protein:vir:94 375 ELISFII----DFNNL-----------KTDVK-------------------------DIEISFNFNRMMNDAEQSQIIAQ 414 (474) T ss_pred HHHHHHH----HHhCC-----------Ccccc-------------------------eeeEEeccCcccCHHHHHHHHHH Confidence 6555544 44321 00110 11111112222222222333222 Q ss_pred HHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 552 FVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKAD 628 (688) Q Consensus 552 ~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e 628 (688) . +.+....++.++++ ...+.-.+++++................ ......... .+.. +.| T Consensus 415 ~-------g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~--~~~~~~~~~-------~~~~--~~e 474 (474) T protein:vir:94 415 S-------QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGG--ADGAQQQEG-------SNNK--ESE 474 (474) T ss_pred c-------CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCC--CCCcccCCC-------Cccc--ccC Confidence 1 11223344444432 3333334444322111000000000000 000000000 0000 000 No 74 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.81 E-value=6.9e-19 Score=120.17 Aligned_cols=459 Identities=11% Similarity=0.054 Sum_probs=231.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-----HHHHhcCCC--ceeehh Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-----KEREDEGRP--CLTLNK 73 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-----~~~~~~g~p--~~~~N~ 73 (688) =+|-.++..++..+...+......+......+..........+..+||.|+|.--.-. ......++| .+++|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:97 6 RMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNF 85 (474) T ss_pred cccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecch Confidence 3566665554433333333333334444444545555566677889999986321100 011123344 378999 Q ss_pred HHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCc Q lcl|NC_020488. 74 LPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFG 153 (688) Q Consensus 74 i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G 153 (688) .+.+|+..+++...+.+.+.+ .|....+ .++.+.+ |+++.....+..+++++|.| T Consensus 86 ~k~Ivd~~~~~l~g~p~~~~~--------------------~d~~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~ 140 (474) T protein:vir:97 86 HQNLVDQKVSYVASKPVTYSC--------------------EDENVLK----VIHDVLD-TRWDNKLIDILTATSNKGID 140 (474) T ss_pred HHHHHHHHHhhhhcCCceecc--------------------CcHHHHH----HHHHHHh-ccHHHHHHHHHHHHhhcCce Confidence 999999999999998877643 1333333 3444444 78999999999999999999 Q ss_pred eEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccc Q lcl|NC_020488. 154 WLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYS 231 (688) Q Consensus 154 ~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~ 231 (688) |..++.+. ++.+.+..+ +|..++ ||+... .+..+ +++.|... + T Consensus 141 ~~~~~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~~~-~ir~~~~~--------~--------------- 185 (474) T protein:vir:97 141 WLQVYINE------NGEMKLFRV-PAEQAIPIWVDKER----EELKS-FIRYYKFN--------N--------------- 185 (474) T ss_pred EEEEEecC------CCeeEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEec--------C--------------- Confidence 98876531 345666665 677764 554321 12222 33333210 0 Q ss_pred cCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCC Q lcl|NC_020488. 232 WWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWP 311 (688) Q Consensus 232 ~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~ 311 (688) ...+++|.......+. ..++........ ....+.....|.+ T Consensus 186 -------~~~~~~yt~~~~~~y~--~~~~~~~~~~~~------------------------------~~~~~~~~~~~~~ 226 (474) T protein:vir:97 186 -------EEKVEFWTDTTVTYYV--LENGGLIPDYYY------------------------------GANHVQSHFSNGN 226 (474) T ss_pred -------eEEEEEEeCCeEEEEE--EcCCcccccccc------------------------------CcCcccccccccC Confidence 0012334332221111 111111110000 0011112223334 Q ss_pred CCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCcee Q lcl|NC_020488. 312 GSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVL 391 (688) Q Consensus 312 ~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 391 (688) .+.+|+|+|. +..+|.|.+..++++++.+|...|.+.+.+.....+.+++.....++..++.... . ...++ T Consensus 227 ~g~vPvv~~~-------nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~-~-~~~~i 297 (474) T protein:vir:97 227 WGRVPFIAFK-------NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGL-K-YYKAI 297 (474) T ss_pred CCccceEEec-------CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh-h-cccee Confidence 4566666542 2356889999999999999999999999998888888776654444444433321 1 22223 Q ss_pred ecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 392 RYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIR 471 (688) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 471 (688) ...+ ...+.++..+.-..++...++.+...|-..|++.+.+.+.-+++.||.|+..+-.............|..+++ T Consensus 298 ~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 374 (474) T protein:vir:97 298 NVDG---DGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQ 374 (474) T ss_pred eccC---CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222 2235555555455667778899999999999998877766566789999987766666666666666666666 Q ss_pred HHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHH Q lcl|NC_020488. 472 RVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQ 551 (688) Q Consensus 472 ~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~ 551 (688) ++.++++ ++... ..++. +|.|.=.++.+....+..+.+.+ T Consensus 375 ~~~~li~----~~~~~-----------~~d~~-------------------------~i~v~f~~~~p~~~~e~a~~~~~ 414 (474) T protein:vir:97 375 ELISFII----DFNNL-----------KTDVK-------------------------DIEISFNFNRMMNDAEQSQIIAQ 414 (474) T ss_pred HHHHHHH----HHhCC-----------Ccccc-------------------------eeeEEeccCcccCHHHHHHHHHH Confidence 6555544 44321 00110 11111112222222222333222 Q ss_pred HHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 552 FVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKAD 628 (688) Q Consensus 552 ~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e 628 (688) . +.+....++.++++ ...+.-.+++++................ ......... .+.. +.| T Consensus 415 ~-------g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~--~~~~~~~~~-------~~~~--~~e 474 (474) T protein:vir:97 415 S-------QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGG--ADGAQQQEG-------SNNK--ESE 474 (474) T ss_pred c-------CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCC--CCCcccCCC-------Cccc--ccC Confidence 1 11223344444432 3333334444322111000000000000 000000000 0000 000 No 75 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.81 E-value=2.6e-19 Score=122.52 Aligned_cols=473 Identities=11% Similarity=0.059 Sum_probs=239.8 Q ss_pred CCCCCCCcCCC--------CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHH-H-------HHhc Q lcl|NC_020488. 1 MLPGNEPIKTR--------DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRK-E-------REDE 64 (688) Q Consensus 1 ~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~-~-------~~~~ 64 (688) ++|-....-.. ..+..+..... +.+..+.+ .+....+..+||.|+|.-..-.. . .... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~ 77 (503) T protein:vir:59 4 IYPLGKTHTEELNEIIVESAKEIAEPDTTM----IQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDD 77 (503) T ss_pred cccCChhhHHhHHHhhhhhhhhccchhHHH----HHHHHHhh--cHHHHHHHHHHhccccchhhccchhccccccccccc Confidence 55555422221 11111111122 22222222 23556778899999874211111 0 1112 Q ss_pred CCC--ceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHH Q lcl|NC_020488. 65 GRP--CLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDN 142 (688) Q Consensus 65 g~p--~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~ 142 (688) ++| .++.|..+.+|+..+++...+.+.+.+ .|.+..+ .++... .|+++..... T Consensus 78 ~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~--------------------~d~~~~~----~l~~~~-~n~~~~~~~~ 132 (503) T protein:vir:59 78 TKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTS--------------------DNKTLLE----YVNELA-DDDFDDILNE 132 (503) T ss_pred ccccceeecchHHHHHHHHHhhhhcCCeeecc--------------------CcHHHHH----HHHHHH-hcCHHHHHHH Confidence 233 567899999999999999998876532 1333333 344444 4789999999 Q ss_pred HHHHHHHcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccch Q lcl|NC_020488. 143 AFQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVG 220 (688) Q Consensus 143 ~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~ 220 (688) +..+++++|.||+.++++. ++.+.+..+ +|..++ ||+... .+. .++++.|... . T Consensus 133 ~~~~~~~~G~~~~~v~~d~------dg~~~i~~~-~p~~~~~i~d~~~~----~~~-~~~ir~~~~~--------~---- 188 (503) T protein:vir:59 133 TVKNMSNKGIEYWHPFVDE------EGEFDYVIF-PAEEMIVVYKDNTR----RDI-LFALRYYSYK--------G---- 188 (503) T ss_pred HHHHHhhCCeEEEEEeecC------CCceEEEEE-ccceeEEEEeCCCC----Cce-EEEEEEEEEe--------c---- Confidence 9999999999998887642 356777766 677764 665321 112 2233333211 0 Q ss_pred hcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEch Q lcl|NC_020488. 221 DLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTA 300 (688) Q Consensus 221 ~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~ 300 (688) .+.+.+..+|+|.......+ ...++.+......... .. . T Consensus 189 -------------~~~~~~~~~evy~~~~i~~~--~~~~~~~~~~~~~~~~--------------~~------------~ 227 (503) T protein:vir:59 189 -------------IMGEETQKAELYTDTHVYYY--EKIDGVYQMDYSYGEN--------------NP------------R 227 (503) T ss_pred -------------CCCceEEEEEEEeCCcEEEE--EEcCCccccccccccc--------------cc------------c Confidence 01123444566655433221 1122211111000000 00 0 Q ss_pred hhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHH Q lcl|NC_020488. 301 YDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEW 380 (688) Q Consensus 301 ~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 380 (688) ..+.....|.+.+.+|+++|.. ...|.|.+..++++++.+|.++|.+.+.+....++.+++.....++..+.. T Consensus 228 ~~~~~~~~~~~~~~vPiv~~~n-------n~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~ 300 (503) T protein:vir:59 228 PHMTKGGQAIGWGRVPIIPFKN-------NEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFT 300 (503) T ss_pred cceeecceeccCCccceEEecC-------CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhh Confidence 0112233455567777776522 345789999999999999999999999998888887776543333333332 Q ss_pred hhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020488. 381 NQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTF 460 (688) Q Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~ 460 (688) ... ....++..++. ..+.++....-...+...++.+...|...+++.+.+.+.-+++.||.|+..+......... T Consensus 301 ~~~--~~~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~ 375 (503) T protein:vir:59 301 ANL--RYHSVIKVSGD---GGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKAN 375 (503) T ss_pred hhh--hcccceeccCC---CcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHH Confidence 221 12223333322 2344444444446677788888899999998888776665667899999888766666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHH Q lcl|NC_020488. 461 AYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQT 540 (688) Q Consensus 461 ~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s 540 (688) .....|..+++++.++++.++...... .+. ...+|.|.=.+..+. T Consensus 376 ~~~~~~~~~l~~~~~~i~~~~~~~~~~-------------~~~----------------------~~~~i~i~f~~~~p~ 420 (503) T protein:vir:59 376 MAERKIRAGLRLFFWFFAEYLRNTGKG-------------DFN----------------------PDKELTMTFTRTRIQ 420 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCc-------------ccc----------------------cccceeEEeCCCCCC Confidence 777777777777766666655332210 000 001233333344444 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhcccccc------chhhHHhhhhhhhhhhHHHHH Q lcl|NC_020488. 541 QRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGIL------DQDEMEEAGIEPPQPSPEQQA 613 (688) Q Consensus 541 ~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~q~ 613 (688) ...+..+.++.+.+. +-+....+++++++ .+.++-.+++.+....... ........+. +..+.+.+.. T Consensus 421 d~~~~~~~~~kl~~~----GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 495 (503) T protein:vir:59 421 NDSEIVQSLVQGVTG----GIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLE-EDDPNAGAAE 495 (503) T ss_pred CHHHHHHHHHHHHhC----CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCC-cCCCCCCccc Confidence 455555656555432 11223344455433 3344444444322110000 0000000000 0000000000 Q ss_pred HHHHHHHH Q lcl|NC_020488. 614 NMAQAQAD 621 (688) Q Consensus 614 ~~~~~q~~ 621 (688) .....|+. T Consensus 496 ~~~~g~~~ 503 (503) T protein:vir:59 496 SGGAGQVS 503 (503) T ss_pred CCCCCCcC Confidence 00000000 No 76 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.81 E-value=3.2e-18 Score=116.50 Aligned_cols=462 Identities=13% Similarity=0.058 Sum_probs=234.4 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCH-------HHHHHHHhcCCC--ceee Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPE-------SVRKEREDEGRP--CLTL 71 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~-------~~~~~~~~~g~p--~~~~ 71 (688) ++|...-...-..+....+.+.+..... . .......+..+||.|+|=-. ......+...+| .++. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~----~--~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~ 79 (479) T protein:vir:79 6 ISETDLIKVQLKKESTINLVKVIEHYIL----K--HRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAIN 79 (479) T ss_pred ecccceEeeccccCChhHHHHHHHHHHh----h--hhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeec Confidence 7787774444444444444444433221 1 12345677789999975110 000111122233 5889 Q ss_pred hhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcC Q lcl|NC_020488. 72 NKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGG 151 (688) Q Consensus 72 N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G 151 (688) |..+-+|+..+|+...+.+.+.+ .|.+. ..+++... .|+++.....+..++++.| T Consensus 80 ~~~~~Ivd~~~~~l~g~p~~~~~--------------------~~~~~----~~~~~~~~-~n~~~~~~~~~~~~~~~~G 134 (479) T protein:vir:79 80 NYHKLLVDQKVGYSVGNPIVFNA--------------------DDDNL----TKLLNDLL-GEEFDDTITELYLNASNKG 134 (479) T ss_pred chHHHHHHHHHhhhhcCCceecc--------------------CCHHH----HHHHHHHH-hcCHHHHHHHHHHHHHhcC Confidence 99999999999999998877643 12222 23344433 4799999999999999999 Q ss_pred CceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccc Q lcl|NC_020488. 152 FGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGE 229 (688) Q Consensus 152 ~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~ 229 (688) .||..++++. ++.+++..+ +|..++ ||+... .. .-++++.|...+ T Consensus 135 ~~~~~v~~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~-~~~~ir~y~~~~--------------------- 181 (479) T protein:vir:79 135 VEWLHPYINR------KGEFKYVII-PAEEAIPIWDSKRQ----RE-LVAFIRFYYIED--------------------- 181 (479) T ss_pred eEEEEEEeCC------CCceEEEEE-ccceeEEEEeCCCC----Cc-eEEEEEEEEEee--------------------- Confidence 9998887541 355777666 787764 554321 11 222333332110 Q ss_pred cccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCC Q lcl|NC_020488. 230 YSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVD 309 (688) Q Consensus 230 ~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p 309 (688) .+.+.+..+|+|......... ...+........... ...........+..+.| T Consensus 182 ----~~~~~~~~~e~y~~~~i~~~~--~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~~~~ 234 (479) T protein:vir:79 182 ----IDGNKIKRVEYYTENDITYFI--ERGNSFIQEFLYDEY---------------------GKMTDIQEGHFRINNKE 234 (479) T ss_pred ----cCCceEEEEEEEeCCcEEEEE--ecCCccccccccccc---------------------ccccccccccccccccc Confidence 011223334555443222111 111111110000000 00000011112233444 Q ss_pred CCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCc Q lcl|NC_020488. 310 WPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQS 389 (688) Q Consensus 310 ~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 389 (688) .+.+.+|||+|. +..+|.|.+..++++++.+|...|.+.+.+....++.+++.........+.... .+..+ T Consensus 235 ~~~~~vPvv~~~-------nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~-~~~~~- 305 (479) T protein:vir:79 235 QGWGKVPFIPFK-------NNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDN-IRYYK- 305 (479) T ss_pred cCCCcccEEEec-------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhh-hhhcc- Confidence 445666666542 234678999999999999999999999999888888766543222222222211 12222 Q ss_pred eeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 390 VLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRA 469 (688) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~ 469 (688) ++..++ ...+.++..+.-..++...++.+...|-..|++.+...+..+| .||.|+..+..............|..+ T Consensus 306 ~i~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 381 (479) T protein:vir:79 306 SIKVDG---GGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGD-KSGVALKFLYSLLDLKCSKTEKKFKKA 381 (479) T ss_pred ceecCC---CCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 232222 2335555554445667778888999999999988887776544 699999887666666666666666666 Q ss_pred HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHH Q lcl|NC_020488. 470 IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSL 549 (688) Q Consensus 470 ~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l 549 (688) ++.+.++++.++... +. ...+ ..++.|.=.+..+....+..+.+ T Consensus 382 l~~~~~li~~~~~~~----------~~-~~~~-------------------------~~~i~i~f~~~~p~~~~~~a~~~ 425 (479) T protein:vir:79 382 IRELLWFVCEYLKIS----------GN-KSYD-------------------------YKTVQITFNHSMIINEAEKIDMA 425 (479) T ss_pred HHHHHHHHHHHHhcc----------CC-Cccc-------------------------cccceEEeCCCCCcCHHHHHHHH Confidence 666666655543211 10 0000 12333333344444344444444 Q ss_pred HHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 550 MQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKAD 628 (688) Q Consensus 550 ~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e 628 (688) +.+.. .+....+++++++ ...++-.+++++.............. ..... T Consensus 426 ~kl~g------~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~--~~~~~---------------------- 475 (479) T protein:vir:79 426 AKSTG------IVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIPN--NQDGV---------------------- 475 (479) T ss_pred HHHhc------cCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccCc--ccCCC---------------------- Confidence 44322 1223344455443 23333333333221100000000000 00000 Q ss_pred HHHHHHHHH Q lcl|NC_020488. 629 TAKAQADMA 637 (688) Q Consensus 629 ~~~~q~e~~ 637 (688) .+.. T Consensus 476 -----~~e~ 479 (479) T protein:vir:79 476 -----IDET 479 (479) T ss_pred -----cCcC Confidence 0000 No 77 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.80 E-value=2.2e-18 Score=117.37 Aligned_cols=456 Identities=12% Similarity=0.061 Sum_probs=230.2 Q ss_pred CCCCCCCcCCCCccchHHHHHH-------HHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHH-----HHHHHHhcCCC- Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQE-------IRERAAHAVTCWKHNFDAAQEDISFLAGEQWPES-----VRKEREDEGRP- 67 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~-----~~~~~~~~g~p- 67 (688) |+--+-|+..+-.. +.+.. ..+.+.+..+.+...+....+..+||.|.|=-.. ........++| T Consensus 1 ~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (478) T protein:vir:10 1 MISINWPWDKPYHE---QVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPD 77 (478) T ss_pred CccccccCCchhhh---HHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhccccccccccc Confidence 44443333332221 11111 1122233333444556667778899999751000 00111123334 Q ss_pred -ceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Q lcl|NC_020488. 68 -CLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQH 146 (688) Q Consensus 68 -~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d 146 (688) .+++|..+.+|+..+|+...+.+.+.+ .|.+..+. +..++ .|+++.....+..+ T Consensus 78 ~ki~~n~~k~ivd~~~~yl~g~p~~~~~--------------------~~~~~~~~----l~~~~-~n~~~~~~~~~~~~ 132 (478) T protein:vir:10 78 WRMYTNYHQNLVDQKVAYAVANPVTFGV--------------------DNDKALKQ----IQHTL-NHKWDDKLVDILTA 132 (478) T ss_pred ceeccchHHHHHHHHhhhhcccCceeec--------------------CChHHHHH----HHHHH-hccHHHHHHHHHHH Confidence 377999999999999999999877643 12233333 33333 37899999999999 Q ss_pred HHHcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccc Q lcl|NC_020488. 147 AVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSD 224 (688) Q Consensus 147 ~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~ 224 (688) +++.|.||..|+++. ++.+.+..+ +|..++ ||+... .+..+ +++.+-... T Consensus 133 ~~~~G~~~~~v~~d~------~~~~~~~~~-~p~~~~~v~d~~~~----~~~~~-~ir~~~~~~---------------- 184 (478) T protein:vir:10 133 ASNKGIEWVQPYVDE------EGEFKTFRV-PAEQAVPIWTNKER----DELQA-FIRVYELDG---------------- 184 (478) T ss_pred HhhCCeEEEEEEecC------CCceEEEEE-cccceEEEEcCCCC----CceEE-EEEEEeeeC---------------- Confidence 999999998887652 346777666 677754 554321 12222 222222100 Q ss_pred ccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhc Q lcl|NC_020488. 225 AERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVL 304 (688) Q Consensus 225 ~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~il 304 (688) ...+++|....... +...++..++...... .. .....+ T Consensus 185 --------------~~~~~~y~~~~i~~--~~~~~~~~~~~~~~~~-------------------------~~-~~~~~~ 222 (478) T protein:vir:10 185 --------------AERVEYWTKDDVTF--YELKEGQLIPDFYRSE-------------------------DH-IQPHYY 222 (478) T ss_pred --------------ceEEEEEeCCcEEE--EEecCCeeeccccccc-------------------------cc-ccccee Confidence 00123332221111 1111222111100000 00 001112 Q ss_pred ccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcc Q lcl|NC_020488. 305 EGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQAN 384 (688) Q Consensus 305 e~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 384 (688) .+..|.+.+.+|+|+|.. ...|.|.+..++++++.+|.++|.+.+.+.....+.+++.....++..+..... T Consensus 223 ~~~~~~~~g~vPvv~~~n-------~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~- 294 (478) T protein:vir:10 223 QGNKLMSWGRVPFIPFKN-------NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNL- 294 (478) T ss_pred cccccccCCcceEEEecc-------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhh- Confidence 334455667777776532 345789999999999999999999999998888776665432223222222211 Q ss_pred cCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 385 RKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYID 464 (688) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~d 464 (688) ...+++.+ .+.. +..+.++....-..++...++.+.+.|-..|++.+.+.+..+++.||.|+..+-............ T Consensus 295 ~~~~~~~~-~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~ 372 (478) T protein:vir:10 295 KYYKAISV-AGES-GSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKN 372 (478) T ss_pred hhCceeEe-cCCC-CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHH Confidence 12233333 2221 234555555555667778889999999999998887777666678999998887666666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHH Q lcl|NC_020488. 465 NLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRME 544 (688) Q Consensus 465 n~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~ 544 (688) .|..+++++.++++. ++.. ..++ .+|.|.=.+..+....+ T Consensus 373 ~~~~~l~~~~~li~~----~~~~-----------~~d~-------------------------~~i~i~f~~~~p~~~~e 412 (478) T protein:vir:10 373 KTLTALQELLQYIID----FYRL-----------DVRV-------------------------QDIEITFNFNVMVNELE 412 (478) T ss_pred HHHHHHHHHHHHHHH----HhCC-----------Cccc-------------------------ccceEEeCCCCCCCHHH Confidence 666677666555544 4310 0011 11222223344433333 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhH-----HhhhhhhhhhhHH Q lcl|NC_020488. 545 AADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEM-----EEAGIEPPQPSPE 610 (688) Q Consensus 545 ~~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~ 610 (688) ..+.++.+.. .+....+++.+++ ...+.-.+++++............ .....+....+++ T Consensus 413 ~~~~~~~~~g------~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 413 NSQIAMNSTG------LLSKETILGNHSWVQDPVAEMERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQSE 478 (478) T ss_pred HHHHHHHHhC------CCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccccCCCCcccccccCcCCCCC Confidence 4444443322 1223334444432 334443444432221100000000 0000000000000 No 78 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=99.80 E-value=3e-17 Score=111.14 Aligned_cols=500 Identities=8% Similarity=-0.018 Sum_probs=246.5 Q ss_pred hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhC-----Cc Q lcl|NC_020488. 16 QEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQN-----RP 90 (688) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~-----r~ 90 (688) ++ ..+..+|.+.. .+.|...|++..+|..-.-..+.- ..-..++. ...-..-...+++..+.+... ++ T Consensus 1 mk---~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~-~~~~~~~~-~~~dstg~~a~~~LAa~l~~~ltpp~~~ 73 (510) T protein:vir:78 1 MK---STAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPM-SGSRGVVE-HDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) T ss_pred Ch---hHHHHHHHHHh--ccchHHHHHHHHHhhccccccCCC-Cccccccc-CcccchHHHHHHHHHHHHHHhhcCCCCc Confidence 33 33344444332 557888899888888532111100 00001111 122233334444444444433 22 Q ss_pred ceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcc Q lcl|NC_020488. 91 AIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLD 170 (688) Q Consensus 91 ~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~ 170 (688) =+++.+.+....... .......+- .+.-+..+..+...+..|++..+...++.+.+..|.+++.+ + ++ .+ T Consensus 74 WF~l~~~d~~~~~~~-~~~~~~~~v-~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~---~~--~~- 143 (510) T protein:vir:78 74 FFRSELTDAIRREAD-SRDTDITEV-TAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--N---SD--EA- 143 (510) T ss_pred ccccCCChHHhhhcc-cCcchHHHH-HHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEE--e---CC--CC- Confidence 233333221100000 000000001 11223445556666778999999999999999889876543 2 11 11 Q ss_pred eeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeec Q lcl|NC_020488. 171 LCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPV 250 (688) Q Consensus 171 ~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~ 250 (688) .++.+ +..++++..++. . ...-++++..||..++.+.||..... .... ....+.|.|+.+.++++. T Consensus 144 -~~~~~-pl~~y~v~~d~~-G---~vd~i~rr~~~t~~~l~~~~~~~~~~---~~~~-----~~~~~~v~v~~~V~~~~~ 209 (510) T protein:vir:78 144 -TVVAW-SLRSYAVRRDAT-G---RWMDIVLKQRYKSKDLDDVYKQDLMR---AGRN-----LSGSGSVDLYTHVQRRKG 209 (510) T ss_pred -eEEEE-EcceeEEeeCCC-c---CeeEEEeeeeccHHHHHHHhhHHhhh---hhhc-----cCCCceEEEEEEEEeecC Confidence 23333 345566554332 1 22347888999999999999864321 1111 112345667666655421 Q ss_pred ceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCc Q lcl|NC_020488. 251 TRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDK 330 (688) Q Consensus 251 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~ 330 (688) . ....+.||+- ..|..++. .+-|++.++||+|+.+ ...+|. T Consensus 210 ~-----------------------------------~~~~~sv~~e-~dg~~i~~-~~~~~~~e~P~~~~Rw--~~~~ge 250 (510) T protein:vir:78 210 T-----------------------------------AMDYAEMYHE-IDGVRVGE-TGRWPIHLCPYIVPTW--NLAPGE 250 (510) T ss_pred C-----------------------------------CCcEEEEEEE-ecCeeecc-ccccccccCCeeeeee--eecCCC Confidence 1 0112233332 34444442 3456778899997644 567999 Q ss_pred ccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceec--CCC Q lcl|NC_020488. 331 TYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRD--MPA 408 (688) Q Consensus 331 ~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 408 (688) .||.|.+....+--+.+|++....+.......++.++++.+.+.+++.+.. ..+|.++. + .. +.+..+ ... T Consensus 251 ~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~---~~~g~~v~-g-~~--~~v~~~~~~~~ 323 (510) T protein:vir:78 251 HYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD---AEMGDYVP-G-GA--EAVRAYERGDY 323 (510) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhcc---CCCceeec-C-Cc--ccccccccCcc Confidence 999999999999999999999999999999999999999888776654432 22344432 1 11 223322 222 Q ss_pred cchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCc Q lcl|NC_020488. 409 SMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRVYDS 487 (688) Q Consensus 409 ~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~~~~ 487 (688) .--....+.++...+.|.+..=+ + ....++...|++.|..+.+.....|...+.++. .+...+.+..+.++.... T Consensus 324 ~d~~~~~~~i~~~~~rI~~aF~~-~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g-- 399 (510) T protein:vir:78 324 NKMAAIQQSLQAVVVRLNQAFMY-G-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL-- 399 (510) T ss_pred cchHHHHHHHHHHHHHHHHHHhh-c-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc-- Confidence 33455567777788888775411 1 111234457999999999999999998888875 566777777766664421 Q ss_pred ceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_020488. 488 DRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLI 567 (688) Q Consensus 488 ~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~ 567 (688) ++-+ + ++ .+ +-.+ -++ -.+--|.+..+.+..+++.+....+. T Consensus 400 --l~p~--------------p---~~--------~~---~~~~--v~~-is~Laraq~~~~l~~~~q~l~~~~~~----- 441 (510) T protein:vir:78 400 --LQGL--------------I---TK--------QH---KPAI--ETG-LPALSRSAAVQSMLNASQVIAGLAPI----- 441 (510) T ss_pred --CCCC--------------C---cc--------cc---ccee--eec-ccHHHHHHHHHHHHHHHHHHHHhcCh----- Confidence 0000 0 00 00 0011 111 11223344444455444443322221 Q ss_pred HHhcCCccHHHHHHHHHhhccccc--cchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 568 AKNMDWPGAQDIARRLQKTLPPGI--LDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAE 645 (688) Q Consensus 568 ~e~~~~~~~~ei~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~ 645 (688) .+..+.-+.+++.+.+....+-.. .-..+.+.++. .+++.+++++++..|+.+... T Consensus 442 ~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~--~~~~~~q~~~~~~~~~a~~~~-------------------- 499 (510) T protein:vir:78 442 AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAE--AEEQRRQAAQAQAAQETLLEG-------------------- 499 (510) T ss_pred hhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHH--HHHHHHHHHHHHHHHHHHHHh-------------------- Confidence 111222356777777665555211 11111100000 000000000000000000000 Q ss_pred HHHHHHHHHHHH Q lcl|NC_020488. 646 AQAKLAEIEQAA 657 (688) Q Consensus 646 ~~a~~~~~~~~a 657 (688) .++++...... T Consensus 500 -~~~~~~~~~g~ 510 (510) T protein:vir:78 500 -ASDMTNALAGV 510 (510) T ss_pred -hhhhcccCCCC Confidence 00111111110 No 79 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.79 E-value=2.9e-18 Score=116.75 Aligned_cols=452 Identities=12% Similarity=0.032 Sum_probs=231.1 Q ss_pred CCCCCCCcCCCCc-----cchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCC--CCHHH---HHHHHhcCCC--c Q lcl|NC_020488. 1 MLPGNEPIKTRDD-----DSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQ--WPESV---RKEREDEGRP--C 68 (688) Q Consensus 1 ~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Q--w~~~~---~~~~~~~g~p--~ 68 (688) |+|-++-..+... +...++. .+.+.+..+...+.+....+..+||.|+| |.... ........+| . T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~e~~---~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~k 88 (483) T protein:vir:12 12 LYPSQPTQTEIFDAIVRTNNKPETL---EEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDR 88 (483) T ss_pred eecCcchhhhhhhcccccCCchhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccc Confidence 5554442221100 0111122 22333444445556677888889999975 11000 0011112222 4 Q ss_pred eeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Q lcl|NC_020488. 69 LTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAV 148 (688) Q Consensus 69 ~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~ 148 (688) ++.|..+.+|+..+|++..+.+.+.+ .|.+..+. ++.++ .|+++.....+..+++ T Consensus 89 i~~n~~k~Ivd~~~~~l~G~p~~~~~--------------------~d~~~~~~----l~~~~-~n~~~~~~~~~~~~~~ 143 (483) T protein:vir:12 89 MITNFHANLVDQKVSYIVGKPIAFKH--------------------TDDEVVKR----IDEVL-GNRFDDKLHSVLTGAS 143 (483) T ss_pred cccchHHHHHHHHhhhhcccCceecc--------------------CChHHHHH----HHHHH-hccHHHHHHHHHHHHh Confidence 77999999999999999988766532 13333333 33333 3678999999999999 Q ss_pred HcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccc Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAE 226 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~ 226 (688) ++|.||..++.+. ++.+.+..+ +|..++ ||+... .+.. .+++.|...+ T Consensus 144 ~~G~~y~~v~~d~------d~~~~i~~~-~p~~~~~v~d~~~~----~~~~-~~ir~~~~~~------------------ 193 (483) T protein:vir:12 144 NKGIEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEH----EELE-AFIRMYKLEN------------------ 193 (483) T ss_pred hCCeEEEEEEEcC------CCceEEEEE-cccceEEEEcCCCC----CceE-EEEEEEEeec------------------ Confidence 9999998887542 355676666 787764 564322 1222 2333332110 Q ss_pred ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 227 RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 227 ~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) .. -+++|....... +....+..+.... .. ....... T Consensus 194 ---------~~---~~~~y~~~~v~~--~~~~~~~~~~~~~-~~-----------------------------~~~~~~~ 229 (483) T protein:vir:12 194 ---------ET---KVEYWDKVTVNY--YVYENGSLIPDYS-NN-----------------------------LENSKTH 229 (483) T ss_pred ---------ce---EEEEEecCeEEE--EEEeCCeeeeccc-cc-----------------------------ccccccc Confidence 00 023332221111 1111121111000 00 0000111 Q ss_pred CCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 307 PVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 307 ~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) ..|.+.+.+|+|+|.. ...|.|.+..++++++.+|..+|.+.+.+...+.+.+++.....++..+.... .+. T Consensus 230 ~~~~~~g~vPvv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~-~~~ 301 (483) T protein:vir:12 230 FSTGSWGKIPFIPFKN-------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL-LRY 301 (483) T ss_pred cccCCCCccceEEecC-------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHh-hhh Confidence 2233445666665422 34577999999999999999999999999888888776644333333332221 222 Q ss_pred CCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) .+++.. .. ...+.++..+.-..++...++.+.+.|-..|++.+.+.+.-+++.||.|+..+-...........+.| T Consensus 302 ~~~~~~-~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f 377 (483) T protein:vir:12 302 YGAIKV-SD---NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 377 (483) T ss_pred cccccc-CC---CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222 21 22345555444456677888999999999999988887776667899999888766666666677677 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) ..+++++.+++++++ .. .+ ++ .++.|.=.+..+....+.. T Consensus 378 ~~~l~~~~~li~~~~----~~------~~-----~~-------------------------~~i~v~f~~~~p~~~~~~a 417 (483) T protein:vir:12 378 KVAIQELLWFVFEHF----DI------KG-----EH-------------------------KDVDISFNYNKVANTELQV 417 (483) T ss_pred HHHHHHHHHHHHHHh----cC------CC-----cc-------------------------ceeeEEeCCCCCCCHHHHH Confidence 777777666655543 10 00 11 1222333344444444455 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcC-CccHHHHHHHHHhhccccccchh---hHHh--hhhhhhhhhHHHH Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNMD-WPGAQDIARRLQKTLPPGILDQD---EMEE--AGIEPPQPSPEQQ 612 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~~-~~~~~ei~~~~~~~~~~~~~~~~---~~~~--~~~~~~~~~~~~q 612 (688) +.++.+... +....++++++ ..+.+.-.+++++.......... .... .++.....+.+++ T Consensus 418 ~~~~kl~Gi------iS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 418 QTAQQSMGI------VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 483 (483) T ss_pred HHHHHHhcc------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccccCCcccCCCCCcccCC Confidence 555544322 22334455544 33444444444332211000000 0000 0000000000000 No 80 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.79 E-value=8.2e-18 Score=114.27 Aligned_cols=456 Identities=9% Similarity=-0.033 Sum_probs=231.2 Q ss_pred CCCCCCcCCC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHH-HhcCCC--ceeehhHHHH Q lcl|NC_020488. 2 LPGNEPIKTR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKER-EDEGRP--CLTLNKLPQY 77 (688) Q Consensus 2 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~-~~~g~p--~~~~N~i~~~ 77 (688) |++..-..-+ +.....+.+.++..++. ...+....+-.+||.|+| +-..... ...++| .++.|..+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~------~~~~~r~~~~~~yy~g~~--~i~~~~~~~~~~~~~~ki~~n~~~~i 72 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFK------AEQLERLKELKRYYLGDN--NIKYRPAKTDKYAADNRIASDFAKYI 72 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHH------HHHHHHHHHHHHHhcccC--ccccccccccccCCcceeecchHHHH Confidence 4444422222 12222333444444332 122344566778999986 1111111 122333 5889999999 Q ss_pred HHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEE Q lcl|NC_020488. 78 VDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRV 157 (688) Q Consensus 78 i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v 157 (688) |+..+|+...+.+.+.+ - |. .....+..++..|+++........+++++|.||..+ T Consensus 73 v~~~~~~l~g~~~~~~~--~------------------d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v 128 (489) T protein:vir:99 73 TVFEQGYMLGVPVEYKN--E------------------NK----DLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELL 128 (489) T ss_pred HHHHhhhhccCCceeec--C------------------Ch----hHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEE Confidence 99999999988776533 1 11 235567777888999999999999999999999877 Q ss_pred EEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCC Q lcl|NC_020488. 158 LTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTN 235 (688) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 235 (688) +..... ..++.+.+..+ +|..++ ||+... .+..+ +++.|... ..+ T Consensus 129 ~~~~~~--d~~~~~~i~~~-~p~~~~~v~dd~~~----~~~~~-~i~~~~~~-------------------------~~~ 175 (489) T protein:vir:99 129 TVEKID--DKKTEVKLYQL-PAEQTFVIYDDTYQ----RNSLM-AVHFYDID-------------------------YGS 175 (489) T ss_pred eeccCc--CCCcceEEEEE-cccceEEEEcCCCC----CceEE-EEEEEEEe-------------------------cCC Confidence 654322 23567777766 677764 443321 12222 22222100 000 Q ss_pred CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 236 EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 236 ~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) ......+++|..... +.+...... .+...+..+.|.+.+.+ T Consensus 176 ~~~~~~~~~y~~~~i--~~~~~~~~~-------------------------------------~~~~~~~~~~~~~~g~v 216 (489) T protein:vir:99 176 GKRKQIIKAYTSDTI--YTYEDYNLE-------------------------------------TKGMRLKDYEGHFFKGV 216 (489) T ss_pred CceEEEEEEEeCCcE--EEEEecCCC-------------------------------------cccceecccccccCCce Confidence 122344455533211 000000000 00001223334444666 Q ss_pred ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcch--HHHHhhccc-CCC---- Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGY--EEEWNQANR-KNQ---- 388 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~--~~~~~~~~~-~~~---- 388 (688) |+|+|.. ...|.|.+..++++++.+|..+|.+.+.+.....+.+++........ .+....... .++ T Consensus 217 Pvv~~~n-------~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (489) T protein:vir:99 217 PVNEYAN-------NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAI 289 (489) T ss_pred eEEEeec-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhccccccccccc Confidence 7665422 24577999999999999999999999988777766655432222111 111111110 011 Q ss_pred -------ceeecCccc----ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHH Q lcl|NC_020488. 389 -------SVLRYNAIP----GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDR 457 (688) Q Consensus 389 -------~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~ 457 (688) .++...+.. ....+.++....-...+...++.+...|-..||+.+.+.+..+++.||.|+..+...... T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ 369 (489) T protein:vir:99 290 SIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDN 369 (489) T ss_pred ccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHH Confidence 111111100 011233344333445666778888899999999888766554456799999887766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccC Q lcl|NC_020488. 458 GTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPS 537 (688) Q Consensus 458 ~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~ 537 (688) ........|..+++.+.++++.++...... .+.... -.+|.|.=.+. T Consensus 370 k~~~k~~~~~~~l~~~~~li~~~~~~~~~~--------~~~~~~-------------------------~~~i~v~f~~~ 416 (489) T protein:vir:99 370 YREKQERLFKKGLMRRLRLAANIWAIKGNE--------ATTYSL-------------------------VNDTSIVFTPN 416 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--------cccccc-------------------------cccceEEeCCC Confidence 666777777777777777776665322100 000000 01233333445 Q ss_pred cHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC---ccHHHHHHHHHhhcccccc-ch-----hhHHhhhhhhhhh Q lcl|NC_020488. 538 YQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDW---PGAQDIARRLQKTLPPGIL-DQ-----DEMEEAGIEPPQP 607 (688) Q Consensus 538 ~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~---~~~~ei~~~~~~~~~~~~~-~~-----~~~~~~~~~~~~~ 607 (688) .+....+..+.++++... +....+++++++ +.+++-++++++....... .. ....+.++.+.+| T Consensus 417 ~p~d~~~~~~~~~kl~gi------is~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 417 LPQNDNEIVTAAQNLYGI------VSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred CCcCHHHHHHHHHHHhcc------CCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 555555556655555332 223334444433 2344444444432211100 00 0001111111112 No 81 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.79 E-value=3e-17 Score=111.19 Aligned_cols=457 Identities=10% Similarity=0.032 Sum_probs=234.5 Q ss_pred chHHHHHHHHHHHHHH-H----Hh---------hhHHHHHHHHHHHhhCCC--CCCHHHHHHHHhcCC----CceeehhH Q lcl|NC_020488. 15 SQEAILQEIRERAAHA-V----TC---------WKHNFDAAQEDISFLAGE--QWPESVRKEREDEGR----PCLTLNKL 74 (688) Q Consensus 15 ~~~~~~~~~~~~~~~~-~----~~---------~~~~r~~~~~~~~~~~G~--Qw~~~~~~~~~~~g~----p~~~~N~i 74 (688) -.+.+...++..+.+- . .. ....+....++.+||.|+ .|... .....|. ..++.|.- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~---~~~~~~~~~~~~~~~~n~~ 77 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNL---NYEHNGNPVNRRQLSMNLP 77 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcc---hhccCCCccccceeecchH Confidence 3344555555554441 0 00 122334456678999995 34321 1111222 34678999 Q ss_pred HHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCce Q lcl|NC_020488. 75 PQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGW 154 (688) Q Consensus 75 ~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~ 154 (688) +-+++...++.....+.+.+ +|.+.++. +..++..+++......++.+++..|.|| T Consensus 78 k~i~~~~a~~l~~~p~~i~~--------------------~d~~~~e~----l~~~~~~n~f~~~~~~~~~~a~~~G~~~ 133 (496) T protein:vir:38 78 KVTAKYMSKLLFNEKVKINI--------------------DDKAAEEF----VLNVLKTNGFTKNMERYIEYGEAMGGFV 133 (496) T ss_pred HHHHHHHhhhhhCCcceEee--------------------CChHHHHH----HHHHHhccCHHHHHHHHHHHHhhhCcEE Confidence 99999999999988888766 13444444 5555667899999999999999999999 Q ss_pred EEEEEeeccCCCCCcceeEEEecccceEEe--CCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccccccc Q lcl|NC_020488. 155 LRVLTKYSTDDAFDLDLCIKSIHNRFAVLM--DPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSW 232 (688) Q Consensus 155 ~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~--Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 232 (688) ++++++. .+.+.+..| +|..+|+ +.. .++..+-|+ ..+. T Consensus 134 ~~~~~D~------~~~~~i~~v-~~~~~~P~~~~~---~~~~~~~f~--~~~~--------------------------- 174 (496) T protein:vir:38 134 IKVYHDG------NKNVKVSFA-TADCMYPLSNDS---ENVDECVIA--NSFH--------------------------- 174 (496) T ss_pred EEEEEcC------CCcEEEEEE-cccceEEEEecC---CcEEEEEEE--EEEE--------------------------- Confidence 9998763 255777766 6777762 211 123322222 1110 Q ss_pred CCCCCEEEEEEEEeeeecceee----eeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCC Q lcl|NC_020488. 233 WTNEEGVRVSEYFYREPVTRKL----LLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPV 308 (688) Q Consensus 233 ~~~~~~v~v~e~~~~~~~~~~~----~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~ 308 (688) .+....+..|+|++.....++ +...++..+ |..+.. ..+ ... ++... T Consensus 175 -~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~---------------g~~v~~------~~~------~~~-~~~~~ 225 (496) T protein:vir:38 175 -KNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNEL---------------GTKVSL------TLL------FDD-IEPVV 225 (496) T ss_pred -eCCeEEEEEEEEEEeCceEEEEEEEEecCCcccc---------------Cccccc------ccc------ccc-cccce Confidence 012345566776654332221 222221110 000000 000 000 00001 Q ss_pred CCC-CCccceEEEeee--eeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHh-hcc Q lcl|NC_020488. 309 DWP-GSTIPVAPVLGK--EMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWN-QAN 384 (688) Q Consensus 309 p~~-~~~~P~vp~~~~--~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~ 384 (688) .+. ....||+.+.+. .....+.++|.|.+..++++++.+|...|.+.+.+.. ...+++++...+....+... ... T Consensus 226 ~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~ 304 (496) T protein:vir:38 226 PLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQ 304 (496) T ss_pred eecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccCCCCCcccc Confidence 111 123344432111 1113467889999999999999999999999998876 56778887776643211000 000 Q ss_pred cCC---CceeecCccc--ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc-hhhHHHHHHHHHHHHHH Q lcl|NC_020488. 385 RKN---QSVLRYNAIP--GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN-EQSGKAILARQRQGDRG 458 (688) Q Consensus 385 ~~~---~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~-~~sg~ai~~~~~~~~~~ 458 (688) .++ ..+....... +...++...+.--.......++.....+...+|+++.+.|..++ ..||.++..+.+..... T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~ 384 (496) T protein:vir:38 305 YFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQT 384 (496) T ss_pred CCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHH Confidence 011 1111111111 11234333333233557778888888888999999999997654 35788888777666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCc Q lcl|NC_020488. 459 TFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSY 538 (688) Q Consensus 459 ~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~ 538 (688) .......++.+++++++.++.+...+-. ..|... ...+++|.=..+. T Consensus 385 ~~~~~~~~~~~l~~l~~~il~~~~~~~~------~~g~~~---------------------------~~~~i~v~f~d~i 431 (496) T protein:vir:38 385 KNSHSQLIEQGIKEMIVSILEVGKFIEA------YSGEVV---------------------------ELDTITVDFDDSI 431 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh------hcCCCC---------------------------CccceEEEeCCCC Confidence 6777778889999999998887654321 001000 0112222222233 Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhc-CC--ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHH Q lcl|NC_020488. 539 QTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNM-DW--PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPE 610 (688) Q Consensus 539 ~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~-~~--~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 610 (688) +....+..+.++++... +-+....++... +. +.+++..+++++......+....... ..+.+ T Consensus 432 ~~d~~~~~~~~~~~~~~----GiiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~~~~~d~~~~------~~~~e 496 (496) T protein:vir:38 432 AQDEDTTINRYTNAKNQ----GMIPLKIALQRAWNITEAEADEWAEMLAKEKQAEMPNNDMNGI------FGEEE 496 (496) T ss_pred CCCHHHHHHHHHHHHhc----CCCCHHHHHHhcCCCChHHHHHHHHHHHHhhhccCccccccCC------CCCCC Confidence 33334444445544321 112222233222 22 12233444444322211110000000 00000 No 82 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.78 E-value=1.6e-17 Score=112.63 Aligned_cols=457 Identities=11% Similarity=0.048 Sum_probs=224.6 Q ss_pred CCCCCCCcCCC-----CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCC-HHHH----HHHHhcCCC--c Q lcl|NC_020488. 1 MLPGNEPIKTR-----DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWP-ESVR----KEREDEGRP--C 68 (688) Q Consensus 1 ~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~-~~~~----~~~~~~g~p--~ 68 (688) |.|=.+-.-.. -.+...++..++ +.+..+.....+.+..+..+||.|++=- .... .......+| . T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~r 97 (492) T protein:vir:97 21 LYPSQPTQTEIFDAIVRTNNKPETLEEM---IVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDR 97 (492) T ss_pred eeccchhhhhHhhhcccCCCchhhHHHH---HHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccc Confidence 23222210000 000111122222 2223334445566777788999997510 0000 001112222 4 Q ss_pred eeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Q lcl|NC_020488. 69 LTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAV 148 (688) Q Consensus 69 ~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~ 148 (688) +++|..+.+|+..+|+...+.+.+.. +|.+..+. ++.++ .|+++.....+..+++ T Consensus 98 i~~n~~k~Ivd~~~~yl~g~p~~~~~--------------------~d~~~~~~----l~~~~-~n~~~~~~~~~~~~~~ 152 (492) T protein:vir:97 98 MITNFHANLVDQKVSYIVGKPIAFKH--------------------TDDEVVKR----IDEVL-GNRFDDKLHSVLTGAS 152 (492) T ss_pred cccchHHHHHHHHhhhhcccCceecc--------------------CchHHHHH----HHHHH-hccHHHHHHHHHHHHh Confidence 67899999999999999988766532 12333333 33333 4789999999999999 Q ss_pred HcCCceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccc Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAE 226 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~ 226 (688) ++|.||..++.+. ++.+.+..+ +|..++ ||+... . +. ..+++.|-..+ T Consensus 153 ~~G~a~~~v~~d~------dg~~~~~~~-~p~~~~~i~d~~~~-~---~~-~~~vr~~~~~~------------------ 202 (492) T protein:vir:97 153 NKGIEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEH-E---EL-EAFIRMYKLEN------------------ 202 (492) T ss_pred hcCeEEEEEEecC------CCceEEEEE-cccceEEEEcCCCC-C---ce-EEEEEEEeecc------------------ Confidence 9999988776532 356777666 677654 554321 1 22 23333332110 Q ss_pred ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 227 RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 227 ~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) .. .+++|......... ..++....... .. .... ... T Consensus 203 ---------~~---~~~~y~~~~v~~~~--~~~~~~~~~~~-~~---------------------------~~~~--~~~ 238 (492) T protein:vir:97 203 ---------ET---KVEYWDKVTVNYYV--YENGSLIPDYS-NN---------------------------LENS--KTH 238 (492) T ss_pred ---------ce---eEEEEecCeEEEEE--EecCeeeeccc-cc---------------------------cccc--ccc Confidence 00 12333322211111 11111111000 00 0000 111 Q ss_pred CCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 307 PVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 307 ~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) ..|.+.+..|+|+|.. ...|.|.+..++++++.+|..+|.+...+.....+.+++.....++..+... ..+. T Consensus 239 ~~~~~~g~vPvv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~-~~~~ 310 (492) T protein:vir:97 239 FSTGSWGKIPFIPFKN-------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKR-LLRY 310 (492) T ss_pred cccCCCCCcceEEecC-------CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHH-HHhh Confidence 2233445566665422 3457799999999999999999999999988888876654322222222222 1122 Q ss_pred CCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) .+++.. .. ...+.++..+.-..++...++.+...|-..|++.+.+.+.-+++.||.|+..+-...........+.| T Consensus 311 ~~~~~~-~~---~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f 386 (492) T protein:vir:97 311 YGAIKV-SD---NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 386 (492) T ss_pred ccceec-CC---CCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222 21 12344554444456677888999999999999888777766667899999887766666666666667 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) ..+++++++++..++.. .+ ++ .++.|.=.+..+....+.. T Consensus 387 ~~~l~~~~~li~~~~~~----------~~-----~~-------------------------~~i~v~f~~~~p~~~~e~a 426 (492) T protein:vir:97 387 KVAIQELLWFVFEHFDI----------KG-----EH-------------------------KDVDISFNYNKVANTELQV 426 (492) T ss_pred HHHHHHHHHHHHHHhcC----------Cc-----cc-------------------------ceeeEEecCCCCCCHHHHH Confidence 77777766665543311 00 11 1122222334444444445 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHH Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQ 619 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q 619 (688) +.++++... +....+++++++ .+.++-.+++++................ ...............+ T Consensus 427 ~~~~kl~G~------iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 427 QTAQQSMGI------VSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPNLDDGG--ADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHhcc------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC--CCCCcccccccccccC Confidence 555544322 233444555543 3344444444332110000000000000 0000000000000000 No 83 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.78 E-value=8.9e-18 Score=114.06 Aligned_cols=457 Identities=11% Similarity=0.044 Sum_probs=227.2 Q ss_pred CCCCCCCcCCCCc-----cchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCC--CHHH---HHHHHhcCCC--c Q lcl|NC_020488. 1 MLPGNEPIKTRDD-----DSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQW--PESV---RKEREDEGRP--C 68 (688) Q Consensus 1 ~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw--~~~~---~~~~~~~g~p--~ 68 (688) |+|=++-.-.... +...++..++.. ...+.+.+.+....+..+||.|++= .... ........+| . T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~---~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~r 97 (492) T protein:vir:94 21 LYPSQPTQTEIFDAIVRTNNKPETLEEMIV---RYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDR 97 (492) T ss_pred eecCccchhhhhhcccccCCchhhHHHHHH---HHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccc Confidence 4444431111100 011122222222 2223344456667788899999751 0000 0001112222 4 Q ss_pred eeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Q lcl|NC_020488. 69 LTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAV 148 (688) Q Consensus 69 ~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~ 148 (688) +++|..+.+|+..+|+...+++.+.+ .|.+..+.+. .++ .|+++.....+..+++ T Consensus 98 i~~n~~k~Ivd~~~~yl~G~p~~~~~--------------------~d~~~~~~l~----~~~-~n~~~~~~~~~~~~a~ 152 (492) T protein:vir:94 98 MITNFHANLVDQKVSYIVGKPIAFKH--------------------TDDEVVKRID----EVL-GNRFDDKLHSVLTGAS 152 (492) T ss_pred cccchHHHHHHHHHhhhcccCceecc--------------------CchHHHHHHH----HHH-hccHHHHHHHHHHHHh Confidence 67899999999999999988766533 1333333333 333 4789999999999999 Q ss_pred HcCCceEEEEEeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccc Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAE 226 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~ 226 (688) ++|.||+.++.+. ++.+.+..+ +|..+ +||+... . +.. .+++.|-..+ T Consensus 153 ~~G~a~~~v~~d~------dg~~~~~~~-~p~~~~~v~d~~~~-~---~~~-a~ir~~~~~~------------------ 202 (492) T protein:vir:94 153 NKGIEWLHPYLDE------EGEFKLFRV-PAEQGIPIWTDKEH-E---ELE-AFIRMYKLEN------------------ 202 (492) T ss_pred hCCeEEEEEEecC------CCceEEEEE-cccceEEEEcCCCC-C---ceE-EEEEEEeecc------------------ Confidence 9999998887542 356677666 67765 4664321 1 122 2333332110 Q ss_pred ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 227 RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 227 ~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) .. .+++|....... +...++..+..... ....+... T Consensus 203 ---------~~---~~~~y~~~~v~~--~~~~~~~~~~~~~~------------------------------~~~~~~~~ 238 (492) T protein:vir:94 203 ---------ET---KVEYWDKVTVNY--YVYENGSLIPDYSN------------------------------NLENSKTH 238 (492) T ss_pred ---------ce---eEEEEecCeEEE--EEEecCeeeecccc------------------------------cccccccc Confidence 00 123332221111 11111211110000 00001112 Q ss_pred CCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 307 PVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 307 ~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) ..|.+.+..|+|+|.. ...|.|.+..++++++.+|.++|.+...+....++.+++.....++..+... ..+. T Consensus 239 ~~~~~~g~vPvv~~~n-------n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~-~~~~ 310 (492) T protein:vir:94 239 FSTGSWGKIPFIPFKN-------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKR-LLRY 310 (492) T ss_pred ccccCCCccceEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHH-HHhh Confidence 2334446666665422 3457799999999999999999999999988888876654322222222221 1122 Q ss_pred CCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) .+++.. .. ...+.++..+.-..++...++.+...|-..|++.+.+.+.-+++.||.|+..+-...........+.| T Consensus 311 ~~~~~~-~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f 386 (492) T protein:vir:94 311 YGAIKV-SD---NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 386 (492) T ss_pred ccceec-CC---CCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHH Confidence 222222 11 22344554444446677788899999999999988777766667899999888766767777777777 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) ..+++++.++++.++..- + ++ -+|.|.=.+..+....+.. T Consensus 387 ~~~l~~~~~li~~~~~~~----------~-----~~-------------------------~~i~v~f~~~~p~~~~e~~ 426 (492) T protein:vir:94 387 KVAIQELLWFVFEHFDIK----------G-----EH-------------------------KDVDISFNYNKVANTELQV 426 (492) T ss_pred HHHHHHHHHHHHHHhcCC----------c-----cc-------------------------ceeeEEecCCCCCCHHHHH Confidence 777777766665543210 0 00 1122222334444444444 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKA 625 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~ 625 (688) +.+..+... +....+++++++ .+.+.-.+++++.................... .. .... T Consensus 427 ~~~~kl~gi------iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~-~~-~~~~------------ 486 (492) T protein:vir:94 427 QTAQQSMGI------VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADSA-QQ-QERS------------ 486 (492) T ss_pred HHHHHHhcc------CchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCC-cc-ccCC------------ Confidence 444444321 223344555443 34444444443221100000000000000000 00 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 626 KADTAKAQADMAMAQAKTAEAQ 647 (688) Q Consensus 626 q~e~~~~q~e~~~~q~~~~~~~ 647 (688) ...+.+ T Consensus 487 ----------------~~~e~e 492 (492) T protein:vir:94 487 ----------------NNKESE 492 (492) T ss_pred ----------------ccccCC Confidence 000000 No 84 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.78 E-value=4e-18 Score=115.94 Aligned_cols=458 Identities=12% Similarity=0.057 Sum_probs=228.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHH-------hcCCC--ceee Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKERE-------DEGRP--CLTL 71 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~-------~~g~p--~~~~ 71 (688) -+|-.++...+--+..+.......+.+....+..........+..+||.|+| +-.....+ ...+| .++. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~--~i~~~~~~~~~~~~~~~~~~~~ki~~ 82 (474) T protein:vir:96 5 FWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDP--DVLRLAPKLDNKGEIDPLKPDWRMFT 82 (474) T ss_pred ccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCC--cchhccchhcccccccccccchhccc Confidence 2444443332222111111112222233333344445566778889999986 11111111 11233 4778 Q ss_pred hhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcC Q lcl|NC_020488. 72 NKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGG 151 (688) Q Consensus 72 N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G 151 (688) |..+-+|+..+|++..+.+.+.+ .|.+..+.+...+ .+++......+..++.++| T Consensus 83 n~~~~Ivd~~~~~l~g~p~~~~~--------------------~d~~~~~~l~~~~-----~n~~~~~~~~~~~~~~~~G 137 (474) T protein:vir:96 83 NYHQNLVDQKVAYAVANPVTFSS--------------------DDDKSLKTIQEVL-----NHKWDDKLVDILTAASNKG 137 (474) T ss_pred chHHHHHHhhhhhhcccCceeec--------------------CchHHHHHHHHHH-----hcCHHHHHHHHHHHHHhcC Confidence 99999999999999998877643 1333344444333 3678888888999999999 Q ss_pred CceEEEEEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccc Q lcl|NC_020488. 152 FGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGE 229 (688) Q Consensus 152 ~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~ 229 (688) .||..++.+. ++.+.+..+ +|..+| ||+... .+. ..+++.|... + .. T Consensus 138 ~~~~~~y~d~------~~~~~i~~~-~p~~~~~v~d~~~~----~~~-~~~vr~~~~~--------~-----------~~ 186 (474) T protein:vir:96 138 IEWLQPYIDE------NGEFKTFRV-PAEQAIPIWTNKER----DTL-KAFIRYYRLD--------G-----------AE 186 (474) T ss_pred eeEEEEEecC------CCceEEEEE-cccceEEEEcCCCC----Cce-EEEEEEEeec--------C-----------ce Confidence 9998887542 356777666 687765 554321 122 2333333110 0 00 Q ss_pred cccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCC Q lcl|NC_020488. 230 YSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVD 309 (688) Q Consensus 230 ~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p 309 (688) -+++|....... +...++.......... . ......+....| T Consensus 187 -----------~~~~yt~~~v~~--~~~~~~~~~~~~~~~~---------------~-----------~~~~~~~~~~~~ 227 (474) T protein:vir:96 187 -----------RVEYWTDSDVTY--YEYQDGILIPDYYHGE---------------E-----------HIQSHYYVGNKR 227 (474) T ss_pred -----------EEEEEeCCeEEE--EEecCCceeecccccc---------------c-----------cccccccccccc Confidence 023332221111 1111121111000000 0 000011223345 Q ss_pred CCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCc Q lcl|NC_020488. 310 WPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQS 389 (688) Q Consensus 310 ~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 389 (688) .+.+.+|+|+|.. ...|.|.+..++++++.+|...|.+...+....++.+++......+..+.... ....++ T Consensus 228 ~~~g~iPvv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~-~~~~~~ 299 (474) T protein:vir:96 228 VSWGRVPFIPFKN-------NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRN-LKYYKA 299 (474) T ss_pred cCCCceeEEEecc-------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhh-hhcCce Confidence 5667788776532 35678999999999999999999999999888888776544333332222221 122233 Q ss_pred eeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 390 VLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRA 469 (688) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~ 469 (688) +.. .+. +..+.++..+.-..+....++.+...|-..|++.+.+.+..+++.||+|+..+-.............|..+ T Consensus 300 i~~-~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 376 (474) T protein:vir:96 300 INV-DGD--GSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTA 376 (474) T ss_pred EEe-cCC--CCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 221 22355665554556778888999999999999988877766677899999888766666666777777777 Q ss_pred HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHH Q lcl|NC_020488. 470 IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSL 549 (688) Q Consensus 470 ~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l 549 (688) ++++.++++.+.-..+ ++. +|.|.=.++.+....+..+.+ T Consensus 377 l~~~~~~i~~~~~~~~---------------~~~-------------------------~i~i~f~~~~p~~~~e~~~~~ 416 (474) T protein:vir:96 377 LQELLQYIIDFYKLNI---------------KVQ-------------------------DVEITFNFNVMVNELEQSQIG 416 (474) T ss_pred HHHHHHHHHHHhCCCc---------------ccc-------------------------eeeEEeccCCCcCHHHHHHHH Confidence 7776666555431111 110 111111222222222222222 Q ss_pred HHHHHhhHHHHHHHHHHHHHhcC-CccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 550 MQFVQAVPAAGGVVLDLIAKNMD-WPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKAD 628 (688) Q Consensus 550 ~~~~q~~~~~~~~~~~~~~e~~~-~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e 628 (688) .+ .+.+....++..++ +...+.-.+++++............... .... T Consensus 417 ~~-------ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~---~~~~--------------------- 465 (474) T protein:vir:96 417 VQ-------SQYLSKETVVTNHPWVDDPVAELERIEQDNIDFNKQLPPLEGD---ANGR--------------------- 465 (474) T ss_pred Hh-------cCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccc---cccc--------------------- Confidence 11 12233334444443 2333433444432211000000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 629 TAKAQADMAMAQAKTAEAQAK 649 (688) Q Consensus 629 ~~~~q~e~~~~q~~~~~~~a~ 649 (688) .+....+- . T Consensus 466 ----------~~d~~~e~--~ 474 (474) T protein:vir:96 466 ----------AQDNESET--N 474 (474) T ss_pred ----------cCCCcccC--C Confidence 00000000 0 No 85 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.78 E-value=8.5e-18 Score=114.17 Aligned_cols=457 Identities=9% Similarity=0.005 Sum_probs=229.3 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-HHHHhcCCC--ceeehhHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-KEREDEGRP--CLTLNKLPQY 77 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-~~~~~~g~p--~~~~N~i~~~ 77 (688) .+|... .....+.+.++..++. ...+....+..+||.|+|..-..+ ......++| .++.|..+.+ T Consensus 14 ~~~~~~------~~l~~~~i~~li~~~~------~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~I 81 (506) T protein:vir:94 14 IYQESL------ENLTPNKIMKFITHHF------NYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYI 81 (506) T ss_pred ecccch------hcCCHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHH Confidence 333221 2222222333333322 223344566778999987532111 122334554 5789999999 Q ss_pred HHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEE Q lcl|NC_020488. 78 VDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRV 157 (688) Q Consensus 78 i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v 157 (688) |+..+|+...+.+.+.+ - |. .....+..+++.|+++.....+..+++++|.+|..+ T Consensus 82 v~~~~~~l~G~p~~~~~--~------------------d~----~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v 137 (506) T protein:vir:94 82 ADFQTSYSVGNPINVKL--P------------------DD----GSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYV 137 (506) T ss_pred HHHhhhhhcccCceeec--C------------------cc----hHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEE Confidence 99999999988766543 1 11 123457777788999999999999999999999888 Q ss_pred EEeeccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCC Q lcl|NC_020488. 158 LTKYSTDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTN 235 (688) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 235 (688) +.+. ++.+.+..+ +|..++ ||+... .....+++.|.....- + .. . T Consensus 138 ~~de------d~~~~i~~~-~p~~~~~v~dd~~~-----~~~~~~v~~~~~~~~~-----~-----------~~-----~ 184 (506) T protein:vir:94 138 YRGE------DNEEHLAKL-DPLDTFVIYSTDVD-----PKPIMAVRYHQIELVD-----D-----------NQ-----V 184 (506) T ss_pred EecC------CCeeEEEEE-cccceEEEecCCCC-----CceEEEEEEEeeeecc-----C-----------Cc-----e Confidence 7642 356777666 677653 444221 1133344444321000 0 00 0 Q ss_pred CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 236 EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 236 ~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) .......++|... .+.....+. .+. .+....+-+.+.+ T Consensus 185 ~~~~~~~~~yt~~----~~~~~~~~~-------------------------------------~~~-~~~~~~~~~~g~v 222 (506) T protein:vir:94 185 STINYVPETWTAD----TYTLYNPTP-------------------------------------IMG-KMQVDTTKPITTF 222 (506) T ss_pred eEEEEEEEEEeCc----eEEEecccc-------------------------------------Ccc-ceeccccccCCcc Confidence 0011112222111 000000000 000 1112233345667 Q ss_pred ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchH------------------ Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYE------------------ 377 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~------------------ 377 (688) |+|+|.. ...|.|.+..++++++.+|..+|.+.+.+.-..++.+++......... T Consensus 223 Pvv~~~n-------~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (506) T protein:vir:94 223 PVVEFKN-------SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAK 295 (506) T ss_pred ceEEecC-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccc Confidence 7776422 234679999999999999999999998887665555443221110000 Q ss_pred -----HHHhhcccCCCceeecCcc-----cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHH Q lcl|NC_020488. 378 -----EEWNQANRKNQSVLRYNAI-----PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKA 447 (688) Q Consensus 378 -----~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~a 447 (688) .......+..+.+.+..+. .....++++..+.-..++...++.+...|-..|++.+...+.-+++.||.| T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 375 (506) T protein:vir:94 296 LAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVA 375 (506) T ss_pred cccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHH Confidence 0000001111122111111 112234555555566778888899999999999999877666556789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeee Q lcl|NC_020488. 448 ILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGK 527 (688) Q Consensus 448 i~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~ 527 (688) +..+-.............|..+++++.++++.++..... ....++ T Consensus 376 ik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~----------~~~~d~------------------------- 420 (506) T protein:vir:94 376 MQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHG----------DWTFDP------------------------- 420 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----------cccccc------------------------- Confidence 998877777777777777778888877777776543211 001111 Q ss_pred EEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhHHhh--hhhh Q lcl|NC_020488. 528 FDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEMEEA--GIEP 604 (688) Q Consensus 528 ~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~~~~--~~~~ 604 (688) .++.|.=.+..+....+..+.+..+.. .+....+++++++ ...++-.+++++............... ..+. T Consensus 421 ~~i~i~f~~~~p~d~~e~a~~~~kl~g------~iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 494 (506) T protein:vir:94 421 QELTFTFRDNLPADNISQIKALVQAGA------TLPQKYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQT 494 (506) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHhc------cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCc Confidence 112222234444444444555554432 1223344444433 333333334433211100000000000 0000 Q ss_pred hhhhHHHHHHHH Q lcl|NC_020488. 605 PQPSPEQQANMA 616 (688) Q Consensus 605 ~~~~~~~q~~~~ 616 (688) .+.+.+...+.. T Consensus 495 ~~~~~~~~~e~~ 506 (506) T protein:vir:94 495 NTTATQTDEEVR 506 (506) T ss_pred cccccccccCCC Confidence 000000000000 No 86 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=99.78 E-value=1.1e-16 Score=107.97 Aligned_cols=499 Identities=11% Similarity=0.008 Sum_probs=249.1 Q ss_pred CCccchHH--HHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh- Q lcl|NC_020488. 11 RDDDSQEA--ILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ- 87 (688) Q Consensus 11 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~- 87 (688) ..+-..+. -.+++..+|+.-.+..+.|...|++..+|..-.-.+++... ++...+.-..-...+++..+.+.. T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~----~~~~~~~dstg~~a~~~LAa~l~~~ 76 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDN----ETSQNGWQGVGAQATNHLANKLAQV 76 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCc----ccccccccchHHHHHHHHHHHHHHh Confidence 11111111 34668888998888899999999999998854322221100 111112223333334444444333 Q ss_pred ----CCcceEEEeCCccccccccccccccChhhHHH------HHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEE Q lcl|NC_020488. 88 ----NRPAIQVHPVEANATKDTSKVPNVAGTSDYSL------AEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRV 157 (688) Q Consensus 88 ----~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~------Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v 157 (688) +++=+++.+.+.... ..+..+.+. -+.++..+...+..|++..+...+|.+.+..|.|++.+ T Consensus 77 ltpp~~~WF~l~~~d~~~~--------~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~ 148 (515) T protein:vir:70 77 LFPAQRSFFRVDLTAKGEK--------VLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK 148 (515) T ss_pred hcCCCCcccccccChhhhh--------ccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEE Confidence 233334433322110 001111222 22345556666778999999999999999999998654 Q ss_pred EEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 158 LTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) + ++ +.+. .+ +..++++..++.- ...-++++..|+..++.+.|+...... ....+. ..++ T Consensus 149 --d---~~---~~~~--~~-pl~~y~v~~d~~G----~v~~i~rr~~~t~~~l~~~f~~~~~~~---~~~~~~---~~~~ 207 (515) T protein:vir:70 149 --P---SK---GAMS--AV-PMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIEVG---MKGKKC---KEDD 207 (515) T ss_pred --e---CC---CCeE--EE-EcCeEEEeeCCCc----CeeEEEeeeeccHHHHHHhhhhhhhhh---hhhhhc---CCCC Confidence 2 11 1122 23 3455666544321 233378899999999999997533211 111111 1123 Q ss_pred EEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccce Q lcl|NC_020488. 238 GVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 238 ~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~ 317 (688) .|.++.+-++.+ ++. + +++....+..++ .++-||...+|| T Consensus 208 ~v~i~~~v~~~~---------~~~-----------------------------~-~~~~e~d~~~~~-~es~y~~~e~P~ 247 (515) T protein:vir:70 208 NVKLYTHAQYAG---------EGF-----------------------------W-KINQSADDIPVG-KESRIKSEKLPF 247 (515) T ss_pred ceEEEEEEEecC---------CCc-----------------------------e-EEEEecCceeec-cccccccccCCc Confidence 344433222211 111 1 122233343333 335567789999 Q ss_pred EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccc Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIP 397 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 397 (688) +++.+ ...+|..||.|.+....+--+.+|.+....+.....+.++.++++.+.+.+...+.. ..+|.++. +. T Consensus 248 ~~~Rw--~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~---~~~g~iv~-g~-- 319 (515) T protein:vir:70 248 IPLTW--KRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVN---SGTGEVIT-GV-- 319 (515) T ss_pred eeeee--eecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccc---cCCceeec-CC-- Confidence 97644 567999999999999999999999999999999999999999999988876654432 22233332 21 Q ss_pred ccccceecC--CCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_020488. 398 GVDRPQRDM--PASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSR-AIRRVG 474 (688) Q Consensus 398 ~~~~~~~~~--~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~ 474 (688) .+.+..++ ...--+.....++...+.|....= .+.+.-.++...|++-|..+.+.-...|...+.+|.. +...+. T Consensus 320 -~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~-~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli 397 (515) T protein:vir:70 320 -AEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIA 397 (515) T ss_pred -cccceeeecCcccchhHHHHHHHHHHHHHHHHHh-hhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH Confidence 12222222 222335566677777777776542 2222223444579999999998888888888888753 444432 Q ss_pred HHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHH Q lcl|NC_020488. 475 QILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQ 554 (688) Q Consensus 475 ~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q 554 (688) ++++. ...+. + + .++ .++.+.+ +-.+-.|.+..+.+..+++ T Consensus 398 ~r~~~-------------~~~p~-------~-P-------------~~~----v~~~~vs-~l~~L~r~q~~~~i~~~~q 438 (515) T protein:vir:70 398 MWGLQ-------------EAGDS-------F-T-------------SEL----VDPVIVT-GIEALGRMAELDKLANFAQ 438 (515) T ss_pred HHHHH-------------hhCCC-------C-C-------------hhh----cccceeh-hHHHHHHHHHHHHHHHHHH Confidence 22110 00100 0 0 000 1222222 2233456666666776666 Q ss_pred hhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 555 AVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQA 634 (688) Q Consensus 555 ~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~ 634 (688) .+....+ ..+.++... +.+++.+.+.........-....++.++. .+++.|++++++.++.. .++ T Consensus 439 ~i~~~~~-~~p~~~~~i---d~d~~~~~~a~~~g~p~~~~rs~eev~~~----------r~q~~~~~~~~~~~~~~-~~a 503 (515) T protein:vir:70 439 YMSLPQT-WPEPAQRAI---RWGDYMDWVRGQISAELPFLKSEEEMQQE----------MAQQAQAQQEAMLNEGV-AKA 503 (515) T ss_pred HHHHHhc-cChhHHhhC---CHHHHHHHHHHHhCCCccccCCHHHHHHH----------HHHHHHHHHHHHHHHhh-hhh Confidence 5432222 223333333 44555554433332211111111000000 00000000000000000 000 Q ss_pred HHHHHHHHHHHH Q lcl|NC_020488. 635 DMAMAQAKTAEA 646 (688) Q Consensus 635 e~~~~q~~~~~~ 646 (688) ....+....++. T Consensus 504 ~~~~~~~~~~~~ 515 (515) T protein:vir:70 504 VPGVIQQEMKEG 515 (515) T ss_pred cccchhhhhccC Confidence 000000000000 No 87 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.78 E-value=3.7e-17 Score=110.64 Aligned_cols=470 Identities=12% Similarity=0.083 Sum_probs=243.3 Q ss_pred HHHHHHHHHHHHHHHHh------------------hhHHHHHHHHHHHhhCCC-CCCHHHHHH-HHhcCCCceeehhHHH Q lcl|NC_020488. 17 EAILQEIRERAAHAVTC------------------WKHNFDAAQEDISFLAGE-QWPESVRKE-REDEGRPCLTLNKLPQ 76 (688) Q Consensus 17 ~~~~~~~~~~~~~~~~~------------------~~~~r~~~~~~~~~~~G~-Qw~~~~~~~-~~~~g~p~~~~N~i~~ 76 (688) -.++++++..|++.... ..+......++.+||.|+ .|-.. ... -....+...+.|+-+. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~-~~~~~~~~~~~~~sln~~~~ 79 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHY-QASDGIKKKRLKNTINMAKT 79 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCccccc-ccCCCCccccceeecchHHH Confidence 44666666666553322 234455567788999996 22110 000 0011233467799999 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEE Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLR 156 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~ 156 (688) +++...+....-.+.+.|.. | +..+..+..+++.|++......++++++..|.||++ T Consensus 80 i~~~~A~lv~~e~~~i~v~~-------------------~----~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k 136 (508) T protein:vir:15 80 AARRIASVVFNEKAEIHVKD-------------------N----NEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMR 136 (508) T ss_pred HHHHHHhhhhCCCceEEeCC-------------------c----hHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEE Confidence 99988888887777776631 1 123445666777899999999999999999999999 Q ss_pred EEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCC Q lcl|NC_020488. 157 VLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNE 236 (688) Q Consensus 157 v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~ 236 (688) ++++ .+.+.+..| ++..||+= .....+...+-|+......+ .... T Consensus 137 ~~~d-------~~~~~i~~v-~ad~~~P~-~~d~~~~~~~af~~~~~~~~--------------------------~~~~ 181 (508) T protein:vir:15 137 PYID-------GNHIKIAWV-RADQFYPL-QSNTNDISEAAIASRTQRTE--------------------------SNQT 181 (508) T ss_pred EEEe-------CCeeEEEEE-cCCeeEEE-EEcCCCeEEEEEEEEEEeec--------------------------CCCc Confidence 9986 135677777 67777731 11112234443333221100 0011 Q ss_pred CEEEEEEEEeeeec-----ceeeeeccC----CceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccC Q lcl|NC_020488. 237 EGVRVSEYFYREPV-----TRKLLLLSD----GRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGP 307 (688) Q Consensus 237 ~~v~v~e~~~~~~~-----~~~~~~~~~----g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~ 307 (688) +..++.|+..+... ...++...+ |..+..... ..+ .. |+.. T Consensus 182 ~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~--------~e~---------------------~~-l~~~ 231 (508) T protein:vir:15 182 KYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTL--------PVY---------------------KE-LAPQ 231 (508) T ss_pred eEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhc--------ccc---------------------cC-CCcc Confidence 12334444433211 111111111 111100000 000 00 0000 Q ss_pred CCCCC-CccceEEEee--eeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcc Q lcl|NC_020488. 308 VDWPG-STIPVAPVLG--KEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQAN 384 (688) Q Consensus 308 ~p~~~-~~~P~vp~~~--~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 384 (688) ..+.+ ...||+.|-. .-....++|+|.|.+..+++.++.+|...|.+.+.+ ..+..++.++++.+....+. .... T Consensus 232 ~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d~~~-~~~~ 309 (508) T protein:vir:15 232 VTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFDDEH-KPTF 309 (508) T ss_pred eEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCCCCC-cccc Confidence 00111 1122322110 001123678999999999999999999999999999 46778899988887532111 0001 Q ss_pred cCCCc-eeecCc-ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcch-hhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 385 RKNQS-VLRYNA-IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNE-QSGKAILARQRQGDRGTFA 461 (688) Q Consensus 385 ~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~-~sg~ai~~~~~~~~~~~~~ 461 (688) ..+.- +...+. ...+..++...+.--...+...++.....+....|++....|..++. .||++|....+..-..... T Consensus 310 ~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~ 389 (508) T protein:vir:15 310 DTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSS 389 (508) T ss_pred CCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHH Confidence 11111 111111 11122344443332334577788888889999999999999976654 5899999888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHH Q lcl|NC_020488. 462 YIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQ 541 (688) Q Consensus 462 ~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~ 541 (688) +...+..+++++.+.++.+..-++-. .+ |.+...-++....++|+|+=+.+.... T Consensus 390 ~~~~~~~al~~lv~~il~l~~~~~~~---------~~----------------g~~~~~~~~~~~~~~v~v~f~D~i~~d 444 (508) T protein:vir:15 390 YLTMVEKAIDELCQSIFELANAGALF---------DD----------------GKPLFTLDSASQPLDIECHFDDGVFVN 444 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccc---------cc----------------cccccccccccCCcceEEEeCCCCCCC Confidence 88899999999999998876544210 00 000001111112345555555555555 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHH-HHhcCC--ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHH Q lcl|NC_020488. 542 RMEAADSLMQFVQAVPAAGGVVLDLI-AKNMDW--PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPE 610 (688) Q Consensus 542 r~~~~~~l~~~~q~~~~~~~~~~~~~-~e~~~~--~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 610 (688) .++..+.++++... +-+....+ .+.-++ ..+++..+++++......... ....+. ....-+ T Consensus 445 ~~~~~~~~~~~v~a----Gi~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~---~~~~~~-~g~~ge 508 (508) T protein:vir:15 445 KDKQLEEDAKVLAI----GALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEG---GRSAIL-NGGDGE 508 (508) T ss_pred HHHHHHHHHHHHhc----CCCCHHHHHHhcCCCChHHHHHHHHHHHHhccccCccc---cccccC-CCCCCC Confidence 55555555555421 11112222 222233 223334444443322111100 000000 000000 No 88 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.77 E-value=2.9e-16 Score=105.77 Aligned_cols=503 Identities=13% Similarity=0.046 Sum_probs=251.2 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ 87 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~ 87 (688) |+..-++...-....++.+|+...+..++|...|++..+|..-.=.++..- .++...+.-..-...+++..+.+.. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~----~~~~~~~~dstg~~a~~~LAa~l~~ 76 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGD----NETSQNGWQGVGAQATNHLANKLAQ 76 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCC----cccccccccchHHHHHHHHHHHHHh Confidence 666666666667788999999999999999999999999885422211100 0111112222333334444444433 Q ss_pred C-----CcceEEEeCCccccccccccccccChhh---HHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 88 N-----RPAIQVHPVEANATKDTSKVPNVAGTSD---YSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 88 ~-----r~~~~v~pr~~~~~~~~~~~~~~~~~~d---~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) . ++=+++.+.+... .. ....+.+- .+-.+..+..+...+..|++..+...++.+.+..|.|++.+ T Consensus 77 ~ltpp~~~WF~L~~~d~~~--~~---~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~-- 149 (516) T protein:vir:10 77 VLFPAQRSFFRVDLTAQGE--KV---LNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYK-- 149 (516) T ss_pred hhcCCCCccccccCChhhH--hh---hhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEe-- Confidence 2 2333333322110 00 00000111 11223455566666778999999999999999999987443 Q ss_pred eeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEE Q lcl|NC_020488. 160 KYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGV 239 (688) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v 239 (688) + ++ +. ++.+ +..++++..++. . ..--++++..++..++.+.|++.......... ....+.+ T Consensus 150 d---~~---~~--~~~~-pl~~y~v~~d~~-G---~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~------~~~~~~~ 210 (516) T protein:vir:10 150 P---SK---GA--ISAI-PMHHYVVNRDTN-G---DLLDIILLQEKSLRTFDPATRAVVEVGLKGKK------CKEDDSI 210 (516) T ss_pred c---CC---CC--eEEE-EcCeEEEeeCCC-C---CeEEEeeeecccHHHHHHHhhhhhhhhhhhhc------cCCCCce Confidence 2 11 11 2233 344566654432 1 12236778899999999988653221111100 0112234 Q ss_pred EEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEE Q lcl|NC_020488. 240 RVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAP 319 (688) Q Consensus 240 ~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp 319 (688) .++.+=++.+ ++. + +++....+..+. ..+-|+...+||++ T Consensus 211 ~i~t~v~~~~---------~~~-----------------------------~-~~~~~~d~~~~~-~~s~~~~~e~P~~~ 250 (516) T protein:vir:10 211 KLYTHAKYLG---------EGF-----------------------------W-ELKQSADDIPVG-KVSKIKSEKLPFIP 250 (516) T ss_pred EEEEEEEecC---------CCc-----------------------------e-EEEEeeCceeec-cccccccccCCeee Confidence 4433222221 111 1 111112223322 23445667899996 Q ss_pred EeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCccccc Q lcl|NC_020488. 320 VLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGV 399 (688) Q Consensus 320 ~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (688) +.+ ...+|..||.|.+....+--+.+|.+...++.....+.++.++++.+.+.+...+.. ..+|.++. + .. T Consensus 251 ~Rw--~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~---~~~g~~~~-g--~~- 321 (516) T protein:vir:10 251 LTW--KRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN---SGTGEVVT-G--VE- 321 (516) T ss_pred eee--eecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhcc---CCCceeec-C--Cc- Confidence 655 457999999999999999999999999999999999999999998888877654432 22233432 1 11 Q ss_pred ccceecC--CCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_020488. 400 DRPQRDM--PASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSR-AIRRVGQI 476 (688) Q Consensus 400 ~~~~~~~--~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~-~~~~~~~~ 476 (688) +.+..++ +..--+.....++...+.|....=+ +.+.-.++...|++.|..+.+.-...|...+.+|.. ++..+... T Consensus 322 ~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~-~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r 400 (516) T protein:vir:10 322 EDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMM-ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMW 400 (516) T ss_pred ccceeeecCcccchHHHHHHHHHHHHHHHHHHhh-hhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 2222222 2222355566677777777665422 222223445679999999998888888888887753 44443333 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) .+. ...+ .+- ..+ .++.+.++ -.+-.|.+.++.++.+++.+ T Consensus 401 ~~~-------------~~~p-------~~P--------------~~l----v~~~~v~~-i~~L~raq~~~~i~~~~q~i 441 (516) T protein:vir:10 401 GLL-------------EAGD-------SFT--------------SDL----VDPVIITG-IEALGRMAELDKLANFAQYM 441 (516) T ss_pred HHH-------------hhCC-------CCC--------------hhh----cCcceehh-HHHHHHHHHHHHHHHHHHHH Confidence 211 0011 000 011 11222222 22334555566666666655 Q ss_pred HHHHHHHHHHHHHhcCCccHHHHHHHHHhhcccc--ccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPG--ILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQA 634 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~ 634 (688) ....++ .+.++...++ ++..+.+....+.. .....+.- .+..++++ +++...+++..+ .++ T Consensus 442 ~~~~q~-~p~v~d~id~---d~~~~~~a~~~gvp~~~irs~eev---------~~~r~~~~---~~q~~~~~~~~~-~~~ 504 (516) T protein:vir:10 442 SLPLQW-PEPVLAAVKW---PDYMDWVRGQISAELPFLKSAEEM---------EQEQEAQM---QAQQAQMLEEGV-AKA 504 (516) T ss_pred HHHhcC-ChHHHhhcCH---HHHHHHHHHHhCCChhccCCHHHH---------HHHHHHHH---HHHHHHHHHHHh-hhc Confidence 443322 2333333333 34343333332211 11100000 00000000 000001111111 111 Q ss_pred HHHHHHHHHHHH Q lcl|NC_020488. 635 DMAMAQAKTAEA 646 (688) Q Consensus 635 e~~~~q~~~~~~ 646 (688) .......+.++. T Consensus 505 ~~~~~~~~~~~~ 516 (516) T protein:vir:10 505 VPGVIQQELKEA 516 (516) T ss_pred ccchhhhhhhcC Confidence 111111111111 No 89 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=99.77 E-value=2e-16 Score=106.65 Aligned_cols=502 Identities=8% Similarity=-0.033 Sum_probs=242.7 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhC-----CcceE Q lcl|NC_020488. 19 ILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQN-----RPAIQ 93 (688) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~-----r~~~~ 93 (688) ...++..+|..-. .+.|...|++..+|..-.-..+..-.......+ ..-..-...+++..+.+... ++=++ T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~--~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEH--DFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCccccccCC--CccchHHHHHHHHHHHHHhhhcCCCCcccc Confidence 3334555554332 567888898888887532111100000001111 12233333444444444432 22223 Q ss_pred EEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcceeE Q lcl|NC_020488. 94 VHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLCI 173 (688) Q Consensus 94 v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~ 173 (688) +.+.+....... .......+ -.+.-+..+..+...+..|++..+...++.+.+..|++++.+ + ++ + ..+ T Consensus 77 l~~~d~~~~~~~-~~~~~~~~-v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~--~---~~---~-~~~ 145 (510) T protein:vir:63 77 SELTDAIRREAD-SRDTDITE-VTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--D---SD---A-ATV 145 (510) T ss_pred cCCChHHhhccc-ccchhHHH-HHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEE--c---CC---C-cEE Confidence 333221100000 00000000 111223455556666778999999999999999999987554 2 12 1 123 Q ss_pred EEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeeccee Q lcl|NC_020488. 174 KSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRK 253 (688) Q Consensus 174 ~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~ 253 (688) ..+ +..++++.-++.- ...-++++..|+..+|-+.|+...... ... ....+.|.|+.+-++.+. T Consensus 146 ~~~-pl~~y~v~~d~~G----~vd~i~rr~~~t~~~l~e~~~~~~~~~---~~~-----~~~~~~v~v~~~V~~~~~--- 209 (510) T protein:vir:63 146 VAW-SLRSYAVRRDATG----RWMDIVLKQRYKSKDLDEEYKQDLMRA---GRN-----LSGSGSVDLYTHVQRKKG--- 209 (510) T ss_pred EEE-EcceeEEeeCCCc----CeeEEEeeeeccHHHHhHHhhhhhhcc---ccc-----cCCCcceEEEEEEEeecC--- Confidence 333 3455665544321 223378899999999987775432211 100 011233555544443211 Q ss_pred eeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccc Q lcl|NC_020488. 254 LLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYY 333 (688) Q Consensus 254 ~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g 333 (688) +....+.||+-. .|..+.. .+-|++..+||+|+.+ ...+|..|| T Consensus 210 --------------------------------~~~~~~sv~~e~-dg~~~~~-~~~~~~~e~P~~~~Rw--~~~~ge~YG 253 (510) T protein:vir:63 210 --------------------------------TAMEYAELYHEI-DGVRVGK-EGRWPIHLCPYIVPTW--NLAPGEHYG 253 (510) T ss_pred --------------------------------CCceEEEEEEEe-cCceecc-ccccccccCceeeeee--eecCCCccc Confidence 011223333333 3444332 2456678999997644 467999999 Q ss_pred cchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceec--CCCcch Q lcl|NC_020488. 334 RGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRD--MPASMP 411 (688) Q Consensus 334 ~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 411 (688) .|.+....+--+.+|++....+.......++.++++++.+.+.+.+.. ..+|.++. +.. +.+..+ .+..-- T Consensus 254 rgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~---~~~g~~v~--g~~--~~v~~~~~~~~~d~ 326 (510) T protein:vir:63 254 RGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD---AEMGDYVP--GGA--EAVRAYERGDYNKM 326 (510) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhcc---CCCceeec--CCc--ccceeeecCcccch Confidence 999999999999999999999999999999999999888876654432 22344432 111 223222 223334 Q ss_pred HHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCcceE Q lcl|NC_020488. 412 AAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRVYDSDRV 490 (688) Q Consensus 412 ~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~~~~~r~ 490 (688) +.....++...+.|....=+ + ....++...|++.|..+.+.....|...+.++. .+...+.+..+.++.... + T Consensus 327 ~~~~~~i~~~~~rI~~af~~-~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g----l 400 (510) T protein:vir:63 327 AAIQQSLQAVVVRLNQAFMY-G-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL----L 400 (510) T ss_pred HHHHHHHHHHHHHHHHHHHh-h-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc----C Confidence 55567778788888776311 1 111234457999999999999899998888875 566777777666654321 0 Q ss_pred EEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHh Q lcl|NC_020488. 491 LRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKN 570 (688) Q Consensus 491 ~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~ 570 (688) + .+-+ .++. -.+ -++ -.+--|.+..+.+..+.+.+....+. .+. T Consensus 401 ~------------p~p~-------------~~~~---~~~--v~~-is~Laraq~~~~l~~~~q~l~~~~~~-----aq~ 444 (510) T protein:vir:63 401 Q------------GLIT-------------KQHK---PAI--ETG-LPALSRSAAVQSMLNASQVIAGLAPI-----AQL 444 (510) T ss_pred C------------CCCc-------------hhcc---cce--ecc-hhHHHHHHHHHHHHHHHHHHHHhcCc-----hhh Confidence 0 0000 0000 011 111 11222344444444444443322221 112 Q ss_pred cCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 571 MDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKL 650 (688) Q Consensus 571 ~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~ 650 (688) .+.-+.+++.+.+....+-........+++.++..+++.+++++++++++.+ ..++ .++ T Consensus 445 ~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~--------~~~a-------------~~~ 503 (510) T protein:vir:63 445 DPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETL--------LEGA-------------SDM 503 (510) T ss_pred hccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHH-------------Hhh Confidence 2223567777777655543111110000000000000000000000000000 0000 000 Q ss_pred HHHHHHH Q lcl|NC_020488. 651 AEIEQAA 657 (688) Q Consensus 651 ~~~~~~a 657 (688) ....... T Consensus 504 ~~~~~g~ 510 (510) T protein:vir:63 504 TNALAGV 510 (510) T ss_pred cccccCC Confidence 0111110 No 90 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.77 E-value=1.2e-17 Score=113.25 Aligned_cols=466 Identities=10% Similarity=0.003 Sum_probs=229.5 Q ss_pred cCCCCccchHHHHHHH----HHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ceeehhHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEI----RERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP--CLTLNKLPQYVDQV 81 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p--~~~~N~i~~~i~~i 81 (688) |+-- ..++++..+ .+.+....+.+........+-.+||.|.| +-.....+..+++ .+++|..+.+|+.. T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~--~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~ 75 (499) T protein:vir:10 1 MAVV---IDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQ--EIEKHEFDNATVEAANVMVNHAKYITDMN 75 (499) T ss_pred Cccc---hhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc--chhcCCcCcCCCCcceeecchHHHHHHHH Confidence 1111 111121111 22233333444445556667789999975 1111111222333 56789999999999 Q ss_pred HHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEee Q lcl|NC_020488. 82 LGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKY 161 (688) Q Consensus 82 ~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~ 161 (688) ++++..+.+.+.+ - |.+..+ .+..++..|+++.....+..+++++|.+|..++.+. T Consensus 76 ~~~l~g~p~~~~~--~------------------~~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~ 131 (499) T protein:vir:10 76 VGFMTGNPVKYVA--E------------------KGKNID----DILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKK 131 (499) T ss_pred hhhhcccCceeec--C------------------ChhHHH----HHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecc Confidence 9999998876543 1 122222 244566778999999999999999999998887653 Q ss_pred ccCCC-----------CCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccc Q lcl|NC_020488. 162 STDDA-----------FDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERG 228 (688) Q Consensus 162 ~~~~~-----------~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~ 228 (688) +.... ....+++..| +|..+| ||.... .-...+++.+... T Consensus 132 ~g~~~~~~~~~~~~~~~~~~~~~~~v-~p~~~~~v~~d~~~-----~~~~~~i~~~~~~--------------------- 184 (499) T protein:vir:10 132 TDPISVRDELGNEKLTPNTELKIEVI-DPRATVVVCDDTVE-----HDPLFAVFTQEKK--------------------- 184 (499) T ss_pred cccccccccccccccccccceEEEEE-cccceEEEecCCCC-----cceEEEEEEEEEe--------------------- Confidence 32111 1122334444 444432 221110 0011222222110 Q ss_pred ccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCC Q lcl|NC_020488. 229 EYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPV 308 (688) Q Consensus 229 ~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~ 308 (688) .......+..+++|.......+. ...+... .+...+.... T Consensus 185 ---~~~~~~~~~~~~iyt~~~i~~~~--~~~~~~~-----------------------------------~~~~~~~~~~ 224 (499) T protein:vir:10 185 ---DLEGNTNGYSITVYMPQRIVEYR--TKTTMEV-----------------------------------SANDPIVYDG 224 (499) T ss_pred ---ecCCCceEEEEEEEeCCeEEEEE--ecCCccc-----------------------------------cCcceecccc Confidence 00011233344555443221111 1000000 0000011222 Q ss_pred CCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCC Q lcl|NC_020488. 309 DWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQ 388 (688) Q Consensus 309 p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 388 (688) |-+.+.+|+|+|. +...|.|.+..++++++.+|...|.+...+.....+.+++....+....+.... ...+ T Consensus 225 ~~~~g~vPvv~~~-------n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~--~~~~ 295 (499) T protein:vir:10 225 ENLFGAVPIIEFR-------NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQR--LKRG 295 (499) T ss_pred cCCCCccceEEec-------CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhh--hhhc Confidence 3334666666542 234577999999999999999999999999888888877654333322221111 1122 Q ss_pred ceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 389 SVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSR 468 (688) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~ 468 (688) .++..... ....+.++..+.-..++...++.+...|...|++.+.+.+.-+++.||+|+..+-.............|.. T Consensus 296 ~~~~~~~~-~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~ 374 (499) T protein:vir:10 296 AIEAPPRE-EGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFD 374 (499) T ss_pred ceeccCCC-CCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33333222 22345666555556777888899999999999988766665455679999998877777777777777777 Q ss_pred HHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHH Q lcl|NC_020488. 469 AIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADS 548 (688) Q Consensus 469 ~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~ 548 (688) +++++.++++.++. +.+. ..++ .++.|.=.+..+....+..+. T Consensus 375 ~l~~~~~li~~~~~----------~~~~--~~d~-------------------------~~i~i~f~~~~p~n~~e~~~~ 417 (499) T protein:vir:10 375 GLRRRLKLIQTIVN----------IKGA--NDDA-------------------------SGCKISLVANIPSNLSDVVNN 417 (499) T ss_pred HHHHHHHHHHHHHh----------ccCC--cccc-------------------------ccceEEeCCCCCCCHHHHHHH Confidence 77777776665432 1111 1111 122222234444444445555 Q ss_pred HHHHHHhhHHHHHHHHHHHHHhcCC-ccHHHHHHHHHhhccccccchhhH-----------H--hhhhhhhhhhHHHHHH Q lcl|NC_020488. 549 LMQFVQAVPAAGGVVLDLIAKNMDW-PGAQDIARRLQKTLPPGILDQDEM-----------E--EAGIEPPQPSPEQQAN 614 (688) Q Consensus 549 l~~~~q~~~~~~~~~~~~~~e~~~~-~~~~ei~~~~~~~~~~~~~~~~~~-----------~--~~~~~~~~~~~~~q~~ 614 (688) ++++.. .+....+++++++ ...++-.+++++............ . ....++...... .+ T Consensus 418 ~~kl~g------~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 489 (499) T protein:vir:10 418 VKNADG------IIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAG--SN 489 (499) T ss_pred HHHHhc------cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCc--cc Confidence 554422 1333444555443 334444455543321100000000 0 000000000000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 615 MAQAQADMEKAKADTAKAQADMAMAQAK 642 (688) Q Consensus 615 ~~~~q~~~~~~q~e~~~~q~e~~~~q~~ 642 (688) .++- .+..+. T Consensus 490 ~~~~------------------~~~~~~ 499 (499) T protein:vir:10 490 HNQS------------------HRTRAV 499 (499) T ss_pred cccC------------------CCCCCC Confidence 0000 000000 No 91 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.74 E-value=1.9e-16 Score=106.80 Aligned_cols=462 Identities=10% Similarity=0.043 Sum_probs=233.7 Q ss_pred chHHHHHHHHHHHHH---------HHH-----hhhHHHHHHHHHHHhhCCC--CCCHHHHHH-HHhcCCCceeehhHHHH Q lcl|NC_020488. 15 SQEAILQEIRERAAH---------AVT-----CWKHNFDAAQEDISFLAGE--QWPESVRKE-REDEGRPCLTLNKLPQY 77 (688) Q Consensus 15 ~~~~~~~~~~~~~~~---------~~~-----~~~~~r~~~~~~~~~~~G~--Qw~~~~~~~-~~~~g~p~~~~N~i~~~ 77 (688) -.+.+.+.++..+++ ..+ ...+.+....++.+||.|+ .|....... .....+..++.|+-+-+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~i 80 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHH Confidence 334455555555543 111 1233445556678899985 553211000 00112335778999999 Q ss_pred HHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEE Q lcl|NC_020488. 78 VDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRV 157 (688) Q Consensus 78 i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v 157 (688) ++...++....++.+.+. |.+.++. ++.+++.|++...+..+.++++..|.||+++ T Consensus 81 v~~~a~~l~~ep~~i~~~--------------------d~~~~e~----l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~ 136 (499) T protein:vir:80 81 AKYMSKLLFNEKVKINID--------------------DETAEEF----VLNVLKTNGFTKNMERYIEYGEAMGGFVIKV 136 (499) T ss_pred HHHHHHhhhCCcceEeeC--------------------CHHHHHH----HHHHHhhccHHHHHHHHHHHHhhcCcEEEEE Confidence 999999999888887661 4444444 5555667899999999999999999999999 Q ss_pred EEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 158 LTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) +++. ++.+.+..| +|..||+=... ..++..+-|+.... + +.+ T Consensus 137 ~~D~------~~~~~i~~v-~a~~~~Pi~~d-~~~~~~~~f~~~~~---~---------------------------~~~ 178 (499) T protein:vir:80 137 YHDG------NKNVKVSFA-TADCMYPLSND-SENVDECLIANSFH---K---------------------------NNK 178 (499) T ss_pred EECC------CCcEEEEEE-cCCceEEEEec-CCCeEEEEEEEEEe---e---------------------------cCe Confidence 9863 256777777 67777631111 12344444332111 1 011 Q ss_pred EEEEEEEEeeeecce-------eeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCC Q lcl|NC_020488. 238 GVRVSEYFYREPVTR-------KLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDW 310 (688) Q Consensus 238 ~v~v~e~~~~~~~~~-------~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~ 310 (688) ..+..|++++..... .++...++..+ |..+... .+ + .-++...++ T Consensus 179 ~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~l---------------G~~v~l~------~~-~------~~~~~~~~~ 230 (499) T protein:vir:80 179 YYKLLEWNEWKGEKEEVYTVTTELYQSDDPNEL---------------GGKVSLK------LL-F------NDIEPVVPL 230 (499) T ss_pred EEEEEEEEEecccceeeEEEEEEEEeccCcccc---------------Ccccchh------hh-c------cCcCCceee Confidence 233445443322111 11111111000 0000000 00 0 000000111 Q ss_pred C-CCccceEEEeee--eeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhh----c Q lcl|NC_020488. 311 P-GSTIPVAPVLGK--EMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQ----A 383 (688) Q Consensus 311 ~-~~~~P~vp~~~~--~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~----~ 383 (688) + .+..||+.|-.. .....++|+|.|.+..++++.+.+|...|.+.+.+.. ...++.++++.+....+.-.. . T Consensus 231 ~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~ 309 (499) T protein:vir:80 231 PSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYF 309 (499) T ss_pred cCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh-cccceecchhhhhccCCCCCCcccCC Confidence 1 133344432111 1112467899999999999999999999999999876 466777777766422111000 0 Q ss_pred ccCCCceeecCcc--cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcch-hhHHHHHHHHHHHHHHHH Q lcl|NC_020488. 384 NRKNQSVLRYNAI--PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNE-QSGKAILARQRQGDRGTF 460 (688) Q Consensus 384 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~-~sg~ai~~~~~~~~~~~~ 460 (688) ......+...... .+...++...+.--...+...++.....+....|++....|..++. .||+++....+....... T Consensus 310 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~ 389 (499) T protein:vir:80 310 DSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKN 389 (499) T ss_pred CcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHH Confidence 0001111111111 1122344444433345577888888889999999999999976543 588888877777766777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHH Q lcl|NC_020488. 461 AYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQT 540 (688) Q Consensus 461 ~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s 540 (688) .+...++.+++++.+.++.+..-+-- ..+.. ....+|+|.=..+... T Consensus 390 ~~~~~~~~~l~~l~~~il~~~~~~~~------~~~~~---------------------------~~~~~v~v~f~d~i~~ 436 (499) T protein:vir:80 390 SHSQLIEQGIKEMIVSILEVGKLIKA------YDGDT---------------------------VELDTITVDFDDSIAQ 436 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc------ccCCC---------------------------CCccceEEEeCCCCCC Confidence 77888888888888888876544320 00000 0112233333333333 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHH-HhcCCcc--HHHHHHHHHhhccccccchhhHHhhhhhhhhhhHH Q lcl|NC_020488. 541 QRMEAADSLMQFVQAVPAAGGVVLDLIA-KNMDWPG--AQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPE 610 (688) Q Consensus 541 ~r~~~~~~l~~~~q~~~~~~~~~~~~~~-e~~~~~~--~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 610 (688) ...+..+.++++... +-+....++ +..+... +++..+++++......+....... ..+.+ T Consensus 437 d~~~~~~~~~~~~~~----Gi~S~et~l~~~~~~~d~ea~~el~~i~~E~~~~~~~~d~~g~------~ge~e 499 (499) T protein:vir:80 437 DEDTTINRYTTAKNQ----GMIPLKIALQRAWNITEAEADEWAEMLAKEKQAEIPNNDMTGI------FGEEE 499 (499) T ss_pred CHHHHHHHHHHHHHc----CCCCHHHHHhhcCCCChHHHHHHHHHHHHHhhcCCCCCCcccc------CCCCC Confidence 444444444444321 111112222 2222221 223333333222111111000000 00000 No 92 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.74 E-value=2.6e-15 Score=100.51 Aligned_cols=495 Identities=10% Similarity=0.036 Sum_probs=246.3 Q ss_pred HHHHHHHHHH--hhhHHHHHHHHHHHhhCCC--CCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh-----CCcceE Q lcl|NC_020488. 23 IRERAAHAVT--CWKHNFDAAQEDISFLAGE--QWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ-----NRPAIQ 93 (688) Q Consensus 23 ~~~~~~~~~~--~~~~~r~~~~~~~~~~~G~--Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~-----~r~~~~ 93 (688) ++.++...+. ..+.|...|++..+|.--. ..+.+..... .+. ....-+.-...+++..+.+.. +++=++ T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~-~~~-~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQA-EVV-EYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQ 78 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccc-ccc-ccccchhHHHHHHHHHHHHHhhhcCCCCcccc Confidence 3333333322 2456888888888876321 1111110100 111 111122333344444444433 233334 Q ss_pred EEeCCccccccccccccccChhhHHHHH------HHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCC Q lcl|NC_020488. 94 VHPVEANATKDTSKVPNVAGTSDYSLAE------VYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAF 167 (688) Q Consensus 94 v~pr~~~~~~~~~~~~~~~~~~d~~~Ae------~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~ 167 (688) +.+.+... . .....|.+.++ ..+..+...+..|++..+...++.+.+..|.|++.+ +. + . T Consensus 79 l~~~d~~~-----~---~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~~---~-~ 144 (514) T protein:vir:80 79 IELDDTLQ-----E---LAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYR--EP---G-T 144 (514) T ss_pred cccCchhh-----h---hccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--ec---C-C Confidence 44332110 0 00111222222 244555566678999999999999999999987554 21 1 1 Q ss_pred CcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEee Q lcl|NC_020488. 168 DLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYR 247 (688) Q Consensus 168 ~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~ 247 (688) +. ++.+ +..++++.-++. . ...-++++.+|+..+|-+.|+...... ... ....+.|.|+.+.++ T Consensus 145 -~~--~~~~-pl~~y~v~~d~~-G---~v~~i~rr~~~~~~~l~~~~~~~~~~~-------~~~-~~~~~~v~v~~~v~~ 208 (514) T protein:vir:80 145 -GK--MLVW-TMQSYTVRRTSH-G---DPAVVVLRQQMPFRELTPEIQADAQAK-------QIA-KRDSDKCDLYTVIEW 208 (514) T ss_pred -Cc--EEEE-EcCeEEEeeCCC-c---CeEEEEeeeeecHHHhhhhhhhhhhhh-------hcc-CCCCCceEEEEEEEe Confidence 12 2333 344566554432 1 122377888999998877664332111 000 112345666666655 Q ss_pred eecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeecc Q lcl|NC_020488. 248 EPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVI 327 (688) Q Consensus 248 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~ 327 (688) .+.. ++ .++ ++|....|..++. .+-|++.++||+++.+ ... T Consensus 209 ~~~~-------~~----------------------------~~~-sv~~e~~g~~i~~-es~y~~~e~P~i~~Rw--~~~ 249 (514) T protein:vir:80 209 QPTP-------NG----------------------------KRC-AVWHELEGKRVGP-ESSYPAHLCPYVPVAW--NVP 249 (514) T ss_pred ecCC-------CC----------------------------eEE-EEEEeccceeecc-cCccccccCCeeeeee--Eec Confidence 4321 00 111 2223334444443 2456678899997644 467 Q ss_pred CCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCC Q lcl|NC_020488. 328 GDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMP 407 (688) Q Consensus 328 ~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (688) +|..||.|.+....+--+.+|++....+.....+.++.++++.+.+.+...+.. ..+|.++. + . .+.+..++. T Consensus 250 ~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~---~~~g~~v~-g--~-~~~v~~~~~ 322 (514) T protein:vir:80 250 DGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRD---AETGDFVP-G--Q-VGSVASYER 322 (514) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcc---cCCceeec-C--C-Cccceeeec Confidence 999999999999999999999999999999999999999999888777655432 22333432 1 1 122333322 Q ss_pred --CcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_020488. 408 --ASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQILIELIPRV 484 (688) Q Consensus 408 --~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~~~li~~~ 484 (688) ..--+.....++...+.|.+..=+. ..+.++...|++.|..+.+.-...|...+.+|. .++..+.+..+.++... T Consensus 323 ~~~~d~~~~~~~i~~~~~rI~~aFml~--~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~ 400 (514) T protein:vir:80 323 GDYNKIAQASASVESIVMRLNRAFMYT--GQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRG 400 (514) T ss_pred CcccchHHHHHHHHHHHHHHHHHHhhh--ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 2233455667777777776643111 122445557999999999998888888888875 45555555555554321 Q ss_pred cCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_020488. 485 YDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVL 564 (688) Q Consensus 485 ~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~ 564 (688) -. | .+-.+ ..++ +.+.+.++ -....|.+..+.+..+++.+..+.+.. T Consensus 401 ~~--------g-----~lP~~--------------p~~l----~~~~~vs~-la~l~r~~~~~~l~~~~~~i~~l~~~~- 447 (514) T protein:vir:80 401 NG--------G-----MLLGI--------------AQGV----YRPSIITG-IPALTRNIETANILRATQEASAIVPAL- 447 (514) T ss_pred cc--------C-----CCCCC--------------Cchh----hcceeeec-HHHHHHHHHHHHHHHHHHHHHHHhccc- Confidence 00 0 00000 0111 22233222 334566777777777777654443322 Q ss_pred HHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_020488. 565 DLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADT-AKAQADMAMA 639 (688) Q Consensus 565 ~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~-~~~q~e~~~~ 639 (688) +. .++.-+.+++.+.+....+-........++..+ +.++.+++++|++++.++... +.+++-..-. T Consensus 448 p~---v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 448 VQ---LSKRFDPEKLVERIFANNSVDLSTLSKDPDVVA------AEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred hh---hhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHH------HHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 22 233445566666665444432111111100000 000000000000000000000 0000000000 No 93 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.74 E-value=7.4e-17 Score=109.02 Aligned_cols=465 Identities=12% Similarity=0.010 Sum_probs=214.6 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHH----HHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESV----RKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~----~~~~~~~g~p~~~~N~i~~ 76 (688) ..||-+ +.+....++.++...+.. -+..-.+-.+||.|+|.-... ...++ .-.++.|..+- T Consensus 4 ~i~~~~-----~~~~~~~~~~~L~~~~~~-------~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~---~~~~~~n~~~~ 68 (485) T protein:vir:24 4 PLPGQE-----EIADPAIARDEMVSAFED-------QNQNLRSNTSYYEAERRPEAIGVTVPVQMQ---SLLAHVGYPRL 68 (485) T ss_pred CCCCCC-----cccchHHHHHHHHHHHHH-------HHHHHHHHHHHHhccCchhhcCcccchhhh---hhhhccchHHH Confidence 555554 444444444444443321 122333345899999853211 11111 11355799999 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEE Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLR 156 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~ 156 (688) +|+..++.+.-+. + ..+- +.+ ....++.++..|+++.....+..++++.|.+|+. T Consensus 69 ivd~~~~~l~~~g--~-~~~~------------------~~~----~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~ 123 (485) T protein:vir:24 69 YVDSIAERQAVEG--F-RLGD------------------ADE----ADEELWQWWQANNLDIEAPLGYTDAYVHGRSYIT 123 (485) T ss_pred HHHHHhhhhccCc--e-ecCC------------------Cch----hHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 9998888774331 1 1111 111 2233455677899999999999999999999998 Q ss_pred EEEeeccCCC--CCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccccccc Q lcl|NC_020488. 157 VLTKYSTDDA--FDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSW 232 (688) Q Consensus 157 v~~~~~~~~~--~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 232 (688) |+.+.+.... ..+.+++..+ +|.++ +||+...+ ...+.+.+-+. T Consensus 124 v~~~~~~~~~~~~~~~~~i~~~-~p~~~~~i~D~~~~~------~~~~~~~~~~~------------------------- 171 (485) T protein:vir:24 124 ISRPDPQIDLGWDPNVPLIRVE-PPTRMYAEIDPRIGR------PAKAIRVAYDA------------------------- 171 (485) T ss_pred EecCCcccccccCCCcceEEEe-ccceeEEEeeCCcCc------eeEEEEEEEee------------------------- Confidence 8876443322 2355666655 67775 57765332 11222221100 Q ss_pred CCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCC Q lcl|NC_020488. 233 WTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPG 312 (688) Q Consensus 233 ~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~ 312 (688) +...+...++|..... +.+...+| ...++...|-+. T Consensus 172 --~~~~~~~~~~y~~~~~--~~~~~~~~----------------------------------------~~~~~~~~~h~~ 207 (485) T protein:vir:24 172 --EGNEIQAATLYTPNET--FGWFRAEG----------------------------------------EWVEWFSDPHGL 207 (485) T ss_pred --cCCeEEEEEEEcCCcE--EEEEecCC----------------------------------------ceEeecccccCC Confidence 0112333334432210 00000111 111222234455 Q ss_pred CccceEEEeeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcc---hHHH-HhhcccCC Q lcl|NC_020488. 313 STIPVAPVLGKEMVIGDKTYYRGLIR-FGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEG---YEEE-WNQANRKN 387 (688) Q Consensus 313 ~~~P~vp~~~~~~~~~~~~~g~g~v~-~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~---~~~~-~~~~~~~~ 387 (688) +.+|+|||... +..+.++|.|-+. .++++++.+|+.+|.+...+...+.+..++.....+. .++- ..-..... T Consensus 208 g~vPvv~f~n~--~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~ 285 (485) T protein:vir:24 208 GAVPVVPLPNR--TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYL 285 (485) T ss_pred CcccEEEeccC--cccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcc Confidence 77888887543 2346678887775 6899999999999999998887777765543111111 1000 00011123 Q ss_pred CceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 388 QSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) +.++...+ +...+...+..+ ...+...+......+-.++++++..+|..+ |..||.|+..+...-..........| T Consensus 286 ~~i~~~~~--~~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f 362 (485) T protein:vir:24 286 ARILAFED--AEGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIF 362 (485) T ss_pred cceeccCC--CCceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 33443322 112222222222 223444444444444455788888988654 55799999988877777777777778 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) ..++++++++++.+... - + ...++ .++.|.=.+.......+.. T Consensus 363 ~~~l~~~~~l~~~~~~~-~---------~--~~~d~-------------------------~~i~v~f~~~~~~s~~~~a 405 (485) T protein:vir:24 363 GGAWEEAMRLAYRLMKG-G---------D--VPPDM-------------------------LRMETVWRDPSTPTYAAKA 405 (485) T ss_pred HHHHHHHHHHHHHHhcC-C---------C--Ccccc-------------------------ceeeEEecCCCCCCHHHHH Confidence 88888887777653211 0 0 00010 0111111111112233334 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccc---hhhHHhhhhhhhhhhHHHHHHHHHHHHHHH Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILD---QDEMEEAGIEPPQPSPEQQANMAQAQADME 623 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~ 623 (688) +...++.+... ..+....+++++++.. +-.+.+++........ ............+.++..-......++. . T Consensus 406 d~~~kl~~~g~--~~~s~et~~~~l~~~~--d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~-~ 480 (485) T protein:vir:24 406 DAATKLYGNGQ--GVIPRERARKDMGYSI--AEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAI-E 480 (485) T ss_pred HHHHHHHhccc--ccCCHHHHHhhCCCCH--hHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCccCC-C Confidence 44444443211 1122334455655532 1112222111100000 0000000000000000000000000000 0 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_020488. 624 KAKADTAKAQADMA 637 (688) Q Consensus 624 ~~q~e~~~~q~e~~ 637 (688) -.+.| T Consensus 481 ---------~~~~a 485 (485) T protein:vir:24 481 ---------GGDSA 485 (485) T ss_pred ---------CCCCC Confidence 00000 No 94 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.74 E-value=7.8e-18 Score=114.38 Aligned_cols=469 Identities=9% Similarity=0.027 Sum_probs=208.6 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCC----CHHHHHHHHhcCCCceeehhHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQW----PESVRKEREDEGRPCLTLNKLPQYVDQVLG 83 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw----~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g 83 (688) |.|. .+++.++...+. ..+....+-.+||+|+|= +......++ .-.++.|..+-+|++.++ T Consensus 1 ~~t~-----~d~i~~L~~~~~-------~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~---~~~~~~n~~~~ivd~~~~ 65 (480) T protein:vir:78 1 MTTY-----HEHVERLQGLLA-------RDLPNLLEAEAYRNGTRRLKTIGIGAPPELA---YLDVQPGWVATYLRTLSD 65 (480) T ss_pred CCCH-----HHHHHHHHHHHH-------HHHHHHHHHHHHHhccccchhcccccchhhh---hhhhhcchHHHHHHHHHh Confidence 3322 335555544332 234444556789999762 111111111 112568999999999888 Q ss_pred HHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeecc Q lcl|NC_020488. 84 DQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYST 163 (688) Q Consensus 84 ~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~ 163 (688) ++.-+ -.+.+- |.+ .+..+..+++.|+++..+..++.++++.|.+|+.|+-.... T Consensus 66 ~l~~~---g~~~~~------------------d~~----~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~ 120 (480) T protein:vir:78 66 RLDIE---GFRISE------------------DSE----GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE 120 (480) T ss_pred hhccC---ceecCC------------------Cch----hHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccc Confidence 76422 111111 222 23445667788999999999999999999998877532111 Q ss_pred CCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEE Q lcl|NC_020488. 164 DDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRV 241 (688) Q Consensus 164 ~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 241 (688) ....++.+++..+ +|..+ +|||.... ...+. .+.+...+ +...+.. T Consensus 121 ~~d~~~~~~i~~~-~p~~~~~i~D~~~~~----~~~~~-i~~~~~~d--------------------------~~~~~~~ 168 (480) T protein:vir:78 121 SGDPAGIPLIRVE-SPLYMYAELDPRNTR----RVTRA-VRLYTTRD--------------------------DVAVPDR 168 (480) T ss_pred cCCCCCeeEEEEE-cccceEEEEcCCCcc----ceEEE-EEEEEeec--------------------------CCcceEE Confidence 1223456666655 77775 57775332 11222 22222110 1112223 Q ss_pred EEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEe Q lcl|NC_020488. 242 SEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVL 321 (688) Q Consensus 242 ~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~ 321 (688) .++|....... +...++.. .+..+..++.|.+.+.+|+|||. T Consensus 169 ~~~y~~~~~~~--~~~~~~~~------------------------------------~~~~~~~~~~~~~~g~vPvv~f~ 210 (480) T protein:vir:78 169 ATLYLPDETVP--LRRNGGLN------------------------------------DQWVVDGDVIKHGLGVVPVVPLT 210 (480) T ss_pred EEEEeCCeEEE--EEecCCCc------------------------------------ccccccccccccCCCCcceEEee Confidence 34443221100 00000000 00001112234445677888765 Q ss_pred eeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcch-HHHH-hhcccCCCceeecCcccc Q lcl|NC_020488. 322 GKEMVIGDKTYYRGLIR-FGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGY-EEEW-NQANRKNQSVLRYNAIPG 398 (688) Q Consensus 322 ~~~~~~~~~~~g~g~v~-~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~-~~~~-~~~~~~~~~~~~~~~~~~ 398 (688) .. ...+.++|.|-+. .++++++.+|+.+|.+...+...+.+..++.....+.. ++.. .......+.++...+ + T Consensus 211 n~--~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 286 (480) T protein:vir:78 211 ND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLAS--E 286 (480) T ss_pred cc--cccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccCCC--C Confidence 43 3456678888875 59999999999999999998877777654432111111 0000 001111223332221 1 Q ss_pred cccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 399 VDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQIL 477 (688) Q Consensus 399 ~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~ 477 (688) ...+...+.. ....+.+.+......+-.+||+++..+|..+ |..||.|+..+-..-........+.|..+++++++++ T Consensus 287 ~~~~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~ 365 (480) T protein:vir:78 287 AAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIA 365 (480) T ss_pred CceEEecCcc-CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122222222 2344555566666666667888889998654 4579999988876666666666666667777766655 Q ss_pred HHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhH Q lcl|NC_020488. 478 IELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVP 557 (688) Q Consensus 478 ~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~ 557 (688) +. +... ....++. ++.|.=.+.......+..+.+.++.+... T Consensus 366 ~~----~~~~---------~~~~~~~-------------------------~i~v~w~~~~~~s~~~~ad~~~kl~~~g~ 407 (480) T protein:vir:78 366 MQ----IMGR---------EVTEEYT-------------------------RLETVWRDPSTPTVAAKADAVSKLYANGQ 407 (480) T ss_pred HH----HcCC---------Cccccce-------------------------eeeEEecCCCCCCHHHHHHHHHHHHHhcc Confidence 43 3221 0111111 11111111111112233444454443211 Q ss_pred HHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhh-hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 558 AAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEA-GIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADM 636 (688) Q Consensus 558 ~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~ 636 (688) . .+-...+++++++. ++-.+.+.+...+........... ..+...+++....-. .. . +++. +...+ T Consensus 408 ~--~~s~et~~~~lg~~--~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~---~--~~~~--~~~~~ 474 (480) T protein:vir:78 408 G--PIPKEQARIDLGYT--ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE--TK---T--ETQT--SPSGF 474 (480) T ss_pred c--CCCHHHHHhcCCCC--HhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCC--CC---C--ccCC--CcccC Confidence 1 11223445555553 221111211100000000000000 000000000000000 00 0 0000 00000 Q ss_pred HHHHHH Q lcl|NC_020488. 637 AMAQAK 642 (688) Q Consensus 637 ~~~q~~ 642 (688) .+..+. T Consensus 475 ~~~~~~ 480 (480) T protein:vir:78 475 NRTKTR 480 (480) T ss_pred CCcCCC Confidence 000000 No 95 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.74 E-value=2.4e-16 Score=106.21 Aligned_cols=407 Identities=13% Similarity=0.084 Sum_probs=215.0 Q ss_pred CCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHH----HHHHHHhcCCCceeehhHHHHHHHHHHHHH Q lcl|NC_020488. 11 RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPES----VRKEREDEGRPCLTLNKLPQYVDQVLGDQR 86 (688) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~----~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~ 86 (688) .+. ..+..+...+.. .+..-.+-.+||.|+|.... .-..++...+ ++.|..+..|+.+.+-.. T Consensus 1 m~~----~~i~~L~~~~~~-------~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~~Vd~~a~rl~ 67 (422) T protein:vir:97 1 MNY----MGMGYLRRKLAL-------FKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYR--SVLEWTAKGVDSLADRII 67 (422) T ss_pred CCh----HHHHHHHHHHHH-------HHHHHHHHHHHHhcCCChhhcCccccHHHHHHHH--hhcchhHHHHHHHHhccc Confidence 111 133333332221 23345556789999876422 2233333333 345888887776654221 Q ss_pred hCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCC Q lcl|NC_020488. 87 QNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDA 166 (688) Q Consensus 87 ~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~ 166 (688) +.- +.-.|.+ +..+++.|+++.....+..++++.|.+++-|+-+. T Consensus 68 -----~~G-----------------f~~~d~~--------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~----- 112 (422) T protein:vir:97 68 -----FRE-----------------FTNDDFN--------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGA----- 112 (422) T ss_pred -----cce-----------------eeCCchh--------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCC----- Confidence 110 0011222 34567789999999999999999999998886431 Q ss_pred CCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEE Q lcl|NC_020488. 167 FDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEY 244 (688) Q Consensus 167 ~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~ 244 (688) .++.+.+. +.+|..+ +|||..+.+ . +....|- . ..+. ..+.. . T Consensus 113 ~~~~p~i~-~~sp~~~~~i~D~~~~~~-----~-~a~~~~~-~-------------------------~~~~-~~~~~-~ 157 (422) T protein:vir:97 113 EDGLPKMQ-VIEASKATGILDPTTFLL-----T-EGYAILE-S-------------------------DSNG-NPTLE-A 157 (422) T ss_pred CCCeeEEE-EechhhEEEEEeCCCCcc-----e-eeEEEEE-e-------------------------cCCC-cEEEE-E Confidence 12445554 4477764 578753321 1 1111110 0 0000 11111 1 Q ss_pred EeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeee Q lcl|NC_020488. 245 FYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKE 324 (688) Q Consensus 245 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~ 324 (688) |+... .+....++ +. .-..++| .+..|+|||+.. T Consensus 158 ~~~~~---~~~~~~~~---------------------------------------~~-~~~~~~~--~g~vPvv~~~n~- 191 (422) T protein:vir:97 158 YFTDK---DIWYYPKK---------------------------------------GK-PYNIKNP--TGHPLLVPIIHR- 191 (422) T ss_pred EEcCc---eEEEEcCC---------------------------------------Cc-cccccCC--CCCcceEEeccc- Confidence 11110 00000000 00 0011333 467899998754 Q ss_pred eccCCcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhc--chHHHHhhcccCCCceeecCccccccc Q lcl|NC_020488. 325 MVIGDKTYYRGLI-RFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIE--GYEEEWNQANRKNQSVLRYNAIPGVDR 401 (688) Q Consensus 325 ~~~~~~~~g~g~v-~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 401 (688) +..+.++|.|-| +.++++|+.+|+.++.++-.....+.++..+- |.-. ...+.|. ...+.++.......+.. T Consensus 192 -~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~---~~~~~i~~~~~de~~~~ 266 (422) T protein:vir:97 192 -PDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVL-GMDPDAKPMEKWR---ATVSTLLEISKDEDGDK 266 (422) T ss_pred -CCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-ccCcccccCchhh---hhhhhhhccCCCCCCCc Confidence 346778988866 88999999999999999998888777765542 2110 0011111 11123333322222233 Q ss_pred cee--cCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 402 PQR--DMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN-EQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILI 478 (688) Q Consensus 402 ~~~--~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~-~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~ 478 (688) +.+ .+..++ ..+...+......+-.+||+++..+|..++ ..||.||.+....-........+.|..+.++++++++ T Consensus 267 ~~v~q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~ 345 (422) T protein:vir:97 267 PTVGQFTTASM-APFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAV 345 (422) T ss_pred ceeeecCCCCh-hHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 333333 345666666667777778999999997665 4799999877766666667777777778888777776 Q ss_pred HHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecc---cCcHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 479 ELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAG---PSYQTQRMEAADSLMQFVQA 555 (688) Q Consensus 479 ~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~---~~~~s~r~~~~~~l~~~~q~ 555 (688) .+.-..-+ ....+ +++.+.=. |.......+..+++.++.+. T Consensus 346 ~~~~~~~~-----------~~~~~-------------------------~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a 389 (422) T protein:vir:97 346 CLRDEFPY-----------LRNQF-------------------------MDTVIKWEPLFEADANMLTLVGDGAIKLNQA 389 (422) T ss_pred HHhcCCcc-----------cchhh-------------------------ccceEEEccCCCCChHHHHHHHHHHHHHHhh Confidence 54422100 00011 11111111 22233345566777777776 Q ss_pred hHHHHHHHHHHHHHhcCCccHHHHHHHHHhhcccc Q lcl|NC_020488. 556 VPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPG 590 (688) Q Consensus 556 ~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~ 590 (688) .|.+.. -..+++++++...+.-..++.+....+ T Consensus 390 ~~~~~~--~~~~~~~lg~~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 390 IPGFMD--ADVIRDLTGVKGADKPIPAITEVTTDG 422 (422) T ss_pred cccccc--HHHHHHHcCCCchhHHHHHHHhhhccC Confidence 544322 235567778866655555554432222 No 96 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.73 E-value=2.7e-16 Score=105.97 Aligned_cols=460 Identities=10% Similarity=0.026 Sum_probs=240.3 Q ss_pred HHHHHHHHHHHHHH------------------HHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcC----CCceeehhH Q lcl|NC_020488. 17 EAILQEIRERAAHA------------------VTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEG----RPCLTLNKL 74 (688) Q Consensus 17 ~~~~~~~~~~~~~~------------------~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g----~p~~~~N~i 74 (688) -.+...++..|.+- .....+.+....++.++|.|+.+.- ......| +.....|+- T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l---~~~~~~~~~~~~~~~slnl~ 77 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQV---THKNSYGDTQKHELQSVNVT 77 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccc---cccccCCCccccceeecchH Confidence 22333333333331 1112334445566778999864311 1111122 234667888 Q ss_pred HHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCce Q lcl|NC_020488. 75 PQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGW 154 (688) Q Consensus 75 ~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~ 154 (688) +.+++...+......+.+.+. |.+. ++.++.+++.|++...+..+.++++..|.|+ T Consensus 78 ~~i~~~~A~ll~~e~~~i~~~--------------------d~~~----~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~ 133 (505) T protein:vir:79 78 KLASAKLASLIFNEQCQVTVS--------------------DETA----NDFLDDVFQQNDFYTTFEEKLEEWIALGSGC 133 (505) T ss_pred HHHHHHHHhhhcCCCceeecC--------------------ChHH----HHHHHHHHHhccHHHHHHHHHHHHhhcCCeE Confidence 999999999888887777661 3333 4445666678899999999999999999999 Q ss_pred EEEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCC Q lcl|NC_020488. 155 LRVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWT 234 (688) Q Consensus 155 ~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~ 234 (688) +++++| .+.+.+..| ++..|++=.. ...+..++-++.+...... . T Consensus 134 ~k~~~D-------~~~~~i~~v-~ad~~~P~~~-d~~~~~~~a~~~~~~~~~~---------~----------------- 178 (505) T protein:vir:79 134 VRPYVD-------SGKIKLAWA-TADQVYPLQA-DTNQVNELAIASRTTEVEN---------H----------------- 178 (505) T ss_pred EEEEEe-------CCceEEEEE-cCCeeEEEEE-cCCCeEEEEEEEEEEEecC---------C----------------- Confidence 999986 135677777 6777763111 1123444444433221110 0 Q ss_pred CCCEEEEEEEEeeeeccee----eeeccC----CceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 235 NEEGVRVSEYFYREPVTRK----LLLLSD----GRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 235 ~~~~v~v~e~~~~~~~~~~----~~~~~~----g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) ...-.++.|++++.....+ ++...+ |..+......+ | .. |+. T Consensus 179 ~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~------------------------~-----~~-l~~ 228 (505) T protein:vir:79 179 RTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQ------------------------Y-----EG-LEP 228 (505) T ss_pred cceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccc------------------------c-----cc-cCc Confidence 0012345565554322221 111111 11111000000 0 00 000 Q ss_pred CCCCC-CCccceEEEee--eeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHH--- Q lcl|NC_020488. 307 PVDWP-GSTIPVAPVLG--KEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEW--- 380 (688) Q Consensus 307 ~~p~~-~~~~P~vp~~~--~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~--- 380 (688) ...+. .+..+|+.|-. ......++|+|.|++..+++..+.+|...|++.+.+.+ .+.++.+++..+...-.-. T Consensus 229 ~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~ 307 (505) T protein:vir:79 229 QVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKK-GQRRLIVPAEWLKTGSSYGGQA 307 (505) T ss_pred ceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcccCCCCccc Confidence 00111 11222322100 01112367899999999999999999999999999876 5667788777763221100 Q ss_pred -h-hcccCCCc---eeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcch-hhHHHHHHHHHH Q lcl|NC_020488. 381 -N-QANRKNQS---VLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNE-QSGKAILARQRQ 454 (688) Q Consensus 381 -~-~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~-~sg~ai~~~~~~ 454 (688) . ....+++. +.......+...++...+.--...++..++...+.+....|++....|.+++. .||++|....+. T Consensus 308 ~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~ 387 (505) T protein:vir:79 308 SETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQ 387 (505) T ss_pred ccccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhH Confidence 0 00011111 11111222233455555443345578888888888889999999999976654 589999888887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEec Q lcl|NC_020488. 455 GDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKA 534 (688) Q Consensus 455 ~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~ 534 (688) .......+...+..+++++.+.++.+..-+.-.. .+ ... .......++|+|+= T Consensus 388 l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~--------~g-----------------~~~--~~~~~~~~~i~v~f 440 (505) T protein:vir:79 388 TYQTRSSYITQVEKTIKALTYAILELASVPSFYA--------DG-----------------QAR--WTGDVDSLDITINF 440 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------cc-----------------ccc--ccCCCCceeEEEEe Confidence 7778888888888999998888888766553100 00 000 00001235566665 Q ss_pred ccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHH-HHHhcCCcc--HHHHHHHHHhhccccccchhhHHhh Q lcl|NC_020488. 535 GPSYQTQRMEAADSLMQFVQAVPAAGGVVLDL-IAKNMDWPG--AQDIARRLQKTLPPGILDQDEMEEA 600 (688) Q Consensus 535 ~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~-~~e~~~~~~--~~ei~~~~~~~~~~~~~~~~~~~~~ 600 (688) +.+.....++..+.++++.+. +-+.... +.+.-++.. +++..+++++......+........ T Consensus 441 ~d~i~~d~~~~~~~~~~~v~~----Gi~s~e~~l~~~~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 441 NDGVFVDQESKRAADLQAVQA----QVMPKKQFLMRNYGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred CCCCCCCHHHHHHHHHHHHHc----CCCCHHHHHHhcCCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 556555566666656555432 1111122 223333321 2333444433221111111000000 No 97 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.73 E-value=2.2e-16 Score=106.37 Aligned_cols=396 Identities=15% Similarity=0.082 Sum_probs=211.3 Q ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHH----HHHHHHhcCCCceeehhHHHHHHHHHHHHHhCCc Q lcl|NC_020488. 15 SQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPES----VRKEREDEGRPCLTLNKLPQYVDQVLGDQRQNRP 90 (688) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~----~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~ 90 (688) -...++.++...+.. .+..-.+-.+||+|+|.-.. .-..++..-+ ++.|..+.+|+.+.+...=+- T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~iVds~a~rl~~~G- 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLSV-------HKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYR--SILGWCAKGVDSLADRLVFRE- 70 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHhcccCchhhcChhhhHHHHHHHh--hhcchhHHHHHHhHhhcccCc- Confidence 223344554443322 22334445689999986422 2222222222 456998888887755322110 Q ss_pred ceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcc Q lcl|NC_020488. 91 AIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLD 170 (688) Q Consensus 91 ~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~ 170 (688) | .-.|. -+..+++.|+++...+.+..++++.|.+++-|+-+ .++. T Consensus 71 ---f------------------~~~d~--------~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~------~dg~ 115 (409) T protein:vir:94 71 ---F------------------ENDDF--------TVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKG------ENDA 115 (409) T ss_pred ---c------------------cCCch--------HHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecC------CCCc Confidence 1 01122 14567788999999999999999999998877532 1355 Q ss_pred eeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeee Q lcl|NC_020488. 171 LCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYRE 248 (688) Q Consensus 171 ~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~ 248 (688) +.+..+ +|..+ +|||..+.+ ..+.+.|-. + . ....+ ...+|.. T Consensus 116 ~~i~~~-sp~~~~~i~D~~~~~~------~~a~~~~~~---------d------------~-----~~~~~-~~~~~~~- 160 (409) T protein:vir:94 116 VRLQVI-EAVNATGIIDPITGLL------TEGYAVLER---------D------------E-----NNNVV-LEAHFLP- 160 (409) T ss_pred eEEEEe-ccceEEEEEecCCCce------eeeEEEEEe---------c------------C-----CCceE-EEEEEec- Confidence 666554 67664 678754321 111221110 0 0 00011 1111111 Q ss_pred ecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccC Q lcl|NC_020488. 249 PVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIG 328 (688) Q Consensus 249 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~ 328 (688) +.+++.. ..+..+...++ +.+..|+|||+.. +.. T Consensus 161 -----------~~~~~~~-------------------------------~~~~~~~~~~n--~~g~vPvV~f~n~--~~~ 194 (409) T protein:vir:94 161 -----------DRTDYYY-------------------------------RDSRNNISIAN--PTGHPLLVPIIHR--PDA 194 (409) T ss_pred -----------CcEEEEE-------------------------------ecCceeEeeeC--CCCCcceEEeccc--ccc Confidence 0111100 00000111233 3467899987653 345 Q ss_pred CcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceeec---hhhhcchHHHHhhcccCCCceeecCcccccccce- Q lcl|NC_020488. 329 DKTYYRGLI-RFGKDAQRMHNYWMTAATERVALAPKAPWVAP---AESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQ- 403 (688) Q Consensus 329 ~~~~g~g~v-~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 403 (688) +.++|.|-| +.++++|+.+|+.++.+.-.....+.++..+- ++. +.. +.|.. ..+.++.......+..+. T Consensus 195 ~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~-~~~-~~~~~---~~~~i~~~~~d~dg~~~~v 269 (409) T protein:vir:94 195 VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA-EPM-ETWKA---TVSSMLQFTKDEDGDKPTL 269 (409) T ss_pred ccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC-ccc-chhhh---hHHHhhcCCCCCCCCCceE Confidence 678888866 78999999999999999998888777765442 211 111 11211 112233332222223333 Q ss_pred -ecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 404 -RDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELI 481 (688) Q Consensus 404 -~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li 481 (688) ..+..++ +.+...+......+-.+||+++..+|..+ |..||.|+.+....-........+.|..+.+.++++++.+. T Consensus 270 ~q~~~~~l-~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~ 348 (409) T protein:vir:94 270 GQFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLR 348 (409) T ss_pred EecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3344443 44566666666777778899999999755 55899999877655555556666666777788877766654 Q ss_pred HHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHH Q lcl|NC_020488. 482 PRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGG 561 (688) Q Consensus 482 ~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~ 561 (688) -..-. ..+++..+ ++... ..-+.......+..+++.++.+..|.+.. T Consensus 349 ~~~~~-----------~~~~~~~~---------------------~v~W~-p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~ 395 (409) T protein:vir:94 349 DDAPY-----------LREQFRKT---------------------KPKWE-PLFEADASMLSLIGDGAIKLNQAIPEFIN 395 (409) T ss_pred CCCCc-----------cccccccc---------------------eEEec-cCCCcchHHHHHHHHHHHHHHHhcccccc Confidence 32210 00111100 00000 00012233345666778888775443321 Q ss_pred HHHHHHHHhcCCccHH Q lcl|NC_020488. 562 VVLDLIAKNMDWPGAQ 577 (688) Q Consensus 562 ~~~~~~~e~~~~~~~~ 577 (688) -..+.+++++...+ T Consensus 396 --~~~~~~~lG~~~~d 409 (409) T protein:vir:94 396 --KDTIRDLTGIEGGE 409 (409) T ss_pred --hhHHHHHcCCCCCC Confidence 23566777877666 No 98 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.72 E-value=1.9e-16 Score=106.75 Aligned_cols=463 Identities=11% Similarity=0.039 Sum_probs=210.3 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCC----CCHHHHHHHHhcCCCceeehhHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQ----WPESVRKEREDEGRPCLTLNKLPQYVDQVLG 83 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Q----w~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g 83 (688) |......+..+++.++...+.. .+....+-.+||+|+| ++.......+ .-.++.|..+-+|++.+. T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~-------~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~---~~~~~~n~~~~ivd~~a~ 70 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFEN-------KQNELKSSKAYYDAERRPDAIGLAVPLDMR---KYLAHVGYPRTYVDAIAE 70 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcccchhhcCcccchhhh---hhhhhcchHHHHHHHHHH Confidence 3333344445566666544432 2233344568999976 2111111111 112567888888887765 Q ss_pred HHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeecc Q lcl|NC_020488. 84 DQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYST 163 (688) Q Consensus 84 ~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~ 163 (688) .+.-+- + +.|.... .......|.+..+. +..++..|+++.....+..+++++|.+|+-|+..... T Consensus 71 ~l~~~G--f-~~~~~~~--------~~~~~~~d~~~~~~----l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~ 135 (488) T protein:vir:23 71 RQELEG--F-RIPSANG--------EEPESGGENDPASE----LWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPE 135 (488) T ss_pred hhhccc--e-eccCCcc--------cccccccchhHHHH----HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcc Confidence 443221 1 1111100 00000123333333 4556788999999999999999999999888654322 Q ss_pred CC--CCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEE Q lcl|NC_020488. 164 DD--AFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGV 239 (688) Q Consensus 164 ~~--~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v 239 (688) .+ ..++.+++.. .+|.++ +|||.... ...+.+++-.. +...+ T Consensus 136 ~~~~~~~~~~~i~~-~~p~~~~~~~d~~~~~------~~~~~~~~~~~---------------------------~~~~~ 181 (488) T protein:vir:23 136 VDFDVDPEVPLIRV-EPPTALYAEVDPRTRK------VLYAIRAIYGA---------------------------DGNEI 181 (488) T ss_pred cccCCCCCcceEEE-eccceeEEEEecCCCc------eEEEEEEEEec---------------------------CCCcE Confidence 11 2234455544 478765 47764321 22222222100 01112 Q ss_pred EEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEE Q lcl|NC_020488. 240 RVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAP 319 (688) Q Consensus 240 ~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp 319 (688) ...++|..... +.+...+| .-.+++..|.+.+.+|+|| T Consensus 182 ~~~~~y~~~~~--~~~~~~~~----------------------------------------~~~~~~~~~h~~g~vPvv~ 219 (488) T protein:vir:23 182 VSATLYLPDTT--MTWLRAEG----------------------------------------EWEAPTSTPHGLEMVPVIP 219 (488) T ss_pred EEEEEEecCcE--EEEEecCC----------------------------------------ceEeccccccCCCCcceEE Confidence 22233322110 00000111 1112334455667788888 Q ss_pred EeeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchH---H-HHhhcccCCCceeecC Q lcl|NC_020488. 320 VLGKEMVIGDKTYYRGLIR-FGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYE---E-EWNQANRKNQSVLRYN 394 (688) Q Consensus 320 ~~~~~~~~~~~~~g~g~v~-~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~---~-~~~~~~~~~~~~~~~~ 394 (688) |... +..+.++|.|-+. .++++++.+|+.+|.+...+...+.+..++.....+... . ...-.....+.++... T Consensus 220 f~n~--~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 297 (488) T protein:vir:23 220 ISNR--TRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFE 297 (488) T ss_pred eccc--cccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCC Confidence 7543 3456678888774 689999999999999999888776665443211111100 0 0000011122333322 Q ss_pred cccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 395 AIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNLSRAIRRV 473 (688) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~ 473 (688) .+. ...+...+..+ ...+...+......+-.+||+++..+|..+ |..||.|+......-..........|..+++++ T Consensus 298 ~g~-~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~ 375 (488) T protein:vir:23 298 GGE-GAHAEQFSAAE-LRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQA 375 (488) T ss_pred CCC-CceeEecCCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 11222222222 344555555555555567899999998654 557999998887777777777777777777777 Q ss_pred HHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH Q lcl|NC_020488. 474 GQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV 553 (688) Q Consensus 474 ~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~ 553 (688) .++++.+.-. .....++. ++.+.=.+.......+..+.+.++. T Consensus 376 ~~l~~~~~~~------------~~~~~~~~-------------------------~i~v~f~~~~~~s~~~~ada~~kl~ 418 (488) T protein:vir:23 376 MRLAYKMVKG------------GDIPTEYY-------------------------RMETVWRDPSTPTYAAKADAAAKLF 418 (488) T ss_pred HHHHHHHhcC------------CCcchhhc-------------------------cceEEecCCCCCCHHHHHHHHHHHH Confidence 7766643211 00001110 1111111111222333444444444 Q ss_pred HhhHHHHHHHHHHHHHhcCCcc-HHHHHHHHHhhccccccchhhH--------H--hhhhhhhhhhHHHHHH Q lcl|NC_020488. 554 QAVPAAGGVVLDLIAKNMDWPG-AQDIARRLQKTLPPGILDQDEM--------E--EAGIEPPQPSPEQQAN 614 (688) Q Consensus 554 q~~~~~~~~~~~~~~e~~~~~~-~~ei~~~~~~~~~~~~~~~~~~--------~--~~~~~~~~~~~~~q~~ 614 (688) +... ..+....+.+++++-. ..+-.+++++....+....... . ...+....+.++..++ T Consensus 419 ~~g~--~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 419 ANGA--GLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred hccc--ccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 3211 0122234455554321 1111111111100000000000 0 0000000000000000 No 99 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.72 E-value=1.3e-15 Score=102.14 Aligned_cols=513 Identities=11% Similarity=0.041 Sum_probs=226.3 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHH----------HHHhcCCC--c Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRK----------EREDEGRP--C 68 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~----------~~~~~g~p--~ 68 (688) |-|+- .....+.+-..+...+.... .+..+....+..+||.|+| +-..+ ..+...+| . T Consensus 1 ~~~~~------~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~YY~g~h--~Il~r~~~~~~~~~~~~~d~~~~nnk 70 (537) T protein:vir:78 1 MTSPL------LNKPIDQLGGLLNTEITTYM--ASNHIKWAHIGENYYNQEN--DIEKSRIFYMNDKGQLREDNYASNVK 70 (537) T ss_pred CCccc------ccccHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhcccc--hhhhcccccccccccccccccccccc Confidence 32221 11111112122222222211 1233556677789999985 11111 11122233 5 Q ss_pred eeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Q lcl|NC_020488. 69 LTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAV 148 (688) Q Consensus 69 ~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~ 148 (688) +++|..+.+|+..+|++..+.+.+..- + ..+.+.. ..++... .++++........++. T Consensus 71 i~~nf~k~Ivd~~~~yl~G~Pv~~~~~--d---------------~~~~e~~----~~l~~~~-~~~~~~~~~el~~~~s 128 (537) T protein:vir:78 71 ISHGFFTELVDQLAQYLLSNGVEVKVK--D---------------EDNTQLD----EILQEYF-DEDFQATIDTLVTNAS 128 (537) T ss_pred cccchHHHHHHHHhhhhcccCceeecC--c---------------chhHHHH----HHHHHHh-hccHHHHHHHHHHHHh Confidence 889999999999999999998776531 1 1123333 3344333 3678888888999999 Q ss_pred HcCCceEEEEEeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccc Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAE 226 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~ 226 (688) ++|.+|..++++. ++.+.+..+ +|..+ +||.. . +...+++.......+.. T Consensus 129 ~~G~ay~~~y~de------~~~~~~~~i-~p~~~~pv~d~~-~-----~~~~~~~~y~~~~~~~~--------------- 180 (537) T protein:vir:78 129 KKGFEGIFARTTS------EGKLKFQTV-DGLTLIPVFDDY-G-----VLKMIIRWYSEIRYSTK--------------- 180 (537) T ss_pred hcCeeEEEeeecC------CCceEEEEE-ccceeEEEEcCC-C-----CceeEEEEEeeeecccc--------------- Confidence 9999998887652 246677666 68775 45542 1 22222222221110000 Q ss_pred ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 227 RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 227 ~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) ..+...+..+|+|.......+.. ..+............. +. ....+...........+...... T Consensus 181 ------~~~~~~~~~~evyt~~~i~~y~~--~~~~~~~~~~~~~~~~-----~~---~i~~~~~~~~~~~~~~~~~~~~~ 244 (537) T protein:vir:78 181 ------QQSTETIWHADVWNEEAVCYYIQ--DDEGVSTTYKLDEAYN-----PN---PAPHVLAIEESTDADFEDTDGYQ 244 (537) T ss_pred ------ccCcceEEEEEEEcCCcEEEEEe--cCCccccccccccccc-----cc---ccceeeecccccccccccccccc Confidence 01123344456665443332221 1111111000000000 00 00000000000001111222223 Q ss_pred CCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 307 PVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 307 ~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) +.|.+.+.+|+++|.. ..+|.|.+..++++++.+|.+.|.+.+.+...+++.+++....+++..+..... +. T Consensus 245 ~~~~~~g~iPvv~f~n-------n~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l-~~ 316 (537) T protein:vir:78 245 VLGRSYSKFPFQLLYN-------NKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNI-KA 316 (537) T ss_pred ccccCCcceeEEEecc-------CccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHH-hh Confidence 3444556666665432 345779999999999999999999999999988887776544444433333322 22 Q ss_pred CCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) .+++.+... ...+.++....-..+....++.+.+.|-..|.+.+..... .++.||+|+..+-..........-.-| T Consensus 317 ~~~i~v~~d---~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~-~gn~SGvAlk~~~~~l~~ka~~ke~~f 392 (537) T protein:vir:78 317 KKMIGVNGD---NAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVG-DGNVTNVVIKSRYTLLAMKARKMETSL 392 (537) T ss_pred cCceeecCC---CCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCcccc-ccCCcHHHHHHHHhhHHHHHHHHHHHH Confidence 334433221 2235666666566778888999999998887555543332 235799999988766666666666666 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) ..+++++.++++.++.... . +..++ .+|.+.-.+..+.-..+.. T Consensus 393 ~~~l~~~~~~i~~~~~~~~----------~-~~~d~-------------------------~~i~i~f~~~~P~n~~e~a 436 (537) T protein:vir:78 393 RKVLRWCADMVVSDIALRG----------L-GEYDS-------------------------NDICFEIEPHVLANELDIA 436 (537) T ss_pred HHHHHHHHHHHHHHHhhcC----------C-ccccc-------------------------ceeeEEeccCCCCCHHHHH Confidence 7777776666666553211 0 00011 1122222223332222223 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhH-HhhhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEM-EEAGIEPPQPSPEQQANMAQAQADMEKA 625 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~q~~~~~~q~~~~~~ 625 (688) +.++.+.+ .+-+....++.++++-.-.+..+.+.+............ .+++.+.....+..+ ...... T Consensus 437 ~~~~~l~~----~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~ 505 (537) T protein:vir:78 437 TTRKTEAE----TEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQ-------AMLDGL 505 (537) T ss_pred HHHHHHHh----cCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchh-------hhcCCC Confidence 33332221 112223333444333211121111111100000000000 000000000000000 000000 Q ss_pred HHHHHHHHHHHHHHHHH---------HHHHHH Q lcl|NC_020488. 626 KADTAKAQADMAMAQAK---------TAEAQA 648 (688) Q Consensus 626 q~e~~~~q~e~~~~q~~---------~~~~~a 648 (688) ......-..+....... -..-+. T Consensus 506 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 506 PVNANQPPVDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred CCCCCCCCCCccCCCCCCCCCCCCCCccCCCC Confidence 00000000000000000 000000 No 100 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.72 E-value=1.2e-16 Score=107.85 Aligned_cols=471 Identities=9% Similarity=0.001 Sum_probs=244.2 Q ss_pred HHHHHHHHHHHHHHHHh-----------------hhHHHHHHHHHHHhhCCCCCCHHHHHH-HHhcCCCceeehhHHHHH Q lcl|NC_020488. 17 EAILQEIRERAAHAVTC-----------------WKHNFDAAQEDISFLAGEQWPESVRKE-REDEGRPCLTLNKLPQYV 78 (688) Q Consensus 17 ~~~~~~~~~~~~~~~~~-----------------~~~~r~~~~~~~~~~~G~Qw~~~~~~~-~~~~g~p~~~~N~i~~~i 78 (688) -.++++++.+|++-... ..+.+....++.++|.|++|.=..... -..+.+..++.|+-+.++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 34666666666543211 233444555677899998764221111 122345567889988888 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) ....+...+-.+.+.|...... +.+-...+..++.++-+++.|++......++++++..|-|+++++ T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~-------------~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~ 147 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDE-------------EKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPY 147 (517) T ss_pred HHhhhhhcCCcceEEecccccc-------------cccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEE Confidence 8888777777788877532211 011122233555666667789999999999999999999999999 Q ss_pred EeeccCCCCCcceeEEEecccceEEeCCcccc-cccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVLMDPDATE-PDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~-~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) ++ .+.+.+..| ++..||+ -+.. .+...|-+++..... .. .. .. T Consensus 148 ~d-------~~~~~I~~v-~ad~~~P--l~~~~~~v~~~ai~~~~~~~-~~--------~~-----------------~~ 191 (517) T protein:vir:98 148 VD-------NGEIEFSWA-LANAFYP--LRSNSNGISEGVMKSVTTKV-IG--------NK-----------------TV 191 (517) T ss_pred Ee-------CCeeEEEEE-cCCeeEE--EEecCCCeEEEEEEEEEEEe-ec--------CC-----------------ce Confidence 87 245667776 6777773 2111 112222222222111 00 00 00 Q ss_pred EEEEEEEEeeee---------cceeeeeccC----CceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhc Q lcl|NC_020488. 238 GVRVSEYFYREP---------VTRKLLLLSD----GRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVL 304 (688) Q Consensus 238 ~v~v~e~~~~~~---------~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~il 304 (688) ..+..|+..+.. ...+++...+ |..+.- .++.. . | T Consensus 192 ~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L---~~~~e----------------------------~-l 239 (517) T protein:vir:98 192 YYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPL---EELYE----------------------------G-M 239 (517) T ss_pred EEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccc---ccccc----------------------------C-C Confidence 111222211110 0001111000 111100 00000 0 0 Q ss_pred ccCCCCCCCccceEEEeee---eeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHh Q lcl|NC_020488. 305 EGPVDWPGSTIPVAPVLGK---EMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWN 381 (688) Q Consensus 305 e~~~p~~~~~~P~vp~~~~---~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 381 (688) .....+.+=..|++.++.. -....++|+|.|++..+++..+.+|...+++.+.+.+ ++.+++++++.+....+.-. T Consensus 240 ~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~~~~~~~g 318 (517) T protein:vir:98 240 QEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-GQRTVFVSDVMLRTVPDESG 318 (517) T ss_pred CcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-CCcceecChhhhccccCCCC Confidence 0000011111122211111 1112368999999999999999999999999999887 56688888888732211000 Q ss_pred --hcc---cCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcch-hhHHHHHHHHHHH Q lcl|NC_020488. 382 --QAN---RKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNE-QSGKAILARQRQG 455 (688) Q Consensus 382 --~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~-~sg~ai~~~~~~~ 455 (688) ... .....+.......+...++...+.=-...+++.++...+.+....|++....|.++.. .||++|....+.. T Consensus 319 ~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~ 398 (517) T protein:vir:98 319 MPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLT 398 (517) T ss_pred cccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHH Confidence 000 0011112122222222333333322245788888999999999999999999977653 5889998888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--cCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEe Q lcl|NC_020488. 456 DRGTFAYIDNLSRAIRRVGQILIELIPRV--YDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVK 533 (688) Q Consensus 456 ~~~~~~~~dn~~~~~~~~~~~~~~li~~~--~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~ 533 (688) -.....+...+..+++++.+.++.+..-+ |.. . ....++|+|+ T Consensus 399 ~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~---------~--------------------------~~~~~~v~v~ 443 (517) T protein:vir:98 399 YRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGG---------E--------------------------IPSAEHIGVD 443 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC---------C--------------------------CCCCcceEEE Confidence 88888888889999999999888766543 210 0 0012455555 Q ss_pred cccCcHHHHHHHHHHHHHHHHhhHHHHHHHHH-HHHHhcCCcc--HHHHHHHHHhhccccccchhhHHhhhhhhhhhh Q lcl|NC_020488. 534 AGPSYQTQRMEAADSLMQFVQAVPAAGGVVLD-LIAKNMDWPG--AQDIARRLQKTLPPGILDQDEMEEAGIEPPQPS 608 (688) Q Consensus 534 ~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~-~~~e~~~~~~--~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 608 (688) =+.+....+++.++.++++... +-+... .+.++.++.- +++...+++.......+....+.+...-....+ T Consensus 444 f~D~i~~D~~~~~~~~~~~v~a----G~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 444 FDDGVFQDRSALLRFYGQAKTF----GFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred cCCCCCCCHHHHHHHHHHHHhc----CCCCHHHHHHHhCCCChHHHHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 5555555666666666655432 111122 2233334321 223333333322222111111111100010111 No 101 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.72 E-value=2.6e-17 Score=111.54 Aligned_cols=466 Identities=9% Similarity=0.027 Sum_probs=206.9 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCC----CHHHHHHHHhcCCCceeehhHHHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQW----PESVRKEREDEGRPCLTLNKLPQYVDQVLG 83 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw----~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g 83 (688) |.|. ++++..+...+. ..+....+-.+||+|.|= +...-..++ .-.++.|..+-+|+..++ T Consensus 1 ~~t~-----~~~i~~L~~~~~-------~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~---~~~~~~n~~~~ivd~~~~ 65 (480) T protein:vir:78 1 MTTY-----HEHVERLQGLLA-------RDLPNLLEAEAYRNGTRRLKTIGIGAPPELA---YLDVQPGWVATYLRTLSD 65 (480) T ss_pred CCCH-----HHHHHHHHHHHH-------HHHHHHHHHHHHHhccccccccccccchhHh---hhhhhcchHHHHHHHHHh Confidence 3332 335555544332 234445566799999751 111101111 113678999999998888 Q ss_pred HHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeecc Q lcl|NC_020488. 84 DQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYST 163 (688) Q Consensus 84 ~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~ 163 (688) ++.-+ -.+.+- |.+. ...+..+++.|+++..+..++.++++.|.||..|+-.-.. T Consensus 66 ~l~~~---g~~~~~------------------d~~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~ 120 (480) T protein:vir:78 66 RLDIE---GFRISE------------------DSEG----LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVE 120 (480) T ss_pred hhccC---ceecCC------------------Cchh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccc Confidence 76322 111111 2222 2345566788999999999999999999998777632111 Q ss_pred CCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEE Q lcl|NC_020488. 164 DDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRV 241 (688) Q Consensus 164 ~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v 241 (688) +...++.+++..+ +|..+ +|||.... ...+. .+.+.+.+ +...+.. T Consensus 121 ~~d~~g~~~i~~~-~p~~~~~~~D~~~~~----~~~~~-i~~~~~~~--------------------------~~~~~~~ 168 (480) T protein:vir:78 121 SGDPAGIPLIRVE-SPLYMYAELDPRNTR----RVTRA-VRLYTTRD--------------------------DVAVPDR 168 (480) T ss_pred cCCCCCeeEEEEE-cccceEEEEcCCCcc----ceEEE-EEEEEeec--------------------------CCCceEE Confidence 1223456666655 67765 57775321 11221 22221110 1112223 Q ss_pred EEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEe Q lcl|NC_020488. 242 SEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVL 321 (688) Q Consensus 242 ~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~ 321 (688) .++|....... +....+... ......++.|.+.+.+|+|||. T Consensus 169 ~~~y~~~~~~~--~~~~~~~~~------------------------------------~~~~~~~~~~~~~g~vPvv~f~ 210 (480) T protein:vir:78 169 ATLYLPDETVP--LRRNGGLND------------------------------------QWVVDGDVIKHGLGVVPVVPLT 210 (480) T ss_pred EEEEeCCeEEE--EEecCCCcc------------------------------------ccccccccccCCCCCcceEEee Confidence 34443221100 000000000 0000112233445677888765 Q ss_pred eeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceeechhh-hcchHH-H-HhhcccCCCceeecCccc Q lcl|NC_020488. 322 GKEMVIGDKTYYRGLIR-FGKDAQRMHNYWMTAATERVALAPKAPWVAPAES-IEGYEE-E-WNQANRKNQSVLRYNAIP 397 (688) Q Consensus 322 ~~~~~~~~~~~g~g~v~-~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~-i~~~~~-~-~~~~~~~~~~~~~~~~~~ 397 (688) .. +..+.++|.|-+. .++++++.+|+.+|.+...+...+.+..++. |. .+...+ . ........+.++...+ T Consensus 211 n~--~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 285 (480) T protein:vir:78 211 ND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDIYYGRILTLAS-- 285 (480) T ss_pred cc--cccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-cCCccccccccccchhhhhhhhhccCCC-- Confidence 42 3456688888886 5899999999999999999887777665442 22 111100 0 0001111233332221 Q ss_pred ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 398 GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQI 476 (688) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~ 476 (688) +...+...+.. ....+.+.++.....+-.+||+++..+|..+ |..||.|+..+-..-..........|..++++++++ T Consensus 286 ~~~~~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l 364 (480) T protein:vir:78 286 EAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) T ss_pred CCceEEecCcc-CHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122222222 2344555556666666667899999998654 457999998876555555666666666666666665 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) ++. +.. ..+..++. ++.|.=.+.......+..+.+.++.+.. T Consensus 365 ~~~----~~g---------~~~~~~~~-------------------------~i~v~f~~~~~~s~~~~ad~~~kl~~~g 406 (480) T protein:vir:78 365 AMQ----IMG---------REVTEEYT-------------------------RLETVWRDPSTPTVAAKADAVSKLYANG 406 (480) T ss_pred HHH----HcC---------CCccccce-------------------------eeeEEecCCCCCCHHHHHHHHHHHHHhc Confidence 543 322 11111111 1111111111111223444445544421 Q ss_pred HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhh---hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEA---GIEPPQPSPEQQANMAQAQADMEKAKADTAKAQ 633 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q 633 (688) .. .+....+++.+++- ++-.+++.+............... +.....+.+... .. ..+.+.+-.. T Consensus 407 ~~--~~s~et~~~~lg~~--~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--------~~~~~~~~~~ 473 (480) T protein:vir:78 407 QG--PIPKEQARIDLGYT--ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-ET--------KTETQTSPSG 473 (480) T ss_pred cc--cCCHHHHHhcCCCC--HhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCC-CC--------CCccccccCC Confidence 11 11223344555443 222222211100000000000000 000000000000 00 0000000000 Q ss_pred HHHHHHHHH Q lcl|NC_020488. 634 ADMAMAQAK 642 (688) Q Consensus 634 ~e~~~~q~~ 642 (688) .-++++. T Consensus 474 --~~~~~~~ 480 (480) T protein:vir:78 474 --FNRTKTR 480 (480) T ss_pred --CCcccCC Confidence 0000000 No 102 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.71 E-value=1.6e-15 Score=101.68 Aligned_cols=484 Identities=10% Similarity=-0.015 Sum_probs=232.4 Q ss_pred HHHHHHHHHHHHHHHHhh---------h-------HHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHH Q lcl|NC_020488. 17 EAILQEIRERAAHAVTCW---------K-------HNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQ 80 (688) Q Consensus 17 ~~~~~~~~~~~~~~~~~~---------~-------~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~ 80 (688) --|...+++.++.-+... . ..+.++. .++|.|.+|..--...+ .+-.+..|+-+.+++. T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~w~~~~~~~~---~~~~~~~~l~~~i~~~ 75 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWS--KDSYLTSLWAQGYVPTV---HDKLMNSGTGNEIVVV 75 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhh--hhhhhhhhcccCCCCcc---ccccccCChHHHHHHH Confidence 223333333333222111 1 1111111 23455667743211111 1123455667777888 Q ss_pred HHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEe Q lcl|NC_020488. 81 VLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTK 160 (688) Q Consensus 81 i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~ 160 (688) .......-.+.+.|..-+.. -.+.+++.+..+++.|++.......+++++..|.||++++++ T Consensus 76 ~A~ll~~e~~~i~v~~~~~~------------------d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d 137 (518) T protein:vir:78 76 AAEYISGKPLSIDVTGVNGS------------------KDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL 137 (518) T ss_pred HHHhhcCCCceEEecCcccc------------------CcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE Confidence 88888888888877532211 124566677778888999999999999999999999999875 Q ss_pred eccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEE Q lcl|NC_020488. 161 YSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVR 240 (688) Q Consensus 161 ~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~ 240 (688) .+.+.+..| ++..|++.. ..-++..+ +|......-+. ...|-- .+.-......+..|... ..+ T Consensus 138 -------~~~~~i~~v-~ad~~~P~~--~~g~~~~~--~f~~~~~~~~k-~~~y~~---lE~he~~~~~~~~~~~~-~~~ 200 (518) T protein:vir:78 138 -------NGRPSISVH-SSSQFWIDF--KNNEPFRF--NFFEEIPTSNK-ADIYYL---VESREIKQWDKEGKKLS-GGF 200 (518) T ss_pred -------CCeeEEEEE-cCCeeEEEe--ecCcEEEE--EEEEEeecCCc-ceeEEE---EEeeccccccceeeccc-cee Confidence 245777777 677777542 22233333 33221110000 000000 00000000000000000 011 Q ss_pred EEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEE Q lcl|NC_020488. 241 VSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPV 320 (688) Q Consensus 241 v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~ 320 (688) +....|+ ...+..+......... . .........+.+...-..+...||+++ T Consensus 201 I~n~ly~---------~~~~~~v~~~~~~~~~-~-------------------l~~~~~~~~~~e~~~~~tg~~~~~~~~ 251 (518) T protein:vir:78 201 VTYSVIK---------IDGDKTTPISAERLPE-Q-------------------ITSYLHTNDIQLNHSVSIGLKSMGAYL 251 (518) T ss_pred EEEEEee---------ecCccccccccccccc-c-------------------cccccccccCccceeeccCCccceEEe Confidence 1111111 0001111000000000 0 000000001111111112234566665 Q ss_pred eeee---eccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhh--cccCC---Cceee Q lcl|NC_020488. 321 LGKE---MVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQ--ANRKN---QSVLR 392 (688) Q Consensus 321 ~~~~---~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~--~~~~~---~~~~~ 392 (688) ++.. ....++|+|.|++..+++.++.+|...|++.+.+.. +..++.++++.+.....-... ...++ ..+.. T Consensus 252 ~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~ 330 (518) T protein:vir:78 252 INNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQ 330 (518) T ss_pred eccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCCCceEEE Confidence 4432 112468899999999999999999999999999976 778888988877421110000 00011 11111 Q ss_pred cCccc--c---cccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 393 YNAIP--G---VDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLS 467 (688) Q Consensus 393 ~~~~~--~---~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~ 467 (688) .++.. + .+.++...+.=-...+...++.....+..-.|++...+|..+...||++|....+..-..+......+. T Consensus 331 i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~e 410 (518) T protein:vir:78 331 FKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKRLIQ 410 (518) T ss_pred ecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 1 112333333222355777788888888888999999999776678999999988887777788888888 Q ss_pred HHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHH Q lcl|NC_020488. 468 RAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAAD 547 (688) Q Consensus 468 ~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~ 547 (688) .+++++-+.++.+..-++.... +......++|+|+=+.+......+..+ T Consensus 411 ~al~~l~~~i~~l~~~~~~~~~-------------------------------~~~~~~~~~v~i~f~D~i~~D~~~~~~ 459 (518) T protein:vir:78 411 NVYEQMLWDFLYLLTGGTNNKE-------------------------------KAIMRDEIRVIIEFPDPMSVNLNELSS 459 (518) T ss_pred HHHHHHHHHHHHHHHhhcCccc-------------------------------cccCCCceeEEEEeCCCCCCCHHHHHH Confidence 8888888887777655432100 000112345555555555556666666 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHhc--CCc--cHHHHHHHHHhhcccccc-chhhHHhhhhhhhhh Q lcl|NC_020488. 548 SLMQFVQAVPAAGGVVLDLIAKNM--DWP--GAQDIARRLQKTLPPGIL-DQDEMEEAGIEPPQP 607 (688) Q Consensus 548 ~l~~~~q~~~~~~~~~~~~~~e~~--~~~--~~~ei~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 607 (688) .++++... +-+.....+++. ++. .+++..+++++....... .+....... ..+- T Consensus 460 ~~~~~v~a----GimS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~--~~~g 518 (518) T protein:vir:78 460 TLNNMNSA----LAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGME--TKGG 518 (518) T ss_pred HHHHHHhc----CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCC--CCCC Confidence 55544331 111122223322 221 122233333332221111 111111111 1111 No 103 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.70 E-value=1.1e-15 Score=102.51 Aligned_cols=454 Identities=12% Similarity=0.060 Sum_probs=238.0 Q ss_pred HHHHHHHHHHHHHHHHh-----------------hhHHHHHHHHHHHhhCCCCCCHHHHHHH-HhcCCCceeehhHHHHH Q lcl|NC_020488. 17 EAILQEIRERAAHAVTC-----------------WKHNFDAAQEDISFLAGEQWPESVRKER-EDEGRPCLTLNKLPQYV 78 (688) Q Consensus 17 ~~~~~~~~~~~~~~~~~-----------------~~~~r~~~~~~~~~~~G~Qw~~~~~~~~-~~~g~p~~~~N~i~~~i 78 (688) -.+.++++..|+..... ..+.+....++.+||.|+.+.-.....- ....+...+.|+-+.++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 34566666666542211 2344556677889999974422110000 01234456779999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +...+....-.+.+.+. |.. +++.++.+++.|++......+++.++..|.|+++++ T Consensus 81 ~~~A~lv~~e~~~i~~~--------------------d~~----~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~ 136 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVD--------------------DDA----ANEFISETLKNDRFNKNFERYLESCLALGGLAMRPY 136 (500) T ss_pred HHHhhhhcCCcceEecC--------------------ChH----HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEE Confidence 99888888777777661 333 444555666789999999999999999999999999 Q ss_pred EeeccCCCCCcceeEEEecccceEEeCCccc-ccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVLMDPDAT-EPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~-~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) ++. +.+.+..| ++..|++ -.. .-+...+-++++.... . . .... T Consensus 137 ~d~-------~~~~I~~v-~ad~~~P--~~~d~~~~~~~a~~~~~~~~-~------------------~-------~~~~ 180 (500) T protein:vir:30 137 VDG-------DKVRVAFV-QAPVFLP--LQSNTQDVSSAAVVIKSVKT-I------------------N-------GKEV 180 (500) T ss_pred EeC-------CceEEEEE-cCCeeEE--EEEcCCCeEEEEEEEEEeee-e------------------c-------CCce Confidence 862 34667766 6777763 111 1112222222211100 0 0 0111 Q ss_pred EEEEEEEEeeeeccee-----eeeccC----CceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCC Q lcl|NC_020488. 238 GVRVSEYFYREPVTRK-----LLLLSD----GRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPV 308 (688) Q Consensus 238 ~v~v~e~~~~~~~~~~-----~~~~~~----g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~ 308 (688) ..+..|+++++....+ ++...+ |..+... .+ | .-|+... T Consensus 181 ~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~-------------------------~~-~------~~l~~~~ 228 (500) T protein:vir:30 181 YYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLS-------------------------EV-Y------KDLKDEA 228 (500) T ss_pred EEEEEEEEEEeCCceeEEEEEEEecccccccCcccccc-------------------------cc-c------CCcCcce Confidence 2344555544322111 111100 1111000 00 0 0011110 Q ss_pred CCC-CCccceEEE---eeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhc- Q lcl|NC_020488. 309 DWP-GSTIPVAPV---LGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQA- 383 (688) Q Consensus 309 p~~-~~~~P~vp~---~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~- 383 (688) .+. ....||+.+ ... ....++|+|.|++..+++..+.+|...|++.+.+.. ...++.++++.+....+-.... T Consensus 229 ~~~~~~~p~f~~~~~~~~N-~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~ 306 (500) T protein:vir:30 229 KVTDVTRPIFTYLKTPGMN-NKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDV 306 (500) T ss_pred EeccCCCccEEEecCCccc-cccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccc Confidence 000 111122211 111 123478999999999999999999999999999976 6678888887764221110000 Q ss_pred ---ccC---CCceeecCcc-cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc-hhhHHHHHHHHHHH Q lcl|NC_020488. 384 ---NRK---NQSVLRYNAI-PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN-EQSGKAILARQRQG 455 (688) Q Consensus 384 ---~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~-~~sg~ai~~~~~~~ 455 (688) ... ...+...+.. .+...++...+.=-...+...++.....+....|++....|.+++ ..||++|....+.. T Consensus 307 ~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~ 386 (500) T protein:vir:30 307 VPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDT 386 (500) T ss_pred cCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHH Confidence 000 1112222221 122334444332224557778888888888889999999997654 35899999888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEe Q lcl|NC_020488. 456 DRGTFAYIDNLSRAIRRVGQILIELIPR--VYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVK 533 (688) Q Consensus 456 ~~~~~~~~dn~~~~~~~~~~~~~~li~~--~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~ 533 (688) -.....+...+..+++++.+.++.+..- ++.. .....++|+|+ T Consensus 387 ~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~-----------------------------------~~~~~~~v~v~ 431 (500) T protein:vir:30 387 YQMRNSIVALVEQSLKELVISIFEIAKAYDLYQS-----------------------------------EVPSMDNISIS 431 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-----------------------------------CCCCCcceEEE Confidence 8888888888999999998888876543 2210 00012344455 Q ss_pred cccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHH-HHHhcCCcc--HHHHHHHHHhhccccccchhhHHhhhhhhhhhh Q lcl|NC_020488. 534 AGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDL-IAKNMDWPG--AQDIARRLQKTLPPGILDQDEMEEAGIEPPQPS 608 (688) Q Consensus 534 ~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~-~~e~~~~~~--~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 608 (688) =+.+.....++.++.++++... +-+.... +.++-++.. +++..++++............ ... .--+ T Consensus 432 f~d~i~~d~~~~~~~~~~~v~a----Gi~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~--~~~---~~g~ 500 (500) T protein:vir:30 432 LDDGVFTDRDAELDYWIKVVNA----GFGTREMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRT--DTH---LYGE 500 (500) T ss_pred eCCCCCCCHHHHHHHHHHHHHc----CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCc--ccc---ccCC Confidence 4444455555555555555432 1111111 233333322 222333333221111100000 000 0000 No 104 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.70 E-value=1.1e-15 Score=102.51 Aligned_cols=454 Identities=12% Similarity=0.060 Sum_probs=238.0 Q ss_pred HHHHHHHHHHHHHHHHh-----------------hhHHHHHHHHHHHhhCCCCCCHHHHHHH-HhcCCCceeehhHHHHH Q lcl|NC_020488. 17 EAILQEIRERAAHAVTC-----------------WKHNFDAAQEDISFLAGEQWPESVRKER-EDEGRPCLTLNKLPQYV 78 (688) Q Consensus 17 ~~~~~~~~~~~~~~~~~-----------------~~~~r~~~~~~~~~~~G~Qw~~~~~~~~-~~~g~p~~~~N~i~~~i 78 (688) -.+.++++..|+..... ..+.+....++.+||.|+.+.-.....- ....+...+.|+-+.++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 34566666666542211 2344556677889999974422110000 01234456779999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +...+....-.+.+.+. |.. +++.++.+++.|++......+++.++..|.|+++++ T Consensus 81 ~~~A~lv~~e~~~i~~~--------------------d~~----~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~ 136 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVD--------------------DDA----ANEFISETLKNDRFNKNFERYLESCLALGGLAMRPY 136 (500) T ss_pred HHHhhhhcCCcceEecC--------------------ChH----HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEE Confidence 99888888777777661 333 444555666789999999999999999999999999 Q ss_pred EeeccCCCCCcceeEEEecccceEEeCCccc-ccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVLMDPDAT-EPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~-~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) ++. +.+.+..| ++..|++ -.. .-+...+-++++.... . . .... T Consensus 137 ~d~-------~~~~I~~v-~ad~~~P--~~~d~~~~~~~a~~~~~~~~-~------------------~-------~~~~ 180 (500) T protein:vir:98 137 VDG-------DKVRVAFV-QAPVFLP--LQSNTQDVSSAAVVIKSVKT-I------------------N-------GKEV 180 (500) T ss_pred EeC-------CceEEEEE-cCCeeEE--EEEcCCCeEEEEEEEEEeee-e------------------c-------CCce Confidence 862 34667766 6777763 111 1112222222211100 0 0 0111 Q ss_pred EEEEEEEEeeeeccee-----eeeccC----CceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCC Q lcl|NC_020488. 238 GVRVSEYFYREPVTRK-----LLLLSD----GRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPV 308 (688) Q Consensus 238 ~v~v~e~~~~~~~~~~-----~~~~~~----g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~ 308 (688) ..+..|+++++....+ ++...+ |..+... .+ | .-|+... T Consensus 181 ~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~-------------------------~~-~------~~l~~~~ 228 (500) T protein:vir:98 181 YYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLS-------------------------EV-Y------KDLKDEA 228 (500) T ss_pred EEEEEEEEEEeCCceeEEEEEEEecccccccCcccccc-------------------------cc-c------CCcCcce Confidence 2344555544322111 111100 1111000 00 0 0011110 Q ss_pred CCC-CCccceEEE---eeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhc- Q lcl|NC_020488. 309 DWP-GSTIPVAPV---LGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQA- 383 (688) Q Consensus 309 p~~-~~~~P~vp~---~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~- 383 (688) .+. ....||+.+ ... ....++|+|.|++..+++..+.+|...|++.+.+.. ...++.++++.+....+-.... T Consensus 229 ~~~~~~~p~f~~~~~~~~N-~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~ 306 (500) T protein:vir:98 229 KVTDVTRPIFTYLKTPGMN-NKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDV 306 (500) T ss_pred EeccCCCccEEEecCCccc-cccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccc Confidence 000 111122211 111 123478999999999999999999999999999976 6678888887764221110000 Q ss_pred ---ccC---CCceeecCcc-cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc-hhhHHHHHHHHHHH Q lcl|NC_020488. 384 ---NRK---NQSVLRYNAI-PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN-EQSGKAILARQRQG 455 (688) Q Consensus 384 ---~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~-~~sg~ai~~~~~~~ 455 (688) ... ...+...+.. .+...++...+.=-...+...++.....+....|++....|.+++ ..||++|....+.. T Consensus 307 ~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~ 386 (500) T protein:vir:98 307 VPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDT 386 (500) T ss_pred cCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHH Confidence 000 1112222221 122334444332224557778888888888889999999997654 35899999888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEe Q lcl|NC_020488. 456 DRGTFAYIDNLSRAIRRVGQILIELIPR--VYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVK 533 (688) Q Consensus 456 ~~~~~~~~dn~~~~~~~~~~~~~~li~~--~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~ 533 (688) -.....+...+..+++++.+.++.+..- ++.. .....++|+|+ T Consensus 387 ~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~-----------------------------------~~~~~~~v~v~ 431 (500) T protein:vir:98 387 YQMRNSIVALVEQSLKELVISIFEIAKAYDLYQS-----------------------------------EVPSMDNISIS 431 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-----------------------------------CCCCCcceEEE Confidence 8888888888999999998888876543 2210 00012344455 Q ss_pred cccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHH-HHHhcCCcc--HHHHHHHHHhhccccccchhhHHhhhhhhhhhh Q lcl|NC_020488. 534 AGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDL-IAKNMDWPG--AQDIARRLQKTLPPGILDQDEMEEAGIEPPQPS 608 (688) Q Consensus 534 ~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~-~~e~~~~~~--~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 608 (688) =+.+.....++.++.++++... +-+.... +.++-++.. +++..++++............ ... .--+ T Consensus 432 f~d~i~~d~~~~~~~~~~~v~a----Gi~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~--~~~---~~g~ 500 (500) T protein:vir:98 432 LDDGVFTDRDAELDYWIKVVNA----GFGTREMAIQKVLNVTEEKAQEIAAEINTGIVDEINQQRT--DTH---LYGE 500 (500) T ss_pred eCCCCCCCHHHHHHHHHHHHHc----CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccCCCCCc--ccc---ccCC Confidence 4444455555555555555432 1111111 233333322 222333333221111100000 000 0000 No 105 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.69 E-value=8.2e-16 Score=103.30 Aligned_cols=458 Identities=14% Similarity=0.050 Sum_probs=210.9 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCC----CCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQ----WPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Q----w~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) .+|+-+ +.+...+++..+...+.. -+..-.+-.+||+|++ ++...-..++ .-..+.|..+- T Consensus 4 ~~~~~~-----e~~~~~~~~~~l~~~~~~-------~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~---~~~~v~n~~~~ 68 (486) T protein:vir:42 4 PLPGME-----EIEDPAVVREEMISAFED-------ASKDLASNTSYYDAERRPEAIGVTVPREMQ---QLLAHVGYPRL 68 (486) T ss_pred CCCCCC-----CcccHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcchhcccccchhHh---hhhhccchHHH Confidence 455544 555555677776665533 1223333457999986 1111101111 11346788888 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEE Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLR 156 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~ 156 (688) +|+..++.+.-+ -+ ..|- +.. ....+..++..|+++.....+..++++.|.+|+- T Consensus 69 iVd~~~~~l~~~--g~-~~~~------------------~~~----~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~ 123 (486) T protein:vir:42 69 YVDSVAERQAVE--GF-RLGD------------------ADE----ADEELWQWWQANNLDIEAPLGYTDAYVHGRSFIT 123 (486) T ss_pred HHHHHHhhhccc--ce-ecCC------------------Cch----hHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 888888766322 11 1110 111 1233455667899999999999999999999988 Q ss_pred EEEeeccCC--CCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccccccc Q lcl|NC_020488. 157 VLTKYSTDD--AFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSW 232 (688) Q Consensus 157 v~~~~~~~~--~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~ 232 (688) |+.+..... ..++.+++..+ +|.++ +|||.... ..++.+.+-+. T Consensus 124 v~~~e~~~~~~~~~~~~~i~~~-~p~~~~~i~d~~~~~------~~~~~~~~~~~------------------------- 171 (486) T protein:vir:42 124 ISKPDPQLDLGWDQNVPIIRVE-PPTRMHAEIDPRINR------VSKAIRVAYDK------------------------- 171 (486) T ss_pred EecCCcccccccCCCeeEEEEe-cccceEEEEeCCCCC------eEEEEEEEEec------------------------- Confidence 876532221 12445555544 77765 57774321 22222222100 Q ss_pred CCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCC Q lcl|NC_020488. 233 WTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPG 312 (688) Q Consensus 233 ~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~ 312 (688) +...+...++|.... ...+...+|.+ ..+++.|.+. T Consensus 172 --~~~~~~~~~~y~~~~--~~~~~~~~~~~----------------------------------------~~~~~~~h~~ 207 (486) T protein:vir:42 172 --EGNEIQAATLYTPME--TIGWFRADGEW----------------------------------------AEWFNVPHGL 207 (486) T ss_pred --CCCeEEEEEEEcCCc--EEEEEecCCcE----------------------------------------EeecceecCC Confidence 112233444443221 11111111111 1122334445 Q ss_pred CccceEEEeeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceeech---hhhcchHHH-HhhcccCC Q lcl|NC_020488. 313 STIPVAPVLGKEMVIGDKTYYRGLIR-FGKDAQRMHNYWMTAATERVALAPKAPWVAPA---ESIEGYEEE-WNQANRKN 387 (688) Q Consensus 313 ~~~P~vp~~~~~~~~~~~~~g~g~v~-~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~---~~i~~~~~~-~~~~~~~~ 387 (688) +.+|+|||... ...+.++|.|-+. .++++++.+|+.+|.+.......+.+...+.. ..+...++. ........ T Consensus 208 g~vPvv~~~n~--~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~ 285 (486) T protein:vir:42 208 GVVPVVPLPNR--TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYL 285 (486) T ss_pred CCceEEEeccc--cccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhh Confidence 67888876542 2356678888886 58999999999999998888777666554421 111100000 00011112 Q ss_pred CceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 388 QSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) +.++.... +...+...+.. -...+...+......+-.++++++..+|..+ |..||.|+......-..........| T Consensus 286 ~~~~~~~~--~~~~~~q~~~~-~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f 362 (486) T protein:vir:42 286 ARILAFED--AEGKIQQFSAA-ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMF 362 (486) T ss_pred chhcccCC--CCceEEeeccc-CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 23333221 11122222222 2233444444444444455788888888654 55799999988777777777777777 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) ..++++++++++.+.-. .....++ .++.|.=.+.......+.. T Consensus 363 ~~~l~~~~~l~~~~~~~------------~~~~~d~-------------------------~~i~v~w~~~~~~s~~~~a 405 (486) T protein:vir:42 363 GGAWEEAMRIAYRIMKG------------GDVPPDM-------------------------LRMETVWRDPSTPTYAAKA 405 (486) T ss_pred HHHHHHHHHHHHHHhcC------------CCccccc-------------------------eeeeEEecCCCCCCHHHHH Confidence 77887777766553210 0000011 1111111112222233344 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcCCcc-HHHHHHHHHhhccccccch----------hhHHhhhhhhhhhhHHHHHHH Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNMDWPG-AQDIARRLQKTLPPGILDQ----------DEMEEAGIEPPQPSPEQQANM 615 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~~~~~-~~ei~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~q~~~ 615 (688) +.+.++.+.... -+....+++++++-. ..+-.+++++......... .+.......++..++...+.. T Consensus 406 d~~~kl~~~~~g--~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (486) T protein:vir:42 406 DAATKLYGNGQG--VIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSG 483 (486) T ss_pred HHHHHHHhcccC--CCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCC Confidence 555555432111 111233344444421 1111111111100000000 000000000000000000000 Q ss_pred HHH Q lcl|NC_020488. 616 AQA 618 (688) Q Consensus 616 ~~~ 618 (688) ... T Consensus 484 ~~~ 486 (486) T protein:vir:42 484 GDA 486 (486) T ss_pred CCC Confidence 000 No 106 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.69 E-value=1.5e-15 Score=101.87 Aligned_cols=397 Identities=14% Similarity=0.072 Sum_probs=208.2 Q ss_pred hhHHHHHHHHHHHhhCCCCCC----HHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccc Q lcl|NC_020488. 34 WKHNFDAAQEDISFLAGEQWP----ESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVP 109 (688) Q Consensus 34 ~~~~r~~~~~~~~~~~G~Qw~----~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~ 109 (688) ..-.+..-..-.+||.|+|=- ...-..++..-+ ++.|..+-+|+...+-..=+- |+ T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~~Vds~a~rl~~~G----f~-------------- 60 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQ--AVLGWAAKGVDSLADRLIFRA----FA-------------- 60 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHH--hhcchhHHHHHHhHhhhcccc----cc-------------- Confidence 222334445557899998632 222223332223 456999888888765332111 10 Q ss_pred cccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcceeEEEecccceE--EeCCc Q lcl|NC_020488. 110 NVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAV--LMDPD 187 (688) Q Consensus 110 ~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~ 187 (688) -.|.. +..+++.|+++.....+..++++.|.+++-|+-+ .++.+++..+ +|..+ +|||. T Consensus 61 ----~~d~~--------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~------~d~~~~i~~~-sP~~~~~i~Dp~ 121 (410) T protein:vir:95 61 ----NDDFN--------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKG------EDDEVRLQVI-ESSNATGVIDPI 121 (410) T ss_pred ----CCCch--------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecC------CCCceEEEEE-cccceEEEEeCC Confidence 01221 4556788999999999999999999998887532 1345666544 67664 57874 Q ss_pred ccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccc Q lcl|NC_020488. 188 ATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDE 267 (688) Q Consensus 188 a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~ 267 (688) .+. ..+..+.+-. + +........+|.. +.+++.. T Consensus 122 ~~~------~~~al~~~~~--------------------------~-~~~~~~~~~~~~~------------~~~~~~~- 155 (410) T protein:vir:95 122 TGL------LVEGYAVLAR--------------------------D-DYNRPTLEAYFEP------------NATHFIP- 155 (410) T ss_pred CCc------eEEEEEEEEe--------------------------c-CCCeEEEEEEEeC------------CcEEEEe- Confidence 322 1122211100 0 0011112222221 1111100 Q ss_pred cchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccch-HHHhhHHHHH Q lcl|NC_020488. 268 VKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGL-IRFGKDAQRM 346 (688) Q Consensus 268 ~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~-v~~~~d~Q~~ 346 (688) ..+... . .|-+.+..|+|||+.. +..+.++|.|- .+.++++|+. T Consensus 156 ------------------------------~~~~~~-~--~~~~~g~vPvV~f~n~--~~l~~~~G~s~I~~~v~~l~da 200 (410) T protein:vir:95 156 ------------------------------KDGEPY-S--VTNETGIPLLVPVIHR--PDAVRPFGRSRITRAGMYYQKY 200 (410) T ss_pred ------------------------------eCCccc-c--ccCCCCCcceEEeccc--ccCCccCCccccchhHHHHHHH Confidence 000000 0 1223467788887643 34567888884 4889999999 Q ss_pred HHHHHHHHHHHHHhcCCCceeechhhhc--chHHHHhhcccCCCceeecCccccccccee--cCCCcchHHHHHHHHHHH Q lcl|NC_020488. 347 HNYWMTAATERVALAPKAPWVAPAESIE--GYEEEWNQANRKNQSVLRYNAIPGVDRPQR--DMPASMPAAELQLALSAT 422 (688) Q Consensus 347 ~N~~~s~~~~~~~~~~~~~~~~~~~~i~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ll~~~~ 422 (688) +|+.++.+.-.....+.++..+- |.-. ...+.|. ...+.++.......+..+.+ .+..++ +.+...+.... T Consensus 201 ~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~---~~~~~i~~~~~~~~~~~~~v~q~~~~~l-~~~~~~l~~l~ 275 (410) T protein:vir:95 201 AKRTLERADITAEFYSWPQKYIL-GLDPDAEPMEKWK---ATVSSLLTISSSDKGVKPSVGQFTTASM-SPFTEQLRTAA 275 (410) T ss_pred HHHHHHHHHHHHHHhcchhheee-ccCCCCCcCchhh---hhhhhheeccCCCCCCcceEEecCCCCh-HHHHHHHHHHH Confidence 99999999998888777765542 2110 0111121 11223444332222233333 444444 34566677777 Q ss_pred HHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCc-ceEEEEeccCCCc Q lcl|NC_020488. 423 DEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDS-DRVLRLRFQDGEG 500 (688) Q Consensus 423 ~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~-~r~~ri~~~~~~~ 500 (688) ..+-.+||++...+|..+ |..||.||.+....-........+.|..+.+.++++.+.+.-.+=.. ....++ .. T Consensus 276 ~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~-----~v 350 (410) T protein:vir:95 276 AGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRT-----AV 350 (410) T ss_pred HHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccccee-----eE Confidence 777778899999999655 45799999877766656666677777778888888777654322100 000000 00 Q ss_pred ceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHH Q lcl|NC_020488. 501 DWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIA 580 (688) Q Consensus 501 ~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~ 580 (688) .|-.+ . .+.+.| ..+..+++.++.+..|.+. .-..+.+++++-..+ +. T Consensus 351 ~W~p~---------------~-------------d~~~~s-~a~~aDa~~Kl~~a~~g~~--~~~~~~~~lg~~~~~-~~ 398 (410) T protein:vir:95 351 KWEPL---------------F-------------EADANT-MTMIGDGVVKLNQALPGYI--NAETIRDLTGIAGDM-SA 398 (410) T ss_pred Eeeec---------------C-------------Ccchhh-HHHHHHHHHHHHHhccCCc--cHHHHHHhcCCChHH-HH Confidence 11100 0 123333 3556667777766544321 234466777775433 33 Q ss_pred HHHHhhcccccc Q lcl|NC_020488. 581 RRLQKTLPPGIL 592 (688) Q Consensus 581 ~~~~~~~~~~~~ 592 (688) ..+.+...+.+. T Consensus 399 ~~~~~e~~~~g~ 410 (410) T protein:vir:95 399 KPVVSEGGSNGE 410 (410) T ss_pred HHHHHHHHhCCC Confidence 222211111111 No 107 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.69 E-value=2e-16 Score=106.67 Aligned_cols=459 Identities=11% Similarity=0.058 Sum_probs=207.2 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHH-HhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKER-EDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~-~~~g~p~~~~N~i~~~i~ 79 (688) +||..+ ..+.+++++.+...+.. ++..-.+-.+||.|.|.-....... .+-..-..+.|..+-+|+ T Consensus 4 ~~~~~~------~~~~~~~~~~l~~~~~~-------~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd 70 (484) T protein:vir:77 4 PLQKQE------NVDPEKAREEMLNLFTE-------RTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYID 70 (484) T ss_pred cccccC------CCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHH Confidence 455443 44445566666665542 2223344578999987632210000 000111246798888899 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) ..++.+.-+- + ..| ++.+. ...+..+++.|+++.....+..++++.|.||+.|+. T Consensus 71 ~~~~~l~~~g--~-~~~------------------~~~~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~ 125 (484) T protein:vir:77 71 AIAARQELEG--F-RLG------------------GADKA----DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISK 125 (484) T ss_pred HHHhhhccCc--e-ecC------------------Ccchh----HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEec Confidence 8887664221 1 111 11121 233556678899999999999999999999988876 Q ss_pred eeccCCCC--CcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCC Q lcl|NC_020488. 160 KYSTDDAF--DLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTN 235 (688) Q Consensus 160 ~~~~~~~~--~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~ 235 (688) +.+..... ...+++. +.+|..+ +|||..+. . ..+.+.+.+. + T Consensus 126 ~~~~~~~~~~~~~~~i~-~~~p~~~~~~~D~~~~~-----~-~~a~~~~~~~---------------------------~ 171 (484) T protein:vir:77 126 PDPNIDPGVDPEVPIIR-VEPPTNLYAQIDPRTRQ-----V-MRAIRAIEDE---------------------------E 171 (484) T ss_pred CCCCcccccccccceEE-EeccceeEEEecCCCCc-----e-EEEEEEEEee---------------------------c Confidence 64433221 1223443 4477776 47764321 1 1222222110 0 Q ss_pred CCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCcc Q lcl|NC_020488. 236 EEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTI 315 (688) Q Consensus 236 ~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~ 315 (688) ...+...++|..... +.+...+|. -.+.+..|-+.+.+ T Consensus 172 ~~~~~~~~~y~~~~~--~~~~~~~~~----------------------------------------~~~~~~~~~~~g~v 209 (484) T protein:vir:77 172 GNEVIGATLYLPNNT--VIWNREDGQ----------------------------------------WVQVANVAHNLEMV 209 (484) T ss_pred CCcEEEEEEEecCeE--EEEEecCCc----------------------------------------eEeeccccCCCCCc Confidence 111222333332110 001111111 11122234455778 Q ss_pred ceEEEeeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcch---HHH-HhhcccCCCce Q lcl|NC_020488. 316 PVAPVLGKEMVIGDKTYYRGLIR-FGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGY---EEE-WNQANRKNQSV 390 (688) Q Consensus 316 P~vp~~~~~~~~~~~~~g~g~v~-~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~---~~~-~~~~~~~~~~~ 390 (688) |+|||... ...+.++|.|-+. .++++++.+|+.+|.+...+...+.+..++.....+.. ++. ........+.+ T Consensus 210 Pvv~f~N~--~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~ 287 (484) T protein:vir:77 210 PVIPIPNR--TRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARI 287 (484) T ss_pred ceEEeccc--cccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhh Confidence 88887542 3467788888775 69999999999999999988877666554422111110 000 00000111223 Q ss_pred eecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 391 LRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNLSRA 469 (688) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~ 469 (688) +...+ +...+...+..+ ...+...+......+-.++++++..+|..+ |..||.|+......-..........|..+ T Consensus 288 ~~~~~--~~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 364 (484) T protein:vir:77 288 LAFED--HESKAQQFSAAE-LRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGA 364 (484) T ss_pred cccCC--CCceeEeecCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32211 112222222222 233444444444444455788888988654 55799999887665555556666666666 Q ss_pred HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHH Q lcl|NC_020488. 470 IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSL 549 (688) Q Consensus 470 ~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l 549 (688) ++++.++++.+. . +.....++ +++.|.=.+.......+..+.+ T Consensus 365 l~~~~~l~~~~~----~--------~~~~~~~~-------------------------~~i~v~w~~~~~~s~~~~ad~~ 407 (484) T protein:vir:77 365 WEQAMRVAYKVM----N--------GGDIPPEY-------------------------YRMESIWRDPSTPTYAAKADAA 407 (484) T ss_pred HHHHHHHHHHHh----C--------CCCccccc-------------------------ccceEEecCCCCCCHHHHHHHH Confidence 666666554432 1 00001111 1111111112222233444455 Q ss_pred HHHHHhhHHHHHHHHHHHHHhcCCcc-HHHHHHHHHhhccccc---cchhhHHhhh-------hhhhhhhHHHHHHHHH Q lcl|NC_020488. 550 MQFVQAVPAAGGVVLDLIAKNMDWPG-AQDIARRLQKTLPPGI---LDQDEMEEAG-------IEPPQPSPEQQANMAQ 617 (688) Q Consensus 550 ~~~~q~~~~~~~~~~~~~~e~~~~~~-~~ei~~~~~~~~~~~~---~~~~~~~~~~-------~~~~~~~~~~q~~~~~ 617 (688) .++.+.... -+....+++++++-. ..+-.+++++...... .........+ ..+.++.+......+. T Consensus 408 ~kl~~~g~g--i~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 408 TKLYNNGQG--VIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQPNPAEEAAA 484 (484) T ss_pred HHHHhccCC--CCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccCCCccccCC Confidence 554432110 112234455554421 1111112211100000 0000000000 0000000000000000 No 108 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.69 E-value=1.3e-15 Score=102.19 Aligned_cols=397 Identities=14% Similarity=0.080 Sum_probs=211.1 Q ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHH----HHHHHHhcCCCceeehhHHHHHHHHHHHHHhCCc Q lcl|NC_020488. 15 SQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPES----VRKEREDEGRPCLTLNKLPQYVDQVLGDQRQNRP 90 (688) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~----~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~ 90 (688) -...++.++...+.. .+....+-.+||.|+|.... .-..++..-+ ++.|..+.+|+.+.+...=+- T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~--~v~nw~~~iVds~a~rl~~~G- 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLSV-------HKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYR--SILGWCAKGVDSLADRLVFRE- 70 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHHhccCchhhcchhhhHHHHHHHh--hhcChhHHHHHHhHhhccccc- Confidence 223355555444432 33445556789999886432 2223322223 456999999988765332110 Q ss_pred ceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcc Q lcl|NC_020488. 91 AIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLD 170 (688) Q Consensus 91 ~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~ 170 (688) | ...|.. +..+++.|+++.....+..++++.|.+++-|+-+ .++. T Consensus 71 ---f------------------~~~d~~--------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~------~dg~ 115 (409) T protein:vir:16 71 ---F------------------ENDDFT--------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKG------ENDA 115 (409) T ss_pred ---c------------------cCcchH--------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecC------CCCc Confidence 1 011221 4557788999999999999999999998877632 1355 Q ss_pred eeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeee Q lcl|NC_020488. 171 LCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYRE 248 (688) Q Consensus 171 ~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~ 248 (688) +.+..+ +|..+ +|||..+++. ...+.|-. + .+...+. ..+|.. T Consensus 116 ~~i~~~-sP~~~~~i~D~~~~~~~------~a~~~~~~---------d-----------------~~~~~~~-~~~~~~- 160 (409) T protein:vir:16 116 VRLQVI-EATNATGIIDPITGLLT------EGYAVLER---------D-----------------ENNNVVL-EAHFLP- 160 (409) T ss_pred eEEEEE-cccceEEEeecccccce------eeeEEEEe---------c-----------------CCCceEE-EEEEec- Confidence 666554 67654 5787544321 11111110 0 0000011 111111 Q ss_pred ecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccC Q lcl|NC_020488. 249 PVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIG 328 (688) Q Consensus 249 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~ 328 (688) +.+++. ...+..+...|+ +.+..|+|||+.. +.. T Consensus 161 -----------~~~~~~-------------------------------~~~~~~~~~~~~--~~g~vPvV~f~n~--~~~ 194 (409) T protein:vir:16 161 -----------DRTDYY-------------------------------YRDSRNNISIAN--PTGNPLLVPIIHR--PDA 194 (409) T ss_pred -----------CcEEEE-------------------------------EecCccccceec--CCCCcceEEeccc--ccc Confidence 111110 000000111223 4577899987653 334 Q ss_pred CcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcc--hHHHHhhcccCCCceeecCcccccccce-- Q lcl|NC_020488. 329 DKTYYRGLI-RFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEG--YEEEWNQANRKNQSVLRYNAIPGVDRPQ-- 403 (688) Q Consensus 329 ~~~~g~g~v-~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 403 (688) +.++|.|-| +.++++|+.+|+.++.+.-.....+.++..+- |.-.+ ..+.|.. ..+.++.......++.+. T Consensus 195 ~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~~---~~~~i~~~~~d~~g~~~~v~ 270 (409) T protein:vir:16 195 VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-GLSDDAEPMETWKA---TVSSMLQFTKDEDGDKPTLG 270 (409) T ss_pred cccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-ecCCCCCccchhhh---hhhHhhccCCCCCCCCceEE Confidence 678888755 78999999999999999988888777766552 22110 1111211 112333332222223333 Q ss_pred ecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 404 RDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN-EQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIP 482 (688) Q Consensus 404 ~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~-~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~ 482 (688) ..+..++ ..+...+......+-.+||++...+|..++ ..||.|+.+....-........+.|..+.+.++++++.+.- T Consensus 271 q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~ 349 (409) T protein:vir:16 271 QFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRD 349 (409) T ss_pred ecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 3444333 355666777777777788999999997654 47999998776555555566666667777777777665532 Q ss_pred HHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020488. 483 RVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGV 562 (688) Q Consensus 483 ~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~ 562 (688) .+ +. ..+++..+ .+...|+ .-+++.+ ..+..+++.++.+..|.+.. T Consensus 350 ~~-~~----------~~~~~~~~-----------~v~W~~~----------~~~~~~s-~a~~aDa~~Kl~~a~~~~~~- 395 (409) T protein:vir:16 350 DV-PY----------LREQFSKT-----------KPKWEPL----------FEADASM-LSLIGDGAIKLNQAIPEFIN- 395 (409) T ss_pred CC-Cc----------cchhhccc-----------eEEecCC----------CCcchhh-HHHHHHHHHHHHhhcccccc- Confidence 21 00 00011000 0000000 0122222 35667778888776443322 Q ss_pred HHHHHHHhcCCccHH Q lcl|NC_020488. 563 VLDLIAKNMDWPGAQ 577 (688) Q Consensus 563 ~~~~~~e~~~~~~~~ 577 (688) -..+.+++++...+ T Consensus 396 -~~v~~~~~g~~~~d 409 (409) T protein:vir:16 396 -KDTIRDLTGIKGAE 409 (409) T ss_pred -hhHHHHhccCCCCC Confidence 23446677776666 No 109 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.67 E-value=6.7e-16 Score=103.78 Aligned_cols=473 Identities=10% Similarity=0.001 Sum_probs=208.7 Q ss_pred CCCCcCCC-CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhc-CCCceeehhHHHHHHHH Q lcl|NC_020488. 4 GNEPIKTR-DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDE-GRPCLTLNKLPQYVDQV 81 (688) Q Consensus 4 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~-g~p~~~~N~i~~~i~~i 81 (688) -.-++.-. +.+....++..+...+. ..+....+-.+||+|+++-.......... ..-..+.|..+-+|+.. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~-------~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~ 73 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFE-------DSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSI 73 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHH-------HHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHH Confidence 11122222 33444445555544332 22334455678999998743211100000 01124568888999888 Q ss_pred HHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEee Q lcl|NC_020488. 82 LGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKY 161 (688) Q Consensus 82 ~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~ 161 (688) ++.+.-+ - |+.- .|.+.. ..++.++..|+++.....+..++++.|.||+.|+.+. T Consensus 74 ~~~l~~~---g-~~~~-----------------~~~~~~----~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e 128 (485) T protein:vir:10 74 AERQAVE---G-FRFG-----------------DADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPD 128 (485) T ss_pred Hhhhccc---c-eecC-----------------CCchhH----HHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCC Confidence 8766321 1 2111 112222 3345567789999999999999999999998887653 Q ss_pred ccCC--CCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCC Q lcl|NC_020488. 162 STDD--AFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 162 ~~~~--~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) ...+ ..++.+++..+ +|.++ +|||...+ ..+.++..+. .+.. T Consensus 129 ~~~~~~~~~~~~~i~~~-~p~~~~~~~D~~~~~-----~~~~~~~~~~----------------------------~~~~ 174 (485) T protein:vir:10 129 PQIDLGWDPNTPIIRVE-PPTRMYAEIDPRIGR-----VSKAIRVAYD----------------------------AEGN 174 (485) T ss_pred cccccccCCCeeEEEEE-ccceeEEEEcCCCCc-----eeEEEEEEEe----------------------------eCCC Confidence 3222 13455666554 78775 57774321 1222221110 0011 Q ss_pred EEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccce Q lcl|NC_020488. 238 GVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPV 317 (688) Q Consensus 238 ~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~ 317 (688) .+...++|..... ..+...++ ........|.+.+.+|+ T Consensus 175 ~~~~~~~y~~~~~--~~~~~~~~----------------------------------------~~~~~~~~~~~~g~vPv 212 (485) T protein:vir:10 175 EIQAATLYTPNDI--FGWYRVEN----------------------------------------EWQEWFNNPHGLGVVPV 212 (485) T ss_pred eEEEEEEEeCCeE--EEEEEcCC----------------------------------------ceEEeccccCCCCcccE Confidence 2333334432211 00000011 11112234455677888 Q ss_pred EEEeeeeeccCCcccccchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcch---HH-HHhhcccCCCceee Q lcl|NC_020488. 318 APVLGKEMVIGDKTYYRGLIR-FGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGY---EE-EWNQANRKNQSVLR 392 (688) Q Consensus 318 vp~~~~~~~~~~~~~g~g~v~-~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~---~~-~~~~~~~~~~~~~~ 392 (688) |+|... ...+.++|.|-+. .++++++.+|+.+|.+...+...+.+..++.....++. +. -..-.....+.++. T Consensus 213 v~~~n~--~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~ 290 (485) T protein:vir:10 213 VPIPNR--TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILA 290 (485) T ss_pred EEeccc--cccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceec Confidence 886543 3356678888775 68999999999999999988877776654432111110 00 00001111233343 Q ss_pred cCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 393 YNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG-NEQSGKAILARQRQGDRGTFAYIDNLSRAIR 471 (688) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~-~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~ 471 (688) ..+ +...+...+..+ ...+...+......+-.+|++++..+|..+ |..||.|+......-..........|..+++ T Consensus 291 ~~~--~d~k~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~ 367 (485) T protein:vir:10 291 FED--AEGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWE 367 (485) T ss_pred cCC--CCceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 321 112232222222 233444444444444455788888888654 5579999998877666666677766777777 Q ss_pred HHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHH Q lcl|NC_020488. 472 RVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQ 551 (688) Q Consensus 472 ~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~ 551 (688) ++.++++.+...- ....++ +++.|.=.+..+....+..+.+.+ T Consensus 368 ~~~~l~~~~~~~~------------~~~~~~-------------------------~~i~v~w~~~~~~~~~~~ada~~k 410 (485) T protein:vir:10 368 EAMRLAYRMMKGG------------DVPPDM-------------------------LRMETVWRDPSTPTYAAKADAASK 410 (485) T ss_pred HHHHHHHHHhCCC------------CCcccc-------------------------eeeeEEecCCCCCCHHHHHHHHHH Confidence 7666655432110 000010 111121112222223333444444 Q ss_pred HHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 552 FVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAK 631 (688) Q Consensus 552 ~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~ 631 (688) +.+.-.. -+....+++++++- ++-.+.+++....+... ...+ +..+-.... T Consensus 411 l~~ag~~--~~s~et~~~~lg~~--~~~~~~~~~~~ee~~~~----------------------~~~~--~~~~~~~~~- 461 (485) T protein:vir:10 411 LYNGGTG--VIPRERARKDMGYS--IAEREEMRRWDEEEAAM----------------------GLGL--IGTMVDPNP- 461 (485) T ss_pred HHhcccc--CCCHHHHHHhCCCC--HhHHHHHHHHHHHHHHH----------------------HHHH--HHHhhccCC- Confidence 4432100 11122334444442 11112221110000000 0000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 632 AQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSL 665 (688) Q Consensus 632 ~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~ 665 (688) ...-+ .+...+ .......-. ...+ T Consensus 462 -~~~~~-~~~~~~---~~~~~~~~~-----~~~~ 485 (485) T protein:vir:10 462 -TVPGS-PSPAPA---PKPAALESG-----GDAA 485 (485) T ss_pred -CCCCC-CCcccc---ccCcCCCCC-----CCCC Confidence 00000 000000 000000000 0000 No 110 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.66 E-value=6.6e-15 Score=98.32 Aligned_cols=437 Identities=11% Similarity=-0.015 Sum_probs=201.9 Q ss_pred CCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC---ceeehhHHHHHHHHHHHHHh Q lcl|NC_020488. 11 RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP---CLTLNKLPQYVDQVLGDQRQ 87 (688) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p---~~~~N~i~~~i~~i~g~~~~ 87 (688) -..+..++++.++...+. ..+....+-.+||+|+|.-...-.......+. .++.|..+-+|+..+|+... T Consensus 1 ~~~~t~~~~~~~l~~~~~-------~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRID-------DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHH-------HHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 344445667766655432 23344455678999988432111111112222 36789999999999999876 Q ss_pred CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCC Q lcl|NC_020488. 88 NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAF 167 (688) Q Consensus 88 ~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~ 167 (688) +...+ +... |.+..+ .+.-+++.|+++...+.+..++++.|.+|.-|+.+ . T Consensus 74 ~~~~~---~~~~----------------d~~~~~----~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d------~ 124 (456) T protein:vir:10 74 NGITV---GGSA----------------DSDLAL----RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR------D 124 (456) T ss_pred CCeec---CCCC----------------CcchHH----HHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC------C Confidence 64332 1111 222222 24445678999999999999999999998766532 1 Q ss_pred CcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEE Q lcl|NC_020488. 168 DLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYF 245 (688) Q Consensus 168 ~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~ 245 (688) ++.+++..+ +|.++ +|||.... ...+++ +.+.+.+ .... -...+.....+.....+ T Consensus 125 ~g~~~i~~~-~p~~~~~i~d~~~~~----~~~~~i-~~~~~~d--------~~~~--------~~~~~~~~~~~~~~~~~ 182 (456) T protein:vir:10 125 DGTATITAD-SPETMVVSVDPLQPW----RIRAAM-RWWRDLD--------AESD--------FAIVWSGDGWQKFARPC 182 (456) T ss_pred CCceEEEEE-ccceeEEEEcCCCCc----ceEEEE-EEEEecC--------Ccee--------EEEEEeccceeEEEEEE Confidence 355666555 67764 57775432 122222 2221110 0000 00000000000000000 Q ss_pred ee-eecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeee Q lcl|NC_020488. 246 YR-EPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKE 324 (688) Q Consensus 246 ~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~ 324 (688) +. ......... ...+.....+..|...+..|++++ T Consensus 183 ~~~~~~~~~~~~----------------------------------------~~~~~~~~~~~~~~~~~~~pvv~~---- 218 (456) T protein:vir:10 183 FVQSSSRRRLVT----------------------------------------RISDSWVPVGDAVVTGSPPPVVVY---- 218 (456) T ss_pred EEeecccceeee----------------------------------------ecCCceeeccccCCCCCceeEEEe---- Confidence 00 000000000 001111111111222233444432 Q ss_pred eccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhc---chHH------HHhhcccCCCceeecCc Q lcl|NC_020488. 325 MVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIE---GYEE------EWNQANRKNQSVLRYNA 395 (688) Q Consensus 325 ~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~---~~~~------~~~~~~~~~~~~~~~~~ 395 (688) ....|.|-+..++++++.+|+.+|.++......+.+...+. |... ..++ .........+.++..++ T Consensus 219 ----~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~ 293 (456) T protein:vir:10 219 ----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLPNVDENGNAIDYASIFEAAPGALWELPP 293 (456) T ss_pred ----cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccccccccccccchhhhhhhhccccccCCC Confidence 12457799999999999999999987766655544433221 1110 0010 00111112233333222 Q ss_pred ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 396 IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQ 475 (688) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 475 (688) ...+...+..+ ...+...++.....+-.+||+++..+|...++.||.|+..+...-..........|..+++++.+ T Consensus 294 ---~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~r 369 (456) T protein:vir:10 294 ---GVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) T ss_pred ---CcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333333322 34566667777777777889999999976666799999988877777777777888888888777 Q ss_pred HHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 476 ILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQA 555 (688) Q Consensus 476 ~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~ 555 (688) +++.+ .+. .++ +++.|.=.+..+....+..+.++++.+. T Consensus 370 l~~~~----~g~------------~~~-------------------------~~~~v~w~~~~~~~~~~~ada~~kl~~~ 408 (456) T protein:vir:10 370 KALQI----EGE------------SVE-------------------------DTVDVSFESPDRVTLGEKYSAASLAKAA 408 (456) T ss_pred HHHHh----cCC------------Ccc-------------------------cceeEEecCCCCcCHHHHHHHHHHHHHc Confidence 76532 110 000 0011110011111123334444444332 Q ss_pred hHHHHHHHHHHHHHhcCCccHHHHH----HHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 556 VPAAGGVVLDLIAKNMDWPGAQDIA----RRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAK 631 (688) Q Consensus 556 ~~~~~~~~~~~~~e~~~~~~~~ei~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~ 631 (688) +-+....+.++.++-. +++. ++++++..... ....+. T Consensus 409 ----gi~~~~~~~~~lg~~~-~~i~~~e~er~~~e~~~~~-------~~~~~~--------------------------- 449 (456) T protein:vir:10 409 ----GESWASIRRNILNYNA-DQIKQDDLDRAREQITLFA-------GNPVQR--------------------------- 449 (456) T ss_pred ----CCChHHHHHhhCCCCH-HHHHHHHHHHHHHHHHHHh-------hhhhhc--------------------------- Confidence 1111122233333321 1111 11111000000 000000 Q ss_pred HHHHHHH Q lcl|NC_020488. 632 AQADMAM 638 (688) Q Consensus 632 ~q~e~~~ 638 (688) .+-+..+ T Consensus 450 ~~~~~~~ 456 (456) T protein:vir:10 450 PQEDGSR 456 (456) T ss_pred CCCCCCC Confidence 0000000 No 111 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.66 E-value=6.6e-15 Score=98.32 Aligned_cols=437 Identities=11% Similarity=-0.015 Sum_probs=201.9 Q ss_pred CCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC---ceeehhHHHHHHHHHHHHHh Q lcl|NC_020488. 11 RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP---CLTLNKLPQYVDQVLGDQRQ 87 (688) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p---~~~~N~i~~~i~~i~g~~~~ 87 (688) -..+..++++.++...+. ..+....+-.+||+|+|.-...-.......+. .++.|..+-+|+..+|+... T Consensus 1 ~~~~t~~~~~~~l~~~~~-------~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRID-------DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHH-------HHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 344445667766655432 23344455678999988432111111112222 36789999999999999876 Q ss_pred CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCC Q lcl|NC_020488. 88 NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAF 167 (688) Q Consensus 88 ~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~ 167 (688) +...+ +... |.+..+ .+.-+++.|+++...+.+..++++.|.+|.-|+.+ . T Consensus 74 ~~~~~---~~~~----------------d~~~~~----~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d------~ 124 (456) T protein:vir:10 74 NGITV---GGSA----------------DSDLAL----RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR------D 124 (456) T ss_pred CCeec---CCCC----------------CcchHH----HHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC------C Confidence 64332 1111 222222 24445678999999999999999999998766532 1 Q ss_pred CcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEE Q lcl|NC_020488. 168 DLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYF 245 (688) Q Consensus 168 ~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~ 245 (688) ++.+++..+ +|.++ +|||.... ...+++ +.+.+.+ .... -...+.....+.....+ T Consensus 125 ~g~~~i~~~-~p~~~~~i~d~~~~~----~~~~~i-~~~~~~d--------~~~~--------~~~~~~~~~~~~~~~~~ 182 (456) T protein:vir:10 125 DGTATITAD-SPETMVVSVDPLQPW----RIRAAM-RWWRDLD--------AESD--------FAIVWSGDGWQKFARPC 182 (456) T ss_pred CCceEEEEE-ccceeEEEEcCCCCc----ceEEEE-EEEEecC--------Ccee--------EEEEEeccceeEEEEEE Confidence 355666555 67764 57775432 122222 2221110 0000 00000000000000000 Q ss_pred ee-eecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeee Q lcl|NC_020488. 246 YR-EPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKE 324 (688) Q Consensus 246 ~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~ 324 (688) +. ......... ...+.....+..|...+..|++++ T Consensus 183 ~~~~~~~~~~~~----------------------------------------~~~~~~~~~~~~~~~~~~~pvv~~---- 218 (456) T protein:vir:10 183 FVQSSSRRRLVT----------------------------------------RISDSWVPVGDAVVTGSPPPVVVY---- 218 (456) T ss_pred EEeecccceeee----------------------------------------ecCCceeeccccCCCCCceeEEEe---- Confidence 00 000000000 001111111111222233444432 Q ss_pred eccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhc---chHH------HHhhcccCCCceeecCc Q lcl|NC_020488. 325 MVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIE---GYEE------EWNQANRKNQSVLRYNA 395 (688) Q Consensus 325 ~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~---~~~~------~~~~~~~~~~~~~~~~~ 395 (688) ....|.|-+..++++++.+|+.+|.++......+.+...+. |... ..++ .........+.++..++ T Consensus 219 ----~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~ 293 (456) T protein:vir:10 219 ----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLPNVDENGNAIDYASIFEAAPGALWELPP 293 (456) T ss_pred ----cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccccccccccccchhhhhhhhccccccCCC Confidence 12457799999999999999999987766655544433221 1110 0010 00111112233333222 Q ss_pred ccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 396 IPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQ 475 (688) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 475 (688) ...+...+..+ ...+...++.....+-.+||+++..+|...++.||.|+..+...-..........|..+++++.+ T Consensus 294 ---~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~r 369 (456) T protein:vir:10 294 ---GVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) T ss_pred ---CcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333333322 34566667777777777889999999976666799999988877777777777888888888777 Q ss_pred HHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 476 ILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQA 555 (688) Q Consensus 476 ~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~ 555 (688) +++.+ .+. .++ +++.|.=.+..+....+..+.++++.+. T Consensus 370 l~~~~----~g~------------~~~-------------------------~~~~v~w~~~~~~~~~~~ada~~kl~~~ 408 (456) T protein:vir:10 370 KALQI----EGE------------SVE-------------------------DTVDVSFESPDRVTLGEKYSAASLAKAA 408 (456) T ss_pred HHHHh----cCC------------Ccc-------------------------cceeEEecCCCCcCHHHHHHHHHHHHHc Confidence 76532 110 000 0011110011111123334444444332 Q ss_pred hHHHHHHHHHHHHHhcCCccHHHHH----HHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 556 VPAAGGVVLDLIAKNMDWPGAQDIA----RRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAK 631 (688) Q Consensus 556 ~~~~~~~~~~~~~e~~~~~~~~ei~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~ 631 (688) +-+....+.++.++-. +++. ++++++..... ....+. T Consensus 409 ----gi~~~~~~~~~lg~~~-~~i~~~e~er~~~e~~~~~-------~~~~~~--------------------------- 449 (456) T protein:vir:10 409 ----GESWASIRRNILNYNA-DQIKQDDLDRAREQITLFA-------GNPVQR--------------------------- 449 (456) T ss_pred ----CCChHHHHHhhCCCCH-HHHHHHHHHHHHHHHHHHh-------hhhhhc--------------------------- Confidence 1111122233333321 1111 11111000000 000000 Q ss_pred HHHHHHH Q lcl|NC_020488. 632 AQADMAM 638 (688) Q Consensus 632 ~q~e~~~ 638 (688) .+-+..+ T Consensus 450 ~~~~~~~ 456 (456) T protein:vir:10 450 PQEDGSR 456 (456) T ss_pred CCCCCCC Confidence 0000000 No 112 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.66 E-value=7.1e-15 Score=98.15 Aligned_cols=450 Identities=12% Similarity=0.063 Sum_probs=215.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCC----CHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQW----PESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw----~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) |.|.+-..-..-++....++.++...+.. .+..-.+-.+||+|+|= +...-..++.. ..+.|..+- T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~-------~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~---~~v~nw~~~ 70 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIEN-------LRWKNLLRTSYYENKRTIQYVGTLIPPQYFNL---GLVLGWTGK 70 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHH-------HhhHHHHHHHHhccCCChhhccccccHHHHHH---HhhcChHHH Confidence 99988866666666666677776665442 22234445689999743 21111222211 136788888 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEE Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLR 156 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~ 156 (688) .|+.......=+ =-+.|.+. .. +..+.-++..|+++.....+..++++.|.+++- T Consensus 71 ~Vd~~a~rl~~~---Gf~~~d~~-----------------~~-----~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~ 125 (474) T protein:vir:81 71 AVDALARRCNLE---GFVWPDGD-----------------LD-----SLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLI 125 (474) T ss_pred HHHHHHhhhccc---ceECCCCC-----------------cc-----chHHHHHHHhcChhHHHHHHHHHHHhhCceeEE Confidence 887765433211 11112111 01 112456678999999999999999999999988 Q ss_pred EEEeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCC Q lcl|NC_020488. 157 VLTKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWT 234 (688) Q Consensus 157 v~~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~ 234 (688) |+.. +..++.+.+. +.+|..+ +|||..+.+ ...+.+...+. T Consensus 126 V~~~----~d~~~~~~i~-~~sp~~~~~~~D~~~~~~-----~~al~~~~~~~--------------------------- 168 (474) T protein:vir:81 126 NTVG----EDDEPEALIH-VKDASEATGEWNRRRRGL-----NNLLSIIDKDK--------------------------- 168 (474) T ss_pred EecC----CCCCceeEEE-EeccceEEEEEeCCCCcc-----eeeeEEEEEcC--------------------------- Confidence 7643 2233345554 4477765 488853321 11111111100 Q ss_pred CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCc Q lcl|NC_020488. 235 NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGST 314 (688) Q Consensus 235 ~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~ 314 (688) +.. .....+|..... +.+...++...| ..+..|.+.+ T Consensus 169 ~g~-~~~~~ly~~~~~--~~~~~~~~~~~w---------------------------------------~~~~~~~~~g- 205 (474) T protein:vir:81 169 EGK-VLSLALYLDNET--VTAQRDKATLKW---------------------------------------QVDRDEHVYG- 205 (474) T ss_pred CCc-EEEEEEEeCCcE--EEEEEcCcccee---------------------------------------eeccCCCCCC- Confidence 000 111112211100 000001111111 1111222334 Q ss_pred cceEEEeeeeeccCCcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcch-------HHHHhhcccC Q lcl|NC_020488. 315 IPVAPVLGKEMVIGDKTYYRGLI-RFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGY-------EEEWNQANRK 386 (688) Q Consensus 315 ~P~vp~~~~~~~~~~~~~g~g~v-~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~-------~~~~~~~~~~ 386 (688) .|+|||+.. +.-+.++|.|-| +.++++|+.+|+.++.+.-.....+.++-.+..-..... ...|... T Consensus 206 vPvV~~~n~--~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~--- 280 (474) T protein:vir:81 206 VPAQVLPYK--PAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEAR--- 280 (474) T ss_pred cceEEeccc--ccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhh--- Confidence 688887554 234567777654 799999999999999999988887777655431111100 0111110 Q ss_pred CCceeecCcccccc-------cceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCC--cchhhHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVD-------RPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQ--GNEQSGKAILARQRQGDR 457 (688) Q Consensus 387 ~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~--~~~~sg~ai~~~~~~~~~ 457 (688) .+.++.+.....+. ++...+..+ ...+...+......+-.+||++...+|.. .|..||.||.+....-.. T Consensus 281 ~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~-l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~ 359 (474) T protein:vir:81 281 LGRIKGLPDDADADIPQLARADVKQFPAAS-PDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIA 359 (474) T ss_pred HHHHhcCCCcccccccccccccccccCCCC-hhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHH Confidence 11122222111111 122222222 23455556666666667789999999953 567899999888777667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEe-ccc Q lcl|NC_020488. 458 GTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVK-AGP 536 (688) Q Consensus 458 ~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~-~~~ 536 (688) ......+.|..+.++++++.+.+.-.+--+ + ...++ +++.|. -.| T Consensus 360 kae~k~~~fg~~l~~~~rla~~i~~~~~~~----~-----~~~~~-------------------------~~~~v~W~d~ 405 (474) T protein:vir:81 360 EAEGAVDDFTPALRKAFIRALAMKNKVAID----E-----IPDEW-------------------------KSIDAKWRDP 405 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCcc----c-----cchhh-------------------------ccceeEecCC Confidence 777777778888888888877654222100 0 00011 111111 112 Q ss_pred CcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHH Q lcl|NC_020488. 537 SYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMA 616 (688) Q Consensus 537 ~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~ 616 (688) .+.|. .+..+++.++.+..+.+.. -..+.+++++. .+++........ .+ T Consensus 406 ~~~s~-a~~aDa~~Kl~~a~~~~~~--~~~~~~~lg~t-~~~i~~~~~~~~---------------------------~~ 454 (474) T protein:vir:81 406 RYLSK-SAQADAGMKQLAAVPWLAE--TEVGLELIGLT-PQQARRAMADKR---------------------------RV 454 (474) T ss_pred CccCH-HHHHHHHHHHHhcccCCCc--HHHHHhhcCCC-HHHHHHHHHHHH---------------------------HH Confidence 33332 3445666666654322111 11233344442 122221110000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 617 QAQADMEKAKADTAKAQADMAMAQ 640 (688) Q Consensus 617 ~~q~~~~~~q~e~~~~q~e~~~~q 640 (688) ..+..+..+ .....+.+.+| T Consensus 455 ~~~~~~~~l----~~~~~~~~~aq 474 (474) T protein:vir:81 455 QGRGTLQAL----IDRSNNGATAQ 474 (474) T ss_pred hHHHHHHHH----HhcCCCCCCCC Confidence 000000000 00011111111 No 113 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.64 E-value=9.1e-15 Score=97.56 Aligned_cols=442 Identities=11% Similarity=0.001 Sum_probs=201.1 Q ss_pred CCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCC---CceeehhHHHHHHHHHHHHHh Q lcl|NC_020488. 11 RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGR---PCLTLNKLPQYVDQVLGDQRQ 87 (688) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~---p~~~~N~i~~~i~~i~g~~~~ 87 (688) -..+..++++..+...+. ..+....+-.+||.|++=-...........+ ..++.|..+-+|+..+++... T Consensus 1 ~~~~t~~~~~~~l~~~~~-------~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRID-------DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHH-------HHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhcc Confidence 233344556665555332 2233345557899997511100000111111 135689999999999999877 Q ss_pred CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCC Q lcl|NC_020488. 88 NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAF 167 (688) Q Consensus 88 ~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~ 167 (688) +...+ +.. .|.+..+. +.-++..|+++.....+..+++++|.+|+-++.+ + T Consensus 74 ~g~~~---~~~----------------~d~~~~~~----~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~---e--- 124 (456) T protein:vir:79 74 NGITV---GGS----------------ADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR---D--- 124 (456) T ss_pred CCeec---CCC----------------CCccHHHH----HHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeC---C--- Confidence 74332 111 12222222 3445667899999999999999999998766543 2 Q ss_pred CcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEE Q lcl|NC_020488. 168 DLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYF 245 (688) Q Consensus 168 ~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~ 245 (688) ++.+++..+ +|.++ +|||.... ....+ .+.+-+.++-. .....+.....+....+| T Consensus 125 dg~~~i~~~-~p~~~~~i~d~~~~~----~~~~~-~~~~~~~d~~~----------------~~~~~~~~~~~~~~~~~~ 182 (456) T protein:vir:79 125 DGTATITAD-SPETMVVSVDPLQPW----RIRSA-MRWWRDLDAES----------------DFAIVWSGDGWQKFARPC 182 (456) T ss_pred CCceEEEEe-ccceeEEEEcCCCCC----ceEEE-EEEEEecCCce----------------eEEEEEcCCceEEEEEEE Confidence 345666555 67764 56764432 11112 22221110000 000111112222222222 Q ss_pred eeeecce-eeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeee Q lcl|NC_020488. 246 YREPVTR-KLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKE 324 (688) Q Consensus 246 ~~~~~~~-~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~ 324 (688) +...... ...... .+........+..++..|++++ T Consensus 183 ~~~~~~~~~~~~~~----------------------------------------~~~~~~~~~~~~~~~~~pvv~~---- 218 (456) T protein:vir:79 183 FVQSSSRRRLVTRI----------------------------------------SDSWVPVGDAVVTGSPPPVVVY---- 218 (456) T ss_pred Eeeccccceeeecc----------------------------------------CCceeecccccCCCCceeEEEe---- Confidence 1110000 000000 0000111112223455565543 Q ss_pred eccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhc--chHH------HHhhcccCCCceeecCcc Q lcl|NC_020488. 325 MVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIE--GYEE------EWNQANRKNQSVLRYNAI 396 (688) Q Consensus 325 ~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~--~~~~------~~~~~~~~~~~~~~~~~~ 396 (688) ....|.|-+..++++++.+|+.+|.+...+...+.+...+...... ..++ .........+.++..++ T Consensus 219 ----~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~- 293 (456) T protein:vir:79 219 ----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPP- 293 (456) T ss_pred ----cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCCC- Confidence 1245679999999999999999998877666655544333211110 0010 00111112233333222 Q ss_pred cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 397 PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQI 476 (688) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~ 476 (688) ...+..++..+ ...+...+......+-..||+++..+|...++.||.|+..+...-..........|..+++++.++ T Consensus 294 --~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l 370 (456) T protein:vir:79 294 --GVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK 370 (456) T ss_pred --CcceeeecccC-hHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333333333 355666777777777788999999999776667999999887777777777777777777777666 Q ss_pred HHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh Q lcl|NC_020488. 477 LIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV 556 (688) Q Consensus 477 ~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~ 556 (688) ++. +.+. .+... +.|+=.. +...+ ..+..+.++++.+. T Consensus 371 ~~~----~~g~------------~~~~~-----------------------i~v~w~~-~~~~s-~~~~ada~~kl~~~- 408 (456) T protein:vir:79 371 ALQ----IEGE------------SVEDT-----------------------VDVSFES-PDRVT-LGEKYSAASLAKAA- 408 (456) T ss_pred HHH----hcCC------------Ccccc-----------------------ceEEeCC-CCCcC-HHHHHHHHHHHHhc- Confidence 553 3221 00000 0000000 11111 23333444443321 Q ss_pred HHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADM 636 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~ 636 (688) +-+....+.+.+++-. +++.+ +. .+..+.+.......+ .+.-+-+. T Consensus 409 ---G~~~~~~~~~~lg~~~-~~i~~-~e----------------------------~~r~~~e~~~~~~~~-~~~~~~~~ 454 (456) T protein:vir:79 409 ---GESWASIRRNILNYNA-DQIKQ-DD----------------------------LDRAREQITLFAGNP-VQRPQEDG 454 (456) T ss_pred ---CCChHHHHHhcCCCCH-HHHHH-HH----------------------------HHHHHHHHHHHhhhH-hhcCCCCC Confidence 0011112222333311 11110 00 000000000000000 00011111 Q ss_pred HH Q lcl|NC_020488. 637 AM 638 (688) Q Consensus 637 ~~ 638 (688) ++ T Consensus 455 ~~ 456 (456) T protein:vir:79 455 SR 456 (456) T ss_pred CC Confidence 11 No 114 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.62 E-value=8.2e-14 Score=92.34 Aligned_cols=451 Identities=13% Similarity=0.024 Sum_probs=205.3 Q ss_pred CCCCCCCcCCC------CccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHH----HHHHHHhcCCCcee Q lcl|NC_020488. 1 MLPGNEPIKTR------DDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPES----VRKEREDEGRPCLT 70 (688) Q Consensus 1 ~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~----~~~~~~~~g~p~~~ 70 (688) |-|-.-...+- -++....++.++...+. ..+....+-.+||.|+|.-.. .-..++.. ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~-------~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~---~~v 70 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLV-------DRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRT---ATV 70 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHH-------HHhHHHHHHHHHHhccccchhccccccHHHHHH---hhc Confidence 44433311111 12222234444444332 223444455689999875321 11122111 256 Q ss_pred ehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHc Q lcl|NC_020488. 71 LNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEG 150 (688) Q Consensus 71 ~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~ 150 (688) .|..+-+|+.......-+ -.+.|- +.+. +..+..+++.|+++.....+..++++. T Consensus 71 ~n~~~~iVd~~a~rl~~~---Gf~~~d------------------~~~~----~~~l~~i~~~N~ld~~~~~~~~~a~iy 125 (504) T protein:vir:99 71 LGWSAKAVDTLARRCNLE---SFVWPD------------------GDYG----SIGGPDVWDENFFATKANNAMVSSLIH 125 (504) T ss_pred cCcHHHHHHHHHhhhccc---eeeCCC------------------CChh----hHHHHHHHHhcChhhHHHHHHHHHHhh Confidence 788888888766433211 111111 1111 233566778899999999999999999 Q ss_pred CCceEEEEEeeccCCCCCcceeEEEecccceE--EeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhccccccc Q lcl|NC_020488. 151 GFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERG 228 (688) Q Consensus 151 G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~ 228 (688) |.+++-|+-+ +.....++|..+ +|.++ +|||....+ ...++.... T Consensus 126 G~af~~v~~~----~d~~~~~~I~~~-sP~~~~~iyD~~~~~~-----~~a~~~~~~----------------------- 172 (504) T protein:vir:99 126 GPAFLINTEG----GAGEPDSLIHVK-SAMQATGEWNSRRNAM-----DSLLSITSR----------------------- 172 (504) T ss_pred CceeEEEecC----CCCCceeEEEEe-ccceeEEEEeCCCCce-----eEEEEEEEe----------------------- Confidence 9998777533 112233445444 77765 588753321 111111100 Q ss_pred ccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCC Q lcl|NC_020488. 229 EYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPV 308 (688) Q Consensus 229 ~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~ 308 (688) +.+ .....+++|+..... + ...++.+ .+..+.. T Consensus 173 ----d~~-g~~~~~~~y~~~~~~-~--~~~~~~~---------------------------------------~~~~~~~ 205 (504) T protein:vir:99 173 ----DAE-GHPTGIALYEDGVTV-T--ADMDDDG---------------------------------------DWHADVR 205 (504) T ss_pred ----cCC-CeEEEEEEEcCCcEE-E--EEEcCCc---------------------------------------eeeeccc Confidence 000 112223444321110 0 0001110 1111222 Q ss_pred CCCCCccceEEEeeeeeccCCcccccchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhc-c-------hHHH Q lcl|NC_020488. 309 DWPGSTIPVAPVLGKEMVIGDKTYYRGLI-RFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIE-G-------YEEE 379 (688) Q Consensus 309 p~~~~~~P~vp~~~~~~~~~~~~~g~g~v-~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~-~-------~~~~ 379 (688) |.+++ .|+|||+.. +..+.++|.|-+ +.++++++.+|+.++.++-.....+.+...+- |.-. + .... T Consensus 206 ~~~~g-vPvV~~~n~--~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~-G~~~~~~~~~d~~~~~~ 281 (504) T protein:vir:99 206 THKLG-VPVEVLPYK--PREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILL-GADAKNFRNKDGSMKPA 281 (504) T ss_pred cCCCC-cceEEeccc--ccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-cCCccccccccccccch Confidence 33344 788887643 234667777754 68999999999999999888877666654432 2110 0 0011 Q ss_pred HhhcccCCCceeecCccc-----cc--ccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc--chhhHHHHHH Q lcl|NC_020488. 380 WNQANRKNQSVLRYNAIP-----GV--DRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG--NEQSGKAILA 450 (688) Q Consensus 380 ~~~~~~~~~~~~~~~~~~-----~~--~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~--~~~sg~ai~~ 450 (688) |.. ..+.++.+.... .. ..+...+..++ ..+...+......+-.+||+++..+|..+ |..||.|+.. T Consensus 282 ~~~---~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l-~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~ 357 (504) T protein:vir:99 282 WQI---ALARVFALPDDEDEPDAARARADVKQFPASSP-QPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIA 357 (504) T ss_pred hhh---hhhhhhcCCCccccccccCccceeeecCCCCh-HHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHH Confidence 110 011122111110 01 11222222222 34455555555566666999999999643 5679999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEE Q lcl|NC_020488. 451 RQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDV 530 (688) Q Consensus 451 ~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv 530 (688) ....-........+-|..+.++++++.+.+.... +. ...++.. + T Consensus 358 ~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~-~~----------~~~~~~~-------------------------~ 401 (504) T protein:vir:99 358 SREDLIAEAEGATDDWSPAFRRSMIRALAIKNGL-DR----------IPPEWKT-------------------------I 401 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Cc----------ccccccc-------------------------c Confidence 8776667777777778888888888776654322 10 0011110 1 Q ss_pred EEe-cccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHH---HHHhhccccccch------------ Q lcl|NC_020488. 531 TVK-AGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIAR---RLQKTLPPGILDQ------------ 594 (688) Q Consensus 531 ~v~-~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~---~~~~~~~~~~~~~------------ 594 (688) .|. ..+...+ ..+..+.+.++.+..+.+. ..-..+++++++. .+++.+ ..++......... T Consensus 402 ~v~w~d~~~~s-~a~~aDa~~Kl~~ag~~l~-~~~~~l~~~lg~~-~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~ 478 (504) T protein:vir:99 402 DSKFRSPLYLS-KAAQADAGAKMLGAGPEWL-KETEVGLELLGLT-PQQAKRALAERRRASSVSIIEALNRRQQEAATAG 478 (504) T ss_pred eeEecCCCccC-HHHHHHHHHHHHhhccccc-cchHHHHhhcCCC-HHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCC Confidence 110 0112222 2344555555554322110 0112344555553 222221 1111111000000 Q ss_pred ---hhH--Hhhhhh--hhhhhHHHHH Q lcl|NC_020488. 595 ---DEM--EEAGIE--PPQPSPEQQA 613 (688) Q Consensus 595 ---~~~--~~~~~~--~~~~~~~~q~ 613 (688) .+. +.+... ..-..|.+.- T Consensus 479 ~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 479 EDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred CCCCcCCCCCCCCCCCccCCCcccCC Confidence 000 000000 0000111111 No 115 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.62 E-value=4.6e-15 Score=99.19 Aligned_cols=469 Identities=10% Similarity=-0.008 Sum_probs=200.8 Q ss_pred CcCCCCccchHHHHHH-HHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH----HHHHhcCCCceeehhHHHHHHHH Q lcl|NC_020488. 7 PIKTRDDDSQEAILQE-IRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR----KEREDEGRPCLTLNKLPQYVDQV 81 (688) Q Consensus 7 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~----~~~~~~g~p~~~~N~i~~~i~~i 81 (688) ++..+.++...+-+.+ +..++-..+. .......+-.+||.|++.-.... ......-.-.++.|..+-+|+.. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~---~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 77 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMN---TECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSF 77 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHH---HHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHH Confidence 7777766444443333 3323332222 23334445568999987521100 00000001113568888888877 Q ss_pred HHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEee Q lcl|NC_020488. 82 LGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKY 161 (688) Q Consensus 82 ~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~ 161 (688) ++... +.- |..- |.+..+ .+..++..|+++.....+..++++.|.+|+.|+-.. T Consensus 78 ~~~l~---~~g-f~~~------------------d~~~~~----~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~ 131 (479) T protein:vir:99 78 AQQLI---VDG-YRKT------------------GTNENA----KGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGI 131 (479) T ss_pred Hhhcc---ccc-ccCC------------------CchhhH----HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCC Confidence 66442 111 1110 111122 234566779999999999999999999887665211 Q ss_pred ccCCCCCcceeEEEecccceEE--eCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEE Q lcl|NC_020488. 162 STDDAFDLDLCIKSIHNRFAVL--MDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGV 239 (688) Q Consensus 162 ~~~~~~~~~~~~~~v~~~~~v~--~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v 239 (688) . ....++.+.+..+ +|.+++ ||...+. . ..++. ++. +... T Consensus 132 ~-~~d~~g~~~i~~~-~p~~~~~iydd~~~~----~-~~~~~---~~~---------------------------~~~~- 173 (479) T protein:vir:99 132 S-PLDGTTVARIKCI-DPRDAFAIWEDPYWD----E-WPKYL---LER---------------------------QPNG- 173 (479) T ss_pred C-CcCCCCceEEEEe-chhheEEEecCCccc----c-eeeEE---Eee---------------------------cCce- Confidence 1 1123455666554 677754 4432211 0 00110 000 0000 Q ss_pred EEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEE Q lcl|NC_020488. 240 RVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAP 319 (688) Q Consensus 240 ~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp 319 (688) ...+|.. ....++... +|...++++.|-+.+.+|+|| T Consensus 174 -~~~~~~~--~~~~~~~~~----------------------------------------~~~~~~~~~~~h~~g~vPvv~ 210 (479) T protein:vir:99 174 -QYWWWTE--EDYSIFEFK----------------------------------------QGKFIYRETVSHDYGHIPFVR 210 (479) T ss_pred -eEEEEec--ceEEEEEec----------------------------------------CCceeeccccccCCCCcceEE Confidence 0011100 000000000 111112233344446777777 Q ss_pred EeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHH-HHhhcccCCCceeecCcccc Q lcl|NC_020488. 320 VLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEE-EWNQANRKNQSVLRYNAIPG 398 (688) Q Consensus 320 ~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~ 398 (688) |.... ....+|.|.+..++++++.+|+.+|.+...+...+.+..++.......... .........+.++...+. T Consensus 211 f~n~~---~~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~~~~-- 285 (479) T protein:vir:99 211 YVNVM---DLRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESMLISQNE-- 285 (479) T ss_pred eecCC---CcCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccccccccceeecCC-- Confidence 65432 123578999999999999999999999988887777765543221111110 001111112223322221 Q ss_pred cccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 399 VDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILI 478 (688) Q Consensus 399 ~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~ 478 (688) ...+..++... ...+...++.....+-.+||+++...|..+| .||.|+..+...-........+.|..++++++++++ T Consensus 286 ~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n-~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~ 363 (479) T protein:vir:99 286 KASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAGQIVN-VAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVN 363 (479) T ss_pred CceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222222 3445555565566666668899999997655 699999988766666667777777777777776655 Q ss_pred HHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEec-ccCcHHHHHHHHHHHHHHHHhhH Q lcl|NC_020488. 479 ELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKA-GPSYQTQRMEAADSLMQFVQAVP 557 (688) Q Consensus 479 ~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~-~~~~~s~r~~~~~~l~~~~q~~~ 557 (688) .+. + .....+. +++.+.= -+.+.+ ..+..+.+.++.++ T Consensus 364 ~~~----~---------~~~~~~~-------------------------~~i~~~w~~~~~~s-~~~~ad~~~kl~~a-- 402 (479) T protein:vir:99 364 KIE----G---------RTEEATD-------------------------LDFTITWQDVTIQS-LAQFADAWAKMVES-- 402 (479) T ss_pred HHc----C---------CCccccc-------------------------eeeeEEecCCCCCC-HHHHHHHHHHHHhc-- Confidence 432 1 1001111 1122210 111222 22344445554432 Q ss_pred HHHHHHHHHHHHhc-CCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 558 AAGGVVLDLIAKNM-DWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADM 636 (688) Q Consensus 558 ~~~~~~~~~~~e~~-~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~ 636 (688) ..+....+++++ ++.. .++ +++++............... ...+.+..+...-.... .++.+..+ T Consensus 403 --g~is~et~l~~l~gv~~-~~~-e~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-----~~~~~~~~--- 467 (479) T protein:vir:99 403 --LKIPAEGVWDMIPNLDQ-STV-NGWKEIYDREGDFGKYMRKL---QNGPDPAEQRGGPNGAT-----NMQQANNK--- 467 (479) T ss_pred --CCCCHHHHHHhcCCCCH-HHH-HHHHHHHHHHHHHHHHHHHH---hcccCcccccCCCCCCC-----CCCCCCCC--- Confidence 112233444444 2321 221 11111100000000000000 00000000000000000 00000000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 637 AMAQAKTAEAQAKLAEIEQ 655 (688) Q Consensus 637 ~~~q~~~~~~~a~~~~~~~ 655 (688) ...-+.+-.+.. T Consensus 468 -------~~~~~~~~~~~~ 479 (479) T protein:vir:99 468 -------TGEPASLNKSGA 479 (479) T ss_pred -------CcchhccCCCCC Confidence 000000000000 No 116 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.62 E-value=8.9e-15 Score=97.62 Aligned_cols=479 Identities=13% Similarity=0.058 Sum_probs=204.2 Q ss_pred CCCCCCCcCCCCc-cchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC---ceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDD-DSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRP---CLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p---~~~~N~i~~ 76 (688) --|..+ +..+++ .+.+.+...+...+. .+...+..-.+-.+||.|+|.....-.......++ ..+.|..+- T Consensus 10 ~~~~~~-~~~p~~~~~~~~~~~l~~~l~~----~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ 84 (501) T protein:vir:25 10 DAPAAD-VEFPEDSMSREQLGALVADMWR----LHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSL 84 (501) T ss_pred ccCccc-ccCCcccCChHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHH Confidence 233333 222222 233333333333222 22222333444568999998643221111111111 245688888 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEE Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLR 156 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~ 156 (688) +|+...+... ++--..| |.... ..+..+++.|+++.....+..++++.|.||+. T Consensus 85 ivd~~a~~l~---~~gf~~~-------------------d~~~~----~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~ 138 (501) T protein:vir:25 85 VRDSFAQNLS---VVGYRNA-------------------LAKEN----DPAWEMWQRNRMDARQAEVHRPALTYGASYVT 138 (501) T ss_pred HHHHHHhhhc---ccceecC-------------------Cccch----HHHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 8887776553 1111111 11111 12445678899999999999999999999987 Q ss_pred EEEeeccCCCCCcceeEEEecccceEE--e-CCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccC Q lcl|NC_020488. 157 VLTKYSTDDAFDLDLCIKSIHNRFAVL--M-DPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWW 233 (688) Q Consensus 157 v~~~~~~~~~~~~~~~~~~v~~~~~v~--~-Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~ 233 (688) |+.+ ++ + .++.. .+|.+++ | ||..... ...+.+.+....+ T Consensus 139 v~~d---e~---~-~~i~~-~sp~~~~~iy~D~~~~~~-----~~~ai~~~~~~~~------------------------ 181 (501) T protein:vir:25 139 VTPT---DE---G-PVFRT-RSPRQILAVYADPSVDAW-----PQYALETWVAQKD------------------------ 181 (501) T ss_pred EecC---CC---C-CeEEE-eccccEEEEEecCCCCcc-----eeEEEEEEeeccc------------------------ Confidence 7643 22 2 23443 4777653 4 5543321 1222223321110 Q ss_pred CCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCC Q lcl|NC_020488. 234 TNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGS 313 (688) Q Consensus 234 ~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~ 313 (688) .+.....++|..+ .++....+..+........ ...... . ....+....+...|-+.+ T Consensus 182 --~~~~~~~~~y~~~----~~~~~~~~~~~~~~~~~~~-----------~~~~~~---~---~~~~~~~~~~~~~~~~~~ 238 (501) T protein:vir:25 182 --AKPHRRGVLYDDT----YMYELDLGEVVLGDAGGGQ-----------ATQQPV---N---VREVTDVIEHGATFEGKP 238 (501) T ss_pred --cCcceeEEEecCe----eEEEEecCceeeeeccccc-----------cccccc---c---ccccccccccccccCCcc Confidence 0001111111110 0000000000000000000 000000 0 000011111122233334 Q ss_pred ccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeec Q lcl|NC_020488. 314 TIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRY 393 (688) Q Consensus 314 ~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 393 (688) ..|+|+|.-. .....+|.|.+..++++++.+|+.++.+.......+.+...+. |.-.+..+.+. ...+.++.. T Consensus 239 ~vPiv~f~N~---~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~-G~~~~~~~~~~---~~~~~i~~~ 311 (501) T protein:vir:25 239 VCPVVRFVNG---RDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVIS-GWTGSKAEVLK---ASALRVWTF 311 (501) T ss_pred ceeeEeccCc---cccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh-CCCCCccchhh---hcccceecc Confidence 4555554332 2334568899999999999999999999888877766644332 32222212222 223344443 Q ss_pred CcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 394 NAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRV 473 (688) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~ 473 (688) .+. ...+...+... ...+...+......+-..|++++...|...++.||.|+......-........+.|..+++++ T Consensus 312 ~~~--~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~ 388 (501) T protein:vir:25 312 EDP--EVKAQAFPPAS-VEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQL 388 (501) T ss_pred CCC--CceEEEecccC-hHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 321 12232233222 244556666666667777899999998766667999998887776666777777777777776 Q ss_pred HHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH Q lcl|NC_020488. 474 GQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV 553 (688) Q Consensus 474 ~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~ 553 (688) +++++. +.+.. .... .+++.|.=.+..+....+..+.+.++. T Consensus 389 ~rl~~~----~~~~~---------~~~~-------------------------~~~i~v~w~~~~~~s~~~~ada~~kl~ 430 (501) T protein:vir:25 389 LRLAAE----MDDDP---------DTAA-------------------------DSGAEVLWRDTEARSFGAVVDGITKLA 430 (501) T ss_pred HHHHHH----HhCCC---------cccc-------------------------ceeeeEEecCCCCCCHHHHHHHHHHHH Confidence 666543 32210 0000 112222212222222344455555554 Q ss_pred HhhHHHHHHHHHHHHHh-cCCccHHHHHHHHHhhccccccc-hhhHH-hhh---hhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 554 QAVPAAGGVVLDLIAKN-MDWPGAQDIARRLQKTLPPGILD-QDEME-EAG---IEPPQPSPEQQANMAQAQADMEKAKA 627 (688) Q Consensus 554 q~~~~~~~~~~~~~~e~-~~~~~~~ei~~~~~~~~~~~~~~-~~~~~-~~~---~~~~~~~~~~q~~~~~~q~~~~~~q~ 627 (688) +. .+....++.. .++. ..++. ++++....+... ..... ..+ ....+.+...+.. T Consensus 431 ~~-----gis~et~~~~~~g~~-~~~ie-~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 490 (501) T protein:vir:25 431 SA-----GIPIEHLLSMVPGMT-QQTIQ-AIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQAL------------- 490 (501) T ss_pred hc-----CCCHHHHHHHcCCCC-HHHHH-HHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCcccc------------- Confidence 42 0122223332 2332 22221 111111000000 00000 000 0000000000000 Q ss_pred HHHHHHHHHHHH Q lcl|NC_020488. 628 DTAKAQADMAMA 639 (688) Q Consensus 628 e~~~~q~e~~~~ 639 (688) ..........+ T Consensus 491 -~~~~~~~~~g~ 501 (501) T protein:vir:25 491 -NEGGVNGNGGA 501 (501) T ss_pred -ccccCCCCCCC Confidence 00000000000 No 117 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.61 E-value=1.1e-14 Score=97.02 Aligned_cols=508 Identities=9% Similarity=0.018 Sum_probs=247.4 Q ss_pred CCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHH-HHHHHHhhCCCCCCHHHHHHHHhcCCCceeeh--hHHHHH Q lcl|NC_020488. 2 LPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDA-AQEDISFLAGEQWPESVRKEREDEGRPCLTLN--KLPQYV 78 (688) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~-~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N--~i~~~i 78 (688) ||-++....+..--..+-...+ +......|-. .+.-.+||.|+||+-.. .+ +|+-...+| --+-+| T Consensus 1 m~~~~~q~~p~~~~fp~~~a~w-------V~~~D~~RlaaY~ly~d~y~n~~~el~~--il--~G~dr~~~~~ps~r~~V 69 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNI-------VDENDKNRVRAYDLYENIYLNSAETLKL--VL--RGDDSVPILMPSGRKIV 69 (563) T ss_pred CCccccccCCCccccccccccc-------CCHHHHHHHHHHHHHHHhhcCchhhhhh--hc--CCCceeeeccchHHHHH Confidence 4444432222221111111111 1111122222 34456899999996332 23 343344444 566888 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +++. ........+.|-|.+. |+...+.++.+++..++.++....+..+-.++++.|-|+++|. T Consensus 70 ~~~~-~~Lg~~~~~~Ve~~~~----------------de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~ 132 (563) T protein:vir:74 70 EAVH-RFLGVGFDYLVEPDMG----------------DEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIH 132 (563) T ss_pred HHHH-HhcCCCcEEecCcccc----------------CcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEe Confidence 8855 4445555666655443 4444566899999999999999999999999999999999999 Q ss_pred EeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEE---ecCCHHHHHH-hcCCccchhcccccccccccCC Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFIS---ERMSKAEFNK-RYPGKAVGDLSDAERGEYSWWT 234 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~---~~~~~~e~~~-~~p~~~~~~~~~~~~~~~~~~~ 234 (688) ||..+ ...+.+....| ||.-+|. +..+| .+..+..++ .|-.+++.++ .+.=+. .....+... ++ T Consensus 133 wDp~K--~~g~R~rv~~v-DP~~~fp---~~dpd-~v~g~~~v~v~~~~~~pdd~~~~~~r~~~--~~~~lndeg---~~ 200 (563) T protein:vir:74 133 ADPNK--KAGERISVDEV-DPRQIFL---IEDGS-TVVGFHMVDIVQDFRSPDDPSKKLARRRT--FRRVRNDEG---MF 200 (563) T ss_pred ecccc--ccCCCceEeec-CCceeee---ccCCC-CcccceeeecccCCCCCcchhccceeeee--eeeeeCCCC---Cc Confidence 87533 23456777677 5654441 22232 122222122 3333333221 000000 000000000 00 Q ss_pred CCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCc Q lcl|NC_020488. 235 NEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGST 314 (688) Q Consensus 235 ~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~ 314 (688) ...-....|.|.... + |++.......... ..+ .+. .-...-+.+-|-|.+. T Consensus 201 ~~~~~~dae~w~lg~----w----d~r~~~~~~~~~~-----~~~-~~~---------------~~~d~e~~~LP~pi~~ 251 (563) T protein:vir:74 201 TGRISSELTHWTLGN----W----DDRGAISDEQARR-----KEQ-VRS---------------AQHDEEEEELPEPISQ 251 (563) T ss_pred cceeeeccchhcccc----c----cccCccchhhhcc-----cch-hhh---------------hhhhchhhhccccccC Confidence 000011122221100 0 1111000000000 000 000 0000011223445678 Q ss_pred cceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhh-cchHHHHhhcccCCCceeec Q lcl|NC_020488. 315 IPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESI-EGYEEEWNQANRKNQSVLRY 393 (688) Q Consensus 315 ~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~ 393 (688) +||+- + .-++.+++.||.|-...+..+.+++|...|-...++..+.+|.++.+..+- ++-.....-++..+|.++.. T Consensus 252 iPiv~-~-~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El 329 (563) T protein:vir:74 252 LPLYR-W-RNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEI 329 (563) T ss_pred ccEEE-c-CCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccccccccccccccccCCceeEec Confidence 88874 3 336778999999999999999999999999999999999988777653321 11111222245567778777 Q ss_pred CcccccccceecCC-CcchHHHHHHHHHHHHHHHHHhCcChHHcCC--CcchhhHHHHHHHHHHHHHHH---H-HHHHHH Q lcl|NC_020488. 394 NAIPGVDRPQRDMP-ASMPAAELQLALSATDEMKATIGLYDASVGA--QGNEQSGKAILARQRQGDRGT---F-AYIDNL 466 (688) Q Consensus 394 ~~~~~~~~~~~~~~-~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~--~~~~~sg~ai~~~~~~~~~~~---~-~~~dn~ 466 (688) .+......+..+.. +++..-..-|=......+.+++|+..+++|. .+...||.|.....+--..+. . .+...+ T Consensus 330 ~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~m 409 (563) T protein:vir:74 330 AGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVM 409 (563) T ss_pred cCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHH Confidence 65544445555544 2222211111122333667889999999994 556789998776543322211 2 255556 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) ++++.+.-.++|.+.+..|-. | ...+|+-+. | .-++.-|+|.=+|-.++.+++.+ T Consensus 410 r~~r~~~~~~lL~~~erl~~~-------g--~~~~~~g~~---------------~-~~~~~~v~ivf~p~~P~d~~~vv 464 (563) T protein:vir:74 410 DQFLHDWMTMWLPAYESDFQE-------Q--DGSRPFASA---------------D-LLNECSVVCIFADPMPVNKTQVT 464 (563) T ss_pred HHHHHHHHHHHHHHHHhHhhh-------h--ccccccccc---------------c-cCCceEEEEEeCCCCCccHHHHH Confidence 677777778888777765421 1 111222211 0 11344566777888888888888 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhc---CCcc--HHH---------HHHHHHhhcccccc-----------c-------- Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNM---DWPG--AQD---------IARRLQKTLPPGIL-----------D-------- 593 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~---~~~~--~~e---------i~~~~~~~~~~~~~-----------~-------- 593 (688) +....+.+. .-+.....+.++ +++- ++. |...+-.+...... . T Consensus 465 ~~~~tl~~a----GiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g 540 (563) T protein:vir:74 465 QDTLLLQQA----HLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQG 540 (563) T ss_pred HHHHHHHHc----CchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccC Confidence 877655542 111122221111 3322 222 22211111100000 0 Q ss_pred hhhHHhhhh-------hhhhhhH Q lcl|NC_020488. 594 QDEMEEAGI-------EPPQPSP 609 (688) Q Consensus 594 ~~~~~~~~~-------~~~~~~~ 609 (688) ++-.+.-.+ .+.+..+ T Consensus 541 ~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 541 NPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred CchhHcCCcccCCccccccCCCC Confidence 000000000 0001111 No 118 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.60 E-value=3.5e-14 Score=94.32 Aligned_cols=485 Identities=11% Similarity=0.063 Sum_probs=235.5 Q ss_pred HHHHHHHHHHHHHHHHh-----------------hhHHHHHHHHHHHhhCCCCCCHHHHHH-HHhcCCCceeehhHHHHH Q lcl|NC_020488. 17 EAILQEIRERAAHAVTC-----------------WKHNFDAAQEDISFLAGEQWPESVRKE-REDEGRPCLTLNKLPQYV 78 (688) Q Consensus 17 ~~~~~~~~~~~~~~~~~-----------------~~~~r~~~~~~~~~~~G~Qw~~~~~~~-~~~~g~p~~~~N~i~~~i 78 (688) -.++.+++..|+.-... ..+.+....++..||.|+.+.-.-... -.-..+.....|+-+.++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 33555555555532211 234445566778899997552110000 000122356678888888 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +.......+-.+.+.+. |.. +++.+..+++.|++.......++.++..|-|+++++ T Consensus 81 ~~~A~lv~~e~~~i~v~--------------------d~~----~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~ 136 (522) T protein:vir:47 81 KKIASLVYNEQATITTK--------------------NEI----LQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPY 136 (522) T ss_pred HHHhhhhcCCcceeecC--------------------ChH----HHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEE Confidence 88888887777776651 333 444555566689999999999999999999999999 Q ss_pred EeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCE Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEG 238 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 238 (688) ++. +.+.+..| ++..|++= .....+...|-++.+........- ..|---..+++................ T Consensus 137 ~d~-------~~~~i~~v-~ad~~~P~-~~~~~~~~e~a~~~~~~~~~~~~~-~~yt~lE~he~~~~~~~~~~~~~~~~~ 206 (522) T protein:vir:47 137 IDG-------DKVRVAFI-QAPVFFPL-ESNTQDVSSAAILTKTIKSEGRKN-VYYTLVEFHEWVTADGQETGSTNDKKY 206 (522) T ss_pred EcC-------CceEEEEE-cCCceEEE-EEcCCceEEEEEEEEEEeecccce-eEEEEEEEeeecccccccccccccCCc Confidence 862 35677777 56666631 111111223333332222111000 000000000000000000000000111 Q ss_pred EEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCC-Cccce Q lcl|NC_020488. 239 VRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPG-STIPV 317 (688) Q Consensus 239 v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~-~~~P~ 317 (688) .++...+|+...... -|..+..... ....-|+....+++ .+.+| T Consensus 207 ~~I~n~ly~~~~~~~-----lG~~v~l~~~------------------------------~e~~~l~~~~~~~~~~~Plf 251 (522) T protein:vir:47 207 YRITNELYRSDVNDV-----LGQRVNLSEL------------------------------DKYKNLEPVTVFENLSRPLF 251 (522) T ss_pred eEEEEEEeecCCCcc-----cCcccccccc------------------------------ccccCCCCceEeCCCCcceE Confidence 111111221100000 0111100000 00000111111112 12222 Q ss_pred EEE---eeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHH----hhcccCC--- Q lcl|NC_020488. 318 APV---LGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEW----NQANRKN--- 387 (688) Q Consensus 318 vp~---~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~~~~~--- 387 (688) +.| .+. ....++|+|.|++..+++..+.+|...+.+.+.+.+. +.++++++..+....... ......+ T Consensus 252 ~y~~~~~~N-~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~ 329 (522) T protein:vir:47 252 TYLKTPGMN-NKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMG-QRRVIVPEHLTQRQYQRPDGTIDFRPRFDVEQ 329 (522) T ss_pred EEecCCccc-ccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhc-cceeecchHHhccCCCCCCcccccccccCccc Confidence 221 111 1234789999999999999999999999999998764 457788777764321100 0000111 Q ss_pred CceeecCcc-cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc-hhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 388 QSVLRYNAI-PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN-EQSGKAILARQRQGDRGTFAYIDN 465 (688) Q Consensus 388 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~-~~sg~ai~~~~~~~~~~~~~~~dn 465 (688) ..+...+.. .+...+....+.--...+...++.....+....|++....|.++. ..||++|....+..-.....+... T Consensus 330 ~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~ 409 (522) T protein:vir:47 330 NVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVAL 409 (522) T ss_pred ceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 112212211 122345444433233456777888888888889999999987654 468999998888888888889999 Q ss_pred HHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHH Q lcl|NC_020488. 466 LSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEA 545 (688) Q Consensus 466 ~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~ 545 (688) +..+++++.+.++.+...+ +- . +......++|+|+=+.+....+++. T Consensus 410 ~~~al~~lv~~i~~l~~~~-~~------~--------------------------~~~~~~~~~i~v~f~D~i~~D~~~~ 456 (522) T protein:vir:47 410 VEQSIKELCVSMCELGKAV-GV------Y--------------------------SGEIPELDDISVNLDDGVFTDRHAE 456 (522) T ss_pred HHHHHHHHHHHHHHHHhhh-hh------c--------------------------cCCCCCcceeEEEcCCCCCCCHHHH Confidence 9999999999988877432 10 0 0000123445555555555555666 Q ss_pred HHHHHHHHHhhHHHHHHHHHH-HHHhcCCcc--HHHHHHHHHhhccccccchh---hHHhhhhhhhhhhH Q lcl|NC_020488. 546 ADSLMQFVQAVPAAGGVVLDL-IAKNMDWPG--AQDIARRLQKTLPPGILDQD---EMEEAGIEPPQPSP 609 (688) Q Consensus 546 ~~~l~~~~q~~~~~~~~~~~~-~~e~~~~~~--~~ei~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 609 (688) ++.++++... +-+.... +.++-++.- +.+..+++++......+... ....++.+..-..- T Consensus 457 ~~~~~~~v~a----G~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 457 LDYWAKMVAA----GFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HHHHHHHHhc----CCCCHHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 6666655432 1111222 233334322 33344444433222111100 00000000000000 No 119 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.57 E-value=1.4e-13 Score=91.11 Aligned_cols=493 Identities=11% Similarity=0.024 Sum_probs=235.3 Q ss_pred CCCCC-------CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHH-HHHHHHhhCCC--CCCHHHHHHHHhcCCCceee Q lcl|NC_020488. 2 LPGNE-------PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDA-AQEDISFLAGE--QWPESVRKEREDEGRPCLTL 71 (688) Q Consensus 2 ~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~-~~~~~~~~~G~--Qw~~~~~~~~~~~g~p~~~~ 71 (688) ||-++ |....+..-.+- +....+.|-. .+.-.+||.|+ +|.. .+..-.++++-++.+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~------------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~-~lrg~~~~~~r~~~~ 67 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNA------------VTDFDKARLASYRLYEDMYLTNTSDYQV-ILRGGDEGDQRPIYV 67 (527) T ss_pred CCccccccCCCcCcCCccccCccc------------CCHHHHHHHHHHHHHHHHhcCchhheee-ecCCccccccceeee Confidence 33332 111111111000 1112222222 34456789886 7742 222222344555655 Q ss_pred hhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcC Q lcl|NC_020488. 72 NKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGG 151 (688) Q Consensus 72 N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G 151 (688) +-. .+++|... .+.+-+-+. .+...++-...+++..++.++....+..+-.++++.| T Consensus 68 ps~----~~~~~~~~----~~~~~g~~~---------------~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlG 124 (527) T protein:vir:10 68 PNG----EKLIEAKM----RFLGQGLKW---------------EFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRG 124 (527) T ss_pred hhh----HHhhCCcc----eeeccCccc---------------cccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhc Confidence 433 44444332 233322221 1233455567778888899999999999999999999 Q ss_pred CceEEEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEe----cCCHHHHHHhcC-Cccchhccccc Q lcl|NC_020488. 152 FGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISE----RMSKAEFNKRYP-GKAVGDLSDAE 226 (688) Q Consensus 152 ~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~----~~~~~e~~~~~p-~~~~~~~~~~~ 226 (688) -|++++.||..+. -.+.+....| ||.-+| |- .++| +.+++..++ |-.+++-++-+- -+--......+ T Consensus 125 Dg~f~l~wD~~k~--~~~R~~v~~~-DP~~~f--~~-ed~d--~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~ 196 (527) T protein:vir:10 125 DYVLLLIGDDEKD--EGSRLSLHEV-DPSTYF--PY-EDPR--YPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLD 196 (527) T ss_pred ceeEEEeeccCCC--cCCCceEeec-Ccceee--ee-ecCC--CCCceeeEEEeeeccCCccccccceehhhhhhhhhcC Confidence 9999999875332 2345666555 776444 21 3333 566665554 333333222110 00000000000 Q ss_pred ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 227 RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 227 ~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) ++.........+.....|.-. .+ .+.....++ ..-. + ...++.++ . T Consensus 197 -~~g~~~~~G~~~yt~~~w~lg------------~w--~d~~e~p~~---~~~~-----------~---~~~~~~~l--~ 242 (527) T protein:vir:10 197 -DDGKPVPGGAIKYTEELYEPG------------KW--DDRPESPLE---PDDI-----------K---KLSTLTEE--E 242 (527) T ss_pred -cccccccCcceeeeeceeecc------------cc--ccccccccc---hhhh-----------h---hhcCceee--e Confidence 000001112222222233211 00 000000000 0000 0 00112222 2 Q ss_pred CCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 307 PVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 307 ~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) ..|.|.+.+|+|.| . -.+.++..+|.|-+.+++++++.+|+.+|-...++..+++|.+....-...+......-+... T Consensus 243 ~lp~pi~fiPvV~~-~-t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~Vg 320 (527) T protein:vir:10 243 PLPEQITTLPVFHF-R-GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTIS 320 (527) T ss_pred cccCCCCccceEee-c-CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccC Confidence 34556688888876 3 356788999999999999999999999999999999998887666433222211111222344 Q ss_pred CCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCC--CcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGA--QGNEQSGKAILARQRQGDRGTFAYID 464 (688) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~--~~~~~sg~ai~~~~~~~~~~~~~~~d 464 (688) +|.+|.... ..++..+.....-..+...+..+...|.+++|+..+++|. .++..||.|+....+.-..+....- T Consensus 321 PG~iweL~e---~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~- 396 (527) T protein:vir:10 321 PLGMVEHGQ---NNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQE- 396 (527) T ss_pred CceeEecCC---CcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHH- Confidence 666765543 2455555554444556777788888999999999999994 3567799988766543321111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHH Q lcl|NC_020488. 465 NLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRME 544 (688) Q Consensus 465 n~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~ 544 (688) -+-.++.+.|.. +++..+... .+.++ ..|+ ...+++.|.=++-.++.+.+ T Consensus 397 L~~~~vqrq~~~--~~~~~~L~a---ye~v~------------------------~~d~-~~~~~v~ivf~p~lP~D~~a 446 (527) T protein:vir:10 397 LELKSVLKQFFY--NLVTQWLPA---YEGVG------------------------IDDA-DKKLTVTITFRDPKPVNSEK 446 (527) T ss_pred HHHHHHHHHhhh--hhHHHHHHH---hhhcc------------------------cCCC-ccccceEEEecccCCCCHHH Confidence 001111111110 011111100 00011 1111 12356777788888999998 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccch-----------------hhHHhhhhhhhhh Q lcl|NC_020488. 545 AADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQ-----------------DEMEEAGIEPPQP 607 (688) Q Consensus 545 ~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~ 607 (688) ..+.+..+.+.---.-..++..+.+.+.+...+.=++++.....++.... ...+..++-.+++ T Consensus 447 vie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~~~~ 526 (527) T protein:vir:10 447 RFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQP 526 (527) T ss_pred HHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCC Confidence 88888776653111112223333334444444333222221111100000 0000000001111 Q ss_pred h Q lcl|NC_020488. 608 S 608 (688) Q Consensus 608 ~ 608 (688) - T Consensus 527 ~ 527 (527) T protein:vir:10 527 L 527 (527) T ss_pred C Confidence 1 No 120 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.57 E-value=1.4e-13 Score=91.00 Aligned_cols=493 Identities=11% Similarity=0.024 Sum_probs=235.2 Q ss_pred CCCCC-------CcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHH-HHHHHHhhCCC--CCCHHHHHHHHhcCCCceee Q lcl|NC_020488. 2 LPGNE-------PIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDA-AQEDISFLAGE--QWPESVRKEREDEGRPCLTL 71 (688) Q Consensus 2 ~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~-~~~~~~~~~G~--Qw~~~~~~~~~~~g~p~~~~ 71 (688) ||-++ |....+..-.+- +....+.|-. .+.-.+||.|+ +|.. .+..-.++++-++.+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~------------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~-~lrg~~~~~~r~~~~ 67 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNA------------VTDFDKARLASYRLYEDMYLTNTSDYQV-ILRGGDEGDQRPIYV 67 (527) T ss_pred CCccccccCCCcCcCCccccCccc------------CCHHHHHHHHHHHHHHHHhcCchhheee-ecCCccccccceeee Confidence 33332 111111111000 1112222222 34456789886 7742 222222344555655 Q ss_pred hhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcC Q lcl|NC_020488. 72 NKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGG 151 (688) Q Consensus 72 N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G 151 (688) +-. .+++|... .+.+-+-+. .+...++-...+++..++.++....+..+-.++++.| T Consensus 68 ps~----~~~~~~~~----~~~~~g~~~---------------~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlG 124 (527) T protein:vir:10 68 PNG----EKLIEAKM----RFLGQGLKW---------------EFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRG 124 (527) T ss_pred hhh----HHhhCCcc----eeeccCccc---------------cccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhc Confidence 433 44444332 233322221 1233455567778888899999999999999999999 Q ss_pred CceEEEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEe----cCCHHHHHHhcC-Cccchhccccc Q lcl|NC_020488. 152 FGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISE----RMSKAEFNKRYP-GKAVGDLSDAE 226 (688) Q Consensus 152 ~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~----~~~~~e~~~~~p-~~~~~~~~~~~ 226 (688) -|+++|.||..+. -.+.+....| ||.-+| |- .++| +.+++..++ |-.+++-++-+- -+--......+ T Consensus 125 Dg~f~l~wD~~k~--~~~R~~v~~~-DP~~~f--~~-ed~d--~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~ 196 (527) T protein:vir:10 125 DYVLLLIGDDEKD--EGSRLSLHEV-DPSTYF--PY-EDPR--YPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLD 196 (527) T ss_pred ceeEEEeeccCCC--cCCCceEeec-Ccceee--ee-ecCC--CCCceeeEEEeeeccCCccccccceehhhhhhhhhcC Confidence 9999999875332 2345666555 776444 21 3333 566665554 333333222110 00000000000 Q ss_pred ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 227 RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 227 ~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) ++.........+.....|.-. .+ .+.....++ ..-. + ...++.++ . T Consensus 197 -~~g~~~~~G~~~yt~~~w~lg------------~w--~d~~e~p~~---~~~~-----------~---~~~~~~~l--~ 242 (527) T protein:vir:10 197 -DDGKPVPGGAIKYTEELYEPG------------KW--DDRPESPLE---PDDI-----------K---KLSTLTEE--E 242 (527) T ss_pred -cccccccCcceeeeeceeecc------------cc--ccccccccc---hhhh-----------h---hhcCceee--e Confidence 000001112222222233211 00 000000000 0000 0 00112222 2 Q ss_pred CCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 307 PVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 307 ~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) ..|.|.+.+|+|.| . -.+.++..+|.|-+.+++++++.+|+.+|-...++..+++|.+....-...+......-+... T Consensus 243 ~lp~pi~fiPvV~~-~-t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~Vg 320 (527) T protein:vir:10 243 PLPEQITTLPVFHF-R-GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTIS 320 (527) T ss_pred cccCCCCccceEee-c-CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccC Confidence 34556688888876 3 356788999999999999999999999999999999998887666433222211111223344 Q ss_pred CCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCC--CcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGA--QGNEQSGKAILARQRQGDRGTFAYID 464 (688) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~--~~~~~sg~ai~~~~~~~~~~~~~~~d 464 (688) +|.+|.... ..++..+.....-..+...++.+...|.+++|+..+++|. .++..||.|+....+.-..+....- T Consensus 321 PG~iweL~e---~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~- 396 (527) T protein:vir:10 321 PLGMVEHGQ---NNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQE- 396 (527) T ss_pred CceeEecCC---CcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHH- Confidence 666765543 2455555554444556777888888999999999999994 3567799988766543321111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHH Q lcl|NC_020488. 465 NLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRME 544 (688) Q Consensus 465 n~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~ 544 (688) -+-.++.+.|.. +++..+... .+.++ ..|+ ...+++.|.=++-.++.+.+ T Consensus 397 L~~~~Vqrq~~~--~~~~~~L~a---ye~v~------------------------~~d~-~~~~~v~ivf~p~lP~D~~a 446 (527) T protein:vir:10 397 LELKSVLKQFFY--NLVTQWLPA---YEGVG------------------------IDDA-DKKLTVTITFRDPKPVNNEK 446 (527) T ss_pred HHHHHHHHHhhh--hhHHHHHHH---hhhcc------------------------cCCC-ccccceEEEecccCCCCHHH Confidence 001111111110 011111100 00011 1111 12356777788888999998 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccch-----------------hhHHhhhhhhhhh Q lcl|NC_020488. 545 AADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQ-----------------DEMEEAGIEPPQP 607 (688) Q Consensus 545 ~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~ 607 (688) ..+.+..+.+.---.-..++..+.+.+.+...+.=++++.....++.... ...+..++-.+++ T Consensus 447 vie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~ 526 (527) T protein:vir:10 447 RFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQP 526 (527) T ss_pred HHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCC Confidence 88888776653111112223333333444444322222221111100000 0000000011111 Q ss_pred h Q lcl|NC_020488. 608 S 608 (688) Q Consensus 608 ~ 608 (688) - T Consensus 527 ~ 527 (527) T protein:vir:10 527 L 527 (527) T ss_pred C Confidence 1 No 121 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.53 E-value=3.9e-14 Score=94.11 Aligned_cols=422 Identities=10% Similarity=-0.014 Sum_probs=190.7 Q ss_pred hhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHH Q lcl|NC_020488. 47 FLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESL 126 (688) Q Consensus 47 ~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~ 126 (688) |. +.......+. .+-.++.|..+-+|+...+...-+- +.+ | |.+. +.. T Consensus 1 ~l-----~~~~~~~~~~-~~~~~v~n~~~~ivd~~~~~l~~~g--f~~-~-------------------d~~~----~~~ 48 (434) T protein:vir:98 1 ML-----PKNAEQAFLD-FQRKARTNFCGLIANASVHRLLALG--VTG-P-------------------DGEP----DTR 48 (434) T ss_pred CC-----CCCccHHHHH-hhhhhhccchHHHHHHHHhhhccCc--eec-C-------------------CCch----HHH Confidence 22 1111122221 1112467999999998887553221 110 1 1111 122 Q ss_pred HHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccC-CCCCcceeEEEecccceE--EeCCcccccccccCceEEEEe Q lcl|NC_020488. 127 IRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTD-DAFDLDLCIKSIHNRFAV--LMDPDATEPDYSDANWCFISE 203 (688) Q Consensus 127 i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~-~~~~~~~~~~~v~~~~~v--~~Dp~a~~~d~~Da~~~~~~~ 203 (688) +..+++.|+++.....+..++++.|.||+.|+.+.+.. ...+..+.|..+ +|..+ +|||.... + .+.+ +. T Consensus 49 ~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~-~p~~~~~i~D~~~~~--~---~~ai-~~ 121 (434) T protein:vir:98 49 ASRWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITME-HPSECIVEYDPETGE--P---LVGL-KV 121 (434) T ss_pred HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEe-ccceeEEEEeCCCCc--e---EEEE-EE Confidence 34467789999999999999999999998887643221 122344555544 77764 57775432 1 1222 22 Q ss_pred cCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeec-ceeeeeccCCceecccccchHHHHHHHhhhhh Q lcl|NC_020488. 204 RMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPV-TRKLLLLSDGRTVWEDEVKDVLDELRDLGTTV 282 (688) Q Consensus 204 ~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~ 282 (688) |-... +..... .+|+.... ..+......+.+.+... T Consensus 122 ~~~~~---------------------------~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~--------------- 158 (434) T protein:vir:98 122 WHNDI---------------------------DGFGYA-RVFFDDTSFPYRTRERTGARLPWGPD--------------- 158 (434) T ss_pred EEecc---------------------------CCceEE-EEEEeCcEEEEEEeeccccccccccc--------------- Confidence 21000 000000 11111000 00000000000000000 Q ss_pred hheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020488. 283 TRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAP 362 (688) Q Consensus 283 ~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~ 362 (688) .|.. ........|-+.+..|+|||+-.. ..+. +|.|-++.++++++.+|+.+|.+.......+ T Consensus 159 -----------~~~~---~~~~~~~~~h~~g~vPvv~f~N~~--~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a 221 (434) T protein:vir:98 159 -----------SWVY---TGTADSGDVHDLGGMQLVEFARMP--DLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSG 221 (434) T ss_pred -----------ccee---cccccccccCCCCccceEEeccCC--CcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhc Confidence 0000 001122233345677777764432 1222 5889999999999999999999999888777 Q ss_pred CCceeechhhhcch-H------HHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHH Q lcl|NC_020488. 363 KAPWVAPAESIEGY-E------EEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDAS 435 (688) Q Consensus 363 ~~~~~~~~~~i~~~-~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~ 435 (688) .+..++........ + ..+.......+.++...+ +...+...+.. ....+...+......+-.+|++++.. T Consensus 222 ~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~ 298 (434) T protein:vir:98 222 FRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEG--ENTQFGQLDAT-DLSGFLKEHASDVRDMLTISQTPTYL 298 (434) T ss_pred chhhhhcCCCcccccccccccchhhhhhhccccccccCCC--CCceEEEecCc-chHHHHHHHHHHHHHHhcccCCCHHH Confidence 76554432111110 0 011111111222332221 11122222222 23445555666666666778999999 Q ss_pred cCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhccccc Q lcl|NC_020488. 436 VGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQ 515 (688) Q Consensus 436 ~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~ 515 (688) .|...++.||.|+..+...-........+.|..++++++++++.+. |. ..++ T Consensus 299 ~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~-------------g~--~~~~------------- 350 (434) T protein:vir:98 299 YATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQA-------------GV--PEDY------------- 350 (434) T ss_pred hccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------------CC--Chhh------------- Confidence 9976677899999988777777777777777778877777665431 11 0011 Q ss_pred ceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccc-h Q lcl|NC_020488. 516 KPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILD-Q 594 (688) Q Consensus 516 ~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~-~ 594 (688) +++.|.=.+..+....+..+.+.++.+.. +....+++++++.. +++.+ +.+....+... . T Consensus 351 ------------~~~~v~w~~~~~~s~~~~ada~~kl~~~g-----~~~e~~~~~lg~~~-~e~~r-~~~e~~~~~~~~~ 411 (434) T protein:vir:98 351 ------------TEAEVRWANPAHVTMAVKADAATKLKSIG-----YPLDVIAEELDESP-ARVRR-IVAGAASQALLAA 411 (434) T ss_pred ------------eeeeEEecCCCCCCHHHHHHHHHHHHhcC-----CcHHHHHHhCCCCH-HHHHH-HHHHHHHHHHHHH Confidence 11122111122222334445555554421 12234556666542 23322 21110000000 0 Q ss_pred hhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 595 DEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQA 641 (688) Q Consensus 595 ~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~ 641 (688) ....++........+..... ..- T Consensus 412 ~~~~~~~~~~~g~~~~~~~~------------------------~dg 434 (434) T protein:vir:98 412 SLLPAPGAPSAGNVPDSGGA------------------------VDG 434 (434) T ss_pred hhhccCCCCCCCCCCcccCC------------------------CCC Confidence 00000000000000000000 000 No 122 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.87 E-value=2.3e-08 Score=62.44 Aligned_cols=605 Identities=11% Similarity=0.031 Sum_probs=168.0 Q ss_pred hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHH--HHHHHHhCCcceE Q lcl|NC_020488. 16 QEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQ--VLGDQRQNRPAIQ 93 (688) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~--i~g~~~~~r~~~~ 93 (688) +.+.+.+++.++..-+....++..+|++... ++.....-.| |...+.+.+ -++...+.+|-+. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~---------~d~~f~~~~G------~QW~~~~~~~~~~~l~~~~~P~~~ 65 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCL---------EATRFARVPG------GQWEGATAAGSELGKHFEKYPKFE 65 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHH---------HHHhhhccCC------CCCCHHHHHHHHHHHhhCCCCeEE Confidence 7777777776666555555566655554222 1111111012 233333333 2334555666432 Q ss_pred EEeCCc---c----ccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCC Q lcl|NC_020488. 94 VHPVEA---N----ATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDA 166 (688) Q Consensus 94 v~pr~~---~----~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~ 166 (688) | ++-. + ....+.... .+.|.+.+.-+.+.+++..++..--.......++.++..+++.+=--+++...+.. T Consensus 66 ~-N~i~~~v~~v~g~~~~nr~d~-~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~ 143 (720) T protein:vir:35 66 I-NKISTELNRIISEYRHNRITV-KFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLV 143 (720) T ss_pred E-ccHHHHHHHHHhHHHhCCCce-EEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeeccc Confidence 2 1110 0 000001111 12233444335666677776655544555556666666666543122222211111 Q ss_pred CCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhc------CCccchhcccccccccccCCCCCEEE Q lcl|NC_020488. 167 FDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRY------PGKAVGDLSDAERGEYSWWTNEEGVR 240 (688) Q Consensus 167 ~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~------p~~~~~~~~~~~~~~~~~~~~~~~v~ 240 (688) .+.++.. .+..+.+.|- ..++...-|-...+..+.++.+-.| .+.....++. .....+.+ T Consensus 144 ~~~d~~~----~~~~i~i~~v--~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~---~a~~~~~~----- 209 (720) T protein:vir:35 144 NALDPMD----ERQRICLEPI--YDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNK---DPATLMSG----- 209 (720) T ss_pred ccCCCCc----ccceeeEecc--cCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCC---cccccccc----- Confidence 1111100 0011111110 0001111111122233333332111 0001111111 00000100 Q ss_pred EEEEEeeeecceeeeeccCCceecccc-----c-ch----H----HHHHHHhhhhhhheee--eeEEEEEEEEEchhhhc Q lcl|NC_020488. 241 VSEYFYREPVTRKLLLLSDGRTVWEDE-----V-KD----V----LDELRDLGTTVTRERR--VKTYKVKWMKVTAYDVL 304 (688) Q Consensus 241 v~e~~~~~~~~~~~~~~~~g~~~~~~~-----~-~~----~----~~~~~~~g~~~~~~~~--~~~~~v~~~~~~~~~il 304 (688) ....|+.++.....+...+..+..... . .+ . -+.............. ..++.|... ...-.++ T Consensus 210 ~~~~~~~d~~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~-~v~~~~~ 288 (720) T protein:vir:35 210 IERSWDYDWYDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRR-RVYVSVV 288 (720) T ss_pred ccccccccccCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEE-EEEEEee Confidence 001111111111111111110000000 0 00 0 0000000000000000 000000000 0000000 Q ss_pred ccCCCCCC-Cc--cceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHh Q lcl|NC_020488. 305 EGPVDWPG-ST--IPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWN 381 (688) Q Consensus 305 e~~~p~~~-~~--~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 381 (688) .+..-..+ +. +.++||+|++-.... ..|......++..=+-.=.+.+.. ++. ... T Consensus 289 ~g~~~l~~~~~~p~~~fP~vP~~g~r~~-~d~~~~~~G~vr~~kd~Q~~~N~~---~s~------------------~~~ 346 (720) T protein:vir:35 289 DGEGFLEKAQRIPGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDAQRLYNLQ---VSM------------------LAD 346 (720) T ss_pred ccchhcccCCCCCCCccceEEEEeeeec-cCCCcccceeeecchhHHHHHHHH---HHH------------------HHH Confidence 00000000 11 222344433211100 011111111111111111111110 000 000 Q ss_pred hcccCCCceeecCccc------------ccccceecCC--------------Cc-chHHHHHHHHHHHHHHHHHhCcChH Q lcl|NC_020488. 382 QANRKNQSVLRYNAIP------------GVDRPQRDMP--------------AS-MPAAELQLALSATDEMKATIGLYDA 434 (688) Q Consensus 382 ~~~~~~~~~~~~~~~~------------~~~~~~~~~~--------------~~-~~~~~~~ll~~~~~~~~~~tGv~d~ 434 (688) ..+ ..+.+....+.. +...+.+++. .. ......++.+...++++.....-.. T Consensus 347 ~~~-~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~ 425 (720) T protein:vir:35 347 SAT-QDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQE 425 (720) T ss_pred HHH-cCCccccccCcchHHHHHHHhhccccccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHH Confidence 000 111111111100 0111111100 00 0000123333333433332222222 Q ss_pred HcCCCcc---hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhc Q lcl|NC_020488. 435 SVGAQGN---EQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMD 511 (688) Q Consensus 435 ~~G~~~~---~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~ 511 (688) ..|.... ..|+++-.+....-.......+. |-..++.-.+.+.+++..+.. .+. ..++.+.|-... T Consensus 426 vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~-~~Dnl~~~~~~~g~~lL~lI~-----~~y---~~er~~RI~~ed-- 494 (720) T protein:vir:35 426 VTGSSQAMQPMPSNIAKETVNHLMHRSDMSSFI-YLDNMAKSLKRAGEVWLSMAR-----EVY---GSDRQVRIVNAD-- 494 (720) T ss_pred HhCCChHHcCcccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC-- Confidence 3443321 23432111111111111111111 112233333344444443321 111 223444442211 Q ss_pred ccccceeeec----cceee----eEEEEE---ecccCcHHHHHHHHH-HHHHHHHhhHHHHHHH--HHHHHHh-cCCccH Q lcl|NC_020488. 512 EETQKPVLVN----DIAAG----KFDVTV---KAGPSYQTQRMEAAD-SLMQFVQAVPAAGGVV--LDLIAKN-MDWPGA 576 (688) Q Consensus 512 ~~~~~~~~~n----di~~~----~~dv~v---~~~~~~~s~r~~~~~-~l~~~~q~~~~~~~~~--~~~~~e~-~~~~~~ 576 (688) .+...+.+| |-..| .-|+++ ++..+.........+ .+..+.+.++.+.+.. ...++.. +..-.. T Consensus 495 -~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~ 573 (720) T protein:vir:35 495 -GTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEG 573 (720) T ss_pred -CCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCc Confidence 011122222 22222 235543 333333333333333 3333333332222211 0001100 000000 Q ss_pred HHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 577 QDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQA 656 (688) Q Consensus 577 ~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~ 656 (688) ... +.+.+......+.+......+++..+..++++++.++.+++++++++++.++|++.++++++....+++..+.+.. T Consensus 574 p~~-~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~ 652 (720) T protein:vir:35 574 EGL-DEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTE 652 (720) T ss_pred hhH-HHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 0111111011111111111122222222333334444555566666666666666666666666666665555555 Q ss_pred HHHHHHHHHHHHHHHH--HHHHHH------HHHHHhhcCC Q lcl|NC_020488. 657 AMMAGPGSLEETVRNL--VAEAMA------ELMAQSQGNA 688 (688) Q Consensus 657 a~~~~~~~~~~~~~~~--~~~a~~------~~~~~~q~~~ 688 (688) +++++.++.+....+. +...+. +.+++.|+.+ T Consensus 653 a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~ 692 (720) T protein:vir:35 653 ARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDA 692 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchH Confidence 5555444433222222 111111 1123455555 No 123 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.55 E-value=3e-07 Score=56.32 Aligned_cols=618 Identities=11% Similarity=0.007 Sum_probs=184.8 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-HHHHhcCCCceeehhHHHHHHHHHHHHHhCCcceEEEeCC Q lcl|NC_020488. 20 LQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-KEREDEGRPCLTLNKLPQYVDQVLGDQRQNRPAIQVHPVE 98 (688) Q Consensus 20 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~ 98 (688) ..+....+..+..+........ ..|..+-. ...-..|. ...+.+..++ ..+.||.++..+.. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~---------~~~r~~a~~d~~fy~G~------Qw~~~~~~~l--~~q~rp~~N~i~~~ 63 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTAS---------DEARREAKNDLFFSRIS------QWDDWLSQYT--TLQYRGQFDVVRPV 63 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhh---------HHHHHHHHHHHHhhcCC------CCCHHHHHHH--HhcCCCcccchHHH Confidence 1111222222222111111100 11111111 11112343 2222333332 22445533332211 Q ss_pred ccc---cccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcceeEEE Q lcl|NC_020488. 99 ANA---TKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKS 175 (688) Q Consensus 99 ~~~---~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~ 175 (688) -+. .......--.+.|.|. .-+.+.+++..++..--...-...++.++..+++.+ -++|- +++... T Consensus 64 i~~v~g~e~~nr~d~~v~P~~~-~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~-G~G~~---------ev~~d~ 132 (725) T protein:vir:92 64 VRKLVSEMRQNPIDVLYRPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIES-GVGAW---------RLVTDY 132 (725) T ss_pred HHHHHhhHHhCCcceEEecCCc-cHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhc-Cccee---------eeeecc Confidence 100 0000000111223444 334566666666555444555556666666666542 11110 011000 Q ss_pred -ecccce--EEeCCcccccccccCceEEEEecCCHHHHHHh----c-CCc-cchhcccccccccccCCCCCEEEEEEEEe Q lcl|NC_020488. 176 -IHNRFA--VLMDPDATEPDYSDANWCFISERMSKAEFNKR----Y-PGK-AVGDLSDAERGEYSWWTNEEGVRVSEYFY 246 (688) Q Consensus 176 -v~~~~~--v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~----~-p~~-~~~~~~~~~~~~~~~~~~~~~v~v~e~~~ 246 (688) -.||++ +.+......-++.+.-|-...+..+.++.+-. + +.. .....+..... ...|..... ..-|+ T Consensus 133 ~~~d~~~~~~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~-~~~~~~~~~---~~~~~ 208 (725) T protein:vir:92 133 EDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLD-ADDIPSFQN---PNDWV 208 (725) T ss_pred cCCCCCCCceeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcc-hhhhhhccc---CCccc Confidence 011211 10000000001111222222233333333311 1 110 00111111100 001110000 00111 Q ss_pred eeecceeeeeccCCc-------eec--ccccc-hH----HHHHHHhhhhhhheee--eeEEEEEEEEEchhhhcccCCCC Q lcl|NC_020488. 247 REPVTRKLLLLSDGR-------TVW--EDEVK-DV----LDELRDLGTTVTRERR--VKTYKVKWMKVTAYDVLEGPVDW 310 (688) Q Consensus 247 ~~~~~~~~~~~~~g~-------~~~--~~~~~-~~----~~~~~~~g~~~~~~~~--~~~~~v~~~~~~~~~ile~~~p~ 310 (688) ..+.+...+...... .+. .+... +. ...+............ +..+++....+--.. +-+..-+ T Consensus 209 ~~~~~~d~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~-~~g~~~l 287 (725) T protein:vir:92 209 FPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI-ITCTAVL 287 (725) T ss_pred ccccCCCeEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeee-ecchhhh Confidence 111111111111100 000 00000 00 0000000000000000 111111111100000 0111111 Q ss_pred CC-Cc--cceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHH-HHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 311 PG-ST--IPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAAT-ERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 311 ~~-~~--~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) .. .. +.++||+|++-..... .|......++..=+-.=.+.+... -.+...+..+-....+....++++...+.++ T Consensus 288 ~~~~~~~~~~~P~vP~~g~r~~~-~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 366 (725) T protein:vir:92 288 KDKQLIAGEHIPIVPVFGEWGFV-EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGN 366 (725) T ss_pred cCCCCCCCCceeeEEEEeeeecc-CCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhcc Confidence 11 12 2347887765432221 222222244433333333333322 2223333333333333444444444433210 Q ss_pred CCceeecCcccccccc---------eecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAIPGVDRP---------QRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDR 457 (688) Q Consensus 387 ~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~ 457 (688) ... +....+.+ ..+...+.++-....++........+.=++...-...|..+++++-.+..+.-.. T Consensus 367 --~~~---~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~q 441 (725) T protein:vir:92 367 --DDY---PYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMR 441 (725) T ss_pred --Ccc---ceeeccccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHH Confidence 010 00000000 0111111222222344444444444333222222222333333222222211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccc----eeeeE----E Q lcl|NC_020488. 458 GTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDI----AAGKF----D 529 (688) Q Consensus 458 ~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi----~~~~~----d 529 (688) ....+.. |-..++.-.+.+.+++..+.. .+. ..++.+.|-... .+...+.+|.- ..|+. | T Consensus 442 g~~~l~~-~~Dnl~~~~~~~g~~lL~lI~-----~~~---~~~r~~RI~~ed---g~~~~v~in~~~~~~~~G~~~~~Nd 509 (725) T protein:vir:92 442 ADLETYV-FQDNLATAMRRDGEIYQSIVN-----DIY---DVPRNVTITLED---GSEKEVQLMAEVVDLATGERQVLND 509 (725) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----Hhc---CCCcEEEEecCC---CCcceEEeccccccccccchhhhhc Confidence 2222222 222333333444444444321 111 122334442111 01122333321 11221 2 Q ss_pred EE--EecccCcHHHHHHHHH-HHHHHHHhhHHHHHHHHH---HHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhh Q lcl|NC_020488. 530 VT--VKAGPSYQTQRMEAAD-SLMQFVQAVPAAGGVVLD---LIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIE 603 (688) Q Consensus 530 v~--v~~~~~~~s~r~~~~~-~l~~~~q~~~~~~~~~~~---~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~ 603 (688) ++ .++..+.+.......+ .+..+.+.++.+.+.... .+...++++......+.......+..+.....+..+.+ T Consensus 510 i~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~ 589 (725) T protein:vir:92 510 IRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEE 589 (725) T ss_pred cccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhh Confidence 21 2222222222222222 222333332222221111 12223333333333333322222222221222222322 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHH Q lcl|NC_020488. 604 PPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSL---------EETVRNLVA 674 (688) Q Consensus 604 ~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~---------~~~~~~~~~ 674 (688) ++...+.++++.+++++++.++++..+++++++++++++..+.+++.+..+.+++.++.... .+...+..+ T Consensus 590 ~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~ 669 (725) T protein:vir:92 590 QQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFL 669 (725) T ss_pred hHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 33334444555666666666777777777777777777776666655555555544433222 222222234 Q ss_pred HHHHHHHHHhhcCC Q lcl|NC_020488. 675 EAMAELMAQSQGNA 688 (688) Q Consensus 675 ~a~~~~~~~~q~~~ 688 (688) +..++...+.+++| T Consensus 670 ~~~~~~q~~~~~~a 683 (725) T protein:vir:92 670 KTVASFQQDRSEDA 683 (725) T ss_pred HHHHHHHHHHHHHH Confidence 45555566666666 No 124 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=98.51 E-value=3.9e-07 Score=55.69 Aligned_cols=619 Identities=11% Similarity=0.002 Sum_probs=190.3 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-HHHHhcCCCceeehhHHHHHHHHHHHHHhCCcceEEEeCC Q lcl|NC_020488. 20 LQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-KEREDEGRPCLTLNKLPQYVDQVLGDQRQNRPAIQVHPVE 98 (688) Q Consensus 20 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~ 98 (688) ..+.+..+..+..+........ ..|..+-. ...-..|. ...+.+..++ ..+.||.++..... T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~---------~~~r~~a~~d~~fy~G~------Qw~~~~~~~l--~~q~rp~~N~i~~~ 63 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTAS---------DEARREAKNDLFFSRVS------QWDDWLSQYT--TLQYRGQFDVVRPV 63 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhh---------HHHHHHHHHHHHhhCCC------CCCHHHHHHH--HhcCCCccccHHHH Confidence 2222222222222211111100 01111111 11112343 2223333332 22445533222111 Q ss_pred cc---ccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcceeEEE Q lcl|NC_020488. 99 AN---ATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKS 175 (688) Q Consensus 99 ~~---~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~ 175 (688) -+ +.......--.+.|.|+ .-+.+.+++..++..--...-+..++.++..+++.+ -++|- +++... T Consensus 64 i~~v~g~~~~nr~d~~v~P~~~-~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~-G~G~~---------ev~~d~ 132 (725) T protein:vir:77 64 VRKLVSEMRQNPIDVLYRPKDG-ARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEA-GVGAW---------RLVTDY 132 (725) T ss_pred HHHHHhhHHhCCcceEEecCCc-cHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhc-Cccee---------eeeecc Confidence 00 00000000111223444 333456666666555444444555556666555532 11110 011000 Q ss_pred -ecccce--EEeCCcccccccccCceEEEEecCCHHHHHHhc-----CC-ccchhcccccccccccCCCCCEEEEEEEEe Q lcl|NC_020488. 176 -IHNRFA--VLMDPDATEPDYSDANWCFISERMSKAEFNKRY-----PG-KAVGDLSDAERGEYSWWTNEEGVRVSEYFY 246 (688) Q Consensus 176 -v~~~~~--v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~-----p~-~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~ 246 (688) -.|+++ +.+-..+...|+.+.-|-...+..+.++.+-.| +. ......+.... ....|..... ...|+ T Consensus 133 ~~~d~~~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~-~~~~~~~~~~---~~~~~ 208 (725) T protein:vir:77 133 EDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDL-DADDIPSFQN---PNDWV 208 (725) T ss_pred cCCCCCCCceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCc-chhhcccccc---ccccc Confidence 001111 000000000011111121222223333332211 10 00011111100 0001110000 00111 Q ss_pred eeecceeeeeccCCce-------ec--ccccc-hH----HHHHHHhhhhhhheee--eeEEEEEEEEEchhhhcccCCCC Q lcl|NC_020488. 247 REPVTRKLLLLSDGRT-------VW--EDEVK-DV----LDELRDLGTTVTRERR--VKTYKVKWMKVTAYDVLEGPVDW 310 (688) Q Consensus 247 ~~~~~~~~~~~~~g~~-------~~--~~~~~-~~----~~~~~~~g~~~~~~~~--~~~~~v~~~~~~~~~ile~~~p~ 310 (688) ..+.+...+....... +. .+... +. ...+............ +..+++..+.+.-.. +-+...+ T Consensus 209 ~~~~~~d~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~-~~g~~~l 287 (725) T protein:vir:77 209 FPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI-ITCTAVL 287 (725) T ss_pred ccccCCCeeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEee-ecCceee Confidence 1111111111100000 00 00000 00 0000000000000000 000011000000000 0011111 Q ss_pred CC-Ccc--ceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHH-HHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 311 PG-STI--PVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAAT-ERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 311 ~~-~~~--P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) .. ..+ .++||+|++-.... ..|......++..=+-.-.+.+... -.+...+..+.....+....++.....+..+ T Consensus 288 ~~~~~~~~~~~P~vP~~g~r~~-~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 366 (725) T protein:vir:77 288 KDKQLIAGEHIPIVPVFGEWGF-VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGN 366 (725) T ss_pred ccCCcCCCCccceEEEeeeeec-cCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhc Confidence 11 112 24677765432211 1222223344443333334444433 3333444445445555566677777777777 Q ss_pred CCceeecCcc---ccccc-ceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCc----chhhHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAI---PGVDR-PQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQG----NEQSGKAILARQRQGDRG 458 (688) Q Consensus 387 ~~~~~~~~~~---~~~~~-~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~----~~~sg~ai~~~~~~~~~~ 458 (688) ++..+..... .++.. ...+...+.++-.-..++........+ ....|... ..+++++-.+....-... T Consensus 367 ~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i----~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg 442 (725) T protein:vir:77 367 DDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAV----KEVATLGVDTEAVNGGQVAFDTVNQLNMRA 442 (725) T ss_pred cCCceecccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHH----HHHhCCCHHHhCCCchhhHHHHHHHHHHHH Confidence 6655442211 11111 112222233322233444444444443 33445432 222322111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccce----eeeE----EE Q lcl|NC_020488. 459 TFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIA----AGKF----DV 530 (688) Q Consensus 459 ~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~----~~~~----dv 530 (688) ....+ .|-..++.-.+.+.+++..+.. .+. ..++.+.|-... .+-..+++|.-. .|.. |+ T Consensus 443 ~~~~~-~~~Dnl~~~~~~~g~~lL~lI~-----~~~---~~~rv~RI~~ed---~~~~~v~in~~~~~~~~G~~~~~NDi 510 (725) T protein:vir:77 443 DLETY-VFQDNLATAMRRDGEIYQSIVN-----DIY---DVPRNVTITLED---GSEKDVQLMAEVVDLATGEKQVLNDI 510 (725) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC---CCcceeeecccccccccchhHhhhhh Confidence 22222 2223333444444444444321 111 122334432111 011223333211 1111 11 Q ss_pred E--EecccCcHHHHHH-HHHHHHHHHHhhHHHHHHHHH---HHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhh Q lcl|NC_020488. 531 T--VKAGPSYQTQRME-AADSLMQFVQAVPAAGGVVLD---LIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEP 604 (688) Q Consensus 531 ~--v~~~~~~~s~r~~-~~~~l~~~~q~~~~~~~~~~~---~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~ 604 (688) + .++..+....... ..+.+..+.+.++.+.+.... .+...++++......+.......+..+........+.++ T Consensus 511 ~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~ 590 (725) T protein:vir:77 511 RGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQ 590 (725) T ss_pred ccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhH Confidence 0 1122222212222 222222333322222221111 122334444444444444333222222222222222223 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHH Q lcl|NC_020488. 605 PQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGS---------LEETVRNLVAE 675 (688) Q Consensus 605 ~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~---------~~~~~~~~~~~ 675 (688) +..++.++++++++++++.++++..+++++++++++++..+++++....+.+++.++.+. .+....+..++ T Consensus 591 q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~ 670 (725) T protein:vir:77 591 QWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLK 670 (725) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH Confidence 333344444555566666666666667777777777776666555555454444333322 22222222334 Q ss_pred HHHHHHHHhhcCC Q lcl|NC_020488. 676 AMAELMAQSQGNA 688 (688) Q Consensus 676 a~~~~~~~~q~~~ 688 (688) +.++..++.++++ T Consensus 671 ~~~~~q~~~~~~~ 683 (725) T protein:vir:77 671 TVASFQQDRSEDA 683 (725) T ss_pred HHHHHHHHHHHHH Confidence 4455555555555 No 125 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.48 E-value=4.8e-07 Score=55.23 Aligned_cols=621 Identities=11% Similarity=0.018 Sum_probs=182.8 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHH-HHHHhcCCCceeehhHHHHHHHHHHHHHhCCcceEEEeCC Q lcl|NC_020488. 20 LQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVR-KEREDEGRPCLTLNKLPQYVDQVLGDQRQNRPAIQVHPVE 98 (688) Q Consensus 20 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~-~~~~~~g~p~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~ 98 (688) ..+.+..+..+..+........ ..|..+-. ...-..|. ...+.+..++ ..+.||.++..+.. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~---------~~~R~~a~~d~~fy~G~------QW~~~~~~~l--~~q~rp~~N~i~~~ 63 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTAS---------DEARREAKNDLFFSRVS------QWDDWLSQYT--TLQYRGQFDVVRPV 63 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhh---------HHHHHHHHHHHHhhcCC------CCCHHHHHHH--HhcCCCcccchHHH Confidence 1222222222222211111100 01111111 11112343 2233334333 33455533332211 Q ss_pred cc---ccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcceeEEE Q lcl|NC_020488. 99 AN---ATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKS 175 (688) Q Consensus 99 ~~---~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~ 175 (688) -+ +.......--.+.|.|. .-+.+.++++.++...-...-+..++.++..+++.|=--+++-..+-. T Consensus 64 v~~v~g~e~~nr~d~~v~p~~~-~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~--------- 133 (725) T protein:vir:10 64 VRKLVSEMRQNPIDVLYRPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE--------- 133 (725) T ss_pred HHHHHhhHHhCCcceEEecCCc-chHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeecccc--------- Confidence 00 00000000011123343 334555555555544433444444555555554432011111000000 Q ss_pred ecccce--EEeCCcccccccccCceEEEEecCCHHHHH----HhcCCcc-chhcccccccccccCCCCCEEEE-EEEEee Q lcl|NC_020488. 176 IHNRFA--VLMDPDATEPDYSDANWCFISERMSKAEFN----KRYPGKA-VGDLSDAERGEYSWWTNEEGVRV-SEYFYR 247 (688) Q Consensus 176 v~~~~~--v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~----~~~p~~~-~~~~~~~~~~~~~~~~~~~~v~v-~e~~~~ 247 (688) -.||++ +.+-..+...++.+..|=...+..+.++.+ .++-... ...+...-......+.+.....- ..-|+. T Consensus 134 ~~d~~~~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 213 (725) T protein:vir:10 134 DQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLT 213 (725) T ss_pred CCCCCCCceeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccC Confidence 001111 000000000011111111112222223222 1111110 00110000001111111000000 000111 Q ss_pred eecceeeeec----cCCceec--ccccc-hH----HHHHHHhhhhhhhee--eeeEEEEEEEEEchhhhcccCCCCCC-- Q lcl|NC_020488. 248 EPVTRKLLLL----SDGRTVW--EDEVK-DV----LDELRDLGTTVTRER--RVKTYKVKWMKVTAYDVLEGPVDWPG-- 312 (688) Q Consensus 248 ~~~~~~~~~~----~~g~~~~--~~~~~-~~----~~~~~~~g~~~~~~~--~~~~~~v~~~~~~~~~ile~~~p~~~-- 312 (688) . ....+..+ .....+. .+... +. ...+........... .+..+++..+.+--.. .-+..-+.. T Consensus 214 ~-~~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~-~~g~~~l~~~~ 291 (725) T protein:vir:10 214 Q-DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI-ITCTAVLKDKQ 291 (725) T ss_pred C-CeEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEe-ecchhhhcCCC Confidence 1 00111100 0000000 00000 00 000000000000000 0011111111100000 011111111 Q ss_pred -CccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHH-HHHHHHhcCCCceeechhhhcchHHHHhhcccCCCce Q lcl|NC_020488. 313 -STIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTA-ATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSV 390 (688) Q Consensus 313 -~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 390 (688) -.+.++||+|++-..... .|...+..++..=+-.=.+.+. ..-.+...+..+-....+....++.....+.+ ... T Consensus 292 ~~~~~~fP~vP~~g~r~~~-~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~--~~~ 368 (725) T protein:vir:10 292 LIAGEHIPIVPVFGEWGFV-EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDG--NDD 368 (725) T ss_pred CCCCCceeEEEEEeeeecc-CCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhc--cCC Confidence 112347887765432221 2222222444333333333333 33333344444444445555555555444321 111 Q ss_pred eecCcc-----cccccc-eecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcc--hhhHHHHHHHHHHH--HHHHH Q lcl|NC_020488. 391 LRYNAI-----PGVDRP-QRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGN--EQSGKAILARQRQG--DRGTF 460 (688) Q Consensus 391 ~~~~~~-----~~~~~~-~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~--~~sg~ai~~~~~~~--~~~~~ 460 (688) ..+--. .++..+ ..+...+.++-....++........+ ....|.... +..+.+.+...-++ ..... T Consensus 369 ~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i----~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~ 444 (725) T protein:vir:10 369 YPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAV----KEVATLGVDAEAVNGGQVAYDTVNQLNMRADL 444 (725) T ss_pred ceeeecccccccCcccccccCcccCCCCchHHHHHHHHHHHHHH----HHHhCCCHHHhCcCchhhHHHHHHHHHHHHHH Confidence 100000 000000 01111122222223334333333333 344454321 22222322222111 11122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccc----eeeeE----EEE- Q lcl|NC_020488. 461 AYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDI----AAGKF----DVT- 531 (688) Q Consensus 461 ~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi----~~~~~----dv~- 531 (688) .... |-..++.-.+.+.+++..+.. .+. ..++.+.|-... .+...+.+|.- ..|+. |++ T Consensus 445 ~l~~-~~Dnl~~~~~~~g~~lL~lI~-----~~~---~~er~~RI~~ed---g~~~~v~in~~~~d~~~G~~v~~Ndi~g 512 (725) T protein:vir:10 445 ETYV-FQDNLATAMRRDGEIYQSIVN-----DIY---DVPRNVTITLED---GSEKEVQLMAEVVDLATGERQVLNDIRG 512 (725) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCC---CCcceeEeccccccccccchhhhhcccc Confidence 2222 222233333333444443321 111 122344442111 01123333322 11221 221 Q ss_pred -EecccCcHHHHHHHH-HHHHHHHHhhHHHHHHHH---HHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhh Q lcl|NC_020488. 532 -VKAGPSYQTQRMEAA-DSLMQFVQAVPAAGGVVL---DLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQ 606 (688) Q Consensus 532 -v~~~~~~~s~r~~~~-~~l~~~~q~~~~~~~~~~---~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~ 606 (688) .++..+.+....... +.+..+.+.++.+.+... ..++.+++++......+.......+..+.....+..+.++++ T Consensus 513 ~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~ 592 (725) T protein:vir:10 513 RYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQW 592 (725) T ss_pred ceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHH Confidence 233333332222222 233334443333332221 123344444544444443333332222222222223333333 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HH-------HHHHHHHH Q lcl|NC_020488. 607 PSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEE--TV-------RNLVAEAM 677 (688) Q Consensus 607 ~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~--~~-------~~~~~~a~ 677 (688) .++.++++++++++++.++++..+++++++.+++++..+++++..+.+.+++..+.+..+. ++ .+...++. T Consensus 593 ~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~ 672 (725) T protein:vir:10 593 LVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTV 672 (725) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHH Confidence 4444555566666666666666667777777777776666655555555554444332221 11 11122334 Q ss_pred HHHHHHhhcCC Q lcl|NC_020488. 678 AELMAQSQGNA 688 (688) Q Consensus 678 ~~~~~~~q~~~ 688 (688) +++.++.++++ T Consensus 673 ~~~q~~~~~~~ 683 (725) T protein:vir:10 673 ASFQQDRSEDA 683 (725) T ss_pred HHHHHHHHHHH Confidence 44444444455 No 126 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=98.26 E-value=1.9e-06 Score=51.92 Aligned_cols=472 Identities=12% Similarity=0.044 Sum_probs=174.8 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQ 80 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~ 80 (688) |+-.+-+. .+-.........++..++...+...--+..+......-...+|..+. .+.....| ...+|.++.+++. T Consensus 1 ~~~~~~~~--~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~-~Y~~rl~r-A~~~n~~~~tl~~ 76 (489) T protein:vir:78 1 MLTENGQG--SGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEA-RQAEYEAG-GIVYNFTRRTLSG 76 (489) T ss_pred CccCCCcc--CCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChH-HHHHHHhc-cccCChHHHHHHH Confidence 66655321 12222222333333333322222211111111111111223454332 23333333 4468999999999 Q ss_pred HHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 81 VLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 81 i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) .+|......|.+.+ | +.|..++..+ .+-++++.-+..++..++.+|.+++=| T Consensus 77 l~G~vfrk~p~~~~-p------------------------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV-- 129 (489) T protein:vir:78 77 MVGSVMRKEPEINI-P------------------------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLV-- 129 (489) T ss_pred HhchhhcCCcceec-c------------------------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE-- Confidence 99999988776543 1 2344444444 345778899999999999999887544 Q ss_pred eeccCCCC--------CcceeEEEecccceEEeCCcccccccc-cCceEEEEecCCHHHHHHhcCCccchhccccccccc Q lcl|NC_020488. 160 KYSTDDAF--------DLDLCIKSIHNRFAVLMDPDATEPDYS-DANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEY 230 (688) Q Consensus 160 ~~~~~~~~--------~~~~~~~~v~~~~~v~~Dp~a~~~d~~-Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~ 230 (688) ||-.+... .-.+-+..+ .|.+|+ ++.....+.. ...++..+...... +..+. T Consensus 130 D~P~~~~~T~ade~~~~~rPy~~~~-~~~~Ii-nW~~~~v~G~~~Lt~v~lrE~~~~~-----------------d~~~~ 190 (489) T protein:vir:78 130 DAPETGAATAAEQNAGLLNPTIAFY-TTENIV-NWRLTRVGSVNRVTMVVLRETWEYN-----------------EPGNE 190 (489) T ss_pred eeCCCCCcCHHHHHHhcCCcEEEEe-chhhhc-CceeeeeCCccceeEEEEEEeEEee-----------------cCCCC Confidence 54322110 012223333 333332 2222111110 11222221111000 00001 Q ss_pred ccCCCCCEEEEEEEEeeeecceeee-eccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCC Q lcl|NC_020488. 231 SWWTNEEGVRVSEYFYREPVTRKLL-LLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVD 309 (688) Q Consensus 231 ~~~~~~~~v~v~e~~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p 309 (688) ..+....++||.+.-........++ ...+|...... +... ...+.. T Consensus 191 f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~--------------------------~~~~------~~~g~~- 237 (489) T protein:vir:78 191 FETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDV--------------------------VEIY------PDLGES- 237 (489) T ss_pred ccceeEEEEEEEecCCCcceEEEEEEeecCCccccee--------------------------eEEe------ccCCCC- Confidence 1111112233322110000000000 01111110000 0000 001111 Q ss_pred CCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCc Q lcl|NC_020488. 310 WPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQS 389 (688) Q Consensus 310 ~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 389 (688) +.+.+|||++.... .+...+....-.+-.+...+=...|-..+++..+..|...+. |. ++..+.+.....+.+. T Consensus 238 -~l~~IPfv~~~~~~---~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~-G~-d~~~~~~~~~~~~~~i 311 (489) T protein:vir:78 238 -LRGVIPFTFIGATN---NDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PG-ENLTPQAFKEANPNGI 311 (489) T ss_pred -ccCeeeEEEEecCC---CCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cC-ccCCcccccccCccce Confidence 23566777654422 222223333445555544333334455666666666665543 32 1112222222222332 Q ss_pred eeecCcccc---cccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 390 VLRYNAIPG---VDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNL 466 (688) Q Consensus 390 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~ 466 (688) ++-.+.... .....++.+....- .-+.|....+.|.. .|.. .+ ..+.+.|+.+......+..-.|..+..|+ T Consensus 312 ~~g~~~~~~lp~~~~~~~ie~~~~~~-~r~~l~~le~qm~~-lGa~--l~-~~~~~~Ta~~~~~~~~~~~S~L~~~a~~~ 386 (489) T protein:vir:78 312 KFGSRRGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQ-IGAQ--LI-TPTQQITAQSARIQRGADTSVMATIARNV 386 (489) T ss_pred eeCCcccccCCCCCCcceeccCcchH-HHHHHHHHHHHHHH-Hhhh--hc-cCCcchhHHHHHHHHHHhhHHHHHHHHHH Confidence 221111100 11122333322211 12223222222322 2322 12 22335788888887777777788888888 Q ss_pred HHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHH Q lcl|NC_020488. 467 SRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAA 546 (688) Q Consensus 467 ~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~ 546 (688) +.++.. ++.++..|.... ++..--+.+|. +|++ .....+.. T Consensus 387 e~al~~----~l~~~a~w~G~~--------~~~~~~i~~n~-------------------dF~~--------~~~d~~~~ 427 (489) T protein:vir:78 387 SQAYTD----ALRWVAVMLGKP--------EDTEVEFRLNM-------------------DFFL--------EPMTAQDR 427 (489) T ss_pred HHHHHH----HHHHHHHHcCCC--------CCCceEEEeec-------------------ccCc--------ccCCHHHH Confidence 777655 566666664311 01111112221 1211 11111123 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHhcCC--ccHHHHHHHHHhhccccccchh-hHHhhhhhhhh Q lcl|NC_020488. 547 DSLMQFVQAVPAAGGVVLDLIAKNMDW--PGAQDIARRLQKTLPPGILDQD-EMEEAGIEPPQ 606 (688) Q Consensus 547 ~~l~~~~q~~~~~~~~~~~~~~e~~~~--~~~~ei~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 606 (688) ++|+.+.+. ..+....+...++..++ +..+++...+.....+...... ...+..+++.+ T Consensus 428 ~al~~~~~~-G~is~~t~~~~L~~~gv~d~~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 428 AAWMADINA-GLLPATAYYAALRKAGVTDWTDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHHHHhc-CCCCHHHHHHHHHhCCCCCccHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 333333321 00111111111222222 2345555665543221111000 00000000000 No 127 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.06 E-value=5.6e-06 Score=49.36 Aligned_cols=446 Identities=9% Similarity=-0.046 Sum_probs=181.7 Q ss_pred CCCCCCcCCCCccchHHHHHHHHHH---HHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHH Q lcl|NC_020488. 2 LPGNEPIKTRDDDSQEAILQEIRER---AAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYV 78 (688) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i 78 (688) ||.+- ... .....+.. +..++.....+|....+.+-.+.|..+.. .+.+-.-..-+|.++.++ T Consensus 1 m~V~~-----~hp----~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~-----Y~~rl~rA~~~n~~~~t~ 66 (452) T protein:vir:94 1 MPIET-----KHP----EYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDM-----YNAYKQRALFYSITSKTL 66 (452) T ss_pred CCCCC-----cCH----HHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHH-----HHHHHhhccCCchHHHHH Confidence 44333 111 22222222 22233333333322222233334444422 233322244579999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) +..+|......|.+.+ | . .+..+ +...+-++++.-...++..++..|.+++-| T Consensus 67 ~~~~G~vf~k~p~~~~-p--------------------~----~l~~~-~~D~~G~~L~~~~~~~~~~~l~~G~~~ilV- 119 (452) T protein:vir:94 67 SALSGMVLDQPPVITH-P--------------------D----AMSKY-FEDQSGIQFYEVFTRAVEETLLMGRVGVFI- 119 (452) T ss_pred HHHhchhhcCCceecc-c--------------------H----HHHHH-HhcccCCCHHHHHHHHHHHHHhcCeEEEEE- Confidence 9999999888776533 1 1 12222 223456888999999999999999877555 Q ss_pred EeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCE Q lcl|NC_020488. 159 TKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEG 238 (688) Q Consensus 159 ~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 238 (688) ||.. ....+-+..+ +|.+|+ |+.... +.. ..++..+..... . +.......+. T Consensus 120 -D~p~---~g~rPy~~~~-~~~~Ii-~W~~~~-~g~-l~~v~lre~~~~------------------~--d~~d~f~~~~ 171 (452) T protein:vir:94 120 -DRPL---TGGDPYISVY-TTENIL-NWEEDE-DGR-LLMVVLREFYTV------------------R--DTADRYVQNI 171 (452) T ss_pred -eecc---CCCceEEEEe-chhhhc-Cccccc-cCC-eeEEEEEEEEEE------------------e--cCCCccccee Confidence 5532 2345555555 455544 332111 110 111111100000 0 0000001111 Q ss_pred EEEEEEEeeeeccee--eeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccc Q lcl|NC_020488. 239 VRVSEYFYREPVTRK--LLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIP 316 (688) Q Consensus 239 v~v~e~~~~~~~~~~--~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P 316 (688) +..+.+|...+-... ++...++. .+. .........+.. +.+.+| T Consensus 172 ~~~yRvL~l~~g~~~v~~~~~~~~~-~~~-------------------------------~~~~~~~~~~~~--~l~~IP 217 (452) T protein:vir:94 172 RVRYRCLELVDGLLQITVHETQDGK-VWE-------------------------------LAKTSTIQNVGV--TMDYIP 217 (452) T ss_pred EEEEEEEEEeCCeEEEEEEEccCCc-eee-------------------------------eccceeecCCCc--ccceeE Confidence 222222211111000 00111111 000 000000111222 346677 Q ss_pred eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcc Q lcl|NC_020488. 317 VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAI 396 (688) Q Consensus 317 ~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 396 (688) ||++.+.. .+...+.+..-++-+++..+....|-..+++..+..|..++... ++.+.- ..-.+.++.+.. T Consensus 218 ~v~~~~~~---~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~-----~~~~~i-~iG~~~~~~lpe- 287 (452) T protein:vir:94 218 FFCITPSG---LSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGA-----ESQSTM-HIGSTKAWVIPE- 287 (452) T ss_pred EEEEcCCC---CCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecC-----cCCCce-EecccccccCCC- Confidence 77654432 23344556677888888888888888899999888886665422 111111 111122333321 Q ss_pred cccccceecCCCcch-HHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 397 PGVDRPQRDMPASMP-AAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQ 475 (688) Q Consensus 397 ~~~~~~~~~~~~~~~-~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~ 475 (688) .+ ..+.++.+..-+ .....-|+...+.|..+ |. ....+.....+|+.|...........|..+..+++.+.. T Consensus 288 ~~-~~~~yie~~g~~i~~~~~~l~~le~~m~~~-Ga-~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~---- 360 (452) T protein:vir:94 288 VA-AKVGFLEFTGQGLQSLEKALSEKQAQLASL-SA-RLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLN---- 360 (452) T ss_pred CC-CcceEEccCchhHHHHHHHHHHHHHHHHHH-HH-HhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHH---- Confidence 12 235555543222 22334444444444332 43 233344444567776655555555677788888877764 Q ss_pred HHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 476 ILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQA 555 (688) Q Consensus 476 ~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~ 555 (688) .++.++..|.... .. --|.+|. +|.. ..-.+. ..+++.++.+. T Consensus 361 ~~l~~~a~w~g~~---------~~-~~v~~n~-------------------dF~~----~~~~~~----~~~al~~~~~~ 403 (452) T protein:vir:94 361 KAYSCIMDMESMG---------GT-LNIKLNS-------------------AFLD----SKLTAA----ELKAWVEAYLS 403 (452) T ss_pred HHHHHHHHHcCCC---------Cc-eEEEecc-------------------cccc----ccCCHH----HHHHHHHHHhc Confidence 5566666665421 11 1122221 1110 001122 22222322221 Q ss_pred hHHHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHH Q lcl|NC_020488. 556 VPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQ 611 (688) Q Consensus 556 ~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 611 (688) .......+...++..++...+.-.+++....+.+.+... -.+..+.-++ T Consensus 404 -G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~~~------~~~~~~~~~~ 452 (452) T protein:vir:94 404 -GGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPSPS------NTPPNPSSKA 452 (452) T ss_pred -CCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcccC------CCCCCCccCC Confidence 001111111122222322222111222211111111000 0000000000 No 128 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=98.03 E-value=6.6e-06 Score=48.99 Aligned_cols=474 Identities=12% Similarity=0.028 Sum_probs=173.7 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhC-CCCCCHHHHHHHHhcCCCceeehhHHHHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLA-GEQWPESVRKEREDEGRPCLTLNKLPQYVD 79 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~-G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~ 79 (688) |+-.+-.. .+-.........++..++...+...--+..+.+. .|.. ...+..+ ...+.+-.-...+|.++.+++ T Consensus 1 ~~~~~~~~--~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~-~yl~~~~~~~~e--~~Y~~rl~rA~~~n~~~~tl~ 75 (491) T protein:vir:95 1 MLTANGQG--SGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRN-VGLNEPDKAYGE--ARQAEYEAGGIVYNFTRRTLS 75 (491) T ss_pred CcccCCcc--CCCCccCHHHHHHHHHHHHHHHHhcCcchhhccc-CCCcCCCCCCCH--HHHHHHHhcccCCChHHHHHH Confidence 66665321 1222222233333333332222221111111110 1110 1123322 223333333556899999999 Q ss_pred HHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHHHcCCceEEEE Q lcl|NC_020488. 80 QVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EYTSNAEAHYDNAFQHAVEGGFGWLRVL 158 (688) Q Consensus 80 ~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~~~~~~~~~~~~~~d~~~~G~G~~~v~ 158 (688) ..+|......|.+.+ | +.|..++..+ .+-++++.-+..++..++.+|.+++=| T Consensus 76 ~l~G~vfrk~p~~~~-p------------------------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV- 129 (491) T protein:vir:95 76 GMVGSVMRKEPEINI-P------------------------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLV- 129 (491) T ss_pred HHhchhhcCCceeec-c------------------------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEE- Confidence 999999988776542 1 2244444444 345778899999999999999887544 Q ss_pred EeeccCCCC--------CcceeEEEecccceEEeCCccccccc-ccCceEEEEecCCHHHHHHhcCCccchhcccccccc Q lcl|NC_020488. 159 TKYSTDDAF--------DLDLCIKSIHNRFAVLMDPDATEPDY-SDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGE 229 (688) Q Consensus 159 ~~~~~~~~~--------~~~~~~~~v~~~~~v~~Dp~a~~~d~-~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~ 229 (688) ||-..... .-.+-+..+ .|.+|+ ++.....+. ....++..+..... .+..+ T Consensus 130 -D~P~~~~~T~Ade~~~~~rPy~~~~-~~~~Ii-nW~~~~v~g~~~L~~v~l~E~~~~-----------------~d~~~ 189 (491) T protein:vir:95 130 -DAPETAAATAAEQNAGLLNPTIAFY-TTENIV-NWRLTRVGSVNRVTMVVLRETWEY-----------------HEPGN 189 (491) T ss_pred -ecCCCcccCHHHHHHhcCCcEEEEe-chhhhc-CceeeeeCCceeeeEEEEEEeEEe-----------------ecCCC Confidence 54322110 011222223 333332 222111110 01122221111000 00000 Q ss_pred cccCCCCCEEEEEEEEeeeecceeeee-ccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCC Q lcl|NC_020488. 230 YSWWTNEEGVRVSEYFYREPVTRKLLL-LSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPV 308 (688) Q Consensus 230 ~~~~~~~~~v~v~e~~~~~~~~~~~~~-~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~ 308 (688) .........+||.+.+....-...++. ..+|...... +.... -.+.. T Consensus 190 ~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~--------------------------~~~~~------~~g~~ 237 (491) T protein:vir:95 190 EFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEV--------------------------VEIYP------DLGES 237 (491) T ss_pred CcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeee--------------------------eeeee------cCCCc Confidence 111111223444332210000000110 1111110000 00000 01111 Q ss_pred CCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCC Q lcl|NC_020488. 309 DWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQ 388 (688) Q Consensus 309 p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 388 (688) +.+.+|||++.+.. .+...+....-.+-.+...+=...|-..+++..+..|...+. |.-+ ..+.+.....+++ T Consensus 238 --~l~~IPfv~~~~~~---~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~-G~d~-~~~~~~~~~~~~~ 310 (491) T protein:vir:95 238 --LRGVIPFTFIGATN---NDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PGDN-LTPQSFKEANPNG 310 (491) T ss_pred --ccCeeEEEEEecCC---CCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cCcc-cCcchhhccCcce Confidence 23556666554322 222333333445555543333333445556666666655442 2111 1111222222222 Q ss_pred ceeecCccc---ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 389 SVLRYNAIP---GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDN 465 (688) Q Consensus 389 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn 465 (688) ..+-.+... ......++.+....- ..+.|......|.. .|.. +...+.+.||++......+..-.|..+..| T Consensus 311 i~~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~-~Ga~---l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~ 385 (491) T protein:vir:95 311 IKFGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQ-IGAQ---LITPSQQITAESARIQRGADTSVMATIARN 385 (491) T ss_pred eEecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHH-HHHH---hccCCcchhHHHHHHHHHHhhHHHHHHHHH Confidence 221111110 011223333322111 12223333333322 2332 222334578888888888887788888888 Q ss_pred HHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHH Q lcl|NC_020488. 466 LSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEA 545 (688) Q Consensus 466 ~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~ 545 (688) ++.++.. ++.++..+.... ++..--+.+|. +|++. +-.+. . T Consensus 386 ~e~al~~----~l~~~a~w~G~~--------~~~~v~i~~n~-------------------dF~~~----~~~~~----~ 426 (491) T protein:vir:95 386 VSQAYTD----ALRWVAMMLGKP--------EDSEVEFQLNM-------------------DFFLQ----PMTAQ----D 426 (491) T ss_pred HHHHHHH----HHHHHHHHcCCC--------CCCceEEEeec-------------------ccccc----cCCHH----H Confidence 8877755 556666664311 01110112221 11100 01111 2 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHhcCCc--cHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHH Q lcl|NC_020488. 546 ADSLMQFVQAVPAAGGVVLDLIAKNMDWP--GAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPE 610 (688) Q Consensus 546 ~~~l~~~~q~~~~~~~~~~~~~~e~~~~~--~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 610 (688) .++++.+.+. ..+-...+...++..++. ..+++.+.+................ -++..+...+ T Consensus 427 ~~all~~~~~-G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~-~~~~~~~~~~ 491 (491) T protein:vir:95 427 RAAWMADINA-GLLPATAYYAALRKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGE-IPQAAQQQQE 491 (491) T ss_pred HHHHHHHHhc-CCCCHHHHHHHHHhCCCCCccHHHHHHHHHhcCCCCCcccccccc-chhhhhhccC Confidence 2333333321 001111111112222222 2355555554433222111110000 0000000000 No 129 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=97.73 E-value=9.2e-06 Score=48.20 Aligned_cols=571 Identities=15% Similarity=0.097 Sum_probs=219.0 Q ss_pred CCCCCCCcCC-----CCccchHHHHHHHHHHHHHHHHhhhHHHHHHH---HHHHhh----CCCCCCHHHHHHHHhcCCCc Q lcl|NC_020488. 1 MLPGNEPIKT-----RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQ---EDISFL----AGEQWPESVRKEREDEGRPC 68 (688) Q Consensus 1 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~---~~~~~~----~G~Qw~~~~~~~~~~~g~p~ 68 (688) +-|..+-+.. .-++.....+.++++- .......+..+.. .++.-| .++-+-.. .......-|| T Consensus 3 ispsepninsfvytqrvdellkahlkkildf---sktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~--~~~~~A~V~C 77 (666) T protein:vir:10 3 ISPSEPNINSFVYTQRVDELLKAHLKKILDF---SKTNKANYIQKMDLIDKAYARYITAQENNELLGY--NQNIAAKVRC 77 (666) T ss_pred cCCCCCcchhhhhHHHHHHHHHHHHHHHhhh---hccchhhHHHHhhhHHHhHHhhhhccCCCceeee--cccccccCcc Confidence 2222221110 1122222222232221 1111222333321 122212 23222111 1122344566 Q ss_pred eeeh------hHHHHHHHHHHHHHhC-----CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChH Q lcl|NC_020488. 69 LTLN------KLPQYVDQVLGDQRQN-----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAE 137 (688) Q Consensus 69 ~~~N------~i~~~i~~i~g~~~~~-----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~ 137 (688) -++| ++-+.|..++|++..- ..-|.|+ .|+..+-||.|++++..-.....+- T Consensus 78 ~V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-----------------~P~~K~~AE~LE~ii~DH~t~~~~~ 140 (666) T protein:vir:10 78 QVVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-----------------TPDKKEQAEALEGIIQDHMTMTSSI 140 (666) T ss_pred eeeccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-----------------CCchhHHHHHHHHHHHhhhhhhhhH Confidence 6665 3456678888887642 1222222 2446788999999998766655555 Q ss_pred HHHHHHHHHHHHcCCceEEEEEe----ecc----CCCCCcc----------eeEEEecccceEEeCCcccccccc-cCce Q lcl|NC_020488. 138 AHYDNAFQHAVEGGFGWLRVLTK----YST----DDAFDLD----------LCIKSIHNRFAVLMDPDATEPDYS-DANW 198 (688) Q Consensus 138 ~~~~~~~~d~~~~G~G~~~v~~~----~~~----~~~~~~~----------~~~~~v~~~~~v~~Dp~a~~~d~~-Da~~ 198 (688) ...--+++|+++..+.-|+.-|- |+. ++...+. -+|++. ||+++||||..--+|.. ...| T Consensus 141 ~~LiL~L~D~~KYN~~~~ET~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRL-N~RN~~~D~~~~~~~VA~~G~~ 219 (666) T protein:vir:10 141 PELILCLQDAAKYNLVGWETEWSHIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRL-NLRNVHWDPIPDIPNVATEGSF 219 (666) T ss_pred HHHHHHHhhhhhcceeeeeeccccccccchhhhhhcCCCceeecccchhhhhhhhcc-ccccccccCCCCCCchhhhhhh Confidence 55666778888777644443221 110 1111111 234566 88999999964434422 3478 Q ss_pred EEEEecCCHHHHHHhcC----Ccc------c-hh-cccccccccccCCCCCE---------EEEEEEEeeeecceeeeec Q lcl|NC_020488. 199 CFISERMSKAEFNKRYP----GKA------V-GD-LSDAERGEYSWWTNEEG---------VRVSEYFYREPVTRKLLLL 257 (688) Q Consensus 199 ~~~~~~~~~~e~~~~~p----~~~------~-~~-~~~~~~~~~~~~~~~~~---------v~v~e~~~~~~~~~~~~~~ 257 (688) +.....+++-.+++... +++ . .- -+++.. .+|.+.+.. +--+.|..+..+..+. . T Consensus 220 ~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~--sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~--s 295 (666) T protein:vir:10 220 LGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQG--SDWTDNPQISPVYQEMEMASDINWDRFGGFETET--S 295 (666) T ss_pred hhHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhccc--cccccCCccCccccccchhhccchhhcCcccccc--c Confidence 88888888777765321 110 0 00 001110 111111100 0000000000000000 0 Q ss_pred cCCceecccc--cch---HHHHHH--HhhhhhhheeeeeEEEEEEEEEchhhhcc-cCCCCCCCccceEEEeeeeeccCC Q lcl|NC_020488. 258 SDGRTVWEDE--VKD---VLDELR--DLGTTVTRERRVKTYKVKWMKVTAYDVLE-GPVDWPGSTIPVAPVLGKEMVIGD 329 (688) Q Consensus 258 ~~g~~~~~~~--~~~---~~~~~~--~~g~~~~~~~~~~~~~v~~~~~~~~~ile-~~~p~~~~~~P~vp~~~~~~~~~~ 329 (688) +-|+.+...+ ... .+.++. +....+.-...+..|+ ..+++++.++. ++.--.+++||+- +++- ..+| T Consensus 296 S~~~rvpvneqg~Y~k~~~Y~RI~PSDF~~~~P~~N~~QIWK--~v~IN~~~iIS~~~~I~AY~~~~~~--~~~~-LEDG 370 (666) T protein:vir:10 296 STNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWK--AVMINRDAIISFEPYIGAYGSFGMG--LAFA-LEDG 370 (666) T ss_pred ccccccccccccceeeeeeeeeeccccceecCCCCCcceeee--eeeeccceeEeeehhhhccchhhhh--hhhh-hhhc Confidence 0011000000 000 000000 0000111111222333 23345665553 2322245666654 3432 3455 Q ss_pred ccc-ccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCce-eecC--cc---cccccc Q lcl|NC_020488. 330 KTY-YRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSV-LRYN--AI---PGVDRP 402 (688) Q Consensus 330 ~~~-g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~~~~--~~---~~~~~~ 402 (688) ..+ ..|+.+..++.|+...++++..+-...+....+.++++..+.. ...+.+...+ ++.. .. .-++.- T Consensus 371 ~G~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a-----~~iNSP~~~~KIP~~~~sL~N~~~~~~Y 445 (666) T protein:vir:10 371 MGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRA-----NDINSPIPQIKIPVVPQSLVNGTMDQAY 445 (666) T ss_pred cccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhccChhhhhh-----hcccCCCCCcccceeehhhcccchhhhh Confidence 433 3467778899999999887776666555555566666555432 1122221111 1100 00 001111 Q ss_pred eecCCCcchHH---HHHHHHHHHHHHHHHhCcChHHcCC-CcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_020488. 403 QRDMPASMPAA---ELQLALSATDEMKATIGLYDASVGA-QGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQIL 477 (688) Q Consensus 403 ~~~~~~~~~~~---~~~ll~~~~~~~~~~tGv~d~~~G~-~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~ 477 (688) ..+ |-..-+ .++-.....+.-++++|++...+|+ ...+.|-+.-...+-.+..|+....=-++ +.+..+-+++ T Consensus 446 ~~I--PFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGNKt~~E~~~~MG~a~NR~RLPALiLEH~~F~~iK~~L 523 (666) T protein:vir:10 446 RQI--PFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKGNKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQL 523 (666) T ss_pred ccC--CccccchhHHHhhhHHHHhhHHHhhccCCcccccccccCcceeehhhhcCCcccceehhhHHhhhhhhhhHHHHH Confidence 111 111122 3333445566678899999999996 33333322222222233444444333332 2333333444 Q ss_pred HHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh- Q lcl|NC_020488. 478 IELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV- 556 (688) Q Consensus 478 ~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~- 556 (688) .--|-+|=++-.|+.-.. |..--+.++ .+.-....+.+..|..-.+ +.+-.+.+..++|.+ T Consensus 524 ~LNl~~YG~DT~ViS~Rt--G~~~~vDi~---------------~L~~~~L~F~~~DG~TP~S-K~ASs~~lT~~LQMI~ 585 (666) T protein:vir:10 524 KLNLLMYGEDTEVISPRT--GKGVRVDIK---------------ELQDLGLKFELGDGLTPAS-KLASSDFLTALLQMIM 585 (666) T ss_pred hhhhhhccccchhccccc--CceeeeeHH---------------HHhhhhheeeeccCCCchh-hhhhhHHHHHHHHHHh Confidence 333445444333332211 111011111 1111124456666654444 444444444444432 Q ss_pred ------HHHHHH---HHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 ------PAAGGV---VLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKA 627 (688) Q Consensus 557 ------~~~~~~---~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 627 (688) ..++.. +...++++.+..+.++.... .+++-.+...-+ |+.++.-+|...+ T Consensus 586 sS~~~~~A~G~~~P~M~AH~~QLGGVRG~E~Y~da---alP~~~~~~~~~-------------Q~LQ~~~LQ~~~Q---- 645 (666) T protein:vir:10 586 SSETTLQAFGTQVPGMIAHLAQLGGVRGFEKYADA---ALPQWQITYGMQ-------------QQLQQMLLQLQQQ---- 645 (666) T ss_pred hhhhhHhhhcccchHHHHHHHHhccccchhhhhhc---cCCccccccchh-------------HHHHHHHHHHhhh---- Confidence 112222 33344555555555554432 222222211100 0111110110000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 628 DTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAG 661 (688) Q Consensus 628 e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~ 661 (688) ...|.++. |.+....|.. -.+ T Consensus 646 --SA~Q~~A~--Q~~L~~~Q~~---------PSq 666 (666) T protein:vir:10 646 --SAMQLQAR--QGELSNDQSQ---------PSQ 666 (666) T ss_pred --hhcccccc--cccCcccccC---------CCC Confidence 00000000 0000000000 000 No 130 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=97.68 E-value=7e-06 Score=48.85 Aligned_cols=571 Identities=15% Similarity=0.102 Sum_probs=216.2 Q ss_pred CCCCCCCcCC-----CCccchHHHHHHHHHHHHHHHHhhhHHHHHHH---HHHHhh----CCCCCCHHHHHHHHhcCCCc Q lcl|NC_020488. 1 MLPGNEPIKT-----RDDDSQEAILQEIRERAAHAVTCWKHNFDAAQ---EDISFL----AGEQWPESVRKEREDEGRPC 68 (688) Q Consensus 1 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~---~~~~~~----~G~Qw~~~~~~~~~~~g~p~ 68 (688) +-|..+-+.. .-++.....+.++++- .......+..+.. .++.-| .++-+-.. .......-|| T Consensus 3 ispsepninsfvytqrvdellkahlkkildf---sktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~--~~~~~A~V~C 77 (666) T protein:vir:96 3 ISPSEPNINSFVYTQRVDELLKAHLKKILDF---SKTNKANYIQKMDLIDKAYARYITAQENNELLGY--NQNIAAKVRC 77 (666) T ss_pred cCCCCCcchhhhhHHHHHHHHHHHHHHHhhh---hccchhhHHHHhhHHHHhHHhhhhccCCCceeee--cccccccccc Confidence 2222221110 1122222222232221 1111222333322 122212 22222111 1122344566 Q ss_pred eeeh------hHHHHHHHHHHHHHhC-----CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChH Q lcl|NC_020488. 69 LTLN------KLPQYVDQVLGDQRQN-----RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAE 137 (688) Q Consensus 69 ~~~N------~i~~~i~~i~g~~~~~-----r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~ 137 (688) -++| ++-+.|+.++|++..- +.-|.|+ .|+..+-||.|++++..-.....+- T Consensus 78 ~V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-----------------~P~~K~~AE~LE~ii~DH~t~~~~~ 140 (666) T protein:vir:96 78 QVVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-----------------TPDKKEQAEALEGIIQDHMTMTSSI 140 (666) T ss_pred eeeccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-----------------CCchhHHHHHHHHHHHhhhhhhhhH Confidence 6665 3456678888887642 1222221 2446788999999998766655555 Q ss_pred HHHHHHHHHHHHcCCceEEEEEe----ecc----CCCCCcc----------eeEEEecccceEEeCCcccccccc-cCce Q lcl|NC_020488. 138 AHYDNAFQHAVEGGFGWLRVLTK----YST----DDAFDLD----------LCIKSIHNRFAVLMDPDATEPDYS-DANW 198 (688) Q Consensus 138 ~~~~~~~~d~~~~G~G~~~v~~~----~~~----~~~~~~~----------~~~~~v~~~~~v~~Dp~a~~~d~~-Da~~ 198 (688) ...--+++|+++..+.-|+.-|- |+. ++...+. -+|++. ||+++||||..--+|.. ...| T Consensus 141 ~~LiL~L~D~~KYN~~~~ET~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRL-N~RN~~~D~~~~~~~VA~~G~~ 219 (666) T protein:vir:96 141 PELILCLQDAAKYNLVGWETEWSNIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRL-NLRNVHWDPIPDIPNVATEGSF 219 (666) T ss_pred HHHHHHHhhhhhcceeeeeeccccccccchhhhhhcCCCceeeeccchhhhhhhhcc-ccccccccCCCCCCchhhhhhh Confidence 55666678888777644443221 110 1111111 234566 78999999964434422 3578 Q ss_pred EEEEecCCHHHHHHhc----CCccc-------hh-cccccccccccCCCCCEE-------EEEEE-Ee-eeecceeeeec Q lcl|NC_020488. 199 CFISERMSKAEFNKRY----PGKAV-------GD-LSDAERGEYSWWTNEEGV-------RVSEY-FY-REPVTRKLLLL 257 (688) Q Consensus 199 ~~~~~~~~~~e~~~~~----p~~~~-------~~-~~~~~~~~~~~~~~~~~v-------~v~e~-~~-~~~~~~~~~~~ 257 (688) +.....+++-.+++.. .+++. .- -+++.. .+|.+.+... ....+ |. +..+..+. . T Consensus 220 ~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~--sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~--s 295 (666) T protein:vir:96 220 LGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQG--SDWTDNPQISPVYQEMEMASDINWDRFGGFETET--S 295 (666) T ss_pred hhhHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhccc--cccccCCcccccccccchhhccchhhcCcccccc--c Confidence 8888888877776532 11100 00 001110 1111111000 00000 10 00000000 0 Q ss_pred cCCceecccc--cch---HHHHHH--HhhhhhhheeeeeEEEEEEEEEchhhhcc-cCCCCCCCccceEEEeeeeeccCC Q lcl|NC_020488. 258 SDGRTVWEDE--VKD---VLDELR--DLGTTVTRERRVKTYKVKWMKVTAYDVLE-GPVDWPGSTIPVAPVLGKEMVIGD 329 (688) Q Consensus 258 ~~g~~~~~~~--~~~---~~~~~~--~~g~~~~~~~~~~~~~v~~~~~~~~~ile-~~~p~~~~~~P~vp~~~~~~~~~~ 329 (688) +-|+.+...+ ... .+.++. +....+.-...+..|+ ...++++.++. ++.--.+++||+- +++- ..+| T Consensus 296 S~~~rvpvneqg~Y~k~~mY~RI~PSDF~~~~P~~N~~QIWK--~v~IN~~~iIS~~~~I~AY~~~~~~--~~~~-LEDG 370 (666) T protein:vir:96 296 STNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWK--AVMINRDAIISFEPYIGAYGSFGMG--LAFA-LEDG 370 (666) T ss_pred ccccccccccccceeeeeeeeeeccccceecCCCCCcceeee--eeeeccceeEeeehhhcccchhhhh--hhhh-hhhc Confidence 0011000000 000 000000 0000111111222333 22345665553 2322245666654 3332 3455 Q ss_pred ccc-ccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCce-eecCc--c---cccccc Q lcl|NC_020488. 330 KTY-YRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSV-LRYNA--I---PGVDRP 402 (688) Q Consensus 330 ~~~-g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~~~~~--~---~~~~~~ 402 (688) ..+ ..|+.+..++.|+...++++..+-...+....+.++++..+.. ...+.+...+ ++... . .-++.- T Consensus 371 mG~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a-----~~iNSP~~~~KIP~~~~sL~N~~m~~~Y 445 (666) T protein:vir:96 371 MGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRA-----NDINSPIPQIKIPVVPQSLVNGTMDQAY 445 (666) T ss_pred cccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhhcchhhhhh-----hcccCCCCCcccceeehhhhccchhhhh Confidence 433 3467778899999999888777666555555566665555432 1122221111 11000 0 001111 Q ss_pred eecCCCcchHH---HHHHHHHHHHHHHHHhCcChHHcCC-CcchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_020488. 403 QRDMPASMPAA---ELQLALSATDEMKATIGLYDASVGA-QGNEQSGKAILARQRQGDRGTFAYIDNLS-RAIRRVGQIL 477 (688) Q Consensus 403 ~~~~~~~~~~~---~~~ll~~~~~~~~~~tGv~d~~~G~-~~~~~sg~ai~~~~~~~~~~~~~~~dn~~-~~~~~~~~~~ 477 (688) ..+ |-..-+ .++-.....+.-++++|++...+|+ ...+.|-+.-...+-.+..|+....=-++ +.+..+-+++ T Consensus 446 ~~I--PFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGNKt~~E~~~~MG~a~NRmRLPALiLEH~~F~~iK~~L 523 (666) T protein:vir:96 446 RQI--PFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKGNKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQL 523 (666) T ss_pred ccC--CccccchhHHHhhhHHHhhhHHHhhccCCcccccccccCcceeehhhhcCCcccceehhhHHHhhhhhhhHHHHH Confidence 111 111122 3333445566678899999999996 33333322222222233445444333332 2333333333 Q ss_pred HHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhh- Q lcl|NC_020488. 478 IELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAV- 556 (688) Q Consensus 478 ~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~- 556 (688) .--|-+|=++-.|+.-.. |..--+.++ .+.-....+.+..|..-.+ +.+-.+.+..++|.+ T Consensus 524 ~LNl~~YG~DT~ViS~Rt--G~~~~vDi~---------------~L~~~~L~F~~~DGlTP~S-KlASs~~lT~~LQMI~ 585 (666) T protein:vir:96 524 KLNLLMYGEDTEVISPRT--GKGVRVDIK---------------ELQDLGLKFELGDGLTPAS-KLASSDFLTALLQMIM 585 (666) T ss_pred hhhhhhccccchhccccc--CceeeeeHH---------------HHhhhhheeeeccCCCchh-hhhhhHHHHHHHHHHh Confidence 333444433333332111 111011111 1111124456666654444 444444455444432 Q ss_pred ------HHHHHHH---HHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 ------PAAGGVV---LDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKA 627 (688) Q Consensus 557 ------~~~~~~~---~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~ 627 (688) ..++..+ ...++++.+..+.++. ...++++-...-- .+ |+.++.-+|...+ T Consensus 586 sS~~~~~A~G~~~P~M~AHl~QLGGVRG~E~Y---~~~ALPqwqityg--m~-----------Q~LQ~~~LQ~~~Q---- 645 (666) T protein:vir:96 586 SSETTLQAFGTQVPGMIAHLAQLGGVRGFEKY---ANAALPQWQITYG--MQ-----------QQLQQMLLQLQQQ---- 645 (666) T ss_pred cchhhHhhhcccchHHHHHHHHhccccchhhc---ccccCcchhhhhh--hh-----------HHHHHHHHHHhhh---- Confidence 1222222 2333444444444443 2222222111100 00 0000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 628 DTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAG 661 (688) Q Consensus 628 e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~ 661 (688) ...|.++. |.+....|.. ..+ T Consensus 646 --SA~Q~~A~--Q~~L~~~Q~~---------PSq 666 (666) T protein:vir:96 646 --SAMQLQAR--QGELSNDQSQ---------PSQ 666 (666) T ss_pred --hccccccc--cccCcccccC---------CCC Confidence 00000000 0000000000 000 No 131 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=97.38 E-value=8.1e-05 Score=43.01 Aligned_cols=573 Identities=13% Similarity=0.031 Sum_probs=122.3 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHH-HhcCCCc-----eeehhHHHHHHHH Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKER-EDEGRPC-----LTLNKLPQYVDQV 81 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~-~~~g~p~-----~~~N~i~~~i~~i 81 (688) |++....+.... .++. +-...+...+..|++|. +..+....+ ...|.+. -...++.+.|... T Consensus 1 ~~k~~~~~~~~~-~~~~----------~~~~~~~~~a~~~~~~~-~~~~~~~~~~~y~g~~~~~~~~~~s~~~~~~v~~~ 68 (705) T protein:vir:88 1 MAKRRKIKPMDD-EQVL----------RHLDQLVNDALDFNSSE-LSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQET 68 (705) T ss_pred CCcccccccCCH-HHHH----------HHHHHHHHHHHhhhhhH-HHHHHHHHHHHHhCCCCCcccCCCCccccHHHHHH Confidence 333322221111 1122 22344555666777763 222222222 1234332 1122222333332 Q ss_pred HHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEee Q lcl|NC_020488. 82 LGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKY 161 (688) Q Consensus 82 ~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~ 161 (688) +......-..+-+ +.+ . .-.+.|--..-+++..-.-.++.+...-......++.+.+..++ T Consensus 69 v~~~~~~l~~~~~-~~~-----~----~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal--------- 129 (705) T protein:vir:88 69 VDWIMPSLMKVFT-SGG-----Q----VVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTL--------- 129 (705) T ss_pred HHHHHHHHHHhhc-CCC-----c----eEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHh--------- Confidence 2222222222111 000 0 00011122222222111111111111111111122222222221 Q ss_pred ccCCCCCcceeEEEecccc-------------------eEEeCCcccccccccCceEE--------------EEecCCHH Q lcl|NC_020488. 162 STDDAFDLDLCIKSIHNRF-------------------AVLMDPDATEPDYSDANWCF--------------ISERMSKA 208 (688) Q Consensus 162 ~~~~~~~~~~~~~~v~~~~-------------------~v~~Dp~a~~~d~~Da~~~~--------------~~~~~~~~ 208 (688) .+...|..| .|. .++.||.+.-.+-++-.+.. +...++.. T Consensus 130 ------~~g~gi~kv-~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~ 202 (705) T protein:vir:88 130 ------MMKTGVVKV-YVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPE 202 (705) T ss_pred ------hcCCeEEEe-ccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHH Confidence 111111122 010 11234433221111111111 11111211 Q ss_pred HHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceee----------eeccCCce-------ecccccchH Q lcl|NC_020488. 209 EFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKL----------LLLSDGRT-------VWEDEVKDV 271 (688) Q Consensus 209 e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~----------~~~~~g~~-------~~~~~~~~~ 271 (688) +|. ++-. ...+.+-. .++...+.......- +..++... +..+..... T Consensus 203 d~~---~dp~-----------a~~~~d~~--~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~ 266 (705) T protein:vir:88 203 NFL---VDRL-----------ATCIDDAR--FLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMT 266 (705) T ss_pred Hce---ecCC-----------CCCcccCc--EEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccc Confidence 111 0000 00011100 010100100000000 00000000 000000000 Q ss_pred HHHHHHhhhhhhheeeeeEEEEEEEEE-chhhhccc-CCCCCCCccceEEEeeeeecc--CCcccccchH-HHhhHHHHH Q lcl|NC_020488. 272 LDELRDLGTTVTRERRVKTYKVKWMKV-TAYDVLEG-PVDWPGSTIPVAPVLGKEMVI--GDKTYYRGLI-RFGKDAQRM 346 (688) Q Consensus 272 ~~~~~~~g~~~~~~~~~~~~~v~~~~~-~~~~ile~-~~p~~~~~~P~vp~~~~~~~~--~~~~~g~g~v-~~~~d~Q~~ 346 (688) .......+......+.+..+.+|..+. .|+.+.+- ...|.+++..-++.++.+.++ +-.+...+++ ..+.+.-.- T Consensus 267 ~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d 346 (705) T protein:vir:88 267 GQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRD 346 (705) T ss_pred cccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCccccccccCCCCCEEEecceeecCccccCChHHHHhH Confidence 000000001111122232333322211 11111100 001122222111111111111 1111111111 223333333 Q ss_pred HHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCceeecCcc--------cccccceecCCCcc-hHHHHHH Q lcl|NC_020488. 347 HNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAI--------PGVDRPQRDMPASM-PAAELQL 417 (688) Q Consensus 347 ~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~-~~~~~~l 417 (688) +...++.+...+ + +.....+.+ +.++..+.. .++..+.+.++..+ +-..-++ T Consensus 347 ~Q~~~n~~~~~~--------------~----d~~~~~~~~-~~~~~~g~v~~~d~~~~~pg~vv~~~~~~~i~~~~~~~~ 407 (705) T protein:vir:88 347 IQEIRSVLMRNI--------------M----DNIYRTNQG-RSVVLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQL 407 (705) T ss_pred HHHHHHHHHHHH--------------H----HHHHhccCC-ceeccccccCcccccccCCCeeEEecCCCccccccCCcC Confidence 333333322211 1 111111111 111111100 11111111111110 0001112 Q ss_pred HHHHHHHHHHHhCcChHHcCCCcc--hhhHHHHHHHHHHHHHHHHHHHHHH----HHHHHHH----HHHHHHHHHHHcCc Q lcl|NC_020488. 418 ALSATDEMKATIGLYDASVGAQGN--EQSGKAILARQRQGDRGTFAYIDNL----SRAIRRV----GQILIELIPRVYDS 487 (688) Q Consensus 418 l~~~~~~~~~~tGv~d~~~G~~~~--~~sg~ai~~~~~~~~~~~~~~~dn~----~~~~~~~----~~~~~~li~~~~~~ 487 (688) -+....+++.+...-....|...- +.++.+.....-++ .+..+...- ....+.+ .+.+..++.. T Consensus 408 ~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~--~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~---- 481 (705) T protein:vir:88 408 SGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAM--SVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHD---- 481 (705) T ss_pred cHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 222333333333333445554322 22333333222221 112222111 1111111 1222222222 Q ss_pred ceEEEEeccCC-CcceeeechhhhcccccceeeeccceeeeEEEEEe---------cccCcHHHHHHHHHHHHHHHHhhH Q lcl|NC_020488. 488 DRVLRLRFQDG-EGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVK---------AGPSYQTQRMEAADSLMQFVQAVP 557 (688) Q Consensus 488 ~r~~ri~~~~~-~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~---------~~~~~~s~r~~~~~~l~~~~q~~~ 557 (688) ++-... .+..+.+. |.| +.|+ +.+..+..-...-+.+..+...+ T Consensus 482 -----li~~~~~~~~~~ri~-------------------g~~-v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll- 535 (705) T protein:vir:88 482 -----HAIKYQNQEEVFQLR-------------------GKW-VAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIW- 535 (705) T ss_pred -----HHHHhCCCceEEeec-------------------cch-hccchHhhccCCceEEeeccccchHHHHHHHHHHHH- Confidence 222111 11222221 111 1221 11111111111111122221111 Q ss_pred HHHHHHHHHHHHhcCCccHHHHHHHHHhhccccccchhhHHhhh---hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 558 AAGGVVLDLIAKNMDWPGAQDIARRLQKTLPPGILDQDEMEEAG---IEPPQPSPEQQANMAQAQADMEKAKADTAKAQA 634 (688) Q Consensus 558 ~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~ 634 (688) .+.+.+.. ..++..+.....+.+.+++................ .++.+++.++.+...+++++..+++++++++++ T Consensus 536 ~~~q~l~~-~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~ 614 (705) T protein:vir:88 536 EMAQAVVG-GGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQS 614 (705) T ss_pred HHHHHhhc-ccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHH Confidence 11111111 01111222222222222222221111111111111 111111111111222223333345566666666 Q ss_pred HHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHhhcCC Q lcl|NC_020488. 635 DMAMAQAKT----AEAQAKLAEIEQAAMMAGPGSLEETVRNLVA----EAMAELMAQSQGNA 688 (688) Q Consensus 635 e~~~~q~~~----~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~----~a~~~~~~~~q~~~ 688 (688) ++.+++++. .+.++++++++.+... ..+.+.......+ +..+....+++..| T Consensus 615 e~~~~q~e~q~~q~E~q~~q~e~e~~~~~--~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~ 674 (705) T protein:vir:88 615 DALAKQAEAQMKQVEAQIRLAEIELKKQE--AVLQQREMALKEAELQLERDRFTWERARNEA 674 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654443332 2333333333222111 1111110000000 00000111111111 No 132 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.23 E-value=0.00012 Score=42.03 Aligned_cols=475 Identities=13% Similarity=0.060 Sum_probs=179.0 Q ss_pred CC-CCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHH Q lcl|NC_020488. 2 LP-GNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQ 80 (688) Q Consensus 2 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~ 80 (688) || .+- + -....+.... +..+..+......+|......+-...+.+++.+-.+..+.+-.-...+|.++.+++. T Consensus 1 m~~V~~----~-hp~y~~~~~~-W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~ 74 (501) T protein:vir:95 1 MPNVSF----I-RPELGKLLPL-YYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFG 74 (501) T ss_pred CCCCCC----C-CHHHHHHHHH-HHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHH Confidence 55 111 1 1111222222 222444444445555544433333456677766555555554446779999999999 Q ss_pred HHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHHHcCCceEEEEE Q lcl|NC_020488. 81 VLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EYTSNAEAHYDNAFQHAVEGGFGWLRVLT 159 (688) Q Consensus 81 i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~~~~~~~~~~~~~~d~~~~G~G~~~v~~ 159 (688) .+|......|.+.+ -..|..++..+ .+-++++..+..++..++..|.+++=| T Consensus 75 l~G~vf~k~p~~~~-------------------------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV-- 127 (501) T protein:vir:95 75 LVGQVFMRDPVVKV-------------------------PALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLV-- 127 (501) T ss_pred HhhhhhcCCcceeC-------------------------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE-- Confidence 99999987665532 12344444444 344688899999999999999887544 Q ss_pred eeccCCCCCc------------ceeEEEecccceEEeCCccccccc-ccCceEEEEecCCHHHHHHhcCCccchhccccc Q lcl|NC_020488. 160 KYSTDDAFDL------------DLCIKSIHNRFAVLMDPDATEPDY-SDANWCFISERMSKAEFNKRYPGKAVGDLSDAE 226 (688) Q Consensus 160 ~~~~~~~~~~------------~~~~~~v~~~~~v~~Dp~a~~~d~-~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~ 226 (688) ||-... .++ .+-+..+ .|.+|+ ++.....+. ....++..+......+ -.|..+....+... T Consensus 128 D~P~~~-~~~~~t~a~~~~~~~rPy~~~~-~~~~Ii-nW~~~~v~g~~~l~~v~l~E~~~~~d--~~f~~~~~~q~RvL- 201 (501) T protein:vir:95 128 DYPTTE-AEGGASIADLEAGRIRPTLYVY-SPTEII-NWRTTDRGAEEVLSLVVLFETWCAAD--DGFEMKTSGQFRVL- 201 (501) T ss_pred eecCCC-CcccccHHHHHhccCCcEEEEe-cHhhhc-CcceeccCCceeeeEEEEEEEEeecC--CCcccceeEEEEEE- Confidence 553211 111 1223333 343332 222111110 1122222221111000 00000000000000 Q ss_pred ccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhccc Q lcl|NC_020488. 227 RGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEG 306 (688) Q Consensus 227 ~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~ 306 (688) ..+.+...++++|....... .+|..+...... ....|. +.. T Consensus 202 ------~~~~~g~~~~~v~r~~~~~~-----~~~~~~~~~~~~---------------------~~~~~~-------~~~ 242 (501) T protein:vir:95 202 ------RLDEEGYYVHEIWREPQPTK-----ADGSKIPKGNYQ---------------------QYVVYK-------PTD 242 (501) T ss_pred ------eeCCCceEEEEEEEecCCcc-----cCcceecCCccc---------------------ccceee-------eec Confidence 00011112233333221100 001000000000 000000 001 Q ss_pred CCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccC Q lcl|NC_020488. 307 PVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRK 386 (688) Q Consensus 307 ~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 386 (688) ..--+.+.+|||++... ..+-..+....-.+-++...+=...|-..+++..+..|...+ .|. ++.+...... T Consensus 243 ~g~~~l~~IPfv~~~~~---~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i-~G~----~~~~~~~~~~ 314 (501) T protein:vir:95 243 AQGKRLTEIPFMFIGSE---NNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVL-IGL----TEEWVTNVLK 314 (501) T ss_pred cCCCcCCeeeEEEEecC---CCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeee-eCC----cccccccCCC Confidence 11112355666644222 122222222233444443332112233555666666665544 222 2222211111 Q ss_pred CCceeecCcc-----cccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 387 NQSVLRYNAI-----PGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFA 461 (688) Q Consensus 387 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~ 461 (688) ++ +..++. +.+..+.++.+....- ..+.|+...+.|..+ |.. .+...+.+.||++......+....|.. T Consensus 315 ~~--i~~G~~~~~~lP~~~~~~~ie~~~~~i-~~~~l~~l~~~m~~~-Ga~--ll~~~~~~~Ta~~~~~~~~~~~S~L~~ 388 (501) T protein:vir:95 315 GS--VNFGSRGGIPLPVGADAKLLQASENTM-LKEAMDTKERQMVAL-GAK--LVEQKEVQRTATEAELEAASEGSTLSS 388 (501) T ss_pred Cc--eeecccccccCCCCCceeEEecChhhH-HHHHHHHHHHHHHHH-HHh--hccCCccchhHHHHHHHHHHHhHHHHH Confidence 11 111111 0111234444322111 133344444444333 432 223333457888888887777778888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHH Q lcl|NC_020488. 462 YIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQ 541 (688) Q Consensus 462 ~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~ 541 (688) +..|++.++.. ++.++..|.. .+...--|.+|+ +|+.. .-.+. T Consensus 389 ~a~~le~al~~----~l~~~a~w~g---------~~~~~~~v~i~~-------------------df~~~----~~~~~- 431 (501) T protein:vir:95 389 ATKNVSAAFEW----ALKWAARWVG---------QADSGVKFELNT-------------------DFDIA----RMTPD- 431 (501) T ss_pred HHHHHHHHHHH----HHHHHHHHcC---------CCCCceEEEEec-------------------ccccc----cCCHH- Confidence 88888877655 5556666642 111110123322 11100 00111 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCcc--HHHHHHHHHhhccccccchhhHH----hhhhhhhhhhHH Q lcl|NC_020488. 542 RMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWPG--AQDIARRLQKTLPPGILDQDEME----EAGIEPPQPSPE 610 (688) Q Consensus 542 r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~--~~ei~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 610 (688) ..++|.++.+. ..+....+...+...++.. .+...+++............... ........ ..+ T Consensus 432 ---~~~al~~~~~~-G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~-~~~ 501 (501) T protein:vir:95 432 ---ERRSLVEEWQK-GAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVG-NSE 501 (501) T ss_pred ---HHHHHHHHHhC-CCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCccccccc-CCC Confidence 12333333221 0111111122233334443 23333333322111111000000 00000000 000 No 133 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=97.17 E-value=0.00014 Score=41.67 Aligned_cols=616 Identities=12% Similarity=0.011 Sum_probs=167.5 Q ss_pred hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHHHHHHHHHHh--CCcceE Q lcl|NC_020488. 16 QEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYVDQVLGDQRQ--NRPAIQ 93 (688) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i~~i~g~~~~--~r~~~~ 93 (688) +.+.+.+++.++........++..+|++... ++....-..| |...+-+.+++-...+ .||-+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~---------~D~~f~~~~G------~QW~~~~~~~l~~~~q~~grP~~~ 65 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCI---------EATRFARVPG------GQWEGATAAGTKLDEQFEKYPKFE 65 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhhcCCC------CCCCHHHHHHHHHhhhhcCCCceE Confidence 7778888877777666666666666654322 1111111123 2333333444332222 233221 Q ss_pred --EEeCCcc----ccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCC Q lcl|NC_020488. 94 --VHPVEAN----ATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAF 167 (688) Q Consensus 94 --v~pr~~~----~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~ 167 (688) ..+..-+ ....+.. --.+.|.+.+.-.-+.++++.++..---......+..++..+++.+=.-+++...+... T Consensus 66 ~N~i~~~v~~v~g~~~~nr~-d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~ 144 (708) T protein:vir:10 66 INKVATELNRIIAEYRNNRI-TVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVN 144 (708) T ss_pred EcchHHHHHHHHHHHHhCCc-ceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeecccc Confidence 1111000 0000000 11122334442244667777766555455556667777777776431112221111111 Q ss_pred CcceeEEEecccceE----EeCCcccccccccCceEEEEecCCHHHHHHhc------CCccchhcccccccccccCCCCC Q lcl|NC_020488. 168 DLDLCIKSIHNRFAV----LMDPDATEPDYSDANWCFISERMSKAEFNKRY------PGKAVGDLSDAERGEYSWWTNEE 237 (688) Q Consensus 168 ~~~~~~~~v~~~~~v----~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~~------p~~~~~~~~~~~~~~~~~~~~~~ 237 (688) +.++. .++..+ .+||. ...-|=...+..+.++.+-.| .+.....++......+++.... T Consensus 145 e~d~~----~~~~~i~i~~~~~p~------~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~- 213 (708) T protein:vir:10 145 EYDPM----DDRQRIAIEPIYDPS------RSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMT- 213 (708) T ss_pred ccCCC----CCccccceEEeecch------hhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCC- Confidence 11110 011111 12220 000010111112333322111 0111111111111111111100 Q ss_pred EEEEEEEEeeee---cceeeeeccCCc-----eecc-c--------ccchHHHHHHHhhhhhhheeeeeEEEEEEEEEch Q lcl|NC_020488. 238 GVRVSEYFYREP---VTRKLLLLSDGR-----TVWE-D--------EVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTA 300 (688) Q Consensus 238 ~v~v~e~~~~~~---~~~~~~~~~~g~-----~~~~-~--------~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~ 300 (688) -|+..+ ....+..+=..+ .++. + +...........-. ......+.++++..+.+.- T Consensus 214 ------~~~~~~~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~-~~g~~~~~~r~~~r~~v~~ 286 (708) T protein:vir:10 214 ------SWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELA-IAGFHEVARRSVKRRRVYV 286 (708) T ss_pred ------CccccccCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHH-hcccchhheeeeeeEEEEE Confidence 011111 111111000000 0000 0 00000000000000 0000011111111111111 Q ss_pred hhhcccCCCC-CCCccc--eEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHH-HHhcCCCceeechhhhcch Q lcl|NC_020488. 301 YDVLEGPVDW-PGSTIP--VAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATER-VALAPKAPWVAPAESIEGY 376 (688) Q Consensus 301 ~~ile~~~p~-~~~~~P--~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~-~~~~~~~~~~~~~~~i~~~ 376 (688) ..+. +..-. ..+.+| ++|+++++-.... ..|......++..=+-.=.+.++..-- +...+..+....-...... T Consensus 287 ~~~~-g~~~le~~~~~p~~~fP~vP~~g~r~~-~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i 364 (708) T protein:vir:10 287 SVVD-GDGFLEKPRRIPGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQI 364 (708) T ss_pred Eeec-chhhhccCCCCCCCceeeEEEeeeeec-cCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhh Confidence 1111 00000 111222 3344433211100 011111111111101111111110000 0000000000000000111 Q ss_pred HHHHhhcccCCCceeecCccccccc-ceecCCC-------cchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHH Q lcl|NC_020488. 377 EEEWNQANRKNQSVLRYNAIPGVDR-PQRDMPA-------SMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAI 448 (688) Q Consensus 377 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-------~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai 448 (688) .....++...+...-.+-....... ...+... +.+.-...+++.......++.-++..+.+..+ ..|++ T Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG-~~sn~-- 441 (708) T protein:vir:10 365 RGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPSNI-- 441 (708) T ss_pred hhHHHHHhhccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHcc-Cccch-- Confidence 1111111111110000000000000 0000000 11122223555555555555544333222222 23332 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeec Q lcl|NC_020488. 449 LARQRQGDRGTFAYIDNLSRA-------IRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVN 521 (688) Q Consensus 449 ~~~~~~~~~~~~~~~dn~~~~-------~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~n 521 (688) |...+....+.-... ++.-.+.+.+++..+.. .+. ..++.+.|-...- +-..+.+| T Consensus 442 ------SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~-----~~y---~~er~~RI~~edg---~~~~v~in 504 (708) T protein:vir:10 442 ------AQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR-----EVY---GSEREVRIVNEDG---SDDIAVLS 504 (708) T ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEEEecCCC---CcceEEec Confidence 122222222222222 22222333334433321 111 2233444421110 01222222 Q ss_pred ----cceee----eEEEEE---ecccCcHHHH-HHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC----ccHHHHHHHHHh Q lcl|NC_020488. 522 ----DIAAG----KFDVTV---KAGPSYQTQR-MEAADSLMQFVQAVPAAGGVVLDLIAKNMDW----PGAQDIARRLQK 585 (688) Q Consensus 522 ----di~~~----~~dv~v---~~~~~~~s~r-~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~----~~~~ei~~~~~~ 585 (688) |...| ..|+++ ++..+..... ....+.+..+++.++.+.+. .+.....+++ -.....-+.+.+ T Consensus 505 ~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~-~~~~~~~~~~~l~~~D~p~~~ei~er 583 (708) T protein:vir:10 505 AQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPT-DPMRPAIQGIILDNIDGEGLDDFKEY 583 (708) T ss_pred ceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCC-chhhHHHHHHHHHhcCCcChHHHHHH Confidence 33333 355544 4443333333 33333333333333333221 0111111110 000010111111 Q ss_pred hccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 586 TLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSL 665 (688) Q Consensus 586 ~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~ 665 (688) ......+........+.++++.++.+++++++.++++.+++++..+.|+++.++++++.+.+++..+.+..+...+.++. T Consensus 584 ir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~ 663 (708) T protein:vir:10 584 NRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTV 663 (708) T ss_pred HHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111111112222222223333333334444444445555555566555555555554444443333333222222 Q ss_pred HHHHHH-----HHHHHHHHHHHHhhcCC Q lcl|NC_020488. 666 EETVRN-----LVAEAMAELMAQSQGNA 688 (688) Q Consensus 666 ~~~~~~-----~~~~a~~~~~~~~q~~~ 688 (688) +....+ ....+.++.+...|+.+ T Consensus 664 q~~~~a~~~~~~~~~~~~q~l~~~q~~q 691 (708) T protein:vir:10 664 YKLAQARNIDDKAVMEAIRLLKDVAESQ 691 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhH Confidence 111110 11112233333333333 No 134 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=97.14 E-value=0.00016 Score=41.47 Aligned_cols=599 Identities=14% Similarity=0.063 Sum_probs=178.4 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHH----------H-------HHHHHhhCC------CCCCHHHHHHH--H Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDA----------A-------QEDISFLAG------EQWPESVRKER--E 62 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~----------~-------~~~~~~~~G------~Qw~~~~~~~~--~ 62 (688) |+.+..+..++++..+........+++..++.. | .+...=..| |.-.......+ + T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 778888888888887776555444443333221 1 111110112 11112222222 2 Q ss_pred hcCCCceee----hh----HHHHHHHHHHHHHhC-CcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHh Q lcl|NC_020488. 63 DEGRPCLTL----NK----LPQYVDQVLGDQRQN-RPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYT 133 (688) Q Consensus 63 ~~g~p~~~~----N~----i~~~i~~i~g~~~~~-r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~ 133 (688) .+.|+-+.+ +. +.-.++.++.+.... +. +....++....+. T Consensus 81 ~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~-------------------------~~a~s~Af~d~i~----- 130 (706) T protein:vir:10 81 RNNRISVKFRPGDNAASEELANKLNGLFRADYEETDG-------------------------GEACDNAFDDAAT----- 130 (706) T ss_pred HhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCc-------------------------hHHHHHHHHHHhh----- Confidence 233332221 11 222222222222110 00 1111111111111 Q ss_pred cChHHHHHHHHHHHH-----HcCCceE----------EEEEeeccCCC--CCcce-eEEEecccce---EEeCCcccccc Q lcl|NC_020488. 134 SNAEAHYDNAFQHAV-----EGGFGWL----------RVLTKYSTDDA--FDLDL-CIKSIHNRFA---VLMDPDATEPD 192 (688) Q Consensus 134 ~~~~~~~~~~~~d~~-----~~G~G~~----------~v~~~~~~~~~--~~~~~-~~~~v~~~~~---v~~Dp~a~~~d 192 (688) +... +.+...|.. ..+.+-+ .|++|+..... -+-.. .+.+..+... +|++..+...+ T Consensus 131 ~G~G--~~ev~~d~~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~ 208 (706) T protein:vir:10 131 GGFG--CFRLTTSFVNEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDR 208 (706) T ss_pred cCcc--eEEeeeccccccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhh Confidence 0000 000100000 0111111 13333221100 01011 0111111111 13221110000 Q ss_pred cccCceEEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHH Q lcl|NC_020488. 193 YSDANWCFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVL 272 (688) Q Consensus 193 ~~Da~~~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 272 (688) ..++.|. ..|.+. ...-..+ ++....+.+..+||+.+++..+.....+...+....... T Consensus 209 ~~~~~~~--~d~~~~---------------d~~~~~e---yy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~- 267 (706) T protein:vir:10 209 VGSVSWQ--YDWFTP---------------DVVYIAK---YYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQ- 267 (706) T ss_pred hcccccc--ccccCC---------------Ccceecc---cccccceeEEEEEeeccccCCceeeccchhhhhHHHHhh- Confidence 0011110 011110 0111111 223344555667999888776655444333222111000 Q ss_pred HHHHHhhhhhhheeeeeEEEEEE--EEEchhhhcccCCCCCCCccceEEEeeeeec---cCCcccccchHHHhhHHHHHH Q lcl|NC_020488. 273 DELRDLGTTVTRERRVKTYKVKW--MKVTAYDVLEGPVDWPGSTIPVAPVLGKEMV---IGDKTYYRGLIRFGKDAQRMH 347 (688) Q Consensus 273 ~~~~~~g~~~~~~~~~~~~~v~~--~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~---~~~~~~g~g~v~~~~d~Q~~~ 347 (688) ..........+..+++....+.- ..........+..|| +||+.+..+... ..|-+. .+.+.=...-... T Consensus 268 ~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~----vP~~g~r~~~d~~~~~~G~vr--~~~d~Q~~~N~~~ 341 (706) T protein:vir:10 268 AGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPL----IPVYGKRWFIDDVERVEGHIA--KAMDPQRLYNLQV 341 (706) T ss_pred CCchhhhhcccceeeEEEEeeccccccccCCCCCCCccce----EEEeeccccccccCcccceec--cchhhHHHHHHHH Confidence 01111111122223332221111 011112222332232 233333221110 011111 2221111111112 Q ss_pred HHHHHHHHHHHHhcCCCceeechhh---hc-----chH-HHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHH Q lcl|NC_020488. 348 NYWMTAATERVALAPKAPWVAPAES---IE-----GYE-EEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLA 418 (688) Q Consensus 348 N~~~s~~~~~~~~~~~~~~~~~~~~---i~-----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll 418 (688) ++++..+...-.....+..-.-++. .. ... -.++.....+|.++......+ .+.....++-...++++. T Consensus 342 s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~~--~~~~~~~~~~~~~l~~~~ 419 (706) T protein:vir:10 342 SMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVAG--YTQAPVLNQALAALLQQT 419 (706) T ss_pred HHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhcccccCCCCcccccccccc--cCCCcchHHHHHHHHHHH Confidence 2221111111000000000000000 00 000 012222333444433332221 222223333334444444 Q ss_pred HHHHHHH----HHHhCcChHHcCCCcch---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHH Q lcl|NC_020488. 419 LSATDEM----KATIGLYDASVGAQGNE---QSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIEL--------IPR 483 (688) Q Consensus 419 ~~~~~~~----~~~tGv~d~~~G~~~~~---~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~l--------i~~ 483 (688) ....+.+ ....|...-..|..-+. .+.++...... .+....++.-+.+-.++.+. |.. T Consensus 420 ~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~D-------nl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ 492 (706) T protein:vir:10 420 SADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIYLD-------NMAKSLKRAGEIWLSMAREIYGSDREVRIVH 492 (706) T ss_pred HHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHcCCCcEEEEec Confidence 4433333 33445543333432211 11111111111 11111111111111222211 111 Q ss_pred HcCcceEEEEecc---CCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH----Hhh Q lcl|NC_020488. 484 VYDSDRVLRLRFQ---DGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV----QAV 556 (688) Q Consensus 484 ~~~~~r~~ri~~~---~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~----q~~ 556 (688) --..++++.|... ..+...+.+|.. ....-||.+ +.. ...++.....-+.+..|.+.+ +.. T Consensus 493 ed~~~~~v~in~~~~d~~~G~~~~~nDi--------~~g~yDv~i---~~~-p~~~t~r~~~~~~m~el~~~~~p~~~~~ 560 (706) T protein:vir:10 493 EDGTDDIALMNAAVLDNQTGRVVALNDL--------STGRYDVSV---DVG-PSYSARRDATVNALTQLLQGMLPQDPMR 560 (706) T ss_pred CCCCccceeeccceeccccCceeeeecc--------eeeeEEEEE---ecc-cCcchHHHHHHHHHHHHHHhcCCcchhh Confidence 1234566666431 223334444432 122235542 222 233454444444444444322 234 Q ss_pred HHHHHHHHHHHHHhcCCc-cHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHH-------HHHHHHHHHHHHHHHHH Q lcl|NC_020488. 557 PAAGGVVLDLIAKNMDWP-GAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQ-------QANMAQAQADMEKAKAD 628 (688) Q Consensus 557 ~~~~~~~~~~~~e~~~~~-~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------q~~~~~~q~~~~~~q~e 628 (688) +.+...++.. +..-+.. .++.+...+..+....+..+.+++..+++++.++.++ +++..+.|+++++++++ T Consensus 561 ~~l~~~~~~~-~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~a~ 639 (706) T protein:vir:10 561 PALMGIIIDN-MEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQNE 639 (706) T ss_pred HHHHHHHHhh-cCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4443333321 1111111 1122332232222222222222222211111111222 23333444555555555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_020488. 629 TAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEAMAELMAQSQGNA 688 (688) Q Consensus 629 ~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~~~q~~~ 688 (688) ..+.+.+..+++.+..+.++.......++.....++..+.++. ++...+.+.+++-+.+ T Consensus 640 ~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~-l~~~~a~q~~~~~~~~ 698 (706) T protein:vir:10 640 TVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMETLRL-LKEVAASQQQTIPSPP 698 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhccCCCCCCC Confidence 5555555544444444444443333333333333344444433 3344444445555555 No 135 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=96.89 E-value=0.00028 Score=40.10 Aligned_cols=478 Identities=12% Similarity=0.039 Sum_probs=170.0 Q ss_pred CCCCCCCcCCCCccchH------HHHHHHHHH---HHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceee Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQE------AILQEIRER---AAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTL 71 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~~---~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~ 71 (688) +.|++.|--...-.++. ..+..++.. +..+......++....+.+-...+.+-+.+-.+..+.+-.-...+ T Consensus 17 ~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~ 96 (535) T protein:vir:80 17 LIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFY 96 (535) T ss_pred ccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCC Confidence 66666552222222222 112222222 222222222222222211111122222222222233333335668 Q ss_pred hhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHHHHc Q lcl|NC_020488. 72 NKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EYTSNAEAHYDNAFQHAVEG 150 (688) Q Consensus 72 N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~~~~~~~~~~~~~~d~~~~ 150 (688) |.++.+++..+|......|.+.+ -..|..++..+ .+-++++..+..++..++.+ T Consensus 97 n~~~~tl~~l~G~vfrk~p~~~~-------------------------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~ 151 (535) T protein:vir:80 97 NVTARTLDGMMGQVFSRDPIRQL-------------------------PPALEAIVEDIDGEGVSLDQQAKKALGYTMGF 151 (535) T ss_pred ChhHHHHHHHhchhhcCCcceec-------------------------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhc Confidence 99999999999998877655432 12344444443 34467889999999999999 Q ss_pred CCceEEEEEeeccCCCC---------CcceeEEEecccceEEeCCcccccc-cccCceEEEEecCCHHHHHHhcCCccch Q lcl|NC_020488. 151 GFGWLRVLTKYSTDDAF---------DLDLCIKSIHNRFAVLMDPDATEPD-YSDANWCFISERMSKAEFNKRYPGKAVG 220 (688) Q Consensus 151 G~G~~~v~~~~~~~~~~---------~~~~~~~~v~~~~~v~~Dp~a~~~d-~~Da~~~~~~~~~~~~e~~~~~p~~~~~ 220 (688) |.+++=| ||-..... .-.+-+..+ .|.+|+ ++.....+ .....++..+..... T Consensus 152 G~~~iLV--D~P~~~~~~t~ade~~~~~rPy~~~y-~ae~Ii-nW~~~~v~G~~~Lt~v~lrE~~~~------------- 214 (535) T protein:vir:80 152 GRAAIFT--DYPNVGRPVTVLEQKLGLYRPTITLV-HPTSII-NWRTKLVGGKSVISLVVIQENVLA------------- 214 (535) T ss_pred CeEEEEE--eecCCCCcccHHHHHhcCCCcEEEEe-chhhcc-CccccccCCccceeEEEEEEEEEe------------- Confidence 9887555 44221110 011223333 344332 22222111 111222222221100 Q ss_pred hcccccccccccCCCCCEEEEEEE----EeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEE Q lcl|NC_020488. 221 DLSDAERGEYSWWTNEEGVRVSEY----FYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWM 296 (688) Q Consensus 221 ~~~~~~~~~~~~~~~~~~v~v~e~----~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~ 296 (688) ..+.+. ......+||.+. +|.. ..+.....++...... .++ T Consensus 215 -----~dd~f~-~~~~~q~RvL~~~~~G~y~v--~~~~~~~~~~~~~~~~---------------------------~~~ 259 (535) T protein:vir:80 215 -----QDDGFE-TTYVQQWRVLQLNAEGNYQV--ERWRRETQEEMYYSYS---------------------------KHV 259 (535) T ss_pred -----cCCCcc-cceeEEEEEEEecCCceEEE--EEEEeecCCccccccc---------------------------eee Confidence 000110 001112333222 0110 0000000000000000 000 Q ss_pred EEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcch Q lcl|NC_020488. 297 KVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGY 376 (688) Q Consensus 297 ~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~ 376 (688) ....+.. +.+.+|||+|.. ...+...+......+..++..+=...|-..+++..+..|...+. |. T Consensus 260 -----~~~~g~~--~l~~IPfv~~~~---~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~-G~---- 324 (535) T protein:vir:80 260 -----PTDGNGN--PFKEIPFQFIGP---LDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFT-GL---- 324 (535) T ss_pred -----cccCCCc--ccCeeEEEEeec---CCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeee-cC---- Confidence 0001111 235556654322 22233344444567777766655555666677777776655443 22 Q ss_pred HHHHhhcccCC-------CceeecCcccccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHH Q lcl|NC_020488. 377 EEEWNQANRKN-------QSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAIL 449 (688) Q Consensus 377 ~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~ 449 (688) ++.|......+ ...+.+.. .+..++..+.+..++.. .++...+.|.. .|..- +...+.+.|+.+.. T Consensus 325 ~~~~~~~~~~~~~i~iG~~~~~~lP~-~~~~~~~e~~~~~~a~~---~l~~~e~qM~~-lGa~l--l~~~~~~~Ta~~a~ 397 (535) T protein:vir:80 325 TKDWVEDVFKDFKVHLGSRAIIPLPQ-GATAGILQITPNSVPFE---AMTHKESQMIA-MGANL--LVKSGGNRTFGEAQ 397 (535) T ss_pred chhhhhcCCCCcceEecCcccccCCC-CCCcceeeeccchhHHH---HHHHHHHHHHH-HHHHh--hccCcccccHHHHH Confidence 22221111111 11222222 12223344444455433 23333344433 23322 22223334444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEE Q lcl|NC_020488. 450 ARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFD 529 (688) Q Consensus 450 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~d 529 (688) ....+..-.|..+..|++.++.. ++.++..|... .. ++..--+.+|. | |. T Consensus 398 ~~~~~~~S~L~~~a~~le~al~~----aL~~~A~w~G~-----~~--~~~~~~i~~n~--------------d-----F~ 447 (535) T protein:vir:80 398 QEEASEQSILSACTKNVSMAFRK----ALRWANQFQTG-----IV--NDETVEYNLNT--------------D-----FP 447 (535) T ss_pred HHHHHHhHHHHHHHHHHHHHHHH----HHHHHHHHcCC-----cc--CCCceEEEecc--------------c-----cc Confidence 44444445577777777777655 55566666421 00 00000012221 1 10 Q ss_pred EEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCc--c--HHHHHHHHHhh----ccccccchhhHHhhh Q lcl|NC_020488. 530 VTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWP--G--AQDIARRLQKT----LPPGILDQDEMEEAG 601 (688) Q Consensus 530 v~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~--~--~~ei~~~~~~~----~~~~~~~~~~~~~~~ 601 (688) ......+..+++.++.+. ..+....+...++..++. . .++...++... ....+.........+ T Consensus 448 --------~~~ld~~~~~all~~~~~-G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~ 518 (535) T protein:vir:80 448 --------AARLTPNERAELILEWQQ-GAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGT 518 (535) T ss_pred --------cccCCHHHHHHHHHHHhc-CCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCC Confidence 001111122223333221 001111111112222321 1 13333333221 111111110000000 Q ss_pred hhhh-h-hhHHHHHHHH Q lcl|NC_020488. 602 IEPP-Q-PSPEQQANMA 616 (688) Q Consensus 602 ~~~~-~-~~~~~q~~~~ 616 (688) ...+ - ...-..+.-. T Consensus 519 ~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 519 NKAKLNNGNGGGNQAGN 535 (535) T ss_pred CcCcccCCccccccCCC Confidence 0000 0 0000000000 No 136 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=96.77 E-value=0.00035 Score=39.54 Aligned_cols=584 Identities=12% Similarity=0.037 Sum_probs=158.6 Q ss_pred CCCCCC----CcCCCCccch-HHHHHHHHHHHHHHHHh------hhHHHHHHHHHHHhhC---CCCCCHHHHHHHHhcCC Q lcl|NC_020488. 1 MLPGNE----PIKTRDDDSQ-EAILQEIRERAAHAVTC------WKHNFDAAQEDISFLA---GEQWPESVRKEREDEGR 66 (688) Q Consensus 1 ~~~~~~----~~~~~~~~~~-~~~~~~~~~~~~~~~~~------~~~~r~~~~~~~~~~~---G~Qw~~~~~~~~~~~g~ 66 (688) .-|+.. +.-.++--.. +-++..+.+-|...-+. ..+--+.|.....|++ -.+- .|. T Consensus 63 ~~~~~~~grs~vv~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~----------~~~ 132 (763) T protein:vir:95 63 AKPPKVKGRSQVQPKLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKL----------NRV 132 (763) T ss_pred CcccccCCCccccCHHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcC----------chh Confidence 222222 1111110000 00111111111110000 0011111221111111 0000 010 Q ss_pred CceeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HH----hcChHHHHH Q lcl|NC_020488. 67 PCLTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EY----TSNAEAHYD 141 (688) Q Consensus 67 p~~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~----~~~~~~~~~ 141 (688) -++.|.++..+..-+|..+ .-|.+.-+.... ....+..+...-.+.+...-...... .+ .-+.+.... T Consensus 133 -~~~~~~~~~~l~~~~gv~k---~~W~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (763) T protein:vir:95 133 -SFIDNYVRSVVDDGTGIVR---VGWNREIRKEKQ---EVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIK 205 (763) T ss_pred -hHHHHHHHHHhhcCcceEE---Eeeeeeeeeeee---eehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhh Confidence 0112222222222222111 001100000000 00000000000011111111111111 11 112233333 Q ss_pred HHHHHHHHcCCc---------eEEEEEe--------eccCCCCCcceeEE-EecccceEEeCCcccccccccCceEEEEe Q lcl|NC_020488. 142 NAFQHAVEGGFG---------WLRVLTK--------YSTDDAFDLDLCIK-SIHNRFAVLMDPDATEPDYSDANWCFISE 203 (688) Q Consensus 142 ~~~~d~~~~G~G---------~~~v~~~--------~~~~~~~~~~~~~~-~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~ 203 (688) .+.......|.+ .+.+... ..+...+-.++.+. ++.+..-+++.-.-+.-|+-+..|-. . T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y--~ 283 (763) T protein:vir:95 206 ESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRY--H 283 (763) T ss_pred hhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCc--c Confidence 333333343433 2222111 11111111111111 12222111221111111221111111 1 Q ss_pred cCCH---H--HHHHh---c---CCc-cchh--ccccccccccc--CCCCCEEEEEEEEeeeecceeeeeccCCceecccc Q lcl|NC_020488. 204 RMSK---A--EFNKR---Y---PGK-AVGD--LSDAERGEYSW--WTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDE 267 (688) Q Consensus 204 ~~~~---~--e~~~~---~---p~~-~~~~--~~~~~~~~~~~--~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~ 267 (688) +++. + ..... + |+. ...+ .......++.. ..+.+.+ .+ ||+ ++.. .+.++.... T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~--~~-~~~------v~~~-g~~iL~~~~ 353 (763) T protein:vir:95 284 NLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIEGNGV--LE-PIV------ATWI-GSTLIRLEK 353 (763) T ss_pred ccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeeeccCCcce--eE-EEE------EEEE-cCeeeeccc Confidence 1000 0 00000 0 000 0000 00011111111 1122221 11 111 1111 122222111 Q ss_pred cchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHH Q lcl|NC_020488. 268 VKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMH 347 (688) Q Consensus 268 ~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~ 347 (688) .+. ..+..| +-.||++|.... ..|..++.-+.+.=...-... T Consensus 354 ~p~---------------------------------~~~~~P--Fv~~~~~p~~~~---~~G~gi~~~~~d~Qr~~N~~~ 395 (763) T protein:vir:95 354 NPY---------------------------------PDGKLP--FVLIPYMPVKRD---MYGEPDAELLGDNQAVLGAVM 395 (763) T ss_pred ccc---------------------------------cCCCcC--EEEecceeecCc---ccCCchHHHhhHHHHHHHHHH Confidence 110 001122 233455554332 234444444444333333333 Q ss_pred HHHHHHHHHHHH---hcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHH Q lcl|NC_020488. 348 NYWMTAATERVA---LAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDE 424 (688) Q Consensus 348 N~~~s~~~~~~~---~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~ 424 (688) |..+..+.-..+ ...++.+ ...+.+...-..... .+ ++...... . ......+.++-+...+++++...+. T Consensus 396 ~~~~d~l~~~~~~~~~v~~gav-~~~d~~~~~pg~v~~-v~-~g~~~~~~-~---~~~~~p~~~~~~~~~l~~~~~~~e~ 468 (763) T protein:vir:95 396 RGMIDLLGRSANGQRGMPKGML-DALNSRRYREGEDYE-YN-PTQNPAQM-I---IEHKFPELPQSALTMATLQNQEAES 468 (763) T ss_pred HHHHHHHHhhcCCcEEeecccc-cchhhhcccCCceEE-ee-CCCChhhh-c---ccccCCCCcchHHHHHHHHHHHHHH Confidence 433333222111 1122222 222222110000000 01 11111110 1 1111124466677778877777776 Q ss_pred HHHHh----CcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCcceEEEEecc Q lcl|NC_020488. 425 MKATI----GLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRV----YDSDRVLRLRFQ 496 (688) Q Consensus 425 ~~~~t----Gv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~----~~~~r~~ri~~~ 496 (688) +..++ |++....|...++.++. .++...+...-+..+.+.++...+.+..++.....+- .+.+..+.|..+ T Consensus 469 ~TGv~~~~~G~~~~~~~~tat~v~~l-~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~ 547 (763) T protein:vir:95 469 LTGVKAFAGGVTGESYGDVAAGIRGV-LDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKRE 547 (763) T ss_pred hhCcchhhcCcCcccccchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHH Confidence 65544 88877788777777775 4444555666677888888877788888887753321 111233444433 Q ss_pred CCCcce-eeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH-Hhh-HHHHHHHHHHHHHhcCC Q lcl|NC_020488. 497 DGEGDW-VQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV-QAV-PAAGGVVLDLIAKNMDW 573 (688) Q Consensus 497 ~~~~~~-v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~-q~~-~~~~~~~~~~~~e~~~~ 573 (688) +-..+| |.+.. + ... .. ..+.+.+..|.+++ +.+ +.+...++..+.++.++ T Consensus 548 ~~~~~~DV~V~~------------------~-----~as-~~--~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~ 601 (763) T protein:vir:95 548 DLKGNFDLEVDI------------------S-----TAE-VD--NQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRM 601 (763) T ss_pred HhcCCcceEEec------------------c-----cch-HH--HHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhch Confidence 222222 11110 0 000 01 12444445454443 222 33334455555555555 Q ss_pred ccHHHHHHHHHhhccccccchhhHHhhhhhhhhhhHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHH--HH Q lcl|NC_020488. 574 PGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANM-------AQAQADMEKAKADTAKAQADMAMAQAK--TA 644 (688) Q Consensus 574 ~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~-------~~~q~~~~~~q~e~~~~q~e~~~~q~~--~~ 644 (688) +...+-.+.......+....+.+.+..+++........++++ ...|++..+++++.++.+..+ ..+.+ .. T Consensus 602 ~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~-~~e~~~~~~ 680 (763) T protein:vir:95 602 PKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKH-ARDLEKMKA 680 (763) T ss_pred hhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 544333332222111111111111111111111111111111 111222222222211111111 01111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhcCC Q lcl|NC_020488. 645 EAQAKLAEIEQAAMMAGPGSLEETVRNLVAEA--MAELMAQSQGNA 688 (688) Q Consensus 645 ~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a--~~~~~~~~q~~~ 688 (688) +.+++++.....++.++...+.-.. .++.+ ....-....++. T Consensus 681 ~~eaq~~l~~~~a~~~~~~ea~~~~--~~~~~~~~~~~~~~~~~~~ 724 (763) T protein:vir:95 681 QSQGNQQLEITKALTKPRKEGELPP--NLSAAIGYNALTNGEDTGI 724 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCh--hHHHhhhhcccccccCCCc Confidence 2222221111222222111111000 00100 001111111111 No 137 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=96.26 E-value=0.00081 Score=37.52 Aligned_cols=462 Identities=11% Similarity=0.038 Sum_probs=176.2 Q ss_pred CCCCCCcCCCCccchHHHHHHHHHHH---HHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeehhHHHHH Q lcl|NC_020488. 2 LPGNEPIKTRDDDSQEAILQEIRERA---AHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLNKLPQYV 78 (688) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N~i~~~i 78 (688) ||-.++.- -... -......+..+ ..+.......|.... .|.. +|+.+.....+.+-.-...+|.++.++ T Consensus 1 m~~~~~~~-v~~~--h~~y~a~~~~W~~ird~~~G~~~~r~~g~---~YLP--k~~~E~~~~Y~~rl~rA~~~n~~~~tl 72 (513) T protein:vir:97 1 MADKDPKS-PATT--SGAYDQMLPRWHVIETLLGGTEAMREAGE---TYLP--RHQEETDKGYQERLASAVLLNMVEQTL 72 (513) T ss_pred CCCCCCCC-CCcC--CHHHHHHHHHHHHHHHHhcChHHHHhhcc---cCCC--CCCCCCHHHHHHHHhcccCCChHHHHH Confidence 33222110 0000 01111111111 112211122221111 1221 333333344444444456689999999 Q ss_pred HHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHH-HHHHHH-HHhcChHHHHHHHHHHHHHcCCceEE Q lcl|NC_020488. 79 DQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYE-SLIRNI-EYTSNAEAHYDNAFQHAVEGGFGWLR 156 (688) Q Consensus 79 ~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~-~~i~~~-~~~~~~~~~~~~~~~d~~~~G~G~~~ 156 (688) +..+|......|.+. ......+. .++..+ .+-++++.-+..++..++..|.+++= T Consensus 73 ~~l~G~vf~k~p~~~-----------------------~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~il 129 (513) T protein:vir:97 73 DTLSGKPFSEPIKLN-----------------------EDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVL 129 (513) T ss_pred HHHhhhhhhcCcccC-----------------------cCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEE Confidence 999999987655431 01112222 233332 34577889999999999999987644 Q ss_pred EEEeeccCCCC-Ccc-------------eeEEEecccceEEeCCcccccccc-cCceEEEEecCCHHHHHHhcCCccchh Q lcl|NC_020488. 157 VLTKYSTDDAF-DLD-------------LCIKSIHNRFAVLMDPDATEPDYS-DANWCFISERMSKAEFNKRYPGKAVGD 221 (688) Q Consensus 157 v~~~~~~~~~~-~~~-------------~~~~~v~~~~~v~~Dp~a~~~d~~-Da~~~~~~~~~~~~e~~~~~p~~~~~~ 221 (688) .||-..... +++ +-+..+ .|.+|+ ++.....+.. ...++..+.... T Consensus 130 --VD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~-~~e~Ii-nW~~~~v~G~~~L~~v~l~E~~~--------------- 190 (513) T protein:vir:97 130 --IDMPRPAPREDGQPRTLADDRREGLRPYWVMI-KPECLL-FARSEVINGVEVLQHVRIIEHYM--------------- 190 (513) T ss_pred --EecCCCCCccchhHHhHHHHHhhccCceEEEe-cHhhhc-CcceeccCcceeeeeEEEEEEEe--------------- Confidence 455332111 111 112222 233322 2222111100 111111111000 Q ss_pred cccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchh Q lcl|NC_020488. 222 LSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAY 301 (688) Q Consensus 222 ~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~ 301 (688) ..+. ...+.+..+.+|. +-...+.+...+.....+ .|. T Consensus 191 ----~~Dg----f~~~~~~q~rvL~--~g~~~v~r~~~~~~~~~~---------------------------e~~----- 228 (513) T protein:vir:97 191 ----EQDG----FAEVCKRRIRVLE--PGLVQLWEPVKKSNAQKE---------------------------EWA----- 228 (513) T ss_pred ----ecCC----CcceEEEEEEEEe--CceEEEEEeecCCCcccc---------------------------ceE----- Confidence 0000 0111111111111 000011110000000000 000 Q ss_pred hhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHh Q lcl|NC_020488. 302 DVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWN 381 (688) Q Consensus 302 ~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 381 (688) .+.++.. +.+.+|||++.... .+...+....-.+-.+...+=...|-+.+++..+..|...+. |.. +.+. T Consensus 229 ~~~~g~~--~l~~IP~v~~~~~~---~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~-G~~----~~~~ 298 (513) T protein:vir:97 229 LADEWAT--GLNYVPLVTFYADR---QGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACS-GAS----GEDS 298 (513) T ss_pred EecCCCC--cCCceeEEEEecCC---CCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeee-cCC----cCCC Confidence 0011222 24677887765432 233444455667777766655566777777887777766653 221 1111 Q ss_pred h-cccCCCceeecCcccccccceecCCCcch-HHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020488. 382 Q-ANRKNQSVLRYNAIPGVDRPQRDMPASMP-AAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGT 459 (688) Q Consensus 382 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~ 459 (688) . ...-+..++.+. ..+ ..+.++.+..-+ .....-|....+.|. ..|.. .+...+.+.||++......+....| T Consensus 299 ~~i~iG~~~~~~lp-e~~-~~~~yie~~g~~i~~~~~~l~~le~qm~-~~Ga~--ll~~~~~~~Ta~a~~~~~~~~~S~L 373 (513) T protein:vir:97 299 DPVVVGPNKVLYNP-DPA-GRFYYVEHTGQAIAAGRTDLKDLEEQMA-GYGAE--FLKRKTGGQTATARALDSAEATSDL 373 (513) T ss_pred CceEeeccccccCC-CCC-CcceeeccCchhHHHHHHHHHHHHHHHH-HHHHH--hhccCCccccHHHHHHHHHHHHHHH Confidence 1 111112223222 112 235555553222 223344555555553 34543 2233333588898888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcH Q lcl|NC_020488. 460 FAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQ 539 (688) Q Consensus 460 ~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~ 539 (688) ..+..|+..++.. ++.++..|... +...--|.+|+ +|+... -.+ T Consensus 374 ~~~a~~le~al~~----~l~~~a~wlg~---------~~~~~~v~in~-------------------dF~~~~----~~~ 417 (513) T protein:vir:97 374 SAMTGLFEDALAQ----ALDITADWLRL---------GPNGGTVELVK-------------------DYDLEE----MDA 417 (513) T ss_pred HHHHHHHHHHHHH----HHHHHHHHhCC---------CCCccEEEecc-------------------ccCccc----CCH Confidence 8888888777755 55555566431 11111133332 121110 011 Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCc----c----HHHHHHHHHhhccccccchhhHHhhhh--------- Q lcl|NC_020488. 540 TQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWP----G----AQDIARRLQKTLPPGILDQDEMEEAGI--------- 602 (688) Q Consensus 540 s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~----~----~~ei~~~~~~~~~~~~~~~~~~~~~~~--------- 602 (688) . ..++|+++.+. +.+....+...++..++- . .+++.+++.+.......+.....+.+. T Consensus 418 ~----~~~al~~a~~~-G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 492 (513) T protein:vir:97 418 P----GLQALQVAREK-RDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEG 492 (513) T ss_pred H----HHHHHHHHHhC-CCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCC Confidence 1 12233333221 000000111111111110 1 133444443332221111000000000 Q ss_pred -------hh-----hhhhHHH Q lcl|NC_020488. 603 -------EP-----PQPSPEQ 611 (688) Q Consensus 603 -------~~-----~~~~~~~ 611 (688) ++ ..+--+. T Consensus 493 ~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 493 EGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred CCCCCCCCCccccCCCCCCCC Confidence 00 0000000 No 138 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=96.02 E-value=0.0011 Score=36.79 Aligned_cols=609 Identities=14% Similarity=0.055 Sum_probs=186.8 Q ss_pred cCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHH-----------HHHHhh--CC----------CCCCHHHHHHH--H Q lcl|NC_020488. 8 IKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQ-----------EDISFL--AG----------EQWPESVRKER--E 62 (688) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~-----------~~~~~~--~G----------~Qw~~~~~~~~--~ 62 (688) |+.......++++.++...+....++..+++...+ ++.... .| |.-.......+ + T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 77777788899999988888777666666654421 122222 11 22222232322 2 Q ss_pred hcCCCceee----h----hHHHHHHHHHHHHHh-CCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHH----H Q lcl|NC_020488. 63 DEGRPCLTL----N----KLPQYVDQVLGDQRQ-NRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIR----N 129 (688) Q Consensus 63 ~~g~p~~~~----N----~i~~~i~~i~g~~~~-~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~----~ 129 (688) .+.|+-+.+ + .+.-.++.++.+... |+.+ ..-.++....+. | T Consensus 81 ~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~-------------------------~~~s~Af~~~i~~G~G~ 135 (708) T protein:vir:17 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGG-------------------------EACDNAFDDAATGGFGC 135 (708) T ss_pred hhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCch-------------------------hHHhHHHHHhhhcccce Confidence 233332111 1 122233333322222 1111 111111111111 1 Q ss_pred HHHhcChHHH---HHHHHHHHHHcCC-ceEEEEEeeccCCCCCcce---eEEEecccc---eEEeCCcccccccc-cCce Q lcl|NC_020488. 130 IEYTSNAEAH---YDNAFQHAVEGGF-GWLRVLTKYSTDDAFDLDL---CIKSIHNRF---AVLMDPDATEPDYS-DANW 198 (688) Q Consensus 130 ~~~~~~~~~~---~~~~~~d~~~~G~-G~~~v~~~~~~~~~~~~~~---~~~~v~~~~---~v~~Dp~a~~~d~~-Da~~ 198 (688) +.-..++... .+.-..-.+.+.. -+..|+||+.....-.-+. .+.+..++. .+|++-.....+.. .+.| T Consensus 136 ~~~~~d~~~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~ 215 (708) T protein:vir:17 136 FRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSW 215 (708) T ss_pred eeeeecccccCCCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccc Confidence 0000000000 0000000011110 1123444433211100011 001111111 12332211111100 0000 Q ss_pred EEEEecCCHHHHHHhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccC--CceecccccchHHHHHH Q lcl|NC_020488. 199 CFISERMSKAEFNKRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSD--GRTVWEDEVKDVLDELR 276 (688) Q Consensus 199 ~~~~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~~ 276 (688) ...|.+.+.++ ..+ +|. +..+.+.-+++..+.+..++.... ...+....... ... T Consensus 216 --~~~~~~~d~vr---------------v~e--~~~-r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~---g~~ 272 (708) T protein:vir:17 216 --EYDWFDADVIY---------------IAK--YYE-VRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIA---GFQ 272 (708) T ss_pred --cccccCCCeEE---------------EEE--EEE-EeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhc---ccc Confidence 01111111000 000 111 111222222333333332222111 00000000000 000 Q ss_pred HhhhhhhheeeeeEEEEEEEEEchhhhc-ccCCCCCCCccceEEEeeeeeccC---CcccccchHHHhhHHHH-HHH--- Q lcl|NC_020488. 277 DLGTTVTRERRVKTYKVKWMKVTAYDVL-EGPVDWPGSTIPVAPVLGKEMVIG---DKTYYRGLIRFGKDAQR-MHN--- 348 (688) Q Consensus 277 ~~g~~~~~~~~~~~~~v~~~~~~~~~il-e~~~p~~~~~~P~vp~~~~~~~~~---~~~~g~g~v~~~~d~Q~-~~N--- 348 (688) .........+++.. ..+.......-. .-|.. .+...||..+..++...+ |-+. .+ ++..+.-. ... T Consensus 273 ~~~~r~~~r~~v~~--~~~~g~~~l~~~~~~p~~-~fP~vP~~g~r~~~d~~~~~yG~vr--~~-kd~Q~~~N~~~S~~~ 346 (708) T protein:vir:17 273 EVARRSVKRRRVYV--SVVDGDGFLEKPRRIPGE-HIPLIPVYGKRWFIDDIERVEGHIA--KA-MDPQRLYNLQVSMLA 346 (708) T ss_pred cceeeeeeEEEEEE--EeecccccccCCCCCCCC-ccceEEEecccccccCCCcccchhh--hc-hhHHHHHHHHHHHHH Confidence 00000111111111 111111000000 00000 012233332222222222 2221 22 12222111 111 Q ss_pred ---HHHHHHHHHHHhcCCCceeechhhhcchHHHHhhccc-CCCceeecCcccccccceecCCCcchHHHHHHHHHHHHH Q lcl|NC_020488. 349 ---YWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANR-KNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDE 424 (688) Q Consensus 349 ---~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~ 424 (688) .+.+.-.++++.++..++....+.....+......+. .+.+..+.++.....+++....++....++++.....+. T Consensus 347 ~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~ 426 (708) T protein:vir:17 347 DTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQE 426 (708) T ss_pred HHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHH Confidence 2222233555555555554333222222222222222 122222223322334444555666666666666555555 Q ss_pred HH----HHhCcChHHcCCCcc---hhhHHHHHHHHHH---HHHHH-HHHHHHHHHHH--HHHHHHHHHHHHHHcCcceEE Q lcl|NC_020488. 425 MK----ATIGLYDASVGAQGN---EQSGKAILARQRQ---GDRGT-FAYIDNLSRAI--RRVGQILIELIPRVYDSDRVL 491 (688) Q Consensus 425 ~~----~~tGv~d~~~G~~~~---~~sg~ai~~~~~~---~~~~~-~~~~dn~~~~~--~~~~~~~~~li~~~~~~~r~~ 491 (688) +. ...|......|..-+ ..+.+++...... +..++ ..+++.+..+. .++++++ -+ =..++.+ T Consensus 427 ~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~----~e-dg~~~~v 501 (708) T protein:vir:17 427 VTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIV----NE-DGSDDIA 501 (708) T ss_pred hcCCChHHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEe----cC-CCCccee Confidence 43 334654433443221 1111222211111 11111 11111111111 1111111 11 1234556 Q ss_pred EEecc---CCCcceeeechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHH----hhHHHHHHHH Q lcl|NC_020488. 492 RLRFQ---DGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQ----AVPAAGGVVL 564 (688) Q Consensus 492 ri~~~---~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q----~~~~~~~~~~ 564 (688) .|.+. .....++.+|... ...-||.+ +..-.. ++........+..+...+. ..+.+..+++ T Consensus 502 ~in~~~~d~~~g~~~~~nDi~--------~g~~Dv~v---~~~p~~-~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l 569 (708) T protein:vir:17 502 VLSAQVVDRQTGAVVALNDLS--------VGRYDVTV---DVGPSY-TARRDATVSVLTNVLSSMLPADPMRPAIQGIIL 569 (708) T ss_pred eecceeccCCCccceeeccce--------eeeeeEEE---ecccCc-hhHHHHHHHHHHHHHHhcCCccchhHHHHHHHH Confidence 55432 1233555555321 12234442 322232 3444333333333322221 1222222222 Q ss_pred HHHHHhcCCc-cHHHHHHHHHhhcccc---ccc----hhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 565 DLIAKNMDWP-GAQDIARRLQKTLPPG---ILD----QDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADM 636 (688) Q Consensus 565 ~~~~e~~~~~-~~~ei~~~~~~~~~~~---~~~----~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~ 636 (688) .. ...-+.. -++.+...+....... ... ...++.++.++.....+++++..++|+++++++++..+++++. T Consensus 570 ~~-~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a 648 (708) T protein:vir:17 570 DN-IDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKA 648 (708) T ss_pred Hh-cCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1000000 0122221111111100 000 1111111222222333456667788889989998888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhcCC Q lcl|NC_020488. 637 AMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAE-AMAELMAQSQGNA 688 (688) Q Consensus 637 ~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~-a~~~~~~~~q~~~ 688 (688) .+++.+..+.+++.++.-.++......+..+........ ..+++...++.+. T Consensus 649 ~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~q~q~~~a~p~~ 701 (708) T protein:vir:17 649 FTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQS 701 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccC Confidence 777777666666655544433332332222222221111 1122222222222 No 139 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=95.73 E-value=0.0016 Score=35.98 Aligned_cols=449 Identities=9% Similarity=0.000 Sum_probs=158.4 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhh-hHHHHHH---------HH-HHHhhCCCCCCHHHHHHHH-hcCCCc Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCW-KHNFDAA---------QE-DISFLAGEQWPESVRKERE-DEGRPC 68 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~r~~~---------~~-~~~~~~G~Qw~~~~~~~~~-~~g~p~ 68 (688) .||.+-+-.. =....+....+++-+..+.... ..+.-++ .. ...|+.. ..+..+ .+.+-. T Consensus 13 ~m~V~~~hp~--y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~------~~~~y~~~~~~rA 84 (488) T protein:vir:96 13 FMLTPIYHPD--YLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAK------IEKDWEDLTWRLA 84 (488) T ss_pred eecccccCHH--HHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhcc------chhhhHhhhhhcc Confidence 3333321111 1111111122222232211100 0000000 00 0000000 000000 011123 Q ss_pred eeehhHHHHHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHH-HHhcChHHHHHHHHHHH Q lcl|NC_020488. 69 LTLNKLPQYVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNI-EYTSNAEAHYDNAFQHA 147 (688) Q Consensus 69 ~~~N~i~~~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~-~~~~~~~~~~~~~~~d~ 147 (688) ..+|..+.+++.++|......|.+.. |. ...|..++..+ .+-++++.-+..++..+ T Consensus 85 ~~~n~~~~tl~~l~G~vfrk~p~~~~-~~----------------------~~~l~~l~~d~D~~G~~L~~f~~~~~~~~ 141 (488) T protein:vir:96 85 NYVNIVNPTMNAITGAVMRREPEFDT-MD----------------------NPVLIGLRDNIDGKGNGIDQECKQALNAL 141 (488) T ss_pred ccCchhHHHHHHhcchhhccCceecc-CC----------------------cHHHHHHHhccCCCCCCHHHHHHHHHHHH Confidence 45799999999999999887766542 10 01244445444 34577889999999999 Q ss_pred HHcCCceEEEEEeeccCCCC-------CcceeEEEecccceEEeCCcccccccc-cCceEEEEecCCHHHHHHhcCCccc Q lcl|NC_020488. 148 VEGGFGWLRVLTKYSTDDAF-------DLDLCIKSIHNRFAVLMDPDATEPDYS-DANWCFISERMSKAEFNKRYPGKAV 219 (688) Q Consensus 148 ~~~G~G~~~v~~~~~~~~~~-------~~~~~~~~v~~~~~v~~Dp~a~~~d~~-Da~~~~~~~~~~~~e~~~~~p~~~~ 219 (688) +..|.+++=| ||-.+... .-.+-+..+ .|.+|+ ++.....+.. ...++..+.-.+ T Consensus 142 l~~G~~~ilV--D~P~~~~T~ade~~~~~rPy~~~~-~a~~Ii-nW~~~~v~G~~~L~~v~lrE~~~------------- 204 (488) T protein:vir:96 142 QWGSRCGWLV--RSHPESATMADWNKGKKLPTAAFY-DALHII-DWEVEYIDGEEKLTYLSLLEDYQ------------- 204 (488) T ss_pred HhcCeEEEEE--ecCCCcCCHHHHHHhcCCcEEEEe-chhhhc-CcceeccCCceeeEEEEEEEEEE------------- Confidence 9999887544 54321110 011223333 344332 2222211110 111222111110 Q ss_pred hhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEc Q lcl|NC_020488. 220 GDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVT 299 (688) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~ 299 (688) ..+. .+ ..+...+++.. ..+-...+....++...+ . |.. T Consensus 205 ----~~D~--~~-~~~~~~~~~~~---l~~g~~~v~~~~~~~~~~---------------------------e--~~~-- 243 (488) T protein:vir:96 205 ----ERDG--GT-YVSKQRLINHR---LVDGLCEFQEVTDDEYSD---------------------------E--WTP-- 243 (488) T ss_pred ----eccC--CC-cccceEEEEEE---EECcEEEEEEEecCCccc---------------------------c--eEe-- Confidence 0000 00 11112222211 111000111111110000 0 000 Q ss_pred hhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHH Q lcl|NC_020488. 300 AYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEE 379 (688) Q Consensus 300 ~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~ 379 (688) ..++.. +.+.+|||++... ..+...+....-++-.++..+=...|-..+++..+.-+.++..-+... .. T Consensus 244 ---~~~g~~--~l~~IP~v~~~~~---~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~---~~ 312 (488) T protein:vir:96 244 ---VLINSK--QSDTIPFFLASSQ---SNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMN---KT 312 (488) T ss_pred ---ecCCCc--ccCeeEEEEEecC---CCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCC---cc Confidence 001111 2355666654332 223233334344555555443334444556666655555554211111 11 Q ss_pred HhhcccCCCceeec-Cccc-ccccceecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHH Q lcl|NC_020488. 380 WNQANRKNQSVLRY-NAIP-GVDRPQRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDR 457 (688) Q Consensus 380 ~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~ 457 (688) +.....+.++.+-. .+.. +.....+.. ++.++-..+-|+...+.|.. .|..- ...+.+.||++......+..- T Consensus 313 ~~~~~~~~g~~~~~~~~~~~~~g~~~~~e-~~~~~l~~~~l~~l~~qm~~-~Ga~l---~~~~~~~Ta~~~~~~~~~~~S 387 (488) T protein:vir:96 313 MASEMNPLGFTLAGRMPYYVKNGDVKVIQ-AQFSPETENKVEKLFEQAVK-VGASL---FTQQSNETATGAAIRSGSSTA 387 (488) T ss_pred cccccccceeeecccccccccCCceeecC-CchhHHHHHHHHHHHHHHHH-HhHhh---ccCCCcchHHHHHHHHHHhhH Confidence 11111111111100 0000 001122222 22211123334444444432 34322 222334688888888877777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccceeeeEEEEEecccC Q lcl|NC_020488. 458 GTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPS 537 (688) Q Consensus 458 ~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~ 537 (688) .|..+..|++.+... ++.++..|.... + .+.. + .+..|.|+.... T Consensus 388 ~L~~~a~~le~al~~----~l~~~A~w~g~~------~-~~~~---------------------~---~~~~~~in~dF~ 432 (488) T protein:vir:96 388 SMATLGNNVEDTVRN----MLRFIMRYFEGT------N-LYVN---------------------P---DELVFKLNRDYF 432 (488) T ss_pred HHHHHHHHHHHHHHH----HHHHHHHHcCCC------C-CCcC---------------------c---cceEEEeccCCC Confidence 888888888877755 555666665311 0 0000 0 001111111111 Q ss_pred cHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC--c--cHHHHHHHHHhhccccccch Q lcl|NC_020488. 538 YQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDW--P--GAQDIARRLQKTLPPGILDQ 594 (688) Q Consensus 538 ~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~--~--~~~ei~~~~~~~~~~~~~~~ 594 (688) ......+..++++++.+. ..+....+...++..++ | ..+++.+++... +... T Consensus 433 ~~~ld~~~~~al~~~~~~-G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~----g~~~ 488 (488) T protein:vir:96 433 DVEVNPQMLQVAYAAMME-GNLPQVSWFELLKRARVVRGDMSKEEFDEHIAEL----GFGM 488 (488) T ss_pred CccCCHHHHHHHHHHHhc-CCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhc----CCCC Confidence 111112223333333321 00001111111222222 1 224444444321 1111 No 140 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=93.72 E-value=0.0066 Score=32.54 Aligned_cols=577 Identities=13% Similarity=0.058 Sum_probs=142.5 Q ss_pred CCCCCCCc--------CCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCceeeh Q lcl|NC_020488. 1 MLPGNEPI--------KTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLAGEQWPESVRKEREDEGRPCLTLN 72 (688) Q Consensus 1 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~~~N 72 (688) |.-|+... |.+...+++...+.+++++........++..+|+...+ ++... ..|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~---------~d~~f--y~G~Q----- 64 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAE---------DDLKF--LGGEQ----- 64 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHH---------HHHHH--hCCCC----- Confidence 33332211 12233333334444444443333333333333332211 11121 24532 Q ss_pred hHHHHHHHHHHHHHhCCc--ceEEEeCCcc------cccccccccccc--Ch-----------------hhHHHHHHHHH Q lcl|NC_020488. 73 KLPQYVDQVLGDQRQNRP--AIQVHPVEAN------ATKDTSKVPNVA--GT-----------------SDYSLAEVYES 125 (688) Q Consensus 73 ~i~~~i~~i~g~~~~~r~--~~~v~pr~~~------~~~~~~~~~~~~--~~-----------------~d~~~Ae~l~~ 125 (688) ..+.+..++- .+.+| .++..+..-+ .-.-+...+.++ ++ .+... +-+.+ T Consensus 65 -w~~~~~~~l~--~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~Ae 140 (711) T protein:vir:10 65 -WPSQVRTERE--LEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKND-YELAE 140 (711) T ss_pred -CCHHHHHHHH--hcCCCcEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhH-HHHHH Confidence 2233333322 22333 2222111100 000000001111 11 11111 23344 Q ss_pred HHHHHHHhcChHHHHHHHHHHHHHcCCceEEEEEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEE---- Q lcl|NC_020488. 126 LIRNIEYTSNAEAHYDNAFQHAVEGGFGWLRVLTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFI---- 201 (688) Q Consensus 126 ~i~~~~~~~~~~~~~~~~~~d~~~~G~G~~~v~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~---- 201 (688) +++.+....-.......++.++...++.+ .+.+ .++++|+.+. |.-+-.+++. T Consensus 141 ~l~~~~~~~~~~~~~~~~~s~af~d~~~~---------------G~G~------~ev~~d~~~~--d~~~~e~~i~~v~~ 197 (711) T protein:vir:10 141 VFTGLIKNIEYNCDAETEYDIAFQGAVES---------------GMGY------LRVRSDYLAD--DSFEQDLIIEAIQN 197 (711) T ss_pred HHHHHHHHHHHhcChhHHHHHHHHHhhhc---------------Ccce------EEEEecccCC--CCCCCCeEEeeecC Confidence 44444433222223333444444444321 1222 1233333221 1111122211 Q ss_pred ---------EecCCHHHHHHhc-----C-CccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceeccc Q lcl|NC_020488. 202 ---------SERMSKAEFNKRY-----P-GKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWED 266 (688) Q Consensus 202 ---------~~~~~~~e~~~~~-----p-~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~ 266 (688) .+..+.++..-.| + +.....++......... ..+.....|+.. ....+.. +|+ T Consensus 198 p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~----~~~~~~~~~~~~-~~vrv~E------~~~- 265 (711) T protein:vir:10 198 QFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYE----DSVADYDTWFTE-KSVRVSE------YFT- 265 (711) T ss_pred hhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhc----ccccccCcccCc-ceeeEEE------EEe- Confidence 1112222222111 0 00000011100000000 000000112211 0001100 010 Q ss_pred ccchHHHHHHHhhhhhhheeeeeEEEEEEEEEc------------------hhh---------------hcccCCCCCC- Q lcl|NC_020488. 267 EVKDVLDELRDLGTTVTRERRVKTYKVKWMKVT------------------AYD---------------VLEGPVDWPG- 312 (688) Q Consensus 267 ~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~------------------~~~---------------ile~~~p~~~- 312 (688) +....+++..+..+ |.. +.-+..-+.. T Consensus 266 -------------------r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~ 326 (711) T protein:vir:10 266 -------------------REPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGP 326 (711) T ss_pred -------------------eeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCC Confidence 00001111111000 000 0011111111 Q ss_pred Cccce--EEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeechhhhcchHHHHhhcccCCCce Q lcl|NC_020488. 313 STIPV--APVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPAESIEGYEEEWNQANRKNQSV 390 (688) Q Consensus 313 ~~~P~--vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 390 (688) ..||+ +||+|++... .-..+.|....+...=+-.=.+.+...-.+. +.....+. ++.+ T Consensus 327 ~p~~~~~~P~vp~~g~r-~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~------------------~~l~~~~~-~~~~ 386 (711) T protein:vir:10 327 VEIPSTTIPVIPVWGKS-LIIKKKEIFRSIIRHSKDAQRMANYWDSAAT------------------ETVALAPK-APFI 386 (711) T ss_pred CCCCCCcccEEEEeeee-eccccccccchhhhhhhhhHHHHHHHHHHHH------------------HHHHhcCC-Ccee Confidence 12333 5665543211 0012334344433322222222222211111 11111110 1111 Q ss_pred eecCcccc-----------cccceecCCC----------cchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHH Q lcl|NC_020488. 391 LRYNAIPG-----------VDRPQRDMPA----------SMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAIL 449 (688) Q Consensus 391 ~~~~~~~~-----------~~~~~~~~~~----------~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~ 449 (688) ...++..+ ...+....+. +.++-....++........+.-++..+....|..+++++-. T Consensus 387 ~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ 466 (711) T protein:vir:10 387 GSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGR 466 (711) T ss_pred ecCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHH Confidence 11111110 0111111111 11122223444444444444433222222233333332222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcceEEEEeccCCCcceeeechhhhcccccceeeeccc----ee Q lcl|NC_020488. 450 ARQRQGDRGTFAYIDNLSRAIRRVGQILIELIPRVYDSDRVLRLRFQDGEGDWVQINQMVMDEETQKPVLVNDI----AA 525 (688) Q Consensus 450 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~~~~~~~r~~ri~~~~~~~~~v~~n~~~~~~~~~~~~~~ndi----~~ 525 (688) +....-......+. .+-+.+++..+.+.+++..+.- .+. ..++.+.|-...- +-..+.+|.- .. T Consensus 467 ai~~~q~qg~~~l~-~~~dn~~~~~~~~g~~ll~li~-----~~~---~~er~~rI~ged~---~~~~v~ln~~~~~~~~ 534 (711) T protein:vir:10 467 AIIARQRQGDRGSF-AFIDNLTKSIRRVGKILVEMIP-----HIY---DTERVVRLKFPDE---TEDFVKLNEQIFDEES 534 (711) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----HHc---CCCeEEEEecCCC---CcceEEeccccccccc Confidence 22211111112222 2223334444455555544321 111 1223344421110 1122222221 11 Q ss_pred e----eEEEEE---ecccCc-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHh-cCCccHHHHHHHHHhhccccccchhh Q lcl|NC_020488. 526 G----KFDVTV---KAGPSY-QTQRMEAADSLMQFVQAVPAAGGVVLDLIAKN-MDWPGAQDIARRLQKTLPPGILDQDE 596 (688) Q Consensus 526 ~----~~dv~v---~~~~~~-~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~-~~~~~~~ei~~~~~~~~~~~~~~~~~ 596 (688) | ..|+++ ++.... ++.-....+.+..+.+.++.+ +...+.++.+ +........ +.+.+.+......... T Consensus 535 G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~-p~~~~~~~~~il~~~d~p~~-~el~e~lr~~~~~~~~ 612 (711) T protein:vir:10 535 GEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAV-PSAAAVMADLIAQNMDWPGA-DVIAERLKKIVPPNVL 612 (711) T ss_pred ccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHhhc-chhhhHHHHHHHHhcCCCCH-HHHHHHHHhhcCcccC Confidence 1 234443 223233 233333333333333322221 1111111110 000000000 0011111000000000 Q ss_pred HHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 597 MEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAAMMAGPGSLEETVRNLVAEA 676 (688) Q Consensus 597 ~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a~~~~~~~~~~~~~~~~~~a 676 (688) ....+.+..+.+++++++.++++.++++++++.++++++.+++++++.+.+++..+.+ ++.....+..+.....++++ T Consensus 613 ~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q--~q~~~~~~~aq~~~~~~qq~ 690 (711) T protein:vir:10 613 SKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQ--KQLAMIEDMAQGGDVVYQQV 690 (711) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH Confidence 0111111111222222222233333334444444444444444444333333222222 11122222222222222222 Q ss_pred HHHHHHHhhcCC Q lcl|NC_020488. 677 MAELMAQSQGNA 688 (688) Q Consensus 677 ~~~~~~~~q~~~ 688 (688) . .++.+++++. T Consensus 691 ~-~~l~~~qael 701 (711) T protein:vir:10 691 R-ELVAQALAEI 701 (711) T ss_pred H-HHHHHHHHHH Confidence 2 2222233333 No 141 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=46.28 E-value=0.74 Score=21.30 Aligned_cols=546 Identities=13% Similarity=0.071 Sum_probs=113.0 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh----CCCCCCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL----AGEQWPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~----~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) .+|..+ ++. ..++..-+...++..++. ++..+..+..|. .|-=|-+.-... ..-|.+ +.+=.|.| T Consensus 95 v~p~~~-----~~~-~~~~Ae~l~~~~~~~~~~---~~~~~~~s~af~~~~~~G~G~~~~~~~~-d~~~~~-i~i~~v~p 163 (714) T protein:vir:10 95 VMSDEP-----DDE-TEKLAEAINAEFADACRL---GNMNKARSDAYAEQIKAGLSWVEVRRNS-DPFGPE-FKVSTVSR 163 (714) T ss_pred EecCCC-----Cch-hHHHHHHHHHHHHHHHHh---hchhHHHHHHHHHhhhcCcceEEecccc-CCCCCC-eEEEecch Confidence 566433 222 112222222223222221 122222222222 233342211000 011211 11111111 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHH--------HHHHHHHH Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHY--------DNAFQHAV 148 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~--------~~~~~d~~ 148 (688) + ++.+-|.... . |..- .+|++..-...... .+.+..+. T Consensus 164 --~-----------~v~~Dp~a~~-----------~---D~sD-------ar~~~~~~~~~~~~~~~~fP~~a~~i~~~~ 209 (714) T protein:vir:10 164 --N-----------EVFWDWLSRE-----------A---DLSD-------CRWLMRRRWMDTDEAKATFPGMAQVIDYAI 209 (714) T ss_pred --h-----------heeecccccc-----------C---Chhh-------ccceeeeecCCHHHHHHhcCCchhhhhhhh Confidence 1 1111111100 0 1000 11111110000000 01111111 Q ss_pred HcCCceEEEEEeeccCCCCCcceeE-EEecccceEEeCCcccccccccCceEEEE----------------ecCCHHHHH Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCI-KSIHNRFAVLMDPDATEPDYSDANWCFIS----------------ERMSKAEFN 211 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~-~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~----------------~~~~~~e~~ 211 (688) .++.|+.....+............. ........-+.+++.+..-+.+| |.-+. ..-+...+. T Consensus 210 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~-w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~ 288 (714) T protein:vir:10 210 DDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVV-YYRTFERLPVIELSNGRVVAFDKNNLMQAV 288 (714) T ss_pred hhhccccccccccccccccccchhhhccccccccccccccccEEEEEEE-EEEEEEEEEeeccCCCceEEeCccCHHHHH Confidence 1222221111100000000000000 00000000011121111111111 10000 000111111 Q ss_pred HhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEE Q lcl|NC_020488. 212 KRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTY 291 (688) Q Consensus 212 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 291 (688) +...+.. .....+ +..++. .+..++. .++....+-.+ .. T Consensus 289 ~~~~g~~--------------~~~~~~--~~rv~~-------~~~~g~~-~L~~~~~p~p~-------------~~---- 327 (714) T protein:vir:10 289 AVASGRV--------------QVKVGR--VSRIRE-------AWFVGPH-FIVDRPCSAPQ-------------GM---- 327 (714) T ss_pred HHhhcch--------------hhhccc--cceEEE-------EEEecCc-ccccCCCCCCC-------------Cc---- Confidence 1000000 000000 111111 1111111 11111111000 00 Q ss_pred EEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeech- Q lcl|NC_020488. 292 KVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPA- 370 (688) Q Consensus 292 ~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~- 370 (688) ..+.-+.|...-.... ||-.+-+...+.+-..++.+ +.+-.+.-+......+.+.... T Consensus 328 -fp~vp~~g~~~~~~g~-------~~G~vr~~~d~Qr~~N~~~s-------------~~~~~l~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:10 328 -FPLVPFWGYRKDKTGE-------PYGLISRAIPAQDEVNFRRI-------------KLTWLLQAKRVIMDEDATQLSDN 386 (714) T ss_pred -eeEEEEeeeeeeccCc-------eeehhhhchhHHHHHHHHHH-------------HHHHhhcCCceeeecCcccccHH Confidence 0000001111000011 11111111111000111111 1110000000111111111111 Q ss_pred ---------hhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHH----HHhCcChHHcC Q lcl|NC_020488. 371 ---------ESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMK----ATIGLYDASVG 437 (688) Q Consensus 371 ---------~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~----~~tGv~d~~~G 437 (688) |++..+..-....+..+..+-+. + ..+.++-.-.+++......+.+- ...|..... T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~------~---~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na-- 455 (714) T protein:vir:10 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVE------Q---DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA-- 455 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCcccccc------C---CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc-- Confidence 11110000000001111111111 1 11223333444444444444443 344543211 Q ss_pred CCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH---------HHH--cCcceEEEEeccCCCcceeee Q lcl|NC_020488. 438 AQGNEQSGKAILARQRQGDRGTF-AYIDNLSRAIRRVGQILIELI---------PRV--YDSDRVLRLRFQDGEGDWVQI 505 (688) Q Consensus 438 ~~~~~~sg~ai~~~~~~~~~~~~-~~~dn~~~~~~~~~~~~~~li---------~~~--~~~~r~~ri~~~~~~~~~v~~ 505 (688) .++.+.++.- ..-..+...+. .+....++--+.+..++...+ -+. .+..+.+.|..+.+. -+.+ T Consensus 456 ~SGvAi~~rq--~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~--~~~~ 531 (714) T protein:vir:10 456 TSGVAISNLV--EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDN--GELT 531 (714) T ss_pred hhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCc--ceec Confidence 1221111111 11111111111 111111122222222222221 111 112245544433222 1222 Q ss_pred chhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH-HhhH-HHHHHHHHHHHHhcCCc-cHHHHHHH Q lcl|NC_020488. 506 NQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV-QAVP-AAGGVVLDLIAKNMDWP-GAQDIARR 582 (688) Q Consensus 506 n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~-q~~~-~~~~~~~~~~~e~~~~~-~~~ei~~~ 582 (688) |. .....-||.. +. ....++......+.+..+++.+ .... ....+++ .++..-+.. -.+.+.+. T Consensus 532 nD--------i~~~~~Dv~i---~~-~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l-~~~d~p~~~el~~~ir~~ 598 (714) T protein:vir:10 532 ND--------ISRLNTHIAL---AP-VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWV-NLLDVPQKQEFVERIRAA 598 (714) T ss_pred cc--------ceeeeEEEEE---ee-ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHH-HhcCCCCHHHHHHHHHHH Confidence 21 1122345543 22 2344666666666655555432 1111 1111111 111110000 01222222 Q ss_pred HHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHH---HHHHHH Q lcl|NC_020488. 583 LQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAM-----------AQAK---TAEAQA 648 (688) Q Consensus 583 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~-----------~q~~---~~~~~a 648 (688) +..........+.++..+++++..++++++.++.++++.+++++++.+++++.+.+ ++.+ .+..++ T Consensus 599 ~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a 678 (714) T protein:vir:10 599 LGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQA 678 (714) T ss_pred cCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222111111111111111111112222233333344433333333322222211 1111 111112 Q ss_pred HHHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 649 KLAEI-EQAAMMA-GPGSLEETVRNLVAEAMAELMA 682 (688) Q Consensus 649 ~~~~~-~~~a~~~-~~~~~~~~~~~~~~~a~~~~~~ 682 (688) ..+++ ...+..+ .....+++..+.+++.+..+-+ T Consensus 679 ~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 679 HTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHhHhhhhhhhHHHHHHHHHHHHHHHHhcCC Confidence 22211 1122222 2222333332222222222222 No 142 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=46.28 E-value=0.74 Score=21.30 Aligned_cols=546 Identities=13% Similarity=0.071 Sum_probs=113.0 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh----CCCCCCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL----AGEQWPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~----~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) .+|..+ ++. ..++..-+...++..++. ++..+..+..|. .|-=|-+.-... ..-|.+ +.+=.|.| T Consensus 95 v~p~~~-----~~~-~~~~Ae~l~~~~~~~~~~---~~~~~~~s~af~~~~~~G~G~~~~~~~~-d~~~~~-i~i~~v~p 163 (714) T protein:vir:81 95 VMSDEP-----DDE-TEKLAEAINAEFADACRL---GNMNKARSDAYAEQIKAGLSWVEVRRNS-DPFGPE-FKVSTVSR 163 (714) T ss_pred EecCCC-----Cch-hHHHHHHHHHHHHHHHHh---hchhHHHHHHHHHhhhcCcceEEecccc-CCCCCC-eEEEecch Confidence 566433 222 112222222223222221 122222222222 233342211000 011211 11111111 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHH--------HHHHHHHH Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHY--------DNAFQHAV 148 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~--------~~~~~d~~ 148 (688) + ++.+-|.... . |..- .+|++..-...... .+.+..+. T Consensus 164 --~-----------~v~~Dp~a~~-----------~---D~sD-------ar~~~~~~~~~~~~~~~~fP~~a~~i~~~~ 209 (714) T protein:vir:81 164 --N-----------EVFWDWLSRE-----------A---DLSD-------CRWLMRRRWMDTDEAKATFPGMAQVIDYAI 209 (714) T ss_pred --h-----------heeecccccc-----------C---Chhh-------ccceeeeecCCHHHHHHhcCCchhhhhhhh Confidence 1 1111111100 0 1000 11111110000000 01111111 Q ss_pred HcCCceEEEEEeeccCCCCCcceeE-EEecccceEEeCCcccccccccCceEEEE----------------ecCCHHHHH Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCI-KSIHNRFAVLMDPDATEPDYSDANWCFIS----------------ERMSKAEFN 211 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~-~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~----------------~~~~~~e~~ 211 (688) .++.|+.....+............. ........-+.+++.+..-+.+| |.-+. ..-+...+. T Consensus 210 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~-w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~ 288 (714) T protein:vir:81 210 DDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVV-YYRTFERLPVIELSNGRVVAFDKNNLMQAV 288 (714) T ss_pred hhhccccccccccccccccccchhhhccccccccccccccccEEEEEEE-EEEEEEEEEeeccCCCceEEeCccCHHHHH Confidence 1222221111100000000000000 00000000011121111111111 10000 000111111 Q ss_pred HhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEE Q lcl|NC_020488. 212 KRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTY 291 (688) Q Consensus 212 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 291 (688) +...+.. .....+ +..++. .+..++. .++....+-.+ .. T Consensus 289 ~~~~g~~--------------~~~~~~--~~rv~~-------~~~~g~~-~L~~~~~p~p~-------------~~---- 327 (714) T protein:vir:81 289 AVASGRV--------------QVKVGR--VSRIRE-------AWFVGPH-FIVDRPCSAPQ-------------GM---- 327 (714) T ss_pred HHhhcch--------------hhhccc--cceEEE-------EEEecCc-ccccCCCCCCC-------------Cc---- Confidence 1000000 000000 111111 1111111 11111111000 00 Q ss_pred EEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeech- Q lcl|NC_020488. 292 KVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPA- 370 (688) Q Consensus 292 ~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~- 370 (688) ..+.-+.|...-.... ||-.+-+...+.+-..++.+ +.+-.+.-+......+.+.... T Consensus 328 -fp~vp~~g~~~~~~g~-------~~G~vr~~~d~Qr~~N~~~s-------------~~~~~l~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:81 328 -FPLVPFWGYRKDKTGE-------PYGLISRAIPAQDEVNFRRI-------------KLTWLLQAKRVIMDEDATQLSDN 386 (714) T ss_pred -eeEEEEeeeeeeccCc-------eeehhhhchhHHHHHHHHHH-------------HHHHhhcCCceeeecCcccccHH Confidence 0000001111000011 11111111111000111111 1110000000111111111111 Q ss_pred ---------hhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHH----HHhCcChHHcC Q lcl|NC_020488. 371 ---------ESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMK----ATIGLYDASVG 437 (688) Q Consensus 371 ---------~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~----~~tGv~d~~~G 437 (688) |++..+..-....+..+..+-+. + ..+.++-.-.+++......+.+- ...|..... T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~------~---~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na-- 455 (714) T protein:vir:81 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVE------Q---DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA-- 455 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCcccccc------C---CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc-- Confidence 11110000000001111111111 1 11223333444444444444443 344543211 Q ss_pred CCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH---------HHH--cCcceEEEEeccCCCcceeee Q lcl|NC_020488. 438 AQGNEQSGKAILARQRQGDRGTF-AYIDNLSRAIRRVGQILIELI---------PRV--YDSDRVLRLRFQDGEGDWVQI 505 (688) Q Consensus 438 ~~~~~~sg~ai~~~~~~~~~~~~-~~~dn~~~~~~~~~~~~~~li---------~~~--~~~~r~~ri~~~~~~~~~v~~ 505 (688) .++.+.++.- ..-..+...+. .+....++--+.+..++...+ -+. .+..+.+.|..+.+. -+.+ T Consensus 456 ~SGvAi~~rq--~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~--~~~~ 531 (714) T protein:vir:81 456 TSGVAISNLV--EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDN--GELT 531 (714) T ss_pred hhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCc--ceec Confidence 1221111111 11111111111 111111122222222222221 111 112245544433222 1222 Q ss_pred chhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH-HhhH-HHHHHHHHHHHHhcCCc-cHHHHHHH Q lcl|NC_020488. 506 NQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV-QAVP-AAGGVVLDLIAKNMDWP-GAQDIARR 582 (688) Q Consensus 506 n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~-q~~~-~~~~~~~~~~~e~~~~~-~~~ei~~~ 582 (688) |. .....-||.. +. ....++......+.+..+++.+ .... ....+++ .++..-+.. -.+.+.+. T Consensus 532 nD--------i~~~~~Dv~i---~~-~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l-~~~d~p~~~el~~~ir~~ 598 (714) T protein:vir:81 532 ND--------ISRLNTHIAL---AP-VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWV-NLLDVPQKQEFVERIRAA 598 (714) T ss_pred cc--------ceeeeEEEEE---ee-ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHH-HhcCCCCHHHHHHHHHHH Confidence 21 1122345543 22 2344666666666655555432 1111 1111111 111110000 01222222 Q ss_pred HHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHH---HHHHHH Q lcl|NC_020488. 583 LQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAM-----------AQAK---TAEAQA 648 (688) Q Consensus 583 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~-----------~q~~---~~~~~a 648 (688) +..........+.++..+++++..++++++.++.++++.+++++++.+++++.+.+ ++.+ .+..++ T Consensus 599 ~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a 678 (714) T protein:vir:81 599 LGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQA 678 (714) T ss_pred cCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222111111111111111111112222233333344433333333322222211 1111 111112 Q ss_pred HHHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 649 KLAEI-EQAAMMA-GPGSLEETVRNLVAEAMAELMA 682 (688) Q Consensus 649 ~~~~~-~~~a~~~-~~~~~~~~~~~~~~~a~~~~~~ 682 (688) ..+++ ...+..+ .....+++..+.+++.+..+-+ T Consensus 679 ~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:81 679 HTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHhHhhhhhhhHHHHHHHHHHHHHHHHhcCC Confidence 22211 1122222 2222333332222222222222 No 143 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=46.28 E-value=0.74 Score=21.30 Aligned_cols=546 Identities=13% Similarity=0.071 Sum_probs=113.0 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh----CCCCCCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL----AGEQWPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~----~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) .+|..+ ++. ..++..-+...++..++. ++..+..+..|. .|-=|-+.-... ..-|.+ +.+=.|.| T Consensus 95 v~p~~~-----~~~-~~~~Ae~l~~~~~~~~~~---~~~~~~~s~af~~~~~~G~G~~~~~~~~-d~~~~~-i~i~~v~p 163 (714) T protein:vir:32 95 VMSDEP-----DDE-TEKLAEAINAEFADACRL---GNMNKARSDAYAEQIKAGLSWVEVRRNS-DPFGPE-FKVSTVSR 163 (714) T ss_pred EecCCC-----Cch-hHHHHHHHHHHHHHHHHh---hchhHHHHHHHHHhhhcCcceEEecccc-CCCCCC-eEEEecch Confidence 566433 222 112222222223222221 122222222222 233342211000 011211 11111111 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHH--------HHHHHHHH Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHY--------DNAFQHAV 148 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~--------~~~~~d~~ 148 (688) + ++.+-|.... . |..- .+|++..-...... .+.+..+. T Consensus 164 --~-----------~v~~Dp~a~~-----------~---D~sD-------ar~~~~~~~~~~~~~~~~fP~~a~~i~~~~ 209 (714) T protein:vir:32 164 --N-----------EVFWDWLSRE-----------A---DLSD-------CRWLMRRRWMDTDEAKATFPGMAQVIDYAI 209 (714) T ss_pred --h-----------heeecccccc-----------C---Chhh-------ccceeeeecCCHHHHHHhcCCchhhhhhhh Confidence 1 1111111100 0 1000 11111110000000 01111111 Q ss_pred HcCCceEEEEEeeccCCCCCcceeE-EEecccceEEeCCcccccccccCceEEEE----------------ecCCHHHHH Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCI-KSIHNRFAVLMDPDATEPDYSDANWCFIS----------------ERMSKAEFN 211 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~-~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~----------------~~~~~~e~~ 211 (688) .++.|+.....+............. ........-+.+++.+..-+.+| |.-+. ..-+...+. T Consensus 210 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~-w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~ 288 (714) T protein:vir:32 210 DDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVV-YYRTFERLPVIELSNGRVVAFDKNNLMQAV 288 (714) T ss_pred hhhccccccccccccccccccchhhhccccccccccccccccEEEEEEE-EEEEEEEEEeeccCCCceEEeCccCHHHHH Confidence 1222221111100000000000000 00000000011121111111111 10000 000111111 Q ss_pred HhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEE Q lcl|NC_020488. 212 KRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTY 291 (688) Q Consensus 212 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 291 (688) +...+.. .....+ +..++. .+..++. .++....+-.+ .. T Consensus 289 ~~~~g~~--------------~~~~~~--~~rv~~-------~~~~g~~-~L~~~~~p~p~-------------~~---- 327 (714) T protein:vir:32 289 AVASGRV--------------QVKVGR--VSRIRE-------AWFVGPH-FIVDRPCSAPQ-------------GM---- 327 (714) T ss_pred HHhhcch--------------hhhccc--cceEEE-------EEEecCc-ccccCCCCCCC-------------Cc---- Confidence 1000000 000000 111111 1111111 11111111000 00 Q ss_pred EEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeech- Q lcl|NC_020488. 292 KVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPA- 370 (688) Q Consensus 292 ~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~- 370 (688) ..+.-+.|...-.... ||-.+-+...+.+-..++.+ +.+-.+.-+......+.+.... T Consensus 328 -fp~vp~~g~~~~~~g~-------~~G~vr~~~d~Qr~~N~~~s-------------~~~~~l~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:32 328 -FPLVPFWGYRKDKTGE-------PYGLISRAIPAQDEVNFRRI-------------KLTWLLQAKRVIMDEDATQLSDN 386 (714) T ss_pred -eeEEEEeeeeeeccCc-------eeehhhhchhHHHHHHHHHH-------------HHHHhhcCCceeeecCcccccHH Confidence 0000001111000011 11111111111000111111 1110000000111111111111 Q ss_pred ---------hhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHH----HHhCcChHHcC Q lcl|NC_020488. 371 ---------ESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMK----ATIGLYDASVG 437 (688) Q Consensus 371 ---------~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~----~~tGv~d~~~G 437 (688) |++..+..-....+..+..+-+. + ..+.++-.-.+++......+.+- ...|..... T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~------~---~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na-- 455 (714) T protein:vir:32 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVE------Q---DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA-- 455 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCcccccc------C---CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc-- Confidence 11110000000001111111111 1 11223333444444444444443 344543211 Q ss_pred CCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH---------HHH--cCcceEEEEeccCCCcceeee Q lcl|NC_020488. 438 AQGNEQSGKAILARQRQGDRGTF-AYIDNLSRAIRRVGQILIELI---------PRV--YDSDRVLRLRFQDGEGDWVQI 505 (688) Q Consensus 438 ~~~~~~sg~ai~~~~~~~~~~~~-~~~dn~~~~~~~~~~~~~~li---------~~~--~~~~r~~ri~~~~~~~~~v~~ 505 (688) .++.+.++.- ..-..+...+. .+....++--+.+..++...+ -+. .+..+.+.|..+.+. -+.+ T Consensus 456 ~SGvAi~~rq--~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~--~~~~ 531 (714) T protein:vir:32 456 TSGVAISNLV--EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDN--GELT 531 (714) T ss_pred hhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCc--ceec Confidence 1221111111 11111111111 111111122222222222221 111 112245544433222 1222 Q ss_pred chhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH-HhhH-HHHHHHHHHHHHhcCCc-cHHHHHHH Q lcl|NC_020488. 506 NQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV-QAVP-AAGGVVLDLIAKNMDWP-GAQDIARR 582 (688) Q Consensus 506 n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~-q~~~-~~~~~~~~~~~e~~~~~-~~~ei~~~ 582 (688) |. .....-||.. +. ....++......+.+..+++.+ .... ....+++ .++..-+.. -.+.+.+. T Consensus 532 nD--------i~~~~~Dv~i---~~-~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l-~~~d~p~~~el~~~ir~~ 598 (714) T protein:vir:32 532 ND--------ISRLNTHIAL---AP-VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWV-NLLDVPQKQEFVERIRAA 598 (714) T ss_pred cc--------ceeeeEEEEE---ee-ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHH-HhcCCCCHHHHHHHHHHH Confidence 21 1122345543 22 2344666666666655555432 1111 1111111 111110000 01222222 Q ss_pred HHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHH---HHHHHH Q lcl|NC_020488. 583 LQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAM-----------AQAK---TAEAQA 648 (688) Q Consensus 583 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~-----------~q~~---~~~~~a 648 (688) +..........+.++..+++++..++++++.++.++++.+++++++.+++++.+.+ ++.+ .+..++ T Consensus 599 ~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a 678 (714) T protein:vir:32 599 LGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQA 678 (714) T ss_pred cCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222111111111111111111112222233333344433333333322222211 1111 111112 Q ss_pred HHHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 649 KLAEI-EQAAMMA-GPGSLEETVRNLVAEAMAELMA 682 (688) Q Consensus 649 ~~~~~-~~~a~~~-~~~~~~~~~~~~~~~a~~~~~~ 682 (688) ..+++ ...+..+ .....+++..+.+++.+..+-+ T Consensus 679 ~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:32 679 HTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHhHhhhhhhhHHHHHHHHHHHHHHHHhcCC Confidence 22211 1122222 2222333332222222222222 No 144 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=46.28 E-value=0.74 Score=21.30 Aligned_cols=546 Identities=13% Similarity=0.071 Sum_probs=113.0 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh----CCCCCCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL----AGEQWPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~----~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) .+|..+ ++. ..++..-+...++..++. ++..+..+..|. .|-=|-+.-... ..-|.+ +.+=.|.| T Consensus 95 v~p~~~-----~~~-~~~~Ae~l~~~~~~~~~~---~~~~~~~s~af~~~~~~G~G~~~~~~~~-d~~~~~-i~i~~v~p 163 (714) T protein:vir:99 95 VMSDEP-----DDE-TEKLAEAINAEFADACRL---GNMNKARSDAYAEQIKAGLSWVEVRRNS-DPFGPE-FKVSTVSR 163 (714) T ss_pred EecCCC-----Cch-hHHHHHHHHHHHHHHHHh---hchhHHHHHHHHHhhhcCcceEEecccc-CCCCCC-eEEEecch Confidence 566433 222 112222222223222221 122222222222 233342211000 011211 11111111 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHH--------HHHHHHHH Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHY--------DNAFQHAV 148 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~--------~~~~~d~~ 148 (688) + ++.+-|.... . |..- .+|++..-...... .+.+..+. T Consensus 164 --~-----------~v~~Dp~a~~-----------~---D~sD-------ar~~~~~~~~~~~~~~~~fP~~a~~i~~~~ 209 (714) T protein:vir:99 164 --N-----------EVFWDWLSRE-----------A---DLSD-------CRWLMRRRWMDTDEAKATFPGMAQVIDYAI 209 (714) T ss_pred --h-----------heeecccccc-----------C---Chhh-------ccceeeeecCCHHHHHHhcCCchhhhhhhh Confidence 1 1111111100 0 1000 11111110000000 01111111 Q ss_pred HcCCceEEEEEeeccCCCCCcceeE-EEecccceEEeCCcccccccccCceEEEE----------------ecCCHHHHH Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCI-KSIHNRFAVLMDPDATEPDYSDANWCFIS----------------ERMSKAEFN 211 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~-~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~----------------~~~~~~e~~ 211 (688) .++.|+.....+............. ........-+.+++.+..-+.+| |.-+. ..-+...+. T Consensus 210 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~-w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~ 288 (714) T protein:vir:99 210 DDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVV-YYRTFERLPVIELSNGRVVAFDKNNLMQAV 288 (714) T ss_pred hhhccccccccccccccccccchhhhccccccccccccccccEEEEEEE-EEEEEEEEEeeccCCCceEEeCccCHHHHH Confidence 1222221111100000000000000 00000000011121111111111 10000 000111111 Q ss_pred HhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEE Q lcl|NC_020488. 212 KRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTY 291 (688) Q Consensus 212 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 291 (688) +...+.. .....+ +..++. .+..++. .++....+-.+ .. T Consensus 289 ~~~~g~~--------------~~~~~~--~~rv~~-------~~~~g~~-~L~~~~~p~p~-------------~~---- 327 (714) T protein:vir:99 289 AVASGRV--------------QVKVGR--VSRIRE-------AWFVGPH-FIVDRPCSAPQ-------------GM---- 327 (714) T ss_pred HHhhcch--------------hhhccc--cceEEE-------EEEecCc-ccccCCCCCCC-------------Cc---- Confidence 1000000 000000 111111 1111111 11111111000 00 Q ss_pred EEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeech- Q lcl|NC_020488. 292 KVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPA- 370 (688) Q Consensus 292 ~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~- 370 (688) ..+.-+.|...-.... ||-.+-+...+.+-..++.+ +.+-.+.-+......+.+.... T Consensus 328 -fp~vp~~g~~~~~~g~-------~~G~vr~~~d~Qr~~N~~~s-------------~~~~~l~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:99 328 -FPLVPFWGYRKDKTGE-------PYGLISRAIPAQDEVNFRRI-------------KLTWLLQAKRVIMDEDATQLSDN 386 (714) T ss_pred -eeEEEEeeeeeeccCc-------eeehhhhchhHHHHHHHHHH-------------HHHHhhcCCceeeecCcccccHH Confidence 0000001111000011 11111111111000111111 1110000000111111111111 Q ss_pred ---------hhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHH----HHhCcChHHcC Q lcl|NC_020488. 371 ---------ESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMK----ATIGLYDASVG 437 (688) Q Consensus 371 ---------~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~----~~tGv~d~~~G 437 (688) |++..+..-....+..+..+-+. + ..+.++-.-.+++......+.+- ...|..... T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~------~---~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na-- 455 (714) T protein:vir:99 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVE------Q---DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA-- 455 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCcccccc------C---CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc-- Confidence 11110000000001111111111 1 11223333444444444444443 344543211 Q ss_pred CCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH---------HHH--cCcceEEEEeccCCCcceeee Q lcl|NC_020488. 438 AQGNEQSGKAILARQRQGDRGTF-AYIDNLSRAIRRVGQILIELI---------PRV--YDSDRVLRLRFQDGEGDWVQI 505 (688) Q Consensus 438 ~~~~~~sg~ai~~~~~~~~~~~~-~~~dn~~~~~~~~~~~~~~li---------~~~--~~~~r~~ri~~~~~~~~~v~~ 505 (688) .++.+.++.- ..-..+...+. .+....++--+.+..++...+ -+. .+..+.+.|..+.+. -+.+ T Consensus 456 ~SGvAi~~rq--~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~--~~~~ 531 (714) T protein:vir:99 456 TSGVAISNLV--EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDN--GELT 531 (714) T ss_pred hhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCc--ceec Confidence 1221111111 11111111111 111111122222222222221 111 112245544433222 1222 Q ss_pred chhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH-HhhH-HHHHHHHHHHHHhcCCc-cHHHHHHH Q lcl|NC_020488. 506 NQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV-QAVP-AAGGVVLDLIAKNMDWP-GAQDIARR 582 (688) Q Consensus 506 n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~-q~~~-~~~~~~~~~~~e~~~~~-~~~ei~~~ 582 (688) |. .....-||.. +. ....++......+.+..+++.+ .... ....+++ .++..-+.. -.+.+.+. T Consensus 532 nD--------i~~~~~Dv~i---~~-~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l-~~~d~p~~~el~~~ir~~ 598 (714) T protein:vir:99 532 ND--------ISRLNTHIAL---AP-VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWV-NLLDVPQKQEFVERIRAA 598 (714) T ss_pred cc--------ceeeeEEEEE---ee-ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHH-HhcCCCCHHHHHHHHHHH Confidence 21 1122345543 22 2344666666666655555432 1111 1111111 111110000 01222222 Q ss_pred HHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHH---HHHHHH Q lcl|NC_020488. 583 LQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAM-----------AQAK---TAEAQA 648 (688) Q Consensus 583 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~-----------~q~~---~~~~~a 648 (688) +..........+.++..+++++..++++++.++.++++.+++++++.+++++.+.+ ++.+ .+..++ T Consensus 599 ~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a 678 (714) T protein:vir:99 599 LGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQA 678 (714) T ss_pred cCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222111111111111111111112222233333344433333333322222211 1111 111112 Q ss_pred HHHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 649 KLAEI-EQAAMMA-GPGSLEETVRNLVAEAMAELMA 682 (688) Q Consensus 649 ~~~~~-~~~a~~~-~~~~~~~~~~~~~~~a~~~~~~ 682 (688) ..+++ ...+..+ .....+++..+.+++.+..+-+ T Consensus 679 ~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:99 679 HTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHhHhhhhhhhHHHHHHHHHHHHHHHHhcCC Confidence 22211 1122222 2222333332222222222222 No 145 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=46.28 E-value=0.74 Score=21.30 Aligned_cols=546 Identities=13% Similarity=0.071 Sum_probs=113.0 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh----CCCCCCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFL----AGEQWPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~----~G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) .+|..+ ++. ..++..-+...++..++. ++..+..+..|. .|-=|-+.-... ..-|.+ +.+=.|.| T Consensus 95 v~p~~~-----~~~-~~~~Ae~l~~~~~~~~~~---~~~~~~~s~af~~~~~~G~G~~~~~~~~-d~~~~~-i~i~~v~p 163 (714) T protein:vir:27 95 VMSDEP-----DDE-TEKLAEAINAEFADACRL---GNMNKARSDAYAEQIKAGLSWVEVRRNS-DPFGPE-FKVSTVSR 163 (714) T ss_pred EecCCC-----Cch-hHHHHHHHHHHHHHHHHh---hchhHHHHHHHHHhhhcCcceEEecccc-CCCCCC-eEEEecch Confidence 566433 222 112222222223222221 122222222222 233342211000 011211 11111111 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHH--------HHHHHHHH Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHY--------DNAFQHAV 148 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~--------~~~~~d~~ 148 (688) + ++.+-|.... . |..- .+|++..-...... .+.+..+. T Consensus 164 --~-----------~v~~Dp~a~~-----------~---D~sD-------ar~~~~~~~~~~~~~~~~fP~~a~~i~~~~ 209 (714) T protein:vir:27 164 --N-----------EVFWDWLSRE-----------A---DLSD-------CRWLMRRRWMDTDEAKATFPGMAQVIDYAI 209 (714) T ss_pred --h-----------heeecccccc-----------C---Chhh-------ccceeeeecCCHHHHHHhcCCchhhhhhhh Confidence 1 1111111100 0 1000 11111110000000 01111111 Q ss_pred HcCCceEEEEEeeccCCCCCcceeE-EEecccceEEeCCcccccccccCceEEEE----------------ecCCHHHHH Q lcl|NC_020488. 149 EGGFGWLRVLTKYSTDDAFDLDLCI-KSIHNRFAVLMDPDATEPDYSDANWCFIS----------------ERMSKAEFN 211 (688) Q Consensus 149 ~~G~G~~~v~~~~~~~~~~~~~~~~-~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~----------------~~~~~~e~~ 211 (688) .++.|+.....+............. ........-+.+++.+..-+.+| |.-+. ..-+...+. T Consensus 210 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~-w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~ 288 (714) T protein:vir:27 210 DDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVV-YYRTFERLPVIELSNGRVVAFDKNNLMQAV 288 (714) T ss_pred hhhccccccccccccccccccchhhhccccccccccccccccEEEEEEE-EEEEEEEEEeeccCCCceEEeCccCHHHHH Confidence 1222221111100000000000000 00000000011121111111111 10000 000111111 Q ss_pred HhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEE Q lcl|NC_020488. 212 KRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTY 291 (688) Q Consensus 212 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 291 (688) +...+.. .....+ +..++. .+..++. .++....+-.+ .. T Consensus 289 ~~~~g~~--------------~~~~~~--~~rv~~-------~~~~g~~-~L~~~~~p~p~-------------~~---- 327 (714) T protein:vir:27 289 AVASGRV--------------QVKVGR--VSRIRE-------AWFVGPH-FIVDRPCSAPQ-------------GM---- 327 (714) T ss_pred HHhhcch--------------hhhccc--cceEEE-------EEEecCc-ccccCCCCCCC-------------Cc---- Confidence 1000000 000000 111111 1111111 11111111000 00 Q ss_pred EEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCceeech- Q lcl|NC_020488. 292 KVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWVAPA- 370 (688) Q Consensus 292 ~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~- 370 (688) ..+.-+.|...-.... ||-.+-+...+.+-..++.+ +.+-.+.-+......+.+.... T Consensus 328 -fp~vp~~g~~~~~~g~-------~~G~vr~~~d~Qr~~N~~~s-------------~~~~~l~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:27 328 -FPLVPFWGYRKDKTGE-------PYGLISRAIPAQDEVNFRRI-------------KLTWLLQAKRVIMDEDATQLSDN 386 (714) T ss_pred -eeEEEEeeeeeeccCc-------eeehhhhchhHHHHHHHHHH-------------HHHHhhcCCceeeecCcccccHH Confidence 0000001111000011 11111111111000111111 1110000000111111111111 Q ss_pred ---------hhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHH----HHhCcChHHcC Q lcl|NC_020488. 371 ---------ESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMK----ATIGLYDASVG 437 (688) Q Consensus 371 ---------~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~----~~tGv~d~~~G 437 (688) |++..+..-....+..+..+-+. + ..+.++-.-.+++......+.+- ...|..... T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~------~---~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na-- 455 (714) T protein:vir:27 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVE------Q---DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA-- 455 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCcccccc------C---CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc-- Confidence 11110000000001111111111 1 11223333444444444444443 344543211 Q ss_pred CCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH---------HHH--cCcceEEEEeccCCCcceeee Q lcl|NC_020488. 438 AQGNEQSGKAILARQRQGDRGTF-AYIDNLSRAIRRVGQILIELI---------PRV--YDSDRVLRLRFQDGEGDWVQI 505 (688) Q Consensus 438 ~~~~~~sg~ai~~~~~~~~~~~~-~~~dn~~~~~~~~~~~~~~li---------~~~--~~~~r~~ri~~~~~~~~~v~~ 505 (688) .++.+.++.- ..-..+...+. .+....++--+.+..++...+ -+. .+..+.+.|..+.+. -+.+ T Consensus 456 ~SGvAi~~rq--~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~--~~~~ 531 (714) T protein:vir:27 456 TSGVAISNLV--EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDN--GELT 531 (714) T ss_pred hhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCc--ceec Confidence 1221111111 11111111111 111111122222222222221 111 112245544433222 1222 Q ss_pred chhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH-HhhH-HHHHHHHHHHHHhcCCc-cHHHHHHH Q lcl|NC_020488. 506 NQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV-QAVP-AAGGVVLDLIAKNMDWP-GAQDIARR 582 (688) Q Consensus 506 n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~-q~~~-~~~~~~~~~~~e~~~~~-~~~ei~~~ 582 (688) |. .....-||.. +. ....++......+.+..+++.+ .... ....+++ .++..-+.. -.+.+.+. T Consensus 532 nD--------i~~~~~Dv~i---~~-~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l-~~~d~p~~~el~~~ir~~ 598 (714) T protein:vir:27 532 ND--------ISRLNTHIAL---AP-VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWV-NLLDVPQKQEFVERIRAA 598 (714) T ss_pred cc--------ceeeeEEEEE---ee-ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHH-HhcCCCCHHHHHHHHHHH Confidence 21 1122345543 22 2344666666666655555432 1111 1111111 111110000 01222222 Q ss_pred HHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHH---HHHHHH Q lcl|NC_020488. 583 LQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAM-----------AQAK---TAEAQA 648 (688) Q Consensus 583 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~-----------~q~~---~~~~~a 648 (688) +..........+.++..+++++..++++++.++.++++.+++++++.+++++.+.+ ++.+ .+..++ T Consensus 599 ~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a 678 (714) T protein:vir:27 599 LGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQA 678 (714) T ss_pred cCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222111111111111111111112222233333344433333333322222211 1111 111112 Q ss_pred HHHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 649 KLAEI-EQAAMMA-GPGSLEETVRNLVAEAMAELMA 682 (688) Q Consensus 649 ~~~~~-~~~a~~~-~~~~~~~~~~~~~~~a~~~~~~ 682 (688) ..+++ ...+..+ .....+++..+.+++.+..+-+ T Consensus 679 ~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:27 679 HTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHhHhhhhhhhHHHHHHHHHHHHHHHHhcCC Confidence 22211 1122222 2222333332222222222222 No 146 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=41.95 E-value=0.9 Score=20.82 Aligned_cols=267 Identities=10% Similarity=0.062 Sum_probs=95.8 Q ss_pred EEEecccceEEeCCcccccccccCceEEE---EecCCHHHHHHhcCCccchhccccccccccc-CCCCCEEEEEEEEeee Q lcl|NC_020488. 173 IKSIHNRFAVLMDPDATEPDYSDANWCFI---SERMSKAEFNKRYPGKAVGDLSDAERGEYSW-WTNEEGVRVSEYFYRE 248 (688) Q Consensus 173 ~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~---~~~~~~~e~~~~~p~~~~~~~~~~~~~~~~~-~~~~~~v~v~e~~~~~ 248 (688) |-.+ |..++=+ . +..+ ++.-++.. -.+||..+|.+..=. .+ ...+..+-. ..+.+ -+++++|.-. T Consensus 1 ia~l--~~~~~~~-~-~~~~-~~l~~lL~~~PN~~~t~~~f~~~~~~----~l-l~~Gna~~~i~r~~~-G~~~~l~~l~ 69 (278) T protein:vir:78 1 MASL--PLKMYED-Y-KVVN-TEVSDLLTVSPNNSLSSFDFINQIET----IR-NEKGNAYVLIERDIY-HQPSKLFLLN 69 (278) T ss_pred Cccc--eeEEEec-C-cccc-cHHHHHHHhcCCCCCCHHHHHHHHHH----HH-hhcCCEEEEEEECCC-CcEEEEEEEC Confidence 1111 1111110 0 0000 11111111 135666666542100 00 000000100 00111 1245666665 Q ss_pred ecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccC Q lcl|NC_020488. 249 PVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIG 328 (688) Q Consensus 249 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~ 328 (688) +....+....+|..+++..... .|. ...+....+++- .. .... T Consensus 70 ~~~v~v~~~~~~~~~~y~~~~~-------~g~--------------~~~~~~~evih~---------------~~-~~~~ 112 (278) T protein:vir:78 70 PDVVEMLIENQSRELYYSIHAA-------TGN--------------KLIVHNMDMLHF---------------KH-IVAS 112 (278) T ss_pred CceeEEEEcCCCceEEEEEEcC-------Cce--------------EEEEccccEEEE---------------CC-CCCC Confidence 5555544444443332211000 000 000111111111 10 1123 Q ss_pred CcccccchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcee-echhhh-----cchHHHHhhcccCCCceeecCcccccccc Q lcl|NC_020488. 329 DKTYYRGLIRFGKDAQRMHNYWMTAATERVALAPKAPWV-APAESI-----EGYEEEWNQANRKNQSVLRYNAIPGVDRP 402 (688) Q Consensus 329 ~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~-~~~~~i-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 402 (688) +..+|.|.+..+.+.-...+......+.... +.+..+ ...+.+ ....+.+.......+.+++..+ +-.+ T Consensus 113 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~---g~~~ 187 (278) T protein:vir:78 113 NMVQGISPIDVLKNTTDFDNAVRTFNLTEMQ--KPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEP---GVEI 187 (278) T ss_pred CCeeeccHHHHHHHHHHHHHHHHHHHHHHhc--CCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCC---CceE Confidence 4567778877777665544443333222222 223333 333322 2333344433333444444432 2234 Q ss_pred eecCCCcchHHHHHHHHHHHHHHHHHhCcChHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 403 QRDMPASMPAAELQLALSATDEMKATIGLYDASVGAQGNEQSGKAILARQRQGDRGTFAYIDNLSRAIRRVGQILIELIP 482 (688) Q Consensus 403 ~~~~~~~~~~~~~~ll~~~~~~~~~~tGv~d~~~G~~~~~~sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~~~li~ 482 (688) ..+.....-....+..+.....|-..-||++..+|...+++-+.+-++...--.. .++.+.+.+.+-+. T Consensus 188 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~-----------~l~P~~~~i~~~ln 256 (278) T protein:vir:78 188 EPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQH-----------TLLPIVKQYEEEFN 256 (278) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH-----------HHHHHHHHHHHHHH Confidence 4444433444456666677788888899999999976543322222211111122 23333333332222 Q ss_pred -HHcCcceEEEEeccCCCcceeeechhhh Q lcl|NC_020488. 483 -RVYDSDRVLRLRFQDGEGDWVQINQMVM 510 (688) Q Consensus 483 -~~~~~~r~~ri~~~~~~~~~v~~n~~~~ 510 (688) +.+++... ...-++.+|-.-+ T Consensus 257 ~~L~~~~e~-------~~g~~~~f~~~~l 278 (278) T protein:vir:78 257 RKLLTKTDR-------EKIGILNLTLNLI 278 (278) T ss_pred hhcCChhHh-------cCCceEEEecccC Confidence 23322110 0112333321111 No 147 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=36.64 E-value=1.2 Score=20.23 Aligned_cols=539 Identities=13% Similarity=0.090 Sum_probs=114.1 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhC----CCCCCHHHHHHHHhcCCCceeehhHHH Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNFDAAQEDISFLA----GEQWPESVRKEREDEGRPCLTLNKLPQ 76 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~----G~Qw~~~~~~~~~~~g~p~~~~N~i~~ 76 (688) ..|... +++ ..++..-+...++..++ .++.++..+..|.+ |-=|-..-... ..-+.+ +.+=.|.| T Consensus 95 v~pr~~-----~~~-~~~~Ae~l~~~~~~~~~---~~~~~~~~s~af~~~~~~G~G~~~~~~d~-d~~~~~-i~i~~v~p 163 (714) T protein:vir:10 95 VMSDDP-----NDE-TEKLAEAINAEFADACR---LGNMNKARSDAYAEQIKAGLSWVEVRRNS-EPFGPE-FKVSTVSR 163 (714) T ss_pred EecCCC-----Chh-hHHHHHHHHHHHHHHHH---hhchhHHHHHHHHHhhhcccceEEeeecc-CCCCCC-eEEEecCh Confidence 556433 221 11222222222222222 22233333333332 33343211110 011111 11111111 Q ss_pred HHHHHHHHHHhCCcceEEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHH-HH---HHHHHcCC Q lcl|NC_020488. 77 YVDQVLGDQRQNRPAIQVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDN-AF---QHAVEGGF 152 (688) Q Consensus 77 ~i~~i~g~~~~~r~~~~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~-~~---~d~~~~G~ 152 (688) -++.+-|... ..+.+| .+|++..-..+..... .| .+.+.... T Consensus 164 -------------~~v~~Dp~a~-----------~~D~sD----------ar~~~~~~~~~~~~~~~~fp~~a~~i~~~~ 209 (714) T protein:vir:10 164 -------------NEVFWDWLSR-----------EADLSD----------CRWLMRRRWMDTDEAKATFPGMAQVIDYAI 209 (714) T ss_pred -------------hheeeccccc-----------cCChhh----------hhhhhhhccCCHHHHHHhcCCchhhhhccc Confidence 1111111110 001111 1222211111110000 01 00010000 Q ss_pred ceEEEEEeeccCCCCCcceeEEEecccc-eEEeCCcccccccccCceEEEEe-c-------------------CCHHHHH Q lcl|NC_020488. 153 GWLRVLTKYSTDDAFDLDLCIKSIHNRF-AVLMDPDATEPDYSDANWCFISE-R-------------------MSKAEFN 211 (688) Q Consensus 153 G~~~v~~~~~~~~~~~~~~~~~~v~~~~-~v~~Dp~a~~~d~~Da~~~~~~~-~-------------------~~~~e~~ 211 (688) ..+.-..+....+... ... +.++. ...||.......-.|.+.|++.. | .+..++. T Consensus 210 ~~~~~~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~ 285 (714) T protein:vir:10 210 DDWRGFVDTTVTEGQP-SPL---MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLM 285 (714) T ss_pred hhhcCcccchhhhhhc-ccc---cccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHH Confidence 0000000000000000 000 00000 00111100000001111111100 1 0111110 Q ss_pred HhcCCccchhcccccccccccCCCCCEEEEEEEEeeeecceeeeeccCCceecccccchHHHHHHHhhhhhhheeeeeEE Q lcl|NC_020488. 212 KRYPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVTRKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVKTY 291 (688) Q Consensus 212 ~~~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 291 (688) .. .. ... ...-.......+|...|+-... .++....+-.+ . .+ T Consensus 286 ~~--------~~-~~~-g~~~~~~~~~~rv~~~~~~g~~-----------~L~~~~~p~p~------------~----~f 328 (714) T protein:vir:10 286 QA--------VA-VAS-GRVQVKVGRVSRIREAWFVGPH-----------FIVDRPCSAPQ------------G----MF 328 (714) T ss_pred HH--------HH-HHh-ccceecccceeeEEEEEEecch-----------hhhcCCCCCCC------------C----ce Confidence 00 00 000 0000001111222222322110 00000000000 0 00 Q ss_pred EEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHH------------HHHHHHHHH Q lcl|NC_020488. 292 KVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYW------------MTAATERVA 359 (688) Q Consensus 292 ~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~------------~s~~~~~~~ 359 (688) .+.. +.|........ ||-.+-+...+.+-..++.|-... ++|.. -+.+.++.+ T Consensus 329 -p~vP-~~g~~~~~~g~-------~~G~vr~~~d~Qr~~N~~~s~~~~------~l~~~~~~~~~gav~~~d~~~~e~~~ 393 (714) T protein:vir:10 329 -PLVP-FWGYRKDKTGE-------PYGLISRAIPAQDEVNFRRIKLTW------LLQAKRVIMDEDATQLSDNDLMEQLE 393 (714) T ss_pred -eeEE-ecceeeeccCc-------cceehhhhhhHHHHHHHHHHHHHH------HHhCCceeeccccccccHHHHHHhcc Confidence 0000 01110000011 111111111111001111111000 11100 011111111 Q ss_pred hcCCCceeechhhhcchHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHH----HHhCcChHH Q lcl|NC_020488. 360 LAPKAPWVAPAESIEGYEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMK----ATIGLYDAS 435 (688) Q Consensus 360 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~----~~tGv~d~~ 435 (688) + +++.+.+..+. ......++.+-+ ....+.|+-.-.+++......+.+- ...|..... T Consensus 394 r-p~~vi~~~~~~--------~~~~~~~~~~~~---------~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na 455 (714) T protein:vir:10 394 R-PDGIIKLNPVR--------KNQKSVADVFRV---------EQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA 455 (714) T ss_pred C-CCCeEEecccc--------cccCCccccccc---------cCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcch Confidence 1 11111111100 000001111111 1112333333455555555444444 344554321 Q ss_pred cCCCcchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH---------HHHHc--CcceEEEEeccCCCccee Q lcl|NC_020488. 436 VGAQGNEQSGKAILARQRQGDRGTF-AYIDNLSRAIRRVGQILIEL---------IPRVY--DSDRVLRLRFQDGEGDWV 503 (688) Q Consensus 436 ~G~~~~~~sg~ai~~~~~~~~~~~~-~~~dn~~~~~~~~~~~~~~l---------i~~~~--~~~r~~ri~~~~~~~~~v 503 (688) .++-+.++. ...-..+...+. .+....++-.+.+..++... +-+.- ...+.+.+..+.+. .. T Consensus 456 --~SGvAI~~r--~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~--~~ 529 (714) T protein:vir:10 456 --TSGVAISNL--VEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDN--GE 529 (714) T ss_pred --hHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCC--cc Confidence 112111111 111111111111 11111111122222222221 11111 12344444322222 12 Q ss_pred eechhhhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHH-----HhhHHHH---------HHHHHHHHH Q lcl|NC_020488. 504 QINQMVMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFV-----QAVPAAG---------GVVLDLIAK 569 (688) Q Consensus 504 ~~n~~~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~-----q~~~~~~---------~~~~~~~~e 569 (688) .+|. .....-||.+ ++. ...++......+.+..++..+ ...+.+. .-+...+.+ T Consensus 530 ~~nD--------i~~~~~dv~i---~~~-p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~ 597 (714) T protein:vir:10 530 LTND--------ISRLNTHIAL---APV-QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRA 597 (714) T ss_pred cccc--------ceeeeEEEEE---eec-cCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHH Confidence 2222 1222345542 222 233454555444444333221 1111111 112222333 Q ss_pred hcCCccHHHHHHHHHhhccccccchhhHHhhhhhhhhh-hHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHH- Q lcl|NC_020488. 570 NMDWPGAQDIARRLQKTLPPGILDQDEMEEAGIEPPQP-SPEQQANMAQAQADMEKAKADTAK----AQADMAMAQAKT- 643 (688) Q Consensus 570 ~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~q~~~~~~q~~~~~~q~e~~~----~q~e~~~~q~~~- 643 (688) .++.+...+.. .+...+.+.++.+.+.++.+. ..+.++..++.+++++++++...+ ++...+.++.+. T Consensus 598 ~~~~~~~~~~~------~~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~ 671 (714) T protein:vir:10 598 ALGTPKSPDEM------TPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRY 671 (714) T ss_pred HcCCCCCcccc------CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33433221110 000000011111111111111 122334444555555554443222 222222222211 Q ss_pred --HHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 644 --AEAQAKLAEI-EQA-AMMAGPGSLEETVRNLVAEAMAELMA 682 (688) Q Consensus 644 --~~~~a~~~~~-~~~-a~~~~~~~~~~~~~~~~~~a~~~~~~ 682 (688) ...++..+.+ +.. .+.+..+..+++..+.+++.+.++-+ T Consensus 672 ~~~~~~a~~a~~l~~~~~~~q~~~~~~q~~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 672 VDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHhcCC Confidence 1112222221 111 12222233333333333333333322 No 148 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=28.94 E-value=1.7 Score=19.32 Aligned_cols=527 Identities=10% Similarity=-0.044 Sum_probs=114.7 Q ss_pred CCCCCCCcCCCCccchHHHHHHHHHHHHHHHHhhhHHH-----------HHHHHHHHhhCCCCCCHHHHHHHHhcCCCce Q lcl|NC_020488. 1 MLPGNEPIKTRDDDSQEAILQEIRERAAHAVTCWKHNF-----------DAAQEDISFLAGEQWPESVRKEREDEGRPCL 69 (688) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r-----------~~~~~~~~~~~G~Qw~~~~~~~~~~~g~p~~ 69 (688) =+||..+. -.-+....+..+...+-.-.-....|. .++.+...++--+|+.. T Consensus 72 ~~~~rs~~---~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~-------------- 134 (651) T protein:vir:80 72 NADWRHKI---TTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTE-------------- 134 (651) T ss_pred CCCCCccc---cChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhc-------------- Confidence 01111111 111111122222211111110111110 00011111110000000 Q ss_pred eehhHHHHHHHHHHHHHhCCcce-EEEeCCccccccccccccccChhhHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Q lcl|NC_020488. 70 TLNKLPQYVDQVLGDQRQNRPAI-QVHPVEANATKDTSKVPNVAGTSDYSLAEVYESLIRNIEYTSNAEAHYDNAFQHAV 148 (688) Q Consensus 70 ~~N~i~~~i~~i~g~~~~~r~~~-~v~pr~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~i~~~~~~~~~~~~~~~~~~d~~ 148 (688) +-....+.+.+-+.....+-+ +|. =+.. ..+..+.. .+...+ T Consensus 135 --~~~~~~~~~~~~d~l~~G~~i~kv~-we~~------------~~~~~~~~----------------------~~~~~~ 177 (651) T protein:vir:80 135 --GKFRAAYANFLRQLLITGNSVLALP-WRVE------------TAEVKKKV----------------------QVRTPL 177 (651) T ss_pred --cCcHHHHHHHHHhhcccCceEEEEe-ecce------------eeeeehhe----------------------eccccc Confidence 000111111111111111111 000 0000 00000000 001111 Q ss_pred HcCCceEEE-----------EEeeccCCCCCcceeEEEecccceEEeCCcccccccccCceEEEEecCCHHHHHHh---- Q lcl|NC_020488. 149 EGGFGWLRV-----------LTKYSTDDAFDLDLCIKSIHNRFAVLMDPDATEPDYSDANWCFISERMSKAEFNKR---- 213 (688) Q Consensus 149 ~~G~G~~~v-----------~~~~~~~~~~~~~~~~~~v~~~~~v~~Dp~a~~~d~~Da~~~~~~~~~~~~e~~~~---- 213 (688) ..|.+.+.| .++..+...+-.++.+..+.+..-++.- ..+..++...-.-.+..+....+..+. T Consensus 178 ~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~-~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~ 256 (651) T protein:vir:80 178 FEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKL-TKTKADILNLLSEGYYYGVDPLDVVEHKCKD 256 (651) T ss_pred cccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeee-eeeHHHHHHHHhcccccchhhHHHHhhhccc Confidence 111111111 0111110101111111111111101100 000011111000001112222222211 Q ss_pred ---cCCccchhcccccccccccCCCCCEEEEEEEEeeeecc-eeeeeccCCceecccccchHHHHHHHhhhhhhheeeee Q lcl|NC_020488. 214 ---YPGKAVGDLSDAERGEYSWWTNEEGVRVSEYFYREPVT-RKLLLLSDGRTVWEDEVKDVLDELRDLGTTVTRERRVK 289 (688) Q Consensus 214 ---~p~~~~~~~~~~~~~~~~~~~~~~~v~v~e~~~~~~~~-~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 289 (688) +|..........+...++....-..+.....+..+... ..++....|..+.....++ T Consensus 257 ~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~~~~~------------------- 317 (651) T protein:vir:80 257 TSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRFEQNP------------------- 317 (651) T ss_pred cccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecccccC------------------- Confidence 11111122111111111110000000000000000000 0011111222111000000 Q ss_pred EEEEEEEEEchhhhcccCCCCCCCccceEEEeeeeeccCCcccccchHHHhhHHHHHHHHHHHHHHHHHHh---cCCCce Q lcl|NC_020488. 290 TYKVKWMKVTAYDVLEGPVDWPGSTIPVAPVLGKEMVIGDKTYYRGLIRFGKDAQRMHNYWMTAATERVAL---APKAPW 366 (688) Q Consensus 290 ~~~v~~~~~~~~~ile~~~p~~~~~~P~vp~~~~~~~~~~~~~g~g~v~~~~d~Q~~~N~~~s~~~~~~~~---~~~~~~ 366 (688) .-...||- .+++.|.-+. ..|..++.-+....+-+....|.....+.-..+- ...+. T Consensus 318 --------------~~~~~Pf~--~~~~~~~~~~---~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~- 377 (651) T protein:vir:80 318 --------------YWCGRPFV--IGTYIPTARQ---PYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDG- 377 (651) T ss_pred --------------CCCCCCee--eecceecCcc---ccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCc- Confidence 00112332 2333333332 2455555566666565555555554433322211 11222 Q ss_pred eechhhhcc-hHHHHhhcccCCCceeecCcccccccceecCCCcchHHHHHHHHHHHHHHH----HHhCcChHHcCCCcc Q lcl|NC_020488. 367 VAPAESIEG-YEEEWNQANRKNQSVLRYNAIPGVDRPQRDMPASMPAAELQLALSATDEMK----ATIGLYDASVGAQGN 441 (688) Q Consensus 367 ~~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~----~~tGv~d~~~G~~~~ 441 (688) +...+.+.. +-..+ ..+..+ .+.+.... .+.++.....++.++.....+- -..|++..+.|..+. T Consensus 378 ~~~~~~l~~~pg~vi-~~~~~~-~~~~l~~~--------~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TA 447 (651) T protein:vir:80 378 LLQPEDVYTEPGKVF-LVSDHG-DLQPLANQ--------SSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTA 447 (651) T ss_pred cccHHHhhcCCCceE-EecCCC-CceeeccC--------cccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccH Confidence 222222211 11111 122222 22222111 1223444555666665555554 346888888887543 Q ss_pred -hhhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH-----HHHHcCcce----EEEEeccCCC--cceeeechh Q lcl|NC_020488. 442 -EQSGKAILARQRQGDRGTFAYIDNL-SRAIRRVGQILIEL-----IPRVYDSDR----VLRLRFQDGE--GDWVQINQM 508 (688) Q Consensus 442 -~~sg~ai~~~~~~~~~~~~~~~dn~-~~~~~~~~~~~~~l-----i~~~~~~~r----~~ri~~~~~~--~~~v~~n~~ 508 (688) +.+..+....+.-+ .....+.+.+ ...+++++.++... +.++...+- .+.|..++.. .+.+.+... T Consensus 448 teI~~~~~~~~~~l~-~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~ 526 (651) T protein:vir:80 448 AEVAAVREAGGNRLS-GIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSD 526 (651) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHH Confidence 44555433333222 2333444444 44556666766653 223333221 1122222211 112222211 Q ss_pred hhcccccceeeeccceeeeEEEEEecccCcHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCccHHHHHHHHHhhcc Q lcl|NC_020488. 509 VMDEETQKPVLVNDIAAGKFDVTVKAGPSYQTQRMEAADSLMQFVQAVPAAGGVVLDLIAKNMDWPGAQDIARRLQKTLP 588 (688) Q Consensus 509 ~~~~~~~~~~~~ndi~~~~~dv~v~~~~~~~s~r~~~~~~l~~~~q~~~~~~~~~~~~~~e~~~~~~~~ei~~~~~~~~~ 588 (688) ........ ..++.. -+- .+...|.... .......+..++..++ ...... ++...+ ... ... .. T Consensus 527 ~~~~r~~~---~~~l~~-~~q-~~~~~p~~~~-~~~~~~~~~~l~~~~g--~~~~~~-~l~~~~-----q~~-~~~-~~- 589 (651) T protein:vir:80 527 HVIERKQY---IEDRLT-FIQ-AVAQVPEMGQ-LVDYKRILVDLLQHWG--FEEPEA-YLKQQD-----QQA-PAN-PQ- 589 (651) T ss_pred HHHHHHHH---HHHHHH-HHH-hhccCCccch-hhhHHHHHHHHHHHcC--CCCcHH-hcCCCc-----cch-hhh-hh- Confidence 10000000 000000 000 0000000000 0001111111221110 000011 111100 000 000 00 Q ss_pred ccccchhhHHhhhhhhhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020488. 589 PGILDQDEMEEAGIEPPQPSPEQQ-ANMAQAQADMEKAKADTAKAQADMAMAQAKTAEAQAKLAEIEQAA 657 (688) Q Consensus 589 ~~~~~~~~~~~~~~~~~~~~~~~q-~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~~~~~~a~~~~~~~~a 657 (688) +... .+++....+++++ ++.+.++.+..++++++.+++++.... .+....+..+.+.+..+ T Consensus 590 ~~~~-------~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 590 EALL-------SQAKDVGGQAMSNMLQNQLQADGGTQMMSEMYGTPNADQMQ-QELMATTPNVSEQQLTQ 651 (651) T ss_pred HHHH-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhccC Confidence 0000 0000000000000 000001111112222222222211100 00001111111111111 No 149 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=23.83 E-value=2.3 Score=18.65 Aligned_cols=94 Identities=13% Similarity=0.194 Sum_probs=8.6 Q ss_pred HHHHhhccccccchhhHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHH Q lcl|NC_020488. 581 RRLQKTLPPGILDQDEMEEAGIEPPQPSPEQQANMAQAQADMEKAKADTAKAQADMAMAQAK-------TAEAQAKLAEI 653 (688) Q Consensus 581 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~q~~~~~~q~e~~~~q~e~~~~q~~-------~~~~~a~~~~~ 653 (688) -.+|+...+....+. .....+..+...++.+...++ ....+.++.+.+ ....++...++ T Consensus 1 ~~~~~~~l~~~~~~~-------------~~~l~el~e~~~~l~k~~~el-~~~l~ea~~~ee~~~~ee~i~~l~~~~~el 66 (466) T protein:vir:80 1 MALRQLMLAKKIEQR-------------KAALAELLEQEKALQKRSEEL-EAAIDEANTDEEIAVVEDEINKLEGEKTEL 66 (466) T ss_pred CchHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHH-HHHHHhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 111111110000000 000000000001111110010 001111111111 01111111111 Q ss_pred HHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHhhcCC Q lcl|NC_020488. 654 EQA--AMMAGPGSLEETVRNLVAEA-----MAELMAQSQGNA 688 (688) Q Consensus 654 ~~~--a~~~~~~~~~~~~~~~~~~a-----~~~~~~~~q~~~ 688 (688) +.+ ........++..+....... ........+.+. T Consensus 67 ~e~~~~l~~ei~~le~el~e~~~~~~~~~~~~~~~~~~~~~~ 108 (466) T protein:vir:80 67 EEKKSKLEGEIKELENELEQLNNKEPKNNSEPAQVSGARTQQ 108 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhccCchhHHHHhhhhhH Confidence 000 00000000010000000000 000000000000 Done!