Query lcl|NC_013059.1_cdsid_YP_003090220.1 [gene=gp1] [protein=Portal protein] [protein_id=YP_003090220.1] [location=1966..4143] Match_columns 725 No_of_seqs 191 out of 245 Neff 8.4 Searched_HMMs 1612 Date Thu Nov 7 13:22:48 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9263 Length: 725 # 100.0 2E-222 1E-225 1236.3 76.8 725 1-725 1-725 (725) 2 protein:vir:100920 Length: 725 100.0 1E-221 6E-225 1232.1 76.6 725 1-725 1-725 (725) 3 protein:vir:77597 Length: 725 100.0 2E-220 9E-224 1225.7 78.6 725 1-725 1-725 (725) 4 protein:vir:3520 Length: 720 # 100.0 3E-194 2E-197 1081.9 72.8 704 1-710 1-720 (720) 5 protein:vir:172 Length: 708 # 100.0 8E-191 5E-194 1062.6 68.7 692 1-723 1-708 (708) 6 protein:vir:105429 Length: 708 100.0 2E-190 1E-193 1060.2 69.6 691 1-725 1-707 (708) 7 protein:vir:105520 Length: 706 100.0 3E-188 2E-191 1048.7 64.9 686 1-725 1-702 (706) 8 protein:vir:108295 Length: 711 100.0 6E-180 3E-183 1003.3 72.1 662 1-700 23-711 (711) 9 protein:vir:10117 Length: 714 100.0 9E-166 6E-169 925.5 71.2 663 1-707 8-714 (714) 10 protein:vir:2764 Length: 714 # 100.0 9E-166 6E-169 925.5 71.2 663 1-707 8-714 (714) 11 protein:vir:3296 Length: 714 # 100.0 9E-166 6E-169 925.5 71.2 663 1-707 8-714 (714) 12 protein:vir:817 Length: 714 # 100.0 9E-166 6E-169 925.5 71.2 663 1-707 8-714 (714) 13 protein:vir:9950 Length: 714 # 100.0 9E-166 6E-169 925.5 71.2 663 1-707 8-714 (714) 14 protein:vir:104437 Length: 714 100.0 8E-165 5E-168 920.2 68.5 663 1-707 1-714 (714) 15 protein:vir:105619 Length: 772 100.0 8E-166 5E-169 925.6 63.0 679 1-725 11-738 (772) 16 protein:vir:93630 Length: 776 100.0 2E-153 1E-156 858.3 53.9 670 1-725 38-745 (776) 17 protein:vir:95821 Length: 763 100.0 2.3E-82 1.5E-85 468.1 59.0 620 1-725 20-746 (763) 18 protein:vir:8846 Length: 705 # 100.0 4.2E-80 2.6E-83 455.8 62.3 615 1-713 1-705 (705) 19 protein:vir:80165 Length: 651 100.0 9.2E-55 5.7E-58 316.8 51.9 577 1-691 15-651 (651) 20 protein:vir:95449 Length: 584 100.0 6.5E-37 4E-40 219.0 38.6 542 1-631 1-584 (584) 21 protein:vir:94599 Length: 641 100.0 1.2E-35 7.5E-39 212.0 38.5 566 1-691 20-641 (641) 22 protein:vir:345 Length: 663 # 100.0 2.3E-34 1.4E-37 205.0 42.6 605 1-692 1-663 (663) 23 protein:vir:3139 Length: 599 # 100.0 3.6E-31 2.2E-34 187.5 35.3 541 1-659 1-599 (599) 24 protein:vir:96494 Length: 501 99.8 6.4E-20 4E-23 125.8 33.2 457 1-635 37-501 (501) 25 protein:vir:2732 Length: 501 # 99.8 8.6E-19 5.3E-22 119.6 32.6 458 1-635 37-501 (501) 26 protein:vir:4898 Length: 502 # 99.8 3.8E-18 2.4E-21 116.1 31.7 464 1-635 31-502 (502) 27 protein:vir:2198 Length: 536 # 99.8 2.3E-16 1.4E-19 106.3 41.0 511 1-669 1-536 (536) 28 protein:vir:10447 Length: 536 99.8 3.3E-16 2.1E-19 105.4 41.4 511 1-669 1-536 (536) 29 protein:vir:1538 Length: 535 # 99.8 4.2E-16 2.6E-19 104.9 40.4 506 1-658 1-535 (535) 30 protein:vir:103765 Length: 549 99.8 1.5E-15 9.5E-19 101.8 42.5 524 1-653 1-549 (549) 31 protein:vir:733 Length: 453 # 99.7 9.9E-17 6.1E-20 108.3 35.1 436 1-625 11-453 (453) 32 protein:vir:102950 Length: 471 99.7 1.3E-16 8.2E-20 107.6 35.6 451 1-623 1-471 (471) 33 protein:vir:3361 Length: 535 # 99.7 1.6E-15 1E-18 101.7 41.0 510 1-660 1-535 (535) 34 protein:vir:96240 Length: 511 99.7 6.3E-17 3.9E-20 109.4 33.1 469 1-631 31-511 (511) 35 protein:vir:97171 Length: 512 99.7 9.5E-17 5.9E-20 108.4 33.7 469 1-635 31-512 (512) 36 protein:vir:3609 Length: 452 # 99.7 2.5E-16 1.6E-19 106.1 35.5 436 1-621 11-452 (452) 37 protein:vir:99522 Length: 470 99.7 9.2E-17 5.7E-20 108.5 32.6 443 1-630 19-470 (470) 38 protein:vir:94546 Length: 506 99.7 1.2E-16 7.4E-20 107.9 33.1 454 1-631 16-506 (506) 39 protein:vir:103951 Length: 511 99.7 1.4E-16 8.5E-20 107.6 33.4 470 1-631 31-511 (511) 40 protein:vir:93747 Length: 472 99.7 2.2E-16 1.4E-19 106.4 34.5 444 1-633 18-472 (472) 41 protein:vir:9871 Length: 429 # 99.7 6E-16 3.7E-19 104.0 36.4 423 1-607 1-429 (429) 42 protein:vir:1236 Length: 483 # 99.7 3.4E-16 2.1E-19 105.4 34.7 443 1-628 29-483 (483) 43 protein:vir:95315 Length: 559 99.7 6.7E-15 4.2E-18 98.3 41.3 532 1-662 1-559 (559) 44 protein:vir:3964 Length: 453 # 99.7 3.3E-16 2.1E-19 105.4 34.1 437 1-607 11-453 (453) 45 protein:vir:97336 Length: 492 99.7 7.3E-16 4.5E-19 103.6 35.7 442 1-628 35-492 (492) 46 protein:vir:7321 Length: 556 # 99.7 1.2E-14 7.7E-18 96.8 43.5 529 1-662 1-556 (556) 47 protein:vir:78805 Length: 511 99.7 2.3E-16 1.4E-19 106.3 32.4 467 1-635 31-511 (511) 48 protein:vir:96366 Length: 511 99.7 2.3E-16 1.4E-19 106.3 32.4 467 1-635 31-511 (511) 49 protein:vir:9922 Length: 489 # 99.7 1.5E-15 9E-19 101.9 36.7 456 1-604 1-489 (489) 50 protein:vir:94805 Length: 492 99.7 7.2E-16 4.4E-19 103.6 34.9 443 1-628 35-492 (492) 51 protein:vir:94101 Length: 474 99.7 1.2E-16 7.3E-20 107.9 30.4 445 1-628 1-474 (474) 52 protein:vir:105889 Length: 474 99.7 1.2E-16 7.3E-20 107.9 30.4 445 1-628 1-474 (474) 53 protein:vir:95806 Length: 440 99.7 1.1E-15 6.6E-19 102.7 35.5 430 11-621 1-440 (440) 54 protein:vir:102330 Length: 451 99.7 5.4E-16 3.3E-19 104.3 33.8 437 3-621 1-451 (451) 55 protein:vir:106639 Length: 481 99.7 7.9E-16 4.9E-19 103.4 34.3 445 1-620 23-481 (481) 56 protein:vir:99781 Length: 511 99.7 3.8E-16 2.3E-19 105.1 32.3 467 1-631 31-511 (511) 57 protein:vir:99672 Length: 532 99.7 2E-14 1.3E-17 95.7 40.4 503 1-644 1-532 (532) 58 protein:vir:105461 Length: 470 99.7 4.5E-16 2.8E-19 104.7 31.1 455 3-624 1-470 (470) 59 protein:vir:9306 Length: 511 # 99.7 6.8E-16 4.2E-19 103.7 31.9 469 1-631 31-511 (511) 60 protein:vir:98506 Length: 555 99.7 3.6E-14 2.2E-17 94.3 42.0 532 1-646 1-555 (555) 61 protein:vir:107822 Length: 555 99.7 3.6E-14 2.2E-17 94.3 42.0 532 1-646 1-555 (555) 62 protein:vir:107404 Length: 555 99.7 3.6E-14 2.2E-17 94.3 42.0 532 1-646 1-555 (555) 63 protein:vir:80680 Length: 441 99.7 3E-15 1.9E-18 100.2 35.0 428 1-630 1-441 (441) 64 protein:vir:107112 Length: 478 99.7 1.3E-15 8.3E-19 102.1 32.8 450 1-631 1-478 (478) 65 protein:vir:5961 Length: 503 # 99.7 8.6E-16 5.3E-19 103.2 31.3 469 1-631 1-503 (503) 66 protein:vir:94709 Length: 522 99.7 4.9E-14 3.1E-17 93.5 42.7 500 1-657 1-522 (522) 67 protein:vir:105292 Length: 478 99.7 2.5E-15 1.6E-18 100.6 33.6 448 1-621 1-478 (478) 68 protein:vir:106571 Length: 499 99.7 2.2E-16 1.3E-19 106.5 27.6 485 1-637 1-499 (499) 69 protein:vir:8883 Length: 543 # 99.7 6.2E-14 3.8E-17 93.0 40.6 514 1-663 1-543 (543) 70 protein:vir:2341 Length: 488 # 99.7 8.1E-16 5.1E-19 103.3 28.7 463 1-641 1-488 (488) 71 protein:vir:96179 Length: 468 99.7 1.8E-14 1.1E-17 95.9 36.0 435 1-607 1-468 (468) 72 protein:vir:1785 Length: 555 # 99.7 1E-13 6.2E-17 91.8 40.7 521 4-659 1-555 (555) 73 protein:vir:96839 Length: 474 99.6 9.1E-15 5.6E-18 97.6 33.6 444 1-628 1-474 (474) 74 protein:vir:95113 Length: 474 99.6 1.6E-14 9.8E-18 96.3 33.7 441 1-607 21-474 (474) 75 protein:vir:78537 Length: 480 99.6 4.1E-15 2.5E-18 99.5 30.1 466 1-635 1-480 (480) 76 protein:vir:102668 Length: 547 99.6 2.3E-13 1.4E-16 89.9 41.4 524 6-642 1-547 (547) 77 protein:vir:2427 Length: 485 # 99.6 8.5E-15 5.3E-18 97.7 30.4 463 1-630 6-485 (485) 78 protein:vir:78227 Length: 480 99.6 1.7E-14 1.1E-17 96.1 32.0 466 1-635 1-480 (480) 79 protein:vir:104082 Length: 485 99.6 1.5E-14 9.5E-18 96.3 30.0 462 1-637 10-485 (485) 80 protein:vir:96266 Length: 474 99.6 4.7E-14 2.9E-17 93.6 31.8 440 1-626 21-474 (474) 81 protein:vir:95899 Length: 474 99.6 4.7E-14 2.9E-17 93.6 31.8 440 1-626 21-474 (474) 82 protein:vir:4223 Length: 486 # 99.6 1.7E-14 1E-17 96.1 29.1 464 1-643 8-486 (486) 83 protein:vir:7768 Length: 484 # 99.6 1E-14 6.4E-18 97.2 27.9 463 1-644 1-484 (484) 84 protein:vir:94498 Length: 474 99.6 9.5E-14 5.9E-17 92.0 32.8 443 1-628 13-474 (474) 85 protein:vir:97447 Length: 474 99.6 9.5E-14 5.9E-17 92.0 32.8 443 1-628 13-474 (474) 86 protein:vir:79043 Length: 479 99.6 3E-14 1.9E-17 94.7 29.9 451 1-625 14-479 (479) 87 protein:vir:94572 Length: 535 99.6 8.1E-13 5E-16 86.9 41.0 509 1-659 1-535 (535) 88 protein:vir:94742 Length: 409 99.5 5.4E-13 3.3E-16 87.8 33.8 392 1-561 1-409 (409) 89 protein:vir:9568 Length: 410 # 99.5 4.5E-13 2.8E-16 88.3 32.6 393 22-603 1-410 (410) 90 protein:vir:9751 Length: 422 # 99.5 3.9E-12 2.4E-15 83.1 35.7 406 1-580 1-422 (422) 91 protein:vir:96988 Length: 516 99.5 5.8E-12 3.6E-15 82.2 37.4 492 1-648 1-516 (516) 92 protein:vir:38 Length: 496 # N 99.5 6E-12 3.7E-15 82.1 38.4 454 1-589 1-496 (496) 93 protein:vir:99072 Length: 479 99.5 8.8E-13 5.5E-16 86.7 30.5 462 1-641 1-479 (479) 94 protein:vir:1634 Length: 409 # 99.5 6.1E-12 3.8E-15 82.1 35.0 394 1-561 1-409 (409) 95 protein:vir:100039 Length: 522 99.5 7.1E-12 4.4E-15 81.7 36.2 497 6-667 1-522 (522) 96 protein:vir:78083 Length: 537 99.5 3.4E-12 2.1E-15 83.5 33.3 500 1-641 1-537 (537) 97 protein:vir:99916 Length: 504 99.5 3.7E-12 2.3E-15 83.3 32.7 450 1-613 18-504 (504) 98 protein:vir:2500 Length: 501 # 99.5 3.8E-12 2.4E-15 83.2 32.0 465 1-641 22-501 (501) 99 protein:vir:105819 Length: 456 99.4 4.8E-12 3E-15 82.7 31.9 437 1-624 1-456 (456) 100 protein:vir:102602 Length: 456 99.4 4.8E-12 3E-15 82.7 31.9 437 1-624 1-456 (456) 101 protein:vir:8184 Length: 474 # 99.4 2.3E-11 1.4E-14 78.9 36.7 442 1-626 12-474 (474) 102 protein:vir:7987 Length: 456 # 99.4 9.2E-12 5.7E-15 81.1 31.4 441 1-624 1-456 (456) 103 protein:vir:78696 Length: 542 99.4 3.8E-11 2.4E-14 77.7 34.9 513 4-659 1-542 (542) 104 protein:vir:78942 Length: 510 99.3 8E-11 4.9E-14 76.0 40.3 487 4-629 1-510 (510) 105 protein:vir:78907 Length: 518 99.3 5.3E-11 3.3E-14 76.9 31.3 488 1-602 1-518 (518) 106 protein:vir:103330 Length: 517 99.3 1E-10 6.3E-14 75.4 39.1 493 1-665 5-517 (517) 107 protein:vir:80959 Length: 499 99.3 1.1E-10 6.6E-14 75.3 35.5 466 1-605 1-499 (499) 108 protein:vir:80211 Length: 514 99.3 1.3E-10 8.1E-14 74.8 40.4 489 4-632 1-514 (514) 109 protein:vir:105641 Length: 516 99.3 1.5E-10 9.3E-14 74.4 37.4 491 1-641 1-516 (516) 110 protein:vir:98444 Length: 434 99.3 5.5E-11 3.4E-14 76.8 28.5 425 40-629 1-434 (434) 111 protein:vir:6322 Length: 510 # 99.3 1.9E-10 1.2E-13 73.8 39.9 488 4-629 1-510 (510) 112 protein:vir:3028 Length: 500 # 99.2 3.7E-10 2.3E-13 72.3 35.0 463 1-595 1-500 (500) 113 protein:vir:9815 Length: 500 # 99.2 3.7E-10 2.3E-13 72.3 35.0 463 1-595 1-500 (500) 114 protein:vir:7017 Length: 515 # 99.2 1E-09 6.5E-13 69.8 38.9 492 1-648 1-515 (515) 115 protein:vir:8846 Length: 705 # 99.1 1.6E-09 9.7E-13 68.9 30.6 643 10-725 1-695 (705) 116 protein:vir:1587 Length: 508 # 99.1 2.1E-09 1.3E-12 68.1 37.0 469 1-588 1-508 (508) 117 protein:vir:98883 Length: 517 99.0 4.1E-09 2.5E-12 66.6 33.9 485 1-607 1-517 (517) 118 protein:vir:79703 Length: 505 99.0 8E-09 5E-12 65.0 39.3 470 1-614 1-505 (505) 119 protein:vir:100920 Length: 725 98.8 3E-08 1.9E-11 61.8 42.3 657 22-725 1-711 (725) 120 protein:vir:101494 Length: 527 98.8 4.2E-08 2.6E-11 61.0 28.7 492 1-607 1-527 (527) 121 protein:vir:102239 Length: 527 98.8 4.9E-08 3E-11 60.7 28.8 494 1-607 1-527 (527) 122 protein:vir:9263 Length: 725 # 98.7 1.2E-07 7.3E-11 58.6 46.3 653 22-725 1-711 (725) 123 protein:vir:7430 Length: 563 # 98.7 1.3E-07 8.3E-11 58.3 31.2 529 1-624 1-563 (563) 124 protein:vir:3520 Length: 720 # 98.1 4.3E-06 2.7E-09 50.0 44.7 654 4-725 1-720 (720) 125 protein:vir:4782 Length: 522 # 98.0 7.8E-06 4.8E-09 48.6 31.7 486 1-609 1-522 (522) 126 protein:vir:108295 Length: 711 97.8 1.6E-05 9.8E-09 46.9 29.9 623 1-715 1-711 (711) 127 protein:vir:77597 Length: 725 97.8 2.1E-05 1.3E-08 46.2 48.0 649 22-725 1-717 (725) 128 protein:vir:3296 Length: 714 # 97.6 3.3E-05 2.1E-08 45.1 33.3 621 1-717 1-714 (714) 129 protein:vir:817 Length: 714 # 97.6 3.3E-05 2.1E-08 45.1 33.3 621 1-717 1-714 (714) 130 protein:vir:2764 Length: 714 # 97.6 3.3E-05 2.1E-08 45.1 33.3 621 1-717 1-714 (714) 131 protein:vir:10117 Length: 714 97.6 3.3E-05 2.1E-08 45.1 33.3 621 1-717 1-714 (714) 132 protein:vir:9950 Length: 714 # 97.6 3.3E-05 2.1E-08 45.1 33.3 621 1-717 1-714 (714) 133 protein:vir:78393 Length: 489 97.6 4.2E-05 2.6E-08 44.6 26.4 460 1-601 1-489 (489) 134 protein:vir:105429 Length: 708 97.2 0.00015 9.4E-08 41.5 41.9 626 4-725 1-700 (708) 135 protein:vir:104437 Length: 714 96.9 0.00026 1.6E-07 40.3 32.5 616 15-717 1-714 (714) 136 protein:vir:172 Length: 708 # 96.9 0.00026 1.6E-07 40.2 32.8 569 1-718 90-708 (708) 137 protein:vir:105520 Length: 706 96.7 0.00042 2.6E-07 39.1 42.4 621 4-725 1-699 (706) 138 protein:vir:93630 Length: 776 96.6 0.00045 2.8E-07 38.9 25.9 624 1-725 22-728 (776) 139 protein:vir:95014 Length: 491 96.4 0.00068 4.2E-07 38.0 31.6 462 1-602 1-491 (491) 140 protein:vir:95149 Length: 501 96.1 0.001 6.4E-07 36.9 29.3 464 1-603 1-501 (501) 141 protein:vir:95821 Length: 763 95.4 0.0022 1.4E-06 35.1 27.3 579 1-725 113-750 (763) 142 protein:vir:96403 Length: 666 94.8 0.0035 2.2E-06 34.0 14.8 550 1-633 1-666 (666) 143 protein:vir:94956 Length: 452 93.7 0.0066 4.1E-06 32.5 33.2 444 1-591 1-452 (452) 144 protein:vir:103385 Length: 666 93.0 0.0092 5.7E-06 31.7 15.5 554 1-633 1-666 (666) 145 protein:vir:1084 Length: 437 # 82.5 0.077 4.7E-05 26.7 15.8 149 560-725 1-165 (437) 146 protein:vir:105619 Length: 772 78.6 0.11 7E-05 25.8 27.4 626 1-725 1-726 (772) 147 protein:vir:1084 Length: 437 # 55.2 0.49 0.0003 22.3 18.7 162 548-725 1-170 (437) 148 protein:vir:97265 Length: 513 21.0 2.7 0.0017 18.2 29.9 467 1-608 1-513 (513) No 1 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=1.8e-222 Score=1236.32 Aligned_cols=725 Identities=99% Similarity=1.426 Sum_probs=710.6 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCCcceEE Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~~~~~ 80 (725) |||++.+|++++.+|+++++++++||++|.+|++||+|+||+++++++|+.+||||||+|+|+|++|+|++++||++++| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~e~~nr~d~~v 80 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcccchHHHHHHHHhhHHhCCcceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhheeeCC Q lcl|NC_013059. 81 RPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDS 160 (725) Q Consensus 81 ~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp 160 (725) +||+++|+++|++||++++|+++.|++++++|+||+++|+||+||++|++||.++|+|++++.|+++++|+|+++||||| T Consensus 81 ~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~V~~Dp 160 (725) T protein:vir:92 81 RPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDS 160 (725) T ss_pred ecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChhhcccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEEeeCccc Q lcl|NC_013059. 161 NSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVT 240 (725) Q Consensus 161 ~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~ 240 (725) +|+++|+|||+|||+++|||+++++.++|.|+.+..+..++.++++|.++|+++++|||+|||+|+++.++++.+.|++| T Consensus 161 ~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~d~~~ 240 (725) T protein:vir:92 161 NSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVT 240 (725) T ss_pred hhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEeeeEEeecCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCccccchhh Q lcl|NC_013059. 241 GEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVV 320 (725) Q Consensus 241 g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~v 320 (725) |++++|++.++.+++..+++.|++++..++++++||+|++++|+++|++++||||++||||||||++.+++|++++||+| T Consensus 241 g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~v 320 (725) T protein:vir:92 241 GEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVV 320 (725) T ss_pred CceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCccccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCCCCchHH Q lcl|NC_013059. 321 RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQA 400 (725) Q Consensus 321 r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 400 (725) |+||||||++|+++|+++|+++++++++++++++++++++.+|+..+..+++++|++..++|.++..++++++++++|++ T Consensus 321 r~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~ 400 (725) T protein:vir:92 321 RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQA 400 (725) T ss_pred ccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeeccccccccccccccCCcccCCCCchHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEE Q lcl|NC_013059. 401 NAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVI 480 (725) Q Consensus 401 ~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI 480 (725) +++||+.+..+|+++||+|++++|+.||++||+||++++++|++.|++|||||+++++++|+++|+||++|||++|+||| T Consensus 401 ~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~r~~RI 480 (725) T protein:vir:92 401 NAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTI 480 (725) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccC Q lcl|NC_013059. 481 TLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLL 560 (725) Q Consensus 481 ~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~ 560 (725) +|+||+.++|.||++++++.+|.++++|||+|+|||+|++||+++|+|+++++.|++|++++|+.+|+.+.+++.+++++ T Consensus 481 ~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~ 560 (725) T protein:vir:92 481 TLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLL 560 (725) T ss_pred ecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 561 DGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVE 640 (725) Q Consensus 561 d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q 640 (725) |+|++++++++++++..+....++.+++.+++..++++++++++++++.++++..+++++++++++++..+.++++.+.+ T Consensus 561 d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~ 640 (725) T protein:vir:92 561 DGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVE 640 (725) T ss_pred cchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999998888989999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc Q lcl|NC_013059. 641 AQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSGSV 720 (725) Q Consensus 641 ~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~~~ 720 (725) +++++++++.+++..++.+.+++.++++.+..++++++++..++..||+.+++.++.++++++.++..++++.+||++++ T Consensus 641 ~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 720 (725) T protein:vir:92 641 AQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQSQRQNQPSGSV 720 (725) T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHhcchhccCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCC Q lcl|NC_013059. 721 AETPQ 725 (725) Q Consensus 721 ~~~~q 725 (725) +++|| T Consensus 721 ~~~~~ 725 (725) T protein:vir:92 721 AETPQ 725 (725) T ss_pred ccCCC Confidence 99999 No 2 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=1e-221 Score=1232.13 Aligned_cols=725 Identities=99% Similarity=1.426 Sum_probs=704.6 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCCcceEE Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~~~~~ 80 (725) |||++.+|++++.+|+++++++++||++|.+|++||+|+||+++++++|+.+||||||+|+|+||+|+|++++||++++| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp~~N~i~~~v~~v~g~e~~nr~d~~v 80 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcccchHHHHHHHHhhHHhCCcceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhheeeCC Q lcl|NC_013059. 81 RPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDS 160 (725) Q Consensus 81 ~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp 160 (725) +||+++|.++|++||++++|+++.|++++++|+||+++|+||+||++|++||+++|+|++++.|+++++|+||++||||| T Consensus 81 ~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v~~Dp 160 (725) T protein:vir:10 81 RPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDS 160 (725) T ss_pred ecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhHcccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEEeeCccc Q lcl|NC_013059. 161 NSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVT 240 (725) Q Consensus 161 ~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~ 240 (725) +|+++|+|||||||+++|||++.+..+...|+.+..+..++.++++|++.|+++++|||+|||+|++++++++++.|++| T Consensus 161 ~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~d~~~ 240 (725) T protein:vir:10 161 NSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVT 240 (725) T ss_pred hhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEeeEEEEeccCCC Confidence 99999999999999999999987766555566677777788889999999999999999999999999999999999999 Q ss_pred cceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCccccchhh Q lcl|NC_013059. 241 GEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVV 320 (725) Q Consensus 241 g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~v 320 (725) |++++|++.++.+++..+++.|++++..++++++||+|++++|+++|++++||||+|||||||||++.+++|++++||+| T Consensus 241 g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~~~~G~v 320 (725) T protein:vir:10 241 GEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVV 320 (725) T ss_pred CceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcceeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCCCCchHH Q lcl|NC_013059. 321 RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQA 400 (725) Q Consensus 321 r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 400 (725) |+||||||++|+++|+++|+++++++++++++++++++++++|+..+..+++++|+++.++|.++..++++++++++|++ T Consensus 321 r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~ 400 (725) T protein:vir:10 321 RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQA 400 (725) T ss_pred ccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcccccccCcccCCCCchHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEE Q lcl|NC_013059. 401 NAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVI 480 (725) Q Consensus 401 ~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI 480 (725) +++||+.+..+|+++||+|++++|+.||++||+||++++++|++.|++|||||+++++++|+++|+||++|||++|+||| T Consensus 401 ~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI 480 (725) T protein:vir:10 401 NAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTI 480 (725) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccC Q lcl|NC_013059. 481 TLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLL 560 (725) Q Consensus 481 ~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~ 560 (725) +|+||+.++|.||++++|+.+|+++++|||+|+|||+|++||+++|+|+++++.|++|++++|+.+|+.+.+++.+++++ T Consensus 481 ~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~ 560 (725) T protein:vir:10 481 TLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLL 560 (725) T ss_pred ecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 561 DGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVE 640 (725) Q Consensus 561 d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q 640 (725) |+|++++++++++++..++...++.+++.+++.+++++++++++++++.+++++.+++++++++++++..+.++++.+.+ T Consensus 561 d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~ 640 (725) T protein:vir:10 561 DGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVE 640 (725) T ss_pred CchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999988888889899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc Q lcl|NC_013059. 641 AQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSGSV 720 (725) Q Consensus 641 ~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~~~ 720 (725) +++++++++.+++..++...+.+.++++++..+.++++..+.+++.+|+.++++++.++|+++.+|..++++++||++++ T Consensus 641 ~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~ 720 (725) T protein:vir:10 641 AQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQSQRQNQPSGSV 720 (725) T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhhhhccccccccCCCccc Confidence 99999999999999999888889999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCC Q lcl|NC_013059. 721 AETPQ 725 (725) Q Consensus 721 ~~~~q 725 (725) +++|| T Consensus 721 ~~~~~ 725 (725) T protein:vir:10 721 AETPQ 725 (725) T ss_pred ccCCC Confidence 99999 No 3 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=1.5e-220 Score=1225.70 Aligned_cols=725 Identities=98% Similarity=1.421 Sum_probs=709.2 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCCcceEE Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~~~~~ 80 (725) |||++.+|++++.+|+++++++++||++|.+|++||+|+||+++++++|+.+||||||+|+|+|++|+|++++||++++| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~~~~nr~d~~v 80 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCccccHHHHHHHHHhhHHhCCcceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhheeeCC Q lcl|NC_013059. 81 RPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDS 160 (725) Q Consensus 81 ~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp 160 (725) +||+++|+++|++||++++|+++.|++++++|+||+++|+||+||++|++||+++|+|++++.|+++++|+||++||||| T Consensus 81 ~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~v~~Dp 160 (725) T protein:vir:77 81 RPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDS 160 (725) T ss_pred ecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccChhhceeCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEEeeCccc Q lcl|NC_013059. 161 NSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVT 240 (725) Q Consensus 161 ~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~ 240 (725) +|+++|+|||+|||+++|||+|+++.+||.|+.+..+..++.++.+++++|+++++|||+|||+|++++++++.+.||+| T Consensus 161 ~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~~~~~~~~~~~t 240 (725) T protein:vir:77 161 NSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVT 240 (725) T ss_pred hhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEeeEEEEecCCCC Confidence 99999999999999999999999999999999998888999999999999999999999999999999999999999999 Q ss_pred cceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCccccchhh Q lcl|NC_013059. 241 GEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVV 320 (725) Q Consensus 241 g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~v 320 (725) |++++|++.++.+++..+++.|++++..++++++||+|+++.|+++|++++||||++||||||||++.+++|++++||+| T Consensus 241 g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~v 320 (725) T protein:vir:77 241 GEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVV 320 (725) T ss_pred cceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCcccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCCCCchHH Q lcl|NC_013059. 321 RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQA 400 (725) Q Consensus 321 r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 400 (725) |+|||+||++|+++|+++|+++++++.++++.++++++++.+|+.++..++.+++.+..++|.++.+++++++++++|++ T Consensus 321 r~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~ 400 (725) T protein:vir:77 321 RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQA 400 (725) T ss_pred hhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCcccccCccccCCCCchHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEE Q lcl|NC_013059. 401 NAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVI 480 (725) Q Consensus 401 ~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI 480 (725) +++||+.+..+|+++||+|++++|..||++||+||++++++|++.|++|||||+++++++|+++|+||++|||++|+||| T Consensus 401 ~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~rv~RI 480 (725) T protein:vir:77 401 NAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTI 480 (725) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccC Q lcl|NC_013059. 481 TLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLL 560 (725) Q Consensus 481 ~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~ 560 (725) +|++|++++|.||.++.++.+|.++++|||+|+|||+|++||+++|+|+++++.|++|++++|+.+|+.+.+++.+++++ T Consensus 481 ~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~ 560 (725) T protein:vir:77 481 TLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLL 560 (725) T ss_pred ecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 561 DGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVE 640 (725) Q Consensus 561 d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q 640 (725) |+|++++++++++++..+....++.+++.++..++++++++.+++++++++++...+++++.++++++..+.+.++.+++ T Consensus 561 d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~ 640 (725) T protein:vir:77 561 DGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVE 640 (725) T ss_pred cchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999988888888888889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc Q lcl|NC_013059. 641 AQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSGSV 720 (725) Q Consensus 641 ~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~~~ 720 (725) +++++++++.+++..++..++.+.++++++....++++++..+++.+|+.++++.+.++|+.+.++..++++.+||++++ T Consensus 641 ~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~ 720 (725) T protein:vir:77 641 AQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMDIANILQSQRQNQPSGSV 720 (725) T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHHHhhHHHHHHHHHHHHhcCCCcCc Confidence 99999999999999999888889999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCC Q lcl|NC_013059. 721 AETPQ 725 (725) Q Consensus 721 ~~~~q 725 (725) +++|| T Consensus 721 ~~~~~ 725 (725) T protein:vir:77 721 AETPQ 725 (725) T ss_pred ccCCC Confidence 99999 No 4 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=2.6e-194 Score=1081.90 Aligned_cols=704 Identities=31% Similarity=0.512 Sum_probs=646.1 Q ss_pred CCcH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh--cCCCCCHHHHH----HHhhcCCCc--ccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADN-KNRLESILSRFDADWTASDEARREAKNDLFFS--RVSQWDDWLSQ----YTTLQYRGQ--FDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~--~G~QW~~~~~~----~l~~~grp~--~N~i~~~v~~v~g~~ 71 (725) |||. ..+|.+++.+|+++++++++||++|.+|++|| +|+||++++++ .++.+|+|| ||+|+|+||+|+|++ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 9998 78999999999999999999999999999998 59999999987 566789994 699999999999999 Q ss_pred hhCCcceEEecCCc-chHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC-CCCCCceeEEEEee Q lcl|NC_013059. 72 RQNPIDVLYRPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTSNNQVIRREPI 149 (725) Q Consensus 72 ~~nr~~~~~~pr~~-~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~-~~~~~~~~ir~~~~ 149 (725) ++||++++|+|+++ +|+++|++||++++|+++.|+++++||+||+++|+||+||++|++||+++ +++.+...|++.++ T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i~~v 160 (720) T protein:vir:35 81 RHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICLEPI 160 (720) T ss_pred HhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeEecc Confidence 99999999999955 58999999999999999999999999999999999999999999999876 45555667788888 Q ss_pred ecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 150 HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 150 ~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) |+|+.+|||||+|+++|+|||+|||+++|||+++++++||+.+..... ....+++++|+++++|||+|||+++++. T Consensus 161 ~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~----~~~~~~~~d~~~~~~v~i~E~~~~~~~~ 236 (720) T protein:vir:35 161 YDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMS----GIERSWDYDWYDVDVVYIAKYYEVKKES 236 (720) T ss_pred cCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccc----cccccccccccCCCceEEEEeeEEEEEE Confidence 999999999999999999999999999999999999999997654332 2335677889999999999999999999 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) +.++++.++.||+++.|++.++..++..+.+.|..++..++++++||+||++.|+++|++++|+||+|||||||||+|.| T Consensus 237 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~ 316 (720) T protein:vir:35 237 VDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWF 316 (720) T ss_pred EEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcc--ccccccccccccCccc--c Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDD--YPYYLLNRTDENNGEM--P 385 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~g~~--~ 385 (725) +||++++||+||+|||+||++|+++|+++|+++++++....+..+.+++++++|...+. ..++.+|.+..++|.+ + T Consensus 317 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~ 396 (720) T protein:vir:35 317 IDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAP 396 (720) T ss_pred cCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhccccccccccccccccccCcccccC Confidence 99999999999999999999999999999999999999999999999999999988654 4666778888888886 5 Q ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) ..++++++++++|+++++||+.+..+|+++||+|++++|+.+| +||+||++++++|++.|++|||||+++++++|+++| T Consensus 397 ~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL 475 (720) T protein:vir:35 397 PTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN-IAKETVNHLMHRSDMSSFIYLDNMAKSLKRAGEVWL 475 (720) T ss_pred CCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6788999999999999999999999999999999999999888 899999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHHHHHHHHHHHHhccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~ 544 (725) +||++|||++|+|||+|+||.+++|.||..++|+.+|..+++|||+ |+|||+|++||+++|+|+++++.|+++++++++ T Consensus 476 ~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p 555 (720) T protein:vir:35 476 SMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLP 555 (720) T ss_pred HHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCC Confidence 9999999999999999999999999999999999999999999995 999999999999999999999999999999998 Q ss_pred ccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 545 ~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) ..|....++..++.++|+|+++++++++++...++...++.+++.++..+++++ ++++.+.++.++++++.++|++.++ T Consensus 556 ~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq-~~qq~~~e~~~aqa~l~qaqae~~k 634 (720) T protein:vir:35 556 QDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQ-QAQQPNAELVAAQGVLMQGQAEVQK 634 (720) T ss_pred CchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHH-HHHhHhHHHHHHHHHHHHHHHHHHH Confidence 766666666667899999999999999999988887777777766665544433 4456777888889999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEI 704 (725) Q Consensus 625 aqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~ 704 (725) ++++..+.++++.+.++++++++++..++.++....++..+.++++....+++.++..+++.+|+.++..++.++|+++. T Consensus 635 aqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~~~~~~~~~~~~~ 714 (720) T protein:vir:35 635 AKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELILKATDTQHKQNRDA 714 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhcccchhhhhhHHH Confidence 99999999999999999999999999998888888888888899999999999999999999999999999999999987 Q ss_pred HHHHHH Q lcl|NC_013059. 705 ANILQS 710 (725) Q Consensus 705 ~~~~~~ 710 (725) ++-..- T Consensus 715 ~~~~~~ 720 (720) T protein:vir:35 715 AKNHSI 720 (720) T ss_pred hhccCC Confidence 665433 No 5 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=8.4e-191 Score=1062.63 Aligned_cols=692 Identities=30% Similarity=0.481 Sum_probs=601.8 Q ss_pred CCcH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHH--HhhcCCCCCHHHHHHHhhcC----CC--cccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADN-KNRLESILSRFDADWTASDEARREAKNDL--FFSRVSQWDDWLSQYTTLQY----RG--QFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~--~f~~G~QW~~~~~~~l~~~g----rp--~~N~i~~~v~~v~g~~ 71 (725) |||. .++|.+++.+|+++++++.+||+++.+|. +||+|+||++++++.|+.+| || +||+|+|+||+|+|++ T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 9997 78999999999999999999999999995 57999999999999998765 67 4799999999999999 Q ss_pred hhCCcceEEecCC-cchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC-CCCCCceeEEEEee Q lcl|NC_013059. 72 RQNPIDVLYRPKD-GASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTSNNQVIRREPI 149 (725) Q Consensus 72 ~~nr~~~~~~pr~-~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~-~~~~~~~~ir~~~~ 149 (725) ++||++++|+||+ ++|.++|++||++++|+++.|+++++||+||+++|+||+|||+|+++|.++ |+.+++..|.+.++ T Consensus 81 ~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:17 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred hhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccceEee Confidence 9999999999996 568999999999999999999999999999999999999999999999876 45777777888888 Q ss_pred ecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 150 HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 150 ~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) ++|+++|||||+|+++|+|||||||+++|||+++++++||+++....+. ....+|.+.|++.++|||+|||+|+++. T Consensus 161 ~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~---~~~~~~~~~~~~~d~vrv~e~~~r~~~~ 237 (708) T protein:vir:17 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDV---TSMTSWEYDWFDADVIYIAKYYEVRKES 237 (708) T ss_pred ccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhh---hhhccccccccCCCeEEEEEEEEEeeee Confidence 8999999999999999999999999999999999999999976544333 3334677889999999999999999999 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) ++++++.||+||++++|++.++++++..+...|...+..+++++++|+||+|+|+++|++++||||+|||||||||++.+ T Consensus 238 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~ 317 (708) T protein:vir:17 238 VDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF 317 (708) T ss_pred eEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccc--cccccccccccCcccc-- Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY--PYYLLNRTDENNGEMP-- 385 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~g~~~-- 385 (725) +||++++||+||+||||||++|+++|+++|+++++++.+++++.+++.+++.+|+..+.. ++...+.+..+.|.+. T Consensus 318 ~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~ 397 (708) T protein:vir:17 318 IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAG 397 (708) T ss_pred ccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccc Confidence 999999999999999999999999999999999999999999999999999999876644 3444455555555432 Q ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) ..++.+++++++|+++++||+.+..+|+++||+|++++|+.+| +||+||++++++|++.|++|||||+++++++|+++| T Consensus 398 a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL 476 (708) T protein:vir:17 398 ATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWL 476 (708) T ss_pred cCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3467788899999999999999999999999999999998776 899999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHHHHHHHHHHHHhccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~ 544 (725) +||++|||++|+|||+|+||+.++|.||..++|+.+|.++++|||+ |+|||+|+++|+++|+|+++++.|+++++++++ T Consensus 477 ~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~ 556 (708) T protein:vir:17 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP 556 (708) T ss_pred HHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCC Confidence 9999999999999999999999999999999999999999999995 899999999999999999999999999999999 Q ss_pred ccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 545 ~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) ..|+...++..++.++|+|++++++++++++..+.+..++..++.+++.+++++.+++++++++.+++++..++|++.++ T Consensus 557 ~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~k 636 (708) T protein:vir:17 557 ADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQK 636 (708) T ss_pred ccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88877777777889999999999999999999999998999999988888888888888889998999999999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEI 704 (725) Q Consensus 625 aqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~ 704 (725) ++++.++.++++.+++.+.+.++++.++..+++.......+.++.+.+ +..+...+++++ T Consensus 637 a~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l-------------------~~~q~~q~q~~~- 696 (708) T protein:vir:17 637 ATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLL-------------------KDVAESQQQQFQ- 696 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------------------hhhhhhHHHHHh- Confidence 888888777777666666555555544433332222211111111111 111111011111 Q ss_pred HHHHHHHHhcCCcccccCC Q lcl|NC_013059. 705 ANILQSQRQNQPSGSVAET 723 (725) Q Consensus 705 ~~~~~~~~~~q~~~~~~~~ 723 (725) +..++| .+.-++ T Consensus 697 ------a~p~~~-~~~~~~ 708 (708) T protein:vir:17 697 ------SPPQSP-ADLMPS 708 (708) T ss_pred ------ccccCc-hhccCC Confidence 111111 111111 No 6 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=2.4e-190 Score=1060.16 Aligned_cols=691 Identities=30% Similarity=0.487 Sum_probs=604.3 Q ss_pred CCcH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh--cCCCCCHHHHHHHhhc----CCC--cccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADN-KNRLESILSRFDADWTASDEARREAKNDLFFS--RVSQWDDWLSQYTTLQ----YRG--QFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~--~G~QW~~~~~~~l~~~----grp--~~N~i~~~v~~v~g~~ 71 (725) |||. +.+|++++.+|.++++++++||++|.+|++|| +|+||+++++++|+.+ ||| +||+|+|+||+|+|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 9998 78999999999999999999999999998887 5999999999999876 567 4799999999999999 Q ss_pred hhCCcceEEecCCc-chHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC-CCCCCceeEEEEee Q lcl|NC_013059. 72 RQNPIDVLYRPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTSNNQVIRREPI 149 (725) Q Consensus 72 ~~nr~~~~~~pr~~-~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~-~~~~~~~~ir~~~~ 149 (725) ++||++++|+|+++ +|.++|++||++++|++++|+++++||+||+++|+||+|||+|++||+++ |++.++..|.+.++ T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:10 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred HhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceEEe Confidence 99999999999964 68999999999999999999999999999999999999999999999875 56777777888888 Q ss_pred ecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 150 HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 150 ~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) ++|+++|||||.|+++|+|||||||+++|||+++++++||+++.+..+.. +..+|.++|++.++|||+|||+|+++. T Consensus 161 ~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~---~~~~~~~~~~~~d~v~v~ey~~r~~~~ 237 (708) T protein:vir:10 161 YDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVT---SMTSWEYNWFGADVIYIAKYYEVRKES 237 (708) T ss_pred ecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccc---cCCCccccccCCCceEEEEeeeEEEEE Confidence 99999999999999999999999999999999999999999876554433 345678899999999999999999999 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) +.++++.||.||++++|+..+++.++..+...|...+..++++++||+|++++|+++|++++||||+|||||||||++.+ T Consensus 238 ~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~ 317 (708) T protein:vir:10 238 VDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF 317 (708) T ss_pred EEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccc--cccccccccccCccc--c Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY--PYYLLNRTDENNGEM--P 385 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~g~~--~ 385 (725) +||++++||+||+|||+||++|+++|+++++++++++.+.+++.+++.+++.+|+..+.. ++..++.+..+.|.+ . T Consensus 318 ~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~ 397 (708) T protein:vir:10 318 IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAG 397 (708) T ss_pred cCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999998887654 444556666666665 3 Q ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) +.++.+++++++|+++++||+.+..+|+++||+|++++|+.+| +||+||++++++|++.|++|||||+++++++|+++| T Consensus 398 ~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn-~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL 476 (708) T protein:vir:10 398 ATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWL 476 (708) T ss_pred cCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4467888999999999999999999999999999999998776 899999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHHHHHHHHHHHHhccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~ 544 (725) +||++|||++|+|||+|+||++++|.||..++|+.+|..+++|||+ |+|||+|+++|+++|+|+++++.|+++++++++ T Consensus 477 ~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p 556 (708) T protein:vir:10 477 SMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLP 556 (708) T ss_pred HHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCC Confidence 9999999999999999999999999999999999999999999996 899999999999999999999999999999999 Q ss_pred ccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 545 ~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) ..|+...++..+++++|+|++++++++++++..+....++.+++.+++.++++++++++++.++.+++++..++|+++++ T Consensus 557 ~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~k 636 (708) T protein:vir:10 557 TDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQK 636 (708) T ss_pred CchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88877777777789999999999999999999999988898999888888888888888888888889999999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEI 704 (725) Q Consensus 625 aqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~ 704 (725) ++++.++.++++.+++.+...++++.++..+++.......+.++++.+ +..+...+++++ T Consensus 637 a~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l-------------------~~~q~~q~~~~~- 696 (708) T protein:vir:10 637 ATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLL-------------------KDVAESQQQQFQ- 696 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------------------hhhhhhHHHHHh- Confidence 888888777777776666665555554443332222222221111111 111100011111 Q ss_pred HHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 705 ANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 705 ~~~~~~~~~~q~~~~~~~~~q 725 (725) ...++| ++.|- T Consensus 697 ------~~p~~~----~~~~p 707 (708) T protein:vir:10 697 ------SPPQSP----ADLMP 707 (708) T ss_pred ------ccccCc----hhccC Confidence 111111 11111 No 7 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=3e-188 Score=1048.69 Aligned_cols=686 Identities=30% Similarity=0.491 Sum_probs=602.6 Q ss_pred CCc-HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh--cCCCCCHHHHHHHhhc----CCC--cccchHHHHHHHHHHH Q lcl|NC_013059. 1 MAD-NKNRLESILSRFDADWTASDEARREAKNDLFFS--RVSQWDDWLSQYTTLQ----YRG--QFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad-~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~--~G~QW~~~~~~~l~~~----grp--~~N~i~~~v~~v~g~~ 71 (725) ||| ++++|++++.+|+++++++++||+++.+|++|| +|+||++++++.|+.+ ||| +||+|+|+|++|+|++ T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 999 478999999999999999999999999999998 6999999999999866 567 4799999999999999 Q ss_pred hhCCcceEEecC-CcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC-CCCCCceeEEEEee Q lcl|NC_013059. 72 RQNPIDVLYRPK-DGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTSNNQVIRREPI 149 (725) Q Consensus 72 ~~nr~~~~~~pr-~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~-~~~~~~~~ir~~~~ 149 (725) ++||++++|+|+ +++|.++|++||++++|+++.|++++++|+||+++|+||+||++|++||+++ +++++++.|.+.++ T Consensus 81 ~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i~~v 160 (706) T protein:vir:10 81 RNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAVEPI 160 (706) T ss_pred HhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccceeeee Confidence 999999999994 6789999999999999999999999999999999999999999999999875 56777888888888 Q ss_pred ecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 150 HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 150 ~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) +.|+++|||||+|+++|+|||||||+++|||+++++++||+.+.+..... ..+|.++|.+.++++++|||.++++. T Consensus 161 ~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~----~~~~~~d~~~~d~~~~~eyy~~~~~~ 236 (706) T protein:vir:10 161 YDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVG----SVSWQYDWFTPDVVYIAKYYEVRKES 236 (706) T ss_pred ccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhc----cccccccccCCCcceeccccccccee Confidence 88888999999999999999999999999999999999999776544322 23567789999999999999999988 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) +.++.+.++.+++++.+++..+.+.+..+...|...+..+++++++|+|++++|+++|++++||||++||||||||++.| T Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~ 316 (706) T protein:vir:10 237 VDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF 316 (706) T ss_pred EEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeecccc Confidence 88888899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhc--cccccccccccccCccc--c Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGND--DYPYYLLNRTDENNGEM--P 385 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~~~~~~~~~g~~--~ 385 (725) +|++++|||+||+|||+||++|+++|+++|+++++++...++..+.+++++.+|...+ ..+++.++.+..++|.+ + T Consensus 317 ~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~ 396 (706) T protein:vir:10 317 IDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAP 396 (706) T ss_pred ccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhcccccCCCCccccc Confidence 9999999999999999999999999999999999999999999999999999998776 34667777788888875 4 Q ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) .+++.+++++++|+++++||+.+..+|+++||+|++++|+.+| +||+||++++++|++.|++|||||+++++++|+++| T Consensus 397 ~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL 475 (706) T protein:vir:10 397 ANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN-VARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWL 475 (706) T ss_pred ccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5677888899999999999999999999999999999999887 899999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHHHHHHHHHHHHhccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~ 544 (725) +||++|||++|+|||+|+||+.++|.||..++|+.+|..+++|||+ |+|||+|++||+++|+|+++++.|++|+++++| T Consensus 476 ~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p 555 (706) T protein:vir:10 476 SMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLP 555 (706) T ss_pred HHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCC Confidence 9999999999999999999999999999999999999999999995 999999999999999999999999999999998 Q ss_pred ccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 545 ~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) ..|+...+++.+++++|+|++++++++++++..++...++.++++++..++++++++++++.++.+++++..+.+++.++ T Consensus 556 ~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k 635 (706) T protein:vir:10 556 QDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQK 635 (706) T ss_pred cchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88877777777789999999999999999999999999999888888888888888888888888888888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEI 704 (725) Q Consensus 625 aqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~ 704 (725) ++++..+.++++.+++.++..+++.......++.........++.+.+ T Consensus 636 ~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~l-------------------------------- 683 (706) T protein:vir:10 636 SQNETVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMETLRLL-------------------------------- 683 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------------------------- Confidence 888887777777666655554443332222211111111111110000 Q ss_pred HHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 705 ANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 705 ~~~~~~~~~~q~~~~~~~~~q 725 (725) ++ ..+.+++...+.++.|+ T Consensus 684 ~~--~~a~q~~~~~~~~~~~~ 702 (706) T protein:vir:10 684 KE--VAASQQQTIPSPPSPAD 702 (706) T ss_pred HH--HHHhccCCCCCCCCCcc Confidence 00 11122223334445555 No 8 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=5.6e-180 Score=1003.32 Aligned_cols=662 Identities=20% Similarity=0.286 Sum_probs=549.4 Q ss_pred CC--cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCC--cccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MA--DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 ma--d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp--~~N~i~~~v~~v~g~~~~nr~ 76 (725) -. |.+++|++++.+|+++++++++||++|.+|++||+|+||++++++.|+.+|+| +||+|+|+|++|+|++++||+ T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~ 102 (711) T protein:vir:10 23 KNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRP 102 (711) T ss_pred cCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHhhhHhhCCc Confidence 11 34679999999999999999999999999999999999999999999999999 479999999999999999999 Q ss_pred ceEEecCC----------------------cchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeecc Q lcl|NC_013059. 77 DVLYRPKD----------------------GASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYED 134 (725) Q Consensus 77 ~~~~~pr~----------------------~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~ 134 (725) +++|+||+ ++|.++|++||++++|+++.|++++++|+||+++|+||+||++|++||.+ T Consensus 103 ~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~ 182 (711) T protein:vir:10 103 AIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLA 182 (711) T ss_pred ceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEEEecccC Confidence 99999985 67899999999999999999999999999999999999999999999999 Q ss_pred CCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCC Q lcl|NC_013059. 135 QSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQ 214 (725) Q Consensus 135 ~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (725) +|++++++.|+++ .+|.+|||||+|+++|+|||+|||+++|||+++++++||+.+.+..+ .+...+++.|+++ T Consensus 183 ~d~~~~e~~i~~v---~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~----~~~~~~~~~~~~~ 255 (711) T protein:vir:10 183 DDSFEQDLIIEAI---QNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVY----EDSVADYDTWFTE 255 (711) T ss_pred CCCCCCCeEEeee---cChhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhh----cccccccCcccCc Confidence 9999889887653 35678999999999999999999999999999999999987655443 2334556789999 Q ss_pred CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCC Q lcl|NC_013059. 215 DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIA 294 (725) Q Consensus 215 ~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p 294 (725) ++|||+|||+|+++..+++.+.+ |..+.++. ..+++..+...|...+..+.++++||+|++|+|+++|++++||| T Consensus 256 ~~vrv~E~~~r~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~ 330 (711) T protein:vir:10 256 KSVRVSEYFTREPVIREIALLSD---GRSFWLDA--LEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIP 330 (711) T ss_pred ceeeEEEEEeeeeeeeEEEeecC---CceeccCc--chhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCC Confidence 99999999999999888888776 55555554 45777888899999999999999999999999999999999999 Q ss_pred CCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc Q lcl|NC_013059. 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) Q Consensus 295 ~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) |++||||||||++.++|++++|||+||+|||+||++|+++|+++|++++++++++++++|++++.++.|...+.+++.+ T Consensus 331 ~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~v- 409 (711) T protein:vir:10 331 STTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSL- 409 (711) T ss_pred CCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHHhccccCCCe- Confidence 9999999999999999999999999999999999999999999999999999999999999999998898877776664 Q ss_pred ccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) Q Consensus 375 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~ 454 (725) +..++|..+..+|++++++++|+++++||+++.++|+++||+|++++|..+|++||+||++++++|++.+++|||||+ T Consensus 410 --i~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~ 487 (711) T protein:vir:10 410 --LTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLT 487 (711) T ss_pred --eEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 345567666778999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHHHHH Q lcl|NC_013059. 455 TAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRA 533 (725) Q Consensus 455 ~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~ 533 (725) .+++++|+++|+||++|||++|+|||+|++|+.+||.||.+++++.+|.++++|||+ |+|||+|+++|+++|+|+++++ T Consensus 488 ~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~ 567 (711) T protein:vir:10 488 KSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAE 567 (711) T ss_pred HHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999995 8999999999999999999999 Q ss_pred HHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_013059. 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQG 613 (725) Q Consensus 534 ~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa 613 (725) .|+++++++|+..|... ..++.++|+|+++++++++++...++...++.+++.++.+. +++++..+.+.+++++++ T Consensus 568 ~l~ql~~~~p~~~~~~~---~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~-e~qq~~~~~q~~~~~~q~ 643 (711) T protein:vir:10 568 AMIQFAQAVPSAAAVMA---DLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMP-EQTEPTPEQQVEMAKSQA 643 (711) T ss_pred HHHHHHhhcchhhhHHH---HHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 99999998876555433 33568999999999999999887766655544443333322 222222334444455555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 614 VLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKG 693 (725) Q Consensus 614 ~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~ 693 (725) ...+++++.+++++++.+++.++.+++++....... +++ ...+. ++...+.+..++|+.+. T Consensus 644 ~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~-----aq~------~~~~~-------qq~~~~l~~~qaelq~~- 704 (711) T protein:vir:10 644 DMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDM-----AQG------GDVVY-------QQVRELVAQALAEITAS- 704 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHH------HHHHH-------HHHHHHHHHHHHHHHHH- Confidence 555666666666665555544444333332211110 000 00000 00001111111221111 Q ss_pred HHHHHHH Q lcl|NC_013059. 694 DEQTHKQ 700 (725) Q Consensus 694 ~~q~~~q 700 (725) +.+..+| T Consensus 705 q~~~~q~ 711 (711) T protein:vir:10 705 QANVTEQ 711 (711) T ss_pred HHHhhcC Confidence 1111111 No 9 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=8.9e-166 Score=925.46 Aligned_cols=663 Identities=16% Similarity=0.138 Sum_probs=500.6 Q ss_pred CCcH------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc--ccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MADN------KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ--FDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 mad~------~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~--~N~i~~~v~~v~g~~~ 72 (725) |++. .++|.+++.+|.++++++++||++|.+|++||+|+||+++++++|+.+|+|| ||+|+|+|++|+|+++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 6654 3689999999999999999999999999999999999999999999999995 7999999999999999 Q ss_pred hCCcceEEecCCcchH--HHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeee Q lcl|NC_013059. 73 QNPIDVLYRPKDGASP--DAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~--~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~ 150 (725) +||++++|+||+++|+ ++|++||++++|+++.|++++++|+||+++|+||+||++++++ +|++++++.|++ T Consensus 88 ~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~---~d~~~~~i~i~~---- 160 (714) T protein:vir:10 88 KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN---SDPFGPEFKVST---- 160 (714) T ss_pred hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc---cCCCCCCeEEEe---- Confidence 9999999999987654 7999999999999999999999999999999999999999876 468998988875 Q ss_pred cchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhh-hh-----------------------hhccc Q lcl|NC_013059. 151 SACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIP-SF-----------------------QNPND 206 (725) Q Consensus 151 ~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~-----------------------~~~~~ 206 (725) +||.+|||||+|+++|+|||+|||+++|||+++++++||++++...... .+ ...+. T Consensus 161 v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:10 161 VSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred cchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 4677899999999999999999999999999999999998653211100 00 00112 Q ss_pred ccccccCC--CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecc Q lcl|NC_013059. 207 WVFPWLTQ--DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT 284 (725) Q Consensus 207 ~~~~~~~~--~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 284 (725) ..+.|+++ ++|||+|||||+++...+ .++.+|++++|++.++.+++.. ..|...+..+++++ |++++|+|+ T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~---~~~~~g~~~~~d~~~~~~~~~~--~~g~~~~~~~~~~r--v~~~~~~g~ 313 (714) T protein:vir:10 241 QQNEWLQRERRRVLLQVVYYRTFERLPV---IELSNGRVVAFDKNNLMQAVAV--ASGRVQVKVGRVSR--IREAWFVGP 313 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEe---eccCCCceEEeCccCHHHHHHH--hhcchhhhccccce--EEEEEEecC Confidence 23456554 568888999998875544 3577899999999998877653 45777777777765 556667899 Q ss_pred ccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHH Q lcl|NC_013059. 285 AVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY 363 (725) Q Consensus 285 ~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 363 (725) ++|++ ++||||++||||||||++.+++|. |||+||+|||+||++|+++|+++|++ +++.. ++.++++...++.+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~-~~~~~a~~~~d~~~ 388 (714) T protein:vir:10 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLL--QAKRV-IMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhh--cCCce-eeecCcccccHHHH Confidence 99965 899999999999999999866665 99999999999999999999999865 45554 56777776554433 Q ss_pred Hhhcccc--ccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHH Q lcl|NC_013059. 364 DGNDDYP--YYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMR 441 (725) Q Consensus 364 ~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q 441 (725) ...-++| .+.+++.. .+|..+..+|++.+++++|+++++||+.+.++|+++||||++++|+.+|++||+||++++++ T Consensus 389 ~e~~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:10 389 MEQIERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred HHhccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 3322333 33333322 45566678899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---cceEEeccccccccCCceeeecccc-ccceEE Q lcl|NC_013059. 442 ADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGS---EKEVQLMAEVVDLATGERQVLNDIR-GRYECY 517 (725) Q Consensus 442 ~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~---~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~ 517 (725) |++.|++|||||+++++++|+++|+||++|||++|+|||||+++. .++|.||+ .+|..++.|||+ |+|||+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:10 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceecccceeeeEEEE Confidence 999999999999999999999999999999999999999988654 46888875 467778899995 999999 Q ss_pred EEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH-hhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHH Q lcl|NC_013059. 518 TDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ-YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEA 596 (725) Q Consensus 518 v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~-~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~ 596 (725) |+++|+++|+|+++++.|++++++++| ..+..++. ++.++|+|++++++++++++..+....++.+++++++.+++ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p---~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~ 619 (714) T protein:vir:10 543 LAPVQQTPAFKAQLAQRMSEVIQGLPP---QVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQ 619 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCc---hhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHH Confidence 999999999999999999999998874 33333333 56899999999999999998776665566665555544444 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 597 QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQ 676 (725) Q Consensus 597 ~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q 676 (725) ++.++++.+.++.+++++..+.+++.+++++...+.+.++....++++ ..+... ...++...+.+.....+ T Consensus 620 q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~-----~~~~~~---~~~~a~~a~~~~~~~~~- 690 (714) T protein:vir:10 620 QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQ-----GQRYVD---ALNQAHTAEIITGVQNM- 690 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHH---HHHHHHHHHHHHhHhhh- Confidence 444444444444444555555555554444433333332222111111 111000 00011111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 677 QDRSEDARANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 677 ~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++..+ ...+.-.|+.+++.++-.+ T Consensus 691 ------~~~~~-~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 691 ------EQEQD-VLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred ------hhhhH-HHHHHHHHHHHHHHHhcCC Confidence 11100 0001112222222222111 No 10 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=8.9e-166 Score=925.46 Aligned_cols=663 Identities=16% Similarity=0.138 Sum_probs=500.6 Q ss_pred CCcH------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc--ccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MADN------KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ--FDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 mad~------~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~--~N~i~~~v~~v~g~~~ 72 (725) |++. .++|.+++.+|.++++++++||++|.+|++||+|+||+++++++|+.+|+|| ||+|+|+|++|+|+++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 6654 3689999999999999999999999999999999999999999999999995 7999999999999999 Q ss_pred hCCcceEEecCCcchH--HHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeee Q lcl|NC_013059. 73 QNPIDVLYRPKDGASP--DAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~--~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~ 150 (725) +||++++|+||+++|+ ++|++||++++|+++.|++++++|+||+++|+||+||++++++ +|++++++.|++ T Consensus 88 ~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~---~d~~~~~i~i~~---- 160 (714) T protein:vir:27 88 KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN---SDPFGPEFKVST---- 160 (714) T ss_pred hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc---cCCCCCCeEEEe---- Confidence 9999999999987654 7999999999999999999999999999999999999999876 468998988875 Q ss_pred cchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhh-hh-----------------------hhccc Q lcl|NC_013059. 151 SACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIP-SF-----------------------QNPND 206 (725) Q Consensus 151 ~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~-----------------------~~~~~ 206 (725) +||.+|||||+|+++|+|||+|||+++|||+++++++||++++...... .+ ...+. T Consensus 161 v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:27 161 VSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred cchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 4677899999999999999999999999999999999998653211100 00 00112 Q ss_pred ccccccCC--CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecc Q lcl|NC_013059. 207 WVFPWLTQ--DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT 284 (725) Q Consensus 207 ~~~~~~~~--~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 284 (725) ..+.|+++ ++|||+|||||+++...+ .++.+|++++|++.++.+++.. ..|...+..+++++ |++++|+|+ T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~---~~~~~g~~~~~d~~~~~~~~~~--~~g~~~~~~~~~~r--v~~~~~~g~ 313 (714) T protein:vir:27 241 QQNEWLQRERRRVLLQVVYYRTFERLPV---IELSNGRVVAFDKNNLMQAVAV--ASGRVQVKVGRVSR--IREAWFVGP 313 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEe---eccCCCceEEeCccCHHHHHHH--hhcchhhhccccce--EEEEEEecC Confidence 23456554 568888999998875544 3577899999999998877653 45777777777765 556667899 Q ss_pred ccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHH Q lcl|NC_013059. 285 AVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY 363 (725) Q Consensus 285 ~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 363 (725) ++|++ ++||||++||||||||++.+++|. |||+||+|||+||++|+++|+++|++ +++.. ++.++++...++.+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~-~~~~~a~~~~d~~~ 388 (714) T protein:vir:27 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLL--QAKRV-IMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhh--cCCce-eeecCcccccHHHH Confidence 99965 899999999999999999866665 99999999999999999999999865 45554 56777776554433 Q ss_pred Hhhcccc--ccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHH Q lcl|NC_013059. 364 DGNDDYP--YYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMR 441 (725) Q Consensus 364 ~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q 441 (725) ...-++| .+.+++.. .+|..+..+|++.+++++|+++++||+.+.++|+++||||++++|+.+|++||+||++++++ T Consensus 389 ~e~~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:27 389 MEQIERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred HHhccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 3322333 33333322 45566678899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---cceEEeccccccccCCceeeecccc-ccceEE Q lcl|NC_013059. 442 ADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGS---EKEVQLMAEVVDLATGERQVLNDIR-GRYECY 517 (725) Q Consensus 442 ~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~---~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~ 517 (725) |++.|++|||||+++++++|+++|+||++|||++|+|||||+++. .++|.||+ .+|..++.|||+ |+|||+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:27 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceecccceeeeEEEE Confidence 999999999999999999999999999999999999999988654 46888875 467778899995 999999 Q ss_pred EEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH-hhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHH Q lcl|NC_013059. 518 TDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ-YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEA 596 (725) Q Consensus 518 v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~-~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~ 596 (725) |+++|+++|+|+++++.|++++++++| ..+..++. ++.++|+|++++++++++++..+....++.+++++++.+++ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p---~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~ 619 (714) T protein:vir:27 543 LAPVQQTPAFKAQLAQRMSEVIQGLPP---QVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQ 619 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCc---hhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHH Confidence 999999999999999999999998874 33333333 56899999999999999998776665566665555544444 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 597 QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQ 676 (725) Q Consensus 597 ~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q 676 (725) ++.++++.+.++.+++++..+.+++.+++++...+.+.++....++++ ..+... ...++...+.+.....+ T Consensus 620 q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~-----~~~~~~---~~~~a~~a~~~~~~~~~- 690 (714) T protein:vir:27 620 QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQ-----GQRYVD---ALNQAHTAEIITGVQNM- 690 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHH---HHHHHHHHHHHHhHhhh- Confidence 444444444444444555555555554444433333332222111111 111000 00011111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 677 QDRSEDARANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 677 ~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++..+ ...+.-.|+.+++.++-.+ T Consensus 691 ------~~~~~-~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:27 691 ------EQEQD-VLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred ------hhhhH-HHHHHHHHHHHHHHHhcCC Confidence 11100 0001112222222222111 No 11 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=8.9e-166 Score=925.46 Aligned_cols=663 Identities=16% Similarity=0.138 Sum_probs=500.6 Q ss_pred CCcH------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc--ccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MADN------KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ--FDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 mad~------~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~--~N~i~~~v~~v~g~~~ 72 (725) |++. .++|.+++.+|.++++++++||++|.+|++||+|+||+++++++|+.+|+|| ||+|+|+|++|+|+++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 6654 3689999999999999999999999999999999999999999999999995 7999999999999999 Q ss_pred hCCcceEEecCCcchH--HHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeee Q lcl|NC_013059. 73 QNPIDVLYRPKDGASP--DAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~--~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~ 150 (725) +||++++|+||+++|+ ++|++||++++|+++.|++++++|+||+++|+||+||++++++ +|++++++.|++ T Consensus 88 ~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~---~d~~~~~i~i~~---- 160 (714) T protein:vir:32 88 KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN---SDPFGPEFKVST---- 160 (714) T ss_pred hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc---cCCCCCCeEEEe---- Confidence 9999999999987654 7999999999999999999999999999999999999999876 468998988875 Q ss_pred cchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhh-hh-----------------------hhccc Q lcl|NC_013059. 151 SACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIP-SF-----------------------QNPND 206 (725) Q Consensus 151 ~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~-----------------------~~~~~ 206 (725) +||.+|||||+|+++|+|||+|||+++|||+++++++||++++...... .+ ...+. T Consensus 161 v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:32 161 VSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred cchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 4677899999999999999999999999999999999998653211100 00 00112 Q ss_pred ccccccCC--CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecc Q lcl|NC_013059. 207 WVFPWLTQ--DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT 284 (725) Q Consensus 207 ~~~~~~~~--~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 284 (725) ..+.|+++ ++|||+|||||+++...+ .++.+|++++|++.++.+++.. ..|...+..+++++ |++++|+|+ T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~---~~~~~g~~~~~d~~~~~~~~~~--~~g~~~~~~~~~~r--v~~~~~~g~ 313 (714) T protein:vir:32 241 QQNEWLQRERRRVLLQVVYYRTFERLPV---IELSNGRVVAFDKNNLMQAVAV--ASGRVQVKVGRVSR--IREAWFVGP 313 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEe---eccCCCceEEeCccCHHHHHHH--hhcchhhhccccce--EEEEEEecC Confidence 23456554 568888999998875544 3577899999999998877653 45777777777765 556667899 Q ss_pred ccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHH Q lcl|NC_013059. 285 AVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY 363 (725) Q Consensus 285 ~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 363 (725) ++|++ ++||||++||||||||++.+++|. |||+||+|||+||++|+++|+++|++ +++.. ++.++++...++.+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~-~~~~~a~~~~d~~~ 388 (714) T protein:vir:32 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLL--QAKRV-IMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhh--cCCce-eeecCcccccHHHH Confidence 99965 899999999999999999866665 99999999999999999999999865 45554 56777776554433 Q ss_pred Hhhcccc--ccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHH Q lcl|NC_013059. 364 DGNDDYP--YYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMR 441 (725) Q Consensus 364 ~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q 441 (725) ...-++| .+.+++.. .+|..+..+|++.+++++|+++++||+.+.++|+++||||++++|+.+|++||+||++++++ T Consensus 389 ~e~~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:32 389 MEQIERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred HHhccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 3322333 33333322 45566678899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---cceEEeccccccccCCceeeecccc-ccceEE Q lcl|NC_013059. 442 ADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGS---EKEVQLMAEVVDLATGERQVLNDIR-GRYECY 517 (725) Q Consensus 442 ~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~---~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~ 517 (725) |++.|++|||||+++++++|+++|+||++|||++|+|||||+++. .++|.||+ .+|..++.|||+ |+|||+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:32 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceecccceeeeEEEE Confidence 999999999999999999999999999999999999999988654 46888875 467778899995 999999 Q ss_pred EEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH-hhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHH Q lcl|NC_013059. 518 TDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ-YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEA 596 (725) Q Consensus 518 v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~-~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~ 596 (725) |+++|+++|+|+++++.|++++++++| ..+..++. ++.++|+|++++++++++++..+....++.+++++++.+++ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p---~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~ 619 (714) T protein:vir:32 543 LAPVQQTPAFKAQLAQRMSEVIQGLPP---QVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQ 619 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCc---hhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHH Confidence 999999999999999999999998874 33333333 56899999999999999998776665566665555544444 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 597 QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQ 676 (725) Q Consensus 597 ~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q 676 (725) ++.++++.+.++.+++++..+.+++.+++++...+.+.++....++++ ..+... ...++...+.+.....+ T Consensus 620 q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~-----~~~~~~---~~~~a~~a~~~~~~~~~- 690 (714) T protein:vir:32 620 QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQ-----GQRYVD---ALNQAHTAEIITGVQNM- 690 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHH---HHHHHHHHHHHHhHhhh- Confidence 444444444444444555555555554444433333332222111111 111000 00011111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 677 QDRSEDARANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 677 ~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++..+ ...+.-.|+.+++.++-.+ T Consensus 691 ------~~~~~-~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:32 691 ------EQEQD-VLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred ------hhhhH-HHHHHHHHHHHHHHHhcCC Confidence 11100 0001112222222222111 No 12 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=8.9e-166 Score=925.46 Aligned_cols=663 Identities=16% Similarity=0.138 Sum_probs=500.6 Q ss_pred CCcH------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc--ccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MADN------KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ--FDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 mad~------~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~--~N~i~~~v~~v~g~~~ 72 (725) |++. .++|.+++.+|.++++++++||++|.+|++||+|+||+++++++|+.+|+|| ||+|+|+|++|+|+++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 6654 3689999999999999999999999999999999999999999999999995 7999999999999999 Q ss_pred hCCcceEEecCCcchH--HHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeee Q lcl|NC_013059. 73 QNPIDVLYRPKDGASP--DAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~--~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~ 150 (725) +||++++|+||+++|+ ++|++||++++|+++.|++++++|+||+++|+||+||++++++ +|++++++.|++ T Consensus 88 ~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~---~d~~~~~i~i~~---- 160 (714) T protein:vir:81 88 KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN---SDPFGPEFKVST---- 160 (714) T ss_pred hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc---cCCCCCCeEEEe---- Confidence 9999999999987654 7999999999999999999999999999999999999999876 468998988875 Q ss_pred cchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhh-hh-----------------------hhccc Q lcl|NC_013059. 151 SACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIP-SF-----------------------QNPND 206 (725) Q Consensus 151 ~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~-----------------------~~~~~ 206 (725) +||.+|||||+|+++|+|||+|||+++|||+++++++||++++...... .+ ...+. T Consensus 161 v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:81 161 VSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred cchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 4677899999999999999999999999999999999998653211100 00 00112 Q ss_pred ccccccCC--CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecc Q lcl|NC_013059. 207 WVFPWLTQ--DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT 284 (725) Q Consensus 207 ~~~~~~~~--~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 284 (725) ..+.|+++ ++|||+|||||+++...+ .++.+|++++|++.++.+++.. ..|...+..+++++ |++++|+|+ T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~---~~~~~g~~~~~d~~~~~~~~~~--~~g~~~~~~~~~~r--v~~~~~~g~ 313 (714) T protein:vir:81 241 QQNEWLQRERRRVLLQVVYYRTFERLPV---IELSNGRVVAFDKNNLMQAVAV--ASGRVQVKVGRVSR--IREAWFVGP 313 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEe---eccCCCceEEeCccCHHHHHHH--hhcchhhhccccce--EEEEEEecC Confidence 23456554 568888999998875544 3577899999999998877653 45777777777765 556667899 Q ss_pred ccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHH Q lcl|NC_013059. 285 AVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY 363 (725) Q Consensus 285 ~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 363 (725) ++|++ ++||||++||||||||++.+++|. |||+||+|||+||++|+++|+++|++ +++.. ++.++++...++.+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~-~~~~~a~~~~d~~~ 388 (714) T protein:vir:81 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLL--QAKRV-IMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhh--cCCce-eeecCcccccHHHH Confidence 99965 899999999999999999866665 99999999999999999999999865 45554 56777776554433 Q ss_pred Hhhcccc--ccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHH Q lcl|NC_013059. 364 DGNDDYP--YYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMR 441 (725) Q Consensus 364 ~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q 441 (725) ...-++| .+.+++.. .+|..+..+|++.+++++|+++++||+.+.++|+++||||++++|+.+|++||+||++++++ T Consensus 389 ~e~~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:81 389 MEQIERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred HHhccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 3322333 33333322 45566678899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---cceEEeccccccccCCceeeecccc-ccceEE Q lcl|NC_013059. 442 ADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGS---EKEVQLMAEVVDLATGERQVLNDIR-GRYECY 517 (725) Q Consensus 442 ~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~---~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~ 517 (725) |++.|++|||||+++++++|+++|+||++|||++|+|||||+++. .++|.||+ .+|..++.|||+ |+|||+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:81 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceecccceeeeEEEE Confidence 999999999999999999999999999999999999999988654 46888875 467778899995 999999 Q ss_pred EEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH-hhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHH Q lcl|NC_013059. 518 TDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ-YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEA 596 (725) Q Consensus 518 v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~-~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~ 596 (725) |+++|+++|+|+++++.|++++++++| ..+..++. ++.++|+|++++++++++++..+....++.+++++++.+++ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p---~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~ 619 (714) T protein:vir:81 543 LAPVQQTPAFKAQLAQRMSEVIQGLPP---QVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQ 619 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCc---hhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHH Confidence 999999999999999999999998874 33333333 56899999999999999998776665566665555544444 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 597 QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQ 676 (725) Q Consensus 597 ~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q 676 (725) ++.++++.+.++.+++++..+.+++.+++++...+.+.++....++++ ..+... ...++...+.+.....+ T Consensus 620 q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~-----~~~~~~---~~~~a~~a~~~~~~~~~- 690 (714) T protein:vir:81 620 QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQ-----GQRYVD---ALNQAHTAEIITGVQNM- 690 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHH---HHHHHHHHHHHHhHhhh- Confidence 444444444444444555555555554444433333332222111111 111000 00011111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 677 QDRSEDARANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 677 ~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++..+ ...+.-.|+.+++.++-.+ T Consensus 691 ------~~~~~-~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:81 691 ------EQEQD-VLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred ------hhhhH-HHHHHHHHHHHHHHHhcCC Confidence 11100 0001112222222222111 No 13 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=8.9e-166 Score=925.46 Aligned_cols=663 Identities=16% Similarity=0.138 Sum_probs=500.6 Q ss_pred CCcH------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc--ccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MADN------KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ--FDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 mad~------~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~--~N~i~~~v~~v~g~~~ 72 (725) |++. .++|.+++.+|.++++++++||++|.+|++||+|+||+++++++|+.+|+|| ||+|+|+|++|+|+++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 6654 3689999999999999999999999999999999999999999999999995 7999999999999999 Q ss_pred hCCcceEEecCCcchH--HHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeee Q lcl|NC_013059. 73 QNPIDVLYRPKDGASP--DAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~--~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~ 150 (725) +||++++|+||+++|+ ++|++||++++|+++.|++++++|+||+++|+||+||++++++ +|++++++.|++ T Consensus 88 ~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~---~d~~~~~i~i~~---- 160 (714) T protein:vir:99 88 KTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN---SDPFGPEFKVST---- 160 (714) T ss_pred hCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc---cCCCCCCeEEEe---- Confidence 9999999999987654 7999999999999999999999999999999999999999876 468998988875 Q ss_pred cchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhh-hh-----------------------hhccc Q lcl|NC_013059. 151 SACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIP-SF-----------------------QNPND 206 (725) Q Consensus 151 ~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~-----------------------~~~~~ 206 (725) +||.+|||||+|+++|+|||+|||+++|||+++++++||++++...... .+ ...+. T Consensus 161 v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:99 161 VSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred cchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 4677899999999999999999999999999999999998653211100 00 00112 Q ss_pred ccccccCC--CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecc Q lcl|NC_013059. 207 WVFPWLTQ--DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT 284 (725) Q Consensus 207 ~~~~~~~~--~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~ 284 (725) ..+.|+++ ++|||+|||||+++...+ .++.+|++++|++.++.+++.. ..|...+..+++++ |++++|+|+ T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~---~~~~~g~~~~~d~~~~~~~~~~--~~g~~~~~~~~~~r--v~~~~~~g~ 313 (714) T protein:vir:99 241 QQNEWLQRERRRVLLQVVYYRTFERLPV---IELSNGRVVAFDKNNLMQAVAV--ASGRVQVKVGRVSR--IREAWFVGP 313 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEe---eccCCCceEEeCccCHHHHHHH--hhcchhhhccccce--EEEEEEecC Confidence 23456554 568888999998875544 3577899999999998877653 45777777777765 556667899 Q ss_pred ccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHH Q lcl|NC_013059. 285 AVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY 363 (725) Q Consensus 285 ~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 363 (725) ++|++ ++||||++||||||||++.+++|. |||+||+|||+||++|+++|+++|++ +++.. ++.++++...++.+ T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~--~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~-~~~~~a~~~~d~~~ 388 (714) T protein:vir:99 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGE--PYGLISRAIPAQDEVNFRRIKLTWLL--QAKRV-IMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCc--eeehhhhchhHHHHHHHHHHHHHHhh--cCCce-eeecCcccccHHHH Confidence 99965 899999999999999999866665 99999999999999999999999865 45554 56777776554433 Q ss_pred Hhhcccc--ccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHH Q lcl|NC_013059. 364 DGNDDYP--YYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMR 441 (725) Q Consensus 364 ~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q 441 (725) ...-++| .+.+++.. .+|..+..+|++.+++++|+++++||+.+.++|+++||||++++|+.+|++||+||++++++ T Consensus 389 ~e~~arp~~vi~~~p~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:99 389 MEQIERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred HHhccCCCCceeecccc-cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 3322333 33333322 45566678899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC---cceEEeccccccccCCceeeecccc-ccceEE Q lcl|NC_013059. 442 ADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGS---EKEVQLMAEVVDLATGERQVLNDIR-GRYECY 517 (725) Q Consensus 442 ~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~---~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~ 517 (725) |++.|++|||||+++++++|+++|+||++|||++|+|||||+++. .++|.||+ .+|..++.|||+ |+|||+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~-----~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:99 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNA-----EGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecc-----ccCcceecccceeeeEEEE Confidence 999999999999999999999999999999999999999988654 46888875 467778899995 999999 Q ss_pred EEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH-hhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHH Q lcl|NC_013059. 518 TDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ-YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEA 596 (725) Q Consensus 518 v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~-~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~ 596 (725) |+++|+++|+|+++++.|++++++++| ..+..++. ++.++|+|++++++++++++..+....++.+++++++.+++ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p---~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~ 619 (714) T protein:vir:99 543 LAPVQQTPAFKAQLAQRMSEVIQGLPP---QVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQ 619 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCc---hhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHH Confidence 999999999999999999999998874 33333333 56899999999999999998776665566665555544444 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 597 QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQ 676 (725) Q Consensus 597 ~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q 676 (725) ++.++++.+.++.+++++..+.+++.+++++...+.+.++....++++ ..+... ...++...+.+.....+ T Consensus 620 q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~-----~~~~~~---~~~~a~~a~~~~~~~~~- 690 (714) T protein:vir:99 620 QALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQ-----GQRYVD---ALNQAHTAEIITGVQNM- 690 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHH---HHHHHHHHHHHHhHhhh- Confidence 444444444444444555555555554444433333332222111111 111000 00011111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 677 QDRSEDARANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 677 ~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++..+ ...+.-.|+.+++.++-.+ T Consensus 691 ------~~~~~-~~~~q~~q~~~~~~~~~~~ 714 (714) T protein:vir:99 691 ------EQEQD-VLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred ------hhhhH-HHHHHHHHHHHHHHHhcCC Confidence 11100 0001112222222222111 No 14 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=7.9e-165 Score=920.24 Aligned_cols=663 Identities=16% Similarity=0.131 Sum_probs=495.9 Q ss_pred CC-------------cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc--ccchHHHHH Q lcl|NC_013059. 1 MA-------------DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ--FDVVRPVVR 65 (725) Q Consensus 1 ma-------------d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~--~N~i~~~v~ 65 (725) |+ ++..++.+++.+|.++++++++||++|.+|++||+|+||+++++++|+.+|+|| ||+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 33 234589999999999999999999999999999999999999999999999995 799999999 Q ss_pred HHHHHHhhCCcceEEecCCcch--HHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCcee Q lcl|NC_013059. 66 KLVSEMRQNPIDVLYRPKDGAS--PDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) Q Consensus 66 ~v~g~~~~nr~~~~~~pr~~~d--~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ 143 (725) +|+|++++||++++|+||+++| .++|++||++++|+++.|++++++|+||+++|+||+||++++++| |+++.++. T Consensus 81 ~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~---d~~~~~i~ 157 (714) T protein:vir:10 81 GVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---EPFGPEFK 157 (714) T ss_pred HHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeecc---CCCCCCeE Confidence 9999999999999999998765 479999999999999999999999999999999999999999886 46788887 Q ss_pred EEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhh-hh--------------------- Q lcl|NC_013059. 144 IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIP-SF--------------------- 201 (725) Q Consensus 144 ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~--------------------- 201 (725) |+. +||.+|||||+|+++|+|||+|||+++|||+++++++||++++...... .+ T Consensus 158 i~~----v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:10 158 VST----VSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred EEe----cChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccch Confidence 775 3678899999999999999999999999999999999998653221110 00 Q ss_pred --hhcccccccccCC--CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEE Q lcl|NC_013059. 202 --QNPNDWVFPWLTQ--DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVY 277 (725) Q Consensus 202 --~~~~~~~~~~~~~--~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~ 277 (725) ...+...+.|+++ ++|||+|||||.++.. .+.++.+|++++|++.++.++.. +..|...+..++++ ||+ T Consensus 234 ~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~---~~~~~~~g~~~~~d~~~~~~~~~--~~~g~~~~~~~~~~--rv~ 306 (714) T protein:vir:10 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERL---PVIELSNGRVVAFDKNNLMQAVA--VASGRVQVKVGRVS--RIR 306 (714) T ss_pred hhcccccccccccccCcceEEEEEEEEeEEEEE---EeecCCCCCeeeeCccCHHHHHH--HHhccceeccccee--eEE Confidence 0011123446544 5789999999987643 33467889999999999888765 44677666666654 689 Q ss_pred EEEeeccccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc Q lcl|NC_013059. 278 KSIITCTAVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI 356 (725) Q Consensus 278 ~~~~~g~~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i 356 (725) |++|+|+++|++ ++||||++||||||||++.+++| .+||+||+|||+||++|+++|+++|++ +++ ++++.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g--~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~gav 381 (714) T protein:vir:10 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTG--EPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDAT 381 (714) T ss_pred EEEEecchhhhcCCCCCCCCceeeEEecceeeeccC--ccceehhhhhhHHHHHHHHHHHHHHHH--hCC-ceeeccccc Confidence 999999999965 89999999999999999886665 499999999999999999999999975 344 457788888 Q ss_pred chHHHHHHhhccccccc--cccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHH Q lcl|NC_013059. 357 AGFEHMYDGNDDYPYYL--LNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDT 434 (725) Q Consensus 357 ~~~~~~~~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~a 434 (725) ....+.....-++|..+ +++.. .+|..++.++++.+++++|+++++||+.+..+|+++||||++++|+.||++||+| T Consensus 382 ~~~d~~~~e~~~rp~~vi~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA 460 (714) T protein:vir:10 382 QLSDNDLMEQLERPDGIIKLNPVR-KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA 460 (714) T ss_pred cccHHHHHHhccCCCCeEEecccc-cccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHH Confidence 66544333333344333 33222 3455667889999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCC---CcceEEeccccccccCCceeeecccc Q lcl|NC_013059. 435 VNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDG---SEKEVQLMAEVVDLATGERQVLNDIR 511 (725) Q Consensus 435 i~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~---~~~~v~in~~~~d~~~g~~~~~nDi~ 511 (725) |++++++|++.|++|||||+++++++|+++|+||++|||++|+|||+|+++ ..+++.+|.. .|.+++.|||+ T Consensus 461 I~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~-----~~~~~~~nDi~ 535 (714) T protein:vir:10 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAE-----GDNGELTNDIS 535 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccc-----cCCccccccce Confidence 999999999999999999999999999999999999999999999998865 4578888854 45567789995 Q ss_pred -ccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH-hhccCCchhHHHHHHHHhhhhhhhhhhhccchhh Q lcl|NC_013059. 512 -GRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ-YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEE 589 (725) Q Consensus 512 -g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~-~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~ 589 (725) |+|||+|+++|+++|+|+++++.|++++++++|. .+..++. ++.++|+|+++++++++++++......++.++++ T Consensus 536 ~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~---~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~ 612 (714) T protein:vir:10 536 RLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQ---VQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEE 612 (714) T ss_pred eeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCch---hhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcch Confidence 9999999999999999999999999999988754 3333333 5789999999999999999877665555555554 Q ss_pred hHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 590 QQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFL 669 (725) Q Consensus 590 ~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~ 669 (725) ++..++.++.++++.+.++++.+++..+.+++++++++...+.+.++....++++...+ . ....++...+.+ T Consensus 613 q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~-----~---~~~~~a~~a~~l 684 (714) T protein:vir:10 613 QEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRY-----V---DALNQAHTAEII 684 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----H---HHHHHHHHHHHH Confidence 44444333333333333344444444444444444433333222222111111111000 0 000011111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 670 KTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 670 ~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) .....+ ++..+.. .+.-.|+-++++++-.+ T Consensus 685 ~~~~~~-------~q~~~~~-~q~~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 685 TGVQNM-------EQEQDVL-QQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHhh-------hhhHHHH-HHHHHHHHHHHHHhcCC Confidence 111111 1111100 01111222222221111 No 15 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=8.2e-166 Score=925.64 Aligned_cols=679 Identities=17% Similarity=0.126 Sum_probs=495.1 Q ss_pred CC--c---HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc--ccchHHHHHHHHHHHhh Q lcl|NC_013059. 1 MA--D---NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ--FDVVRPVVRKLVSEMRQ 73 (725) Q Consensus 1 ma--d---~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~--~N~i~~~v~~v~g~~~~ 73 (725) |. + ...+|.+++.+|.++++++++||++|.+|++||+|+||++++++.|+.+|+|| ||+|+|+|++|+|++++ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~ 90 (772) T protein:vir:10 11 LNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLIGPALLSLQGYEAV 90 (772) T ss_pred hccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHHHHHHh Confidence 22 2 34588899999999999999999999999999999999999999999999995 69999999999999999 Q ss_pred CCcceEEecC-CcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecc Q lcl|NC_013059. 74 NPIDVLYRPK-DGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) Q Consensus 74 nr~~~~~~pr-~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~ 152 (725) ||++++|+|| +++|.++|++||++++|+++.|+++++||+||+++|+||+||+++.++ +|+++.+|.|+++ + T Consensus 91 nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~---~d~~~~~i~i~~v----~ 163 (772) T protein:vir:10 91 TRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRE---SDPFKFPYRCRPI----R 163 (772) T ss_pred cCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccc---cCCCCCCeEEEee----C Confidence 9999999998 568999999999999999999999999999999999999999998654 6789999888864 6 Q ss_pred hhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchh----hhhhhhc---------------------ccc Q lcl|NC_013059. 153 CSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADD----IPSFQNP---------------------NDW 207 (725) Q Consensus 153 ~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~----~~~~~~~---------------------~~~ 207 (725) |++|||||+|++ |+|||+|||+++|||+++++++||+++..... ..++... ..| T Consensus 164 p~~v~~Dp~a~~-D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (772) T protein:vir:10 164 RDEIHWDMKCGD-DWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAW 242 (772) T ss_pred cccceecCCCCC-CHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhhcc Confidence 778999999976 99999999999999999999999986531110 0110000 000 Q ss_pred ---ccccc--CCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEee Q lcl|NC_013059. 208 ---VFPWL--TQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIIT 282 (725) Q Consensus 208 ---~~~~~--~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~ 282 (725) ...|+ +.++|||+|||||+++...++. +.+|+++.|++.++.+.. .++.|...+..+ .++||+|++|+ T Consensus 243 ~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~---~~~g~~~~~~~~~~~~~~--~l~~g~~~~~~~--~~~rv~~~~~~ 315 (772) T protein:vir:10 243 TVQEDHWYNPTSKEICLVELWYRRWVQVHVLK---SPDGRVVEYDPNNLAHNI--ALASGRISPKKV--TVSRVRRSYWL 315 (772) T ss_pred ccccccccccCCceEEEEEEeeeeeeeeeeec---cCCCceEeeCcccHHHHH--HHhhcccchhee--eeeEEEEEEEe Confidence 12343 3578999999999988766554 456999999999877754 555666555443 45679999999 Q ss_pred ccccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHH Q lcl|NC_013059. 283 CTAVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH 361 (725) Q Consensus 283 g~~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~ 361 (725) |+++|++ ++||||++||||||||++.+++|. +||+||+|||+||++|+++|+++|+++++. .++.+|++++.++ T Consensus 316 g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~--~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~---~~~~~gav~~~d~ 390 (772) T protein:vir:10 316 GPHCLHDGPTPYTHRHFPYVPFFGFREDATGI--PYGYVRGMKYAQDSLNSGVSKLRWGMSVAR---VERTKGAVAMTDA 390 (772) T ss_pred cceeeccCCCCCCCCccceEEEeeeEeccCCc--ccchhhhhhhHHHHHHHHHHHHHHHHhccc---ccccCCCccchhH Confidence 9999985 999999999999999999866655 999999999999999999999999988875 5788999988766 Q ss_pred HHHhhccccccccccccccCcc--ccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHH Q lcl|NC_013059. 362 MYDGNDDYPYYLLNRTDENNGE--MPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLN 439 (725) Q Consensus 362 ~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q 439 (725) .+.....++..+. ..++|. .++.++++.+++++|+++++||+.+..+|+++||+|++++|..||++||+||.+++ T Consensus 391 ~~~e~~arp~~vi---~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq 467 (772) T protein:vir:10 391 QFRRQIARPDADI---VLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQI 467 (772) T ss_pred HHHHhccCCCCeE---EeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHH Confidence 5555555554432 223333 24667889999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCC--CcceEEeccccccccCCceeeecccc-ccceE Q lcl|NC_013059. 440 MRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDG--SEKEVQLMAEVVDLATGERQVLNDIR-GRYEC 516 (725) Q Consensus 440 ~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~--~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv 516 (725) ++|++.|++|||||+++++++|+++|+||++|||++|+|||+|+|+ .+++|.||+.+.|+.||..++.|||+ |+||| T Consensus 468 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv 547 (772) T protein:vir:10 468 EQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKV 547 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEE Confidence 9999999999999999999999999999999999999999999985 58999999999999999999999995 99999 Q ss_pred EEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHH-HhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHH Q lcl|NC_013059. 517 YTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLL-QYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVE 595 (725) Q Consensus 517 ~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~-~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q 595 (725) +|+++|+++|+|+++++.|+++++.++ |+....++ .+++++|+|+.+++++++++...+. .+++++ T Consensus 548 ~i~~~p~~~t~r~~~~~~m~ql~~~~~---P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~------~peq~~---- 614 (772) T protein:vir:10 548 ALEDVPSTNSYRGQQLNAMSEAVKSMP---PQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQ------TPEQIQ---- 614 (772) T ss_pred EeeccccchHHHHHHHHHHHHHHhccC---hhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccC------ChHHHH---- Confidence 999999999999999999999998876 44444433 3578999999999999998754321 111111 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 596 AQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASF 675 (725) Q Consensus 596 ~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~ 675 (725) +++.++.+++.+..++++++.+++++..+.+++..+.++++.....+++..+.+.++...++... +....++ T Consensus 615 ~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~--------a~~ad~~ 686 (772) T protein:vir:10 615 QQIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMI--------APIADAV 686 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhh--------hHHHHHH Confidence 11111111112222223333333333333333333333333333333333332222211111100 0000000 Q ss_pred HHHHH-HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH-hcCCcccccCCCC Q lcl|NC_013059. 676 QQDRS-EDARANAEL--LLKGDEQTHKQRMEIANILQSQR-QNQPSGSVAETPQ 725 (725) Q Consensus 676 q~~~~-~~a~~~aE~--~~~~~~q~~~q~~e~~~~~~~~~-~~q~~~~~~~~~q 725 (725) ..+++ +......+. ...+...+ ..... ....+.+. +..|...++.-|| T Consensus 687 l~~~g~~~~~~~~~~~~~p~~~~~a-~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 738 (772) T protein:vir:10 687 MQSAGYQRPNPAGDDPNYPIADQTA-AMNIR-SPYIQGQGPAAEAEAESVSVRR 738 (772) T ss_pred HHhcccccccccccCCCCCCCCCcc-CCCCC-ccCCCCCCCCCccccCCCCCcc Confidence 00000 000000000 00000000 00000 00000000 0000011111112 No 16 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=1.6e-153 Score=858.27 Aligned_cols=670 Identities=15% Similarity=0.119 Sum_probs=473.0 Q ss_pred CCc--HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCC--cccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MAD--NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad--~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp--~~N~i~~~v~~v~g~~~~nr~ 76 (725) +.+ ..++|++|+.+|+++++++++||++|.+|++||+|+||+++++++|+.+|+| +||+|+|+|++|+|++++||+ T Consensus 38 ~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~nr~ 117 (776) T protein:vir:93 38 LDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERGQAPTVYNVISQSVNWIIGSEKRGRS 117 (776) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCceEEecchHHHHHHHHHHHHhCCc Confidence 332 3569999999999999999999999999999999999999999999999999 479999999999999999999 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhhe Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHV 156 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~v 156 (725) +++|+|++++|+++|++||++++|++++|++++++++||+++++||+||++|+++|+ .++.+++++ +++|.+| T Consensus 118 ~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~---~~~~~~~~~----~~~p~~i 190 (776) T protein:vir:93 118 DFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDE---NDGEPIYAG----AESWRNI 190 (776) T ss_pred ceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeecc---CCCCceEee----ccChhhe Confidence 999999999999999999999999999999999999999999999999999999874 345566555 3578889 Q ss_pred eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhh----hh--------------------hhccccccccc Q lcl|NC_013059. 157 IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIP----SF--------------------QNPNDWVFPWL 212 (725) Q Consensus 157 ~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~----~~--------------------~~~~~~~~~~~ 212 (725) ||||+|+++|+|||+|||+++|||+++++++||++.+...... .+ .+..+..+.|. T Consensus 191 ~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (776) T protein:vir:93 191 LWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAY 270 (776) T ss_pred eeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccccccccccc Confidence 9999999999999999999999999999999998654321100 00 00111233456 Q ss_pred CCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccC-CC Q lcl|NC_013059. 213 TQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKD-KQ 291 (725) Q Consensus 213 ~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~-~~ 291 (725) +.++|||+|||||+++...++.+. ...+..+.++..+. .+...++.|...+..+. .++|+|++|.|+++|++ ++ T Consensus 271 ~~~~v~v~E~~~r~~~~~~~~~~~-~~~~~~~~~d~~~~--~~~~~~~~g~~~~~~~~--~~~v~~~~~~g~~~l~~~~~ 345 (776) T protein:vir:93 271 ARKRVRMIEAWFRMPVRVQRLKGR-NSDFRGEVFDPNDE--RHVLEVESGRAVLAVSP--MMRMHCAIMTTRDLMWAGPS 345 (776) T ss_pred CCCeEEEEEEEEeeeeehhhcccc-cccccceeecccch--HHHHHhhcCceeehhee--eeeeEEEEEecchhhhccCC Confidence 678999999999998866554331 12234555666553 44455777777766654 45788999999999965 89 Q ss_pred CCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccc Q lcl|NC_013059. 292 LIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPY 371 (725) Q Consensus 292 ~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 371 (725) ||||++||||||||++. ++++++||+||+|||+||++|+++|+++|+++ +.++++.+|++++.++.++..+ +++ T Consensus 346 p~~~~~~Pfv~~~~~~~--~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~---~~~~~~~~gav~~~d~~~~~~~-rp~ 419 (776) T protein:vir:93 346 PYRHNRYPFTPIWGFRR--ARDGMPYGVIRFMRGMQDDVNKRLSKALYILS---TNKVLMEEGAVDDIDEFRREAA-RPD 419 (776) T ss_pred CCCCCccceEEecCcee--cccccccchHHhhhHHHHHHHHHHHHHHHhhc---CCceeeccccccchHHHHHhcc-cCC Confidence 99999999999999987 55567999999999999999999999999875 3478999999998877776543 333 Q ss_pred cccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 372 YLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQD 451 (725) Q Consensus 372 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~d 451 (725) .+ +..++|... +++..+++++|+++++||+++..+|+++|||+++++|..+|++||+||++++++|++.+.+||| T Consensus 420 ~v---i~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~d 494 (776) T protein:vir:93 420 AV---MTVKNGKLG--AVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFD 494 (776) T ss_pred ce---eeeCCcccc--ccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHH Confidence 32 344566543 4666778899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHH Q lcl|NC_013059. 452 NLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQ 530 (725) Q Consensus 452 n~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~ 530 (725) ||+++++++|+++|+||++|||++|+|||+|++|+.+||.||... +.||++ |+|||+|++||+++|+|++ T Consensus 495 n~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~---------~~nd~~~~~~dv~v~~~~~~~s~r~~ 565 (776) T protein:vir:93 495 NLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGL---------PENDITRTKADFIIDEAEWRATMRQA 565 (776) T ss_pred HHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccc---------hhhhhccceeeEEEeecccchhHHHH Confidence 999999999999999999999999999999999999999999643 458985 8999999999999999999 Q ss_pred HHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHH Q lcl|NC_013059. 531 NRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQ 610 (725) Q Consensus 531 ~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~ 610 (725) +++.|+++++++++.+ ...++..++.++|+|+.+++++++++...... +.+.+..+..++.++.++.+++.+... T Consensus 566 ~~~~l~ql~~~~~p~~--~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~---p~q~~~~~e~~~~qq~q~~~~q~q~~~ 640 (776) T protein:vir:93 566 AVAELMEVIGKMPPEI--ALTMLDLLVENMDIPNRDELVKRIRAVNGQKD---PDQDEPTPEEIAREQAQQQQQQYNDAL 640 (776) T ss_pred HHHHHHHHHhhcChhh--HHHHHHHHHHhcCccchHHHHHHHHHhhcccc---cchhhcchhHHHHHHHhhHHHHHHHHH Confidence 9999999998876432 12233334678899999999998876543221 112222211111112222111222122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 611 AQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELL 690 (725) Q Consensus 611 ~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~ 690 (725) +++.+.+.+++.. +.++++..++++++..+++.. .........++++.....+.. +.+.. +... T Consensus 641 ~~a~~~~~qa~a~-------~~~aea~~~~aqa~~~~~~a~-------~~~~~a~q~a~qa~~~~~~~~-~~a~~-a~~~ 704 (776) T protein:vir:93 641 AIATLEEQQAKAR-------KAAAEAQVAEAKAKHISRMAI-------REGVGAVKDATDAATAIAFMP-ELAGL-SDGI 704 (776) T ss_pred hhhhhhHhhHHHH-------HHHHHHHHHhhhhhhhhhcch-------hhhhhhhhhhhhhhhhhhhhh-hhhhh-hhhh Confidence 2222112222211 122222222222111111110 000000000000000000000 00000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHhcCCccccc-------CCCC Q lcl|NC_013059. 691 LKGDEQTHKQRMEIANILQS-QRQNQPSGSVA-------ETPQ 725 (725) Q Consensus 691 ~~~~~q~~~q~~e~~~~~~~-~~~~q~~~~~~-------~~~q 725 (725) ++.+.+..+ ......... ...+.|+.... +.|+ T Consensus 705 ~~~a~~~~p--~~p~~~~~~~~~~~~~~~p~~p~~p~~p~~p~ 745 (776) T protein:vir:93 705 LRESGWDDP--NTPQPASAASGMPPAPAQPAQPANPAQPPAPG 745 (776) T ss_pred hcccccccc--ccccccccccCCCCCCCCCCCCCCcCCCCCCC Confidence 000000000 000000000 00000000000 0000 No 17 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=2.3e-82 Score=468.10 Aligned_cols=620 Identities=14% Similarity=0.080 Sum_probs=374.0 Q ss_pred CC--cHHHHHHHHHHHHHHHHhhhHHHHHHH--HHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHhh--- Q lcl|NC_013059. 1 MA--DNKNRLESILSRFDADWTASDEARREA--KNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQ--- 73 (725) Q Consensus 1 ma--d~~~~~~~~~~~~~~~~~~~~~~r~~a--~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~--- 73 (725) .- -+..++..|..++..+.....+-+.++ ..|++||.|+.= -...+.+...+.+.|+..|+|+++.... T Consensus 20 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~vv~~~v~~~ve~~~~~l~~~f~ 95 (763) T protein:vir:95 20 LTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK----PPKVKGRSQVQPKLVRRQAEWRYSALTEPFL 95 (763) T ss_pred CCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc----ccccCCCccccCHHHHHHHHHHHHHHHHhhc Confidence 11 133456666666665555444434443 455666777662 2233344455779999999999998887 Q ss_pred CCcce-EEecCCcchHHHHHHHHHHHHH-HHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC---------------- Q lcl|NC_013059. 74 NPIDV-LYRPKDGASPDAADVLMGMYRT-DMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ---------------- 135 (725) Q Consensus 74 nr~~~-~~~pr~~~d~~~Ae~l~~~~~~-~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~---------------- 135 (725) ...+| .|.|+.++|.++|+..|.+++| +...|+.....+++|+++|+||+|+++|.|+.+.+ T Consensus 96 ~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~ 175 (763) T protein:vir:95 96 GSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQT 175 (763) T ss_pred CCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhccccc Confidence 56666 9999999999999999999999 57788888889999999999999977776652110 Q ss_pred ---------------------------C---------CCCCc------------e------eEEEEeeecchhheeeCCC Q lcl|NC_013059. 136 ---------------------------S---------PTSNN------------Q------VIRREPIHSACSHVIWDSN 161 (725) Q Consensus 136 ---------------------------~---------~~~~~------------~------~ir~~~~~~~~~~v~~Dp~ 161 (725) . ..+.+ . .+++. .+|+.++||||. T Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie--~V~p~d~~iDp~ 253 (763) T protein:vir:95 176 QEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVE--MLNPENIIIDPS 253 (763) T ss_pred hhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEE--eecHHHheecCC Confidence 0 00000 0 00111 368888999999 Q ss_pred ccccChhcccceeeeecCCHHHHHHHhhhcCC-cchhhhhhhhc----------ccccccccCCCeEEEEEEEEEeccee Q lcl|NC_013059. 162 SKLMDKSDARHCTVIHSMSQNGWEDFAEKFDL-DADDIPSFQNP----------NDWVFPWLTQDTIQIAEFYEVVEKKE 230 (725) Q Consensus 162 a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~-~~~~~~~~~~~----------~~~~~~~~~~~~vrv~E~w~~~~~~~ 230 (725) +++ |++||+|||++.++|++++.++...|.. +..+..+.... ....+...+.++|+|.|||.+.++. T Consensus 254 a~s-D~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~- 331 (763) T protein:vir:95 254 CQG-DINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIE- 331 (763) T ss_pred CCC-chhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeeeccC- Confidence 988 8899999999999999999887333321 11111111111 1111122235678899999874321 Q ss_pred EEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccC-CCCCCCCccceEEEEeeeec Q lcl|NC_013059. 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKD-KQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~-~~~~p~~~~p~vP~~g~~~~ 309 (725) .| | ..++++.+|.|+++|.. .+||||+.||||+|++++. T Consensus 332 -----gd------------------------g----------~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~- 371 (763) T protein:vir:95 332 -----GN------------------------G----------VLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPV- 371 (763) T ss_pred -----Cc------------------------c----------eeEEEEEEEEcCeeeecccccccCCCcCEEEecceee- Confidence 11 1 12345567789999864 7999999999999999875 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCC Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPL 389 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 389 (725) .++.+++|+++.++|+|+++|+++|+++|++++++++++++..++++..+ .....++. .+..++|..+...+ T Consensus 372 -~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~d----~~~~~pg~---v~~v~~g~~~~~~~ 443 (763) T protein:vir:95 372 -KRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDALN----SRRYREGE---DYEYNPTQNPAQMI 443 (763) T ss_pred -cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccchh----hhcccCCc---eEEeeCCCChhhhc Confidence 77788889999999999999999999999999999999999999876432 22233433 34556677777788 Q ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 390 AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVA--YDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) Q Consensus 390 ~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~S--g~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~l 467 (725) +...++++|++.+.|+++....++.+|||++.++|..++..+ +.+|..+++++++.+..+++||..+++.+|+++|+| T Consensus 444 ~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~L 523 (763) T protein:vir:95 444 IEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAM 523 (763) T ss_pred ccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 888899999999999999999999999999999998765533 245888899999999999999999999999999999 Q ss_pred HHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccc Q lcl|NC_013059. 468 VNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) Q Consensus 468 i~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p 547 (725) |++|||++|+|||+|++ ||++++ .|++|+|||+|+++++ +.++++++.++++++.+++.+| T Consensus 524 i~q~~d~~rviRI~g~e----~v~v~~-------------~~~~~~~DV~V~~~~a--s~~~q~~~~l~~ll~~l~~~~~ 584 (763) T protein:vir:95 524 NAVFLAEHEVVRITNEE----FVTIKR-------------EDLKGNFDLEVDISTA--EVDNQKSQDLGFMLQTIGPNVD 584 (763) T ss_pred HHhhCCCCcEEEEeCCc----cccccH-------------HHhcCCcceEEecccc--hHHHHHHHHHHHHHHHhccccC Confidence 99999999999999863 776663 4688999999999875 5567778888889988877665 Q ss_pred hHH-HHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 548 EYQ-LLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQ 626 (725) Q Consensus 548 ~~~-~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaq 626 (725) ... ..++ ...++.....++++.++.... +.++ .++.+. +.+..+.+.+.+.+++++++.++++....++ T Consensus 585 ~~~~~~il--~~~~d~~~~~~~~~~lr~~q~------~~d~-~~q~qa-qle~~~~q~e~~~~~akaq~~qaqa~~~~aq 654 (763) T protein:vir:95 585 QQITLNIL--AEIADLKRMPKLAHDLRTWQP------QPDP-VQEQLK-QLAVEKAQLENEELRSKIRLNDAQAQKAMAE 654 (763) T ss_pred hHHHHHHH--HHHHhhhchhhhHHHHHhcCC------Cccc-hhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 322 2211 122233333344444433211 1111 111100 0111111111111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 627 NQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIAN 706 (725) Q Consensus 627 ae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~ 706 (725) +++++.++.+...+.| +..+.+..+...+.+.+ .+..++.++...++.....++.+.-.+ T Consensus 655 ~e~~~~d~~~~e~~~Q-------------------~~~e~~~~~~~~eaq~~-l~~~~a~~~~~~ea~~~~~~~~~~~~~ 714 (763) T protein:vir:95 655 RDNKNLDYLEQESGTK-------------------HARDLEKMKAQSQGNQQ-LEITKALTKPRKEGELPPNLSAAIGYN 714 (763) T ss_pred HHHHHHHHHHHHHHHH-------------------HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccChhHHHhhhhc Confidence 1111111111000000 00000111111111000 000001111000000000000000000 Q ss_pred H-HHHHH-hcC--CcccccCC---------CC Q lcl|NC_013059. 707 I-LQSQR-QNQ--PSGSVAET---------PQ 725 (725) Q Consensus 707 ~-~~~~~-~~q--~~~~~~~~---------~q 725 (725) . ....- ..+ .+.+.++. |. T Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 746 (763) T protein:vir:95 715 ALTNGEDTGIQSVSERDIAAEANPAYSLGSSQ 746 (763) T ss_pred ccccccCCCccchhhcccCccccccccCCCCC Confidence 0 00000 000 00011111 11 No 18 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=4.2e-80 Score=455.77 Aligned_cols=615 Identities=15% Similarity=0.103 Sum_probs=370.1 Q ss_pred CC--------cHHHHHHHHHHHHHHHHhhhH-HHHHHHHHHHHhhcCCCCCHHHHHHHhhcCC--CcccchHHHHHHHHH Q lcl|NC_013059. 1 MA--------DNKNRLESILSRFDADWTASD-EARREAKNDLFFSRVSQWDDWLSQYTTLQYR--GQFDVVRPVVRKLVS 69 (725) Q Consensus 1 ma--------d~~~~~~~~~~~~~~~~~~~~-~~r~~a~~d~~f~~G~QW~~~~~~~l~~~gr--p~~N~i~~~v~~v~g 69 (725) || ++.+++.-+...++.|.++.. ....++.+.++||+|++|+... .|+ .+.|.|...|+++++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~------~~~s~~~~~~v~~~v~~~~~ 74 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNER------PGKSGIVSRDVQETVDWIMP 74 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCccc------CCCCccccHHHHHHHHHHHH Confidence 66 345688888888888888765 4556889999999999997643 355 356899999999988 Q ss_pred HHhh----CCcceEEecCCcchHHHHHHHHHHHHHH-HHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC--------- Q lcl|NC_013059. 70 EMRQ----NPIDVLYRPKDGASPDAADVLMGMYRTD-MRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ--------- 135 (725) Q Consensus 70 ~~~~----nr~~~~~~pr~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~--------- 135 (725) +... +..-++|.|+.++|.++|+++|.+++|+ .+.|+....++++|+++|+||+||++|.|+.... T Consensus 75 ~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~ 154 (705) T protein:vir:88 75 SLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGL 154 (705) T ss_pred HHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccC Confidence 6664 6778999999999999999999999995 8899999999999999999999999998753210 Q ss_pred ----------CC--------CCC----c---------eeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHH Q lcl|NC_013059. 136 ----------SP--------TSN----N---------QVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGW 184 (725) Q Consensus 136 ----------~~--------~~~----~---------~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~ 184 (725) |+ ... + -.|++. ++||.+|||||+++. +.||.|++++.+||++++ T Consensus 155 ~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~--~V~p~d~~~dp~a~~--~~d~~~~~~~~~~t~~dl 230 (705) T protein:vir:88 155 SEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVL--CVKPENFLVDRLATC--IDDARFLCHREKYTVSDL 230 (705) T ss_pred ChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeee--eccHHHceecCCCCC--cccCcEEEEEEeccHHHH Confidence 00 000 0 012222 457889999999986 559999999999999999 Q ss_pred HHHhhhcC------Ccchhhh--hhhhccccccc----------cc--CCCeEEEEEEEEEecceeEEEEeeCcccccee Q lcl|NC_013059. 185 EDFAEKFD------LDADDIP--SFQNPNDWVFP----------WL--TQDTIQIAEFYEVVEKKETAFIYQDPVTGEPV 244 (725) Q Consensus 185 ~~~~p~~~------~~~~~~~--~~~~~~~~~~~----------~~--~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~ 244 (725) .++++... .+..+.. ..+......++ |. ....|.|.|||.+.+.. .| T Consensus 231 ~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~------~d------- 297 (705) T protein:vir:88 231 RLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVD------GD------- 297 (705) T ss_pred HhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEeccc------CC------- Confidence 88865321 1110100 00000011111 11 12246667777653221 11 Q ss_pred ecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCccccchhhhhhh Q lcl|NC_013059. 245 SYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTK 324 (725) Q Consensus 245 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~k 324 (725) | ..+.++.++.|+++|..+ |.+++||+.+.+++. .++.+++|+++.++ T Consensus 298 -----------------~----------~~~~~~~~~~g~~il~~~---~~~~~PF~~~~~~p~--~~~~~G~g~~~~~~ 345 (705) T protein:vir:88 298 -----------------G----------ISELRRILYVGDYIISNE---PWDCRPFADLNAYRI--AHKFHGMSVYDKIR 345 (705) T ss_pred -----------------c----------ceeeEEEEEeCccccccc---cCCCCCEEEecceee--cCccccCChHHHHh Confidence 1 123455677899888543 346799998766654 78888999999999 Q ss_pred hHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCCCCchHHHHHH Q lcl|NC_013059. 325 DGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYM 404 (725) Q Consensus 325 d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~l 404 (725) |+|+.+|++++++++++++++++++++..|++...+ ..++.|+.+ +..++ .+.+++++++++|++.++| T Consensus 346 d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~~d----~~~~~pg~v---v~~~~----~~~i~~~~~~~~~~~~~~l 414 (705) T protein:vir:88 346 DIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLED----LLTNEAAGI---VRVKS----MNSITPLETPQLSGEVYGM 414 (705) T ss_pred HHHHHHHHHHHHHHHHHHhccCCceeccccccCccc----ccccCCCee---EEecC----CCccccccCCcCcHHHHHH Confidence 999999999999999999999999999998875422 223344433 22222 2457889999999999999 Q ss_pred HHHHHHHHHHHhCCChHHhccCcc----hhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEE Q lcl|NC_013059. 405 LEAATAAVKEVATLGVDAEAVNGG----QVAYDTVNQLNMRADLETYVFQDNLA-TAMRRDGEIYQSIVNDIYDVPRNVV 479 (725) Q Consensus 405 l~~~~~~i~~~tGv~~~~~G~~~n----~~Sg~ai~~~q~q~~~~~~~~~dn~~-~~~~~~g~~ll~li~~~y~~~r~ir 479 (725) +++....++++|||++.++|.+++ ..++.+|..+.++|++.+..+++||+ ++++.+|+++++||.+||+++++|| T Consensus 415 l~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~r 494 (705) T protein:vir:88 415 LDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQ 494 (705) T ss_pred HHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEe Confidence 999999999999999999997653 34677899999999999999999997 6799999999999999999999999 Q ss_pred EeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhH--HHHHHHHHHHHHHhcccccchHHHHHHHhh Q lcl|NC_013059. 480 ITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSM--KQQNRAEILELLGKTPQGTPEYQLLLLQYF 557 (725) Q Consensus 480 I~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~--r~~~~~~l~ell~~~~~~~p~~~~~~~~~~ 557 (725) |+| .||.+|+ .++.++|||.|++++++.+. +.+.+..++++.+.+.+. |.....+. T Consensus 495 i~g-----~~v~v~~-------------~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~-~~~~~~~~--- 552 (705) T protein:vir:88 495 LRG-----KWVAVNP-------------ANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGG-GGLGVLVS--- 552 (705) T ss_pred ecc-----chhccch-------------HhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcc-cchhhhcC--- Confidence 998 3677764 35778999999999988773 233344444444333322 11111100 Q ss_pred ccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 558 TLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAA 637 (725) Q Consensus 558 ~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~ 637 (725) .........++.+..+.+.......++...+.++..++..+.+. ++..+..++|+++.+++++.++++++.+..+.++. T Consensus 553 ~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~-~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q 631 (705) T protein:vir:88 553 EQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEA-QPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQ 631 (705) T ss_pred hHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00001122233333322222222222222222211111111111 11112222333444444443333333322222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH----HHHHHH-HHHHHHHH Q lcl|NC_013059. 638 KVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDR--SEDARANAELLLKGDEQ----THKQRM-EIANILQS 710 (725) Q Consensus 638 ~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~--~~~a~~~aE~~~~~~~q----~~~q~~-e~~~~~~~ 710 (725) ..++..+... .+....+. +. +..++..+.++.+ .+.++.+++..++..+. ....+. +..+...+ T Consensus 632 ~~q~e~e~~~-------~~~~~~~~-e~-~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~~~~~~~k~~~~ 702 (705) T protein:vir:88 632 IRLAEIELKK-------QEAVLQQR-EM-ALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKVPETKKPTKA 702 (705) T ss_pred HHHHHHHHHH-------HHHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 1111111110 00000000 00 0000011111111 11112222221111110 000000 00111111 Q ss_pred HHh Q lcl|NC_013059. 711 QRQ 713 (725) Q Consensus 711 ~~~ 713 (725) .+. T Consensus 703 ~rr 705 (705) T protein:vir:88 703 VRR 705 (705) T ss_pred hcC Confidence 111 No 19 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=9.2e-55 Score=316.82 Aligned_cols=577 Identities=10% Similarity=0.020 Sum_probs=333.9 Q ss_pred CCcHHHHHHHHHHHHHHHHhhh----HHHHHHH------HHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTAS----DEARREA------KNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSE 70 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~----~~~r~~a------~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~ 70 (725) |.|...+..-++.+|+++.+.. ++|+... .++.+||+|..|....-..+..+.+.+.|.++..|++++.. T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~~v~~~ve~~~~~ 94 (651) T protein:vir:80 15 YDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKAFEAIETIHAY 94 (651) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccChhHHHHHHHHHHH Confidence 7788888888888888887764 3565443 47789999987754433333344455779999999999887 Q ss_pred HhhC----CcceEEecCCcch--HHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeecc---------- Q lcl|NC_013059. 71 MRQN----PIDVLYRPKDGAS--PDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYED---------- 134 (725) Q Consensus 71 ~~~n----r~~~~~~pr~~~d--~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~---------- 134 (725) .... ..=++|.|.+.+| ...+++++.++.+-+..+++...++..++++++.|.|+++|.|+... T Consensus 95 l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~ 174 (651) T protein:vir:80 95 LMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVR 174 (651) T ss_pred HHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehheecc Confidence 6664 3337777855444 33556677766666678999999999999999999999998876321 Q ss_pred ----CCCCCCceeE-------EEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCc--chhhhh- Q lcl|NC_013059. 135 ----QSPTSNNQVI-------RREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLD--ADDIPS- 200 (725) Q Consensus 135 ----~~~~~~~~~i-------r~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~--~~~~~~- 200 (725) ++.......| ......+||.+|||||.++. +.||.|++++.++..+..+.....+..+ ..+..+ T Consensus 175 ~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~--~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~ 252 (651) T protein:vir:80 175 TPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTD--PNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEH 252 (651) T ss_pred ccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcC--ccccceeeeeeeeHHHHHHHHhcccccchhhHHHHhh Confidence 0100011111 11111357788999999975 5699999998776555333222111100 000000 Q ss_pred -----------hhhccccc--ccccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhh Q lcl|NC_013059. 201 -----------FQNPNDWV--FPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIA 267 (725) Q Consensus 201 -----------~~~~~~~~--~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~ 267 (725) ......+. ..+.....|.|.|||.+... + .+. T Consensus 253 ~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~--------e--~~~------------------------- 297 (651) T protein:vir:80 253 KCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHL--------E--NKT------------------------- 297 (651) T ss_pred hccccccCCccccccccCCCccccccccceEEEEEEEEeec--------c--CCc------------------------- Confidence 00000000 01123467889999986321 1 000 Q ss_pred ccceeEEEEEEEEeecccccc-CCCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_013059. 268 ERQIKRRRVYKSIITCTAVLK-DKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPK 346 (725) Q Consensus 268 ~~~~~~~~v~~~~~~g~~~l~-~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~ 346 (725) .+.+...+.|.++|. +.+||++. +||+++.+.+ ++|+.++.|+++.+.|.|+.+|+..+.+++++.++++ T Consensus 298 ------~~~~~v~~~g~~il~~~~~~~~~~-~Pf~~~~~~~--~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~ 368 (651) T protein:vir:80 298 ------YHDVVVTIMGNEVLRFEQNPYWCG-RPFVIGTYIP--TARQPYAMGALQPNLGMLHELNIITNQRLDNLELAID 368 (651) T ss_pred ------eEEEEEEEcCcEEecccccCCCCC-CCeeeeccee--cCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhC Confidence 011123344566664 35667665 5999776554 5899999999999999999999999999999999999 Q ss_pred cceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccC-CCCchHHHHHHHHHHHHHHHHHhCCChHHhcc Q lcl|NC_013059. 347 KKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYE-NPEVPQANAYMLEAATAAVKEVATLGVDAEAV 425 (725) Q Consensus 347 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~ 425 (725) +++++.++.+...++.. ..|+.+. .+. ..+ .+.++. .+..+...++++++....+++++|+++.++|. T Consensus 369 ~~~~v~~d~~~~~~~l~----~~pg~vi-~~~-~~~-----~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~ 437 (651) T protein:vir:80 369 QMYTLRSDGLLQPEDVY----TEPGKVF-LVS-DHG-----DLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGAN 437 (651) T ss_pred CcEEecCCccccHHHhh----cCCCceE-Eec-CCC-----CceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCC Confidence 99999888665433321 2232321 111 111 123332 33456788999999999999999999999996 Q ss_pred Ccc---hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcEEEEeccCC-CcceEEecccccccc Q lcl|NC_013059. 426 NGG---QVAYDTVNQLNMRADLETYVFQDNLAT-AMRRDGEIYQSIVNDIYDVPRNVVITLEDG-SEKEVQLMAEVVDLA 500 (725) Q Consensus 426 ~~n---~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~~~~~g~~ll~li~~~y~~~r~irI~~~d~-~~~~v~in~~~~d~~ 500 (725) .+. .+++.+|..++.++...+..++++|.. +++.++++++.|+.+||+.++++||+|++. ...++.+++ T Consensus 438 ~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~------ 511 (651) T protein:vir:80 438 AARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDV------ 511 (651) T ss_pred CccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCc------ Confidence 542 245578999999999999999999996 789999999999999999999999999863 334555542 Q ss_pred CCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhh Q lcl|NC_013059. 501 TGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG 580 (725) Q Consensus 501 ~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~ 580 (725) +|+.++||++ ..|+.....|.+.++.+.++++.+.+. |.....+. ....+.++.+.++-...... T Consensus 512 -------~dl~~~~~iv-~~g~~~~~~r~~~~~~l~~~~q~~~~~-p~~~~~~~------~~~~~~~l~~~~g~~~~~~~ 576 (651) T protein:vir:80 512 -------EDLQKEVRLV-PIGSDHVIERKQYIEDRLTFIQAVAQV-PEMGQLVD------YKRILVDLLQHWGFEEPEAY 576 (651) T ss_pred -------cceeeeeeee-eccHHHHHHHHHHHHHHHHHHHhhccC-Cccchhhh------HHHHHHHHHHHcCCCCcHHh Confidence 4788888884 566666566777777887777765542 22111110 00112333333332222212 Q ss_pred hhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 581 VKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLS 660 (725) Q Consensus 581 ~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~ 660 (725) +..+.+. ..+..+++...+ ++...++++.+.+++.+++ ..+.+.+.++...+++.+.+ T Consensus 577 l~~~~q~----~~~~~~~~~~~q--~~~~~~~a~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-------------- 634 (651) T protein:vir:80 577 LKQQDQQ----APANPQEALLSQ--AKDVGGQAMSNMLQNQLQA--DGGTQMMSEMYGTPNADQMQ-------------- 634 (651) T ss_pred cCCCccc----hhhhhhHHHHhh--HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------------- Confidence 2111111 111111111111 1111111111111100000 00011111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 661 KQSEFREFLKTVASFQQDRSEDARANAELLL 691 (725) Q Consensus 661 ~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~ 691 (725) ++.... +....++|+-. T Consensus 635 ~~~~~~--------------~~~l~~~~~~~ 651 (651) T protein:vir:80 635 QELMAT--------------TPNVSEQQLTQ 651 (651) T ss_pred HHHHHH--------------HHHHHHhhccC Confidence 000000 01111111111 No 20 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=6.5e-37 Score=218.97 Aligned_cols=542 Identities=13% Similarity=0.012 Sum_probs=299.8 Q ss_pred CCcHHHHHHHHH----------HHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESIL----------SRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSE 70 (725) Q Consensus 1 mad~~~~~~~~~----------~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~ 70 (725) |+-...-+.+++ -.|.+..++-..|-.+..+=++||.+-=-...--..+..+...++|+|...+++|+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~ 80 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSN 80 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHH Confidence 874433333333 3333333332222223344566766511111111122223345679999999988765 Q ss_pred Hh----hCCcceEEecCCcchHHH--HHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCC---C--CC Q lcl|NC_013059. 71 MR----QNPIDVLYRPKDGASPDA--ADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS---P--TS 139 (725) Q Consensus 71 ~~----~nr~~~~~~pr~~~d~~~--Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~---~--~~ 139 (725) .. .|+-=+++.|..++|.++ +++++.++..=+..+++..+....|+++++.|.|++++.+.-.-+- . .. T Consensus 81 l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~ 160 (584) T protein:vir:95 81 YFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVP 160 (584) T ss_pred HHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecccccc Confidence 44 455568888988877654 8888888888788999999999999999999999998875432110 0 11 Q ss_pred CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHh-----hhcCCcchhhhhhhhccccccccc-- Q lcl|NC_013059. 140 NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFA-----EKFDLDADDIPSFQNPNDWVFPWL-- 212 (725) Q Consensus 140 ~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~-----p~~~~~~~~~~~~~~~~~~~~~~~-- 212 (725) .-+..+++. .+|.+|||||.+++ .+|+.||+ +..+|++++.++- |.|..+..............+.+. T Consensus 161 ~~~~prier--iSP~d~~~Dpsa~~--i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~ 235 (584) T protein:vir:95 161 DYIGPRLVR--ISPLDIVFNPLATS--ISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDF 235 (584) T ss_pred ccccceEEe--eChhheeecCCCCC--ccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccccc Confidence 111223332 34567999999976 45999999 5557999998876 333332221111111000000000 Q ss_pred -------CCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccc Q lcl|NC_013059. 213 -------TQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA 285 (725) Q Consensus 213 -------~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~ 285 (725) .+...-+.||+-..++ .++.+. |.+......+ .....+ ..++.|.+ T Consensus 236 ~~~~~~~~d~~~~~~ey~~~~~V--~vl~~~----g~~~~~~~~e--------------------~~~~~i-v~v~~g~~ 288 (584) T protein:vir:95 236 DKAAGFDVDGFGNLYEYYMSDWV--EILEFY----GDYHDKETGE--------------------LQTNRI-ITVVDRST 288 (584) T ss_pred ccccccccccccccccccCCcee--EEEeec----ccccccccCC--------------------Ccccce-EEEEeccE Confidence 0011112333322111 111110 1110000000 001111 23456777 Q ss_pred cc-cCCCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHH Q lcl|NC_013059. 286 VL-KDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYD 364 (725) Q Consensus 286 ~l-~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 364 (725) +| ...+|+|++.+||+-+ ++ .+...+.++.|+-..+.|.|+.+|..+..+++++.++.+..+- ...+ .++.. T Consensus 289 iIR~~~np~~~~~~PF~~~-~~-~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k---~~~~-~~~~~- 361 (584) T protein:vir:95 289 EVRNESIPTWFGSAPIYHV-GW-RFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLK---IIGE-VEEFV- 361 (584) T ss_pred EEEeeecCCCCCCCCEEEE-cc-eeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCccee---eccc-cchhc- Confidence 77 4578999999999733 33 4567888999999999999999999999999999998876221 1111 11221 Q ss_pred hhccccccccccccccCccccccCCcccCCCCch-HHHHHHHHHHHHHHHHHhCCChHHhccCcchh-HHHHHHHHHHHH Q lcl|NC_013059. 365 GNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGGQV-AYDTVNQLNMRA 442 (725) Q Consensus 365 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~-Sg~ai~~~q~q~ 442 (725) ..|...... ...| ..++++++... .+.++.+++....+.+.|||...+.|..+.+. ++..+.++.+++ T Consensus 362 ---~~pg~~~~~--~~~~-----~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa 431 (584) T protein:vir:95 362 ---WGPGAEIHL--DQGG-----DVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAA 431 (584) T ss_pred ---ccCCceeec--CCCC-----CcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHH Confidence 122222211 1111 23444443211 23456688999999999999999999865331 233578888888 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcEEEEeccC-CCcceEEeccccccccCCceeeeccccccceEEEEe Q lcl|NC_013059. 443 DLETYVFQDNLATAM-RRDGEIYQSIVNDIYDVPRNVVITLED-GSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDV 520 (725) Q Consensus 443 ~~~~~~~~dn~~~~~-~~~g~~ll~li~~~y~~~r~irI~~~d-~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~ 520 (725) +....++.+.+...+ ++++..|+....++++..-+|||+|++ |...|+.|.+ +||.|.||++..- T Consensus 432 ~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r-------------~Dl~g~~~~va~G 498 (584) T protein:vir:95 432 GRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTR-------------EDITANGKIRPIG 498 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccCh-------------hhhccCeeEEeeh Confidence 888889999988775 888888888888888999999999987 5666776643 6899999999877 Q ss_pred ccCchhHHHHHHHHHHHHHHh-ccc-ccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHH Q lcl|NC_013059. 521 GPSFQSMKQQNRAEILELLGK-TPQ-GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQ 598 (725) Q Consensus 521 ~p~~~t~r~~~~~~l~ell~~-~~~-~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q 598 (725) +.... .|++..+.+.+++++ +++ ..| .+....+..++....+. +.-.+..+....++|+.. |+ T Consensus 499 a~~~~-~keq~~q~l~~ilq~~~~~~i~p-----------~~~~~~l~~~ladl~~~-p~~~~~~~~~~~~~Q~~~--q~ 563 (584) T protein:vir:95 499 ARHFG-KQAQDLQNLVGIFNSQIGQMILP-----------HTSGKALATFVDDVTGL-QGYEIFRPNVAVAEQAET--QS 563 (584) T ss_pred hhHHH-HHHHHHHHHHHHHHhhhhhhccc-----------cchHHHHHHHHHHHhCC-CcccccCCCcccchhHHH--Hh Confidence 65445 477878888887763 222 223 22222222222111110 011111111111111000 00 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 599 AKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLS 631 (725) Q Consensus 599 ~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k 631 (725) ...++|. ..++|++ +.++-+- T Consensus 564 --------~~~~~q~-~~~~~~~---~~~~~~~ 584 (584) T protein:vir:95 564 --------LVAQAQE-DLQLQAQ---MPAEGAI 584 (584) T ss_pred --------hhHHHHH-HHHHHHh---hhhccCC Confidence 0111110 0011110 0000000 No 21 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=1.2e-35 Score=212.01 Aligned_cols=566 Identities=12% Similarity=0.045 Sum_probs=295.7 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcC-----C-------CCCHHHHHHHhhcCCCcc----cchHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRV-----S-------QWDDWLSQYTTLQYRGQF----DVVRPVV 64 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G-----~-------QW~~~~~~~l~~~grp~~----N~i~~~v 64 (725) |++++ +...++.+|..+.+....|=.+..+.++||.. + +|........ +.+.+. +.+..+. T Consensus 20 ~~~~~-~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--r~ki~~~~~~~~~~~l~ 96 (641) T protein:vir:94 20 LSTDR-IGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADW--RHRINTGHTFEVVETLV 96 (641) T ss_pred CCchh-HHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcc--cccccchhHHHHHHHHh Confidence 77665 55556666666655443333333444555432 1 2322221111 122223 4444444 Q ss_pred HHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeecc---------- Q lcl|NC_013059. 65 RKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYED---------- 134 (725) Q Consensus 65 ~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~---------- 134 (725) -++.+....++.=+++.|++++|.+.|++++..+++.+..|++...++..|.+++..|.|++++.++... T Consensus 97 s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~ 176 (641) T protein:vir:94 97 AYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVE 176 (641) T ss_pred hHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhccc Confidence 4555555566776799999999999999999999999899999999999999999999999988765321 Q ss_pred -CCCCCCc---------eeEEEEeeecchhheeeCCCccccChhccccee-eeecCCHHHHHHHhhhcCCcchhhhhhhh Q lcl|NC_013059. 135 -QSPTSNN---------QVIRREPIHSACSHVIWDSNSKLMDKSDARHCT-VIHSMSQNGWEDFAEKFDLDADDIPSFQN 203 (725) Q Consensus 135 -~~~~~~~---------~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~-~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 203 (725) .+.++.+ -.++++|+ ++.++||||.++..+ ..|++ +...++..+++.. +-++.+..+. .. T Consensus 177 ~~~~~~~~~~~~v~~~~~~~r~~~v--~~~di~~dps~~~~~---~~f~~~r~t~~t~~~l~~e-g~~~~d~v~~---~~ 247 (641) T protein:vir:94 177 TGDIFGGWEDVAVNRQRSELRIEPL--SPYDVWLDTSGGKNT---GTFVRLRHTREELHELVTS-GYYDLDLTQV---EQ 247 (641) T ss_pred chhhcccccccceecccceeeEEec--chhheeecCCCCccc---ccceehhhhHHHHHHHHhc-CCCChhhcch---hh Confidence 1223221 12344543 555799999886533 33333 2233343333322 0011111111 00 Q ss_pred cccccc---------cccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEE Q lcl|NC_013059. 204 PNDWVF---------PWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRR 274 (725) Q Consensus 204 ~~~~~~---------~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 274 (725) ..++.+ ..++..+.++.|||... +. .| ...+ T Consensus 248 ~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~--------------------~~------------d~--------~~~~ 287 (641) T protein:vir:94 248 YVDYKFADPDTPKDVNGTDTSGWDIIEYYGPL--------------------LV------------EG--------VQFW 287 (641) T ss_pred cccccccccccccccccccccccceeeeeeee--------------------cc------------CC--------Ccee Confidence 000000 00111112233333100 00 00 0111 Q ss_pred EEEEEEeeccccccC-CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeech Q lcl|NC_013059. 275 RVYKSIITCTAVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWP 353 (725) Q Consensus 275 ~v~~~~~~g~~~l~~-~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~ 353 (725) .++ ..+.|.++|.. .+++ ++.+||+.+... .++++.|+.|.+..+.+.|+.+|+.....++.+.++.++++++.. T Consensus 288 ~~~-~~~~g~~il~~~~~~~-~d~~Pf~~~r~~--~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~ 363 (641) T protein:vir:94 288 CVH-AVFYGKQLIRLSDSKY-WCGSPFVTTTLL--PDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVE 363 (641) T ss_pred eEE-EEEeCCEEeecccccc-cCcCCeEEecce--ecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecc Confidence 222 33467777744 2332 456799855433 468899999999999999999999999999999999999988776 Q ss_pred hhcchHHHHHHhhccccccccccccccCccccccCCcccCCCC-chHHHHHHHHHHHHHHHHHhCCChHHhccCc---ch Q lcl|NC_013059. 354 EQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPE-VPQANAYMLEAATAAVKEVATLGVDAEAVNG---GQ 429 (725) Q Consensus 354 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~---n~ 429 (725) +.+-.-.+. ...|+.. +....+ ..++++.+.. -.....++++.....+.+.+|+...++|..+ .. T Consensus 364 ~~~~~~~~l----~~~PG~i---i~~~~~----~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 432 (641) T protein:vir:94 364 DGILKREDV----KAKPGAV---FKVAQH----GSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGER 432 (641) T ss_pred cccccccee----eccCCcc---eeeCCC----CcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchh Confidence 543221111 1122221 111111 1233333222 1223456777777889999998776665543 23 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC-cceEEeccccccccCCceeee Q lcl|NC_013059. 430 VAYDTVNQLNMRADLETYVFQDNLA-TAMRRDGEIYQSIVNDIYDVPRNVVITLEDGS-EKEVQLMAEVVDLATGERQVL 507 (725) Q Consensus 430 ~Sg~ai~~~q~q~~~~~~~~~dn~~-~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~-~~~v~in~~~~d~~~g~~~~~ 507 (725) +++..|..+.+++...+..+..+|. .+++.+++.++.++.++++.+.++|++|.... ..++++. . T Consensus 433 ~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~-------------p 499 (641) T protein:vir:94 433 VTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVS-------------P 499 (641) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCC-------------c Confidence 4566799999999999999999999 68888999999999999999999999987522 1233332 2 Q ss_pred ccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchh-HHHHHHHHhhhhhhhhhhhccc Q lcl|NC_013059. 508 NDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKG-VEMMRDYANKQLIQMGVKKPET 586 (725) Q Consensus 508 nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~-~~~i~e~~~kq~~~~~~~~~~~ 586 (725) +||.|+||+ |..|.+....+.+.++.|.++++.+.. .|.+ +.+.|+.. +.++++.++--.+...+..+.+ T Consensus 500 ~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~-~P~v-------~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~ 570 (641) T protein:vir:94 500 EYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGR-VPQI-------GQSLDYALILEDLLRQMRFTDPMRYIKKAEA 570 (641) T ss_pred cceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhc-Chhh-------hhcCCHHHHHHHHHHHhCCCCchhhccCccC Confidence 567888888 567767776777777778777766543 2332 23333332 3444443332111222222111 Q ss_pred hhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH Q lcl|NC_013059. 587 PEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIA--EIFNNMDLSKQSE 664 (725) Q Consensus 587 ~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~--~~~~q~~~~~~~~ 664 (725) ++......+++++ +..+.+++... .....++.....+ .+++.+..++..+ ....|+.++.... T Consensus 571 ---~~~~~~~~~~~~q--~~~~~~a~~~~-----~~~~~~a~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 635 (641) T protein:vir:94 571 ---PPAAPPIAPAEPG--ALPPEMMNSVG-----GGLNDQAIAGMTP-----EDVSDLASRIGIDTSDVAPEAMAAATQQ 635 (641) T ss_pred ---chhHHHHHHHHHH--HHHHHHHHHHH-----hhhHHHHHHHhhH-----HHHHHHHHhhcCCchhhhHHHHhccccc Confidence 1111101111110 01111111100 0000111000000 0111111111000 0000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 665 FREFLKTVASFQQDRSEDARANAELLL 691 (725) Q Consensus 665 ~~e~~~~~~~~q~~~~~~a~~~aE~~~ 691 (725) .. +..+ T Consensus 636 ~~---------------------~~~~ 641 (641) T protein:vir:94 636 IT---------------------SGAL 641 (641) T ss_pred cc---------------------ccCC Confidence 00 0000 No 22 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=2.3e-34 Score=205.03 Aligned_cols=605 Identities=13% Similarity=0.125 Sum_probs=310.5 Q ss_pred CCcH---------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADN---------KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~---------~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~ 71 (725) |+|. +.+-++++.++..+..++.+|++......+-|.|..|+..... .-||+|-..|..|+-.- T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~-------~r~nl~~sni~~i~P~i 73 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAE-------TRWNLFSTNIQTQMASL 73 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCccc-------cccchhhhhHHHHhhhh Confidence 9883 2366689999999999999999999999999999888765532 13899999999998877 Q ss_pred hhCCcceEEecCCcc-hH----HHHHHHHHHHHHHH--HhcChhHHHHHHHHHHHhcCcceEEEEeeeccC--------- Q lcl|NC_013059. 72 RQNPIDVLYRPKDGA-SP----DAADVLMGMYRTDM--RHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ--------- 135 (725) Q Consensus 72 ~~nr~~~~~~pr~~~-d~----~~Ae~l~~~~~~~~--~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~--------- 135 (725) =...|.|.|+||-.+ |. -.+|+|+-+++... +.++++...-....+++.||+|.++|+|+.+.+ T Consensus 74 Yar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~ 153 (663) T protein:vir:34 74 YGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDAIL 153 (663) T ss_pred hcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccccccC Confidence 788899999998443 32 34566666665433 667788889999999999999999999865322 Q ss_pred -CCCCCc----------e-eEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcch--hhhhh Q lcl|NC_013059. 136 -SPTSNN----------Q-VIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDAD--DIPSF 201 (725) Q Consensus 136 -~~~~~~----------~-~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~--~~~~~ 201 (725) ++.+.. + .-+++.-|..|.++++|| |+..+ ++.|++.+.||++.+++++|...++... ..... T Consensus 154 D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~p-Ar~W~--ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~~~~ 230 (663) T protein:vir:34 154 DEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSP-ARVWH--EVRWLAFRNLLDMREFNARFDADGSRNLWASVPKV 230 (663) T ss_pred CCccccchhcccccchhhcccceeeeeechhhcccch-hhccc--cccceeeeccCCHHHHHHhhcCChhhhhhhhccCc Confidence 111110 0 112222367799999999 56654 9999999999999999999953221110 11111 Q ss_pred hhccccc--ccccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEE Q lcl|NC_013059. 202 QNPNDWV--FPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKS 279 (725) Q Consensus 202 ~~~~~~~--~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~ 279 (725) ..+.++. ....+.+..+|.|.|=|... ||||+ T Consensus 231 ~~~~~~~~~~~~~~~~~a~VwEIWdK~~~----------------------------------------------~V~w~ 264 (663) T protein:vir:34 231 GKPKDGKDGQSCHPWDRAEVWEIWDKGGR----------------------------------------------KVDWY 264 (663) T ss_pred CCccccCCCCCcchhcCcceeEEEecCCc----------------------------------------------EEEEE Confidence 1112111 12223357889999977422 34444 Q ss_pred EeeccccccC-CCCCCCC-ccce-EEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc Q lcl|NC_013059. 280 IITCTAVLKD-KQLIAGE-HIPI-VPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI 356 (725) Q Consensus 280 ~~~g~~~l~~-~~~~p~~-~~p~-vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i 356 (725) +=.++.+|.. ++|---. +||+ .|.+|+.. .++.+|--.+-..++.|+.+|..+... ..+...-+.+++++.+.. T Consensus 265 ~eg~~~~L~~~~p~lgl~~ffPcPrpl~~~~~--~ds~ipvpd~~~y~~~~~E~n~~t~Ri-n~l~d~ikv~gvy~~~~g 341 (663) T protein:vir:34 265 VEGYSAVLDTQPDPLGLESFFPCPKPLLANWT--TDKVVPRPDFVLAQDLYKEIDLVSTRI-TLLERAIRVVGVYDKSSG 341 (663) T ss_pred EcCcceecccCCCCCCCCCCCCCcccccceec--CCCeecCCcHHHHHHHHHHHHHHHHHH-HHHHhhhhhceeeccccc Confidence 3333333322 2211112 2332 03334432 333444333348999999999766554 355566777888887765 Q ss_pred chHHHHH-Hhhcccccccc-c--cccccCccccccCCcccCCCCchHHHHHHHH---HHHHHHHHHhCCChHHhccC-cc Q lcl|NC_013059. 357 AGFEHMY-DGNDDYPYYLL-N--RTDENNGEMPTQPLAYYENPEVPQANAYMLE---AATAAVKEVATLGVDAEAVN-GG 428 (725) Q Consensus 357 ~~~~~~~-~~~~~~~~~~~-~--~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~---~~~~~i~~~tGv~~~~~G~~-~n 428 (725) .+.-... ...++. .++. | .+..++| + .+.|..++-.++-+.+..+.+ ..+.++.++||+.|++-|.. .| T Consensus 342 ~~i~~~l~~a~~n~-lvpV~~~~~~~~~gg-~-~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ 418 (663) T protein:vir:34 342 LTIGRLLSEAAQND-LIPVENWLTFADKGG-L-RGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPR 418 (663) T ss_pred hhHHHHHHHhhCCC-ceecchhhhhhhhcC-c-cchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcc Confidence 4433322 222221 1111 0 1111222 1 134455555555555555544 46677889999999999974 45 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeec Q lcl|NC_013059. 429 QVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLN 508 (725) Q Consensus 429 ~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~n 508 (725) .|.| |-+-.++-|+.++..+-|-+.++.+.+.+..-+.|.+-|+-+.+-+|+|..-. .-+.|.+ ..+.+.| T Consensus 419 ETat-AQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp-~~~ei~~-------~~~~L~n 489 (663) T protein:vir:34 419 ETAM-AQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFT-FDKELAP-------KAAELIK 489 (663) T ss_pred hhhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCC-cccchhH-------HHHHHhc Confidence 6654 34444567999999999999999999999988888888887777778875321 1122211 1123445 Q ss_pred cccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchh Q lcl|NC_013059. 509 DIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPE 588 (725) Q Consensus 509 Di~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e 588 (725) |=...|.|-|..+..........-+.++++++.+.++.-..+-+ ...+ -++.|.+.++....-..........-..++ T Consensus 490 ~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl-~~q~-p~~~p~l~Ellk~~~~~f~~~~qie~ai~~ 567 (663) T protein:vir:34 490 SRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPL-AQQV-PGSAPFLLQMLKWSVSGLRGSSTIEGVLDK 567 (663) T ss_pred CCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHH-HHhh-hhhHHHHHHHHHHHhhcCChhhhHHHHHHH Confidence 53245666555543222222223334444444433322111000 0001 112222222221100000000000000000 Q ss_pred hhHHHHHHHHHH----HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 589 EQQWFVEAQQAK----QGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSE 664 (725) Q Consensus 589 ~~~~~~q~~q~q----q~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~ 664 (725) ... ..+..+.+ +..++.+..++.+++.+++.+.+++|++. |.+..+.+.+++..+.+.. . + ++.. T Consensus 568 ~~~-~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeAq~e~---q~~~~~~ql~~~~~~~k~~---~-~---a~~~ 636 (663) T protein:vir:34 568 AIA-AAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQEMAKVQAEV---QGDLLRIQAETQANETKER---Q-Q---AEWN 636 (663) T ss_pred HHh-hhHHHhhccCCCCcccchhhHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH---H-H---HHHH Confidence 000 00011111 11111222222333334443433333221 1111222222221111110 0 0 0111 Q ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_013059. 665 FREFLKTVASFQQD--RSEDARANAELLLK 692 (725) Q Consensus 665 ~~e~~~~~~~~q~~--~~~~a~~~aE~~~~ 692 (725) +.++..+....++. +...++..- + . T Consensus 637 ~~~a~q~~~~~~~~r~~~~~a~~~~--~-~ 663 (663) T protein:vir:34 637 VREAAQKNLISQAARAMNPQARNGG--M-P 663 (663) T ss_pred HHHHHHhhHHHHHHHhhchhhhcCC--C-C Confidence 11111111000000 011111110 0 0 No 23 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=3.6e-31 Score=187.49 Aligned_cols=541 Identities=12% Similarity=0.057 Sum_probs=295.5 Q ss_pred CCcHHHHHHHHHHHHHHHHhh----------hHHHHHHHHHHHHhhcCCCCCHHHHHHHhh--------cCCC-----cc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTA----------SDEARREAKNDLFFSRVSQWDDWLSQYTTL--------QYRG-----QF 57 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~----------~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~--------~grp-----~~ 57 (725) |+-+...+++.+.-+....+. +.+.|... -+-| ++...++.. .+=| +. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~--------~~~w-~e~~~yi~~~~tr~t~~~~~~w~~s~t~ 71 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQK--------DRED-KELMDYIDATDTRKTSNSKLPFKNSTTI 71 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhh--------hccc-HHHHHHHhhhcccccccCCCCcccccch Confidence 776555555555533333222 22222221 1233 333333322 2223 56 Q ss_pred cchHHHHHHHHHHHh----hCCcceEEecCCcch--HHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEE-- Q lcl|NC_013059. 58 DVVRPVVRKLVSEMR----QNPIDVLYRPKDGAS--PDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLV-- 129 (725) Q Consensus 58 N~i~~~v~~v~g~~~----~nr~~~~~~pr~~~d--~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~-- 129 (725) |++..++..|+.+.. .|+-=+.|.|-+++| ...++++..+++.=+..+++..+++.-+.+.|.-|.++..+. T Consensus 72 ~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~e 151 (599) T protein:vir:31 72 NKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHV 151 (599) T ss_pred HHHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEE Confidence 888888888876554 466678888877764 367788888888888899999999999999999998875543 Q ss_pred ---eeeccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhh-----hcCCcchhhhhh Q lcl|NC_013059. 130 ---TDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAE-----KFDLDADDIPSF 201 (725) Q Consensus 130 ---~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p-----~~~~~~~~~~~~ 201 (725) ..+++....+..+..+++. .+|.+|||||.|+++| |+.||+ +-..|+.++..+-. -|..+.....+. T Consensus 152 r~~~~~~d~~v~~~~~~P~~er--vsP~Di~~Dp~A~si~--d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~ 226 (599) T protein:vir:31 152 KRMTVTAENQVIKNYSGTVTER--LSPSDVFWDVTADSLP--KAAKCI-RQLYTLGSLKREIEEGTFPLMSMEDFQKLRE 226 (599) T ss_pred EcceeecccccccccccceEEe--ecccceeeCCCCCCCC--cceeee-ehhhhHHHHHHHhccCCccccchHHHHHHHh Confidence 2333332333333344444 3455699999997655 998877 66677888887653 333333333221 Q ss_pred hhc------ccccccccCCCe------EEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhcc Q lcl|NC_013059. 202 QNP------NDWVFPWLTQDT------IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAER 269 (725) Q Consensus 202 ~~~------~~~~~~~~~~~~------vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 269 (725) .-. -+++.+|...+. -++.||+ .+-.-++++||..- ++. .+. T Consensus 227 ~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~-------------~~~~VevLeywGd~----yde---e~d------ 280 (599) T protein:vir:31 227 ERRTIREALADGYNGRRKFDSLHKKGYGSMMNYI-------------NEGVVEVLTFMGDF----YDE---END------ 280 (599) T ss_pred hccCCCccccchhhhhhhccccccccccchhhhc-------------ccchhhhhhhhhhh----hcc---cCC------ Confidence 111 111222221111 1122222 11122333443211 110 000 Q ss_pred ceeEEEEEEEEeeccccc--cCCCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_013059. 270 QIKRRRVYKSIITCTAVL--KDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKK 347 (725) Q Consensus 270 ~~~~~~v~~~~~~g~~~l--~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~ 347 (725) +...-....|+|..+| .+..|||.+.+||+-. ++ .+..++.+++|....+.|.|..+|......++++.+.-.. T Consensus 281 --~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~-~~-~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p 356 (599) T protein:vir:31 281 --ELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIA-VY-EFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHP 356 (599) T ss_pred --ccccceEEEEecCcEEeecccCCCCCCCCCeEEE-Ee-eeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcc Confidence 0011113456675444 4578999999999833 33 3557778999999999999999999999998877665432 Q ss_pred ceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc Q lcl|NC_013059. 348 KPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG 427 (725) Q Consensus 348 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~ 427 (725) .+..-+.+.+.+..|. |.... .+ ... ...+++.++.-......+++.....+.+.||+...+.|..+ T Consensus 357 -~l~~~~dl~~eD~~~~-----P~~v~-~~-~d~-----~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ 423 (599) T protein:vir:31 357 -SLKKVGDVREKGMRGG-----PNHVF-EV-EET-----GDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRT 423 (599) T ss_pred -cccccccccccCccCC-----CCcce-ee-cCC-----CccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcc Confidence 1111111111100110 11110 00 011 12244444433334555788888999999999999999876 Q ss_pred ch-hHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCcEEEEeccC-CCcceEEeccccccccCCce Q lcl|NC_013059. 428 GQ-VAYDTVNQLNMRADLETYVFQDNLATA-MRRDGEIYQSIVNDIYDVPRNVVITLED-GSEKEVQLMAEVVDLATGER 504 (725) Q Consensus 428 n~-~Sg~ai~~~q~q~~~~~~~~~dn~~~~-~~~~g~~ll~li~~~y~~~r~irI~~~d-~~~~~v~in~~~~d~~~g~~ 504 (725) .+ -.+..+..+.++++.....+...+... .+.+.+.++++..+|+|++-+|||++++ |...|+.|.+ T Consensus 424 ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~r---------- 493 (599) T protein:vir:31 424 AGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITA---------- 493 (599) T ss_pred cchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeeh---------- Confidence 43 244568888899999999999999866 5559999999999999999999999987 7888998854 Q ss_pred eeeccccccceEEEEeccCchhHHHHHHHHHHHHHHh-cc-cccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhh Q lcl|NC_013059. 505 QVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGK-TP-QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVK 582 (725) Q Consensus 505 ~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~-~~-~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~ 582 (725) +||.+.+++ +..|...--.|++..+-+.++++. ++ +..|. +.......+++.+.-+ ...++. T Consensus 494 ---edl~~~~~~-v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~-----------~~~k~l~~~l~~~~~l-~~~~~~ 557 (599) T protein:vir:31 494 ---DDLNLNGQM-VAQGATLFAEKANTLQNLNAILGGPLGAALAPH-----------MSRTKLFNAVEYLGDL-DAYGIF 557 (599) T ss_pred ---hhhhCCeee-eechhhHHHHHHHHHHHHHHHhcccCCCccchh-----------hHHHHHHHHHHHHHhc-cccccC Confidence 688899999 566665444577777777777742 11 12232 1111111111111100 000111 Q ss_pred hccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 583 KPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDL 659 (725) Q Consensus 583 ~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~ 659 (725) ......+++ |.+. +.+|+++ |++-. +|+-.+.... .. ++... T Consensus 558 ------~~~va~~eq-----q~~~--~m~Q~~l---q~~~~--------~~~~~~~~~~-------~~----~~~~~ 599 (599) T protein:vir:31 558 ------TFGIGVQED-----QQLA--RMAQKST---QQTEE--------TALTQEEVGG-------PT----TDTGQ 599 (599) T ss_pred ------CCchhHHHH-----HHHH--HHHHHHH---HHhHh--------hhhhhhhcCC-------CC----cccCC Confidence 111111011 1111 1111111 00000 0000000000 00 00000 No 24 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.84 E-value=6.4e-20 Score=125.82 Aligned_cols=457 Identities=12% Similarity=0.031 Sum_probs=224.8 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCC----CcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYR----GQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~gr----p~~N~i~~~v~~v~g~~~~nr~ 76 (725) |.+..+.+.++...+.. .-+....+-.+||.|+++.-.........++ .++|..+.+|+..+|+...+++ T Consensus 37 ~~~~~~~i~~~i~~~~~------~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~ 110 (501) T protein:vir:96 37 MVNNWELLKNFINHHKL------RQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPI 110 (501) T ss_pred cCChHHHHHHHHHHHHH------HHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCe Confidence 45554445554443331 1122445667999999875422222233333 3679999999999999999887 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH- 155 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~- 155 (725) .+.+. +.+ ..+.+...+..+++.|+++.....+..++++.|.||.-+..+ ++ | .+.+... ++.. T Consensus 111 ~~~~~--~~~---~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---ed--g-~~~i~~~----~p~~~ 175 (501) T protein:vir:96 111 RVEYD--DND---DNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRS---EY--D-ETRIKRL----SPLET 175 (501) T ss_pred eEeeC--Ccc---chhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEc---CC--C-ceEEEEE----cccee Confidence 77553 222 234556667778889999999999999999999999877543 22 2 2333321 2222 Q ss_pred -eeeCCCcc-ccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 156 -VIWDSNSK-LMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 156 -v~~Dp~a~-~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) ++||+... ++. ++++.|-. ......+..+++|.... ++ T Consensus 176 ~~v~d~~~~~~~~------~~v~~~~~------------------------------~~~~~~~~~~~vyt~~~----i~ 215 (501) T protein:vir:96 176 FVIYDNSLEDNSI------AAVRYYNR------------------------------GTLQSAKDVVEIYTDEH----IY 215 (501) T ss_pred EEEEcCCCCCceE------EEEEEEEe------------------------------ecCCCcEEEEEEEcCCc----EE Confidence 34554321 110 11111100 00011233444443311 11 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) .+ ...+...+.+..|.+.+.+|+|+|... T Consensus 216 ~~--------------------------------------------~~~~~~~~~~~~~~~~g~vPvv~~~nn------- 244 (501) T protein:vir:96 216 TL--------------------------------------------DASDDFNEISVTTHAFGTVPITEYLNN------- 244 (501) T ss_pred EE--------------------------------------------eeCCCceeccccccCCCccceEEecCC------- Confidence 11 111111222334445566777766321 Q ss_pred cccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCccccccCCccc Q lcl|NC_013059. 314 EVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYY 392 (725) Q Consensus 314 ~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 392 (725) ..+.|.+..+++.++.+|..+|.+...+...+...+.+ .|.. ...............+.........|......++++ T Consensus 245 ~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 323 (501) T protein:vir:96 245 IDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAI-YGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYL 323 (501) T ss_pred ccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecccccCcccchhhhhhcCeeeecccccccccccCcceeeE Confidence 23568888999999999999999998887766555433 2221 111111111111111111111111122222234444 Q ss_pred CCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_013059. 393 ENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIY 472 (725) Q Consensus 393 ~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y 472 (725) ..+.-..++..+++.....|-.+|++.+.+.|..++..||+|+..............-.-|..+++++.++++.++.... T Consensus 324 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~ 403 (501) T protein:vir:96 324 TKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVN 403 (501) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 43333455666788899999999999888877766667999998876666665666666777777776666665543221 Q ss_pred CCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHH Q lcl|NC_013059. 473 DVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLL 552 (725) Q Consensus 473 ~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~ 552 (725) .. . .. |+ .+|.|.-.|..+.-..+..+.++.+.+.++. .. T Consensus 404 ~~--------~--~~---------------------d~---~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~------et 443 (501) T protein:vir:96 404 EF--------K--DF---------------------DE---SLLKITFTPNLPKSLNEQVSILTGLGGQVSQ------ET 443 (501) T ss_pred cc--------c--cc---------------------cc---ccceEEeCCCCCcCHHHHHHHHHHHhccCch------HH Confidence 00 0 00 01 2344555666665555556666665443322 12 Q ss_pred HHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 553 LLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSL 632 (725) Q Consensus 553 ~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~ 632 (725) ++..++..+-| +.-++++.+.......... .... .........+... .+++.... T Consensus 444 ~~~~l~~v~D~--~~E~~ri~~E~~~~~~~~~-~~~~---------------~~~~~~~~~~~~e-------~~~d~~e~ 498 (501) T protein:vir:96 444 ALSLSGLVESP--NEELDKINKEMSEIDFKGY-SNDF---------------NEHVGKYTDEVKE-------THTDDFER 498 (501) T ss_pred HHHhCCCCCCH--HHHHHHHHHHHHHhhcccc-ccch---------------hhcccccCCcCCC-------CCCCcccc Confidence 22223322211 2223333221110000000 0000 0000000000000 00000000 Q ss_pred HHH Q lcl|NC_013059. 633 QID 635 (725) Q Consensus 633 q~e 635 (725) -.+ T Consensus 499 ~~~ 501 (501) T protein:vir:96 499 EYE 501 (501) T ss_pred ccC Confidence 000 No 25 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.80 E-value=8.6e-19 Score=119.62 Aligned_cols=458 Identities=12% Similarity=0.045 Sum_probs=219.9 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCC----CcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYR----GQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~gr----p~~N~i~~~v~~v~g~~~~nr~ 76 (725) +.+..+.+.++...+. ..-+.+..+-.+||.|++..-......+..++ .++|..+.+|+..+|+...+++ T Consensus 37 ~~~~~~~l~~~i~~~~------~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~ 110 (501) T protein:vir:27 37 MVNNWELLKNFINHHK------LRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPI 110 (501) T ss_pred ccccHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCe Confidence 4444444444433221 12223445567999999764322222333333 4679999999999999999987 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH- 155 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~- 155 (725) .+.+... ...+.+...+..+++.|+++.....+.+++++.|.+|.-|..+ ++ | .+.++.. ++.+ T Consensus 111 ~~~~~d~-----~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d---ed--~-~~~i~~~----~p~~~ 175 (501) T protein:vir:27 111 RVEYDDN-----DNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRN---EY--D-ETRIKRL----NPLET 175 (501) T ss_pred eEecCCc-----cchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeC---CC--C-ceEEEEE----cccee Confidence 7765332 2223445566777888999999999999999999999877543 22 1 2333321 2222 Q ss_pred -eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 156 -VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 156 -v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) ++||+.... .. ..+++.|.. ....+.+..+++|.... ++. T Consensus 176 ~~v~d~~~~~----~~-~~~ir~~~~------------------------------~~~~~~~~~~~vyt~~~----v~~ 216 (501) T protein:vir:27 176 FVIYDNSLED----NS-IAAVRYYNR------------------------------GTLQNAKDVVEIYTNEH----IYT 216 (501) T ss_pred EEEecCCCCC----ce-EEEEEEEEe------------------------------eecCCcEEEEEEEeCCe----EEE Confidence 334543211 00 112221110 00112334455554421 111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +. ..|...+.+..|.+.+.+|+|+|.. .. T Consensus 217 ~~--------------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n-------n~ 245 (501) T protein:vir:27 217 LD--------------------------------------------ASDDFNEISVTTHAFGTVPITEFLN-------NV 245 (501) T ss_pred EE--------------------------------------------eCCceeeccccccCCCcccEEEecC-------CC Confidence 11 1111122233344445677766522 12 Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCccccccCCcccC Q lcl|NC_013059. 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYE 393 (725) Q Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 393 (725) .+.|.+..+++.++.+|..+|.+...+...+...+.+. |.. ....+............+.......|......++++. T Consensus 246 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 324 (501) T protein:vir:27 246 DGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIY-GDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLT 324 (501) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCccCCcccchhhhhhcCceeecccccccCCCCCcceeeee Confidence 35688899999999999999999988877665554432 211 1111111111111111111111111111222344444 Q ss_pred CCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013059. 394 NPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYD 473 (725) Q Consensus 394 ~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~ 473 (725) .+.-..++..+++...+.|-.+|++.+.+.|.-++..||+|+..............-..|..+++++.++++.++.... T Consensus 325 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~- 403 (501) T protein:vir:27 325 KSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVN- 403 (501) T ss_pred ccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc- Confidence 4333345666788889999999998877776655567999998877666666666667777777776666665432111 Q ss_pred CCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHH Q lcl|NC_013059. 474 VPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLL 553 (725) Q Consensus 474 ~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~ 553 (725) ..... |+ .+|.|.-.|..+.-..+..+.++.+.+.++. . .+ T Consensus 404 ---------~~~~~---------------------d~---~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~---e---t~ 444 (501) T protein:vir:27 404 ---------EFKDF---------------------DE---SLLKITFTPNLPKSLNEQVSILTGLGGQVSQ---E---TA 444 (501) T ss_pred ---------ccccc---------------------cc---ccceEEeCCCCCcCHHHHHHHHHHHhccCcH---H---HH Confidence 10000 01 2455555666665455555555555433221 1 22 Q ss_pred HHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 554 LQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQ 633 (725) Q Consensus 554 ~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q 633 (725) +..++..+- .+.-++++.+........ ..+...-. ......-+...-..+..+-+ T Consensus 445 l~~l~~v~D--~~~E~eri~~E~~e~~~~--------------------~~~~~~~~---~~~~~~d~~~~~~~d~~e~~ 499 (501) T protein:vir:27 445 LSLSGLVES--PNEELDKINKEVSEIDFK--------------------GYSNDFNE---HVGKYTDEVKETHTDDFERA 499 (501) T ss_pred HHhCCCCCC--HHHHHHHHHHHHHhhhHh--------------------hhcCcccc---ccccccCCCCCCcccccccc Confidence 222222221 112223332211000000 00000000 00000000000000000000 Q ss_pred HH Q lcl|NC_013059. 634 ID 635 (725) Q Consensus 634 ~e 635 (725) .+ T Consensus 500 ~~ 501 (501) T protein:vir:27 500 YE 501 (501) T ss_pred CC Confidence 00 No 26 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.78 E-value=3.8e-18 Score=116.07 Aligned_cols=464 Identities=11% Similarity=0.011 Sum_probs=214.3 Q ss_pred CCcHHHHHHHHHHHHHHHHhhh-HHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTAS-DEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~-~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v~~v~g~~~~nr 75 (725) |-....+.....+..+..++.. ..-+....+-.+||.|+++.-......+..+ |.++|..+-+|+..+|+...++ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p 110 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNP 110 (502) T ss_pred ccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcccC Confidence 1111111111111122222221 1222344556899999886432222222333 3467999999999999999998 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~ 155 (725) +.+.+.- +++. +.+...+..++..|+++.....+..+++++|.||+-+..+ ++ | .+.++.. ++.. T Consensus 111 ~~~~~~d-~~~~----~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~d---ed--g-~~~i~~~----~p~~ 175 (502) T protein:vir:48 111 IRVEYDD-NEDN----SQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRS---EY--D-ETRIKRL----SPLE 175 (502) T ss_pred eeEecCC-ccch----hHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeC---CC--C-ceEEEEE----cccc Confidence 8776532 2222 3345556667888999999999999999999999877543 22 1 2333321 2222 Q ss_pred --eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 156 --VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 156 --v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) ++||+.... + ..++++.|-.. ...+.+.++++|.... ++ T Consensus 176 ~~~vydd~~~~----~-~~~~ir~~~~~------------------------------~~~~~~~~~~iyt~~~----i~ 216 (502) T protein:vir:48 176 TFVIYDNSLED----N-SIAAVRYYNRG------------------------------TLQNAKDVVEIYTNQH----IY 216 (502) T ss_pred eEEEEcCCCCC----c-eEEEEEEEEEe------------------------------ecCCcEEEEEEEeCCe----EE Confidence 234432210 0 11122211100 0012233445554321 11 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) .+. ..|...+.+..|.+.+.+|+|+|+.. T Consensus 217 ~~~--------------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~nn------- 245 (502) T protein:vir:48 217 TLD--------------------------------------------ASDSFNEISVTPHAFGTVPITEFLNN------- 245 (502) T ss_pred EEE--------------------------------------------eCCceeeccceecCCCccceEEecCC------- Confidence 111 11111222334445556777765321 Q ss_pred cccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcch-HHHHHHhhccccccccccccccCccccccCCccc Q lcl|NC_013059. 314 EVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAG-FEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYY 392 (725) Q Consensus 314 ~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 392 (725) ..+.|.+..+++.++.+|..+|.+...+...+...+++ .|.... ...............+.......|......++++ T Consensus 246 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l 324 (502) T protein:vir:48 246 ADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAI-YGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYL 324 (502) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCcccccccchhhhhhcceeeccccccccccccCcceeEe Confidence 23568889999999999999999998887766554433 222111 0000000000011111111111122223334444 Q ss_pred CCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_013059. 393 ENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIY 472 (725) Q Consensus 393 ~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y 472 (725) ..+.-..+....+......|-..|++.+.+.+.-++..||+|+..............-.-|..+++++.++++.++... T Consensus 325 ~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~- 403 (502) T protein:vir:48 325 TKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLV- 403 (502) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc- Confidence 4433335556678888999999999888777765556799999987766666555566666666666666655544311 Q ss_pred CCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHH Q lcl|NC_013059. 473 DVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLL 552 (725) Q Consensus 473 ~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~ 552 (725) +..... |+ .+|.|.-.|..+.-..+..+.+..+.+.+ |. .. T Consensus 404 ---------~~~~~~---------------------d~---~~i~i~f~~~~p~d~~e~a~~~~kl~g~i----S~--et 444 (502) T protein:vir:48 404 ---------NEFKDF---------------------DE---SRLKITFTPNLPKSLYEQVSILNDLGGQV----SQ--ET 444 (502) T ss_pred ---------cccccc---------------------cc---ccceEEeCCCCCcCHHHHHHHHHHHhccC----cH--HH Confidence 110000 01 13444445565554445555555554332 21 22 Q ss_pred HHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 553 LLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSL 632 (725) Q Consensus 553 ~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~ 632 (725) ++..++..+-+ +.-++++.+.......... .. . .........-+......+.... T Consensus 445 ~l~~l~~v~D~--~~E~~ri~~E~~~~~~~~~-~~---------------~-------~~~~~~~~~d~~~e~~~~~~~~ 499 (502) T protein:vir:48 445 ALSLSGLVENP--TEELDKINEESSKIDFKGY-PS---------------Y-------FYDNVGKYTDEVKETHTDDFER 499 (502) T ss_pred HHHhCCCCCCH--HHHHHHHHHHHHhhhhhcc-cc---------------c-------ccccccccCCCccCCCCcCcCC Confidence 22223322211 1222222221100000000 00 0 0000000000000000000000 Q ss_pred HHH Q lcl|NC_013059. 633 QID 635 (725) Q Consensus 633 q~e 635 (725) -.+ T Consensus 500 ~~~ 502 (502) T protein:vir:48 500 VYE 502 (502) T ss_pred CCC Confidence 000 No 27 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.77 E-value=2.3e-16 Score=106.32 Aligned_cols=511 Identities=11% Similarity=0.024 Sum_probs=244.2 Q ss_pred CCcHHH--HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC--CHHHHHHHhhcCCCcccchHHHHHHHHHHHh---- Q lcl|NC_013059. 1 MADNKN--RLESILSRFDADWTASDEARREAKNDLFFSRVSQW--DDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR---- 72 (725) Q Consensus 1 mad~~~--~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW--~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~---- 72 (725) |||.+. .-+.+..+|+.-.+....|-..+.+..+|..-.=. +..... +...++.-+.....++.+.+... T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~--~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS--TDYQTPWQAVGARGLNNLASKLMLALF 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc--ccccccccccHHHHHHHHHHHHHHhhc Confidence 999642 34556666666555555666666777777653211 111111 11123322444444444433222 Q ss_pred hCCcceEEecCCcc-------hHHHHHH------HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCC Q lcl|NC_013059. 73 QNPIDVLYRPKDGA-------SPDAADV------LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTS 139 (725) Q Consensus 73 ~nr~~~~~~pr~~~-------d~~~Ae~------l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~ 139 (725) -+++=+++.+.+++ +...+++ .+..+......|++..+...+|.+.++.|.|+.-+ ++++.+ T Consensus 79 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~-----~e~~~~ 153 (536) T protein:vir:21 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL-----PEPEGS 153 (536) T ss_pred CCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEE-----eeCCCC Confidence 23333343333321 1222222 33345555567889999999999999999888532 333333 Q ss_pred CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEE Q lcl|NC_013059. 140 NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQI 219 (725) Q Consensus 140 ~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv 219 (725) .....+..|+ .++++..+..- ...-+||...||...+.+.|++...... .. . ..++.|.| T Consensus 154 ~~~~f~~~pl----~~~~v~~d~~G----~vd~i~r~~~~t~~~l~~~fg~~~~~~~------~~-~-----~~~~~v~v 213 (536) T protein:vir:21 154 NYNPMKLYRL----SSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVEGQG------GE-K-----KADETIDV 213 (536) T ss_pred ceeeEEEEEc----CeEEEeeCCCC----CeeEEeeeeeccHHHHHHhhhhhhcccc------cc-c-----ccccceeE Confidence 2223344443 34666555322 2233788899999999998885211100 00 0 01234444 Q ss_pred EEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccc Q lcl|NC_013059. 220 AEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIP 299 (725) Q Consensus 220 ~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p 299 (725) ..+.+.++. ++. +.+| .-..|.+++.+.+.||++.+| T Consensus 214 ~~~v~~~~~-----------~~~-------------------------------~~~~-~e~~g~~v~~~~g~~~f~~~P 250 (536) T protein:vir:21 214 YTHIYLDED-----------SGE-------------------------------YLRY-EEVEGMEVQGSDGTYPKEACP 250 (536) T ss_pred EEEEEEecC-----------CCc-------------------------------EEEE-eccCCeeeccccCccccccCC Confidence 443332211 111 1111 123455566666778899999 Q ss_pred eEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccc Q lcl|NC_013059. 300 IVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDE 379 (725) Q Consensus 300 ~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 379 (725) |+|+-.. .++|..|+.|.+.+..+-.+.+|+.....+.....+.+.++.+.++.+-...... ...+ +..+.. T Consensus 251 ~i~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~~---g~~v~g 322 (536) T protein:vir:21 251 YIPIRMV--RLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---KAQT---GDFVTG 322 (536) T ss_pred eeeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc---cCCC---cceecC Confidence 9987554 4689899999999999999999999888888778888888888776553222111 1111 111222 Q ss_pred cCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhc-cCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_013059. 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA-VNGGQVAYDTVNQLNMRADLETYVFQDNLATA-M 457 (725) Q Consensus 380 ~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G-~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~-~ 457 (725) ..+.+. + .......--......++...+.|....=+ + +++ .++..+++.=|..+.+.....|...+.+|..- . T Consensus 323 ~~~~v~--~-~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~-~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell 397 (536) T protein:vir:21 323 RPEDIS--F-LQLEKQADFTVAKAVSDAIEARLSFAFML-N-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) T ss_pred Ccccce--e-eeccccccchHHHHHHHHHHHHHHHHHhh-h-hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 222221 1 11222222344567778888888776632 2 344 34445666678888888888888877777642 2 Q ss_pred HHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHH Q lcl|NC_013059. 458 RRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILE 537 (725) Q Consensus 458 ~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~e 537 (725) .=+.+.++.++ .+.|. +-.+- .++ +.+.+..+ -.+..|.+.++.+.. T Consensus 398 ~Pli~r~~~il-------------~r~g~--lP~~p--------------~~~---v~~~~vs~-l~~l~r~~~~~~l~~ 444 (536) T protein:vir:21 398 LPLVRVLLKQL-------------QATQQ--IPELP--------------KEA---VEPTISTG-LEAIGRGQDLDKLER 444 (536) T ss_pred HHHHHHHHHHH-------------HhCCC--CCCCC--------------hhh---ccceEEec-HHHHHHHHHHHHHHH Confidence 22333333332 22221 00000 011 23333333 334557778888888 Q ss_pred HHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhh--hhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_013059. 538 LLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVL 615 (725) Q Consensus 538 ll~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~--~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~ 615 (725) +++.+....|+.. . +..|++ .++..+..... +..+... +++.+.+.++++++++. ++++. T Consensus 445 ~~~~la~~~Pe~l---d---~~id~d---~~~~~~a~~~Gv~p~~~irt--~eev~~~r~q~~~~~~~------~~~a~- 506 (536) T protein:vir:21 445 CVTAWAALAPMRD---D---PDINLA---MIKLRIANAIGIDTSGILLT--EEQKQQKMAQQSMQMGM------DNGAA- 506 (536) T ss_pred HHHHHHhhchhhh---c---ccCCHH---HHHHHHHHHcCCChhhhcCC--HHHHHHHHHHHHHHHHH------HHHHH- Confidence 8777655555321 1 123333 33333322221 1222211 11111111100000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 616 LQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFL 669 (725) Q Consensus 616 ~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~ 669 (725) .+.+..+.++. .+.+. .+++.+ ++-++.+. T Consensus 507 -----~~~~~~~~~~~---~~~~~-~~~~~~---------------~~g~~~~~ 536 (536) T protein:vir:21 507 -----ALAQGMAAQAT---ASPEA-MAAAAD---------------SVGLQPGI 536 (536) T ss_pred -----HHHHHHHHHHh---cChhh-HHhhhh---------------ccccCCCC Confidence 00000000000 00000 000000 00000111 No 28 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.77 E-value=3.3e-16 Score=105.44 Aligned_cols=511 Identities=11% Similarity=0.016 Sum_probs=244.9 Q ss_pred CCcHHH--HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC--CHHHHHHHhhcCCCcccchHHHHHHHHHHHh---- Q lcl|NC_013059. 1 MADNKN--RLESILSRFDADWTASDEARREAKNDLFFSRVSQW--DDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR---- 72 (725) Q Consensus 1 mad~~~--~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW--~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~---- 72 (725) |||.+. .-+.+..+|+.-.+....|-..+.+..+|..-.=. +..... +...++.-+.....++.+.+... T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~--~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS--TDYQTPWQAVGARGLNNLASKLMLALF 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc--ccccccccccHHHHHHHHHHHHHhhhc Confidence 999642 34456666666555555666666667777653211 111111 11123322444444444433322 Q ss_pred hCCcceEEecCCcc-------hHHHHHH------HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCC Q lcl|NC_013059. 73 QNPIDVLYRPKDGA-------SPDAADV------LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTS 139 (725) Q Consensus 73 ~nr~~~~~~pr~~~-------d~~~Ae~------l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~ 139 (725) -+++=+++.+.+++ +...+++ .+..+......|++..+...+|.+.++.|.|+.-+ ++++.+ T Consensus 79 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~-----~e~~~~ 153 (536) T protein:vir:10 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL-----PEPEGS 153 (536) T ss_pred CCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEE-----eeCCCC Confidence 23333343333321 1222222 33345555567889999999999999999888532 333333 Q ss_pred CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEE Q lcl|NC_013059. 140 NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQI 219 (725) Q Consensus 140 ~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv 219 (725) .....+..|+ .++++..+..- ...-+||+..|+...+.+.|+....... .. . ..++.|.| T Consensus 154 ~~~~~~~~pl----~~~~v~~d~~G----~vd~i~r~~~~t~~~l~~~fg~~~~~~~------~~-~-----~~~~~v~v 213 (536) T protein:vir:10 154 NYNPMKLYRL----SSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVEGQG------GE-K-----KADETIDV 213 (536) T ss_pred ceeeEEEEEc----CeEEEeeCCCC----CeeEEeeeeeccHHHHHHhhhhhhcccc------cc-c-----CcccceEE Confidence 2223344444 34666555322 1233788899999999888885211100 00 0 01234555 Q ss_pred EEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccc Q lcl|NC_013059. 220 AEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIP 299 (725) Q Consensus 220 ~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p 299 (725) ..+.+.+.. ++. +-+ +.-..|.+++.+.+.||++.+| T Consensus 214 ~~~V~~~~~-----------~~~-------------------------------~~~-~~e~~g~~v~~~~g~~~f~~~P 250 (536) T protein:vir:10 214 YTHIYLDEA-----------SGE-------------------------------YLR-YEEVEGMEVQGSDGTYPKEACP 250 (536) T ss_pred EEEEEEecC-----------CCc-------------------------------EEE-EEeecCccccccccccccccCC Confidence 544333211 111 111 2234566676667788899999 Q ss_pred eEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccc Q lcl|NC_013059. 300 IVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDE 379 (725) Q Consensus 300 ~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 379 (725) |+|+... .++|..|+.|.+.+..+-.+.+|+.....+.....+.+.++.+.++.+-...... ...+ +..+.. T Consensus 251 ~i~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~~---g~~v~g 322 (536) T protein:vir:10 251 YIPIRMV--RLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---KAQT---GDFVTG 322 (536) T ss_pred ceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc---cCCC---cceecC Confidence 9987554 4689899999999999999999999888888778888888888776553222111 1111 111222 Q ss_pred cCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhc-cCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_013059. 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA-VNGGQVAYDTVNQLNMRADLETYVFQDNLATA-M 457 (725) Q Consensus 380 ~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G-~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~-~ 457 (725) ..+.+. + .......--......++...+.|....=+ + +++ .++..+++.=|..+.+.....|...+.+|..- . T Consensus 323 ~~~~v~--~-~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~-~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell 397 (536) T protein:vir:10 323 RPEDIS--F-LQLEKQADFTVAKAVSDAIEARLSFAFML-N-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) T ss_pred Ccccce--e-eeccccccchHHHHHHHHHHHHHHHHHhh-h-hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 222221 1 11222222344567778888888776632 2 344 34445666678888888888888877777642 2 Q ss_pred HHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHH Q lcl|NC_013059. 458 RRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILE 537 (725) Q Consensus 458 ~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~e 537 (725) .=+.+.++.++ .+.|. +-.+- .++ +.+.+..+ -.+..|.+.++.+.. T Consensus 398 ~Pli~r~~~il-------------~r~g~--lP~~p--------------~~~---v~~~~vs~-l~~l~r~~~~~~l~~ 444 (536) T protein:vir:10 398 LPLVRVLLKQL-------------QATQQ--IPELP--------------KEA---VEPTISTG-LEAIGRGQDLDKLER 444 (536) T ss_pred HHHHHHHHHHH-------------HhCCC--CCCCC--------------hhh---ccceEEec-HHHHHHHHHHHHHHH Confidence 22333333332 22221 00000 011 23333333 334557788888888 Q ss_pred HHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhh--hhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_013059. 538 LLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVL 615 (725) Q Consensus 538 ll~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~--~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~ 615 (725) +++.+.+..|+.. . +..|+ +.++..+..... +..+... +++.+.+.++++++++ . ++++. T Consensus 445 ~~~~la~~~P~~l---d---~~id~---d~~~~~~a~~~Gv~p~~~irt--~eev~~~r~q~~~~~~---~---~~~a~- 506 (536) T protein:vir:10 445 CVTAWAALAPMRD---D---PDINL---AMIKLRIANAIGIDTSGILLT--EEQKQQKMAQQSMQMG---M---DNGAA- 506 (536) T ss_pred HHHHHHhhchhhh---c---ccCCH---HHHHHHHHHHcCCCchhhcCC--HHHHHHHHHHHHHHHH---H---HHHHH- Confidence 8777665555321 1 12233 333333322221 1222211 1111111110000000 0 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 616 LQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFL 669 (725) Q Consensus 616 ~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~ 669 (725) .+.+..+.++.. ..+. .+++.+ ++-++.+. T Consensus 507 -----~~~~~~~~~~~~---~~~~-~~~~~~---------------~~g~~~~~ 536 (536) T protein:vir:10 507 -----ALAQGMAAQATA---SPEA-MAAAAD---------------SVGLQPGI 536 (536) T ss_pred -----HHHHHHHHHHhc---Cchh-HHhhhh---------------ccccCCCC Confidence 000000000000 0000 000000 00000011 No 29 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.76 E-value=4.2e-16 Score=104.87 Aligned_cols=506 Identities=10% Similarity=-0.004 Sum_probs=244.7 Q ss_pred CCcHHH---HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcC----CCCCHHHHHHHhhcCCCcccchHHHHHHHHH---- Q lcl|NC_013059. 1 MADNKN---RLESILSRFDADWTASDEARREAKNDLFFSRV----SQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS---- 69 (725) Q Consensus 1 mad~~~---~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G----~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g---- 69 (725) ||+.+. --+.+..+|..-.+....|-..+.+..+|..- +.+...- ....++.-..-...++.+.+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~----~~~~~~~dst~~~a~~~Laa~l~~ 76 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNES----TDYTTPWQAVGARGLNNLASKLML 76 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc----ccccccccccHHHHHHHHHHHHHH Confidence 997642 23345556666555555566666777777543 3332211 11122311333333443333 Q ss_pred HHhhCCcceEEecCCc-------ch---HHHHHHHH---HHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCC Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDG-------AS---PDAADVLM---GMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS 136 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~-------~d---~~~Ae~l~---~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~ 136 (725) .-.-+++=+++.+.+. ++ .++.+.|. ..+......|++..+...+|.+.+..|.|++-+ .++ T Consensus 77 ~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~-----~~~ 151 (535) T protein:vir:15 77 ALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYL-----PEP 151 (535) T ss_pred hhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEe-----ecC Confidence 2223344444444331 11 12333333 333334467889999999999999999997643 222 Q ss_pred CCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCe Q lcl|NC_013059. 137 PTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDT 216 (725) Q Consensus 137 ~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (725) + +..+..+..|+ .++++..++.- ...-+++...||...+.+.|+.... .... .-...+. T Consensus 152 ~-~~~~~f~~~pl----~~~~v~~d~~G----~vd~i~r~~~~t~~~l~~~~~~~~~-----------~~~~-~~~~~~~ 210 (535) T protein:vir:15 152 E-GSYNPMKLYRL----SSYVVQRDAYG----NVLQIVTRDQIAFGALPEDVRSAVE-----------KAGG-EKKMDEM 210 (535) T ss_pred C-CCceeeEEEEc----CeeEEeeCCCC----CeeEEEEeEeecHHHHHHHHhHhhh-----------cccc-ccCCCCc Confidence 2 22333444443 45777766532 1223788889999888776664210 0000 0112344 Q ss_pred EEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEE-eeccccccCCCCCCC Q lcl|NC_013059. 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI-ITCTAVLKDKQLIAG 295 (725) Q Consensus 217 vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~-~~g~~~l~~~~~~p~ 295 (725) |.|..+++.+.. +| ++.|+. ..|..+....+.|++ T Consensus 211 v~v~~~v~~~~~-----------~~---------------------------------~~~~~~e~~g~~~~~~~~~~~~ 246 (535) T protein:vir:15 211 VDVYTHVYLDEE-----------SG---------------------------------DYLKYEEVEDVEIDGSDATYPT 246 (535) T ss_pred eeEEEEEEEecC-----------CC---------------------------------cEEEEEEeeCcccccccccccc Confidence 555555433211 11 111222 224333333467888 Q ss_pred CccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccc Q lcl|NC_013059. 296 EHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLN 375 (725) Q Consensus 296 ~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 375 (725) +.+||+|+-.. .++|..|+.|.+.+..+-.+.+|+.....+.....+.+.+++++++.+-...+.. +...+. T Consensus 247 ~~~P~i~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~------~~~~g~ 318 (535) T protein:vir:15 247 DAMPYIPVRMV--RIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT------KAQTGD 318 (535) T ss_pred ccCCceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcc------cCCcee Confidence 99999977544 4689999999999999999999999999999999999999998776543322211 111111 Q ss_pred cccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhc-cCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 376 RTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA-VNGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) Q Consensus 376 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G-~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~ 454 (725) .+....+.+. +++ .....-.......++...+.|.... ..+ +++ .++..+++.=|..+.+.....+..++.+|. T Consensus 319 ~v~g~~~~v~--~~~-~~~~~~~~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~ 393 (535) T protein:vir:15 319 FVPGRREDID--FLQ-LEKQADFTVAKAVSDQIEARLSYAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILS 393 (535) T ss_pred eecCCcccce--eee-cccccchhHHHHHHHHHHHHHHHHH-hhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHH Confidence 2222223221 221 1122223446677777777887665 233 444 444556666788888888888888887776 Q ss_pred H-HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHH Q lcl|NC_013059. 455 T-AMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRA 533 (725) Q Consensus 455 ~-~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~ 533 (725) . ...=+.+.++.++.+ .|.. + ... .+.+.+.+..+ -....|.+.++ T Consensus 394 ~Ell~Pli~r~~~il~r-------------~g~l------P-~~p------------~~~v~~~yis~-La~aqr~~~~~ 440 (535) T protein:vir:15 394 QELQLPLVRVLLKQLQA-------------TSQI------P-ELP------------KEAVEPTISTG-LEAIGRGQDLD 440 (535) T ss_pred HHHHHHHHHHHHHHHHh-------------cCCC------C-CCC------------ccceeEEEecH-HHHHHHHHHHH Confidence 3 333333333333322 2210 0 000 01245555433 34456788888 Q ss_pred HHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhh--hhhhccchhhhHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_013059. 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM--GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQA 611 (725) Q Consensus 534 ~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~--~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~ 611 (725) .+.++++.+....|.... ...|++ +++..+......+ .+... +++.++.++++++++++++. + . T Consensus 441 ~l~~~~~~la~~~P~~ld------~~id~d---~~~~~~a~~~Gvp~~~i~~~--~eev~~~~~q~~~~~~~~~~--a-~ 506 (535) T protein:vir:15 441 KLERCISAWAALAPMQGD------PDINLA---VIKLRIANAIGIDTSGILLT--DEQKQALMMQDAAQTGIENA--A-A 506 (535) T ss_pred HHHHHHHHHHhcChhhhh------ccCCHH---HHHHHHHHHcCCChhhhcCC--HHHHHHHHHHHHHHHHHHHH--H-H Confidence 888888877665553211 123333 3333332222222 12211 11111111111100000000 0 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 612 QGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMD 658 (725) Q Consensus 612 qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~ 658 (725) + ..+..+..+ .+. -. .+++..++ +-.++. T Consensus 507 ~--~g~~~~~~~------~~~----p~-~~~~~~~~-----~g~~~~ 535 (535) T protein:vir:15 507 T--GGAGVGALA------TSS----PE-AMQGAAAQ-----AGLDAT 535 (535) T ss_pred H--HHhhccchh------ccC----hH-HHHHHHhc-----cCCCCC Confidence 0 000000000 000 00 00000000 000000 No 30 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.75 E-value=1.5e-15 Score=101.81 Aligned_cols=524 Identities=12% Similarity=-0.006 Sum_probs=250.2 Q ss_pred CC-cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcC---C--CCCHHHHHHHhhc-CCCcccchHHHHHHHHHH--- Q lcl|NC_013059. 1 MA-DNKNRLESILSRFDADWTASDEARREAKNDLFFSRV---S--QWDDWLSQYTTLQ-YRGQFDVVRPVVRKLVSE--- 70 (725) Q Consensus 1 ma-d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G---~--QW~~~~~~~l~~~-grp~~N~i~~~v~~v~g~--- 70 (725) |+ |...+.+++..+|.........|...+.+..+|..- . -++..+...-..+ .++--..-...++.+.+. T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~ 80 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDS 80 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHh Confidence 99 567788889999999988888888888888888652 1 2332221111111 122112233333333332 Q ss_pred -Hh-hCCcceEEecCCcchHH---HHHHHHHHHHHHH-----HhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCC Q lcl|NC_013059. 71 -MR-QNPIDVLYRPKDGASPD---AADVLMGMYRTDM-----RHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) Q Consensus 71 -~~-~nr~~~~~~pr~~~d~~---~Ae~l~~~~~~~~-----~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~ 140 (725) -. .+++=+++.+.+.+..+ ..+-|...-+.++ ..+++..+...+|.+.+..|.|++-+. +++.+ T Consensus 81 ~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~-----~~~~~- 154 (549) T protein:vir:10 81 MITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIE-----HDVGK- 154 (549) T ss_pred hccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEe-----ecCCC- Confidence 22 25566666666554332 2333443333332 357888899999999999999986542 23222 Q ss_pred ceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEE Q lcl|NC_013059. 141 NQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIA 220 (725) Q Consensus 141 ~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~ 220 (725) .+..+.. |+.++++..++.-. -|+ +|+...||...+.+.|+........ ..... . ...+.+.|+ T Consensus 155 ~~~f~~~----pl~~~~v~~d~~G~--vd~--i~r~~~~t~~ql~~~fg~~~l~~~v-~~~~~--~-----~~~~~~~v~ 218 (549) T protein:vir:10 155 GIVYRNV----PMQRLWFAENNSGL--IDK--THVQWELTLRQAAQRFGRENLSPSM-QSTLE--K-----DPEKSAIFY 218 (549) T ss_pred eeEEEEE----EcCeEEEeeCCCCC--eEE--EEEEeecCHHHHHHhcCcccCCHHH-HHHhh--c-----CCCceEEEE Confidence 2333333 34457776665321 122 7888899999999998863211110 00000 0 012344444 Q ss_pred EEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccce Q lcl|NC_013059. 221 EFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPI 300 (725) Q Consensus 221 E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~ 300 (725) .+-+.+... |+.. .....+.+..+|.-..|.++|.+ +-| ..+|| T Consensus 219 ~~V~pr~~~-------~~~~--------------------------~~~~~~pf~sv~~e~~~~~il~e-sg~--~e~P~ 262 (549) T protein:vir:10 219 HAVEPRADR-------DPRK--------------------------LDGRNMQFASYWLDEGRDRIVQN-SGF--RTFPF 262 (549) T ss_pred EEeecCCCC-------Cccc--------------------------cccccCceEEEEEEecCCEeecc-CCc--ccCCc Confidence 332221111 1000 00112233333444456666643 333 67999 Q ss_pred EEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccccccccc Q lcl|NC_013059. 301 VPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDEN 380 (725) Q Consensus 301 vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 380 (725) +|+-.. .++|..|+.|.+.+..+-.+.+|......+.....+.+.++.++.+.+-.. .+-.|+......... T Consensus 263 ~~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~------~~l~pgg~~~~~~~~ 334 (549) T protein:vir:10 263 AIGRFY--VGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLDG------FDLRSGALNWGGLND 334 (549) T ss_pred ceeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccc------ceeccCCccccccCC Confidence 977544 468989999999999999999999999999999999999999876543221 111222211111111 Q ss_pred CccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_013059. 381 NGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLA-TAMRR 459 (725) Q Consensus 381 ~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~-~~~~~ 459 (725) +|.. .+.++....--.....+++...+.|....=++-..+..++..+++.=|..+.+.....|...+.+|. ....= T Consensus 335 ~~~~---~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~P 411 (549) T protein:vir:10 335 KGEE---MVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGP 411 (549) T ss_pred CCcc---ceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 2211 1222222222233455677777777766643222233344446666688888888888888777776 33333 Q ss_pred HHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc--c-cceEEEEeccCchhHHHHHHHHHH Q lcl|NC_013059. 460 DGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR--G-RYECYTDVGPSFQSMKQQNRAEIL 536 (725) Q Consensus 460 ~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~--g-~~Dv~v~~~p~~~t~r~~~~~~l~ 536 (725) +-+..++++.+ .|. | ++.. .++. | .++|.. ++|-...+|...+..+. T Consensus 412 li~R~~~il~r-------------~g~-----l-P~~p----------~~l~~~~~~~~i~y-is~La~aq~~~~~~~i~ 461 (549) T protein:vir:10 412 MIAREVDILAE-------------AGQ-----L-PDMP----------QELIDAGADVDVEY-DSPLNKAMRAGEGAAIL 461 (549) T ss_pred HHHHHHHHHHh-------------cCC-----C-CCCC----------hhhhcCCceeEEEe-ecHHHHHHHHHHHHHHH Confidence 33333333332 111 0 0000 1111 1 234443 33444445666666666 Q ss_pred HHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhh-hhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_013059. 537 ELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG-VKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVL 615 (725) Q Consensus 537 ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~-~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~ 615 (725) ++++.+.+.....+. .++..|+ ++++..+......+. +... +++.++..++.+++++++++.+.+...+. T Consensus 462 ~~~~~~~~laq~~Pe----~ld~id~---d~~~~~~a~~~Gvp~~~irs-~eev~~~r~~~~~qqq~~~~~~~a~~a~~- 532 (549) T protein:vir:10 462 QWLQQLGIVSQFDPA----AAKVPNG---ARIARLLADYGGVPVEAMST-DEELQAQQAAEAQAAQMQQMLAAAPVAAG- 532 (549) T ss_pred HHHHHHHHHhccChh----HHhcCCH---HHHHHHHHHhcCCCccccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 666554433221111 1223333 333333322222211 1111 11111111111111111111100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 616 LQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEI 653 (725) Q Consensus 616 ~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~ 653 (725) +.+..++++.+ ++.+.. T Consensus 533 --------------------~a~~~~~~~ta-~~~~~~ 549 (549) T protein:vir:10 533 --------------------AIKDLSDAQTA-AQTARV 549 (549) T ss_pred --------------------HHHhhhhhcCC-CcccCC Confidence 00011111000 000000 No 31 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.75 E-value=9.9e-17 Score=108.34 Aligned_cols=436 Identities=9% Similarity=0.035 Sum_probs=212.0 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCC----cccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG----QFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp----~~N~i~~~v~~v~g~~~~nr~ 76 (725) |..++.+..+.+..|...+. ..+....+-.+||.|.|-- .....+..++| ++|..+.+|+..+|+.-.+.+ T Consensus 11 ~~~~~~~~~~~i~~~i~~~~---~~~~r~~~~~~yy~g~~~i--~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~ 85 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMKKHQ---EEVERYEYLGNMYKGIMEI--SSQKAKDSWKPDNRLTNNFAKYIVDTFVGYFNGIPI 85 (453) T ss_pred ccccccCCHHHHHHHHHHHH---HHHHHHHHHHHHhccccch--hcCCCCCccCccceeecchHHHHHHHhhhhhcccCc Confidence 77666666666666655443 3334555568999998741 11122223333 569999999999999988776 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH- 155 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~- 155 (725) .+. +. |. .....+..+.+.|+++...+.+.++++++|.||.-+..+ ++ | ...+++. ++.+ T Consensus 86 ~~~--~~---d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d---~~--~-~~~i~~~----~p~~~ 146 (453) T protein:vir:73 86 KKT--HD---DK----SVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQN---ES--T-ESEVIYC----SPLNV 146 (453) T ss_pred eee--cC---Ch----HHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeC---CC--C-ceEEEEE----cccce Confidence 553 32 22 233467777888999999999999999999999877543 22 2 2222221 2222 Q ss_pred -eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 156 -VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 156 -v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) ++||+.... ..++..+|-. +.+....+++|.... ++. T Consensus 147 ~~v~dd~~~~------~~~~~i~~~~--------------------------------~~~~~~~~~vyt~~~----i~~ 184 (453) T protein:vir:73 147 FMVYDDSIKQ------KPLFAVYYGF--------------------------------DEEGNLSGTVYTLLE----TIS 184 (453) T ss_pred EEEEeCCCCc------eeEEEEEEEE--------------------------------ecCceEEEEEEeCCe----EEE Confidence 344443221 1222222210 001112334444321 111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +.. .++ ...+.++.|.+.+.+|+|||.. + . T Consensus 185 ~~~-~~~------------------------------------------~~~~~~~~~~~~g~vPvv~~~n------~-~ 214 (453) T protein:vir:73 185 ITG-KAG------------------------------------------EVKFGESTYNVYSDLPIVEYNF------N-E 214 (453) T ss_pred EEe-cCC------------------------------------------ceEEccceeccCCceeEEEecC------C-C Confidence 110 000 0111233344455667766532 1 1 Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcc-ccccCCcccC Q lcl|NC_013059. 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE-MPTQPLAYYE 393 (725) Q Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~ 393 (725) .+.|.+..+++.++.+|..+|.+...+...+....++.-...++. ..-..................+. .....++++. T Consensus 215 ~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~ 293 (453) T protein:vir:73 215 ERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDEE-DAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLD 293 (453) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCch-hhhcccccccccccccccccccccccCceeEEee Confidence 255788899999999999999999888776665544321111110 00000000000000000011111 1112234444 Q ss_pred CCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013059. 394 NPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYD 473 (725) Q Consensus 394 ~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~ 473 (725) .+.-..++...++.....|-..|++.+.+.+..+| .||+|+...-..........-..|..+++++.++++.+ .. T Consensus 294 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~----~~ 368 (453) T protein:vir:73 294 KPDSDVQTENLLNRLERSIFQFTMAANISDENFGN-SSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSL----ST 368 (453) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hh Confidence 33333445566788888888899887666554444 69999988766666666666666677776666655543 21 Q ss_pred CCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHH Q lcl|NC_013059. 474 VPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLL 553 (725) Q Consensus 474 ~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~ 553 (725) . .+. . .|. .+|.|.-.|..+.-..+..+.++.+.+.++. ..+ T Consensus 369 ~------~~~--~---------------------~~~---~~i~v~f~~~~p~~~~~~a~~~~k~~giis~------et~ 410 (453) T protein:vir:73 369 N------ASN--K---------------------DAW---KDIEYTFTRNEPKDIKEQAETANILKGITSE------ETA 410 (453) T ss_pred c------cCC--c---------------------ccc---ccceEEeCCCCCCCHHHHHHHHHHHhccCcH------HHH Confidence 1 010 0 011 2344445566665445555555555433221 222 Q ss_pred HHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 554 LQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKA 625 (725) Q Consensus 554 ~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~ka 625 (725) +..++..+-+ +.-.+++.++. .+... .++. ..-.++.+ +...+ T Consensus 411 ~~~~~~~~d~--~~E~~ri~~E~-------------~~~~~-~~~~-~~~~~~~~------------~~~~~ 453 (453) T protein:vir:73 411 LSVISVIPDV--QAEMEKIKKKK-------------LLQLS-LTRT-SNLVRMKQ------------MRGNL 453 (453) T ss_pred HHhCCCCCCH--HHHHHHHHHHH-------------HHHHH-HHHh-ccCCcchh------------hhcCC Confidence 2233332211 22222222110 00000 0000 00000000 00000 No 32 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.75 E-value=1.3e-16 Score=107.63 Aligned_cols=451 Identities=9% Similarity=-0.017 Sum_probs=214.6 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC------CCHHHH-------HHHhhcC----CCcccchHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ------WDDWLS-------QYTTLQY----RGQFDVVRPV 63 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q------W~~~~~-------~~l~~~g----rp~~N~i~~~ 63 (725) |- .+.+.+++..+... ..+.+....+..+||.|.| ...... ...+..+ |.++|..+.+ T Consensus 1 ~~--~e~~~~~i~~~~~~---~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 75 (471) T protein:vir:10 1 ME--IEVIKKIISSQMVK---HGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLL 75 (471) T ss_pred CC--HHHHHHHHHHHHHH---HHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHH Confidence 32 23444444444432 3345667788899999975 100000 0001112 3357999999 Q ss_pred HHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCcee Q lcl|NC_013059. 64 VRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) Q Consensus 64 v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ 143 (725) |+..+|+.-.+.+.+.+ ++.+..+ .+..+.+ |+++...+.+..++.+.|.||.-+.++.. +..+. T Consensus 76 vd~~~~yl~G~p~~~~~-----~~~~~~~----~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-----~g~~~ 140 (471) T protein:vir:10 76 LDQKKAYALTYPPTFDV-----DDKKVND----MIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDAS-----DNSFR 140 (471) T ss_pred HHhhhhhhcccCceecc-----CChHHHH----HHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCC-----CCeeE Confidence 99999999887766542 2333333 3444444 78999999999999999999988876522 12333 Q ss_pred EEEEeeecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEE Q lcl|NC_013059. 144 IRREPIHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAE 221 (725) Q Consensus 144 ir~~~~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E 221 (725) +.+. +|.. ++||.... +-..++++.|...+ ..+.+.+..++ T Consensus 141 ~~~~----~p~~~~~i~d~~~~-----~~~~~~ir~~~~~~----------------------------~~~~~~~~~~~ 183 (471) T protein:vir:10 141 YACV----DSKEVIPIYSKSLD-----KKSIGVLRVYSSID----------------------------ETDGKNYTVYE 183 (471) T ss_pred EEEE----cccceEEEEcCCCC-----CceEEEEEEEEeec----------------------------cCCCceeEEEE Confidence 3321 2222 34554321 11222333332210 01123445566 Q ss_pred EEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceE Q lcl|NC_013059. 222 FYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIV 301 (725) Q Consensus 222 ~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~v 301 (725) +|..... ..+...+......+.... ..+...+..|.....+..+.+.+.+|+| T Consensus 184 vy~~~~~--~~y~~~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~g~iPvv 236 (471) T protein:vir:10 184 YWNDKEC--SFYRHEKEKPLEELETFQ-------------------------AISLIDTMNGDRSSDNSFKHDFGLVPFI 236 (471) T ss_pred EEeCCcE--EEEEecCCcccccccccc-------------------------cccccccccccccccccccCCCCceeEE Confidence 6654322 222222211111110000 0000112234434444444555566666 Q ss_pred EEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhcccccccccccccc Q lcl|NC_013059. 302 PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDEN 380 (725) Q Consensus 302 P~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~ 380 (725) +|... ..+.|.+..+++.++.+|..+|.+...+...++..+++ .|.. ....+..........+.+ . . T Consensus 237 ~~~n~-------~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~i~~---~-~ 304 (471) T protein:vir:10 237 PFKNN-------EIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVL-TNYGGQDKQEFLEDLKRYKMIKM---D-N 304 (471) T ss_pred EeccC-------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccchhHHHhhcCCeEEe---c-C Confidence 55321 22557788999999999999999998888777654433 2211 111111111111111111 0 0 Q ss_pred CccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 381 NGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRD 460 (725) Q Consensus 381 ~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~ 460 (725) .|.-....+.++..+.-..+....++...+.|-..|++.+.+.+..|| .||+|+..+...........-..|..+++++ T Consensus 305 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~ 383 (471) T protein:vir:10 305 DGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGN-SSGVALKFLYSLLELKAGNMETQFRSGYATL 383 (471) T ss_pred CCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111112234555544444566778888899999999876655554444 5999998877666665555666666666655 Q ss_pred HHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHH Q lcl|NC_013059. 461 GEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLG 540 (725) Q Consensus 461 g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~ 540 (725) .++++. ++.. .+ ..||.|.-.+..+.--.+..+.++.+.+ T Consensus 384 ~~li~~----~~~~------~d------------------------------~~~i~i~f~~~~p~n~~e~~~~~~kl~g 423 (471) T protein:vir:10 384 VKMILK----HLGL------SD------------------------------KLKIKQTWTRNSINNDTEMAQVVSTLAT 423 (471) T ss_pred HHHHHH----Hhcc------CC------------------------------CceeEEEeCCCCCCCHHHHHHHHHHHhc Confidence 555544 4311 00 0234444455555544444555544432 Q ss_pred hcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 541 KTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) Q Consensus 541 ~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qa 620 (725) .+ +. ..++..++..+ ..+.-++++.+.........+... ....+. T Consensus 424 ~i----S~--et~~~~~p~v~--D~~~E~eri~~E~~~~~~~~~~~~-------------~~~~~~-------------- 468 (471) T protein:vir:10 424 IT----SR--ENVAKSNPIVE--DWQDELRLQKAEQEGRSEKLYDME-------------EVEHES-------------- 468 (471) T ss_pred cC----ch--HHHHHhCCCCC--CHHHHHHHHHHHHHHHHhcccccC-------------CCCCcc-------------- Confidence 22 21 12222222221 122223333221111000000000 000000 Q ss_pred HHH Q lcl|NC_013059. 621 ELA 623 (725) Q Consensus 621 e~~ 623 (725) |.+ T Consensus 469 e~~ 471 (471) T protein:vir:10 469 EVE 471 (471) T ss_pred ccC Confidence 000 No 33 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.74 E-value=1.6e-15 Score=101.67 Aligned_cols=510 Identities=10% Similarity=0.006 Sum_probs=243.0 Q ss_pred CCcHHH---HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHH----hh Q lcl|NC_013059. 1 MADNKN---RLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEM----RQ 73 (725) Q Consensus 1 mad~~~---~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~----~~ 73 (725) ||+-+. --+.+..+|..-.+....|-..+.+..+|..-.=.+.+.-.......++--..-...++.+.+.. .- T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 997542 23345556666555555666666777777643211111000001111221133333444333322 22 Q ss_pred CCcceEEecCCc-------ch---HHHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCC Q lcl|NC_013059. 74 NPIDVLYRPKDG-------AS---PDAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) Q Consensus 74 nr~~~~~~pr~~-------~d---~~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~ 140 (725) +++=+++.+.+. .+ .++.+.| +..+......|++..+...+|.+.+..|.|++-+ .+++ +. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~-----~~~~-~~ 154 (535) T protein:vir:33 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYL-----PEPE-GS 154 (535) T ss_pred CCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEe-----ecCC-CC Confidence 344344444331 11 1223333 2333344567889999999999999999998654 2222 22 Q ss_pred ceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEE Q lcl|NC_013059. 141 NQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIA 220 (725) Q Consensus 141 ~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~ 220 (725) .+..+..| +.++++..++.- ...-+|++..||...+.+.|+..... +. .... .++.+-|. T Consensus 155 ~~~f~~~p----l~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~~--~~-----~~k~-----~~~~~~v~ 214 (535) T protein:vir:33 155 YNPMKLYR----LSSYVVQRDAYG----NVLQIVTRDQIAFGALPEDVRSAVEK--SG-----GEKK-----MDEMVDVY 214 (535) T ss_pred ceeeEEEE----cCeeEEeeCCCC----CeeEEEeeEeecHHHHHHHhhhhhcc--cc-----cccc-----cccCCeEE Confidence 23334444 345777665432 12237888999999888877752110 00 0000 01222222 Q ss_pred EEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEE-EeeccccccCCCCCCCCccc Q lcl|NC_013059. 221 EFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKS-IITCTAVLKDKQLIAGEHIP 299 (725) Q Consensus 221 E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~-~~~g~~~l~~~~~~p~~~~p 299 (725) .+.++ +..+|. +.|+ -..|..+....+.|+++.+| T Consensus 215 ~~v~~-----------~~~~~~---------------------------------~~~~~~~~~~~~~~~~~~~~~~~~P 250 (535) T protein:vir:33 215 THVYL-----------DEESGD---------------------------------YLKYEEVEDVEIDGSDATYPTDAMP 250 (535) T ss_pred EEEEe-----------eCCCCc---------------------------------EEEEEEEeCccccccccccccccCC Confidence 22211 111111 1122 22344444455778899999 Q ss_pred eEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccc Q lcl|NC_013059. 300 IVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDE 379 (725) Q Consensus 300 ~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 379 (725) |+|+-.. .++|..|+.|.+.+..+-.+.+|+.....+.....+.+.+++++++.+-...+.. +...+..+.. T Consensus 251 ~i~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~------~~~~g~~v~g 322 (535) T protein:vir:33 251 YIPVRMV--RIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT------KAQTGDFVPG 322 (535) T ss_pred ceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc------cCCceeeecC Confidence 9977544 4689999999999999999999999999999999999999998776544322211 1111111222 Q ss_pred cCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhc-cCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_013059. 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA-VNGGQVAYDTVNQLNMRADLETYVFQDNLAT-AM 457 (725) Q Consensus 380 ~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G-~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~~ 457 (725) ..+.+. +++ .....-.......++...+.|.... ..+ +++ .++..+++.=|..+.+.....|..++.+|.. .. T Consensus 323 ~~~~v~--~~~-~~~~~~~~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell 397 (535) T protein:vir:33 323 RREDID--FLQ-LEKQADFTVAKAVSDQIEARLSYAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (535) T ss_pred Ccccce--eee-cccccchhHHHHHHHHHHHHHHHHH-hhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHH Confidence 223221 221 1122223446677777778887765 233 444 4445566667888888888888888877763 33 Q ss_pred HHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHH Q lcl|NC_013059. 458 RRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILE 537 (725) Q Consensus 458 ~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~e 537 (725) .=+.+.++.++.+ .|.. + ... .+.+.+.+..+ -....|.+.++.+.+ T Consensus 398 ~Pli~r~~~il~r-------------~g~l------P-~~p------------~~~v~~~yis~-La~aqr~~~~~~l~~ 444 (535) T protein:vir:33 398 LPLVRVLLKQLQA-------------TSQI------P-ELP------------KEAVEPTISTG-LEAIGRGQDLDKLER 444 (535) T ss_pred HHHHHHHHHHHHh-------------cCCC------C-CCC------------ccceeEEEecH-HHHHHHHHHHHHHHH Confidence 3333333443322 2210 0 000 01245555433 344567888888888 Q ss_pred HHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhh--hhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_013059. 538 LLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM--GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVL 615 (725) Q Consensus 538 ll~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~--~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~ 615 (725) +++.+....|.... ...|++ +++..+......+ .+... +++.+...+++++++++++ ++. .. T Consensus 445 ~~~~la~~~P~~~d------~~id~d---~~~~~~a~~~Gvp~~~i~~~--~ee~~~~~~q~~~~~~~~~--~~~---~~ 508 (535) T protein:vir:33 445 CISAWAALAPMQGD------PDINLA---VIKLRIANAIGIDTSGILLT--DEQKQALMMQDAAQTGVEN--AAA---AG 508 (535) T ss_pred HHHHHHhhChhhhh------ccCCHH---HHHHHHHHHcCCCHhHhcCC--HHHHHHHHHHHHHHHHHHH--HHH---hh Confidence 88877665553211 123333 3333332222221 12211 1111111111111000000 000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 616 LQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLS 660 (725) Q Consensus 616 ~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~ 660 (725) .+..+..+ +..-+ .+...+..+=+ ++ . T Consensus 509 g~~~~~~~-------~~~~~-------~~~~~~~~~g~--~~--~ 535 (535) T protein:vir:33 509 GAGVGALA-------TSSPE-------AMQGAAAKAGL--NA--T 535 (535) T ss_pred hhhhcchh-------hcCCh-------hHHHHHHhccC--CC--C Confidence 00000000 00000 00000000000 00 0 No 34 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.74 E-value=6.3e-17 Score=109.42 Aligned_cols=469 Identities=10% Similarity=0.037 Sum_probs=219.4 Q ss_pred CC-cHHHHHHHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCC----CcccchHHHHHHHHHHHhhC Q lcl|NC_013059. 1 MA-DNKNRLESILSRFDADWTA-SDEARREAKNDLFFSRVSQWDDWLSQYTTLQYR----GQFDVVRPVVRKLVSEMRQN 74 (725) Q Consensus 1 ma-d~~~~~~~~~~~~~~~~~~-~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~gr----p~~N~i~~~v~~v~g~~~~n 74 (725) |. +...... ........+.. ....+....+-.+||.|.|.--......+..++ .++|..+.+|+..+|+.-.+ T Consensus 31 ~~~~e~~~~~-~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~ 109 (511) T protein:vir:96 31 YDGTESDLLQ-NVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (511) T ss_pred cchhhhhhhc-cHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhhccC Confidence 32 2221110 01111112221 122344566678999998764322222233333 36799999999999999998 Q ss_pred CcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh Q lcl|NC_013059. 75 PIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS 154 (725) Q Consensus 75 r~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~ 154 (725) ++.+.+ ++.+ ....+..+.+.|+++......++++++.|.+|..+..+ ++ + .+.++.. ++. T Consensus 110 p~~~~~-----~~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~d---ed--~-~~~i~~~----~p~ 170 (511) T protein:vir:96 110 PIQYQD-----DDKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD--D-ETRLYKS----DAM 170 (511) T ss_pred Cceeec-----CchH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeC---CC--C-ceEEEEE----ccc Confidence 888753 2222 34567788888999999999999999999999877543 22 1 3333321 233 Q ss_pred h--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEE Q lcl|NC_013059. 155 H--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETA 232 (725) Q Consensus 155 ~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~ 232 (725) + ++||..... . ..++++.|.... . .....+.+..+++|.... + T Consensus 171 ~~~~vydd~~~~-~----~~~~vr~~~~~~-------------~-------------d~~~~~~~~~~~iyt~~~----i 215 (511) T protein:vir:96 171 STFVIYDNTIER-N----SIAGVRYLRTKP-------------I-------------DKTDEDEVFTVDLFTSHG----V 215 (511) T ss_pred eeEEEEcCCCCC-c----eEEEEEEEEeee-------------c-------------cccccceEEEEEEEeCCc----E Confidence 2 235533210 0 123333331100 0 000122334445554421 1 Q ss_pred EEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCC Q lcl|NC_013059. 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVED 312 (725) Q Consensus 233 ~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~ 312 (725) +.+....++ ... . ......+.|.|++.+|+|+|. .+ T Consensus 216 ~~~~~~~~~-~~~-----------------------------------~--~~~~~~~~~~~~~~vPvv~~~------nn 251 (511) T protein:vir:96 216 YRYLTSRTN-GLK-----------------------------------L--TPRENGFESHSFERMPITEFS------NN 251 (511) T ss_pred EEEEecCCC-ccc-----------------------------------c--cccccccccccCCceeeEEec------CC Confidence 111110000 000 0 000112334455566666542 11 Q ss_pred ccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc--ccccccC--ccccccC Q lcl|NC_013059. 313 KEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL--NRTDENN--GEMPTQP 388 (725) Q Consensus 313 ~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~~~~~--g~~~~~~ 388 (725) ..+.|.+.++++.++.+|...|.+...+...+...+++ .|....-..............+ ....... +...... T Consensus 252 -~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:96 252 -ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred -CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCccCCchhhcccccccceecccccccccccccCCCCcc Confidence 23578899999999999999999998887666554432 2211110011110000000000 0000000 1111223 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) ++++..+.-..++...++...+.|-.+|++.+.+.+.-++..||+|+..............-.-|..+++++.++++.++ T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~ 409 (511) T protein:vir:96 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 409 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44554444456667788888999999999877776655455799999988777777766777777777777777666654 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) ...... +... |+ .+|.|.-.+..+.-..+..+.++.+.+.++. . T Consensus 410 ~~~~~~---------~~~~---------------------d~---~~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~---e 453 (511) T protein:vir:96 410 KNTWSI---------DANK---------------------DF---NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ---T 453 (511) T ss_pred HhhcCc---------cccc---------------------cc---ccceEEeCCCCCCCHHHHHHHHHHHhccCCh---H Confidence 321100 0000 11 2445555566665555555556555433321 1 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQ 628 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae 628 (725) . ++..++..+ ..++-++++.+.... .....+..........--..... + .+-..+ T Consensus 454 t---~l~~l~~v~--D~~~E~~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~~------~-~~~~~~ 508 (511) T protein:vir:96 454 T---LMSLFSFFQ--DPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQDD------D-TKDTVD 508 (511) T ss_pred H---HHHhCCCCC--CHHHHHHHHHHHHHH-------------HHHHHhhccccCCCCCCCCCCCC------c-cccccc Confidence 2 222222222 112222333221100 00000000000000000000000 0 000000 Q ss_pred HHH Q lcl|NC_013059. 629 TLS 631 (725) Q Consensus 629 ~~k 631 (725) +.+ T Consensus 509 ~~~ 511 (511) T protein:vir:96 509 KKE 511 (511) T ss_pred ccC Confidence 000 No 35 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.74 E-value=9.5e-17 Score=108.42 Aligned_cols=469 Identities=11% Similarity=0.037 Sum_probs=220.6 Q ss_pred CCcHHHHHHHHHHHHHHHHhhh-HHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTAS-DEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~-~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v~~v~g~~~~nr 75 (725) |..-...+..........+... ...+....+-.+||.|.|..-......+..+ |.++|..+.+|+..+|+...+. T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p 110 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccC Confidence 4432222211112222222222 2223345566789999886321111222223 3467999999999999998888 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~ 155 (725) +.+.+ +|.+ ....+..+++.|+++.....+.++++++|.+|.-+..+ ++ + .+.+... ++.+ T Consensus 111 ~~~~~-----~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~d---ed--~-~~~i~~~----~p~~ 171 (512) T protein:vir:97 111 IQCQD-----DDKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD--D-ETRLYKS----DAMS 171 (512) T ss_pred ceecc-----CChH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeC---CC--C-ceEEEEE----cccc Confidence 77643 2222 34567778888999999999999999999999877543 22 1 3333322 2332 Q ss_pred --eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 156 --VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 156 --v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) ++||+.... -...+++.|.... . .....+.+..+++|..... + T Consensus 172 ~~~iyd~~~~~-----~~~~~vr~~~~~~-------------~-------------~~~~~~~~~~~~vyt~~~i----~ 216 (512) T protein:vir:97 172 TFVIYDNTIER-----NSIAGVRYLRTKP-------------I-------------DKTDEDEVFTVDLFTSHGV----Y 216 (512) T ss_pred eEEEEcCCCCC-----ceEEEEEEEEeee-------------c-------------cccccceEEEEEEEeCCcE----E Confidence 456665421 1123333332110 0 0001234445566655321 1 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) .+... ++..... .....++.|.|++.+|+|+|.- + T Consensus 217 ~~~~~-~~~~~~~-------------------------------------~~~~~~~~~~~~g~vPvv~~~n------n- 251 (512) T protein:vir:97 217 RYLTS-RTNGLKL-------------------------------------TPRENGFESHSFERMPITEFSN------N- 251 (512) T ss_pred EEEec-CCCcccc-------------------------------------cccccccccccCcccceEeecC------C- Confidence 11110 0100000 0001133455566677776521 1 Q ss_pred cccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccc-----cccccccCcccccc Q lcl|NC_013059. 314 EVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYL-----LNRTDENNGEMPTQ 387 (725) Q Consensus 314 ~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~-----~~~~~~~~g~~~~~ 387 (725) ..+.|.+..+++.++.+|...|.+...+...+...+++ .|.. ................. .+.... .+..... T Consensus 252 ~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 329 (512) T protein:vir:97 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYENRDTG-IETEGSV 329 (512) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCccCCchhhhhhhhcccccccccchhhcccc-cCCCCCc Confidence 23568899999999999999999998887666554432 2211 11111111000000000 000000 0000112 Q ss_pred CCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 388 PLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) Q Consensus 388 ~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~l 467 (725) .++++..+.-..++..+++.....|-.+|++.+.+.|.-++..||+|+..............-.-|..+++++.++++.+ T Consensus 330 d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~ 409 (512) T protein:vir:97 330 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETI 409 (512) T ss_pred ceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23444433334456667888889999999988877776555579999988777666666666677777777766666654 Q ss_pred HHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccc Q lcl|NC_013059. 468 VNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) Q Consensus 468 i~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p 547 (725) +...-.. +.. -|+ .+|.|.-.|..+.-..+..+.+..+.+.++. T Consensus 410 ~~~~~~~---------~~~---------------------~d~---~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~--- 453 (512) T protein:vir:97 410 LKNTRSI---------DAN---------------------KDF---NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ--- 453 (512) T ss_pred HHhcCCc---------ccc---------------------ccc---ccceEEeCCCCCcCHHHHHHHHHHHhccCch--- Confidence 4211100 000 011 1445555666665455555556555433322 Q ss_pred hHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQN 627 (725) Q Consensus 548 ~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqa 627 (725) ..++..++..+ ..+..++++.+.... ....................+.. ++ T Consensus 454 ---et~~~~l~~v~--d~~~E~eri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~~-----------~~ 504 (512) T protein:vir:97 454 ---TTLMSLFSFFQ--DPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQDD-----------DT 504 (512) T ss_pred ---HHHHHhCCCCC--CHHHHHHHHHHHHHH-------------HHHHHhhcccCCCCCCCCCCCCC-----------Cc Confidence 12222232222 122223333221100 00000000000000000000000 00 Q ss_pred HHHHHHHH Q lcl|NC_013059. 628 QTLSLQID 635 (725) Q Consensus 628 e~~k~q~e 635 (725) +....+.+ T Consensus 505 ~~~~~~~~ 512 (512) T protein:vir:97 505 KDTVDKKE 512 (512) T ss_pred cccccccC Confidence 00000000 No 36 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.74 E-value=2.5e-16 Score=106.10 Aligned_cols=436 Identities=10% Similarity=0.062 Sum_probs=210.4 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCC----cccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG----QFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp----~~N~i~~~v~~v~g~~~~nr~ 76 (725) +-.+.++..+.+..|-..+ ...+.+..+..+||.|.| +-.....+..++| ++|..+.+|+..+|+.-.+.+ T Consensus 11 ~~~~~~~~~~~i~~~i~~~---~~~~~r~~~~~~Yy~g~~--~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~ 85 (452) T protein:vir:36 11 FSKDEPITVEVVTKFMEKH---KLEVARYEYLKNMYLGIM--AIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNGIPV 85 (452) T ss_pred cCCccCCCHHHHHHHHHHH---HHHHHHHHHHHHHhcccc--ccccCccccccCccceeecchHHHHHHHHhhhhcccCc Confidence 3323223323333333222 222334566789999986 1111222333333 469999999999999988876 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH- 155 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~- 155 (725) .+. +.+. + ....++.+++.|+++...+.+.++++++|.||+.+..+ ++ | .+.+++. ++.+ T Consensus 86 ~~~--~~d~---~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d---~~--g-~~~i~~~----~p~~~ 146 (452) T protein:vir:36 86 KKS--HSDK---E----ILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQD---ED--T-QTNVVYN----SPENM 146 (452) T ss_pred eee--cCCh---h----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEec---CC--C-eeEEEEE----cccce Confidence 654 3221 1 24457778888999999999999999999999877543 22 2 2223221 2222 Q ss_pred -eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 156 -VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 156 -v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) .+||+.... -...+++.|. ..+....+++|... .++. T Consensus 147 ~~v~d~~~~~-----~~~~~i~~~~---------------------------------~~~~~~~~~vyt~~----~i~~ 184 (452) T protein:vir:36 147 FMVYDDTVKQ-----EPLFAVRYGV---------------------------------DEDKKLQGEVYTLL----ETIK 184 (452) T ss_pred EEEEcCCCCC-----ceEEEEEEEE---------------------------------ecCceEEEEEEecC----eEEE Confidence 234432110 0011111111 01112223344321 1111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +.. -.+...+.+..|.+.+.+|+|+|+.. . T Consensus 185 ~~~-------------------------------------------~~~~~~~~~~~~~~~g~iPvv~~~n~---~---- 214 (452) T protein:vir:36 185 ISG-------------------------------------------ENDEISFGEGTYNPYPDLPVVEFYFN---E---- 214 (452) T ss_pred EEE-------------------------------------------cCCceEEecceeccCCcccEEEecCC---C---- Confidence 110 00111122334445556666655321 1 Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCC Q lcl|NC_013059. 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYEN 394 (725) Q Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 394 (725) .+.|.+..+++.++.+|..+|.+...+...+...+++.-...+. +..... .....+.. ..+|.-....++++.. T Consensus 215 ~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~-~~~~~~---~~~~~~~~--~~~~~~~~~~~~~l~~ 288 (452) T protein:vir:36 215 ERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE-EDLKNI---RSNRVINY--YADGEGKNVDVKFLEK 288 (452) T ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc-hhhhhh---hhcceEEe--cCCCCccCCcceeEee Confidence 24577889999999999999999988877766655443212111 111111 11111110 1111111223444443 Q ss_pred CCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_013059. 395 PEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDV 474 (725) Q Consensus 395 ~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~ 474 (725) +.-..+....++...+.|-..|++.+.+.+..+| .||+|+..+-..........-..|..+++++.++++.+... T Consensus 289 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---- 363 (452) T protein:vir:36 289 PDSDSQTENLLDRLTKLIFQTTMVANISDESFGS-SSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTN---- 363 (452) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCccccCcccccC-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---- Confidence 3334556667888899999999887766665554 59999988776666666667777777777777776665431 Q ss_pred CcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHH Q lcl|NC_013059. 475 PRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLL 554 (725) Q Consensus 475 ~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~ 554 (725) .|. .. |+ .||.|.-.+..+.-..+..+.++.+.+.++. ..++ T Consensus 364 ------~~~--~~---------------------~~---~~i~i~f~~~~p~d~~~~a~~~~k~~g~iS~------et~~ 405 (452) T protein:vir:36 364 ------VSN--KD---------------------SW---KDIEYTFTRNEPKDIKEQAETANILMGITSQ------ETAL 405 (452) T ss_pred ------cCC--cc---------------------cc---ccceEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHH Confidence 111 00 01 2445555566655444455555554333221 1222 Q ss_pred HhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 555 QYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 555 ~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) ..++..+ ..+.-++++.+........ .+... . .... .+.+......| T Consensus 406 ~~~~~~~--d~~~E~~ri~~E~~~~~~~--------------~~~~~--~-~~~~-~~~~~~~~~~e 452 (452) T protein:vir:36 406 SVISVIP--DVQAEMEKIKKEEASTAIF--------------DKDKQ--P-SEKG-TDTVVSETNEE 452 (452) T ss_pred HhCCCCC--CHHHHHHHHHHHHHHHHHH--------------Hhhcc--C-CCCc-ccccCccccCC Confidence 2222221 1222233332211000000 00000 0 0000 00000000000 No 37 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.73 E-value=9.2e-17 Score=108.51 Aligned_cols=443 Identities=9% Similarity=-0.015 Sum_probs=209.6 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCC----CcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYR----GQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~gr----p~~N~i~~~v~~v~g~~~~nr~ 76 (725) |..+..+..+.+..|-.. +....+....+-.+||.|+| ..+...+..++ .++|..+.+|+..+|+...+.+ T Consensus 19 ~~~~~~~~~~~i~~~i~~--~~~~~~~~~~~l~~Yy~g~~---~i~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~ 93 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAY--NETVLKPRYRENMKLYLGKH---KILTAPEKETGADNRIVVNSAKYVVDVYNGYFCGIEP 93 (470) T ss_pred eCCCCCcCHHHHHHHHHH--HHHhhHHHHHHHHHHhcccc---ccccCcccccCCcceeecchHHHHHHHHhhhhccCCe Confidence 554433333332222221 12333344566689999976 11111222233 3579999999999999998887 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH- 155 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~- 155 (725) .+.+. +|.+..+ .+..+.+.|+++.....++.+++++|.+|.-+..+ ++ | .+.+.. .++.+ T Consensus 94 ~~~~~----~d~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d---~d--g-~~~i~~----~~p~~~ 155 (470) T protein:vir:99 94 KLALL----NDSSKID----EIARWNRQENFFDTINEISKQCDIFGRSIASIYQG---ED--A-RPHLMY----SSPNHA 155 (470) T ss_pred eEeeC----CchhHHH----HHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeC---CC--C-eEEEEE----Ecccee Confidence 76552 2222222 24456678999999999999999999999877543 22 2 222322 12332 Q ss_pred -eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 156 -VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 156 -v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) ++||+.... . ..++++.|...+ .......+++|+.. .++. T Consensus 156 ~~i~d~~~~~-~----~~~~vr~~~~~~------------------------------~~~~~~~~~~~~~~----~~~~ 196 (470) T protein:vir:99 156 FIIYDDTVQR-Q----PLAFVHYQIDNS------------------------------NNWTDAYGVIQYAD----KFYK 196 (470) T ss_pred EEEEcCCCCc-c----eEEEEEEEEEec------------------------------CCeeEEEEEEEecC----eEEE Confidence 345543211 0 111222221100 00111111222111 1111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +..... .....+.+..+.|.+.+|+|+|... . T Consensus 197 ~~~~~~-----------------------------------------~~~~~~~~~~~~~~g~vPvv~~~n~-------~ 228 (470) T protein:vir:99 197 FKGYDI-----------------------------------------EEDTNAAGYAINPYGLVPAVEFFEN-------E 228 (470) T ss_pred EEeccc-----------------------------------------ccccccccccccCCCccceEeecCC-------C Confidence 110000 0011112334455566777765321 2 Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHH---HHHHhhccccccccccccccCccccccCCcc Q lcl|NC_013059. 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFE---HMYDGNDDYPYYLLNRTDENNGEMPTQPLAY 391 (725) Q Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 391 (725) .+.|.+..+++.++.+|..+|.+...+...+.....+.-...+..+ .......... ..+. . .+.-....++. T Consensus 229 ~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~-~~~~---~-~~~~~~~~~~~ 303 (470) T protein:vir:99 229 ERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRV-LYVS---Q-LDPDTNPQIGF 303 (470) T ss_pred CCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcce-eeec---C-CCCCCCCcceE Confidence 3567888999999999999999998887776665544322111110 0100000000 0000 0 00011223445 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013059. 392 YENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDI 471 (725) Q Consensus 392 ~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~ 471 (725) +..+.-...+...++...+.|-..||+.+.+.+..++..||+|+..............-..|..+++++.++++.++..- T Consensus 304 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 383 (470) T protein:vir:99 304 IAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNN 383 (470) T ss_pred EeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 54443334555678888999999999987766665556799999987777666666667777777777766665543321 Q ss_pred cCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHH Q lcl|NC_013059. 472 YDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQL 551 (725) Q Consensus 472 y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~ 551 (725) ... . + + -.+|.|.-.|..+.-..+..+.+..+.+.++. - T Consensus 384 ~~~-----------~--~-------------------~---~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~------e 422 (470) T protein:vir:99 384 KQD-----------Q--E-------------------L---WSELDFKFTRNLPEDMASAIDNAKNAEGIVSK------K 422 (470) T ss_pred CCc-----------c--c-------------------c---cccceEEeCCCCCcCHHHHHHHHHHHhccCCH------H Confidence 100 0 0 0 12455555666665444555555555433221 1 Q ss_pred HHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 552 LLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTL 630 (725) Q Consensus 552 ~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~ 630 (725) .++..++..| .+.-++++.++... . .+..+........ ..+ ....+.. T Consensus 423 t~l~~l~~vd---~~~E~eri~~E~~~-------------~-~~~~~~~~~~~d~-----------~~~---d~~~ee~ 470 (470) T protein:vir:99 423 TQLGMIPDIE---PDAEMKQIAKEKAD-------------A-IKQTQQLSMPIDI-----------LKR---DNNAEEE 470 (470) T ss_pred HHHHhCCCCC---HHHHHHHHHHHHHH-------------H-HHHHHhhcCCCCc-----------CCC---CCCccCC Confidence 2222233222 11222222211100 0 0000000000000 000 0000000 No 38 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.73 E-value=1.2e-16 Score=107.88 Aligned_cols=454 Identities=10% Similarity=0.011 Sum_probs=210.0 Q ss_pred CCcHHHHHHH-HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHH-HHHhhcCCC----cccchHHHHHHHHHHHhhC Q lcl|NC_013059. 1 MADNKNRLES-ILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLS-QYTTLQYRG----QFDVVRPVVRKLVSEMRQN 74 (725) Q Consensus 1 mad~~~~~~~-~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~-~~l~~~grp----~~N~i~~~v~~v~g~~~~n 74 (725) |.+.+.+..+ ++.-..+.. ...+....+-.+||.|.|..-... ..+...++| ++|..+.+|+..+|+...+ T Consensus 16 ~~~~~~l~~~~i~~li~~~~---~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~ 92 (506) T protein:vir:94 16 QESLENLTPNKIMKFITHHF---NYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGN 92 (506) T ss_pred ccchhcCCHHHHHHHHHHHH---HHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhccc Confidence 6554433222 222222211 122334556678999998632111 223344444 5699999999999999888 Q ss_pred CcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh Q lcl|NC_013059. 75 PIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS 154 (725) Q Consensus 75 r~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~ 154 (725) .+.+. +.+. . .+..+..+.+.|+++.....+.++++++|.+|..|..+ ++ + .+.+.+. ++. T Consensus 93 p~~~~--~~d~---~----~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~d---ed--~-~~~i~~~----~p~ 153 (506) T protein:vir:94 93 PINVK--LPDD---G----SNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRG---ED--N-EEHLAKL----DPL 153 (506) T ss_pred Cceee--cCcc---h----HHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEec---CC--C-eeEEEEE----ccc Confidence 76554 3322 1 24567788889999999999999999999999888653 22 2 2333221 222 Q ss_pred h--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeE----EEEEEEEEecc Q lcl|NC_013059. 155 H--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTI----QIAEFYEVVEK 228 (725) Q Consensus 155 ~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~v----rv~E~w~~~~~ 228 (725) . ++||.... +....+++.|...+ ...+.+ ...++|.. T Consensus 154 ~~~~v~dd~~~-----~~~~~~v~~~~~~~-----------------------------~~~~~~~~~~~~~~~yt~--- 196 (506) T protein:vir:94 154 DTFVIYSTDVD-----PKPIMAVRYHQIEL-----------------------------VDDNQVSTINYVPETWTA--- 196 (506) T ss_pred ceEEEecCCCC-----CceEEEEEEEeeee-----------------------------ccCCceeEEEEEEEEEeC--- Confidence 2 23443221 11223343332210 001111 12222221 Q ss_pred eeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeee Q lcl|NC_013059. 229 KETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWG 308 (725) Q Consensus 229 ~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~ 308 (725) ..++.+.. . . +...+.+..+.+.+.+|+|+|... T Consensus 197 -~~~~~~~~----~-------~--------------------------------~~~~~~~~~~~~~g~vPvv~~~n~-- 230 (506) T protein:vir:94 197 -DTYTLYNP----T-------P--------------------------------IMGKMQVDTTKPITTFPVVEFKNS-- 230 (506) T ss_pred -ceEEEecc----c-------c--------------------------------CccceeccccccCCccceEEecCC-- Confidence 11111110 0 0 000112223444566777766321 Q ss_pred ccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcc------------------------hHHHHHH Q lcl|NC_013059. 309 FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIA------------------------GFEHMYD 364 (725) Q Consensus 309 ~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~------------------------~~~~~~~ 364 (725) ..+.|.+..+++.++.+|..+|.+...+...+....++.-.... ....... T Consensus 231 -----~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (506) T protein:vir:94 231 -----NFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIK 305 (506) T ss_pred -----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHh Confidence 12557888999999999999999987765444332221100000 0000000 Q ss_pred hhccccccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHH Q lcl|NC_013059. 365 GNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADL 444 (725) Q Consensus 365 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~ 444 (725) .......+.+.......|.-....++++..+.-..+....++.....|-..|++.+.+.+.-++..||+|+..+...... T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~ 385 (506) T protein:vir:94 306 EMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVE 385 (506) T ss_pred hhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHH Confidence 00000000000000001111122344454444556777788899999999999877655544456799999987766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCc Q lcl|NC_013059. 445 ETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSF 524 (725) Q Consensus 445 ~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~ 524 (725) .....-.-|..+++++.++++.++.... ..... |+ .+|.|.-.+.. T Consensus 386 k~~~k~~~~~~~l~~~~~li~~~~~~~~----------~~~~~---------------------d~---~~i~i~f~~~~ 431 (506) T protein:vir:94 386 LASTKRRMFERGLYARYQIISDIENSIH----------GDWTF---------------------DP---QELTFTFRDNL 431 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcC----------Ccccc---------------------cc---ccceEEeCCCC Confidence 6666666666777766666666554321 10000 01 23445555666 Q ss_pred hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHH-HHHHhh Q lcl|NC_013059. 525 QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQ-QAKQGQ 603 (725) Q Consensus 525 ~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~-q~qq~q 603 (725) +.-..+..+.+..+.+. .|. ..++..++..+ ..+.-++++.++...... ...... -....+ T Consensus 432 p~d~~e~a~~~~kl~g~----iS~--et~~~~lp~v~--d~~~E~~ri~~E~~~~~~----------~~~~~~~~~~~~~ 493 (506) T protein:vir:94 432 PADNISQIKALVQAGAT----LPQ--KYLYQQLPGVT--NPQDIVDMMKEQSANGDY----------SFDQNGVISNDGQ 493 (506) T ss_pred CcCHHHHHHHHHHHhcc----CCh--HHHHHhCCCCC--CHHHHHHHHHHHHHHHhh----------cchhhcCCCcccC Confidence 55444445555544332 222 11122222221 112222222221100000 000000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 604 QDPAMVQAQGVLLQGQAELAKAQNQTLS 631 (725) Q Consensus 604 ~q~~~~~~qa~~~k~qae~~kaqae~~k 631 (725) .. +.+....+.-| T Consensus 494 ~~---------------~~~~~~~~e~~ 506 (506) T protein:vir:94 494 TN---------------TTATQTDEEVR 506 (506) T ss_pred cc---------------ccccccccCCC Confidence 00 00000000000 No 39 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.73 E-value=1.4e-16 Score=107.56 Aligned_cols=470 Identities=11% Similarity=0.030 Sum_probs=216.6 Q ss_pred CCcHHHHHHHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTA-SDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~-~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v~~v~g~~~~nr 75 (725) |.......-.-.......+.. ...-+....+-.+||.|.|.--......+..+ |.++|..+.+|+..+|+...+. T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p 110 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccC Confidence 331111110001112222222 22234455667899999876321111222223 3467999999999999999988 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~ 155 (725) +.+.+ ++.+ ....+..+.+.|+++...+....++++.|.+|.-+..+ ++ | .+.++.. ++.+ T Consensus 111 ~~~~~-----~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~d---ed--g-~~~i~~~----~p~~ 171 (511) T protein:vir:10 111 IQYQD-----DDKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRN---QD--D-ETRLYKS----DAMS 171 (511) T ss_pred ceeec-----CchH----HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeC---CC--C-ceEEEEE----ccce Confidence 87753 2222 34567778888999999999999999999999776543 22 1 2333321 2222 Q ss_pred --eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 156 --VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 156 --v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) ++||..... . ..++++.|.... . ...+.+.+..+++|.... ++ T Consensus 172 ~~~vydd~~~~-~----~~~~vr~~~~~~------~--------------------d~~~~~~~~~~~iyt~~~----i~ 216 (511) T protein:vir:10 172 TFVIYDNTIER-N----SIAGVRYLRTKP------I--------------------DKTDEDEVFTVDLFTSHG----VY 216 (511) T ss_pred eEEEEcCCCCC-c----eEEEEEEEEeee------c--------------------ccCccceEEEEEEEeCCc----EE Confidence 335543311 0 122333321100 0 001123344455555431 11 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) .+....++.. . ......++.|.|++.+|+|+|. .+ T Consensus 217 ~~~~~~~~~~-~-------------------------------------~~~~~~~~~~~~~~~vPvv~f~------nn- 251 (511) T protein:vir:10 217 RYLTSRTNGL-K-------------------------------------LTPRENGFESHSFERMPITEFS------NN- 251 (511) T ss_pred EEEecCCCcc-c-------------------------------------ccccccccccccCcceeEEEec------CC- Confidence 1211111100 0 0000112344555556666542 21 Q ss_pred cccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhcccccccc-ccccccC--ccccccCC Q lcl|NC_013059. 314 EVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLL-NRTDENN--GEMPTQPL 389 (725) Q Consensus 314 ~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~-~~~~~~~--g~~~~~~~ 389 (725) ..+.|.+..+++.++.+|...|.+...+...+....++ .|.. .+................ ....... +......+ T Consensus 252 ~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 330 (511) T protein:vir:10 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDG 330 (511) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-eccccCCchhhccchhccceecccccccccccccCCCCcce Confidence 23568899999999999999999998887666554432 2211 111111000000000000 0000000 11112233 Q ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 390 AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVN 469 (725) Q Consensus 390 ~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~ 469 (725) +++..+.-..++...+......|-.+|++.+.+.+.-++..||+|+..............-.-|..+++++.++++.++. T Consensus 331 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~ 410 (511) T protein:vir:10 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 410 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44443333455667788888999999988776666544557999998877666666666666666777666666555432 Q ss_pred HhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchH Q lcl|NC_013059. 470 DIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEY 549 (725) Q Consensus 470 ~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~ 549 (725) ..- +.+.. .|+ .+|.|.-.+..+.-..+..+.++.+.+.++. T Consensus 411 ~~~---------~~~~~---------------------~d~---~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~----- 452 (511) T protein:vir:10 411 NTR---------SIDAN---------------------KDF---NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ----- 452 (511) T ss_pred hhC---------Ccccc---------------------ccc---ceeeEEeCCCCCcCHHHHHHHHHHHhccCcH----- Confidence 110 00000 011 2556666666666555566666666443322 Q ss_pred HHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 550 QLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 550 ~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) ..++..++..+ ..+.-++++.+.... .....+................. + .+-..++ T Consensus 453 -et~~~~l~~v~--d~~~E~~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~~------~-~~~~~~~ 509 (511) T protein:vir:10 453 -TTLMSLFSFFQ--DPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQDD------D-TKDTVDK 509 (511) T ss_pred -HHHHHhCCCCC--CHHHHHHHHHHHHHH-------------HHHHHhhhcccCCCCCCCCCCCC------c-ccCcccc Confidence 12222222222 112222322221100 00000000000000000000000 0 0000000 Q ss_pred HH Q lcl|NC_013059. 630 LS 631 (725) Q Consensus 630 ~k 631 (725) .+ T Consensus 510 ~~ 511 (511) T protein:vir:10 510 KE 511 (511) T ss_pred cC Confidence 00 No 40 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.73 E-value=2.2e-16 Score=106.44 Aligned_cols=444 Identities=9% Similarity=0.035 Sum_probs=208.5 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CCHHHH-------HHHhhcCCCcccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ--WDDWLS-------QYTTLQYRGQFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q--W~~~~~-------~~l~~~grp~~N~i~~~v~~v~g~~ 71 (725) |=-+.+++.+++..+ ++.....+....+..+||.|+| |..... ...+..-|.++|..+.+|+..+|+. T Consensus 18 ~~~~~~~~~~~i~~~---i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l 94 (472) T protein:vir:93 18 TNNKPETLEEMIVRY---IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYI 94 (472) T ss_pred ecCchhhHHHHHHHH---HHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhh Confidence 211223444444333 3334455667777889999975 211111 0111112346799999999999999 Q ss_pred hhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeec Q lcl|NC_013059. 72 RQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHS 151 (725) Q Consensus 72 ~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~ 151 (725) ..+.+.+.+ +|.+..+.+. .+. .|+++.....++.+++++|.||.-|..+ ++ + .+.+.. . T Consensus 95 ~g~~~~~~~-----~d~~~~~~l~----~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d---~d--~-~~~i~~----~ 154 (472) T protein:vir:93 95 VGKPIAFKH-----TDDEVVKRID----EVL-GNRFDDKLHSVLTGASNKGIEWLHPYLD---EE--G-EFKLFR----V 154 (472) T ss_pred cccCeeecc-----CChHHHHHHH----HHH-hccHHHHHHHHHHHHhhcCeEEEEEEEC---CC--C-ceEEEE----E Confidence 887766532 3444444443 333 3689999999999999999999877543 22 1 233332 1 Q ss_pred chhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 152 ACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 152 ~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) ++.+ ++||+.... + -..+++.|-..+ .. -.++|....+ T Consensus 155 ~p~~~~~i~d~~~~~----~-~~~~ir~~~~~~-------------------------------~~---~~~~~~~~~~- 194 (472) T protein:vir:93 155 PAEQGIPIWTDKEHE----E-LEAFIRMYKLEN-------------------------------ET---KVEYWDKVTV- 194 (472) T ss_pred cccceEEEEcCCCCC----c-eEEEEEEEEeec-------------------------------ce---eEEEEecCeE- Confidence 2232 445543221 1 112333332110 00 1233322111 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) ..+.+.. +..+...... ......+..+.+.+.+|+|+|... T Consensus 195 -~~~~~~~---~~~~~~~~~~--------------------------------~~~~~~~~~~~~~~~vPvv~~~nn--- 235 (472) T protein:vir:93 195 -NYYVYEN---GSLIPDYSNN--------------------------------LENSKTHFSTGSWGKIPFIPFKNN--- 235 (472) T ss_pred -EEEEEec---Ceeeeccccc--------------------------------ccccccccccCCCCCcceEEecCC--- Confidence 1111111 1111110000 000011223445566777766321 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCC Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPL 389 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 389 (725) ..+.|.+..+++.++.+|..+|.+...+...+...+++.-.......+........ .. +....+ ... T Consensus 236 ----~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~--~~---~~~~~~----~~~ 302 (472) T protein:vir:93 236 ----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYY--GA---IKVSDN----GGV 302 (472) T ss_pred ----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHHhhc--cc---cccCCC----Ccc Confidence 12568889999999999999999998887776665543211111111111111111 01 111111 123 Q ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 390 AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVN 469 (725) Q Consensus 390 ~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~ 469 (725) +++..+.-..++...++.....|-..+++.+.+.+.-++..||+|+..............-..|..+++++.++++.++ T Consensus 303 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~- 381 (472) T protein:vir:93 303 DTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF- 381 (472) T ss_pred eeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh- Confidence 3333222235556678888899999998877776665566799998877666666666666666777766666655543 Q ss_pred HhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchH Q lcl|NC_013059. 470 DIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEY 549 (725) Q Consensus 470 ~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~ 549 (725) ... + + -.++.|.-.+..+.-..+..+.++.+.+.++. T Consensus 382 ---~~~------~--------------------------~---~~~i~v~f~~~~p~~~~~~~~~~~k~~giis~----- 418 (472) T protein:vir:93 382 ---DIK------G--------------------------E---HKDVDISFNYNKVANTELQVQTAQQSMGIVSH----- 418 (472) T ss_pred ---CCC------c--------------------------c---cceeeEEeCCCCCCCHHHHHHHHHHHhccCch----- Confidence 210 0 0 12344445666665444555555555433221 Q ss_pred HHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 550 QLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 550 ~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) ...+..++.. +..+..++++.+.........+.... ....- ..+.+. ... T Consensus 419 -et~l~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~~----------------------~~~d~-~~~~~~----~~~ 468 (472) T protein:vir:93 419 -ETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLDD----------------------GGADG-AQQQER----SNN 468 (472) T ss_pred -HHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhccCcCc----------------------ccCCC-CCCCCC----CCc Confidence 1222222222 12223333332211000000000000 00000 000000 000 Q ss_pred HHHH Q lcl|NC_013059. 630 LSLQ 633 (725) Q Consensus 630 ~k~q 633 (725) ...+ T Consensus 469 ~~~e 472 (472) T protein:vir:93 469 KESE 472 (472) T ss_pred ccCC Confidence 0000 No 41 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.73 E-value=6e-16 Score=104.04 Aligned_cols=423 Identities=9% Similarity=0.019 Sum_probs=204.5 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCC----CcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYR----GQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~gr----p~~N~i~~~v~~v~g~~~~nr~ 76 (725) |-.+ .+.++...+. ..+....+-.+||.|+| +-.....+..++ .++|..+.+|+..+|+...+.+ T Consensus 1 l~~~--~l~~~i~~~~-------~~~~r~~~l~~yy~g~~--~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~ 69 (429) T protein:vir:98 1 MTKD--LLSELIQKHR-------SFNLSYSAYKQLYEGDH--AILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPV 69 (429) T ss_pred CCHH--HHHHHHHHHH-------HHHHHHHHHHHHhcccc--ccccccccccCCCcceeecchHHHHHHHHhhhhcccCc Confidence 4333 3444444332 22234455678999987 111122233333 3579999999999999988876 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH- 155 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~- 155 (725) .+.+ . + +.....+..+.+.|+++...+.+.++++++|.||+-+..+ ++ | .+.+++. ++.+ T Consensus 70 ~~~~--~---~----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d---~~--g-~~~~~~~----~p~~~ 130 (429) T protein:vir:98 70 QTSH--E---N----KQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFND---EN--A-EAGITYL----TPLEA 130 (429) T ss_pred eeec--C---C----hHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEec---CC--C-cEEEEEE----cccce Confidence 6543 2 1 2244467777888999999999999999999999877543 22 2 2223221 2222 Q ss_pred -eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 156 -VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 156 -v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) ++||..... ....+++.|. ..+.+...++|.... +.. T Consensus 131 ~~v~dd~~~~-----~~~~~i~~~~---------------------------------~~~~~~~~~~~~~~~----~~~ 168 (429) T protein:vir:98 131 FIVYDDSIRQ-----KPLFAVRYFY---------------------------------NKGGVLEGSYSDASN----ITY 168 (429) T ss_pred EEEEeCCCCC-----ceEEEEEEEE---------------------------------ecCceEEEEEEeCce----EEE Confidence 233321110 0111121111 112233444443211 111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +.+. .+...+.+..|.+.+.+|+|+|.. + . T Consensus 169 ~~~~-------------------------------------------~~~~~~~~~~~~~~g~vPvv~~~n------~-~ 198 (429) T protein:vir:98 169 FKDG-------------------------------------------EKGIEIGESEPHPFDGVPMIEYVE------N-E 198 (429) T ss_pred EEec-------------------------------------------CCceEecccccccCCccceEEecC------C-C Confidence 1110 001112233445556677766521 1 2 Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCC Q lcl|NC_013059. 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYEN 394 (725) Q Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 394 (725) .+.|.+..+++.++.+|+..|.+...+...+....++. |.-.. ++............ +...+|. ...++++.. T Consensus 199 ~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~-g~~~~-~~~~~~~~~~~~~~---~~~~~~~--~~~~~~l~~ 271 (429) T protein:vir:98 199 ERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKIL-GAELD-DETLKSLRDTRIIN---LKDTDAQ--QLTVEFLQK 271 (429) T ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCCC-cchhhhHhhCceee---ccCCCCC--CcceeEEee Confidence 35688999999999999999999988777766554432 22111 11111111111111 1111111 112344443 Q ss_pred CCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_013059. 395 PEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDV 474 (725) Q Consensus 395 ~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~ 474 (725) +.-..++...++...+.|-..|++.+.+.+..+| .||.|+..............-..|..+.+++.++++. +.. T Consensus 272 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~----~~~- 345 (429) T protein:vir:98 272 PDADATQEHLLDRLENLIFRTAMVANISDESFGT-ASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIAS----YPT- 345 (429) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCccccCcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hhc- Confidence 3333445567888899999999887666655444 5999998776655555555566666666665555544 332 Q ss_pred CcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHH Q lcl|NC_013059. 475 PRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLL 554 (725) Q Consensus 475 ~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~ 554 (725) +.+. .. |. .+|.|.-.+..+.-..+..+.++.+.+.+ |. -.++ T Consensus 346 -----~~~~--~~---------------------d~---~~i~v~f~~~~p~~~~~~a~~~~kl~g~i----s~--et~~ 388 (429) T protein:vir:98 346 -----SKIG--PK---------------------DW---IGIKYKFTRNLPANLLEESQIAGNLAGIV----SE--ETQV 388 (429) T ss_pred -----cCCC--cc---------------------cc---ccceEEeCCCCCcCHHHHHHHHHHHhccC----ch--HHHH Confidence 1111 00 11 24555556666654445555555543322 22 1122 Q ss_pred HhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHH Q lcl|NC_013059. 555 QYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPA 607 (725) Q Consensus 555 ~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~ 607 (725) ..++..+ ..+.-.+++.+... +..+.++.....+-....++ T Consensus 389 ~~l~~v~--d~~~E~~ri~~E~~----------~~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 389 GVLSIVE--NPQKEIERKNSDKS----------TLISRQAGGLNGQNTTTILE 429 (429) T ss_pred HhCCCCC--CHHHHHHHHHHHHH----------HHHHHHHhhhcCCCCCCCCC Confidence 2222221 11222222221100 00000000000000001111 No 42 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.72 E-value=3.4e-16 Score=105.36 Aligned_cols=443 Identities=9% Similarity=0.055 Sum_probs=209.4 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CCHHH---HHHHhhc----CCCcccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ--WDDWL---SQYTTLQ----YRGQFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q--W~~~~---~~~l~~~----grp~~N~i~~~v~~v~g~~ 71 (725) +-.+.+.+.+++..|- +.....+....+-.+||.|.| |.... ....... -|.++|..+.+|+..+|+. T Consensus 29 ~~~~~e~~~~~i~~~i---~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l 105 (483) T protein:vir:12 29 TNNKPETLEEMIVRYI---KQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYI 105 (483) T ss_pred cCCchhhHHHHHHHHH---HHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhh Confidence 3333344444444333 333445566777889999985 11110 0011111 2345799999999999999 Q ss_pred hhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeec Q lcl|NC_013059. 72 RQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHS 151 (725) Q Consensus 72 ~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~ 151 (725) -.+.+.+.+ +|.+..+.+ +.+.. |+++...+..+.+++++|.||.-|..+ ++ | .+.++. . T Consensus 106 ~G~p~~~~~-----~d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d---~d--~-~~~i~~----~ 165 (483) T protein:vir:12 106 VGKPIAFKH-----TDDEVVKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD---EE--G-EFKLFR----V 165 (483) T ss_pred cccCceecc-----CChHHHHHH----HHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEc---CC--C-ceEEEE----E Confidence 887766532 344444443 33333 678999999999999999999877653 22 2 233332 2 Q ss_pred chhh--eeeCCCc-cccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecc Q lcl|NC_013059. 152 ACSH--VIWDSNS-KLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEK 228 (725) Q Consensus 152 ~~~~--v~~Dp~a-~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~ 228 (725) +|.+ ++||+.. .++ ..+++.|-..+ .. -.++|....+ T Consensus 166 ~p~~~~~v~d~~~~~~~------~~~ir~~~~~~-------------------------------~~---~~~~y~~~~v 205 (483) T protein:vir:12 166 PAEQGIPIWTDKEHEEL------EAFIRMYKLEN-------------------------------ET---KVEYWDKVTV 205 (483) T ss_pred cccceEEEEcCCCCCce------EEEEEEEEeec-------------------------------ce---EEEEEecCeE Confidence 3333 3455432 221 12233321100 00 1233322111 Q ss_pred eeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeee Q lcl|NC_013059. 229 KETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWG 308 (725) Q Consensus 229 ~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~ 308 (725) ..+.+.+ |..+...... .+ ....+..|.+.+.+|+|+|... T Consensus 206 --~~~~~~~---~~~~~~~~~~-------------------------------~~-~~~~~~~~~~~g~vPvv~~~nn-- 246 (483) T protein:vir:12 206 --NYYVYEN---GSLIPDYSNN-------------------------------LE-NSKTHFSTGSWGKIPFIPFKNN-- 246 (483) T ss_pred --EEEEEeC---Ceeeeccccc-------------------------------cc-ccccccccCCCCccceEEecCC-- Confidence 1111111 2111111000 00 0001223444456666665321 Q ss_pred ccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 309 FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 309 ~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) ..+.|.+..+++.++.+|...|.+...+...+...+.+.-...+........... +.. +...++ .. T Consensus 247 -----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~--~~~---~~~~~~----~~ 312 (483) T protein:vir:12 247 -----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRY--YGA---IKVSDN----GG 312 (483) T ss_pred -----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhh--ccc---cccCCC----Cc Confidence 1256888999999999999999999888777666554321111111111111111 011 111111 12 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) ++++..+.-..++...++...+.|-..|++.+.+.+.-++..||+|+..............-..|..+++++.++++++ T Consensus 313 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~- 391 (483) T protein:vir:12 313 VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH- 391 (483) T ss_pred ceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 3444433333555667888888999999887766665555679999987766666666666666667766666655543 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) ... .+ + -.|+.|.-.+..+.-..+..+.++.+.+.++. T Consensus 392 ---~~~---------~~-----------------------~---~~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~---- 429 (483) T protein:vir:12 392 ---FDI---------KG-----------------------E---HKDVDISFNYNKVANTELQVQTAQQSMGIVSH---- 429 (483) T ss_pred ---hcC---------CC-----------------------c---cceeeEEeCCCCCCCHHHHHHHHHHHhccCch---- Confidence 221 00 0 12444555666665455555555555433221 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQ 628 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae 628 (725) ...+..++.. +..+.-++++.+.........+........ ... +..+. .++|. + T Consensus 430 --et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d---~~~-~~~~~-------------~~~e~-----e 483 (483) T protein:vir:12 430 --ETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLDDGGAD---GAQ-QQERS-------------NNKES-----E 483 (483) T ss_pred --HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhhcccccccccC---Ccc-cCCCC-------------CcccC-----C Confidence 1122222222 222233333332211100000000000000 000 00000 00000 0 No 43 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.72 E-value=6.7e-15 Score=98.27 Aligned_cols=532 Identities=11% Similarity=-0.034 Sum_probs=240.1 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhc--CCCCCHHHHHHH-hhcCCCcccchHHHHHHHHH----HHhh Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSR--VSQWDDWLSQYT-TLQYRGQFDVVRPVVRKLVS----EMRQ 73 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~--G~QW~~~~~~~l-~~~grp~~N~i~~~v~~v~g----~~~~ 73 (725) ||+. ..+++..+|......-..|...+.+..+|.. ..-+..++...- +...++.-+.....++.+.+ .-.- T Consensus 1 m~~~--~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltp 78 (559) T protein:vir:95 1 MAET--TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) T ss_pred CChh--hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 9986 3556777777777777777777788888863 222332222111 11223322444445554433 2222 Q ss_pred -CCcceEEecCCcch---HHHHHHHH---HHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEE Q lcl|NC_013059. 74 -NPIDVLYRPKDGAS---PDAADVLM---GMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) Q Consensus 74 -nr~~~~~~pr~~~d---~~~Ae~l~---~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~ 146 (725) +++=+++.+.+++. .++.+.|. ..+......+++..+...+|.+.++.|.|++-+ ++++.+ .++... T Consensus 79 p~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~-----~~d~~~-~~r~~~ 152 (559) T protein:vir:95 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV-----LDDDED-IIRTMP 152 (559) T ss_pred CCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEe-----ecCCCc-eeEEEE Confidence 56667776655432 23333333 334444557889999999999999999998643 233322 233333 Q ss_pred EeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEe Q lcl|NC_013059. 147 EPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) Q Consensus 147 ~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~ 226 (725) . ++.++++..++.-. .--+++...||...+.+.|+.-......... ... ......+.|+.+-|.+ T Consensus 153 ~----~l~~~~v~~d~~G~----vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~---~~~----~~~~~~v~v~~~V~pr 217 (559) T protein:vir:95 153 F----PIGSYYLANSPRGS----VDTCFRKFSMTVRQLVQEFGLNNVSESVKSM---WES----GTYEKWIEVMHSVYPN 217 (559) T ss_pred e----ecCeEEEeeCCCCC----eEEEEEeEecCHHHHHHHcCcccCCHHHHHH---Hhc----CCCCCeEEEEEEEecc Confidence 3 44568887776431 1226888899999999988752211110000 111 1112345555543332 Q ss_pred cceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEe-eccccccCCCCCCCCccceEEEEe Q lcl|NC_013059. 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII-TCTAVLKDKQLIAGEHIPIVPVFG 305 (725) Q Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~g~~~l~~~~~~p~~~~p~vP~~g 305 (725) ... ++ +. .+..+ ..-..|+|..- .+.++|.+ +-| ..+||+|+-. T Consensus 218 ~~~-------~~--~~---~~~~~--------------------~pf~s~~~e~~~~~~~~l~e-sg~--~e~P~~~~Rw 262 (559) T protein:vir:95 218 IDR-------DT--SK---LDSKN--------------------KPFKSVYYEVGGDNDKLLRE-SGF--DEFPIMAPRW 262 (559) T ss_pred ccc-------cc--cc---ccccc--------------------ceEEEEEEEecCCCceeeec-CCc--ccCCccceee Confidence 111 10 00 01100 00112333322 22345533 333 6699987754 Q ss_pred eeeccCCccccchh-hhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccc Q lcl|NC_013059. 306 EWGFVEDKEVYEGV-VRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM 384 (725) Q Consensus 306 ~~~~~d~~~~~~G~-vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) . -++|..|+.|. +.+..+-.+.+|......+....++.+.++.++.+..... .+..|+.. +.+....|. T Consensus 263 ~--~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~------~~l~pgg~-~~~~~~~~~- 332 (559) T protein:vir:95 263 E--VNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR------ASLLPGDI-TYIDQITGQ- 332 (559) T ss_pred e--ecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc------eeeeccce-eeeCCCCCc- Confidence 4 35888888885 9999999999999999889999999999988876543211 11122221 111111111 Q ss_pred cccCCcccC--CCCchHHHHHHHHHHHHHHHHHhCCCh-HHhcc-CcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013059. 385 PTQPLAYYE--NPEVPQANAYMLEAATAAVKEVATLGV-DAEAV-NGGQVAYDTVNQLNMRADLETYVFQDNLAT-AMRR 459 (725) Q Consensus 385 ~~~~~~~~~--~~~~~~~~~~ll~~~~~~i~~~tGv~~-~~~G~-~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~~~~ 459 (725) ..+++.. ++.+ ..+...++...+.|....-.+- .+++. ++..+++.=|..+.+.....|..++.+|.. ...= T Consensus 333 --~~i~p~~~~~~~~-~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~P 409 (559) T protein:vir:95 333 --DGFRPAYLVNPST-ADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNP 409 (559) T ss_pred --ccceeecccccch-HHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 2222221 2222 2223345666666766664322 12333 333456667888888888888888777753 3333 Q ss_pred HHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 460 DGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 460 ~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) +-+..++++.+. |. |.+ ..+ .+. ..++|.+.. |-...+|...+..+.++ T Consensus 410 li~r~~~il~r~-------------g~-----lP~-~p~----------~l~~~~i~v~~is-~La~aqk~~~~~~i~~~ 459 (559) T protein:vir:95 410 LIDRSFSMMVRK-------------NM-----LPP-PPD----------VMEGMPLKVEYIS-VMAQAQKSIGLSSLAST 459 (559) T ss_pred HHHHHHHHHHhc-------------CC-----CCC-Ccc----------cccCcceEEEeec-HHHHHHHHHHHHHHHHH Confidence 333333333332 11 000 000 011 123444422 22333455555555554 Q ss_pred HHhcc---cccchHHHHHHHhhccCCchhHHHHHHHHhhhhhh-hhhhhccchhhhHHH-HHHHHHHHhhHHHHHHHHHH Q lcl|NC_013059. 539 LGKTP---QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ-MGVKKPETPEEQQWF-VEAQQAKQGQQDPAMVQAQG 613 (725) Q Consensus 539 l~~~~---~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~-~~~~~~~~~e~~~~~-~q~~q~qq~q~q~~~~~~qa 613 (725) ++.+. +..|.. ++..|+ ++++..+...... ..+.. ++++.+++ ++.+++++++++.++...-+ T Consensus 460 ~~~~~~laq~~Pev-------ld~id~---d~~~~~~a~~~Gvp~~~ir--s~~ev~~~rqqr~~~qq~~q~~~~~~~aa 527 (559) T protein:vir:95 460 VNFIGQLAQVKPEA-------LDKLNV---DQAIDAFADMSGVSPTVIV--PQEQVEQARQQRAQQQQQQQMMAMGMAAA 527 (559) T ss_pred HHHHHHHhccChhh-------hhcCCH---HHHHHHHHHHhCCchhhcC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44433 333321 233344 3333332222211 11111 11111111 11111111000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 614 VLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQ 662 (725) Q Consensus 614 ~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~ 662 (725) ...+ ....+++.. -++ .+...++. .. .-.++| T Consensus 528 ~~~~---~~~~~~~~~----~~~----l~~~~~~~-----~~-~~~~~~ 559 (559) T protein:vir:95 528 QGVK---TLSEAKTSD----PSV----LSAMANAV-----SG-QGGQSQ 559 (559) T ss_pred Hhhh---ccccccCCC----hhH----HHHHHHhh-----cC-ccccCC Confidence 0000 000000000 000 00000000 00 000000 No 44 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.72 E-value=3.3e-16 Score=105.43 Aligned_cols=437 Identities=10% Similarity=0.051 Sum_probs=211.4 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v~~v~g~~~~nr~ 76 (725) |..+..+..+++..|-..+. ..+....+-.+||.|.| +......+..+ |.++|..+.+|+..+|+.-.+.+ T Consensus 11 ~p~d~~~~~~~l~~~i~~~~---~~~~r~~~~~~yy~g~~--~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~ 85 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHR---LEVARYEYLKNMYRGIM--AIDAEPTKDLWKPDNRLTVNFTKYIVDTFTGYFNGIPV 85 (453) T ss_pred cCCCCCCCHHHHHHHHHHHH---HHHHHHHHHHHHhhccC--chhcCCCccccCccceeecchHHHHHHHHhhhhcccCc Confidence 76655555555555544432 22334566678999976 11111122223 34579999999999999988775 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh-- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS-- 154 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~-- 154 (725) .+.+ . |.+ ....+..++..|+++.....+.++++++|.||+-|..+ ++ | .+.+++. ++. T Consensus 86 ~~~~--~---d~~----~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d---~~--g-~~~i~~~----~p~~~ 146 (453) T protein:vir:39 86 KKSH--S---DKE----TLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQN---EE--T-QTNVIYN----TPENM 146 (453) T ss_pred eecc--C---ChH----HHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEec---CC--C-ceEEEEE----cccce Confidence 5542 2 221 23457777888999999999999999999999887653 22 2 2333321 222 Q ss_pred heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 155 HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 155 ~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) .++||+.... ...+ +++.+. ..+.+..+++|... .++. T Consensus 147 ~~v~d~~~~~----~~~~-~ir~~~---------------------------------~~~~~~~~~~yt~~----~i~~ 184 (453) T protein:vir:39 147 FMVYDDTIKQ----EPLF-AVRYGY---------------------------------DDDYKLYGEVYTKE----TTYA 184 (453) T ss_pred EEEecCCCCC----eEEE-EEEEEE---------------------------------eCCeEEEEEEEeCC----eEEE Confidence 2445543321 0111 111110 01223344555432 1111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +.. .+ +...+.++.|.+.+.+|+|||... . T Consensus 185 ~~~-~~------------------------------------------~~~~~~~~~~~~~g~vPvv~~~n~-------~ 214 (453) T protein:vir:39 185 LNG-TM------------------------------------------GFYNMTEQAPNPFDDLPVVEFYFN-------E 214 (453) T ss_pred EEe-cC------------------------------------------CceeeecccccCCCceeEEEecCC-------C Confidence 110 00 010112333444456677766321 1 Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCC Q lcl|NC_013059. 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYEN 394 (725) Q Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 394 (725) .+.|.+..+++.++.+|+.+|.+...+...+....++.-..+++. ........... . +....+.-....+..+.. T Consensus 215 ~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~--~~~~~~~~~~~--~-~~~~~~~~~~~~~~~lt~ 289 (453) T protein:vir:39 215 ERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEE--DLKNIRSNRVI--N-YYGESSEAKNVDVKFLEK 289 (453) T ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCch--hhhhhhhccee--e-ecCCCCCCCCCceeEEee Confidence 255778899999999999999998888666665544432122211 11111110000 0 000111111222344433 Q ss_pred CCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_013059. 395 PEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDV 474 (725) Q Consensus 395 ~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~ 474 (725) +.-..+....+......|-.+|++.+.+.+..+| .||.|+..............-..|..+++++.++++.+... T Consensus 290 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~---- 364 (453) T protein:vir:39 290 PDSDSQTENLLDRLTKLIFQTTMVANISDESFGS-SSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTN---- 364 (453) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccccccccC-ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---- Confidence 3223455557788888888888876655554444 59999987766655555566666667777666665554321 Q ss_pred CcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHH Q lcl|NC_013059. 475 PRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLL 554 (725) Q Consensus 475 ~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~ 554 (725) .|.. . |+ .||.|.-.+..+....+..+.++.+.+.++. .... T Consensus 365 ------~~~~--~---------------------~~---~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~---et~l--- 406 (453) T protein:vir:39 365 ------VSNK--E---------------------AW---KDIEYTFTRNEPKDIKEQAETANILMGITSQ---ETAL--- 406 (453) T ss_pred ------cCCc--c---------------------cc---ccceEEeCCCCCcCHHHHHHHHHHHhccCCh---HHHH--- Confidence 1110 0 01 2455555666665445555555555433322 1222 Q ss_pred HhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHH Q lcl|NC_013059. 555 QYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPA 607 (725) Q Consensus 555 ~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~ 607 (725) ..++..+ ..++-++++.++........+......+ . .+.....-..+ T Consensus 407 ~~l~~v~--D~~~E~~ri~~E~~~~~~~~~~~~~~~~-~---~~~~~~~~~~e 453 (453) T protein:vir:39 407 SVISVIP--DVQAEMEKIKKEEASTAIFDKDKQPSEK-G---TDTVVPETNEE 453 (453) T ss_pred HhCCCCC--CHHHHHHHHHHHHHHHHHHHHhccCCCC-C---CCCCCCCcCCC Confidence 2222221 1222233333221111000000000000 0 00000000000 No 45 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.72 E-value=7.3e-16 Score=103.58 Aligned_cols=442 Identities=9% Similarity=0.045 Sum_probs=206.9 Q ss_pred CC---cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC-CHHHHH--------HHhhcCCCcccchHHHHHHHH Q lcl|NC_013059. 1 MA---DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQW-DDWLSQ--------YTTLQYRGQFDVVRPVVRKLV 68 (725) Q Consensus 1 ma---d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW-~~~~~~--------~l~~~grp~~N~i~~~v~~v~ 68 (725) |. .+.++..+++.+|-. ....-+....+-.+||.|++= ...... ..+..-|.++|..+.+|+..+ T Consensus 35 ~~~~~~~~~~~~~~i~~~i~---~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~ 111 (492) T protein:vir:97 35 IVRTNNKPETLEEMIVRYIK---QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKV 111 (492) T ss_pred cccCCCchhhHHHHHHHHHH---HHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHh Confidence 32 333455554444433 333445566777899999751 000001 111112346799999999999 Q ss_pred HHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEe Q lcl|NC_013059. 69 SEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) Q Consensus 69 g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~ 148 (725) |+...+.+.+. + +|.+..+.+ +.+.+ |+++...+.+..+++++|.||.-+..+ ++ | .+.+++ T Consensus 112 ~yl~g~p~~~~--~---~d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~d---~d--g-~~~~~~-- 173 (492) T protein:vir:97 112 SYIVGKPIAFK--H---TDDEVVKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD---EE--G-EFKLFR-- 173 (492) T ss_pred hhhcccCceec--c---CchHHHHHH----HHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEec---CC--C-ceEEEE-- Confidence 99988876653 2 333333433 33433 789999999999999999999877543 22 1 233332 Q ss_pred eecchhh--eeeCCCc-cccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEE Q lcl|NC_013059. 149 IHSACSH--VIWDSNS-KLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEV 225 (725) Q Consensus 149 ~~~~~~~--v~~Dp~a-~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~ 225 (725) .++.+ ++||+.. +++ ..+++.|-..+ .. ..++|.. T Consensus 174 --~~p~~~~~i~d~~~~~~~------~~~vr~~~~~~-------------------------------~~---~~~~y~~ 211 (492) T protein:vir:97 174 --VPAEQGIPIWTDKEHEEL------EAFIRMYKLEN-------------------------------ET---KVEYWDK 211 (492) T ss_pred --EcccceEEEEcCCCCCce------EEEEEEEeecc-------------------------------ce---eEEEEec Confidence 12332 4455332 221 22333332100 00 1233333 Q ss_pred ecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEe Q lcl|NC_013059. 226 VEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFG 305 (725) Q Consensus 226 ~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g 305 (725) ..+ ..+.+.+ |......... .+. ...+..|.+.+.+|+|+|.. T Consensus 212 ~~v--~~~~~~~---~~~~~~~~~~-------------------------------~~~-~~~~~~~~~~g~vPvv~~~n 254 (492) T protein:vir:97 212 VTV--NYYVYEN---GSLIPDYSNN-------------------------------LEN-SKTHFSTGSWGKIPFIPFKN 254 (492) T ss_pred CeE--EEEEEec---Ceeeeccccc-------------------------------ccc-cccccccCCCCCcceEEecC Confidence 211 1122211 1111100000 000 01122344445666666532 Q ss_pred eeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCccc Q lcl|NC_013059. 306 EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEM 384 (725) Q Consensus 306 ~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) . ..+.|.+..+++.++.+|..+|.+...+...+.....+ .|.. ....+........ . .+....+ T Consensus 255 n-------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~--~---~~~~~~~-- 319 (492) T protein:vir:97 255 N-------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-KNYDDQELPEFKRLLRYY--G---AIKVSDN-- 319 (492) T ss_pred C-------CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCcccchhHHHHHhhc--c---ceecCCC-- Confidence 1 12568888999999999999999998887776665443 2211 1111111111110 0 1111111 Q ss_pred cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 385 PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIY 464 (725) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~l 464 (725) ...+++..+.-..+....++...+.|-..|++.+.+.+.-++..||+|+..............-..|..+++++.+++ T Consensus 320 --~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li 397 (492) T protein:vir:97 320 --GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFV 397 (492) T ss_pred --CcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123344333233455667888889999999887766665556679999887766666665566666666666665555 Q ss_pred HHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhccc Q lcl|NC_013059. 465 QSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) Q Consensus 465 l~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~ 544 (725) +.+ ... .+ + -.++.|.-.|..+.-..+..+.++.+.+.++. T Consensus 398 ~~~----~~~------~~--------------------------~---~~~i~v~f~~~~p~~~~e~a~~~~kl~G~iS~ 438 (492) T protein:vir:97 398 FEH----FDI------KG--------------------------E---HKDVDISFNYNKVANTELQVQTAQQSMGIVSH 438 (492) T ss_pred HHH----hcC------Cc--------------------------c---cceeeEEecCCCCCCHHHHHHHHHHHhccCch Confidence 443 221 00 0 12444455666665444555555555433221 Q ss_pred ccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 545 ~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) ...+..++..+ ..+.-++++.+.........+. .......-.+-...... T Consensus 439 ------et~l~~l~~v~--d~~~Eleri~~E~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~ 488 (492) T protein:vir:97 439 ------ETVLENHPFVE--DLQAELERIEQEQTEYNKQLPN----------------------LDDGGADSAQQQERSNN 488 (492) T ss_pred ------HHHHHhCCCCC--CHHHHHHHHHHHHHHHHHhhhc----------------------cccCCCCCCcccccccc Confidence 11222222222 1222233332211100000000 00000000000000000 Q ss_pred HHHH Q lcl|NC_013059. 625 AQNQ 628 (725) Q Consensus 625 aqae 628 (725) .+.+ T Consensus 489 ~~~e 492 (492) T protein:vir:97 489 KESE 492 (492) T ss_pred cccC Confidence 0000 No 46 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=99.72 E-value=1.2e-14 Score=96.82 Aligned_cols=529 Identities=13% Similarity=-0.009 Sum_probs=242.7 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHhh-cCCCcccchHHHHHHHHHH----Hh- Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSR--VSQWDDWLSQYTTL-QYRGQFDVVRPVVRKLVSE----MR- 72 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~--G~QW~~~~~~~l~~-~grp~~N~i~~~v~~v~g~----~~- 72 (725) ||+.. -.++..+|......-..|...+.+..+|.. ..-+...+...-.. ..++.-+.....++.+.+. -. T Consensus 1 m~~~~--~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltp 78 (556) T protein:vir:73 1 MAETE--KERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITS 78 (556) T ss_pred CChhh--HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcC Confidence 99842 345666777776667777777788888863 22343333221111 1233224444445544332 22 Q ss_pred hCCcceEEecCCcchHHHHH------HHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEE Q lcl|NC_013059. 73 QNPIDVLYRPKDGASPDAAD------VLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~~~Ae------~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~ 146 (725) -+++=+++.+.+++..+.++ ..+..+......+++..+...+|.+.+..|.|++-+ +.++.+ -+ |. T Consensus 79 p~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~-----~~~~~~-~~--r~ 150 (556) T protein:vir:73 79 PARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAV-----MEDDQD-VI--RT 150 (556) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeee-----eecCCc-eE--EE Confidence 36777777776654322222 244445555667889999999999999999998633 233332 12 33 Q ss_pred EeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEe Q lcl|NC_013059. 147 EPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) Q Consensus 147 ~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~ 226 (725) .+ .++.++++..++.-.- |+ +++...|+...+.+.|+.-+.... .. ...... ..+..+.|+.+-|.+ T Consensus 151 ~~--~~l~~~~~~~d~~G~v--d~--i~r~~~~t~~ql~~~fg~~~l~~~-v~--~~~~~~----~~~~~~~v~~~V~pr 217 (556) T protein:vir:73 151 MP--FPIGSYYLANSPRGSV--DT--CIRQFSMTVRQMVQEFGLDNVSTS-VK--GMWENG----TYETWVEVNHCITPN 217 (556) T ss_pred EE--eecceeEEeeCCCCCe--EE--EEEEEeccHHHHHHHcCcccCCHH-HH--HHHhcC----CccceEEEEEEEecc Confidence 22 3455688887764311 22 788889999999998875221111 00 111111 012234454432221 Q ss_pred cceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeE-EEEEEEEe-eccccccCCCCCCCCccceEEEE Q lcl|NC_013059. 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKR-RRVYKSII-TCTAVLKDKQLIAGEHIPIVPVF 304 (725) Q Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~v~~~~~-~g~~~l~~~~~~p~~~~p~vP~~ 304 (725) ... ++.. .+. ..+. ..++|.-. .+.++|.+ +. |..+||+|+- T Consensus 218 ~~~-------~~~~-----~~~---------------------~~~p~~s~~~~~~~~~~~vl~e-sg--~~e~P~~~~R 261 (556) T protein:vir:73 218 VNR-------DSGK-----MDS---------------------KNKPYRSVYFESGGDSDKLLRE-SG--FDEFPILAPR 261 (556) T ss_pred ccc-------cccc-----cCc---------------------ccceEEEEEEEecCCCceeccc-CC--cccCCceeee Confidence 110 1000 000 1111 12333322 34445533 33 4679998775 Q ss_pred eeeeccCCccccchh-hhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcc Q lcl|NC_013059. 305 GEWGFVEDKEVYEGV-VRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) Q Consensus 305 g~~~~~d~~~~~~G~-vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) .. .++|..|+.|. +.+..+-.+.+|+.....+....+..+.++.++.+..... .+..|+.. +.+...++. T Consensus 262 w~--~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~------~~~~pgg~-~~~~~~~~~ 332 (556) T protein:vir:73 262 WE--VNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQR------VSLLPGDV-TYLDVISGQ 332 (556) T ss_pred ee--ecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc------eeeccCcc-ccccCCCCc Confidence 44 35888888895 9999999999999999889999999999998876642211 11122111 111111111 Q ss_pred ccccCCcccC--CCCchHHHHHHHHHHHHHHHHHhCCCh-HHhccC-cchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013059. 384 MPTQPLAYYE--NPEVPQANAYMLEAATAAVKEVATLGV-DAEAVN-GGQVAYDTVNQLNMRADLETYVFQDNLA-TAMR 458 (725) Q Consensus 384 ~~~~~~~~~~--~~~~~~~~~~ll~~~~~~i~~~tGv~~-~~~G~~-~n~~Sg~ai~~~q~q~~~~~~~~~dn~~-~~~~ 458 (725) +.++++. .+.+ ....++++...+.|....-++- .+++.. +..+++.-|..+.+.....|..++.+|. .... T Consensus 333 ---~~i~p~~~~~~d~-~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~ 408 (556) T protein:vir:73 333 ---DGFKPAYLVNPNT-ADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALN 408 (556) T ss_pred ---cceeeeccccccH-HHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 1222221 2222 3334556667777766664322 123433 3335666788888888888888877775 3333 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeecccc-ccceEEEEeccCchhHHHHHHHHHHH Q lcl|NC_013059. 459 RDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIR-GRYECYTDVGPSFQSMKQQNRAEILE 537 (725) Q Consensus 459 ~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~l~e 537 (725) =+-+..++++.+. |. |.+ .. ..+. +.++|.... |-...+|...+..+.+ T Consensus 409 Pli~r~~~il~r~-------------g~-----lP~-~P----------~~l~~~~i~v~yis-~La~aqk~~~~~~i~~ 458 (556) T protein:vir:73 409 PLIDRVFSIMARK-------------NM-----LPE-PP----------DVLQGMPLRIEYIS-VMAQAQKSIGLTSLSQ 458 (556) T ss_pred HHHHHHHHHHHhc-------------CC-----CCC-Cc----------hhhcCceeEEEeec-HHHHHHHHHHHHHHHH Confidence 3333344433331 11 000 00 0121 123444322 3333345555555555 Q ss_pred HHHhcc---cccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhh-hhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_013059. 538 LLGKTP---QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM-GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQG 613 (725) Q Consensus 538 ll~~~~---~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~-~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa 613 (725) +++.+. +..|.. ++..|+ ++++..+......+ .+. .++++.+.+.++.+++|++++ ..+++ T Consensus 459 ~~~~~~~laq~~Pe~-------~d~id~---d~~~~~~a~~~Gvp~~~i--rs~eev~~~rq~r~~~qq~~~---~~~~~ 523 (556) T protein:vir:73 459 TVGFIGQLAQFKPEA-------LDKLDV---DQAIDAFSEMSGVSPTVI--VPQEQVQGIREERAKQAQAAQ---AMAMG 523 (556) T ss_pred HHHHHHHHhccChhh-------HhcCCH---HHHHHHHHHHcCCChhhc--CCHHHHHHHHHHHHHHHHHHH---HHHHH Confidence 444433 333432 233344 33333332222111 111 111111111111111111000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 614 VLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQ 662 (725) Q Consensus 614 ~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~ 662 (725) .. -++.++. .+.+...- ....+...+++ .+. ++ T Consensus 524 ~~---a~~~~~~---~~~~~~~~-~~~l~~~~~~~-------g~~--~~ 556 (556) T protein:vir:73 524 QA---AAQGAKT---LSETQTSD-PSALTAIANAA-------GAP--QQ 556 (556) T ss_pred HH---HHHHHHH---hhhccCCC-HHHHHHHHHhh-------cCC--CC Confidence 00 0011100 00000000 00000000000 000 00 No 47 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.71 E-value=2.3e-16 Score=106.35 Aligned_cols=467 Identities=10% Similarity=0.046 Sum_probs=216.6 Q ss_pred CC-cHHH---HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MA-DNKN---RLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 ma-d~~~---~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v~~v~g~~~ 72 (725) |. .... ..+++........ ...+....+-.+||.|.|.--......+..+ |.++|..+.+|+..+|+.- T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHH---HhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 32 1111 1122222222221 2223344556789999986322111222222 3467999999999999999 Q ss_pred hCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecc Q lcl|NC_013059. 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~ 152 (725) .+++.+.+ ++.+ ....+..+.+.|+++........++++.|.+|.-+..+ ++ | .+.++. .+ T Consensus 108 g~p~~~~~-----~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d---~d--g-~~~i~~----~~ 168 (511) T protein:vir:78 108 GNPIQYQD-----DDKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD--D-ETRLYK----SD 168 (511) T ss_pred ccCceeec-----CchH----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeC---CC--C-ceEEEE----Ec Confidence 88887753 2222 34567778888999999999999999999999877543 22 1 233332 22 Q ss_pred hhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEeccee Q lcl|NC_013059. 153 CSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKE 230 (725) Q Consensus 153 ~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~ 230 (725) +.+ ++||..... . ..++++.|.... .+ ..+.+.+..+++|.... T Consensus 169 p~~~~~v~dd~~~~-~----~~~~vr~~~~~~----------~~----------------~~~~~~~~~~~vyt~~~--- 214 (511) T protein:vir:78 169 AMSTFIIYDNTVER-N----SIAGVRYLRTKP----------ID----------------KTDEDEVFTVDLFTSHG--- 214 (511) T ss_pred ccceEEEEcCCCCC-c----eEEEEEEEEeee----------cc----------------ccccceEEEEEEEeCCc--- Confidence 333 335544321 0 123333332110 00 00112333445554431 Q ss_pred EEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeecc Q lcl|NC_013059. 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFV 310 (725) Q Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~ 310 (725) ++.+....++... . .....+..|.|++.+|+|+|. T Consensus 215 -i~~~~~~~~~~~~------------------------------------~--~~~~~~~~~~~~g~vPvv~~~------ 249 (511) T protein:vir:78 215 -VYRYLTNRTNGLK------------------------------------L--TPRENSFESHSFERMPITEFS------ 249 (511) T ss_pred -EEEEEecCCCccc------------------------------------c--cccccccccCcCcccceEEec------ Confidence 1222111111000 0 000113344555566666552 Q ss_pred CCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc-cc-ccccCc--cccc Q lcl|NC_013059. 311 EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL-NR-TDENNG--EMPT 386 (725) Q Consensus 311 d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~-~~~~~g--~~~~ 386 (725) .+ ..+.|.+..+++.++.+|...|.+.+.+...+...+++ .|...................+ .. .....+ .... T Consensus 250 n~-~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (511) T protein:vir:78 250 NN-ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGS 327 (511) T ss_pred CC-CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhhe-ecCccCCchhhcccccccceeccccceeccccccCCCC Confidence 11 23568899999999999999999998887665554332 2211100011111000000000 00 000011 1112 Q ss_pred cCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 387 QPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) Q Consensus 387 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~ 466 (725) ..++++..+.-..++...+....+.|-.+|++.+.+.+..++..||+|+..............-.-|..+++++.++++. T Consensus 328 ~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:78 328 VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23344443333455666788888899999988777776655557999999877666666666666777777777666666 Q ss_pred HHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 467 IVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 467 li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) ++...-... .. .|+ .+|.|.-.+..+.-..+..+.++.+.+.++. T Consensus 408 ~~~~~~~~~---------~~---------------------~~~---~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~-- 452 (511) T protein:vir:78 408 ILKNTRSID---------AN---------------------KDF---NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ-- 452 (511) T ss_pred HHHhcCCCc---------cc---------------------ccc---ccceEEeCCCCCcCHHHHHHHHHHHhccCCh-- Confidence 542211000 00 011 2455555666665455555666655443322 Q ss_pred chHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 547 PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQ 626 (725) Q Consensus 547 p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaq 626 (725) . .++..++..+ ..+..++++.+.... ....................+. -+ T Consensus 453 -e---t~l~~l~~v~--d~~~El~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~-----------~~ 502 (511) T protein:vir:78 453 -T---TLMSLFSFFQ--DPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQD-----------DD 502 (511) T ss_pred -H---HHHHhCCCCC--CHHHHHHHHHHHHHH-------------HHHHHhhccccCCCCCCCCCCC-----------CC Confidence 1 1222232222 122223333221100 0000000000000000000000 00 Q ss_pred HHHHHHHHH Q lcl|NC_013059. 627 NQTLSLQID 635 (725) Q Consensus 627 ae~~k~q~e 635 (725) .+....+.+ T Consensus 503 ~~~~~~e~~ 511 (511) T protein:vir:78 503 TKDTVDKKE 511 (511) T ss_pred ccCcccccC Confidence 000000000 No 48 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.71 E-value=2.3e-16 Score=106.35 Aligned_cols=467 Identities=10% Similarity=0.046 Sum_probs=216.6 Q ss_pred CC-cHHH---HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MA-DNKN---RLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 ma-d~~~---~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v~~v~g~~~ 72 (725) |. .... ..+++........ ...+....+-.+||.|.|.--......+..+ |.++|..+.+|+..+|+.- T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHH---HhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 32 1111 1122222222221 2223344556789999986322111222222 3467999999999999999 Q ss_pred hCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecc Q lcl|NC_013059. 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~ 152 (725) .+++.+.+ ++.+ ....+..+.+.|+++........++++.|.+|.-+..+ ++ | .+.++. .+ T Consensus 108 g~p~~~~~-----~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d---~d--g-~~~i~~----~~ 168 (511) T protein:vir:96 108 GNPIQYQD-----DDKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD--D-ETRLYK----SD 168 (511) T ss_pred ccCceeec-----CchH----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeC---CC--C-ceEEEE----Ec Confidence 88887753 2222 34567778888999999999999999999999877543 22 1 233332 22 Q ss_pred hhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEeccee Q lcl|NC_013059. 153 CSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKE 230 (725) Q Consensus 153 ~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~ 230 (725) +.+ ++||..... . ..++++.|.... .+ ..+.+.+..+++|.... T Consensus 169 p~~~~~v~dd~~~~-~----~~~~vr~~~~~~----------~~----------------~~~~~~~~~~~vyt~~~--- 214 (511) T protein:vir:96 169 AMSTFIIYDNTVER-N----SIAGVRYLRTKP----------ID----------------KTDEDEVFTVDLFTSHG--- 214 (511) T ss_pred ccceEEEEcCCCCC-c----eEEEEEEEEeee----------cc----------------ccccceEEEEEEEeCCc--- Confidence 333 335544321 0 123333332110 00 00112333445554431 Q ss_pred EEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeecc Q lcl|NC_013059. 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFV 310 (725) Q Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~ 310 (725) ++.+....++... . .....+..|.|++.+|+|+|. T Consensus 215 -i~~~~~~~~~~~~------------------------------------~--~~~~~~~~~~~~g~vPvv~~~------ 249 (511) T protein:vir:96 215 -VYRYLTNRTNGLK------------------------------------L--TPRENSFESHSFERMPITEFS------ 249 (511) T ss_pred -EEEEEecCCCccc------------------------------------c--cccccccccCcCcccceEEec------ Confidence 1222111111000 0 000113344555566666552 Q ss_pred CCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc-cc-ccccCc--cccc Q lcl|NC_013059. 311 EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL-NR-TDENNG--EMPT 386 (725) Q Consensus 311 d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~-~~~~~g--~~~~ 386 (725) .+ ..+.|.+..+++.++.+|...|.+.+.+...+...+++ .|...................+ .. .....+ .... T Consensus 250 n~-~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (511) T protein:vir:96 250 NN-ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGS 327 (511) T ss_pred CC-CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhhe-ecCccCCchhhcccccccceeccccceeccccccCCCC Confidence 11 23568899999999999999999998887665554332 2211100011111000000000 00 000011 1112 Q ss_pred cCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 387 QPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) Q Consensus 387 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~ 466 (725) ..++++..+.-..++...+....+.|-.+|++.+.+.+..++..||+|+..............-.-|..+++++.++++. T Consensus 328 ~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:96 328 VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23344443333455666788888899999988777776655557999999877666666666666777777777666666 Q ss_pred HHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 467 IVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 467 li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) ++...-... .. .|+ .+|.|.-.+..+.-..+..+.++.+.+.++. T Consensus 408 ~~~~~~~~~---------~~---------------------~~~---~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~-- 452 (511) T protein:vir:96 408 ILKNTRSID---------AN---------------------KDF---NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ-- 452 (511) T ss_pred HHHhcCCCc---------cc---------------------ccc---ccceEEeCCCCCcCHHHHHHHHHHHhccCCh-- Confidence 542211000 00 011 2455555666665455555666655443322 Q ss_pred chHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 547 PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQ 626 (725) Q Consensus 547 p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaq 626 (725) . .++..++..+ ..+..++++.+.... ....................+. -+ T Consensus 453 -e---t~l~~l~~v~--d~~~El~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~-----------~~ 502 (511) T protein:vir:96 453 -T---TLMSLFSFFQ--DPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQD-----------DD 502 (511) T ss_pred -H---HHHHhCCCCC--CHHHHHHHHHHHHHH-------------HHHHHhhccccCCCCCCCCCCC-----------CC Confidence 1 1222232222 122223333221100 0000000000000000000000 00 Q ss_pred HHHHHHHHH Q lcl|NC_013059. 627 NQTLSLQID 635 (725) Q Consensus 627 ae~~k~q~e 635 (725) .+....+.+ T Consensus 503 ~~~~~~e~~ 511 (511) T protein:vir:96 503 TKDTVDKKE 511 (511) T ss_pred ccCcccccC Confidence 000000000 No 49 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.71 E-value=1.5e-15 Score=101.93 Aligned_cols=456 Identities=11% Similarity=-0.005 Sum_probs=211.7 Q ss_pred CC------------cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHH Q lcl|NC_013059. 1 MA------------DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVV 64 (725) Q Consensus 1 ma------------d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v 64 (725) |+ ...+.+.+++.++... -+....+-.+||.|+| +-..+......+ |.++|..+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~------~~~r~~~~~~yy~g~~-~i~~~~~~~~~~~~~~ki~~n~~~~iv 73 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAE------QLERLKELKRYYLGDN-NIKYRPAKTDKYAADNRIASDFAKYIT 73 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHH------HHHHHHHHHHHhcccC-ccccccccccccCCcceeecchHHHHH Confidence 33 2233455555554321 2233456678999986 111111111223 33579999999 Q ss_pred HHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeE Q lcl|NC_013059. 65 RKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVI 144 (725) Q Consensus 65 ~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~i 144 (725) +..+|+.-.+.+.+.+ . |. .....+..+++.|+++.......++++++|.||.-+...... |. +..+.| T Consensus 74 ~~~~~~l~g~~~~~~~--~---d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~-d~-~~~~~i 142 (489) T protein:vir:99 74 VFEQGYMLGVPVEYKN--E---NK----DLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKID-DK-KTEVKL 142 (489) T ss_pred HHHhhhhccCCceeec--C---Ch----hHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCc-CC-CcceEE Confidence 9999999887766543 2 22 245567778888999999999999999999999877654221 11 123333 Q ss_pred EEEeeecchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEE Q lcl|NC_013059. 145 RREPIHSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEF 222 (725) Q Consensus 145 r~~~~~~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~ 222 (725) .. .++.++ +||+.... + ..++++.|.. .+.+...+..+++ T Consensus 143 ~~----~~p~~~~~v~dd~~~~----~-~~~~i~~~~~-----------------------------~~~~~~~~~~~~~ 184 (489) T protein:vir:99 143 YQ----LPAEQTFVIYDDTYQR----N-SLMAVHFYDI-----------------------------DYGSGKRKQIIKA 184 (489) T ss_pred EE----EcccceEEEEcCCCCC----c-eEEEEEEEEE-----------------------------ecCCCceEEEEEE Confidence 32 233332 34433210 1 1122222210 0001123345555 Q ss_pred EEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEE Q lcl|NC_013059. 223 YEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVP 302 (725) Q Consensus 223 w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP 302 (725) |.... ++.+... +.. .+...+.++.|.+.+.+|+|+ T Consensus 185 y~~~~----i~~~~~~-~~~---------------------------------------~~~~~~~~~~~~~~g~vPvv~ 220 (489) T protein:vir:99 185 YTSDT----IYTYEDY-NLE---------------------------------------TKGMRLKDYEGHFFKGVPVNE 220 (489) T ss_pred EeCCc----EEEEEec-CCC---------------------------------------cccceecccccccCCceeEEE Confidence 54321 1111110 000 000011233344445666666 Q ss_pred EEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHH---HHHHhhccccc----cccc Q lcl|NC_013059. 303 VFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFE---HMYDGNDDYPY----YLLN 375 (725) Q Consensus 303 ~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~~~~~~~----~~~~ 375 (725) |.. ...+.|.+..+++.++.+|...|.+...+...+.....+ .|...... .........+. .... T Consensus 221 ~~n-------~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (489) T protein:vir:99 221 YAN-------NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVI-AGNAYTGADENDYLDDGRLNPNGRLAISIG 292 (489) T ss_pred eec-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhh-ccCCcccccchhhhhhcccccccccccccc Confidence 532 112557788999999999999999988776554433222 12110000 00000000000 0000 Q ss_pred -----cccccCccc---cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 376 -----RTDENNGEM---PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETY 447 (725) Q Consensus 376 -----~~~~~~g~~---~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~ 447 (725) .....++.. ....++++..+.-..+....++...+.|-..||+.+.+.+..++..||+|+............ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 372 (489) T protein:vir:99 293 FKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYRE 372 (489) T ss_pred cccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHH Confidence 000000000 011223333323334555577888888888998776554433345699998877655555555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhH Q lcl|NC_013059. 448 VFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSM 527 (725) Q Consensus 448 ~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~ 527 (725) ..-..|..+++++.++++.++..... .. ... . .-.||.|.-.+..+.. T Consensus 373 ~k~~~~~~~l~~~~~li~~~~~~~~~---------~~----~~~------------------~-~~~~i~v~f~~~~p~d 420 (489) T protein:vir:99 373 KQERLFKKGLMRRLRLAANIWAIKGN---------EA----TTY------------------S-LVNDTSIVFTPNLPQN 420 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCC---------cc----ccc------------------c-ccccceEEeCCCCCcC Confidence 55566666666666665555432110 00 000 0 0125556667777765 Q ss_pred HHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhH Q lcl|NC_013059. 528 KQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQ 604 (725) Q Consensus 528 r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~ 604 (725) ..+..+.++++.+.++. ...+..++..+-+..++.++++.+.........+.. .....-...+....++ T Consensus 421 ~~~~~~~~~kl~giis~------et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~--~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 421 DNEIVTAAQNLYGIVSD------QTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPR--LVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHHHHHhccCCH------HHHHHhcCCCCchhHHHHHHHHHHHHHHHhcccccc--ccCCCCCCcCCCCCCC Confidence 66666666665443321 222233333333334444444433221111000000 0000000000000011 No 50 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.71 E-value=7.2e-16 Score=103.61 Aligned_cols=443 Identities=9% Similarity=0.052 Sum_probs=209.8 Q ss_pred CC---cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CCHHH-------HHHHhhcCCCcccchHHHHHHHH Q lcl|NC_013059. 1 MA---DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ--WDDWL-------SQYTTLQYRGQFDVVRPVVRKLV 68 (725) Q Consensus 1 ma---d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q--W~~~~-------~~~l~~~grp~~N~i~~~v~~v~ 68 (725) |. .+.+.+.+++.+|-.. ..+-+....+-.+||.|++ |.... ....+..-|.++|..+.+|+..+ T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~---~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~ 111 (492) T protein:vir:94 35 IVRTNNKPETLEEMIVRYIKQ---HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKV 111 (492) T ss_pred ccccCCchhhHHHHHHHHHHH---HHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHH Confidence 22 3345555555555433 2334456677789999975 11100 00111112346799999999999 Q ss_pred HHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEe Q lcl|NC_013059. 69 SEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) Q Consensus 69 g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~ 148 (725) |+.-.+.+.+.+ +|.+..+.|.. +. .|+++...+..+.+++++|.||+-|..+ ++ | .+.+++ T Consensus 112 ~yl~G~p~~~~~-----~d~~~~~~l~~----~~-~n~~~~~~~~~~~~a~~~G~a~~~v~~d---~d--g-~~~~~~-- 173 (492) T protein:vir:94 112 SYIVGKPIAFKH-----TDDEVVKRIDE----VL-GNRFDDKLHSVLTGASNKGIEWLHPYLD---EE--G-EFKLFR-- 173 (492) T ss_pred hhhcccCceecc-----CchHHHHHHHH----HH-hccHHHHHHHHHHHHhhCCeEEEEEEec---CC--C-ceEEEE-- Confidence 999887766532 34444444443 33 3789999999999999999999877643 22 2 222222 Q ss_pred eecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEe Q lcl|NC_013059. 149 IHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) Q Consensus 149 ~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~ 226 (725) .++.+ ++||+.... + -..+++.|-..+ .. .+++|... T Consensus 174 --~~p~~~~~v~d~~~~~-~----~~a~ir~~~~~~-------------------------------~~---~~~~y~~~ 212 (492) T protein:vir:94 174 --VPAEQGIPIWTDKEHE-E----LEAFIRMYKLEN-------------------------------ET---KVEYWDKV 212 (492) T ss_pred --EcccceEEEEcCCCCC-c----eEEEEEEEeecc-------------------------------ce---eEEEEecC Confidence 23333 445643211 1 112333332100 00 12333322 Q ss_pred cceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEee Q lcl|NC_013059. 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGE 306 (725) Q Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~ 306 (725) .+ ..+.+.+ |..+..... .+.....+..|.+.+.+|+|||... T Consensus 213 ~v--~~~~~~~---~~~~~~~~~--------------------------------~~~~~~~~~~~~~~g~vPvv~~~nn 255 (492) T protein:vir:94 213 TV--NYYVYEN---GSLIPDYSN--------------------------------NLENSKTHFSTGSWGKIPFIPFKNN 255 (492) T ss_pred eE--EEEEEec---Ceeeecccc--------------------------------ccccccccccccCCCccceEEecCC Confidence 11 1111111 111100000 0001111234455566777766331 Q ss_pred eeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCcccc Q lcl|NC_013059. 307 WGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMP 385 (725) Q Consensus 307 ~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 385 (725) ..+.|.+..+++.++.+|..+|.+...+...+.....+ .|.. +...+........ . .+....+ T Consensus 256 -------~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~~~--~---~~~~~~~--- 319 (492) T protein:vir:94 256 -------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-KNYDDQELPEFKRLLRYY--G---AIKVSDN--- 319 (492) T ss_pred -------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcccchhhHHHHhhc--c---ceecCCC--- Confidence 12568889999999999999999998887776665443 2211 1111111111110 0 0111111 Q ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) ..++++..+.-..+....++...+.|-..+++.+.+.+.-++..||+|+..............-..|..+++++.++++ T Consensus 320 -~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~ 398 (492) T protein:vir:94 320 -GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 398 (492) T ss_pred -CcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1234443333334556677888889999998877666655556799999887766666666667777777777666655 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQG 545 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~ 545 (725) .+ ... .++ --||.|.-.+..+.-..+..+.+..+.+.++. T Consensus 399 ~~----~~~------~~~-----------------------------~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~- 438 (492) T protein:vir:94 399 EH----FDI------KGE-----------------------------HKDVDISFNYNKVANTELQVQTAQQSMGIVSH- 438 (492) T ss_pred HH----hcC------Ccc-----------------------------cceeeEEecCCCCCCHHHHHHHHHHHhccCch- Confidence 43 221 000 02344455666665444555555554432221 Q ss_pred cchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 546 TPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKA 625 (725) Q Consensus 546 ~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~ka 625 (725) .. ++..++..+ ..+..++++.+.........+....... .-.....+.... T Consensus 439 --et---~~~~l~~v~--d~~~E~eri~~E~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~ 489 (492) T protein:vir:94 439 --ET---VLENHPFVE--DLQAELERIEQEQMEYNKQLPNLDDGGA----------------------DSAQQQERSNNK 489 (492) T ss_pred --HH---HHHhCCCCC--CHHHHHHHHHHHHHHHHhhccccccccC----------------------CCCccccCCccc Confidence 12 222222221 1222333332211000000000000000 000000000000 Q ss_pred HHH Q lcl|NC_013059. 626 QNQ 628 (725) Q Consensus 626 qae 628 (725) +++ T Consensus 490 e~e 492 (492) T protein:vir:94 490 ESE 492 (492) T ss_pred cCC Confidence 000 No 51 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.71 E-value=1.2e-16 Score=107.90 Aligned_cols=445 Identities=11% Similarity=0.039 Sum_probs=213.8 Q ss_pred CCc--------HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC-C-CHHHH-------------HHHhhcC---- Q lcl|NC_013059. 1 MAD--------NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ-W-DDWLS-------------QYTTLQY---- 53 (725) Q Consensus 1 mad--------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q-W-~~~~~-------------~~l~~~g---- 53 (725) |-= ...+..+.+ ...++.....|.+..+..+||.|.+ + ..-.+ ...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i---~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHI---EALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNN 77 (474) T ss_pred CchHHHHhhccccCCCHHHH---HHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccc Confidence 221 111111222 2222233344566666677776632 1 11000 0112223 Q ss_pred CCcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeec Q lcl|NC_013059. 54 RGQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE 133 (725) Q Consensus 54 rp~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~ 133 (725) |.++|..+.+|+..+|+.-.+.+.+.+.+....+ +.+...+.-+...|+++.....+..+++++|.+|.-+..+ T Consensus 78 ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~----e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d-- 151 (474) T protein:vir:94 78 KLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKN----EKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYID-- 151 (474) T ss_pred ccccchHHHHHHhHhhheeccceeEeeCCCCcch----HHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeC-- Confidence 3467999999999999999998887775444444 3444566667777999999999999999999999776443 Q ss_pred cCCCCCCceeEEEEeeecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccc Q lcl|NC_013059. 134 DQSPTSNNQVIRREPIHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPW 211 (725) Q Consensus 134 ~~~~~~~~~~ir~~~~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 211 (725) ++ + .+.++.. ++.+ ++||- ..+ .- .+++.|...+ . T Consensus 152 -~~--~-~~~~~~i----~p~~~~~v~d~-~~~-----~~-~~i~~~~~~~----------------------------~ 188 (474) T protein:vir:94 152 -TN--G-DIRIKNI----DPYNVIFVGDN-ILE-----PT-YSLRYFYEKD----------------------------D 188 (474) T ss_pred -CC--C-eeEEEEE----cccceEEEEcC-CCc-----eE-EEEEEEEEee----------------------------C Confidence 22 1 2333321 2222 23331 111 11 1222211100 0 Q ss_pred cCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCC Q lcl|NC_013059. 212 LTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQ 291 (725) Q Consensus 212 ~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~ 291 (725) .+...+..+++|... .++.+.... .+...+.++. T Consensus 189 ~~~~~~~~~~~y~~~----~~~~~~~~~------------------------------------------~~~~~~~~~~ 222 (474) T protein:vir:94 189 DNGTDYVYAEFYDNA----YYYVFRGEG------------------------------------------IDALQEVGRY 222 (474) T ss_pred CCceEEEEEEEEcCc----eEEEEeecC------------------------------------------CCcccccccc Confidence 001122233444331 111111100 0001112233 Q ss_pred CCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccc Q lcl|NC_013059. 292 LIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPY 371 (725) Q Consensus 292 ~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 371 (725) |.+.+.+|+|+|.. ...+.|.+..+++.++.+|...|.+...+...+...+++ .|. ...++......... T Consensus 223 ~~~~g~vPvv~~~n-------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i-~g~-~~~~~~~~~~~~~~- 292 (474) T protein:vir:94 223 EHLFDYNPLFGVPN-------NKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVL-RGM-GMSEEMIQETQKSG- 292 (474) T ss_pred cCCCCccceEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-ccC-CCCchhhhhhhhcc- Confidence 33444566665421 123568889999999999999999998887666655433 221 11111111111100 Q ss_pred cccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 372 YLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQD 451 (725) Q Consensus 372 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~d 451 (725) . +...++ ...++++..+.-..+....++...+.|-..|++.+.+.+..++..||+|+..+...........-. T Consensus 293 ~----i~~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~ 365 (474) T protein:vir:94 293 A----FELFDK---DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFER 365 (474) T ss_pred e----eEecCC---CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHH Confidence 1 111111 123445544433456677888899999999998777766555567999998877766666666777 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHH Q lcl|NC_013059. 452 NLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQN 531 (725) Q Consensus 452 n~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~ 531 (725) .|..+++++.++++.++..-... ..+ + ++ .||.+.-.+..+.-..+. T Consensus 366 ~~~~~l~~~~~li~~~l~~~~~~----------~~~-~-------------------~~---~~i~~~f~~~~p~d~~e~ 412 (474) T protein:vir:94 366 KMTAMLRYQFKVILSALKRKGYN----------LDD-D-------------------SY---LNLIFKFTRNIPVNKLEE 412 (474) T ss_pred HHHHHHHHHHHHHHHHHhhccCC----------CCc-c-------------------cc---ccceEEeCCCCCCCHHHH Confidence 77777777777776654432110 000 0 01 245555566666544455 Q ss_pred HHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_013059. 532 RAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQA 611 (725) Q Consensus 532 ~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~ 611 (725) .+.++.+.+.+ |. ..++..++.. +..+..++++.+.........+.. .... T Consensus 413 a~~~~kl~g~i----S~--et~~~~l~~v--~d~~~E~eri~~E~~e~~~~~~~~-------------~~~~-------- 463 (474) T protein:vir:94 413 SQVLINLKGQV----SE--RTRLGQSQLV--DDVDYELDEMEKESLEFNDKLPDI-------------DEGD-------- 463 (474) T ss_pred HHHHHHHhccC----ch--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhhcccc-------------cCCC-------- Confidence 55555543322 21 2222222222 223333333322111000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_013059. 612 QGVLLQGQAELAKAQNQ 628 (725) Q Consensus 612 qa~~~k~qae~~kaqae 628 (725) ...+....+++ T Consensus 464 ------~~~~~~~~~s~ 474 (474) T protein:vir:94 464 ------ANDKSQNNQSE 474 (474) T ss_pred ------cCCCCccccCC Confidence 00000000111 No 52 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.71 E-value=1.2e-16 Score=107.90 Aligned_cols=445 Identities=11% Similarity=0.039 Sum_probs=213.8 Q ss_pred CCc--------HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC-C-CHHHH-------------HHHhhcC---- Q lcl|NC_013059. 1 MAD--------NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ-W-DDWLS-------------QYTTLQY---- 53 (725) Q Consensus 1 mad--------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q-W-~~~~~-------------~~l~~~g---- 53 (725) |-= ...+..+.+ ...++.....|.+..+..+||.|.+ + ..-.+ ...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i---~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHI---EALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNN 77 (474) T ss_pred CchHHHHhhccccCCCHHHH---HHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccc Confidence 221 111111222 2222233344566666677776632 1 11000 0112223 Q ss_pred CCcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeec Q lcl|NC_013059. 54 RGQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE 133 (725) Q Consensus 54 rp~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~ 133 (725) |.++|..+.+|+..+|+.-.+.+.+.+.+....+ +.+...+.-+...|+++.....+..+++++|.+|.-+..+ T Consensus 78 ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~----e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d-- 151 (474) T protein:vir:10 78 KLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKN----EKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYID-- 151 (474) T ss_pred ccccchHHHHHHhHhhheeccceeEeeCCCCcch----HHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeC-- Confidence 3467999999999999999998887775444444 3444566667777999999999999999999999776443 Q ss_pred cCCCCCCceeEEEEeeecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccc Q lcl|NC_013059. 134 DQSPTSNNQVIRREPIHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPW 211 (725) Q Consensus 134 ~~~~~~~~~~ir~~~~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 211 (725) ++ + .+.++.. ++.+ ++||- ..+ .- .+++.|...+ . T Consensus 152 -~~--~-~~~~~~i----~p~~~~~v~d~-~~~-----~~-~~i~~~~~~~----------------------------~ 188 (474) T protein:vir:10 152 -TN--G-DIRIKNI----DPYNVIFVGDN-ILE-----PT-YSLRYFYEKD----------------------------D 188 (474) T ss_pred -CC--C-eeEEEEE----cccceEEEEcC-CCc-----eE-EEEEEEEEee----------------------------C Confidence 22 1 2333321 2222 23331 111 11 1222211100 0 Q ss_pred cCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCC Q lcl|NC_013059. 212 LTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQ 291 (725) Q Consensus 212 ~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~ 291 (725) .+...+..+++|... .++.+.... .+...+.++. T Consensus 189 ~~~~~~~~~~~y~~~----~~~~~~~~~------------------------------------------~~~~~~~~~~ 222 (474) T protein:vir:10 189 DNGTDYVYAEFYDNA----YYYVFRGEG------------------------------------------IDALQEVGRY 222 (474) T ss_pred CCceEEEEEEEEcCc----eEEEEeecC------------------------------------------CCcccccccc Confidence 001122233444331 111111100 0001112233 Q ss_pred CCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccc Q lcl|NC_013059. 292 LIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPY 371 (725) Q Consensus 292 ~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 371 (725) |.+.+.+|+|+|.. ...+.|.+..+++.++.+|...|.+...+...+...+++ .|. ...++......... T Consensus 223 ~~~~g~vPvv~~~n-------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i-~g~-~~~~~~~~~~~~~~- 292 (474) T protein:vir:10 223 EHLFDYNPLFGVPN-------NKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVL-RGM-GMSEEMIQETQKSG- 292 (474) T ss_pred cCCCCccceEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-ccC-CCCchhhhhhhhcc- Confidence 33444566665421 123568889999999999999999998887666655433 221 11111111111100 Q ss_pred cccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 372 YLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQD 451 (725) Q Consensus 372 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~d 451 (725) . +...++ ...++++..+.-..+....++...+.|-..|++.+.+.+..++..||+|+..+...........-. T Consensus 293 ~----i~~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~ 365 (474) T protein:vir:10 293 A----FELFDK---DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFER 365 (474) T ss_pred e----eEecCC---CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHH Confidence 1 111111 123445544433456677888899999999998777766555567999998877766666666777 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHH Q lcl|NC_013059. 452 NLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQN 531 (725) Q Consensus 452 n~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~ 531 (725) .|..+++++.++++.++..-... ..+ + ++ .||.+.-.+..+.-..+. T Consensus 366 ~~~~~l~~~~~li~~~l~~~~~~----------~~~-~-------------------~~---~~i~~~f~~~~p~d~~e~ 412 (474) T protein:vir:10 366 KMTAMLRYQFKVILSALKRKGYN----------LDD-D-------------------SY---LNLIFKFTRNIPVNKLEE 412 (474) T ss_pred HHHHHHHHHHHHHHHHHhhccCC----------CCc-c-------------------cc---ccceEEeCCCCCCCHHHH Confidence 77777777777776654432110 000 0 01 245555566666544455 Q ss_pred HHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_013059. 532 RAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQA 611 (725) Q Consensus 532 ~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~ 611 (725) .+.++.+.+.+ |. ..++..++.. +..+..++++.+.........+.. .... T Consensus 413 a~~~~kl~g~i----S~--et~~~~l~~v--~d~~~E~eri~~E~~e~~~~~~~~-------------~~~~-------- 463 (474) T protein:vir:10 413 SQVLINLKGQV----SE--RTRLGQSQLV--DDVDYELDEMEKESLEFNDKLPDI-------------DEGD-------- 463 (474) T ss_pred HHHHHHHhccC----ch--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhhcccc-------------cCCC-------- Confidence 55555543322 21 2222222222 223333333322111000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_013059. 612 QGVLLQGQAELAKAQNQ 628 (725) Q Consensus 612 qa~~~k~qae~~kaqae 628 (725) ...+....+++ T Consensus 464 ------~~~~~~~~~s~ 474 (474) T protein:vir:10 464 ------ANDKSQNNQSE 474 (474) T ss_pred ------cCCCCccccCC Confidence 00000000111 No 53 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.71 E-value=1.1e-15 Score=102.68 Aligned_cols=430 Identities=11% Similarity=0.010 Sum_probs=208.4 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCC----cccchHHHHHHHHHHHhhCCcceEEecCCcc Q lcl|NC_013059. 11 ILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG----QFDVVRPVVRKLVSEMRQNPIDVLYRPKDGA 86 (725) Q Consensus 11 ~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp----~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~ 86 (725) ++..|.. +-+....+..+||.|+|+........+..++| ++|..+.+|+..+|+...+.+.+.+ .+.+ T Consensus 1 ~~~~~~~------~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~--~~~~ 72 (440) T protein:vir:95 1 MLAAFLG------SQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGV--MEGG 72 (440) T ss_pred ChhhHHH------HHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEee--CCCc Confidence 3332222 22334566679999998853222223333443 5699999999999999888877654 3333 Q ss_pred hHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh--eeeCCCccc Q lcl|NC_013059. 87 SPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH--VIWDSNSKL 164 (725) Q Consensus 87 d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~--v~~Dp~a~~ 164 (725) +.+.. ..+..++..|+++.....+.++++++|.+|.-+..+ ++ | .+.+++. ++.+ ++||+.... T Consensus 73 ~~~~~----~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d---~~--~-~~~i~~~----~p~~~~~~~d~~~~~ 138 (440) T protein:vir:95 73 SADQL----STIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRD---KD--K-VDRVVLI----SPLEMFVIRDLTVEQ 138 (440) T ss_pred cHHHH----HHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEec---CC--C-ceEEEEE----cccceEEEEcCCCCC Confidence 33322 235667889999999999999999999999887643 22 2 2233321 2222 445664421 Q ss_pred cChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEEeeCcccccee Q lcl|NC_013059. 165 MDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPV 244 (725) Q Consensus 165 ~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~ 244 (725) ....+++.|... +. ...++|..... ..+.... +. T Consensus 139 -----~~~~~i~~~~~~---------------------------------~~-~~~~vyt~~~~--~~~~~~~---~~-- 172 (440) T protein:vir:95 139 -----NIIAAVHLPIYA---------------------------------DK-VNMTVYTKDKV--ITYKPYS---NN-- 172 (440) T ss_pred -----ceEEEEEEEEec---------------------------------Cc-eEEEEEeCCeE--EEEEEec---CC-- Confidence 011223322211 00 01233332111 0011100 00 Q ss_pred ecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCccccchhhhhhh Q lcl|NC_013059. 245 SYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTK 324 (725) Q Consensus 245 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~k 324 (725) .+...+.+..|.+.+.+|+|+|.. + ..+.|.+..++ T Consensus 173 -------------------------------------~~~~~~~~~~~~~~g~vPvv~~~n------~-~~g~sd~e~v~ 208 (440) T protein:vir:95 173 -------------------------------------SVRLVVDDVKKHSYNDVPVVEWWN------N-RFRMGDYESEI 208 (440) T ss_pred -------------------------------------ccceeecceeeccCceeeEEEeeC------C-CCCCCchhhhH Confidence 001112233344445566665522 1 12568889999 Q ss_pred hHHHHHHHHHHHHHHHHHhcCCcceeechhh---cchHHHHHHhh-ccccccccccccccCccccccCCcccCCCCchHH Q lcl|NC_013059. 325 DGQRLRNMIMSFNADIVARTPKKKPFFWPEQ---IAGFEHMYDGN-DDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQA 400 (725) Q Consensus 325 d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~---i~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 400 (725) +.++.+|..+|.+...+...+....++ .|. .....+..... ........... ...+......++++..+.-..+ T Consensus 209 ~lida~~~~~s~~~~~~~~~~~~~~v~-~g~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~lt~~~~~~~ 286 (440) T protein:vir:95 209 SLIDAYDAGQSDTANYMSDLNDAMLLV-KGDLDGIKLSPEDAAKMKDANMLFLKTGI-STTGQQTTADASYIYKQYDVNG 286 (440) T ss_pred HHHHHHHHHHHHHHHHHHHhhcceeee-ecccccCCCCccchhhhhhccceeccccc-ccccCCCCcceeEEeecCCHHH Confidence 999999999999988887665554332 111 10000100000 00000000000 0001111122344443333355 Q ss_pred HHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEE Q lcl|NC_013059. 401 NAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVI 480 (725) Q Consensus 401 ~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI 480 (725) ....++.....|-.+|++.+.+.+.-+++.||+|+.................|..+.+++.++++.++..... T Consensus 287 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~------- 359 (440) T protein:vir:95 287 TEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAING------- 359 (440) T ss_pred HHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC------- Confidence 6678888999999999988777766555679999888766666666666777777777766665554432110 Q ss_pred eccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccC Q lcl|NC_013059. 481 TLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLL 560 (725) Q Consensus 481 ~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~ 560 (725) . ..+ -.++.|.-.+..+.-..+..+.+..+.+.+ |. -.++..++.. T Consensus 360 -----~---------~~~--------------~~~v~i~f~~~~p~~~~~~ad~~~kl~g~i----S~--et~~~~l~~~ 405 (440) T protein:vir:95 360 -----P---------VIE--------------ANKLTFTFHPNIPQDVWTEIKAYIEAGGEI----SQ--ETLMENASFT 405 (440) T ss_pred -----c---------ccc--------------cccceEEeCCCCCCCHHHHHHHHHHHhccC----cH--HHHHHhCCCC Confidence 0 000 135555556666554445555555543322 21 1222223322 Q ss_pred CchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 561 DGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 561 d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) |.+ .++ +++..+.. +..... +.+... ..-....+| T Consensus 406 d~~--~E~-~ri~~E~~----------~~~~~~----~~~~~~---------~~~~~~~~e 440 (440) T protein:vir:95 406 DYK--TEH-SRILKQGG----------SSDLEI----GQIVGD---------ADVGQADTE 440 (440) T ss_pred CcH--HHH-HHHHHHHH----------HhhhhH----HhhccC---------CCCCCcCCC Confidence 211 111 11111000 000000 000000 000000000 No 54 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.71 E-value=5.4e-16 Score=104.31 Aligned_cols=437 Identities=10% Similarity=-0.002 Sum_probs=206.8 Q ss_pred cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHH-----HHHhhcC----CCcccchHHHHHHHHHHHhh Q lcl|NC_013059. 3 DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLS-----QYTTLQY----RGQFDVVRPVVRKLVSEMRQ 73 (725) Q Consensus 3 d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~-----~~l~~~g----rp~~N~i~~~v~~v~g~~~~ 73 (725) =+.+.+.++. +....-+....+..+||.|++.-..-. ......+ |.++|..+.+|+..+|+.-. T Consensus 1 l~~~~i~~~i-------~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G 73 (451) T protein:vir:10 1 MELEKIRAII-------SADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFT 73 (451) T ss_pred CCHHHHHHHH-------HHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheec Confidence 1122222222 223334566788899999987421100 0111122 44679999999999999998 Q ss_pred CCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCC--CCceeEEEEeeec Q lcl|NC_013059. 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT--SNNQVIRREPIHS 151 (725) Q Consensus 74 nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~--~~~~~ir~~~~~~ 151 (725) +.+.+.+. ++.+..+ +++.+. .|+++.......++++++|.||.-+.++-...+.. ...+.+.+ . T Consensus 74 ~p~~~~~~----~~~~~~~----~~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~----i 140 (451) T protein:vir:10 74 YPVLFDID----NNKELNE----KVTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGV----V 140 (451) T ss_pred ccceeecC----CcHHHHH----HHHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEE----E Confidence 88766542 2333333 344444 48899999999999999999998776542211111 11222221 1 Q ss_pred chhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 152 ACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 152 ~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) +|..+ +||.... +--..+++.|...+. . . .-.....+..+++|.... T Consensus 141 ~p~~~~~vydd~~~-----~~~~~~ir~~~~~~~------~---------------~---~~~~~~~~~~~e~yt~~~-- 189 (451) T protein:vir:10 141 NTEEIIPIYRNGIE-----RELEAVIRYYIQLED------V---------------K---GQIQKQAYTYVEFWTDKI-- 189 (451) T ss_pred cccceEEEEcCCCC-----CceEEEEEEEEeeec------c---------------c---ccccceEEEEEEEEeCCe-- Confidence 23332 3443221 011233333322110 0 0 000012223344443321 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) ++.+... ++. ..|...+.++.|.+.+.+|+|+|+. T Consensus 190 --~~~~~~~-~~~--------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n---- 224 (451) T protein:vir:10 190 --LDKYKFF-GVS--------------------------------------CCGSQIEHITVQHRFNSVPFVEFSN---- 224 (451) T ss_pred --EEEEEec-ccC--------------------------------------ccccccccccccCCCCeeeEEEecc---- Confidence 1111100 000 0112222233333344555555432 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhh-cchHHHHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQ-IAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) + ..+.|.+..+++.++.+|..+|.....+...+....++. |. .+...+............+... +.-..+. T Consensus 225 --n-~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~-g~~~~~~~~~~~~~~~~~~i~~~~~----~~~~~~~ 296 (451) T protein:vir:10 225 --N-IKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILE-NFGGEDTSEFLKELKRYKTIKTETD----SEGDSGG 296 (451) T ss_pred --C-CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeee-cCCcccchhhHHHHhhCCeEEecCc----CCccCCc Confidence 1 125688899999999999999999998887776655432 21 1111111111111111111110 0001123 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) ++++..+.-..+....++.....|-..|++.+.+.+..|| .||+|+..+-..........-..|..+++++.++++.++ T Consensus 297 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~ 375 (451) T protein:vir:10 297 LKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGN-ASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFL 375 (451) T ss_pred ceEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4555544445666778999999999999876554444444 699999988777776666666677777766666655543 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) .. . |. .||.|.-.+..+.--.+..+.++.+.+.+ |. T Consensus 376 ----~~------~---------------------------d~---~~i~i~f~~~~p~n~~e~~~~~~kl~g~i----S~ 411 (451) T protein:vir:10 376 ----GV------T---------------------------DY---KKIQQTYTRNMMSNDLEDADIATKSVGII----PT 411 (451) T ss_pred ----CC------C---------------------------Cc---cceeEEecCCCCCCHHHHHHHHHHHhccC----ch Confidence 11 0 00 13334445555543334445555543322 21 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) -.++..++..+-+ ++..+++.++ +..... +.+..-.... + T Consensus 412 --et~~~~~p~v~d~--~~e~~~~~ee-------------~~~~~~-~~~~~~~~~~---------------~ 451 (451) T protein:vir:10 412 --KIILRHHPWVDDV--EEAEKLYLEE-------------KKIQAS-KVSDDYNNFT---------------E 451 (451) T ss_pred --HHHHHhCCCCCCH--HHHHHHHHHH-------------HHHHHH-HHHhhcCCCC---------------C Confidence 2222223322211 1111111000 000000 0000000000 0 No 55 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.71 E-value=7.9e-16 Score=103.37 Aligned_cols=445 Identities=11% Similarity=-0.008 Sum_probs=211.8 Q ss_pred CCcH-----HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH--HHHHHHhhcCCC----cccchHHHHHHHHH Q lcl|NC_013059. 1 MADN-----KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDD--WLSQYTTLQYRG----QFDVVRPVVRKLVS 69 (725) Q Consensus 1 mad~-----~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~--~~~~~l~~~grp----~~N~i~~~v~~v~g 69 (725) |.+. .+.+.++...+. .+-+....+-.+||.|++-.- ......+..++| ++|..+.+|+..+| T Consensus 23 ~~~~~~~~~~~~i~~~i~~~~------~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~ 96 (481) T protein:vir:10 23 VSDLAELLKEENLRNFISRHQ------TEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVG 96 (481) T ss_pred eecchhhcCHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHh Confidence 3332 223444444332 223334566678999986432 111222333333 56999999999999 Q ss_pred HHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEee Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPI 149 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~ 149 (725) +.-.+.+.+.+ . |.+. ...+.-+++.|+++.....+.++++++|.||+-+..+ ++ | .+.+++ T Consensus 97 ~l~g~~~~~~~--~---d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d---~d--g-~~~i~~--- 158 (481) T protein:vir:10 97 YLTGNPITITH--Q---DNQT----NDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRD---FE--D-RDTFKV--- 158 (481) T ss_pred hhccCCceEec--C---ChhH----HHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeC---CC--C-eEEEEE--- Confidence 99887765543 2 2222 2345566778999999999999999999999877543 22 2 233332 Q ss_pred ecchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEec Q lcl|NC_013059. 150 HSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVE 227 (725) Q Consensus 150 ~~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~ 227 (725) .+++.+ +||+.... ...++++.|... ..+...+..+++|.... T Consensus 159 -~~p~~~~~v~d~~~~~-----~~~~~i~~~~~~-----------------------------~~~~~~~~~~~~y~~~~ 203 (481) T protein:vir:10 159 -LDPKSTFVVYDQTLDK-----KVVAGVRYFEKQ-----------------------------DKDKVPVQHVEVYTTDK 203 (481) T ss_pred -EcccceEEEEcCCCCC-----ceEEEEEEEEEe-----------------------------eCCCceEEEEEEEecCe Confidence 133333 34443211 111222222100 00112233445554321 Q ss_pred ceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccc-cccCCCCCCCCccceEEEEee Q lcl|NC_013059. 228 KKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA-VLKDKQLIAGEHIPIVPVFGE 306 (725) Q Consensus 228 ~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~-~l~~~~~~p~~~~p~vP~~g~ 306 (725) . +.+.. .|.. .+.++.|.+.+.+|+|||.- T Consensus 204 i----~~~~~--------------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n- 234 (481) T protein:vir:10 204 I----YYIEI--------------------------------------------KGGTYHRVEEVEHYYNDVPIIEYLN- 234 (481) T ss_pred E----EEEEe--------------------------------------------cCCceeecccccccCCceeEEEeec- Confidence 1 11110 0100 11233344445667666532 Q ss_pred eeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccc Q lcl|NC_013059. 307 WGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPT 386 (725) Q Consensus 307 ~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 386 (725) + ..+.|.+..+++.++.+|...|.+...+...+...+++.-....+.+................... .+.-.. T Consensus 235 -----~-~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 307 (481) T protein:vir:10 235 -----D-QFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNA-NGSEGK 307 (481) T ss_pred -----C-CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccc-cCCCCC Confidence 1 235688899999999999999999988877766655442111111111000000000000000000 000011 Q ss_pred cCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 387 QPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) Q Consensus 387 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~ 466 (725) ..++++....-..++...++.....|-.+|++.+.+.|..++..||.|+..............-..|..+++++.++++. T Consensus 308 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 387 (481) T protein:vir:10 308 AEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLN 387 (481) T ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12333333322344566788889999999999887877665667999998776665555555566666666666655555 Q ss_pred HHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 467 IVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 467 li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) ++.. .+.. . . ...+|.|.-.|..+....+..+.++.+.+.+ T Consensus 388 ~~~~----------~~~~-~-----------------------~-~~~~i~v~f~~~~~~~~~~~a~~~~kl~g~i---- 428 (481) T protein:vir:10 388 NVNL----------TGLK-Q-----------------------H-NYAELTITFTPNLPKSMMESINAFNALSGGV---- 428 (481) T ss_pred HHhc----------cCCC-c-----------------------c-ccceeeEEeCCCCCcCHHHHHHHHHHHhccC---- Confidence 4321 1100 0 0 0135666666777665666666666654332 Q ss_pred chHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 547 PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) Q Consensus 547 p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qa 620 (725) |. ...+..++..+ ..++-++++.++....... ..+... .++. ......-..+. T Consensus 429 s~--et~~~~l~~i~--d~~~E~~ri~~E~~~~~~~-------------~~~~~~--~~~~--~~~~~~dd~~g 481 (481) T protein:vir:10 429 SE--STRLSLLDFID--NPKEELEKMQEEEAQREKQ-------------ADKRGY--GEAF--ENHLNVDDSNG 481 (481) T ss_pred Ch--HHHHHhCCCCC--CHHHHHHHHHHHHHHHHhh-------------hhhccC--CccC--CCCCCCCCCCC Confidence 21 12222222221 1122233332211000000 000000 0000 00000000000 No 56 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.70 E-value=3.8e-16 Score=105.15 Aligned_cols=467 Identities=11% Similarity=0.036 Sum_probs=218.6 Q ss_pred CC-cHHHH---HHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CCcccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MA-DNKNR---LESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 ma-d~~~~---~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp~~N~i~~~v~~v~g~~~ 72 (725) |. +.... .+++....... ....+....+-.+||.|.|.--......+... |.++|..+.+|+..+|+.. T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~---~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHH---MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cchhhhhhhccHHHHHHHHHHH---HHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhhc Confidence 32 22211 12222222211 12234455667899999886432222223233 3467999999999999999 Q ss_pred hCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecc Q lcl|NC_013059. 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~ 152 (725) .+.+.+.+ +|.+ +...+..+.+.|+++...+...+++++.|.+|..+..+ ++ + .+.+.. .+ T Consensus 108 g~p~~~~~-----~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~d---ed--~-~~~i~~----~~ 168 (511) T protein:vir:99 108 GNPIQYQD-----DDKD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD--D-ETRLYK----SD 168 (511) T ss_pred ccCceeec-----CchH----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeC---CC--C-ceEEEE----Ec Confidence 88887753 2222 34667778888999999999999999999999877543 22 1 233332 23 Q ss_pred hhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEeccee Q lcl|NC_013059. 153 CSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKE 230 (725) Q Consensus 153 ~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~ 230 (725) +.++ +||+.... -..++++.|.... . + ..+.+.+..+++|..... T Consensus 169 p~~~~~vyd~~~~~-----~~~~~vr~~~~~~------~----~----------------~~~~~~~~~~~vyt~~~i-- 215 (511) T protein:vir:99 169 AMSTFVIYDNTIER-----NSIAGVRYLRTKP------I----D----------------KTDEDEVFTVDLFTSHGV-- 215 (511) T ss_pred cceeEEEEcCCCCC-----ceEEEEEEEEeee------c----c----------------cCccceEEEEEEEeCCcE-- Confidence 3333 35544311 1123333332110 0 0 001233444556654321 Q ss_pred EEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeecc Q lcl|NC_013059. 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFV 310 (725) Q Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~ 310 (725) +.+....++.. . ......+..|.|++.+|+|+|.. T Consensus 216 --~~~~~~~~~~~-~-------------------------------------~~~~~~~~~~~~~g~vPvv~~~n----- 250 (511) T protein:vir:99 216 --YRYLTSRTNGL-K-------------------------------------LTPRENGFESHSFERMPITEFSN----- 250 (511) T ss_pred --EEEEecCCccc-c-------------------------------------ccccccccccCCCCccceEEecC----- Confidence 11111111100 0 00001123445556677766532 Q ss_pred CCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc--ccccccCc--cccc Q lcl|NC_013059. 311 EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL--NRTDENNG--EMPT 386 (725) Q Consensus 311 d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~~~~~g--~~~~ 386 (725) + ..+.|.+..+++.++.+|..+|.+...+...+...+++ .|....-............... .......+ .... T Consensus 251 -n-~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (511) T protein:vir:99 251 -N-ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGS 327 (511) T ss_pred -C-CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhh-ccCcccCchhhcccccccceecccccccccccccCCCC Confidence 1 23568889999999999999999988776655543332 2211000000000000000000 00000000 1112 Q ss_pred cCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 387 QPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) Q Consensus 387 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~ 466 (725) ..++++..+.-..++...++...+.|-.+|++.+.+.+.-++..||+|+..+...........-..|..+++++.++++. T Consensus 328 ~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~ 407 (511) T protein:vir:99 328 VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23444444333455666788888999999988776666544557999999887777666667777777787777777766 Q ss_pred HHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 467 IVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 467 li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) ++...-... ... |+ .+|.|.-.+..|.-..+..+.++.+.+.++. T Consensus 408 ~~~~~~~~~---------~~~---------------------~~---~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~-- 452 (511) T protein:vir:99 408 ILKNTRSID---------VSK---------------------DF---NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ-- 452 (511) T ss_pred HHHhcCCcc---------ccc---------------------cc---ccceEEeCCCCCcCHHHHHHHHHHHhccCCH-- Confidence 553311000 000 01 2344555566665444555555555433221 Q ss_pred chHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 547 PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQ 626 (725) Q Consensus 547 p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaq 626 (725) .. ++..++..+ ..+.-++++.++.... ....+...........-..+. +..+.. T Consensus 453 -et---~l~~l~~v~--D~~~E~~ri~~E~~~~-------------~~~~~~~~~~~~~~~~~~~~~-------~~~~~~ 506 (511) T protein:vir:99 453 -TT---LMSLFSFFQ--DPELEVKKIEEDEKES-------------IKKAQKNMYQDPRNINDDEQD-------DSTKDS 506 (511) T ss_pred -HH---HHHhCCCCC--CHHHHHHHHHHHHHHH-------------HHHHhhcccccCCCCCCCCCC-------CCCcCc Confidence 11 222222222 2223333332211100 000000000000000000000 000000 Q ss_pred HHHHH Q lcl|NC_013059. 627 NQTLS 631 (725) Q Consensus 627 ae~~k 631 (725) .+..+ T Consensus 507 ~d~~e 511 (511) T protein:vir:99 507 IDKKE 511 (511) T ss_pred ccccC Confidence 00000 No 57 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=99.69 E-value=2e-14 Score=95.66 Aligned_cols=503 Identities=8% Similarity=-0.045 Sum_probs=235.5 Q ss_pred CCcHHH---HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcC----CCCCHHHHHHHhhcCCCcccchHHHHHHHHH---- Q lcl|NC_013059. 1 MADNKN---RLESILSRFDADWTASDEARREAKNDLFFSRV----SQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS---- 69 (725) Q Consensus 1 mad~~~---~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G----~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g---- 69 (725) ||+.+. .-+.+..+|..-.+....|-..+.+..+|..- +.+..... .-.++.-..-...++.+.+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~----~~~~~~dst~~~a~~~LAa~L~~ 76 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGST----SYTTPWQSIGARGLNNLASKLML 76 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchh----hccccccchHHHHHHHHHHHHHH Confidence 998653 24556666666555555566666666777753 22322111 1123322333344444333 Q ss_pred HHhh-CCcceEEecCCcc-------h---HHHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC Q lcl|NC_013059. 70 EMRQ-NPIDVLYRPKDGA-------S---PDAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ 135 (725) Q Consensus 70 ~~~~-nr~~~~~~pr~~~-------d---~~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~ 135 (725) .-.. +++=+++.+.+.+ + .++.+.| +..+......|++..+...+|.+.+..|.|+.-+- .. + T Consensus 77 ~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~--~~-~ 153 (532) T protein:vir:99 77 ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP--ST-E 153 (532) T ss_pred hhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEec--cc-c Confidence 2222 4555666665421 1 1223233 33344455678999999999999999999986332 11 1 Q ss_pred CCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCC Q lcl|NC_013059. 136 SPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQD 215 (725) Q Consensus 136 ~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (725) ...+.....+..|+ .++++..++.- ..--++++..++.+.+-+.++... ....+.+...+ T Consensus 154 ~~~~~~~~f~~~pl----~~y~v~~d~~G----~v~~ivrr~~~~~~~l~e~~~~~~------------~~~~~~~~p~~ 213 (532) T protein:vir:99 154 QVEGQSNAPKLYKL----HNFVVERDAYD----NVLQIVTEDKIARAALPEDVRKSL------------EDAQGDQNPSE 213 (532) T ss_pred cccCcccceEEEEc----CeEEEeeCCCC----CeeeEeeeeeecHHhcChHHHHHh------------hccccccCCCc Confidence 11112233444444 34666554421 111255556667665533333210 01111222334 Q ss_pred eEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEe-eccccccCCCCCC Q lcl|NC_013059. 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII-TCTAVLKDKQLIA 294 (725) Q Consensus 216 ~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~g~~~l~~~~~~p 294 (725) .|.|..+.++++... ...|+.. .|..+....+-|| T Consensus 214 ~v~v~~~v~~~~~~~--------------------------------------------~~~~~~~~~g~~~~~~~~~~~ 249 (532) T protein:vir:99 214 EVTIYTHVYRDPEAM--------------------------------------------VFRSYQEIDGEIVAGTEGEYP 249 (532) T ss_pred ceEEEEEEEecCCCC--------------------------------------------eeEEEEeecCceecccccccc Confidence 566666555432210 0112222 3544444557788 Q ss_pred CCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc Q lcl|NC_013059. 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) Q Consensus 295 ~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) ++.+||+|+... -.+|..|+.|.+.+..+-.+.+|+.....+.....+.+.++++.++.+-....... ..+ + T Consensus 250 ~~e~P~~~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~---~~~---g 321 (532) T protein:vir:99 250 LDSCPWIPVRLI--KMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK---ANT---G 321 (532) T ss_pred cccCCceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhcc---CCC---c Confidence 899999987554 35888999999999999999999998888888888889998888765433221111 111 1 Q ss_pred ccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) Q Consensus 375 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~ 454 (725) ..+....+.+ .++ ......--.....+++...+.|.... ..+.....++..+++.=|..+.+.....|..++.+|. T Consensus 322 ~~v~g~~~~i--~~~-~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~ 397 (532) T protein:vir:99 322 DFVAGRKQDV--EVF-QLEKYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLS 397 (532) T ss_pred ceecCCcccc--eee-ecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHH Confidence 1111112211 111 11122222445566777777776655 2222222344556666788888888888888777776 Q ss_pred H-HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHH Q lcl|NC_013059. 455 T-AMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRA 533 (725) Q Consensus 455 ~-~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~ 533 (725) . ...=+.+..+.++.+ .|.- ++. ..++. .-++++-.+|- -|.+.++ T Consensus 398 ~E~l~Pli~r~~~il~r-------------~g~l------P~~----------p~~~~-~~~iv~~is~L---araq~~~ 444 (532) T protein:vir:99 398 QELQLPLVKILLKELQA-------------TSKI------PNL----------PKEAV-EPAIATGLEAL---GRGHDLN 444 (532) T ss_pred HHHHHHHHHHHHHHHHh-------------cCCC------CCC----------Chhhc-ccceeecchHH---HHHHHHH Confidence 3 333333333333332 2210 000 01121 12333222222 3555566 Q ss_pred HHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhh--hhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_013059. 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQA 611 (725) Q Consensus 534 ~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~--~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~ 611 (725) .++.+++.+.+..|.. ++..|++ +++..+..... +..+... +++.+...+++++++ +..++.. T Consensus 445 ~l~~~~~~laq~~p~~-------~d~id~d---~~~~~~a~~~GV~~~~i~r~--~ee~~~~~~q~~~~~---~~~~a~~ 509 (532) T protein:vir:99 445 KLNVFIDYMIKLAGLQ-------DDDINLL---DVKMRLANSLGMDTTGLILT--QQDKQAKMAEASTAA---GMVTAGQ 509 (532) T ss_pred HHHHHHHHHHhhcchh-------hhhCCHH---HHHHHHHHHhCCChhhccCC--HHHHHHHHHHHHHHH---HHHHHHH Confidence 6666666655444432 2233433 33333333221 1222221 111111111111100 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 612 QGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQ 644 (725) Q Consensus 612 qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q 644 (725) ++... +.++.....+.+.....+ T Consensus 510 ~~~~~----------~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 510 QMGAA----------GGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred HHHHH----------HHHhcchhHHhhcCCCCC Confidence 00000 000000000000000001 No 58 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.69 E-value=4.5e-16 Score=104.69 Aligned_cols=455 Identities=8% Similarity=-0.047 Sum_probs=209.8 Q ss_pred cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CCHHHHHH-------HhhcC----CCcccchHHHHHHHHH Q lcl|NC_013059. 3 DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ--WDDWLSQY-------TTLQY----RGQFDVVRPVVRKLVS 69 (725) Q Consensus 3 d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q--W~~~~~~~-------l~~~g----rp~~N~i~~~v~~v~g 69 (725) =+.+.+.++..++... .........+-.+||.|++ |....... ....+ |.++|..+.+|+..+| T Consensus 1 ~~~~~~~~~i~~~~~~---~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~ 77 (470) T protein:vir:10 1 MELDALKKLIQNTSTS---RNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (470) T ss_pred CchHHHHHHHHHHHHH---HHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhh Confidence 1233444444444443 3344456667789999976 22111110 11122 3467999999999999 Q ss_pred HHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEee Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPI 149 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~ 149 (725) +.-.+.+.+.+ +|.+..+.+..+++ ++++...+....+++++|.+|..+.++ ++ + .+.+.+. T Consensus 78 yl~G~p~~~~~-----~d~~~~~~l~~~~~-----~~~~~~~~~l~~~~~~~G~a~~~~y~d---~~--~-~~~~~~~-- 139 (470) T protein:vir:10 78 YVASVFPDIDV-----GKDADNKKIIDVLG-----DDRALTLNGLLVDSSNAGRAWLHYWID---ED--G-NFRYGII-- 139 (470) T ss_pred heeccceeeec-----CchHHHHHHHHHHh-----hhHHHHHHHHHHHHhhcCeeEEEEEec---CC--C-ceEEEEE-- Confidence 99998877643 23334444544433 356677778889999999999887653 22 1 2332221 Q ss_pred ecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEec Q lcl|NC_013059. 150 HSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVE 227 (725) Q Consensus 150 ~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~ 227 (725) ++.. ++||+.... . ..++++.|...+ ..+...+..+|+|.... T Consensus 140 --~p~~~~~v~d~~~~~----~-~~a~ir~y~~~~----------------------------~~~~~~~~~~e~yt~~~ 184 (470) T protein:vir:10 140 --QPDQITPIYATTLDN----K-LLGILRSYKQLD----------------------------PDSGKYFTVHEYWTDKE 184 (470) T ss_pred --cccceEEEEcCCCCC----c-eEEEEEEEEeee----------------------------cCCceEEEEEEEEcCCc Confidence 2222 334443211 0 112222222110 01122344566665432 Q ss_pred ceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeee Q lcl|NC_013059. 228 KKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEW 307 (725) Q Consensus 228 ~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~ 307 (725) .. .+.... ++.. .......... . -....+.... ...++|. |+.|||+.++ T Consensus 185 ~~--~~~~~~--~~~~-~~~~~~~~~~-------------------~---~~~~~~~~~~--~~~~~~~-~g~vPvv~~~ 234 (470) T protein:vir:10 185 AQ--FFRTNA--TDST-VIEPYNIITS-------------------Y---DLSAGYETGQ--SNTLKHN-FGRVPFIEFS 234 (470) T ss_pred EE--EEEeec--Ccce-eccccccccc-------------------c---cccccccccc--ccccccC-CCeeeEEEee Confidence 21 111111 1100 0000000000 0 0000011111 1223333 3444554443 Q ss_pred eccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcccccc Q lcl|NC_013059. 308 GFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQ 387 (725) Q Consensus 308 ~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 387 (725) - + ..+.|.+..+++.++.+|..+|.+...+...+...+++.-...+...+............+.. .|.-... T Consensus 235 n---n-~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~----~~~~~~~ 306 (470) T protein:vir:10 235 K---N-KYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINN----TGNGDNS 306 (470) T ss_pred c---C-CCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccC----CCCCcCc Confidence 2 1 125588889999999999999999988887776655543212222222222111111111111 1111122 Q ss_pred CCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 388 PLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) Q Consensus 388 ~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~l 467 (725) .++++..+.-..+....++...+.|-..+++.+.+.+..| ..||+|+..+...........-..|..+++++.++++. T Consensus 307 ~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~- 384 (470) T protein:vir:10 307 GVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESS-NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMR- 384 (470) T ss_pred eeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 3445554444456677888899999999988776666544 48999999887777777666666666776666665554 Q ss_pred HHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccc Q lcl|NC_013059. 468 VNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) Q Consensus 468 i~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p 547 (725) ++.- .+- | ..||.|.-.+..+.--.+..+.++.+.+. .+ T Consensus 385 ---~l~~------~~~-------------------------d---~~~i~i~f~~~~p~d~~e~~~~~~~~~g~----iS 423 (470) T protein:vir:10 385 ---YLNF------SDA-------------------------D---KRHISQHWTRTKVEDSLTKAQIVSTVANY----SS 423 (470) T ss_pred ---Hhcc------cCc-------------------------c---cceeeEEeccCCCCCHHHHHHHHHHHhcc----Cc Confidence 4321 110 0 12444544555554333333334333222 22 Q ss_pred hHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 548 ~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) . ...+..++..+ ..+...+++.+........ ..+. ... ......-++ T Consensus 424 ~--et~l~~~p~v~--D~~~E~eri~~E~~e~~~~-------------~~~~------~~~-------~~~~~dde~ 470 (470) T protein:vir:10 424 K--EAVAKANPIVD--DWQQELKDLAKDKEENDPY-------------SNQA------DEL-------NGKGVNDEQ 470 (470) T ss_pred H--HHHHHhCCCCC--CHHHHHHHHHHHHHHHHHh-------------hccc------ccc-------CCCCCCCCC Confidence 1 12222222221 1222233332211000000 0000 000 000000000 No 59 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.69 E-value=6.8e-16 Score=103.73 Aligned_cols=469 Identities=10% Similarity=0.025 Sum_probs=216.4 Q ss_pred CC-cHHHHHHHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCCCCCHHHHHHHhhc----CCCcccchHHHHHHHHHHHhhC Q lcl|NC_013059. 1 MA-DNKNRLESILSRFDADWTA-SDEARREAKNDLFFSRVSQWDDWLSQYTTLQ----YRGQFDVVRPVVRKLVSEMRQN 74 (725) Q Consensus 1 ma-d~~~~~~~~~~~~~~~~~~-~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~----grp~~N~i~~~v~~v~g~~~~n 74 (725) |. +..... .........+.. ...-+....+-.+||.|.|.--......+.. .|.++|..+.+|+..+|+...+ T Consensus 31 ~~~~e~~~~-~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~ 109 (511) T protein:vir:93 31 YDGTESDLL-QNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (511) T ss_pred ccchhhhhh-ccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhhccc Confidence 33 221111 011111112221 1222344556679999987532111111222 2346799999999999999888 Q ss_pred CcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh Q lcl|NC_013059. 75 PIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS 154 (725) Q Consensus 75 r~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~ 154 (725) .+.+.+ +|.+ ....+..+.+.|+++.......++++++|.+|.-|..+ ++ + .+.+.. .++. T Consensus 110 p~~~~~-----~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~d---e~--~-~~~i~~----~~p~ 170 (511) T protein:vir:93 110 PIQYQD-----DDKD----VLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD--D-ETRLYK----SDAM 170 (511) T ss_pred Ceeecc-----CChH----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeC---CC--C-ceEEEE----Eccc Confidence 877642 2222 34567777888999999999999999999999877543 22 2 233332 1233 Q ss_pred h--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEE Q lcl|NC_013059. 155 H--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETA 232 (725) Q Consensus 155 ~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~ 232 (725) + ++||..... -..++++.|.... . + ..+.+.+..+++|..... T Consensus 171 ~~~~vydd~~~~-----~~~~~vr~~~~~~------~----~----------------~~~~~~~~~~~iyt~~~i---- 215 (511) T protein:vir:93 171 STFVIYDNTIER-----NSIAGVRYLRTKP------I----D----------------KTDEDEVFTVDLFTSHGV---- 215 (511) T ss_pred eeEEEEcCCCCC-----ceEEEEEEEEeee------c----c----------------ccccceEEEEEEEeCCcE---- Confidence 3 345544321 1223444332210 0 0 001123344555544321 Q ss_pred EEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCC Q lcl|NC_013059. 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVED 312 (725) Q Consensus 233 ~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~ 312 (725) +.+... ++.... ......++.|.+.+.+|+|+|. .+ T Consensus 216 ~~~~~~-~~~~~~-------------------------------------~~~~~~~~~~~~~g~vPvv~~~------nn 251 (511) T protein:vir:93 216 YRYLTS-RTNGLK-------------------------------------LTPRENGFESHSFERMPITEFS------NN 251 (511) T ss_pred EEEEec-CCCccc-------------------------------------cccccccccccCCCccceEEec------CC Confidence 111110 010000 0000112334444556665542 11 Q ss_pred ccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc--ccccccC--ccccccC Q lcl|NC_013059. 313 KEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL--NRTDENN--GEMPTQP 388 (725) Q Consensus 313 ~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~~~~~--g~~~~~~ 388 (725) ..+.|.+..+++.++.+|..+|.+...+...+....++ .|....-..............+ ....... +...... T Consensus 252 -~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:93 252 -ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred -CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceee-ecCcccCchhhcccccccceecccccccccccccCCCCcc Confidence 23568899999999999999999998887666554332 2211100011110000000000 0000000 1111233 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) ++++..+.-..++...++.....|-.+|++.+.+.+..++..||+|+..............-..|..+++++.++++.++ T Consensus 330 ~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l 409 (511) T protein:vir:93 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 409 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444333345666788888999999999877766654455799999988777666666666777777777777666654 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) ....... ... |+ -+|.|.-.+..+.-..+..+.+..+.+.++. . T Consensus 410 ~~~~~~~---------~~~---------------------d~---~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~---e 453 (511) T protein:vir:93 410 KNTWSID---------ANK---------------------DF---NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ---T 453 (511) T ss_pred HhccCcc---------ccc---------------------cc---ccceEEeCCCCCCCHHHHHHHHHHHhccCch---H Confidence 3222110 000 11 1444555666665455556666655433322 1 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQ 628 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae 628 (725) . ++..++..+ ..++-++++.+..... ........................+- .+. T Consensus 454 t---~~~~l~~v~--d~~~E~~ri~~E~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~ 508 (511) T protein:vir:93 454 T---LMSLFSFFQ--DPELEVKKIEEDEKES-------------IKKAQKGIYKDPRDINDDEQDDDTKD-------TVD 508 (511) T ss_pred H---HHHhCCCCC--CHHHHHHHHHHHHHHH-------------HHHHhhhcccCCCCCCCCCCCCcccc-------ccc Confidence 1 222222222 1122233332211000 00000000000000000000000000 000 Q ss_pred HHH Q lcl|NC_013059. 629 TLS 631 (725) Q Consensus 629 ~~k 631 (725) +.+ T Consensus 509 ~~~ 511 (511) T protein:vir:93 509 KKE 511 (511) T ss_pred ccC Confidence 000 No 60 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=99.69 E-value=3.6e-14 Score=94.28 Aligned_cols=532 Identities=10% Similarity=-0.060 Sum_probs=248.1 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh---cCCCCCHHHHHHHhhcCCCcccchHHHHHHHHH----HHhh Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFS---RVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EMRQ 73 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~---~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~~~ 73 (725) |+... ....+..+|+........|-..+.+..+|. .|.=|+...-...+...++.-..-...++.+.+ .-.. T Consensus 1 M~~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltp 79 (555) T protein:vir:98 1 MAEQT-ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTS 79 (555) T ss_pred CCCcc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcC Confidence 99876 447788888888777777877888888887 344443321111111233322334444444333 2222 Q ss_pred -CCcceEEecCCcchH---HHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEE Q lcl|NC_013059. 74 -NPIDVLYRPKDGASP---DAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) Q Consensus 74 -nr~~~~~~pr~~~d~---~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~ 146 (725) +++=+++.+.+++.. .+.+.| +..+......+++..+...+|.+.+..|.|++-+ ..|+.+ .+ +. T Consensus 80 p~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~-----~~d~~~-~~--rf 151 (555) T protein:vir:98 80 PARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIV-----LPDFDA-VV--YH 151 (555) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEE-----ecCCCc-eE--EE Confidence 666677777655432 222322 3344445567999999999999999999998633 233322 22 32 Q ss_pred EeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEe Q lcl|NC_013059. 147 EPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) Q Consensus 147 ~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~ 226 (725) .+ .++.++++..++.- ..--+||...|+...+.+.|+.-........ .... ...+..+.|+.++|.+ T Consensus 152 ~~--~pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~---~~~~----~~~~~~v~v~~~V~pr 218 (555) T protein:vir:98 152 HS--LTAGEYAIAADNQG----RVNTLYREFQITVAQMVREFGKDKCSTTVQS---LFDR----GALEQWVTVIHAIEPR 218 (555) T ss_pred EE--eecceeEEeeCCCC----CEEEEEEEEeccHHHHHHhcCcccCCHHHHH---HHhc----CCCCceEEEEEEEeec Confidence 22 34455777655432 1223567788999999998886221111000 1111 1112357777776654 Q ss_pred cceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEE-EEEEEe-eccccccCCCCCCCCccceEEEE Q lcl|NC_013059. 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRR-VYKSII-TCTAVLKDKQLIAGEHIPIVPVF 304 (725) Q Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-v~~~~~-~g~~~l~~~~~~p~~~~p~vP~~ 304 (725) ... ++... +. ..+.+. |+|.-- .|.++|. .+.| ..+||+|+- T Consensus 219 ~~~-------~~~~~-----~~---------------------~~~p~~s~~~~~~~d~~~vl~-esgy--~e~P~i~~R 262 (555) T protein:vir:98 219 ADR-------DPSKR-----DD---------------------RNMAWKSVYFEPGADETRTLR-ESGY--RSFRALCPR 262 (555) T ss_pred cCc-------CcCCC-----Cc---------------------cccceEEEEEEeccCCccccc-cCCc--ccCCceeee Confidence 321 11100 00 011111 233221 2445553 3334 679999875 Q ss_pred eeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccc Q lcl|NC_013059. 305 GEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM 384 (725) Q Consensus 305 g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) .. ..+|..|+.|.+....+-.+.+|+.....+..+....+.++.++.+.... ..+..|+... .+. .|. T Consensus 263 w~--~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~------~~~~~pgg~~-~v~--~g~- 330 (555) T protein:vir:98 263 WA--LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQ------DISTVPGGLS-YVD--AAA- 330 (555) T ss_pred ee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc------cceecccccc-ccc--cCC- Confidence 44 45898999999999999999999988888889999999998887764221 1122233221 111 111 Q ss_pred cccCCcc--cCCCCchHHHHHHHHHHHHHHHHHhCCChH--Hhcc-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 385 PTQPLAY--YENPEVPQANAYMLEAATAAVKEVATLGVD--AEAV-NGGQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) Q Consensus 385 ~~~~~~~--~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~--~~G~-~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~ 459 (725) +.+.+.. -..+.+ ....+.++...+.|.... .++. +++. ++..+++.=|..+.+.....|..++-+|.. T Consensus 331 ~~d~~~~~~~~~~d~-~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~---- 404 (555) T protein:vir:98 331 PNGGIRTAFEVNLDL-SHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHN---- 404 (555) T ss_pred CCcceecccccccch-HHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHH---- Confidence 0111111 111222 344566777777776665 3342 2332 223356667888888888888888877764 Q ss_pred HHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccc-cceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 460 DGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRG-RYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 460 ~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g-~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) +.+.=||...|. |....|.- ++..+ .+.| .++|....+. ...+|...+..+.++ T Consensus 405 --E~l~Pli~r~~~------il~r~g~l------P~~P~----------~l~~~~i~v~yis~L-a~aq~~~~~~~i~~~ 459 (555) T protein:vir:98 405 --EILDPLIELTFQ------RMVEANIL------PPPPQ----------EMQGVDLNVEFVSML-AQAQRAIATNSVDRF 459 (555) T ss_pred --HHHHHHHHHHHH------HHHhcCCC------CCCch----------hhcCceeEEEeccHH-HHHHHHHHHHHHHHH Confidence 333333322221 22222210 00000 0111 2333333322 333455655555555 Q ss_pred HHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhh-hhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_013059. 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM-GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) Q Consensus 539 l~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~-~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k 617 (725) ++.+.+.+...+. .++..|+ ++++..+......+ .+... +++..+..++.++++++++++.+ +.+... T Consensus 460 l~~i~~laq~~P~----vld~id~---d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~~a~~---~~q~~~ 528 (555) T protein:vir:98 460 VGNLGAVAGIKPE----VLDKFDA---DRWADTYADMLGIDPELIVP-GNQVALIRKQRADQQQAAQQAAL---LNQGAD 528 (555) T ss_pred HHHHHHHhcCChh----hhhcCCH---HHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHHHHHHHHHHH---HHHHHH Confidence 5544332221111 1233344 33333332222121 11111 11111111111111111111100 000001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 618 GQAELAKAQNQTLSLQIDAAKVEAQNQLN 646 (725) Q Consensus 618 ~qae~~kaqae~~k~q~ea~~~q~q~q~~ 646 (725) ..+.+..+...-.-. -....++..-.. T Consensus 529 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 555 (555) T protein:vir:98 529 TAAKLGSVDTSKQNA--LTDVTRAFSGYT 555 (555) T ss_pred HHHHhcccccCcchh--HHHHHhhhccCC Confidence 111111111000000 000000000000 No 61 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=99.69 E-value=3.6e-14 Score=94.28 Aligned_cols=532 Identities=10% Similarity=-0.060 Sum_probs=248.1 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh---cCCCCCHHHHHHHhhcCCCcccchHHHHHHHHH----HHhh Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFS---RVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EMRQ 73 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~---~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~~~ 73 (725) |+... ....+..+|+........|-..+.+..+|. .|.=|+...-...+...++.-..-...++.+.+ .-.. T Consensus 1 M~~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltp 79 (555) T protein:vir:10 1 MAEQT-ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTS 79 (555) T ss_pred CCCcc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcC Confidence 99876 447788888888777777877888888887 344443321111111233322334444444333 2222 Q ss_pred -CCcceEEecCCcchH---HHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEE Q lcl|NC_013059. 74 -NPIDVLYRPKDGASP---DAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) Q Consensus 74 -nr~~~~~~pr~~~d~---~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~ 146 (725) +++=+++.+.+++.. .+.+.| +..+......+++..+...+|.+.+..|.|++-+ ..|+.+ .+ +. T Consensus 80 p~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~-----~~d~~~-~~--rf 151 (555) T protein:vir:10 80 PARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIV-----LPDFDA-VV--YH 151 (555) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEE-----ecCCCc-eE--EE Confidence 666677777655432 222322 3344445567999999999999999999998633 233322 22 32 Q ss_pred EeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEe Q lcl|NC_013059. 147 EPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) Q Consensus 147 ~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~ 226 (725) .+ .++.++++..++.- ..--+||...|+...+.+.|+.-........ .... ...+..+.|+.++|.+ T Consensus 152 ~~--~pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~---~~~~----~~~~~~v~v~~~V~pr 218 (555) T protein:vir:10 152 HS--LTAGEYAIAADNQG----RVNTLYREFQITVAQMVREFGKDKCSTTVQS---LFDR----GALEQWVTVIHAIEPR 218 (555) T ss_pred EE--eecceeEEeeCCCC----CEEEEEEEEeccHHHHHHhcCcccCCHHHHH---HHhc----CCCCceEEEEEEEeec Confidence 22 34455777655432 1223567788999999998886221111000 1111 1112357777776654 Q ss_pred cceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEE-EEEEEe-eccccccCCCCCCCCccceEEEE Q lcl|NC_013059. 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRR-VYKSII-TCTAVLKDKQLIAGEHIPIVPVF 304 (725) Q Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-v~~~~~-~g~~~l~~~~~~p~~~~p~vP~~ 304 (725) ... ++... +. ..+.+. |+|.-- .|.++|. .+.| ..+||+|+- T Consensus 219 ~~~-------~~~~~-----~~---------------------~~~p~~s~~~~~~~d~~~vl~-esgy--~e~P~i~~R 262 (555) T protein:vir:10 219 ADR-------DPSKR-----DD---------------------RNMAWKSVYFEPGADETRTLR-ESGY--RSFRALCPR 262 (555) T ss_pred cCc-------CcCCC-----Cc---------------------cccceEEEEEEeccCCccccc-cCCc--ccCCceeee Confidence 321 11100 00 011111 233221 2445553 3334 679999875 Q ss_pred eeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccc Q lcl|NC_013059. 305 GEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM 384 (725) Q Consensus 305 g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) .. ..+|..|+.|.+....+-.+.+|+.....+..+....+.++.++.+.... ..+..|+... .+. .|. T Consensus 263 w~--~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~------~~~~~pgg~~-~v~--~g~- 330 (555) T protein:vir:10 263 WA--LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQ------DISTVPGGLS-YVD--AAA- 330 (555) T ss_pred ee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc------cceecccccc-ccc--cCC- Confidence 44 45898999999999999999999988888889999999998887764221 1122233221 111 111 Q ss_pred cccCCcc--cCCCCchHHHHHHHHHHHHHHHHHhCCChH--Hhcc-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 385 PTQPLAY--YENPEVPQANAYMLEAATAAVKEVATLGVD--AEAV-NGGQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) Q Consensus 385 ~~~~~~~--~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~--~~G~-~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~ 459 (725) +.+.+.. -..+.+ ....+.++...+.|.... .++. +++. ++..+++.=|..+.+.....|..++-+|.. T Consensus 331 ~~d~~~~~~~~~~d~-~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~---- 404 (555) T protein:vir:10 331 PNGGIRTAFEVNLDL-SHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHN---- 404 (555) T ss_pred CCcceecccccccch-HHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHH---- Confidence 0111111 111222 344566777777776665 3342 2332 223356667888888888888888877764 Q ss_pred HHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccc-cceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 460 DGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRG-RYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 460 ~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g-~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) +.+.=||...|. |....|.- ++..+ .+.| .++|....+. ...+|...+..+.++ T Consensus 405 --E~l~Pli~r~~~------il~r~g~l------P~~P~----------~l~~~~i~v~yis~L-a~aq~~~~~~~i~~~ 459 (555) T protein:vir:10 405 --EILDPLIELTFQ------RMVEANIL------PPPPQ----------EMQGVDLNVEFVSML-AQAQRAIATNSVDRF 459 (555) T ss_pred --HHHHHHHHHHHH------HHHhcCCC------CCCch----------hhcCceeEEEeccHH-HHHHHHHHHHHHHHH Confidence 333333322221 22222210 00000 0111 2333333322 333455655555555 Q ss_pred HHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhh-hhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_013059. 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM-GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) Q Consensus 539 l~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~-~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k 617 (725) ++.+.+.+...+. .++..|+ ++++..+......+ .+... +++..+..++.++++++++++.+ +.+... T Consensus 460 l~~i~~laq~~P~----vld~id~---d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~~a~~---~~q~~~ 528 (555) T protein:vir:10 460 VGNLGAVAGIKPE----VLDKFDA---DRWADTYADMLGIDPELIVP-GNQVALIRKQRADQQQAAQQAAL---LNQGAD 528 (555) T ss_pred HHHHHHHhcCChh----hhhcCCH---HHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHHHHHHHHHHH---HHHHHH Confidence 5544332221111 1233344 33333332222121 11111 11111111111111111111100 000001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 618 GQAELAKAQNQTLSLQIDAAKVEAQNQLN 646 (725) Q Consensus 618 ~qae~~kaqae~~k~q~ea~~~q~q~q~~ 646 (725) ..+.+..+...-.-. -....++..-.. T Consensus 529 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 555 (555) T protein:vir:10 529 TAAKLGSVDTSKQNA--LTDVTRAFSGYT 555 (555) T ss_pred HHHHhcccccCcchh--HHHHHhhhccCC Confidence 111111111000000 000000000000 No 62 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=99.69 E-value=3.6e-14 Score=94.28 Aligned_cols=532 Identities=10% Similarity=-0.060 Sum_probs=248.1 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh---cCCCCCHHHHHHHhhcCCCcccchHHHHHHHHH----HHhh Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFS---RVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EMRQ 73 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~---~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~~~ 73 (725) |+... ....+..+|+........|-..+.+..+|. .|.=|+...-...+...++.-..-...++.+.+ .-.. T Consensus 1 M~~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltp 79 (555) T protein:vir:10 1 MAEQT-ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTS 79 (555) T ss_pred CCCcc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcC Confidence 99876 447788888888777777877888888887 344443321111111233322334444444333 2222 Q ss_pred -CCcceEEecCCcchH---HHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEE Q lcl|NC_013059. 74 -NPIDVLYRPKDGASP---DAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) Q Consensus 74 -nr~~~~~~pr~~~d~---~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~ 146 (725) +++=+++.+.+++.. .+.+.| +..+......+++..+...+|.+.+..|.|++-+ ..|+.+ .+ +. T Consensus 80 p~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~-----~~d~~~-~~--rf 151 (555) T protein:vir:10 80 PARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIV-----LPDFDA-VV--YH 151 (555) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEE-----ecCCCc-eE--EE Confidence 666677777655432 222322 3344445567999999999999999999998633 233322 22 32 Q ss_pred EeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEe Q lcl|NC_013059. 147 EPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) Q Consensus 147 ~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~ 226 (725) .+ .++.++++..++.- ..--+||...|+...+.+.|+.-........ .... ...+..+.|+.++|.+ T Consensus 152 ~~--~pl~~~~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~---~~~~----~~~~~~v~v~~~V~pr 218 (555) T protein:vir:10 152 HS--LTAGEYAIAADNQG----RVNTLYREFQITVAQMVREFGKDKCSTTVQS---LFDR----GALEQWVTVIHAIEPR 218 (555) T ss_pred EE--eecceeEEeeCCCC----CEEEEEEEEeccHHHHHHhcCcccCCHHHHH---HHhc----CCCCceEEEEEEEeec Confidence 22 34455777655432 1223567788999999998886221111000 1111 1112357777776654 Q ss_pred cceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEE-EEEEEe-eccccccCCCCCCCCccceEEEE Q lcl|NC_013059. 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRR-VYKSII-TCTAVLKDKQLIAGEHIPIVPVF 304 (725) Q Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-v~~~~~-~g~~~l~~~~~~p~~~~p~vP~~ 304 (725) ... ++... +. ..+.+. |+|.-- .|.++|. .+.| ..+||+|+- T Consensus 219 ~~~-------~~~~~-----~~---------------------~~~p~~s~~~~~~~d~~~vl~-esgy--~e~P~i~~R 262 (555) T protein:vir:10 219 ADR-------DPSKR-----DD---------------------RNMAWKSVYFEPGADETRTLR-ESGY--RSFRALCPR 262 (555) T ss_pred cCc-------CcCCC-----Cc---------------------cccceEEEEEEeccCCccccc-cCCc--ccCCceeee Confidence 321 11100 00 011111 233221 2445553 3334 679999875 Q ss_pred eeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccc Q lcl|NC_013059. 305 GEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM 384 (725) Q Consensus 305 g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) .. ..+|..|+.|.+....+-.+.+|+.....+..+....+.++.++.+.... ..+..|+... .+. .|. T Consensus 263 w~--~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~------~~~~~pgg~~-~v~--~g~- 330 (555) T protein:vir:10 263 WA--LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQ------DISTVPGGLS-YVD--AAA- 330 (555) T ss_pred ee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc------cceecccccc-ccc--cCC- Confidence 44 45898999999999999999999988888889999999998887764221 1122233221 111 111 Q ss_pred cccCCcc--cCCCCchHHHHHHHHHHHHHHHHHhCCChH--Hhcc-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 385 PTQPLAY--YENPEVPQANAYMLEAATAAVKEVATLGVD--AEAV-NGGQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) Q Consensus 385 ~~~~~~~--~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~--~~G~-~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~ 459 (725) +.+.+.. -..+.+ ....+.++...+.|.... .++. +++. ++..+++.=|..+.+.....|..++-+|.. T Consensus 331 ~~d~~~~~~~~~~d~-~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~---- 404 (555) T protein:vir:10 331 PNGGIRTAFEVNLDL-SHLLADIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHN---- 404 (555) T ss_pred CCcceecccccccch-HHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHH---- Confidence 0111111 111222 344566777777776665 3342 2332 223356667888888888888888877764 Q ss_pred HHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccc-cceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 460 DGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRG-RYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 460 ~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g-~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) +.+.=||...|. |....|.- ++..+ .+.| .++|....+. ...+|...+..+.++ T Consensus 405 --E~l~Pli~r~~~------il~r~g~l------P~~P~----------~l~~~~i~v~yis~L-a~aq~~~~~~~i~~~ 459 (555) T protein:vir:10 405 --EILDPLIELTFQ------RMVEANIL------PPPPQ----------EMQGVDLNVEFVSML-AQAQRAIATNSVDRF 459 (555) T ss_pred --HHHHHHHHHHHH------HHHhcCCC------CCCch----------hhcCceeEEEeccHH-HHHHHHHHHHHHHHH Confidence 333333322221 22222210 00000 0111 2333333322 333455655555555 Q ss_pred HHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhh-hhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_013059. 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM-GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) Q Consensus 539 l~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~-~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k 617 (725) ++.+.+.+...+. .++..|+ ++++..+......+ .+... +++..+..++.++++++++++.+ +.+... T Consensus 460 l~~i~~laq~~P~----vld~id~---d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~~a~~---~~q~~~ 528 (555) T protein:vir:10 460 VGNLGAVAGIKPE----VLDKFDA---DRWADTYADMLGIDPELIVP-GNQVALIRKQRADQQQAAQQAAL---LNQGAD 528 (555) T ss_pred HHHHHHHhcCChh----hhhcCCH---HHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHHHHHHHHHHH---HHHHHH Confidence 5544332221111 1233344 33333332222121 11111 11111111111111111111100 000001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 618 GQAELAKAQNQTLSLQIDAAKVEAQNQLN 646 (725) Q Consensus 618 ~qae~~kaqae~~k~q~ea~~~q~q~q~~ 646 (725) ..+.+..+...-.-. -....++..-.. T Consensus 529 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 555 (555) T protein:vir:10 529 TAAKLGSVDTSKQNA--LTDVTRAFSGYT 555 (555) T ss_pred HHHHhcccccCcchh--HHHHHhhhccCC Confidence 111111111000000 000000000000 No 63 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.68 E-value=3e-15 Score=100.17 Aligned_cols=428 Identities=10% Similarity=0.009 Sum_probs=198.4 Q ss_pred CCcH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHH----HHHHhhcCCCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADN-KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWL----SQYTTLQYRGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~----~~~l~~~grp~~N~i~~~v~~v~g~~~~nr 75 (725) |.++ .+.+.+++..+..- +....+-.+||+|+|.-... ...++ .-+.+.|..+-+|+..+++..-+ T Consensus 1 ~~~~~~~~i~~l~~~~~~~-------~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~-~~k~~~n~~~~ivd~~~~~l~~~- 71 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRL-------SSWHCCIEGYYEGSNRVRDLGVAIPPELQ-RVQTVVSWPGIAVDALEERLDWL- 71 (441) T ss_pred CCccHHHHHHHHHHHHHHH-------HHHHHHHHHHHhcCCcchhcCcccchhhh-hhhhhcchHHHHHHHHHhhhccc- Confidence 7755 55666666655432 22334446999999864221 11111 22456799999999888765211 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~ 155 (725) .+ +.+++.+ +..+++.|+++.....++.++++.|.||.-|..+ ++ |. ..++.. ++.+ T Consensus 72 -g~----~~~d~~~--------l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---~~--g~-~~i~~~----~p~~ 128 (441) T protein:vir:80 72 -GW----TNGDGYG--------LDGVYAANRLATASCDVHLDALIFGLSFVAIIPH---GD--GT-VSVRPQ----SPKN 128 (441) T ss_pred -cc----cCCChHH--------HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC---CC--Cc-eEEEEE----ccce Confidence 11 2233322 4556778999999999999999999999877532 22 22 233322 2333 Q ss_pred --eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 156 --VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 156 --v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) ++||+...... ++.+++-.. .+... -.+.|+. T Consensus 129 ~~~i~d~~~~~~~------~~~~~~~~~-------------------------------~~~~~-~~~vy~~-------- 162 (441) T protein:vir:80 129 CTGKFSADGSRLD------AGLVVQQTC-------------------------------DPEVV-EAELLLP-------- 162 (441) T ss_pred EEEEEeCCCCcee------EEEEEEEEe-------------------------------cCceE-EEEEEec-------- Confidence 45787654322 111111100 00011 1122221 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) +.++.|.... .|..+..++.|.+.+.+|+|||... ... T Consensus 163 -------~~~~~~~~~~-------------------------------~~~~~~~~~~~~~~g~vPvv~~~n~----~~~ 200 (441) T protein:vir:80 163 -------DVIVQVERRG-------------------------------SREWVEVDRIPNVLGAVPLVPIVNR----RRT 200 (441) T ss_pred -------CeEEEEEEcC-------------------------------CcceeeccccccCCCceeEEEeecc----ccC Confidence 1111110000 0000112344555677888887543 222 Q ss_pred cccch---hhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHH-HHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 314 EVYEG---VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFE-HMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 314 ~~~~G---~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) ..+|| +.+.+++.++.+|+.+|.+...+...+.....+ .|.. +.+. +.+. ..............|. ... T Consensus 201 ~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i-~G~~~~~~~~~~~~---~~~~~i~~~~~~~~~~--~~~ 274 (441) T protein:vir:80 201 SRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWV-TGVSADEFSQPGWV---LSMASVWAVDKDDDGD--TPN 274 (441) T ss_pred CccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeee-ecCCccccccchhh---hcccccccCCCCCCCC--cce Confidence 33455 456789999999999999887776665543332 2211 1111 1111 0111111100001111 111 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~-~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~l 467 (725) +...+.. ....+...+......+-.++++.+..+|..++. .||.|+......-.......-.-|..+++++.++++. T Consensus 275 ~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~- 352 (441) T protein:vir:80 275 VGSFPVN-SPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAK- 352 (441) T ss_pred eEecCcc-chHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 2222222 223444455555566666688888889877754 5999999877666666666666667777766665443 Q ss_pred HHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccc Q lcl|NC_013059. 468 VNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) Q Consensus 468 i~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p 547 (725) +++.. +.. . | .-.++.|.=.+..+.-..+..+.++.|.++-....+ T Consensus 353 ---~~~~~------~~~-~----------------------~--~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s 398 (441) T protein:vir:80 353 ---ALDSR------VDE-A----------------------D--FFGDVGLRWRDASTPTRAATADAVTKLVGAGILPAD 398 (441) T ss_pred ---HhcCC------Ccc-c----------------------c--cceeeeEEeCCCCCcCHHHHHHHHHHHHhcCccccc Confidence 33211 000 0 0 012444444545444445566666666654221111 Q ss_pred hHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQN 627 (725) Q Consensus 548 ~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqa 627 (725) ...+...+...+ +++ +++.+ ..++++.+..+.... ...+. T Consensus 399 --~~~~~~~l~~~~----~e~-~~~~~------------------e~~e~~~~~~~~~~~---------------~~~~~ 438 (441) T protein:vir:80 399 --SRTVLEMLGLDD----VQV-EAVMR------------------HRAESSDPLAVLAGA---------------ISRQT 438 (441) T ss_pred --HHHHHHhCCCCH----HHH-HHHHH------------------HHHHHHHHHHHHhhh---------------hhccc Confidence 112222222211 111 11100 000000000000000 00000 Q ss_pred HHH Q lcl|NC_013059. 628 QTL 630 (725) Q Consensus 628 e~~ 630 (725) ++. T Consensus 439 ~~~ 441 (441) T protein:vir:80 439 NEV 441 (441) T ss_pred ccC Confidence 111 No 64 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.68 E-value=1.3e-15 Score=102.13 Aligned_cols=450 Identities=10% Similarity=0.031 Sum_probs=208.0 Q ss_pred CCcH-----HHHHHHHH-----------HHHHHHHhhhHHHHHHHHHHHHhhcCCC--CCHH---HHHHHhhcCCC---- Q lcl|NC_013059. 1 MADN-----KNRLESIL-----------SRFDADWTASDEARREAKNDLFFSRVSQ--WDDW---LSQYTTLQYRG---- 55 (725) Q Consensus 1 mad~-----~~~~~~~~-----------~~~~~~~~~~~~~r~~a~~d~~f~~G~Q--W~~~---~~~~l~~~grp---- 55 (725) |+|- ...+.+++ +.+...+......+....+..+||.|.| +... ........++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRM 80 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhccccccccccccee Confidence 7763 11222221 1222333333345566777899999976 1100 01111223333 Q ss_pred cccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC Q lcl|NC_013059. 56 QFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ 135 (725) Q Consensus 56 ~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~ 135 (725) ++|..+.+|+..+|+.-.+.+.+.+ ++.+..+.|. .+.+ |+++.....+.+++++.|.||..|.++ + T Consensus 81 ~~n~~k~ivd~~~~yl~g~p~~~~~-----~~~~~~~~l~----~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d---~ 147 (478) T protein:vir:10 81 YTNYHQNLVDQKVAYAVANPVTFGV-----DNDKALKQIQ----HTLN-HKWDDKLVDILTAASNKGIEWVQPYVD---E 147 (478) T ss_pred ccchHHHHHHHHhhhhcccCceeec-----CChHHHHHHH----HHHh-ccHHHHHHHHHHHHhhCCeEEEEEEec---C Confidence 4799999999999999998877643 3333334333 3333 789999999999999999999888654 2 Q ss_pred CCCCCceeEEEEeeecchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccC Q lcl|NC_013059. 136 SPTSNNQVIRREPIHSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLT 213 (725) Q Consensus 136 ~~~~~~~~ir~~~~~~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 213 (725) + + .+.+.+ .++.++ +||+.... +.. ++++.|-.. T Consensus 148 ~--~-~~~~~~----~~p~~~~~v~d~~~~~----~~~-~~ir~~~~~-------------------------------- 183 (478) T protein:vir:10 148 E--G-EFKTFR----VPAEQAVPIWTNKERD----ELQ-AFIRVYELD-------------------------------- 183 (478) T ss_pred C--C-ceEEEE----EcccceEEEEcCCCCC----ceE-EEEEEEeee-------------------------------- Confidence 2 1 233222 123332 34433211 111 122222110 Q ss_pred CCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCC Q lcl|NC_013059. 214 QDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI 293 (725) Q Consensus 214 ~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~ 293 (725) ...-+++|....+ ..+.+.+ |.+..... ...-......+.+..|. T Consensus 184 --~~~~~~~y~~~~i--~~~~~~~---~~~~~~~~----------------------------~~~~~~~~~~~~~~~~~ 228 (478) T protein:vir:10 184 --GAERVEYWTKDDV--TFYELKE---GQLIPDFY----------------------------RSEDHIQPHYYQGNKLM 228 (478) T ss_pred --CceEEEEEeCCcE--EEEEecC---Ceeecccc----------------------------ccccccccceecccccc Confidence 0011233332211 1111111 11110000 00000112223344566 Q ss_pred CCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhcccccc Q lcl|NC_013059. 294 AGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYY 372 (725) Q Consensus 294 p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~ 372 (725) +.+.+|+|+|... ..+.|.+..+++.++.+|...|.+...+...+...+++ .|.. +...+......... . T Consensus 229 ~~g~vPvv~~~n~-------~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~-~ 299 (478) T protein:vir:10 229 SWGRVPFIPFKNN-------PQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYIL-KGYEGEDMKDFMHNLKYYK-A 299 (478) T ss_pred cCCcceEEEeccC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceee-ecCCcccccchhhhhhhCc-e Confidence 6677777776331 23567788999999999999999998887666554432 2221 11111111111110 1 Q ss_pred ccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 373 LLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDN 452 (725) Q Consensus 373 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn 452 (725) +... +. .++.++++....-..++...++...+.|-..|++.+.+.+..++..||+|+..+...........-.. T Consensus 300 ----~~~~-~~-~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~ 373 (478) T protein:vir:10 300 ----ISVA-GE-SGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNK 373 (478) T ss_pred ----eEec-CC-CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 11 11224455444444556667888899999999987666665555679999998766666666666666 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHH Q lcl|NC_013059. 453 LATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) Q Consensus 453 ~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~ 532 (725) |..+++++.++++ +++.. + . |+ .||.|.-.+..+.--.+.. T Consensus 374 ~~~~l~~~~~li~----~~~~~---------~--~---------------------d~---~~i~i~f~~~~p~~~~e~~ 414 (478) T protein:vir:10 374 TLTALQELLQYII----DFYRL---------D--V---------------------RV---QDIEITFNFNVMVNELENS 414 (478) T ss_pred HHHHHHHHHHHHH----HHhCC---------C--c---------------------cc---ccceEEeCCCCCCCHHHHH Confidence 6666666555544 44321 0 0 01 2344445566654333344 Q ss_pred HHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_013059. 533 AEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQ 612 (725) Q Consensus 533 ~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~q 612 (725) +.++.+.+ ..|. ...+..++.. ...+..++++.+.........+. ..... T Consensus 415 ~~~~~~~g----~iS~--et~i~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~----------------------~~~~~ 464 (478) T protein:vir:10 415 QIAMNSTG----LLSK--ETILGNHSWV--QDPVAEMERIEQENIELNQQLPD----------------------IEEGL 464 (478) T ss_pred HHHHHHhC----CCCh--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhccc----------------------cCCCC Confidence 44444322 2221 1122222222 11223233332211110000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 613 GVLLQGQAELAKAQNQTLS 631 (725) Q Consensus 613 a~~~k~qae~~kaqae~~k 631 (725) ......+.+ ....+ T Consensus 465 ~d~~~~~~~-----d~~~e 478 (478) T protein:vir:10 465 NDEQQRQSE-----DNQSE 478 (478) T ss_pred cccccccCc-----CCCCC Confidence 000000000 00000 No 65 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.68 E-value=8.6e-16 Score=103.18 Aligned_cols=469 Identities=10% Similarity=0.029 Sum_probs=217.9 Q ss_pred CCcH----HHHHHH---------------HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHH-HH-------HhhcC Q lcl|NC_013059. 1 MADN----KNRLES---------------ILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLS-QY-------TTLQY 53 (725) Q Consensus 1 mad~----~~~~~~---------------~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~-~~-------l~~~g 53 (725) |||. +-.+.. ....+...++.. .+....+..+||.|+|.-.... .. ....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDT 78 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccccccccc Confidence 5541 111111 111122222211 2345677789999988421111 11 11222 Q ss_pred C----CcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEE Q lcl|NC_013059. 54 R----GQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLV 129 (725) Q Consensus 54 r----p~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~ 129 (725) + .++|..+.+|+..+|+...+.+.+. .+|.+..+ +++.+.+ |+++.....+.++++++|.||+.|. T Consensus 79 ~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~-----~~d~~~~~----~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~ 148 (503) T protein:vir:59 79 KTNNRTSHAWHKLFVDQKTQYLVGEPVTFT-----SDNKTLLE----YVNELAD-DDFDDILNETVKNMSNKGIEYWHPF 148 (503) T ss_pred cccceeecchHHHHHHHHHhhhhcCCeeec-----cCcHHHHH----HHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEe Confidence 3 3579999999999999998887653 23444444 4444444 7899999999999999999998886 Q ss_pred eeeccCCCCCCceeEEEEeeecchhh--eeeCCCc-cccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhccc Q lcl|NC_013059. 130 TDYEDQSPTSNNQVIRREPIHSACSH--VIWDSNS-KLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPND 206 (725) Q Consensus 130 ~~~~~~~~~~~~~~ir~~~~~~~~~~--v~~Dp~a-~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 206 (725) ++ ++ | .+.++. .++.+ .+||+.. .++ .++++.|...+ T Consensus 149 ~d---~d--g-~~~i~~----~~p~~~~~i~d~~~~~~~------~~~ir~~~~~~------------------------ 188 (503) T protein:vir:59 149 VD---EE--G-EFDYVI----FPAEEMIVVYKDNTRRDI------LFALRYYSYKG------------------------ 188 (503) T ss_pred ec---CC--C-ceEEEE----EccceeEEEEeCCCCCce------EEEEEEEEEec------------------------ Confidence 54 22 2 233332 23333 3455543 211 12333332110 Q ss_pred ccccccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecccc Q lcl|NC_013059. 207 WVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAV 286 (725) Q Consensus 207 ~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~ 286 (725) .+.+.+..+|+|..... ..+...+ ++..... . .. ... ...++ T Consensus 189 -----~~~~~~~~~evy~~~~i--~~~~~~~--~~~~~~~-~--------------~~--------~~~------~~~~~ 230 (503) T protein:vir:59 189 -----IMGEETQKAELYTDTHV--YYYEKID--GVYQMDY-S--------------YG--------ENN------PRPHM 230 (503) T ss_pred -----CCCceEEEEEEEeCCcE--EEEEEcC--Ccccccc-c--------------cc--------ccc------cccce Confidence 00123345666655322 1122111 1110000 0 00 000 01112 Q ss_pred ccCCCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhh Q lcl|NC_013059. 287 LKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGN 366 (725) Q Consensus 287 l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 366 (725) ..+..|.+.+.+|+|+|... ..+.|.+..+++.++.+|..+|.+.+.+...+...+++.-...+...+..... T Consensus 231 ~~~~~~~~~~~vPiv~~~nn-------~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~ 303 (503) T protein:vir:59 231 TKGGQAIGWGRVPIIPFKNN-------EEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANL 303 (503) T ss_pred eecceeccCCccceEEecCC-------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhh Confidence 23445666677888776321 23567888999999999999999999887777766554321111111111111 Q ss_pred ccccccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHH Q lcl|NC_013059. 367 DDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLET 446 (725) Q Consensus 367 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~ 446 (725) .....+ ...++ ..+.++....-..+....++.....|-..+++.+.+.+..++..||.|+........... T Consensus 304 ~~~~~~-----~~~~~----~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~ 374 (503) T protein:vir:59 304 RYHSVI-----KVSGD----GGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKA 374 (503) T ss_pred hcccce-----eccCC----CcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHH Confidence 111111 11111 123444333223455667888888898888877765555456679999988776666665 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchh Q lcl|NC_013059. 447 YVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQS 526 (725) Q Consensus 447 ~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t 526 (725) ...-..|..+++++.++++.++....... .....+|.|.-.+..+. T Consensus 375 ~~~~~~~~~~l~~~~~~i~~~~~~~~~~~----------------------------------~~~~~~i~i~f~~~~p~ 420 (503) T protein:vir:59 375 NMAERKIRAGLRLFFWFFAEYLRNTGKGD----------------------------------FNPDKELTMTFTRTRIQ 420 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcc----------------------------------cccccceeEEeCCCCCC Confidence 66666666666666665555543221110 00013455555566665 Q ss_pred HHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHH Q lcl|NC_013059. 527 MKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDP 606 (725) Q Consensus 527 ~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~ 606 (725) -..+..+.++.+..+ + ..|. ..++..++.. +..+.-++++.+.................... ... +... T Consensus 421 d~~~~~~~~~kl~~~-G-iiS~--et~l~~l~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~~----~~~~ 489 (503) T protein:vir:59 421 NDSEIVQSLVQGVTG-G-IMSK--ETAVARNPFV--QDPEEELARIEEEMNQYAEMQGNLLDDEGGDD-DLE----EDDP 489 (503) T ss_pred CHHHHHHHHHHHHhC-C-CCch--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhhhccccCccCCCC-CCC----cCCC Confidence 555666666666543 1 1221 1222222222 11222233332211100000000000000000 000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 607 AMVQAQGVLLQGQAELAKAQNQTLS 631 (725) Q Consensus 607 ~~~~~qa~~~k~qae~~kaqae~~k 631 (725) .. .+.++..+.++- T Consensus 490 ~~-----------~~~~~~~~g~~~ 503 (503) T protein:vir:59 490 NA-----------GAAESGGAGQVS 503 (503) T ss_pred CC-----------CcccCCCCCCcC Confidence 00 000000000000 No 66 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.68 E-value=4.9e-14 Score=93.54 Aligned_cols=500 Identities=9% Similarity=-0.020 Sum_probs=239.1 Q ss_pred CCcHHHH-HHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHhh----CC Q lcl|NC_013059. 1 MADNKNR-LESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQ----NP 75 (725) Q Consensus 1 mad~~~~-~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~----nr 75 (725) ||+-... -+.+..+|..-.+....|-..+.+..+|..-.=.+.+.-..-....++.-+.-...++.+.+.... ++ T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP~~ 80 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFPQS 80 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCCCC Confidence 9985332 334555555544444455555666677764321111100011112234224444444444333322 33 Q ss_pred cceEEecCCc-------chHHHH------HHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCce Q lcl|NC_013059. 76 IDVLYRPKDG-------ASPDAA------DVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQ 142 (725) Q Consensus 76 ~~~~~~pr~~-------~d~~~A------e~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~ 142 (725) +=+++.+.+. ++...+ +..+..+......|++..+...+|.+.+..|.|+.-+ .+++.+... T Consensus 81 ~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~-----~~~~~~~~~ 155 (522) T protein:vir:94 81 PWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYI-----PEPEQGTYS 155 (522) T ss_pred cccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEee-----eccCCCcee Confidence 4334443321 112222 2233344445567889999999999999999998533 234444333 Q ss_pred eEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEE Q lcl|NC_013059. 143 VIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEF 222 (725) Q Consensus 143 ~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~ 222 (725) ..+..|+ .++++..++.- ..--++++..++.+.+.+.++... +.+ .....+.|.|+.+ T Consensus 156 ~~~~~pl----~~y~v~~d~~G----~vd~i~r~~~~~~~~l~~~~~~~~----------~~~----~~~p~~~v~v~~~ 213 (522) T protein:vir:94 156 PMRMYRL----VSYVVQRDAFG----NILQIVTIDKVAFSALPEDVKSQL----------NAD----DYEPDTELEVYTH 213 (522) T ss_pred eEEEEEc----ceEEEeeCCCc----CeEEEeeeeeccHHhcchHHHHHH----------hcc----cCCccceEEEEEE Confidence 4455554 34666554321 122356666777766544444311 000 0111356667666 Q ss_pred EEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEE Q lcl|NC_013059. 223 YEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVP 302 (725) Q Consensus 223 w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP 302 (725) ++++..+ .. .+.-..|..+....+-||++.+||+| T Consensus 214 v~~~~~~-------------~~--------------------------------~~~~~~g~~~~~~~~~~~~~e~P~~~ 248 (522) T protein:vir:94 214 IYRQDDE-------------YL--------------------------------RYEEVEGIEVTGTDGSYPLTACPYIP 248 (522) T ss_pred EEeeCCc-------------ee--------------------------------EEeeccCceecccCCCCccccCCcee Confidence 5553211 00 01112234343344668889999997 Q ss_pred EEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCc Q lcl|NC_013059. 303 VFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNG 382 (725) Q Consensus 303 ~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 382 (725) +-.. .++|..|+.|.+..+.+-.+.+|+.....+.....+.+.+++++++.+-...+.... .+ +..+....+ T Consensus 249 ~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~---~~---g~~v~g~~~ 320 (522) T protein:vir:94 249 VRMV--RLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKA---AT---GEFVAGRVE 320 (522) T ss_pred eeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheecc---CC---ceeecCCcc Confidence 7544 468999999999999999999999999999999999999999987655433222111 11 111111222 Q ss_pred cccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhcc-CcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_013059. 383 EMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAV-NGGQVAYDTVNQLNMRADLETYVFQDNLAT-AMRRD 460 (725) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~-~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~~~~~ 460 (725) .+ .++ ....+.--......++...+.|....-+. +++. ++..+++.=|..+.+.....+..++.+|.. ...=+ T Consensus 321 ~v--~~~-~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pl 395 (522) T protein:vir:94 321 DI--NFL-QLTKGQDFTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPI 395 (522) T ss_pred cc--eee-ecccccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH Confidence 21 111 11222222445667777788887776433 4553 334456666888888888888887777663 22222 Q ss_pred HHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHH Q lcl|NC_013059. 461 GEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLG 540 (725) Q Consensus 461 g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~ 540 (725) .+..+.++ .+.|.- ++..+ .-+.+.+.. |-....|.+-++.+.++++ T Consensus 396 i~r~~~il-------------~r~g~l------P~~p~-------------~~v~v~~~s-~La~~qr~~~~~~l~~~~~ 442 (522) T protein:vir:94 396 VRVLMNQL-------------QSAGMI------PDLPK-------------EAVEPTVST-GLEALGRGQDLEKLTQAVN 442 (522) T ss_pred HHHHHHHH-------------HhcCCC------CCCCc-------------ccEEeeEec-HHHHHHHHHHHHHHHHHHH Confidence 23323322 222110 00000 013444433 3334567888888888888 Q ss_pred hcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhh--hhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_013059. 541 KTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQG 618 (725) Q Consensus 541 ~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~--~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~ 618 (725) .+....|+... ...|+ ++++..+..... +..+... +++.+++.++++++ +..+..+.+.. T Consensus 443 ~ia~l~P~~~~------~~id~---d~~~~~~a~~~Gv~~~~ivr~--~ee~~~~~~q~~~~----~~~~~~~~~~~--- 504 (522) T protein:vir:94 443 MMTGLQPLSQD------PDINL---PTLKLRLLNALGIDTAGLLLT--QDEKIQRMAEQSSQ----QAVVQGASAAG--- 504 (522) T ss_pred HHHhccchhhh------hcCCH---HHHHHHHHHHcCCChhhccCC--HHHHHHHHHHHHHH----HHHHHHHHHHH--- Confidence 77666564321 12233 344443333322 1222211 11111111111100 00000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 619 QAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNM 657 (725) Q Consensus 619 qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~ 657 (725) +...+...++.-+. ..++ T Consensus 505 ~~~~a~~~~~~~~~---------------------~~~~ 522 (522) T protein:vir:94 505 ANMGAAVGQGAGED---------------------MAQA 522 (522) T ss_pred HHhhhhhhcccchh---------------------hhcC Confidence 00000000000000 0000 No 67 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.68 E-value=2.5e-15 Score=100.61 Aligned_cols=448 Identities=10% Similarity=0.041 Sum_probs=208.7 Q ss_pred CCcH-----HHHHHHHH-----------HHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHH-------hhcCC--- Q lcl|NC_013059. 1 MADN-----KNRLESIL-----------SRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYT-------TLQYR--- 54 (725) Q Consensus 1 mad~-----~~~~~~~~-----------~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l-------~~~gr--- 54 (725) |+|- +..+.+.. +.+.+.+...........+..+||.|.| +- ..... ...++ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~-~i-~~~~~~~~~~~~~~~~~~~~ 78 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHP-DI-LDAPPKRDVNGDYDETKPDW 78 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ch-hccccccccccccccccccc Confidence 8874 12222221 2223333333344556777899999975 11 11111 11222 Q ss_pred -CcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeec Q lcl|NC_013059. 55 -GQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE 133 (725) Q Consensus 55 -p~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~ 133 (725) .++|..+-+|+..+|+.-.+.+.+.+ ++.+..+.+..+ .+ |+++.....+.++++++|.||+-+..+ T Consensus 79 ki~~n~~~~ivd~~~~~l~g~~~~~~~-----~~d~~~~~l~~~----~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d-- 146 (478) T protein:vir:10 79 RMYTNYHQNLVDQKVAYAVANPVTFGV-----DNDKALKQIQHT----LN-HKWDDKLVDILTAASNKGIEWVQPYVD-- 146 (478) T ss_pred eeccchHHHHHHHHHhhhccCCeeeec-----CChHHHHHHHHH----Hh-cCHHHHHHHHHHHHHhcCeEEEEEEec-- Confidence 35799999999999999887777643 333344444433 33 688999999999999999999887654 Q ss_pred cCCCCCCceeEEEEeeecchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccc Q lcl|NC_013059. 134 DQSPTSNNQVIRREPIHSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPW 211 (725) Q Consensus 134 ~~~~~~~~~~ir~~~~~~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 211 (725) ++ | .+.+.+ .++..+ +||+.... +. .++++.|-.. T Consensus 147 -~~--g-~~~~~~----~~p~~~~~i~d~~~~~----~~-~~~v~~~~~~------------------------------ 183 (478) T protein:vir:10 147 -EE--G-EFKTFR----VPAEQAVPIWTNKERD----EL-QAFIRVYELD------------------------------ 183 (478) T ss_pred -CC--C-eeEEEE----EcccceEEEEcCCCCC----ce-EEEEEEEEec------------------------------ Confidence 22 1 233322 133332 34543211 11 1233322100 Q ss_pred cCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCC Q lcl|NC_013059. 212 LTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQ 291 (725) Q Consensus 212 ~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~ 291 (725) ....+++|..... ..+.+. +|.+....... ........+.+.. T Consensus 184 ----~~~~~~~y~~~~i--~~~~~~---~~~~~~~~~~~----------------------------~~~~~~~~~~~~~ 226 (478) T protein:vir:10 184 ----GAERVEYWTKDDV--TYYELK---EGQLIPDFYRS----------------------------DDHIQPHYYQGNK 226 (478) T ss_pred ----CceEEEEEeCCeE--EEEEEc---CCeeecccccc----------------------------ccccccceecccc Confidence 0011233332111 111111 11111000000 0000111122344 Q ss_pred CCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhcccc Q lcl|NC_013059. 292 LIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYP 370 (725) Q Consensus 292 ~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~ 370 (725) |.+.+.+|+|+|.. ...+.|.+..+++.++.+|...|.+...+...+....++ .|.. +...+....... T Consensus 227 ~~~~~~vPvv~~~n-------~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~-~g~~~~~~~~~~~~~~~-- 296 (478) T protein:vir:10 227 LMSWGRVPFIPFKN-------NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYIL-KGYEGEDMKDFMHNLKY-- 296 (478) T ss_pred cccCCccceEEecc-------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccchhhhhhhh-- Confidence 56666777776622 223568888999999999999999998887666554432 2221 111111111110 Q ss_pred ccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 371 YYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQ 450 (725) Q Consensus 371 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~ 450 (725) ... +.. .|. .++.++++..+.-..+....++...+.|-..+++.+.+.+..++..||+|+..............- T Consensus 297 ~~~---~~~-~~~-~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~ 371 (478) T protein:vir:10 297 YKA---ISV-AGE-SGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLK 371 (478) T ss_pred cce---EEe-cCC-CCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHH Confidence 000 111 111 112234444333345566788888999999999877676655566799999987666666656666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHH Q lcl|NC_013059. 451 DNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQ 530 (725) Q Consensus 451 dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~ 530 (725) ..|..+++++.++++. ++.. + . + . .+|.|.-.+..+.-..+ T Consensus 372 ~~~~~~l~~~~~li~~----~~g~---------~--~----------~-----------~---~~i~i~f~~~~p~d~~e 412 (478) T protein:vir:10 372 NKTLTALQELLQYIID----FYRL---------D--V----------K-----------V---QDIEITFNFNVMVNELE 412 (478) T ss_pred HHHHHHHHHHHHHHHH----HhCC---------C--c----------c-----------c---ccceEEecCCCCCCHHH Confidence 6666666665555544 4321 1 0 0 0 23444445555543344 Q ss_pred HHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHH Q lcl|NC_013059. 531 NRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQ 610 (725) Q Consensus 531 ~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~ 610 (725) ..+.++.+.+ ..|. ..++..++.. +..+.-++++.++......... ..........+.+.. T Consensus 413 ~a~~~~kl~g----~iS~--et~~~~l~~v--~D~~~E~~ri~~E~~~~~~~~~---~~~~~~~~~~~~~~~-------- 473 (478) T protein:vir:10 413 NSQIAMNSTG----LLSK--ETILSNHAWV--EDPVAEMERIEQENIELNQQLP---DIEEGLNGEQQRQSE-------- 473 (478) T ss_pred HHHHHHHHhC----CCCh--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhhcc---ccccccCCCCCCCCC-------- Confidence 4444444422 2232 1222222222 1222333333321110000000 000000000000000 Q ss_pred HHHHHHHHHHH Q lcl|NC_013059. 611 AQGVLLQGQAE 621 (725) Q Consensus 611 ~qa~~~k~qae 621 (725) -.+.| T Consensus 474 ------~~~~~ 478 (478) T protein:vir:10 474 ------NNQPE 478 (478) T ss_pred ------CCCCC Confidence 00000 No 68 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.68 E-value=2.2e-16 Score=106.48 Aligned_cols=485 Identities=10% Similarity=-0.000 Sum_probs=207.0 Q ss_pred CCc--HHHHHHHH----HHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHH-HHHhhcCCC----cccchHHHHHHHHH Q lcl|NC_013059. 1 MAD--NKNRLESI----LSRFDADWTASDEARREAKNDLFFSRVSQWDDWLS-QYTTLQYRG----QFDVVRPVVRKLVS 69 (725) Q Consensus 1 mad--~~~~~~~~----~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~-~~l~~~grp----~~N~i~~~v~~v~g 69 (725) ||= .++++..+ .......++.....+.+..+-.+||.|.| ..+ ...+..+++ ++|..+.+|+..+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~---~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~ 77 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQ---EIEKHEFDNATVEAANVMVNHAKYITDMNVG 77 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcccc---chhcCCcCcCCCCcceeecchHHHHHHHHhh Confidence 761 22233332 12222233333333445566789999976 222 112223333 56999999999999 Q ss_pred HHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEee Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPI 149 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~ 149 (725) +.-.+.+.+.+ . +.+..+ .+..+++.|+++.....++.+++++|.+|.-+..+- ++.+..... . T Consensus 78 ~l~g~p~~~~~--~---~~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~--~g~~~~~~~-~---- 141 (499) T protein:vir:10 78 FMTGNPVKYVA--E---KGKNID----DILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKK--TDPISVRDE-L---- 141 (499) T ss_pred hhcccCceeec--C---ChhHHH----HHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecc--ccccccccc-c---- Confidence 99988766553 2 222222 345567789999999999999999999997775442 121110000 0 Q ss_pred ecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccc-cCCCeEEEEEEEEEecc Q lcl|NC_013059. 150 HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPW-LTQDTIQIAEFYEVVEK 228 (725) Q Consensus 150 ~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~-~~~~~vrv~E~w~~~~~ 228 (725) .++.. +-.. . ++...+++. ..||-+.+....... .....+.... .....++.+++|.... T Consensus 142 --------~~~~~--~~~~--~--~~~~~v~p~---~~~~v~~d~~~~~~~-~~i~~~~~~~~~~~~~~~~~~iyt~~~- 202 (499) T protein:vir:10 142 --------GNEKL--TPNT--E--LKIEVIDPR---ATVVVCDDTVEHDPL-FAVFTQEKKDLEGNTNGYSITVYMPQR- 202 (499) T ss_pred --------ccccc--cccc--c--eEEEEEccc---ceEEEecCCCCcceE-EEEEEEEEeecCCCceEEEEEEEeCCe- Confidence 00000 0000 0 011111111 011111000000000 0000000000 0112333344444321 Q ss_pred eeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeee Q lcl|NC_013059. 229 KETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWG 308 (725) Q Consensus 229 ~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~ 308 (725) ++.+....++.. .+...+.+..|-+.+.+|+|+|.. T Consensus 203 ---i~~~~~~~~~~~--------------------------------------~~~~~~~~~~~~~~g~vPvv~~~n--- 238 (499) T protein:vir:10 203 ---IVEYRTKTTMEV--------------------------------------SANDPIVYDGENLFGAVPIIEFRN--- 238 (499) T ss_pred ---EEEEEecCCccc--------------------------------------cCcceecccccCCCCccceEEecC--- Confidence 111111100100 000011122333445666665522 Q ss_pred ccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 309 FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 309 ~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) ...+.|.+.++++.++.+|...|.+...+...+....++.-..++........ ...+... .+...+ ... T Consensus 239 ----~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~--~~~~~~~-~~~~~~----~~d 307 (499) T protein:vir:10 239 ----NEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQR--LKRGAIE-APPREE----GAD 307 (499) T ss_pred ----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhh--hhhccee-ccCCCC----CCc Confidence 12356889999999999999999999988777666555432111211111010 0111110 011111 122 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) ++++..+.-..++...++...+.|-.+|++.+.+.+.-++..||+|+..+...........-..|..+++++.++++.+ T Consensus 308 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~- 386 (499) T protein:vir:10 308 IEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTI- 386 (499) T ss_pred ceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 4455544444566677888889999999876555444344579999988776666665556666666665555555543 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) .. +.|.+ . |+ .+|.|.-.+..+.-..+..+.++.+.+.++. . T Consensus 387 ---~~------~~~~~--~---------------------d~---~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~---e 428 (499) T protein:vir:10 387 ---VN------IKGAN--D---------------------DA---SGCKISLVANIPSNLSDVVNNVKNADGIIPR---K 428 (499) T ss_pred ---Hh------ccCCc--c---------------------cc---ccceEEeCCCCCCCHHHHHHHHHHHhccCCh---H Confidence 32 11211 0 11 2445555666665444555555555332221 1 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLL--QGQAELAKAQ 626 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~--k~qae~~kaq 626 (725) .... .++..+- .+..++++.++.... .+..++.....-... ......+.... ........++ T Consensus 429 t~~~---~l~~v~d--~~~E~~ri~~E~~~~----------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 492 (499) T protein:vir:10 429 YTYS---WLPDVDN--PQDVIDEMNQQDAET----------IKKNQEALRGQDPDR-LELEDKQDDSSENDKEAGSNHNQ 492 (499) T ss_pred HHHH---hCCCCCC--HHHHHHHHHHHHHHH----------HHHHHhhhccCCCCC-CCCCCCCcccCCCCCCCcccccc Confidence 2222 2222221 222233332211000 000000000000000 00000000000 0000000000 Q ss_pred HHHHHHHHHHH Q lcl|NC_013059. 627 NQTLSLQIDAA 637 (725) Q Consensus 627 ae~~k~q~ea~ 637 (725) ..+.++ . T Consensus 493 ~~~~~~----~ 499 (499) T protein:vir:10 493 SHRTRA----V 499 (499) T ss_pred CCCCCC----C Confidence 000000 0 No 69 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.67 E-value=6.2e-14 Score=93.00 Aligned_cols=514 Identities=10% Similarity=0.000 Sum_probs=237.1 Q ss_pred CCcHH---HHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCC----CCCHHHHHHHhhcCCCcccchHHHHHHHHHHHh- Q lcl|NC_013059. 1 MADNK---NRLESILSRFDADWTASDEARREAKNDLFFSRVS----QWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR- 72 (725) Q Consensus 1 mad~~---~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~----QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~- 72 (725) ||+.+ ..-+.+..+|..-.+....|-..+.+..+|..-. +++..- ....++.-..-...++.+.+... T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~----~~~~~~~dst~~~a~~~Laa~l~~ 76 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSS----TDYTTPWQAVGARGLNNLSAKVML 76 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCccc----ccccccccchHHHHHHHHHHHHHH Confidence 99832 2344455566665555556666666777777632 332211 11123322333334443333222 Q ss_pred ---hCCcceEEecCCcc-------h---HHHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCC Q lcl|NC_013059. 73 ---QNPIDVLYRPKDGA-------S---PDAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS 136 (725) Q Consensus 73 ---~nr~~~~~~pr~~~-------d---~~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~ 136 (725) -.++=+++.+.+.. . ..+...| +..+......|++..+...+|.+.+..|.|+.-+ .++ T Consensus 77 ~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~-----~~~ 151 (543) T protein:vir:88 77 ALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYL-----PPP 151 (543) T ss_pred hhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeee-----ccC Confidence 22333344343211 1 1222233 2334444567889999999999999999998632 333 Q ss_pred CCCC-ce-eEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCC Q lcl|NC_013059. 137 PTSN-NQ-VIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQ 214 (725) Q Consensus 137 ~~~~-~~-~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (725) +... .. .++..|+ .++++..++.- ..--++++..|+...+.+.||+. ... .. ..... T Consensus 152 ~~~~~~~~~~~~~pl----~~y~v~~d~~G----~v~~i~r~~~~~~~~l~~~~~~~------v~~------~~-~~~p~ 210 (543) T protein:vir:88 152 DASSNSYNPMKLYTL----HNHVVQRDAFG----NVLQIVTLDKVAYAALPEDVRNS------LSG------GQ-EYKPE 210 (543) T ss_pred ccccceecceEEeEc----ceEEEeeCCCC----CeeeeeeeeeccHHHHhHHhhHH------HHH------Hh-hcCCc Confidence 2211 11 1233333 33444433321 12346677778888877666531 000 00 00112 Q ss_pred CeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCC Q lcl|NC_013059. 215 DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIA 294 (725) Q Consensus 215 ~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p 294 (725) +.+.|+.+-+.++.+ +... + +.-+.|..+..+.+.|| T Consensus 211 ~~~~v~~~V~pr~~~-----------~~~~-------------------------------~-~~~~~~~~v~~~~~~~~ 247 (543) T protein:vir:88 211 QELEVYTHIYIDDES-----------GDFL-------------------------------S-YQEIEGVEVDGSDGQYP 247 (543) T ss_pred cceEEEEEEEeecCC-----------Cccc-------------------------------c-cccccCeeeecCCCccc Confidence 456665543332211 1110 0 00011222223346677 Q ss_pred CCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc Q lcl|NC_013059. 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) Q Consensus 295 ~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) ++.+||+|+-.. .++|..|+.|.+.+..+-.+.+|......+.....+.+.+++++++.+....+.. +...+ T Consensus 248 ~~e~P~i~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~------~~~~g 319 (543) T protein:vir:88 248 QDALPWIAVRWT--KRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLV------KAQTG 319 (543) T ss_pred cccCCceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc------cCCCc Confidence 788999977544 4689899999999999999999999999999999999999998877554332221 11111 Q ss_pred ccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) Q Consensus 375 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~ 454 (725) ..+....+.+. +++.-..+. -......++...+.|....= .+.+...++..+++.=|..+.+.....|...+.+|. T Consensus 320 ~~v~g~~~~v~--~~~~~~~~~-~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~ 395 (543) T protein:vir:88 320 DFVAGRKADIE--FLQLEKTAD-FTVAKSVADAIEARLSYVFM-LNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILS 395 (543) T ss_pred eeecCCCCcce--eeecccccc-hhHHHHHHHHHHHHHHHHHh-hhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHH Confidence 12222233221 122222222 34466777888888877662 233323445556677788888888888888877776 Q ss_pred H-HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHH Q lcl|NC_013059. 455 T-AMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRA 533 (725) Q Consensus 455 ~-~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~ 533 (725) . ...=+.+..++++.+ .|.- ++..+ +.+.+.+.. +-.+-.|.+.++ T Consensus 396 ~E~l~Pli~r~~~il~r-------------~g~l------P~~p~-------------~~v~~~~vs-~l~~l~r~~~~~ 442 (543) T protein:vir:88 396 QELQLPIVRVLLNQLQA-------------TQQI------PNLPQ-------------EAVEPTVTT-GAEALGRGQDLD 442 (543) T ss_pred HHHHHHHHHHHHHHHHh-------------cCCC------CCCch-------------hceeeeEEe-cHHHHHHHHHHH Confidence 3 332333333333322 2210 00000 112333322 333445778888 Q ss_pred HHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhh--hhhhhccchhhhHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_013059. 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ--MGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQA 611 (725) Q Consensus 534 ~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~--~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~ 611 (725) .+..+++.+....| +..++..|++ +++..+...... ..+... +++. ++.++++++++..+.++ T Consensus 443 ~l~~~~~~v~~~~~------p~vld~id~d---~~~~~~a~~~Gv~~~~i~r~--~~e~----~~~~~q~~~q~~~~~~~ 507 (543) T protein:vir:88 443 KLTQFLNAVATVSQ------LNGDPDLNVN---NIKLRLANAIGIDTAGLLLT--EAEK----AQAQSQEMLKQGGLNAA 507 (543) T ss_pred HHHHHHHHHHhccc------hhhhccCCHH---HHHHHHHHHhCCChhhhcCC--HHHH----HHHHHHHHHHHHHHHHH Confidence 88887777654443 1123344444 333333222211 122221 1111 11111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 612 QGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQS 663 (725) Q Consensus 612 qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~ 663 (725) .++.... + ....+. -.+..++ ...+ -.+ ..+...+- T Consensus 508 ~~~~~~~-~------~~~~~~----~~~~~~~-~~~~---~~~-~~p~~~~~ 543 (543) T protein:vir:88 508 AGIGSGV-A------AQATAS----PEAMESA-MDTA---GVQ-PGPIATQV 543 (543) T ss_pred HHHhhch-h------hhhccC----hHHHHHH-hhhc---CCC-CCCCCCCC Confidence 1100000 0 000000 0000000 0000 000 00000011 No 70 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.66 E-value=8.1e-16 Score=103.31 Aligned_cols=463 Identities=11% Similarity=0.021 Sum_probs=194.9 Q ss_pred CCcH-----HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHhhcCCCcccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADN-----KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ----WDDWLSQYTTLQYRGQFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~-----~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q----W~~~~~~~l~~~grp~~N~i~~~v~~v~g~~ 71 (725) |+.. .+++.+|...+.. . +..-.+=.+||+|+| ++.......+ .-+.+.|..+-+|+.++... T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~----~---~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~-~~~~~~n~~~~ivd~~a~~l 72 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFEN----K---QNELKSSKAYYDAERRPDAIGLAVPLDMR-KYLAHVGYPRTYVDAIAERQ 72 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHH----H---HHHHHHHHHHHhcccchhhcCcccchhhh-hhhhhcchHHHHHHHHHHhh Confidence 8743 3334444333322 2 122233368999986 2222112222 12345688888888877533 Q ss_pred hhCC----cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCC---CCCCceeE Q lcl|NC_013059. 72 RQNP----IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS---PTSNNQVI 144 (725) Q Consensus 72 ~~nr----~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~---~~~~~~~i 144 (725) .-+- ....+..-..+|.+..+.+ ..+++.|+++.....+..+++++|.+|+-|..+..... ..+.+ .| T Consensus 73 ~~~Gf~~~~~~~~~~~~~~d~~~~~~l----~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~-~i 147 (488) T protein:vir:23 73 ELEGFRIPSANGEEPESGGENDPASEL----WDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP-LI 147 (488) T ss_pred hccceeccCCcccccccccchhHHHHH----HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc-eE Confidence 2210 0011111223455554444 45688999999999999999999999987765422111 11111 22 Q ss_pred EEEeeecchh--heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEE Q lcl|NC_013059. 145 RREPIHSACS--HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEF 222 (725) Q Consensus 145 r~~~~~~~~~--~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~ 222 (725) +. .++. .++|||.... ...+.+++-+. +...+..+++ T Consensus 148 ~~----~~p~~~~~~~d~~~~~------~~~~~~~~~~~-------------------------------~~~~~~~~~~ 186 (488) T protein:vir:23 148 RV----EPPTALYAEVDPRTRK------VLYAIRAIYGA-------------------------------DGNEIVSATL 186 (488) T ss_pred EE----eccceeEEEEecCCCc------eEEEEEEEEec-------------------------------CCCcEEEEEE Confidence 22 1222 3567775332 22222222110 0111222333 Q ss_pred EEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEE Q lcl|NC_013059. 223 YEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVP 302 (725) Q Consensus 223 w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP 302 (725) |.... ++.+.. ..|.-.+.+..|.+.+.+|+|| T Consensus 187 y~~~~----~~~~~~-------------------------------------------~~~~~~~~~~~~h~~g~vPvv~ 219 (488) T protein:vir:23 187 YLPDT----TMTWLR-------------------------------------------AEGEWEAPTSTPHGLEMVPVIP 219 (488) T ss_pred EecCc----EEEEEe-------------------------------------------cCCceEeccccccCCCCcceEE Confidence 33211 001100 0011112234456667788888 Q ss_pred EEeeeeccCCccccchh--h-hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHH-Hhhccccccccccc Q lcl|NC_013059. 303 VFGEWGFVEDKEVYEGV--V-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMY-DGNDDYPYYLLNRT 377 (725) Q Consensus 303 ~~g~~~~~d~~~~~~G~--v-r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~-~~~~~~~~~~~~~~ 377 (725) |...+ ....+||. + +.+++.++.+|+.+|.+...+...+.....+ .|.. +.+...- .............. T Consensus 220 f~n~~----~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~v~ 294 (488) T protein:vir:23 220 ISNRT----RLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLI-FGAKPEELGINAETGQRMFDAYMARIL 294 (488) T ss_pred ecccc----ccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHH-hCCCcccccccccccchhhhhhhhhhc Confidence 75432 22234553 3 4678999999999999887665444332211 1110 1110000 00000000000001 Q ss_pred cccCccccccCCcccCCCC-chHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 378 DENNGEMPTQPLAYYENPE-VPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLAT 455 (725) Q Consensus 378 ~~~~g~~~~~~~~~~~~~~-~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~ 455 (725) ...+|. .+...+.+. -...+...+......+-.++|+++..+|..+ |..||.|+......-.......-..|.. T Consensus 295 ~~~~g~----~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~ 370 (488) T protein:vir:23 295 AFEGGE----GAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGG 370 (488) T ss_pred cCCCCC----CceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111221 122222222 1234455555555556667888889888654 5579999988777766666666677777 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHH Q lcl|NC_013059. 456 AMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEI 535 (725) Q Consensus 456 ~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l 535 (725) +++++.++++.+. +.. ++. .|+ .++.|.=.+..+....+..+.+ T Consensus 371 ~l~~~~~l~~~~~----~~~--------~~~---------------------~~~---~~i~v~f~~~~~~s~~~~ada~ 414 (488) T protein:vir:23 371 AWEQAMRLAYKMV----KGG--------DIP---------------------TEY---YRMETVWRDPSTPTYAAKADAA 414 (488) T ss_pred HHHHHHHHHHHHh----cCC--------Ccc---------------------hhh---ccceEEecCCCCCCHHHHHHHH Confidence 7777777665432 110 000 011 1233333333322244556666 Q ss_pred HHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_013059. 536 LELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVL 615 (725) Q Consensus 536 ~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~ 615 (725) .+|.+......|. ..+...++..+-+ .+++ +++.++. ..+. ..+........+.. T Consensus 415 ~kl~~~g~~~~s~--et~~~~l~~~~d~-~~~~-~~~~~~~-------------------~~~~-~~~~~~~~~~~~~~- 469 (488) T protein:vir:23 415 AKLFANGAGLIPR--ERGWVDMGYTIVE-REQM-RQWLEQD-------------------QKQG-LGLIGSLYGASTPE- 469 (488) T ss_pred HHHHhcccccCCH--HHHHHhCCCCchH-HHHH-HHHHHHH-------------------HHHH-HHHHHHHhccCCCc- Confidence 6665543222232 1222222221111 1111 1110000 0000 00000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 616 LQGQAELAKAQNQTLSLQIDAAKVEA 641 (725) Q Consensus 616 ~k~qae~~kaqae~~k~q~ea~~~q~ 641 (725) .....+.+........+. | T Consensus 470 ----~~~~~~~~~~~~~~e~~~---a 488 (488) T protein:vir:23 470 ----GKPGEAPVGEPPAPEPDA---A 488 (488) T ss_pred ----ccCCCCCCCCCCCCCCCC---C Confidence 000000000000000000 0 No 71 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.65 E-value=1.8e-14 Score=95.88 Aligned_cols=435 Identities=9% Similarity=0.033 Sum_probs=203.1 Q ss_pred CCc-------------------HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CCHHHHH---HHhhcC--- Q lcl|NC_013059. 1 MAD-------------------NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ--WDDWLSQ---YTTLQY--- 53 (725) Q Consensus 1 mad-------------------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q--W~~~~~~---~l~~~g--- 53 (725) |++ ...+..+++..+.... ...+....+..+||.|.| +...... ...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~---~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKH---KENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPD 77 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHH---HHHHHHHHHHHHHhcCCCccccccccccccccccccccc Confidence 433 2233334444443332 233445677789999986 1110000 011112 Q ss_pred -CCcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeee Q lcl|NC_013059. 54 -RGQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDY 132 (725) Q Consensus 54 -rp~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~ 132 (725) +.++|..+.+|+..+|+.-.+.+.+.+ +|.+..+.+..+ .+ |+++.....+..++.++|.||..|..+ T Consensus 78 ~ki~~n~~~~Iv~~~~~~l~g~p~~~~~-----~d~~~~~~l~~~----~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d- 146 (468) T protein:vir:96 78 WRMYTNYHQNLVDQKVAYAVANPVTYGT-----EDEKSLKTIQEV----LN-HKWDDKLVDILTAASNKGVEWIQPYVD- 146 (468) T ss_pred cccccchHHHHHHHHHhhhccCCceecc-----CChHHHHHHHHH----Hh-cCHHHHHHHHHHHHhhcCeEEEEEEEc- Confidence 345799999999999999888777643 233333444333 33 688888999999999999999887654 Q ss_pred ccCCCCCCceeEEEEeeecchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhccccccc Q lcl|NC_013059. 133 EDQSPTSNNQVIRREPIHSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFP 210 (725) Q Consensus 133 ~~~~~~~~~~~ir~~~~~~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 210 (725) ++ + .+.+.. .++.++ +||+.... +. ..+++.|...+ T Consensus 147 --~~--~-~~~i~~----~~p~~~~~v~~~~~~~----~~-~~~ir~~~~~~---------------------------- 184 (468) T protein:vir:96 147 --EQ--G-EFKTFR----VPAEQAIPIWTNKERD----EL-KAFIRLYELDG---------------------------- 184 (468) T ss_pred --CC--C-ceEEEE----EcccceEEEEcCCCCC----ce-EEEEEEEEecC---------------------------- Confidence 22 1 233322 133333 34433211 11 12232221000 Q ss_pred ccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCC Q lcl|NC_013059. 211 WLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDK 290 (725) Q Consensus 211 ~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~ 290 (725) ..-+++|+.... ..+... ++.......... .......+.+. T Consensus 185 ------~~~~~~~~~~~~--~~~~~~---~~~~~~~~~~~~----------------------------~~~~~~~~~~~ 225 (468) T protein:vir:96 185 ------GERVEYWTANDV--TFYELK---DGQLIPDYYQGE----------------------------EHVQAHYYVGN 225 (468) T ss_pred ------ceEEEEEeCCeE--EEEEEc---CCceeecccccc----------------------------cccccceeecc Confidence 001233332111 111111 121111000000 00011122344 Q ss_pred CCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccc Q lcl|NC_013059. 291 QLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDY 369 (725) Q Consensus 291 ~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~ 369 (725) .|.+.+.+|+|+|.. ...+.|.+..+++.++.+|...|.+...+...+...+++. |.. +............ T Consensus 226 ~~~~~~~iPvv~~~n-------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-g~~~~~~~~~~~~~~~~ 297 (468) T protein:vir:96 226 KSMSWNRVPFIPFKN-------NPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLK-GYEGEDLEEFMYNLKYY 297 (468) T ss_pred ccccCCcccEEEecC-------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCccccchhhhhhhcC Confidence 566777788887632 1235688889999999999999999988877666554432 221 1111111111111 Q ss_pred cccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 370 PYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVF 449 (725) Q Consensus 370 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~ 449 (725) ..+. +...+ .+.++++..+.-..+....++...+.|-..+++.+.+.+..++..||+|+..+........... T Consensus 298 ~~i~---~~~d~----~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k 370 (468) T protein:vir:96 298 KAIN---VDGDG----SGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKL 370 (468) T ss_pred ceEE---ecCCC----CCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHH Confidence 0010 11111 1224555544444566667888899999999987666555555679999987766666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHH Q lcl|NC_013059. 450 QDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQ 529 (725) Q Consensus 450 ~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~ 529 (725) -..|..+++++.++++. ++.. +. + . .+|.|.-.++.+.-.. T Consensus 371 ~~~~~~~l~~~~~li~~----~~g~---------~~------------d-----------~---~~i~i~f~~~~p~d~~ 411 (468) T protein:vir:96 371 KNKTLTALQELLQYIID----FYKL---------SI------------K-----------V---QDVEITFNFNVMVNEL 411 (468) T ss_pred HHHHHHHHHHHHHHHHH----HhCC---------Cc------------c-----------c---ceeeEEecCCCCcCHH Confidence 66666666666555444 4321 10 0 0 2333333444444332 Q ss_pred HHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHH--HhhHHHH Q lcl|NC_013059. 530 QNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAK--QGQQDPA 607 (725) Q Consensus 530 ~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~q--q~q~q~~ 607 (725) +..+.+.+ .+ ..+. ...+..++..+ ..++-++++.+.... ..+.+..- ...-++- T Consensus 412 e~a~~~~~----~g-~iS~--et~i~~l~~v~--D~~~E~~ri~~E~~~--------------~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 412 EQSQIGVN----SQ-YLSK--ETVVTNHPWVD--DPVAEMERIDQEELA--------------LPSIEEGLNGKENNEPT 468 (468) T ss_pred HHHHHHHh----cC-CCch--HHHHHhCCCCC--CHHHHHHHHHHHHHH--------------HHHHhhccCCCCCCCCC Confidence 33333222 21 1221 11222222221 112222222211000 00000000 0000000 No 72 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=99.65 E-value=1e-13 Score=91.85 Aligned_cols=521 Identities=10% Similarity=0.012 Sum_probs=235.3 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHH----HHhh-CCcce Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EMRQ-NPIDV 78 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~~~-nr~~~ 78 (725) .++.+...++.++... ..|-..+.+..+|..-.=.+.+.-..-....++.-+.-...++.+.+ .-.. +++=+ T Consensus 1 m~~~~~~r~~~l~~~R---~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF 77 (555) T protein:vir:17 1 MKHSAQAKYMMLRADR---EDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFF 77 (555) T ss_pred ChhHHHHHHHHHHHHh---hHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 5555555555555554 34444455556776432111100000001122322444444444433 2222 56666 Q ss_pred EEecCCcc------h-HH---HHHH---HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEE Q lcl|NC_013059. 79 LYRPKDGA------S-PD---AADV---LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) Q Consensus 79 ~~~pr~~~------d-~~---~Ae~---l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir 145 (725) ++.+.+++ + +. +.+. .+..+......|++..+...+|.+.+..|.|++ |.+++++ + T Consensus 78 ~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-----y~~~~~~------~ 146 (555) T protein:vir:17 78 KLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALL-----YQGKKNL------K 146 (555) T ss_pred ccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE-----EecCCce------e Confidence 77776532 1 11 3333 333455556678899999999999999999985 4445543 4 Q ss_pred EEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccc---------cccCCCe Q lcl|NC_013059. 146 REPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVF---------PWLTQDT 216 (725) Q Consensus 146 ~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~---------~~~~~~~ 216 (725) ..|+ .++++..+..- ...-+++..+|+...+.+.|++...............+..+ .+..+.. T Consensus 147 ~~pl----~~y~v~~d~~G----~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 218 (555) T protein:vir:17 147 LYPL----DRFVVSRDGEG----NVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSND 218 (555) T ss_pred EEEc----CeEEEeeCCCc----CeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcc Confidence 4444 33555554432 23348889999999999999864321111000000000000 0000011 Q ss_pred EEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccc-cCCCCCCC Q lcl|NC_013059. 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVL-KDKQLIAG 295 (725) Q Consensus 217 vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l-~~~~~~p~ 295 (725) +.|-.++.+. . -+++|+.-....++ ...+.+++ T Consensus 219 ~~v~t~~~~~--------------------------------------------~--~~~~~~~e~~~~~v~~~l~e~g~ 252 (555) T protein:vir:17 219 ALVYTYVCRK--------------------------------------------D--GQVKWHQECDGKVIPGSNSSAPY 252 (555) T ss_pred eeEeeccccc--------------------------------------------C--CeeEEEEecCceeccccccccCc Confidence 1110000000 0 02333333233232 11356777 Q ss_pred CccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccc Q lcl|NC_013059. 296 EHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLN 375 (725) Q Consensus 296 ~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 375 (725) ..+||+|+-.. .++|..|+.|.+.++.+-.+.+|......+.....+.+.+++++++.+....+. .+..... T Consensus 253 ~e~P~i~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l------~~~~~g~ 324 (555) T protein:vir:17 253 THNPWIPLRFN--IVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQNL------ALAANGA 324 (555) T ss_pred ccCCeeeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCccee------ecCCCce Confidence 89999977544 468999999999999999999999999999999999999999977654332211 1111111 Q ss_pred cccccCccccccCCcccC--CCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 376 RTDENNGEMPTQPLAYYE--NPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNL 453 (725) Q Consensus 376 ~~~~~~g~~~~~~~~~~~--~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~ 453 (725) .+....+ .++.++ .+.--.....+++...+.|.+..-+. .-.++..+++.=|..+.+.....|...+.+| T Consensus 325 v~~g~~~-----~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~---~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl 396 (555) T protein:vir:17 325 IIQGRPD-----DVSVVQANKAADFRTVLEMIQKLEQRISDAFLML---QVRQSERTTATEVQATVQELNEQIGGIYSNL 396 (555) T ss_pred eecCCcc-----cceeeeccccchhhHHHHHHHHHHHHHHHHHhhc---CCCCcccchHHHHHHHHHHHHHHHhHHHHHH Confidence 1111111 122222 11111234455666666666554321 1123344566668888888888888888887 Q ss_pred H-HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHH Q lcl|NC_013059. 454 A-TAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) Q Consensus 454 ~-~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~ 532 (725) . ....=+.+..+.++.+. |. |. +.. .++ ..+.+..+. ....|++.+ T Consensus 397 ~~E~L~Pli~R~~~il~r~-------------g~-----lP-~~p----------~~~---v~~~i~~~l-~~l~r~~~~ 443 (555) T protein:vir:17 397 TTELLQPYLARKLHLLQKQ-------------RK-----LP-QLP----------KDL---VQPTVVAGL-WGVGRGQDK 443 (555) T ss_pred HHHHHHHHHHHHHHHHHhC-------------CC-----CC-CCC----------Hhh---hccceeehH-HHHHHHHHH Confidence 6 33333444444444332 11 00 000 011 122333332 334577888 Q ss_pred HHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhh--hhhhhhccchhhhHHHHHHHHHHHhhHHHHHHH Q lcl|NC_013059. 533 AEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQ 610 (725) Q Consensus 533 ~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~--~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~ 610 (725) +.+.++++.+.+..+. ...++..|+ ++++..+..... +..+.. ++++..++.+++++++++ .+... T Consensus 444 ~~l~~~~~~laq~~~~-----p~~~d~id~---d~~~~~~a~~~Gv~p~~ivr--s~eev~~~rq~~~~~~~q--~~~~~ 511 (555) T protein:vir:17 444 QQLMEFITTLAQTMGP-----EIAMKYINP---TEFIKRLAAAQGIDTLQLIN--SPETMKQLGDQQKQDMVQ--ASLIN 511 (555) T ss_pred HHHHHHHHHHHhhcCc-----hhHhhcCCH---HHHHHHHHHHcCCChhhhcC--CHHHHHHHHHHHHHHHHH--HHHHH Confidence 8888888776554321 112233343 333333322221 111211 222222222111111111 10110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_013059. 611 AQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAA-RIAEIFNNMDL 659 (725) Q Consensus 611 ~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a-~~~~~~~q~~~ 659 (725) +.+.+.. . +.++++-.+......+++....++ +.+-..+.+.. T Consensus 512 qa~~~~~----~--~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 512 QAGQLAK----T--PMAEQAMQLIQQQQEGAQDAGAAESETSSAEAQAGA 555 (555) T ss_pred HHHHHHh----h--hhhhhHHhccccchhhhhHHHHHHhhcCCcccccCC Confidence 0010000 0 000010010111111111111100 00000000000 No 73 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.65 E-value=9.1e-15 Score=97.56 Aligned_cols=444 Identities=9% Similarity=-0.002 Sum_probs=205.1 Q ss_pred CCcH-----HHHHHHHH-----------HHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHh-------hcC---- Q lcl|NC_013059. 1 MADN-----KNRLESIL-----------SRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTT-------LQY---- 53 (725) Q Consensus 1 mad~-----~~~~~~~~-----------~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~-------~~g---- 53 (725) |||- +..+.++. ..+...++..........+-.+||.|+| +-.....+ ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~--~i~~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDP--DVLRLAPKLDNKGEIDPLKPDW 78 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCC--cchhccchhcccccccccccch Confidence 6653 22222222 1222223322334556677889999986 11111111 112 Q ss_pred CCcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeec Q lcl|NC_013059. 54 RGQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE 133 (725) Q Consensus 54 rp~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~ 133 (725) |.++|..+.+|+..+|+.-.+.+.+.+ +|.+..+.|+.++ + ++..........++.++|.||..+..+ T Consensus 79 ki~~n~~~~Ivd~~~~~l~g~p~~~~~-----~d~~~~~~l~~~~----~-n~~~~~~~~~~~~~~~~G~~~~~~y~d-- 146 (474) T protein:vir:96 79 RMFTNYHQNLVDQKVAYAVANPVTFSS-----DDDKSLKTIQEVL----N-HKWDDKLVDILTAASNKGIEWLQPYID-- 146 (474) T ss_pred hcccchHHHHHHhhhhhhcccCceeec-----CchHHHHHHHHHH----h-cCHHHHHHHHHHHHHhcCeeEEEEEec-- Confidence 235699999999999999888876643 3444445444433 3 678888889999999999999877643 Q ss_pred cCCCCCCceeEEEEeeecchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccc Q lcl|NC_013059. 134 DQSPTSNNQVIRREPIHSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPW 211 (725) Q Consensus 134 ~~~~~~~~~~ir~~~~~~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 211 (725) ++ + .+.+.+ .+++.+ +||+.... + ..++++.|-.. . T Consensus 147 -~~--~-~~~i~~----~~p~~~~~v~d~~~~~----~-~~~~vr~~~~~-----------~------------------ 184 (474) T protein:vir:96 147 -EN--G-EFKTFR----VPAEQAIPIWTNKERD----T-LKAFIRYYRLD-----------G------------------ 184 (474) T ss_pred -CC--C-ceEEEE----EcccceEEEEcCCCCC----c-eEEEEEEEeec-----------C------------------ Confidence 22 1 233322 233333 35543221 1 12333333110 0 Q ss_pred cCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCC Q lcl|NC_013059. 212 LTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQ 291 (725) Q Consensus 212 ~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~ 291 (725) ..-.++|....+ ..+.+.+ |.... .... .......+.+.+.. T Consensus 185 -----~~~~~~yt~~~v--~~~~~~~---~~~~~-~~~~---------------------------~~~~~~~~~~~~~~ 226 (474) T protein:vir:96 185 -----AERVEYWTDSDV--TYYEYQD---GILIP-DYYH---------------------------GEEHIQSHYYVGNK 226 (474) T ss_pred -----ceEEEEEeCCeE--EEEEecC---Cceee-cccc---------------------------cccccccccccccc Confidence 001222322111 1111111 11000 0000 00000111222345 Q ss_pred CCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhcccc Q lcl|NC_013059. 292 LIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYP 370 (725) Q Consensus 292 ~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~ 370 (725) |.+.+.+|+|+|... ..+.|.+..+++.++.+|...|.+...+...+...+++ .|.. ++..+...... . T Consensus 227 ~~~~g~iPvv~~~nn-------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~--~ 296 (474) T protein:vir:96 227 RVSWGRVPFIPFKNN-------PQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYIL-KGYEGQDLDEFMRNLK--Y 296 (474) T ss_pred ccCCCceeEEEeccC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCcccccchhhhhh--c Confidence 666677888776332 23568888999999999999999998887777665443 2321 11111111100 1 Q ss_pred ccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 371 YYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQ 450 (725) Q Consensus 371 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~ 450 (725) +.. +.. ++ .++.++++..+.-..+....++...+.|-..|++.+.+.+..++..||+|+..+...........- T Consensus 297 ~~~---i~~-~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 370 (474) T protein:vir:96 297 YKA---INV-DG--DGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLK 370 (474) T ss_pred Cce---EEe-cC--CCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHH Confidence 111 111 11 012245555444445666778899999999999877666655566799999887766666656666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHH Q lcl|NC_013059. 451 DNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQ 530 (725) Q Consensus 451 dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~ 530 (725) .-|..+++++.++++. ++. .+.. + .+|.|.-.+..+.--.+ T Consensus 371 ~~~~~~l~~~~~~i~~----~~~---------~~~~--~------------------------~~i~i~f~~~~p~~~~e 411 (474) T protein:vir:96 371 NKTLTALQELLQYIID----FYK---------LNIK--V------------------------QDVEITFNFNVMVNELE 411 (474) T ss_pred HHHHHHHHHHHHHHHH----HhC---------CCcc--c------------------------ceeeEEeccCCCcCHHH Confidence 6666777666655554 331 1100 0 12333334444432222 Q ss_pred HHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHH Q lcl|NC_013059. 531 NRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQ 610 (725) Q Consensus 531 ~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~ 610 (725) ..+. +.+.+ ..+. ..++..++.. +..+.-++++.+....... ...+..... . T Consensus 412 ~~~~----~~~ag-~iS~--et~~~~~~~v--~d~~~E~~ri~~E~~e~~~--~~~~~~~~~---------------~-- 463 (474) T protein:vir:96 412 QSQI----GVQSQ-YLSK--ETVVTNHPWV--DDPVAELERIEQDNIDFNK--QLPPLEGDA---------------N-- 463 (474) T ss_pred HHHH----HHhcC-CCch--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHh--ccccccccc---------------c-- Confidence 2222 22222 1221 1222222221 1222333333221100000 000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 611 AQGVLLQGQAELAKAQNQ 628 (725) Q Consensus 611 ~qa~~~k~qae~~kaqae 628 (725) ....+..+ ++. T Consensus 464 ------~~~~d~~~-e~~ 474 (474) T protein:vir:96 464 ------GRAQDNES-ETN 474 (474) T ss_pred ------cccCCCcc-cCC Confidence 00000000 011 No 74 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.64 E-value=1.6e-14 Score=96.26 Aligned_cols=441 Identities=8% Similarity=0.020 Sum_probs=204.1 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHH------HHHhhcC----CCcccchHHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLS------QYTTLQY----RGQFDVVRPVVRKLVSE 70 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~------~~l~~~g----rp~~N~i~~~v~~v~g~ 70 (725) +-++..+..+++..+..... ..+....+..+||.|.| +-..+ ......+ +.++|..+.+|+..+|+ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~---~~~~~~~~~~~Yy~g~~-~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~ 96 (474) T protein:vir:95 21 LKPQFETQEEMIIRLIDDHR---KQLDKITVGQRYYDKDN-DIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSY 96 (474) T ss_pred hhhccCChHHHHHHHHHHHH---HHHHHHHHHHHHhcccC-chhccccccccccccccccccceeccchHHHHHHHHHhh Confidence 33333333444444433322 23344566789999976 21100 0011122 23579999999999999 Q ss_pred HhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeee Q lcl|NC_013059. 71 MRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) Q Consensus 71 ~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~ 150 (725) .-.+.+.+.+ +|.+.- .+++.+.+ |+++.....+.+++.++|.||..+..+ ++ + .+.+.+ T Consensus 97 l~g~p~~~~~-----~d~~~~----~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d---~~--~-~~~i~~---- 156 (474) T protein:vir:95 97 VASKPVTYSC-----EDESVL----KIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYIN---EN--G-EMKLFR---- 156 (474) T ss_pred hccCCceecc-----CchHHH----HHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEec---CC--C-ceEEEE---- Confidence 9888876642 333333 34455554 679999999999999999999877543 22 1 233332 Q ss_pred cchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecc Q lcl|NC_013059. 151 SACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEK 228 (725) Q Consensus 151 ~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~ 228 (725) .++.++ +||+.... + -..+++.|-.. ....+++|....+ T Consensus 157 ~~p~~~~~v~d~~~~~----~-~~~~i~~~~~~----------------------------------~~~~~~~y~~~~~ 197 (474) T protein:vir:95 157 VPAEQAIPIWVDKERE----E-LKSFIRYYKFN----------------------------------NEEKVEFWTDTTV 197 (474) T ss_pred EcccceEEEEcCCCCC----c-eEEEEEEEEEc----------------------------------CeeEEEEEeCCeE Confidence 123333 34443211 1 11222222110 0012334433211 Q ss_pred eeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeee Q lcl|NC_013059. 229 KETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWG 308 (725) Q Consensus 229 ~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~ 308 (725) ..+.+.+ +....... .....+.....+.+.+.+|+|||+.. T Consensus 198 --~~~~~~~---~~~~~~~~--------------------------------~~~~~~~~~~~~~~~g~iPvv~~~nn-- 238 (474) T protein:vir:95 198 --TYYVLEN---GGLIPDYY--------------------------------YGANHIQSHFSNGNWGRVPFIAFKNN-- 238 (474) T ss_pred --EEEEEcC---Cccccccc--------------------------------cCcccccccccccCCCccceEeecCC-- Confidence 1111111 11100000 00111111223445566777776432 Q ss_pred ccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 309 FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 309 ~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) ..+.|.+..+++.++.+|...|.+...+...+...+++.-...+........... +. .+...++ .. T Consensus 239 -----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~--~~---~i~~~~~----~~ 304 (474) T protein:vir:95 239 -----PEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKY--YK---AINVDGD----GG 304 (474) T ss_pred -----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhc--cc---eeeccCC----Cc Confidence 1256788899999999999999998888766665544332111111111111111 11 1111111 12 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) ++++..+.-..++...++.....|-..+++.+.+.+..+++.||+|+..+...........-..|..+++++.++++++ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~- 383 (474) T protein:vir:95 305 VETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDF- 383 (474) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 3444433333555667888889999999987766665556679999988776666666666666777776666665553 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) .. .+ .. -.++.|.-.++.+..-.+..+.+.+ .+ ..|. T Consensus 384 ---~g---------~~--~d------------------------~~~i~v~f~~~~p~d~~e~a~~~~~----~g-~iS~ 420 (474) T protein:vir:95 384 ---NN---------LK--MD------------------------VKDIEISFNFNRMMNDAEQSQIIAQ----SQ-YLSR 420 (474) T ss_pred ---hC---------CC--cc------------------------cceeeEEeccCCCcCHHHHHHHHHh----cC-CCch Confidence 21 10 00 0233333444554432333333333 22 2231 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhh-hHHHHHHHHHHHhhHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEE-QQWFVEAQQAKQGQQDPA 607 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~-~~~~~q~~q~qq~q~q~~ 607 (725) ...+..++.. +..++..+++.+.........+..... ..... +.++-...+.+ T Consensus 421 --et~i~~l~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~--~~~~~~~~~~~ 474 (474) T protein:vir:95 421 --ETLVKSSPLV--DDYKAELERIEQEQMEYNKQLPNLDDGGADGAQ--QQERSNDKESE 474 (474) T ss_pred --HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhcccccccccCCCCc--CCCCCccCCCC Confidence 1122222222 112223333322111000000000000 00000 00000000000 No 75 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.63 E-value=4.1e-15 Score=99.46 Aligned_cols=466 Identities=11% Similarity=-0.007 Sum_probs=199.8 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ----WDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q----W~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~ 76 (725) |.-..+++.++...+.+ .+.+..+-.+||+|+| ++......++ .-+.+.|..+-+|+..+|+..- T Consensus 1 ~~t~~d~i~~L~~~~~~-------~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~ivd~~~~~l~~--- 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLAR-------DLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDI--- 69 (480) T ss_pred CCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccchhcccccchhhh-hhhhhcchHHHHHHHHHhhhcc--- Confidence 99888888888776644 2334455578999986 2111111111 1234679999999999987632 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeee-ccCCCCCCceeEEEEeeecchh- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDY-EDQSPTSNNQVIRREPIHSACS- 154 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~-~~~~~~~~~~~ir~~~~~~~~~- 154 (725) +--+.| +|.+. +..+..+++.|+++..+..++.++++.|.||+-|...- ...|.-+ ...|+.. ++. T Consensus 70 ~g~~~~---~d~~~----~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~-~~~i~~~----~p~~ 137 (480) T protein:vir:78 70 EGFRIS---EDSEG----LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG-IPLIRVE----SPLY 137 (480) T ss_pred CceecC---CCchh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCC-eeEEEEE----cccc Confidence 211222 33332 34456677889999999999999999999998774210 0111122 2223322 222 Q ss_pred -heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 155 -HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 155 -~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) .++|||..... -. ++++.+...+ +.+.+...++|..... ..+ T Consensus 138 ~~~i~D~~~~~~----~~-~~i~~~~~~d------------------------------~~~~~~~~~~y~~~~~--~~~ 180 (480) T protein:vir:78 138 MYAELDPRNTRR----VT-RAVRLYTTRD------------------------------DVAVPDRATLYLPDET--VPL 180 (480) T ss_pred eEEEEcCCCccc----eE-EEEEEEEeec------------------------------CCcceEEEEEEeCCeE--EEE Confidence 36788764321 01 1222221110 1112233444433111 001 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) ...+ +... +..+..++.|.+.+.+|+|||... ... T Consensus 181 ~~~~---~~~~--------------------------------------~~~~~~~~~~~~~g~vPvv~f~n~----~~~ 215 (480) T protein:vir:78 181 RRNG---GLND--------------------------------------QWVVDGDVIKHGLGVVPVVPLTND----PRL 215 (480) T ss_pred EecC---CCcc--------------------------------------cccccccccccCCCCcceEEeecc----ccc Confidence 1000 0000 000011223334456777777433 222 Q ss_pred cccchh--h-hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCccccccCC Q lcl|NC_013059. 314 EVYEGV--V-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPL 389 (725) Q Consensus 314 ~~~~G~--v-r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 389 (725) ..+||. + +.+++.++.+|+.+|.+...+...+.....+ .|.. +.+... ........... .+....| ..+ T Consensus 216 ~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~-~~~~~~~~~~~-~~~~~~~----~~~ 288 (480) T protein:vir:78 216 GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-SGVTTDELTND-GENTTLDIYYG-RILTLAS----EAA 288 (480) T ss_pred CCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhh-hCCCccccccc-cccchhhhhhh-hhccCCC----CCc Confidence 334543 3 4689999999999999887766554433222 1211 111000 00001111111 1111111 112 Q ss_pred cccCCCC-chHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 390 AYYENPE-VPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) Q Consensus 390 ~~~~~~~-~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~l 467 (725) +..+.+. ....+...+......+-.++++.+..+|..+ |..||+|+......-.......-.-|..+++++.++++ T Consensus 289 ~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~-- 366 (480) T protein:vir:78 289 KISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-- 366 (480) T ss_pred eEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 2222222 2234555566666666667888888888654 44799999887766555555555666666666655443 Q ss_pred HHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEecc-CchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 468 VNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP-SFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 468 i~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p-~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) .+.+.. ....+ +++.|.=.+ ..++ ..+..+.+.+|.++..... T Consensus 367 --~~~~~~---------~~~~~------------------------~~i~v~w~~~~~~s-~~~~ad~~~kl~~~g~~~~ 410 (480) T protein:vir:78 367 --QIMGRE---------VTEEY------------------------TRLETVWRDPSTPT-VAAKADAVSKLYANGQGPI 410 (480) T ss_pred --HHcCCC---------ccccc------------------------eeeeEEecCCCCCC-HHHHHHHHHHHHHhcccCC Confidence 333211 00000 122222222 2222 3345666777666543222 Q ss_pred chHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 547 PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQ 626 (725) Q Consensus 547 p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaq 626 (725) +. ..++..+...+- -++++.+ +.++....... .+....+ +....++. ........+.+.+. T Consensus 411 s~--et~~~~lg~~~d-~~~e~~~-~~~~~~~~~~~---------~~~~~~~-~~~~~~~~-----~~~~~~~~~~~~~~ 471 (480) T protein:vir:78 411 PK--EQARIDLGYTAT-QREQMRD-WDKQETEDMID---------TLYSTTK-AQADATPK-----PTVTETKTETQTSP 471 (480) T ss_pred CH--HHHHhcCCCCHh-HHHHHHH-HHHHHHHHHHH---------Hhhcccc-CCCccccC-----CCCCCCCCccCCCc Confidence 21 122222222111 1111111 11100000000 0000000 00000000 00000000000000 Q ss_pred HHHHHHHHH Q lcl|NC_013059. 627 NQTLSLQID 635 (725) Q Consensus 627 ae~~k~q~e 635 (725) +..-++... T Consensus 472 ~~~~~~~~~ 480 (480) T protein:vir:78 472 SGFNRTKTR 480 (480) T ss_pred ccCCCcCCC Confidence 000000000 No 76 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.63 E-value=2.3e-13 Score=89.91 Aligned_cols=524 Identities=11% Similarity=-0.028 Sum_probs=233.5 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhc---CCCCCHHHHHH---HhhcCCCcccchHHHHHHHHH----HHhh-C Q lcl|NC_013059. 6 NRLESILSRFDADWTASDEARREAKNDLFFSR---VSQWDDWLSQY---TTLQYRGQFDVVRPVVRKLVS----EMRQ-N 74 (725) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~---G~QW~~~~~~~---l~~~grp~~N~i~~~v~~v~g----~~~~-n 74 (725) .-..++..+|+........|-..+.+..+|.. +.-+.+..... .+...++.-+.-...++.+.+ .-.- + T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 23445555666666656666666666677763 33332211100 011122222333344444333 2222 5 Q ss_pred CcceEEecCCcc---hHHHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEe Q lcl|NC_013059. 75 PIDVLYRPKDGA---SPDAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) Q Consensus 75 r~~~~~~pr~~~---d~~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~ 148 (725) ++=+++.+.+.+ ..++.+.| +..+....+.+++..+...+|.+.++.|.|++-+.-+ .+ ....++.+..| T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d---~~-~~~~~r~~~~p 156 (547) T protein:vir:10 81 TKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEED---ED-EEGSVVFQSSP 156 (547) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccC---CC-CCCceeEEEee Confidence 666666665432 12233333 3344444567889999999999999999998655322 11 12234344444 Q ss_pred eecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecc Q lcl|NC_013059. 149 IHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEK 228 (725) Q Consensus 149 ~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~ 228 (725) +.++++..++.-. .--+||...|+..++.+.|+.......-.... . .+.......+.|+.+.+.+.. T Consensus 157 ----l~~~~v~~d~~G~----v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~-~----~~~~~~~~~~~v~~~v~~~~~ 223 (547) T protein:vir:10 157 ----IQDSYFEEDSRGQ----VVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKA-K----EASNQAALKQEVVMCVFTRYD 223 (547) T ss_pred ----cceEEEeeCCCcC----eeeeeeeeeccHHHHHHhcCcccCCHHHHHHH-h----cCCCcccceEEEEEEEeeccC Confidence 4458887775431 11267888999999999988633211110100 0 001111224556665555433 Q ss_pred eeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEE-EEEEEeeccccccCCCCCCCCccceEEEEeee Q lcl|NC_013059. 229 KETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRR-VYKSIITCTAVLKDKQLIAGEHIPIVPVFGEW 307 (725) Q Consensus 229 ~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~ 307 (725) .... ......++.. .+... |||..-.+.+++.+ +-| ..+||+|+-.. T Consensus 224 ~~~~-------~~~~~~~~~~---------------------~~p~~s~~~e~~~~~~~l~e-sg~--~e~P~~~~Rw~- 271 (547) T protein:vir:10 224 KKQN-------RNAGTVLAPT---------------------ERPFGKKWILKEGAVQLGEE-GGY--YEMPAYAIRWR- 271 (547) T ss_pred CCCC-------ccccceeecc---------------------ccceeEEEEEecCceeeeec-CCc--ccCCeeeeeee- Confidence 2110 0000000110 11121 33322222345433 334 57999977544 Q ss_pred eccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcccccc Q lcl|NC_013059. 308 GFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQ 387 (725) Q Consensus 308 ~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 387 (725) .++|..|+.|.+....+-.+.+|+.....+..+.++.+.++.++.+.+-.. .+..|+... . .++. . T Consensus 272 -~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~------~~~~pgg~~-~---~~~~---~ 337 (547) T protein:vir:10 272 -KSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISD------IDLGASGLT-V---VRDM---E 337 (547) T ss_pred -ecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc------ceecCCeee-e---cCCc---c Confidence 458999999999999999999999999999999999999998876543221 122222211 1 1111 1 Q ss_pred CCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhcc-CcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_013059. 388 PLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAV-NGGQVAYDTVNQLNMRADLETYVFQDNLAT-AMRRDGEIYQ 465 (725) Q Consensus 388 ~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~-~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~~~~~g~~ll 465 (725) .++++....--.....+++...+.|....= ...++. ++..+++.=|..+.+.....|...+..|.. ...=+-+..+ T Consensus 338 ~v~pl~~~~~~~~~~~~i~~~~~rI~~af~--~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~ 415 (547) T protein:vir:10 338 SMKPFESRARFDVSSIQLTDLRSAVRRIYY--VDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTF 415 (547) T ss_pred cceeeecccchHHHHHHHHHHHHHHHHHhh--hhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 223233222222334556666666665542 222233 233355666888888888888887777653 3322323333 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcc-- Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTP-- 543 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~-- 543 (725) +++.+ .|.-.- +-+..+++ +| ..++|...... ...++...+..+.++++.+. T Consensus 416 ~il~r-------------~g~lP~--~p~~l~~~-~~---------~~~~v~~is~L-araq~~~~~~~i~~~~~~v~~l 469 (547) T protein:vir:10 416 NIRFR-------------AGKLGE--LPSKLLES-GK---------AAMDIVYTGPL-SRAQKIDQAASIERWAGSTAQL 469 (547) T ss_pred HHHHh-------------cCCCCC--Cchhhhcc-Cc---------ceEEEEeccHH-HHHHHHHHHHHHHHHHHHHHHh Confidence 33222 111000 00000000 00 12334322222 22234455555555554433 Q ss_pred -cccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 544 -QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAEL 622 (725) Q Consensus 544 -~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~ 622 (725) +..|.. ++..|++ +++..+......+.-.--.+++.++..+++++++|.++++.+..+.....+.+ T Consensus 470 aq~~P~v-------ld~id~d---~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~--- 536 (547) T protein:vir:10 470 AEINPEV-------LDIPDWD---EMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQ--- 536 (547) T ss_pred hccChhh-------hhcCCHH---HHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--- Confidence 333321 2233433 33332222221111111111111111111111111111111110000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 623 AKAQNQTLSLQIDAAKVEAQ 642 (725) Q Consensus 623 ~kaqae~~k~q~ea~~~q~q 642 (725) .. ..|..++-| T Consensus 537 ~~---------~~a~~~~~~ 547 (547) T protein:vir:10 537 GK---------GQAALKENQ 547 (547) T ss_pred cC---------cccchhccC Confidence 00 000000000 No 77 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.62 E-value=8.5e-15 Score=97.72 Aligned_cols=463 Identities=10% Similarity=0.012 Sum_probs=193.1 Q ss_pred CCc-----HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHH----HHHHHhhcCCCcccchHHHHHHHHHHH Q lcl|NC_013059. 1 MAD-----NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDW----LSQYTTLQYRGQFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad-----~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~----~~~~l~~~grp~~N~i~~~v~~v~g~~ 71 (725) +-+ ....+.+|...+..- .+. -.+-.+||.|+|.-.. ....++ .-+.+.|..+-+|+..+++. T Consensus 6 ~~~~~~~~~~~~~~~L~~~~~~~---~~r----~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSAFEDQ---NQN----LRSNTSYYEAERRPEAIGVTVPVQMQ-SLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCcccchHHHHHHHHHHHHHH---HHH----HHHHHHHHhccCchhhcCcccchhhh-hhhhccchHHHHHHHHhhhh Confidence 221 222333344333221 122 2223589999985321 111111 12346799999999988876 Q ss_pred hhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeec Q lcl|NC_013059. 72 RQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHS 151 (725) Q Consensus 72 ~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~ 151 (725) .-+ .-..+ ++.... ..++-++..|+++...+.++.++++.|.+|+-|..+.........+-..++.++ T Consensus 78 ~~~---g~~~~---~~~~~~----~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~-- 145 (485) T protein:vir:24 78 AVE---GFRLG---DADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVE-- 145 (485) T ss_pred ccC---ceecC---CCchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEe-- Confidence 332 11122 222222 234556778999999999999999999999888655322211111111222222 Q ss_pred chhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 152 ACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 152 ~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) ++.+ ++||+...++ .++.+++-+. ....+..+++|.... T Consensus 146 ~p~~~~~i~D~~~~~~------~~~~~~~~~~-------------------------------~~~~~~~~~~y~~~~-- 186 (485) T protein:vir:24 146 PPTRMYAEIDPRIGRP------AKAIRVAYDA-------------------------------EGNEIQAATLYTPNE-- 186 (485) T ss_pred ccceeEEEeeCCcCce------eEEEEEEEee-------------------------------cCCeEEEEEEEcCCc-- Confidence 2332 4777765432 1222222100 011223333443311 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) ++.+.. .+| ...+.+..|.+.+.+|+|||... T Consensus 187 --~~~~~~-~~~------------------------------------------~~~~~~~~~h~~g~vPvv~f~n~--- 218 (485) T protein:vir:24 187 --TFGWFR-AEG------------------------------------------EWVEWFSDPHGLGAVPVVPLPNR--- 218 (485) T ss_pred --EEEEEe-cCC------------------------------------------ceEeecccccCCCcccEEEeccC--- Confidence 111100 001 11112233445567788877432 Q ss_pred cCCccccchhh---hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhh-cchHHHHHH-hhccccccccccccccCccc Q lcl|NC_013059. 310 VEDKEVYEGVV---RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQ-IAGFEHMYD-GNDDYPYYLLNRTDENNGEM 384 (725) Q Consensus 310 ~d~~~~~~G~v---r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~-~~~~~~~~~~~~~~~~~g~~ 384 (725) .....+||.- +.+++.++.+|+.+|.+..++...+.....+ .|. .+.+...-. ....+...........++ T Consensus 219 -~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-- 294 (485) T protein:vir:24 219 -TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLI-FGIKPEEIGVDPETGQTLFDAYLARILAFEDA-- 294 (485) T ss_pred -cccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh-ccCCccccccccccccchhhhcccceeccCCC-- Confidence 2223356654 3688999999999999877766554433222 111 010100000 000000000000001111 Q ss_pred cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 385 PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ 463 (725) ...+...+... ...+...+......+-.++++++..+|..+ |..||+|+......-.......-..|..+++++.++ T Consensus 295 -~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l 372 (485) T protein:vir:24 295 -EGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRL 372 (485) T ss_pred -CceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122222211 223444444444444455788888888654 557999999887777777777777777888777777 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcc Q lcl|NC_013059. 464 YQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTP 543 (725) Q Consensus 464 ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~ 543 (725) ++.+.. . .+. .. |+ .++.|.=.+..+....+..+.+++|.+... T Consensus 373 ~~~~~~----~------~~~--~~---------------------d~---~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~ 416 (485) T protein:vir:24 373 AYRLMK----G------GDV--PP---------------------DM---LRMETVWRDPSTPTYAAKADAATKLYGNGQ 416 (485) T ss_pred HHHHhc----C------CCC--cc---------------------cc---ceeeEEecCCCCCCHHHHHHHHHHHHhccc Confidence 665321 0 000 00 01 123333332222223445556666655432 Q ss_pred cccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 544 QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELA 623 (725) Q Consensus 544 ~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~ 623 (725) ...|. -.++..+...+ +-++++ +++.++........ .............+ ......+ ...++. T Consensus 417 ~~~s~--et~~~~l~~~~-d~~~e~-~~~~ee~~~~~~~~------~~~~~~~~~~~~~~--~~~~e~~---~~~~~~-- 479 (485) T protein:vir:24 417 GVIPR--ERARKDMGYSI-AEREEM-RRWDEEEAAMGLGL------LGTMVDADPTVPGS--PNPTPAP---KPQPAI-- 479 (485) T ss_pred ccCCH--HHHHhhCCCCH-hHHHHH-HHHHHHHhhhhhhH------HHhhcccCCCCCCC--CCCCCCC---CCccCC-- Confidence 22221 11122222211 111111 11111100000000 00000000000000 0000000 000000 Q ss_pred HHHHHHH Q lcl|NC_013059. 624 KAQNQTL 630 (725) Q Consensus 624 kaqae~~ 630 (725) .-.+.+ T Consensus 480 -~~~~~a 485 (485) T protein:vir:24 480 -EGGDSA 485 (485) T ss_pred -CCCCCC Confidence 000000 No 78 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.62 E-value=1.7e-14 Score=96.07 Aligned_cols=466 Identities=11% Similarity=-0.006 Sum_probs=196.2 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ----WDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q----W~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~ 76 (725) |.=..+.+..|...+.. .+....+-.+||+|.| ++......++ .-+.+.|..+-+|+..+++.. + T Consensus 1 ~~t~~~~i~~L~~~~~~-------~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~ivd~~~~~l~---~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLAR-------DLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLD---I 69 (480) T ss_pred CCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccccccccccchhHh-hhhhhcchHHHHHHHHHhhhc---c Confidence 99887788877776543 2334455579999976 2111111111 113467999999999988763 2 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeec-cCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE-DQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~-~~~~~~~~~~ir~~~~~~~~~~ 155 (725) +--+.| +|.+.. ..+..+++.|+++..+..++.++++.|.||.-|...-. ..|.-+ ...++..+- .... T Consensus 70 ~g~~~~---~d~~~~----~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g-~~~i~~~~p--~~~~ 139 (480) T protein:vir:78 70 EGFRIS---EDSEGL----EELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAG-IPLIRVESP--LYMY 139 (480) T ss_pred CceecC---CCchhH----HHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCC-eeEEEEEcc--cceE Confidence 222222 333333 34456778899999999999999999999876643210 111122 222332211 1113 Q ss_pred eeeCCCccc-cChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 156 VIWDSNSKL-MDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 156 v~~Dp~a~~-~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) ++|||.... +- ++++.+.+. .+...+...++|..... ..+. T Consensus 140 ~~~D~~~~~~~~------~~i~~~~~~------------------------------~~~~~~~~~~~y~~~~~--~~~~ 181 (480) T protein:vir:78 140 AELDPRNTRRVT------RAVRLYTTR------------------------------DDVAVPDRATLYLPDET--VPLR 181 (480) T ss_pred EEEcCCCccceE------EEEEEEEee------------------------------cCCCceEEEEEEeCCeE--EEEE Confidence 667876432 11 112212110 01112223344433111 0011 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) ... |... +.....++.|.+.+.+|+|||... .... T Consensus 182 ~~~---~~~~--------------------------------------~~~~~~~~~~~~~g~vPvv~f~n~----~~~~ 216 (480) T protein:vir:78 182 RNG---GLND--------------------------------------QWVVDGDVIKHGLGVVPVVPLTND----PRLG 216 (480) T ss_pred ecC---CCcc--------------------------------------ccccccccccCCCCCcceEEeecc----cccC Confidence 100 0000 000001222334456777776432 2223 Q ss_pred ccch--hh-hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCccccccCCc Q lcl|NC_013059. 315 VYEG--VV-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLA 390 (725) Q Consensus 315 ~~~G--~v-r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 390 (725) .+|| -+ +.+++.++.+|+.+|.+...+...+.....+ .|.. +.+... .....+.... ..+....| ..++ T Consensus 217 ~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~-~~~~~~~~~~-~~~~~~~~----~~~~ 289 (480) T protein:vir:78 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-SGVTTDELTND-GENTTLDIYY-GRILTLAS----EAAK 289 (480) T ss_pred CccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh-hcCCccccccc-cccchhhhhh-hhhccCCC----CCce Confidence 3454 34 4589999999999999887766544433222 1211 111000 0000000000 11111111 1122 Q ss_pred ccCCCC-chHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 391 YYENPE-VPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 391 ~~~~~~-~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) ..+.+. -...+...+......+-.++|+++..+|..+ |..||+|+......-.......-.-|..+++++.++++ T Consensus 290 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~--- 366 (480) T protein:vir:78 290 ISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM--- 366 (480) T ss_pred EEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Confidence 222222 2234555566666666667889999998655 45799999877655555555555555666666555443 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEecc-CchhHHHHHHHHHHHHHHhcccccc Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP-SFQSMKQQNRAEILELLGKTPQGTP 547 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p-~~~t~r~~~~~~l~ell~~~~~~~p 547 (725) .+.+. .+...+ +++.|.=.+ ..++ ..+..+.+.++.++.....+ T Consensus 367 -~~~g~---------~~~~~~------------------------~~i~v~f~~~~~~s-~~~~ad~~~kl~~~g~~~~s 411 (480) T protein:vir:78 367 -QIMGR---------EVTEEY------------------------TRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIP 411 (480) T ss_pred -HHcCC---------Cccccc------------------------eeeeEEecCCCCCC-HHHHHHHHHHHHHhccccCC Confidence 33321 111111 122222222 2233 33556667776665433222 Q ss_pred hHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHH-HhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAK-QGQQDPAMVQAQGVLLQGQAELAKAQ 626 (725) Q Consensus 548 ~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~q-q~q~q~~~~~~qa~~~k~qae~~kaq 626 (725) . ..++..+...+- .++++.+ ..++...... ..+....... ........ .+ ...+.+.+- T Consensus 412 ~--et~~~~lg~~~d-~~~~~~~-~~~e~~~~~~---------~~~~~~~~~~~~~~~~~~~----~~---~~~~~~~~~ 471 (480) T protein:vir:78 412 K--EQARIDLGYTAT-QREQMRD-WDKQETEDMI---------DTLYSTTKAQADATPKPTV----TE---TKTETQTSP 471 (480) T ss_pred H--HHHHhcCCCCHh-HHHHHHH-HHHHHHHHHH---------HHhhccccccCCCCCCCCC----CC---CCCcccccc Confidence 2 122222221111 1111111 1000000000 0000000000 00000000 00 000000000 Q ss_pred HHHHHHHHH Q lcl|NC_013059. 627 NQTLSLQID 635 (725) Q Consensus 627 ae~~k~q~e 635 (725) ...-+++.. T Consensus 472 ~~~~~~~~~ 480 (480) T protein:vir:78 472 SGFNRTKTR 480 (480) T ss_pred CCCCcccCC Confidence 000000000 No 79 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.60 E-value=1.5e-14 Score=96.32 Aligned_cols=462 Identities=10% Similarity=0.003 Sum_probs=188.4 Q ss_pred CCcH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHH----HHhhcCCCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADN-KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQ----YTTLQYRGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~----~l~~~grp~~N~i~~~v~~v~g~~~~nr 75 (725) =.|. ...++.+...+.. -+..-.+-.+||+|+++-..... .++ .-+.+.|..+-+|+.++++..-+ T Consensus 10 ~~~~~~~~~~~l~~~~~~-------~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~ivd~~~~~l~~~- 80 (485) T protein:vir:10 10 EIEDPAIARDEMVSAFED-------STQNLKTNTSYYEAERRPEAIGVTVPIQMQ-SLLAHVGYPRLYVDSIAERQAVE- 80 (485) T ss_pred CCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCcchhcCCCCChhhh-hhhhhcCcHHHHHHHHHhhhccc- Confidence 0222 2333334333322 22234555799999997432111 111 11234699999999888765321 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~ 155 (725) -|+. +++.+..+ .++.++..|+++.....+..++++.|.||+-|+.+-........+-..++..+ ++.+ T Consensus 81 ---g~~~--~~~~~~~~----~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~--~p~~ 149 (485) T protein:vir:10 81 ---GFRF--GDADEADE----ELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVE--PPTR 149 (485) T ss_pred ---ceec--CCCchhHH----HHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEE--ccce Confidence 1222 23333333 34456788999999999999999999999887654221111111111122211 2332 Q ss_pred --eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 156 --VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 156 --v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) ++|||...++ .++.+++-. . +.+.+...++|..... + T Consensus 150 ~~~~~D~~~~~~------~~~~~~~~~------------------------------~-~~~~~~~~~~y~~~~~----~ 188 (485) T protein:vir:10 150 MYAEIDPRIGRV------SKAIRVAYD------------------------------A-EGNEIQAATLYTPNDI----F 188 (485) T ss_pred eEEEEcCCCCce------eEEEEEEEe------------------------------e-CCCeEEEEEEEeCCeE----E Confidence 4677754321 112221100 0 0122333344433211 1 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) .+.. ..|...+.+..|.+.+.+|+|||... ... T Consensus 189 ~~~~-------------------------------------------~~~~~~~~~~~~~~~g~vPvv~~~n~----~~~ 221 (485) T protein:vir:10 189 GWYR-------------------------------------------VENEWQEWFNNPHGLGVVPVVPIPNR----TRL 221 (485) T ss_pred EEEE-------------------------------------------cCCceEEeccccCCCCcccEEEeccc----ccc Confidence 1110 00011111234455567788777533 222 Q ss_pred cccchhh---hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhh-cchHHHHHH-hhccccccccccccccCccccccC Q lcl|NC_013059. 314 EVYEGVV---RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQ-IAGFEHMYD-GNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 314 ~~~~G~v---r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) ..+||.- +.+++.++.+|+.+|.+..+....+.....+ .|. .+.+..... ........... +...++. ... T Consensus 222 ~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~-i~~~~~~--d~k 297 (485) T protein:vir:10 222 SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLI-FGIKPEEIGVDPETGQTLFDAYLAR-ILAFEDA--EGK 297 (485) T ss_pred CCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHH-hcCCcccccccccccchhhhhcccc-eeccCCC--Cce Confidence 3356654 3688999999999999877665554433221 111 011100000 00000000000 0111110 112 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~l 467 (725) +..++... ...+...+......+-.++++++..+|..+ |..||+|+......-.......-..|..+++++.++++.+ T Consensus 298 ~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~ 376 (485) T protein:vir:10 298 IQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRM 376 (485) T ss_pred EEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222211 233444455445555555777888887654 5579999998776666666666666666666666655443 Q ss_pred HHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccc Q lcl|NC_013059. 468 VNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) Q Consensus 468 i~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p 547 (725) .... +. .. |+ +++.|.=.+..+.-..+..+.+..|.+.-....+ T Consensus 377 ----~~~~------~~--~~---------------------~~---~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s 420 (485) T protein:vir:10 377 ----MKGG------DV--PP---------------------DM---LRMETVWRDPSTPTYAAKADAASKLYNGGTGVIP 420 (485) T ss_pred ----hCCC------CC--cc---------------------cc---eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCC Confidence 2210 00 00 01 2333333333322233445555665543212222 Q ss_pred hHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_013059. 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK-AQ 626 (725) Q Consensus 548 ~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k-aq 626 (725) . ..++..+...+ +.++++ +++..+ +..+.......+.... ...-.+.+... .. T Consensus 421 ~--et~~~~lg~~~-~~~~~~-~~~~ee---------------------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 474 (485) T protein:vir:10 421 R--ERARKDMGYSI-AEREEM-RRWDEE---------------------EAAMGLGLIGTMVDPN-PTVPGSPSPAPAPK 474 (485) T ss_pred H--HHHHHhCCCCH-hHHHHH-HHHHHH---------------------HHHHHHHHHHHhhccC-CCCCCCCCcccccc Confidence 1 11111111111 111111 111000 0000000000000000 00000000000 00 Q ss_pred HHHHHHHHHHH Q lcl|NC_013059. 627 NQTLSLQIDAA 637 (725) Q Consensus 627 ae~~k~q~ea~ 637 (725) .-.....-..+ T Consensus 475 ~~~~~~~~~~~ 485 (485) T protein:vir:10 475 PAALESGGDAA 485 (485) T ss_pred CcCCCCCCCCC Confidence 00000000000 No 80 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.59 E-value=4.7e-14 Score=93.64 Aligned_cols=440 Identities=9% Similarity=0.031 Sum_probs=201.3 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC---CCH--HHHHHHhhcC----CCcccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ---WDD--WLSQYTTLQY----RGQFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q---W~~--~~~~~l~~~g----rp~~N~i~~~v~~v~g~~ 71 (725) +-.+..+..+++..+.... ..-+....+..+||.|+| ... .......... |.++|..+.+|+..+|+. T Consensus 21 ~~~~~~~~~~~i~~~i~~~---~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl 97 (474) T protein:vir:96 21 MKPKVETQEEMIIRLINNH---KQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYV 97 (474) T ss_pred ccccccchHHHHHHHHHHH---HHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhh Confidence 2222234444444443332 233455677889999986 110 1111111112 336799999999999999 Q ss_pred hhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeec Q lcl|NC_013059. 72 RQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHS 151 (725) Q Consensus 72 ~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~ 151 (725) -.+.+.+.+ ++.+..+.++. +.+ |+++.....+.++++++|.||.-+..+ ++ | .+.+++. T Consensus 98 ~g~p~~~~~-----~~~~~~~~l~~----~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d---~~--~-~~~i~~~---- 157 (474) T protein:vir:96 98 AGKPVTYAH-----DDDKVLDVIHQ----VLD-TRWDNKLIDILTAASNKGIDWLQVYIN---ED--G-ELKLFRV---- 157 (474) T ss_pred cccCceecc-----CChHHHHHHHH----HHh-ccHHHHHHHHHHHHhhCCeEEEEeeeC---CC--C-ceEEEEE---- Confidence 988876643 23333344433 333 789999999999999999999877543 22 2 3333322 Q ss_pred chhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 152 ACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 152 ~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) ++.+ ++||+.... + -..+++.|... ....+++|....+ T Consensus 158 ~p~~~~~v~d~~~~~----~-~~a~ir~~~~~----------------------------------~~~~~~vy~~~~i- 197 (474) T protein:vir:96 158 PAEQAIPIWTDKERE----Q-LNAFIRIFTFN----------------------------------GETKVEYWTAETV- 197 (474) T ss_pred cccceEEEEcCCCCC----c-eEEEEEEEeec----------------------------------CeeEEEEEeCCeE- Confidence 2333 334533221 1 12333333210 0011234433211 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) ..+.+.+ |..... ..........+..+.+.+.+|+|+|+.. T Consensus 198 -~~~~~~~---~~~~~~--------------------------------~~~~~~~~~~~~~~~~~~~vPvv~~~nn--- 238 (474) T protein:vir:96 198 -TYYVYEN---GGLIPD--------------------------------FYYGDEHIQTHFSTGSWERVPFIAFKNN--- 238 (474) T ss_pred -EEEEEcC---Cceeec--------------------------------cccccccccCcccccCCCccceEEecCC--- Confidence 1111111 110000 0000111112233444456666665321 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) ..+.|.+..+++.++.+|...|.+...+...+...+++ .|.. +.......... ... .+...++ .. T Consensus 239 ----~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~--~~~---~i~~~~~----~~ 304 (474) T protein:vir:96 239 ----PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGLK--YYK---AINVSSD----GG 304 (474) T ss_pred ----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhhh--ccc---eeeccCC----Cc Confidence 23568888999999999999999998887776654432 3321 11111111100 000 1111111 12 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) +.++..+.-..+....++.....|-..|++.+.+.+..++.+||+|+..+...........-..|..+++++.++++ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~--- 381 (474) T protein:vir:96 305 VETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFIL--- 381 (474) T ss_pred eeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Confidence 34444444446667788889999999998876665554556899999887666665555555666666666555544 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) .++.. +.. . .+|.|.-.+..+.--.+..+.+.+ .+ ..+. T Consensus 382 -~~~g~---------~~d-----------------------~---~~i~i~f~~~~p~~~~e~a~~~~~----~g-iiS~ 420 (474) T protein:vir:96 382 -DFNKI---------KLD-----------------------A---KEIEITFNFNVMVNDLEQSQIGAQ----SQ-YLSK 420 (474) T ss_pred -HHhCC---------Ccc-----------------------c---ceeeEEecCCCccCHHHHHHHHHH----cC-CCCh Confidence 44321 100 0 123333344444422233332222 11 2221 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLL--QGQAELAKAQ 626 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~--k~qae~~kaq 626 (725) ..++..++..+ ..+..++++.++..... .+.+. .......-. ..+.+..+.+ T Consensus 421 --et~~~~lp~v~--D~~~E~eri~~E~~~~~---------------------~~~~~-~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 421 --ETLVRHHPWVD--DPKAELERLDEEQLELN---------------------KQLPN-LDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred --HHHHHhCCCCC--CHHHHHHHHHHHHHHHH---------------------hhccc-cccccCCCCCCcCCCCccccC Confidence 11222222221 12222333322110000 00000 000000000 0000000000 No 81 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.59 E-value=4.7e-14 Score=93.64 Aligned_cols=440 Identities=9% Similarity=0.031 Sum_probs=201.3 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC---CCH--HHHHHHhhcC----CCcccchHHHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ---WDD--WLSQYTTLQY----RGQFDVVRPVVRKLVSEM 71 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q---W~~--~~~~~l~~~g----rp~~N~i~~~v~~v~g~~ 71 (725) +-.+..+..+++..+.... ..-+....+..+||.|+| ... .......... |.++|..+.+|+..+|+. T Consensus 21 ~~~~~~~~~~~i~~~i~~~---~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl 97 (474) T protein:vir:95 21 MKPKVETQEEMIIRLINNH---KQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYV 97 (474) T ss_pred ccccccchHHHHHHHHHHH---HHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhh Confidence 2222234444444443332 233455677889999986 110 1111111112 336799999999999999 Q ss_pred hhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeec Q lcl|NC_013059. 72 RQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHS 151 (725) Q Consensus 72 ~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~ 151 (725) -.+.+.+.+ ++.+..+.++. +.+ |+++.....+.++++++|.||.-+..+ ++ | .+.+++. T Consensus 98 ~g~p~~~~~-----~~~~~~~~l~~----~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d---~~--~-~~~i~~~---- 157 (474) T protein:vir:95 98 AGKPVTYAH-----DDDKVLDVIHQ----VLD-TRWDNKLIDILTAASNKGIDWLQVYIN---ED--G-ELKLFRV---- 157 (474) T ss_pred cccCceecc-----CChHHHHHHHH----HHh-ccHHHHHHHHHHHHhhCCeEEEEeeeC---CC--C-ceEEEEE---- Confidence 988876643 23333344433 333 789999999999999999999877543 22 2 3333322 Q ss_pred chhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecce Q lcl|NC_013059. 152 ACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) Q Consensus 152 ~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~ 229 (725) ++.+ ++||+.... + -..+++.|... ....+++|....+ T Consensus 158 ~p~~~~~v~d~~~~~----~-~~a~ir~~~~~----------------------------------~~~~~~vy~~~~i- 197 (474) T protein:vir:95 158 PAEQAIPIWTDKERE----Q-LNAFIRIFTFN----------------------------------GETKVEYWTAETV- 197 (474) T ss_pred cccceEEEEcCCCCC----c-eEEEEEEEeec----------------------------------CeeEEEEEeCCeE- Confidence 2333 334533221 1 12333333210 0011234433211 Q ss_pred eEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) ..+.+.+ |..... ..........+..+.+.+.+|+|+|+.. T Consensus 198 -~~~~~~~---~~~~~~--------------------------------~~~~~~~~~~~~~~~~~~~vPvv~~~nn--- 238 (474) T protein:vir:95 198 -TYYVYEN---GGLIPD--------------------------------FYYGDEHIQTHFSTGSWERVPFIAFKNN--- 238 (474) T ss_pred -EEEEEcC---Cceeec--------------------------------cccccccccCcccccCCCccceEEecCC--- Confidence 1111111 110000 0000111112233444456666665321 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) ..+.|.+..+++.++.+|...|.+...+...+...+++ .|.. +.......... ... .+...++ .. T Consensus 239 ----~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~--~~~---~i~~~~~----~~ 304 (474) T protein:vir:95 239 ----PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGLK--YYK---AINVSSD----GG 304 (474) T ss_pred ----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhhh--ccc---eeeccCC----Cc Confidence 23568888999999999999999998887776654432 3321 11111111100 000 1111111 12 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) +.++..+.-..+....++.....|-..|++.+.+.+..++.+||+|+..+...........-..|..+++++.++++ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~--- 381 (474) T protein:vir:95 305 VETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFIL--- 381 (474) T ss_pred eeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Confidence 34444444446667788889999999998876665554556899999887666665555555666666666555544 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccch Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~ 548 (725) .++.. +.. . .+|.|.-.+..+.--.+..+.+.+ .+ ..+. T Consensus 382 -~~~g~---------~~d-----------------------~---~~i~i~f~~~~p~~~~e~a~~~~~----~g-iiS~ 420 (474) T protein:vir:95 382 -DFNKI---------KLD-----------------------A---KEIEITFNFNVMVNDLEQSQIGAQ----SQ-YLSK 420 (474) T ss_pred -HHhCC---------Ccc-----------------------c---ceeeEEecCCCccCHHHHHHHHHH----cC-CCCh Confidence 44321 100 0 123333344444422233332222 11 2221 Q ss_pred HHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_013059. 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLL--QGQAELAKAQ 626 (725) Q Consensus 549 ~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~--k~qae~~kaq 626 (725) ..++..++..+ ..+..++++.++..... .+.+. .......-. ..+.+..+.+ T Consensus 421 --et~~~~lp~v~--D~~~E~eri~~E~~~~~---------------------~~~~~-~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 421 --ETLVRHHPWVD--DPKAELERLDEEQLELN---------------------KQLPN-LDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred --HHHHHhCCCCC--CHHHHHHHHHHHHHHHH---------------------hhccc-cccccCCCCCCcCCCCccccC Confidence 11222222221 12222333322110000 00000 000000000 0000000000 No 82 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.59 E-value=1.7e-14 Score=96.11 Aligned_cols=464 Identities=11% Similarity=0.011 Sum_probs=188.8 Q ss_pred CCcH---HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHhhcCCCcccchHHHHHHHHHHHhh Q lcl|NC_013059. 1 MADN---KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ----WDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQ 73 (725) Q Consensus 1 mad~---~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q----W~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~ 73 (725) |.|. ..++..+...+.. .. ..-.+-.+||+|++ ++......++ .-+.+.|..+-+|+.+++...- T Consensus 8 ~~e~~~~~~~~~~l~~~~~~----~~---~r~~~l~~YY~G~~~i~~~~~~~~~~~~-~~~~v~n~~~~iVd~~~~~l~~ 79 (486) T protein:vir:42 8 MEEIEDPAVVREEMISAFED----AS---KDLASNTSYYDAERRPEAIGVTVPREMQ-QLLAHVGYPRLYVDSVAERQAV 79 (486) T ss_pred CCCcccHHHHHHHHHHHHHH----HH---HHHHHHHHHhcccCcchhcccccchhHh-hhhhccchHHHHHHHHHhhhcc Confidence 5432 3345555444433 21 12223357999987 2111111111 1134569999999988876522 Q ss_pred CCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecch Q lcl|NC_013059. 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSAC 153 (725) Q Consensus 74 nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~ 153 (725) + - |+. +++.... ..+.-+++.|+++...+.+..++++.|.+|.-|..+.......+.+-.+++.++ ++ T Consensus 80 ~--g--~~~--~~~~~~~----~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~--~p 147 (486) T protein:vir:42 80 E--G--FRL--GDADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVE--PP 147 (486) T ss_pred c--c--eec--CCCchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEe--cc Confidence 1 1 221 1222222 224556778999999999999999999999877654211111111111222221 22 Q ss_pred h--heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeE Q lcl|NC_013059. 154 S--HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKET 231 (725) Q Consensus 154 ~--~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~ 231 (725) . .++|||...+ -.++++.|-+. +.+.++..++|.... T Consensus 148 ~~~~~i~d~~~~~------~~~~~~~~~~~-------------------------------~~~~~~~~~~y~~~~---- 186 (486) T protein:vir:42 148 TRMHAEIDPRINR------VSKAIRVAYDK-------------------------------EGNEIQAATLYTPME---- 186 (486) T ss_pred cceEEEEeCCCCC------eEEEEEEEEec-------------------------------CCCeEEEEEEEcCCc---- Confidence 2 3567875432 11233222110 012334444554321 Q ss_pred EEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccC Q lcl|NC_013059. 232 AFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVE 311 (725) Q Consensus 232 ~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d 311 (725) ++.+.. .+| ...+.+..|.+.+.+|+|||... . T Consensus 187 ~~~~~~-~~~------------------------------------------~~~~~~~~~h~~g~vPvv~~~n~----~ 219 (486) T protein:vir:42 187 TIGWFR-ADG------------------------------------------EWAEWFNVPHGLGVVPVVPLPNR----T 219 (486) T ss_pred EEEEEe-cCC------------------------------------------cEEeecceecCCCCceEEEeccc----c Confidence 111100 001 00112233444456777776432 2 Q ss_pred Cccccch--hh-hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHH-hhccccccccccccccCccccc Q lcl|NC_013059. 312 DKEVYEG--VV-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYD-GNDDYPYYLLNRTDENNGEMPT 386 (725) Q Consensus 312 ~~~~~~G--~v-r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~ 386 (725) ....+|| -+ +.+++.++.+|+.+|.+.......+.....+ .|.. +.+...-. ....+...........++ . T Consensus 220 ~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~ 295 (486) T protein:vir:42 220 RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLI-FGIKPEEIGVDSETGQTLFDAYLARILAFEDA---E 295 (486) T ss_pred ccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHh-hcCCccccccccccccchhhhhhchhcccCCC---C Confidence 2222454 34 3688999999999998876654443322211 1110 00000000 000000000000111111 1 Q ss_pred cCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 387 QPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 387 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) ..+..++.. -...+...+......+-.++++++..+|..+ |..||+|+......-.......-..|..+++++.++++ T Consensus 296 ~~~~q~~~~-~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~ 374 (486) T protein:vir:42 296 GKIQQFSAA-ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAY 374 (486) T ss_pred ceEEeeccc-CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112222211 1233444444444445555788888888654 55799999987776666666666777777777766655 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQG 545 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~ 545 (725) .+ .... .... |+ +++.|.=.+..+.-..+..+.+..|.+..... T Consensus 375 ~~----~~~~--------~~~~---------------------d~---~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~ 418 (486) T protein:vir:42 375 RI----MKGG--------DVPP---------------------DM---LRMETVWRDPSTPTYAAKADAATKLYGNGQGV 418 (486) T ss_pred HH----hcCC--------Cccc---------------------cc---eeeeEEecCCCCCCHHHHHHHHHHHHhcccCC Confidence 43 2110 0000 01 23333333333322445556666666543322 Q ss_pred cchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 546 TPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKA 625 (725) Q Consensus 546 ~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~ka 625 (725) .+. ..++..+...+- -++++ +++.++.... .......+..+ ........+ T Consensus 419 ~s~--et~~~~lg~~~d-~~~e~-~~~~~e~~~~---------------------~~~~~~~~~~~-----~~~~~~~~~ 468 (486) T protein:vir:42 419 IPR--ERARIDMGYSVK-EREEM-RRWDEEEAAM---------------------GLGLLGTMVDA-----DPTVPGSPS 468 (486) T ss_pred CCH--HHHHhcCCCChh-HHHHH-HHHHHHHHHH---------------------HHHHHHHhhcC-----CCCCCCCCC Confidence 231 111111211111 11111 1111100000 00000000000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 626 QNQTLSLQIDAAKVEAQN 643 (725) Q Consensus 626 qae~~k~q~ea~~~q~q~ 643 (725) .++....+-.+..++... T Consensus 469 ~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 469 PTAPPKPQPAIESSGGDA 486 (486) T ss_pred CCCCCCCCcccCCCCCCC Confidence 000000000000000000 No 83 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.59 E-value=1e-14 Score=97.25 Aligned_cols=463 Identities=10% Similarity=0.030 Sum_probs=185.8 Q ss_pred CCc---------HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHH----HHHHhhcCCCcccchHHHHHHH Q lcl|NC_013059. 1 MAD---------NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWL----SQYTTLQYRGQFDVVRPVVRKL 67 (725) Q Consensus 1 mad---------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~----~~~l~~~grp~~N~i~~~v~~v 67 (725) |+. ..++++.+...+..- +..-.+-.+||.|.|.-... ...++. -+.+.|..+-+|+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~-------~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~-~~~~~n~~~~ivd~~ 72 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTER-------TQDLGDNTAYYESERRPDAVGVTVPQQMQK-LLAHVGYPRLYIDAI 72 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHH-------HHHHHHHHHHHhccccchhcccccchhHHh-hhhhcCcHHHHHHHH Confidence 552 233444444444321 12223447899998863211 111111 123569999999988 Q ss_pred HHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEE Q lcl|NC_013059. 68 VSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRRE 147 (725) Q Consensus 68 ~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~ 147 (725) ++...-+- |+. +++.+. ...+.-+++.|+++.....++.++++.|.||+-|+.+-........+...++. T Consensus 73 ~~~l~~~g----~~~--~~~~~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~ 142 (484) T protein:vir:77 73 AARQELEG----FRL--GGADKA----DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIR 142 (484) T ss_pred HhhhccCc----eec--CCcchh----HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEE Confidence 87553221 111 222222 23355677889999999999999999999998886542221111112222222 Q ss_pred eeecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEE Q lcl|NC_013059. 148 PIHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEV 225 (725) Q Consensus 148 ~~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~ 225 (725) ++ ++.. ++||+..+++ .++++.+.+. ....+..+++|+. T Consensus 143 ~~--~p~~~~~~~D~~~~~~------~~a~~~~~~~-------------------------------~~~~~~~~~~y~~ 183 (484) T protein:vir:77 143 VE--PPTNLYAQIDPRTRQV------MRAIRAIEDE-------------------------------EGNEVIGATLYLP 183 (484) T ss_pred Ee--ccceeEEEecCCCCce------EEEEEEEEee-------------------------------cCCcEEEEEEEec Confidence 21 2332 4577654321 1222222110 0011222233322 Q ss_pred ecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEe Q lcl|NC_013059. 226 VEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFG 305 (725) Q Consensus 226 ~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g 305 (725) .. .+....+ +| ...+.+..|.+.+.+|+|||.. T Consensus 184 ~~---~~~~~~~--~~------------------------------------------~~~~~~~~~~~~g~vPvv~f~N 216 (484) T protein:vir:77 184 NN---TVIWNRE--DG------------------------------------------QWVQVANVAHNLEMVPVIPIPN 216 (484) T ss_pred Ce---EEEEEec--CC------------------------------------------ceEeeccccCCCCCcceEEecc Confidence 10 0000000 01 1111122344456678887753 Q ss_pred eeeccCCccccchh--h-hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHH-hhcccccccccccccc Q lcl|NC_013059. 306 EWGFVEDKEVYEGV--V-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYD-GNDDYPYYLLNRTDEN 380 (725) Q Consensus 306 ~~~~~d~~~~~~G~--v-r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~-~~~~~~~~~~~~~~~~ 380 (725) . .....++|. | +.+++.++.+|+.+|.+.......+.....+ .|.. +.+...-. ....+..... .+... T Consensus 217 ~----~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 290 (484) T protein:vir:77 217 R----TRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLL-FGVKGEELGVDPETGQTLFDAYLA-RILAF 290 (484) T ss_pred c----cccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHH-hCCCcchhcccccccchhhhhhhh-hhccc Confidence 2 222335553 3 4688999999999999877665444332221 1111 11100000 0000000000 01111 Q ss_pred CccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 381 NGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) Q Consensus 381 ~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~ 459 (725) .+. ...+..++... ...+...+......+-.++++.+..+|..+ |..||.|+......-.......-.-|..+.++ T Consensus 291 ~~~--~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~ 367 (484) T protein:vir:77 291 EDH--ESKAQQFSAAE-LRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQ 367 (484) T ss_pred CCC--CceeEeecCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 11122222222 133444455555555555788888888654 55799999876655444444444555555555 Q ss_pred HHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHH Q lcl|NC_013059. 460 DGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELL 539 (725) Q Consensus 460 ~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell 539 (725) +.++++. +... .+... | -+++.|.=.+..+....+..+.+.+|. T Consensus 368 ~~~l~~~----~~~~--------~~~~~---------------------~---~~~i~v~w~~~~~~s~~~~ad~~~kl~ 411 (484) T protein:vir:77 368 AMRVAYK----VMNG--------GDIPP---------------------E---YYRMESIWRDPSTPTYAAKADAATKLY 411 (484) T ss_pred HHHHHHH----HhCC--------CCccc---------------------c---cccceEEecCCCCCCHHHHHHHHHHHH Confidence 5554443 3211 00000 0 123333333333222445566666665 Q ss_pred HhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_013059. 540 GKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQ 619 (725) Q Consensus 540 ~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~q 619 (725) +......+. ..++..+...+-+ ++++ +++..+..... ++..... ...... + T Consensus 412 ~~g~gi~s~--et~~~~l~~~~~~-~~e~-~~~~~ee~~~~---------~~~~~~~------------~~~~~~----~ 462 (484) T protein:vir:77 412 NNGQGVIPK--ERARIDMGYSITE-REEM-RKWDEEEQAQG---------LGLMGTM------------FGTDPS----G 462 (484) T ss_pred hccCCCCCH--HHHHhcCCCChhH-HHHH-HHHHHHHHHHH---------HHHHhhh------------cccccc----C Confidence 442222221 1111122211111 1111 11111000000 0000000 000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 620 AELAKAQNQTLSLQIDAAKVEAQNQ 644 (725) Q Consensus 620 ae~~kaqae~~k~q~ea~~~q~q~q 644 (725) ....... +....+..+....+ . T Consensus 463 ~~~~~~~-~~~~~~~~~~~~~~--~ 484 (484) T protein:vir:77 463 GGNPDNP-ETPEPQPNPAEEAA--A 484 (484) T ss_pred CCCCCCC-CcccccCCCccccC--C Confidence 0000000 00000000000000 0 No 84 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.58 E-value=9.5e-14 Score=91.96 Aligned_cols=443 Identities=8% Similarity=0.008 Sum_probs=199.4 Q ss_pred CC--------cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHH-----HhhcCC----CcccchHHH Q lcl|NC_013059. 1 MA--------DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQY-----TTLQYR----GQFDVVRPV 63 (725) Q Consensus 1 ma--------d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~-----l~~~gr----p~~N~i~~~ 63 (725) |+ +..+...+++..+.. ....-+....+-.+||.|+|.--.-..+ ....++ .++|..+.+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~---~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~I 89 (474) T protein:vir:94 13 YGEEVVEQLKPQFETQEEMIVRLID---DHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNL 89 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHH---HHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHH Confidence 32 222222333333322 2223345566778999998732110000 112222 357999999 Q ss_pred HHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCcee Q lcl|NC_013059. 64 VRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) Q Consensus 64 v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ 143 (725) |+..+|+.-.+.+.+.+ +|.+..+ +++.+.+ |+++.....+.++++++|.||.-+..+ ++ + .+. T Consensus 90 vd~~~~~l~g~p~~~~~-----~d~~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d---~~--~-~~~ 153 (474) T protein:vir:94 90 VDQKVSYVASKPVTYSC-----EDENVLK----VIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYIN---EN--G-EMK 153 (474) T ss_pred HHHHHhhhhcCCceecc-----CcHHHHH----HHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEec---CC--C-eeE Confidence 99999999998876643 3333333 4444444 789999999999999999999877543 22 1 233 Q ss_pred EEEEeeecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEE Q lcl|NC_013059. 144 IRREPIHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAE 221 (725) Q Consensus 144 ir~~~~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E 221 (725) +.+. ++.. ++||+.... + ...+++.|-.. ....++ T Consensus 154 i~~~----~p~~~~~v~d~~~~~----~-~~~~ir~~~~~----------------------------------~~~~~~ 190 (474) T protein:vir:94 154 LFRV----PAEQAIPIWVDKERE----E-LKSFIRYYKFN----------------------------------NEEKVE 190 (474) T ss_pred EEEE----cccceEEEEcCCCCC----c-eEEEEEEEEec----------------------------------CeEEEE Confidence 3321 2332 345543211 1 11223322110 001233 Q ss_pred EEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceE Q lcl|NC_013059. 222 FYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIV 301 (725) Q Consensus 222 ~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~v 301 (725) +|....+ ..+...+ |...... ......+.....+.+.+.+|+| T Consensus 191 ~yt~~~~--~~y~~~~---~~~~~~~--------------------------------~~~~~~~~~~~~~~~~g~vPvv 233 (474) T protein:vir:94 191 FWTDTTV--TYYVLEN---GGLIPDY--------------------------------YYGANHVQSHFSNGNWGRVPFI 233 (474) T ss_pred EEeCCeE--EEEEEcC---Ccccccc--------------------------------ccCcCcccccccccCCCccceE Confidence 4433211 1122111 1110000 0001111122233444556666 Q ss_pred EEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccC Q lcl|NC_013059. 302 PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENN 381 (725) Q Consensus 302 P~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (725) +|... ..+.|.+..+++.++.+|+..|.+...+...+...+++.-...+...+....... + ..+...+ T Consensus 234 ~~~nn-------~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~--~---~~i~~~~ 301 (474) T protein:vir:94 234 AFKNN-------PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKY--Y---KAINVDG 301 (474) T ss_pred EecCC-------cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhc--c---ceeeccC Confidence 55321 2356778899999999999999999888776665544332111211111111111 1 1111111 Q ss_pred ccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 382 GEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDG 461 (725) Q Consensus 382 g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g 461 (725) + ..++++..+.-..++...++.....|-..+++.+.+.+.-+++.||+|+..+...........-..|..+++++. T Consensus 302 ~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~ 377 (474) T protein:vir:94 302 D----GGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELI 377 (474) T ss_pred C----CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 123444433334555667888889999999887666555445679999887766655555555555666665555 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHh Q lcl|NC_013059. 462 EIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGK 541 (725) Q Consensus 462 ~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~ 541 (725) ++ |.+++... . .+ .+|.|.-.++.+....+..+.+.+ T Consensus 378 ~l----i~~~~~~~------~-----d~------------------------~~i~v~f~~~~p~~~~e~a~~~~~---- 414 (474) T protein:vir:94 378 SF----IIDFNNLK------T-----DV------------------------KDIEISFNFNRMMNDAEQSQIIAQ---- 414 (474) T ss_pred HH----HHHHhCCC------c-----cc------------------------ceeeEEeccCcccCHHHHHHHHHH---- Confidence 44 44444210 0 00 122233344444322233333322 Q ss_pred cccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 542 TPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 542 ~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) .+ ..|. ..++..++.. +..+..++++.+.........+.. .........-..+ T Consensus 415 ~g-~iS~--et~l~~l~~v--~D~~~E~eri~~E~~~~~~~~~~~----------------------~~~~~~~~~~~~~ 467 (474) T protein:vir:94 415 SQ-YLSR--ETLVKSSPLV--DDYKAELERIEQEQMEYNKQLPNL----------------------DDGGADGAQQQEG 467 (474) T ss_pred cC-CCCH--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhhcccc----------------------CCCCCCCcccCCC Confidence 22 2232 1222222222 112222333322111000000000 0000000000000 Q ss_pred HHHHHHH Q lcl|NC_013059. 622 LAKAQNQ 628 (725) Q Consensus 622 ~~kaqae 628 (725) ...-+++ T Consensus 468 ~~~~~~e 474 (474) T protein:vir:94 468 SNNKESE 474 (474) T ss_pred CcccccC Confidence 0000000 No 85 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.58 E-value=9.5e-14 Score=91.96 Aligned_cols=443 Identities=8% Similarity=0.008 Sum_probs=199.4 Q ss_pred CC--------cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHH-----HhhcCC----CcccchHHH Q lcl|NC_013059. 1 MA--------DNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQY-----TTLQYR----GQFDVVRPV 63 (725) Q Consensus 1 ma--------d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~-----l~~~gr----p~~N~i~~~ 63 (725) |+ +..+...+++..+.. ....-+....+-.+||.|+|.--.-..+ ....++ .++|..+.+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~---~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~I 89 (474) T protein:vir:97 13 YGEEVVEQLKPQFETQEEMIVRLID---DHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNL 89 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHH---HHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHH Confidence 32 222222333333322 2223345566778999998732110000 112222 357999999 Q ss_pred HHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCcee Q lcl|NC_013059. 64 VRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) Q Consensus 64 v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ 143 (725) |+..+|+.-.+.+.+.+ +|.+..+ +++.+.+ |+++.....+.++++++|.||.-+..+ ++ + .+. T Consensus 90 vd~~~~~l~g~p~~~~~-----~d~~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d---~~--~-~~~ 153 (474) T protein:vir:97 90 VDQKVSYVASKPVTYSC-----EDENVLK----VIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYIN---EN--G-EMK 153 (474) T ss_pred HHHHHhhhhcCCceecc-----CcHHHHH----HHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEec---CC--C-eeE Confidence 99999999998876643 3333333 4444444 789999999999999999999877543 22 1 233 Q ss_pred EEEEeeecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEE Q lcl|NC_013059. 144 IRREPIHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAE 221 (725) Q Consensus 144 ir~~~~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E 221 (725) +.+. ++.. ++||+.... + ...+++.|-.. ....++ T Consensus 154 i~~~----~p~~~~~v~d~~~~~----~-~~~~ir~~~~~----------------------------------~~~~~~ 190 (474) T protein:vir:97 154 LFRV----PAEQAIPIWVDKERE----E-LKSFIRYYKFN----------------------------------NEEKVE 190 (474) T ss_pred EEEE----cccceEEEEcCCCCC----c-eEEEEEEEEec----------------------------------CeEEEE Confidence 3321 2332 345543211 1 11223322110 001233 Q ss_pred EEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceE Q lcl|NC_013059. 222 FYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIV 301 (725) Q Consensus 222 ~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~v 301 (725) +|....+ ..+...+ |...... ......+.....+.+.+.+|+| T Consensus 191 ~yt~~~~--~~y~~~~---~~~~~~~--------------------------------~~~~~~~~~~~~~~~~g~vPvv 233 (474) T protein:vir:97 191 FWTDTTV--TYYVLEN---GGLIPDY--------------------------------YYGANHVQSHFSNGNWGRVPFI 233 (474) T ss_pred EEeCCeE--EEEEEcC---Ccccccc--------------------------------ccCcCcccccccccCCCccceE Confidence 4433211 1122111 1110000 0001111122233444556666 Q ss_pred EEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccC Q lcl|NC_013059. 302 PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENN 381 (725) Q Consensus 302 P~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (725) +|... ..+.|.+..+++.++.+|+..|.+...+...+...+++.-...+...+....... + ..+...+ T Consensus 234 ~~~nn-------~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~--~---~~i~~~~ 301 (474) T protein:vir:97 234 AFKNN-------PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKY--Y---KAINVDG 301 (474) T ss_pred EecCC-------cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhc--c---ceeeccC Confidence 55321 2356778899999999999999999888776665544332111211111111111 1 1111111 Q ss_pred ccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 382 GEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDG 461 (725) Q Consensus 382 g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g 461 (725) + ..++++..+.-..++...++.....|-..+++.+.+.+.-+++.||+|+..+...........-..|..+++++. T Consensus 302 ~----~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~ 377 (474) T protein:vir:97 302 D----GGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELI 377 (474) T ss_pred C----CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 123444433334555667888889999999887666555445679999887766655555555555666665555 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHh Q lcl|NC_013059. 462 EIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGK 541 (725) Q Consensus 462 ~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~ 541 (725) ++ |.+++... . .+ .+|.|.-.++.+....+..+.+.+ T Consensus 378 ~l----i~~~~~~~------~-----d~------------------------~~i~v~f~~~~p~~~~e~a~~~~~---- 414 (474) T protein:vir:97 378 SF----IIDFNNLK------T-----DV------------------------KDIEISFNFNRMMNDAEQSQIIAQ---- 414 (474) T ss_pred HH----HHHHhCCC------c-----cc------------------------ceeeEEeccCcccCHHHHHHHHHH---- Confidence 44 44444210 0 00 122233344444322233333322 Q ss_pred cccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 542 TPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 542 ~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) .+ ..|. ..++..++.. +..+..++++.+.........+.. .........-..+ T Consensus 415 ~g-~iS~--et~l~~l~~v--~D~~~E~eri~~E~~~~~~~~~~~----------------------~~~~~~~~~~~~~ 467 (474) T protein:vir:97 415 SQ-YLSR--ETLVKSSPLV--DDYKAELERIEQEQMEYNKQLPNL----------------------DDGGADGAQQQEG 467 (474) T ss_pred cC-CCCH--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHhhcccc----------------------CCCCCCCcccCCC Confidence 22 2232 1222222222 112222333322111000000000 0000000000000 Q ss_pred HHHHHHH Q lcl|NC_013059. 622 LAKAQNQ 628 (725) Q Consensus 622 ~~kaqae 628 (725) ...-+++ T Consensus 468 ~~~~~~e 474 (474) T protein:vir:97 468 SNNKESE 474 (474) T ss_pred CcccccC Confidence 0000000 No 86 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.58 E-value=3e-14 Score=94.73 Aligned_cols=451 Identities=11% Similarity=0.023 Sum_probs=205.8 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC--CHH-----HHHHHhhcC----CCcccchHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQW--DDW-----LSQYTTLQY----RGQFDVVRPVVRKLVS 69 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW--~~~-----~~~~l~~~g----rp~~N~i~~~v~~v~g 69 (725) |.=.......+...+...+... -.....+..+||.|+|= ... .....+... +.++|..+-+|+..+| T Consensus 14 ~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~ 91 (479) T protein:vir:79 14 VQLKKESTINLVKVIEHYILKH--RPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVG 91 (479) T ss_pred eccccCChhHHHHHHHHHHhhh--hHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHh Confidence 1111111122222222222222 12346677899999761 000 001111122 3357999999999999 Q ss_pred HHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEee Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPI 149 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~ 149 (725) +...+.+.+.+ ++.+.. .+++.+.+ |+++...+++.+++++.|.||.-+.++ ++ + .+++++. T Consensus 92 ~l~g~p~~~~~-----~~~~~~----~~~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d---~~--~-~~~i~~~-- 153 (479) T protein:vir:79 92 YSVGNPIVFNA-----DDDNLT----KLLNDLLG-EEFDDTITELYLNASNKGVEWLHPYIN---RK--G-EFKYVII-- 153 (479) T ss_pred hhhcCCceecc-----CCHHHH----HHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEeC---CC--C-ceEEEEE-- Confidence 99988766632 233333 34444444 799999999999999999999887543 22 1 2333321 Q ss_pred ecchhh--eeeCCCcc-ccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEe Q lcl|NC_013059. 150 HSACSH--VIWDSNSK-LMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) Q Consensus 150 ~~~~~~--v~~Dp~a~-~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~ 226 (725) ++.+ .+||+... ++ -++++.|...+ .+.+.+..+|+|... T Consensus 154 --~p~~~~~v~d~~~~~~~------~~~ir~y~~~~-----------------------------~~~~~~~~~e~y~~~ 196 (479) T protein:vir:79 154 --PAEEAIPIWDSKRQREL------VAFIRFYYIED-----------------------------IDGNKIKRVEYYTEN 196 (479) T ss_pred --ccceeEEEEeCCCCCce------EEEEEEEEEee-----------------------------cCCceEEEEEEEeCC Confidence 2332 34554321 21 12232222110 011223345555443 Q ss_pred cceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEee Q lcl|NC_013059. 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGE 306 (725) Q Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~ 306 (725) ... .+...+ +.....-... .. ...........+.+..|.+.+.+|+|+|... T Consensus 197 ~i~--~~~~~~---~~~~~~~~~~----------------------~~-~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn 248 (479) T protein:vir:79 197 DIT--YFIERG---NSFIQEFLYD----------------------EY-GKMTDIQEGHFRINNKEQGWGKVPFIPFKNN 248 (479) T ss_pred cEE--EEEecC---Cccccccccc----------------------cc-ccccccccccccccccccCCCcccEEEecCC Confidence 221 122111 1110000000 00 0000011111222344455556666665321 Q ss_pred eeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHHHhhccccccccccccccCcccc Q lcl|NC_013059. 307 WGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMP 385 (725) Q Consensus 307 ~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 385 (725) ..+.|.+..+++.++.+|...|.+...+....+..+++ .|.. ....+........ ..+...++ T Consensus 249 -------~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~-~g~~~~~~~~~~~~~~~~-----~~i~~~~~--- 312 (479) T protein:vir:79 249 -------EKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVL-KEYPGTSLQEFIDNIRYY-----KSIKVDGG--- 312 (479) T ss_pred -------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccccchhhhhhc-----cceecCCC--- Confidence 12557788999999999999999998888776665443 2211 1111111110000 01111111 Q ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) ..++++..+.-..+....++...+.|-..|++.+.+.+..|| .||+|+..............-..|..+++++.++++ T Consensus 313 -~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 390 (479) T protein:vir:79 313 -GGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGD-KSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVC 390 (479) T ss_pred -CcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223454433334555667888888998999888777775554 699999887666666555555666666666555555 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQG 545 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~ 545 (725) .++. - .+. . ..+ -.++.|.-.+..+.-..+..+.+..+.+.+ T Consensus 391 ~~~~----~------~~~--~---------~~~--------------~~~i~i~f~~~~p~~~~~~a~~~~kl~g~i--- 432 (479) T protein:vir:79 391 EYLK----I------SGN--K---------SYD--------------YKTVQITFNHSMIINEAEKIDMAAKSTGIV--- 432 (479) T ss_pred HHHh----c------cCC--C---------ccc--------------cccceEEeCCCCCcCHHHHHHHHHHHhccC--- Confidence 4432 1 110 0 000 135556656666654444455555543322 Q ss_pred cchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 546 TPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKA 625 (725) Q Consensus 546 ~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~ka 625 (725) |. ...+..++..+ ..+.-++++.+........ .+..... ... ....+ T Consensus 433 -S~--et~l~~l~~v~--d~~~E~~ri~~E~~~~~~~--------------~~~~~~~-------~~~-------~~~e~ 479 (479) T protein:vir:79 433 -SD--ETIVSNHPWVE--DVNDELERLKKQEDTQKEY--------------DDLIPNN-------QDG-------VIDET 479 (479) T ss_pred -cH--HHHHHhCCCCC--CHHHHHHHHHHHHHHHHHH--------------HhccCcc-------cCC-------CcCcC Confidence 21 11222222211 1112222222211000000 0000000 000 00000 No 87 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=99.58 E-value=8.1e-13 Score=86.86 Aligned_cols=509 Identities=10% Similarity=0.009 Sum_probs=234.7 Q ss_pred CCcHHHH----HHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHH----HHHh Q lcl|NC_013059. 1 MADNKNR----LESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLV----SEMR 72 (725) Q Consensus 1 mad~~~~----~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~----g~~~ 72 (725) ||....+ -+.+..+|+.-.+....|-..+.+..+|..-.=.+.+.-.......++.-..-...++.+. +.-. T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 80 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLALF 80 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhhhc Confidence 9975322 2335556665555555666666777778652211110000001112221133333333332 2222 Q ss_pred hCCcceEEecCCc-------chH---HHHHHHHH---HHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCC Q lcl|NC_013059. 73 QNPIDVLYRPKDG-------ASP---DAADVLMG---MYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTS 139 (725) Q Consensus 73 ~nr~~~~~~pr~~-------~d~---~~Ae~l~~---~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~ 139 (725) -+++=+++.+.+. .+. ++.+.|.. .+......|++..+...+|.+.+..|.|++-+ .+++ + T Consensus 81 P~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~-----~~~~-~ 154 (535) T protein:vir:94 81 PMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYI-----PEPE-G 154 (535) T ss_pred CCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEee-----ccCc-C Confidence 3344445544431 111 23333433 33334468899999999999999999998644 2222 2 Q ss_pred CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEE Q lcl|NC_013059. 140 NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQI 219 (725) Q Consensus 140 ~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv 219 (725) .....+..|+ .++++..++.= ...-++++..|+.+.+-+.|++.- .. ... . ..++.|.| T Consensus 155 ~~~~f~~~pl----~~y~v~~d~~G----~vd~i~r~~~~~~~~l~~~~~~~~------~~--~~~--~---~~~~~v~v 213 (535) T protein:vir:94 155 TYNPMKLYRL----SSYVVQRDAFG----TVLQIVTLDKTAYAALPEDVRNSM------DS--SQE--H---KGDEMIDV 213 (535) T ss_pred cccceEEEEc----CeEEEeeCCCC----CeEEEEeeeeccHHHhhHHHHHHH------Hh--ccc--c---CCCceeEE Confidence 2223344444 34555544321 122356777888888777666410 00 000 0 11234444 Q ss_pred EEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEE-eeccccccCCCCCCCCcc Q lcl|NC_013059. 220 AEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI-ITCTAVLKDKQLIAGEHI 298 (725) Q Consensus 220 ~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~-~~g~~~l~~~~~~p~~~~ 298 (725) ..+.+++. .++. +. |++ ..|..+....+.|++..+ T Consensus 214 ~~~v~~~~-----------~~~~-------------------------------~~--~~~e~~g~~~~~~~~~~g~~~~ 249 (535) T protein:vir:94 214 YTHIYLDE-----------ESGE-------------------------------YL--KYEEIDGVEVEGTDASYPVDAC 249 (535) T ss_pred EEEEEeeC-----------CCCc-------------------------------EE--EEEEecCeeeccccccCccccC Confidence 44332221 1111 11 222 234444333467888999 Q ss_pred ceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccccccc Q lcl|NC_013059. 299 PIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTD 378 (725) Q Consensus 299 p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 378 (725) ||+|+-.. ..+|..|+.|.+.+..+-.+.+|+.....+.....+.+.++++.++.+-...... .. ..+..+. T Consensus 250 P~~~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~---~~g~~v~ 321 (535) T protein:vir:94 250 PYIPVRMV--RIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLT---KA---QTGDFVS 321 (535) T ss_pred Cceeeeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcc---cC---CCceeec Confidence 99987544 4689899999999999999999998888888888888888888775443221111 11 1111122 Q ss_pred ccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhc-cCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_013059. 379 ENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA-VNGGQVAYDTVNQLNMRADLETYVFQDNLAT-A 456 (725) Q Consensus 379 ~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G-~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~ 456 (725) ...+.+. +.......--.....+++...+.|.... . -.+++ .++..+++.=|..+.+.....+...+.+|.. . T Consensus 322 g~~~~v~---~~~~~~~~~~~~~~~~i~~~~~rI~~af-~-~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~El 396 (535) T protein:vir:94 322 GRPEDIS---FLQLEKAADFSVARAVSEQIEGRLSYAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (535) T ss_pred CCcccce---eeecccccchhHHHHHHHHHHHHHHHHH-h-HhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHH Confidence 2222221 1112222223445667777777777665 2 22343 3445566777888888888888887777663 2 Q ss_pred HHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHH Q lcl|NC_013059. 457 MRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEIL 536 (725) Q Consensus 457 ~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ 536 (725) ..=+.+..++++. ..|.- ++.. .++ +++.+.. |-.+..|.+.++.+. T Consensus 397 L~Pli~r~~~il~-------------r~g~l------P~~p----------~~~---v~~~~vs-~la~l~r~~~~~~l~ 443 (535) T protein:vir:94 397 QLPMVRVLLKQLQ-------------ATNQI------PELP----------KEA---VEPTIST-GMEALGRGQDLDKLE 443 (535) T ss_pred HHHHHHHHHHHHH-------------hCCCC------CCCC----------hhh---ccceEee-hHHHHHHHHHHHHHH Confidence 2222233333322 11110 0000 011 2344422 334456778888888 Q ss_pred HHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhh--hhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_013059. 537 ELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ--MGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGV 614 (725) Q Consensus 537 ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~--~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~ 614 (725) ++++.+.+..|.... +..|++ +++..+...... ..+... +++.+++.+++++++++ +. ++.+... T Consensus 444 ~~~~~laq~~P~~ld------~~id~d---~~~~~~a~~~Gvp~~~i~rs--~eev~~~~~q~~~~~~~-~~-~~~~~g~ 510 (535) T protein:vir:94 444 RCIAAWSALAPMQGD------PDINIA---TIKLRIANAIGIDTSGILKT--PEEKQQEMAEAAQGTAM-QN-AAASAGA 510 (535) T ss_pred HHHHHHHhhChHHhh------hcCCHH---HHHHHHHHHhCCChhhhcCC--HHHHHHHHHHHHHHHHH-HH-HHHHHHH Confidence 888877665553211 123433 333333222221 122211 11111111111110000 00 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 615 LLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDL 659 (725) Q Consensus 615 ~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~ 659 (725) +.....+ .....+.+.. ....-+++ T Consensus 511 ----------~~~~~~~-------~~~~~~~~~~---~~~g~~~~ 535 (535) T protein:vir:94 511 ----------GAGTMAT-------ASPENMKAAA---AQAGMAPN 535 (535) T ss_pred ----------hhhcccc-------cChHHHHHHH---HHhccCCC Confidence 0000000 0000000000 00011111 No 88 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.54 E-value=5.4e-13 Score=87.84 Aligned_cols=392 Identities=12% Similarity=0.058 Sum_probs=197.0 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH----HHHHHHhhcCCCcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDD----WLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~----~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~ 76 (725) |- .+.+.+|...+..- .+. -.+-.+||+|+|.-. .....++..-+.+.|..+..|+.+.+-.. T Consensus 1 ~~--~~~i~~L~~~~~~~---~~r----~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~---- 67 (409) T protein:vir:94 1 MT--EKGIGYLRFKLSVH---KRR----AEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV---- 67 (409) T ss_pred CC--HHHHHHHHHHHHHH---hHH----HHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc---- Confidence 43 33666665554432 122 233358999998642 23334444456677999999998866321 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh-- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS-- 154 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~-- 154 (725) |..-+.+|.+ +..+++.|+++...+.++.++++.|.+|+-|.- +++ +.+ .|+.. ++. T Consensus 68 ---~~Gf~~~d~~--------l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~---~~d--g~~-~i~~~----sp~~~ 126 (409) T protein:vir:94 68 ---FREFENDDFT--------VNEIFEENNPDIFFDSAVLSSLIASCSFTYISK---GEN--DAV-RLQVI----EAVNA 126 (409) T ss_pred ---cCcccCCchH--------HHHHHHhcChhHHHHHHHHHHHHhcceeEEEec---CCC--Cce-EEEEe----ccceE Confidence 1111122322 466889999999999999999999999987742 222 222 23321 222 Q ss_pred heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 155 HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 155 ~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) .++|||..+++- ..++.|-+ + . ....++ ..+|.. T Consensus 127 ~~i~D~~~~~~~------~a~~~~~~-------------d----------------~-~~~~~~-~~~~~~--------- 160 (409) T protein:vir:94 127 TGIIDPITGLLT------EGYAVLER-------------D----------------E-NNNVVL-EAHFLP--------- 160 (409) T ss_pred EEEEecCCCcee------eeEEEEEe-------------c----------------C-CCceEE-EEEEec--------- Confidence 367887544321 11221100 0 0 001111 111211 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) ++++.+...+ | .|. ..|.|.++.|+|||+..+. .. T Consensus 161 ------~~~~~~~~~~-----------~--------------~~~----------~~~n~~g~vPvV~f~n~~~----~~ 195 (409) T protein:vir:94 161 ------DRTDYYYRDS-----------R--------------NNI----------SIANPTGHPLLVPIIHRPD----AV 195 (409) T ss_pred ------CcEEEEEecC-----------c--------------eeE----------eeeCCCCCcceEEeccccc----cc Confidence 1111110000 0 000 1234557889999875432 22 Q ss_pred ccch---hhhhhhhHHHHHHHHHHHHHHHHHhcCCcceee---chhhcchHHHHHHhhccccccccccccccCccccccC Q lcl|NC_013059. 315 VYEG---VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFF---WPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) Q Consensus 315 ~~~G---~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) .+|| +.+.+++.|+.+|+.++.++...-..+.....+ +++. .+. +.|.....+ +...-....| .... T Consensus 196 ~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~-~~~-~~~~~~~~~---i~~~~~d~dg--~~~~ 268 (409) T protein:vir:94 196 RPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA-EPM-ETWKATVSS---MLQFTKDEDG--DKPT 268 (409) T ss_pred cccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC-ccc-chhhhhHHH---hhcCCCCCCC--CCce Confidence 3566 336799999999999998876554444432221 1111 111 112111000 0000000011 1123 Q ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) Q Consensus 389 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~l 467 (725) ++.++...+ ..+...+......+-.+||+.+..+|..+ |..||.||.+....-........+-|..+.++++++++.+ T Consensus 269 v~q~~~~~l-~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i 347 (409) T protein:vir:94 269 LGQFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACL 347 (409) T ss_pred EEecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444444444 34566777777777788899999999765 5589999997665555554555566667777777766654 Q ss_pred HHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccC---chhHHHHHHHHHHHHHHhccc Q lcl|NC_013059. 468 VNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPS---FQSMKQQNRAEILELLGKTPQ 544 (725) Q Consensus 468 i~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~---~~t~r~~~~~~l~ell~~~~~ 544 (725) .-.+- .. .. ++ +++.|.=.|. ......+..+.+..|.++.+. T Consensus 348 ~~~~~-~~----------~~---------------------~~---~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~ 392 (409) T protein:vir:94 348 RDDAP-YL----------RE---------------------QF---RKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPE 392 (409) T ss_pred hCCCC-cc----------cc---------------------cc---ccceEEeccCCCcchHHHHHHHHHHHHHHHhccc Confidence 32211 00 00 00 1222222222 222235567788888887654 Q ss_pred ccc-hHHHHHHHhhccCC Q lcl|NC_013059. 545 GTP-EYQLLLLQYFTLLD 561 (725) Q Consensus 545 ~~p-~~~~~~~~~~~~~d 561 (725) .++ ......+++ .-.| T Consensus 393 ~~~~~~~~~~lG~-~~~d 409 (409) T protein:vir:94 393 FINKDTIRDLTGI-EGGE 409 (409) T ss_pred ccchhHHHHHcCC-CCCC Confidence 433 222333222 1222 No 89 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.53 E-value=4.5e-13 Score=88.28 Aligned_cols=393 Identities=10% Similarity=0.020 Sum_probs=191.8 Q ss_pred hHHHHHHHHHHHHhhcCCCC----CHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHH Q lcl|NC_013059. 22 SDEARREAKNDLFFSRVSQW----DDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGM 97 (725) Q Consensus 22 ~~~~r~~a~~d~~f~~G~QW----~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~ 97 (725) ..-.+..-..-.+||+|+|= +......++..-+.+.|..+..|+.+.+-..-+ .-..+|.+ T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~-------Gf~~~d~~-------- 65 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFR-------AFANDDFN-------- 65 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhccc-------cccCCCch-------- Confidence 11112223344689999873 333334455444567799999999986632211 11122222 Q ss_pred HHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh--heeeCCCccccChhcccceee Q lcl|NC_013059. 98 YRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS--HVIWDSNSKLMDKSDARHCTV 175 (725) Q Consensus 98 ~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~--~v~~Dp~a~~~d~sDa~~~~~ 175 (725) +..+++.|+++...+.++.++++.|.+|+-|.-+ ++ +.+ .|+. .+|. .++|||..+++. +.. T Consensus 66 l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~---~d--~~~-~i~~----~sP~~~~~i~Dp~~~~~~------~al 129 (410) T protein:vir:95 66 VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKG---ED--DEV-RLQV----IESSNATGVIDPITGLLV------EGY 129 (410) T ss_pred HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecC---CC--Cce-EEEE----EcccceEEEEeCCCCceE------EEE Confidence 4567889999999999999999999999877422 22 222 2322 1222 367787543221 111 Q ss_pred eecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHH Q lcl|NC_013059. 176 IHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVI 255 (725) Q Consensus 176 ~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~ 255 (725) +.|- .+ +........+|... ..++...+ |. T Consensus 130 ~~~~------------------------------~~-~~~~~~~~~~~~~~---~~~~~~~~---~~------------- 159 (410) T protein:vir:95 130 AVLA------------------------------RD-DYNRPTLEAYFEPN---ATHFIPKD---GE------------- 159 (410) T ss_pred EEEE------------------------------ec-CCCeEEEEEEEeCC---cEEEEeeC---Cc------------- Confidence 1110 00 00111222222210 00000000 00 Q ss_pred HHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCccccch---hhhhhhhHHHHHHH Q lcl|NC_013059. 256 DDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEG---VVRLTKDGQRLRNM 332 (725) Q Consensus 256 ~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G---~vr~~kd~Q~~~N~ 332 (725) .| ..|.+.+..|+|||+..+ ....+|| +.+.+++.|+.+|+ T Consensus 160 ---------------------~~-----------~~~~~~g~vPvV~f~n~~----~l~~~~G~s~I~~~v~~l~da~~r 203 (410) T protein:vir:95 160 ---------------------PY-----------SVTNETGIPLLVPVIHRP----DAVRPFGRSRITRAGMYYQKYAKR 203 (410) T ss_pred ---------------------cc-----------cccCCCCCcceEEecccc----cCCccCCccccchhHHHHHHHHHH Confidence 01 113344678888886432 2233566 55789999999999 Q ss_pred HHHHHHHHHHhcCCcceeechhhcc-hH-HHHHHhhccccccccccccccCccccccCCcccCCCCchHHHHHHHHHHHH Q lcl|NC_013059. 333 IMSFNADIVARTPKKKPFFWPEQIA-GF-EHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATA 410 (725) Q Consensus 333 ~~s~~~~~~~~~~~~~~~~~~~~i~-~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~ 410 (725) .++.+....-..+.....+ .|... +. .+.|...... +...-....| ....+..++...+ ..+...+..... T Consensus 204 ~~~~~~~~~e~~a~pqr~i-~G~d~d~~~~~~~~~~~~~---i~~~~~~~~~--~~~~v~q~~~~~l-~~~~~~l~~l~~ 276 (410) T protein:vir:95 204 TLERADITAEFYSWPQKYI-LGLDPDAEPMEKWKATVSS---LLTISSSDKG--VKPSVGQFTTASM-SPFTEQLRTAAA 276 (410) T ss_pred HHHHHHHHHHHhcchhhee-eccCCCCCcCchhhhhhhh---heeccCCCCC--CcceEEecCCCCh-HHHHHHHHHHHH Confidence 9998776555444432222 11100 00 0111111000 0000000111 1123444554454 346677777777 Q ss_pred HHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcce Q lcl|NC_013059. 411 AVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKE 489 (725) Q Consensus 411 ~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~ 489 (725) .+-.+||+....+|..+ |..||.||.+....-........+-|..+.++++++.+.+.-.+=..+ ..+ T Consensus 277 ~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~-----------~~~ 345 (410) T protein:vir:95 277 GFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTR-----------SQF 345 (410) T ss_pred HHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcc-----------ccc Confidence 88888899999999755 557999999876665555555666677777777777666542210000 000 Q ss_pred EEeccccccccCCceeeeccccccceEEEEec----cCchhHHHHHHHHHHHHHHhcccccc-hHHHHHHHhhccCCchh Q lcl|NC_013059. 490 VQLMAEVVDLATGERQVLNDIRGRYECYTDVG----PSFQSMKQQNRAEILELLGKTPQGTP-EYQLLLLQYFTLLDGKG 564 (725) Q Consensus 490 v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~----p~~~t~r~~~~~~l~ell~~~~~~~p-~~~~~~~~~~~~~d~~~ 564 (725) +++.|.=. |+++| ..+..+.+..|.++.+...+ .....++.+ .+-+ T Consensus 346 ------------------------~~~~v~W~p~~d~~~~s-~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~---~~~~- 396 (410) T protein:vir:95 346 ------------------------VRTAVKWEPLFEADANT-MTMIGDGVVKLNQALPGYINAETIRDLTGI---AGDM- 396 (410) T ss_pred ------------------------ceeeEEeeecCCcchhh-HHHHHHHHHHHHHhccCCccHHHHHHhcCC---ChHH- Confidence 11111111 23444 34567777777776553332 222222222 1111 Q ss_pred HHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhh Q lcl|NC_013059. 565 VEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQ 603 (725) Q Consensus 565 ~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q 603 (725) +.... ...+.+..+ T Consensus 397 ---~~~~~----------------------~~e~~~~g~ 410 (410) T protein:vir:95 397 ---SAKPV----------------------VSEGGSNGE 410 (410) T ss_pred ---HHHHH----------------------HHHHHhCCC Confidence 11100 000111111 No 90 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.51 E-value=3.9e-12 Score=83.11 Aligned_cols=406 Identities=10% Similarity=0.083 Sum_probs=198.2 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH----HHHHHHhhcCCCcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDD----WLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~----~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~ 76 (725) |= ...++.|+..+..- +..-.+-.+||.|+|... .....++...+.+.|..+..|+.+.+-. T Consensus 1 m~--~~~i~~L~~~~~~~-------~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl----- 66 (422) T protein:vir:97 1 MN--YMGMGYLRRKLALF-------KTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRI----- 66 (422) T ss_pred CC--hHHHHHHHHHHHHH-------HHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcc----- Confidence 32 23455555444432 223344579999988632 2334455555666788888888876611 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh-- Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS-- 154 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~-- 154 (725) .|..-+-+|.+ +..+++.|+++...+.++.++++.|.+|+-|..+ ++ .+.+ .|+. .++. T Consensus 67 --~~~Gf~~~d~~--------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~---~~-~~~p-~i~~----~sp~~~ 127 (422) T protein:vir:97 67 --IFREFTNDDFN--------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPG---AE-DGLP-KMQV----IEASKA 127 (422) T ss_pred --ccceeeCCchh--------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeC---CC-CCee-EEEE----echhhE Confidence 11111122322 3557788999999999999999999999887543 21 1212 2332 1232 Q ss_pred heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 155 HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 155 ~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) .++|||..+.+. +...+| +. + +....+...||.- ..++. T Consensus 128 ~~i~D~~~~~~~------~a~~~~-~~------------~------------------~~~~~~~~~~~~~----~~~~~ 166 (422) T protein:vir:97 128 TGILDPTTFLLT------EGYAIL-ES------------D------------------SNGNPTLEAYFTD----KDIWY 166 (422) T ss_pred EEEEeCCCCcce------eeEEEE-Ee------------c------------------CCCcEEEEEEEcC----ceEEE Confidence 366787543321 111111 00 0 0001111111110 01111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +.+ .|. .+..|-|.+++|+|||+.++. .. T Consensus 167 ~~~--~~~---------------------------------------------~~~~~~~~g~vPvv~~~n~~~----~~ 195 (422) T protein:vir:97 167 YPK--KGK---------------------------------------------PYNIKNPTGHPLLVPIIHRPD----AV 195 (422) T ss_pred EcC--CCc---------------------------------------------cccccCCCCCcceEEecccCC----Cc Confidence 111 000 011233446789999875432 23 Q ss_pred ccch---hhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcc-hH-HHHHHhhccccccccccccccCccccccCC Q lcl|NC_013059. 315 VYEG---VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIA-GF-EHMYDGNDDYPYYLLNRTDENNGEMPTQPL 389 (725) Q Consensus 315 ~~~G---~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~-~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 389 (725) .+|| +.+.+++.|+.+|+.++.++......+.....+ .|... +. .+.|..... .+...-....|. ...+ T Consensus 196 ~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-~G~d~d~~~~~~~~~~~~---~i~~~~~de~~~--~~~v 269 (422) T protein:vir:97 196 RPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYV-LGMDPDAKPMEKWRATVS---TLLEISKDEDGD--KPTV 269 (422) T ss_pred cccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-cccCcccccCchhhhhhh---hhhccCCCCCCC--ccee Confidence 3555 336799999999999998776655544433221 11100 00 011111000 000000001111 1123 Q ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 390 AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGG-QVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) Q Consensus 390 ~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n-~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li 468 (725) +.++...+ ..+...+......+-.+||+.+..+|..++ ..||.||.+....-........+-|..+.++++++++.+. T Consensus 270 ~q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~ 348 (422) T protein:vir:97 270 GQFTTASM-APFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLR 348 (422) T ss_pred eecCCCCh-hHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 33444443 345667777777777788999999997664 4799999987666555556666667777777777766543 Q ss_pred HHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCc---hhHHHHHHHHHHHHHHhcccc Q lcl|NC_013059. 469 NDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSF---QSMKQQNRAEILELLGKTPQG 545 (725) Q Consensus 469 ~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~---~t~r~~~~~~l~ell~~~~~~ 545 (725) -..= . . . +++ +++.+.=.|.. .....+..+.+..|+++.+.. T Consensus 349 ~~~~-~---~-------~---------------------~~~---~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~ 393 (422) T protein:vir:97 349 DEFP-Y---L-------R---------------------NQF---MDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGF 393 (422) T ss_pred cCCc-c---c-------c---------------------hhh---ccceEEEccCCCCChHHHHHHHHHHHHHHhhcccc Confidence 1110 0 0 0 001 23333333322 222455677888888876654 Q ss_pred cc-hHHHHHHHhhccCCchhHHHHHHHHhhhhhhhh Q lcl|NC_013059. 546 TP-EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG 580 (725) Q Consensus 546 ~p-~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~ 580 (725) ++ .....++. +...+.-..++.+.. +.+ T Consensus 394 ~~~~~~~~~lg------~~~~~~~~~~~~~~~-~d~ 422 (422) T protein:vir:97 394 MDADVIRDLTG------VKGADKPIPAITEVT-TDG 422 (422) T ss_pred ccHHHHHHHcC------CCchhHHHHHHHhhh-ccC Confidence 43 22222222 111122222221110 111 No 91 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=99.49 E-value=5.8e-12 Score=82.17 Aligned_cols=492 Identities=11% Similarity=-0.007 Sum_probs=228.0 Q ss_pred CCcH-HH----HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHH----HH Q lcl|NC_013059. 1 MADN-KN----RLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EM 71 (725) Q Consensus 1 mad~-~~----~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~ 71 (725) |++. +. .-.++..+|..-.+....|...+.+..+|..-.=++++.-.. + .-+|--..-...++.+.+ .- T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~-~-~~~~~dstg~~a~~~LAa~l~~~l 78 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNE-T-SQNGWQGVGAQATNHLANKLAQVL 78 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCcc-c-cCCcccchHHHHHHHHHHHHHhhh Confidence 8753 22 224566666666555566666777777777643232211110 0 012311333333443322 22 Q ss_pred h-hCCcceEEecCCcc-------hHH---HHHH---HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCC Q lcl|NC_013059. 72 R-QNPIDVLYRPKDGA-------SPD---AADV---LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSP 137 (725) Q Consensus 72 ~-~nr~~~~~~pr~~~-------d~~---~Ae~---l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~ 137 (725) . -+++=+++.+.+.. +.+ +.+. .+..+......|++..+...+|.+.+..|.|++-+ ++++ T Consensus 79 tpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~-----d~~~ 153 (516) T protein:vir:96 79 FPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYK-----PSKG 153 (516) T ss_pred cCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEe-----cCCC Confidence 2 24555666665421 122 2222 34445555667899999999999999999987532 2221 Q ss_pred CCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeE Q lcl|NC_013059. 138 TSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTI 217 (725) Q Consensus 138 ~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~v 217 (725) . ++..|+ .++++..++.- ...-++++.+|+..++.+.|+... ... .+... -..++.+ T Consensus 154 ---~--~~~~pl----~~y~v~~d~~G----~v~~i~rr~~~~~~~l~~~~~~~~---~~~---~~~~~----~~~~~~v 210 (516) T protein:vir:96 154 ---A--ISAIPM----HHYVVNRDTNG----DLLDIILLQEKALRTFDPATRAVV---EVG---LKGKK----CKEDDSV 210 (516) T ss_pred ---C--EEEEEc----CeEEEeeCCCC----CeeeehhhhHhhHHHHHHhhhhhh---hhh---hhhhh----cCCCCce Confidence 1 344444 34555544431 112367777888888777664311 100 00000 0011223 Q ss_pred EEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCc Q lcl|NC_013059. 218 QIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEH 297 (725) Q Consensus 218 rv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~ 297 (725) .|..+-++++ + + +..|+.. ..|.++ ...+-||+.. T Consensus 211 ~v~~~v~~~~---------~---~-------------------------------~~~~~~~-~d~~~~-~~es~~~~~e 245 (516) T protein:vir:96 211 KLYTHAKYLG---------D---G-------------------------------FWELKQS-ADDIPV-GKVSKIKSEK 245 (516) T ss_pred EEEEeeeeeC---------C---c-------------------------------eeEEEEE-eCceee-cccccccccc Confidence 3322111111 1 0 0111111 223333 3346788889 Q ss_pred cceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccc Q lcl|NC_013059. 298 IPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRT 377 (725) Q Consensus 298 ~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 377 (725) |||+|+-.. -.+|..|+.|.+.+..+--+.+|+.....+.....+.+.++.++++.+-...+... ...+..+ T Consensus 246 ~P~~~~Rw~--~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~------~~~g~i~ 317 (516) T protein:vir:96 246 LPFIPLTWK--RSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN------SGTGEVV 317 (516) T ss_pred CCeeeeeee--ecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhcc------CCCceee Confidence 999987554 35898999999999999999999999888888888999999988765533222211 1111111 Q ss_pred cccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 378 DENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAM 457 (725) Q Consensus 378 ~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~ 457 (725) ....+.+ .+++.- +..--......++...+.|....= .+.+.-.++..+++.=|..+.+.-...|...+.+|.. T Consensus 318 ~g~~~~v--~~~q~~-~~~d~~~~~~~i~~~~~rI~~af~-~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~-- 391 (516) T protein:vir:96 318 TGVEEDI--HIVQLG-KYADLTPISAVLEVYTRRIGVVFM-METMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAT-- 391 (516) T ss_pred cCCcccc--eeeecC-cccchhHHHHHHHHHHHHHHHHHh-hhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHH-- Confidence 1111111 111111 111124445667777777766541 1112222333355666777777777777777776654 Q ss_pred HHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHH Q lcl|NC_013059. 458 RRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILE 537 (725) Q Consensus 458 ~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~e 537 (725) +++.-||...+ .+.++ +. | .+..++.+.. +-.+-.|.+.++.+.. T Consensus 392 ----Ell~Pli~r~l------~~~~p-----------~l--p-----------~~~v~~~~vs-~l~~l~r~~~~~~i~~ 436 (516) T protein:vir:96 392 ----TMQSPVAMWGL------LEAGE-----------SF--T-----------SDLVDPVIIT-GIEALGRMAELDKLAN 436 (516) T ss_pred ----HHHHHHHHHHH------HhcCC-----------CC--c-----------cccccceeec-hHHHHHHHHHHHHHHH Confidence 23333332221 11111 00 0 0112333333 2334457777778887 Q ss_pred HHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhh-hhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_013059. 538 LLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG-VKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLL 616 (725) Q Consensus 538 ll~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~-~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~ 616 (725) +++.+.+.++... ..++..|++ +++..+......+. +. ..+++..++.++++++++++..+....++. T Consensus 437 ~~~~i~~~~~~~p----~v~d~id~d---~~~~~~a~~~Gvp~~~i--rs~eev~~~~~~~~~~q~~~~~a~~~~~~~-- 505 (516) T protein:vir:96 437 FAQYMSLPLQWPE----PVLAAVKWP---DYMDWVRGQISAELPFL--KSAEEMAQEQEAQMQAQQAQMLEEGVAKAV-- 505 (516) T ss_pred HHHHHHHHhcCCh----hHHhcCCHH---HHHHHHHHHhCCCcccc--CCHHHHHHHHHHHHHHHHHHHHHHHhhhhh-- Confidence 7776654432211 123334443 33333322222221 11 111111111111111111110000001111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 617 QGQAELAKAQNQTLSLQIDAAKVEAQNQLNAA 648 (725) Q Consensus 617 k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a 648 (725) ..+++.|.+. + T Consensus 506 ----------~~~~~~~~~~-----------~ 516 (516) T protein:vir:96 506 ----------PGVIQQELKE-----------A 516 (516) T ss_pred ----------hHHhhccccc-----------C Confidence 1111111100 0 No 92 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.49 E-value=6e-12 Score=82.10 Aligned_cols=454 Identities=9% Similarity=-0.010 Sum_probs=217.1 Q ss_pred CCcH-HHHHHHHHHHH--HHH----H-----hhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHhhcCC------Ccccch Q lcl|NC_013059. 1 MADN-KNRLESILSRF--DAD----W-----TASDEARREAKNDLFFSRVS--QWDDWLSQYTTLQYR------GQFDVV 60 (725) Q Consensus 1 mad~-~~~~~~~~~~~--~~~----~-----~~~~~~r~~a~~d~~f~~G~--QW~~~~~~~l~~~gr------p~~N~i 60 (725) |=|+ ...++....+. ..+ . ...++-.....+..+||.|+ .|.... ....++ .++|.- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~---~~~~~~~~~~~~~~~n~~ 77 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLN---YEHNGNPVNRRQLSMNLP 77 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcch---hccCCCccccceeecchH Confidence 6665 22333322221 011 1 11122334456788999985 453211 111222 246989 Q ss_pred HHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCC Q lcl|NC_013059. 61 RPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) Q Consensus 61 ~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~ 140 (725) +-+++...++.....+.+.+ +|...++. +..+.+.|++.....+++..+++.|.||+.+.+|. + . T Consensus 78 k~i~~~~a~~l~~~p~~i~~-----~d~~~~e~----l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~---~---~ 142 (496) T protein:vir:38 78 KVTAKYMSKLLFNEKVKINI-----DDKAAEEF----VLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDG---N---K 142 (496) T ss_pred HHHHHHHhhhhhCCcceEee-----CChHHHHH----HHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcC---C---C Confidence 99999999999998888777 34444454 45566679999999999999999999999997652 2 1 Q ss_pred ceeEEEEeeecchhhee--eCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEE Q lcl|NC_013059. 141 NQVIRREPIHSACSHVI--WDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQ 218 (725) Q Consensus 141 ~~~ir~~~~~~~~~~v~--~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vr 218 (725) .+.+.++ ++..+| |+.. . ++.-+ +|+..+. . +....+ T Consensus 143 ~~~i~~v----~~~~~~P~~~~~-~--~~~~~--~f~~~~~-~-------------------------------~~~~y~ 181 (496) T protein:vir:38 143 NVKVSFA----TADCMYPLSNDS-E--NVDEC--VIANSFH-K-------------------------------NNKYYT 181 (496) T ss_pred cEEEEEE----cccceEEEEecC-C--cEEEE--EEEEEEE-e-------------------------------CCeEEE Confidence 2334432 333443 2211 1 12222 2222110 0 112334 Q ss_pred EEEEEEEeccee----EEEEeeCcc-ccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCC Q lcl|NC_013059. 219 IAEFYEVVEKKE----TAFIYQDPV-TGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI 293 (725) Q Consensus 219 v~E~w~~~~~~~----~~~~~~d~~-~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~ 293 (725) ..|+|+.....- .++...+.. .|..+.+.. +++ -+.....+ T Consensus 182 ~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~-----~~~-----------------------------~~~~~~~~ 227 (496) T protein:vir:38 182 LLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTL-----LFD-----------------------------DIEPVVPL 227 (496) T ss_pred EEEEEEEeCceEEEEEEEEecCCccccCccccccc-----ccc-----------------------------ccccceee Confidence 455555332211 111111111 122221110 000 00000011 Q ss_pred CC-CccceEEEEe--eeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccc Q lcl|NC_013059. 294 AG-EHIPIVPVFG--EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYP 370 (725) Q Consensus 294 p~-~~~p~vP~~g--~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 370 (725) ++ ...||++|-. ...-..+.+.+.|.+.++++.++.+|...|.+.+.+-+ ...++.++...+....+. .... .+ T Consensus 228 ~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~-~g~~-~~ 304 (496) T protein:vir:38 228 PDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNL-DGST-TQ 304 (496) T ss_pred cCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccCCC-CCcc-cc Confidence 11 1122222211 00011334445578999999999999999999988765 344555554433211100 0000 00 Q ss_pred cccc--cc-ccccCcccc-ccCCcccCCCCch-HHHHHHHHHHHHHHHHHhCCChHHhccCcch-hHHHHHHHHHHHHHH Q lcl|NC_013059. 371 YYLL--NR-TDENNGEMP-TQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADL 444 (725) Q Consensus 371 ~~~~--~~-~~~~~g~~~-~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~-~Sg~ai~~~q~q~~~ 444 (725) .... .. ......... ...++.. .+.+. ..+...++.....+...+|++...+|..++. .|+.+|......... T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~ 383 (496) T protein:vir:38 305 YFDSTDEAFFLYQGDQDDNGKAIKDI-SVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQ 383 (496) T ss_pred CCCCccceEEEeecCCCcccccceee-ccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHH Confidence 0000 00 000111111 1123333 33443 4567788888899999999999988865433 467778776665555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEecc Q lcl|NC_013059. 445 ETYVFQDNLATAMRRDGEIYQSIVNDIY--DVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP 522 (725) Q Consensus 445 ~~~~~~dn~~~~~~~~g~~ll~li~~~y--~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p 522 (725) .....-..+..+++++.+.++.+..-+- +.. . + ...++.|+-.. T Consensus 384 ~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~--------~-------------------------~-~~~~i~v~f~d 429 (496) T protein:vir:38 384 TKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGE--------V-------------------------V-ELDTITVDFDD 429 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--------C-------------------------C-CccceEEEeCC Confidence 5556777778888888888887654322 111 0 0 01334444444 Q ss_pred CchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhh----hccchhh Q lcl|NC_013059. 523 SFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVK----KPETPEE 589 (725) Q Consensus 523 ~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~----~~~~~e~ 589 (725) +.+.-.++..+.++++..+ + .+|.- ..+...+..+-+.+++.++++.......... ....+++ T Consensus 430 ~i~~d~~~~~~~~~~~~~~-G-iiS~e--t~l~~~~~~~d~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 430 SIAQDEDTTINRYTNAKNQ-G-MIPLK--IALQRAWNITEAEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred CCCCCHHHHHHHHHHHHhc-C-CCCHH--HHHHhcCCCChHHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 4444445566666665432 2 22311 1111122223344444444443322111000 0000000 No 93 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.48 E-value=8.8e-13 Score=86.67 Aligned_cols=462 Identities=10% Similarity=0.008 Sum_probs=183.6 Q ss_pred CCc-------HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHH----HH--HhhcCCCcccchHHHHHHH Q lcl|NC_013059. 1 MAD-------NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLS----QY--TTLQYRGQFDVVRPVVRKL 67 (725) Q Consensus 1 mad-------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~----~~--l~~~grp~~N~i~~~v~~v 67 (725) |-+ ..++-.-+...+...+.. ......+-.+||.|++.-.... .. .+..-+.+.|..+-+|+.. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~---~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 77 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNT---ECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSF 77 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHH---HhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHH Confidence 432 111222122222222222 1122334468999997521110 00 0111123568888888888 Q ss_pred HHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEE Q lcl|NC_013059. 68 VSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRRE 147 (725) Q Consensus 68 ~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~ 147 (725) ++... +.+-...|.+..+. +..++..|+++...+.+..++++.|.+|+-|.-.....|..+. ..++. T Consensus 78 ~~~l~-------~~gf~~~d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~-~~i~~- 144 (479) T protein:vir:99 78 AQQLI-------VDGYRKTGTNENAK----GWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTV-ARIKC- 144 (479) T ss_pred Hhhcc-------cccccCCCchhhHH----HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCc-eEEEE- Confidence 77432 11111222222332 3456678999999999999999999988765421111122222 22221 Q ss_pred eeecchhhe--eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEE Q lcl|NC_013059. 148 PIHSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEV 225 (725) Q Consensus 148 ~~~~~~~~v--~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~ 225 (725) .++.++ +||...+.. ..++.. + .+......+|.. T Consensus 145 ---~~p~~~~~iydd~~~~~-----~~~~~~-----------------~-------------------~~~~~~~~~~~~ 180 (479) T protein:vir:99 145 ---IDPRDAFAIWEDPYWDE-----WPKYLL-----------------E-------------------RQPNGQYWWWTE 180 (479) T ss_pred ---echhheEEEecCCcccc-----eeeEEE-----------------e-------------------ecCceeEEEEec Confidence 123322 343221100 000000 0 000001111111 Q ss_pred ecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEe Q lcl|NC_013059. 226 VEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFG 305 (725) Q Consensus 226 ~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g 305 (725) . . +..+. ...|...+.++.|-+.+.+|+|||.. T Consensus 181 ~--~--~~~~~-------------------------------------------~~~~~~~~~~~~~h~~g~vPvv~f~n 213 (479) T protein:vir:99 181 E--D--YSIFE-------------------------------------------FKQGKFIYRETVSHDYGHIPFVRYVN 213 (479) T ss_pred c--e--EEEEE-------------------------------------------ecCCceeeccccccCCCCcceEEeec Confidence 0 0 00000 00111112233333345677777654 Q ss_pred eeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcccc Q lcl|NC_013059. 306 EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMP 385 (725) Q Consensus 306 ~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 385 (725) .+ +....+.|-+..+++.++.+|+.+|.+...+...+.....+ .|... .+......+.+.......+...++. T Consensus 214 ~~---~~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i-~G~~~-~~~~~~~~~~~~~~~~~i~~~~~~~-- 286 (479) T protein:vir:99 214 VM---DLRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWA-TGLML-PEGANADQEKMRFAQESMLISQNEK-- 286 (479) T ss_pred CC---CcCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhh-cCCCc-ccccccchhccccccccceeecCCC-- Confidence 32 22234678889999999999999998876665544433221 11110 0000000011111111111111111 Q ss_pred ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll 465 (725) ..+..++... ...+...++.....|-.++|+.+..+|..+| .||+|+......-........+-|..+++++.++++ T Consensus 287 -~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n-~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~ 363 (479) T protein:vir:99 287 -ASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAGQIVN-VAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVN 363 (479) T ss_pred -ceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222222 2344555665556666667888899997666 699999987766666666666667777777666654 Q ss_pred HHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEec-cCchhHHHHHHHHHHHHHHhccc Q lcl|NC_013059. 466 SIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVG-PSFQSMKQQNRAEILELLGKTPQ 544 (725) Q Consensus 466 ~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~-p~~~t~r~~~~~~l~ell~~~~~ 544 (725) .+ -+ ...... .+++.|.=. |.+++ ..+..+.+.+|.++ + T Consensus 364 ~~----~~---------~~~~~~------------------------~~~i~~~w~~~~~~s-~~~~ad~~~kl~~a-g- 403 (479) T protein:vir:99 364 KI----EG---------RTEEAT------------------------DLDFTITWQDVTIQS-LAQFADAWAKMVES-L- 403 (479) T ss_pred HH----cC---------CCcccc------------------------ceeeeEEecCCCCCC-HHHHHHHHHHHHhc-C- Confidence 42 21 110000 122332221 22334 33456666666543 1 Q ss_pred ccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 545 ~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) ..|. ..++..++..+-+.++.+.+....+........ .... ... ...+.. ......+..+ T Consensus 404 ~is~--et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~------------~~~~---~~~--~~~~~~-~~~~~~~~~~ 463 (479) T protein:vir:99 404 KIPA--EGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMR------------KLQN---GPD--PAEQRG-GPNGATNMQQ 463 (479) T ss_pred CCCH--HHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHH------------HHhc---ccC--cccccC-CCCCCCCCCC Confidence 2222 222222222232333322211111000000000 0000 000 000000 0000000000 Q ss_pred HHHHH-HHHHHHHHHHHH Q lcl|NC_013059. 625 AQNQT-LSLQIDAAKVEA 641 (725) Q Consensus 625 aqae~-~k~q~ea~~~q~ 641 (725) +...- .-+++ -+..+ T Consensus 464 ~~~~~~~~~~~--~~~~~ 479 (479) T protein:vir:99 464 ANNKTGEPASL--NKSGA 479 (479) T ss_pred CCCCCcchhcc--CCCCC Confidence 00000 00000 00000 No 94 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.48 E-value=6.1e-12 Score=82.06 Aligned_cols=394 Identities=11% Similarity=0.038 Sum_probs=198.8 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH----HHHHHHhhcCCCcccchHHHHHHHHHHHhhCCc Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDD----WLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~----~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~ 76 (725) |- .+.+.+|+..+... . ..-.+-.+||+|+|... .....++..-+.+.|..+..|+.+.+-..-+ T Consensus 1 ~~--~~~i~~L~~~~~~~---~----~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~-- 69 (409) T protein:vir:16 1 MT--EKGIGYLRFKLSVH---K----RRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFR-- 69 (409) T ss_pred CC--HHHHHHHHHHHHHH---h----HHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccc-- Confidence 43 33666666555442 1 22344468999998643 3334454445667799999999986622211 Q ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhhe Q lcl|NC_013059. 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHV 156 (725) Q Consensus 77 ~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~v 156 (725) .-+.+|.+ +..+++.|+++...+.+..++++.|.+|+-|.- +++ +.+ .|+.. .-....+ T Consensus 70 -----Gf~~~d~~--------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~---~~d--g~~-~i~~~--sP~~~~~ 128 (409) T protein:vir:16 70 -----EFENDDFT--------VNEIFEENNPDIFFDSTVLSALIASCSFTYISK---GEN--DAV-RLQVI--EATNATG 128 (409) T ss_pred -----cccCcchH--------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEec---CCC--Cce-EEEEE--cccceEE Confidence 11123322 456789999999999999999999999987642 222 222 23321 1112236 Q ss_pred eeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEEee Q lcl|NC_013059. 157 IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQ 236 (725) Q Consensus 157 ~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~ 236 (725) +|||..+++. ...+.|-+ +. ....++ ..+|.. T Consensus 129 i~D~~~~~~~------~a~~~~~~-------------d~-----------------~~~~~~-~~~~~~----------- 160 (409) T protein:vir:16 129 IIDPITGLLT------EGYAVLER-------------DE-----------------NNNVVL-EAHFLP----------- 160 (409) T ss_pred Eeecccccce------eeeEEEEe-------------cC-----------------CCceEE-EEEEec----------- Confidence 7788655432 11111100 00 001111 111111 Q ss_pred CccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcccc Q lcl|NC_013059. 237 DPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVY 316 (725) Q Consensus 237 d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~ 316 (725) ++++.+... +... ...|-|.+..|+|||+..+ ....+ T Consensus 161 ----~~~~~~~~~---------------------------------~~~~--~~~~~~~g~vPvV~f~n~~----~~~~~ 197 (409) T protein:vir:16 161 ----DRTDYYYRD---------------------------------SRNN--ISIANPTGNPLLVPIIHRP----DAVRP 197 (409) T ss_pred ----CcEEEEEec---------------------------------Cccc--cceecCCCCcceEEecccc----ccccc Confidence 111111000 0000 1123455778999986542 22346 Q ss_pred chhh---hhhhhHHHHHHHHHHHHHHHHHhcCCcc--eeechhhcchHHHHHHhhccccccccccccccCccccccCCcc Q lcl|NC_013059. 317 EGVV---RLTKDGQRLRNMIMSFNADIVARTPKKK--PFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAY 391 (725) Q Consensus 317 ~G~v---r~~kd~Q~~~N~~~s~~~~~~~~~~~~~--~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 391 (725) ||.. +.+++.|+.+|+.++.+.......+... ..+..+...+. +.|.....+ +...-....| +...++. T Consensus 198 ~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~-~~~~~~~~~---i~~~~~d~~g--~~~~v~q 271 (409) T protein:vir:16 198 FGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPM-ETWKATVSS---MLQFTKDEDG--DKPTLGQ 271 (409) T ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCcc-chhhhhhhH---hhccCCCCCC--CCceEEe Confidence 7743 6799999999999998776544443332 22221110111 112111000 0000000111 1123444 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 392 YENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVND 470 (725) Q Consensus 392 ~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~-n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~ 470 (725) ++...+ ..+...+......+-.+||+.+..+|..+ |-.||.||.+....-.......-+-|..+.++++++++.+.-. T Consensus 272 ~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~ 350 (409) T protein:vir:16 272 FTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDD 350 (409) T ss_pred cCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 444444 35667777777777788899999999765 4479999997665555555555666667777777766664222 Q ss_pred hcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEe----ccCchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 471 IYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDV----GPSFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 471 ~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~----~p~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) + +.. ...+ +++.|.= -|+++| ..+..+.+..|.++.+..+ T Consensus 351 ~-~~~----------~~~~------------------------~~~~v~W~~~~~~~~~s-~a~~aDa~~Kl~~a~~~~~ 394 (409) T protein:vir:16 351 V-PYL----------REQF------------------------SKTKPKWEPLFEADASM-LSLIGDGAIKLNQAIPEFI 394 (409) T ss_pred C-Ccc----------chhh------------------------ccceEEecCCCCcchhh-HHHHHHHHHHHHhhccccc Confidence 1 000 0000 1111111 123343 3567788888888765544 Q ss_pred c-hHHHHHHHhhccCC Q lcl|NC_013059. 547 P-EYQLLLLQYFTLLD 561 (725) Q Consensus 547 p-~~~~~~~~~~~~~d 561 (725) + .....++.+ .-.| T Consensus 395 ~~~v~~~~~g~-~~~d 409 (409) T protein:vir:16 395 NKDTIRDLTGI-KGAE 409 (409) T ss_pred chhHHHHhccC-CCCC Confidence 3 222222221 2222 No 95 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=99.48 E-value=7.1e-12 Score=81.72 Aligned_cols=497 Identities=11% Similarity=0.012 Sum_probs=226.9 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhc---CCCCCHHHHHHHhhcCCCcccchHHHHHHHHH----HHh-hCCcc Q lcl|NC_013059. 6 NRLESILSRFDADWTASDEARREAKNDLFFSR---VSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EMR-QNPID 77 (725) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~---G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~~-~nr~~ 77 (725) .. +..+|..-......|...+.+..+|.. +.=..... ...+.-.+|.-..-...++.+.+ .-. -+++= T Consensus 1 m~---~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~-~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 76 (522) T protein:vir:10 1 MK---ARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSR-PNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSF 76 (522) T ss_pred Cc---hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCC-cccccccccccchHHHHHHHHHHHHHHhhcCCCCcc Confidence 22 334444444444555555666667774 22111111 11111123322333344443333 222 24555 Q ss_pred eEEecCCcc-----h----HHHH---HHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEE Q lcl|NC_013059. 78 VLYRPKDGA-----S----PDAA---DVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) Q Consensus 78 ~~~~pr~~~-----d----~~~A---e~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir 145 (725) +++.+.+.+ + ..+. +..+..+......|++..+...+|.+.+..|.|++ |.+++++ + T Consensus 77 F~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-----y~~~~~~------~ 145 (522) T protein:vir:10 77 FKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALI-----FMGKDGL------K 145 (522) T ss_pred ccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeE-----EEcCCCc------e Confidence 666654421 1 1122 22444455556789999999999999999999985 3445543 3 Q ss_pred EEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEE Q lcl|NC_013059. 146 REPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEV 225 (725) Q Consensus 146 ~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~ 225 (725) ..|+ .++++..++.- ...-++++.+|+...+.+.|+..... . ...+ ....++.+.|+.+.+. T Consensus 146 ~~pl----~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~fg~~~~~--~----~~~~----~~~~~~~v~v~~~v~p 207 (522) T protein:vir:10 146 TFPL----TRYVINRDGDG----NVLEIVTKELISRKVLDIELPEPKPN--T----GIDE----SSTTNDDVTIYTYVKL 207 (522) T ss_pred EEEc----ceEEEeeCCCC----CeeEEEeeeeccHHHHHHhcchhccc--h----hhhc----ccCCCCceEEEEEEEe Confidence 4444 34666655432 22338899999999999988862211 1 1001 1112345666665554 Q ss_pred ecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEe-eccccccCCCCCCCCccceEEEE Q lcl|NC_013059. 226 VEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII-TCTAVLKDKQLIAGEHIPIVPVF 304 (725) Q Consensus 226 ~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~g~~~l~~~~~~p~~~~p~vP~~ 304 (725) +... |.. .|+.. .|..+....+.+++..+||+|+. T Consensus 208 ~~~~-----------~~~---------------------------------~~~~~~~~~~~~~~~s~~g~~~~P~~~~R 243 (522) T protein:vir:10 208 DKSS-----------GRW---------------------------------VWHQEAFDKIIPDSRSTAPKNASPWLPLR 243 (522) T ss_pred eccC-----------Cce---------------------------------EEEEccCCccccccccccccccCCceeee Confidence 3211 110 11111 11112122356788999999875 Q ss_pred eeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccc Q lcl|NC_013059. 305 GEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM 384 (725) Q Consensus 305 g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) .. -.+|..|+.|.+.++.+-.+.+|......+.....+.+.++.++++.+....+.. +...+..+....+.+ T Consensus 244 w~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~------~~~~~~~v~g~~~~v 315 (522) T protein:vir:10 244 FN--TVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIA------KAGNGAIVQGRPEDV 315 (522) T ss_pred ee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccccc------CCCCcceecCCCccc Confidence 54 3588899999999999999999999999998889999999998776543322111 111111222222222 Q ss_pred cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_013059. 385 PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLAT-AMRRDGEI 463 (725) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~~~~~g~~ 463 (725) . +++.- ...--.....+++...+.|.++.-+ ....++..+++.-|..+.+.....|...+.+|.. ...=+-+. T Consensus 316 ~--~~~~~-~~~d~~~~~~~i~~~~~ri~~aFl~---~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 389 (522) T protein:vir:10 316 A--VIQVG-KTADFSTAANMATAIEKRLLEAFLV---MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNR 389 (522) T ss_pred e--eeccc-ccccchHHHHHHHHHHHHHHHHHhh---ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 1 11111 1122233455666666666655321 1122333456667888888888888888777763 22222223 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcc Q lcl|NC_013059. 464 YQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTP 543 (725) Q Consensus 464 ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~ 543 (725) .+.++ ...|.-. .+ ..++. +-++ .++.+ +--|.+.++.++.+++.+. T Consensus 390 ~~~il-------------~r~g~lP--~~--------------p~~~~-~~~~--v~~is-~Laraq~~~~l~~~~~~i~ 436 (522) T protein:vir:10 390 TLLVL-------------QRSNQIP--KL--------------PKDIV-RPTI--VAGVN-ALGRGQDRESLTAFVGTIA 436 (522) T ss_pred HHHHH-------------HhcCCCC--CC--------------Ccccc-cccc--ccchh-HHHHHHHHHHHHHHHHHHH Confidence 33322 1122100 00 01121 1111 12222 2236666777777776654 Q ss_pred ccc-chHHHHHHHhhccCCchhHHHHHHHHhhhhhh--hhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 544 QGT-PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ--MGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) Q Consensus 544 ~~~-p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~--~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qa 620 (725) ... |. ..++..|++ +++..+...... ..+.. ++++..++.++.++++++ . +.+..+ + T Consensus 437 ~~~~p~------~~~~~id~d---~~~~~~a~~~Gvp~~~ivr--t~eev~~~~q~~q~~~~~--~----~~~~~a---~ 496 (522) T protein:vir:10 437 QTLGPE------ALMQYLNPL---EAIKRLAAAQGIDVLNLVK--TEQQLAEEQQAAQQQAAQ--Q----SLVDQA---G 496 (522) T ss_pred HhhCch------hhhhcCCHH---HHHHHHHHHhCCChhhhcC--CHHHHHHHHHHHHHHHHH--H----HHHHHH---H Confidence 432 21 112333443 333332222211 11211 111111111111100000 0 000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 621 ELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFRE 667 (725) Q Consensus 621 e~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e 667 (725) .+ +.+- .....+... ...+.++.+.+ T Consensus 497 ~~--~~~~-~~~~~~~~~------------------~~~~~~~~~~~ 522 (522) T protein:vir:10 497 QM--TGSP-LMDPTKNPQ------------------LMDEEQPPMEE 522 (522) T ss_pred HH--hccc-ccCccccHH------------------HHHHhCCCCCC Confidence 00 0000 000000000 00000000000 No 96 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.48 E-value=3.4e-12 Score=83.47 Aligned_cols=500 Identities=9% Similarity=-0.003 Sum_probs=206.2 Q ss_pred CCc-----HHHHHHHHHHHHHHHHhh--hHHHHHHHHHHHHhhcCCCCCHHHHH-----------HHhhcC----CCccc Q lcl|NC_013059. 1 MAD-----NKNRLESILSRFDADWTA--SDEARREAKNDLFFSRVSQWDDWLSQ-----------YTTLQY----RGQFD 58 (725) Q Consensus 1 mad-----~~~~~~~~~~~~~~~~~~--~~~~r~~a~~d~~f~~G~QW~~~~~~-----------~l~~~g----rp~~N 58 (725) |-. ..+.+..+ |..++.. .+.-|..+.+-.+||.|++ ..+. ..+... |.++| T Consensus 1 ~~~~~~~~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~YY~g~h---~Il~r~~~~~~~~~~~~~d~~~~nnki~~n 74 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGL---LNTEITTYMASNHIKWAHIGENYYNQEN---DIEKSRIFYMNDKGQLREDNYASNVKISHG 74 (537) T ss_pred CCcccccccHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcccc---hhhhcccccccccccccccccccccccccc Confidence 332 12233332 3332221 1233566777899999986 1111 112223 34679 Q ss_pred chHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCC Q lcl|NC_013059. 59 VVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) Q Consensus 59 ~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~ 138 (725) ..+.+|+..+|+...+.+.+. +.+.++.+.-+.|+ .+.+ ++++........++.++|.+|.-+.++ ++ T Consensus 75 f~k~Ivd~~~~yl~G~Pv~~~--~~d~~~~e~~~~l~----~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~d---e~-- 142 (537) T protein:vir:78 75 FFTELVDQLAQYLLSNGVEVK--VKDEDNTQLDEILQ----EYFD-EDFQATIDTLVTNASKKGFEGIFARTT---SE-- 142 (537) T ss_pred hHHHHHHHHhhhhcccCceee--cCcchhHHHHHHHH----HHhh-ccHHHHHHHHHHHHhhcCeeEEEeeec---CC-- Confidence 999999999999999877654 44444444444444 3333 677888888999999999999877544 22 Q ss_pred CCceeEEEEeeecchhh--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCe Q lcl|NC_013059. 139 SNNQVIRREPIHSACSH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDT 216 (725) Q Consensus 139 ~~~~~ir~~~~~~~~~~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (725) + .+++.+ .++.+ ++||... + ...+++. |.+... . ....+.+. T Consensus 143 ~-~~~~~~----i~p~~~~pv~d~~~-~-----~~~~~~~-y~~~~~------~------------------~~~~~~~~ 186 (537) T protein:vir:78 143 G-KLKFQT----VDGLTLIPVFDDYG-V-----LKMIIRW-YSEIRY------S------------------TKQQSTET 186 (537) T ss_pred C-ceEEEE----EccceeEEEEcCCC-C-----ceeEEEE-Eeeeec------c------------------ccccCcce Confidence 1 233322 12232 3455421 1 1112222 211100 0 00001233 Q ss_pred EEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEE-----EeeccccccCCC Q lcl|NC_013059. 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKS-----IITCTAVLKDKQ 291 (725) Q Consensus 217 vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~-----~~~g~~~l~~~~ 291 (725) +..+|+|....+. .+...+...+....+.. .. ...+ ...+++. ...+........ T Consensus 187 ~~~~evyt~~~i~--~y~~~~~~~~~~~~~~~--------------~~--~~~~--i~~~~~~~~~~~~~~~~~~~~~~~ 246 (537) T protein:vir:78 187 IWHADVWNEEAVC--YYIQDDEGVSTTYKLDE--------------AY--NPNP--APHVLAIEESTDADFEDTDGYQVL 246 (537) T ss_pred EEEEEEEcCCcEE--EEEecCCcccccccccc--------------cc--cccc--cceeeecccccccccccccccccc Confidence 4455666554332 22222211000000000 00 0000 0001110 011122222333 Q ss_pred CCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccc Q lcl|NC_013059. 292 LIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPY 371 (725) Q Consensus 292 ~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 371 (725) |.+++.+|+|+|.. + ..+.|.+.++++.++.+|...|.+...+...+...+++.-...+...+.......... T Consensus 247 ~~~~g~iPvv~f~n------n-~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~ 319 (537) T protein:vir:78 247 GRSYSKFPFQLLYN------N-KDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKM 319 (537) T ss_pred ccCCcceeEEEecc------C-ccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCc Confidence 44445555555422 1 1356888999999999999999999999888776554432112221122111111110 Q ss_pred cccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 372 YLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQD 451 (725) Q Consensus 372 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~d 451 (725) + .. .|. .+.+.++..+.-..+....++...+.|-..+.+-+....-.| ..||+|+..+-..........-. T Consensus 320 i-----~v-~~d--~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~g-n~SGvAlk~~~~~l~~ka~~ke~ 390 (537) T protein:vir:78 320 I-----GV-NGD--NAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDG-NVTNVVIKSRYTLLAMKARKMET 390 (537) T ss_pred e-----ee-cCC--CCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCcccccc-CCcHHHHHHHHhhHHHHHHHHHH Confidence 1 01 110 122445544444456666788888888888744333332223 46999998876665555555455 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHH Q lcl|NC_013059. 452 NLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQN 531 (725) Q Consensus 452 n~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~ 531 (725) -|..+++++.++++.++... +.. . + |+ .+|.|.-.+..|.-..+. T Consensus 391 ~f~~~l~~~~~~i~~~~~~~----------~~~-~--~-------------------d~---~~i~i~f~~~~P~n~~e~ 435 (537) T protein:vir:78 391 SLRKVLRWCADMVVSDIALR----------GLG-E--Y-------------------DS---NDICFEIEPHVLANELDI 435 (537) T ss_pred HHHHHHHHHHHHHHHHHhhc----------CCc-c--c-------------------cc---ceeeEEeccCCCCCHHHH Confidence 55556555555555443211 100 0 0 01 244445555555433233 Q ss_pred HHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_013059. 532 RAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQA 611 (725) Q Consensus 532 ~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~ 611 (725) .+.++.+.+ .+ ..+ ...++..++..+-+..+++.+ .+. .....+..+...+++. +... .....+. T Consensus 436 a~~~~~l~~-~g-iiS--~eT~l~~~p~vdd~e~ek~~~---ee~------~~~~~~~~~~~~~~~~-~~~~-~~~~~~~ 500 (537) T protein:vir:78 436 ATTRKTEAE-TE-ALK--IGNIMTVAPRIGDDETLKLIA---EEL------DLDYNELKDALAEQDA-QSLD-VSPDVQA 500 (537) T ss_pred HHHHHHHHh-cC-cch--HHHHHHhCCCCCCHHHHHHHH---HHH------Hhhhhhhhhhhhhhcc-cccC-cCcchhh Confidence 333333221 11 111 122222233222221111111 000 0000000000000000 0000 0000000 Q ss_pred HHHHHHHHH-----HHHHHHHH---HHHHHHHHHHHHH Q lcl|NC_013059. 612 QGVLLQGQA-----ELAKAQNQ---TLSLQIDAAKVEA 641 (725) Q Consensus 612 qa~~~k~qa-----e~~kaqae---~~k~q~ea~~~q~ 641 (725) ...-...+. .-..+..+ .-..-- ..--++ T Consensus 501 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~-~~~~~~ 537 (537) T protein:vir:78 501 MLDGLPVNANQPPVDPNQPVADPNVVPPTDP-NAVPQT 537 (537) T ss_pred hcCCCCCCCCCCCCCccCCCCCCCCCCCCCC-ccCCCC Confidence 000000000 00000000 000000 000000 No 97 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.47 E-value=3.7e-12 Score=83.28 Aligned_cols=450 Identities=12% Similarity=0.025 Sum_probs=196.2 Q ss_pred CCcH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH----HHHHHHhhcCCCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADN-KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDD----WLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~----~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr 75 (725) |-+. ...+.+|+..+... +..-.+-.+||+|+|.-. .....++ .-+.+.|..+-+|+.+.....-+ T Consensus 18 l~~~e~~~i~~L~~~~~~~-------~~r~~~l~~YY~G~~~i~~~~~~~p~~~~-~~~~v~n~~~~iVd~~a~rl~~~- 88 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDR-------TPRNLLRASFYDGKYAIRQIGNLIPPEYL-RTATVLGWSAKAVDTLARRCNLE- 88 (504) T ss_pred CCHHHHHHHHHHHHHHHHH-------hHHHHHHHHHHhccccchhccccccHHHH-HHhhccCcHHHHHHHHHhhhccc- Confidence 6654 45666666655443 222334468999988532 1122222 11356799888888876532211 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCc-eeEEEEeeecchh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNN-QVIRREPIHSACS 154 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~-~~ir~~~~~~~~~ 154 (725) --+.| ++.+.. ..+..+++.|+++...+.+..++++.|.+|+-|.-+ ++ +.+ ..|+. .++. T Consensus 89 --Gf~~~---d~~~~~----~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~---~d--~~~~~~I~~----~sP~ 150 (504) T protein:vir:99 89 --SFVWP---DGDYGS----IGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEG---GA--GEPDSLIHV----KSAM 150 (504) T ss_pred --eeeCC---CCChhh----HHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecC---CC--CCceeEEEE----eccc Confidence 11222 222222 235567899999999999999999999999766422 22 222 22332 2333 Q ss_pred h--eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEE Q lcl|NC_013059. 155 H--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETA 232 (725) Q Consensus 155 ~--v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~ 232 (725) + ++|||....+. +..+.+ . .+. .......++|+.... + T Consensus 151 ~~~~iyD~~~~~~~------~a~~~~-~-----------------------------~d~-~g~~~~~~~y~~~~~---~ 190 (504) T protein:vir:99 151 QATGEWNSRRNAMD------SLLSIT-S-----------------------------RDA-EGHPTGIALYEDGVT---V 190 (504) T ss_pred eeEEEEeCCCCcee------EEEEEE-E-----------------------------ecC-CCeEEEEEEEcCCcE---E Confidence 2 46887644322 111111 0 000 111222334432110 0 Q ss_pred EEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCC Q lcl|NC_013059. 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVED 312 (725) Q Consensus 233 ~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~ 312 (725) ....+ .+| .+..+..|.+++ +|+|||...+ . T Consensus 191 ~~~~~-~~~-------------------------------------------~~~~~~~~~~~g-vPvV~~~n~~----~ 221 (504) T protein:vir:99 191 TADMD-DDG-------------------------------------------DWHADVRTHKLG-VPVEVLPYKP----R 221 (504) T ss_pred EEEEc-CCc-------------------------------------------eeeeccccCCCC-cceEEecccc----c Confidence 01011 000 011233445554 7888885432 2 Q ss_pred ccccch---hhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chH-------HHHHHhhccccccccccccccC Q lcl|NC_013059. 313 KEVYEG---VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGF-------EHMYDGNDDYPYYLLNRTDENN 381 (725) Q Consensus 313 ~~~~~G---~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~-------~~~~~~~~~~~~~~~~~~~~~~ 381 (725) ...+|| +.+.+++.++.+|+.++.++...-..+.....+ -|.. +.+ ...|.....+ +...-.... T Consensus 222 ~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i-~G~~~~~~~~~d~~~~~~~~~~~~~---i~~~~~~~~ 297 (504) T protein:vir:99 222 EDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLIL-LGADAKNFRNKDGSMKPAWQIALAR---VFALPDDED 297 (504) T ss_pred CccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-ccCCccccccccccccchhhhhhhh---hhcCCCccc Confidence 233566 445889999999999998775554433322111 1110 000 0011110000 000000001 Q ss_pred ccc---cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc--chhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 382 GEM---PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG--GQVAYDTVNQLNMRADLETYVFQDNLATA 456 (725) Q Consensus 382 g~~---~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~--n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~ 456 (725) +.+ ....+..++...+ ..+...+......|-.+||+.+..+|..+ |..||.||......-.......-+-|..+ T Consensus 298 ~~~~~~~~~~~~q~~~~~l-~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~ 376 (504) T protein:vir:99 298 EPDAARARADVKQFPASSP-QPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPA 376 (504) T ss_pred cccccCccceeeecCCCCh-HHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 1112233333333 34555666666666667999999999654 56799999887766666666677777777 Q ss_pred HHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEec-cCchhHHHHHHHHH Q lcl|NC_013059. 457 MRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVG-PSFQSMKQQNRAEI 535 (725) Q Consensus 457 ~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~-p~~~t~r~~~~~~l 535 (725) .+++.++.+.+...+- .. ...+ +++.|.=. |.++| ..+..+.+ T Consensus 377 l~~~~rla~~~~~~~~-~~----------~~~~------------------------~~~~v~w~d~~~~s-~a~~aDa~ 420 (504) T protein:vir:99 377 FRRSMIRALAIKNGLD-RI----------PPEW------------------------KTIDSKFRSPLYLS-KAAQADAG 420 (504) T ss_pred HHHHHHHHHHHhcCCC-cc----------cccc------------------------ccceeEecCCCccC-HHHHHHHH Confidence 7777777665433210 00 0000 11111111 33344 34455666 Q ss_pred HHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhh---hhhhhccc-------hhhhHHHHHHHHHH--Hhh Q lcl|NC_013059. 536 LELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ---MGVKKPET-------PEEQQWFVEAQQAK--QGQ 603 (725) Q Consensus 536 ~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~---~~~~~~~~-------~e~~~~~~q~~q~q--q~q 603 (725) ..|.++.+...+. ...++..+. .+.+.++++.+..+++... ..+..... .+....- +..... ..- T Consensus 421 ~Kl~~ag~~l~~~-~~~l~~~lg-~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~-e~a~~~~~~~~ 497 (504) T protein:vir:99 421 AKMLGAGPEWLKE-TEVGLELLG-LTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAG-EPPANEPPAAL 497 (504) T ss_pred HHHHhhccccccc-hHHHHhhcC-CCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCC-CCCCCCCCccC Confidence 6665543221111 112222221 1222222222211111000 00000000 0000000 000000 000 Q ss_pred HHHHHHHHHH Q lcl|NC_013059. 604 QDPAMVQAQG 613 (725) Q Consensus 604 ~q~~~~~~qa 613 (725) ..+.+. . T Consensus 498 ~~p~~~---~ 504 (504) T protein:vir:99 498 GRPTLV---G 504 (504) T ss_pred CCcccC---C Confidence 000000 0 No 98 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.46 E-value=3.8e-12 Score=83.20 Aligned_cols=465 Identities=8% Similarity=-0.019 Sum_probs=192.9 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhc-----CCCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQ-----YRGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~-----grp~~N~i~~~v~~v~g~~~~nr 75 (725) +-+...+.. +...+-..+... +..-.+=.+||.|+|............ .+.+.|..+-+|+..++... T Consensus 22 ~~~~~~~~~-l~~~l~~~~~~~---~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~--- 94 (501) T protein:vir:25 22 SMSREQLGA-LVADMWRLHISE---RQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLS--- 94 (501) T ss_pred cCChHHHHH-HHHHHHHHHHHH---HHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhc--- Confidence 222222222 222222222221 122233358999998743322211111 23456888999998887542 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~ 155 (725) ++--..|.+..+ ..+.-+++.|+++...+.++.++++.|.||+-|+.+ ++ + + .|+.. ++.+ T Consensus 95 ~~gf~~~d~~~~--------~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d---e~--~-~-~i~~~----sp~~ 155 (501) T protein:vir:25 95 VVGYRNALAKEN--------DPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPT---DE--G-P-VFRTR----SPRQ 155 (501) T ss_pred ccceecCCccch--------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecC---CC--C-C-eEEEe----cccc Confidence 221112222111 123456789999999999999999999999876532 32 2 2 23321 2333 Q ss_pred e--ee-CCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEE Q lcl|NC_013059. 156 V--IW-DSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETA 232 (725) Q Consensus 156 v--~~-Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~ 232 (725) + +| ||.....= . +.++.|.... +.+.....++|... .+ T Consensus 156 ~~~iy~D~~~~~~~----~-~ai~~~~~~~------------------------------~~~~~~~~~~y~~~----~~ 196 (501) T protein:vir:25 156 ILAVYADPSVDAWP----Q-YALETWVAQK------------------------------DAKPHRRGVLYDDT----YM 196 (501) T ss_pred EEEEEecCCCCcce----e-EEEEEEeecc------------------------------ccCcceeEEEecCe----eE Confidence 3 34 56544311 1 1122222110 00111122222111 11 Q ss_pred EEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEE------eeccccccCCCCCCCCccceEEEEee Q lcl|NC_013059. 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI------ITCTAVLKDKQLIAGEHIPIVPVFGE 306 (725) Q Consensus 233 ~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~------~~g~~~l~~~~~~p~~~~p~vP~~g~ 306 (725) +.+... +.+ .... ....|.. ..++..-++..|-+++.+|+|||.-. T Consensus 197 ~~~~~~--~~~-~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~ 248 (501) T protein:vir:25 197 YELDLG--EVV-LGDA-------------------------GGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNG 248 (501) T ss_pred EEEecC--cee-eeec-------------------------cccccccccccccccccccccccccCCccceeeEeccCc Confidence 111110 000 0000 0000000 00111111222333345566655332 Q ss_pred eeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccc Q lcl|NC_013059. 307 WGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPT 386 (725) Q Consensus 307 ~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 386 (725) .+..+.+.|-+..+++.++.+|+.+|.+.......+..... ..|........+... ... .+...++. T Consensus 249 ---~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~-i~G~~~~~~~~~~~~---~~~---i~~~~~~~--- 315 (501) T protein:vir:25 249 ---RDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRV-ISGWTGSKAEVLKAS---ALR---VWTFEDPE--- 315 (501) T ss_pred ---cccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHH-HhCCCCCccchhhhc---ccc---eeccCCCC--- Confidence 23233456778899999999999999887665544433211 112111111111111 101 11111111 Q ss_pred cCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 387 QPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) Q Consensus 387 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~ 466 (725) ..+..++... ...+...+......|-.+|++.+.++|..++..||.|+......-.........-|..+.+++.++++ T Consensus 316 ~~~~q~~~~~-~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~- 393 (501) T protein:vir:25 316 VKAQAFPPAS-VEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAA- 393 (501) T ss_pred ceEEEecccC-hHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 1222232222 24455667777777777889999999866555799999887776666666666666777766666544 Q ss_pred HHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 467 IVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 467 li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) .+.+.. .... .+++.|.=.+..+....+..+.+..|.+. + . T Consensus 394 ---~~~~~~---------~~~~------------------------~~~i~v~w~~~~~~s~~~~ada~~kl~~~-g--i 434 (501) T protein:vir:25 394 ---EMDDDP---------DTAA------------------------DSGAEVLWRDTEARSFGAVVDGITKLASA-G--I 434 (501) T ss_pred ---HHhCCC---------cccc------------------------ceeeeEEecCCCCCCHHHHHHHHHHHHhc-C--C Confidence 333211 0000 13344443333332234556666666543 1 1 Q ss_pred chHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_013059. 547 PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAEL-AKA 625 (725) Q Consensus 547 p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~-~ka 625 (725) |. ..++..++..+-+.++++.+....+.. .... ......+... ......+. ... T Consensus 435 s~--et~~~~~~g~~~~~ie~~~~~~~e~~~-~~~~------------~~~~~~~~~~----------~~~~~~~~~~~~ 489 (501) T protein:vir:25 435 PI--EHLLSMVPGMTQQTIQAIKDSLRGGEV-KSLV------------DKLLSNEPAP----------VPPPPPQAAAQA 489 (501) T ss_pred CH--HHHHHHcCCCCHHHHHHHHHHHHHHhH-HHHH------------HHhhccCcCC----------CCCCCCCCCccc Confidence 21 222222333333333332221111100 0000 0000000000 00000000 000 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_013059. 626 QNQTLSLQIDAAKVEA 641 (725) Q Consensus 626 qae~~k~q~ea~~~q~ 641 (725) ..+ ..+ .-...+ T Consensus 490 ~~~-~~~---~~~~g~ 501 (501) T protein:vir:25 490 LNE-GGV---NGNGGA 501 (501) T ss_pred ccc-ccC---CCCCCC Confidence 000 000 000000 No 99 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.45 E-value=4.8e-12 Score=82.66 Aligned_cols=437 Identities=11% Similarity=-0.035 Sum_probs=187.4 Q ss_pred CCcH--HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCC----HHHHHHHhh-cCCCcccchHHHHHHHHHHHhh Q lcl|NC_013059. 1 MADN--KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWD----DWLSQYTTL-QYRGQFDVVRPVVRKLVSEMRQ 73 (725) Q Consensus 1 mad~--~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~----~~~~~~l~~-~grp~~N~i~~~v~~v~g~~~~ 73 (725) |... .+.+.++...+... +....+=.+||+|+|.- ......++. ..+.+.|..+-+|+..+|+... T Consensus 1 ~~~~t~~~~~~~l~~~~~~~-------~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH-------HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 7633 34555554443321 22334457899999842 222222222 2345679999999999999876 Q ss_pred CCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecch Q lcl|NC_013059. 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSAC 153 (725) Q Consensus 74 nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~ 153 (725) +...+ +.+ +|.+..+. +.-+++.|+++...+.+..++++.|.+|.-|.. +++ | ...++.. ++ T Consensus 74 ~~~~~---~~~-~d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~---d~~--g-~~~i~~~----~p 135 (456) T protein:vir:10 74 NGITV---GGS-ADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR---RDD--G-TATITAD----SP 135 (456) T ss_pred CCeec---CCC-CCcchHHH----HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEee---CCC--C-ceEEEEE----cc Confidence 64432 221 22222222 344567899999999999999999999865542 222 2 2222221 22 Q ss_pred h--heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEeccee- Q lcl|NC_013059. 154 S--HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKE- 230 (725) Q Consensus 154 ~--~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~- 230 (725) . .++|||..... ..++ ++.|-+.+ .......+|+-...... T Consensus 136 ~~~~~i~d~~~~~~----~~~~-i~~~~~~d-------------------------------~~~~~~~~~~~~~~~~~~ 179 (456) T protein:vir:10 136 ETMVVSVDPLQPWR----IRAA-MRWWRDLD-------------------------------AESDFAIVWSGDGWQKFA 179 (456) T ss_pred ceeEEEEcCCCCcc----eEEE-EEEEEecC-------------------------------CceeEEEEEeccceeEEE Confidence 2 26678765431 1111 12221110 00111111110000000 Q ss_pred -EEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 231 -TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 231 -~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) .... .+..+.... ....|..+..+..+...+..|+||+. . T Consensus 180 ~~~~~-~~~~~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~~pvv~~~----N 220 (456) T protein:vir:10 180 RPCFV-QSSSRRRLV----------------------------------TRISDSWVPVGDAVVTGSPPPVVVYQ----N 220 (456) T ss_pred EEEEE-eecccceee----------------------------------eecCCceeeccccCCCCCceeEEEec----C Confidence 0000 000010000 00111111112222222334444431 1 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCc--ceeechhhcchHHHHHHhhccccccccccccccCccc--- Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKK--KPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM--- 384 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~--~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~--- 384 (725) .+ +-|-+..+++.++.+|+..|.++......+.. ...+........ +.... +.-..+......|.+ T Consensus 221 ~~----g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~-d~~g~----~~~~~~~~~~~~~~~~~~ 291 (456) T protein:vir:10 221 PD----GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNV-DENGN----AIDYASIFEAAPGALWEL 291 (456) T ss_pred CC----CCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccc-ccccc----ccchhhhhhhhccccccC Confidence 12 34668889999999999999876544333221 111110000000 00000 000000001111111 Q ss_pred -cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 385 -PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) Q Consensus 385 -~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ 463 (725) +...+..++.. ....+...+......|-.+||+.+..+|..++..||+||......-.......-..|..+++++.++ T Consensus 292 ~~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl 370 (456) T protein:vir:10 292 PPGVDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK 370 (456) T ss_pred CCCcceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11223333322 2345666777777777788899999998765557999999887776666666777777787777776 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccC-chhHHHHHHHHHHHHHHhc Q lcl|NC_013059. 464 YQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPS-FQSMKQQNRAEILELLGKT 542 (725) Q Consensus 464 ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~-~~t~r~~~~~~l~ell~~~ 542 (725) ++.+ .+.. .. +++.|.=.+. .++ ..+..+.++.|.++ T Consensus 371 ~~~~----~g~~------------~~------------------------~~~~v~w~~~~~~~-~~~~ada~~kl~~~- 408 (456) T protein:vir:10 371 ALQI----EGES------------VE------------------------DTVDVSFESPDRVT-LGEKYSAASLAKAA- 408 (456) T ss_pred HHHh----cCCC------------cc------------------------cceeEEecCCCCcC-HHHHHHHHHHHHHc- Confidence 6532 1110 00 1111111111 222 23344555554432 Q ss_pred ccccchHHHHHHHhhccCCchhHH-HHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 543 PQGTPEYQLLLLQYFTLLDGKGVE-MMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 543 ~~~~p~~~~~~~~~~~~~d~~~~~-~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) +.|-.. .....+ ..+-+.++ .-.++++.+..... ....+..+. + T Consensus 409 --gi~~~~-~~~~~l-g~~~~~i~~~e~er~~~e~~~~~---------------~~~~~~~~~----------------~ 453 (456) T protein:vir:10 409 --GESWAS-IRRNIL-NYNADQIKQDDLDRAREQITLFA---------------GNPVQRPQE----------------D 453 (456) T ss_pred --CCChHH-HHHhhC-CCCHHHHHHHHHHHHHHHHHHHh---------------hhhhhcCCC----------------C Confidence 112111 111111 11111111 01111111100000 000000000 0 Q ss_pred HHH Q lcl|NC_013059. 622 LAK 624 (725) Q Consensus 622 ~~k 624 (725) ..+ T Consensus 454 ~~~ 456 (456) T protein:vir:10 454 GSR 456 (456) T ss_pred CCC Confidence 000 No 100 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.45 E-value=4.8e-12 Score=82.66 Aligned_cols=437 Identities=11% Similarity=-0.035 Sum_probs=187.4 Q ss_pred CCcH--HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCC----HHHHHHHhh-cCCCcccchHHHHHHHHHHHhh Q lcl|NC_013059. 1 MADN--KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWD----DWLSQYTTL-QYRGQFDVVRPVVRKLVSEMRQ 73 (725) Q Consensus 1 mad~--~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~----~~~~~~l~~-~grp~~N~i~~~v~~v~g~~~~ 73 (725) |... .+.+.++...+... +....+=.+||+|+|.- ......++. ..+.+.|..+-+|+..+|+... T Consensus 1 ~~~~t~~~~~~~l~~~~~~~-------~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH-------HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 7633 34555554443321 22334457899999842 222222222 2345679999999999999876 Q ss_pred CCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecch Q lcl|NC_013059. 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSAC 153 (725) Q Consensus 74 nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~ 153 (725) +...+ +.+ +|.+..+. +.-+++.|+++...+.+..++++.|.+|.-|.. +++ | ...++.. ++ T Consensus 74 ~~~~~---~~~-~d~~~~~~----~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~---d~~--g-~~~i~~~----~p 135 (456) T protein:vir:10 74 NGITV---GGS-ADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR---RDD--G-TATITAD----SP 135 (456) T ss_pred CCeec---CCC-CCcchHHH----HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEee---CCC--C-ceEEEEE----cc Confidence 64432 221 22222222 344567899999999999999999999865542 222 2 2222221 22 Q ss_pred h--heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEeccee- Q lcl|NC_013059. 154 S--HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKE- 230 (725) Q Consensus 154 ~--~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~- 230 (725) . .++|||..... ..++ ++.|-+.+ .......+|+-...... T Consensus 136 ~~~~~i~d~~~~~~----~~~~-i~~~~~~d-------------------------------~~~~~~~~~~~~~~~~~~ 179 (456) T protein:vir:10 136 ETMVVSVDPLQPWR----IRAA-MRWWRDLD-------------------------------AESDFAIVWSGDGWQKFA 179 (456) T ss_pred ceeEEEEcCCCCcc----eEEE-EEEEEecC-------------------------------CceeEEEEEeccceeEEE Confidence 2 26678765431 1111 12221110 00111111110000000 Q ss_pred -EEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeec Q lcl|NC_013059. 231 -TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGF 309 (725) Q Consensus 231 -~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~ 309 (725) .... .+..+.... ....|..+..+..+...+..|+||+. . T Consensus 180 ~~~~~-~~~~~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~~pvv~~~----N 220 (456) T protein:vir:10 180 RPCFV-QSSSRRRLV----------------------------------TRISDSWVPVGDAVVTGSPPPVVVYQ----N 220 (456) T ss_pred EEEEE-eecccceee----------------------------------eecCCceeeccccCCCCCceeEEEec----C Confidence 0000 000010000 00111111112222222334444431 1 Q ss_pred cCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCc--ceeechhhcchHHHHHHhhccccccccccccccCccc--- Q lcl|NC_013059. 310 VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKK--KPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM--- 384 (725) Q Consensus 310 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~--~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~--- 384 (725) .+ +-|-+..+++.++.+|+..|.++......+.. ...+........ +.... +.-..+......|.+ T Consensus 221 ~~----g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~-d~~g~----~~~~~~~~~~~~~~~~~~ 291 (456) T protein:vir:10 221 PD----GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNV-DENGN----AIDYASIFEAAPGALWEL 291 (456) T ss_pred CC----CCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccc-ccccc----ccchhhhhhhhccccccC Confidence 12 34668889999999999999876544333221 111110000000 00000 000000001111111 Q ss_pred -cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 385 -PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) Q Consensus 385 -~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ 463 (725) +...+..++.. ....+...+......|-.+||+.+..+|..++..||+||......-.......-..|..+++++.++ T Consensus 292 ~~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl 370 (456) T protein:vir:10 292 PPGVDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK 370 (456) T ss_pred CCCcceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11223333322 2345666777777777788899999998765557999999887776666666777777787777776 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccC-chhHHHHHHHHHHHHHHhc Q lcl|NC_013059. 464 YQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPS-FQSMKQQNRAEILELLGKT 542 (725) Q Consensus 464 ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~-~~t~r~~~~~~l~ell~~~ 542 (725) ++.+ .+.. .. +++.|.=.+. .++ ..+..+.++.|.++ T Consensus 371 ~~~~----~g~~------------~~------------------------~~~~v~w~~~~~~~-~~~~ada~~kl~~~- 408 (456) T protein:vir:10 371 ALQI----EGES------------VE------------------------DTVDVSFESPDRVT-LGEKYSAASLAKAA- 408 (456) T ss_pred HHHh----cCCC------------cc------------------------cceeEEecCCCCcC-HHHHHHHHHHHHHc- Confidence 6532 1110 00 1111111111 222 23344555554432 Q ss_pred ccccchHHHHHHHhhccCCchhHH-HHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 543 PQGTPEYQLLLLQYFTLLDGKGVE-MMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 543 ~~~~p~~~~~~~~~~~~~d~~~~~-~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) +.|-.. .....+ ..+-+.++ .-.++++.+..... ....+..+. + T Consensus 409 --gi~~~~-~~~~~l-g~~~~~i~~~e~er~~~e~~~~~---------------~~~~~~~~~----------------~ 453 (456) T protein:vir:10 409 --GESWAS-IRRNIL-NYNADQIKQDDLDRAREQITLFA---------------GNPVQRPQE----------------D 453 (456) T ss_pred --CCChHH-HHHhhC-CCCHHHHHHHHHHHHHHHHHHHh---------------hhhhhcCCC----------------C Confidence 112111 111111 11111111 01111111100000 000000000 0 Q ss_pred HHH Q lcl|NC_013059. 622 LAK 624 (725) Q Consensus 622 ~~k 624 (725) ..+ T Consensus 454 ~~~ 456 (456) T protein:vir:10 454 GSR 456 (456) T ss_pred CCC Confidence 000 No 101 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.42 E-value=2.3e-11 Score=78.92 Aligned_cols=442 Identities=10% Similarity=0.019 Sum_probs=199.4 Q ss_pred CCcH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC----CHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADN-KNRLESILSRFDADWTASDEARREAKNDLFFSRVSQW----DDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~-~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW----~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr 75 (725) |.|. ...+.+|+..+..-. + .-.+-.+||+|.|= +......++.. +.+.|..+-.|+.+.....-+ T Consensus 12 l~~~~~~~~~~L~~~~~~~~---~----~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~-~~v~nw~~~~Vd~~a~rl~~~- 82 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLR---W----KNLLRTSYYENKRTIQYVGTLIPPQYFNL-GLVLGWTGKAVDALARRCNLE- 82 (474) T ss_pred CChhHHHHHHHHHHHHHHHh---h----HHHHHHHHhccCCChhhccccccHHHHHH-HhhcChHHHHHHHHHhhhccc- Confidence 7765 445555555444431 1 22334589999753 22222222211 235688888888876522211 Q ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhh Q lcl|NC_013059. 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) Q Consensus 76 ~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~ 155 (725) -+ +.|.+..+. .-+.-+++.|+++...+.++.++++.|.+|+-|..+ ++..+ ...|+. .++.+ T Consensus 83 -Gf-~~~d~~~~~-------~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~---~d~~~-~~~i~~----~sp~~ 145 (474) T protein:vir:81 83 -GF-VWPDGDLDS-------LGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVG---EDDEP-EALIHV----KDASE 145 (474) T ss_pred -ce-ECCCCCccc-------hHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecC---CCCCc-eeEEEE----eccce Confidence 12 234322111 114567899999999999999999999999877543 22111 122322 23332 Q ss_pred --eeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEE Q lcl|NC_013059. 156 --VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) Q Consensus 156 --v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~ 233 (725) ++|||..+.+. ..+.+...+ .+.+. +...+|... ..+. T Consensus 146 ~~~~~D~~~~~~~-----~al~~~~~~-------------------------------~~g~~-~~~~ly~~~---~~~~ 185 (474) T protein:vir:81 146 ATGEWNRRRRGLN-----NLLSIIDKD-------------------------------KEGKV-LSLALYLDN---ETVT 185 (474) T ss_pred EEEEEeCCCCcce-----eeeEEEEEc-------------------------------CCCcE-EEEEEEeCC---cEEE Confidence 45887643321 111111000 00111 111122110 0011 Q ss_pred EeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) ...+..++ .|. .+..+.|++ .|+|||+..+. . T Consensus 186 ~~~~~~~~-----------------------------------~w~--------~~~~~~~~g-vPvV~~~n~~~----~ 217 (474) T protein:vir:81 186 AQRDKATL-----------------------------------KWQ--------VDRDEHVYG-VPAQVLPYKPA----P 217 (474) T ss_pred EEEcCccc-----------------------------------eee--------eccCCCCCC-cceEEeccccc----c Confidence 11110000 011 123344444 68888865432 2 Q ss_pred cccch---hhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhc-chHHHHH-Hhhccccccc---cccccccCcccc Q lcl|NC_013059. 314 EVYEG---VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMY-DGNDDYPYYL---LNRTDENNGEMP 385 (725) Q Consensus 314 ~~~~G---~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i-~~~~~~~-~~~~~~~~~~---~~~~~~~~g~~~ 385 (725) ..+|| +.+.+++.|+.+|+.++.++...-..+.....+. |.. +.+.+.- ...+.+.... .......+|.++ T Consensus 218 ~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~ 296 (474) T protein:vir:81 218 KRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL-GADESALKNADGTIKSVWEARLGRIKGLPDDADADIP 296 (474) T ss_pred cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-cCChhhcccccccccchhhhhHHHHhcCCCccccccc Confidence 23455 4468999999999999887765544444332221 110 0000000 0000010000 000011112111 Q ss_pred ---ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccC--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 ---TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVN--GGQVAYDTVNQLNMRADLETYVFQDNLATAMRRD 460 (725) Q Consensus 386 ---~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~--~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~ 460 (725) ...+..++...+ ..+...+......+-.+||+....+|.. .|..||.||.+....-........+-|..+.+++ T Consensus 297 ~~~~~~~~q~~~a~l-~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~ 375 (474) T protein:vir:81 297 QLARADVKQFPAASP-DAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKA 375 (474) T ss_pred ccccccccccCCCCh-hHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122333443333 3455566666667777889999999953 5668999999877776666666777777888888 Q ss_pred HHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEe-ccCchhHHHHHHHHHHHHH Q lcl|NC_013059. 461 GEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDV-GPSFQSMKQQNRAEILELL 539 (725) Q Consensus 461 g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~-~p~~~t~r~~~~~~l~ell 539 (725) +++.+.+--.+--+ ++..+ -+.+.|.= -|.++| ..+..+.+..+. T Consensus 376 ~rla~~i~~~~~~~----~~~~~-----------------------------~~~~~v~W~d~~~~s-~a~~aDa~~Kl~ 421 (474) T protein:vir:81 376 FIRALAMKNKVAID----EIPDE-----------------------------WKSIDAKWRDPRYLS-KSAQADAGMKQL 421 (474) T ss_pred HHHHHHHhCCCCcc----ccchh-----------------------------hccceeEecCCCccC-HHHHHHHHHHHH Confidence 87776654221100 00000 02222221 145555 345677777777 Q ss_pred HhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_013059. 540 GKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQ 619 (725) Q Consensus 540 ~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~q 619 (725) ++.+...+.. ....++ ..+-..++.+....+++ +....-..+..... T Consensus 422 ~a~~~~~~~~--~~~~~l-g~t~~~i~~~~~~~~~~------------------------~~~~~~~~l~~~~~------ 468 (474) T protein:vir:81 422 AAVPWLAETE--VGLELI-GLTPQQARRAMADKRRV------------------------QGRGTLQALIDRSN------ 468 (474) T ss_pred hcccCCCcHH--HHHhhc-CCCHHHHHHHHHHHHHH------------------------hHHHHHHHHHhcCC------ Confidence 7654333211 111111 11111111111111000 00000000000000 Q ss_pred HHHHHHH Q lcl|NC_013059. 620 AELAKAQ 626 (725) Q Consensus 620 ae~~kaq 626 (725) +...+| T Consensus 469 -~~~~aq 474 (474) T protein:vir:81 469 -NGATAQ 474 (474) T ss_pred -CCCCCC Confidence 111111 No 102 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.41 E-value=9.2e-12 Score=81.08 Aligned_cols=441 Identities=10% Similarity=-0.040 Sum_probs=189.6 Q ss_pred CCc--HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHhhc-CCCcccchHHHHHHHHHHHhh Q lcl|NC_013059. 1 MAD--NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQ----WDDWLSQYTTLQ-YRGQFDVVRPVVRKLVSEMRQ 73 (725) Q Consensus 1 mad--~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q----W~~~~~~~l~~~-grp~~N~i~~~v~~v~g~~~~ 73 (725) |-. ..+++.++...+... +....+-.+||.|++ ++......++.. .+.+.|..+-+|+..+|+... T Consensus 1 ~~~~t~~~~~~~l~~~~~~~-------~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH-------HHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhcc Confidence 542 334555555543322 223344578999976 211111112221 223569999999999998877 Q ss_pred CCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEE-Eeeecc Q lcl|NC_013059. 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR-EPIHSA 152 (725) Q Consensus 74 nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~-~~~~~~ 152 (725) +...+ ...+|.+..+.+ .-+++.|+++...+.+..+++++|.+|.-+.. +++ |.+ .++. .|.+ T Consensus 74 ~g~~~----~~~~d~~~~~~~----~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~---~ed--g~~-~i~~~~p~~-- 137 (456) T protein:vir:79 74 NGITV----GGSADSDLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR---RDD--GTA-TITADSPET-- 137 (456) T ss_pred CCeec----CCCCCccHHHHH----HHHHHhcChhHHHHHHHHHHhhcCeeEEEEee---CCC--Cce-EEEEeccce-- Confidence 75332 222333333333 34566789999999999999999999876543 222 222 2222 2221 Q ss_pred hhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEE Q lcl|NC_013059. 153 CSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETA 232 (725) Q Consensus 153 ~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~ 232 (725) ..++|||.....= . +..+.|-+.++ .+ .....|.....+....+|+.. T Consensus 138 -~~~i~d~~~~~~~----~-~~~~~~~~~d~----~~----------------~~~~~~~~~~~~~~~~~~~~~------ 185 (456) T protein:vir:79 138 -MVVSVDPLQPWRI----R-SAMRWWRDLDA----ES----------------DFAIVWSGDGWQKFARPCFVQ------ 185 (456) T ss_pred -eEEEEcCCCCCce----E-EEEEEEEecCC----ce----------------eEEEEEcCCceEEEEEEEEee------ Confidence 1256776554311 1 11222211100 00 000112222222222211111 Q ss_pred EEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCC Q lcl|NC_013059. 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVED 312 (725) Q Consensus 233 ~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~ 312 (725) +..+... .....+.....+..+.+++.+|+|||- ..+ T Consensus 186 ----~~~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~~~pvv~~~----N~~- 222 (456) T protein:vir:79 186 ----SSSRRRL----------------------------------VTRISDSWVPVGDAVVTGSPPPVVVYQ----NPD- 222 (456) T ss_pred ----cccccee----------------------------------eeccCCceeecccccCCCCceeEEEec----CCC- Confidence 0000000 000001111111122333455666541 112 Q ss_pred ccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhh-cchH-HHHHHh----hccccccccccccccCccccc Q lcl|NC_013059. 313 KEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQ-IAGF-EHMYDG----NDDYPYYLLNRTDENNGEMPT 386 (725) Q Consensus 313 ~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~-i~~~-~~~~~~----~~~~~~~~~~~~~~~~g~~~~ 386 (725) +-|-+..+++.++.+|+.+|.+.......+.....+ .|. .+.. .++... .+.+.......+..++ . T Consensus 223 ---~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~-~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~----~ 294 (456) T protein:vir:79 223 ---GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRAL-KSSEHRLPKVDENGNAIDYASIFEAAPGALWELPP----G 294 (456) T ss_pred ---CCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHH-hcCCcccccccccccccchhhhhhhhccccccCCC----C Confidence 346788899999999999888765443332221111 110 0000 000000 0000000000011111 1 Q ss_pred cCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 387 QPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) Q Consensus 387 ~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~ 466 (725) ..+..++...+ ..+...+......|-..||+.+..+|..++..||+|+......-.......-..|..+++++.++++. T Consensus 295 ~~~~q~~~~~~-~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~ 373 (456) T protein:vir:79 295 VDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ 373 (456) T ss_pred cceeeecccCh-HHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12333333332 45667788888888888999999998766567999999887776666666667777777777666543 Q ss_pred HHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhccccc Q lcl|NC_013059. 467 IVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGT 546 (725) Q Consensus 467 li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~ 546 (725) +.+.. + .. .++|+-.. |..++ ..+..+.++.+.++ +. T Consensus 374 ----~~g~~--------~----~~----------------------~i~v~w~~-~~~~s-~~~~ada~~kl~~~---G~ 410 (456) T protein:vir:79 374 ----IEGES--------V----ED----------------------TVDVSFES-PDRVT-LGEKYSAASLAKAA---GE 410 (456) T ss_pred ----hcCCC--------c----cc----------------------cceEEeCC-CCCcC-HHHHHHHHHHHHhc---CC Confidence 32210 0 00 01222111 22233 23344555554432 11 Q ss_pred chHHHHHHHhhccCCchhHHH-HHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 547 PEYQLLLLQYFTLLDGKGVEM-MRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) Q Consensus 547 p~~~~~~~~~~~~~d~~~~~~-i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~k 624 (725) |-.. ..+..+ ..+.+.++. -.++++.+ ... +. ... ....+.+.++ T Consensus 411 ~~~~-~~~~~l-g~~~~~i~~~e~~r~~~e-------------------------~~~----~~-~~~-~~~~~~~~~~ 456 (456) T protein:vir:79 411 SWAS-IRRNIL-NYNADQIKQDDLDRAREQ-------------------------ITL----FA-GNP-VQRPQEDGSR 456 (456) T ss_pred ChHH-HHHhcC-CCCHHHHHHHHHHHHHHH-------------------------HHH----Hh-hhH-hhcCCCCCCC Confidence 1111 111111 111111100 00011000 000 00 000 0000011111 No 103 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=99.39 E-value=3.8e-11 Score=77.71 Aligned_cols=513 Identities=12% Similarity=0.023 Sum_probs=234.7 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCC----CCCHHHHHHHhhcCCCcccchHHHHHHHHH----HHhh-C Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRVS----QWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EMRQ-N 74 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~----QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~~~-n 74 (725) .+.+.+...+.++.. ...|-..+.+..+|..-. -++.. . ....++--+.-...++.+.+ .-.. + T Consensus 1 mk~~a~~r~~~l~~~---R~~~e~~w~e~~~y~lP~~~~~~~~~~--~--~~~~~~~dstg~~a~~~Laa~l~~~ltpp~ 73 (542) T protein:vir:78 1 MKGLAQARYSAMRAD---REDFLDMARRCAALTLPYLLTEDGHAS--G--GRLQQPYQSLGSKGVNALSSKLMLSLFPIQ 73 (542) T ss_pred ChhHHHHHHHHHHHH---hhHHHHHHHHHHHHhccccCCCCCCcc--c--ccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 443444444444444 344545555566776421 12111 0 11123322334444444433 2222 5 Q ss_pred CcceEEecCCc--------chH---HHHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCC Q lcl|NC_013059. 75 PIDVLYRPKDG--------ASP---DAADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) Q Consensus 75 r~~~~~~pr~~--------~d~---~~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~ 140 (725) ++=+++.+.+. +++ ++...| +..+......|++..+...+|.+.+..|.+++ |.++++| T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-----~~~~~~~-- 146 (542) T protein:vir:78 74 TSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLV-----FAGKKTL-- 146 (542) T ss_pred CccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE-----EecCCCc-- Confidence 55666666542 121 222223 34444556688899999999999999999975 3344543 Q ss_pred ceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEE Q lcl|NC_013059. 141 NQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIA 220 (725) Q Consensus 141 ~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~ 220 (725) +..|+ .++++..++.- ...-+|+...||...+.+.|+.-... +.. ...... .....+.|+ T Consensus 147 ----~~~pl----~~y~v~~d~~G----~vd~v~r~~~~t~~ql~~~fg~~~l~--~~~-~~~~~~-----~~~~~~~v~ 206 (542) T protein:vir:78 147 ----KVYPL----DRYVIERDGDG----NVIEIITRELVDRSLLPAEFQKQSLL--EGK-DSNAVG-----EDGPKFGVA 206 (542) T ss_pred ----eEEec----ceeEEeeCCCC----CeEEEeeeeecCHHHHHHhhccccCc--hHH-Hhhccc-----cCCCeEEEE Confidence 34444 33555555422 11227888999999999988752211 100 000000 012344455 Q ss_pred EEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEe-eccccccCCCCCCCCccc Q lcl|NC_013059. 221 EFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII-TCTAVLKDKQLIAGEHIP 299 (725) Q Consensus 221 E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~-~g~~~l~~~~~~p~~~~p 299 (725) +.++.+... +.++. .. +.... ..|+.- .|.++-...+.+++..+| T Consensus 207 ~~v~pr~~~-------~~~~~-----~~--------------------~~~~~--~s~~~e~~g~~v~~~~~e~g~~~~P 252 (542) T protein:vir:78 207 QGKGGRNDA-------EVFTC-----CK--------------------LVDGQ--HRWHQECDGKEIKGSRSSSPLKHSP 252 (542) T ss_pred EEeecccCC-------ccccc-----cc--------------------cCCCe--EEEEEEeccccccccccccccccCC Confidence 554443221 11111 00 01111 222322 234332223567889999 Q ss_pred eEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccc Q lcl|NC_013059. 300 IVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDE 379 (725) Q Consensus 300 ~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 379 (725) |+|+.... .+|..|+.|.+.++.+-.+.+|......+.....+.+.+++++++.+-...+... ..+ +..+.. T Consensus 253 ~i~~Rw~~--~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~~---~~~---g~iv~g 324 (542) T protein:vir:78 253 WLPLRFNV--VDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLAR---AGT---GAIIQG 324 (542) T ss_pred ceeeeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhccc---CCC---ceeecC Confidence 99775543 5888999999999999999999999999999999999999987765432221111 111 111222 Q ss_pred cCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013059. 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLA-TAMR 458 (725) Q Consensus 380 ~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~-~~~~ 458 (725) ..+.+ .++. ...+.--......++...+.|.+..-+. .-.++..+++.=|..+.+.....|...+.+|. .... T Consensus 325 ~~~~v--~~~~-~~~~~~~~~~~~~i~~~~~rI~~aFl~~---~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~ 398 (542) T protein:vir:78 325 RAEDV--SVVQ-ANKGADFRTVQEMIRDLSQRISDAFLIL---NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLT 398 (542) T ss_pred Cccce--eeee-cccccchhHHHHHHHHHHHHHHHHhccc---ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 22222 1111 2222222445667777777887765332 12333445666688888888888888777775 3333 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 459 RDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 459 ~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) =+-+..++++.+ .|.-. .+ | .++ +++.+..+. ....|.+.++.+.++ T Consensus 399 Pli~R~~~il~r-------------~g~lP--~~------p--------~~l---v~~~~~s~L-a~~~r~~~~~~l~~~ 445 (542) T protein:vir:78 399 PYLNRKLHLMQR-------------SKQLP--SL------P--------KGL---VMPTVVAGL-GGVGRGEDRAALIEF 445 (542) T ss_pred HHHHHHHHHHHh-------------cCCCC--CC------c--------hhc---eeeeeechH-HHHHHHHHHHHHHHH Confidence 233333333332 11100 00 0 011 344444443 334577777788777 Q ss_pred HHhccccc-chHHHHHHHhhccCCchhHHHHHHHHhhhhhhh--hhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_013059. 539 LGKTPQGT-PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM--GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVL 615 (725) Q Consensus 539 l~~~~~~~-p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~--~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~ 615 (725) ++.+.+.. |. . +....|+ ++++..+......+ .+.. ++++....+ +++++++.+..+. .++ T Consensus 446 ~~~i~~~~~p~---~---l~~~id~---d~~~~~~a~~~Gvp~~~i~~--s~e~~~~~~--~q~q~~~~~~al~-~~a-- 509 (542) T protein:vir:78 446 MQTVGQAMGPE---A---LQQFIDP---TEFLKRLAAASGIDTLNLVK--SPETMANEA--QQAQQQQMTASLM-GQA-- 509 (542) T ss_pred HHHHHHhcCCh---h---HHhcCCH---HHHHHHHHHHcCCCHhhccC--CHHHHHHHH--HHHHHHHHHHHHH-Hhh-- Confidence 77664432 21 1 1123333 33333333222221 1111 111111111 1111111110000 000 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 616 LQGQAELAKAQN-QTLSLQIDAAKVEAQNQLNAARIAEIFNNMDL 659 (725) Q Consensus 616 ~k~qae~~kaqa-e~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~ 659 (725) +.++.... .....+. .+. ++.+-+....-.++ T Consensus 510 ----~~~a~~~~~~~~~~~~-----~a~---~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 510 ----GQLAKSPIGEKMMQQI-----NAP---GQEAPAGPQTGEDL 542 (542) T ss_pred ----hhccccccccchhhhc-----CCC---CcCCCCCCcccccC Confidence 00000000 0000000 000 00000000000011 No 104 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=99.34 E-value=8e-11 Score=75.95 Aligned_cols=487 Identities=11% Similarity=0.008 Sum_probs=220.1 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHh-----hCCcce Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR-----QNPIDV 78 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~-----~nr~~~ 78 (725) .+.++.....++++. .|-..+.+..+|+.-.=..+.--.......+|.-..-...++.+.+... -+++=+ T Consensus 1 mk~~~~~~~~~lkr~-----~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:78 1 MKSTAAMLWEKLRDG-----SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHhcc-----chHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 566666666655432 2444445555666532221110001111123422333344444333222 244455 Q ss_pred EEecCCcc-------hHH---HHHHH---HHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEE Q lcl|NC_013059. 79 LYRPKDGA-------SPD---AADVL---MGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) Q Consensus 79 ~~~pr~~~-------d~~---~Ae~l---~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir 145 (725) ++.+.+.. +.+ +.+.| +..+......|++..+...+|.+.+..|.+++- ..+++. .++ T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~-----~~~~~~----~~~ 146 (510) T protein:vir:78 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY-----RNSDEA----TVV 146 (510) T ss_pred ccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEE-----EeCCCC----eEE Confidence 55554321 122 22222 334444566889999999999999988887642 222211 234 Q ss_pred EEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEE Q lcl|NC_013059. 146 REPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEV 225 (725) Q Consensus 146 ~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~ 225 (725) ..|+ .++++..++.= ..--++++..|+...+.+.|+....... . .-..++.|-|+.++++ T Consensus 147 ~~pl----~~y~v~~d~~G----~vd~i~rr~~~t~~~l~~~~~~~~~~~~---~---------~~~~~~~v~v~~~V~~ 206 (510) T protein:vir:78 147 AWSL----RSYAVRRDATG----RWMDIVLKQRYKSKDLDDVYKQDLMRAG---R---------NLSGSGSVDLYTHVQR 206 (510) T ss_pred EEEc----ceeEEeeCCCc----CeeEEEeeeeccHHHHHHHhhHHhhhhh---h---------ccCCCceEEEEEEEEe Confidence 4444 33555444321 1123788889999999988876211100 0 0011344556555544 Q ss_pred ecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEe Q lcl|NC_013059. 226 VEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFG 305 (725) Q Consensus 226 ~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g 305 (725) ++.+ + ...+.||+. ..|.+++ ..+-||++.+||+|+-. T Consensus 207 ~~~~-----------~-----------------------------~~~~sv~~e-~dg~~i~-~~~~~~~~e~P~~~~Rw 244 (510) T protein:vir:78 207 RKGT-----------A-----------------------------MDYAEMYHE-IDGVRVG-ETGRWPIHLCPYIVPTW 244 (510) T ss_pred ecCC-----------C-----------------------------CcEEEEEEE-ecCeeec-cccccccccCCeeeeee Confidence 3210 0 011222222 2455554 34778889999998855 Q ss_pred eeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcccc Q lcl|NC_013059. 306 EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMP 385 (725) Q Consensus 306 ~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 385 (725) . -.+|..|+.|.+.+..+--+.+|+.....+.....+.+.++.+.++.+-........ ..+..+ +|.. T Consensus 245 ~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~------~~g~~v---~g~~- 312 (510) T protein:vir:78 245 N--LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA------EMGDYV---PGGA- 312 (510) T ss_pred e--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccC------CCceee---cCCc- Confidence 4 458989999999999999999999888888877888888888887654322111111 111111 1111 Q ss_pred ccCCcccC--CCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_013059. 386 TQPLAYYE--NPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLAT-AMRRDGE 462 (725) Q Consensus 386 ~~~~~~~~--~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~-~~~~~g~ 462 (725) +.++.++ ...--.....+++...+.|....=+ + ....++..+++.=|..+.+.....|...+.+|.. ...=+.+ T Consensus 313 -~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~-~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~ 389 (510) T protein:vir:78 313 -EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-G-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAY 389 (510) T ss_pred -ccccccccCcccchHHHHHHHHHHHHHHHHHHhh-c-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 1122222 1222234456677777777765411 1 1112233355666888887777778777777663 2222223 Q ss_pred HHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhc Q lcl|NC_013059. 463 IYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) Q Consensus 463 ~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~ 542 (725) ..+.++.. .|- .++-+. .+ +-.+ .++ -++--|.+....+..+++.+ T Consensus 390 r~~~il~r-------------~gl---~p~p~~-------------~~--~~~~--v~~-is~Laraq~~~~l~~~~q~l 435 (510) T protein:vir:78 390 VCLSEVDD-------------ALL---QGLITK-------------QH--KPAI--ETG-LPALSRSAAVQSMLNASQVI 435 (510) T ss_pred HHHHHHHh-------------ccC---CCCCcc-------------cc--ccee--eec-ccHHHHHHHHHHHHHHHHHH Confidence 33332221 110 000000 01 1111 112 22333445555555444444 Q ss_pred ccccchHHHHHHHhhccCCchhHHHHHHHHhhhhh--hhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) Q Consensus 543 ~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~--~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qa 620 (725) ....+..+ + .+-.|+ ++++..+..... +..+.. ++++.+++.+++++++++++. .+++-+ ..-+ T Consensus 436 ~~~~~~~q--~---~~~id~---d~~~~~~a~~~Gv~p~~ivr--s~eev~a~~~~~~~q~~~~~~---~~~a~~-~~~~ 501 (510) T protein:vir:78 436 AGLAPIAQ--L---DPRISL---PKMMDTIWAAFSVDTSQFYK--SADELQAEAEEQRRQAAQAQA---AQETLL-EGAS 501 (510) T ss_pred HHhcChhh--h---hhcCCH---HHHHHHHHHHhCCChhhhcC--CHHHHHHHHHHHHHHHHHHHH---HHHHHH-Hhhh Confidence 33333211 1 112233 334443333322 222222 111111111111111000000 000000 0000 Q ss_pred HHHHHHHHH Q lcl|NC_013059. 621 ELAKAQNQT 629 (725) Q Consensus 621 e~~kaqae~ 629 (725) ....+.+.+ T Consensus 502 ~~~~~~~g~ 510 (510) T protein:vir:78 502 DMTNALAGV 510 (510) T ss_pred hhcccCCCC Confidence 111111111 No 105 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.34 E-value=5.3e-11 Score=76.94 Aligned_cols=488 Identities=7% Similarity=-0.041 Sum_probs=221.5 Q ss_pred CCcHHHHHHHHHHHHH---------------HHH-hhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFD---------------ADW-TASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVV 64 (725) Q Consensus 1 mad~~~~~~~~~~~~~---------------~~~-~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v 64 (725) |-=-..+-+-++.||. +.+ +...+|+ .++|.+.+|..--...+. .-+...|+-+.++ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~w~~~~~~~~~-~~~~~~~l~~~i~ 73 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWS------KDSYLTSLWAQGYVPTVH-DKLMNSGTGNEIV 73 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhh------hhhhhhhhcccCCCCccc-cccccCChHHHHH Confidence 3311112222222221 110 1112222 234556677442111111 1112347778888 Q ss_pred HHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeE Q lcl|NC_013059. 65 RKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVI 144 (725) Q Consensus 65 ~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~i 144 (725) +.+...--.-.+.+.|...+..|. +.++..+..+.+.|++.....+..+.++..|.+|+++.++ ++ .+.| T Consensus 74 ~~~A~ll~~e~~~i~v~~~~~~d~---e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d---~~----~~~i 143 (518) T protein:vir:78 74 VVAAEYISGKPLSIDVTGVNGSKD---ENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL---NG----RPSI 143 (518) T ss_pred HHHHHhhcCCCceEEecCccccCc---HHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE---CC----eeEE Confidence 888888888888899976554442 3456677778888999999999999999999999998764 21 2334 Q ss_pred EEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhccccc-ccccCCCeEEEEEEE Q lcl|NC_013059. 145 RREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWV-FPWLTQDTIQIAEFY 223 (725) Q Consensus 145 r~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~vrv~E~w 223 (725) .. .+.+.+++.... -++. ..+|.......+ ..+.....+++....+. ..|... .-+|.-.. T Consensus 144 ~~----v~ad~~~P~~~~--g~~~--~~~f~~~~~~~~---------k~~~y~~lE~he~~~~~~~~~~~~-~~~I~n~l 205 (518) T protein:vir:78 144 SV----HSSSQFWIDFKN--NEPF--RFNFFEEIPTSN---------KADIYYLVESREIKQWDKEGKKLS-GGFVTYSV 205 (518) T ss_pred EE----EcCCeeEEEeec--CcEE--EEEEEEEeecCC---------cceeEEEEEeeccccccceeeccc-ceeEEEEE Confidence 43 244445543221 1222 333332111100 00001011111111100 001000 00111011 Q ss_pred EEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCC-CCccceEE Q lcl|NC_013059. 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIA-GEHIPIVP 302 (725) Q Consensus 224 ~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p-~~~~p~vP 302 (725) |+. + .|..+....-..... + ..++....+. +...++ +...||++ T Consensus 206 y~~----------~--~~~~v~~~~~~~~~~---l-------------------~~~~~~~~~~-e~~~~~tg~~~~~~~ 250 (518) T protein:vir:78 206 IKI----------D--GDKTTPISAERLPEQ---I-------------------TSYLHTNDIQ-LNHSVSIGLKSMGAY 250 (518) T ss_pred eee----------c--Ccccccccccccccc---c-------------------ccccccccCc-cceeeccCCccceEE Confidence 110 1 011110000000000 0 0000000011 111111 12344444 Q ss_pred EEee---eeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc--ccc Q lcl|NC_013059. 303 VFGE---WGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL--NRT 377 (725) Q Consensus 303 ~~g~---~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~ 377 (725) |+.. ..-..+++.+-+.+.+++|.++.+|...|.+.+.+-+ +..+..++++.+.....- ......+.... +.. T Consensus 251 ~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~-~~~~~~~~fd~~~~~y 328 (518) T protein:vir:78 251 LINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNK-STDKEEWSMNVDEDYF 328 (518) T ss_pred eeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCC-CCCccccccCCCCceE Confidence 4321 1111345556678999999999999999999999865 566666665544210000 00000000000 000 Q ss_pred cccCccc-----cccCCcccCCCCch-HHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 378 DENNGEM-----PTQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQD 451 (725) Q Consensus 378 ~~~~g~~-----~~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~d 451 (725) ..-++.. ....++.+.+ .++ ..+...++.....+..-.|++...+|..+...||.+|.+..+..-........ T Consensus 329 ~~i~~~~~~~~~~~~~i~~~~~-~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~ 407 (518) T protein:vir:78 329 MQFKGTLDAGAKLNDMIQFMQG-DFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKR 407 (518) T ss_pred EEecCcCCCCCccccceeeeec-ccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHH Confidence 0000100 0112333332 333 45677888888899999999998888766667888998888777667677777 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHH Q lcl|NC_013059. 452 NLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQN 531 (725) Q Consensus 452 n~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~ 531 (725) .+..+++++-+.++.+..-++...... .....++|+|+-..+-+.-+++. T Consensus 408 ~~e~al~~l~~~i~~l~~~~~~~~~~~------------------------------~~~~~~~v~i~f~D~i~~D~~~~ 457 (518) T protein:vir:78 408 LIQNVYEQMLWDFLYLLTGGTNNKEKA------------------------------IMRDEIRVIIEFPDPMSVNLNEL 457 (518) T ss_pred HHHHHHHHHHHHHHHHHHhhcCccccc------------------------------cCCCceeEEEEeCCCCCCCHHHH Confidence 777777776666666554443211000 01124667777777766666666 Q ss_pred HHHHHHHHHhcccccchHHHHHHHhh-ccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHh Q lcl|NC_013059. 532 RAEILELLGKTPQGTPEYQLLLLQYF-TLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQG 602 (725) Q Consensus 532 ~~~l~ell~~~~~~~p~~~~~~~~~~-~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~ 602 (725) .+.++++..+ + .++.- ..+..+ +..+-..+++.++++...... ..+.++...--. .-.+. T Consensus 458 ~~~~~~~v~a-G-imS~e--~~i~~~~~~~~deea~~e~~ri~~E~~~---~~~~~p~~~~g~----~~~~g 518 (518) T protein:vir:78 458 SSTLNNMNSA-L-AMSVE--EKVKLIHPKWEDEEIQAEVKRIYLENAI---GEVPDPEAIGGM----ETKGG 518 (518) T ss_pred HHHHHHHHhc-C-CCCHH--HHHHHhCCCCCHHHHHHHHHHHHHHhcc---cCCCCCccccCC----CCCCC Confidence 7666665432 2 22211 111111 222323333334333322111 111000000000 00000 No 106 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=99.33 E-value=1e-10 Score=75.38 Aligned_cols=493 Identities=11% Similarity=0.002 Sum_probs=224.0 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHh-----hCC Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR-----QNP 75 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~-----~nr 75 (725) .|.++..+.+.++.++... ..|...+.+..+|..-.=.+...-... ..+|--..-...++.+.+... -++ T Consensus 5 ~~~e~~~l~~r~~~Lk~~R---~~~e~~w~e~~~~~lP~~~~~~~~~~~--~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 79 (517) T protein:vir:10 5 FAGNKSKIPKLYEQLVGKR---SPFLSRAENYSRFTLPYLMADVNDDLS--SQNAWQDDGASATNFLSNKLSQVLFPAQR 79 (517) T ss_pred ccccHHHHHHHHHHHHHhh---hHHHHHHHHHHHHhccccccCCCCCcc--ccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 4556655555555555444 444444555567765321111110000 113322333344444333222 245 Q ss_pred cceEEecCCcc-------hH---HHHHH---HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCce Q lcl|NC_013059. 76 IDVLYRPKDGA-------SP---DAADV---LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQ 142 (725) Q Consensus 76 ~~~~~~pr~~~-------d~---~~Ae~---l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~ 142 (725) +=+++.+.+.. .. ++.+. .+..+......|++..+...+|.+.+..|.++. |..++ +. T Consensus 80 ~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-----y~~~~--~~-- 150 (517) T protein:vir:10 80 SFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMM-----YHPDK--TS-- 150 (517) T ss_pred ccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE-----EEeCC--CC-- Confidence 55566654321 11 12222 234444556788999999999999999998864 22222 11 Q ss_pred eEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEE Q lcl|NC_013059. 143 VIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEF 222 (725) Q Consensus 143 ~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~ 222 (725) .++..|+ .++++..++.- ..--++++.+|+...+.+.|+.... +.... ..+..++.|.|..+ T Consensus 151 ~~~~~pl----~~y~v~~d~~G----~v~~ivrr~~~~~~~l~~~~~~~~~------~~~~~----~~~~~~~~v~v~~~ 212 (517) T protein:vir:10 151 PIQAVPL----HHYCVRRDNNG----TVLDIVFLQEKALETFEPSIRMAIQ------ASRKG----KQYKDKDNVKLYTH 212 (517) T ss_pred cEEEEEc----CeEEEeeCCCc----CeEEEEeeeeccHHHHHHHhhhhcc------hhhhh----hccCCcCceEEEEE Confidence 2355554 34665555431 1112677889999999888875211 10100 11222344444433 Q ss_pred EEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEE Q lcl|NC_013059. 223 YEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVP 302 (725) Q Consensus 223 w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP 302 (725) -++.+ | |. + +.|.-..|.++ ...+-||++.+||+| T Consensus 213 v~~~~---------~---~~-------------------------------~-~~~~~~d~~~~-~~~s~y~~~e~P~~~ 247 (517) T protein:vir:10 213 AKRTK---------D---GK-------------------------------Y-LIRQSADDVPV-GKESTVTEDKSPFLI 247 (517) T ss_pred EEEeC---------C---Cc-------------------------------e-EEEEEeCceee-ccccccccccCCeee Confidence 22211 1 10 0 11111223333 334678888999998 Q ss_pred EEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCc Q lcl|NC_013059. 303 VFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNG 382 (725) Q Consensus 303 ~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 382 (725) +-... .+|..|+.|.+.+..+--+.+|+.....+.....+.+.++.++++.+...... .+...+..+....+ T Consensus 248 ~Rw~~--~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l------~~~~~g~~~~g~~~ 319 (517) T protein:vir:10 248 LTWKR--SYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQF------VEGGSGAVLHGVEG 319 (517) T ss_pred eeeee--cCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhc------cCCCccccccCCcc Confidence 75553 58889999999999999999999888888888888889998887654322111 11111111111111 Q ss_pred cccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhcc-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 383 EMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAV-NGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDG 461 (725) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~-~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g 461 (725) .+ .+++ +....-.......++...+.|....=+ + .++. ++..+++.=|..+.+.-...|...+.+|.. T Consensus 320 ~v--~~~~-~~~~~d~~~~~~~i~~~~~rI~~af~~-~-~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~------ 388 (517) T protein:vir:10 320 DI--HIVQ-LGKYADYTPIQAVLNDYRQRIGRVFMM-E-AMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFAT------ 388 (517) T ss_pred cc--eeee-cccccchhHHHHHHHHHHHHHHHHHhh-h-hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHH------ Confidence 11 1111 122222345567777777777776622 2 2332 333455666777777777777777776653 Q ss_pred HHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHh Q lcl|NC_013059. 462 EIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGK 541 (725) Q Consensus 462 ~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~ 541 (725) +++.=||...| .|....+. . .+ ..+.+..+. .+-.|.+.++.+.++++. T Consensus 389 Ell~Pli~r~~------~~l~~~l~-------~-------------~~----v~~~~~s~l-a~l~r~~~~~~i~~~~~~ 437 (517) T protein:vir:10 389 TFQGPLARWFM------NGISSILT-------S-------------KN----VSPTILTGI-EALGRMAELDKLGTFNGY 437 (517) T ss_pred HHHHHHHHHHH------HHhhhhcC-------C-------------CC----ccceeeccH-HHHHHHHHHHHHHHHHHH Confidence 22222322221 12211100 0 01 122232332 334577777777777666 Q ss_pred cccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhh-hhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 542 TPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM-GVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) Q Consensus 542 ~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~-~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qa 620 (725) +.+.++.... + .+..|+ ++++..+......+ .+.. ++++..+.++++ +++++.++. + +.-. T Consensus 438 i~~~a~~~~~-~---~~~id~---d~~~~~~a~~~Gvp~~~ir--s~~ev~~~~~~~---~~~~~~~~~-~-----~~ag 499 (517) T protein:vir:10 438 VSMTAQWPEP-L---QQAIKW---PDFTDWVQGQISANFPFFK--TQDELNAEAQAQ---QEQEATKYA-A-----EQAG 499 (517) T ss_pred HHHhhcCChH-H---HhcCCH---HHHHHHHHHHhCCChhhcC--CHHHHHHHHHHH---HHHHHHHHH-H-----HHHH Confidence 5543321111 1 122333 33333332222111 1111 111111111000 000000000 0 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 621 ELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEF 665 (725) Q Consensus 621 e~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~ 665 (725) . .+.+.+.+-| . ..+ .-+ T Consensus 500 ~--~~~~~~~~~~-----------~--------~~~------~~~ 517 (517) T protein:vir:10 500 K--AIPDMVKNGQ-----------I--------NPQ------GGQ 517 (517) T ss_pred H--HHHHHHhCCC-----------C--------CCC------CCC Confidence 0 0000000000 0 000 000 No 107 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.33 E-value=1.1e-10 Score=75.28 Aligned_cols=466 Identities=10% Similarity=0.009 Sum_probs=219.1 Q ss_pred CCcH-HHHHHHHHHH------HHHHH-----hhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHhhcC----CCcccchHH Q lcl|NC_013059. 1 MADN-KNRLESILSR------FDADW-----TASDEARREAKNDLFFSRVS--QWDDWLSQYTTLQY----RGQFDVVRP 62 (725) Q Consensus 1 mad~-~~~~~~~~~~------~~~~~-----~~~~~~r~~a~~d~~f~~G~--QW~~~~~~~l~~~g----rp~~N~i~~ 62 (725) |=|. ...++.++.+ ++... ....+-+....+..+||.|+ .|....-. ....+ +..+|+-+- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~-~~~~~~~~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYE-HNGNPVNRRQLSMNLPKV 79 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccc-cCCCccccceeecchHHH Confidence 7665 2333333332 11121 12334455667778999985 66432111 11112 224699999 Q ss_pred HHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCce Q lcl|NC_013059. 63 VVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQ 142 (725) Q Consensus 63 ~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~ 142 (725) +++...++.....+.+.+ +|...++. +.-+.+.|++.....++.+.++..|.||+++.+|. + ..+ T Consensus 80 iv~~~a~~l~~ep~~i~~-----~d~~~~e~----l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~---~---~~~ 144 (499) T protein:vir:80 80 TAKYMSKLLFNEKVKINI-----DDETAEEF----VLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDG---N---KNV 144 (499) T ss_pred HHHHHHHhhhCCcceEee-----CCHHHHHH----HHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECC---C---CcE Confidence 999999999988888777 34444444 45566679999999999999999999999998762 2 134 Q ss_pred eEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEE Q lcl|NC_013059. 143 VIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEF 222 (725) Q Consensus 143 ~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~ 222 (725) .|.. .++..+|+=.... -++..|-|+.... .+ .+..+..|+ T Consensus 145 ~i~~----v~a~~~~Pi~~d~-~~~~~~~f~~~~~---~~-------------------------------~~~y~~lE~ 185 (499) T protein:vir:80 145 KVSF----ATADCMYPLSNDS-ENVDECLIANSFH---KN-------------------------------NKYYKLLEW 185 (499) T ss_pred EEEE----EcCCceEEEEecC-CCeEEEEEEEEEe---ec-------------------------------CeEEEEEEE Confidence 4443 2344444211100 1233343322111 00 011122233 Q ss_pred EEEeccee-------EEEEeeCc-cccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCC Q lcl|NC_013059. 223 YEVVEKKE-------TAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIA 294 (725) Q Consensus 223 w~~~~~~~-------~~~~~~d~-~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p 294 (725) ++...... .++...+. ..|..+.... +.. -+.....++ T Consensus 186 h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~-----~~~-----------------------------~~~~~~~~~ 231 (499) T protein:vir:80 186 NEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKL-----LFN-----------------------------DIEPVVPLP 231 (499) T ss_pred EEecccceeeEEEEEEEEeccCccccCcccchhh-----hcc-----------------------------CcCCceeec Confidence 22211110 11111111 1122221110 000 000001111 Q ss_pred C-CccceEEEEe--eeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcc-cc Q lcl|NC_013059. 295 G-EHIPIVPVFG--EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDD-YP 370 (725) Q Consensus 295 ~-~~~p~vP~~g--~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-~~ 370 (725) + ...||+.|-. ...-..+++.+.|++.++++..+.+|...|.+.+.+-.. ..++.++.+.+....+. ..... .. T Consensus 232 ~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~-~g~~~~~~ 309 (499) T protein:vir:80 232 SLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNL-DGSTTQYF 309 (499) T ss_pred CCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhc-ccceecchhhhhccCCC-CCCcccCC Confidence 1 1122221100 001123444456789999999999999999999888664 44555554443211000 00000 00 Q ss_pred ccccccccccCcccc--ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcch-hHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 371 YYLLNRTDENNGEMP--TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADLETY 447 (725) Q Consensus 371 ~~~~~~~~~~~g~~~--~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~-~Sg~ai~~~q~q~~~~~~ 447 (725) ...........+... ...++...+.-...+++..++.....+....|++...+|-.++. .++.+|............ T Consensus 310 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~ 389 (499) T protein:vir:80 310 DSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKN 389 (499) T ss_pred CcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHH Confidence 000000000011111 11244343333334567888989999999999999888865433 367777776666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhH Q lcl|NC_013059. 448 VFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSM 527 (725) Q Consensus 448 ~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~ 527 (725) .....++.+++++.+.++.+..-+- ..+ |. ....++|.|+-..+.+.- T Consensus 390 ~~~~~~~~~l~~l~~~il~~~~~~~-------~~~--~~-----------------------~~~~~~v~v~f~d~i~~d 437 (499) T protein:vir:80 390 SHSQLIEQGIKEMIVSILEVGKLIK-------AYD--GD-----------------------TVELDTITVDFDDSIAQD 437 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc-------ccc--CC-----------------------CCCccceEEEeCCCCCCC Confidence 6777777777777777776543322 000 00 001245555555555554 Q ss_pred HHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHH Q lcl|NC_013059. 528 KQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQD 605 (725) Q Consensus 528 r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q 605 (725) +++..+.++++..+ + .++.- ..+.-.+..+-+.+++.++++.+...... +.+...-. ..+.+ T Consensus 438 ~~~~~~~~~~~~~~-G-i~S~e--t~l~~~~~~~d~ea~~el~~i~~E~~~~~---~~~d~~g~---------~ge~e 499 (499) T protein:vir:80 438 EDTTINRYTTAKNQ-G-MIPLK--IALQRAWNITEAEADEWAEMLAKEKQAEI---PNNDMTGI---------FGEEE 499 (499) T ss_pred HHHHHHHHHHHHHc-C-CCCHH--HHHhhcCCCChHHHHHHHHHHHHHhhcCC---CCCCcccc---------CCCCC Confidence 55666666665532 2 22211 11111222233334444444433221100 00000000 00000 No 108 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.31 E-value=1.3e-10 Score=74.77 Aligned_cols=489 Identities=11% Similarity=0.008 Sum_probs=216.4 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHhhcCCCcccch-HHHHHHHHH----HHh-hCC Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRV--SQWDDWLSQYTTLQYRGQFDVV-RPVVRKLVS----EMR-QNP 75 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G--~QW~~~~~~~l~~~grp~~N~i-~~~v~~v~g----~~~-~nr 75 (725) .+++...+..++++ ..|-..+.+..+|..- -..+.+.........+| |+-. ...++.+.+ .-. -++ T Consensus 1 m~~~~~~l~~k~~R-----~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~-~dstg~~a~~~LAa~l~~~ltpp~~ 74 (514) T protein:vir:80 1 MRQQASAMWAEYRD-----STAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYD-FQSAGAFLVNNLTAKLALTLFPPGR 74 (514) T ss_pred CccchHHHHHHhhc-----chHHHHHHHHHHHhcccccCCCCCCcccccccccc-cchhHHHHHHHHHHHHHhhhcCCCC Confidence 33333333332221 1233333444455532 12222211111111233 3222 233343322 222 255 Q ss_pred cceEEecCCc-------chHHHHHH------HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCce Q lcl|NC_013059. 76 IDVLYRPKDG-------ASPDAADV------LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQ 142 (725) Q Consensus 76 ~~~~~~pr~~-------~d~~~Ae~------l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~ 142 (725) +=+++.+.+. .+.+.+++ .+..+......|++..+...+|.+.+..|.+++-+ +++.. + T Consensus 75 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~-----~~~~~--~- 146 (514) T protein:vir:80 75 PSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYR-----EPGTG--K- 146 (514) T ss_pred cccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE-----ecCCC--c- Confidence 6666666431 22233333 23334445567899999999999999999987533 23322 2 Q ss_pred eEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEE Q lcl|NC_013059. 143 VIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEF 222 (725) Q Consensus 143 ~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~ 222 (725) ++..|+ .++++..++.- ...-++++.+|+...+-+.|+.. ... .. ......+.|-|..+ T Consensus 147 -~~~~pl----~~y~v~~d~~G----~v~~i~rr~~~~~~~l~~~~~~~---~~~-----~~----~~~~~~~~v~v~~~ 205 (514) T protein:vir:80 147 -MLVWTM----QSYTVRRTSHG----DPAVVVLRQQMPFRELTPEIQAD---AQA-----KQ----IAKRDSDKCDLYTV 205 (514) T ss_pred -EEEEEc----CeEEEeeCCCc----CeEEEEeeeeecHHHhhhhhhhh---hhh-----hh----ccCCCCCceEEEEE Confidence 244443 34555544421 11126778889987665544321 100 00 00112334555444 Q ss_pred EEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEE Q lcl|NC_013059. 223 YEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVP 302 (725) Q Consensus 223 w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP 302 (725) .++.+.+ .+ .+.-| |.-..|.+++ ..+-||+..+||+| T Consensus 206 v~~~~~~----------~~------------------------------~~~sv-~~e~~g~~i~-~es~y~~~e~P~i~ 243 (514) T protein:vir:80 206 IEWQPTP----------NG------------------------------KRCAV-WHELEGKRVG-PESSYPAHLCPYVP 243 (514) T ss_pred EEeecCC----------CC------------------------------eEEEE-EEeccceeec-ccCccccccCCeee Confidence 4432211 00 01111 1122345553 34678888899998 Q ss_pred EEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCc Q lcl|NC_013059. 303 VFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNG 382 (725) Q Consensus 303 ~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 382 (725) +-.. -.+|..|+.|.+.+..+--+.+|+.....+.....+.+.++.++++.+-...... ++..+..+....+ T Consensus 244 ~Rw~--~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~------~~~~g~~v~g~~~ 315 (514) T protein:vir:80 244 VAWN--VPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYR------DAETGDFVPGQVG 315 (514) T ss_pred eeeE--ecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhc------ccCCceeecCCCc Confidence 7544 4589899999999999999999998888888888888889888876543221111 1111111111111 Q ss_pred cccccCCcccC--CCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 383 EMPTQPLAYYE--NPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRD 460 (725) Q Consensus 383 ~~~~~~~~~~~--~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~ 460 (725) .+..++ ...--.....+++...+.|....=++ ..+.++..+++.-|..+.+.-...|...+.+|.. T Consensus 316 -----~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~--~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~----- 383 (514) T protein:vir:80 316 -----SVASYERGDYNKIAQASASVESIVMRLNRAFMYT--GQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAE----- 383 (514) T ss_pred -----cceeeecCcccchHHHHHHHHHHHHHHHHHHhhh--ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHH----- Confidence 122222 11122334566777777776543111 1223333356666777777777777777766663 Q ss_pred HHHHHHHHHHhcCCCcEEEEecc--CCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 461 GEIYQSIVNDIYDVPRNVVITLE--DGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 461 g~~ll~li~~~y~~~r~irI~~~--d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) +++.=||...| .|... .|. +-.+- .++ +.+.+.. +-.+-.|.+..+.+..+ T Consensus 384 -Ell~Pli~r~~------~il~r~~~g~--lP~~p--------------~~l---~~~~~vs-~la~l~r~~~~~~l~~~ 436 (514) T protein:vir:80 384 -TLQAPLAYLTM------YEASRGNGGM--LLGIA--------------QGV---YRPSIIT-GIPALTRNIETANILRA 436 (514) T ss_pred -HHHHHHHHHHH------HHHhhhccCC--CCCCC--------------chh---hcceeee-cHHHHHHHHHHHHHHHH Confidence 22333332222 11110 011 00000 011 2333333 23344566667677766 Q ss_pred HHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_013059. 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQG 618 (725) Q Consensus 539 l~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~ 618 (725) ++.+....|..+. .++..|++ +++..+......+...--.+++..+..+++.++++++++. .++.. . T Consensus 437 ~~~i~~l~~~~p~----v~d~id~d---~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~---~~~~~---~ 503 (514) T protein:vir:80 437 TQEASAIVPALVQ----LSKRFDPE---KLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLD---VASGA---L 503 (514) T ss_pred HHHHHHHhccchh----hhhcCCHH---HHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHH---HHHHH---H Confidence 6655444332211 23334443 3333332222222111111222222211111111111000 00000 0 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_013059. 619 QAELAKAQNQTLSL 632 (725) Q Consensus 619 qae~~kaqae~~k~ 632 (725) .+.++....-. T Consensus 504 ---~~~~~~~~~~~ 514 (514) T protein:vir:80 504 ---AAETSAGVLTS 514 (514) T ss_pred ---HHhhhccccCC Confidence 00000000000 No 109 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.30 E-value=1.5e-10 Score=74.45 Aligned_cols=491 Identities=11% Similarity=0.022 Sum_probs=224.4 Q ss_pred CCc-HHHHHH----HHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHhhcCCCcccchHHHHHHHHHH---- Q lcl|NC_013059. 1 MAD-NKNRLE----SILSRFDADWTASDEARREAKNDLFFSRVSQWDDWL-SQYTTLQYRGQFDVVRPVVRKLVSE---- 70 (725) Q Consensus 1 mad-~~~~~~----~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~-~~~l~~~grp~~N~i~~~v~~v~g~---- 70 (725) |.- .++.+. .+..+|..-......|...+.+..+|..-.=++++. .+.. .+|--..-...++.+.+. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~---~~~~dstg~~a~~~LAa~l~~~ 77 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETS---QNGWQGVGAQATNHLANKLAQV 77 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcccc---cccccchHHHHHHHHHHHHHhh Confidence 663 233333 566666666555666777777777887643222111 0111 123112333334433322 Q ss_pred Hh-hCCcceEEecCCcc-------hH---HHHHH---HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCC Q lcl|NC_013059. 71 MR-QNPIDVLYRPKDGA-------SP---DAADV---LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS 136 (725) Q Consensus 71 ~~-~nr~~~~~~pr~~~-------d~---~~Ae~---l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~ 136 (725) -. -+++=+++.+.+.. +. ++.+. .+.++......|++..+...+|.+.+..|.|++ |.+++ T Consensus 78 ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-----~~d~~ 152 (516) T protein:vir:10 78 LFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML-----YKPSK 152 (516) T ss_pred hcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE-----EecCC Confidence 22 24555565554321 11 23333 334444556788999999999999999999874 22221 Q ss_pred CCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCe Q lcl|NC_013059. 137 PTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDT 216 (725) Q Consensus 137 ~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (725) . . ++..|+ .++++..++.- ..--++++..|+...+.+.|++..... .. .... ..++. T Consensus 153 ~---~--~~~~pl----~~y~v~~d~~G----~v~~ivrr~~~~~~~l~e~~~~~~~~~---~~--~~~~-----~~~~~ 209 (516) T protein:vir:10 153 G---A--ISAIPM----HHYVVNRDTNG----DLLDIILLQEKSLRTFDPATRAVVEVG---LK--GKKC-----KEDDS 209 (516) T ss_pred C---C--eEEEEc----CeEEEeeCCCC----CeEEEeeeecccHHHHHHHhhhhhhhh---hh--hhcc-----CCCCc Confidence 1 1 344444 34555554421 112267777899988888776532110 00 0000 01122 Q ss_pred EEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCC Q lcl|NC_013059. 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGE 296 (725) Q Consensus 217 vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~ 296 (725) +.|..+-++ ++ .+ . +.++.-.+...+...+-||+. T Consensus 210 ~~i~t~v~~-----------~~-~~-------------------------------~--~~~~~~~d~~~~~~~s~~~~~ 244 (516) T protein:vir:10 210 IKLYTHAKY-----------LG-EG-------------------------------F--WELKQSADDIPVGKVSKIKSE 244 (516) T ss_pred eEEEEEEEe-----------cC-CC-------------------------------c--eEEEEeeCceeeccccccccc Confidence 222111111 10 00 0 111221233333345678888 Q ss_pred ccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccccc Q lcl|NC_013059. 297 HIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNR 376 (725) Q Consensus 297 ~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 376 (725) .|||+|+-.. -.+|..|+.|.+.+..+--+.+|+.....+.....+.+.++.++++.+-...+... ...+.. T Consensus 245 e~P~~~~Rw~--~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~------~~~g~~ 316 (516) T protein:vir:10 245 KLPFIPLTWK--RSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN------SGTGEV 316 (516) T ss_pred cCCeeeeeee--ecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhcc------CCCcee Confidence 9999987554 35898999999999999999999998888888888999999988765543222211 111111 Q ss_pred ccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 377 TDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATA 456 (725) Q Consensus 377 ~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~ 456 (725) +....+.+ .+++.- +..--......++...+.|....=+ +.+.-.++..+++.=|..+.+.-...|...+.+|.. T Consensus 317 ~~g~~~~v--~~~q~~-~~~d~~~~~~~i~~~~~rI~~af~~-~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~- 391 (516) T protein:vir:10 317 VTGVEEDI--HIVQLG-KYADLTPISAVLEVYTRRIGVVFMM-ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAT- 391 (516) T ss_pred ecCCcccc--eeeecC-cccchHHHHHHHHHHHHHHHHHHhh-hhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHH- Confidence 11111111 111111 1111244455666667777655422 212222333355556777777777777777666653 Q ss_pred HHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHH Q lcl|NC_013059. 457 MRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEIL 536 (725) Q Consensus 457 ~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ 536 (725) +++.=||..-+ ..... +.. ..+ .++.+..+. .+-.|.+.++.+. T Consensus 392 -----Ell~Pli~r~~------~~~~p-----------~~P----------~~l---v~~~~v~~i-~~L~raq~~~~i~ 435 (516) T protein:vir:10 392 -----TMQSPVAMWGL------LEAGD-----------SFT----------SDL---VDPVIITGI-EALGRMAELDKLA 435 (516) T ss_pred -----HHHHHHHHHHH------HhhCC-----------CCC----------hhh---cCcceehhH-HHHHHHHHHHHHH Confidence 23333332221 01111 000 111 223332332 3334666677777 Q ss_pred HHHHhcccccchHHHHHHHhhccCCchhH-HHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_013059. 537 ELLGKTPQGTPEYQLLLLQYFTLLDGKGV-EMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVL 615 (725) Q Consensus 537 ell~~~~~~~p~~~~~~~~~~~~~d~~~~-~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~ 615 (725) .+++.+.+.++.. ...++..|+... +.+....+- +..+. ..+++...+.+++++++ +..++.. T Consensus 436 ~~~q~i~~~~q~~----p~v~d~id~d~~~~~~a~~~gv---p~~~i--rs~eev~~~r~~~~~~q---~~~~~~~---- 499 (516) T protein:vir:10 436 NFAQYMSLPLQWP----EPVLAAVKWPDYMDWVRGQISA---ELPFL--KSAEEMEQEQEAQMQAQ---QAQMLEE---- 499 (516) T ss_pred HHHHHHHHHhcCC----hHHHhhcCHHHHHHHHHHHhCC---Chhcc--CCHHHHHHHHHHHHHHH---HHHHHHH---- Confidence 6666554332211 112344444432 223332221 11111 11111111111111111 0100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 616 LQGQAELAKAQNQTLSLQIDAAKVEA 641 (725) Q Consensus 616 ~k~qae~~kaqae~~k~q~ea~~~q~ 641 (725) ...++.....+..+ +++ T Consensus 500 -----~~~~~~~~~~~~~~----~~~ 516 (516) T protein:vir:10 500 -----GVAKAVPGVIQQEL----KEA 516 (516) T ss_pred -----Hhhhcccchhhhhh----hcC Confidence 00111111100000 000 No 110 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.29 E-value=5.5e-11 Score=76.82 Aligned_cols=425 Identities=9% Similarity=-0.073 Sum_probs=178.7 Q ss_pred CCCHHHHHHHhhc-CCCcccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHH Q lcl|NC_013059. 40 QWDDWLSQYTTLQ-YRGQFDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQ 118 (725) Q Consensus 40 QW~~~~~~~l~~~-grp~~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~ 118 (725) =++.......+.. .+.+.|..+-+|+.+++...-+ -++ .|.... +..+.-+++.|+++...+.+..++ T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~--gf~-~~d~~~--------~~~~~~i~~~N~~d~~~~~~~~~a 69 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLAL--GVT-GPDGEP--------DTRASRWWQANRLDSRQKLVWRMA 69 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccC--cee-cCCCch--------HHHHHHHHHhcChhHHHHHHHHHH Confidence 1222222222222 2346699999999988854322 121 122111 222345678899999999999999 Q ss_pred HhcCcceEEEEeeeccCCCCCC-ceeEEEEeeecchh--heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcc Q lcl|NC_013059. 119 IEAGVGAWRLVTDYEDQSPTSN-NQVIRREPIHSACS--HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDA 195 (725) Q Consensus 119 ~~~G~G~~~v~~~~~~~~~~~~-~~~ir~~~~~~~~~--~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~ 195 (725) ++.|.||+-|..+.......+. ...|+.. ++. .++||+....+. +.++.|-... T Consensus 70 ~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~----~p~~~~~i~D~~~~~~~------~ai~~~~~~~------------- 126 (434) T protein:vir:98 70 MAQSAGYMLVGAHPTRTEDNGRPSPLITME----HPSECIVEYDPETGEPL------VGLKVWHNDI------------- 126 (434) T ss_pred hhcCceEEEEecCCCcccccCCceeEEEEe----ccceeEEEEeCCCCceE------EEEEEEEecc------------- Confidence 9999999888654211111111 1223221 222 367787654322 1222221000 Q ss_pred hhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEE Q lcl|NC_013059. 196 DDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRR 275 (725) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 275 (725) +...+...||+- . ..++......++... +.. T Consensus 127 ------------------~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~-~~~--------------------------- 157 (434) T protein:vir:98 127 ------------------DGFGYARVFFDD-T--SFPYRTRERTGARLP-WGP--------------------------- 157 (434) T ss_pred ------------------CCceEEEEEEeC-c--EEEEEEeeccccccc-ccc--------------------------- Confidence 001111111111 0 011111111111100 000 Q ss_pred EEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhh Q lcl|NC_013059. 276 VYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQ 355 (725) Q Consensus 276 v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~ 355 (725) ..|. ....+....|.+.+.+|+|||.-.+ +-...+.|-+..+++.++.+|+.+|.++......+.....+ .|. T Consensus 158 ~~~~---~~~~~~~~~~h~~g~vPvv~f~N~~---~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i-~G~ 230 (434) T protein:vir:98 158 DSWV---YTGTADSGDVHDLGGMQLVEFARMP---DLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWI-KGH 230 (434) T ss_pred ccce---ecccccccccCCCCccceEEeccCC---CcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-cCC Confidence 0000 0111222334455667777774432 21123568899999999999999999887665554433222 111 Q ss_pred -cchHHHHHHhhccccccccccccccCccc---cccCCcccCCCC-chHHHHHHHHHHHHHHHHHhCCChHHhccCcchh Q lcl|NC_013059. 356 -IAGFEHMYDGNDDYPYYLLNRTDENNGEM---PTQPLAYYENPE-VPQANAYMLEAATAAVKEVATLGVDAEAVNGGQV 430 (725) Q Consensus 356 -i~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~~~~~~~-~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~ 430 (725) .+.+.+ . +.......+......+.+ +...+...+.+. ....+...+......+-.++++.+..+|...+.. T Consensus 231 ~~~~~~~---~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~ 306 (434) T protein:vir:98 231 KFAKRTD---P-ATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNI 306 (434) T ss_pred Ccccccc---c-ccccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhccccCCh Confidence 011100 0 000000000001111111 111122222111 2234555566666666677888889998666678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccc Q lcl|NC_013059. 431 AYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDI 510 (725) Q Consensus 431 Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi 510 (725) ||+|+......-..........|..+.+++.++++.+ . |.. . +. T Consensus 307 Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~----~---------g~~--~---------------------~~ 350 (434) T protein:vir:98 307 SADTIGALDILHVAKVREHIASFSEGLESVLALAAAQ----A---------GVP--E---------------------DY 350 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----c---------CCC--h---------------------hh Confidence 9999998776666666666777777777776665543 1 110 0 00 Q ss_pred cccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhh Q lcl|NC_013059. 511 RGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQ 590 (725) Q Consensus 511 ~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~ 590 (725) +++.|.=.+..+....+..+.++.|.+.. .|. ..++..+...+ ..++++.+...++ T Consensus 351 ---~~~~v~w~~~~~~s~~~~ada~~kl~~~g---~~~--e~~~~~lg~~~-~e~~r~~~e~~~~--------------- 406 (434) T protein:vir:98 351 ---TEAEVRWANPAHVTMAVKADAATKLKSIG---YPL--DVIAEELDESP-ARVRRIVAGAASQ--------------- 406 (434) T ss_pred ---eeeeEEecCCCCCCHHHHHHHHHHHHhcC---CcH--HHHHHhCCCCH-HHHHHHHHHHHHH--------------- Confidence 23333323333222345555666655431 121 22222222211 1111111111000 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 591 QWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 591 ~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) ....+....+.. ++. + ...-+..-+ ++= T Consensus 407 -~~~~~~~~~~~~-~~~---~-----g~~~~~~~~-~dg 434 (434) T protein:vir:98 407 -ALLAASLLPAPG-APS---A-----GNVPDSGGA-VDG 434 (434) T ss_pred -HHHHHhhhccCC-CCC---C-----CCCCcccCC-CCC Confidence 000000000000 000 0 000000000 000 No 111 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=99.28 E-value=1.9e-10 Score=73.84 Aligned_cols=488 Identities=11% Similarity=-0.013 Sum_probs=213.6 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHh-----hCCcce Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR-----QNPIDV 78 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~-----~nr~~~ 78 (725) .+.++.....++++ ..|-..+.+..+|..-.=.....-.......++.-..-...++.+.+... -+++=+ T Consensus 1 mk~~~~~~~~~lkR-----~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:63 1 MKTTAAMLWEKLRD-----GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHhc-----cchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCccc Confidence 55555555554442 23444445555666532111100000111123322333344444333222 244455 Q ss_pred EEecCCc-------chHH---HHHH---HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEE Q lcl|NC_013059. 79 LYRPKDG-------ASPD---AADV---LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) Q Consensus 79 ~~~pr~~-------~d~~---~Ae~---l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir 145 (725) ++.+.+. .+.+ +.+. .+..+......|++..+...+|.+.+..|.+++-+ ++++ ..++ T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~-----~~~~----~~~~ 146 (510) T protein:vir:63 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR-----DSDA----ATVV 146 (510) T ss_pred ccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEE-----cCCC----cEEE Confidence 5555432 2222 2222 33444455678899999999999999999886532 3332 2234 Q ss_pred EEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEE Q lcl|NC_013059. 146 REPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEV 225 (725) Q Consensus 146 ~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~ 225 (725) ..|+ .++++..++.- ...-++++.+|+...+-+.|+....... . .-...+.|.|..+.++ T Consensus 147 ~~pl----~~y~v~~d~~G----~vd~i~rr~~~t~~~l~e~~~~~~~~~~--------~----~~~~~~~v~v~~~V~~ 206 (510) T protein:vir:63 147 AWSL----RSYAVRRDATG----RWMDIVLKQRYKSKDLDEEYKQDLMRAG--------R----NLSGSGSVDLYTHVQR 206 (510) T ss_pred EEEc----ceeEEeeCCCc----CeeEEEeeeeccHHHHhHHhhhhhhccc--------c----ccCCCcceEEEEEEEe Confidence 4444 34555544321 1122678888998877666654211100 0 0011223434433332 Q ss_pred ecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEe Q lcl|NC_013059. 226 VEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFG 305 (725) Q Consensus 226 ~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g 305 (725) .+. + ......||+. ..|.++. ..+-||+..+||+|+-. T Consensus 207 ~~~------------~----------------------------~~~~~sv~~e-~dg~~~~-~~~~~~~~e~P~~~~Rw 244 (510) T protein:vir:63 207 KKG------------T----------------------------AMEYAELYHE-IDGVRVG-KEGRWPIHLCPYIVPTW 244 (510) T ss_pred ecC------------C----------------------------CceEEEEEEE-ecCceec-cccccccccCceeeeee Confidence 110 0 0111222222 2455543 34678889999998854 Q ss_pred eeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcccc Q lcl|NC_013059. 306 EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMP 385 (725) Q Consensus 306 ~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 385 (725) . -.+|..|+.|.+.+..+--+.+|+.....+.....+.+.++.+.++.+-....... ...+..+....+ T Consensus 245 ~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~------~~~g~~v~g~~~--- 313 (510) T protein:vir:63 245 N--LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD------AEMGDYVPGGAE--- 313 (510) T ss_pred e--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhcc------CCCceeecCCcc--- Confidence 4 45898999999999999999999988888887888888888888765432211111 111111111111 Q ss_pred ccCCcccC--CCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYE--NPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) Q Consensus 386 ~~~~~~~~--~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ 463 (725) .++.++ +..--.....+++...+.|....=+ + ....++..+++.=|..+.+.....+...+.+|.. ++ T Consensus 314 --~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~------E~ 383 (510) T protein:vir:63 314 --AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-G-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAE------NL 383 (510) T ss_pred --cceeeecCcccchHHHHHHHHHHHHHHHHHHHh-h-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHH------HH Confidence 122222 1122234456677777777765411 1 1112233355666778877777777777776653 22 Q ss_pred HHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcc Q lcl|NC_013059. 464 YQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTP 543 (725) Q Consensus 464 ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~ 543 (725) +.-||... +.|....|- .++-+.. + +-.++ ++ .++--|.+.+..+..+++.+. T Consensus 384 l~Pli~r~------~~il~r~gl---~p~p~~~-------------~--~~~~v--~~-is~Laraq~~~~l~~~~q~l~ 436 (510) T protein:vir:63 384 QSPLAYVC------LSEVDDALL---QGLITKQ-------------H--KPAIE--TG-LPALSRSAAVQSMLNASQVIA 436 (510) T ss_pred HHHHHHHH------HHHHHhccC---CCCCchh-------------c--cccee--cc-hhHHHHHHHHHHHHHHHHHHH Confidence 33333222 222211110 0110000 1 11111 12 222234444444444443333 Q ss_pred cccchHHHHHHHhhccCCchhHHHHHHHHhhhhh--hhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 544 QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAE 621 (725) Q Consensus 544 ~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~--~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae 621 (725) ...+..+ + .+-.|+ +++++.+..... +..+... +++.+++.++++++.++ +.++++.+...-+. T Consensus 437 ~~~~~aq--~---~~~id~---d~~~~~~a~~~Gv~p~~ivrs--~eev~a~~~~~~qq~~~----~~~~~~~~~~~a~~ 502 (510) T protein:vir:63 437 GLAPIAQ--L---DPRISL---PKMMDTIWAAFSVDTSQFYKS--ADELQAEAEQQRQQAAQ----AQAAQETLLEGASD 502 (510) T ss_pred HhcCchh--h---hccCCH---HHHHHHHHHHhCCChhHhcCC--HHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHh Confidence 2222211 1 122233 334443333222 1222211 11111111111110000 00000000000001 Q ss_pred HHHHHHHH Q lcl|NC_013059. 622 LAKAQNQT 629 (725) Q Consensus 622 ~~kaqae~ 629 (725) +..+-+.+ T Consensus 503 ~~~~~~g~ 510 (510) T protein:vir:63 503 MTNALAGV 510 (510) T ss_pred hcccccCC Confidence 11111111 No 112 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.24 E-value=3.7e-10 Score=72.30 Aligned_cols=463 Identities=9% Similarity=0.016 Sum_probs=224.9 Q ss_pred CCcHHHHHHHHHHHHHHH-----------------HhhhHHHHHHHHHHHHhhcCCCCCHHHHHH---HhhcCCCcccch Q lcl|NC_013059. 1 MADNKNRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRVSQWDDWLSQY---TTLQYRGQFDVV 60 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~-----------------~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~---l~~~grp~~N~i 60 (725) |. +.++++.+|+.- +.-.++-.....+..+||.|+.+.-.-... .+.+.+-.+|+- T Consensus 1 m~----~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~ 76 (500) T protein:vir:30 1 MG----VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIA 76 (500) T ss_pred Cc----hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchH Confidence 43 223333333221 122234445677789999998553211111 111222346999 Q ss_pred HHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCC Q lcl|NC_013059. 61 RPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) Q Consensus 61 ~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~ 140 (725) +.+++...+.--.-.+.+.+. |. .++..+.-+.+.|++.....++++.++..|-+|+++.+| .+ T Consensus 77 ~~i~~~~A~lv~~e~~~i~~~-----d~----~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d---~~---- 140 (500) T protein:vir:30 77 RTAAKKIASLVFNEQAEIKVD-----DD----AANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVD---GD---- 140 (500) T ss_pred HHHHHHHhhhhcCCcceEecC-----Ch----HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe---CC---- Confidence 999999999888877777772 33 445556667778999999999999999999999999875 12 Q ss_pred ceeEEEEeeecchhheeeCCCccc-cChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEE Q lcl|NC_013059. 141 NQVIRREPIHSACSHVIWDSNSKL-MDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQI 219 (725) Q Consensus 141 ~~~ir~~~~~~~~~~v~~Dp~a~~-~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv 219 (725) .+.|.. .+++.|| |-... -+...|-++++.. . .... .....+. T Consensus 141 ~~~I~~----v~ad~~~--P~~~d~~~~~~~a~~~~~~--~---------~~~~-------------------~~~~yt~ 184 (500) T protein:vir:30 141 KVRVAF----VQAPVFL--PLQSNTQDVSSAAVVIKSV--K---------TING-------------------KEVYYTL 184 (500) T ss_pred ceEEEE----EcCCeeE--EEEEcCCCeEEEEEEEEEe--e---------eecC-------------------CceEEEE Confidence 233433 2344444 21110 0112222222110 0 0000 0011123 Q ss_pred EEEEEEecce-----eEEEEeeCc-cccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCC Q lcl|NC_013059. 220 AEFYEVVEKK-----ETAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI 293 (725) Q Consensus 220 ~E~w~~~~~~-----~~~~~~~d~-~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~ 293 (725) .|+++..... -.++...+. .-|..+.... + |--+.....+.+ + T Consensus 185 lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~-----~-----------------------~~~l~~~~~~~~---~ 233 (500) T protein:vir:30 185 IEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSE-----V-----------------------YKDLKDEAKVTD---V 233 (500) T ss_pred EEEEEEeCCceeEEEEEEEecccccccCccccccc-----c-----------------------cCCcCcceEecc---C Confidence 3443321110 011111110 0122221110 0 000000000101 1 Q ss_pred CCCccceE--EEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhc-ccc Q lcl|NC_013059. 294 AGEHIPIV--PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGND-DYP 370 (725) Q Consensus 294 p~~~~p~v--P~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~ 370 (725) +..-|.|+ |+. . .-..+++.+-|++.++++..+.+|...|.+.+.+-+ +..++.++.+.+.....-..... ..+ T Consensus 234 ~~p~f~~~~~~~~-N-~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~ 310 (500) T protein:vir:30 234 TRPIFTYLKTPGM-N-NKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRP 310 (500) T ss_pred CCccEEEecCCcc-c-cccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCc Confidence 11111111 111 1 112355556688999999999999999999998866 45566666554421110000000 000 Q ss_pred ccccc--cccccCcc-ccccCCcccCCCCch-HHHHHHHHHHHHHHHHHhCCChHHhccCcch-hHHHHHHHHHHHHHHH Q lcl|NC_013059. 371 YYLLN--RTDENNGE-MPTQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADLE 445 (725) Q Consensus 371 ~~~~~--~~~~~~g~-~~~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~-~Sg~ai~~~q~q~~~~ 445 (725) ...+. ....-++. -....++.+. |.+. ..+...++.....+....|++...+|-.++. .++.+|.+........ T Consensus 311 ~~d~~~~~~~~~~~~~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t 389 (500) T protein:vir:30 311 RFESDQNVYIRMGGRDLDSSAIQDLT-TPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQM 389 (500) T ss_pred ccCCCcceEEEcCCCCCcCcceeEec-cccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHH Confidence 00000 00001111 1112344433 4443 3567788888888988999988888765433 3577888887777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccC Q lcl|NC_013059. 446 TYVFQDNLATAMRRDGEIYQSIVN--DIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPS 523 (725) Q Consensus 446 ~~~~~dn~~~~~~~~g~~ll~li~--~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~ 523 (725) ...+...+..+++++-+.++.+.. .+|+.. +...++|.|+-..+ T Consensus 390 ~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~----------------------------------~~~~~~v~v~f~d~ 435 (500) T protein:vir:30 390 RNSIVALVEQSLKELVISIFEIAKAYDLYQSE----------------------------------VPSMDNISISLDDG 435 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC----------------------------------CCCCcceEEEeCCC Confidence 778888888998888888887543 233210 11235677777666 Q ss_pred chhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHH Q lcl|NC_013059. 524 FQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVE 595 (725) Q Consensus 524 ~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q 595 (725) ...-+++.++.++++..+ + .+|.-. .+.. ....+-+.+++.+++++....+..-. .. ......-+ T Consensus 436 i~~d~~~~~~~~~~~v~a-G-i~s~~~-~i~~-~~g~~eeea~~~l~~i~~E~~~~~~~-~~--~~~~~~g~ 500 (500) T protein:vir:30 436 VFTDRDAELDYWIKVVNA-G-FGTREM-AIQK-VLNVTEEKAQEIAAEINTGIVDEINQ-QR--TDTHLYGE 500 (500) T ss_pred CCCCHHHHHHHHHHHHHc-C-CCCHHH-HHHh-cCCCCHHHHHHHHHHHHHhccccCCC-CC--ccccccCC Confidence 555566777777776554 2 233221 1112 22233333444444443321111000 00 00000000 No 113 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.24 E-value=3.7e-10 Score=72.30 Aligned_cols=463 Identities=9% Similarity=0.016 Sum_probs=224.9 Q ss_pred CCcHHHHHHHHHHHHHHH-----------------HhhhHHHHHHHHHHHHhhcCCCCCHHHHHH---HhhcCCCcccch Q lcl|NC_013059. 1 MADNKNRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRVSQWDDWLSQY---TTLQYRGQFDVV 60 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~-----------------~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~---l~~~grp~~N~i 60 (725) |. +.++++.+|+.- +.-.++-.....+..+||.|+.+.-.-... .+.+.+-.+|+- T Consensus 1 m~----~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~ 76 (500) T protein:vir:98 1 MG----VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIA 76 (500) T ss_pred Cc----hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchH Confidence 43 223333333221 122234445677789999998553211111 111222346999 Q ss_pred HHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCC Q lcl|NC_013059. 61 RPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) Q Consensus 61 ~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~ 140 (725) +.+++...+.--.-.+.+.+. |. .++..+.-+.+.|++.....++++.++..|-+|+++.+| .+ T Consensus 77 ~~i~~~~A~lv~~e~~~i~~~-----d~----~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d---~~---- 140 (500) T protein:vir:98 77 RTAAKKIASLVFNEQAEIKVD-----DD----AANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVD---GD---- 140 (500) T ss_pred HHHHHHHhhhhcCCcceEecC-----Ch----HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe---CC---- Confidence 999999999888877777772 33 445556667778999999999999999999999999875 12 Q ss_pred ceeEEEEeeecchhheeeCCCccc-cChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEE Q lcl|NC_013059. 141 NQVIRREPIHSACSHVIWDSNSKL-MDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQI 219 (725) Q Consensus 141 ~~~ir~~~~~~~~~~v~~Dp~a~~-~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv 219 (725) .+.|.. .+++.|| |-... -+...|-++++.. . .... .....+. T Consensus 141 ~~~I~~----v~ad~~~--P~~~d~~~~~~~a~~~~~~--~---------~~~~-------------------~~~~yt~ 184 (500) T protein:vir:98 141 KVRVAF----VQAPVFL--PLQSNTQDVSSAAVVIKSV--K---------TING-------------------KEVYYTL 184 (500) T ss_pred ceEEEE----EcCCeeE--EEEEcCCCeEEEEEEEEEe--e---------eecC-------------------CceEEEE Confidence 233433 2344444 21110 0112222222110 0 0000 0011123 Q ss_pred EEEEEEecce-----eEEEEeeCc-cccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCC Q lcl|NC_013059. 220 AEFYEVVEKK-----ETAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI 293 (725) Q Consensus 220 ~E~w~~~~~~-----~~~~~~~d~-~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~ 293 (725) .|+++..... -.++...+. .-|..+.... + |--+.....+.+ + T Consensus 185 lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~-----~-----------------------~~~l~~~~~~~~---~ 233 (500) T protein:vir:98 185 IEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSE-----V-----------------------YKDLKDEAKVTD---V 233 (500) T ss_pred EEEEEEeCCceeEEEEEEEecccccccCccccccc-----c-----------------------cCCcCcceEecc---C Confidence 3443321110 011111110 0122221110 0 000000000101 1 Q ss_pred CCCccceE--EEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhc-ccc Q lcl|NC_013059. 294 AGEHIPIV--PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGND-DYP 370 (725) Q Consensus 294 p~~~~p~v--P~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~ 370 (725) +..-|.|+ |+. . .-..+++.+-|++.++++..+.+|...|.+.+.+-+ +..++.++.+.+.....-..... ..+ T Consensus 234 ~~p~f~~~~~~~~-N-~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~ 310 (500) T protein:vir:98 234 TRPIFTYLKTPGM-N-NKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRP 310 (500) T ss_pred CCccEEEecCCcc-c-cccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCc Confidence 11111111 111 1 112355556688999999999999999999998866 45566666554421110000000 000 Q ss_pred ccccc--cccccCcc-ccccCCcccCCCCch-HHHHHHHHHHHHHHHHHhCCChHHhccCcch-hHHHHHHHHHHHHHHH Q lcl|NC_013059. 371 YYLLN--RTDENNGE-MPTQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADLE 445 (725) Q Consensus 371 ~~~~~--~~~~~~g~-~~~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~-~Sg~ai~~~q~q~~~~ 445 (725) ...+. ....-++. -....++.+. |.+. ..+...++.....+....|++...+|-.++. .++.+|.+........ T Consensus 311 ~~d~~~~~~~~~~~~~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t 389 (500) T protein:vir:98 311 RFESDQNVYIRMGGRDLDSSAIQDLT-TPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQM 389 (500) T ss_pred ccCCCcceEEEcCCCCCcCcceeEec-cccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHH Confidence 00000 00001111 1112344433 4443 3567788888888988999988888765433 3577888887777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccC Q lcl|NC_013059. 446 TYVFQDNLATAMRRDGEIYQSIVN--DIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPS 523 (725) Q Consensus 446 ~~~~~dn~~~~~~~~g~~ll~li~--~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~ 523 (725) ...+...+..+++++-+.++.+.. .+|+.. +...++|.|+-..+ T Consensus 390 ~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~----------------------------------~~~~~~v~v~f~d~ 435 (500) T protein:vir:98 390 RNSIVALVEQSLKELVISIFEIAKAYDLYQSE----------------------------------VPSMDNISISLDDG 435 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC----------------------------------CCCCcceEEEeCCC Confidence 778888888998888888887543 233210 11235677777666 Q ss_pred chhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHH Q lcl|NC_013059. 524 FQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVE 595 (725) Q Consensus 524 ~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q 595 (725) ...-+++.++.++++..+ + .+|.-. .+.. ....+-+.+++.+++++....+..-. .. ......-+ T Consensus 436 i~~d~~~~~~~~~~~v~a-G-i~s~~~-~i~~-~~g~~eeea~~~l~~i~~E~~~~~~~-~~--~~~~~~g~ 500 (500) T protein:vir:98 436 VFTDRDAELDYWIKVVNA-G-FGTREM-AIQK-VLNVTEEKAQEIAAEINTGIVDEINQ-QR--TDTHLYGE 500 (500) T ss_pred CCCCHHHHHHHHHHHHHc-C-CCCHHH-HHHh-cCCCCHHHHHHHHHHHHHhccccCCC-CC--ccccccCC Confidence 555566777777776554 2 233221 1112 22233333444444443321111000 00 00000000 No 114 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=99.16 E-value=1e-09 Score=69.81 Aligned_cols=492 Identities=10% Similarity=0.006 Sum_probs=221.9 Q ss_pred CCcHHHH----HHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHH----HHh Q lcl|NC_013059. 1 MADNKNR----LESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVS----EMR 72 (725) Q Consensus 1 mad~~~~----~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g----~~~ 72 (725) |-|-..+ ...+..+|..-......|...+.+..+|..-.-++++..... ..+|--..-...++.+.+ .-. T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~--~~~~~dstg~~a~~~LAa~l~~~lt 78 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCccc--ccccccchHHHHHHHHHHHHHHhhc Confidence 6553211 123444444444444555555666667776533322211110 112321333334443332 222 Q ss_pred -hCCcceEEecCCcc-------hHH---HHHH---HHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCC Q lcl|NC_013059. 73 -QNPIDVLYRPKDGA-------SPD---AADV---LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) Q Consensus 73 -~nr~~~~~~pr~~~-------d~~---~Ae~---l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~ 138 (725) -+++=+++.+.+.. +.+ +.+. .+..+......|++..+...+|.+.+..|.|++-+ | +. T Consensus 79 pp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--d---~~-- 151 (515) T protein:vir:70 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--P---SK-- 151 (515) T ss_pred CCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEE--e---CC-- Confidence 24555666554421 122 2233 33445555678899999999999999999987533 2 21 Q ss_pred CCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEE Q lcl|NC_013059. 139 SNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQ 218 (725) Q Consensus 139 ~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vr 218 (725) + . ++..|+ .++++..++.- ...-++++..|+...+.+.|+... . ...... ....++.|. T Consensus 152 ~-~--~~~~pl----~~y~v~~d~~G----~v~~i~rr~~~t~~~l~~~f~~~~---~---~~~~~~----~~~~~~~v~ 210 (515) T protein:vir:70 152 G-A--MSAVPM----HHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAI---E---VGMKGK----KCKEDDNVK 210 (515) T ss_pred C-C--eEEEEc----CeEEEeeCCCc----CeeEEEeeeeccHHHHHHhhhhhh---h---hhhhhh----hcCCCCceE Confidence 1 1 344444 33555554432 112278888999998888877421 0 000000 011123333 Q ss_pred EEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCcc Q lcl|NC_013059. 219 IAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHI 298 (725) Q Consensus 219 v~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~ 298 (725) |..+-++++ + | .. .+|.-..|.++ ...+-||+..| T Consensus 211 i~~~v~~~~---------~---~-------------------------------~~-~~~~e~d~~~~-~~es~y~~~e~ 245 (515) T protein:vir:70 211 LYTHAQYAG---------E---G-------------------------------FW-KINQSADDIPV-GKESRIKSEKL 245 (515) T ss_pred EEEEEEecC---------C---C-------------------------------ce-EEEEecCceee-ccccccccccC Confidence 322111110 1 0 01 11122233333 34577888999 Q ss_pred ceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccccccc Q lcl|NC_013059. 299 PIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTD 378 (725) Q Consensus 299 p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 378 (725) ||+|+-.. -.+|..|+.|.+.+..+--+.+|+.....+.....+.+.+++++++.+-..... .+...+..+. T Consensus 246 P~~~~Rw~--~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l------~~~~~g~iv~ 317 (515) T protein:vir:70 246 PFIPLTWK--RSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHF------VNSGTGEVIT 317 (515) T ss_pred Cceeeeee--ecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhc------cccCCceeec Confidence 99987544 458889999999999999999999999999888999999999987654322111 1111111111 Q ss_pred ccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 379 ENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMR 458 (725) Q Consensus 379 ~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~ 458 (725) ...+.+ .+++ ..+..--......++...+.|....=++ .+.-.++-.+++.=|..+.+.-...+...+.+|.. T Consensus 318 g~~~~v--~~~~-~~~~~d~~~~~~~i~~~~~rI~~af~~~-~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~--- 390 (515) T protein:vir:70 318 GVAEDI--HIVQ-LGKYADLTPISAVLEVYTRRIGVIFMME-TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAM--- 390 (515) T ss_pred CCcccc--eeee-cCcccchhHHHHHHHHHHHHHHHHHhhh-hhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHH--- Confidence 111111 1111 1111122445566777777776655222 22222222345556777777777777777777653 Q ss_pred HHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 459 RDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 459 ~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) +++.-||.... +...+. +. .++ .++.+.. +-.+-.|.+.++.+..+ T Consensus 391 ---Ell~Pli~r~~------~~~~p~--------------~P-------~~~---v~~~~vs-~l~~L~r~q~~~~i~~~ 436 (515) T protein:vir:70 391 ---TMQTPIAMWGL------QEAGDS--------------FT-------SEL---VDPVIVT-GIEALGRMAELDKLANF 436 (515) T ss_pred ---HHHHHHHHHHH------HhhCCC--------------CC-------hhh---cccceeh-hHHHHHHHHHHHHHHHH Confidence 22333332110 000100 00 011 2233322 33334566777777776 Q ss_pred HHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhh-hhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_013059. 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ-MGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) Q Consensus 539 l~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~-~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k 617 (725) ++.+...+..... .+...|++. +++.+...... ..+.. .+++.++..+++++++++ ++..++ T Consensus 437 ~q~i~~~~~~~p~----~~~~id~d~---~~~~~a~~~g~p~~~~r-s~eev~~~r~q~~~~~~~----~~~~~~----- 499 (515) T protein:vir:70 437 AQYMSLPQTWPEP----AQRAIRWGD---YMDWVRGQISAELPFLK-SEEEMQQEMAQQAQAQQE----AMLNEG----- 499 (515) T ss_pred HHHHHHHhccChh----HHhhCCHHH---HHHHHHHHhCCCccccC-CHHHHHHHHHHHHHHHHH----HHHHHh----- Confidence 6655432221111 223334433 22222221111 11111 111111111111111110 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 618 GQAELAKAQNQTLSLQIDAAKVEAQNQLNAA 648 (725) Q Consensus 618 ~qae~~kaqae~~k~q~ea~~~q~q~q~~~a 648 (725) ..++.....+- ..+++ T Consensus 500 ----~~~a~~~~~~~-----------~~~~~ 515 (515) T protein:vir:70 500 ----VAKAVPGVIQQ-----------EMKEG 515 (515) T ss_pred ----hhhhcccchhh-----------hhccC Confidence 00000000000 00000 No 115 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.13 E-value=1.6e-09 Score=68.86 Aligned_cols=643 Identities=11% Similarity=0.026 Sum_probs=131.1 Q ss_pred HHHHH--HH-HHHhhhHHHHHHHHHHHHhhcCCCC---CHHHHHHHhhcCCC-c-ccchH---HHHHHHHHHHhhCCcce Q lcl|NC_013059. 10 SILSR--FD-ADWTASDEARREAKNDLFFSRVSQW---DDWLSQYTTLQYRG-Q-FDVVR---PVVRKLVSEMRQNPIDV 78 (725) Q Consensus 10 ~~~~~--~~-~~~~~~~~~r~~a~~d~~f~~G~QW---~~~~~~~l~~~grp-~-~N~i~---~~v~~v~g~~~~nr~~~ 78 (725) -+.+. .+ ...+-..-+..+...+..|++|.=| ..+..-|+ ..+.+ . -|+.+ +.|...+..-..+=.++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~-g~~~~~~~~~~s~~~~~~v~~~v~~~~~~l~~~ 79 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYF-GEPFGNERPGKSGIVSRDVQETVDWIMPSLMKV 79 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHh-CCCCCcccCCCCccccHHHHHHHHHHHHHHHHh Confidence 11110 00 0011122344455556677665422 22222222 12222 1 12222 33333333333333332 Q ss_pred EEecC-----CcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcc-eEEEEeeeccCCCCCCceeEEEEee-ec Q lcl|NC_013059. 79 LYRPK-----DGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVG-AWRLVTDYEDQSPTSNNQVIRREPI-HS 151 (725) Q Consensus 79 ~~~pr-----~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G-~~~v~~~~~~~~~~~~~~~ir~~~~-~~ 151 (725) -|-++ .|-...-+++.--+-.++.....-.....+++.+.+..++= -.-|..=|.+.. ....+. +...+ .. T Consensus 80 ~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~-~~~~~e-~~~~~~~~ 157 (705) T protein:vir:88 80 FTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEV-LKPTFE-RFSGLSED 157 (705) T ss_pred hcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccc-cchhhh-hhccCChh Confidence 22111 12222333322222222222211111122333333333321 111111000000 000000 00000 00 Q ss_pred chhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeE Q lcl|NC_013059. 152 ACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKET 231 (725) Q Consensus 152 ~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~ 231 (725) .+...+.||.+.-.+-|+-.+..+..+++....+...+-...++.+..- +++ ..+|. +...++...+... ..- T Consensus 158 ~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~--dp~--a~~~~--d~~~~~~~~~~t~-~dl 230 (705) T protein:vir:88 158 MVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLV--DRL--ATCID--DARFLCHREKYTV-SDL 230 (705) T ss_pred hhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHcee--cCC--CCCcc--cCcEEEEEEeccH-HHH Confidence 0001223443322222222221111111111000000000000000000 000 00011 1111111111100 000 Q ss_pred EEEeeCccccceeec-chhhhHHHHHHHHhc--------chhhhhccceeEEEE--EEEEee----ccccccC-CCCCCC Q lcl|NC_013059. 232 AFIYQDPVTGEPVSY-FKRDIKDVIDDLADS--------GFIKIAERQIKRRRV--YKSIIT----CTAVLKD-KQLIAG 295 (725) Q Consensus 232 ~~~~~d~~~g~~~~~-~~~~~~~~~~~~~~~--------g~~~~~~~~~~~~~v--~~~~~~----g~~~l~~-~~~~p~ 295 (725) .-.+.+....+-..+ +..........+... ....... ...+++| +-|++. |+.+.+- ...|.+ T Consensus 231 ~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~-~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g 309 (705) T protein:vir:88 231 RLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDD-AEANREVWASECYTLLDVDGDGISELRRILYVG 309 (705) T ss_pred HhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccc-cCCceeEEEEEeeeEecccCCcceeeEEEEEeC Confidence 000000000000000 000000000000000 0000000 0111222 223221 1211100 111334 Q ss_pred CccceEEEEeeeeccCCcccc--ch-hhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccc Q lcl|NC_013059. 296 EHIPIVPVFGEWGFVEDKEVY--EG-VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYY 372 (725) Q Consensus 296 ~~~p~vP~~g~~~~~d~~~~~--~G-~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 372 (725) +++.-++.||.+.|...+..| .. +-..+.+.=..+-...+++..-+..+.+ ...+. .+. T Consensus 310 ~~il~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~-----------------~~~~~-~~~ 371 (705) T protein:vir:88 310 DYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIY-----------------RTNQG-RSV 371 (705) T ss_pred ccccccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHH-----------------hccCC-cee Confidence 444433333333333222111 11 1122233222233333332221111110 00110 000 Q ss_pred ccc-------cc-cccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc--chhHHHHHHHHHHHH Q lcl|NC_013059. 373 LLN-------RT-DENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG--GQVAYDTVNQLNMRA 442 (725) Q Consensus 373 ~~~-------~~-~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~--n~~Sg~ai~~~q~q~ 442 (725) ... .. ..++|.+...+..-+.+.+.|.-...+++ +++.....-....|... .+.+|.+......++ T Consensus 372 ~~~g~v~~~d~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~----ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~ 447 (705) T protein:vir:88 372 VLDGQVNLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYG----MLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAM 447 (705) T ss_pred ccccccCcccccccCCCeeEEecCCCccccccCCcCcHHHHH----HHHHHHHHHHHhhCCchHHcCCCcccccchhhHH Confidence 000 00 11222222222233333333333333333 33333333344555432 112223332222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCCcEEEEeccCC-CcceEEeccccccccCCceeeeccccc-cceEEEE Q lcl|NC_013059. 443 DLETYVFQDNLATAMRRDGEIYQS-IVNDIYDVPRNVVITLEDG-SEKEVQLMAEVVDLATGERQVLNDIRG-RYECYTD 519 (725) Q Consensus 443 ~~~~~~~~dn~~~~~~~~g~~ll~-li~~~y~~~r~irI~~~d~-~~~~v~in~~~~d~~~g~~~~~nDi~g-~~Dv~v~ 519 (725) .+..+.+.-....+...+.+-. .+...+ ++++.+.-.-. .++.+.| +|..+ .+.. .+--..+ T Consensus 448 --~i~~~~~~~~~r~~~~~r~~a~~~~~~l~--~~~~~li~~~~~~~~~~ri--------~g~~v---~v~~~~~~~~~~ 512 (705) T protein:vir:88 448 --SVNQLMTAAEQQIDLIARMFAETGVKRLF--QLLHDHAIKYQNQEEVFQL--------RGKWV---AVNPANWRERSD 512 (705) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHhCCCceEEee--------ccchh---ccchHhhccCCc Confidence 1112222111111111111100 000000 11222211111 1222322 12110 0110 1111111 Q ss_pred eccCchhHHHHHHHHHHHHHHhcccccchHHHHH--HHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHH Q lcl|NC_013059. 520 VGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLL--LQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQ 597 (725) Q Consensus 520 ~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~--~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~ 597 (725) ..+.++..-...-..+..+...+. +..... ....+..+.....++.. ......+...+......+...+.. T Consensus 513 v~v~v~~~~~~~eq~~a~l~~ll~----~~q~l~~~~~~~~~~~~~~~~~~~~---el~e~~~~k~~~~~~~~~~~~e~~ 585 (705) T protein:vir:88 513 LTVTVGIGNMNKDQQMLHLMRIWE----MAQAVVGGGGLGVLVSEQNLYNILK---EVTENAGYKDPDRFWTNPNSPEAL 585 (705) T ss_pred eEEeeccccchHHHHHHHHHHHHH----HHHHhhcccchhhhcChHHHHHHHH---HHHHhhhhhhHHHHhhhhhhHHHH Confidence 222222211111112222111110 111110 11112222222222211 111112222211111111111111 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 598 QAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQ 677 (725) Q Consensus 598 q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~ 677 (725) +.++++.+.+ .+++.+..++|++.+++++++++. ++.++.++.+.... ++..+....+.++. ..+.+. T Consensus 586 ~~~~~~~q~e-~~~~~~~~~~q~e~~k~q~e~~~~-------q~e~q~~q~E~q~~--q~e~e~~~~~~~~~--~~e~~~ 653 (705) T protein:vir:88 586 QAKAIREQKE-AQPKPEDIKAQADAQRAQSDALAK-------QAEAQMKQVEAQIR--LAEIELKKQEAVLQ--QREMAL 653 (705) T ss_pred HHHHhhhhhh-hhHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHH--HHHHHHHHHHHHHH--HHHHHH Confidence 1111111111 112222333344444444333333 32222222221111 11111111111111 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 678 DRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 678 ~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~~~~~~~q 725 (725) .+++..++..+..++.+.+..+.+.++....+. +...+..|+ T Consensus 654 ~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~------~~~~~~~~~ 695 (705) T protein:vir:88 654 KEAELQLERDRFTWERARNEAEYHLEATQARAA------YIGDGKVPE 695 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHhHHH Confidence 111111111111111111111111111111111 011111111 No 116 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.10 E-value=2.1e-09 Score=68.14 Aligned_cols=469 Identities=12% Similarity=0.010 Sum_probs=229.1 Q ss_pred CCcHHHHHHHHHHHHHHH------------------HhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcC----CC--c Q lcl|NC_013059. 1 MADNKNRLESILSRFDAD------------------WTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RG--Q 56 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~------------------~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~g----rp--~ 56 (725) |. .+++++.+|++. +.-.++-........+||.|++ +. .......| +. . T Consensus 1 m~----~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~-~~--~~~~~~~~~~~~~~~~s 73 (508) T protein:vir:15 1 MG----LIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKL-QY--IHYQASDGIKKKRLKNT 73 (508) T ss_pred CC----hHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCC-cc--cccccCCCCccccceee Confidence 44 233333333221 2223344455777889999862 11 11111112 12 4 Q ss_pred ccchHHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCC Q lcl|NC_013059. 57 FDVVRPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQS 136 (725) Q Consensus 57 ~N~i~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~ 136 (725) +|+-+.+++...+.-..-.+.+.|...+ ..+..+.-+.+.|++......+++.++..|-||+++.+| .+ T Consensus 74 ln~~~~i~~~~A~lv~~e~~~i~v~~~~--------~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d---~~ 142 (508) T protein:vir:15 74 INMAKTAARRIASVVFNEKAEIHVKDNN--------EADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYID---GN 142 (508) T ss_pred cchHHHHHHHHHhhhhCCCceEEeCCch--------HHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEe---CC Confidence 5999999999988888877788875422 233456667778999999999999999999999999875 22 Q ss_pred CCCCceeEEEEeeecchhheee-CCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCC Q lcl|NC_013059. 137 PTSNNQVIRREPIHSACSHVIW-DSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQD 215 (725) Q Consensus 137 ~~~~~~~ir~~~~~~~~~~v~~-Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (725) .+.|..+ +++.+|+ -.+. -+...|-|+......+. . ..+ T Consensus 143 ----~~~i~~v----~ad~~~P~~~d~--~~~~~~af~~~~~~~~~---------~---------------------~~~ 182 (508) T protein:vir:15 143 ----HIKIAWV----RADQFYPLQSNT--NDISEAAIASRTQRTES---------N---------------------QTK 182 (508) T ss_pred ----eeEEEEE----cCCeeEEEEEcC--CCeEEEEEEEEEEeecC---------C---------------------Cce Confidence 3334432 3444441 1111 12333333222211000 0 001 Q ss_pred eEEEEEEEEEecc-----eeEEEEeeCcc-ccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccC Q lcl|NC_013059. 216 TIQIAEFYEVVEK-----KETAFIYQDPV-TGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKD 289 (725) Q Consensus 216 ~vrv~E~w~~~~~-----~~~~~~~~d~~-~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~ 289 (725) ..++.|+++.... .-.++...++. -|..+.... +. +. .+ +.+. ..+.|- + T Consensus 183 ~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~-----~~-e~--~~---l~~~---------~~~~g~----~ 238 (508) T protein:vir:15 183 YYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLST-----LP-VY--KE---LAPQ---------VTISGL----Q 238 (508) T ss_pred EEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhh-----cc-cc--cC---CCcc---------eEecCC----C Confidence 1223343322110 00111111100 122111100 00 00 00 0000 000110 0 Q ss_pred CCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccc Q lcl|NC_013059. 290 KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY 369 (725) Q Consensus 290 ~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 369 (725) .++ +.+||. |+... -..+++.+-|.+.++++.++.+|...|.+.+.+ ..+..++.++++.+..-.+.....+.. T Consensus 239 ~p~--f~y~~~-~~~N~--~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d~~~~~~~~~~ 312 (508) T protein:vir:15 239 RPL--FAYFKT-PGANN--INIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFDDEHKPTFDTE 312 (508) T ss_pred cce--eEEecC-Ccccc--ccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCCCCCccccCCC Confidence 000 111111 11111 123455566789999999999999999999988 456666677666553211000000000 Q ss_pred cccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcch-hHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 370 PYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADLETYV 448 (725) Q Consensus 370 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~-~Sg~ai~~~q~q~~~~~~~ 448 (725) . -.+ ...+.....+..++.+.+.---..+...++.....+....|++...+|-.++. .++.+|............. T Consensus 313 ~-~~~--~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~ 389 (508) T protein:vir:15 313 Q-NVY--VGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSS 389 (508) T ss_pred C-eeE--EeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHH Confidence 0 000 00111111122344433322234567788988999999999999888866543 3677888888777777778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHH Q lcl|NC_013059. 449 FQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMK 528 (725) Q Consensus 449 ~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r 528 (725) ....+..+++++.+.++.+..-++--. .|. .....+ . ....++|.|+=+.+-..-+ T Consensus 390 ~~~~~~~al~~lv~~il~l~~~~~~~~--------~g~-~~~~~~--~-------------~~~~~~v~v~f~D~i~~d~ 445 (508) T protein:vir:15 390 YLTMVEKAIDELCQSIFELANAGALFD--------DGK-PLFTLD--S-------------ASQPLDIECHFDDGVFVNK 445 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccc--------ccc-cccccc--c-------------ccCCcceEEEeCCCCCCCH Confidence 888889999998888888754333110 000 000000 0 0124567777666666656 Q ss_pred HHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhh-------hhccchh Q lcl|NC_013059. 529 QQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGV-------KKPETPE 588 (725) Q Consensus 529 ~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~-------~~~~~~e 588 (725) ++.++.++++..+ + .+|.- ..+...+..+-+.+++.++++......... ..-...| T Consensus 446 ~~~~~~~~~~v~a-G-i~s~e--~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 446 DKQLEEDAKVLAI-G-ALSKQ--TFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHHHHHHHHHHhc-C-CCCHH--HHHHhcCCCChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 6777777776532 2 22321 111112333334455555554443221100 0001111 No 117 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.04 E-value=4.1e-09 Score=66.58 Aligned_cols=485 Identities=8% Similarity=-0.030 Sum_probs=234.9 Q ss_pred CCcHHHHHHHHHHHHHHH-----------------HhhhHHHHHHHHHHHHhhcCCCCCHHHHHH---HhhcCCCcccch Q lcl|NC_013059. 1 MADNKNRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRVSQWDDWLSQY---TTLQYRGQFDVV 60 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~-----------------~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~---l~~~grp~~N~i 60 (725) |. ++.+++.+|++- +.-..+-+....+...+|.|++|.=..... .+.+.+..+|+- T Consensus 1 m~----~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~ 76 (517) T protein:vir:98 1 MK----VIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLR 76 (517) T ss_pred Cc----hHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcH Confidence 43 333344444331 222334455666678999998773211111 111122346888 Q ss_pred HHHHHHHHHHHhhCCcceEEecCC--cchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCC Q lcl|NC_013059. 61 RPVVRKLVSEMRQNPIDVLYRPKD--GASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) Q Consensus 61 ~~~v~~v~g~~~~nr~~~~~~pr~--~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~ 138 (725) +.++..+.+.--+-.+.+.|..-+ ..+....+..+..+.-+.+.|++.....++.+.++..|-|++++.+| .+ T Consensus 77 ~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d---~~-- 151 (517) T protein:vir:98 77 KLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD---NG-- 151 (517) T ss_pred HHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe---CC-- Confidence 888888888877777888887532 22233445566777778889999999999999999999999999876 11 Q ss_pred CCceeEEEEeeecchhheeeCCCccc-cChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeE Q lcl|NC_013059. 139 SNNQVIRREPIHSACSHVIWDSNSKL-MDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTI 217 (725) Q Consensus 139 ~~~~~ir~~~~~~~~~~v~~Dp~a~~-~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~v 217 (725) .+.|.. .+.+.|| |-... -+.+.|-++++..... + .+.......+++..+.. -.....- T Consensus 152 --~~~I~~----v~ad~~~--Pl~~~~~~v~~~ai~~~~~~~~-~--------~~~~~Yt~lE~H~~~~~---~~~~~~y 211 (517) T protein:vir:98 152 --EIEFSW----ALANAFY--PLRSNSNGISEGVMKSVTTKVI-G--------NKTVYYTLLEFHEWEKT---EEGESLY 211 (517) T ss_pred --eeEEEE----EcCCeeE--EEEecCCCeEEEEEEEEEEEee-c--------CCceEEEEEEEEecCce---eccCCcE Confidence 233332 2344454 32111 1122233333221110 0 00000000011100000 0000111 Q ss_pred EEEEEEEEecceeEEEEeeC-ccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCC Q lcl|NC_013059. 218 QIAEFYEVVEKKETAFIYQD-PVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGE 296 (725) Q Consensus 218 rv~E~w~~~~~~~~~~~~~d-~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~ 296 (725) +|.- .++...+ ...|..+.... +++.+. +. .++.| .+.. T Consensus 212 ~I~n---------~ly~s~~~~~lG~~v~L~~-----~~e~l~--------~~---------~~~~g---------~~~P 251 (517) T protein:vir:98 212 VITN---------ELYKSDNEGEIGKRIPLEE-----LYEGMQ--------EK---------TYIQG---------LSRP 251 (517) T ss_pred EEEE---------EEEecCCCccccccccccc-----cccCCC--------cc---------eeECC---------CCcc Confidence 1111 1111111 11233322111 000000 00 01111 0100 Q ss_pred ccceE--EEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc Q lcl|NC_013059. 297 HIPIV--PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) Q Consensus 297 ~~p~v--P~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) .|-|+ |+. .+ -..+++.+-+++.+++|..+.+|...|.+.+.+-+ +..++.++++.+....+- ......+.... T Consensus 252 lf~y~~~p~~-N~-~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~~~~~~-~g~~~~~~~d~ 327 (517) T protein:vir:98 252 LFNYLKPSGF-NN-INPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-GQRTVFVSDVMLRTVPDE-SGMPPPQVFDP 327 (517) T ss_pred eEEEecCCcc-cc-cccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-CCcceecChhhhccccCC-CCcccCCCCCc Confidence 11111 111 11 12356667789999999999999999999998877 445666666655211100 00000000000 Q ss_pred --ccccccCccccccCCcccCCCCc-hHHHHHHHHHHHHHHHHHhCCChHHhccCcchh-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 375 --NRTDENNGEMPTQPLAYYENPEV-PQANAYMLEAATAAVKEVATLGVDAEAVNGGQV-AYDTVNQLNMRADLETYVFQ 450 (725) Q Consensus 375 --~~~~~~~g~~~~~~~~~~~~~~~-~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~-Sg~ai~~~q~q~~~~~~~~~ 450 (725) .....-.+......++.+. |.+ -..++..++...+.|....|++...+|-.+..+ ++.+|.+..+..-.....+. T Consensus 328 ~~~~y~~~~~~~~~~~i~~~~-~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~ 406 (517) T protein:vir:98 328 DVNVYKSIRMGTDEEFVKDVT-HDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHV 406 (517) T ss_pred ccceeeeccCCCCCCceeeec-cccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHH Confidence 0000011111112233322 333 347788899999999999999998888765433 46678777777777777788 Q ss_pred HHHHHHHHHHHHHHHHHHHH--hcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHH Q lcl|NC_013059. 451 DNLATAMRRDGEIYQSIVND--IYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMK 528 (725) Q Consensus 451 dn~~~~~~~~g~~ll~li~~--~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r 528 (725) ..+..+++++-+.++.+..- .|+.. +...++|+|+=+.+...-+ T Consensus 407 ~~~~~aL~~lv~~i~~l~~~~~~~~~~----------------------------------~~~~~~v~v~f~D~i~~D~ 452 (517) T protein:vir:98 407 YEVEQFIKGLVISVLELAKTYKLFGGE----------------------------------IPSAEHIGVDFDDGVFQDR 452 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCC----------------------------------CCCCcceEEEcCCCCCCCH Confidence 88888888888888766543 23211 1124677777777766667 Q ss_pred HHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHH Q lcl|NC_013059. 529 QQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPA 607 (725) Q Consensus 529 ~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~ 607 (725) ++.++.++++..+ + .+|.... +.. ....+-+.+++.+.++....... .+....+ . +....--+.+ T Consensus 453 ~~~~~~~~~~v~a-G-~ms~~~~-i~~-~~g~~eeeA~~e~~~i~~E~~~~---~~~~~~~------~-~~~~~~gd~e 517 (517) T protein:vir:98 453 SALLRFYGQAKTF-G-FIPTVEA-IQR-IFKVPKKTAEQWLEEIRKDQIEL---DPVTISQ------R-AQKRMFGDEE 517 (517) T ss_pred HHHHHHHHHHHhc-C-CCCHHHH-HHH-hCCCChHHHHHHHHHHHHhcccc---CCCCccc------c-ccCCCCCCCC Confidence 7777777776543 2 2332221 111 22334444444444443322111 0000000 0 0000000000 No 118 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.98 E-value=8e-09 Score=64.97 Aligned_cols=470 Identities=10% Similarity=0.003 Sum_probs=227.1 Q ss_pred CCcHH---HHHHHHHHH--HHH---------HHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCC------cccch Q lcl|NC_013059. 1 MADNK---NRLESILSR--FDA---------DWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG------QFDVV 60 (725) Q Consensus 1 mad~~---~~~~~~~~~--~~~---------~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp------~~N~i 60 (725) |.=-. ..+++...+ +.. .+...++-.....++..||.|+... +......|++ .+|+- T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~---l~~~~~~~~~~~~~~~slnl~ 77 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQ---VTHKNSYGDTQKHELQSVNVT 77 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcc---ccccccCCCccccceeecchH Confidence 54221 122221110 000 1222334445556678899886431 1122223333 35888 Q ss_pred HHHHHHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCC Q lcl|NC_013059. 61 RPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) Q Consensus 61 ~~~v~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~ 140 (725) +.+++...+.--...+.+.+. |. ..+..+.-+.+.|++.....++.+.++..|-+|+++.+| ++ T Consensus 78 ~~i~~~~A~ll~~e~~~i~~~-----d~----~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D---~~---- 141 (505) T protein:vir:79 78 KLASAKLASLIFNEQCQVTVS-----DE----TANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD---SG---- 141 (505) T ss_pred HHHHHHHHhhhcCCCceeecC-----Ch----HHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe---CC---- Confidence 999999999888888777762 33 345556777788999999999999999999999999875 22 Q ss_pred ceeEEEEeeecchhhee---eCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeE Q lcl|NC_013059. 141 NQVIRREPIHSACSHVI---WDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTI 217 (725) Q Consensus 141 ~~~ir~~~~~~~~~~v~---~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~v 217 (725) .+.|..+ +++.+| ||.. +..+|-++. +|...+ + + ...-. T Consensus 142 ~~~i~~v----~ad~~~P~~~d~~----~~~~~a~~~--~~~~~~-------~---~------------------~~~~y 183 (505) T protein:vir:79 142 KIKLAWA----TADQVYPLQADTN----QVNELAIAS--RTTEVE-------N---H------------------RTIYY 183 (505) T ss_pred ceEEEEE----cCCeeEEEEEcCC----CeEEEEEEE--EEEEec-------C---C------------------cceEE Confidence 2334332 334444 3332 233444333 221110 0 0 00112 Q ss_pred EEEEEEEEecce----eEEEEeeCc-cccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCC Q lcl|NC_013059. 218 QIAEFYEVVEKK----ETAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQL 292 (725) Q Consensus 218 rv~E~w~~~~~~----~~~~~~~d~-~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~ 292 (725) ++.|+|+..... -.+|...+. ..|..+.+.. +.+ |.-+.....+ +. T Consensus 184 t~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~--~~~------------------------~~~l~~~~~~---~g 234 (505) T protein:vir:79 184 TLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNS--LEQ------------------------YEGLEPQVKI---TG 234 (505) T ss_pred EEEEEEEecCceEEEEEEEEecCCCCccCcccchhh--ccc------------------------ccccCcceee---cC Confidence 344544422111 111111111 1122221110 000 0000000000 00 Q ss_pred CCCCccceE--EEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHH--HHhhcc Q lcl|NC_013059. 293 IAGEHIPIV--PVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHM--YDGNDD 368 (725) Q Consensus 293 ~p~~~~p~v--P~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~ 368 (725) ++...|-|+ |+... -.++++.+-|.+.++++..+.+|...|.+.+.+-+. +.+..+++..+.....- ...... T Consensus 235 ~~~p~f~~~~~~~~N~--~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~~~~~~~ 311 (505) T protein:vir:79 235 LKHPLFAFYRNKGANN--KNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKG-QRRLIVPAEWLKTGSSYGGQASETH 311 (505) T ss_pred CCcceEEEecCCcccc--cccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhc-ccceeechHHhcccCCCCccccccc Confidence 111112222 22111 123455567889999999999999999999888654 44555555443211000 000000 Q ss_pred cccccc--ccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcch-hHHHHHHHHHHHHHHH Q lcl|NC_013059. 369 YPYYLL--NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADLE 445 (725) Q Consensus 369 ~~~~~~--~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~-~Sg~ai~~~q~q~~~~ 445 (725) .+.... .....-.+......++.+.+.-.-.+++..++...+.|....|++...+|-.++. .++.+|.......... T Consensus 312 ~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t 391 (505) T protein:vir:79 312 PPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQT 391 (505) T ss_pred ccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHH Confidence 010000 0000001111122344454332334567788999999999999998888865543 3677888877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCch Q lcl|NC_013059. 446 TYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQ 525 (725) Q Consensus 446 ~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~ 525 (725) ...+...+..+++.+-+.++.+..-+.-.. .|...+. .+ ...++|+|+=..+-+ T Consensus 392 ~~~~~~~~~~al~~li~~i~~~~~~~~~~~--------~g~~~~~-----------------~~-~~~~~i~v~f~d~i~ 445 (505) T protein:vir:79 392 RSSYITQVEKTIKALTYAILELASVPSFYA--------DGQARWT-----------------GD-VDSLDITINFNDGVF 445 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------ccccccc-----------------CC-CCceeEEEEeCCCCC Confidence 777888888888888888887755544111 0000000 00 124677777777766 Q ss_pred hHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHH Q lcl|NC_013059. 526 SMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQD 605 (725) Q Consensus 526 t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q 605 (725) .-+++..+.++++... + .++.- ..+...+..+-+.+++.++++...... ..+ T Consensus 446 ~d~~~~~~~~~~~v~~-G-i~s~e--~~l~~~~~~~eeea~~el~ri~~E~~~---~~p--------------------- 497 (505) T protein:vir:79 446 VDQESKRAADLQAVQA-Q-VMPKK--QFLMRNYGLDEEEADEWLAQIDAENST---AEP--------------------- 497 (505) T ss_pred CCHHHHHHHHHHHHHc-C-CCCHH--HHHHhcCCCChHHHHHHHHHHHHhccc---cCC--------------------- Confidence 6666777777776543 2 22321 111112223323344444444322110 000 Q ss_pred HHHHHHHHH Q lcl|NC_013059. 606 PAMVQAQGV 614 (725) Q Consensus 606 ~~~~~~qa~ 614 (725) +....-.+ T Consensus 498 -~~~~~gg~ 505 (505) T protein:vir:79 498 -EFNQFGGD 505 (505) T ss_pred -CchhccCC Confidence 00000000 No 119 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.84 E-value=3e-08 Score=61.82 Aligned_cols=657 Identities=9% Similarity=-0.084 Sum_probs=196.0 Q ss_pred hHHHHHHHHHHHHhhc--CCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCCcceEE-----------------ec Q lcl|NC_013059. 22 SDEARREAKNDLFFSR--VSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY-----------------RP 82 (725) Q Consensus 22 ~~~~r~~a~~d~~f~~--G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr~~~~~-----------------~p 82 (725) .++.+....+.+.++. -+.++++=...+++..=.--|.-.+.+..++ +.+.||.+.. .+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l--~~q~rp~~N~i~~~v~~v~g~e~~nr~d~ 78 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYT--TLQYRGQFDVVRPVVRKLVSEMRQNPIDV 78 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHH--HhcCCCcccchHHHHHHHHhhHHhCCcce Confidence 4444444443333332 1222222222222210000022222222222 2233332111 11 Q ss_pred -CCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeecc--CCCCC-CceeEEEEeeecchhheee Q lcl|NC_013059. 83 -KDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYED--QSPTS-NNQVIRREPIHSACSHVIW 158 (725) Q Consensus 83 -r~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~--~~~~~-~~~~ir~~~~~~~~~~v~~ 158 (725) ..|.+. ....+..++..+....-...-...++.++..+++.+ ++.|.. .|..+ ++...... | ....|++ T Consensus 79 ~v~p~~~-~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~---G~G~~ev~~d~~~~d~~~~~~~-i--~~~~i~~ 151 (725) T protein:vir:10 79 LYRPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEA---GVGAWRLVTDYEDQSPTSNNQV-I--RREPIHS 151 (725) T ss_pred EEecCCc-chHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhc---CcceeeeeccccCCCCCCCcee-e--eeeeccc Confidence 124343 445566666666666655555555556655555443 111110 01110 00110000 0 0001223 Q ss_pred CCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhh-hhcccccccccCCCeE-EEEEEEEEecceeEEEEee Q lcl|NC_013059. 159 DSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSF-QNPNDWVFPWLTQDTI-QIAEFYEVVEKKETAFIYQ 236 (725) Q Consensus 159 Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~-~~~~~~~~~~~~~~~v-rv~E~w~~~~~~~~~~~~~ 236 (725) ||...-+|.. .+..+.+++.=.|-..-.+......+ ..+......|.+.... .-..-|+... +.++..+. T Consensus 152 ~~~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~vrv~E~~ 223 (725) T protein:vir:10 152 ACSHVIWDSN-------SKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQD-TIQIAEFY 223 (725) T ss_pred CHhHcccCch-------hhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCC-eEEEEEEE Confidence 3332223322 11122222221111111111100000 0111111112211111 0111233321 22222211 Q ss_pred --Cccccceee-cch--hhhHHHHH-HHHhcchhhhhcc--ceeEEEEEEEEeeccccccCCCC--CC-CCccceEEEEe Q lcl|NC_013059. 237 --DPVTGEPVS-YFK--RDIKDVID-DLADSGFIKIAER--QIKRRRVYKSIITCTAVLKDKQL--IA-GEHIPIVPVFG 305 (725) Q Consensus 237 --d~~~g~~~~-~~~--~~~~~~~~-~~~~~g~~~~~~~--~~~~~~v~~~~~~g~~~l~~~~~--~p-~~~~p~vP~~g 305 (725) .+.+..++. .++ ..+..... .+........... .+..+++.++.+....++ +... .| .-.+.++||++ T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~-g~~~l~~~~~~~~~~fP~vP 302 (725) T protein:vir:10 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIIT-CTAVLKDKQLIAGEHIPIVP 302 (725) T ss_pred EEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeec-chhhhcCCCCCCCCceeEEE Confidence 111111110 111 11111100 0000001111111 123344433332222232 3222 12 12234689988 Q ss_pred eeeccCCccccchhhh--hhhhHHHHHHHHHHHHHH-HHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCc Q lcl|NC_013059. 306 EWGFVEDKEVYEGVVR--LTKDGQRLRNMIMSFNAD-IVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNG 382 (725) Q Consensus 306 ~~~~~d~~~~~~G~vr--~~kd~Q~~~N~~~s~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 382 (725) +..+.... .|.-. .++..=+..=....+... .+-..+..+.....+..+.++...... .++.... .+..+.- T Consensus 303 ~~g~r~~~---~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~-~~~~~~~-~~~~~~~ 377 (725) T protein:vir:10 303 VFGEWGFV---EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY-DGNDDYP-YYLLNRT 377 (725) T ss_pred EEeeeecc---CCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHH-hccCCce-eeecccc Confidence 76544332 33322 444333333333333222 222333444444444444444332221 1221111 0111111 Q ss_pred cccccCC--cccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 383 EMPTQPL--AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRD 460 (725) Q Consensus 383 ~~~~~~~--~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~ 460 (725) ....+.+ ..+...+.|+-....++......+.+.-++...-...|..+++.+-.+....-..+...++.=| ..+++. T Consensus 378 ~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~-Dnl~~~ 456 (725) T protein:vir:10 378 DENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQ-DNLATA 456 (725) T ss_pred cccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 1111111 2233333434334566665555555544432222223333333222222222222233332222 222222 Q ss_pred HHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccc-----cccc----eEE--EEeccCc-hhHH Q lcl|NC_013059. 461 GEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDI-----RGRY----ECY--TDVGPSF-QSMK 528 (725) Q Consensus 461 g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi-----~g~~----Dv~--v~~~p~~-~t~r 528 (725) -+.+-.++..+.- .+. +.++.+.|-.+. . +-..+++|.- +|+. |++ .++..+. ++.- T Consensus 457 ~~~~g~~lL~lI~-----~~~---~~er~~RI~~ed--g-~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~ 525 (725) T protein:vir:10 457 MRRDGEIYQSIVN-----DIY---DVPRNVTITLED--G-SEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQ 525 (725) T ss_pred HHHHHHHHHHHHH-----HHc---CCCcEEEEecCC--C-CcceeEeccccccccccchhhhhccccceeEEEeeccCcH Confidence 2223333333221 111 234555553221 1 1235566642 3332 332 4555554 3433 Q ss_pred HHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHH Q lcl|NC_013059. 529 QQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAM 608 (725) Q Consensus 529 ~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~ 608 (725) ...-+.+..|++.++...|........++..++.+......+............... ++...+.+++.+++++.++ T Consensus 526 s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~----~~~~~e~~q~~~e~qq~~~ 601 (725) T protein:vir:10 526 SMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVK----KPETPEEQQWLVEAQQAKQ 601 (725) T ss_pred HHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccC----CccccchhHHHHHHHHHHH Confidence 333333334444443332322222222223333332222222222211111111111 1111122222222333444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 609 VQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAE 688 (725) Q Consensus 609 ~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE 688 (725) .++++++.++++...+++++.++++.+..+++.++. +...++....+.......++..++....++.++..++ T Consensus 602 ~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~-------~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~ 674 (725) T protein:vir:10 602 GQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAA-------KVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVAS 674 (725) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH Confidence 445555555555555555555544444433333322 2222222111111111112222222222222222221 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 689 LLLKGDEQTHKQRMEIA-NILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 689 ~~~~~~~q~~~q~~e~~-~~~~~~~~~q~~~~~~~~~q 725 (725) ...+ ....++..++.- +....+.+++....+.-++| T Consensus 675 ~q~~-~~~~~~~~ae~~~~~~~~~~~~~~~~~~~~~~q 711 (725) T protein:vir:10 675 FQQD-RSEDARANAELLLKGNEQTHKQRMDIANILQSQ 711 (725) T ss_pred HHHH-HHHHHHHhhHHHHHHHHHHHHHHhhhhhccccc Confidence 1111 111111111100 00011111222222222322 No 120 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=98.80 E-value=4.2e-08 Score=61.03 Aligned_cols=492 Identities=11% Similarity=-0.065 Sum_probs=222.8 Q ss_pred CCcHHHHHHHHHHH------HHHHHhhhHHHHHH-HHHHHHhhcCC--CCCHHHHHHHhhc-CCC-cccchHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSR------FDADWTASDEARRE-AKNDLFFSRVS--QWDDWLSQYTTLQ-YRG-QFDVVRPVVRKLVS 69 (725) Q Consensus 1 mad~~~~~~~~~~~------~~~~~~~~~~~r~~-a~~d~~f~~G~--QW~~~~~~~l~~~-grp-~~N~i~~~v~~v~g 69 (725) |+-++.-.....-. |-..+....+.|-. .+.-.+||.|+ ||..-. ..-..+ .|| .++-. ..|+| T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~l-rg~~~~~~r~~~~ps~----~~~~~ 75 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVIL-RGGDEGDQRPIYVPNG----EKLIE 75 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeec-CCccccccceeeehhh----HHhhC Confidence 44322111000000 00001222222322 23336788886 774211 111122 234 23333 45555 Q ss_pred HHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEee Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPI 149 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~ 149 (725) ..- .+.+-+-+..+...++.+..+++...+.++....+..+-.++++-|=|+++|.+|.... -+..+ +... T Consensus 76 ~~~----~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~--~~~R~--~v~~- 146 (527) T protein:vir:10 76 AKM----RFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD--EGSRL--SLHE- 146 (527) T ss_pred Ccc----eeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC--cCCCc--eEee- Confidence 332 25555545455566666778888888899999999999999999999999999875331 22223 3221 Q ss_pred ecchhheeeCCCccccChhcccceeeee----cCCHHHHH-----HHhhhcCCcchhhhhhhhcccccccccCCCeEEEE Q lcl|NC_013059. 150 HSACSHVIWDSNSKLMDKSDARHCTVIH----SMSQNGWE-----DFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIA 220 (725) Q Consensus 150 ~~~~~~v~~Dp~a~~~d~sDa~~~~~~~----~~~~~~~~-----~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~ 220 (725) .||..+| |- ..+| +.+++.+++ |-.+++-+ ++-+.+-.. .++....+.....++.. T Consensus 147 -~DP~~~f--~~-ed~d--~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~---------l~~~g~~~~~G~~~yt~ 211 (527) T protein:vir:10 147 -VDPSTYF--PY-EDPR--YPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKT---------LDDDGKPVPGGAIKYTE 211 (527) T ss_pred -cCcceee--ee-ecCC--CCCceeeEEEeeeccCCccccccceehhhhhhhhh---------cCcccccccCcceeeee Confidence 2443222 22 2233 566666664 43333322 111111000 00101111122223323 Q ss_pred EEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccce Q lcl|NC_013059. 221 EFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPI 300 (725) Q Consensus 221 E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~ 300 (725) +.|.-.... | -.-...+++.+ -...+...++..|.|.+.+|| T Consensus 212 ~~w~lg~w~-------d---~~e~p~~~~~~----------------------------~~~~~~~~l~~lp~pi~fiPv 253 (527) T protein:vir:10 212 ELYEPGKWD-------D---RPESPLEPDDI----------------------------KKLSTLTEEEPLPEQITTLPV 253 (527) T ss_pred ceeeccccc-------c---ccccccchhhh----------------------------hhhcCceeeecccCCCCccce Confidence 344432110 0 00000111110 000122334567888899999 Q ss_pred EEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccccccccc Q lcl|NC_013059. 301 VPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDEN 380 (725) Q Consensus 301 vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 380 (725) |.|-.. ..-+..-+++-..+++++++.+|+.+|-...++..+.+..... . .+...+. ....+ ++.+. T Consensus 254 V~~~t~--p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~-t-g~~~vd~-~G~~~--------~~~Vg 320 (527) T protein:vir:10 254 FHFRGH--PIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYAT-D-SAPPRDS-RGNMV--------PWTIS 320 (527) T ss_pred EeecCC--CccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeee-c-ccccccc-cCCcC--------ccccC Confidence 877333 2345545677788999999999999999998888876554333 2 2222211 11111 12222 Q ss_pred Ccccc----ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhcc--CcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 381 NGEMP----TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAV--NGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) Q Consensus 381 ~g~~~----~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~--~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~ 454 (725) +|.+. ..++..+....--..+...+......|.+++|+....+|. .++.-||.|+......- T Consensus 321 PG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PL------------ 388 (527) T protein:vir:10 321 PLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAI------------ 388 (527) T ss_pred CceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHH------------ Confidence 33322 3445555554444556677888888999999999999994 46777998776533221 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHH Q lcl|NC_013059. 455 TAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAE 534 (725) Q Consensus 455 ~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~ 534 (725) .++-.--++++..++.=|--..+.|- ...... +...|.-..++|.|.-+|-.|+-+.+.++. T Consensus 389 lar~~rk~L~~~~vqrq~~~~~~~~~---------L~aye~---------v~~~d~~~~~~v~ivf~p~lP~D~~avie~ 450 (527) T protein:vir:10 389 LSSCAEQELELKSVLKQFFYNLVTQW---------LPAYEG---------VGIDDADKKLTVTITFRDPKPVNSEKRFNQ 450 (527) T ss_pred HHHHHHHHHHHHHHHHHhhhhhHHHH---------HHHhhh---------cccCCCccccceEEEecccCCCCHHHHHHH Confidence 11111223344444422211111110 000000 111233335788999999999999888888 Q ss_pred HHHHHHhcccccc-hHHHHHH---HhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHH-----HHHHHHHhhHH Q lcl|NC_013059. 535 ILELLGKTPQGTP-EYQLLLL---QYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFV-----EAQQAKQGQQD 605 (725) Q Consensus 535 l~ell~~~~~~~p-~~~~~~~---~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~-----q~~q~qq~q~q 605 (725) +..+.+. + ..+ ...+..+ .+.+....+. +++.+..-.+..+..-....-. .++.. ..+--++.-.+ T Consensus 451 v~tL~~a-G-i~S~~tAv~~L~~~~g~eD~E~E~-~~I~~era~~a~a~a~A~~~~~--a~~~~~~g~~~~~~d~~~~~~ 525 (527) T protein:vir:10 451 LLQLWEA-G-LIPAKKLTEELSKIMGFELTEEDF-KQATEDKKTQGIAQAEAADPFG--AQMAAEQGIPDEEDDQALNGQ 525 (527) T ss_pred HHHHHHc-C-chhHHHHHHHHHhccCCCChHHHH-HHHHHHHHHHhHHhhhhcCchh--hhhccccCCCCCCcccccCCC Confidence 8877653 1 111 1111111 0111111111 1121111110000000000000 00000 00000000011 Q ss_pred HH Q lcl|NC_013059. 606 PA 607 (725) Q Consensus 606 ~~ 607 (725) +. T Consensus 526 ~~ 527 (527) T protein:vir:10 526 PL 527 (527) T ss_pred CC Confidence 11 No 121 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=98.78 E-value=4.9e-08 Score=60.66 Aligned_cols=494 Identities=11% Similarity=-0.062 Sum_probs=223.1 Q ss_pred CCcHHHHHHHHHHH------HHHHHhhhHHHHHH-HHHHHHhhcCC--CCCHHHHHHHhhc-CCC-cccchHHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLESILSR------FDADWTASDEARRE-AKNDLFFSRVS--QWDDWLSQYTTLQ-YRG-QFDVVRPVVRKLVS 69 (725) Q Consensus 1 mad~~~~~~~~~~~------~~~~~~~~~~~r~~-a~~d~~f~~G~--QW~~~~~~~l~~~-grp-~~N~i~~~v~~v~g 69 (725) |+-++.-.....-. |-..+....+.|-. .+.-.+||.|+ ||..-. ..-..+ .|| .++-. ..|+| T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~l-rg~~~~~~r~~~~ps~----~~~~~ 75 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVIL-RGGDEGDQRPIYVPNG----EKLIE 75 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeec-CCccccccceeeehhh----HHhhC Confidence 44322111000000 00001222222322 23336788886 774211 111122 234 23333 45555 Q ss_pred HHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEee Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPI 149 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~ 149 (725) ..- .+.+-+-+..+...++.+..+++...+.++....+..+-.++++-|=|+++|.+|.... -+..+ +... T Consensus 76 ~~~----~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~--~~~R~--~v~~- 146 (527) T protein:vir:10 76 AKM----RFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD--EGSRL--SLHE- 146 (527) T ss_pred Ccc----eeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC--cCCCc--eEee- Confidence 332 25555545455566666778888888899999999999999999999999999875331 22223 3221 Q ss_pred ecchhheeeCCCccccChhcccceeeee----cCCHHHHH-----HHhhhcCCcchhhhhhhhcccccccccCCCeEEEE Q lcl|NC_013059. 150 HSACSHVIWDSNSKLMDKSDARHCTVIH----SMSQNGWE-----DFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIA 220 (725) Q Consensus 150 ~~~~~~v~~Dp~a~~~d~sDa~~~~~~~----~~~~~~~~-----~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~ 220 (725) .||..+| |- ..+| +.+++.+++ |-.+++-+ ++-+.+-.. .++....+.....++.. T Consensus 147 -~DP~~~f--~~-ed~d--~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~---------l~~~g~~~~~G~~~yt~ 211 (527) T protein:vir:10 147 -VDPSTYF--PY-EDPR--YPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKT---------LDDDGKPVPGGAIKYTE 211 (527) T ss_pred -cCcceee--ee-ecCC--CCCceeeEEEeeeccCCccccccceehhhhhhhhh---------cCcccccccCcceeeee Confidence 2443222 22 2233 566666664 43333322 111111000 00101111122223323 Q ss_pred EEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccce Q lcl|NC_013059. 221 EFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPI 300 (725) Q Consensus 221 E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~ 300 (725) +.|.-.... | -.-...+++.+ -...+...++..|.|.+.+|| T Consensus 212 ~~w~lg~w~-------d---~~e~p~~~~~~----------------------------~~~~~~~~l~~lp~pi~fiPv 253 (527) T protein:vir:10 212 ELYEPGKWD-------D---RPESPLEPDDI----------------------------KKLSTLTEEEPLPEQITTLPV 253 (527) T ss_pred ceeeccccc-------c---ccccccchhhh----------------------------hhhcCceeeecccCCCCccce Confidence 344432110 0 00000111110 000122334567888899999 Q ss_pred EEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccccccccc Q lcl|NC_013059. 301 VPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDEN 380 (725) Q Consensus 301 vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 380 (725) |.|-.. ..-+..-+++-..+++++++.+|+.+|-...++..+.+..... . .+...+. ....+ ++.+. T Consensus 254 V~~~t~--p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~-t-g~~~vd~-~G~~~--------~~~Vg 320 (527) T protein:vir:10 254 FHFRGH--PIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYAT-D-SAPPRDS-RGNMV--------PWTIS 320 (527) T ss_pred EeecCC--CccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeee-c-ccccccc-cCCcC--------ccccC Confidence 877333 2345545677788999999999999999998888876554333 2 2222211 11111 12222 Q ss_pred Ccccc----ccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhcc--CcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 381 NGEMP----TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAV--NGGQVAYDTVNQLNMRADLETYVFQDNLA 454 (725) Q Consensus 381 ~g~~~----~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~--~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~ 454 (725) +|.+. ..++..+....--..+...++.....|.+++|+....+|. .++.-||.|+......- T Consensus 321 PG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PL------------ 388 (527) T protein:vir:10 321 PLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAI------------ 388 (527) T ss_pred CceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHH------------ Confidence 33322 3445555554444556777888888999999999999994 46777998776533221 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHH Q lcl|NC_013059. 455 TAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAE 534 (725) Q Consensus 455 ~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~ 534 (725) .++-.--++++..++.=|--..+.|- ...... +...|.-..++|.|.-+|-.|+-+.+.++. T Consensus 389 lar~~rk~L~~~~Vqrq~~~~~~~~~---------L~aye~---------v~~~d~~~~~~v~ivf~p~lP~D~~avie~ 450 (527) T protein:vir:10 389 LSSCAEQELELKSVLKQFFYNLVTQW---------LPAYEG---------VGIDDADKKLTVTITFRDPKPVNNEKRFAQ 450 (527) T ss_pred HHHHHHHHHHHHHHHHHhhhhhHHHH---------HHHhhh---------cccCCCccccceEEEecccCCCCHHHHHHH Confidence 11111223344444422211111110 000000 111233335788999999999999888888 Q ss_pred HHHHHHhcccccc-hHHHHHH---HhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHH---HHHHHHHHHhhHHHH Q lcl|NC_013059. 535 ILELLGKTPQGTP-EYQLLLL---QYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQW---FVEAQQAKQGQQDPA 607 (725) Q Consensus 535 l~ell~~~~~~~p-~~~~~~~---~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~---~~q~~q~qq~q~q~~ 607 (725) +..+.+. + ..+ ...+..+ .+.+....+. +++.+..-.+..+..-....-....-. ....+--++.-.++. T Consensus 451 v~tL~~a-G-iiS~etAv~~L~~~~g~eD~E~E~-~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 451 LLELWEA-G-LIPAKKLTEELSKIMGFELTEEDF-RQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHHHHHc-C-chhHHHHHHHHHhccCCCchHHHH-HHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 8877653 1 111 1111111 0111111111 112111111100000000000000000 000000000001111 No 122 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.68 E-value=1.2e-07 Score=58.58 Aligned_cols=653 Identities=9% Similarity=-0.074 Sum_probs=208.5 Q ss_pred hHHHHHHHHHHHHhhc--CCCCCHHHHHHHhh----cCC--Cc------ccchHHHHHHHH---HHHhhCCcceEEec-C Q lcl|NC_013059. 22 SDEARREAKNDLFFSR--VSQWDDWLSQYTTL----QYR--GQ------FDVVRPVVRKLV---SEMRQNPIDVLYRP-K 83 (725) Q Consensus 22 ~~~~r~~a~~d~~f~~--G~QW~~~~~~~l~~----~gr--p~------~N~i~~~v~~v~---g~~~~nr~~~~~~p-r 83 (725) .++.++...+.+.++. -+.++++=...+++ .|. +. -+..+|.+|.|. ..-.+.-..-+..+ . T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~e~~nr~d~~v 80 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcccchHHHHHHHHhhHHhCCcceEE Confidence 4444444444433332 12222222222222 121 11 023333332221 11111111111111 1 Q ss_pred CcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecc-hh-------- Q lcl|NC_013059. 84 DGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA-CS-------- 154 (725) Q Consensus 84 ~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~-~~-------- 154 (725) .|.+. .-+.+..++..+....-...-...++.++..+++.+ ++|. +.++.+....+ +. T Consensus 81 ~P~~~-~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~-----------G~G~-~ev~~d~~~~d~~~~~~~i~~~ 147 (725) T protein:vir:92 81 RPKDG-ASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIES-----------GVGA-WRLVTDYEDQSPTSNNQVIRRE 147 (725) T ss_pred ecCCc-cHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhc-----------Ccce-eeeeecccCCCCCCCceeeEEe Confidence 24443 445666667776666655555666666666665553 1221 11221110000 00 Q ss_pred heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhh-hhcccccccccCCCeEE-EEEEEEEecceeEE Q lcl|NC_013059. 155 HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSF-QNPNDWVFPWLTQDTIQ-IAEFYEVVEKKETA 232 (725) Q Consensus 155 ~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~-~~~~~~~~~~~~~~~vr-v~E~w~~~~~~~~~ 232 (725) .|++|+...-+|.. .+..+.+++.-+|-..-.+.+....+ ..+.....+|.+..... -..-|+.+ .+.++ T Consensus 148 ~i~~~~~~V~~Dp~-------a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~vrv 219 (725) T protein:vir:92 148 PIHSACSHVIWDSN-------SKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQ-DTIQI 219 (725) T ss_pred eccCChhhcccCch-------hhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCC-CeEEE Confidence 01111111111211 12233333333332221221111110 11111112222211110 01113322 11222 Q ss_pred EEeeC--cccccee-ecch--hhhHHHHH-HHHhcchhhhhcc--ceeEEEEEEEEeeccccccCCCC--CCC-CccceE Q lcl|NC_013059. 233 FIYQD--PVTGEPV-SYFK--RDIKDVID-DLADSGFIKIAER--QIKRRRVYKSIITCTAVLKDKQL--IAG-EHIPIV 301 (725) Q Consensus 233 ~~~~d--~~~g~~~-~~~~--~~~~~~~~-~~~~~g~~~~~~~--~~~~~~v~~~~~~g~~~l~~~~~--~p~-~~~p~v 301 (725) ..+.- +....++ -.++ ..+..... .+........... .+..+++.++-+....++ +... .|. -.+.++ T Consensus 220 ~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~-g~~~l~~~~~~~~~~~ 298 (725) T protein:vir:92 220 AEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIIT-CTAVLKDKQLIAGEHI 298 (725) T ss_pred EEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeec-chhhhcCCCCCCCCce Confidence 11110 0010111 0111 11111110 0000000111111 123333433322222222 3221 121 223468 Q ss_pred EEEeeeeccCCccccchhhh--hhhhHHHHHHHHHHHHHH-HHHhcCCcceeechhhcchHHHHHHhhcccccccccccc Q lcl|NC_013059. 302 PVFGEWGFVEDKEVYEGVVR--LTKDGQRLRNMIMSFNAD-IVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTD 378 (725) Q Consensus 302 P~~g~~~~~d~~~~~~G~vr--~~kd~Q~~~N~~~s~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 378 (725) ||+++..++... .|... .++..=+..=....+... .+...+..+.....+..+.++...... ..+.... .+. T Consensus 299 P~vP~~g~r~~~---~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~-~~~ 373 (725) T protein:vir:92 299 PIVPVFGEWGFV---EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY-DGNDDYP-YYL 373 (725) T ss_pred eeEEEEeeeecc---CCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHH-hccCccc-eee Confidence 998876544332 33322 443333222222222221 222223333333333333332222211 1111111 001 Q ss_pred ccCccccccCC--cccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 379 ENNGEMPTQPL--AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATA 456 (725) Q Consensus 379 ~~~g~~~~~~~--~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~ 456 (725) .+.-....+.+ +.+...+.|+-....++......+.+--++...-...|..+++.+-.+....-..+...+..=|.. T Consensus 374 ~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dn- 452 (725) T protein:vir:92 374 LNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDN- 452 (725) T ss_pred ccccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHH- Confidence 11111111111 223333333334456666666666555544333333333333332223332223332333322222 Q ss_pred HHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccc-----cccc----eEE--EEeccCc- Q lcl|NC_013059. 457 MRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDI-----RGRY----ECY--TDVGPSF- 524 (725) Q Consensus 457 ~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi-----~g~~----Dv~--v~~~p~~- 524 (725) +++--+.+-.++..+.-. +. +.++.+.|-.+ |. +-..+.+|.- +|+. |++ .++..+. T Consensus 453 l~~~~~~~g~~lL~lI~~-----~~---~~~r~~RI~~e--dg-~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~ 521 (725) T protein:vir:92 453 LATAMRRDGEIYQSIVND-----IY---DVPRNVTITLE--DG-SEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVG 521 (725) T ss_pred HHHHHHHHHHHHHHHHHH-----hc---CCCcEEEEecC--CC-CcceEEeccccccccccchhhhhccccceeeEEeec Confidence 222223333333333211 11 23455555322 11 1245566652 3332 432 4444444 Q ss_pred hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhH Q lcl|NC_013059. 525 QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQ 604 (725) Q Consensus 525 ~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~ 604 (725) ++.-...-+.+..|++.++...|........+...++.+......+.......... .....++...+.+++.++++ T Consensus 522 p~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~----~~~~~~~~~~e~~q~~~~~q 597 (725) T protein:vir:92 522 PSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI----QMGVKKPETPEEQQWLVEAQ 597 (725) T ss_pred cChHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhc----hhccCCccchhhhHHHHHHH Confidence 34333333333334444333323223222223333333333333332222111111 11111222233333334444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 605 DPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDAR 684 (725) Q Consensus 605 q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~ 684 (725) +.+..++++++.++++.+++++++.++++.+..+.+.++...+++.....+.. ......+..+.++..++.++ T Consensus 598 qa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~-------~~~~~q~~~~q~~~~~~~~~ 670 (725) T protein:vir:92 598 QAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARI-------AEIFNNMDLSKQSEFREFLK 670 (725) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHhhHHHHHHHHHHH Confidence 55555666666677777777777777776666666555444433322111111 11111122222333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 685 ANAELLLKGDEQTHKQRMEI-ANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 685 ~~aE~~~~~~~q~~~q~~e~-~~~~~~~~~~q~~~~~~~~~q 725 (725) ..++...+.+. .++..+|. .+....+.+++-..+.+-++| T Consensus 671 ~~~~~q~~~~~-~a~~~ae~~l~~~~~~~~~~~d~~~~~~~~ 711 (725) T protein:vir:92 671 TVASFQQDRSE-DARANAELLLKGNEQTHKQRMDIANILQSQ 711 (725) T ss_pred HHHHHHHHHHH-HHHHhchHHHHHHHHHHHHHHHHHHHhcch Confidence 33333222111 11112211 111111111112112222222 No 123 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=98.66 E-value=1.3e-07 Score=58.27 Aligned_cols=529 Identities=9% Similarity=-0.054 Sum_probs=224.6 Q ss_pred CCcHHHHHHHHHHHHHHH----HhhhHHHHHH-HHHHHHhhcCCCCCHHHHHHHhhcCCCccc--chHHHHHHHHHHHhh Q lcl|NC_013059. 1 MADNKNRLESILSRFDAD----WTASDEARRE-AKNDLFFSRVSQWDDWLSQYTTLQYRGQFD--VVRPVVRKLVSEMRQ 73 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~----~~~~~~~r~~-a~~d~~f~~G~QW~~~~~~~l~~~grp~~N--~i~~~v~~v~g~~~~ 73 (725) |+-+...-.--...|-.. +....+.|-. .+.-.+||.|+||+-... |...-+-+|| --+.+|+.+. ..-. T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~i--l~G~dr~~~~~ps~r~~V~~~~-~~Lg 77 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLV--LRGDDSVPILMPSGRKIVEAVH-RFLG 77 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhh--cCCCceeeeccchHHHHHHHHH-HhcC Confidence 553321111111111111 1111222222 233468999999964332 4444444453 5667777744 5555 Q ss_pred CCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecch Q lcl|NC_013059. 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSAC 153 (725) Q Consensus 74 nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~ 153 (725) .-..+.|-|.+ +|+...+.++.+++...+.++..-.+..+-.++++-|=|+++|.+|.... .+.. +++. ..|| T Consensus 78 ~~~~~~Ve~~~-~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~--~g~R--~rv~--~vDP 150 (563) T protein:vir:74 78 VGFDYLVEPDM-GDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKK--AGER--ISVD--EVDP 150 (563) T ss_pred CCcEEecCccc-cCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccc--cCCC--ceEe--ecCC Confidence 55667665555 45555577899999999999999999999999999999999999875331 2222 3332 2233 Q ss_pred hheeeCCCccccChhcccceeee---ecCCHHHHHH-HhhhcCCcchhhhhhhhcccccccccCCCeE-----EEEEEEE Q lcl|NC_013059. 154 SHVIWDSNSKLMDKSDARHCTVI---HSMSQNGWED-FAEKFDLDADDIPSFQNPNDWVFPWLTQDTI-----QIAEFYE 224 (725) Q Consensus 154 ~~v~~Dp~a~~~d~sDa~~~~~~---~~~~~~~~~~-~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~v-----rv~E~w~ 224 (725) ..+|. +..+| ++..+..+. .|-.+++.+. ++. ...+-+.|-++... ...|.|. T Consensus 151 ~~~fp---~~dpd-~v~g~~~v~v~~~~~~pdd~~~~~~r--------------~~~~~~~lndeg~~~~~~~~dae~w~ 212 (563) T protein:vir:74 151 RQIFL---IEDGS-TVVGFHMVDIVQDFRSPDDPSKKLAR--------------RRTFRRVRNDEGMFTGRISSELTHWT 212 (563) T ss_pred ceeee---ccCCC-CcccceeeecccCCCCCcchhcccee--------------eeeeeeeeCCCCCccceeeeccchhc Confidence 33321 22222 011111111 1222222111 000 00000000000000 0111121 Q ss_pred EecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecccccc-CCCCCCCCccceEEE Q lcl|NC_013059. 225 VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLK-DKQLIAGEHIPIVPV 303 (725) Q Consensus 225 ~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~-~~~~~p~~~~p~vP~ 303 (725) -..- |..+...+.+- ++.--++.-.+.++ +.-|-|.+.+|||-| T Consensus 213 lg~w--------d~r~~~~~~~~---------------------------~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~ 257 (563) T protein:vir:74 213 LGNW--------DDRGAISDEQA---------------------------RRKEQVRSAQHDEEEEELPEPISQLPLYRW 257 (563) T ss_pred cccc--------cccCccchhhh---------------------------cccchhhhhhhhchhhhccccccCccEEEc Confidence 1000 11111000000 00000111111111 233666677887632 Q ss_pred EeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcc Q lcl|NC_013059. 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) Q Consensus 304 ~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) . .....++.-+.+-..++..+.+.+|..+|-.-.++..+.++..+..- +. +.+.-......|....+.+ .+.++. T Consensus 258 -~-tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~-~~-p~d~~~g~~~~w~vgpG~i-~El~~~ 332 (563) T protein:vir:74 258 -R-NKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNA-SA-PVDPNTGELTDWNIGPMQI-VEIAGN 332 (563) T ss_pred -C-CCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecc-cc-ccccccccccccccCCcee-EeccCC Confidence 1 11123333345677889999999999999999998888776555442 11 1111111111222222211 111111 Q ss_pred ccccCCcccCC-CCchHHHHHHHHHHHHHHHHHhCCChHHhc--cCcchhHHHHHHHHHHHHHH---HH-HHHHHHHHHH Q lcl|NC_013059. 384 MPTQPLAYYEN-PEVPQANAYMLEAATAAVKEVATLGVDAEA--VNGGQVAYDTVNQLNMRADL---ET-YVFQDNLATA 456 (725) Q Consensus 384 ~~~~~~~~~~~-~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G--~~~n~~Sg~ai~~~q~q~~~---~~-~~~~dn~~~~ 456 (725) .....+-.+.. +++..-..-|=......|.+++|+....+| -.++.-||.|..-...--.. .- ..+...++.+ T Consensus 333 ~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~ 412 (563) T protein:vir:74 333 RNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQF 412 (563) T ss_pred ccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHH Confidence 11222333322 222222222223334477889999999999 45677899885543222111 11 1255566666 Q ss_pred HHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHH Q lcl|NC_013059. 457 MRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEIL 536 (725) Q Consensus 457 ~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ 536 (725) +-..-.+||.+.+..|-.. + ...|+-+ .|+-++.=|+|.-+|-+|+-+.+.++... T Consensus 413 r~~~~~~lL~~~erl~~~g--------~-~~~~~g~---------------~~~~~~~~v~ivf~p~~P~d~~~vv~~~~ 468 (563) T protein:vir:74 413 LHDWMTMWLPAYESDFQEQ--------D-GSRPFAS---------------ADLLNECSVVCIFADPMPVNKTQVTQDTL 468 (563) T ss_pred HHHHHHHHHHHHHhHhhhh--------c-ccccccc---------------cccCCceEEEEEeCCCCCccHHHHHHHHH Confidence 7777777887777765322 1 1122211 12323556788899999999988888877 Q ss_pred HHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhh---hHHHHHH---HHH--HHhhHHHHH Q lcl|NC_013059. 537 ELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEE---QQWFVEA---QQA--KQGQQDPAM 608 (725) Q Consensus 537 ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~---~~~~~q~---~q~--qq~q~q~~~ 608 (725) .+.++ +-..-.....++.-. --..|-++...+.+........+.++...-. .++.... .++ -+.+.-..- T Consensus 469 tl~~a-GiiSretAv~~L~~~-g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p~~~~ 546 (563) T protein:vir:74 469 LLQQA-HLILRKMAVAKLRSI-GWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQGNPIDQF 546 (563) T ss_pred HHHHc-CchhHHHHHHHHHhC-CCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccCCchhHc Confidence 76543 100011111111100 0011222222222222111110000000000 0000000 000 000000000 Q ss_pred HHHHHHH--HHHHHHHHH Q lcl|NC_013059. 609 VQAQGVL--LQGQAELAK 624 (725) Q Consensus 609 ~~~qa~~--~k~qae~~k 624 (725) . ---++ --.|.-+.- T Consensus 547 ~-~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 547 G-NPVEIPPDVTQVPLSP 563 (563) T ss_pred C-CcccCCccccccCCCC Confidence 0 00000 000000000 No 124 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.11 E-value=4.3e-06 Score=50.00 Aligned_cols=654 Identities=8% Similarity=-0.019 Sum_probs=215.2 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHH--HHHHHhhCCcceEE- Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRK--LVSEMRQNPIDVLY- 80 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~--v~g~~~~nr~~~~~- 80 (725) +-+.+.+++.++...+++..+|..+++..+. ++.......| |.-.+.+.. =++.++++||-+.| T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~---------~d~~f~~~~G----~QW~~~~~~~~~~~l~~~~~P~~~~N 67 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCL---------EATRFARVPG----GQWEGATAAGSELGKHFEKYPKFEIN 67 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHH---------HHHhhhccCC----CCCCHHHHHHHHHHHhhCCCCeEEEc Confidence 7777777777777766666666655543321 2222222123 222233322 12344555553331 Q ss_pred ------------------ec-CCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCc Q lcl|NC_013059. 81 ------------------RP-KDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNN 141 (725) Q Consensus 81 ------------------~p-r~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~ 141 (725) .+ ..|.+.+.-+.+..++..+....-.......++.++..+++.+ ++.|. T Consensus 68 ~i~~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~---G~G~~-------- 136 (720) T protein:vir:35 68 KISTELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTG---GFGCF-------- 136 (720) T ss_pred cHHHHHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhc---cceeE-------- Confidence 11 1245566556777888887777766666677777777766654 11111 Q ss_pred eeEEEEeeec-chhheeeCCCccccChhccccee---eeecCCHHHHHHHhhhcCCcchhhhhhhhccccc-ccccCCCe Q lcl|NC_013059. 142 QVIRREPIHS-ACSHVIWDSNSKLMDKSDARHCT---VIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWV-FPWLTQDT 216 (725) Q Consensus 142 ~~ir~~~~~~-~~~~v~~Dp~a~~~d~sDa~~~~---~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 216 (725) .++++...+ ++. .+++.-..++-.+|+.-++ ..+..+.+++.-.|-..-.+..+.... +.... ..|.+... T Consensus 137 -~v~~d~~~~~d~~-~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~--yp~~a~~~~~~~~~ 212 (720) T protein:vir:35 137 -RLTTNLVNALDPM-DERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAE--YNKDPATLMSGIER 212 (720) T ss_pred -EeeecccccCCCC-cccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHh--CCCccccccccccc Confidence 111110000 000 0010000000001111111 111233333332222211121111110 11000 01111000 Q ss_pred EEEEEEEEEecceeEEEEe--eCccccceeec-ch--hhh----HHHHHHHHhc-chhhhhccceeEEEEEEEEeecccc Q lcl|NC_013059. 217 IQIAEFYEVVEKKETAFIY--QDPVTGEPVSY-FK--RDI----KDVIDDLADS-GFIKIAERQIKRRRVYKSIITCTAV 286 (725) Q Consensus 217 vrv~E~w~~~~~~~~~~~~--~d~~~g~~~~~-~~--~~~----~~~~~~~~~~-g~~~~~~~~~~~~~v~~~~~~g~~~ 286 (725) -... -|+.. ...++..| ..+....++.+ ++ .++ .+....+... +.... -.+.++.|.++.+... + T Consensus 213 ~~~~-d~~~~-~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~r~~~~~~v~~~-~ 287 (720) T protein:vir:35 213 SWDY-DWYDV-DVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGF--IEAARRTIKRRRVYVS-V 287 (720) T ss_pred cccc-cccCC-CceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhcc--ccccccceeEEEEEEE-e Confidence 0000 13322 11222211 11111112111 11 111 0101111111 11111 1122232333222221 2 Q ss_pred ccCCCCC-CCCccc--eEEEEeeeeccCCccccchhh---hhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHH Q lcl|NC_013059. 287 LKDKQLI-AGEHIP--IVPVFGEWGFVEDKEVYEGVV---RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFE 360 (725) Q Consensus 287 l~~~~~~-p~~~~p--~vP~~g~~~~~d~~~~~~G~v---r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~ 360 (725) +.+...- -...+| .+||++++.+... .-|.. -.+++.-+.-...-.....++...+..+.+...++.++.+ T Consensus 288 ~~g~~~l~~~~~~p~~~fP~vP~~g~r~~---~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~ 364 (720) T protein:vir:35 288 VDGEGFLEKAQRIPGEHIPLIPVYGKRWF---IDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIK 364 (720) T ss_pred eccchhcccCCCCCCCccceEEEEeeeec---cCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHH Confidence 2222111 112233 5677777543332 12332 3456666666666666666777788888888888888765 Q ss_pred HHHHhhccccccccccccccCccccccCC----cccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCc---chhHHH Q lcl|NC_013059. 361 HMYDGNDDYPYYLLNRTDENNGEMPTQPL----AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG---GQVAYD 433 (725) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~----~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~---n~~Sg~ 433 (725) .....-.............+......+.+ ..+...+-++-....++........+-.+ .|... +..|+. T Consensus 365 ~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~v----sGi~~~~lG~~sn~ 440 (720) T protein:vir:35 365 TLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEV----TGSSQAMQPMPSNI 440 (720) T ss_pred HHHHHhhccccccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHH----hCCChHHcCcccch Confidence 55443333222221111111111111111 11112222233333455555555554433 34332 123432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeec----c Q lcl|NC_013059. 434 TVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLN----D 509 (725) Q Consensus 434 ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~n----D 509 (725) +-.+....-..+...++. |-...++.-+.+-.++..+.- .--+.++.+.|..+. + +...+.+| | T Consensus 441 SG~Ai~~rq~qg~~~~~~-~~Dnl~~~~~~~g~~lL~lI~--------~~y~~er~~RI~~ed--~-~~~~v~~n~~~~d 508 (720) T protein:vir:35 441 AKETVNHLMHRSDMSSFI-YLDNMAKSLKRAGEVWLSMAR--------EVYGSDRQVRIVNAD--G-TDDIALMSVVIND 508 (720) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH--------HHcCCCcEEEEecCC--C-CcceEeechhhhc Confidence 111222222222222222 112222222333333333221 111344566554321 1 12233333 2 Q ss_pred -cccc----ceEEE---EeccCc-hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhh Q lcl|NC_013059. 510 -IRGR----YECYT---DVGPSF-QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG 580 (725) Q Consensus 510 -i~g~----~Dv~v---~~~p~~-~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~ 580 (725) .+|. -|+.+ ++..+. ++.-...-+.+..+++.++.+.|....... .+ +.+-+.+....+...... T Consensus 509 ~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~~~~~~-~~-----~~ile~~d~p~~~e~~er 582 (720) T protein:vir:35 509 NQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQDPMRQV-LQ-----GIILDNMEGEGLDEFKEY 582 (720) T ss_pred cCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCchhHHH-HH-----HHHHHhcCchhHHHHHHH Confidence 2342 46653 445554 443333344444455444433332221111 00 111111111112122222 Q ss_pred hhhccchh--hhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 581 VKKPETPE--EQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMD 658 (725) Q Consensus 581 ~~~~~~~e--~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~ 658 (725) +.+...+. ..+... +.+++.++.+.+..++++++.++|+++.+++++.++++.++...++.+... ..++. T Consensus 583 irk~~~~~~~~~~~~~-e~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~a-------qa~a~ 654 (720) T protein:vir:35 583 NRKQLLTQGVVKPRNT-EEEQMVAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQA-------QTEAR 654 (720) T ss_pred HHhhcchhcccCccCh-hHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHH Confidence 22111111 011111 111111222222233444455555555555555544443333333222221 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 659 LSKQSEFREFLKTVASFQQD-RSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 659 ~~~~~~~~e~~~~~~~~q~~-~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~~~~~~~q 725 (725) +.... ...+.++.+..++ ....+....+-..+.+...++..++......++.+.|......-++= T Consensus 655 ~~~a~--~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~~~~~~~~~~~~~~~~~~~ 720 (720) T protein:vir:35 655 VAEAK--MVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELILKATDTQHKQNRDAAKNHSI 720 (720) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhcccchhhhhhHHHhhccCC Confidence 11111 1111111111000 00011111111111122222222222111111111110000000000 No 125 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.99 E-value=7.8e-06 Score=48.58 Aligned_cols=486 Identities=9% Similarity=-0.024 Sum_probs=220.1 Q ss_pred CCcH---HHHHHHHHHHHH-HH---------HhhhHHHHHHHHHHHHhhcCCCCCHHHHH---HHhhcCCCcccchHHHH Q lcl|NC_013059. 1 MADN---KNRLESILSRFD-AD---------WTASDEARREAKNDLFFSRVSQWDDWLSQ---YTTLQYRGQFDVVRPVV 64 (725) Q Consensus 1 mad~---~~~~~~~~~~~~-~~---------~~~~~~~r~~a~~d~~f~~G~QW~~~~~~---~l~~~grp~~N~i~~~v 64 (725) |.=- +..+++...+.- .. +...++-.....+...||.|+.+.-.-.. ....+.+-.+|+-+.++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 4411 222222221110 00 11123444556677889999755311100 01111223458888889 Q ss_pred HHHHHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeE Q lcl|NC_013059. 65 RKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVI 144 (725) Q Consensus 65 ~~v~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~i 144 (725) +...+.--.-.+.+.+. |+ .++..+..+.+.|++.....++.+.++..|-+++++.++ .+ .+.| T Consensus 81 ~~~A~lv~~e~~~i~v~-----d~----~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d---~~----~~~i 144 (522) T protein:vir:47 81 KKIASLVYNEQATITTK-----NE----ILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID---GD----KVRV 144 (522) T ss_pred HHHhhhhcCCcceeecC-----Ch----HHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc---CC----ceEE Confidence 88888887777777762 33 344455666678999999999999999999999999875 22 2333 Q ss_pred EEEeeecchhhee---eCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccc---cccccCC-CeE Q lcl|NC_013059. 145 RREPIHSACSHVI---WDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDW---VFPWLTQ-DTI 217 (725) Q Consensus 145 r~~~~~~~~~~v~---~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~---~~~~~~~-~~v 217 (725) .++ +.+.+| ||+.- ...|-++++........- ..| ...+++....+ .+..... ..- T Consensus 145 ~~v----~ad~~~P~~~~~~~----~~e~a~~~~~~~~~~~~~-~~y--------t~lE~he~~~~~~~~~~~~~~~~~~ 207 (522) T protein:vir:47 145 AFI----QAPVFFPLESNTQD----VSSAAILTKTIKSEGRKN-VYY--------TLVEFHEWVTADGQETGSTNDKKYY 207 (522) T ss_pred EEE----cCCceEEEEEcCCc----eEEEEEEEEEEeecccce-eEE--------EEEEEeeecccccccccccccCCce Confidence 332 222233 33321 122333333322111000 000 00011100000 0000000 011 Q ss_pred EEEEEEEEecceeEEEEeeCc-cccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCC Q lcl|NC_013059. 218 QIAEFYEVVEKKETAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGE 296 (725) Q Consensus 218 rv~E~w~~~~~~~~~~~~~d~-~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~ 296 (725) +|.-.+|+- .+. ..|..+....- .+ | .-|.+..-+++- T Consensus 208 ~I~n~ly~~---------~~~~~lG~~v~l~~~--~e------------------------~------~~l~~~~~~~~~ 246 (522) T protein:vir:47 208 RITNELYRS---------DVNDVLGQRVNLSEL--DK------------------------Y------KNLEPVTVFENL 246 (522) T ss_pred EEEEEEeec---------CCCcccCcccccccc--cc------------------------c------cCCCCceEeCCC Confidence 111112221 110 11222211110 00 0 000000001111 Q ss_pred ccc-eEEE---EeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHH-hhccccc Q lcl|NC_013059. 297 HIP-IVPV---FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYD-GNDDYPY 371 (725) Q Consensus 297 ~~p-~vP~---~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~ 371 (725) .-| |++| ... .-.++++.+-+++.++++..+.+|...|.+.+-+-+... +.+++...+.....--. .....+. T Consensus 247 ~~Plf~y~~~~~~N-~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~~~~~~~ 324 (522) T protein:vir:47 247 SRPLFTYLKTPGMN-NKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQR-RVIVPEHLTQRQYQRPDGTIDFRPR 324 (522) T ss_pred CcceEEEecCCccc-ccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-eeecchHHhccCCCCCCcccccccc Confidence 111 1111 011 112456666788999999999999999999988776544 55555544321000000 0000000 Q ss_pred cc--cccccccCcc-ccccCCcccCCCCchH-HHHHHHHHHHHHHHHHhCCChHHhccCcc-hhHHHHHHHHHHHHHHHH Q lcl|NC_013059. 372 YL--LNRTDENNGE-MPTQPLAYYENPEVPQ-ANAYMLEAATAAVKEVATLGVDAEAVNGG-QVAYDTVNQLNMRADLET 446 (725) Q Consensus 372 ~~--~~~~~~~~g~-~~~~~~~~~~~~~~~~-~~~~ll~~~~~~i~~~tGv~~~~~G~~~n-~~Sg~ai~~~q~q~~~~~ 446 (725) .. .+..+.-++. .....++.+. |.+.. .+...++.....|....|++...+|-.++ ..++.+|.+..+...... T Consensus 325 fd~~~~~f~~~~~~~~~~~~i~~~~-~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~ 403 (522) T protein:vir:47 325 FDVEQNVYMQIGGSSMDAGGITDLT-SPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMR 403 (522) T ss_pred cCcccceEeecCCCCCCCCcceeec-cccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHH Confidence 00 0001111111 1122344443 34443 45667888888888889998877776543 246778888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--hcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCc Q lcl|NC_013059. 447 YVFQDNLATAMRRDGEIYQSIVND--IYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSF 524 (725) Q Consensus 447 ~~~~dn~~~~~~~~g~~ll~li~~--~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~ 524 (725) ..+...+..+++++-+.++.+..- +|... ....++|+|+-..+. T Consensus 404 ~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~----------------------------------~~~~~~i~v~f~D~i 449 (522) T protein:vir:47 404 SSIVALVEQSIKELCVSMCELGKAVGVYSGE----------------------------------IPELDDISVNLDDGV 449 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccCC----------------------------------CCCcceeEEEcCCCC Confidence 888888999998888888876632 22110 012356666666665 Q ss_pred hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhH Q lcl|NC_013059. 525 QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQ 604 (725) Q Consensus 525 ~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~ 604 (725) ..-+++.++.++++..+ + .++.-. .+.. .+..+-+.+++.++++.....+.. +.+...-..-. ++.+ .- T Consensus 450 ~~D~~~~~~~~~~~v~a-G-~~s~e~-~i~~-~~g~~eeea~~el~ri~~E~~~~~---~~~~~~~~~~~--~~~~--~~ 518 (522) T protein:vir:47 450 FTDRHAELDYWAKMVAA-G-FSTKKR-AIGK-TLNISGVEAEKELNAINSELLPMN---DAELAIYGMHD--QNEE--KA 518 (522) T ss_pred CCCHHHHHHHHHHHHhc-C-CCCHHH-HHHh-cCCCChHHHHHHHHHHHHhhccCC---CCCCCCCCCCC--cccc--cC Confidence 55566777777776543 2 222211 1111 223333444445555543221110 00000000000 0000 00 Q ss_pred HHHHH Q lcl|NC_013059. 605 DPAMV 609 (725) Q Consensus 605 q~~~~ 609 (725) .. .. T Consensus 519 d~-~~ 522 (522) T protein:vir:47 519 DD-KG 522 (522) T ss_pred CC-CC Confidence 00 00 No 126 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=97.84 E-value=1.6e-05 Score=46.92 Aligned_cols=623 Identities=13% Similarity=0.049 Sum_probs=149.5 Q ss_pred CCcHH--HHHHHHHHHH-HHHHhhhHHHHHHHHHHHHhh-cC-CCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhCC Q lcl|NC_013059. 1 MADNK--NRLESILSRF-DADWTASDEARREAKNDLFFS-RV-SQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNP 75 (725) Q Consensus 1 mad~~--~~~~~~~~~~-~~~~~~~~~~r~~a~~d~~f~-~G-~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~nr 75 (725) ||.++ ..+..+--+- +.......+.+....+-+..+ .. .-|+++-...+++..=.-=|--.+.+.+++ +.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l--~~~g~ 78 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTER--ELEQR 78 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHH--HhcCC Confidence 88653 2222211111 111112223333333332322 22 334444333333211000022223333322 22333 Q ss_pred cceE--EecC------------------CcchH---------------------HHHHHHHHHHHHHHHhcChhHHHHHH Q lcl|NC_013059. 76 IDVL--YRPK------------------DGASP---------------------DAADVLMGMYRTDMRHNTAKIAVNIA 114 (725) Q Consensus 76 ~~~~--~~pr------------------~~~d~---------------------~~Ae~l~~~~~~~~~~~~~~~~~s~a 114 (725) |-+. .++. .|-+. +.-..+..++..+....-.......+ T Consensus 79 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) T protein:vir:10 79 PCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) T ss_pred CcEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHH Confidence 3222 1111 01110 11223444444444433333333333 Q ss_pred HHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeee-------------ecCCH Q lcl|NC_013059. 115 VREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVI-------------HSMSQ 181 (725) Q Consensus 115 ~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~-------------~~~~~ 181 (725) +.++..+++.+ ++| |-+|++|+.+...- +-..++.+ +-.+. T Consensus 159 ~s~af~d~~~~-----------G~G-------------~~ev~~d~~~~d~~--~~e~~i~~v~~p~~v~~Dp~a~~~D~ 212 (711) T protein:vir:10 159 YDIAFQGAVES-----------GMG-------------YLRVRSDYLADDSF--EQDLIIEAIQNQFSVTIDPDAKKRDR 212 (711) T ss_pred HHHHHHHhhhc-----------Ccc-------------eEEEEecccCCCCC--CCCeEEeeecChhheeeCccccccCh Confidence 33444444322 111 11134444332110 11111110 01111 Q ss_pred HHHHHHhhhcCCcchhhhhhh---hcccccccccCCCeEEEEEEEEEecceeEEEEe--eCccccceeecchhhh--HHH Q lcl|NC_013059. 182 NGWEDFAEKFDLDADDIPSFQ---NPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIY--QDPVTGEPVSYFKRDI--KDV 254 (725) Q Consensus 182 ~~~~~~~p~~~~~~~~~~~~~---~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~--~d~~~g~~~~~~~~~~--~~~ 254 (725) +++.-.|-..-.+..+..... ........|... -..|+.+ .+.++..+ ..+.+-+++.+..... ... T Consensus 213 sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~-----~~~~~~~-~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~ 286 (711) T protein:vir:10 213 SDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVAD-----YDTWFTE-KSVRVSEYFTREPVIREIALLSDGRSFWLDA 286 (711) T ss_pred hhhcceeeeecCCHHHHHHhCCchhhhhhhcccccc-----cCcccCc-ceeeEEEEEeeeeeeeEEEeecCCceeccCc Confidence 121111111101111100000 000000001000 0113221 11111111 1111111111100000 000 Q ss_pred HHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCC-Cccc--eEEEEeeeeccCCccccchhhhhhhhHHHHHH Q lcl|NC_013059. 255 IDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAG-EHIP--IVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRN 331 (725) Q Consensus 255 ~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~-~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N 331 (725) ............ ...+.++.+..+-..+..++ +....-. .-|| ++||+++..+... ....|....++..=+..= T Consensus 287 ~~~~~~~~~~~g-~~~~~~~~~~~~~v~~~~~~-G~~~L~~~~p~~~~~~P~vp~~g~r~~-~d~~~~~~G~vr~~~d~Q 363 (711) T protein:vir:10 287 LEDIVDELLEAG-ISIVRTRKVKTFKTYWRKIT-GANVLEGPVEIPSTTIPVIPVWGKSLI-IKKKEIFRSIIRHSKDAQ 363 (711) T ss_pred chhHHHHHHhcC-chhhhhhhhceeeEEEEEEe-cceeecCCCCCCCCcccEEEEeeeeec-cccccccchhhhhhhhhH Confidence 000000000000 00011111111111111111 2111100 1122 2455444321110 011222222221111111 Q ss_pred HHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc-ccccccCccccccCCc----ccCCCCchHHHHHHHH Q lcl|NC_013059. 332 MIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL-NRTDENNGEMPTQPLA----YYENPEVPQANAYMLE 406 (725) Q Consensus 332 ~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~----~~~~~~~~~~~~~ll~ 406 (725) ....+.... ..+++...+-..+ -....+.......+ +....++|.+...+.. .+...+.|+-....++ T Consensus 364 r~~N~~~s~------~~~~l~~~~~~~~-~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (711) T protein:vir:10 364 RMANYWDSA------ATETVALAPKAPF-IGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELT 436 (711) T ss_pred HHHHHHHHH------HHHHHHhcCCCce-eecCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHH Confidence 111111100 0111100000000 00000000000000 0112233333322221 2333333334455666 Q ss_pred HHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCC Q lcl|NC_013059. 407 AATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGS 486 (725) Q Consensus 407 ~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~ 486 (725) ........+..++..+-...|..+++.+-.+....-..+...+.. +-..+++..+.+-.++-.+.-. .. .. T Consensus 437 ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~-~~dn~~~~~~~~g~~ll~li~~-----~~---~~ 507 (711) T protein:vir:10 437 LGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFA-FIDNLTKSIRRVGKILVEMIPH-----IY---DT 507 (711) T ss_pred HHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----Hc---CC Confidence 666666655554332222233333332222222222222222222 2222333333344444433211 11 23 Q ss_pred cceEEeccccccccCCceeeeccc-----ccc----ceEEE---EeccC-chhHHHHHHHHHHHHHHhcccccchHHHHH Q lcl|NC_013059. 487 EKEVQLMAEVVDLATGERQVLNDI-----RGR----YECYT---DVGPS-FQSMKQQNRAEILELLGKTPQGTPEYQLLL 553 (725) Q Consensus 487 ~~~v~in~~~~d~~~g~~~~~nDi-----~g~----~Dv~v---~~~p~-~~t~r~~~~~~l~ell~~~~~~~p~~~~~~ 553 (725) ++.|.|..+- .+-.++.+|.- +|. .|+++ ++..+ .++.-....+.+..|+. +.+..|. .. T Consensus 508 er~~rI~ged---~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~q-l~~~~p~---~~ 580 (711) T protein:vir:10 508 ERVVRLKFPD---ETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQ-FAQAVPS---AA 580 (711) T ss_pred CeEEEEecCC---CCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHH-HHhhcch---hh Confidence 3555553221 12234555542 232 46653 33333 34444443444444443 2222232 11 Q ss_pred HHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhH-HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 554 LQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQ-WFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSL 632 (725) Q Consensus 554 ~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~-~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~ 632 (725) .. -.+.+-+.+....+......+.....+.... ...++.++.+++++....+.+.++++++++.++++++.+++ T Consensus 581 ~~-----~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~A 655 (711) T protein:vir:10 581 AV-----MADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQA 655 (711) T ss_pred hH-----HHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1112222222222222222221111111110 01111111111111111112222222222222222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 633 QIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQR 712 (725) Q Consensus 633 q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~ 712 (725) ++++ .+.++...+...+....+.. .+. +....+++ ..+.++.+++.+..+. +- T Consensus 656 qae~----------------~qa~~e~~~~q~q~~~~~~~--aq~--~~~~~qq~------~~~l~~~qaelq~~q~-~~ 708 (711) T protein:vir:10 656 QADM----------------LKAQLETEEAQKQLAMIEDM--AQG--GDVVYQQV------RELVAQALAEITASQA-NV 708 (711) T ss_pred HHHH----------------HHHHHHHHHHHHHHHHHHHH--HHH--HHHHHHHH------HHHHHHHHHHHHHHHH-Hh Confidence 1111 11111111111111111111 111 11111111 1122222222222221 11 Q ss_pred hcC Q lcl|NC_013059. 713 QNQ 715 (725) Q Consensus 713 ~~q 715 (725) .+| T Consensus 709 ~q~ 711 (711) T protein:vir:10 709 TEQ 711 (711) T ss_pred hcC Confidence 111 No 127 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=97.77 E-value=2.1e-05 Score=46.25 Aligned_cols=649 Identities=10% Similarity=-0.057 Sum_probs=189.8 Q ss_pred hHHHHHHHHHHHHhhc--CCCCCHHHHHHHhh----cCC--Cc------ccchHHHHHHH---HHHHhhCCcceEEec-C Q lcl|NC_013059. 22 SDEARREAKNDLFFSR--VSQWDDWLSQYTTL----QYR--GQ------FDVVRPVVRKL---VSEMRQNPIDVLYRP-K 83 (725) Q Consensus 22 ~~~~r~~a~~d~~f~~--G~QW~~~~~~~l~~----~gr--p~------~N~i~~~v~~v---~g~~~~nr~~~~~~p-r 83 (725) .++.+....+.+.++. -+.++++=...+++ .|. +. -+..+|.+|.| ++.-.+.-..-+..+ . T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~~~~nr~d~~v 80 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLY 80 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCccccHHHHHHHHHhhHHhCCcceEE Confidence 4444444444433332 12222222222222 121 11 13333333222 111111111111111 1 Q ss_pred CcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecc-h--------h Q lcl|NC_013059. 84 DGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA-C--------S 154 (725) Q Consensus 84 ~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~-~--------~ 154 (725) .|.+. .-+.+..++..+....-...-...++.++..+++.+ + +|. +.++.+....+ + . T Consensus 81 ~P~~~-~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~---G--------~G~-~ev~~d~~~~d~~~~~~~i~~~ 147 (725) T protein:vir:77 81 RPKDG-ARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEA---G--------VGA-WRLVTDYEDQSPTSNNQVIRRE 147 (725) T ss_pred ecCCc-cHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhc---C--------cce-eeeeecccCCCCCCCceeeEEe Confidence 34444 444566777777666655555666666666665543 1 221 11121110000 0 0 Q ss_pred heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhh-hhcccccccccCCCeEEE-EEEEEEecceeEE Q lcl|NC_013059. 155 HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSF-QNPNDWVFPWLTQDTIQI-AEFYEVVEKKETA 232 (725) Q Consensus 155 ~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~-~~~~~~~~~~~~~~~vrv-~E~w~~~~~~~~~ 232 (725) .|++||...-+|.. .+..+.+++.-+|-..-.+.++...+ ..+.....+|.......- ..-|+.. .+.++ T Consensus 148 ~~~~~~~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~vrv 219 (725) T protein:vir:77 148 PIHSACSHVIWDSN-------SKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQ-DTIQI 219 (725) T ss_pred ecccChhhceeCch-------hhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCC-CeeEE Confidence 12222222222211 11233333333332222222111111 111111222322111110 0112221 11122 Q ss_pred EEeeC--ccccceee-cch--hhhHHHHH-HHHhcchhhhhcc--ceeEEEEEEEEeeccccccCCCCC-CCCcc--ceE Q lcl|NC_013059. 233 FIYQD--PVTGEPVS-YFK--RDIKDVID-DLADSGFIKIAER--QIKRRRVYKSIITCTAVLKDKQLI-AGEHI--PIV 301 (725) Q Consensus 233 ~~~~d--~~~g~~~~-~~~--~~~~~~~~-~~~~~g~~~~~~~--~~~~~~v~~~~~~g~~~l~~~~~~-p~~~~--p~v 301 (725) ..+.- +.+..++. .++ ..+..... .+........... .+..+++.++-.....+. +.... -...+ .++ T Consensus 220 ~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~-g~~~l~~~~~~~~~~~ 298 (725) T protein:vir:77 220 AEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIIT-CTAVLKDKQLIAGEHI 298 (725) T ss_pred EEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeec-CceeeccCCcCCCCcc Confidence 11110 11111111 111 01111000 0000000001111 122233332222222222 22211 11122 357 Q ss_pred EEEeeeeccCCccccchhhh--hhhhHHHHHHHHHHHHHHH-HHhcCCcceeechhhcchHHHHHHhhcccccccccccc Q lcl|NC_013059. 302 PVFGEWGFVEDKEVYEGVVR--LTKDGQRLRNMIMSFNADI-VARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTD 378 (725) Q Consensus 302 P~~g~~~~~d~~~~~~G~vr--~~kd~Q~~~N~~~s~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 378 (725) ||+++..++.. ..|... .++..=+..=....+.... +-..+........+..+.++...... ..+.... .+. T Consensus 299 P~vP~~g~r~~---~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~-~~~ 373 (725) T protein:vir:77 299 PIVPVFGEWGF---VEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY-DGNDDYP-YYL 373 (725) T ss_pred ceEEEeeeeec---cCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHH-HhccCCc-eec Confidence 77766543332 123222 3333222222222222111 11222222222222222221111111 1110000 000 Q ss_pred ccCccccccC--CcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccC----cchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 379 ENNGEMPTQP--LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVN----GGQVAYDTVNQLNMRADLETYVFQDN 452 (725) Q Consensus 379 ~~~g~~~~~~--~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~----~n~~Sg~ai~~~q~q~~~~~~~~~dn 452 (725) .+.-....+. .+.+...+.|+-....++........+.- ..|.. |..+++.+-.+....-..+...++. T Consensus 374 ~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~----~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~- 448 (725) T protein:vir:77 374 LNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKE----VATLGVDTEAVNGGQVAFDTVNQLNMRADLETYV- 448 (725) T ss_pred ccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHH----HhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHH- Confidence 0000001111 12222333333333455555555555433 33543 2222222112222222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccc-----cccc----eE--EEEec Q lcl|NC_013059. 453 LATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDI-----RGRY----EC--YTDVG 521 (725) Q Consensus 453 ~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi-----~g~~----Dv--~v~~~ 521 (725) |-..+++.-+.+-.++..+... + -+.++.+.|-.+. . +-..+++|.- +|+. |+ ..++. T Consensus 449 ~~Dnl~~~~~~~g~~lL~lI~~-----~---~~~~rv~RI~~ed--~-~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~ 517 (725) T protein:vir:77 449 FQDNLATAMRRDGEIYQSIVND-----I---YDVPRNVTITLED--G-SEKDVQLMAEVVDLATGEKQVLNDIRGRYECY 517 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-----H---cCCCcEEEEecCC--C-CcceeeecccccccccchhHhhhhhccceeeE Confidence 2222333333333333333211 1 1234555553221 1 1235555632 3332 22 13444 Q ss_pred cCc-hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHH Q lcl|NC_013059. 522 PSF-QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAK 600 (725) Q Consensus 522 p~~-~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~q 600 (725) .+. ++.-...-+.+..|++.++...|........+....+.+......+...............++ .....++.. T Consensus 518 v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~----~~~~e~q~~ 593 (725) T protein:vir:77 518 TDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKP----ETPEEQQWL 593 (725) T ss_pred EeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCC----CChhhHHHH Confidence 443 343333334444444444433333333333333344444433443333222111111111111 111122222 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 601 QGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRS 680 (725) Q Consensus 601 q~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~ 680 (725) +++++.++.++++++.++|+.+++++++.++++.+..+++.++...+++.. ...........+...+.+...+ T Consensus 594 ~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~-------~~aa~~~~~~~q~~~~q~a~~~ 666 (725) T protein:vir:77 594 VEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQ-------LNAARIAEIFNNMDLSKQSEFR 666 (725) T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHhHHHHHHH Confidence 333334444555555566666666666666655555444444333322211 1111111111111111122222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHhcCCcccccCCCC Q lcl|NC_013059. 681 EDARANAELLLKGDEQTHKQRMEIANILQS-------QRQNQPSGSVAETPQ 725 (725) Q Consensus 681 ~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~-------~~~~q~~~~~~~~~q 725 (725) +.++..+....+.+ ..+++.++.....++ ...+.-.....++|- T Consensus 667 ~~~~~~~~~q~~~~-~~~~~~ae~~~~~~~~~~~q~~~~~~~~~~~~~~~~~ 717 (725) T protein:vir:77 667 EFLKTVASFQQDRS-EDARANAELLLKGDEQTHKQRMDIANILQSQRQNQPS 717 (725) T ss_pred HHHHHHHHHHHHHH-HHHHHHhHHHHHhhhHHHhhHHHHHHHHHHHHhcCCC Confidence 22222221111111 111111111111111 111111111122222 No 128 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=97.65 E-value=3.3e-05 Score=45.13 Aligned_cols=621 Identities=12% Similarity=0.073 Sum_probs=157.8 Q ss_pred CCcHHHHHHH---------HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH---HHHHHHhhcCCCcccchHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLES---------ILSRFDADWTASDEARREAKNDLFFSRVSQWDD---WLSQYTTLQYRGQFDVVRPVVRKLV 68 (725) Q Consensus 1 mad~~~~~~~---------~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~---~~~~~l~~~grp~~N~i~~~v~~v~ 68 (725) |.|....... ++.++...+....+... -|.. .+..+.. |. .-.+.+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~R~~a~~d~~fy~--G~----Qw~~~~~~~l 62 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP------------KWRDAANKACAYYD--GD----QLPPEVLQVL 62 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHhhc--CC----CCCHHHHHHH Confidence 9987543322 22222222111111111 1211 1222222 31 1112222222 Q ss_pred HH-----------------HhhCCcceEEec-CCcchHHHHH-HHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEE Q lcl|NC_013059. 69 SE-----------------MRQNPIDVLYRP-KDGASPDAAD-VLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLV 129 (725) Q Consensus 69 g~-----------------~~~nr~~~~~~p-r~~~d~~~Ae-~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~ 129 (725) =. -...-..-+..+ ..|.+.+-++ .+..++..+....-.......++.++..+++.+ + T Consensus 63 ~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~---G 139 (714) T protein:vir:32 63 KDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA---G 139 (714) T ss_pred HhcCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc---C Confidence 21 111111111112 1244555664 366666666655555555666666666666554 1 Q ss_pred eeeccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhh-----------hcCCcchhh Q lcl|NC_013059. 130 TDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAE-----------KFDLDADDI 198 (725) Q Consensus 130 ~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p-----------~~~~~~~~~ 198 (725) +.|.. ...+ .++...++..-..||...-+|.+ .+-.|.+++.=.|- .|+...... T Consensus 140 ~G~~~-~~~~------~d~~~~~i~i~~v~p~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i 205 (714) T protein:vir:32 140 LSWVE-VRRN------SDPFGPEFKVSTVSRNEVFWDWL-------SREADLSDCRWLMRRRWMDTDEAKATFPGMAQVI 205 (714) T ss_pred cceEE-eccc------cCCCCCCeEEEecchhheeeccc-------cccCChhhccceeeeecCCHHHHHHhcCCchhhh Confidence 11110 0000 00110110001112222222211 01111222221111 122111111 Q ss_pred hhhhhcccccc--cc--cCCCeEEEEE-------------EEEEecceeEEEEeeCcccc-cee-ecchhhhHHH--HHH Q lcl|NC_013059. 199 PSFQNPNDWVF--PW--LTQDTIQIAE-------------FYEVVEKKETAFIYQDPVTG-EPV-SYFKRDIKDV--IDD 257 (725) Q Consensus 199 ~~~~~~~~~~~--~~--~~~~~vrv~E-------------~w~~~~~~~~~~~~~d~~~g-~~~-~~~~~~~~~~--~~~ 257 (725) .. ..++|.. +. .+.....+.. .|+... +.++.++...+.- ..+ .+.+.+...+ ... T Consensus 206 ~~--~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~ 282 (714) T protein:vir:32 206 DY--AIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRE-RRRVLLQVVYYRTFERLPVIELSNGRVVAFDKN 282 (714) T ss_pred hh--hhhhhccccccccccccccccccchhhhcccccccccccccc-ccEEEEEEEEEEEEEEEEeeccCCCceEEeCcc Confidence 11 1111110 00 0000111111 122211 1111111111000 000 0000000000 000 Q ss_pred -HHhcchhhhhccceeEEEEEEEEeeccccccCCCCC--CCCccc--eEEEEeeeeccCCccccchhhhhhhh-HHHHHH Q lcl|NC_013059. 258 -LADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI--AGEHIP--IVPVFGEWGFVEDKEVYEGVVRLTKD-GQRLRN 331 (725) Q Consensus 258 -~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~--p~~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd-~Q~~~N 331 (725) ....-.+......+..++|.+..+. .++.+.-.. |. -|| .+||++++.+.+. ..|....++. .-+- T Consensus 283 ~~~~~~~~~~g~~~~~~~~~~rv~~~--~~~g~~~L~~~~~-p~p~~~fp~vp~~g~~~~---~~g~~~G~vr~~~d~-- 354 (714) T protein:vir:32 283 NLMQAVAVASGRVQVKVGRVSRIREA--WFVGPHFIVDRPC-SAPQGMFPLVPFWGYRKD---KTGEPYGLISRAIPA-- 354 (714) T ss_pred CHHHHHHHhhcchhhhccccceEEEE--EEecCcccccCCC-CCCCCceeEEEEeeeeee---ccCceeehhhhchhH-- Confidence 0000000001111111222221111 111111111 11 133 3555554433221 2333333221 1111 Q ss_pred HHHHHHHHHHHhc-CCcceeechhhcchHHHHHHhhccccccccccccccCccccc--------cCCcccCCCCchHHHH Q lcl|NC_013059. 332 MIMSFNADIVART-PKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPT--------QPLAYYENPEVPQANA 402 (725) Q Consensus 332 ~~~s~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--------~~~~~~~~~~~~~~~~ 402 (725) .+.++.. ++-.++..... .+.. ....+............++|.+.- .++.++...+-++-.. T Consensus 355 ------Qr~~N~~~s~~~~~l~~~~--~~~~-~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~ 425 (714) T protein:vir:32 355 ------QDEVNFRRIKLTWLLQAKR--VIMD-EDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVAS 425 (714) T ss_pred ------HHHHHHHHHHHHHhhcCCc--eeee-cCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccH Confidence 1111111 00111111100 0000 000000000000011122222221 1223344444444556 Q ss_pred HHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEec Q lcl|NC_013059. 403 YMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITL 482 (725) Q Consensus 403 ~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~ 482 (725) ..++......+.+--++...-...|..+++.+=.+....-..+...+.. +-..+++..+.+-.++..+.- .+. T Consensus 426 ~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~-~~Dnl~~~~~~~g~~lL~li~-----~~~- 498 (714) T protein:vir:32 426 QQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE-INDNYQFACQQVGRLLLAYLL-----DDL- 498 (714) T ss_pred HHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc- Confidence 6666666666555544322222223333322111222212222222221 112222223333333332221 111 Q ss_pred cCCCcceEEeccccccccCCceeeeccccc----cceE---EEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH Q lcl|NC_013059. 483 EDGSEKEVQLMAEVVDLATGERQVLNDIRG----RYEC---YTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ 555 (725) Q Consensus 483 ~d~~~~~v~in~~~~d~~~g~~~~~nDi~g----~~Dv---~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~ 555 (725) +.++.+.|..+......-..+.+|.-.| .-|| .+++..+....-....++..+.|..+ +. T Consensus 499 --~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l-----------~~ 565 (714) T protein:vir:32 499 --KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEV-----------IQ 565 (714) T ss_pred --CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHH-----------Hh Confidence 2344454432211111123455664432 1243 44555554443333333333222211 11 Q ss_pred hhc-cCCchhHHHHHHH---Hhhhhhhhhhhhccch--hhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 556 YFT-LLDGKGVEMMRDY---ANKQLIQMGVKKPETP--EEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 556 ~~~-~~d~~~~~~i~e~---~~kq~~~~~~~~~~~~--e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) .++ .+..-..+-+++. .++......+.+.... ...+..++++++++++++.+..++++++.+ ++++. T Consensus 566 ~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~-------~~a~~ 638 (714) T protein:vir:32 566 GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMRE-------MAGRV 638 (714) T ss_pred hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHH-------HHHHH Confidence 111 1111111222221 1111122222111000 001111111111111111122222222223 33333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 630 LSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDAR--ANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 630 ~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~--~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++.++++.+++++++....+......++... +... +..+++.+. ...+- .+......++++ .+. T Consensus 639 ~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~---------~~~~--~~~~a~~a~~~~~~~~-~~~~~~~~~~q~--~q~ 704 (714) T protein:vir:32 639 AKLEADAARAHAAAQRDNASAQREVALTQGQ---------RYVD--ALNQAHTAEIITGVQN-MEQEQDVLQQQM--LYT 704 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHH--HHHHHHHHHHHHhHhh-hhhhhHHHHHHH--HHH Confidence 3333333333333333332222111111111 0000 001111111 00111 011111112222 112 Q ss_pred HHHHHhcCCc Q lcl|NC_013059. 708 LQSQRQNQPS 717 (725) Q Consensus 708 ~~~~~~~q~~ 717 (725) .+.+..+.+- T Consensus 705 ~~~~~~~~~~ 714 (714) T protein:vir:32 705 LQQRMNEMSL 714 (714) T ss_pred HHHHHHhcCC Confidence 1222222222 No 129 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=97.65 E-value=3.3e-05 Score=45.13 Aligned_cols=621 Identities=12% Similarity=0.073 Sum_probs=157.8 Q ss_pred CCcHHHHHHH---------HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH---HHHHHHhhcCCCcccchHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLES---------ILSRFDADWTASDEARREAKNDLFFSRVSQWDD---WLSQYTTLQYRGQFDVVRPVVRKLV 68 (725) Q Consensus 1 mad~~~~~~~---------~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~---~~~~~l~~~grp~~N~i~~~v~~v~ 68 (725) |.|....... ++.++...+....+... -|.. .+..+.. |. .-.+.+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~R~~a~~d~~fy~--G~----Qw~~~~~~~l 62 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP------------KWRDAANKACAYYD--GD----QLPPEVLQVL 62 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHhhc--CC----CCCHHHHHHH Confidence 9987543322 22222222111111111 1211 1222222 31 1112222222 Q ss_pred HH-----------------HhhCCcceEEec-CCcchHHHHH-HHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEE Q lcl|NC_013059. 69 SE-----------------MRQNPIDVLYRP-KDGASPDAAD-VLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLV 129 (725) Q Consensus 69 g~-----------------~~~nr~~~~~~p-r~~~d~~~Ae-~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~ 129 (725) =. -...-..-+..+ ..|.+.+-++ .+..++..+....-.......++.++..+++.+ + T Consensus 63 ~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~---G 139 (714) T protein:vir:81 63 KDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA---G 139 (714) T ss_pred HhcCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc---C Confidence 21 111111111112 1244555664 366666666655555555666666666666554 1 Q ss_pred eeeccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhh-----------hcCCcchhh Q lcl|NC_013059. 130 TDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAE-----------KFDLDADDI 198 (725) Q Consensus 130 ~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p-----------~~~~~~~~~ 198 (725) +.|.. ...+ .++...++..-..||...-+|.+ .+-.|.+++.=.|- .|+...... T Consensus 140 ~G~~~-~~~~------~d~~~~~i~i~~v~p~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i 205 (714) T protein:vir:81 140 LSWVE-VRRN------SDPFGPEFKVSTVSRNEVFWDWL-------SREADLSDCRWLMRRRWMDTDEAKATFPGMAQVI 205 (714) T ss_pred cceEE-eccc------cCCCCCCeEEEecchhheeeccc-------cccCChhhccceeeeecCCHHHHHHhcCCchhhh Confidence 11110 0000 00110110001112222222211 01111222221111 122111111 Q ss_pred hhhhhcccccc--cc--cCCCeEEEEE-------------EEEEecceeEEEEeeCcccc-cee-ecchhhhHHH--HHH Q lcl|NC_013059. 199 PSFQNPNDWVF--PW--LTQDTIQIAE-------------FYEVVEKKETAFIYQDPVTG-EPV-SYFKRDIKDV--IDD 257 (725) Q Consensus 199 ~~~~~~~~~~~--~~--~~~~~vrv~E-------------~w~~~~~~~~~~~~~d~~~g-~~~-~~~~~~~~~~--~~~ 257 (725) .. ..++|.. +. .+.....+.. .|+... +.++.++...+.- ..+ .+.+.+...+ ... T Consensus 206 ~~--~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~ 282 (714) T protein:vir:81 206 DY--AIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRE-RRRVLLQVVYYRTFERLPVIELSNGRVVAFDKN 282 (714) T ss_pred hh--hhhhhccccccccccccccccccchhhhcccccccccccccc-ccEEEEEEEEEEEEEEEEeeccCCCceEEeCcc Confidence 11 1111110 00 0000111111 122211 1111111111000 000 0000000000 000 Q ss_pred -HHhcchhhhhccceeEEEEEEEEeeccccccCCCCC--CCCccc--eEEEEeeeeccCCccccchhhhhhhh-HHHHHH Q lcl|NC_013059. 258 -LADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI--AGEHIP--IVPVFGEWGFVEDKEVYEGVVRLTKD-GQRLRN 331 (725) Q Consensus 258 -~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~--p~~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd-~Q~~~N 331 (725) ....-.+......+..++|.+..+. .++.+.-.. |. -|| .+||++++.+.+. ..|....++. .-+- T Consensus 283 ~~~~~~~~~~g~~~~~~~~~~rv~~~--~~~g~~~L~~~~~-p~p~~~fp~vp~~g~~~~---~~g~~~G~vr~~~d~-- 354 (714) T protein:vir:81 283 NLMQAVAVASGRVQVKVGRVSRIREA--WFVGPHFIVDRPC-SAPQGMFPLVPFWGYRKD---KTGEPYGLISRAIPA-- 354 (714) T ss_pred CHHHHHHHhhcchhhhccccceEEEE--EEecCcccccCCC-CCCCCceeEEEEeeeeee---ccCceeehhhhchhH-- Confidence 0000000001111111222221111 111111111 11 133 3555554433221 2333333221 1111 Q ss_pred HHHHHHHHHHHhc-CCcceeechhhcchHHHHHHhhccccccccccccccCccccc--------cCCcccCCCCchHHHH Q lcl|NC_013059. 332 MIMSFNADIVART-PKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPT--------QPLAYYENPEVPQANA 402 (725) Q Consensus 332 ~~~s~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--------~~~~~~~~~~~~~~~~ 402 (725) .+.++.. ++-.++..... .+.. ....+............++|.+.- .++.++...+-++-.. T Consensus 355 ------Qr~~N~~~s~~~~~l~~~~--~~~~-~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~ 425 (714) T protein:vir:81 355 ------QDEVNFRRIKLTWLLQAKR--VIMD-EDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVAS 425 (714) T ss_pred ------HHHHHHHHHHHHHhhcCCc--eeee-cCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccH Confidence 1111111 00111111100 0000 000000000000011122222221 1223344444444556 Q ss_pred HHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEec Q lcl|NC_013059. 403 YMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITL 482 (725) Q Consensus 403 ~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~ 482 (725) ..++......+.+--++...-...|..+++.+=.+....-..+...+.. +-..+++..+.+-.++..+.- .+. T Consensus 426 ~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~-~~Dnl~~~~~~~g~~lL~li~-----~~~- 498 (714) T protein:vir:81 426 QQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE-INDNYQFACQQVGRLLLAYLL-----DDL- 498 (714) T ss_pred HHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc- Confidence 6666666666555544322222223333322111222212222222221 112222223333333332221 111 Q ss_pred cCCCcceEEeccccccccCCceeeeccccc----cceE---EEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH Q lcl|NC_013059. 483 EDGSEKEVQLMAEVVDLATGERQVLNDIRG----RYEC---YTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ 555 (725) Q Consensus 483 ~d~~~~~v~in~~~~d~~~g~~~~~nDi~g----~~Dv---~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~ 555 (725) +.++.+.|..+......-..+.+|.-.| .-|| .+++..+....-....++..+.|..+ +. T Consensus 499 --~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l-----------~~ 565 (714) T protein:vir:81 499 --KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEV-----------IQ 565 (714) T ss_pred --CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHH-----------Hh Confidence 2344454432211111123455664432 1243 44555554443333333333222211 11 Q ss_pred hhc-cCCchhHHHHHHH---Hhhhhhhhhhhhccch--hhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 556 YFT-LLDGKGVEMMRDY---ANKQLIQMGVKKPETP--EEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 556 ~~~-~~d~~~~~~i~e~---~~kq~~~~~~~~~~~~--e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) .++ .+..-..+-+++. .++......+.+.... ...+..++++++++++++.+..++++++.+ ++++. T Consensus 566 ~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~-------~~a~~ 638 (714) T protein:vir:81 566 GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMRE-------MAGRV 638 (714) T ss_pred hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHH-------HHHHH Confidence 111 1111111222221 1111122222111000 001111111111111111122222222223 33333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 630 LSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDAR--ANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 630 ~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~--~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++.++++.+++++++....+......++... +... +..+++.+. ...+- .+......++++ .+. T Consensus 639 ~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~---------~~~~--~~~~a~~a~~~~~~~~-~~~~~~~~~~q~--~q~ 704 (714) T protein:vir:81 639 AKLEADAARAHAAAQRDNASAQREVALTQGQ---------RYVD--ALNQAHTAEIITGVQN-MEQEQDVLQQQM--LYT 704 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHH--HHHHHHHHHHHHhHhh-hhhhhHHHHHHH--HHH Confidence 3333333333333333332222111111111 0000 001111111 00111 011111112222 112 Q ss_pred HHHHHhcCCc Q lcl|NC_013059. 708 LQSQRQNQPS 717 (725) Q Consensus 708 ~~~~~~~q~~ 717 (725) .+.+..+.+- T Consensus 705 ~~~~~~~~~~ 714 (714) T protein:vir:81 705 LQQRMNEMSL 714 (714) T ss_pred HHHHHHhcCC Confidence 1222222222 No 130 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=97.65 E-value=3.3e-05 Score=45.13 Aligned_cols=621 Identities=12% Similarity=0.073 Sum_probs=157.8 Q ss_pred CCcHHHHHHH---------HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH---HHHHHHhhcCCCcccchHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLES---------ILSRFDADWTASDEARREAKNDLFFSRVSQWDD---WLSQYTTLQYRGQFDVVRPVVRKLV 68 (725) Q Consensus 1 mad~~~~~~~---------~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~---~~~~~l~~~grp~~N~i~~~v~~v~ 68 (725) |.|....... ++.++...+....+... -|.. .+..+.. |. .-.+.+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~R~~a~~d~~fy~--G~----Qw~~~~~~~l 62 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP------------KWRDAANKACAYYD--GD----QLPPEVLQVL 62 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHhhc--CC----CCCHHHHHHH Confidence 9987543322 22222222111111111 1211 1222222 31 1112222222 Q ss_pred HH-----------------HhhCCcceEEec-CCcchHHHHH-HHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEE Q lcl|NC_013059. 69 SE-----------------MRQNPIDVLYRP-KDGASPDAAD-VLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLV 129 (725) Q Consensus 69 g~-----------------~~~nr~~~~~~p-r~~~d~~~Ae-~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~ 129 (725) =. -...-..-+..+ ..|.+.+-++ .+..++..+....-.......++.++..+++.+ + T Consensus 63 ~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~---G 139 (714) T protein:vir:27 63 KDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA---G 139 (714) T ss_pred HhcCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc---C Confidence 21 111111111112 1244555664 366666666655555555666666666666554 1 Q ss_pred eeeccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhh-----------hcCCcchhh Q lcl|NC_013059. 130 TDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAE-----------KFDLDADDI 198 (725) Q Consensus 130 ~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p-----------~~~~~~~~~ 198 (725) +.|.. ...+ .++...++..-..||...-+|.+ .+-.|.+++.=.|- .|+...... T Consensus 140 ~G~~~-~~~~------~d~~~~~i~i~~v~p~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i 205 (714) T protein:vir:27 140 LSWVE-VRRN------SDPFGPEFKVSTVSRNEVFWDWL-------SREADLSDCRWLMRRRWMDTDEAKATFPGMAQVI 205 (714) T ss_pred cceEE-eccc------cCCCCCCeEEEecchhheeeccc-------cccCChhhccceeeeecCCHHHHHHhcCCchhhh Confidence 11110 0000 00110110001112222222211 01111222221111 122111111 Q ss_pred hhhhhcccccc--cc--cCCCeEEEEE-------------EEEEecceeEEEEeeCcccc-cee-ecchhhhHHH--HHH Q lcl|NC_013059. 199 PSFQNPNDWVF--PW--LTQDTIQIAE-------------FYEVVEKKETAFIYQDPVTG-EPV-SYFKRDIKDV--IDD 257 (725) Q Consensus 199 ~~~~~~~~~~~--~~--~~~~~vrv~E-------------~w~~~~~~~~~~~~~d~~~g-~~~-~~~~~~~~~~--~~~ 257 (725) .. ..++|.. +. .+.....+.. .|+... +.++.++...+.- ..+ .+.+.+...+ ... T Consensus 206 ~~--~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~ 282 (714) T protein:vir:27 206 DY--AIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRE-RRRVLLQVVYYRTFERLPVIELSNGRVVAFDKN 282 (714) T ss_pred hh--hhhhhccccccccccccccccccchhhhcccccccccccccc-ccEEEEEEEEEEEEEEEEeeccCCCceEEeCcc Confidence 11 1111110 00 0000111111 122211 1111111111000 000 0000000000 000 Q ss_pred -HHhcchhhhhccceeEEEEEEEEeeccccccCCCCC--CCCccc--eEEEEeeeeccCCccccchhhhhhhh-HHHHHH Q lcl|NC_013059. 258 -LADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI--AGEHIP--IVPVFGEWGFVEDKEVYEGVVRLTKD-GQRLRN 331 (725) Q Consensus 258 -~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~--p~~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd-~Q~~~N 331 (725) ....-.+......+..++|.+..+. .++.+.-.. |. -|| .+||++++.+.+. ..|....++. .-+- T Consensus 283 ~~~~~~~~~~g~~~~~~~~~~rv~~~--~~~g~~~L~~~~~-p~p~~~fp~vp~~g~~~~---~~g~~~G~vr~~~d~-- 354 (714) T protein:vir:27 283 NLMQAVAVASGRVQVKVGRVSRIREA--WFVGPHFIVDRPC-SAPQGMFPLVPFWGYRKD---KTGEPYGLISRAIPA-- 354 (714) T ss_pred CHHHHHHHhhcchhhhccccceEEEE--EEecCcccccCCC-CCCCCceeEEEEeeeeee---ccCceeehhhhchhH-- Confidence 0000000001111111222221111 111111111 11 133 3555554433221 2333333221 1111 Q ss_pred HHHHHHHHHHHhc-CCcceeechhhcchHHHHHHhhccccccccccccccCccccc--------cCCcccCCCCchHHHH Q lcl|NC_013059. 332 MIMSFNADIVART-PKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPT--------QPLAYYENPEVPQANA 402 (725) Q Consensus 332 ~~~s~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--------~~~~~~~~~~~~~~~~ 402 (725) .+.++.. ++-.++..... .+.. ....+............++|.+.- .++.++...+-++-.. T Consensus 355 ------Qr~~N~~~s~~~~~l~~~~--~~~~-~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~ 425 (714) T protein:vir:27 355 ------QDEVNFRRIKLTWLLQAKR--VIMD-EDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVAS 425 (714) T ss_pred ------HHHHHHHHHHHHHhhcCCc--eeee-cCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccH Confidence 1111111 00111111100 0000 000000000000011122222221 1223344444444556 Q ss_pred HHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEec Q lcl|NC_013059. 403 YMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITL 482 (725) Q Consensus 403 ~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~ 482 (725) ..++......+.+--++...-...|..+++.+=.+....-..+...+.. +-..+++..+.+-.++..+.- .+. T Consensus 426 ~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~-~~Dnl~~~~~~~g~~lL~li~-----~~~- 498 (714) T protein:vir:27 426 QQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE-INDNYQFACQQVGRLLLAYLL-----DDL- 498 (714) T ss_pred HHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc- Confidence 6666666666555544322222223333322111222212222222221 112222223333333332221 111 Q ss_pred cCCCcceEEeccccccccCCceeeeccccc----cceE---EEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH Q lcl|NC_013059. 483 EDGSEKEVQLMAEVVDLATGERQVLNDIRG----RYEC---YTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ 555 (725) Q Consensus 483 ~d~~~~~v~in~~~~d~~~g~~~~~nDi~g----~~Dv---~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~ 555 (725) +.++.+.|..+......-..+.+|.-.| .-|| .+++..+....-....++..+.|..+ +. T Consensus 499 --~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l-----------~~ 565 (714) T protein:vir:27 499 --KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEV-----------IQ 565 (714) T ss_pred --CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHH-----------Hh Confidence 2344454432211111123455664432 1243 44555554443333333333222211 11 Q ss_pred hhc-cCCchhHHHHHHH---Hhhhhhhhhhhhccch--hhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 556 YFT-LLDGKGVEMMRDY---ANKQLIQMGVKKPETP--EEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 556 ~~~-~~d~~~~~~i~e~---~~kq~~~~~~~~~~~~--e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) .++ .+..-..+-+++. .++......+.+.... ...+..++++++++++++.+..++++++.+ ++++. T Consensus 566 ~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~-------~~a~~ 638 (714) T protein:vir:27 566 GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMRE-------MAGRV 638 (714) T ss_pred hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHH-------HHHHH Confidence 111 1111111222221 1111122222111000 001111111111111111122222222223 33333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 630 LSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDAR--ANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 630 ~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~--~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++.++++.+++++++....+......++... +... +..+++.+. ...+- .+......++++ .+. T Consensus 639 ~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~---------~~~~--~~~~a~~a~~~~~~~~-~~~~~~~~~~q~--~q~ 704 (714) T protein:vir:27 639 AKLEADAARAHAAAQRDNASAQREVALTQGQ---------RYVD--ALNQAHTAEIITGVQN-MEQEQDVLQQQM--LYT 704 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHH--HHHHHHHHHHHHhHhh-hhhhhHHHHHHH--HHH Confidence 3333333333333333332222111111111 0000 001111111 00111 011111112222 112 Q ss_pred HHHHHhcCCc Q lcl|NC_013059. 708 LQSQRQNQPS 717 (725) Q Consensus 708 ~~~~~~~q~~ 717 (725) .+.+..+.+- T Consensus 705 ~~~~~~~~~~ 714 (714) T protein:vir:27 705 LQQRMNEMSL 714 (714) T ss_pred HHHHHHhcCC Confidence 1222222222 No 131 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=97.65 E-value=3.3e-05 Score=45.13 Aligned_cols=621 Identities=12% Similarity=0.073 Sum_probs=157.8 Q ss_pred CCcHHHHHHH---------HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH---HHHHHHhhcCCCcccchHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLES---------ILSRFDADWTASDEARREAKNDLFFSRVSQWDD---WLSQYTTLQYRGQFDVVRPVVRKLV 68 (725) Q Consensus 1 mad~~~~~~~---------~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~---~~~~~l~~~grp~~N~i~~~v~~v~ 68 (725) |.|....... ++.++...+....+... -|.. .+..+.. |. .-.+.+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~R~~a~~d~~fy~--G~----Qw~~~~~~~l 62 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP------------KWRDAANKACAYYD--GD----QLPPEVLQVL 62 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHhhc--CC----CCCHHHHHHH Confidence 9987543322 22222222111111111 1211 1222222 31 1112222222 Q ss_pred HH-----------------HhhCCcceEEec-CCcchHHHHH-HHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEE Q lcl|NC_013059. 69 SE-----------------MRQNPIDVLYRP-KDGASPDAAD-VLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLV 129 (725) Q Consensus 69 g~-----------------~~~nr~~~~~~p-r~~~d~~~Ae-~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~ 129 (725) =. -...-..-+..+ ..|.+.+-++ .+..++..+....-.......++.++..+++.+ + T Consensus 63 ~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~---G 139 (714) T protein:vir:10 63 KDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA---G 139 (714) T ss_pred HhcCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc---C Confidence 21 111111111112 1244555664 366666666655555555666666666666554 1 Q ss_pred eeeccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhh-----------hcCCcchhh Q lcl|NC_013059. 130 TDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAE-----------KFDLDADDI 198 (725) Q Consensus 130 ~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p-----------~~~~~~~~~ 198 (725) +.|.. ...+ .++...++..-..||...-+|.+ .+-.|.+++.=.|- .|+...... T Consensus 140 ~G~~~-~~~~------~d~~~~~i~i~~v~p~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i 205 (714) T protein:vir:10 140 LSWVE-VRRN------SDPFGPEFKVSTVSRNEVFWDWL-------SREADLSDCRWLMRRRWMDTDEAKATFPGMAQVI 205 (714) T ss_pred cceEE-eccc------cCCCCCCeEEEecchhheeeccc-------cccCChhhccceeeeecCCHHHHHHhcCCchhhh Confidence 11110 0000 00110110001112222222211 01111222221111 122111111 Q ss_pred hhhhhcccccc--cc--cCCCeEEEEE-------------EEEEecceeEEEEeeCcccc-cee-ecchhhhHHH--HHH Q lcl|NC_013059. 199 PSFQNPNDWVF--PW--LTQDTIQIAE-------------FYEVVEKKETAFIYQDPVTG-EPV-SYFKRDIKDV--IDD 257 (725) Q Consensus 199 ~~~~~~~~~~~--~~--~~~~~vrv~E-------------~w~~~~~~~~~~~~~d~~~g-~~~-~~~~~~~~~~--~~~ 257 (725) .. ..++|.. +. .+.....+.. .|+... +.++.++...+.- ..+ .+.+.+...+ ... T Consensus 206 ~~--~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~ 282 (714) T protein:vir:10 206 DY--AIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRE-RRRVLLQVVYYRTFERLPVIELSNGRVVAFDKN 282 (714) T ss_pred hh--hhhhhccccccccccccccccccchhhhcccccccccccccc-ccEEEEEEEEEEEEEEEEeeccCCCceEEeCcc Confidence 11 1111110 00 0000111111 122211 1111111111000 000 0000000000 000 Q ss_pred -HHhcchhhhhccceeEEEEEEEEeeccccccCCCCC--CCCccc--eEEEEeeeeccCCccccchhhhhhhh-HHHHHH Q lcl|NC_013059. 258 -LADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI--AGEHIP--IVPVFGEWGFVEDKEVYEGVVRLTKD-GQRLRN 331 (725) Q Consensus 258 -~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~--p~~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd-~Q~~~N 331 (725) ....-.+......+..++|.+..+. .++.+.-.. |. -|| .+||++++.+.+. ..|....++. .-+- T Consensus 283 ~~~~~~~~~~g~~~~~~~~~~rv~~~--~~~g~~~L~~~~~-p~p~~~fp~vp~~g~~~~---~~g~~~G~vr~~~d~-- 354 (714) T protein:vir:10 283 NLMQAVAVASGRVQVKVGRVSRIREA--WFVGPHFIVDRPC-SAPQGMFPLVPFWGYRKD---KTGEPYGLISRAIPA-- 354 (714) T ss_pred CHHHHHHHhhcchhhhccccceEEEE--EEecCcccccCCC-CCCCCceeEEEEeeeeee---ccCceeehhhhchhH-- Confidence 0000000001111111222221111 111111111 11 133 3555554433221 2333333221 1111 Q ss_pred HHHHHHHHHHHhc-CCcceeechhhcchHHHHHHhhccccccccccccccCccccc--------cCCcccCCCCchHHHH Q lcl|NC_013059. 332 MIMSFNADIVART-PKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPT--------QPLAYYENPEVPQANA 402 (725) Q Consensus 332 ~~~s~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--------~~~~~~~~~~~~~~~~ 402 (725) .+.++.. ++-.++..... .+.. ....+............++|.+.- .++.++...+-++-.. T Consensus 355 ------Qr~~N~~~s~~~~~l~~~~--~~~~-~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~ 425 (714) T protein:vir:10 355 ------QDEVNFRRIKLTWLLQAKR--VIMD-EDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVAS 425 (714) T ss_pred ------HHHHHHHHHHHHHhhcCCc--eeee-cCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccH Confidence 1111111 00111111100 0000 000000000000011122222221 1223344444444556 Q ss_pred HHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEec Q lcl|NC_013059. 403 YMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITL 482 (725) Q Consensus 403 ~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~ 482 (725) ..++......+.+--++...-...|..+++.+=.+....-..+...+.. +-..+++..+.+-.++..+.- .+. T Consensus 426 ~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~-~~Dnl~~~~~~~g~~lL~li~-----~~~- 498 (714) T protein:vir:10 426 QQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE-INDNYQFACQQVGRLLLAYLL-----DDL- 498 (714) T ss_pred HHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc- Confidence 6666666666555544322222223333322111222212222222221 112222223333333332221 111 Q ss_pred cCCCcceEEeccccccccCCceeeeccccc----cceE---EEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH Q lcl|NC_013059. 483 EDGSEKEVQLMAEVVDLATGERQVLNDIRG----RYEC---YTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ 555 (725) Q Consensus 483 ~d~~~~~v~in~~~~d~~~g~~~~~nDi~g----~~Dv---~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~ 555 (725) +.++.+.|..+......-..+.+|.-.| .-|| .+++..+....-....++..+.|..+ +. T Consensus 499 --~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l-----------~~ 565 (714) T protein:vir:10 499 --KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEV-----------IQ 565 (714) T ss_pred --CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHH-----------Hh Confidence 2344454432211111123455664432 1243 44555554443333333333222211 11 Q ss_pred hhc-cCCchhHHHHHHH---Hhhhhhhhhhhhccch--hhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 556 YFT-LLDGKGVEMMRDY---ANKQLIQMGVKKPETP--EEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 556 ~~~-~~d~~~~~~i~e~---~~kq~~~~~~~~~~~~--e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) .++ .+..-..+-+++. .++......+.+.... ...+..++++++++++++.+..++++++.+ ++++. T Consensus 566 ~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~-------~~a~~ 638 (714) T protein:vir:10 566 GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMRE-------MAGRV 638 (714) T ss_pred hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHH-------HHHHH Confidence 111 1111111222221 1111122222111000 001111111111111111122222222223 33333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 630 LSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDAR--ANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 630 ~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~--~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++.++++.+++++++....+......++... +... +..+++.+. ...+- .+......++++ .+. T Consensus 639 ~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~---------~~~~--~~~~a~~a~~~~~~~~-~~~~~~~~~~q~--~q~ 704 (714) T protein:vir:10 639 AKLEADAARAHAAAQRDNASAQREVALTQGQ---------RYVD--ALNQAHTAEIITGVQN-MEQEQDVLQQQM--LYT 704 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHH--HHHHHHHHHHHHhHhh-hhhhhHHHHHHH--HHH Confidence 3333333333333333332222111111111 0000 001111111 00111 011111112222 112 Q ss_pred HHHHHhcCCc Q lcl|NC_013059. 708 LQSQRQNQPS 717 (725) Q Consensus 708 ~~~~~~~q~~ 717 (725) .+.+..+.+- T Consensus 705 ~~~~~~~~~~ 714 (714) T protein:vir:10 705 LQQRMNEMSL 714 (714) T ss_pred HHHHHHhcCC Confidence 1222222222 No 132 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=97.65 E-value=3.3e-05 Score=45.13 Aligned_cols=621 Identities=12% Similarity=0.073 Sum_probs=157.8 Q ss_pred CCcHHHHHHH---------HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCH---HHHHHHhhcCCCcccchHHHHHHHH Q lcl|NC_013059. 1 MADNKNRLES---------ILSRFDADWTASDEARREAKNDLFFSRVSQWDD---WLSQYTTLQYRGQFDVVRPVVRKLV 68 (725) Q Consensus 1 mad~~~~~~~---------~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~---~~~~~l~~~grp~~N~i~~~v~~v~ 68 (725) |.|....... ++.++...+....+... -|.. .+..+.. |. .-.+.+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~R~~a~~d~~fy~--G~----Qw~~~~~~~l 62 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP------------KWRDAANKACAYYD--GD----QLPPEVLQVL 62 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHhhc--CC----CCCHHHHHHH Confidence 9987543322 22222222111111111 1211 1222222 31 1112222222 Q ss_pred HH-----------------HhhCCcceEEec-CCcchHHHHH-HHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEE Q lcl|NC_013059. 69 SE-----------------MRQNPIDVLYRP-KDGASPDAAD-VLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLV 129 (725) Q Consensus 69 g~-----------------~~~nr~~~~~~p-r~~~d~~~Ae-~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~ 129 (725) =. -...-..-+..+ ..|.+.+-++ .+..++..+....-.......++.++..+++.+ + T Consensus 63 ~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~---G 139 (714) T protein:vir:99 63 KDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA---G 139 (714) T ss_pred HhcCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc---C Confidence 21 111111111112 1244555664 366666666655555555666666666666554 1 Q ss_pred eeeccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhh-----------hcCCcchhh Q lcl|NC_013059. 130 TDYEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAE-----------KFDLDADDI 198 (725) Q Consensus 130 ~~~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p-----------~~~~~~~~~ 198 (725) +.|.. ...+ .++...++..-..||...-+|.+ .+-.|.+++.=.|- .|+...... T Consensus 140 ~G~~~-~~~~------~d~~~~~i~i~~v~p~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i 205 (714) T protein:vir:99 140 LSWVE-VRRN------SDPFGPEFKVSTVSRNEVFWDWL-------SREADLSDCRWLMRRRWMDTDEAKATFPGMAQVI 205 (714) T ss_pred cceEE-eccc------cCCCCCCeEEEecchhheeeccc-------cccCChhhccceeeeecCCHHHHHHhcCCchhhh Confidence 11110 0000 00110110001112222222211 01111222221111 122111111 Q ss_pred hhhhhcccccc--cc--cCCCeEEEEE-------------EEEEecceeEEEEeeCcccc-cee-ecchhhhHHH--HHH Q lcl|NC_013059. 199 PSFQNPNDWVF--PW--LTQDTIQIAE-------------FYEVVEKKETAFIYQDPVTG-EPV-SYFKRDIKDV--IDD 257 (725) Q Consensus 199 ~~~~~~~~~~~--~~--~~~~~vrv~E-------------~w~~~~~~~~~~~~~d~~~g-~~~-~~~~~~~~~~--~~~ 257 (725) .. ..++|.. +. .+.....+.. .|+... +.++.++...+.- ..+ .+.+.+...+ ... T Consensus 206 ~~--~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~ 282 (714) T protein:vir:99 206 DY--AIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRE-RRRVLLQVVYYRTFERLPVIELSNGRVVAFDKN 282 (714) T ss_pred hh--hhhhhccccccccccccccccccchhhhcccccccccccccc-ccEEEEEEEEEEEEEEEEeeccCCCceEEeCcc Confidence 11 1111110 00 0000111111 122211 1111111111000 000 0000000000 000 Q ss_pred -HHhcchhhhhccceeEEEEEEEEeeccccccCCCCC--CCCccc--eEEEEeeeeccCCccccchhhhhhhh-HHHHHH Q lcl|NC_013059. 258 -LADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI--AGEHIP--IVPVFGEWGFVEDKEVYEGVVRLTKD-GQRLRN 331 (725) Q Consensus 258 -~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~--p~~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd-~Q~~~N 331 (725) ....-.+......+..++|.+..+. .++.+.-.. |. -|| .+||++++.+.+. ..|....++. .-+- T Consensus 283 ~~~~~~~~~~g~~~~~~~~~~rv~~~--~~~g~~~L~~~~~-p~p~~~fp~vp~~g~~~~---~~g~~~G~vr~~~d~-- 354 (714) T protein:vir:99 283 NLMQAVAVASGRVQVKVGRVSRIREA--WFVGPHFIVDRPC-SAPQGMFPLVPFWGYRKD---KTGEPYGLISRAIPA-- 354 (714) T ss_pred CHHHHHHHhhcchhhhccccceEEEE--EEecCcccccCCC-CCCCCceeEEEEeeeeee---ccCceeehhhhchhH-- Confidence 0000000001111111222221111 111111111 11 133 3555554433221 2333333221 1111 Q ss_pred HHHHHHHHHHHhc-CCcceeechhhcchHHHHHHhhccccccccccccccCccccc--------cCCcccCCCCchHHHH Q lcl|NC_013059. 332 MIMSFNADIVART-PKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPT--------QPLAYYENPEVPQANA 402 (725) Q Consensus 332 ~~~s~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--------~~~~~~~~~~~~~~~~ 402 (725) .+.++.. ++-.++..... .+.. ....+............++|.+.- .++.++...+-++-.. T Consensus 355 ------Qr~~N~~~s~~~~~l~~~~--~~~~-~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~ 425 (714) T protein:vir:99 355 ------QDEVNFRRIKLTWLLQAKR--VIMD-EDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVAS 425 (714) T ss_pred ------HHHHHHHHHHHHHhhcCCc--eeee-cCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccH Confidence 1111111 00111111100 0000 000000000000011122222221 1223344444444556 Q ss_pred HHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEec Q lcl|NC_013059. 403 YMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITL 482 (725) Q Consensus 403 ~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~ 482 (725) ..++......+.+--++...-...|..+++.+=.+....-..+...+.. +-..+++..+.+-.++..+.- .+. T Consensus 426 ~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~-~~Dnl~~~~~~~g~~lL~li~-----~~~- 498 (714) T protein:vir:99 426 QQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE-INDNYQFACQQVGRLLLAYLL-----DDL- 498 (714) T ss_pred HHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-----HHc- Confidence 6666666666555544322222223333322111222212222222221 112222223333333332221 111 Q ss_pred cCCCcceEEeccccccccCCceeeeccccc----cceE---EEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHH Q lcl|NC_013059. 483 EDGSEKEVQLMAEVVDLATGERQVLNDIRG----RYEC---YTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ 555 (725) Q Consensus 483 ~d~~~~~v~in~~~~d~~~g~~~~~nDi~g----~~Dv---~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~ 555 (725) +.++.+.|..+......-..+.+|.-.| .-|| .+++..+....-....++..+.|..+ +. T Consensus 499 --~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l-----------~~ 565 (714) T protein:vir:99 499 --KKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEV-----------IQ 565 (714) T ss_pred --CCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHH-----------Hh Confidence 2344454432211111123455664432 1243 44555554443333333333222211 11 Q ss_pred hhc-cCCchhHHHHHHH---Hhhhhhhhhhhhccch--hhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 556 YFT-LLDGKGVEMMRDY---ANKQLIQMGVKKPETP--EEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) Q Consensus 556 ~~~-~~d~~~~~~i~e~---~~kq~~~~~~~~~~~~--e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~ 629 (725) .++ .+..-..+-+++. .++......+.+.... ...+..++++++++++++.+..++++++.+ ++++. T Consensus 566 ~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~-------~~a~~ 638 (714) T protein:vir:99 566 GLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMRE-------MAGRV 638 (714) T ss_pred hcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHH-------HHHHH Confidence 111 1111111222221 1111122222111000 001111111111111111122222222223 33333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 630 LSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDAR--ANAELLLKGDEQTHKQRMEIANI 707 (725) Q Consensus 630 ~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~--~~aE~~~~~~~q~~~q~~e~~~~ 707 (725) ++.++++.+++++++....+......++... +... +..+++.+. ...+- .+......++++ .+. T Consensus 639 ~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~---------~~~~--~~~~a~~a~~~~~~~~-~~~~~~~~~~q~--~q~ 704 (714) T protein:vir:99 639 AKLEADAARAHAAAQRDNASAQREVALTQGQ---------RYVD--ALNQAHTAEIITGVQN-MEQEQDVLQQQM--LYT 704 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHH--HHHHHHHHHHHHhHhh-hhhhhHHHHHHH--HHH Confidence 3333333333333333332222111111111 0000 001111111 00111 011111112222 112 Q ss_pred HHHHHhcCCc Q lcl|NC_013059. 708 LQSQRQNQPS 717 (725) Q Consensus 708 ~~~~~~~q~~ 717 (725) .+.+..+.+- T Consensus 705 ~~~~~~~~~~ 714 (714) T protein:vir:99 705 LQQRMNEMSL 714 (714) T ss_pred HHHHHHhcCC Confidence 1222222222 No 133 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=97.58 E-value=4.2e-05 Score=44.58 Aligned_cols=460 Identities=10% Similarity=0.009 Sum_probs=165.8 Q ss_pred CC-------c---HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc-ccchHHHHHHHHH Q lcl|NC_013059. 1 MA-------D---NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ-FDVVRPVVRKLVS 69 (725) Q Consensus 1 ma-------d---~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~-~N~i~~~v~~v~g 69 (725) |- | ....+...+..|+...+...--+..+......-....|+.+.. +..-..|-+ +|..+.+|+.++| T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~-Y~~rl~rA~~~n~~~~tl~~l~G 79 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEAR-QAEYEAGGIVYNFTRRTLSGMVG 79 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHH-HHHHHhccccCChHHHHHHHHhc Confidence 32 1 1122222222222221111100111111112222345654442 333333443 5999999999999 Q ss_pred HHhhCCcceEEecCCcchHHHHHHHHHHHHHH-HHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCC--------- Q lcl|NC_013059. 70 EMRQNPIDVLYRPKDGASPDAADVLMGMYRTD-MRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTS--------- 139 (725) Q Consensus 70 ~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~--------- 139 (725) .--...|.+.+ -+.|..++..+ .+-++.+.-...+|..++.+|.+|+=| ||-.++... T Consensus 80 ~vfrk~p~~~~----------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV--D~P~~~~~T~ade~~~~~ 147 (489) T protein:vir:78 80 SVMRKEPEINI----------PKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLV--DAPETGAATAAEQNAGLL 147 (489) T ss_pred hhhcCCcceec----------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE--eeCCCCCcCHHHHHHhcC Confidence 88777766532 22345555554 467788999999999999999999655 443322110 Q ss_pred CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEE Q lcl|NC_013059. 140 NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQI 219 (725) Q Consensus 140 ~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv 219 (725) .|+..- ..+.+| .|+.....+.. ..+...++.-.. - .....+.|.+.....+|| T Consensus 148 rPy~~~-----~~~~~I-inW~~~~v~G~--~~Lt~v~lrE~~------~------------~~d~~~~f~~~~~~q~Rv 201 (489) T protein:vir:78 148 NPTIAF-----YTTENI-VNWRLTRVGSV--NRVTMVVLRETW------E------------YNEPGNEFETKYGEQYRV 201 (489) T ss_pred CcEEEE-----echhhh-cCceeeeeCCc--cceeEEEEEEeE------E------------eecCCCCccceeEEEEEE Confidence 122221 122233 34444333321 111111111100 0 000001122222223333 Q ss_pred EEE--EEEecceeEEEEe-eCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCC Q lcl|NC_013059. 220 AEF--YEVVEKKETAFIY-QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGE 296 (725) Q Consensus 220 ~E~--w~~~~~~~~~~~~-~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~ 296 (725) -+. |.+- +.+++.. .++.++. ..+ .++.+..--+-+ T Consensus 202 L~~~~~g~~--~~~~~r~~~~g~~~~-------------------------------~~~--------~~~~~~g~~~l~ 240 (489) T protein:vir:78 202 LDIDSDGNY--RQRLFRFDAEGGAQE-------------------------------DVV--------EIYPDLGESLRG 240 (489) T ss_pred EecCCCcce--EEEEEEeecCCcccc-------------------------------eee--------EEeccCCCCccC Confidence 211 1000 0011110 1100000 000 000111111225 Q ss_pred ccceEEEEeeeecc-CCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccc Q lcl|NC_013059. 297 HIPIVPVFGEWGFV-EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLN 375 (725) Q Consensus 297 ~~p~vP~~g~~~~~-d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 375 (725) .+|||||.+..... .++...+++... . .-.++.|.-.+.+......+..+..|..+..+......+..+...+ T Consensus 241 ~IPfv~~~~~~~~~~~~~pPLl~LA~l----n-i~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g- 314 (489) T protein:vir:78 241 VIPFTFIGATNNDATIDDAPLLPLAEL----N-IGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFG- 314 (489) T ss_pred eeeEEEEecCCCCCCCCcCchHHHHHH----H-HHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeC- Confidence 67877765432211 122112222222 1 1122222222222222333333333321111122222222211111 Q ss_pred cccccCcc-c-cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 376 RTDENNGE-M-PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNL 453 (725) Q Consensus 376 ~~~~~~g~-~-~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~ 453 (725) ...+. + .....++++.....-+ .+.|....+.|. ..|. ..+- .+.+.|+.+......+....|..+..|+ T Consensus 315 ---~~~~~~lp~~~~~~~ie~~~~~~~-r~~l~~le~qm~-~lGa--~l~~-~~~~~Ta~~~~~~~~~~~S~L~~~a~~~ 386 (489) T protein:vir:78 315 ---SRRGHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQAI-QIGA--QLIT-PTQQITAQSARIQRGADTSVMATIARNV 386 (489) T ss_pred ---CcccccCCCCCCcceeccCcchHH-HHHHHHHHHHHH-HHhh--hhcc-CCcchhHHHHHHHHHHhhHHHHHHHHHH Confidence 11110 1 1122344444332222 222322222222 2232 2333 2334667777777767766677777777 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHH Q lcl|NC_013059. 454 ATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRA 533 (725) Q Consensus 454 ~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~ 533 (725) ..+. +.+|.++..|.... ++..--|.+|+ .|++. +-. .+.++ T Consensus 387 e~al----~~~l~~~a~w~G~~--------~~~~~~i~~n~------------------dF~~~----~~d----~~~~~ 428 (489) T protein:vir:78 387 SQAY----TDALRWVAVMLGKP--------EDTEVEFRLNM------------------DFFLE----PMT----AQDRA 428 (489) T ss_pred HHHH----HHHHHHHHHHcCCC--------CCCceEEEeec------------------ccCcc----cCC----HHHHH Confidence 7765 66777888887642 11111222332 12221 111 12344 Q ss_pred HHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhh--hhhccchhhhHHHHHHHHHHH Q lcl|NC_013059. 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMG--VKKPETPEEQQWFVEAQQAKQ 601 (725) Q Consensus 534 ~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~--~~~~~~~e~~~~~~q~~q~qq 601 (725) +|.++..+ +..........+.-...+|. ..+++.+++..+..+.+ ...+.++..|+.. + T Consensus 429 al~~~~~~-G~is~~t~~~~L~~~gv~d~-~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~-------~ 489 (489) T protein:vir:78 429 AWMADINA-GLLPATAYYAALRKAGVTDW-TDADIKDAVADQPLPVATEVQGEIPQSAQQQE-------K 489 (489) T ss_pred HHHHHHhc-CCCCHHHHHHHHHhCCCCCc-cHHHHHHHHhhcCCCcccCCcccCCCCccccc-------C Confidence 44444331 11111111111111112232 23444444433321111 1111111111100 0 No 134 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=97.15 E-value=0.00015 Score=41.53 Aligned_cols=626 Identities=11% Similarity=0.029 Sum_probs=189.0 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHH--------------- Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLV--------------- 68 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~--------------- 68 (725) .-+++++++.++..-++...+|..+.++.+. ++....-..| |.-.+.+.+++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~---------~D~~f~~~~G----~QW~~~~~~~l~~~~q~~grP~~~~N 67 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCI---------EATRFARVPG----GQWEGATAAGTKLDEQFEKYPKFEIN 67 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHH---------HHHHhhcCCC----CCCCHHHHHHHHHhhhhcCCCceEEc Confidence 7788888888777766666666555543221 1222222223 12222222222 Q ss_pred ------HHHhhCCcceEEec-CCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCc Q lcl|NC_013059. 69 ------SEMRQNPIDVLYRP-KDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNN 141 (725) Q Consensus 69 ------g~~~~nr~~~~~~p-r~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~ 141 (725) ..-...-..-+..+ ..|.+.+.-..+..++..+....-.......++.++..+++.+ ++.|. T Consensus 68 ~i~~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~---G~Gw~-------- 136 (708) T protein:vir:10 68 KVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATG---GFGCF-------- 136 (708) T ss_pred chHHHHHHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhc---cccee-------- Confidence 22222222222222 1356666546678888888877777777777888888888765 11111 Q ss_pred eeEEEEeeec-chh--------heeeCC-CccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhh-hhhccccccc Q lcl|NC_013059. 142 QVIRREPIHS-ACS--------HVIWDS-NSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPS-FQNPNDWVFP 210 (725) Q Consensus 142 ~~ir~~~~~~-~~~--------~v~~Dp-~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~-~~~~~~~~~~ 210 (725) .++.+...+ ++. ...+|| .+.-+|.. .+..+.+++.-.|...-.+..+... +-......++ T Consensus 137 -~~~~d~~~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d 208 (708) T protein:vir:10 137 -RLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPD-------AKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLD 208 (708) T ss_pred -eeeeccccccCCCCCccccceEEeecchhhcccCcc-------ccccChhhhhhhhhccCCCHHHHHHhCCCCcccccc Confidence 112111100 000 001111 11111111 0111233332222211111111100 0000000111 Q ss_pred ccCCCeEEEEEEEEEecceeEEEEeeCc--cccceeec-ch--hhhHH----HHHHHHhcchhhhhccceeEEEEEEEEe Q lcl|NC_013059. 211 WLTQDTIQIAEFYEVVEKKETAFIYQDP--VTGEPVSY-FK--RDIKD----VIDDLADSGFIKIAERQIKRRRVYKSII 281 (725) Q Consensus 211 ~~~~~~vrv~E~w~~~~~~~~~~~~~d~--~~g~~~~~-~~--~~~~~----~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 281 (725) |..... -..-|... .+..+..|... ..-.++.+ ++ .++.. ....... .........+.++++.++.+ T Consensus 209 ~~~~~~--~~~~~~~~-d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~-~~~~~g~~~~~~r~~~r~~v 284 (708) T protein:vir:10 209 VTSMTS--WEYNWFGA-DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIED-ELAIAGFHEVARRSVKRRRV 284 (708) T ss_pred cccCCC--ccccccCC-CceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHH-HHHhcccchhheeeeeeEEE Confidence 111100 00012221 12222222110 00111111 11 11111 1111111 11111112334455666555 Q ss_pred eccccccCC-CCCCCC-ccceEEEEeeeeccCCccccchhh--hh-hhhHHHHHHHHHHHHHHH-HHhcCCcceeechhh Q lcl|NC_013059. 282 TCTAVLKDK-QLIAGE-HIPIVPVFGEWGFVEDKEVYEGVV--RL-TKDGQRLRNMIMSFNADI-VARTPKKKPFFWPEQ 355 (725) Q Consensus 282 ~g~~~l~~~-~~~p~~-~~p~vP~~g~~~~~d~~~~~~G~v--r~-~kd~Q~~~N~~~s~~~~~-~~~~~~~~~~~~~~~ 355 (725) ....++... -.+|.. -+.++||++++-+... ..|.- .. +++.-+.-. ...+...- ....+..+....... T Consensus 285 ~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~---~d~~~~~yG~vr~~kd~Q~-~~N~~~S~~~~~~a~~~~~~~i~~ 360 (708) T protein:vir:10 285 YVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWF---IDDIERVEGHIAKAMDPQR-LYNLQVSMLADTAAQDPGQIPIVG 360 (708) T ss_pred EEEeecchhhhccCCCCCCCceeeEEEeeeeec---cCCCcccceeecccchhHH-HHHHHHHHHHHHHHhcCCcccccC Confidence 544444221 123322 2335577665432221 11211 11 122211100 00000000 000000000000000 Q ss_pred cchHHHHHHhhccccccccccccccCcccc----ccCCcccCCC-------CchHHHHHHHHHHHHHHHHHhCCChHHhc Q lcl|NC_013059. 356 IAGFEHMYDGNDDYPYYLLNRTDENNGEMP----TQPLAYYENP-------EVPQANAYMLEAATAAVKEVATLGVDAEA 424 (725) Q Consensus 356 i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~----~~~~~~~~~~-------~~~~~~~~ll~~~~~~i~~~tGv~~~~~G 424 (725) .. ........+... ...+...++ ..+.+.+... +.|.-...+++.....+..+.-++..+.+ T Consensus 361 ~~---~i~~~~~~~~~~----~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~ 433 (708) T protein:vir:10 361 ME---QIRGLEKHWEAR----NKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQA 433 (708) T ss_pred hh---hhhhHHHHHhhc----cccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChh Confidence 00 000000000000 001111110 1111111111 22233344666666666666655333333 Q ss_pred cCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCce Q lcl|NC_013059. 425 VNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGER 504 (725) Q Consensus 425 ~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~ 504 (725) ..| ..|+.+=.+.......+...+.. +-..+++.-+.+-.++..++-. +. +.++.+.|..+. + +-.. T Consensus 434 ~lG-~~sn~SG~aI~~rq~qg~~~l~~-~~Dnl~~~~~~~g~~lL~li~~-----~y---~~er~~RI~~ed--g-~~~~ 500 (708) T protein:vir:10 434 MQQ-MPSNIAQETVNNLMNRADMASFI-YLDNMAKSLKRAGEVWLSMARE-----VY---GSEREVRIVNED--G-SDDI 500 (708) T ss_pred Hcc-CccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-----Hc---CCCcEEEEecCC--C-Ccce Confidence 222 13332111111111111111111 1111222222222333332211 11 234555553221 1 1134 Q ss_pred eeec----cc-cc----cceEEE---EeccCc-hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHH Q lcl|NC_013059. 505 QVLN----DI-RG----RYECYT---DVGPSF-QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDY 571 (725) Q Consensus 505 ~~~n----Di-~g----~~Dv~v---~~~p~~-~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~ 571 (725) +.+| |. +| ..|+++ ++..+. ++.-...-+.+..|++.++...|...... ..+ +.+-+.+.. T Consensus 501 v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~-~~~-----~~~l~~~D~ 574 (708) T protein:vir:10 501 AVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRP-AIQ-----GIILDNIDG 574 (708) T ss_pred EEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhH-HHH-----HHHHHhcCC Confidence 4454 43 23 356644 555554 55555555555555555554444222111 111 111122222 Q ss_pred Hhhhhhhhhhhhcc--chhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 572 ANKQLIQMGVKKPE--TPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAAR 649 (725) Q Consensus 572 ~~kq~~~~~~~~~~--~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~ 649 (725) .++......+.+.. .....+..++.++..++++++++.+++.++.+++++..++++++++++.++.++++++. T Consensus 575 p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~----- 649 (708) T protein:vir:10 575 EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAF----- 649 (708) T ss_pred cChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Confidence 22222222222211 11111111222222222223333333333334444444444444333332222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 650 IAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 650 ~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~~~~~~~q 725 (725) +.+.++ .++..... ...+++........+.+.+-+. ..+..++++ .+ -.|+ T Consensus 650 --q~~~~~--------~~a~~~a~--------q~~~~a~~~~~~~~~~~~q~l~---~~q~~q~~~-~~---~~p~ 700 (708) T protein:vir:10 650 --TAQQDA--------MESQANTV--------YKLAQARNIDDKAVMEAIRLLK---DVAESQQQQ-FQ---SPPQ 700 (708) T ss_pred --HHHHHH--------HHHHHHHH--------HHHHHHHHHHHHHHHHHHHHhh---hhhhhHHHH-Hh---cccc Confidence 100000 00000000 0011111111111111111111 111111111 11 3445 No 135 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=96.92 E-value=0.00026 Score=40.27 Aligned_cols=616 Identities=12% Similarity=0.049 Sum_probs=160.1 Q ss_pred HHHHHhhh------HHHHHHHHHHHHhh--cCC---CCCH---HHHHHHhhcC-C-Cc-------ccchHHHHHHHHH-- Q lcl|NC_013059. 15 FDADWTAS------DEARREAKNDLFFS--RVS---QWDD---WLSQYTTLQY-R-GQ-------FDVVRPVVRKLVS-- 69 (725) Q Consensus 15 ~~~~~~~~------~~~r~~a~~d~~f~--~G~---QW~~---~~~~~l~~~g-r-p~-------~N~i~~~v~~v~g-- 69 (725) ...+.+.. .+.+....+.+..+ +-+ -|.. .+..+. .| + +. ..-..|++..+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy--~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~ 78 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYY--DGDQLAPEVIQVLKDRGQPMTIHNLIAPT 78 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhh--cCCCCCHHHHHHHHhcCCCcEEeccHHHH Confidence 11122110 11111111111111 111 1211 111111 12 1 10 0111122111111 Q ss_pred --HHhhCCcceEEec-CCcchHHHHH-HHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEE Q lcl|NC_013059. 70 --EMRQNPIDVLYRP-KDGASPDAAD-VLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) Q Consensus 70 --~~~~nr~~~~~~p-r~~~d~~~Ae-~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir 145 (725) .-...-..-+..+ ..|.+.+-+. .+..++..+....-.......++.++..+|+.+ ++.|. T Consensus 79 v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~---G~G~~------------ 143 (714) T protein:vir:10 79 VDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKA---GLSWV------------ 143 (714) T ss_pred HHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhc---ccceE------------ Confidence 1122111112222 1345666764 466777777666655666677777888887765 11111 Q ss_pred EEeeecchhheeeCCCcc-------ccChhcccceee---eecCCHHHHHHHhhh-----------cCCcchhhhhhhhc Q lcl|NC_013059. 146 REPIHSACSHVIWDSNSK-------LMDKSDARHCTV---IHSMSQNGWEDFAEK-----------FDLDADDIPSFQNP 204 (725) Q Consensus 146 ~~~~~~~~~~v~~Dp~a~-------~~d~sDa~~~~~---~~~~~~~~~~~~~p~-----------~~~~~~~~~~~~~~ 204 (725) ++++|++.. ..|.. -+|. .+-.|.+++.-.|-. |+.+...... .. T Consensus 144 ---------~~~~d~d~~~~~i~i~~v~p~---~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~--~~ 209 (714) T protein:vir:10 144 ---------EVRRNSEPFGPEFKVSTVSRN---EVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDY--AI 209 (714) T ss_pred ---------EeeeccCCCCCCeEEEecChh---heeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhc--cc Confidence 123333211 11110 0110 111222222222211 1111111111 11 Q ss_pred cccccc----ccC-CCeEEE------------EEEEEEecceeEEEEeeCcc-cccee-ecchhh--hHHHHHHHHhcch Q lcl|NC_013059. 205 NDWVFP----WLT-QDTIQI------------AEFYEVVEKKETAFIYQDPV-TGEPV-SYFKRD--IKDVIDDLADSGF 263 (725) Q Consensus 205 ~~~~~~----~~~-~~~vrv------------~E~w~~~~~~~~~~~~~d~~-~g~~~-~~~~~~--~~~~~~~~~~~g~ 263 (725) .+|... +.+ .+...+ ...|+.... +++.++...+ ....+ .+.+.+ .......-..... T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~ 288 (714) T protein:vir:10 210 DDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRER-RRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAV 288 (714) T ss_pred hhhcCcccchhhhhhcccccccchhhcccccccccccccCc-ceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHH Confidence 111100 000 000000 011211111 1111111100 00000 000000 0000000000000 Q ss_pred -hhhhccceeEEEEEEEEeeccccccCCCCCCC-Cccc--eEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHH Q lcl|NC_013059. 264 -IKIAERQIKRRRVYKSIITCTAVLKDKQLIAG-EHIP--IVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNAD 339 (725) Q Consensus 264 -~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~-~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~ 339 (725) +....-.+.+++|.+.. ...++.+.-.+-. ..|| ++||++++.+.+ ..+|....++..=+..-....+ T Consensus 289 ~~~~g~~~~~~~~~~rv~--~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~---~~~g~~~G~vr~~~d~Qr~~N~--- 360 (714) T protein:vir:10 289 AVASGRVQVKVGRVSRIR--EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---DKTGEPYGLISRAIPAQDEVNF--- 360 (714) T ss_pred HHHhccceecccceeeEE--EEEEecchhhhcCCCCCCCCceeeEEecceee---eccCccceehhhhhhHHHHHHH--- Confidence 00000111112222211 1112211111111 1234 366666543322 2445444433222211111111 Q ss_pred HHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcccccc--------CCcccCCCCchHHHHHHHHHHHHH Q lcl|NC_013059. 340 IVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQ--------PLAYYENPEVPQANAYMLEAATAA 411 (725) Q Consensus 340 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~--------~~~~~~~~~~~~~~~~ll~~~~~~ 411 (725) ..+ +..++.....+ +... ...+....-.......++|.+.-. +..++...+.++-....++..... T Consensus 361 --~~s-~~~~~l~~~~~--~~~~-gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~ 434 (714) T protein:vir:10 361 --RRI-KLTWLLQAKRV--IMDE-DATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQES 434 (714) T ss_pred --HHH-HHHHHHhCCce--eecc-ccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHH Confidence 011 11111111110 0000 000000000000111222222211 122344444344445666666666 Q ss_pred HHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEE Q lcl|NC_013059. 412 VKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQ 491 (725) Q Consensus 412 i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~ 491 (725) ...+-.++...-...|..+++.+-.+....-..+...+..-| ..+++.-+.+-.++..++- .+. ..++.+. T Consensus 435 ~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~-dnl~~~~~~~g~~ll~li~-----~~~---~~~rv~R 505 (714) T protein:vir:10 435 EKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEIN-DNYQFACQQVGRLLLAYLL-----DDL---KKRRNHA 505 (714) T ss_pred HHHHHHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH-----HHc---CCCcEEE Confidence 666655432222222333332222222222222222222222 2223333333334443331 112 2334444 Q ss_pred eccccccccCCceeeecccc--c--cceEE---EEeccCc-hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCch Q lcl|NC_013059. 492 LMAEVVDLATGERQVLNDIR--G--RYECY---TDVGPSF-QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGK 563 (725) Q Consensus 492 in~~~~d~~~g~~~~~nDi~--g--~~Dv~---v~~~p~~-~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~ 563 (725) |-.+......-..+..|+-. | --||+ .++..+. ++.-...-+.+..|++.++ .+. +.+-.- T Consensus 506 I~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~--------~~~---p~~~~~ 574 (714) T protein:vir:10 506 VVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQ--------GLP---PQVQAV 574 (714) T ss_pred EeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHh--------hcC---chhhhh Confidence 42221111122345555432 1 23443 3444443 3333333333334443221 000 111111 Q ss_pred hHHHHHHHH---hhhhhhhhhhhc----cchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 564 GVEMMRDYA---NKQLIQMGVKKP----ETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDA 636 (725) Q Consensus 564 ~~~~i~e~~---~kq~~~~~~~~~----~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea 636 (725) ..+-+++.. .+......+.+. .+++..+. ++++.++++.+.+..++++ +.++++++.++.++++ T Consensus 575 ~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~--e~q~~q~~~~~~~~~q~~l-------~~~e~~a~~~k~eaea 645 (714) T protein:vir:10 575 VLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTP--EEQEVAAQQQALQQQQAEL-------QMREMAGRVAKLEADA 645 (714) T ss_pred HHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCc--chhHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHH Confidence 112222211 111112221111 11111111 1111111111111112222 2233333334444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_013059. 637 AKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQP 716 (725) Q Consensus 637 ~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~ 716 (725) .+++++++....+......++..+... ....+++. + ...+..+-..+ ..+..+|++ ....+.+..+.| T Consensus 646 ~~~~aqa~~~~~~a~~~~~~~~~q~~~----~~~~~a~~----a-~~l~~~~~~~q-~~~~~~q~~--~q~~~~~~~~~~ 713 (714) T protein:vir:10 646 ARAHAAAQRDNASAQREVALTQGQRYV----DALNQAHT----A-EIITGVQNMEQ-EQDVLQQQM--LYTLQQRMNEMS 713 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHH----H-HHHHHHHhhhh-hHHHHHHHH--HHHHHHHHHhcC Confidence 444433333322222211111111100 00000000 0 00011111111 111222222 111122222222 Q ss_pred c Q lcl|NC_013059. 717 S 717 (725) Q Consensus 717 ~ 717 (725) - T Consensus 714 ~ 714 (714) T protein:vir:10 714 L 714 (714) T ss_pred C Confidence 2 No 136 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=96.91 E-value=0.00026 Score=40.22 Aligned_cols=569 Identities=12% Similarity=-0.022 Sum_probs=183.2 Q ss_pred CC---cHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhc----CCCCCHHHHHHHhhcC------CCcccchHHHHHHH Q lcl|NC_013059. 1 MA---DNKNRLESILSRFDADWTASDEARREAKNDLFFSR----VSQWDDWLSQYTTLQY------RGQFDVVRPVVRKL 67 (725) Q Consensus 1 ma---d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~----G~QW~~~~~~~l~~~g------rp~~N~i~~~v~~v 67 (725) -+ +....+.+++...-+.+..... .+...+..|.+ |-=|-.--..+..+.+ .+++-.+.--.+.| T Consensus 90 ~p~~~~~d~~~Ae~l~~l~~~~~~~~~--~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~~~~~~~~~v 167 (708) T protein:vir:17 90 RPGDREASEELANKLNGLFRADYEETD--GGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSV 167 (708) T ss_pred ecCCCcchHHHHHHHHHHHHHHHHhcC--chhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccceEeeccchhhe Confidence 11 1122344444444443332222 22223333333 3224221111111111 11100000000011 Q ss_pred HHHHhhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHH---------HHHHHHHHhcCcceEEEEeeeccCCCC Q lcl|NC_013059. 68 VSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAV---------NIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) Q Consensus 68 ~g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~---------s~a~~~~~~~G~G~~~v~~~~~~~~~~ 138 (725) ..|+..+-.|..| .+|++..--.+... ...+.....+..+ ++|.+. T Consensus 168 -------~~Dp~a~~~D~sD----------ar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~-----~~~~~~--- 222 (708) T protein:vir:17 168 -------WFDPDAKKYDKSD----------ALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWE-----YDWFDA--- 222 (708) T ss_pred -------ecCccccccChhh----------hhhhhhhccCCHHHHHHhCccccchhhhhhhhcccc-----ccccCC--- Confidence 1111111112222 12222111111111 1111110001100 011000 Q ss_pred CCceeEEEEee------ecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhccccccccc Q lcl|NC_013059. 139 SNNQVIRREPI------HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWL 212 (725) Q Consensus 139 ~~~~~ir~~~~------~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 212 (725) .. +|+.-. +... -++.||.+.++ .....-....+...+.. .+.-.-. T Consensus 223 -d~--vrv~e~~~r~~~~~~~-~~~~~~~~g~~--------~~~~~~~~~~~~~~~~~---------------~g~~~~~ 275 (708) T protein:vir:17 223 -DV--IYIAKYYEVRKESVDV-ISYRHPITGEI--------ATYDSDQVEDIEDELAI---------------AGFQEVA 275 (708) T ss_pred -Ce--EEEEEEEEEeeeeeEE-EEEecCccCce--------eeeCccchhhHHHHHHh---------------cccccce Confidence 11 111100 0000 01223332210 00000011111111110 0000001 Q ss_pred CCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCC Q lcl|NC_013059. 213 TQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQL 292 (725) Q Consensus 213 ~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~ 292 (725) .+...|..++|+..- ...++...+++.|+.. -+.. +.|.....+..| T Consensus 276 ~r~~~r~~v~~~~~~-g~~~l~~~~~~p~~~f-------------------------------P~vP-~~g~r~~~d~~~ 322 (708) T protein:vir:17 276 RRSVKRRRVYVSVVD-GDGFLEKPRRIPGEHI-------------------------------PLIP-VYGKRWFIDDIE 322 (708) T ss_pred eeeeeEEEEEEEeec-ccccccCCCCCCCCcc-------------------------------ceEE-EecccccccCCC Confidence 122233333443311 0111111111111100 0000 111111112222 Q ss_pred CCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccc Q lcl|NC_013059. 293 IAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYY 372 (725) Q Consensus 293 ~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 372 (725) .+++++-.. +| +-=++-...-.-.+.-...+...+++..+...++....+........+...+..+.. T Consensus 323 ~~yG~vr~~--------kd----~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~ 390 (708) T protein:vir:17 323 RVEGHIAKA--------MD----PQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK 390 (708) T ss_pred cccchhhhc--------hh----HHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCc Confidence 222222111 11 110111111111111111222234555555444432221111111112222222222 Q ss_pred ccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHH----HHhCCChHHhccCcch--hHH-HHHHHH------- Q lcl|NC_013059. 373 LLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVK----EVATLGVDAEAVNGGQ--VAY-DTVNQL------- 438 (725) Q Consensus 373 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~----~~tGv~~~~~G~~~n~--~Sg-~ai~~~------- 438 (725) .++. ..|+.+..+++..+.++-....+++.....+.+. ...|......|..-++ .+| .++... T Consensus 391 ~g~v---~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~ 467 (708) T protein:vir:17 391 YGNI---IAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKS 467 (708) T ss_pred cccc---ccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222 2344445556666666666666666555554443 2446544444532221 111 111111 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEecc--C-CCcceEEeccccccccCCceeeeccccccc Q lcl|NC_013059. 439 -NMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLE--D-GSEKEVQLMAEVVDLATGERQVLNDIRGRY 514 (725) Q Consensus 439 -q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~--d-~~~~~v~in~~~~d~~~g~~~~~nDi~g~~ 514 (725) +..|...+..+.+-|.. .+++++ +-+. ..++++.|.+. | ....++.+|.- ..|+. |++ . T Consensus 468 ~~~~g~~lL~lI~~~y~~--~R~~RI----~~ed-g~~~~v~in~~~~d~~~g~~~~~nDi----~~g~~----Dv~--v 530 (708) T protein:vir:17 468 LKRAGEVWLSMAREVYGS--EREVRI----VNED-GSDDIAVLSAQVVDRQTGAVVALNDL----SVGRY----DVT--V 530 (708) T ss_pred HHHHHHHHHHHHHHHcCC--CcEEEE----ecCC-CCcceeeecceeccCCCccceeeccc----eeeee----eEE--E Confidence 22222222211111111 111111 1111 23466666432 1 23346666632 12221 221 3 Q ss_pred eEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhc----cchhhh Q lcl|NC_013059. 515 ECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKP----ETPEEQ 590 (725) Q Consensus 515 Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~----~~~e~~ 590 (725) |..-. .++.-..--..+..|...+....+.++.....++.+++.+....+.+.+.....+........+ ...+.+ T Consensus 531 ~~~p~-~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~q 609 (708) T protein:vir:17 531 DVGPS-YTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQ 609 (708) T ss_pred ecccC-chhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHH Confidence 33323 3444444445555666666666666666777777888877776655544432222211111111 122222 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 591 QWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLK 670 (725) Q Consensus 591 ~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~ 670 (725) +..+++.+.++.+.+....++|+++++++++..+++++..+.+.++.+++.++....++ ...+....+.+..+.++ T Consensus 610 q~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q----~~~~~~~~~~~~~~~l~ 685 (708) T protein:vir:17 610 MAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQ----ARNIDDKAVMEAIRLLK 685 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHhh Confidence 23333333444455666777788888888888877777777776666665444322222 22222333333444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc Q lcl|NC_013059. 671 TVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSG 718 (725) Q Consensus 671 ~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~ 718 (725) .....++.+. ++ ..+ +.++ .||. T Consensus 686 ~~q~~q~q~~-------~a----~p~---~~~~-----------~~~~ 708 (708) T protein:vir:17 686 DVAESQQQQF-------QS----PPQ---SPAD-----------LMPS 708 (708) T ss_pred hhhhhHHHHH-------hc----ccc---Cchh-----------ccCC Confidence 4432221111 00 000 0000 0111 No 137 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=96.67 E-value=0.00042 Score=39.08 Aligned_cols=621 Identities=10% Similarity=-0.016 Sum_probs=178.5 Q ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCcccchHHHHHHHHHHHh--hCCc----- Q lcl|NC_013059. 4 NKNRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMR--QNPI----- 76 (725) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~--~nr~----- 76 (725) +-+.+++++.++...++...+|..++++.+ -++....-..|. .-.+.+..++=... .+|| T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~---------~~d~~f~~~~G~----QW~~~~~~~l~~~~q~~grP~~~~N 67 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKC---------IEATRFVRVPGG----QWEGATVAGTKLDEQFEKYPKFEIN 67 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHhhccCCc----cCCHHHHHHHHhhhhhcCCCceEec Confidence 666666666666666666666655554332 123333322331 11122222221111 1222 Q ss_pred --------------ceEEecC-CcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCc Q lcl|NC_013059. 77 --------------DVLYRPK-DGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNN 141 (725) Q Consensus 77 --------------~~~~~pr-~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~ 141 (725) .-+..+. .|.+.+.-+.+..++..+....-.......++.++..+++.+ ++.|. T Consensus 68 ~i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~---G~G~~-------- 136 (706) T protein:vir:10 68 KVATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATG---GFGCF-------- 136 (706) T ss_pred chHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhc---CcceE-------- Confidence 1111111 244444445577788887777666666667777777776654 11111 Q ss_pred eeEEEEeeec-chh--------heeeCCC-ccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccc Q lcl|NC_013059. 142 QVIRREPIHS-ACS--------HVIWDSN-SKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPW 211 (725) Q Consensus 142 ~~ir~~~~~~-~~~--------~v~~Dp~-a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 211 (725) .++.+...+ ++. ..++||. +.-+|.. .+..+..++.-.|-..-.+....... +-+....+ T Consensus 137 -ev~~d~~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~-------a~~~D~sDar~~~~~~~~~~d~~~~~--fp~~~~~~ 206 (706) T protein:vir:10 137 -RLTTSFVNEYDPMDERQRIAVEPIYDPARSVWFDPD-------AKKYDKSDALWAFCMYSVSLEKYQSE--YDKAPTSL 206 (706) T ss_pred -EeeeccccccCCCCCCccceeeeeccchhceecCch-------hcccChhhcceEeeeecCCHHHHHHh--cCCChhhh Confidence 011110000 000 0011221 1111111 00011111111111100000000000 00000000 Q ss_pred cCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHH----HHHHh------------cchhhhhcc---cee Q lcl|NC_013059. 212 LTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVI----DDLAD------------SGFIKIAER---QIK 272 (725) Q Consensus 212 ~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~----~~~~~------------~g~~~~~~~---~~~ 272 (725) -+ ...-+|+..+.... +-.+.+|+........ ..... .....+... .+. T Consensus 207 ~~----~~~~~~~~d~~~~d--------~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 274 (706) T protein:vir:10 207 DR----VGSVSWQYDWFTPD--------VVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIG 274 (706) T ss_pred hh----hccccccccccCCC--------cceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhh Confidence 00 01113433221110 1111122111000000 00000 000000000 000 Q ss_pred EEEEEEEEeeccccccCCCCC-CCCcc--ceEEEEeeeeccCCccccchhhhh-hhhHHHHHHHHHHHHHHHHHhcCCcc Q lcl|NC_013059. 273 RRRVYKSIITCTAVLKDKQLI-AGEHI--PIVPVFGEWGFVEDKEVYEGVVRL-TKDGQRLRNMIMSFNADIVARTPKKK 348 (725) Q Consensus 273 ~~~v~~~~~~g~~~l~~~~~~-p~~~~--p~vP~~g~~~~~d~~~~~~G~vr~-~kd~Q~~~N~~~s~~~~~~~~~~~~~ 348 (725) +++|..+-.....+. +.... -..-| ..+||++++.+.... ..-|.... +.+.-+.--..-..+.-++...+..+ T Consensus 275 ~~~~~~~~v~~~~~~-g~~~l~~~~p~~~~~~P~vP~~g~r~~~-d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~ 352 (706) T protein:vir:10 275 RRSVKRRRIYVAVVD-GDGFLEKPRRIPGEHIPLIPVYGKRWFI-DDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDP 352 (706) T ss_pred hcccceeeEEEEeec-cccccccCCCCCCCccceEEEeeccccc-cccCcccceeccchhhHHHHHHHHHHHHHHHHhcC Confidence 011111100000000 10000 00111 235555544321110 11111112 33444433333334334445555666 Q ss_pred eeechhhcchHHHHHHhhccccccccccc-cccCccccccCCcccCCC---CchHHHHHHHHHHHHHHHHHhCCChHHhc Q lcl|NC_013059. 349 PFFWPEQIAGFEHMYDGNDDYPYYLLNRT-DENNGEMPTQPLAYYENP---EVPQANAYMLEAATAAVKEVATLGVDAEA 424 (725) Q Consensus 349 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~---~~~~~~~~ll~~~~~~i~~~tGv~~~~~G 424 (725) .....+++++.+.....-..........+ .++.|...+..+.+.+.+ +.|.-....++........+- ...| T Consensus 353 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~----~vsG 428 (706) T protein:vir:10 353 GQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQ----EVTG 428 (706) T ss_pred CcccccchhHHHHHHHHhhhcccccccchhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHH----HHhC Confidence 66666655444333222222221211111 123333333322222111 122222224444444443332 2334 Q ss_pred cCcc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccC-CCcceEEecccccccc Q lcl|NC_013059. 425 VNGG---QVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLED-GSEKEVQLMAEVVDLA 500 (725) Q Consensus 425 ~~~n---~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d-~~~~~v~in~~~~d~~ 500 (725) .... ..|+.+-.+.+..-..+...+.. |-..+++.-+.+-+++..+. -+. +.++.+.|..+. + T Consensus 429 i~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~-~~Dnl~~~~~~~g~~lL~li---------~~~y~~~R~~RI~~ed--~- 495 (706) T protein:vir:10 429 SSQAMQQMPSNVARETVNSLLNRSDMASFI-YLDNMAKSLKRAGEIWLSMA---------REIYGSDREVRIVHED--G- 495 (706) T ss_pred CCHHHcCCccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH---------HHHcCCCcEEEEecCC--C- Confidence 4321 22332111111111111111111 11112222222223333222 111 234555553221 1 Q ss_pred CCceeeecc-----cccc----ceEEE---EeccCc-hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHH Q lcl|NC_013059. 501 TGERQVLND-----IRGR----YECYT---DVGPSF-QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEM 567 (725) Q Consensus 501 ~g~~~~~nD-----i~g~----~Dv~v---~~~p~~-~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~ 567 (725) +-..+++|. .+|. .||++ ++..+. ++.-...-+.+..++..++.+.|...... ..+ +.+-+ T Consensus 496 ~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~-~l~-----~~~~~ 569 (706) T protein:vir:10 496 TDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPMRP-ALM-----GIIID 569 (706) T ss_pred CccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchhhH-HHH-----HHHHh Confidence 123455553 4453 45544 444453 44444444455555555554445333222 111 11111 Q ss_pred HHHHHhhhhhhhhhhhccchh--hhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 568 MRDYANKQLIQMGVKKPETPE--EQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) Q Consensus 568 i~e~~~kq~~~~~~~~~~~~e--~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~ 645 (725) .+...++......+.+...+. .++..++.++..+++++.++.+++.++.+++++..++++++++. T Consensus 570 ~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~------------- 636 (706) T protein:vir:10 570 NMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKS------------- 636 (706) T ss_pred hcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------- Confidence 111112222222221111100 01111111111111111111222222222222222222222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 646 NAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 646 ~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~~~~~~~~~q 725 (725) +++..+.. .++.+.+.+..+.+.+. ++.+..+.+-..++..++. ......+......+...|+ T Consensus 637 -~a~~~q~~-----------~~a~~a~~qa~~~~~~~----~~~~~~a~~~~~~~~~q~~-q~l~~~~a~q~~~~~~~~~ 699 (706) T protein:vir:10 637 -QNETVQTQ-----------IKAFTAQQDAMESQANT----VYKLAQARNIDDKAVMETL-RLLKEVAASQQQTIPSPPS 699 (706) T ss_pred -HHHHHHHH-----------HHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCCCCC Confidence 11111100 00111111111111100 1111111111111112211 2233333344444444444 No 138 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=96.64 E-value=0.00045 Score=38.94 Aligned_cols=624 Identities=10% Similarity=-0.021 Sum_probs=156.8 Q ss_pred CCcHHHHHHHHHHHHHHHHhhhHH----HHHHHHHHHHhhcCCCCCHHHHHHHhh----cCC--Ccc-------cchHHH Q lcl|NC_013059. 1 MADNKNRLESILSRFDADWTASDE----ARREAKNDLFFSRVSQWDDWLSQYTTL----QYR--GQF-------DVVRPV 63 (725) Q Consensus 1 mad~~~~~~~~~~~~~~~~~~~~~----~r~~a~~d~~f~~G~QW~~~~~~~l~~----~gr--p~~-------N~i~~~ 63 (725) ||++.+...+-+.-...+.....+ .+..+..++ +.+.++-....++ .|. +.. .-..++ T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-----~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~ 96 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQEL-----SRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERGQAPT 96 (776) T ss_pred CCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHH-----hhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCceE Confidence 776544333332222221111111 111112222 2333322222221 233 110 111111 Q ss_pred HH----HHHHHHhhCCcceEEec-CCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCC Q lcl|NC_013059. 64 VR----KLVSEMRQNPIDVLYRP-KDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) Q Consensus 64 v~----~v~g~~~~nr~~~~~~p-r~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~ 138 (725) +. .++..-...-..-+..+ ..+.+.. -+.+..++..+....-.......++.++...++.+ ++ T Consensus 97 ~~N~i~~~i~~v~g~~~~nr~~~~~~p~~~~-d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~-----------G~ 164 (776) T protein:vir:93 97 VYNVISQSVNWIIGSEKRGRSDFKVLPRRKD-GGKAAERKTALLKYLSDVNHTPFERSMAFEETTKA-----------GI 164 (776) T ss_pred EecchHHHHHHHHHHHHhCCcceEEecCChh-HHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhc-----------Cc Confidence 11 11222222111122222 2344443 33445666666555444444555566666555443 12 Q ss_pred CCceeEEEEeeecchhheeeCCCccccChhcccceeeeec--------------CCHHHHHHHhhh-----------cCC Q lcl|NC_013059. 139 SNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHS--------------MSQNGWEDFAEK-----------FDL 193 (725) Q Consensus 139 ~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~--------------~~~~~~~~~~p~-----------~~~ 193 (725) |. -.|+||.+... ..++.+. .+.+++.-+|-. |+. T Consensus 165 G~-------------~~v~~d~~~~~-------~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~ 224 (776) T protein:vir:93 165 GW-------------LESQVQDENDG-------EPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPE 224 (776) T ss_pred ce-------------EEEEeeccCCC-------CceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCC Confidence 10 01344433211 0111122 223333322221 111 Q ss_pred cchhhhhhhhcccccccccCCCeEEEE-------------EEEEEecceeEEEEe----eCccccceeecchhhhHHH-H Q lcl|NC_013059. 194 DADDIPSFQNPNDWVFPWLTQDTIQIA-------------EFYEVVEKKETAFIY----QDPVTGEPVSYFKRDIKDV-I 255 (725) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~vrv~-------------E~w~~~~~~~~~~~~----~d~~~g~~~~~~~~~~~~~-~ 255 (725) ...............+.+.+.+...+. ..|+-.. +.++.++ ..+.+..++.+...+.... + T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~ 303 (776) T protein:vir:93 225 RAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYA-RKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVF 303 (776) T ss_pred chHHHHHhhhhcccccchhcccccccccccccccccccccccccccC-CCeEEEEEEEEeeeeehhhcccccccccceee Confidence 111111111010001111111111111 1111111 1111111 1111122211111111110 0 Q ss_pred HHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCc--cc--eEEEEeeeeccCCcccc-chhhhhhhhHHHHH Q lcl|NC_013059. 256 DDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEH--IP--IVPVFGEWGFVEDKEVY-EGVVRLTKDGQRLR 330 (725) Q Consensus 256 ~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~--~p--~vP~~g~~~~~d~~~~~-~G~vr~~kd~Q~~~ 330 (725) +.....-...+....+..++...+.+-...++ +....-.+. || .+||++.+ +...+ -|+...+.+.=+.. T Consensus 304 d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~-g~~~l~~~~~p~~~~~~Pfv~~~----~~~~~~~~~~~G~v~~~~d~ 378 (776) T protein:vir:93 304 DPNDERHVLEVESGRAVLAVSPMMRMHCAIMT-TRDLMWAGPSPYRHNRYPFTPIW----GFRRARDGMPYGVIRFMRGM 378 (776) T ss_pred cccchHHHHHhhcCceeehheeeeeeEEEEEe-cchhhhccCCCCCCCccceEEec----CceecccccccchHHhhhHH Confidence 10000011111222222221111222222222 222221111 22 44665542 22221 23333333322222 Q ss_pred HHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCcccccc--CCcccCCCCchHHHHHHHHHH Q lcl|NC_013059. 331 NMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQ--PLAYYENPEVPQANAYMLEAA 408 (725) Q Consensus 331 N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~--~~~~~~~~~~~~~~~~ll~~~ 408 (725) -....+.. +. ..+++....+-..+...+.. .-.......+++.+... ....+.....++-...+++.. T Consensus 379 Q~~~N~~~---s~---~~~~l~~~~~~~~~gav~~~----d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (776) T protein:vir:93 379 QDDVNKRL---SK---ALYILSTNKVLMEEGAVDDI----DEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELA 448 (776) T ss_pred HHHHHHHH---HH---HHHhhcCCceeeccccccch----HHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHH Confidence 22222211 11 12222222211100000000 00111112233333222 222333333344445566666 Q ss_pred HHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccC-CCc Q lcl|NC_013059. 409 TAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLED-GSE 487 (725) Q Consensus 409 ~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d-~~~ 487 (725) ......+..++..+-...|..+++.+-.+.......+...+..-++.. .+..+.+..++.... -+. +.+ T Consensus 449 ~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~-~~~~~~~~~~~l~li---------~~~~~~~ 518 (776) T protein:vir:93 449 SRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNL-RLAFQQHGEKELSLI---------EQYMTEE 518 (776) T ss_pred HHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH---------HHhcCcc Confidence 666666666544333434443333322222222222222222222222 222222233222222 111 234 Q ss_pred ceEEeccccccccCCceeeeccccccceEEE---EeccCchhHH-HHHHHHHHHHHHhcccccchHHHHHHHhhccCCch Q lcl|NC_013059. 488 KEVQLMAEVVDLATGERQVLNDIRGRYECYT---DVGPSFQSMK-QQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGK 563 (725) Q Consensus 488 ~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v---~~~p~~~t~r-~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~ 563 (725) +.+.|..+. .+...|.+|+-.-.-|+.+ ++..+..... .+.-+.+..|++.++.. .+.+-.. T Consensus 519 r~~ri~~~~---~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~-----------~p~~~~~ 584 (776) T protein:vir:93 519 KQFRITNSR---GNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKM-----------PPEIALT 584 (776) T ss_pred eEEEEeecC---CCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhc-----------ChhhHHH Confidence 555443211 1224666776433455543 5555554433 33333333333322110 0000001 Q ss_pred hHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 564 GVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQN 643 (725) Q Consensus 564 ~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~ 643 (725) ....+++... -+...+..+.+.+.++..... +......+....+.++..++++.+ ..+.++...++++ T Consensus 585 ~~~~~~e~~d---------~p~~~e~~~~l~~~~~~~~p~-q~~~~~e~~~~qq~q~~~~q~q~~--~~~a~~~~~qa~a 652 (776) T protein:vir:93 585 MLDLLVENMD---------IPNRDELVKRIRAVNGQKDPD-QDEPTPEEIAREQAQQQQQQYNDA--LAIATLEEQQAKA 652 (776) T ss_pred HHHHHHHhcC---------ccchHHHHHHHHHhhcccccc-hhhcchhHHHHHHHhhHHHHHHHH--HhhhhhhHhhHHH Confidence 1111111110 001111111111111000000 000000000000111111111111 1111111112222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcCCc--- Q lcl|NC_013059. 644 QLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMEIANILQSQR---QNQPS--- 717 (725) Q Consensus 644 q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q~~~q~~e~~~~~~~~~---~~q~~--- 717 (725) ...+++.....+++... .......+.. .++. +...+++... ..+..+. .....+.+. +..|. T Consensus 653 ~~~~aea~~~~aqa~~~--~~~a~~~~~~--a~q~----a~qa~~~~~~-~~~~a~~---a~~~~~~a~~~~p~~p~~~~ 720 (776) T protein:vir:93 653 RKAAAEAQVAEAKAKHI--SRMAIREGVG--AVKD----ATDAATAIAF-MPELAGL---SDGILRESGWDDPNTPQPAS 720 (776) T ss_pred HHHHHHHHHHhhhhhhh--hhcchhhhhh--hhhh----hhhhhhhhhh-hhhhhhh---hhhhhccccccccccccccc Confidence 22222111111111100 0000000000 0000 0000011000 0000000 001111111 11111 Q ss_pred ccccCCCC Q lcl|NC_013059. 718 GSVAETPQ 725 (725) Q Consensus 718 ~~~~~~~q 725 (725) .+..+.|+ T Consensus 721 ~~~~~~~~ 728 (776) T protein:vir:93 721 AASGMPPA 728 (776) T ss_pred cccCCCCC Confidence 11111111 No 139 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=96.38 E-value=0.00068 Score=37.95 Aligned_cols=462 Identities=10% Similarity=0.003 Sum_probs=165.5 Q ss_pred CC-------c---HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh-cCCCCCHHHHHHHhhcCCCc-ccchHHHHHHHH Q lcl|NC_013059. 1 MA-------D---NKNRLESILSRFDADWTASDEARREAKNDLFFS-RVSQWDDWLSQYTTLQYRGQ-FDVVRPVVRKLV 68 (725) Q Consensus 1 ma-------d---~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~-~G~QW~~~~~~~l~~~grp~-~N~i~~~v~~v~ 68 (725) |- | ....+...+..|+...+...--+..+.+. .|. ....++.+. .+..-..|-+ +|..+.+|+.++ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~-~yl~~~~~~~~e~-~Y~~rl~rA~~~n~~~~tl~~l~ 78 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRN-VGLNEPDKAYGEA-RQAEYEAGGIVYNFTRRTLSGMV 78 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhccc-CCCcCCCCCCCHH-HHHHHHhcccCCChHHHHHHHHh Confidence 32 1 11122222222222211111001111111 111 112343333 2333233443 599999999999 Q ss_pred HHHhhCCcceEEecCCcchHHHHHHHHHHHHHH-HHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCC-------- Q lcl|NC_013059. 69 SEMRQNPIDVLYRPKDGASPDAADVLMGMYRTD-MRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTS-------- 139 (725) Q Consensus 69 g~~~~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~-------- 139 (725) |.--...|.+.+ -+.|..++..+ .+-++.+.-...+|..++.+|.+|+=| ||-...+.. T Consensus 79 G~vfrk~p~~~~----------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV--D~P~~~~~T~Ade~~~~ 146 (491) T protein:vir:95 79 GSVMRKEPEINI----------PKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLV--DAPETAAATAAEQNAGL 146 (491) T ss_pred chhhcCCceeec----------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEE--ecCCCcccCHHHHHHhc Confidence 988776666532 12245555554 457788999999999999999999655 443222110 Q ss_pred -CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhccc--ccccccCCCe Q lcl|NC_013059. 140 -NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPND--WVFPWLTQDT 216 (725) Q Consensus 140 -~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 216 (725) -|+..- .++.+| .|+.....+.. ..+...++.-. ....+ +.|....... T Consensus 147 ~rPy~~~-----~~~~~I-inW~~~~v~g~--~~L~~v~l~E~--------------------~~~~d~~~~f~~~~~~q 198 (491) T protein:vir:95 147 LNPTIAF-----YTTENI-VNWRLTRVGSV--NRVTMVVLRET--------------------WEYHEPGNEFETKYGEQ 198 (491) T ss_pred CCcEEEE-----echhhh-cCceeeeeCCc--eeeeEEEEEEe--------------------EEeecCCCCcccceEEE Confidence 122221 122223 34443333311 11111111110 00000 1111111223 Q ss_pred EEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEE-eeccccccCCCCCCC Q lcl|NC_013059. 217 IQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI-ITCTAVLKDKQLIAG 295 (725) Q Consensus 217 vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~-~~g~~~l~~~~~~p~ 295 (725) +||-+.+.-..-+.+++.. + ..|.... -+.+.+ -.|. -+- T Consensus 199 yRvL~l~~~g~~~~~v~r~-~-~~g~~~~-----------------------------~~~~~~~~~g~--------~~l 239 (491) T protein:vir:95 199 YRVLDIDTDGNYRQRLFRF-D-AEGGAQE-----------------------------EVVEIYPDLGE--------SLR 239 (491) T ss_pred EEEEeecCCCceEEEEEEE-c-CCCccee-----------------------------eeeeeeecCCC--------ccc Confidence 3332211000000111110 0 0010000 000011 0111 122 Q ss_pred CccceEEEEeeeecc-CCccccchhhhhhhhHHHHHHHHHHHHHH-HHHhcCCcceeechhhcchHHHHHHhhccccccc Q lcl|NC_013059. 296 EHIPIVPVFGEWGFV-EDKEVYEGVVRLTKDGQRLRNMIMSFNAD-IVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYL 373 (725) Q Consensus 296 ~~~p~vP~~g~~~~~-d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 373 (725) +.+|||||.+..... .++...+++... . .-.++.|.-.+ ++... ..+..+..+..+.........+.....+ T Consensus 240 ~~IPfv~~~~~~~~~~~~~pPLl~LA~l----n-i~Hy~~ssd~~~~l~~~-~~P~l~~~G~d~~~~~~~~~~~~~~i~~ 313 (491) T protein:vir:95 240 GVIPFTFIGATNNDATIDDAPLLPLAEL----N-IGHYRNSADNEESSFVV-GQPTLFIYPGDNLTPQSFKEANPNGIKF 313 (491) T ss_pred CeeEEEEEecCCCCCCCCcCchHHHHHH----H-HHHhhhhhHHHHHHHHc-ccceeeeecCcccCcchhhccCcceeEe Confidence 567777665432211 122212222221 1 11222222222 33333 3333332222111111222111111111 Q ss_pred cccccccCcc-c-cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 374 LNRTDENNGE-M-PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQD 451 (725) Q Consensus 374 ~~~~~~~~g~-~-~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~d 451 (725) + .+.+. + ......+++.....- ....|......|. ..|. .+...+.+.|+.+......+....|..+.. T Consensus 314 g----~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~-~~Ga---~l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~ 384 (491) T protein:vir:95 314 G----SRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAI-QIGA---QLITPSQQITAESARIQRGADTSVMATIAR 384 (491) T ss_pred c----CcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHH-HHHH---HhccCCcchhHHHHHHHHHHhhHHHHHHHH Confidence 1 11111 1 112334444432222 1222333222222 2332 222233346777777777777777777888 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHH Q lcl|NC_013059. 452 NLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQN 531 (725) Q Consensus 452 n~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~ 531 (725) |+..+. +.+|.++..|.... ++..--+.+|+ .|++. +-. .+. T Consensus 385 ~~e~al----~~~l~~~a~w~G~~--------~~~~v~i~~n~------------------dF~~~----~~~----~~~ 426 (491) T protein:vir:95 385 NVSQAY----TDALRWVAMMLGKP--------EDSEVEFQLNM------------------DFFLQ----PMT----AQD 426 (491) T ss_pred HHHHHH----HHHHHHHHHHcCCC--------CCCceEEEeec------------------ccccc----cCC----HHH Confidence 887775 56677888887542 11111122332 12211 011 122 Q ss_pred HHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHh Q lcl|NC_013059. 532 RAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQG 602 (725) Q Consensus 532 ~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~ 602 (725) +.+|.++..+ +..........+.-...+|.. .+++.+++..+..+.....+...+..+ .++..+. T Consensus 427 ~~all~~~~~-G~is~~t~~~~L~~~~vl~~~-~e~~~~~ie~~~~~~~~~~~~~~~~~~----~~~~~~~ 491 (491) T protein:vir:95 427 RAAWMADINA-GLLPATAYYAALRKAGVTDWT-DEDILNAIEDAPLPSGAVTQVAGEIPQ----AAQQQQE 491 (491) T ss_pred HHHHHHHHhc-CCCCHHHHHHHHHhCCCCCcc-HHHHHHHHHhcCCCCCccccccccchh----hhhhccC Confidence 3344443332 111111111112211223322 345555554443333222222222221 1111111 No 140 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=96.08 E-value=0.001 Score=36.94 Aligned_cols=464 Identities=9% Similarity=-0.043 Sum_probs=170.9 Q ss_pred CCcHHHHH---HHHH---HHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhc-CCCc-ccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MADNKNRL---ESIL---SRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQ-YRGQ-FDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 mad~~~~~---~~~~---~~~~~~~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~-grp~-~N~i~~~v~~v~g~~~ 72 (725) |.|-...| .... ..++.++.....+|.....-+-...+.+++.+..+.-+.+ .|-+ +|.++.+++.++|.-- T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf 80 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVF 80 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhh Confidence 98732222 2222 2234444555666654432222244667776544433332 3444 5999999999999887 Q ss_pred hCCcceEEecCCcchHHHHHHHHHHHHHH-HHhcChhHHHHHHHHHHHhcCcceEEEEeeeccC-CCCC----------- Q lcl|NC_013059. 73 QNPIDVLYRPKDGASPDAADVLMGMYRTD-MRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ-SPTS----------- 139 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~-~~~~----------- 139 (725) ...|.+. +-..|..++..+ .+-++.+.-...+|..++..|.+|+=| ||-.. ++.. T Consensus 81 ~k~p~~~----------~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV--D~P~~~~~~~~t~a~~~~~~~ 148 (501) T protein:vir:95 81 MRDPVVK----------VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLV--DYPTTEAEGGASIADLEAGRI 148 (501) T ss_pred cCCccee----------CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE--eecCCCCcccccHHHHHhccC Confidence 6665543 123344555544 456688999999999999999999655 55322 1111 Q ss_pred CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEE Q lcl|NC_013059. 140 NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQI 219 (725) Q Consensus 140 ~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv 219 (725) -|+... ..+.+| +|+.....+.. ..+...++. +.+-+ .++.|.......+|| T Consensus 149 rPy~~~-----~~~~~I-inW~~~~v~g~--~~l~~v~l~------E~~~~--------------~d~~f~~~~~~q~Rv 200 (501) T protein:vir:95 149 RPTLYV-----YSPTEI-INWRTTDRGAE--EVLSLVVLF------ETWCA--------------ADDGFEMKTSGQFRV 200 (501) T ss_pred CcEEEE-----ecHhhh-cCcceeccCCc--eeeeEEEEE------EEEee--------------cCCCcccceeEEEEE Confidence 022211 122233 34443333311 111111110 10000 000111111122222 Q ss_pred ----------EEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccC Q lcl|NC_013059. 220 ----------AEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKD 289 (725) Q Consensus 220 ----------~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~ 289 (725) +++|.+...+. .+ .+++.. |.. ..-..|. +.+ T Consensus 201 L~~~~~g~~~~~v~r~~~~~~-----~~--~~~~~~----------------~~~--------~~~~~~~-------~~~ 242 (501) T protein:vir:95 201 LRLDEEGYYVHEIWREPQPTK-----AD--GSKIPK----------------GNY--------QQYVVYK-------PTD 242 (501) T ss_pred EeeCCCceEEEEEEEecCCcc-----cC--cceecC----------------Ccc--------cccceee-------eec Confidence 23333221100 00 000000 000 0000011 111 Q ss_pred CCCCCCCccceEEEEeeeecc-CCccccchhhhhhhhHHHHHHHHHHH-HHHHHHhcCCcceeechhhcchHHHHH-Hhh Q lcl|NC_013059. 290 KQLIAGEHIPIVPVFGEWGFV-EDKEVYEGVVRLTKDGQRLRNMIMSF-NADIVARTPKKKPFFWPEQIAGFEHMY-DGN 366 (725) Q Consensus 290 ~~~~p~~~~p~vP~~g~~~~~-d~~~~~~G~vr~~kd~Q~~~N~~~s~-~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~ 366 (725) ..-.+-+.+|||+|...+... .++...+++. +..- -.++.|. ..+++... ..+..+..|- +..+ +.. T Consensus 243 ~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA----~lni-~hy~~ssd~~~~l~~~-~~P~l~i~G~----~~~~~~~~ 312 (501) T protein:vir:95 243 AQGKRLTEIPFMFIGSENNDSNPDNPNFYDLA----SLNM-AHYRNSADYEESCYIV-GQPTPVLIGL----TEEWVTNV 312 (501) T ss_pred cCCCcCCeeeEEEEecCCCCCCCCccchHHHH----HHHH-HHHhhhhHHHHHHHHc-ccceeeeeCC----cccccccC Confidence 111233557776553332211 1122223332 2221 1122222 22233332 2333333322 1121 111 Q ss_pred cccccccc--ccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHH Q lcl|NC_013059. 367 DDYPYYLL--NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADL 444 (725) Q Consensus 367 ~~~~~~~~--~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~ 444 (725) ...+..++ +.+..+. ...++++++..-.- ....|+...+.|.. .|. ..+...+.+.||.+......+... T Consensus 313 ~~~~i~~G~~~~~~lP~----~~~~~~ie~~~~~i-~~~~l~~l~~~m~~-~Ga--~ll~~~~~~~Ta~~~~~~~~~~~S 384 (501) T protein:vir:95 313 LKGSVNFGSRGGIPLPV----GADAKLLQASENTM-LKEAMDTKERQMVA-LGA--KLVEQKEVQRTATEAELEAASEGS 384 (501) T ss_pred CCCceeecccccccCCC----CCceeEEecChhhH-HHHHHHHHHHHHHH-HHH--hhccCCccchhHHHHHHHHHHHhH Confidence 11111111 1111111 22345555422111 13345544444433 342 233333444677777777777766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCc Q lcl|NC_013059. 445 ETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSF 524 (725) Q Consensus 445 ~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~ 524 (725) .|..+..|+..+. +.+|.++..|.... + +.. -|.||+ .|+... -. T Consensus 385 ~L~~~a~~le~al----~~~l~~~a~w~g~~------~--~~~-~v~i~~------------------df~~~~----~~ 429 (501) T protein:vir:95 385 TLSSATKNVSAAF----EWALKWAARWVGQA------D--SGV-KFELNT------------------DFDIAR----MT 429 (501) T ss_pred HHHHHHHHHHHHH----HHHHHHHHHHcCCC------C--Cce-EEEEec------------------cccccc----CC Confidence 7778888887775 55777788886421 1 111 133332 121110 01 Q ss_pred hhHHHHHHHHHHHHHHhcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhh-hccchhhhHHHHHHHHHHHhh Q lcl|NC_013059. 525 QSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVK-KPETPEEQQWFVEAQQAKQGQ 603 (725) Q Consensus 525 ~t~r~~~~~~l~ell~~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~-~~~~~e~~~~~~q~~q~qq~q 603 (725) .+.+++|.++..+ +..........+.-...++. ..+...+++.......... .+........-....- -.+ T Consensus 430 ----~~~~~al~~~~~~-G~is~~t~~~~L~~~~v~~~-~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~--~~~ 501 (501) T protein:vir:95 430 ----PDERRSLVEEWQK-GAITFEEMRTGLRKAGVATE-DDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVG--NSE 501 (501) T ss_pred ----HHHHHHHHHHHhC-CCCcHHHHHHHHHhCCCCCh-hHHHHHHHHHhhhcCcccccccCCCCCCCccccccc--CCC Confidence 1223334443321 11111111111111111111 1122222222111110000 0000000000000000 000 No 141 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=95.36 E-value=0.0022 Score=35.13 Aligned_cols=579 Identities=10% Similarity=-0.016 Sum_probs=151.6 Q ss_pred CCcHHH-HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC-----CCHHHHHHHhhcCCCcccchHHHHHHHHHHHhhC Q lcl|NC_013059. 1 MADNKN-RLESILSRFDADWTASDEARREAKNDLFFSRVSQ-----WDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQN 74 (725) Q Consensus 1 mad~~~-~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~G~Q-----W~~~~~~~l~~~grp~~N~i~~~v~~v~g~~~~n 74 (725) |||... .++-++..-........+|-+.+.. .|.. |+. .+ T Consensus 113 ~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~-----~~~gv~k~~W~~-----------------------------~~ 158 (763) T protein:vir:95 113 GARQNELVLNYQFRTKLNRVSFIDNYVRSVVD-----DGTGIVRVGWNR-----------------------------EI 158 (763) T ss_pred HHHHHHHHHHHHHhhcCchhhHHHHHHHHHhh-----cCcceEEEeeee-----------------------------ee Confidence 332211 1111111111111111122211111 1222 221 11 Q ss_pred CcceEEecC-----CcchHH---HHHHHH-HHHHHHHHhcChhHHHHHHHHHHHh---------cCcceEEEEeeeccCC Q lcl|NC_013059. 75 PIDVLYRPK-----DGASPD---AADVLM-GMYRTDMRHNTAKIAVNIAVREQIE---------AGVGAWRLVTDYEDQS 136 (725) Q Consensus 75 r~~~~~~pr-----~~~d~~---~Ae~l~-~~~~~~~~~~~~~~~~s~a~~~~~~---------~G~G~~~v~~~~~~~~ 136 (725) |.+....+. +.-.+. .++.+- ...+..-..-+.+.....++..... .|+.++.+...... . T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~-~ 237 (763) T protein:vir:95 159 RKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLAN-H 237 (763) T ss_pred eeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecC-c Confidence 111111110 000011 111111 1111111111122222233333333 33444444222111 0 Q ss_pred CC---CCceeEEEEee----ecchhheeeCCCccccChhcccceeeeecCCHHHHHH----Hhhhc-CC-----cchhhh Q lcl|NC_013059. 137 PT---SNNQVIRREPI----HSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWED----FAEKF-DL-----DADDIP 199 (725) Q Consensus 137 ~~---~~~~~ir~~~~----~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~----~~p~~-~~-----~~~~~~ 199 (725) |. -.+..+.++|- ..+...+++--.-+.-|+-+..+.+ .|++.-.... ..+.+ .. +..+.. T Consensus 238 p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 315 (763) T protein:vir:95 238 PTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRY--HNLNKIDWQSSAPVNEPDHATTTPQEFQISDPM 315 (763) T ss_pred eEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCc--cccchhcchhccccccccccccchhhccCCCcc Confidence 00 00001111111 0111122222112222332222211 1211100000 00000 00 000000 Q ss_pred hhhhcccccccccCCCeEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEE Q lcl|NC_013059. 200 SFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKS 279 (725) Q Consensus 200 ~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~ 279 (725) ...-....+|.-++.+...+.++++.... |..+.....+ + T Consensus 316 ~~~V~v~E~y~~~d~~gdg~~~~~~v~~~------------g~~iL~~~~~-------------------p--------- 355 (763) T protein:vir:95 316 RKRVVAYEYWGFWDIEGNGVLEPIVATWI------------GSTLIRLEKN-------------------P--------- 355 (763) T ss_pred cceEEEEEeeeeeccCCcceeEEEEEEEE------------cCeeeecccc-------------------c--------- Confidence 00000001111111122223332211111 1111110000 0 Q ss_pred EeeccccccCCCCCCCCccceEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHHHHHHHH---HhcCCcceeechhhc Q lcl|NC_013059. 280 IITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIV---ARTPKKKPFFWPEQI 356 (725) Q Consensus 280 ~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~---~~~~~~~~~~~~~~i 356 (725) +...-+|+-.||++|..+.. -|.++..-+...=...-.+.|...--+.-.. .....+.+ ...+.. T Consensus 356 --------~~~~~~PFv~~~~~p~~~~~---~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav-~~~d~~ 423 (763) T protein:vir:95 356 --------YPDGKLPFVLIPYMPVKRDM---YGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGML-DALNSR 423 (763) T ss_pred --------ccCCCcCEEEecceeecCcc---cCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccc-cchhhh Confidence 00112344445555553321 1222222221111111111221111110000 00111111 111111 Q ss_pred chHHHHHHhhcc-ccccccccccccCccccccCCcccCCCCchHHHHHHHHHHHHHHHHHh----CCChHHhccCcchhH Q lcl|NC_013059. 357 AGFEHMYDGNDD-YPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVA----TLGVDAEAVNGGQVA 431 (725) Q Consensus 357 ~~~~~~~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~t----Gv~~~~~G~~~n~~S 431 (725) . ........ .++. ..... ..+...-+.++-+...+++++.....+..++ |++...+|...+++| T Consensus 424 ~---~~pg~v~~v~~g~---~~~~~-----~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~ 492 (763) T protein:vir:95 424 R---YREGEDYEYNPTQ---NPAQM-----IIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIR 492 (763) T ss_pred c---ccCCceEEeeCCC---Chhhh-----cccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHH Confidence 1 00000000 0111 11111 1112222345666777777776666555444 777777886667788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCcEEEEeccCCCcce-EEeccccccccCCceee Q lcl|NC_013059. 432 YDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDI----YDVPRNVVITLEDGSEKE-VQLMAEVVDLATGERQV 506 (725) Q Consensus 432 g~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~----y~~~r~irI~~~d~~~~~-v~in~~~~d~~~g~~~~ 506 (725) +. +++.+.+...-+..|.+.++...+.+..++..+...- .+.+.++.|+.++-..+| |.+. T Consensus 493 ~l-~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~------------- 558 (763) T protein:vir:95 493 GV-LDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVD------------- 558 (763) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEe------------- Confidence 85 5555566666678888888888888888888753221 012234445433311122 1111 Q ss_pred eccccccceEEEEeccCchhHHHHHHHHHHHHH-Hhccccc-chHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhc Q lcl|NC_013059. 507 LNDIRGRYECYTDVGPSFQSMKQQNRAEILELL-GKTPQGT-PEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKP 584 (725) Q Consensus 507 ~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell-~~~~~~~-p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~ 584 (725) +... .... .+.+.+..|.+++ +.+++.+ +.+...+..+....+. ...+.............. T Consensus 559 ---------~~~a-s~~~--q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~--~~~lr~~q~~~d~~~q~q-- 622 (763) T protein:vir:95 559 ---------ISTA-EVDN--QKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKL--AHDLRTWQPQPDPVQEQL-- 622 (763) T ss_pred ---------cccc-hHHH--HHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhh--HHHHHhcCCCccchhhhH-- Confidence 0000 1111 2344555555544 3334333 2223333333332221 122222111111111110 Q ss_pred cchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_013059. 585 ETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMD-LSKQS 663 (725) Q Consensus 585 ~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~-~~~~~ 663 (725) .+.+..+ ++.+.+..+ .++++.++++.....+++..+++...+..+.+ ..++.+...++.. ...+.+ +..+. T Consensus 623 aqle~~~-~q~e~~~~~--akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q---~~~e~~~~~~~~e-aq~~l~~~~a~~ 695 (763) T protein:vir:95 623 KQLAVEK-AQLENEELR--SKIRLNDAQAQKAMAERDNKNLDYLEQESGTK---HARDLEKMKAQSQ-GNQQLEITKALT 695 (763) T ss_pred HHHHHHH-HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 0111111 111111111 11122222222223333333332222222111 1112111111100 000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-cccccCCCC Q lcl|NC_013059. 664 EFREFLKTVASFQQDRSEDAR-ANAELLLKGDEQTHKQRMEIANILQSQRQNQP-SGSVAETPQ 725 (725) Q Consensus 664 ~~~e~~~~~~~~q~~~~~~a~-~~aE~~~~~~~q~~~q~~e~~~~~~~~~~~q~-~~~~~~~~q 725 (725) .+.+.++...++ +....+.+ ..++.. ..+.. .+. ......++.+ .+++.+.|- T Consensus 696 ~~~~ea~~~~~~-~~~~~~~~~~~~~~~---~~~~~-~~~----~~~~~~~~~~~~~~~~~~~~ 750 (763) T protein:vir:95 696 KPRKEGELPPNL-SAAIGYNALTNGEDT---GIQSV-SER----DIAAEANPAYSLGSSQFDPT 750 (763) T ss_pred HHHHHhccChhH-HHhhhhcccccccCC---Cccch-hhc----ccCccccccccCCCCCCCCC Confidence 000000000000 00000000 001000 00000 000 0111111212 223455555 No 142 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=94.78 E-value=0.0035 Score=34.03 Aligned_cols=550 Identities=14% Similarity=0.106 Sum_probs=202.3 Q ss_pred CCc-----------HHHHHHHHHHH-HHHHHhhhHHHHHH----H-HHHHHhhc--CCCCCHHHHHH--HhhcCCCc--c Q lcl|NC_013059. 1 MAD-----------NKNRLESILSR-FDADWTASDEARRE----A-KNDLFFSR--VSQWDDWLSQY--TTLQYRGQ--F 57 (725) Q Consensus 1 mad-----------~~~~~~~~~~~-~~~~~~~~~~~r~~----a-~~d~~f~~--G~QW~~~~~~~--l~~~grp~--~ 57 (725) ||= ..++.++++.. ++.-.+....+... + ..|..|.+ --|=..+.+-+ ..+-.-|| | T Consensus 1 maispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V~ 80 (666) T protein:vir:96 1 MAISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQVV 80 (666) T ss_pred CccCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceeeecccccccccceee Confidence 551 12334443332 22223333221111 1 12333332 11111122211 11112243 3 Q ss_pred --cchHH----HHHHHHHHHhhC-CcceEEec--CCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEE Q lcl|NC_013059. 58 --DVVRP----VVRKLVSEMRQN-PIDVLYRP--KDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRL 128 (725) Q Consensus 58 --N~i~~----~v~~v~g~~~~n-r~~~~~~p--r~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v 128 (725) |.+.| .|.+++||-..- -.-.-+.| ..|+..+-||.|.+++..-+....+-...-=..+|+++--+..||+ T Consensus 81 ~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS~P~~K~~AE~LE~ii~DH~t~~~~~~~LiL~L~D~~KYN~~~~ET 160 (666) T protein:vir:96 81 NKATVNPIVISQVQSMTAYLTEVFASGYPILPVVSTPDKKEQAEALEGIIQDHMTMTSSIPELILCLQDAAKYNLVGWET 160 (666) T ss_pred ccccCCchhhhhHHHHHHHHHHHHhcCCccceeecCCchhHHHHHHHHHHHhhhhhhhhHHHHHHHHhhhhhcceeeeee Confidence 44444 467788865432 01112233 3677788999999999876665555555555566777766666664 Q ss_pred Eeeecc----------CCCCCCceeEEEEeee------cchhheeeCCCccccChh-cccceeeeecCCHHHHHHHhhhc Q lcl|NC_013059. 129 VTDYED----------QSPTSNNQVIRREPIH------SACSHVIWDSNSKLMDKS-DARHCTVIHSMSQNGWEDFAEKF 191 (725) Q Consensus 129 ~~~~~~----------~~~~~~~~~ir~~~~~------~~~~~v~~Dp~a~~~d~s-Da~~~~~~~~~~~~~~~~~~p~~ 191 (725) ||.. +|..+...+.|+.+.| .+++++||||..--+|.. -..|+..+..+++-+++...--+ T Consensus 161 --~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~L 238 (666) T protein:vir:96 161 --EWSNIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYL 238 (666) T ss_pred --ccccccccchhhhhhcCCCceeeeccchhhhhhhhccccccccccCCCCCCchhhhhhhhhhHHHHHHHHHHHHHhhh Confidence 2321 2444455666666654 367889999987666633 45688888888887776543221 Q ss_pred CCcc------hh----hhhhhhcccccc----------------c---c--------cCCCeEEEEE--EEEEecceeEE Q lcl|NC_013059. 192 DLDA------DD----IPSFQNPNDWVF----------------P---W--------LTQDTIQIAE--FYEVVEKKETA 232 (725) Q Consensus 192 ~~~~------~~----~~~~~~~~~~~~----------------~---~--------~~~~~vrv~E--~w~~~~~~~~~ 232 (725) -++- .. ..++...+..+. + | +...+|-|-| +|.|.- -.+ T Consensus 239 T~EKkltykkvV~~Al~~s~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~--mY~ 316 (666) T protein:vir:96 239 TNEKKLTYKKVVNEALKSSFQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHT--MYL 316 (666) T ss_pred hcchhhhHHHHHHHHHhhhccccccccCCcccccccccchhhccchhhcCcccccccccccccccccccceeeee--eee Confidence 1100 00 000000100000 0 1 0011122211 122210 000 Q ss_pred EEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecccccc-CCCCCCCCccceEEEEeeeeccC Q lcl|NC_013059. 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLK-DKQLIAGEHIPIVPVFGEWGFVE 311 (725) Q Consensus 233 ~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~-~~~~~p~~~~p~vP~~g~~~~~d 311 (725) .+++..|. + -......+..+ +..++.|+.++- ++--..|++||+- +|+.+ -| T Consensus 317 RI~PSDF~--~--------------------~~P~~N~~QIW--K~v~IN~~~iIS~~~~I~AY~~~~~~--~~~~L-ED 369 (666) T protein:vir:96 317 RIIPSDFE--M--------------------NVPNRNQVQIW--KAVMINRDAIISFEPYIGAYGSFGMG--LAFAL-ED 369 (666) T ss_pred eeccccce--e--------------------cCCCCCcceee--eeeeeccceeEeeehhhcccchhhhh--hhhhh-hh Confidence 01110000 0 00001111122 234567777763 2333367778764 45543 46 Q ss_pred Ccccc-chhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHH-----hhccccccccccccccCcccc Q lcl|NC_013059. 312 DKEVY-EGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYD-----GNDDYPYYLLNRTDENNGEMP 385 (725) Q Consensus 312 ~~~~~-~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-----~~~~~~~~~~~~~~~~~g~~~ 385 (725) |.++- .|+-...++-|+.-.+. +++.-.+..+.+.+..-.+. .+.. .....+.++..+-...+|.+ T Consensus 370 GmG~QTQ~~~E~~~P~Q~A~t~L-----~N~~~~~aRRAV~DRAl~~~--S~i~a~~iNSP~~~~KIP~~~~sL~N~~m- 441 (666) T protein:vir:96 370 GMGLQTQGYGEMAAPLQSATTEL-----WNAYIQGARRAVMDRALYNP--SMIRANDINSPIPQIKIPVVPQSLVNGTM- 441 (666) T ss_pred ccccccccccccccchhhhhhHH-----hhhhhhhhhhhhhhhhhcch--hhhhhhcccCCCCCcccceeehhhhccch- Confidence 66542 57777889999865443 23333333332222111111 1111 11111222211111111111 Q ss_pred ccCCcccCCCCchHHHHHHHHHH---HHHHHHHhCCChHHhcc--CcchhHHHHHHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_013059. 386 TQPLAYYENPEVPQANAYMLEAA---TAAVKEVATLGVDAEAV--NGGQVAYDTVNQLNMRADLETYV--FQDNLATAMR 458 (725) Q Consensus 386 ~~~~~~~~~~~~~~~~~~ll~~~---~~~i~~~tGv~~~~~G~--~~n~~Sg~ai~~~q~q~~~~~~~--~~dn~~~~~~ 458 (725) ++.- -+-|-..-|.-..|+.+ .+--++++|.|..-.|+ .||-+- +--...+-.++.++.. ++-..+ .+. T Consensus 442 ~~~Y--~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGNKt~-~E~~~~MG~a~NRmRLPALiLEH~-~F~ 517 (666) T protein:vir:96 442 DQAY--RQIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKGNKTR-AEFDTIMGNAENRMRLPALILEHR-MFT 517 (666) T ss_pred hhhh--ccCCccccchhHHHhhhHHHhhhHHHhhccCCcccccccccCcce-eehhhhcCCcccceehhhHHHhhh-hhh Confidence 1111 11122223333444433 34456788999888886 344321 0001111111222111 111111 111 Q ss_pred HHHHHHHHHHHHhcC-CCcEEEEeccCCCcceEEeccccccccCCcee--eecccc-ccceEEEEeccCchhHHHHHHHH Q lcl|NC_013059. 459 RDGEIYQSIVNDIYD-VPRNVVITLEDGSEKEVQLMAEVVDLATGERQ--VLNDIR-GRYECYTDVGPSFQSMKQQNRAE 534 (725) Q Consensus 459 ~~g~~ll~li~~~y~-~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~--~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~ 534 (725) -+-+ +|.|..=.|. +..||- +.+|+.+ -+..++ -.+.+-+.+|..-. .|.+.-+. T Consensus 518 ~iK~-~L~LNl~~YG~DT~ViS-------------------~RtG~~~~vDi~~L~~~~L~F~~~DGlTP~-SKlASs~~ 576 (666) T protein:vir:96 518 KIKE-QLKLNLLMYGEDTEVIS-------------------PRTGKGVRVDIKELQDLGLKFELGDGLTPA-SKLASSDF 576 (666) T ss_pred hHHH-HHhhhhhhccccchhcc-------------------cccCceeeeeHHHHhhhhheeeeccCCCch-hhhhhhHH Confidence 1112 2333322333 323321 2222221 111122 12344445554333 35444444 Q ss_pred HHHHHHh----------cccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhH Q lcl|NC_013059. 535 ILELLGK----------TPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQ 604 (725) Q Consensus 535 l~ell~~----------~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~ 604 (725) ++-+|++ .++-.|-+..-+ +.+. ++..+-+..+..+++=++.--....-++...+.+++-..|. T Consensus 577 lT~~LQMI~sS~~~~~A~G~~~P~M~AHl---~QLG---GVRG~E~Y~~~ALPqwqitygm~Q~LQ~~~LQ~~~QSA~Q~ 650 (666) T protein:vir:96 577 LTALLQMIMSSETTLQAFGTQVPGMIAHL---AQLG---GVRGFEKYANAALPQWQITYGMQQQLQQMLLQLQQQSAMQL 650 (666) T ss_pred HHHHHHHHhcchhhHhhhcccchHHHHHH---HHhc---cccchhhcccccCcchhhhhhhhHHHHHHHHHHhhhhcccc Confidence 4443332 222223222211 2222 23333333332222111111001111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 605 DPAMVQAQGVLLQGQAELAKAQNQTLSLQ 633 (725) Q Consensus 605 q~~~~~~qa~~~k~qae~~kaqae~~k~q 633 (725) ++. |.++--.|+.- .| T Consensus 651 ~A~-----------Q~~L~~~Q~~P--Sq 666 (666) T protein:vir:96 651 QAR-----------QGELSNDQSQP--SQ 666 (666) T ss_pred ccc-----------cccCcccccCC--CC Confidence 000 01110000000 00 No 143 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=93.73 E-value=0.0066 Score=32.55 Aligned_cols=444 Identities=11% Similarity=-0.001 Sum_probs=167.7 Q ss_pred CC--cHHHHHHHHHHHHHHH---HhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHhhcCCCc-ccchHHHHHHHHHHHhhC Q lcl|NC_013059. 1 MA--DNKNRLESILSRFDAD---WTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQ-FDVVRPVVRKLVSEMRQN 74 (725) Q Consensus 1 ma--d~~~~~~~~~~~~~~~---~~~~~~~r~~a~~d~~f~~G~QW~~~~~~~l~~~grp~-~N~i~~~v~~v~g~~~~n 74 (725) |- .....+...+..|+.. +.....+|.....-+-.+.|..+. .+..-..|-+ +|..+.+|+.++|.--.. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~----~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k 76 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDD----MYNAYKQRALFYSITSKTLSALSGMVLDQ 76 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHH----HHHHHHhhccCCchHHHHHHHHhchhhcC Confidence 65 2222333333333222 223333332221112222233321 1222223333 599999999999988776 Q ss_pred CcceEEecCCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCCCceeEEEEeeecchh Q lcl|NC_013059. 75 PIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACS 154 (725) Q Consensus 75 r~~~~~~pr~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~~~~~ir~~~~~~~~~ 154 (725) .+.+.+ | +.|..+ ....+-++.+.-...+|..++..|.+++=| ||-.++ +-|+...+ ++. T Consensus 77 ~p~~~~-p---------~~l~~~-~~D~~G~~L~~~~~~~~~~~l~~G~~~ilV--D~p~~g--~rPy~~~~-----~~~ 136 (452) T protein:vir:94 77 PPVITH-P---------DAMSKY-FEDQSGIQFYEVFTRAVEETLLMGRVGVFI--DRPLTG--GDPYISVY-----TTE 136 (452) T ss_pred Cceecc-c---------HHHHHH-HhcccCCCHHHHHHHHHHHHHhcCeEEEEE--eeccCC--CceEEEEe-----chh Confidence 665532 1 122222 224567889999999999999999999766 553322 22443322 222 Q ss_pred heeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCCeEEEEEEEEEecceeEEEE Q lcl|NC_013059. 155 HVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) Q Consensus 155 ~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~vrv~E~w~~~~~~~~~~~ 234 (725) +| +|+.... + -+..+ +.+ ++. ....+..+.| +.+.+....+|...+-...+.+ T Consensus 137 ~I-i~W~~~~-~---g~l~~-v~l------re~--------------~~~~d~~d~f-~~~~~~~yRvL~l~~g~~~v~~ 189 (452) T protein:vir:94 137 NI-LNWEEDE-D---GRLLM-VVL------REF--------------YTVRDTADRY-VQNIRVRYRCLELVDGLLQITV 189 (452) T ss_pred hh-cCccccc-c---CCeeE-EEE------EEE--------------EEEecCCCcc-cceeEEEEEEEEEeCCeEEEEE Confidence 33 2433221 1 11111 100 000 0000011111 1122222222222111111111 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCCCccceEEEEeeeeccCCcc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~~~~p~vP~~g~~~~~d~~~ 314 (725) +... .|.. |.. +...+.+...-+.+.+|||+|.+.+... . T Consensus 190 ~~~~-~~~~----------------------------------~~~--~~~~~~~~~~~~l~~IP~v~~~~~~~~~---~ 229 (452) T protein:vir:94 190 HETQ-DGKV----------------------------------WEL--AKTSTIQNVGVTMDYIPFFCITPSGLSM---T 229 (452) T ss_pred EEcc-CCce----------------------------------eee--ccceeecCCCcccceeEEEEEcCCCCCC---C Confidence 1110 0000 000 0111112223344678888775543211 1 Q ss_pred ccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhccccccccccccccCccccccCCcccCC Q lcl|NC_013059. 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYEN 394 (725) Q Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 394 (725) ....-.-++-+....+....|-.-+++..+......+ .+- ++.+.. .....+.+..+. ++..+.++++ T Consensus 230 ~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~-~g~----~~~~~i----~iG~~~~~~lpe---~~~~~~yie~ 297 (452) T protein:vir:94 230 PAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWI-TGA----ESQSTM----HIGSTKAWVIPE---VAAKVGFLEF 297 (452) T ss_pred CCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEe-ecC----cCCCce----EecccccccCCC---CCCcceEEcc Confidence 1111133555556555555555555555554443322 221 111110 000011111111 1223556654 Q ss_pred CCch-HHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013059. 395 PEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYD 473 (725) Q Consensus 395 ~~~~-~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~ 473 (725) ..-+ ..+..-|....+.|. ..|. ....+...+.+|+.|...+..+....|..+..|+..+. ..+|.++..|.+ T Consensus 298 ~g~~i~~~~~~l~~le~~m~-~~Ga-~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al----~~~l~~~a~w~g 371 (452) T protein:vir:94 298 TGQGLQSLEKALSEKQAQLA-SLSA-RLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALL----NKAYSCIMDMES 371 (452) T ss_pred CchhHHHHHHHHHHHHHHHH-HHHH-HhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHH----HHHHHHHHHHcC Confidence 3322 222333444444443 3333 23333333456776665555444566667777777775 677778888876 Q ss_pred CCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHHHHHHHHHHhcccccchHHHHH Q lcl|NC_013059. 474 VPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLL 553 (725) Q Consensus 474 ~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~~~l~ell~~~~~~~p~~~~~~ 553 (725) ..--+ -|.||. .|... +-.+ +.+.+|.++..+ +.......... T Consensus 372 ~~~~~----------~v~~n~------------------dF~~~----~~~~----~~~~al~~~~~~-G~is~~t~~~~ 414 (452) T protein:vir:94 372 MGGTL----------NIKLNS------------------AFLDS----KLTA----AELKAWVEAYLS-GGISKEIYIHA 414 (452) T ss_pred CCCce----------EEEecc------------------ccccc----cCCH----HHHHHHHHHHhc-CCCcHHHHHHH Confidence 43111 233332 12111 0111 223333333321 11111111111 Q ss_pred HHhhccCCchhHHHHH-HHHhhhhhhhhhhhccchhhhH Q lcl|NC_013059. 554 LQYFTLLDGKGVEMMR-DYANKQLIQMGVKKPETPEEQQ 591 (725) Q Consensus 554 ~~~~~~~d~~~~~~i~-e~~~kq~~~~~~~~~~~~e~~~ 591 (725) +.-...+|.+.-.+.+ .....+.....- .+.++-... T Consensus 415 L~~~gvl~~~~e~~~i~~E~~~~~~~~~~-~~~~~~~~~ 452 (452) T protein:vir:94 415 LKVGKVLPPPGESMGVIPDPPAPEPSPSN-TPPNPSSKA 452 (452) T ss_pred HHhCCCCCCccCHHHHHHHhhccCcccCC-CCCCCccCC Confidence 1111222222211111 111101100000 111111100 No 144 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=92.98 E-value=0.0092 Score=31.74 Aligned_cols=554 Identities=14% Similarity=0.085 Sum_probs=203.1 Q ss_pred CCc-----------HHHHHHHHHHH-HHHHHhhhHHHHHH----H-HHHHHhhc--CCCCCHHHHHH--HhhcCCCc--c Q lcl|NC_013059. 1 MAD-----------NKNRLESILSR-FDADWTASDEARRE----A-KNDLFFSR--VSQWDDWLSQY--TTLQYRGQ--F 57 (725) Q Consensus 1 mad-----------~~~~~~~~~~~-~~~~~~~~~~~r~~----a-~~d~~f~~--G~QW~~~~~~~--l~~~grp~--~ 57 (725) ||= ..++.++++.. ++.-.+....+... + ..|..|.+ --|=..+.+-+ ..+-.-|| | T Consensus 1 maispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V~ 80 (666) T protein:vir:10 1 MAISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQVV 80 (666) T ss_pred CCcCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceeeecccccccCcceee Confidence 551 12334443332 22223332221111 1 12333332 11111122211 11112243 3 Q ss_pred --cchHH----HHHHHHHHHhhC-CcceEEec--CCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEE Q lcl|NC_013059. 58 --DVVRP----VVRKLVSEMRQN-PIDVLYRP--KDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRL 128 (725) Q Consensus 58 --N~i~~----~v~~v~g~~~~n-r~~~~~~p--r~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v 128 (725) |.+.| .|.+++||-..- -.-.-+.| ..|+..+-||.|.+++..-+....+-...-=..+|+++--+..||+ T Consensus 81 ~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS~P~~K~~AE~LE~ii~DH~t~~~~~~~LiL~L~D~~KYN~~~~ET 160 (666) T protein:vir:10 81 NKATVNPIVISQVQSMTAYLTEVFASGYPILPVVSTPDKKEQAEALEGIIQDHMTMTSSIPELILCLQDAAKYNLVGWET 160 (666) T ss_pred ccccCCchhhhhHHHHHHHHHHHHhcCCccceeecCCchhHHHHHHHHHHHhhhhhhhhHHHHHHHHhhhhhcceeeeee Confidence 44444 467788865432 01111233 3677788999999999876666555555555567777766666664 Q ss_pred Eeee--------ccCCCCCCceeEEEEeee------cchhheeeCCCccccChh-cccceeeeecCCHHHHHHHhhhcCC Q lcl|NC_013059. 129 VTDY--------EDQSPTSNNQVIRREPIH------SACSHVIWDSNSKLMDKS-DARHCTVIHSMSQNGWEDFAEKFDL 193 (725) Q Consensus 129 ~~~~--------~~~~~~~~~~~ir~~~~~------~~~~~v~~Dp~a~~~d~s-Da~~~~~~~~~~~~~~~~~~p~~~~ 193 (725) -|-. +-+|..+...+.|+.|.| .+++++||||..--+|.. -..|+..+..+++-+++...--+-+ T Consensus 161 ~Ws~IE~~~~~~~i~~~~~~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~ 240 (666) T protein:vir:10 161 EWSHIETYDPQKEITDLEPGKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTN 240 (666) T ss_pred ccccccccchhhhhhcCCCceeecccchhhhhhhhccccccccccCCCCCCchhhhhhhhhHHHHHHHHHHHHHHhhhhc Confidence 2211 113444455666666654 367889999987666633 4568888888888777654322111 Q ss_pred cc------hh----hhhhhhcccccc----------------c---c--------cCCCeEEEEE--EEEEecceeEEEE Q lcl|NC_013059. 194 DA------DD----IPSFQNPNDWVF----------------P---W--------LTQDTIQIAE--FYEVVEKKETAFI 234 (725) Q Consensus 194 ~~------~~----~~~~~~~~~~~~----------------~---~--------~~~~~vrv~E--~w~~~~~~~~~~~ 234 (725) +- .. ..++...+..+. + | +...+|-|-| +|.|.- -.+.+ T Consensus 241 EKkltykkvV~~Al~~s~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~--~Y~RI 318 (666) T protein:vir:10 241 EKKLTYKKVVNEALKSSFQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHT--MYLRI 318 (666) T ss_pred chhhhHHHHHHHHHhhhccccccccCCccCccccccchhhccchhhcCcccccccccccccccccccceeeee--eeeee Confidence 00 00 000011110000 0 1 0011121211 121110 00001 Q ss_pred eeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeecccccc-CCCCCCCCccceEEEEeeeeccCCc Q lcl|NC_013059. 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLK-DKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) Q Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~-~~~~~p~~~~p~vP~~g~~~~~d~~ 313 (725) ++ .++ +. -......+..+ +..++.|+.++- ++--..|++||+- +|+.+ -||. T Consensus 319 ~P------------SDF----~~------~~P~~N~~QIW--K~v~IN~~~iIS~~~~I~AY~~~~~~--~~~~L-EDG~ 371 (666) T protein:vir:10 319 IP------------SDF----EM------NVPNRNQVQIW--KAVMINRDAIISFEPYIGAYGSFGMG--LAFAL-EDGM 371 (666) T ss_pred cc------------ccc----ee------cCCCCCcceee--eeeeeccceeEeeehhhhccchhhhh--hhhhh-hhcc Confidence 11 000 00 00001111222 234567777763 2333367778764 45543 4666 Q ss_pred ccc-chhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHH---HHHHhhccccccccccccccCccccccCC Q lcl|NC_013059. 314 EVY-EGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFE---HMYDGNDDYPYYLLNRTDENNGEMPTQPL 389 (725) Q Consensus 314 ~~~-~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 389 (725) ++- .|+-...++-|+.-.+. |++.-.+..+.+.+..-.+..+ +........+.++..+-...+|.+ ++.- T Consensus 372 G~QTQ~~~E~~~P~Q~A~t~L-----~N~~~~~aRRAV~DRAl~~~S~i~a~~iNSP~~~~KIP~~~~sL~N~~~-~~~Y 445 (666) T protein:vir:10 372 GLQTQGYGEMAAPLQSATTEL-----WNAYIQGARRAVMDRALYNPSMIRANDINSPIPQIKIPVVPQSLVNGTM-DQAY 445 (666) T ss_pred ccccccccccccchhhhhhHH-----hhhhhhhhhhhhhhhhccChhhhhhhcccCCCCCcccceeehhhcccch-hhhh Confidence 542 57777889999865443 3333333333222211111100 011111111222211111111111 1111 Q ss_pred cccCCCCchHHHHHHHHHH---HHHHHHHhCCChHHhcc--CcchhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH Q lcl|NC_013059. 390 AYYENPEVPQANAYMLEAA---TAAVKEVATLGVDAEAV--NGGQVAYDTVNQLNMRADLETYV--FQDNLATAMRRDGE 462 (725) Q Consensus 390 ~~~~~~~~~~~~~~ll~~~---~~~i~~~tGv~~~~~G~--~~n~~Sg~ai~~~q~q~~~~~~~--~~dn~~~~~~~~g~ 462 (725) . +-|-..-|.-..|+.+ .+--++++|.|..-.|+ .||-+- +--...+-.++.++.. ++=..+ .+.-+-+ T Consensus 446 ~--~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKGNKt~-~E~~~~MG~a~NR~RLPALiLEH~-~F~~iK~ 521 (666) T protein:vir:10 446 R--QIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKGNKTR-AEFDTIMGNAENRMRLPALILEHR-MFTKIKE 521 (666) T ss_pred c--cCCccccchhHHHhhhHHHHhhHHHhhccCCcccccccccCcce-eehhhhcCCcccceehhhHHhhhh-hhhhHHH Confidence 1 1122223444444433 34456788999888886 344321 0001111111111111 111111 1111112 Q ss_pred HHHHHHHHhcC-CCcEEEEeccCCCcceEEeccccccccCCcee--eecccc-ccceEEEEeccCchhHHHHHHHHHHHH Q lcl|NC_013059. 463 IYQSIVNDIYD-VPRNVVITLEDGSEKEVQLMAEVVDLATGERQ--VLNDIR-GRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) Q Consensus 463 ~ll~li~~~y~-~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~--~~nDi~-g~~Dv~v~~~p~~~t~r~~~~~~l~el 538 (725) +|.|..=.|. +..||- +.+|+.+ -+..++ -.+.+-+.+|..-. .|.+.-+.++-+ T Consensus 522 -~L~LNl~~YG~DT~ViS-------------------~RtG~~~~vDi~~L~~~~L~F~~~DG~TP~-SK~ASs~~lT~~ 580 (666) T protein:vir:10 522 -QLKLNLLMYGEDTEVIS-------------------PRTGKGVRVDIKELQDLGLKFELGDGLTPA-SKLASSDFLTAL 580 (666) T ss_pred -HHhhhhhhccccchhcc-------------------cccCceeeeeHHHHhhhhheeeeccCCCch-hhhhhhHHHHHH Confidence 2333333333 333321 2222221 111122 12344445554333 344444444433 Q ss_pred HH----------hcccccchHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHH Q lcl|NC_013059. 539 LG----------KTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAM 608 (725) Q Consensus 539 l~----------~~~~~~p~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~ 608 (725) |+ +.++-.|-+..- ++.+.-.-+.++....+--+- ++.-..+..-++...+.+++-..|.++. T Consensus 581 LQMI~sS~~~~~A~G~~~P~M~AH---~~QLGGVRG~E~Y~daalP~~---~~~~~~~Q~LQ~~~LQ~~~QSA~Q~~A~- 653 (666) T protein:vir:10 581 LQMIMSSETTLQAFGTQVPGMIAH---LAQLGGVRGFEKYADAALPQW---QITYGMQQQLQQMLLQLQQQSAMQLQAR- 653 (666) T ss_pred HHHHhhhhhhHhhhcccchHHHHH---HHHhccccchhhhhhccCCcc---ccccchhHHHHHHHHHHhhhhhcccccc- Confidence 33 233323322222 223333344444433211100 0000000111111111111111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 609 VQAQGVLLQGQAELAKAQNQTLSLQ 633 (725) Q Consensus 609 ~~~qa~~~k~qae~~kaqae~~k~q 633 (725) |.++--.|+.- .| T Consensus 654 ----------Q~~L~~~Q~~P--Sq 666 (666) T protein:vir:10 654 ----------QGELSNDQSQP--SQ 666 (666) T ss_pred ----------cccCcccccCC--CC Confidence 11110000000 00 No 145 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=82.48 E-value=0.077 Score=26.70 Aligned_cols=149 Identities=8% Similarity=0.018 Sum_probs=15.5 Q ss_pred CCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 560 LDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKV 639 (725) Q Consensus 560 ~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~q~ea~~~ 639 (725) |.+..+++..+..+.++ .+...++.+.... ..+...+......+....+.++...+ .+++.... T Consensus 1 Mki~elk~el~~~~~el----------~~~~~elr~~~~~-~~~~~~el~~~~~e~~~~~~ei~el~-----~~l~~~~~ 64 (437) T protein:vir:10 1 MKIEKLKKDLATKTAEL----------NTKKAEIRSFTES-EDKTIDEVKAGMTEIKEKEDEIKEIR-----SNIEVLEQ 64 (437) T ss_pred CCHHHHHHHHHHHHHHH----------HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHH Confidence 33222221111111000 0000000000000 00000011111111111111111111 11111111 Q ss_pred HHHHHHHHHHHHHH--HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----H-----HHHHHHHHHH--- Q lcl|NC_013059. 640 EAQNQLNAARIAEI--FNN-MDLSKQSEFREFLKTVASFQQDRSEDARANAELLLK-----G-----DEQTHKQRME--- 703 (725) Q Consensus 640 q~q~q~~~a~~~~~--~~q-~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~-----~-----~~q~~~q~~e--- 703 (725) ..+....+.+.... ... .............+...+... ..+.....+....+ . .........+ T Consensus 65 ~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (437) T protein:vir:10 65 ASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKTETKS-EAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTA 143 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhh Confidence 11111100000000 000 000000000000000000000 00000000000000 0 0000000000 Q ss_pred HHHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 704 IANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 704 ~~~~~~~~~~~q~~~~~~~~~q 725 (725) ..+................... T Consensus 144 ~~~~~~~~e~~~~~~~~~~~~g 165 (437) T protein:vir:10 144 FADYLKTGEVRDVTGIALKDGK 165 (437) T ss_pred hHHHHHhhhhhhhhhccccccc Confidence 0000000000011111111111 No 146 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=78.57 E-value=0.11 Score=25.77 Aligned_cols=626 Identities=11% Similarity=0.038 Sum_probs=184.3 Q ss_pred CC--cHHHHHHHHHHHHHHHHhhh------HHHHHHHHHHHHhhcCCCCCHH---HHHHHhhcCCCcccchHHHHHHHHH Q lcl|NC_013059. 1 MA--DNKNRLESILSRFDADWTAS------DEARREAKNDLFFSRVSQWDDW---LSQYTTLQYRGQFDVVRPVVRKLVS 69 (725) Q Consensus 1 ma--d~~~~~~~~~~~~~~~~~~~------~~~r~~a~~d~~f~~G~QW~~~---~~~~l~~~grp~~N~i~~~v~~v~g 69 (725) |- ++++ .+....... .+.+..+..+.+ .-.-|... +..+. .|. .-.+.+..++- T Consensus 1 ~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~q~~~r~~a~~d~~fy--~G~----QW~~~~~~~l~ 65 (772) T protein:vir:10 1 MQITENDR-------QYLNGLPPAGDTPLTVDEYADINYEIE--DQPAWRAVADKEMDYA--DGN----QLDTELLRRQQ 65 (772) T ss_pred CCcchhhH-------HhhccCCcccccccCHHHHHHHHHHHh--ccHHHHHHHHHHHHhh--cCC----CCCHHHHHHHH Confidence 32 2211 111111100 111121211111 00112211 11111 122 22222222221 Q ss_pred -----------------HHhhCCcceEEec-CCcchHHHHHHHHHHHHHHHHhcChhHHHHHHHHHHHhcCcceEEEEee Q lcl|NC_013059. 70 -----------------EMRQNPIDVLYRP-KDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTD 131 (725) Q Consensus 70 -----------------~~~~nr~~~~~~p-r~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~ 131 (725) .-...-..-+..+ ..|.++..-+.+..++..+....-.......++.++..+++.+ ++. T Consensus 66 ~~g~p~~~~N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~---G~G 142 (772) T protein:vir:10 66 ALGIPPAVEDLIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIAC---GIG 142 (772) T ss_pred hcCCCcEEEcchHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhc---Cce Confidence 1111111111222 1244443446778888888888777777888888888888876 222 Q ss_pred eccCCCCCCceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhh-----------hcCCcchhhhh Q lcl|NC_013059. 132 YEDQSPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAE-----------KFDLDADDIPS 200 (725) Q Consensus 132 ~~~~~~~~~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p-----------~~~~~~~~~~~ 200 (725) |... ... .++...+...-..||++.-+|.+ |+ .|.++..-+|- .|+++...... T Consensus 143 w~e~-~~~------~d~~~~~i~i~~v~p~~v~~Dp~-a~-------~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~ 207 (772) T protein:vir:10 143 WVEV-SRE------SDPFKFPYRCRPIRRDEIHWDMK-CG-------DDWEACRFLRRQRWLSPDRIALVFPEHAELIGM 207 (772) T ss_pred eEEe-ccc------cCCCCCCeEEEeeCcccceecCC-CC-------CCHHHhhhhhhhccCCHHHHHHhCCCchhHHHh Confidence 3221 111 11211122122346777666643 21 13344333332 23333222222 Q ss_pred hhhcccccc--------cccCCCeEEEE----EEEEEec------ceeEEEEeeCccccc--eeecchhhhH--HHHHHH Q lcl|NC_013059. 201 FQNPNDWVF--------PWLTQDTIQIA----EFYEVVE------KKETAFIYQDPVTGE--PVSYFKRDIK--DVIDDL 258 (725) Q Consensus 201 ~~~~~~~~~--------~~~~~~~vrv~----E~w~~~~------~~~~~~~~~d~~~g~--~~~~~~~~~~--~~~~~~ 258 (725) ..+.....+ .|.+.+.+... ..|.++. .+.+|.++...+.-. ...++..+.. .....- T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~ 287 (772) T protein:vir:10 208 VGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNN 287 (772) T ss_pred hhhhcccccCcccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCccc Confidence 222221110 11111111111 1122111 011121111111000 0001111000 000000 Q ss_pred HhcchhhhhccceeEEEEEEEEeeccccccCCCCC-CCCccc--eEEEEeeeeccCCccccchhhhhhhhHHHHHHHHHH Q lcl|NC_013059. 259 ADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLI-AGEHIP--IVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMS 335 (725) Q Consensus 259 ~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~-p~~~~p--~vP~~g~~~~~d~~~~~~G~vr~~kd~Q~~~N~~~s 335 (725) ... ...+....+..++++.+.+-...++.+.-.+ ...-|| .+||++++.+++. ..|....++..=+..=. T Consensus 288 ~~~-~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~---~~g~~~G~vr~~kd~Qr--- 360 (772) T protein:vir:10 288 LAH-NIALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFRED---ATGIPYGYVRGMKYAQD--- 360 (772) T ss_pred HHH-HHHHhhcccchheeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEec---cCCcccchhhhhhhHHH--- Confidence 000 1223334444444444444445555333222 112244 6788887765543 36655543332222111 Q ss_pred HHHHHHHh-cCCcceeechhhcchHHHHHHhhccc-cccccccccccCccccccC------CcccCCCCchHHHHHHHHH Q lcl|NC_013059. 336 FNADIVAR-TPKKKPFFWPEQIAGFEHMYDGNDDY-PYYLLNRTDENNGEMPTQP------LAYYENPEVPQANAYMLEA 407 (725) Q Consensus 336 ~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~g~~~~~~------~~~~~~~~~~~~~~~ll~~ 407 (725) .++. .++..+++..-++- ++..+... .....+.+..+++.+.-.+ ...+...+.|.-....++. T Consensus 361 ----~~N~~~S~~~~~l~~~~~~----~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 432 (772) T protein:vir:10 361 ----SLNSGVSKLRWGMSVARVE----RTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQM 432 (772) T ss_pred ----HHHHHHHHHHHHHhccccc----ccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHH Confidence 1111 11112222221110 11111110 0011122233344433322 1233333444455677788 Q ss_pred HHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeccC-CC Q lcl|NC_013059. 408 ATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVVITLED-GS 486 (725) Q Consensus 408 ~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d-~~ 486 (725) ....+..+.-++.......|..+++..=-+....-..+...+..-|.. +++.-+.+-+++..+. -+. +. T Consensus 433 lq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dn-l~~~~~~~g~~lL~li---------~~~y~~ 502 (772) T protein:vir:10 433 LQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQSNQSIGRIMDN-FRAGRTLVGELLLAMI---------VEDIGQ 502 (772) T ss_pred HHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH---------HHHcCC Confidence 777777776654333333333222211112111111111111111111 1222222222222222 222 23 Q ss_pred cceEEeccccccccC-Cceeeeccc-----cccc----eEEEEe---ccC-chhHHHHHHHHHHHHHHhcccccchHHHH Q lcl|NC_013059. 487 EKEVQLMAEVVDLAT-GERQVLNDI-----RGRY----ECYTDV---GPS-FQSMKQQNRAEILELLGKTPQGTPEYQLL 552 (725) Q Consensus 487 ~~~v~in~~~~d~~~-g~~~~~nDi-----~g~~----Dv~v~~---~p~-~~t~r~~~~~~l~ell~~~~~~~p~~~~~ 552 (725) ++.+.|..+ |+.+ ...+.+|.. +|.. ||++.. ..+ .++.-.+.-+.+..+++.+ T Consensus 503 er~~RI~~~--d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~---------- 570 (772) T protein:vir:10 503 ERTEVVIEG--DAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAV---------- 570 (772) T ss_pred CcEEEEecC--CCCCCCceEEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHH---------- Confidence 445544322 2222 235666653 3543 444332 223 3443333333333333321 Q ss_pred HHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 553 LLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSL 632 (725) Q Consensus 553 ~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k~qae~~kaqae~~k~ 632 (725) ..++ |.+...+- ..+.+....+..+.+.+..++.+.+..+++.+++.. ...+.++++++++... T Consensus 571 -----~~~~-P~~~~~~~--------~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~~-q~~qq~~~~~~~el~~- 634 (772) T protein:vir:10 571 -----KSMP-PQYQAAVL--------PFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQID-QAVQDALAKAGNDIKL- 634 (772) T ss_pred -----hccC-hhHHHHHH--------HHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHHH-HHHHHHHHHHHHHHHH- Confidence 1111 22111110 000111111112222222222222222222111111 0111111222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_013059. 633 QIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQ-THKQRMEIANILQSQ 711 (725) Q Consensus 633 q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~~~~~~a~~~aE~~~~~~~q-~~~q~~e~~~~~~~~ 711 (725) .+++++.++..++..++++++.......+..+++..... .++...+...++ ++.+-.+..+..... T Consensus 635 ----~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~---------~q~~q~a~~ad~~l~~~g~~~~~~~~~~ 701 (772) T protein:vir:10 635 ----RELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQI---------AQMPMIAPIADAVMQSAGYQRPNPAGDD 701 (772) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhH---------HhhhhhhHHHHHHHHhcccccccccccC Confidence 111222222222222222211111110111111111000 000000010111 000000000000000 Q ss_pred ------Hhc-----CCcccccCCCC Q lcl|NC_013059. 712 ------RQN-----QPSGSVAETPQ 725 (725) Q Consensus 712 ------~~~-----q~~~~~~~~~q 725 (725) .++ ++++-....|+ T Consensus 702 ~~~p~~~~~a~~~~~~~~~~~~~~~ 726 (772) T protein:vir:10 702 PNYPIADQTAAMNIRSPYIQGQGPA 726 (772) T ss_pred CCCCCCCCccCCCCCccCCCCCCCC Confidence 000 00000000011 No 147 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=55.20 E-value=0.49 Score=22.31 Aligned_cols=162 Identities=8% Similarity=0.021 Sum_probs=9.9 Q ss_pred hHHHHHHHhhccCCchhHHHHHHHHhhhhhhhhhhhccchhhhHHHHHHHHHHHhhHHHHHHHHHHHHHH----HHHHHH Q lcl|NC_013059. 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWFVEAQQAKQGQQDPAMVQAQGVLLQ----GQAELA 623 (725) Q Consensus 548 ~~~~~~~~~~~~~d~~~~~~i~e~~~kq~~~~~~~~~~~~e~~~~~~q~~q~qq~q~q~~~~~~qa~~~k----~qae~~ 623 (725) |-...+- ..++...+.+.+..................+. ....+......+....+.++.. .+..+. T Consensus 1 Mki~elk--------~el~~~~~el~~~~~elr~~~~~~~~~~~el~-~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e 71 (437) T protein:vir:10 1 MKIEKLK--------KDLATKTAELNTKKAEIRSFTESEDKTIDEVK-AGMTEIKEKEDEIKEIRSNIEVLEQASALKVE 71 (437) T ss_pred CCHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 01111111111100000000000000000000 0000000000000000000000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_013059. 624 KAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQ---DRSEDARANAELLLKG-DEQTHK 699 (725) Q Consensus 624 kaqae~~k~q~ea~~~q~q~q~~~a~~~~~~~q~~~~~~~~~~e~~~~~~~~q~---~~~~~a~~~aE~~~~~-~~q~~~ 699 (725) +.+........+......... ..+..... ...........+....... ................ ...... T Consensus 72 ~~~~~~~~~~~e~~~~~~~~e--~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 145 (437) T protein:vir:10 72 EKRDDSDLVAPELEENSADNE--EDDPEKLK----TETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFA 145 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHHHHHH----HHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhH Confidence 000000000000000000000 00000000 0000000000000000000 0000000000000000 000000 Q ss_pred HHHHHHHHHHHHHhcCCcccccCCCC Q lcl|NC_013059. 700 QRMEIANILQSQRQNQPSGSVAETPQ 725 (725) Q Consensus 700 q~~e~~~~~~~~~~~q~~~~~~~~~q 725 (725) ......+. .......++....-.|. T Consensus 146 ~~~~~~e~-~~~~~~~~~~~g~lvp~ 170 (437) T protein:vir:10 146 DYLKTGEV-RDVTGIALKDGKVIIPE 170 (437) T ss_pred HHHHhhhh-hhhhhcccccccccchH Confidence 00000000 00000001111111222 No 148 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=20.96 E-value=2.7 Score=18.24 Aligned_cols=467 Identities=11% Similarity=0.038 Sum_probs=163.4 Q ss_pred CCcHH-HHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhc-CC----CCCHHHHHHHhh-cCCCc-ccchHHHHHHHHHHHh Q lcl|NC_013059. 1 MADNK-NRLESILSRFDADWTASDEARREAKNDLFFSR-VS----QWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSEMR 72 (725) Q Consensus 1 mad~~-~~~~~~~~~~~~~~~~~~~~r~~a~~d~~f~~-G~----QW~~~~~~~l~~-~grp~-~N~i~~~v~~v~g~~~ 72 (725) |+|.. +-...-+-.|.+....|.-.|.-..-...+-. |. +|+.+.....+. ..|-+ +|..+.+|+.++|.-- T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf 80 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPF 80 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhh Confidence 99863 22222222222222222222211110111111 11 444433332222 23443 5999999999999776 Q ss_pred hCCcceEEecCCcchHHHHHHHH-HHHHHH-HHhcChhHHHHHHHHHHHhcCcceEEEEeeeccCCCCC----------- Q lcl|NC_013059. 73 QNPIDVLYRPKDGASPDAADVLM-GMYRTD-MRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTS----------- 139 (725) Q Consensus 73 ~nr~~~~~~pr~~~d~~~Ae~l~-~~~~~~-~~~~~~~~~~s~a~~~~~~~G~G~~~v~~~~~~~~~~~----------- 139 (725) ...|. +. .++...+. .++..+ .+-++.+.-...+|..++..|.+|+=| ||-...+.+ T Consensus 81 ~k~p~--~~------~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV--D~P~~~~~~~~~~~T~Ade~ 150 (513) T protein:vir:97 81 SEPIK--LN------EDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLI--DMPRPAPREDGQPRTLADDR 150 (513) T ss_pred hcCcc--cC------cCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE--ecCCCCCccchhHHhHHHHH Confidence 64332 21 11222222 344333 467789999999999999999998654 543222111 Q ss_pred ----CceeEEEEeeecchhheeeCCCccccChhcccceeeeecCCHHHHHHHhhhcCCcchhhhhhhhcccccccccCCC Q lcl|NC_013059. 140 ----NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKFDLDADDIPSFQNPNDWVFPWLTQD 215 (725) Q Consensus 140 ----~~~~ir~~~~~~~~~~v~~Dp~a~~~d~sDa~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (725) .|+..- ..+.+| +|++....+.. ..+...++.. .+-+ .++ +...... T Consensus 151 ~~~~rPy~~~-----~~~e~I-inW~~~~v~G~--~~L~~v~l~E------~~~~--------------~Dg-f~~~~~~ 201 (513) T protein:vir:97 151 REGLRPYWVM-----IKPECL-LFARSEVINGV--EVLQHVRIIE------HYME--------------QDG-FAEVCKR 201 (513) T ss_pred hhccCceEEE-----ecHhhh-cCcceeccCcc--eeeeeEEEEE------EEee--------------cCC-CcceEEE Confidence 022211 122222 24443333321 1111111110 0000 000 0000001 Q ss_pred eEEEEEEEEEecceeEEEEeeCccccceeecchhhhHHHHHHHHhcchhhhhccceeEEEEEEEEeeccccccCCCCCCC Q lcl|NC_013059. 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAG 295 (725) Q Consensus 216 ~vrv~E~w~~~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~g~~~l~~~~~~p~ 295 (725) .+|| |. +....++....+ |.. . .-.|.. .+...-+- T Consensus 202 q~rv---L~--~g~~~v~r~~~~--~~~---------------------~---------~~e~~~-------~~~g~~~l 237 (513) T protein:vir:97 202 RIRV---LE--PGLVQLWEPVKK--SNA---------------------Q---------KEEWAL-------ADEWATGL 237 (513) T ss_pred EEEE---Ee--CceEEEEEeecC--CCc---------------------c---------ccceEE-------ecCCCCcC Confidence 1111 10 000111110000 000 0 000111 11111123 Q ss_pred CccceEEEEeeeeccC-CccccchhhhhhhhHHHHHHHHHHHHHHHHHhcCCcceeechhhcchHHHHHHhhcccccccc Q lcl|NC_013059. 296 EHIPIVPVFGEWGFVE-DKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) Q Consensus 296 ~~~p~vP~~g~~~~~d-~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) +.+|||||+..+.... ++...+ ++-....-+=...|-..+++.... .+..+..|..+ .+ ..+..++ T Consensus 238 ~~IP~v~~~~~~~~~~~~~pPLl----~LA~ln~~hy~~~Sd~~~il~~~~-~P~l~~~G~~~----~~----~~~i~iG 304 (513) T protein:vir:97 238 NYVPLVTFYADRQGFMMGKPPLL----DLAHLNVAHWQSASDQRHILTVSR-FPILACSGASG----ED----SDPVVVG 304 (513) T ss_pred CceeEEEEecCCCCCCCCccchH----HHHHHHHHHHhhhhhHHHHHHhcc-cceeeeecCCc----CC----CCceEee Confidence 5788887765432111 111112 222222111111222223333332 23333222111 11 0011111 Q ss_pred -ccccccCccccccCCcccCCCCch-HHHHHHHHHHHHHHHHHhCCChHHhccCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013059. 375 -NRTDENNGEMPTQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDN 452 (725) Q Consensus 375 -~~~~~~~g~~~~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGv~~~~~G~~~n~~Sg~ai~~~q~q~~~~~~~~~dn 452 (725) +.....++ ++..+.++++..-+ .....-|....+.| ...|. ..+...+.+.|+.+......+....|..+..| T Consensus 305 ~~~~~~lpe--~~~~~~yie~~g~~i~~~~~~l~~le~qm-~~~Ga--~ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~ 379 (513) T protein:vir:97 305 PNKVLYNPD--PAGRFYYVEHTGQAIAAGRTDLKDLEEQM-AGYGA--EFLKRKTGGQTATARALDSAEATSDLSAMTGL 379 (513) T ss_pred ccccccCCC--CCCcceeeccCchhHHHHHHHHHHHHHHH-HHHHH--HhhccCCccccHHHHHHHHHHHHHHHHHHHHH Confidence 11111110 12345666654322 22333444444555 34453 23443343467888877777777777888888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEeccCCCcceEEeccccccccCCceeeeccccccceEEEEeccCchhHHHHHH Q lcl|NC_013059. 453 LATAMRRDGEIYQSIVNDIYDVPRNVVITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) Q Consensus 453 ~~~~~~~~g~~ll~li~~~y~~~r~irI~~~d~~~~~v~in~~~~d~~~g~~~~~nDi~g~~Dv~v~~~p~~~t~r~~~~ 532 (725) +..+. +.+|.++..|...+ ...--|.||+ .|+...- ..+.+ T Consensus 380 le~al----~~~l~~~a~wlg~~---------~~~~~v~in~------------------dF~~~~~--------~~~~~ 420 (513) T protein:vir:97 380 FEDAL----AQALDITADWLRLG---------PNGGTVELVK------------------DYDLEEM--------DAPGL 420 (513) T ss_pred HHHHH----HHHHHHHHHHhCCC---------CCccEEEecc------------------ccCcccC--------CHHHH Confidence 77775 56667777776522 1111234432 2322110 11223 Q ss_pred HHHHHHHHhcccccchHHHHHHH---hh-ccCCch-hHHHHHHHHhhhhhhhhhh----hccch-------hhhHHHHHH Q lcl|NC_013059. 533 AEILELLGKTPQGTPEYQLLLLQ---YF-TLLDGK-GVEMMRDYANKQLIQMGVK----KPETP-------EEQQWFVEA 596 (725) Q Consensus 533 ~~l~ell~~~~~~~p~~~~~~~~---~~-~~~d~~-~~~~i~e~~~kq~~~~~~~----~~~~~-------e~~~~~~q~ 596 (725) ++|.+++.+ +..........+. .+ +..|.. ..+++.+++.........- ...++ +-...-.+- T Consensus 421 ~al~~a~~~-G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (513) T protein:vir:97 421 QALQVAREK-RDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEG 499 (513) T ss_pred HHHHHHHhC-CCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCC Confidence 344443321 1111111111111 11 111211 2244444443322110000 00000 000000000 Q ss_pred HH--HHHhhHHHHH Q lcl|NC_013059. 597 QQ--AKQGQQDPAM 608 (725) Q Consensus 597 ~q--~qq~q~q~~~ 608 (725) +. ..-.-.--+. T Consensus 500 ~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 500 GEGGEGGGNPGGES 513 (513) T ss_pred CCccccCCCCCCCC Confidence 00 0000000000 Done!