Query lcl|NC_013692.1_cdsid_YP_003358477.1 [gene=PP-LIT1_gp80] [protein=N4 gp59-like protein] [protein_id=YP_003358477.1] [location=complement(63464..65644)] Match_columns 726 No_of_seqs 277 out of 585 Neff 9.1 Searched_HMMs 1612 Date Thu Nov 7 13:57:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_80 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_80_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95821 Length: 763 100.0 7E-169 4E-172 942.6 72.1 700 11-726 1-709 (763) 2 protein:vir:8846 Length: 705 # 100.0 5E-128 3E-131 718.4 70.7 665 17-726 1-704 (705) 3 protein:vir:80165 Length: 651 100.0 2E-95 1.2E-98 539.8 59.0 618 11-714 1-651 (651) 4 protein:vir:93630 Length: 776 100.0 1.1E-94 6.6E-98 535.8 51.4 629 1-726 1-711 (776) 5 protein:vir:108295 Length: 711 100.0 3.5E-88 2.2E-91 500.0 61.0 615 1-726 1-711 (711) 6 protein:vir:3296 Length: 714 # 100.0 2.9E-85 1.8E-88 484.0 60.3 623 11-724 1-714 (714) 7 protein:vir:10117 Length: 714 100.0 2.9E-85 1.8E-88 484.0 60.3 623 11-724 1-714 (714) 8 protein:vir:817 Length: 714 # 100.0 2.9E-85 1.8E-88 484.0 60.3 623 11-724 1-714 (714) 9 protein:vir:9950 Length: 714 # 100.0 2.9E-85 1.8E-88 484.0 60.3 623 11-724 1-714 (714) 10 protein:vir:2764 Length: 714 # 100.0 2.9E-85 1.8E-88 484.0 60.3 623 11-724 1-714 (714) 11 protein:vir:104437 Length: 714 100.0 3.9E-83 2.4E-86 472.4 59.5 621 1-726 1-708 (714) 12 protein:vir:105619 Length: 772 100.0 4.5E-83 2.8E-86 472.0 53.3 613 1-726 1-700 (772) 13 protein:vir:77597 Length: 725 100.0 4E-77 2.5E-80 439.4 57.9 608 1-726 1-712 (725) 14 protein:vir:9263 Length: 725 # 100.0 3.5E-76 2.2E-79 434.2 56.1 606 11-726 1-712 (725) 15 protein:vir:100920 Length: 725 100.0 7.8E-76 4.8E-79 432.4 57.2 606 1-726 1-712 (725) 16 protein:vir:105520 Length: 706 100.0 6E-74 3.7E-77 422.0 58.0 611 23-726 1-693 (706) 17 protein:vir:3520 Length: 720 # 100.0 3E-73 1.9E-76 418.2 57.3 606 26-726 1-713 (720) 18 protein:vir:95449 Length: 584 100.0 2.2E-74 1.3E-77 424.4 42.7 554 1-652 1-584 (584) 19 protein:vir:172 Length: 708 # 100.0 2.3E-71 1.4E-74 407.8 57.7 608 1-726 1-702 (708) 20 protein:vir:94599 Length: 641 100.0 2.2E-73 1.4E-76 418.9 46.1 606 9-712 1-641 (641) 21 protein:vir:105429 Length: 708 100.0 6E-71 3.7E-74 405.6 57.7 604 1-726 1-698 (708) 22 protein:vir:3139 Length: 599 # 100.0 1.2E-68 7.7E-72 392.9 37.1 561 11-669 1-599 (599) 23 protein:vir:345 Length: 663 # 100.0 1.5E-38 9.3E-42 227.9 44.6 609 1-726 1-659 (663) 24 protein:vir:7321 Length: 556 # 99.9 1.5E-24 9.6E-28 151.1 43.6 514 27-689 1-556 (556) 25 protein:vir:95315 Length: 559 99.9 2.6E-24 1.6E-27 149.9 42.4 523 27-704 1-559 (559) 26 protein:vir:102668 Length: 547 99.9 1.6E-23 1E-26 145.6 42.3 502 30-676 1-547 (547) 27 protein:vir:107822 Length: 555 99.9 7.6E-23 4.7E-26 141.9 42.5 520 26-695 1-555 (555) 28 protein:vir:107404 Length: 555 99.9 7.6E-23 4.7E-26 141.9 42.5 520 26-695 1-555 (555) 29 protein:vir:98506 Length: 555 99.9 7.6E-23 4.7E-26 141.9 42.5 520 26-695 1-555 (555) 30 protein:vir:3361 Length: 535 # 99.9 3.6E-22 2.2E-25 138.2 44.1 506 11-680 1-535 (535) 31 protein:vir:103765 Length: 549 99.9 1.2E-21 7.4E-25 135.3 46.5 510 11-675 1-549 (549) 32 protein:vir:1538 Length: 535 # 99.9 8.5E-22 5.3E-25 136.1 43.9 506 11-680 1-535 (535) 33 protein:vir:1785 Length: 555 # 99.9 1.2E-21 7.4E-25 135.3 44.2 520 31-701 1-555 (555) 34 protein:vir:2198 Length: 536 # 99.9 5.5E-21 3.4E-24 131.6 45.1 509 26-686 1-536 (536) 35 protein:vir:10447 Length: 536 99.9 4.9E-21 3E-24 131.9 44.8 509 26-686 1-536 (536) 36 protein:vir:94709 Length: 522 99.9 5.4E-21 3.3E-24 131.7 40.2 494 11-685 1-522 (522) 37 protein:vir:99672 Length: 532 99.9 1.6E-19 9.8E-23 123.6 44.5 503 16-689 1-532 (532) 38 protein:vir:100039 Length: 522 99.9 9.7E-20 6E-23 124.8 41.8 493 33-689 1-522 (522) 39 protein:vir:8883 Length: 543 # 99.9 3.2E-19 2E-22 122.0 43.4 516 11-689 1-543 (543) 40 protein:vir:94572 Length: 535 99.9 1.4E-18 8.7E-22 118.5 44.7 501 26-680 1-535 (535) 41 protein:vir:103330 Length: 517 99.9 6.6E-19 4.1E-22 120.3 42.7 488 14-674 1-517 (517) 42 protein:vir:78942 Length: 510 99.8 1.4E-18 8.5E-22 118.5 43.2 482 31-674 1-510 (510) 43 protein:vir:80211 Length: 514 99.8 3.5E-18 2.2E-21 116.2 45.4 482 35-670 1-514 (514) 44 protein:vir:6322 Length: 510 # 99.8 1.6E-18 9.8E-22 118.2 42.9 480 31-674 1-510 (510) 45 protein:vir:96988 Length: 516 99.8 3E-18 1.9E-21 116.6 39.8 488 11-670 1-516 (516) 46 protein:vir:7017 Length: 515 # 99.8 1.3E-17 8.2E-21 113.1 43.0 492 1-675 1-515 (515) 47 protein:vir:78696 Length: 542 99.8 3.6E-18 2.3E-21 116.2 36.4 500 32-694 1-542 (542) 48 protein:vir:105641 Length: 516 99.8 5E-17 3.1E-20 109.9 41.4 487 11-675 1-516 (516) 49 protein:vir:103385 Length: 666 99.8 1.5E-19 9.1E-23 123.8 20.5 590 8-688 1-666 (666) 50 protein:vir:96403 Length: 666 99.7 3.8E-19 2.4E-22 121.6 20.6 590 8-688 1-666 (666) 51 protein:vir:3964 Length: 453 # 99.7 6.9E-15 4.3E-18 98.2 34.1 445 1-661 1-453 (453) 52 protein:vir:3609 Length: 452 # 99.7 9.7E-15 6E-18 97.4 34.1 444 14-661 1-452 (452) 53 protein:vir:733 Length: 453 # 99.6 1.6E-13 1E-16 90.7 37.9 440 14-668 1-453 (453) 54 protein:vir:9871 Length: 429 # 99.6 7.1E-14 4.4E-17 92.7 34.6 421 27-661 1-429 (429) 55 protein:vir:93747 Length: 472 99.6 5.8E-14 3.6E-17 93.2 31.4 456 11-655 1-472 (472) 56 protein:vir:102950 Length: 471 99.6 5.1E-13 3.1E-16 88.0 34.5 434 30-638 1-471 (471) 57 protein:vir:95806 Length: 440 99.5 7E-13 4.3E-16 87.2 34.4 422 38-664 1-440 (440) 58 protein:vir:96179 Length: 468 99.5 9.3E-13 5.8E-16 86.5 34.1 452 1-658 1-468 (468) 59 protein:vir:1587 Length: 508 # 99.5 1.2E-12 7.2E-16 86.0 34.6 464 14-634 1-508 (508) 60 protein:vir:105292 Length: 478 99.5 9.5E-13 5.9E-16 86.5 33.6 458 1-645 1-478 (478) 61 protein:vir:99522 Length: 470 99.5 2E-12 1.3E-15 84.7 34.9 451 1-662 1-470 (470) 62 protein:vir:80680 Length: 441 99.5 3.8E-12 2.4E-15 83.2 36.6 426 27-668 1-441 (441) 63 protein:vir:106639 Length: 481 99.5 1.3E-12 8.2E-16 85.7 32.5 460 1-651 6-481 (481) 64 protein:vir:79703 Length: 505 99.5 1.7E-12 1E-15 85.1 33.0 460 14-638 1-505 (505) 65 protein:vir:38 Length: 496 # N 99.5 1.2E-12 7.2E-16 86.0 32.1 466 11-629 1-496 (496) 66 protein:vir:80959 Length: 499 99.5 4.7E-13 2.9E-16 88.2 29.7 463 1-629 1-499 (499) 67 protein:vir:9751 Length: 422 # 99.5 9.3E-13 5.7E-16 86.6 31.1 408 27-635 1-422 (422) 68 protein:vir:105461 Length: 470 99.5 3.4E-12 2.1E-15 83.5 34.1 440 30-662 1-470 (470) 69 protein:vir:2732 Length: 501 # 99.5 1E-12 6.2E-16 86.4 31.1 471 1-661 8-501 (501) 70 protein:vir:102330 Length: 451 99.5 6.3E-12 3.9E-15 82.0 36.1 432 30-655 1-451 (451) 71 protein:vir:3028 Length: 500 # 99.5 9.7E-13 6E-16 86.4 29.6 468 14-634 1-500 (500) 72 protein:vir:9815 Length: 500 # 99.5 9.7E-13 6E-16 86.4 29.6 468 14-634 1-500 (500) 73 protein:vir:98883 Length: 517 99.5 8.5E-13 5.3E-16 86.8 29.2 489 14-638 1-517 (517) 74 protein:vir:96839 Length: 474 99.5 6.6E-12 4.1E-15 81.9 34.1 457 1-657 1-474 (474) 75 protein:vir:96494 Length: 501 99.5 4.7E-12 2.9E-15 82.7 33.0 472 1-662 8-501 (501) 76 protein:vir:1236 Length: 483 # 99.5 2.1E-12 1.3E-15 84.6 30.6 462 1-661 1-483 (483) 77 protein:vir:9922 Length: 489 # 99.5 1.2E-11 7.7E-15 80.4 35.7 442 3-630 1-489 (489) 78 protein:vir:107112 Length: 478 99.4 5.2E-12 3.2E-15 82.4 32.0 460 1-669 1-478 (478) 79 protein:vir:96240 Length: 511 99.4 4.2E-12 2.6E-15 82.9 31.3 480 1-676 1-511 (511) 80 protein:vir:94742 Length: 409 99.4 3.9E-12 2.4E-15 83.1 31.0 395 27-608 1-409 (409) 81 protein:vir:105889 Length: 474 99.4 1.5E-11 9.1E-15 80.0 33.7 450 14-645 1-474 (474) 82 protein:vir:94101 Length: 474 99.4 1.5E-11 9.1E-15 80.0 33.7 450 14-645 1-474 (474) 83 protein:vir:2341 Length: 488 # 99.4 5.6E-13 3.5E-16 87.8 25.6 466 11-665 1-488 (488) 84 protein:vir:97336 Length: 492 99.4 6.9E-12 4.3E-15 81.8 31.3 451 1-661 25-492 (492) 85 protein:vir:2427 Length: 485 # 99.4 5.8E-13 3.6E-16 87.7 23.9 462 19-666 1-485 (485) 86 protein:vir:4898 Length: 502 # 99.4 1E-11 6.3E-15 80.9 30.2 474 1-660 1-502 (502) 87 protein:vir:103951 Length: 511 99.4 1.4E-11 8.6E-15 80.1 31.0 477 1-676 1-511 (511) 88 protein:vir:94805 Length: 492 99.4 4.8E-11 3E-14 77.2 32.9 458 1-661 17-492 (492) 89 protein:vir:94498 Length: 474 99.4 5.2E-11 3.2E-14 77.0 35.1 459 1-662 1-474 (474) 90 protein:vir:97447 Length: 474 99.4 5.2E-11 3.2E-14 77.0 35.1 459 1-662 1-474 (474) 91 protein:vir:1634 Length: 409 # 99.4 3E-11 1.8E-14 78.3 31.4 395 27-608 1-409 (409) 92 protein:vir:9568 Length: 410 # 99.4 1.2E-11 7.7E-15 80.4 29.3 393 43-645 1-410 (410) 93 protein:vir:99781 Length: 511 99.4 1.3E-11 7.8E-15 80.3 29.2 481 1-676 1-511 (511) 94 protein:vir:9306 Length: 511 # 99.4 2.3E-11 1.4E-14 78.9 30.6 479 1-676 1-511 (511) 95 protein:vir:97171 Length: 512 99.4 2.3E-11 1.5E-14 78.9 30.4 479 1-676 1-512 (512) 96 protein:vir:78805 Length: 511 99.4 1.5E-11 9.4E-15 79.9 29.3 480 1-676 1-511 (511) 97 protein:vir:96366 Length: 511 99.4 1.5E-11 9.4E-15 79.9 29.3 480 1-676 1-511 (511) 98 protein:vir:95113 Length: 474 99.4 6.3E-11 3.9E-14 76.5 34.1 457 1-664 1-474 (474) 99 protein:vir:78907 Length: 518 99.4 4E-11 2.5E-14 77.6 31.1 477 14-633 1-518 (518) 100 protein:vir:7768 Length: 484 # 99.3 2.6E-12 1.6E-15 84.1 24.3 459 16-666 1-484 (484) 101 protein:vir:106571 Length: 499 99.3 1.5E-11 9.4E-15 79.9 28.2 473 1-659 1-499 (499) 102 protein:vir:79043 Length: 479 99.3 8.8E-11 5.4E-14 75.7 35.3 450 1-656 1-479 (479) 103 protein:vir:96266 Length: 474 99.3 1.1E-10 6.9E-14 75.2 32.0 447 16-669 1-474 (474) 104 protein:vir:95899 Length: 474 99.3 1.1E-10 6.9E-14 75.2 32.0 447 16-669 1-474 (474) 105 protein:vir:3520 Length: 720 # 99.3 1.5E-11 9.4E-15 79.9 26.6 577 82-726 1-702 (720) 106 protein:vir:104082 Length: 485 99.3 4.3E-11 2.7E-14 77.4 28.9 455 16-645 1-485 (485) 107 protein:vir:4223 Length: 486 # 99.3 6.3E-12 3.9E-15 82.0 23.3 465 16-667 1-486 (486) 108 protein:vir:78227 Length: 480 99.2 1.5E-11 9.6E-15 79.9 23.0 452 32-666 1-480 (480) 109 protein:vir:99072 Length: 479 99.2 2.9E-10 1.8E-13 72.9 28.9 451 1-658 1-479 (479) 110 protein:vir:78537 Length: 480 99.2 1.2E-10 7.5E-14 75.0 26.5 453 32-678 1-480 (480) 111 protein:vir:4782 Length: 522 # 99.2 6.9E-10 4.3E-13 70.8 32.2 484 1-633 1-522 (522) 112 protein:vir:5961 Length: 503 # 99.2 7.1E-10 4.4E-13 70.7 36.3 475 1-664 1-503 (503) 113 protein:vir:7430 Length: 563 # 99.2 7.4E-11 4.6E-14 76.1 24.3 501 11-632 1-563 (563) 114 protein:vir:94546 Length: 506 99.2 8.1E-10 5E-13 70.4 31.1 461 1-673 1-506 (506) 115 protein:vir:105429 Length: 708 99.1 1.9E-09 1.2E-12 68.4 29.6 582 82-726 1-694 (708) 116 protein:vir:2500 Length: 501 # 99.1 5.3E-10 3.3E-13 71.4 25.7 472 1-644 1-501 (501) 117 protein:vir:105520 Length: 706 99.1 2E-09 1.2E-12 68.3 30.7 590 78-726 1-690 (706) 118 protein:vir:93630 Length: 776 99.1 1.6E-10 9.7E-14 74.4 22.4 613 11-726 1-706 (776) 119 protein:vir:99916 Length: 504 99.1 2.8E-09 1.8E-12 67.4 29.2 459 11-644 1-504 (504) 120 protein:vir:101494 Length: 527 99.0 5.2E-09 3.2E-12 66.0 32.5 495 32-668 1-527 (527) 121 protein:vir:102239 Length: 527 99.0 5.6E-09 3.5E-12 65.8 32.6 495 32-668 1-527 (527) 122 protein:vir:817 Length: 714 # 98.9 1.4E-09 8.7E-13 69.1 20.7 600 64-726 1-708 (714) 123 protein:vir:9950 Length: 714 # 98.9 1.4E-09 8.7E-13 69.1 20.7 600 64-726 1-708 (714) 124 protein:vir:3296 Length: 714 # 98.9 1.4E-09 8.7E-13 69.1 20.7 600 64-726 1-708 (714) 125 protein:vir:10117 Length: 714 98.9 1.4E-09 8.7E-13 69.1 20.7 600 64-726 1-708 (714) 126 protein:vir:2764 Length: 714 # 98.9 1.4E-09 8.7E-13 69.1 20.7 600 64-726 1-708 (714) 127 protein:vir:8184 Length: 474 # 98.9 2.3E-08 1.5E-11 62.4 34.2 440 16-662 1-474 (474) 128 protein:vir:104437 Length: 714 98.9 2.5E-08 1.5E-11 62.3 28.7 592 64-726 1-703 (714) 129 protein:vir:102602 Length: 456 98.8 4.6E-08 2.9E-11 60.8 29.6 433 23-657 1-456 (456) 130 protein:vir:105819 Length: 456 98.8 4.6E-08 2.9E-11 60.8 29.6 433 23-657 1-456 (456) 131 protein:vir:98444 Length: 434 98.8 1.5E-08 9.3E-12 63.5 22.2 416 60-656 1-434 (434) 132 protein:vir:100920 Length: 725 98.7 1.1E-07 6.9E-11 58.7 34.9 591 39-726 1-701 (725) 133 protein:vir:7987 Length: 456 # 98.7 1.3E-07 8.3E-11 58.3 31.1 431 23-622 1-456 (456) 134 protein:vir:9263 Length: 725 # 98.5 4E-07 2.5E-10 55.7 31.3 585 39-726 1-701 (725) 135 protein:vir:78083 Length: 537 98.5 5E-07 3.1E-10 55.1 33.4 482 23-665 1-537 (537) 136 protein:vir:77597 Length: 725 98.4 7.3E-07 4.5E-10 54.2 32.3 592 39-726 1-701 (725) 137 protein:vir:172 Length: 708 # 98.1 3.7E-06 2.3E-09 50.4 31.2 588 82-726 1-695 (708) 138 protein:vir:8846 Length: 705 # 98.1 5.4E-06 3.4E-09 49.4 32.9 595 26-726 1-692 (705) 139 protein:vir:108295 Length: 711 98.0 7.4E-06 4.6E-09 48.7 32.8 583 41-726 1-708 (711) 140 protein:vir:105619 Length: 772 97.3 9E-05 5.6E-08 42.8 30.9 580 80-726 1-693 (772) 141 protein:vir:80128 Length: 466 89.7 0.025 1.6E-05 29.4 13.0 131 593-726 1-138 (466) 142 protein:vir:94956 Length: 452 81.1 0.089 5.5E-05 26.4 28.0 426 11-630 1-452 (452) 143 protein:vir:1084 Length: 437 # 78.9 0.11 6.8E-05 25.8 18.1 147 570-726 1-155 (437) 144 protein:vir:9704 Length: 394 # 57.9 0.42 0.00026 22.6 9.6 99 625-726 1-109 (394) 145 protein:vir:95376 Length: 425 48.4 0.67 0.00042 21.5 13.3 127 593-726 1-138 (425) 146 protein:vir:80128 Length: 466 44.3 0.81 0.0005 21.1 15.5 141 564-726 1-152 (466) 147 protein:vir:104256 Length: 458 43.2 0.85 0.00053 21.0 15.5 128 588-726 1-146 (458) 148 protein:vir:6212 Length: 434 # 40.0 0.99 0.00061 20.6 7.9 96 624-726 1-99 (434) 149 protein:vir:1084 Length: 437 # 33.9 1.3 0.00082 19.9 20.1 120 602-726 1-145 (437) 150 protein:vir:98339 Length: 415 27.7 1.8 0.0011 19.2 9.6 98 625-726 1-107 (415) 151 protein:vir:79987 Length: 415 27.7 1.8 0.0011 19.2 9.6 98 625-726 1-107 (415) 152 protein:vir:81100 Length: 415 27.7 1.8 0.0011 19.2 9.6 98 625-726 1-107 (415) 153 protein:vir:9410 Length: 415 # 27.4 1.8 0.0011 19.1 9.6 98 621-726 1-107 (415) 154 protein:vir:4600 Length: 415 # 25.6 2 0.0013 18.9 9.4 96 625-726 1-110 (415) 155 protein:vir:4700 Length: 415 # 25.6 2 0.0013 18.9 9.4 96 625-726 1-110 (415) 156 protein:vir:8420 Length: 477 # 21.6 2.6 0.0016 18.3 11.4 107 607-726 1-123 (477) 157 protein:vir:95821 Length: 763 21.4 2.6 0.0016 18.3 34.6 635 1-726 1-715 (763) 158 protein:vir:78393 Length: 489 21.4 2.6 0.0016 18.3 25.4 449 64-630 1-489 (489) 159 protein:vir:104256 Length: 458 21.2 2.6 0.0016 18.3 16.4 145 575-726 1-157 (458) 160 protein:vir:962 Length: 397 # 21.2 2.6 0.0016 18.3 16.9 126 571-726 1-157 (397) 161 protein:vir:1433 Length: 435 # 20.5 2.7 0.0017 18.2 5.2 117 602-726 1-134 (435) 162 protein:vir:93881 Length: 387 20.4 2.8 0.0017 18.2 8.6 95 625-726 1-106 (387) No 1 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=6.6e-169 Score=942.61 Aligned_cols=700 Identities=46% Similarity=0.776 Sum_probs=603.5 Q ss_pred CCCCCCcc-------chhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHH Q lcl|NC_013692. 11 LPNEDGDP-------SKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIR 83 (726) Q Consensus 11 ~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~ 83 (726) |...++.+ ++.+||+|||+++|++|+.+|+.|+++++++++++.+|++||++++++++|+++|||+|||++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~~v~ 80 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPKLVR 80 (763) T ss_pred CCcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccccCCCccccCHHHH Confidence 44444433 45558999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeee Q lcl|NC_013692. 84 KQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRT 163 (726) Q Consensus 84 ~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~ 163 (726) ++|||++|+|+++|||+++||+|.|++++|+++|+|+|+||||+|+++|+||+++++||++||++|+||||+||++++++ T Consensus 81 ~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~ 160 (763) T protein:vir:95 81 RQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRK 160 (763) T ss_pred HHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecccccccCCcchHHHHHHhhhhhhhhhccCCchh-hhHHHHHHhhhhhhhcccceeeccccceeecccceeecccee Q lcl|NC_013692. 164 VKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYP-EIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTV 242 (726) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i 242 (726) +++.+++++.+++...+.........++..+++.... ..+..+..+.......|.++..+..+....+..++++++|+| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~i 240 (763) T protein:vir:95 161 EKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTV 240 (763) T ss_pred eeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEE Confidence 9999999999999998888877788887776665543 244456666777788899999999998888888999999999 Q ss_pred eeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCC-cchhhcCcccchhhcccchhhhhccccccccCCcCCce Q lcl|NC_013692. 243 QVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRY-QNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKR 321 (726) Q Consensus 243 ~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~-~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (726) ++|+|++|||||+|++|++||+||||++++|++||+++|++ ++++.+.+........ ..........+++.+.+.++ T Consensus 241 e~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~d~~~~~ 318 (763) T protein:vir:95 241 EMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNE--PDHATTTPQEFQISDPMRKR 318 (763) T ss_pred EeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhcccccc--ccccccchhhccCCCcccce Confidence 99999999999999889999999999999999999999764 4566664433222111 12223345566777888899 Q ss_pred EEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 322 LVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGM 401 (726) Q Consensus 322 v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~ 401 (726) |+|+|||++++++|||++++|+++|+|+++|+++++||+|++|||++++++|++|++||+|+++.++|+|+++|+++|++ T Consensus 319 V~v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~ 398 (763) T protein:vir:95 319 VVAYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGM 398 (763) T ss_pred EEEEEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhc Q lcl|NC_013692. 402 IDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAG 481 (726) Q Consensus 402 ~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G 481 (726) +|+++++++|+|++++|++++.|..+++||+++++++|.++..++.+..+|++++.++.++++++..++++|||+++++| T Consensus 399 ~d~l~~~~~~~~~v~~gav~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G 478 (763) T protein:vir:95 399 IDLLGRSANGQRGMPKGMLDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGG 478 (763) T ss_pred HHHHHhhcCCcEEeecccccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeee Q lcl|NC_013692. 482 ISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLD 561 (726) Q Consensus 482 ~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~ 561 (726) +++++++.|+++++++++++++++..+++||+++++.+|+++++||++||+++++|||+|++|++++++++.++|||++. T Consensus 479 ~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~ 558 (763) T protein:vir:95 479 VTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVD 558 (763) T ss_pred cCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 562 ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAER 641 (726) Q Consensus 562 ~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~ 641 (726) ++.++.++++.+++..+++.+++.+++.....++..++++..+.++.+.++..++++++.++++.++++.+++++++..+ T Consensus 559 ~~~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~ 638 (763) T protein:vir:95 559 ISTAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELR 638 (763) T ss_pred cccchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHH Confidence 99988888888888888888888888888888888888888888777777776665555544444333333322222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 642 ARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRLDEATSA 721 (726) Q Consensus 642 aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~~~~~~a 721 (726) ++ ++..++++..+.++.++..+++..+++++++.+++++.+.+++++++++++++++++.+++... T Consensus 639 ak--------------aq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~~ea~~~ 704 (763) T protein:vir:95 639 SK--------------IRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPRKEGELP 704 (763) T ss_pred HH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 11 1222233333344445555666666777777778888888888888899999888887777666 Q ss_pred HHhcC Q lcl|NC_013692. 722 RTSQK 726 (726) Q Consensus 722 ~~~~q 726 (726) .++.. T Consensus 705 ~~~~~ 709 (763) T protein:vir:95 705 PNLSA 709 (763) T ss_pred hhHHH Confidence 55542 No 2 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=5.2e-128 Score=718.38 Aligned_cols=665 Identities=16% Similarity=0.205 Sum_probs=459.1 Q ss_pred ccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHH Q lcl|NC_013692. 17 DPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSE 95 (726) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~ 95 (726) -++.++....+++..++.|...+.+|++++++.++ ++.+|++||+|++.+ +..+|||+||+++|+++|||++|+|++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~--~~~~~~s~~~~~~v~~~v~~~~~~l~~ 78 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFG--NERPGKSGIVSRDVQETVDWIMPSLMK 78 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCC--cccCCCCccccHHHHHHHHHHHHHHHH Confidence 23334457788999999999999999999999997 689999999998864 567899999999999999999999999 Q ss_pred hhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCC Q lcl|NC_013692. 96 PFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMP 175 (726) Q Consensus 96 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~ 175 (726) +||+|+++|.|.|++++|+++|++.|.||||+|+++|+|++++++||++||++|+||+|+||+.+++. ..++|++++ T Consensus 79 ~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~---~~e~~~~~~ 155 (705) T protein:vir:88 79 VFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKP---TFERFSGLS 155 (705) T ss_pred hhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccch---hhhhhccCC Confidence 99999999999999999999999999999999999999999999999999999999999999855443 444677777 Q ss_pred cchHHHHHHhhh-hhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCC Q lcl|NC_013692. 176 DSSEELAQIYQT-AAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDP 254 (726) Q Consensus 176 ~~~~~~~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp 254 (726) +... +.+..+ ..++..+ ...|.+...+.++. ....+++.+++|||++|+||| T Consensus 156 ~~~l--~~~~~d~~~~~~~~-------------------~~~~~~~~~~~~~~------~~~~~~i~i~~V~p~d~~~dp 208 (705) T protein:vir:88 156 EDMV--ADILSDPDTSILAQ-------------------SVDDDGTYTIKIRK------DKKKREIKVLCVKPENFLVDR 208 (705) T ss_pred hhhh--hhhhhhhhhhcccc-------------------cccccceeeeEEee------eeecCceeeeeccHHHceecC Confidence 6532 222211 1111111 11122233333322 233567788999999999999 Q ss_pred CCCCchhhCCeEEEEEeccHHHHHhcCCCcch-hhcCcccchh-hcccchhhh-----hcccccc-ccCCcCCceEEEEE Q lcl|NC_013692. 255 SCGSDFSKAKFLIETFESSYAELKADGRYQNL-DKIQVEGQNL-LSEPDYTGP-----SEGVRNF-DFQDKSRKRLVVHE 326 (726) Q Consensus 255 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~-d~~~~~~~~~-~~~~~~~~~-----~~~~~~~-~~~~~~~~~v~v~E 326 (726) +|+ +++||+|++|+.++|+++|+++||+.+. +.+....... ........+ ....... .+.+...++|++|| T Consensus 209 ~a~-~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E 287 (705) T protein:vir:88 209 LAT-CIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASE 287 (705) T ss_pred CCC-CcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEE Confidence 986 5999999999999999999999988764 2332211110 001100000 0011111 12334456799999 Q ss_pred EEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 327 YWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMA 406 (726) Q Consensus 327 ~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~ 406 (726) ||++++++|||+.++++++++|++||+.+ +++++||++++++|+|+++||+|+++.++|+|+.+|+++|+++|+++ T Consensus 288 ~y~~~d~~~d~~~~~~~~~~~g~~il~~~----~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~ 363 (705) T protein:vir:88 288 CYTLLDVDGDGISELRRILYVGDYIISNE----PWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIY 363 (705) T ss_pred eeeEecccCCcceeeEEEEEeCccccccc----cCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999865 34789999999999999999999999999999999999999999999 Q ss_pred hcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCccc Q lcl|NC_013692. 407 RSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAA 486 (726) Q Consensus 407 ~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~ 486 (726) ++++|++++++|+++..+.++++||++++++++ .+|.+.++|+++++++.|++++.+.++++|||+++++|+++++ T Consensus 364 ~~~~~~~~~~~g~v~~~d~~~~~pg~vv~~~~~----~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~ 439 (705) T protein:vir:88 364 RTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSM----NSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNT 439 (705) T ss_pred hccCCceeccccccCcccccccCCCeeEEecCC----CccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCccc Confidence 999999999999999989999999999999864 3688889999999999999999999999999999999998877 Q ss_pred c--hhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecc Q lcl|NC_013692. 487 L--GDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIS 563 (726) Q Consensus 487 ~--~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~ 563 (726) + ++|+++++++++++++++..++++|.. +++++|++++.||++|+++++++||+| +|++++|+++.++|++.+.++ T Consensus 440 ~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g-~~v~v~~~~~~~~~~v~v~v~ 518 (705) T protein:vir:88 440 LHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRG-KWVAVNPANWRERSDLTVTVG 518 (705) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeecc-chhccchHhhccCCceEEeec Confidence 5 579999999999999999999999974 789999999999999999999999998 599999999999999998877 Q ss_pred cchHHHHH----HHHHHHHHHHhhhc------cchhHHHHHHHHHHHhhhhhhhhhhHHH---HHhhhhhhhhhH--HHH Q lcl|NC_013692. 564 TAEEDNAK----VNDLTFMLQTMGPN------MDPMMAQQIMGQIMELKKMPDFAKRIRE---FQPQPDPIAQQK--AQL 628 (726) Q Consensus 564 ~~~~~~~~----~~~l~~l~q~~~~~------~~~~~~~~~~~~~~~~~~~~e~~~~l~~---~~~~~~~~~qq~--~q~ 628 (726) .+...+.+ .+.+..+.+.+.+. ..+.....+..++.+..+..+..+.+.. ....+.++..++ .+. T Consensus 519 ~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~ 598 (705) T protein:vir:88 519 IGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQP 598 (705) T ss_pred cccchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhH Confidence 65443322 22333332222221 1112222334444444443332221111 111111111100 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHH Q lcl|NC_013692. 629 ELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQ------QARKRELQQAQSEAQ 702 (726) Q Consensus 629 e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~------~~~e~e~~~~q~~~q 702 (726) +...+++|++.++++++.+.+++..+.+ +.++++++.+.+.++++....+++.+.+ +.++.++++.+.+++ T Consensus 599 ~~~~~~~q~e~~k~q~e~~~~q~e~q~~---q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e 675 (705) T protein:vir:88 599 KPEDIKAQADAQRAQSDALAKQAEAQMK---QVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAE 675 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111222222222222222111111110 1111111111121111111111111110 111122233333333 Q ss_pred HHHHHHHHHHHHHHH-----HHHHHHhcC Q lcl|NC_013692. 703 GKLAMLNSQLKRLDE-----ATSARTSQK 726 (726) Q Consensus 703 ~~~~~l~~~~~~~~~-----~~~a~~~~q 726 (726) .++++.+.+.+...+ .++.....+ T Consensus 676 ~~~e~~q~~~~~~~~~~~~~~~k~~~~~r 704 (705) T protein:vir:88 676 YHLEATQARAAYIGDGKVPETKKPTKAVR 704 (705) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHhc Confidence 334433333322222 222222222 No 3 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=2e-95 Score=539.79 Aligned_cols=618 Identities=15% Similarity=0.195 Sum_probs=404.3 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHhccCCCCC--CCCCCCCCcCC Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEK----------ITQINRWLDYMHVRGEGK--PKTEKGKSAVQ 78 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~----------~~~~~~~~~~y~~~~~~~--~~~~~grs~~v 78 (726) |+-.+.+-.|+...-.-.+.+.+.|.+++..++.+.... ...+.++++||++....+ +++..|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~ 80 (651) T protein:vir:80 1 MKLATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKIT 80 (651) T ss_pred CcccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCcccc Confidence 333333333333333455566777777777666654422 123467899999887643 46667999999 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhc---ccchhHHHHHHHHHhhcCCeEEEE Q lcl|NC_013692. 79 PPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTK---LNKQRFIDEYVRAGVDEGTIIVKV 155 (726) Q Consensus 79 ~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~---~~~~~~~~~~~~~~l~~~~~i~k~ 155 (726) +++|+.+|+|++|+|+++||++++||+|.|.+ |++.|++.+.+||+++..+ ++....++.+++++|++|+||+|+ T Consensus 81 ~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~--~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv 158 (651) T protein:vir:80 81 TGKAFEAIETIHAYLMSATFPNKNWFDVVPAK--PGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLAL 158 (651) T ss_pred ChhHHHHHHHHHHHHHHhhcCCCceeEeccCC--chhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEE Confidence 99999999999999999999999999999964 4567899999999998854 444455667789999999999999 Q ss_pred eeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccce Q lcl|NC_013692. 156 GWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREET 235 (726) Q Consensus 156 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 235 (726) ||+.++.+..+.... ...+.. |.+...+.+ .... T Consensus 159 ~we~~~~~~~~~~~~-----------------~~~~~~-----------------------~~~~~~v~~------~~~~ 192 (651) T protein:vir:80 159 PWRVETAEVKKKVQV-----------------RTPLFE-----------------------DEPTFEVVS------EERE 192 (651) T ss_pred eecceeeeeehheec-----------------cccccc-----------------------cccceeeec------ccee Confidence 999776554433210 000000 111112222 2334 Q ss_pred eeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHH---HHhcCCCcchhhcCcccchhhcccchhhhh-ccccc Q lcl|NC_013692. 236 VENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAE---LKADGRYQNLDKIQVEGQNLLSEPDYTGPS-EGVRN 311 (726) Q Consensus 236 ~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~e---l~~~g~~~~~d~~~~~~~~~~~~~~~~~~~-~~~~~ 311 (726) ..++|++++|||++|||||+|+ +++||.||+|+++ |+.+ |+++|+|.+++.............+..... ...+. T Consensus 193 ~~~~~~i~~v~p~~~~~dp~a~-~~~d~~~v~~~~~-t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (651) T protein:vir:80 193 VKSSPDFEVLDMFDCFYDPNVT-DPNRGAFIRKLTK-TKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQG 270 (651) T ss_pred eeceeEEEEecHHHeeecCCCc-Cccccceeeeeee-eHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccC Confidence 5678999999999999999985 6999999998855 5555 556788877654332221111111111110 11111 Q ss_pred cc-cCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHH Q lcl|NC_013692. 312 FD-FQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDN 390 (726) Q Consensus 312 ~~-~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~ 390 (726) .+ ....+.++|+|||||++++.+|+++ +++++++.|.+||+.+++||+++ +||++++++++||++||+|+++.+.+. T Consensus 271 ~d~~~~~~~~~v~v~E~~~~~d~e~~~~-~~~~v~~~g~~il~~~~~~~~~~-~Pf~~~~~~~~~~~~yG~g~~~~~~~~ 348 (651) T protein:vir:80 271 VTTSLWSPHQNVELLEYWGDIHLENKTY-HDVVVTIMGNEVLRFEQNPYWCG-RPFVIGTYIPTARQPYAMGALQPNLGM 348 (651) T ss_pred CCccccccccceEEEEEEEEeeccCCce-EEEEEEEcCcEEecccccCCCCC-CCeeeecceecCccccCCChHHHHhHH Confidence 11 1223567899999999999998887 57788889999999999999865 599999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccC-ccchhHHHHHHHHHHHHH Q lcl|NC_013692. 391 QRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTF-PEIPQSAQYMINLQQAEA 469 (726) Q Consensus 391 Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~-~~~~~~~~~ll~~~~~~~ 469 (726) |+.+|+++|+++++++++++|+++++.|++...+...+.||++|+++.+.+. .+.++ +..+...+.+++++.+.+ T Consensus 349 q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~vi~~~~~~~~----~~l~~~~~~~~~~~~~l~~l~~~~ 424 (651) T protein:vir:80 349 LHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGKVFLVSDHGDL----QPLANQSSNFSITYQESSFLESTI 424 (651) T ss_pred HHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCceEEecCCCCc----eeeccCcccchhHHHHHHHHHHHH Confidence 9999999999999999999999999988876666667899999988765543 33322 335667888999999999 Q ss_pred HHHhchHHHhhccCcccch-hhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecc----- Q lcl|NC_013692. 470 ESMTGVKAFNAGISGAALG-DTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNE----- 542 (726) Q Consensus 470 e~~tGv~~~~~G~~~~~~~-~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~----- 542 (726) +++|||+++++|.++.+++ .||++|+++++++++++..++++|.. +++.++++++.++++|++.++++|++|. T Consensus 425 ~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~ 504 (651) T protein:vir:80 425 DKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAY 504 (651) T ss_pred HHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccc Confidence 9999999999998877654 59999999999999999999999987 7899999999999999999999999986 Q ss_pred cceecchhhcccccceeeecccch--HHHHHHHHHHHHHHHhhhccch---hHHHHHHHHHHHhhhhhhhhhhHHHHHhh Q lcl|NC_013692. 543 HFVDIRRDDLAGNFDLKLDISTAE--EDNAKVNDLTFMLQTMGPNMDP---MMAQQIMGQIMELKKMPDFAKRIREFQPQ 617 (726) Q Consensus 543 ~~v~v~~~~~~~~~dv~i~~~~~~--~~~~~~~~l~~l~q~~~~~~~~---~~~~~~~~~~~~~~~~~e~~~~l~~~~~~ 617 (726) .++.+++.++.+++++.. .+... ......+.+..+++.+++..+. .+...++..+++..++.+....+....++ T Consensus 505 ~~~~i~~~dl~~~~~iv~-~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~ 583 (651) T protein:vir:80 505 EYYELDVEDLQKEVRLVP-IGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQ 583 (651) T ss_pred cccccCccceeeeeeeee-ccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccc Confidence 367788888888888752 33322 2334455566666666553322 12334445566666665443333211111 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 618 PDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQA 697 (726) Q Consensus 618 ~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~ 697 (726) ....+++ ....|++...++++...++.+. .+.++....+.+..+ +++++.+++ +.++++.+. T Consensus 584 ~~~~~~~-------~~~~q~~~~~~~a~~~~~~~~~--~~~~~~~~~~~~~~~---~~~~~~~~~----~~~~~~~l~-- 645 (651) T protein:vir:80 584 APANPQE-------ALLSQAKDVGGQAMSNMLQNQL--QADGGTQMMSEMYGT---PNADQMQQE----LMATTPNVS-- 645 (651) T ss_pred hhhhhhH-------HHHhhHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH---HHHHHHHHH----HHHHHHHHH-- Confidence 1100000 0011111111111111000000 000000000000000 000000000 011111111 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_013692. 698 QSEAQGKLAMLNSQLKR 714 (726) Q Consensus 698 q~~~q~~~~~l~~~~~~ 714 (726) ++++.+ T Consensus 646 -----------~~~~~~ 651 (651) T protein:vir:80 646 -----------EQQLTQ 651 (651) T ss_pred -----------HhhccC Confidence 111111 No 4 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=1.1e-94 Score=535.77 Aligned_cols=629 Identities=15% Similarity=0.155 Sum_probs=409.6 Q ss_pred CCCccchhh-----------cCCCCCCccchhcCCCC-CCch---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_013692. 1 MADVDEDYL-----------TLPNEDGDPSKRLQPEW-SNAP---SLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGE 65 (726) Q Consensus 1 ~~~~~~~~~-----------~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~ 65 (726) |.|+.+--+ -|.+.+..+.|....+= .++. .++.|...|..+...+...-+...+.++||+|+-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw 80 (776) T protein:vir:93 1 MFDLNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQW 80 (776) T ss_pred CCCccccccccccccccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC Confidence 777655222 23233333333332222 2333 44555555555554444444555678999987642 Q ss_pred CC----CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHH Q lcl|NC_013692. 66 GK----PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEY 141 (726) Q Consensus 66 ~~----~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 141 (726) -+ .....|++.+|-+.|.-+|+|++.+..+ +-.-+.|.|.+++|++.|+..|.++||++ ..+++.....++ T Consensus 81 ~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~----nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~~~a 155 (776) T protein:vir:93 81 SQDEIDELKERGQAPTVYNVISQSVNWIIGSEKR----GRSDFKVLPRRKDGGKAAERKTALLKYLS-DVNHTPFERSMA 155 (776) T ss_pred CHHHHHHHHhcCCceEEecchHHHHHHHHHHHHh----CCcceEEecCChhHHHHHHHHHHHHHHHH-HhhcHHHHHHHH Confidence 11 1233699999999999999999987755 44458999999999999999999999985 799999999999 Q ss_pred HHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhccccee Q lcl|NC_013692. 142 VRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVR 221 (726) Q Consensus 142 ~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 221 (726) |+++|++|.|+++++|++... |. T Consensus 156 f~d~~~~G~G~~~v~~d~~~~------------------------------------------------------~~--- 178 (776) T protein:vir:93 156 FEETTKAGIGWLESQVQDEND------------------------------------------------------GE--- 178 (776) T ss_pred HHHhhhcCcceEEEEeeccCC------------------------------------------------------CC--- Confidence 999999999999999972100 00 Q ss_pred eccccceeecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhc-- Q lcl|NC_013692. 222 AVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLS-- 298 (726) Q Consensus 222 ~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~-- 298 (726) .+.+++|+|++|||||++++ |++||+||||++|||+++|+++ |++..+.+.....+... T Consensus 179 -----------------~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-~p~~~~~~~~~~~~~~~~~ 240 (776) T protein:vir:93 179 -----------------PIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAI-FPERAAQLRAAAVDNFETW 240 (776) T ss_pred -----------------ceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHh-cCCchHHHHHhhhhccccc Confidence 01236789999999998765 9999999999999999999998 55444433211111000 Q ss_pred -c----------cchhhhhccccccccCCcCCceEEEEEEEEEeecC--------C------------------------ Q lcl|NC_013692. 299 -E----------PDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIH--------G------------------------ 335 (726) Q Consensus 299 -~----------~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~--------~------------------------ 335 (726) . ..............+.+..+++|+|+|||+|..+. + T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~ 320 (776) T protein:vir:93 241 GTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAV 320 (776) T ss_pred chhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCcee Confidence 0 00001112233345566778999999999875321 0 Q ss_pred ---CceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_013692. 336 ---DGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQ 412 (726) Q Consensus 336 ---~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~ 412 (726) .+..++++++|+|+++|+.+++||+|++|||+++++++++++++|+|+++.++|+|+++|+++|+++|+|+ +++ T Consensus 321 ~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~---~~~ 397 (776) T protein:vir:93 321 LAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS---TNK 397 (776) T ss_pred ehheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc---CCc Confidence 12245688999999999999999999999999999999999999999999999999999999999999874 568 Q ss_pred eEeecccccchhhhh---hcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchh Q lcl|NC_013692. 413 VGVMKGALDVTNRRR---FDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGD 489 (726) Q Consensus 413 ~~~~~gav~~~d~~~---~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ 489 (726) +++++|+|++.+... ++||++|++++|+.. .+.+.+.++++..+++++++..+.++++|||+++++|..+++ . T Consensus 398 ~~~~~gav~~~d~~~~~~~rp~~vi~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~--~ 473 (776) T protein:vir:93 398 VLMEEGAVDDIDEFRREAARPDAVMTVKNGKLG--AVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNA--V 473 (776) T ss_pred eeeccccccchHHHHHhcccCCceeeeCCcccc--ccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcch--h Confidence 999999998877544 689999999998754 345556778899999999999999999999999999988765 5 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecc----cceecchhh-----cccccceee Q lcl|NC_013692. 490 TATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNE----HFVDIRRDD-----LAGNFDLKL 560 (726) Q Consensus 490 ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~----~~v~v~~~~-----~~~~~dv~i 560 (726) ++.++++++++|++++..+++||+.+++++|+++|+||.+||++++++||+|+ .||.|+... ..++|||.| T Consensus 474 Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v 553 (776) T protein:vir:93 474 SGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFII 553 (776) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEE Confidence 77789999999999999999999999999999999999999999999999986 588887543 357899999 Q ss_pred ecccchHHH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHH Q lcl|NC_013692. 561 DISTAEEDN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIE 638 (726) Q Consensus 561 ~~~~~~~~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e 638 (726) ..++.+..+ .....+.++++.+.+........ .+.+++.+.+..++.+.++.....+.+.+.+..+.+++..+++.+ T Consensus 554 ~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~-~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~ 632 (776) T protein:vir:93 554 DEAEWRATMRQAAVAELMEVIGKMPPEIALTMLD-LLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQ 632 (776) T ss_pred eecccchhHHHHHHHHHHHHHhhcChhhHHHHHH-HHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhH Confidence 988765443 33334444444333333222211 222334445555666666655544433333333222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 639 AERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRLDEA 718 (726) Q Consensus 639 ~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~~~~ 718 (726) .++++.+.+..+...++++....+++++..++++.+...+...+ ..+ +. ..++++..+ .... .+....... T Consensus 633 ~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~--~~~-a~---q~a~qa~~~--~~~~-~~~a~~a~~ 703 (776) T protein:vir:93 633 QQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIRE--GVG-AV---KDATDAATA--IAFM-PELAGLSDG 703 (776) T ss_pred HHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhh--hhh-hh---hhhhhhhhh--hhhh-hhhhhhhhh Confidence 22222222222222222222222222222222222111111000 000 00 000000000 0000 000000000 Q ss_pred HHHHHhcC Q lcl|NC_013692. 719 TSARTSQK 726 (726) Q Consensus 719 ~~a~~~~q 726 (726) .......+ T Consensus 704 ~~~~a~~~ 711 (776) T protein:vir:93 704 ILRESGWD 711 (776) T ss_pred hhcccccc Confidence 00000000 No 5 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=3.5e-88 Score=500.04 Aligned_cols=615 Identities=14% Similarity=0.117 Sum_probs=399.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCC---CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC----CCCCCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEW---SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK----PKTEKG 73 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~----~~~~~g 73 (726) |+. .-|+.-.+.+-. .|.++..- .++..+..+...|..+..+..+....-.++++||+|.--.+ .-...| T Consensus 1 ~~~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g 77 (711) T protein:vir:10 1 MAK--KQKKSRVEQLYA-KKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQ 77 (711) T ss_pred CCc--ccccccccchhH-HHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcC Confidence 432 111111112212 22221111 23346777888888888777777766668899998642100 012358 Q ss_pred CCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCC----------------------cchHHHHHHHHHHHHHHHhhc Q lcl|NC_013692. 74 KSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVT----------------------WEDAESARQNGLVLNQQFNTK 131 (726) Q Consensus 74 rs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~----------------------~~D~~~A~q~t~~~n~~~~~~ 131 (726) +.-++-+.|+-.|++++..-.. +-.-+.|.|+. .+|.+.|++.|.+++|+. .. T Consensus 78 ~p~~~~N~i~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~-~~ 152 (711) T protein:vir:10 78 RPCLVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE-YN 152 (711) T ss_pred CCcEEEcchHHHHHHHhhhHhh----CCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHH-Hh Confidence 9999999999999999976643 33446888864 789999999999999976 46 Q ss_pred ccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhh Q lcl|NC_013692. 132 LNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLE 211 (726) Q Consensus 132 ~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 211 (726) ++......++|+++|++|.|+++++|++.... + T Consensus 153 ~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d--------------------------------~--------------- 185 (711) T protein:vir:10 153 CDAETEYDIAFQGAVESGMGYLRVRSDYLADD--------------------------------S--------------- 185 (711) T ss_pred cChhHHHHHHHHHhhhcCcceEEEEecccCCC--------------------------------C--------------- Confidence 67777788999999999999999988732110 0 Q ss_pred hhhhcccceeeccccceeecccceeeccceeeee-chhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhc Q lcl|NC_013692. 212 ETEANGIQVRAVPVGSEEEEREETVENHPTVQVC-DYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKI 289 (726) Q Consensus 212 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v-~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~ 289 (726) ..+.|.|.+| +|++|||||.++ .|++||+|||+++|||+++++++ |+...... T Consensus 186 ------------------------~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-yp~~a~~~ 240 (711) T protein:vir:10 186 ------------------------FEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKAL-YPDATAEP 240 (711) T ss_pred ------------------------CCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHh-CCchhhhh Confidence 0122445566 799999999664 59999999999999999999998 54332111 Q ss_pred CcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecC------CCc-------------------------- Q lcl|NC_013692. 290 QVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIH------GDG-------------------------- 337 (726) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~------~~g-------------------------- 337 (726) ..... .. . .......++|+|+|||.+.... ++| T Consensus 241 ~~~~~----~~---------~--~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 305 (711) T protein:vir:10 241 VYEDS----VA---------D--YDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTR 305 (711) T ss_pred hhccc----cc---------c--cCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhh Confidence 00000 00 0 0112235789999999774311 111 Q ss_pred ---eEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_013692. 338 ---VLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPR--KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQ 412 (726) Q Consensus 338 ---~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~ 412 (726) ..+.++.+|+|.++| .+++||+|++|||+|+++++. +++++++|+++.++|+|+++|+++|+++|+++++++++ T Consensus 306 ~~~~~~v~~~~~~G~~~L-~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~ 384 (711) T protein:vir:10 306 KVKTFKTYWRKITGANVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAP 384 (711) T ss_pred hhceeeEEEEEEecceee-cCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCc Confidence 123455678999999 688999999999999999865 78888999999999999999999999999999999999 Q ss_pred eEeecccccchhhh----hhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccch Q lcl|NC_013692. 413 VGVMKGALDVTNRR----RFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALG 488 (726) Q Consensus 413 ~~~~~gav~~~d~~----~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~ 488 (726) +++++|+|++.++. ..+||++++++++.++...+.+.+.|++|++++.|+++..+.++++|||+++++|..+++ T Consensus 385 ~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~-- 462 (711) T protein:vir:10 385 FIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNE-- 462 (711) T ss_pred eeecCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccc-- Confidence 99999999876543 368999999999998888899999999999999999999999999999999999988775 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecc----cceecchh-------------- Q lcl|NC_013692. 489 DTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNE----HFVDIRRD-------------- 550 (726) Q Consensus 489 ~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~----~~v~v~~~-------------- 550 (726) .|+.+|++++++|.+.+..+++||+.+++++|+++|+||.+||++++++||+|+ +++.++.. T Consensus 463 ~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nD 542 (711) T protein:vir:10 463 TSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHD 542 (711) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeec Confidence 688899999999999999999999999999999999999999999999999986 57777643 Q ss_pred hcccccceeeecccchHHHHH--HHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHH Q lcl|NC_013692. 551 DLAGNFDLKLDISTAEEDNAK--VNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQL 628 (726) Q Consensus 551 ~~~~~~dv~i~~~~~~~~~~~--~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~ 628 (726) ...++|||.+..++.+..... ...+.++++.. +...+. ...++.+++++.+..++.+.++....++...+....+. T Consensus 543 i~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~-p~~~~~-~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~ 620 (711) T protein:vir:10 543 LNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAV-PSAAAV-MADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAI 620 (711) T ss_pred cceeeeEEEEeeccCchhHHHHHHHHHHHHHhhc-chhhhH-HHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHH Confidence 336789999999876654433 33333333332 222222 22233455566666677777766655444333222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH---HHHHHHHH Q lcl|NC_013692. 629 ELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQ-QARKRELQQ---AQSEAQGK 704 (726) Q Consensus 629 e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~-~~~e~e~~~---~q~~~q~~ 704 (726) ++...+.+.+..+.+.+.+.++....++.+...++++.+++.+++....+......... +.....+++ ...+.+++ T Consensus 621 qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qae 700 (711) T protein:vir:10 621 EEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAE 700 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22111111111111111111111111111222222222222111111100000000000 000000110 01111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 705 LAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 705 ~~~l~~~~~~~~~~~~a~~~~q 726 (726) +...++++. +| T Consensus 701 lq~~q~~~~-----------q~ 711 (711) T protein:vir:10 701 ITASQANVT-----------EQ 711 (711) T ss_pred HHHHHHHhh-----------cC Confidence 111111111 11 No 6 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=2.9e-85 Score=484.03 Aligned_cols=623 Identities=12% Similarity=0.066 Sum_probs=391.6 Q ss_pred CCCCCCccchhcCCCCCCc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNA---PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKSAVQPPTI 82 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs~~v~~~v 82 (726) |..++....-++-.+-+-+ ..++-+..++++ +...-..-.+.++||+|.= -+ .-...|+.-+|-..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~R~~a~~d~~fy~G~Q-w~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDS----QPKWRDAANKACAYYDGDQ-LPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHh----hHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCCcEEeccH Confidence 3333333333332221111 223333333333 2222334567899998632 12 123359999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEecCCcchH--HHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeee Q lcl|NC_013692. 83 RKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDA--ESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQ 160 (726) Q Consensus 83 ~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~ 160 (726) +-+|+|++-.-. .+-.=+.|.|++++|+ +.|+..|.+++|++. .++.-....++|.++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:32 76 APTVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 999999997663 3444579999887655 789999999999875 555556777999999999999887766510 Q ss_pred eeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccc Q lcl|NC_013692. 161 SRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHP 240 (726) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p 240 (726) + ....+ T Consensus 151 -----------------------------------~---------------------------------------~~~~i 156 (714) T protein:vir:32 151 -----------------------------------P---------------------------------------FGPEF 156 (714) T ss_pred -----------------------------------C---------------------------------------CCCCe Confidence 0 01224 Q ss_pred eeeeechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchh---------------hcccchhh Q lcl|NC_013692. 241 TVQVCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNL---------------LSEPDYTG 304 (726) Q Consensus 241 ~i~~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~---------------~~~~~~~~ 304 (726) +|++|||++|||||+++ .|++||+|+||++|||+++|+++ |++..+.+.....+. ....+... T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:32 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 67899999999999664 59999999999999999999998 554333332111000 00111112 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeec---------------CC------------------CceEEEEEEEEECCEE Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDI---------------HG------------------DGVLHPIVATWVGAVM 351 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~---------------~~------------------~g~~~~~~~~~~g~~~ 351 (726) .+.....+.|.+..+++|+|+|||.|... ++ ..+.++++++|+|+++ T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:32 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 22334455667777899999999987321 11 1235678899999999 Q ss_pred EEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhh----h Q lcl|NC_013692. 352 IRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRR----R 427 (726) Q Consensus 352 l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~----~ 427 (726) |+.+++||||++|||+|+++++....+.++|+++.++|+|+.+|++.|+++++| +++ ++++.+|++++.+.. . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:32 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLMEQI 392 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHHhc Confidence 999999999999999999999998888889999999999999999999998865 444 466889998776532 3 Q ss_pred hcCCceEeecCcc----chhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHH Q lcl|NC_013692. 428 FDRGENYEFNPGA----DPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASK 503 (726) Q Consensus 428 ~~~g~vi~~~~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~ 503 (726) .+||+++.++++. .+...|.+.+.+++|+.++.++++....++++|||+++++|..+++ .++.+|++++++|.+ T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na--~SGvAi~~rq~qg~~ 470 (714) T protein:vir:32 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA--TSGVAISNLVEQGAT 470 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc--hhHHHHHHHHHHHHH Confidence 7899999998753 3345577788899999999999999999999999999999988876 567779999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc-------ceecchhh---------cccccceeeecccchH Q lcl|NC_013692. 504 RELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH-------FVDIRRDD---------LAGNFDLKLDISTAEE 567 (726) Q Consensus 504 ~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~-------~v~v~~~~---------~~~~~dv~i~~~~~~~ 567 (726) .+..+++||+.+++.+|+++|+||++||++++++||+|++ ++.+|+.. ..++|||.+..++.+. T Consensus 471 ~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:32 471 TLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 9999999999999999999999999999999999999752 67776543 4678999999988655 Q ss_pred HH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_013692. 568 DN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL-MLLQAQIEAERARA 644 (726) Q Consensus 568 ~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~-q~~qaq~e~~~aq~ 644 (726) .+ +....+.++++.+.+....... .++.+++++.+..++.+++++...+++...+..++.++ +..++++++++++. T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~-~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:32 551 AFKAQLAQRMSEVIQGLPPQVQAVVL-DLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHH-HHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 43 3444455555544433332222 23445666677777888887765544332221111111 11111111111111 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013692. 645 AHYMSGAGLQ--DSKVGTEQAKARALASQADM--TDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRL-DEAT 719 (726) Q Consensus 645 q~~~~~~~~~--~~~~~~eqaq~~q~~~q~~~--~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~-~~~~ 719 (726) +....++..+ ++.+...++++.+...++.+ ...+..+...+..++...+........+++...++++.++. +... T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:32 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 1111111111 11111111111111111110 00000000000000000001111111111122222222111 1111 Q ss_pred HHHHh Q lcl|NC_013692. 720 SARTS 724 (726) Q Consensus 720 ~a~~~ 724 (726) .+-.. T Consensus 710 ~~~~~ 714 (714) T protein:vir:32 710 NEMSL 714 (714) T ss_pred HhcCC Confidence 11111 No 7 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=2.9e-85 Score=484.03 Aligned_cols=623 Identities=12% Similarity=0.066 Sum_probs=391.6 Q ss_pred CCCCCCccchhcCCCCCCc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNA---PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKSAVQPPTI 82 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs~~v~~~v 82 (726) |..++....-++-.+-+-+ ..++-+..++++ +...-..-.+.++||+|.= -+ .-...|+.-+|-..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~R~~a~~d~~fy~G~Q-w~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDS----QPKWRDAANKACAYYDGDQ-LPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHh----hHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCCcEEeccH Confidence 3333333333332221111 223333333333 2222334567899998632 12 123359999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEecCCcchH--HHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeee Q lcl|NC_013692. 83 RKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDA--ESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQ 160 (726) Q Consensus 83 ~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~ 160 (726) +-+|+|++-.-. .+-.=+.|.|++++|+ +.|+..|.+++|++. .++.-....++|.++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:10 76 APTVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 999999997663 3444579999887655 789999999999875 555556777999999999999887766510 Q ss_pred eeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccc Q lcl|NC_013692. 161 SRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHP 240 (726) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p 240 (726) + ....+ T Consensus 151 -----------------------------------~---------------------------------------~~~~i 156 (714) T protein:vir:10 151 -----------------------------------P---------------------------------------FGPEF 156 (714) T ss_pred -----------------------------------C---------------------------------------CCCCe Confidence 0 01224 Q ss_pred eeeeechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchh---------------hcccchhh Q lcl|NC_013692. 241 TVQVCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNL---------------LSEPDYTG 304 (726) Q Consensus 241 ~i~~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~---------------~~~~~~~~ 304 (726) +|++|||++|||||+++ .|++||+|+||++|||+++|+++ |++..+.+.....+. ....+... T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:10 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 67899999999999664 59999999999999999999998 554333332111000 00111112 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeec---------------CC------------------CceEEEEEEEEECCEE Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDI---------------HG------------------DGVLHPIVATWVGAVM 351 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~---------------~~------------------~g~~~~~~~~~~g~~~ 351 (726) .+.....+.|.+..+++|+|+|||.|... ++ ..+.++++++|+|+++ T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:10 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 22334455667777899999999987321 11 1235678899999999 Q ss_pred EEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhh----h Q lcl|NC_013692. 352 IRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRR----R 427 (726) Q Consensus 352 l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~----~ 427 (726) |+.+++||||++|||+|+++++....+.++|+++.++|+|+.+|++.|+++++| +++ ++++.+|++++.+.. . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:10 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLMEQI 392 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHHhc Confidence 999999999999999999999998888889999999999999999999998865 444 466889998776532 3 Q ss_pred hcCCceEeecCcc----chhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHH Q lcl|NC_013692. 428 FDRGENYEFNPGA----DPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASK 503 (726) Q Consensus 428 ~~~g~vi~~~~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~ 503 (726) .+||+++.++++. .+...|.+.+.+++|+.++.++++....++++|||+++++|..+++ .++.+|++++++|.+ T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na--~SGvAi~~rq~qg~~ 470 (714) T protein:vir:10 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA--TSGVAISNLVEQGAT 470 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc--hhHHHHHHHHHHHHH Confidence 7899999998753 3345577788899999999999999999999999999999988876 567779999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc-------ceecchhh---------cccccceeeecccchH Q lcl|NC_013692. 504 RELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH-------FVDIRRDD---------LAGNFDLKLDISTAEE 567 (726) Q Consensus 504 ~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~-------~v~v~~~~---------~~~~~dv~i~~~~~~~ 567 (726) .+..+++||+.+++.+|+++|+||++||++++++||+|++ ++.+|+.. ..++|||.+..++.+. T Consensus 471 ~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:10 471 TLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 9999999999999999999999999999999999999752 67776543 4678999999988655 Q ss_pred HH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_013692. 568 DN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL-MLLQAQIEAERARA 644 (726) Q Consensus 568 ~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~-q~~qaq~e~~~aq~ 644 (726) .+ +....+.++++.+.+....... .++.+++++.+..++.+++++...+++...+..++.++ +..++++++++++. T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~-~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:10 551 AFKAQLAQRMSEVIQGLPPQVQAVVL-DLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHH-HHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 43 3444455555544433332222 23445666677777888887765544332221111111 11111111111111 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013692. 645 AHYMSGAGLQ--DSKVGTEQAKARALASQADM--TDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRL-DEAT 719 (726) Q Consensus 645 q~~~~~~~~~--~~~~~~eqaq~~q~~~q~~~--~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~-~~~~ 719 (726) +....++..+ ++.+...++++.+...++.+ ...+..+...+..++...+........+++...++++.++. +... T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:10 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 1111111111 11111111111111111110 00000000000000000001111111111122222222111 1111 Q ss_pred HHHHh Q lcl|NC_013692. 720 SARTS 724 (726) Q Consensus 720 ~a~~~ 724 (726) .+-.. T Consensus 710 ~~~~~ 714 (714) T protein:vir:10 710 NEMSL 714 (714) T ss_pred HhcCC Confidence 11111 No 8 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=2.9e-85 Score=484.03 Aligned_cols=623 Identities=12% Similarity=0.066 Sum_probs=391.6 Q ss_pred CCCCCCccchhcCCCCCCc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNA---PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKSAVQPPTI 82 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs~~v~~~v 82 (726) |..++....-++-.+-+-+ ..++-+..++++ +...-..-.+.++||+|.= -+ .-...|+.-+|-..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~R~~a~~d~~fy~G~Q-w~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDS----QPKWRDAANKACAYYDGDQ-LPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHh----hHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCCcEEeccH Confidence 3333333333332221111 223333333333 2222334567899998632 12 123359999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEecCCcchH--HHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeee Q lcl|NC_013692. 83 RKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDA--ESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQ 160 (726) Q Consensus 83 ~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~ 160 (726) +-+|+|++-.-. .+-.=+.|.|++++|+ +.|+..|.+++|++. .++.-....++|.++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:81 76 APTVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 999999997663 3444579999887655 789999999999875 555556777999999999999887766510 Q ss_pred eeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccc Q lcl|NC_013692. 161 SRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHP 240 (726) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p 240 (726) + ....+ T Consensus 151 -----------------------------------~---------------------------------------~~~~i 156 (714) T protein:vir:81 151 -----------------------------------P---------------------------------------FGPEF 156 (714) T ss_pred -----------------------------------C---------------------------------------CCCCe Confidence 0 01224 Q ss_pred eeeeechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchh---------------hcccchhh Q lcl|NC_013692. 241 TVQVCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNL---------------LSEPDYTG 304 (726) Q Consensus 241 ~i~~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~---------------~~~~~~~~ 304 (726) +|++|||++|||||+++ .|++||+|+||++|||+++|+++ |++..+.+.....+. ....+... T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:81 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 67899999999999664 59999999999999999999998 554333332111000 00111112 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeec---------------CC------------------CceEEEEEEEEECCEE Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDI---------------HG------------------DGVLHPIVATWVGAVM 351 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~---------------~~------------------~g~~~~~~~~~~g~~~ 351 (726) .+.....+.|.+..+++|+|+|||.|... ++ ..+.++++++|+|+++ T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:81 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 22334455667777899999999987321 11 1235678899999999 Q ss_pred EEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhh----h Q lcl|NC_013692. 352 IRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRR----R 427 (726) Q Consensus 352 l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~----~ 427 (726) |+.+++||||++|||+|+++++....+.++|+++.++|+|+.+|++.|+++++| +++ ++++.+|++++.+.. . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:81 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLMEQI 392 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHHhc Confidence 999999999999999999999998888889999999999999999999998865 444 466889998776532 3 Q ss_pred hcCCceEeecCcc----chhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHH Q lcl|NC_013692. 428 FDRGENYEFNPGA----DPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASK 503 (726) Q Consensus 428 ~~~g~vi~~~~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~ 503 (726) .+||+++.++++. .+...|.+.+.+++|+.++.++++....++++|||+++++|..+++ .++.+|++++++|.+ T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na--~SGvAi~~rq~qg~~ 470 (714) T protein:vir:81 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA--TSGVAISNLVEQGAT 470 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc--hhHHHHHHHHHHHHH Confidence 7899999998753 3345577788899999999999999999999999999999988876 567779999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc-------ceecchhh---------cccccceeeecccchH Q lcl|NC_013692. 504 RELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH-------FVDIRRDD---------LAGNFDLKLDISTAEE 567 (726) Q Consensus 504 ~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~-------~v~v~~~~---------~~~~~dv~i~~~~~~~ 567 (726) .+..+++||+.+++.+|+++|+||++||++++++||+|++ ++.+|+.. ..++|||.+..++.+. T Consensus 471 ~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:81 471 TLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 9999999999999999999999999999999999999752 67776543 4678999999988655 Q ss_pred HH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_013692. 568 DN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL-MLLQAQIEAERARA 644 (726) Q Consensus 568 ~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~-q~~qaq~e~~~aq~ 644 (726) .+ +....+.++++.+.+....... .++.+++++.+..++.+++++...+++...+..++.++ +..++++++++++. T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~-~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:81 551 AFKAQLAQRMSEVIQGLPPQVQAVVL-DLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHH-HHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 43 3444455555544433332222 23445666677777888887765544332221111111 11111111111111 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013692. 645 AHYMSGAGLQ--DSKVGTEQAKARALASQADM--TDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRL-DEAT 719 (726) Q Consensus 645 q~~~~~~~~~--~~~~~~eqaq~~q~~~q~~~--~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~-~~~~ 719 (726) +....++..+ ++.+...++++.+...++.+ ...+..+...+..++...+........+++...++++.++. +... T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:81 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 1111111111 11111111111111111110 00000000000000000001111111111122222222111 1111 Q ss_pred HHHHh Q lcl|NC_013692. 720 SARTS 724 (726) Q Consensus 720 ~a~~~ 724 (726) .+-.. T Consensus 710 ~~~~~ 714 (714) T protein:vir:81 710 NEMSL 714 (714) T ss_pred HhcCC Confidence 11111 No 9 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=2.9e-85 Score=484.03 Aligned_cols=623 Identities=12% Similarity=0.066 Sum_probs=391.6 Q ss_pred CCCCCCccchhcCCCCCCc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNA---PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKSAVQPPTI 82 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs~~v~~~v 82 (726) |..++....-++-.+-+-+ ..++-+..++++ +...-..-.+.++||+|.= -+ .-...|+.-+|-..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~R~~a~~d~~fy~G~Q-w~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDS----QPKWRDAANKACAYYDGDQ-LPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHh----hHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCCcEEeccH Confidence 3333333333332221111 223333333333 2222334567899998632 12 123359999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEecCCcchH--HHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeee Q lcl|NC_013692. 83 RKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDA--ESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQ 160 (726) Q Consensus 83 ~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~ 160 (726) +-+|+|++-.-. .+-.=+.|.|++++|+ +.|+..|.+++|++. .++.-....++|.++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:99 76 APTVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 999999997663 3444579999887655 789999999999875 555556777999999999999887766510 Q ss_pred eeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccc Q lcl|NC_013692. 161 SRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHP 240 (726) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p 240 (726) + ....+ T Consensus 151 -----------------------------------~---------------------------------------~~~~i 156 (714) T protein:vir:99 151 -----------------------------------P---------------------------------------FGPEF 156 (714) T ss_pred -----------------------------------C---------------------------------------CCCCe Confidence 0 01224 Q ss_pred eeeeechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchh---------------hcccchhh Q lcl|NC_013692. 241 TVQVCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNL---------------LSEPDYTG 304 (726) Q Consensus 241 ~i~~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~---------------~~~~~~~~ 304 (726) +|++|||++|||||+++ .|++||+|+||++|||+++|+++ |++..+.+.....+. ....+... T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:99 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 67899999999999664 59999999999999999999998 554333332111000 00111112 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeec---------------CC------------------CceEEEEEEEEECCEE Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDI---------------HG------------------DGVLHPIVATWVGAVM 351 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~---------------~~------------------~g~~~~~~~~~~g~~~ 351 (726) .+.....+.|.+..+++|+|+|||.|... ++ ..+.++++++|+|+++ T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:99 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 22334455667777899999999987321 11 1235678899999999 Q ss_pred EEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhh----h Q lcl|NC_013692. 352 IRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRR----R 427 (726) Q Consensus 352 l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~----~ 427 (726) |+.+++||||++|||+|+++++....+.++|+++.++|+|+.+|++.|+++++| +++ ++++.+|++++.+.. . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:99 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLMEQI 392 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHHhc Confidence 999999999999999999999998888889999999999999999999998865 444 466889998776532 3 Q ss_pred hcCCceEeecCcc----chhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHH Q lcl|NC_013692. 428 FDRGENYEFNPGA----DPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASK 503 (726) Q Consensus 428 ~~~g~vi~~~~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~ 503 (726) .+||+++.++++. .+...|.+.+.+++|+.++.++++....++++|||+++++|..+++ .++.+|++++++|.+ T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na--~SGvAi~~rq~qg~~ 470 (714) T protein:vir:99 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA--TSGVAISNLVEQGAT 470 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc--hhHHHHHHHHHHHHH Confidence 7899999998753 3345577788899999999999999999999999999999988876 567779999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc-------ceecchhh---------cccccceeeecccchH Q lcl|NC_013692. 504 RELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH-------FVDIRRDD---------LAGNFDLKLDISTAEE 567 (726) Q Consensus 504 ~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~-------~v~v~~~~---------~~~~~dv~i~~~~~~~ 567 (726) .+..+++||+.+++.+|+++|+||++||++++++||+|++ ++.+|+.. ..++|||.+..++.+. T Consensus 471 ~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:99 471 TLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 9999999999999999999999999999999999999752 67776543 4678999999988655 Q ss_pred HH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_013692. 568 DN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL-MLLQAQIEAERARA 644 (726) Q Consensus 568 ~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~-q~~qaq~e~~~aq~ 644 (726) .+ +....+.++++.+.+....... .++.+++++.+..++.+++++...+++...+..++.++ +..++++++++++. T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~-~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:99 551 AFKAQLAQRMSEVIQGLPPQVQAVVL-DLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHH-HHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 43 3444455555544433332222 23445666677777888887765544332221111111 11111111111111 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013692. 645 AHYMSGAGLQ--DSKVGTEQAKARALASQADM--TDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRL-DEAT 719 (726) Q Consensus 645 q~~~~~~~~~--~~~~~~eqaq~~q~~~q~~~--~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~-~~~~ 719 (726) +....++..+ ++.+...++++.+...++.+ ...+..+...+..++...+........+++...++++.++. +... T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:99 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 1111111111 11111111111111111110 00000000000000000001111111111122222222111 1111 Q ss_pred HHHHh Q lcl|NC_013692. 720 SARTS 724 (726) Q Consensus 720 ~a~~~ 724 (726) .+-.. T Consensus 710 ~~~~~ 714 (714) T protein:vir:99 710 NEMSL 714 (714) T ss_pred HhcCC Confidence 11111 No 10 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=2.9e-85 Score=484.03 Aligned_cols=623 Identities=12% Similarity=0.066 Sum_probs=391.6 Q ss_pred CCCCCCccchhcCCCCCCc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNA---PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKSAVQPPTI 82 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs~~v~~~v 82 (726) |..++....-++-.+-+-+ ..++-+..++++ +...-..-.+.++||+|.= -+ .-...|+.-+|-..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~R~~a~~d~~fy~G~Q-w~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDS----QPKWRDAANKACAYYDGDQ-LPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHh----hHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCCcEEeccH Confidence 3333333333332221111 223333333333 2222334567899998632 12 123359999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEecCCcchH--HHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeee Q lcl|NC_013692. 83 RKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDA--ESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQ 160 (726) Q Consensus 83 ~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~ 160 (726) +-+|+|++-.-. .+-.=+.|.|++++|+ +.|+..|.+++|++. .++.-....++|.++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~----~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:27 76 APTVDGVLGMEA----KTRTDLVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHH----hCCcceEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 999999997663 3444579999887655 789999999999875 555556777999999999999887766510 Q ss_pred eeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccc Q lcl|NC_013692. 161 SRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHP 240 (726) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p 240 (726) + ....+ T Consensus 151 -----------------------------------~---------------------------------------~~~~i 156 (714) T protein:vir:27 151 -----------------------------------P---------------------------------------FGPEF 156 (714) T ss_pred -----------------------------------C---------------------------------------CCCCe Confidence 0 01224 Q ss_pred eeeeechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchh---------------hcccchhh Q lcl|NC_013692. 241 TVQVCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNL---------------LSEPDYTG 304 (726) Q Consensus 241 ~i~~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~---------------~~~~~~~~ 304 (726) +|++|||++|||||+++ .|++||+|+||++|||+++|+++ |++..+.+.....+. ....+... T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:27 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 67899999999999664 59999999999999999999998 554333332111000 00111112 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeec---------------CC------------------CceEEEEEEEEECCEE Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDI---------------HG------------------DGVLHPIVATWVGAVM 351 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~---------------~~------------------~g~~~~~~~~~~g~~~ 351 (726) .+.....+.|.+..+++|+|+|||.|... ++ ..+.++++++|+|+++ T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:27 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 22334455667777899999999987321 11 1235678899999999 Q ss_pred EEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhh----h Q lcl|NC_013692. 352 IRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRR----R 427 (726) Q Consensus 352 l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~----~ 427 (726) |+.+++||||++|||+|+++++....+.++|+++.++|+|+.+|++.|+++++| +++ ++++.+|++++.+.. . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~-~~~~~~~a~~~~d~~~~e~~ 392 (714) T protein:vir:27 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK-RVIMDEDATQLSDNDLMEQI 392 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCC-ceeeecCcccccHHHHHHhc Confidence 999999999999999999999998888889999999999999999999998865 444 466889998776532 3 Q ss_pred hcCCceEeecCcc----chhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHH Q lcl|NC_013692. 428 FDRGENYEFNPGA----DPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASK 503 (726) Q Consensus 428 ~~~g~vi~~~~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~ 503 (726) .+||+++.++++. .+...|.+.+.+++|+.++.++++....++++|||+++++|..+++ .++.+|++++++|.+ T Consensus 393 arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na--~SGvAi~~rq~qg~~ 470 (714) T protein:vir:27 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA--TSGVAISNLVEQGAT 470 (714) T ss_pred cCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccc--hhHHHHHHHHHHHHH Confidence 7899999998753 3345577788899999999999999999999999999999988876 567779999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc-------ceecchhh---------cccccceeeecccchH Q lcl|NC_013692. 504 RELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH-------FVDIRRDD---------LAGNFDLKLDISTAEE 567 (726) Q Consensus 504 ~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~-------~v~v~~~~---------~~~~~dv~i~~~~~~~ 567 (726) .+..+++||+.+++.+|+++|+||++||++++++||+|++ ++.+|+.. ..++|||.+..++.+. T Consensus 471 ~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:27 471 TLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 9999999999999999999999999999999999999752 67776543 4678999999988655 Q ss_pred HH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_013692. 568 DN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL-MLLQAQIEAERARA 644 (726) Q Consensus 568 ~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~-q~~qaq~e~~~aq~ 644 (726) .+ +....+.++++.+.+....... .++.+++++.+..++.+++++...+++...+..++.++ +..++++++++++. T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~-~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:27 551 AFKAQLAQRMSEVIQGLPPQVQAVVL-DLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHH-HHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 43 3444455555544433332222 23445666677777888887765544332221111111 11111111111111 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013692. 645 AHYMSGAGLQ--DSKVGTEQAKARALASQADM--TDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRL-DEAT 719 (726) Q Consensus 645 q~~~~~~~~~--~~~~~~eqaq~~q~~~q~~~--~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~-~~~~ 719 (726) +....++..+ ++.+...++++.+...++.+ ...+..+...+..++...+........+++...++++.++. +... T Consensus 630 q~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:27 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 1111111111 11111111111111111110 00000000000000000001111111111122222222111 1111 Q ss_pred HHHHh Q lcl|NC_013692. 720 SARTS 724 (726) Q Consensus 720 ~a~~~ 724 (726) .+-.. T Consensus 710 ~~~~~ 714 (714) T protein:vir:27 710 NEMSL 714 (714) T ss_pred HhcCC Confidence 11111 No 11 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=3.9e-83 Score=472.39 Aligned_cols=621 Identities=13% Similarity=0.080 Sum_probs=386.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKS 75 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs 75 (726) |.. |-+...+.+++.. +..+.- ..+..+..++++ +..--..-.++++||+|.= -+ .-...|+. T Consensus 1 ~~~-~~~~~~~~~~~~~-----~~~~~~-~~l~~~~~~~~~----~~~~r~~a~~d~~fy~G~Q-w~~~~~~~l~~~g~p 68 (714) T protein:vir:10 1 MKN-EINTTAMKNDHGS-----TPRFSQ-RQLLSLCSDIDS----QPLWRDAANKACAYYDGDQ-LAPEVIQVLKDRGQP 68 (714) T ss_pred CCc-CcCcccCCCcchh-----hhhhhH-HHHHHHHHHHhh----hHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCC Confidence 433 3333223222221 111111 223333333333 2222244577899998632 11 12235999 Q ss_pred cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchH--HHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEE Q lcl|NC_013692. 76 AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDA--ESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIV 153 (726) Q Consensus 76 ~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~--~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~ 153 (726) -++-..|+-+|+|++-.-.. +-.=+.|.|++++|+ +.|+..|.+++|+.. .++.-....++|.++|++|.|++ T Consensus 69 ~~~~N~i~~~v~~v~g~~~~----nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~ 143 (714) T protein:vir:10 69 MTIHNLIAPTVDGVLGMEAK----TRTDLIVMSDDPNDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWV 143 (714) T ss_pred cEEeccHHHHHHHHHHHHHh----CCcceEEecCCCChhhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcccceE Confidence 99999999999999976633 444479999987665 689999999999875 55555677799999999999998 Q ss_pred EEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccc Q lcl|NC_013692. 154 KVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEERE 233 (726) Q Consensus 154 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 233 (726) +++|++.. T Consensus 144 ~~~~d~d~------------------------------------------------------------------------ 151 (714) T protein:vir:10 144 EVRRNSEP------------------------------------------------------------------------ 151 (714) T ss_pred EeeeccCC------------------------------------------------------------------------ Confidence 87776210 Q ss_pred ceeeccceeeeechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhc-------------- Q lcl|NC_013692. 234 ETVENHPTVQVCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLS-------------- 298 (726) Q Consensus 234 ~~~~~~p~i~~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~-------------- 298 (726) ....++|++|||++|||||+++ .|++||+|+||++|||+++++++ |++..+.+......... T Consensus 152 --~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~ 228 (714) T protein:vir:10 152 --FGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred --CCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHh-cCCchhhhhccchhhcCcccchhhhhhcccc Confidence 0122467899999999999664 59999999999999999999998 55544443321111000 Q ss_pred -ccchhhhhccccccccCCcCCceEEEEEEEEEeec--------CC-------------------------CceEEEEEE Q lcl|NC_013692. 299 -EPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDI--------HG-------------------------DGVLHPIVA 344 (726) Q Consensus 299 -~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~--------~~-------------------------~g~~~~~~~ 344 (726) .............+.|.+..+++|+|+|||.|... +| ....+.+++ T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:10 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred cccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEE Confidence 00111112233445566677889999999977321 11 123466789 Q ss_pred EEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchh Q lcl|NC_013692. 345 TWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTN 424 (726) Q Consensus 345 ~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d 424 (726) +|+|.++|+.+++||||++|||+|+++++....+.++|+++.++|+|+.+|++.|+++++|+ ..++++++|++++++ T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~---~~~~~~~~gav~~~d 385 (714) T protein:vir:10 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ---AKRVIMDEDATQLSD 385 (714) T ss_pred EEecchhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHHh---CCceeeccccccccH Confidence 99999999999999999999999999999988888999999999999999999999988763 336788999998765 Q ss_pred hh----hhcCCceEeecCcc----chhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHH Q lcl|NC_013692. 425 RR----RFDRGENYEFNPGA----DPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRG 496 (726) Q Consensus 425 ~~----~~~~g~vi~~~~~~----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~ 496 (726) .. ..+||+++.++++. .+...|.+.+.+++|+.++.++++....++++|||+++++|..+++ .++.+|++ T Consensus 386 ~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na--~SGvAI~~ 463 (714) T protein:vir:10 386 NDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGA--TSGVAISN 463 (714) T ss_pred HHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcch--hHHHHHHH Confidence 42 25899999998753 3445678888899999999999999999999999999999998876 57777999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc-------ceecch---------hhcccccceee Q lcl|NC_013692. 497 ALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH-------FVDIRR---------DDLAGNFDLKL 560 (726) Q Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~-------~v~v~~---------~~~~~~~dv~i 560 (726) ++++|.+.+..+++||..+++.+|+++|+||++||++++++||++++ ++.+|. +...++|||.+ T Consensus 464 r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i 543 (714) T protein:vir:10 464 LVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIAL 543 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEE Confidence 99999999999999999999999999999999999999999999752 455553 33457899999 Q ss_pred ecccchHHHH--HHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHH-HHHHHHHHHHH Q lcl|NC_013692. 561 DISTAEEDNA--KVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKA-QLELMLLQAQI 637 (726) Q Consensus 561 ~~~~~~~~~~--~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~-q~e~q~~qaq~ 637 (726) ..++.+..+. ....+.++++.+.+..+. ....++.+++++++..++.+++++....+.......+ +.+.+..+.++ T Consensus 544 ~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~-~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~~~~~ 622 (714) T protein:vir:10 544 APVQQTPAFKAQLAQRMSEVIQGLPPQVQA-VVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQAL 622 (714) T ss_pred eeccCcHHHHHHHHHHHHHHHhhcCchhhh-hHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHHHHHH Confidence 9888766543 334455555544332222 2223334555666666777777766544332211110 00111111111 Q ss_pred HHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 638 EAERARAA--HYMSGAGLQDSKVGTEQAKARALASQADM--TDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLK 713 (726) Q Consensus 638 e~~~aq~q--~~~~~~~~~~~~~~~eqaq~~q~~~q~~~--~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~ 713 (726) +.++++.+ +.++..+..++.+...++++.+...++.+ ...+.+.......++...++.+.....+++...+ T Consensus 623 ~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~~~~----- 697 (714) T protein:vir:10 623 QQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVL----- 697 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHH----- Confidence 11111111 11111111111111111111111111111 0000000000000000000000000111111111 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_013692. 714 RLDEATSARTSQK 726 (726) Q Consensus 714 ~~~~~~~a~~~~q 726 (726) +++..+..++| T Consensus 698 --~q~~~q~~~~~ 708 (714) T protein:vir:10 698 --QQQMLYTLQQR 708 (714) T ss_pred --HHHHHHHHHHH Confidence 11111111111 No 12 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=4.5e-83 Score=472.03 Aligned_cols=613 Identities=15% Similarity=0.122 Sum_probs=391.1 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKS 75 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs 75 (726) |--...|- .+.+.|+.-.+.+....+...|......+...-..-.+.++||+|+= -+ .-...|+. T Consensus 1 ~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~Q-W~~~~~~~l~~~g~p 70 (772) T protein:vir:10 1 MQITENDR---------QYLNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQ-LDTELLRRQQALGIP 70 (772) T ss_pred CCcchhhH---------HhhccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCC Confidence 22211121 12222333334444444444444333333333445566889998642 12 12336999 Q ss_pred cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCC-cchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEE Q lcl|NC_013692. 76 AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVT-WEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVK 154 (726) Q Consensus 76 ~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~-~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k 154 (726) -++-..|+-+|+|++-.-. .+-.=+.|.|.+ .+|.+.|+..|.+++|+.. .++.-....++|.++|++|.|.+. T Consensus 71 ~~~~N~i~~~v~~v~g~~~----~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~Gw~e 145 (772) T protein:vir:10 71 PAVEDLIGPALLSLQGYEA----VTRTDWRVTPNGDVGGQEVADALNYRLNTAER-QSGADRACSEAFRPQIACGIGWVE 145 (772) T ss_pred cEEEcchHHHHHHHHHHHH----hcCcceEEecCCCchHHHHHHHHHHHHHHHHH-hcChHHHHHHHHHHhhhcCceeEE Confidence 9999999999999997663 344457999975 6899999999999999874 666666677999999999999776 Q ss_pred EeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccc Q lcl|NC_013692. 155 VGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREE 234 (726) Q Consensus 155 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 234 (726) ++++. ++ T Consensus 146 ~~~~~-----------------------------------d~-------------------------------------- 152 (772) T protein:vir:10 146 VSRES-----------------------------------DP-------------------------------------- 152 (772) T ss_pred ecccc-----------------------------------CC-------------------------------------- Confidence 54430 00 Q ss_pred eeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCccc-------------------ch Q lcl|NC_013692. 235 TVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEG-------------------QN 295 (726) Q Consensus 235 ~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~-------------------~~ 295 (726) ....++|++|+|++|||||+|+.|++||+|+|+.+|||+++++++ |++..+.+.... .. T Consensus 153 -~~~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~-fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (772) T protein:vir:10 153 -FKFPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALV-FPEHAELIGMVGKYGSTWWGQPDLGMMEGGTST 230 (772) T ss_pred -CCCCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHh-CCCchhHHHhhhhhcccccCccccccccccccc Confidence 011246789999999999998779999999999999999999988 544332221100 00 Q ss_pred hhcccchhhhhccccccccCCcCCceEEEEEEEEEeec--------CCCc-------------------------eEEEE Q lcl|NC_013692. 296 LLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDI--------HGDG-------------------------VLHPI 342 (726) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~--------~~~g-------------------------~~~~~ 342 (726) .....+...+......+.|.+..+++|+|+|||+|..+ +|.+ ..+++ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~ 310 (772) T protein:vir:10 231 GLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVR 310 (772) T ss_pred ccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEE Confidence 01111222334455566777888999999999988531 1221 23567 Q ss_pred EEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccc Q lcl|NC_013692. 343 VATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDV 422 (726) Q Consensus 343 ~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~ 422 (726) +++|+|.++|+.+++||+|++|||||+++++.+.++.++|+++.++|+|+.+|++.|+++++|+.+ ++++++|+|++ T Consensus 311 ~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~---~~~~~~gav~~ 387 (772) T protein:vir:10 311 RSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVA---RVERTKGAVAM 387 (772) T ss_pred EEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhcc---cccccCCCccc Confidence 789999999999999999999999999999999998899999999999999999999999998766 57899999998 Q ss_pred hhh----hhhcCCceEeecCccc--hhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHH Q lcl|NC_013692. 423 TNR----RRFDRGENYEFNPGAD--PRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRG 496 (726) Q Consensus 423 ~d~----~~~~~g~vi~~~~~~~--~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~ 496 (726) ++. ...+|+++|.++++.. +...+.+.+.|.+|..++.|+++....++++|||+++++|..+++ .|+.+|++ T Consensus 388 ~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na--~SGvAi~~ 465 (772) T protein:vir:10 388 TDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTA--TSGIQEQQ 465 (772) T ss_pred hhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcch--hhHHHHHH Confidence 763 3468999999998754 345667778889999999999999999999999999999987765 67778999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc------ceecch--------------hhccccc Q lcl|NC_013692. 497 ALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH------FVDIRR--------------DDLAGNF 556 (726) Q Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~------~v~v~~--------------~~~~~~~ 556 (726) ++++|++.+..+++||..+++.+|+++|+||++||++++++||+|++ ++.+|. +...++| T Consensus 466 rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~y 545 (772) T protein:vir:10 466 QIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRI 545 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeE Confidence 99999999999999999999999999999999999999999999753 344442 2346789 Q ss_pred ceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHH---HHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHH Q lcl|NC_013692. 557 DLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMG---QIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLL 633 (726) Q Consensus 557 dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~---~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~ 633 (726) ||.+..++....+... .+..+++.++. .++.....++. +.+++.+..++.+.+++....+.+.+++.++ .+.. T Consensus 546 Dv~i~~~p~~~t~r~~-~~~~m~ql~~~-~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~--~q~~ 621 (772) T protein:vir:10 546 KVALEDVPSTNSYRGQ-QLNAMSEAVKS-MPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQI--DQAV 621 (772) T ss_pred EEEeeccccchHHHHH-HHHHHHHHHhc-cChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHH--HHHH Confidence 9999998876554332 22222232222 34444444333 3344555557777777665443322211111 0111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 634 QAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLK 713 (726) Q Consensus 634 qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~ 713 (726) +.+++..+++.+. ....++.+...++++++++++.+...+ .+..+++..+.. .++...+ .. ....+. T Consensus 622 qq~~~~~~~el~~-----~q~~a~~~~~~A~a~~~~aqa~~~~~~--a~~~a~~aa~~~--~q~~q~a--~~--ad~~l~ 688 (772) T protein:vir:10 622 QDALAKAGNDIKL-----RELEIKERKADSEISGLNAKAVQIGVQ--AAFSAMQAGAQI--AQMPMIA--PI--ADAVMQ 688 (772) T ss_pred HHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHhhhhhhH--Hhhhhhh--HH--HHHHHH Confidence 1111111111110 111111111112222222222221111 111111111110 0000000 00 000110 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_013692. 714 RLDEATSARTSQK 726 (726) Q Consensus 714 ~~~~~~~a~~~~q 726 (726) .. -....+.+-. T Consensus 689 ~~-g~~~~~~~~~ 700 (772) T protein:vir:10 689 SA-GYQRPNPAGD 700 (772) T ss_pred hc-cccccccccc Confidence 00 0000000000 No 13 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=4e-77 Score=439.45 Aligned_cols=608 Identities=14% Similarity=0.104 Sum_probs=372.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC-----CCCCCCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP-----KTEKGKS 75 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-----~~~~grs 75 (726) |+ -+..++..+...|..+..+..+--....+.++||+|.-- ++ -...||. T Consensus 1 m~------------------------d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw-~~~~~~~l~~q~rp 55 (725) T protein:vir:77 1 MA------------------------DNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQW-DDWLSQYTTLQYRG 55 (725) T ss_pred CC------------------------chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCC-CHHHHHHHHhcCCC Confidence 11 245567888888877776666655566778999986431 21 1223554 Q ss_pred cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEE Q lcl|NC_013692. 76 AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKV 155 (726) Q Consensus 76 ~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~ 155 (726) +=+.|+-+|+|++-+-.+ +-.-+.|.|..++|.+.|+..|.+++|+.. .++....-.++|+++|++|.|.+.+ T Consensus 56 --~~N~i~~~i~~v~g~~~~----nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev 128 (725) T protein:vir:77 56 --QFDVVRPVVRKLVSEMRQ----NPIDVLYRPKDGARPDAADVLMGMYRTDMR-HNTAKIAVNIAVREQIEAGVGAWRL 128 (725) T ss_pred --ccccHHHHHHHHHhhHHh----CCcceEEecCCccHHHHHHHHHHHHHHHHH-hhCchhHHHHHHHHHhhcCcceeee Confidence 447888888888765543 556689999999999999999999999864 6666677779999999999998887 Q ss_pred eeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccce Q lcl|NC_013692. 156 GWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREET 235 (726) Q Consensus 156 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 235 (726) .+++... +.++ ...+ T Consensus 129 ~~d~~~~---------------------------------d~~~--------------------------------~~~~ 143 (725) T protein:vir:77 129 VTDYEDQ---------------------------------SPTS--------------------------------NNQV 143 (725) T ss_pred eecccCC---------------------------------CCCC--------------------------------Ccee Confidence 6663210 0000 0000 Q ss_pred eeccceeeeechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhc--CCCcchhhcCcccchhhcccchhhhhcccccc Q lcl|NC_013692. 236 VENHPTVQVCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKAD--GRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNF 312 (726) Q Consensus 236 ~~~~p~i~~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~--g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (726) +...+ .+.+|.+|||||.++ .|++||+|+|+++|||++++..+ .|+.+...+.. + . ..... T Consensus 144 i~~~~--~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~--------~----~~~~~ 207 (725) T protein:vir:77 144 IRREP--IHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPS--F--------Q----NPNDW 207 (725) T ss_pred eEEee--cccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhccc--c--------c----ccccc Confidence 00000 134688999999765 49999999999999999976543 22222211110 0 0 01111 Q ss_pred ccCCcCCceEEEEEEEEEeec--------C----------------------CCceE----------EEEEEEEECCEEE Q lcl|NC_013692. 313 DFQDKSRKRLVVHEYWGYYDI--------H----------------------GDGVL----------HPIVATWVGAVMI 352 (726) Q Consensus 313 ~~~~~~~~~v~v~E~w~~~~~--------~----------------------~~g~~----------~~~~~~~~g~~~l 352 (726) .....+.++|+|+|||.|..+ + ..|.. +.+.+++.|.++| T Consensus 208 ~~~~~~~d~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l 287 (725) T protein:vir:77 208 VFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVL 287 (725) T ss_pred cccccCCCeeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceee Confidence 112234578999999987532 1 11211 2223345666666 Q ss_pred EeccCCCCCCccceEEeeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcC Q lcl|NC_013692. 353 RMEENPFPDKRIPYVVVNYIPR--KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDR 430 (726) Q Consensus 353 ~~~~~P~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~ 430 (726) . +++||+|++|||||+++++. ++..+++|+++.++|+|+.+|+++|+++++++++++.++++..|+++..+.....+ T Consensus 288 ~-~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 366 (725) T protein:vir:77 288 K-DKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGN 366 (725) T ss_pred c-cCCcCCCCccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhc Confidence 4 68899999999999999965 56677779999999999999999999999999999999999999998776665555 Q ss_pred Cce-------EeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHH Q lcl|NC_013692. 431 GEN-------YEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASK 503 (726) Q Consensus 431 g~v-------i~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~ 503 (726) +++ +..++|..+.+++...+.|++|++++.|+++....++++|||++.++|..+++ .++.++.+++++|.+ T Consensus 367 ~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~--~SG~ai~~rq~qg~~ 444 (725) T protein:vir:77 367 DDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQ--VAFDTVNQLNMRADL 444 (725) T ss_pred cCCceecccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchh--hHHHHHHHHHHHHHH Confidence 544 45567777777888888999999999999999999999999999999988875 678889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc----ceecch-------------hhcccccceeeecccch Q lcl|NC_013692. 504 RELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH----FVDIRR-------------DDLAGNFDLKLDISTAE 566 (726) Q Consensus 504 ~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~----~v~v~~-------------~~~~~~~dv~i~~~~~~ 566 (726) .+..+++||..+++.+|+++|+||++||++++++||+|++ ++.+|. .++.++|||.|..++++ T Consensus 445 ~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~ 524 (725) T protein:vir:77 445 ETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSF 524 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccch Confidence 9999999999999999999999999999999999999873 666663 24457899999998765 Q ss_pred HHH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHH---hhhhhhhhhhHHHHHhhhhhhh-----hhHHH---HHHHHH Q lcl|NC_013692. 567 EDN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIME---LKKMPDFAKRIREFQPQPDPIA-----QQKAQ---LELMLL 633 (726) Q Consensus 567 ~~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~---~~~~~e~~~~l~~~~~~~~~~~-----qq~~q---~e~q~~ 633 (726) ... +....+.++++.+++.. +... .++....+ ..++.++.+.++.........+ +++.. .+.+.. T Consensus 525 ~s~r~~~~~~l~qll~~~~~~~-~~~~-~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~ 602 (725) T protein:vir:77 525 QSMKQQNRAEILELLGKTPQGT-PEYQ-LLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQG 602 (725) T ss_pred HHHHHHHHHHHHHHHHhccccc-hhHH-HHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHH Confidence 543 44455556655554322 2222 22222233 3344455554444322211100 00000 000111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH---------H---HHHHHHHHHHHHHHH---H Q lcl|NC_013692. 634 QAQIEAERARAAHYMSGAGLQDSKVGTEQAKAR--ALASQADMTDLNF---------L---EQESGVQQARKRELQ---Q 696 (726) Q Consensus 634 qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~--q~~~q~~~~~~e~---------~---~qe~~~~~~~e~e~~---~ 696 (726) +++.+..++++..+..++..+.++.+.+.++.. +.+.++..+.++. . .++++++.....+.+ . T Consensus 603 q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~ 682 (725) T protein:vir:77 603 QQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSED 682 (725) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111111111111111111111111100 0011100000000 0 000111111111111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 697 AQSEAQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 697 ~q~~~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) .++.++..++......++..+-.++-++++ T Consensus 683 ~~~~ae~~~~~~~~~~~q~~~~~~~~~~~~ 712 (725) T protein:vir:77 683 ARANAELLLKGDEQTHKQRMDIANILQSQR 712 (725) T ss_pred HHHHhHHHHHhhhHHHhhHHHHHHHHHHHH Confidence 111122222222222222222222222222 No 14 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=3.5e-76 Score=434.23 Aligned_cols=606 Identities=14% Similarity=0.112 Sum_probs=370.0 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC-----CCCCCCCcCCCHHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP-----KTEKGKSAVQPPTIRKQ 85 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-----~~~~grs~~v~~~v~~~ 85 (726) |+ -++..+..+...|..+..+..+.-....+.++||+|.-- ++ -...||. +=+.|+-+ T Consensus 1 m~--------------d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw-~~~~~~~l~~q~rp--~~N~i~~~ 63 (725) T protein:vir:92 1 MA--------------DNENRLESILSRFDADWTASDEARREAKNDLFFSRISQW-DDWLSQYTTLQYRG--QFDVVRPV 63 (725) T ss_pred CC--------------chHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC-CHHHHHHHHhcCCC--cccchHHH Confidence 11 134567888888877776666655666788999986431 21 1123554 44788888 Q ss_pred HHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEE Q lcl|NC_013692. 86 AEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVK 165 (726) Q Consensus 86 v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~ 165 (726) |+|++-+-.+ +-.-+.|.|..++|.+.|+..|.+++|+.. .++....-.++|+++|++|.|.+.+.+++... T Consensus 64 i~~v~g~e~~----nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~--- 135 (725) T protein:vir:92 64 VRKLVSEMRQ----NPIDVLYRPKDGASPDAADVLMGMYRTDMR-HNTAKIAVNVAVREQIESGVGAWRLVTDYEDQ--- 135 (725) T ss_pred HHHHHhhHHh----CCcceEEecCCccHHHHHHHHHHHHHHHHH-hhCchHHHHHHHHHHhhcCcceeeeeecccCC--- Confidence 8887755433 556689999999999999999999999864 67776777799999999999988776653210 Q ss_pred ecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeee- Q lcl|NC_013692. 166 EQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQV- 244 (726) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~- 244 (726) ++. . .... +++.. T Consensus 136 -----------------------------d~~-~--------------------------------~~~~----i~~~~i 149 (725) T protein:vir:92 136 -----------------------------SPT-S--------------------------------NNQV----IRREPI 149 (725) T ss_pred -----------------------------CCC-C--------------------------------Ccee----eEEeec Confidence 000 0 0000 11111 Q ss_pred -echhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhc--CCCcchhhcCcccchhhcccchhhhhccccccccCCcCCc Q lcl|NC_013692. 245 -CDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKAD--GRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRK 320 (726) Q Consensus 245 -v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~--g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (726) -|+.+|||||.++ .|++||+|+|+++||+++++..+ .|+.+...+.. + . ....+.....+++ T Consensus 150 ~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~-----~-------~~~~~~~~~~~~d 215 (725) T protein:vir:92 150 HSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPS--F-----Q-------NPNDWVFPWLTQD 215 (725) T ss_pred cCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhh--c-----c-------cCCcccccccCCC Confidence 2466799999765 49999999999999999876543 23322211110 0 0 0111111223467 Q ss_pred eEEEEEEEEEeecC-----------C-------------------Cce--------E--EEEEEEEECCEEEEeccCCCC Q lcl|NC_013692. 321 RLVVHEYWGYYDIH-----------G-------------------DGV--------L--HPIVATWVGAVMIRMEENPFP 360 (726) Q Consensus 321 ~v~v~E~w~~~~~~-----------~-------------------~g~--------~--~~~~~~~~g~~~l~~~~~P~~ 360 (726) +|+|+|||.+..+. | .|. . +.+..+++|.++|+ +++||+ T Consensus 216 ~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~-~~~~~~ 294 (725) T protein:vir:92 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLK-DKQLIA 294 (725) T ss_pred eEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhc-CCCCCC Confidence 89999999875321 1 111 1 22233356777665 478999 Q ss_pred CCccceEEeeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceE---- Q lcl|NC_013692. 361 DKRIPYVVVNYIPR--KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENY---- 434 (726) Q Consensus 361 ~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi---- 434 (726) |++|||+|+++++. ++..+++|+++.++|+|+.+|+++|+++++++++++.+++++.|+++........++.+. T Consensus 295 ~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) T protein:vir:92 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) T ss_pred CCceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeec Confidence 99999999999876 566777799999999999999999999999999999999999999977655555555442 Q ss_pred ---eecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 435 ---EFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRR 511 (726) Q Consensus 435 ---~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~ 511 (726) ..++|..+..+|.+.+.|++|++++.|+++..+.++++|||++.++|..+++ .++.++.+++++|.+.+..+++| T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~--~SG~ai~~rq~qg~~~l~~~~Dn 452 (725) T protein:vir:92 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ--VAYDTVNQLNMRADLETYVFQDN 452 (725) T ss_pred cccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchh--hHHHHHHHHHHHHHHHHHHHHHH Confidence 2356666677788888999999999999999999999999999999988765 67888999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCcCeEEEEecc----cceecch-------------hhcccccceeeecccchHHH--HHH Q lcl|NC_013692. 512 LSAGIIEIGRKIIAMNAEFLDDVEVVRITNE----HFVDIRR-------------DDLAGNFDLKLDISTAEEDN--AKV 572 (726) Q Consensus 512 ~~~~~~~l~~~il~li~q~~d~e~~iRi~~~----~~v~v~~-------------~~~~~~~dv~i~~~~~~~~~--~~~ 572 (726) |+.+++.+|+++|+||++||++++++||+|+ .++.+|. .++.++||+.|..++++..+ +.. T Consensus 453 l~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~ 532 (725) T protein:vir:92 453 LATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHH Confidence 9999999999999999999999999999986 3666654 24567999999998765543 334 Q ss_pred HHHHHHHHHhhhccchhHHHHHHHHHH---HhhhhhhhhhhHHHHHhhhh-----hhhhhHHHHHHH---HHHHHHHHHH Q lcl|NC_013692. 573 NDLTFMLQTMGPNMDPMMAQQIMGQIM---ELKKMPDFAKRIREFQPQPD-----PIAQQKAQLELM---LLQAQIEAER 641 (726) Q Consensus 573 ~~l~~l~q~~~~~~~~~~~~~~~~~~~---~~~~~~e~~~~l~~~~~~~~-----~~~qq~~q~e~q---~~qaq~e~~~ 641 (726) ..+.++++.+.+. .+... .++.... +..+..++.+.++....... ..+++.+..+.+ ..+.+.+..+ T Consensus 533 ~~l~ql~~~~~~~-~~~~~-~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~ 610 (725) T protein:vir:92 533 AEILELLGKTPQG-TPEYQ-LLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQ 610 (725) T ss_pred HHHHHHHHhcccc-hhHHH-HHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHH Confidence 4455555544332 22221 1222223 33344555555544322211 111111111110 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH---HHHH---------HHHHHHHHHHHH---HHHHHHHHH Q lcl|NC_013692. 642 ARAAHYMSGAGLQDSKVGTEQAK--ARALASQADMTDLNF---LEQE---------SGVQQARKRELQ---QAQSEAQGK 704 (726) Q Consensus 642 aq~q~~~~~~~~~~~~~~~eqaq--~~q~~~q~~~~~~e~---~~qe---------~~~~~~~e~e~~---~~q~~~q~~ 704 (726) +++..+..++.++.++++.++++ +.+.+.++....++. ..+. ...+...+.+.+ ..+..++.. T Consensus 611 ~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~ 690 (725) T protein:vir:92 611 AQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELL 690 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHH Confidence 11111111111111111111111 111111111111110 0000 001111110000 011111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 705 LAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 705 ~~~l~~~~~~~~~~~~a~~~~q 726 (726) ++......++..+..++-++++ T Consensus 691 l~~~~~~~~~~~d~~~~~~~~~ 712 (725) T protein:vir:92 691 LKGNEQTHKQRMDIANILQSQR 712 (725) T ss_pred HHHHHHHHHHHHHHHHHhcchh Confidence 2222222233333333333333 No 15 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=7.8e-76 Score=432.36 Aligned_cols=606 Identities=15% Similarity=0.109 Sum_probs=374.2 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC-----CCCCCCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP-----KTEKGKS 75 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-----~~~~grs 75 (726) |+| +...+..+...|..+.....+--..-.+.++||+|.= -++ -...||. T Consensus 1 m~d------------------------~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q-W~~~~~~~l~~q~rp 55 (725) T protein:vir:10 1 MAD------------------------NENRLESILSRFDADWTASDEARREAKNDLFFSRVSQ-WDDWLSQYTTLQYRG 55 (725) T ss_pred CCc------------------------hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC-CCHHHHHHHHhcCCC Confidence 111 3445777777777666555544445567899998532 111 1123554 Q ss_pred cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEE Q lcl|NC_013692. 76 AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKV 155 (726) Q Consensus 76 ~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~ 155 (726) +=+.|+-+|+|++-+-.+ +-.=+.|.|..++|.+.|+..|.+++|+. ..++.-..-.++|.++|++|.|.+.+ T Consensus 56 --~~N~i~~~v~~v~g~e~~----nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~s~Af~~~i~~G~G~~ev 128 (725) T protein:vir:10 56 --QFDVVRPVVRKLVSEMRQ----NPIDVLYRPKDGASPDAADVLMGMYRTDM-RHNTAKIAVNIAVREQIEAGVGAWRL 128 (725) T ss_pred --cccchHHHHHHHHhhHHh----CCcceEEecCCcchHHHHHHHHHHHHHHH-HhcCcchHHhHHHHHHhhcCcceeee Confidence 448889999998865543 33447999999999999999999999985 45555556669999999999999888 Q ss_pred eeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccce Q lcl|NC_013692. 156 GWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREET 235 (726) Q Consensus 156 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 235 (726) .|++... +.+. ...+ T Consensus 129 ~~d~~~~---------------------------------d~~~--------------------------------~~~~ 143 (725) T protein:vir:10 129 VTDYEDQ---------------------------------SPTS--------------------------------NNQV 143 (725) T ss_pred eccccCC---------------------------------CCCC--------------------------------Ccee Confidence 7663210 0000 0000 Q ss_pred eeccceee--eechhheeeCCCCC-CchhhCCeEEEEEeccHHHHHhc--CCCcchhhcCcccchhhcccchhhhhcccc Q lcl|NC_013692. 236 VENHPTVQ--VCDYNNIVIDPSCG-SDFSKAKFLIETFESSYAELKAD--GRYQNLDKIQVEGQNLLSEPDYTGPSEGVR 310 (726) Q Consensus 236 ~~~~p~i~--~v~p~~~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~--g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 310 (726) +++. +.||.+|||||.++ .|++||+|+|+.+||+++.+... .|+.+...+.. + .... T Consensus 144 ----i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~--~------------~~~~ 205 (725) T protein:vir:10 144 ----IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPS--F------------QNPN 205 (725) T ss_pred ----eeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCccccccc--c------------cccc Confidence 1111 34688899999664 59999999999999998654321 23333221110 0 0011 Q ss_pred ccccCCcCCceEEEEEEEEEeecC-----------C-------------------Cce--------E--EEEEEEEECCE Q lcl|NC_013692. 311 NFDFQDKSRKRLVVHEYWGYYDIH-----------G-------------------DGV--------L--HPIVATWVGAV 350 (726) Q Consensus 311 ~~~~~~~~~~~v~v~E~w~~~~~~-----------~-------------------~g~--------~--~~~~~~~~g~~ 350 (726) ...+.....++|+|+|||.+.++. | .|. . +.+..+++|.+ T Consensus 206 ~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~ 285 (725) T protein:vir:10 206 DWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA 285 (725) T ss_pred cccccccCCCeEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchh Confidence 112222346789999999886421 1 111 1 22233456777 Q ss_pred EEEeccCCCCCCccceEEeeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhh Q lcl|NC_013692. 351 MIRMEENPFPDKRIPYVVVNYIPR--KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRF 428 (726) Q Consensus 351 ~l~~~~~P~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~ 428 (726) +|+ +++||+|++|||+|+++++. ++..+++|+++.++|+|+.+|+++|+++++++++++.+++++.++++....... T Consensus 286 ~l~-~~~~~~~~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~ 364 (725) T protein:vir:10 286 VLK-DKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYD 364 (725) T ss_pred hhc-CCCCCCCCceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHh Confidence 774 58899999999999999876 566777799999999999999999999999999999999999999987666656 Q ss_pred cCCceEee-------cCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHH Q lcl|NC_013692. 429 DRGENYEF-------NPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAA 501 (726) Q Consensus 429 ~~g~vi~~-------~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~ 501 (726) +++.+..+ ++|..+..++.+.+.|++|++++.|+++..+.++++|||++.++|..+++ .++.+|.+++++| T Consensus 365 ~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~--~SG~ai~~rq~qg 442 (725) T protein:vir:10 365 GNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ--VAYDTVNQLNMRA 442 (725) T ss_pred ccCCceeeecccccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchh--hHHHHHHHHHHHH Confidence 66655333 55666667788888999999999999999999999999999999988765 6778899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc----ceecch-------------hhcccccceeeeccc Q lcl|NC_013692. 502 SKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH----FVDIRR-------------DDLAGNFDLKLDIST 564 (726) Q Consensus 502 ~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~----~v~v~~-------------~~~~~~~dv~i~~~~ 564 (726) .+.+..+++||+.+++.+|+++|+||++||++++++||+|++ ++.+|. .++.++||+.|..++ T Consensus 443 ~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p 522 (725) T protein:vir:10 443 DLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP 522 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeecc Confidence 999999999999999999999999999999999999999873 666664 245679999999987 Q ss_pred chHHH--HHHHHHHHHHHHhhhccchhHHHHHHHHHHH---hhhhhhhhhhHHHHHhhhhh-----hhhhHHHH---HHH Q lcl|NC_013692. 565 AEEDN--AKVNDLTFMLQTMGPNMDPMMAQQIMGQIME---LKKMPDFAKRIREFQPQPDP-----IAQQKAQL---ELM 631 (726) Q Consensus 565 ~~~~~--~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~---~~~~~e~~~~l~~~~~~~~~-----~~qq~~q~---e~q 631 (726) ++..+ +....+.++++.+++.. +.. ..++..+.+ ..+..++.+.++........ .+++++.. +.+ T Consensus 523 ~~~s~r~~~~~~l~qll~~~~~~~-~~~-~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~ 600 (725) T protein:vir:10 523 SFQSMKQQNRSEILELLGKTPQGT-PEY-QLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAK 600 (725) T ss_pred CcHHHHHHHHHHHHHHHHhccccc-hhH-HHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHH Confidence 65543 34445555655554322 222 122222333 34455555555543322110 11100000 011 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH------------HHHHHHHHHHHHH---HH Q lcl|NC_013692. 632 LLQAQIEAERARAAHYMSGAGLQDSKVGTEQAK--ARALASQADMTDLNF------------LEQESGVQQARKR---EL 694 (726) Q Consensus 632 ~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq--~~q~~~q~~~~~~e~------------~~qe~~~~~~~e~---e~ 694 (726) +.+++.+..+++++.++.++..+.++++.+.++ +.+.+.++....++. ..++++++..... .. T Consensus 601 ~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~ 680 (725) T protein:vir:10 601 QGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRS 680 (725) T ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 111111111111111111111111111111111 100011110000000 0111111111111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 695 QQAQSEAQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 695 ~~~q~~~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ...+..++..++.+..+.++..+..++-++++ T Consensus 681 ~~~~~~ae~~~~~~~~~~~~~~~~~~~~~~q~ 712 (725) T protein:vir:10 681 EDARANAELLLKGNEQTHKQRMDIANILQSQR 712 (725) T ss_pred HHHHHhhHHHHHHHHHHHHHHhhhhhcccccc Confidence 12223333344444444445445555554444 No 16 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=6e-74 Score=422.01 Aligned_cols=611 Identities=15% Similarity=0.123 Sum_probs=363.0 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCC----------CCCCCCcCCCHHHHHHHHHHHHH Q lcl|NC_013692. 23 QPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPK----------TEKGKSAVQPPTIRKQAEWRYSS 92 (726) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~----------~~~grs~~v~~~v~~~v~~~~~~ 92 (726) .++ ++...+..+...|+.+..+....-..-....+||+++|+.=++ ...||..+|-+.|+-+|+|++.+ T Consensus 1 m~e-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~ 79 (706) T protein:vir:10 1 MAE-SRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISE 79 (706) T ss_pred CCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhH Confidence 222 5566888899999888877765555555567888877743222 23489999999999999999988 Q ss_pred HHHhhcCCCceEEEecCC-cchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEeccccc Q lcl|NC_013692. 93 LSEPFLSSPNIFEVNPVT-WEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTY 171 (726) Q Consensus 93 L~~~f~~~~~~~~~~p~~-~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~ 171 (726) ..+.= .=+.|.|.. .+|.+.|+..|.+++|+. ..++.-....++|+++|++|.|.+++.-+++.. T Consensus 80 ~~~nr----~~~~v~P~~~~~d~~~Ae~l~~l~~~~~-~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~--------- 145 (706) T protein:vir:10 80 YRNNR----ISVKFRPGDNAASEELANKLNGLFRADY-EETDGGEACDNAFDDAATGGFGCFRLTTSFVNE--------- 145 (706) T ss_pred HHhCC----CceEEecCCCCchHHHHHHHHHHHHHHH-HhcCchHHHHHHHHHHhhcCcceEEeeeccccc--------- Confidence 85544 338999975 568999999999999985 577777788899999999999977654332110 Q ss_pred ccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeee-ch-hh Q lcl|NC_013692. 172 EMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVC-DY-NN 249 (726) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v-~p-~~ 249 (726) .++. + ...++.|++| +| .+ T Consensus 146 ----------------------~d~~---------------------~----------------~~~~i~i~~v~~p~~~ 166 (706) T protein:vir:10 146 ----------------------YDPM---------------------D----------------ERQRIAVEPIYDPARS 166 (706) T ss_pred ----------------------cCCC---------------------C----------------CCccceeeeeccchhc Confidence 0000 0 0112233444 45 48 Q ss_pred eeeCCCCC-CchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEE Q lcl|NC_013692. 250 IVIDPSCG-SDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYW 328 (726) Q Consensus 250 ~~~dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w 328 (726) |||||.++ .|++||+|+|+++|||+++++++ |++..+.+...... ....++... .+.....|.. ++.+.++.|| T Consensus 167 v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~-fp~~~~~~~~~~~~-~~~~d~~~~-d~~~~~eyy~--~~~~~~~~~~ 241 (706) T protein:vir:10 167 VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSE-YDKAPTSLDRVGSV-SWQYDWFTP-DVVYIAKYYE--VRKESVDVIS 241 (706) T ss_pred eecCchhcccChhhcceEeeeecCCHHHHHHh-cCCChhhhhhhccc-cccccccCC-Ccceeccccc--ccceeEEEEE Confidence 99999664 59999999999999999999998 55443333211110 000011110 1111111111 2233344455 Q ss_pred EEeecCC-------------------Cce----------EEEEEEEEECCEEEEeccCCCCCCccceEEeeeeee--cCc Q lcl|NC_013692. 329 GYYDIHG-------------------DGV----------LHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPR--KRD 377 (726) Q Consensus 329 ~~~~~~~-------------------~g~----------~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~--~~~ 377 (726) ++.+..+ .|+ .+.+..++.|.++| .+++||+|++|||+|+++++. +++ T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l-~~~~p~~~~~~P~vP~~g~r~~~d~~ 320 (706) T protein:vir:10 242 YRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFL-EKPRRIPGEHIPLIPVYGKRWFIDDV 320 (706) T ss_pred eeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeecccccc-ccCCCCCCCccceEEEeecccccccc Confidence 5533221 122 12234456777777 579999999999999999876 778 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCC----ceEeec-----Cccc--hhhhc Q lcl|NC_013692. 378 LYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRG----ENYEFN-----PGAD--PRAAV 446 (726) Q Consensus 378 ~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g----~vi~~~-----~~~~--~~~~i 446 (726) ..++|+++.++|+|+.+|+++|+++++++++.+...++..+.++........+. ..+.++ +|.. +...+ T Consensus 321 ~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~ 400 (706) T protein:vir:10 321 ERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVA 400 (706) T ss_pred CcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhcccccCCCCccccccccc Confidence 888999999999999999999999999988866555444333322111111111 111121 1211 12233 Q ss_pred ccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 447 HMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAM 526 (726) Q Consensus 447 ~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~l 526 (726) ..+..|.+|+++..|+++....++++|||+++++|..++ .++.+|++++++|.+.+..+++||..+++.+|+++|+| T Consensus 401 ~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn---~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~l 477 (706) T protein:vir:10 401 GYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN---VARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLSM 477 (706) T ss_pred ccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455677899999999999999999999999999997654 47888999999999999999999999999999999999 Q ss_pred HHHhcCcCeEEEEecc----cceecch--------------hhcccccceeeecccchHH--HHHHHHHHHHHHHhhhcc Q lcl|NC_013692. 527 NAEFLDDVEVVRITNE----HFVDIRR--------------DDLAGNFDLKLDISTAEED--NAKVNDLTFMLQTMGPNM 586 (726) Q Consensus 527 i~q~~d~e~~iRi~~~----~~v~v~~--------------~~~~~~~dv~i~~~~~~~~--~~~~~~l~~l~q~~~~~~ 586 (726) |++||++++++||+|+ +++.+|. +...++|||.|..++.... .+....+.++++.+.+.. T Consensus 478 i~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~ 557 (706) T protein:vir:10 478 AREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQD 557 (706) T ss_pred HHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcc Confidence 9999999999999986 4565542 3346789999998876544 444456666666655433 Q ss_pred chh-HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHH Q lcl|NC_013692. 587 DPM-MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELML-LQAQIEAERARAAHYMSG--AGLQDSKVGTEQ 662 (726) Q Consensus 587 ~~~-~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~-~qaq~e~~~aq~q~~~~~--~~~~~~~~~~eq 662 (726) +.. ....++-+.+++++..++.++++.....+...++..++.+... .++|+++++++.+....+ ....++.++..+ T Consensus 558 ~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~ 637 (706) T protein:vir:10 558 PMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQ 637 (706) T ss_pred hhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 211 1122333445556666777777665544332222211111110 011111111111111111 111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhcC Q lcl|NC_013692. 663 AKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLK--RLDEATSARTSQK 726 (726) Q Consensus 663 aq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~--~~~~~~~a~~~~q 726 (726) +++.+.+.+..+.+++..+++... .+...++.....++..+ +......+.+++. T Consensus 638 a~~~q~~~~a~~a~~qa~~~~~~~----------~~~~~~a~~~~~~~~~q~~q~l~~~~a~q~~~ 693 (706) T protein:vir:10 638 NETVQTQIKAFTAQQDAMESQANT----------VYKLAQARNIDDKAVMETLRLLKEVAASQQQT 693 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 111111111111111111111100 00001110001111111 1112222222222 No 17 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=3e-73 Score=418.17 Aligned_cols=606 Identities=15% Similarity=0.143 Sum_probs=361.9 Q ss_pred CCC--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCC----------CCCCCcCCCHHHHHHHHHHHHHH Q lcl|NC_013692. 26 WSN--APSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKT----------EKGKSAVQPPTIRKQAEWRYSSL 93 (726) Q Consensus 26 ~~~--~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~----------~~grs~~v~~~v~~~v~~~~~~L 93 (726) +|+ ...|..+..+|+.+..+...--..-....+||+++|+.=++. -.||..++=+.|+-+|+|++-.- T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 432 245677777777666555444443344578888777542221 24888888899999999998655 Q ss_pred HHhhcCCCceEEEecCCcc-hHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccc Q lcl|NC_013692. 94 SEPFLSSPNIFEVNPVTWE-DAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYE 172 (726) Q Consensus 94 ~~~f~~~~~~~~~~p~~~~-D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~ 172 (726) . .+-+=+.|.|.+++ |.+.|+..|.+++|+.. .++.-....++|.++|++|.|++++.|++.... T Consensus 81 ~----~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~--------- 146 (720) T protein:vir:35 81 R----HNRITVKFRPGDKTASEALANKLNGLFRADYE-ETDGGEACDNAFDDGSTGGFGCFRLTTNLVNAL--------- 146 (720) T ss_pred H----hCCCceEEEcCCCcchHHHHHHHHHHHHHHHH-hcCchHHHhHHHHHhhhccceeEEeeecccccC--------- Confidence 3 23344799999665 99999999999999864 666666677999999999999999988742110 Q ss_pred cCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceee--eechhhe Q lcl|NC_013692. 173 MMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQ--VCDYNNI 250 (726) Q Consensus 173 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~--~v~p~~~ 250 (726) ++. +. ..++.++ ++|+++| T Consensus 147 ----------------------d~~---------------------~~----------------~~~i~i~~v~~~~~~v 167 (720) T protein:vir:35 147 ----------------------DPM---------------------DE----------------RQRICLEPIYDPARSV 167 (720) T ss_pred ----------------------CCC---------------------cc----------------cceeeEecccCchhhe Confidence 000 00 0111223 3578899 Q ss_pred eeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEE Q lcl|NC_013692. 251 VIDPSCGS-DFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWG 329 (726) Q Consensus 251 ~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~ 329 (726) ||||.++. |++||+|+|+.+|||+++++++ |+++.+.+.. ... .+..+.....+.|+|+|||. T Consensus 168 ~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~-yp~~a~~~~~-~~~--------------~~~~~d~~~~~~v~i~E~~~ 231 (720) T protein:vir:35 168 WFDPDAKKYDKSDAEWAFCMYSLSAEKYKAE-YNKDPATLMS-GIE--------------RSWDYDWYDVDVVYIAKYYE 231 (720) T ss_pred eecccccccChhhhhhhhhhcCCCHHHHHHh-CCCccccccc-ccc--------------ccccccccCCCceEEEEeeE Confidence 99998764 9999999999999999999988 6555433211 000 01111112356799999987 Q ss_pred EeecC------------------CC------------ce--------EEEEEEEE-ECCEEEEeccCCCCCCccceEEee Q lcl|NC_013692. 330 YYDIH------------------GD------------GV--------LHPIVATW-VGAVMIRMEENPFPDKRIPYVVVN 370 (726) Q Consensus 330 ~~~~~------------------~~------------g~--------~~~~~~~~-~g~~~l~~~~~P~~~~~~Pf~~~~ 370 (726) +..+. ++ |. ..+++.++ .+++++-.+++|+||++|||||++ T Consensus 232 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~ 311 (720) T protein:vir:35 232 VKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVY 311 (720) T ss_pred EEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEE Confidence 64310 00 11 12222222 356666677899999999999999 Q ss_pred eeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhh---hhcCCce----Eee----- Q lcl|NC_013692. 371 YIPR--KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRR---RFDRGEN----YEF----- 436 (726) Q Consensus 371 ~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~---~~~~g~v----i~~----- 436 (726) +++. ++..+++|+++.++|+|+.+|+++|.+++++..+ +.+++.|++++.+.. ...++.+ +.+ T Consensus 312 g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~---~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~ 388 (720) T protein:vir:35 312 GKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQD---TGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVD 388 (720) T ss_pred eeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcC---CccccccCcchHHHHHHHhhccccccccccccccccc Confidence 9877 5666668999999999999999999999998655 667888888765432 2233332 122 Q ss_pred cCccc--hhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 437 NPGAD--PRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA 514 (726) Q Consensus 437 ~~~~~--~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~ 514 (726) ++|.. +...+...+.|++|+....|++.....++++|||++.++|..+| .++.+|.+++++|.+.+..+++||.. T Consensus 389 ~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn---~SG~Ai~~rq~qg~~~~~~~~Dnl~~ 465 (720) T protein:vir:35 389 KQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN---IAKETVNHLMHRSDMSSFIYLDNMAK 465 (720) T ss_pred cCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc---hHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33332 23355667778999999999999999999999999999997654 57888999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCcCeEEEEecc----cceecc--------------hhhcccccceeeecccchHH--HHHHHH Q lcl|NC_013692. 515 GIIEIGRKIIAMNAEFLDDVEVVRITNE----HFVDIR--------------RDDLAGNFDLKLDISTAEED--NAKVND 574 (726) Q Consensus 515 ~~~~l~~~il~li~q~~d~e~~iRi~~~----~~v~v~--------------~~~~~~~~dv~i~~~~~~~~--~~~~~~ 574 (726) +++.+|+++|+||++||++++++||+|+ .++.++ ++...++|||.+..++.... .+.... T Consensus 466 ~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~ 545 (720) T protein:vir:35 466 SLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSV 545 (720) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHH Confidence 9999999999999999999999999985 344333 34456899999999876543 344444 Q ss_pred HHHHHHHhhhccchh-HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 575 LTFMLQTMGPNMDPM-MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGL 653 (726) Q Consensus 575 l~~l~q~~~~~~~~~-~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~ 653 (726) +.+++..+.+..+.. ....++...+++.+..++.++++...+......+...+.++...+++.+.++++.+.+..++.+ T Consensus 546 m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l 625 (720) T protein:vir:35 546 LTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVL 625 (720) T ss_pred HHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHH Confidence 555555544432211 2222333444555555666666554433322222222222222222222222222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHH Q lcl|NC_013692. 654 QDSKVGTEQAKARALASQADMTDLNFLEQESGVQ-----QARKRELQQAQSEAQ----------GKLAMLNSQLKRLDEA 718 (726) Q Consensus 654 ~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~-----~~~e~e~~~~q~~~q----------~~~~~l~~~~~~~~~~ 718 (726) ++++++..++++++...+++..+.+...+....+ .++....+....++. .+.+...+++.+-+.. T Consensus 626 ~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~~~~ 705 (720) T protein:vir:35 626 MQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELILKATD 705 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhcccc Confidence 2222222222222111111111111110000000 000000000000000 0011111111111111 Q ss_pred HHHHHhcC Q lcl|NC_013692. 719 TSARTSQK 726 (726) Q Consensus 719 ~~a~~~~q 726 (726) ...+++.- T Consensus 706 ~~~~~~~~ 713 (720) T protein:vir:35 706 TQHKQNRD 713 (720) T ss_pred hhhhhhHH Confidence 11111110 No 18 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=2.2e-74 Score=424.44 Aligned_cols=554 Identities=14% Similarity=0.138 Sum_probs=367.1 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CCCCCCCcCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KTEKGKSAVQ 78 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~~grs~~v 78 (726) |+--+-|.-+|-+. ..+-+-|..++.++..........=.+..+||.++-..+. -....|||++ T Consensus 1 ~~~~~~~~~~~~~~--------------~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~ 66 (584) T protein:vir:95 1 MSVKVAELNSLLVR--------------DSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTT 66 (584) T ss_pred CCcchhhhhhhccc--------------cchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccc Confidence 32222222222211 1122344445554333222221111334666655432111 2234699999 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHH--HHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEe Q lcl|NC_013692. 79 PPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAES--ARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVG 156 (726) Q Consensus 79 ~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~--A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~ 156 (726) .+.++..++|++++||++||++++||+|+|.-++|+.+ |+....|+..++ ++.+-+.+++.+|++++++|+||+|++ T Consensus 67 ~~k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl-~e~~~~~~~~~~i~d~~~~G~~~~k~~ 145 (584) T protein:vir:95 67 LPKLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKC-RESHFRTEVSKLIYDYIDYGNAFATVS 145 (584) T ss_pred hhHHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhh-hhccHHHHHHHHHHhhccCCceEEEEe Confidence 99999999999999999999999999999999999888 776777766655 466788899999999999999999999 Q ss_pred eeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccccee Q lcl|NC_013692. 157 WNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETV 236 (726) Q Consensus 157 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (726) |+.++.+..+. . .++. T Consensus 146 ~~~~~~e~~e~-----------------------------~-----------------------------------~v~~ 161 (584) T protein:vir:95 146 FEAKYKEMTDG-----------------------------T-----------------------------------LVPD 161 (584) T ss_pred Eeecceeeecc-----------------------------c-----------------------------------cccc Confidence 98554322110 0 0011 Q ss_pred eccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcC----C-CcchhhcCcccch-----hhcccchhhhh Q lcl|NC_013692. 237 ENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADG----R-YQNLDKIQVEGQN-----LLSEPDYTGPS 306 (726) Q Consensus 237 ~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g----~-~~~~d~~~~~~~~-----~~~~~~~~~~~ 306 (726) ..+|++++|+|++|||||+| ++++|+.||+ +..+|+++|..+. + +-+.|.+...... .....+.+. . T Consensus 162 ~~~prieriSP~d~~~Dpsa-~~i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~-~ 238 (584) T protein:vir:95 162 YIGPRLVRISPLDIVFNPLA-TSISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDK-A 238 (584) T ss_pred cccceEEeeChhheeecCCC-CCccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccccccc-c Confidence 34688999999999999999 4699999999 5668999997662 2 2233333221111 001111111 1 Q ss_pred cccc----ccccCCcCCceEEEEEEEEEe-ecC-CCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccC Q lcl|NC_013692. 307 EGVR----NFDFQDKSRKRLVVHEYWGYY-DIH-GDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYG 380 (726) Q Consensus 307 ~~~~----~~~~~~~~~~~v~v~E~w~~~-~~~-~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g 380 (726) +... .+.+.......|+|+|+|+.+ +.. +++.....++++.|+++|+.+.||||++++||++++++|+++++|| T Consensus 239 ~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG 318 (584) T protein:vir:95 239 AGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWA 318 (584) T ss_pred cccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeeccccC Confidence 1111 223444456789999999854 433 3344445566778999999999999999999999999999999999 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCcc-chhHHH Q lcl|NC_013692. 381 ESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPE-IPQSAQ 459 (726) Q Consensus 381 ~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~-~~~~~~ 459 (726) +|+.+.+.|+|+++|+++|+++||++++++|.+ +..++. ++..+.||+++.....+ ++.+.+.|. .-...+ T Consensus 319 ~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~---k~~~~~-~~~~~~pg~~~~~~~~~----~~q~~~p~a~~~~s~~ 390 (584) T protein:vir:95 319 MGPLDNLVGMQYRIDHLENAKADAVDLIIQPPL---KIIGEV-EEFVWGPGAEIHLDQGG----DVQEIAKNVNYIINAD 390 (584) T ss_pred CCchhhhhhHHHHHhHHHHHHHHHHHHhcCcce---eecccc-chhcccCCceeecCCCC----CcceecCchhhhhHHH Confidence 999999999999999999999999999999833 333332 33457899998876543 233433331 112345 Q ss_pred HHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCcCeEEE Q lcl|NC_013692. 460 YMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGI-IEIGRKIIAMNAEFLDDVEVVR 538 (726) Q Consensus 460 ~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~-~~l~~~il~li~q~~d~e~~iR 538 (726) ..+++++..++++|||+.+++|... +.+.||+++++++++++..++.+++.|..++ ++++..++++...+++...++| T Consensus 391 ~~lq~~e~~me~~sGvp~~~~G~~~-~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr 469 (584) T protein:vir:95 391 NQIQMLEDRMELYAGAPREAMGIRT-PGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIR 469 (584) T ss_pred HHHHHHHHHHHhhhCCChhhccccc-chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcee Confidence 6689999999999999999999874 4478999999999999999999999999876 8888899999889999999999 Q ss_pred Eeccc-----ceecchhhcccccceeeecccchHHHHH-HHHHHHHHH-HhhhccchhHHHH-HHHHHHHhhhhhhhhhh Q lcl|NC_013692. 539 ITNEH-----FVDIRRDDLAGNFDLKLDISTAEEDNAK-VNDLTFMLQ-TMGPNMDPMMAQQ-IMGQIMELKKMPDFAKR 610 (726) Q Consensus 539 i~~~~-----~v~v~~~~~~~~~dv~i~~~~~~~~~~~-~~~l~~l~q-~~~~~~~~~~~~~-~~~~~~~~~~~~e~~~~ 610 (726) ++|++ |++|.+++++++|++....+.+...+.+ .+.+..+++ .+++.+.+..... +...+.++.+++...-. T Consensus 470 ~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~~~ 549 (584) T protein:vir:95 470 VMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYEIF 549 (584) T ss_pred eeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCccccc Confidence 99875 8999999999999999888876666533 445554444 4555554444433 33345566665532211 Q ss_pred HHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 611 IREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAG 652 (726) Q Consensus 611 l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~ 652 (726) . .+...+++++.++...++| +..+++++.....+- T Consensus 550 ~------~~~~~~~Q~~~q~~~~~~q-~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 550 R------PNVAVAEQAETQSLVAQAQ-EDLQLQAQMPAEGAI 584 (584) T ss_pred C------CCcccchhHHHHhhhHHHH-HHHHHHHhhhhccCC Confidence 1 0001111111111111111 000000000000000 No 19 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=2.3e-71 Score=407.81 Aligned_cols=608 Identities=14% Similarity=0.105 Sum_probs=358.8 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhccCCCCCC---------C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRW--LDYMHVRGEGKP---------K 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~--~~~y~~~~~~~~---------~ 69 (726) ||+ .....+..+...|+.+..+....-...... .+||.|.= -++ . T Consensus 1 ma~-----------------------~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Q-w~~~~~~~l~~~~ 56 (708) T protein:vir:17 1 MAE-----------------------TLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQ-WEGATAAGTKLDE 56 (708) T ss_pred Cch-----------------------hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCC-CCHHHHHHHHhhh Confidence 221 112456777778877776666555543233 46676421 121 1 Q ss_pred CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCc-chHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhc Q lcl|NC_013692. 70 TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTW-EDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDE 148 (726) Q Consensus 70 ~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~-~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~ 148 (726) ...||..++=+.|+-+|+|++-.=. .+-.=+.|.|.++ +|.+.|+..|.+++|+.. .++.-....++|++++++ T Consensus 57 q~~~rP~~~~N~i~~~i~~v~g~e~----~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~~~s~Af~~~i~~ 131 (708) T protein:vir:17 57 QFEKYPKFEINKVATELNRIIAEYR----NNRITVKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATG 131 (708) T ss_pred hhcCCCceEEcchHHHHHHHHhhHh----hCCcceEEecCCCcchHHHHHHHHHHHHHHHH-hcCchhHHhHHHHHhhhc Confidence 2357888999999999999986432 2333479999975 499999999999999864 666666777999999999 Q ss_pred CCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccce Q lcl|NC_013692. 149 GTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSE 228 (726) Q Consensus 149 ~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 228 (726) |.|.+++.=+++.+ .++.++. T Consensus 132 G~G~~~~~~d~~~e-------------------------------~d~~~~~---------------------------- 152 (708) T protein:vir:17 132 GFGCFRLTSMLVNE-------------------------------YDPMDDR---------------------------- 152 (708) T ss_pred ccceeeeeeccccc-------------------------------CCCCCCc---------------------------- Confidence 99976542221100 0000000 Q ss_pred eecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhc Q lcl|NC_013692. 229 EEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSE 307 (726) Q Consensus 229 ~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~ 307 (726) ...+ +...++||.+|||||.++. |++||+|+|+++|||+++++++ |++....... .... T Consensus 153 ---~~i~----i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~-yp~~a~~~~~----~~~~-------- 212 (708) T protein:vir:17 153 ---QRIA----IEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAE-YGKKPPASLD----VTSM-------- 212 (708) T ss_pred ---cccc----eEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHh-Cccccchhhh----hhhh-------- Confidence 0000 1123357789999997754 9999999999999999999888 4443211100 0000 Q ss_pred cccccccCCcCCceEEEEEEEEEeec--------C---C-------------------Cce----------EEEEEEEEE Q lcl|NC_013692. 308 GVRNFDFQDKSRKRLVVHEYWGYYDI--------H---G-------------------DGV----------LHPIVATWV 347 (726) Q Consensus 308 ~~~~~~~~~~~~~~v~v~E~w~~~~~--------~---~-------------------~g~----------~~~~~~~~~ 347 (726) .... +.....++|+|+|||+|... + | .|. .++++++|. T Consensus 213 ~~~~--~~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~ 290 (708) T protein:vir:17 213 TSWE--YDWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVD 290 (708) T ss_pred cccc--ccccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeec Confidence 0111 11223578999999987431 0 1 011 123334456 Q ss_pred CCEEEEeccCCCCCCccceEEeeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh Q lcl|NC_013692. 348 GAVMIRMEENPFPDKRIPYVVVNYIPR--KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR 425 (726) Q Consensus 348 g~~~l~~~~~P~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~ 425 (726) |..+| .+++|+||++|||+|+++++. ++...++|+++.++|+|+.+|+++|+++++++++++.+++++.+++.+... T Consensus 291 g~~~l-~~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~ 369 (708) T protein:vir:17 291 GDGFL-EKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEK 369 (708) T ss_pred ccccc-cCCCCCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHH Confidence 66666 578999999999999999876 455555899999999999999999999999999999999999988754321 Q ss_pred h--------------hhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhH Q lcl|NC_013692. 426 R--------------RFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTA 491 (726) Q Consensus 426 ~--------------~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta 491 (726) . +..++.+..+++++.+. ...+.|.+|+.+..+++.....++++|||+++++|..++ .++ T Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~---~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn---~SG 443 (708) T protein:vir:17 370 HWEARNKKRPAFLPLREVRDKYGNIIAGATPA---GYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQ 443 (708) T ss_pred hhhhcccchhhhhhhhccCCcccccccccCCc---ccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccc---hHH Confidence 1 11233333344444332 234577899999999999999999999999999997443 478 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecc----cceecch--------------hhcc Q lcl|NC_013692. 492 TAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNE----HFVDIRR--------------DDLA 553 (726) Q Consensus 492 ~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~----~~v~v~~--------------~~~~ 553 (726) .+|++++++|.+.+..+++||..+++++|+++|+||.+||++++++||+|+ .++.+|. +... T Consensus 444 ~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~ 523 (708) T protein:vir:17 444 ETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSV 523 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeecccee Confidence 889999999999999999999999999999999999999999999999986 3444432 3345 Q ss_pred cccceeeecccchHHH--HHHHHHHHHHHHhhhccchh-HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH Q lcl|NC_013692. 554 GNFDLKLDISTAEEDN--AKVNDLTFMLQTMGPNMDPM-MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL 630 (726) Q Consensus 554 ~~~dv~i~~~~~~~~~--~~~~~l~~l~q~~~~~~~~~-~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~ 630 (726) ++|||.|..++.+..+ +..+.+.++++.+.+..+.. ....++...+++.+..++.++++..........+..++.++ T Consensus 524 g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q 603 (708) T protein:vir:17 524 GRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQ 603 (708) T ss_pred eeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHH Confidence 7899999987765543 34445555665554433221 22233445566666677777776655443333222221111 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_013692. 631 MLLQA-QIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLA--- 706 (726) Q Consensus 631 q~~qa-q~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~--- 706 (726) +..++ ++++++++.+..+++++....+++..++++++.+.+++..+.+....+...+..+..+ ++...+..++.+ T Consensus 604 ~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~-q~~~~~~~~~~~~~~ 682 (708) T protein:vir:17 604 IVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLA-QARNIDDKAVMEAIR 682 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Confidence 11111 1111111111111111111111111111111111111111111111111000011000 000011111111 Q ss_pred HHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 707 MLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 707 ~l~~~~~~~~~~~~a~~~~q 726 (726) .++......+...++..+.- T Consensus 683 ~l~~~q~~q~q~~~a~p~~~ 702 (708) T protein:vir:17 683 LLKDVAESQQQQFQSPPQSP 702 (708) T ss_pred HhhhhhhhHHHHHhccccCc Confidence 11111111111111111111 No 20 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=2.2e-73 Score=418.93 Aligned_cols=606 Identities=15% Similarity=0.179 Sum_probs=385.1 Q ss_pred hcCCC-----CCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC----CC--------CCC Q lcl|NC_013692. 9 LTLPN-----EDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG----KP--------KTE 71 (726) Q Consensus 9 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~----~~--------~~~ 71 (726) -+++- +|.+..|+++++++....|..+-+.++++++.+..+.. +..+||...+++ .+ ..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~---e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (641) T protein:vir:94 1 MTIEMPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWD---ETYELYRASAIDRQNTRARNFQTTGADDA 77 (641) T ss_pred CccCCCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHH---HHHHHhhcchhhhhhcccccccccccchh Confidence 11111 25566777788888666666666666666655554432 224444433321 01 122 Q ss_pred CCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCe Q lcl|NC_013692. 72 KGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTI 151 (726) Q Consensus 72 ~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~ 151 (726) .+||+++++.+.+.++|++|+||+.||++++||+|.|++++|+++|++.+.++|+++ +++++++++++|++++|+.|+| T Consensus 78 ~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l-~~~~~~~~~~~~~~d~~~~g~~ 156 (641) T protein:vir:94 78 DWRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKL-EAASIRDIFETYVRNLVLYGVS 156 (641) T ss_pred cccccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHH-hhcchHHHHHHHHHHHhhcCce Confidence 358999999999999999999999999999999999999999999999999999998 6899999999999999999999 Q ss_pred EEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeec Q lcl|NC_013692. 152 IVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEE 231 (726) Q Consensus 152 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 231 (726) |+|++|+..+.+..+.+...+ ..+... +... T Consensus 157 iv~~~w~~~~~~~~~~~~~~~----------------~~~~~~---------------------------------~~~~ 187 (641) T protein:vir:94 157 TYRLGWDTSMERQFKRTFVET----------------GDIFGG---------------------------------WEDV 187 (641) T ss_pred EEEeehhhHHHHhhhhhcccc----------------hhhccc---------------------------------cccc Confidence 999999865543222211000 000000 0011 Q ss_pred ccceeeccceeeeechhheeeCCCCCCchhhCCeEEEE-EeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhcccc Q lcl|NC_013692. 232 REETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIET-FESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVR 310 (726) Q Consensus 232 ~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~-~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 310 (726) ...+.+..++++.|+|++|||||++. ..++.|++++ ..+|+.+|+++||| +++.+.+......... ...... T Consensus 188 ~v~~~~~~~r~~~v~~~di~~dps~~--~~~~~f~~~r~t~~t~~~l~~eg~~-~~d~v~~~~~~~~~~~----~~d~~~ 260 (641) T protein:vir:94 188 AVNRQRSELRIEPLSPYDVWLDTSGG--KNTGTFVRLRHTREELHELVTSGYY-DLDLTQVEQYVDYKFA----DPDTPK 260 (641) T ss_pred ceecccceeeEEecchhheeecCCCC--cccccceehhhhHHHHHHHHhcCCC-Chhhcchhhccccccc----cccccc Confidence 11222345677889999999999985 3566676655 45677778888875 4444332211111100 011111 Q ss_pred ccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHH Q lcl|NC_013692. 311 NFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDN 390 (726) Q Consensus 311 ~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~ 390 (726) ..++.+ ..+++++|||+.++.+|..+ +.+++++.|++||+.+.++|+ +.+||+++++.++++++||.|++..+.+. T Consensus 261 d~~~~~--~~~~~~~e~~gd~~~d~~~~-~~~~~~~~g~~il~~~~~~~~-d~~Pf~~~r~~~~~~~~YG~gp~~~~l~d 336 (641) T protein:vir:94 261 DVNGTD--TSGWDIIEYYGPLLVEGVQF-WCVHAVFYGKQLIRLSDSKYW-CGSPFVTTTLLPDRDSVYGMSVLHPNLGA 336 (641) T ss_pred cccccc--ccccceeeeeeeeccCCCce-eeEEEEEeCCEEeeccccccc-CcCCeEEecceecCCcccCCChHHHHHHH Confidence 222222 33567899998665544432 346688899999999999875 46799999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCcc-chhHHHHHHHHHHHHH Q lcl|NC_013692. 391 QRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPE-IPQSAQYMINLQQAEA 469 (726) Q Consensus 391 Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~-~~~~~~~ll~~~~~~~ 469 (726) |+.+|++.+.+++++.++++|++++..+++.........||+++.++..+. +.+...+. .....+.+++++...+ T Consensus 337 qk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~~----v~pl~~~~~~~~~~~~~~~~~~~~i 412 (641) T protein:vir:94 337 LHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHGS----LQPIDMGRQDFVVTYQEAQVQESSV 412 (641) T ss_pred HHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCCc----ceeecCCccccchhHHHHHHHHHHH Confidence 999999999999999999999999988887555667788999987765432 33333222 2234566788888899 Q ss_pred HHHhchHHHhhccCcccch-hhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecc----- Q lcl|NC_013692. 470 ESMTGVKAFNAGISGAALG-DTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNE----- 542 (726) Q Consensus 470 e~~tGv~~~~~G~~~~~~~-~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~----- 542 (726) ...+++..+++|..+.... .||+++++++++++.++..++++|.. ++..+++.++.+++++++.+.++|+.|. T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~ 492 (641) T protein:vir:94 413 YRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMD 492 (641) T ss_pred HHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcc Confidence 9999999998887765433 49999999999999999999999985 8889999999999999999999999986 Q ss_pred cceecchhhcccccceeeecccchH--HHHHHHHHHHHHHHhhhccc---hhHHHHHHHHHHHhhhhhhhhhhHHHHHhh Q lcl|NC_013692. 543 HFVDIRRDDLAGNFDLKLDISTAEE--DNAKVNDLTFMLQTMGPNMD---PMMAQQIMGQIMELKKMPDFAKRIREFQPQ 617 (726) Q Consensus 543 ~~v~v~~~~~~~~~dv~i~~~~~~~--~~~~~~~l~~l~q~~~~~~~---~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~ 617 (726) .++++.|+++++++++ +..+.+.. +.+..+.+..+++.++...+ ......++...++..++......++..+.+ T Consensus 493 ~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~ 571 (641) T protein:vir:94 493 GFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDPMRYIKKAEAP 571 (641) T ss_pred cCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCchhhccCccCc Confidence 4788999999998887 34444332 23344555555565554211 111223455556666655554444432211 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 618 PDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAG----LQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRE 693 (726) Q Consensus 618 ~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~----~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e 693 (726) +.+.+. .+++ ++.+..++++..+. ...++...+ +.+.+..++.....+..+|+.+ T Consensus 572 ~~~~~~---------~~~~--~q~~~~~~a~~~~~~~~~~a~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-------- 630 (641) T protein:vir:94 572 PAAPPI---------APAE--PGALPPEMMNSVGGGLNDQAIAGMTPE--DVSDLASRIGIDTSDVAPEAMA-------- 630 (641) T ss_pred hhHHHH---------HHHH--HHHHHHHHHHHHHhhhHHHHHHHhhHH--HHHHHHHhhcCCchhhhHHHHh-------- Confidence 111000 0000 00000000000000 000000000 0011111111111111111110 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 694 LQQAQSEAQGKLAMLNSQL 712 (726) Q Consensus 694 ~~~~q~~~q~~~~~l~~~~ 712 (726) ++-.+. ...+| T Consensus 631 ------~~~~~~--~~~~~ 641 (641) T protein:vir:94 631 ------AATQQI--TSGAL 641 (641) T ss_pred ------cccccc--cccCC Confidence 000000 00001 No 21 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=6e-71 Score=405.55 Aligned_cols=604 Identities=15% Similarity=0.134 Sum_probs=362.4 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCC----------C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPK----------T 70 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~----------~ 70 (726) |+| ..+..+..+...|+.+..+....-..-.+..+||+++|+.=++ . T Consensus 1 m~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q 57 (708) T protein:vir:10 1 MAE-----------------------TLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQ 57 (708) T ss_pred Cch-----------------------hHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhh Confidence 111 1234667777778777666665555444456788776754222 2 Q ss_pred CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcc-hHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcC Q lcl|NC_013692. 71 EKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWE-DAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEG 149 (726) Q Consensus 71 ~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~-D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~ 149 (726) ..||+-++=+.|+-+|+|++-.-.+ +-.=+.|.|.+++ |.+.|+..|.+++|+.. .++.-....++|.++|++| T Consensus 58 ~~grP~~~~N~i~~~v~~v~g~~~~----nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~d~i~~G 132 (708) T protein:vir:10 58 FEKYPKFEINKVATELNRIIAEYRN----NRITVKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGG 132 (708) T ss_pred hcCCCceEEcchHHHHHHHHHHHHh----CCcceEEEcCCCCchHHHHHHHHHHHHHHHH-hcCchHHHHHHHHhhhhcc Confidence 3478889999999999999975543 4444799999765 99999999999999874 6666667779999999999 Q ss_pred CeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccccee Q lcl|NC_013692. 150 TIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEE 229 (726) Q Consensus 150 ~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 229 (726) .|.+++.-+++.+ .++... T Consensus 133 ~Gw~~~~~d~~~e-------------------------------~d~~~~------------------------------ 151 (708) T protein:vir:10 133 FGCFRLTSMLVNE-------------------------------YDPMDD------------------------------ 151 (708) T ss_pred cceeeeeeccccc-------------------------------cCCCCC------------------------------ Confidence 9977553321110 000000 Q ss_pred ecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCCcchhh-cCcccchhhcccchhhhhc Q lcl|NC_013692. 230 EEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRYQNLDK-IQVEGQNLLSEPDYTGPSE 307 (726) Q Consensus 230 ~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~~~d~-~~~~~~~~~~~~~~~~~~~ 307 (726) ....+ +...+.||++|||||.++. |++||+|+|+++|||+++++++ |++.... ..+.. . T Consensus 152 -~~~i~----i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~-~p~~a~~~~d~~~-----~-------- 212 (708) T protein:vir:10 152 -RQRIA----IEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAE-YGKKPPTSLDVTS-----M-------- 212 (708) T ss_pred -ccccc----eEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHh-CCCCccccccccc-----C-------- Confidence 00001 1123446689999997754 9999999999999999999988 4432211 11100 0 Q ss_pred cccccccCCcCCceEEEEEEEEEeec-----------CCC-------------------c--------e--EEEEEEEEE Q lcl|NC_013692. 308 GVRNFDFQDKSRKRLVVHEYWGYYDI-----------HGD-------------------G--------V--LHPIVATWV 347 (726) Q Consensus 308 ~~~~~~~~~~~~~~v~v~E~w~~~~~-----------~~~-------------------g--------~--~~~~~~~~~ 347 (726) ....++| ...+.|+|.|||.+..+ .|. | + .++++.++. T Consensus 213 ~~~~~~~--~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~ 290 (708) T protein:vir:10 213 TSWEYNW--FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVD 290 (708) T ss_pred CCccccc--cCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeec Confidence 0011111 23456888999876311 010 1 0 123344566 Q ss_pred CCEEEEeccCCCCCCccceEEeeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh Q lcl|NC_013692. 348 GAVMIRMEENPFPDKRIPYVVVNYIPR--KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR 425 (726) Q Consensus 348 g~~~l~~~~~P~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~ 425 (726) |..+| ..++||||++|||+|+++++. .+...++|+++.++|+|+.+|+++|++++++++++....+++.+++..... T Consensus 291 g~~~l-e~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~ 369 (708) T protein:vir:10 291 GDGFL-EKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEK 369 (708) T ss_pred chhhh-ccCCCCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHH Confidence 77777 678999999999999999876 455666899999999999999999999999999999999999888755432 Q ss_pred hh----hcCCceEe-----ecCccchhh--hcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHH Q lcl|NC_013692. 426 RR----FDRGENYE-----FNPGADPRA--AVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAV 494 (726) Q Consensus 426 ~~----~~~g~vi~-----~~~~~~~~~--~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i 494 (726) .. .....+.. .+.|..... .....+.|++|.++..|++.....++++||+++.++|..++ .|+.+| T Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn---~SG~aI 446 (708) T protein:vir:10 370 HWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN---IAQETV 446 (708) T ss_pred HHhhccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccc---hHHHHH Confidence 21 11111221 122221111 12233567899999999999999999999999999996433 478889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecc----cceecc--------------hhhccccc Q lcl|NC_013692. 495 RGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNE----HFVDIR--------------RDDLAGNF 556 (726) Q Consensus 495 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~----~~v~v~--------------~~~~~~~~ 556 (726) ++++++|++.+..+++||..+++.+|+++|+||++||++++++||+|+ .++.++ ++...++| T Consensus 447 ~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~y 526 (708) T protein:vir:10 447 NNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRY 526 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeE Confidence 999999999999999999999999999999999999999999999986 244333 34446799 Q ss_pred ceeeecccchH--HHHHHHHHHHHHHHhhhccchh-HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHH Q lcl|NC_013692. 557 DLKLDISTAEE--DNAKVNDLTFMLQTMGPNMDPM-MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLL 633 (726) Q Consensus 557 dv~i~~~~~~~--~~~~~~~l~~l~q~~~~~~~~~-~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~ 633 (726) ||.|..++.+. +.+..+.+.++++.+.+..+.. ....++-+++++.+..++.++++...+......+..++.+++.. T Consensus 527 Dv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~ 606 (708) T protein:vir:10 527 DVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQ 606 (708) T ss_pred EEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHH Confidence 99999877544 4445556666666665533221 12233445666777778888887765544333322221111111 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_013692. 634 QA-QIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQ-----ESGVQQARKRELQQAQ-SEAQGKLA 706 (726) Q Consensus 634 qa-q~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~q-----e~~~~~~~e~e~~~~q-~~~q~~~~ 706 (726) ++ ++++++++.+..+.+++.. +.++++++.++++.+.+....+. +...+..+. ++.+. .+..++.+ T Consensus 607 ~~q~~~q~q~~~~~~e~qa~~~-----~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~--~~~a~~~~~~~~~~ 679 (708) T protein:vir:10 607 QAQMAAQSQPNPEMVLAQAQMV-----AAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYK--LAQARNIDDKAVME 679 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH Confidence 11 1111121111111111111 11111111111111111111111 110111111 11110 01111111 Q ss_pred HHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 707 MLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 707 ~l~~~~~~~~~~~~a~~~~q 726 (726) . ...++..+..+..+..-+ T Consensus 680 ~-~q~l~~~q~~q~~~~~~~ 698 (708) T protein:vir:10 680 A-IRLLKDVAESQQQQFQSP 698 (708) T ss_pred H-HHHhhhhhhhHHHHHhcc Confidence 1 111111111111111111 No 22 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=1.2e-68 Score=392.87 Aligned_cols=561 Identities=14% Similarity=0.103 Sum_probs=377.5 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCC-----CCCCcCCCHHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKTE-----KGKSAVQPPTIRKQ 85 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~-----~grs~~v~~~v~~~ 85 (726) |+-...+--|-+.+--.....++.|...+.. +.+.+..+...|-+.|+---.-+..++ .=|+|+..+.+... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~---~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~ 77 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTN---MENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHL 77 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHh---hhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHH Confidence 3332222222222223444555566665553 555555555566444431111122222 23899999999999 Q ss_pred HHHHHHHHHHhhcCCCceEEEecCCcch--HHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeee Q lcl|NC_013692. 86 AEWRYSSLSEPFLSSPNIFEVNPVTWED--AESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRT 163 (726) Q Consensus 86 v~~~~~~L~~~f~~~~~~~~~~p~~~~D--~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~ 163 (726) ++.+.++++..+|.+++||+|+|..++| .++++....|+...+ ++.+-+.++..+|.+.+++|++|.++.|...... T Consensus 78 ~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl-~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~ 156 (599) T protein:vir:31 78 HLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKV-EASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTV 156 (599) T ss_pred HHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhh-hhcchHHHHHHHHhhhcccCceeEeeeEEEccee Confidence 9999999999999999999999999994 455666666766655 4668889999999999999999999988622111 Q ss_pred EEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceee Q lcl|NC_013692. 164 VKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQ 243 (726) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~ 243 (726) + + +| .....+.+|+++ T Consensus 157 ----------~------------------~----------------------d~--------------~v~~~~~~P~~e 172 (599) T protein:vir:31 157 ----------T------------------A----------------------EN--------------QVIKNYSGTVTE 172 (599) T ss_pred ----------e------------------c----------------------cc--------------ccccccccceEE Confidence 0 0 00 112335678999 Q ss_pred eechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhc---CCCc--chhhcCcccchhhcccchhhhhccccccccCCcC Q lcl|NC_013692. 244 VCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKAD---GRYQ--NLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKS 318 (726) Q Consensus 244 ~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~---g~~~--~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (726) +|+|+||||||+| ++++|+.||+ |...|+++|..+ ++++ +.+.+........... .....+.+.....+.+ T Consensus 173 rvsP~Di~~Dp~A-~si~d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~--~~~~d~~~~~~g~D~~ 248 (599) T protein:vir:31 173 RLSPSDVFWDVTA-DSLPKAAKCI-RQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIR--EALADGYNGRRKFDSL 248 (599) T ss_pred eecccceeeCCCC-CCCCcceeee-ehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCC--ccccchhhhhhhcccc Confidence 9999999999999 5799999987 778899998763 3332 2232221100000000 0011112222222222 Q ss_pred C-------------ceEEEEEEEE-EeecCCCceEEEEEEEEECC-EEEEeccCCCCCCccceEEeeeeeecCcccCCCh Q lcl|NC_013692. 319 R-------------KRLVVHEYWG-YYDIHGDGVLHPIVATWVGA-VMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESD 383 (726) Q Consensus 319 ~-------------~~v~v~E~w~-~~~~~~~g~~~~~~~~~~g~-~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~ 383 (726) . .-|+++|||+ .++..+|+..+.+++|++|+ ++++.+.|||||+++||++.++.|+++++||+|+ T Consensus 249 ~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~ 328 (599) T protein:vir:31 249 HKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIGP 328 (599) T ss_pred ccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccCCCCC Confidence 2 2489999997 88899999999999999996 7789999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHH Q lcl|NC_013692. 384 GALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMIN 463 (726) Q Consensus 384 ~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~ 463 (726) ...+.++|..+|.++|+++|++.+...|.+ ...|.+.+.|.. +.||++|.+...+. +.+.+.|+....+..+++ T Consensus 329 l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l-~~~~dl~~eD~~-~~P~~v~~~~d~~~----vq~~~p~s~~~~a~~~is 402 (599) T protein:vir:31 329 LHRLTGMQYKLDKRENFREDLHDRFLHPSL-KKVGDVREKGMR-GGPNHVFEVEETGD----VQYMTPPAEVLQPDNQLS 402 (599) T ss_pred chhcchHHHHHHHHHHHhhhhhhhhhcccc-cccccccccCcc-CCCCcceeecCCCc----cccccCchhhhhHHHHHH Confidence 999999999999999999999999987733 334445554443 56999998876543 345555555667778899 Q ss_pred HHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCcCeEEEEecc Q lcl|NC_013692. 464 LQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGI-IEIGRKIIAMNAEFLDDVEVVRITNE 542 (726) Q Consensus 464 ~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~-~~l~~~il~li~q~~d~e~~iRi~~~ 542 (726) +++..+++.||++.+++|..+.. ..||+++++++++|+.+.+..++.|.+++ ++++++++++.++|++++.++|+.++ T Consensus 403 ~~e~~mee~sGvp~~~~G~~~ag-~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~ 481 (599) T protein:vir:31 403 ITLQLMEDLSGAPKESIGQRTAG-EKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNS 481 (599) T ss_pred HHHHHHHHhhccchhhcCCcccc-hhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecc Confidence 99999999999999999987655 58999999999999999999999999865 67999999999999999999999998 Q ss_pred c-----ceecchhhcccccceeeecccchHHHHH-HHHHHHH-HHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHH Q lcl|NC_013692. 543 H-----FVDIRRDDLAGNFDLKLDISTAEEDNAK-VNDLTFM-LQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQ 615 (726) Q Consensus 543 ~-----~v~v~~~~~~~~~dv~i~~~~~~~~~~~-~~~l~~l-~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~ 615 (726) + |++|.++++++.+++..-.+....++.+ .+.+..+ ...+++.+.|...+..+..+ .+....++..+ T Consensus 482 e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~------l~~~~~l~~~~ 555 (599) T protein:vir:31 482 ELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNA------VEYLGDLDAYG 555 (599) T ss_pred cccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHH------HHHHHhccccc Confidence 6 9999999999999984433332222222 2333333 33344445555444322222 22234455555 Q ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH Q lcl|NC_013692. 616 PQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQA---KARALA 669 (726) Q Consensus 616 ~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqa---q~~q~~ 669 (726) ..+.....+.+|++...+|+++++....+ +.++.- -+.+++ T Consensus 556 ~~~~~va~~eqq~~~~m~Q~~lq~~~~~~-------------~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 556 IFTFGIGVQEDQQLARMAQKSTQQTEETA-------------LTQEEVGGPTTDTGQ 599 (599) T ss_pred cCCCchhHHHHHHHHHHHHHHHHHhHhhh-------------hhhhhcCCCCcccCC Confidence 55554444444433333333332211000 000000 000000 No 23 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=1.5e-38 Score=227.94 Aligned_cols=609 Identities=12% Similarity=0.113 Sum_probs=318.2 Q ss_pred CCC-ccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCC Q lcl|NC_013692. 1 MAD-VDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQP 79 (726) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~ 79 (726) |.| -.-||.-- + +.+-..| ...|.+|+...++.-.....-.+.|.+.-+. +-.+ .++ - T Consensus 1 m~~~~~~~~~~t------p-e~la~~W---------~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~--~~~~-~~r--~ 59 (663) T protein:vir:34 1 MNESQPTDFADT------P-QGWAQRW---------QEEMSAAREPLEKWHTQGKEIVKRYRDERDS--AHDA-ETR--W 59 (663) T ss_pred CCccccccchhc------c-hhHHHHH---------HHHHHHHHhccchHHHHHHHHHHHhhccccC--CCcc-ccc--c Confidence 544 22222211 1 1122233 3456666665554444445556666543322 1122 234 3 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCceEEEecCCcc-hHHHHHHHHHHHHHHHhhccc-----chhHHHHHHHHHhhcCCeEE Q lcl|NC_013692. 80 PTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWE-DAESARQNGLVLNQQFNTKLN-----KQRFIDEYVRAGVDEGTIIV 153 (726) Q Consensus 80 ~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~-D~~~A~q~t~~~n~~~~~~~~-----~~~~~~~~~~~~l~~~~~i~ 153 (726) +.++..|+.++|++ .+..|++.|.|...+ |.+.++.+.++|+..+++-.. ....+.-.++++|++|-|++ T Consensus 60 nl~~sni~~i~P~i----Yar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~ 135 (663) T protein:vir:34 60 NLFSTNIQTQMASL----YGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLC 135 (663) T ss_pred chhhhhHHHHhhhh----hcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceE Confidence 78999999999999 889999999997766 545677777777776643332 34567788999999999999 Q ss_pred EEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccc Q lcl|NC_013692. 154 KVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEERE 233 (726) Q Consensus 154 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 233 (726) ++.+..+.+ .+. +. +. ..+++.+ +..+..|-+ ++ T Consensus 136 ~v~Ye~~~~----~~~-~~---~~--------------~~D~~~~------------~~~a~~~~~---~e--------- 169 (663) T protein:vir:34 136 RIRYEVEWE----EVA-GV---DA--------------ILDEATG------------AELAAAVPP---TQ--------- 169 (663) T ss_pred EEEeecccc----hhc-cc---cc--------------cCCCccc------------cchhccccc---ch--------- Confidence 997753321 110 00 00 0001110 000111111 01 Q ss_pred ceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccc Q lcl|NC_013692. 234 ETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFD 313 (726) Q Consensus 234 ~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (726) ........|++|+|.||++||+ + .|++++||+.+.|||+.++.+. |..+.+............. ....++ T Consensus 170 ~~a~E~v~id~v~~~dfl~~pA-r-~W~ev~wva~r~~mtk~e~~~r-f~~~~~~~~~a~~~~~~~~-------~~~~~~ 239 (663) T protein:vir:34 170 RKAYECVETDYLHWQDVLWSPA-R-VWHEVRWLAFRNLLDMREFNAR-FDADGSRNLWASVPKVGKP-------KDGKDG 239 (663) T ss_pred hhcccceeeeeechhhcccchh-h-ccccccceeeeccCCHHHHHHh-hcCChhhhhhhhccCcCCc-------cccCCC Confidence 0112234579999999999995 3 4999999999999999998775 4444433221111100000 011112 Q ss_pred cCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECC--EEEEeccCCCCCCcc---ceEEeeeeeecCcccCCChHHHHH Q lcl|NC_013692. 314 FQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGA--VMIRMEENPFPDKRI---PYVVVNYIPRKRDLYGESDGALLI 388 (726) Q Consensus 314 ~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~--~~l~~~~~P~~~~~~---Pf~~~~~~~~~~~~~g~g~~~~~~ 388 (726) -.+.+.++.+|||.|.|- ..+++|++.+ .+|+.++-|.-...| ||..++. ...++.++...+-... T Consensus 240 ~~~~~~~~a~VwEIWdK~--------~~~V~w~~eg~~~~L~~~~p~lgl~~ffPcPrpl~~~-~~~ds~ipvpd~~~y~ 310 (663) T protein:vir:34 240 QSCHPWDRAEVWEIWDKG--------GRKVDWYVEGYSAVLDTQPDPLGLESFFPCPKPLLAN-WTTDKVVPRPDFVLAQ 310 (663) T ss_pred CCcchhcCcceeEEEecC--------CcEEEEEEcCcceecccCCCCCCCCCCCCCcccccce-ecCCCeecCCcHHHHH Confidence 223344589999999984 3456666654 467766555433233 5543333 3456777777777999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh--hhhcCCceEeecCc------cchhhhcccccCccchhHHHH Q lcl|NC_013692. 389 DNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR--RRFDRGENYEFNPG------ADPRAAVHMHTFPEIPQSAQY 460 (726) Q Consensus 389 d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~--~~~~~g~vi~~~~~------~~~~~~i~~~~~~~~~~~~~~ 460 (726) +.++++|.++. .+..+.-...++++++.|+-..... .....+.++.+... +.++++|.+++.+.+.+.+.. T Consensus 311 ~~~~E~n~~t~-Rin~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~ 389 (663) T protein:vir:34 311 DLYKEIDLVST-RITLLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTS 389 (663) T ss_pred HHHHHHHHHHH-HHHHHHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcCccchhhcccchhHHHHHHH Confidence 99999998765 4566667788899998776532222 12333445544322 234567888888888888888 Q ss_pred HHHHHH---HHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEE Q lcl|NC_013692. 461 MINLQQ---AEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVV 537 (726) Q Consensus 461 ll~~~~---~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~i 537 (726) +.+... ....++||+.|++.|.. ..+.|+++.+...+.++.++..+.+.+.++.+++++...+.|.+.++-+.+- T Consensus 390 l~~~r~qir~d~~qITGiaDi~Rga~--~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~ 467 (663) T protein:vir:34 390 LRDYRRELVDALHQVTGMADIMRGAS--DPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASIL 467 (663) T ss_pred HHHHHHHHHHHHHHHHhHHHHhhccc--CcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHH Confidence 877654 45667999999999954 3358999999999999999999999999999999999999999988888877 Q ss_pred EEecccce---ecc-------hhhcccccceeeeccc-chHH----HHHHHHHHHH----HHHhhhccchh-HHHHHHHH Q lcl|NC_013692. 538 RITNEHFV---DIR-------RDDLAGNFDLKLDIST-AEED----NAKVNDLTFM----LQTMGPNMDPM-MAQQIMGQ 597 (726) Q Consensus 538 Ri~~~~~v---~v~-------~~~~~~~~dv~i~~~~-~~~~----~~~~~~l~~l----~q~~~~~~~~~-~~~~~~~~ 597 (726) +++|.+.- ++. .... ..|.+.|..+. ...+ +....++... .+...+.+... .....+.+ T Consensus 468 ~m~~~elp~~~ei~~~~~~L~n~~~-r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~l~E 546 (663) T protein:vir:34 468 AQANAEFTFDKELAPKAAELIKSRF-SMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAPFLLQ 546 (663) T ss_pred HHhcCCCCcccchhHHHHHHhcCCC-cceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHH Confidence 77775432 121 1222 34455544322 2222 2222222111 11111111000 01112222 Q ss_pred HHHhh--hhh---hhhhhHHHHHhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 598 IMELK--KMP---DFAKRIREFQPQPDPIAQQK--AQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALAS 670 (726) Q Consensus 598 ~~~~~--~~~---e~~~~l~~~~~~~~~~~qq~--~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~ 670 (726) +.... ++. .+.-.+............+. ++...+.+++.++.++.+++...+++ +.+++..+... T Consensus 547 llk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeA--------q~e~q~~~~~~ 618 (663) T protein:vir:34 547 MLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQEMAKV--------QAEVQGDLLRI 618 (663) T ss_pred HHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHHHHHHHHHHH--------HHHHHHHHHHH Confidence 22211 111 11111111111111111000 01111112222211111111111110 11222233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 671 QADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 671 q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) |++.++.+.++++++...+ ....+..++..+-+.+..+++- T Consensus 619 ql~~~~~~~k~~~~a~~~~---------------~~a~q~~~~~~~~r~~~~~a~~ 659 (663) T protein:vir:34 619 QAETQANETKERQQAEWNV---------------REAAQKNLISQAARAMNPQARN 659 (663) T ss_pred HHHHHHHHHHHHHHHHHHH---------------HHHHHhhHHHHHHHhhchhhhc Confidence 3333333333332221111 1111111111111111111111 No 24 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=99.94 E-value=1.5e-24 Score=151.13 Aligned_cols=514 Identities=12% Similarity=0.057 Sum_probs=268.5 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------cCCCCCCCCCCC---CCcCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_013692. 27 SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMH------VRGEGKPKTEKG---KSAVQPPTIRKQAEWRYSSLSEPF 97 (726) Q Consensus 27 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~------~~~~~~~~~~~g---rs~~v~~~v~~~v~~~~~~L~~~f 97 (726) -+++.-+.|++.++..++-.+... ..|.++|. +...++ ....| .++++++.-.+.++.+.+.|+..+ T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e---~~w~e~~~~~lP~~~~~~~~-~~~~~~~~~~~~~dst~~~a~~~Las~l~~~l 76 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFE---SHWLDLSDFINPRGSRFLTS-DVNRDDRRNTKIVDPTGSMAQRILSSGMMSGI 76 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHH---HHHHHHHHHhccccCCcCCC-CCCcchhhcCccccchHHHHHHHHHHHHHHhh Confidence 344556667777776665544443 33444442 211111 12222 358899999999999999999999 Q ss_pred cC-CCceEEEecCCcchHHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 98 LS-SPNIFEVNPVTWEDAESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 98 ~~-~~~~~~~~p~~~~D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) |+ +.+||.+.+-.++..+.+ +..+..+...|. .++.|..++..+++.++.|||++-+.++.. T Consensus 77 tpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~~~~---------- 145 (556) T protein:vir:73 77 TSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFN-KSNLYQSLPVMYASLGTFGTGAMAVMEDDQ---------- 145 (556) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeeeeecCC---------- Confidence 98 899999988654433322 225566666664 577899999999999999999984322200 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) ..+++..+++.+| T Consensus 146 -------------------------------------------------------------------~~~r~~~~~l~~~ 158 (556) T protein:vir:73 146 -------------------------------------------------------------------DVIRTMPFPIGSY 158 (556) T ss_pred -------------------------------------------------------------------ceEEEEEeeccee Confidence 0123456788899 Q ss_pred eeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEE-EE Q lcl|NC_013692. 251 VIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEY-WG 329 (726) Q Consensus 251 ~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~-w~ 329 (726) ++..++..+++. |++++.+|...+.++ |..+ .++.. +....+ .+....+|.|++| |- T Consensus 159 ~~~~d~~G~vd~---i~r~~~~t~~ql~~~-fg~~--~l~~~---v~~~~~-------------~~~~~~~~~v~~~V~p 216 (556) T protein:vir:73 159 YLANSPRGSVDT---CIRQFSMTVRQMVQE-FGLD--NVSTS---VKGMWE-------------NGTYETWVEVNHCITP 216 (556) T ss_pred EEeeCCCCCeEE---EEEEEeccHHHHHHH-cCcc--cCCHH---HHHHHh-------------cCCccceEEEEEEEec Confidence 999987655433 688899999998776 3221 11110 000000 0111235777765 33 Q ss_pred EeecCCCc----eEEEEEEEE----ECCEEEEeccCCCCCCccceEEeeeeeecCcccCCC-hHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 330 YYDIHGDG----VLHPIVATW----VGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGES-DGALLIDNQRIIGAVTRG 400 (726) Q Consensus 330 ~~~~~~~g----~~~~~~~~~----~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g-~~~~~~d~Q~~~N~~~~~ 400 (726) +.+.+.++ .+.+..++| .|.++++ ++.| .++||++..|...++..||+| .+....+-.+.+|.+.+. T Consensus 217 r~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~--esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~ 292 (556) T protein:vir:73 217 NVNRDSGKMDSKNKPYRSVYFESGGDSDKLLR--ESGF--DEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKR 292 (556) T ss_pred cccccccccCcccceEEEEEEEecCCCceecc--cCCc--ccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHH Confidence 33222111 111222222 2345664 4556 568999999999999999999 599999999999999999 Q ss_pred HHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCcc-chhHHHHHHHHHHHHHHHHhchHHH- Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPE-IPQSAQYMINLQQAEAESMTGVKAF- 478 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~-~~~~~~~ll~~~~~~~e~~tGv~~~- 478 (726) .+.++.+..+|+++++.+... ......||+++....++.. ..+.+...-. .-..+...++.+.+.+...--+.-+ T Consensus 293 ~l~~~~~~~~pp~~v~~~~~~--~~~~~~pgg~~~~~~~~~~-~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~ 369 (556) T protein:vir:73 293 KAQLIDKATNPPMVAPTSLKN--QRVSLLPGDVTYLDVISGQ-DGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFM 369 (556) T ss_pred HHHHHHHHhcCceeccccccc--cceeeccCccccccCCCCc-cceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhh Confidence 999999999999999887532 3445678887655433322 2344332111 1122223333333333322111100 Q ss_pred hhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcc-ccc Q lcl|NC_013692. 479 NAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLA-GNF 556 (726) Q Consensus 479 ~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~-~~~ 556 (726) +.+. .++..-||+++....+.....+..++.+|.. .+..+..+.+.++.+..--+ .-|+.+. ..+ T Consensus 370 ~l~~-~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP------------~~P~~l~~~~i 436 (556) T protein:vir:73 370 MLQN-INTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLP------------EPPDVLQGMPL 436 (556) T ss_pred hhcc-CCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC------------CCchhhcCcee Confidence 1121 2233359999999999999999999999864 67888888888887743211 1222222 233 Q ss_pred ceeeecccchHHHH-H---HHHHHHHHHHhhhccch-----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHH Q lcl|NC_013692. 557 DLKLDISTAEEDNA-K---VNDLTFMLQTMGPNMDP-----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQ 627 (726) Q Consensus 557 dv~i~~~~~~~~~~-~---~~~l~~l~q~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q 627 (726) .+......+...+. . ..++......+++ +.| .+....+..++...+++. ..+...+ +.++ T Consensus 437 ~v~yis~La~aqk~~~~~~i~~~~~~~~~laq-~~Pe~~d~id~d~~~~~~a~~~Gvp~--~~irs~e--------ev~~ 505 (556) T protein:vir:73 437 RIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQ-FKPEALDKLDVDQAIDAFSEMSGVSP--TVIVPQE--------QVQG 505 (556) T ss_pred EEEeecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChhhHhcCCHHHHHHHHHHHcCCCh--hhcCCHH--------HHHH Confidence 34433333322111 1 1222222222222 122 222334444444444442 1111110 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH Q lcl|NC_013692. 628 LELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQAD---MTDLNFLEQESGVQQA 689 (726) Q Consensus 628 ~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~---~~~~e~~~qe~~~~~~ 689 (726) +.+++.++|..++ +++...+. ++..+..+... ...++........-++ T Consensus 506 ~rq~r~~~qq~~~--~~~~~~~a------------~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 506 IREERAKQAQAAQ--AMAMGQAA------------AQGAKTLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHHHHHH--HHHHHHHH------------HHHHHHhhhccCCCHHHHHHHHHhhcCCCC Confidence 0000000000000 00000000 00000000000 0000000000000000 No 25 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.93 E-value=2.6e-24 Score=149.87 Aligned_cols=523 Identities=13% Similarity=0.055 Sum_probs=270.2 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hccCCCCCCCCCC---CCCcCCCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_013692. 27 SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDY---MHVRGEGKPKTEK---GKSAVQPPTIRKQAEWRYSSLSEPFLS- 99 (726) Q Consensus 27 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~---y~~~~~~~~~~~~---grs~~v~~~v~~~v~~~~~~L~~~f~~- 99 (726) =++++.+.|++.++..++-.+...+.-.+..+| |.+...++ .... ..++++++...+.++.+.+.|+..+|+ T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp 79 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTS-EVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCC-CCCcccccccccccchHHHHHHHHHHHHHHhhcCC Confidence 566677788888877666555554332222233 22221112 1222 246789999999999999999999998 Q ss_pred CCceEEEecCCcchHHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEeccccccc Q lcl|NC_013692. 100 SPNIFEVNPVTWEDAESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEM 173 (726) Q Consensus 100 ~~~~~~~~p~~~~D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~ 173 (726) +.+||.+.+..++..+.+ +..+..+...|. .++.|..++..+++.++.|||++-+.++.. T Consensus 80 ~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~Gta~l~~~~d~~------------- 145 (559) T protein:vir:95 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSNLYQSLPQLYGSLGTYSTGAMAVLDDDE------------- 145 (559) T ss_pred CCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeEeecCCC------------- Confidence 899999988554422222 223344555554 567888899999999999999984422200 Q ss_pred CCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeC Q lcl|NC_013692. 174 MPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVID 253 (726) Q Consensus 174 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~d 253 (726) ..+++..++..+|++. T Consensus 146 ----------------------------------------------------------------~~~r~~~~~l~~~~v~ 161 (559) T protein:vir:95 146 ----------------------------------------------------------------DIIRTMPFPIGSYYLA 161 (559) T ss_pred ----------------------------------------------------------------ceeEEEEeecCeEEEe Confidence 0123456778899998 Q ss_pred CCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEE-EEEee Q lcl|NC_013692. 254 PSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEY-WGYYD 332 (726) Q Consensus 254 p~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~-w~~~~ 332 (726) .++..+++. |+++..+|..+|.++ |..+ .++... ....+ .+...+.|+|++| |.+.+ T Consensus 162 ~d~~G~vd~---i~r~~~~t~~ql~~~-fg~~--~l~~~~---~~~~~-------------~~~~~~~v~v~~~V~pr~~ 219 (559) T protein:vir:95 162 NSPRGSVDT---CFRKFSMTVRQLVQE-FGLN--NVSESV---KSMWE-------------SGTYEKWIEVMHSVYPNID 219 (559) T ss_pred eCCCCCeEE---EEEeEecCHHHHHHH-cCcc--cCCHHH---HHHHh-------------cCCCCCeEEEEEEEecccc Confidence 887554433 678899999998776 3221 111100 00000 0111245888876 33333 Q ss_pred cCCCce----EEEEEEEE----ECCEEEEeccCCCCCCccceEEeeeeeecCcccCCC-hHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 333 IHGDGV----LHPIVATW----VGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGES-DGALLIDNQRIIGAVTRGMID 403 (726) Q Consensus 333 ~~~~g~----~~~~~~~~----~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g-~~~~~~d~Q~~~N~~~~~~~d 403 (726) .+.++. +.+..+++ .+.++++. +.| .++||++..|.+.++..||+| .+....+-.+.+|.+.+..+. T Consensus 220 ~~~~~~~~~~~pf~s~~~e~~~~~~~~l~e--sg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~ 295 (559) T protein:vir:95 220 RDTSKLDSKNKPFKSVYYEVGGDNDKLLRE--SGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQ 295 (559) T ss_pred ccccccccccceEEEEEEEecCCCceeeec--CCc--ccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHH Confidence 322211 11111222 12356653 445 569999999999999999999 699999999999999999999 Q ss_pred HHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCcc-chhHHHHHHHHHHHHHHHHhchHHH-hhc Q lcl|NC_013692. 404 TMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPE-IPQSAQYMINLQQAEAESMTGVKAF-NAG 481 (726) Q Consensus 404 ~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~-~~~~~~~ll~~~~~~~e~~tGv~~~-~~G 481 (726) ++.+..+|+++++.+... ...+..||++..+..+.. ...+.+..... ....+...++.+.+.+...--..-+ +.+ T Consensus 296 ~~~~~~~pp~~v~~~~~~--~~~~l~pgg~~~~~~~~~-~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~ 372 (559) T protein:vir:95 296 LIDKATNPPMVAPTSLKN--QRASLLPGDITYIDQITG-QDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQ 372 (559) T ss_pred HHHHHhcCceeccccccc--cceeeeccceeeeCCCCC-cccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhh Confidence 999999999999877642 334567999887755432 23344332111 1112222233333333322211100 011 Q ss_pred cCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcc-ccccee Q lcl|NC_013692. 482 ISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLA-GNFDLK 559 (726) Q Consensus 482 ~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~-~~~dv~ 559 (726) ..++..-||+++....+.....+..++.+|.. .+..++.+.+.++.+..--+ .-|+.+. ..+.++ T Consensus 373 -~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP------------~~p~~l~~~~i~v~ 439 (559) T protein:vir:95 373 -NINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLP------------PPPDVMEGMPLKVE 439 (559) T ss_pred -cCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCC------------CCcccccCcceEEE Confidence 11222349999999999999999999999864 67788888888887753211 1222221 233444 Q ss_pred eecccchHHHH----HHHHHHHHHHHhhhccch-----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH Q lcl|NC_013692. 560 LDISTAEEDNA----KVNDLTFMLQTMGPNMDP-----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL 630 (726) Q Consensus 560 i~~~~~~~~~~----~~~~l~~l~q~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~ 630 (726) .....+...+. ...++......+++ +.| .+....+..++...+++. ..++... +.+++.+ T Consensus 440 ~is~La~aqk~~~~~~i~~~~~~~~~laq-~~Pevld~id~d~~~~~~a~~~Gvp~--~~irs~~--------ev~~~rq 508 (559) T protein:vir:95 440 YISVMAQAQKSIGLSSLASTVNFIGQLAQ-VKPEALDKLNVDQAIDAFADMSGVSP--TVIVPQE--------QVEQARQ 508 (559) T ss_pred eecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChhhhhcCCHHHHHHHHHHHhCCch--hhcCCHH--------HHHHHHH Confidence 43333322211 11222233333322 122 223334444444444442 1111110 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 631 MLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGK 704 (726) Q Consensus 631 q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~ 704 (726) +.+++|..++ +++...+. ++....-..++....+.. +++..+- .-.-++++ T Consensus 509 qr~~~qq~~q--~~~~~~~a------------a~~~~~~~~~~~~~~~~l---~~~~~~~------~~~~~~~~ 559 (559) T protein:vir:95 509 QRAQQQQQQQ--MMAMGMAA------------AQGVKTLSEAKTSDPSVL---SAMANAV------SGQGGQSQ 559 (559) T ss_pred HHHHHHHHHH--HHHHHHHH------------HHhhhccccccCCChhHH---HHHHHhh------cCccccCC Confidence 0000000000 00000000 000000000000000000 0000000 00000000 No 26 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.93 E-value=1.6e-23 Score=145.55 Aligned_cols=502 Identities=10% Similarity=0.007 Sum_probs=263.3 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC------CCCCCCCC-----CCCCcCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_013692. 30 PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVR------GEGKPKTE-----KGKSAVQPPTIRKQAEWRYSSLSEPFL 98 (726) Q Consensus 30 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~------~~~~~~~~-----~grs~~v~~~v~~~v~~~~~~L~~~f~ 98 (726) =..+.|.+.++..++-.... ...|.++|.-. ..++.+.. +-.++++++.-.+.++.+.+.||..+| T Consensus 1 ~~~~~l~~r~~~l~~~R~~~---e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~lt 77 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNV---EQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLT 77 (547) T ss_pred CCHHHHHHHHHHHHHHhhHH---HHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhc Confidence 23344555555444333322 23454554311 11111111 123577889999999999999999999 Q ss_pred C-CCceEEEecCCcchHHH------HHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEeccccc Q lcl|NC_013692. 99 S-SPNIFEVNPVTWEDAES------ARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTY 171 (726) Q Consensus 99 ~-~~~~~~~~p~~~~D~~~------A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~ 171 (726) + +.+||.+.+..++..+. -...+..+...+. .++.+..++..+++.++.||+++-+..+. T Consensus 78 Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~d~------------ 144 (547) T protein:vir:10 78 SPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQ-DSNFNLEANETYIDLCGYGNAIMVEEEDE------------ 144 (547) T ss_pred CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCcEeEEeccCC------------ Confidence 8 89999988754432221 2334555555554 56788889999999999999998553220 Q ss_pred ccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhee Q lcl|NC_013692. 172 EMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIV 251 (726) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~ 251 (726) + ....+++..++..+|+ T Consensus 145 ----~-----------------------------------------------------------~~~~~r~~~~pl~~~~ 161 (547) T protein:vir:10 145 ----D-----------------------------------------------------------EEGSVVFQSSPIQDSY 161 (547) T ss_pred ----C-----------------------------------------------------------CCCceeEEEeecceEE Confidence 0 0112345678889999 Q ss_pred eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEE- Q lcl|NC_013692. 252 IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGY- 330 (726) Q Consensus 252 ~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~- 330 (726) +..++..++++ |+|+..+|..+|.++ |..+ .++. . +..... ...+....++.+++|... T Consensus 162 v~~d~~G~v~~---i~r~~~~t~~qi~~~-fg~~--~l~~-~--v~~~~~-----------~~~~~~~~~~~v~~~v~~~ 221 (547) T protein:vir:10 162 FEEDSRGQVVN---FYRVFRWTPAQIYDR-FGDE--GTPE-A--IIKKAK-----------EASNQAALKQEVVMCVFTR 221 (547) T ss_pred EeeCCCcCeee---eeeeeeccHHHHHHh-cCcc--cCCH-H--HHHHHh-----------cCCCcccceEEEEEEEeec Confidence 98887555544 578889999998876 3221 1110 0 000000 001112235777766332 Q ss_pred eecCCCc---------eEEEEEEEE-EC--CEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHH Q lcl|NC_013692. 331 YDIHGDG---------VLHPIVATW-VG--AVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVT 398 (726) Q Consensus 331 ~~~~~~g---------~~~~~~~~~-~g--~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~ 398 (726) .+.+++. ...+..+++ .+ .+++. ++.| .++||++..|...++..||.|.+....+-.+.+|.+. T Consensus 222 ~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~--esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~ 297 (547) T protein:vir:10 222 YDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGE--EGGY--YEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYV 297 (547) T ss_pred cCCCCCccccceeeccccceeEEEEEecCceeeee--cCCc--ccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHH Confidence 2221110 001111221 22 34554 4445 5689999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHH Q lcl|NC_013692. 399 RGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAF 478 (726) Q Consensus 399 ~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~ 478 (726) +.+++++.+..+|+++++.+.+.. ..+..||+++.+.+.. .+.++............++.+.+.+...==+..+ T Consensus 298 ~~~l~~~~~~~~pp~~v~~~g~~~--~~~~~pgg~~~~~~~~----~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~ 371 (547) T protein:vir:10 298 ELVLRSSEKVIDPAIMVTERGLIS--DIDLGASGLTVVRDME----SMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQL 371 (547) T ss_pred HHHHHHHHHHhcCceecccccccc--cceecCCeeeecCCcc----cceeeecccchHHHHHHHHHHHHHHHHHhhhhhh Confidence 999999999999999988665532 3456799988664332 3444444433333444555554444432111111 Q ss_pred hhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhc----c Q lcl|NC_013692. 479 NAGISGAALGDTATAVRGALDAASKRELGILRRLS-AGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDL----A 553 (726) Q Consensus 479 ~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~-~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~----~ 553 (726) +. .++..-||+++....+.....+..++.+|. +.+..+..+.+.++.+..--+. -|+.+ . T Consensus 372 --~~-~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~------------~p~~l~~~~~ 436 (547) T protein:vir:10 372 --QM-KDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGE------------LPSKLLESGK 436 (547) T ss_pred --hc-CCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC------------CchhhhccCc Confidence 21 223346999999999999999999999987 4667888888888776432211 12221 1 Q ss_pred cccceeeecccchHHHH-HHH---HHHHHHHHhhhccch-----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhh Q lcl|NC_013692. 554 GNFDLKLDISTAEEDNA-KVN---DLTFMLQTMGPNMDP-----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQ 624 (726) Q Consensus 554 ~~~dv~i~~~~~~~~~~-~~~---~l~~l~q~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq 624 (726) ..++|+.....+...+. ..+ +.......+++ +.| .+....+..++...+++. ..+... T Consensus 437 ~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq-~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~---------- 503 (547) T protein:vir:10 437 AAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAE-INPEVLDIPDWDEMVRMLGSLLGAPQ--TLMRPK---------- 503 (547) T ss_pred ceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhc-cChhhhhcCCHHHHHHHHHHHhCCCh--hccCCH---------- Confidence 23334433333222111 111 22222222222 122 222334444444444431 111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) .+.+++.++.+ ++++.++++..+.+ +...-+....-.+.+++.+ T Consensus 504 ---eev~~~r~qr~----~~~q~~~qaa~~~~-~g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 504 ---AKVTSIRKNRS----QTQQKAEQAAIAEA-EGNAMEAQGKGQAALKENQ 547 (547) T ss_pred ---HHHHHHHHHHH----HHHHHHHHHHHHHH-HHHHHHhhcCcccchhccC Confidence 01111100000 00000000000000 0000000000000000000 No 27 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=99.92 E-value=7.6e-23 Score=141.85 Aligned_cols=520 Identities=12% Similarity=0.026 Sum_probs=271.1 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CC----CCCCCCCCC---CcCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_013692. 26 WSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVR-GE----GKPKTEKGK---SAVQPPTIRKQAEWRYSSLSEPF 97 (726) Q Consensus 26 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~-~~----~~~~~~~gr---s~~v~~~v~~~v~~~~~~L~~~f 97 (726) +.+....+.|.+.++..++-.+.. ...|.++|.-+ |. -+.....|+ .+++++.-.+.++.+.+.|+..+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~---e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESW---MSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHH---HHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 777777778888887665544333 23344444211 10 011122332 56889999999999999999999 Q ss_pred cC-CCceEEEecCCcchHHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 98 LS-SPNIFEVNPVTWEDAESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 98 ~~-~~~~~~~~p~~~~D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) |+ +.+||.+.+..++..+.+ ...+..+...|. .++.+..++..+++.+..|||++-+.++. T Consensus 78 tpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~----------- 145 (555) T protein:vir:10 78 TSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDF----------- 145 (555) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCC----------- Confidence 98 899999998655433222 224555555554 57788889999999999999998332210 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) ...+++..++..+| T Consensus 146 ------------------------------------------------------------------~~~~rf~~~pl~~~ 159 (555) T protein:vir:10 146 ------------------------------------------------------------------DAVVYHHSLTAGEY 159 (555) T ss_pred ------------------------------------------------------------------CceEEEEEeeccee Confidence 00123455778889 Q ss_pred eeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEE Q lcl|NC_013692. 251 VIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGY 330 (726) Q Consensus 251 ~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~ 330 (726) ++..++..++ .=++|+..+|..+|.++ |..+ .++... ....+ .+.....|+|+++... T Consensus 160 ~v~~d~~G~v---d~i~r~~~~t~~ql~~~-fg~~--~l~~~~---~~~~~-------------~~~~~~~v~v~~~V~p 217 (555) T protein:vir:10 160 AIAADNQGRV---NTLYREFQITVAQMVRE-FGKD--KCSTTV---QSLFD-------------RGALEQWVTVIHAIEP 217 (555) T ss_pred EEeeCCCCCE---EEEEEEEeccHHHHHHh-cCcc--cCCHHH---HHHHh-------------cCCCCceEEEEEEEee Confidence 9977765433 33568889999998877 3322 111100 00000 0011246888888543 Q ss_pred -eecCCCc---e-EEEEEEEE----ECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 331 -YDIHGDG---V-LHPIVATW----VGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGM 401 (726) Q Consensus 331 -~~~~~~g---~-~~~~~~~~----~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~ 401 (726) .+.+.++ . +.+..+++ .|.+++. ++.| ..+||++..|.+.++..||+|.+....+-.+.+|.+.+.. T Consensus 218 r~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~--esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~ 293 (555) T protein:vir:10 218 RADRDPSKRDDRNMAWKSVYFEPGADETRTLR--ESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRK 293 (555) T ss_pred ccCcCcCCCCccccceEEEEEEeccCCccccc--cCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHH Confidence 2221111 1 11111222 2345654 4445 5699999999999999999999999999999999999999 Q ss_pred HHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHH-hh Q lcl|NC_013692. 402 IDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAF-NA 480 (726) Q Consensus 402 ~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~-~~ 480 (726) +.++.+..+|+++++.+.. .+.....||++..+.+|...+...+.......-+.+...++.+.+.+...- ..+. .+ T Consensus 294 l~~~~~~~~pp~~v~~~~~--~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~ 370 (555) T protein:vir:10 294 AQAIDYKSNPPLQLPVSAK--NQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLM 370 (555) T ss_pred HHHHHHHhcCceeeccccc--cccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhh Confidence 9999999999999988764 234567899876665544333222222221112333344444444443322 2221 11 Q ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccc-ccce Q lcl|NC_013692. 481 GISGAALGDTATAVRGALDAASKRELGILRRLS-AGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAG-NFDL 558 (726) Q Consensus 481 G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~-~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~-~~dv 558 (726) ....++..-||+++....+.....+..++.++. +.+..+..+.+.++.+..-- +.-|..+.+ .+++ T Consensus 371 l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~l------------P~~P~~l~~~~i~v 438 (555) T protein:vir:10 371 LANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANIL------------PPPPQEMQGVDLNV 438 (555) T ss_pred ccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC------------CCCchhhcCceeEE Confidence 222334446999999999999999999999986 46678888888887764221 112222222 2333 Q ss_pred eeecccchHHHH----HHHHHHHHHHHhhhccc----hhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH Q lcl|NC_013692. 559 KLDISTAEEDNA----KVNDLTFMLQTMGPNMD----PMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL 630 (726) Q Consensus 559 ~i~~~~~~~~~~----~~~~l~~l~q~~~~~~~----~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~ 630 (726) +.....+...+. ...+++.....+++..| ..+....+..++...+++. ..++..+ +. T Consensus 439 ~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~e-------------ev 503 (555) T protein:vir:10 439 EFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVPGN-------------QV 503 (555) T ss_pred EeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCCHH-------------HH Confidence 333333222111 11222222222222111 1222333444444444431 1111100 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 631 MLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQ 695 (726) Q Consensus 631 q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~ 695 (726) +++ .++++++++.++++.++.. ..+..+ .+...+....-....+.++-- --. T Consensus 504 ~~~----r~qr~~~~q~~~~a~~~~q-~~~~~~-------~~~~~~~~~~~~~~~~~~~~~-~~~ 555 (555) T protein:vir:10 504 ALI----RKQRADQQQAAQQAALLNQ-GADTAA-------KLGSVDTSKQNALTDVTRAFS-GYT 555 (555) T ss_pred HHH----HHHHHHHHHHHHHHHHHHH-HHHHHH-------HhcccccCcchhHHHHHhhhc-cCC Confidence 000 0000000000000000000 000000 000000000000000000000 000 No 28 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=99.92 E-value=7.6e-23 Score=141.85 Aligned_cols=520 Identities=12% Similarity=0.026 Sum_probs=271.1 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CC----CCCCCCCCC---CcCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_013692. 26 WSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVR-GE----GKPKTEKGK---SAVQPPTIRKQAEWRYSSLSEPF 97 (726) Q Consensus 26 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~-~~----~~~~~~~gr---s~~v~~~v~~~v~~~~~~L~~~f 97 (726) +.+....+.|.+.++..++-.+.. ...|.++|.-+ |. -+.....|+ .+++++.-.+.++.+.+.|+..+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~---e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESW---MSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHH---HHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 777777778888887665544333 23344444211 10 011122332 56889999999999999999999 Q ss_pred cC-CCceEEEecCCcchHHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 98 LS-SPNIFEVNPVTWEDAESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 98 ~~-~~~~~~~~p~~~~D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) |+ +.+||.+.+..++..+.+ ...+..+...|. .++.+..++..+++.+..|||++-+.++. T Consensus 78 tpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~----------- 145 (555) T protein:vir:10 78 TSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDF----------- 145 (555) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCC----------- Confidence 98 899999998655433222 224555555554 57788889999999999999998332210 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) ...+++..++..+| T Consensus 146 ------------------------------------------------------------------~~~~rf~~~pl~~~ 159 (555) T protein:vir:10 146 ------------------------------------------------------------------DAVVYHHSLTAGEY 159 (555) T ss_pred ------------------------------------------------------------------CceEEEEEeeccee Confidence 00123455778889 Q ss_pred eeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEE Q lcl|NC_013692. 251 VIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGY 330 (726) Q Consensus 251 ~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~ 330 (726) ++..++..++ .=++|+..+|..+|.++ |..+ .++... ....+ .+.....|+|+++... T Consensus 160 ~v~~d~~G~v---d~i~r~~~~t~~ql~~~-fg~~--~l~~~~---~~~~~-------------~~~~~~~v~v~~~V~p 217 (555) T protein:vir:10 160 AIAADNQGRV---NTLYREFQITVAQMVRE-FGKD--KCSTTV---QSLFD-------------RGALEQWVTVIHAIEP 217 (555) T ss_pred EEeeCCCCCE---EEEEEEEeccHHHHHHh-cCcc--cCCHHH---HHHHh-------------cCCCCceEEEEEEEee Confidence 9977765433 33568889999998877 3322 111100 00000 0011246888888543 Q ss_pred -eecCCCc---e-EEEEEEEE----ECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 331 -YDIHGDG---V-LHPIVATW----VGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGM 401 (726) Q Consensus 331 -~~~~~~g---~-~~~~~~~~----~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~ 401 (726) .+.+.++ . +.+..+++ .|.+++. ++.| ..+||++..|.+.++..||+|.+....+-.+.+|.+.+.. T Consensus 218 r~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~--esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~ 293 (555) T protein:vir:10 218 RADRDPSKRDDRNMAWKSVYFEPGADETRTLR--ESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRK 293 (555) T ss_pred ccCcCcCCCCccccceEEEEEEeccCCccccc--cCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHH Confidence 2221111 1 11111222 2345654 4445 5699999999999999999999999999999999999999 Q ss_pred HHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHH-hh Q lcl|NC_013692. 402 IDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAF-NA 480 (726) Q Consensus 402 ~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~-~~ 480 (726) +.++.+..+|+++++.+.. .+.....||++..+.+|...+...+.......-+.+...++.+.+.+...- ..+. .+ T Consensus 294 l~~~~~~~~pp~~v~~~~~--~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~ 370 (555) T protein:vir:10 294 AQAIDYKSNPPLQLPVSAK--NQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLM 370 (555) T ss_pred HHHHHHHhcCceeeccccc--cccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhh Confidence 9999999999999988764 234567899876665544333222222221112333344444444443322 2221 11 Q ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccc-ccce Q lcl|NC_013692. 481 GISGAALGDTATAVRGALDAASKRELGILRRLS-AGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAG-NFDL 558 (726) Q Consensus 481 G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~-~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~-~~dv 558 (726) ....++..-||+++....+.....+..++.++. +.+..+..+.+.++.+..-- +.-|..+.+ .+++ T Consensus 371 l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~l------------P~~P~~l~~~~i~v 438 (555) T protein:vir:10 371 LANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANIL------------PPPPQEMQGVDLNV 438 (555) T ss_pred ccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC------------CCCchhhcCceeEE Confidence 222334446999999999999999999999986 46678888888887764221 112222222 2333 Q ss_pred eeecccchHHHH----HHHHHHHHHHHhhhccc----hhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH Q lcl|NC_013692. 559 KLDISTAEEDNA----KVNDLTFMLQTMGPNMD----PMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL 630 (726) Q Consensus 559 ~i~~~~~~~~~~----~~~~l~~l~q~~~~~~~----~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~ 630 (726) +.....+...+. ...+++.....+++..| ..+....+..++...+++. ..++..+ +. T Consensus 439 ~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~e-------------ev 503 (555) T protein:vir:10 439 EFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVPGN-------------QV 503 (555) T ss_pred EeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCCHH-------------HH Confidence 333333222111 11222222222222111 1222333444444444431 1111100 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 631 MLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQ 695 (726) Q Consensus 631 q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~ 695 (726) +++ .++++++++.++++.++.. ..+..+ .+...+....-....+.++-- --. T Consensus 504 ~~~----r~qr~~~~q~~~~a~~~~q-~~~~~~-------~~~~~~~~~~~~~~~~~~~~~-~~~ 555 (555) T protein:vir:10 504 ALI----RKQRADQQQAAQQAALLNQ-GADTAA-------KLGSVDTSKQNALTDVTRAFS-GYT 555 (555) T ss_pred HHH----HHHHHHHHHHHHHHHHHHH-HHHHHH-------HhcccccCcchhHHHHHhhhc-cCC Confidence 000 0000000000000000000 000000 000000000000000000000 000 No 29 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=99.92 E-value=7.6e-23 Score=141.85 Aligned_cols=520 Identities=12% Similarity=0.026 Sum_probs=271.1 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CC----CCCCCCCCC---CcCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_013692. 26 WSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVR-GE----GKPKTEKGK---SAVQPPTIRKQAEWRYSSLSEPF 97 (726) Q Consensus 26 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~-~~----~~~~~~~gr---s~~v~~~v~~~v~~~~~~L~~~f 97 (726) +.+....+.|.+.++..++-.+.. ...|.++|.-+ |. -+.....|+ .+++++.-.+.++.+.+.|+..+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~---e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESW---MSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHH---HHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 777777778888887665544333 23344444211 10 011122332 56889999999999999999999 Q ss_pred cC-CCceEEEecCCcchHHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 98 LS-SPNIFEVNPVTWEDAESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 98 ~~-~~~~~~~~p~~~~D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) |+ +.+||.+.+..++..+.+ ...+..+...|. .++.+..++..+++.+..|||++-+.++. T Consensus 78 tpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~----------- 145 (555) T protein:vir:98 78 TSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDF----------- 145 (555) T ss_pred cCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCC----------- Confidence 98 899999998655433222 224555555554 57788889999999999999998332210 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) ...+++..++..+| T Consensus 146 ------------------------------------------------------------------~~~~rf~~~pl~~~ 159 (555) T protein:vir:98 146 ------------------------------------------------------------------DAVVYHHSLTAGEY 159 (555) T ss_pred ------------------------------------------------------------------CceEEEEEeeccee Confidence 00123455778889 Q ss_pred eeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEE Q lcl|NC_013692. 251 VIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGY 330 (726) Q Consensus 251 ~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~ 330 (726) ++..++..++ .=++|+..+|..+|.++ |..+ .++... ....+ .+.....|+|+++... T Consensus 160 ~v~~d~~G~v---d~i~r~~~~t~~ql~~~-fg~~--~l~~~~---~~~~~-------------~~~~~~~v~v~~~V~p 217 (555) T protein:vir:98 160 AIAADNQGRV---NTLYREFQITVAQMVRE-FGKD--KCSTTV---QSLFD-------------RGALEQWVTVIHAIEP 217 (555) T ss_pred EEeeCCCCCE---EEEEEEEeccHHHHHHh-cCcc--cCCHHH---HHHHh-------------cCCCCceEEEEEEEee Confidence 9977765433 33568889999998877 3322 111100 00000 0011246888888543 Q ss_pred -eecCCCc---e-EEEEEEEE----ECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 331 -YDIHGDG---V-LHPIVATW----VGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGM 401 (726) Q Consensus 331 -~~~~~~g---~-~~~~~~~~----~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~ 401 (726) .+.+.++ . +.+..+++ .|.+++. ++.| ..+||++..|.+.++..||+|.+....+-.+.+|.+.+.. T Consensus 218 r~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~--esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~ 293 (555) T protein:vir:98 218 RADRDPSKRDDRNMAWKSVYFEPGADETRTLR--ESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRK 293 (555) T ss_pred ccCcCcCCCCccccceEEEEEEeccCCccccc--cCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHH Confidence 2221111 1 11111222 2345654 4445 5699999999999999999999999999999999999999 Q ss_pred HHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHH-hh Q lcl|NC_013692. 402 IDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAF-NA 480 (726) Q Consensus 402 ~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~-~~ 480 (726) +.++.+..+|+++++.+.. .+.....||++..+.+|...+...+.......-+.+...++.+.+.+...- ..+. .+ T Consensus 294 l~~~~~~~~pp~~v~~~~~--~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~ 370 (555) T protein:vir:98 294 AQAIDYKSNPPLQLPVSAK--NQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLM 370 (555) T ss_pred HHHHHHHhcCceeeccccc--cccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhh Confidence 9999999999999988764 234567899876665544333222222221112333344444444443322 2221 11 Q ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccc-ccce Q lcl|NC_013692. 481 GISGAALGDTATAVRGALDAASKRELGILRRLS-AGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAG-NFDL 558 (726) Q Consensus 481 G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~-~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~-~~dv 558 (726) ....++..-||+++....+.....+..++.++. +.+..+..+.+.++.+..-- +.-|..+.+ .+++ T Consensus 371 l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~l------------P~~P~~l~~~~i~v 438 (555) T protein:vir:98 371 LANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANIL------------PPPPQEMQGVDLNV 438 (555) T ss_pred ccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC------------CCCchhhcCceeEE Confidence 222334446999999999999999999999986 46678888888887764221 112222222 2333 Q ss_pred eeecccchHHHH----HHHHHHHHHHHhhhccc----hhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHH Q lcl|NC_013692. 559 KLDISTAEEDNA----KVNDLTFMLQTMGPNMD----PMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLEL 630 (726) Q Consensus 559 ~i~~~~~~~~~~----~~~~l~~l~q~~~~~~~----~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~ 630 (726) +.....+...+. ...+++.....+++..| ..+....+..++...+++. ..++..+ +. T Consensus 439 ~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~e-------------ev 503 (555) T protein:vir:98 439 EFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVPGN-------------QV 503 (555) T ss_pred EeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCCHH-------------HH Confidence 333333222111 11222222222222111 1222333444444444431 1111100 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 631 MLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQ 695 (726) Q Consensus 631 q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~ 695 (726) +++ .++++++++.++++.++.. ..+..+ .+...+....-....+.++-- --. T Consensus 504 ~~~----r~qr~~~~q~~~~a~~~~q-~~~~~~-------~~~~~~~~~~~~~~~~~~~~~-~~~ 555 (555) T protein:vir:98 504 ALI----RKQRADQQQAAQQAALLNQ-GADTAA-------KLGSVDTSKQNALTDVTRAFS-GYT 555 (555) T ss_pred HHH----HHHHHHHHHHHHHHHHHHH-HHHHHH-------HhcccccCcchhHHHHHhhhc-cCC Confidence 000 0000000000000000000 000000 000000000000000000000 000 No 30 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.91 E-value=3.6e-22 Score=138.16 Aligned_cols=506 Identities=12% Similarity=0.053 Sum_probs=252.3 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------cCCCCCCCCCCCCCcCCCHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMH------VRGEGKPKTEKGKSAVQPPTIRK 84 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~------~~~~~~~~~~~grs~~v~~~v~~ 84 (726) |+ ++|+ +-..+. ++++.++..++-.+.. ...|.++|. ..+.++... ....++++..-.+ T Consensus 1 m~-----~~~~---~~~~~~---~~~~r~~~l~~~R~~~---e~~w~e~~~~~lP~~~~~~~~~~~-~~~~~~~dst~~~ 65 (535) T protein:vir:33 1 MA-----DSKR---TGLGED---GAKATYDRLTNDRRAY---ETRAENCAQYTIPSLFPKESDNES-TDYTTPWQAVGAR 65 (535) T ss_pred CC-----hhhh---hccChh---HHHHHHHHHHHHhhHH---HHHHHHHHHHhcccccCCCCCccc-ccccccccccHHH Confidence 21 1221 111111 2233333333222222 233444432 222221111 1124577888889 Q ss_pred HHHHHHHHHHHhhcCCCceEEEecCCcc-------hHH------HHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCe Q lcl|NC_013692. 85 QAEWRYSSLSEPFLSSPNIFEVNPVTWE-------DAE------SARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTI 151 (726) Q Consensus 85 ~v~~~~~~L~~~f~~~~~~~~~~p~~~~-------D~~------~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~ 151 (726) .++.+.+.|+..+|.+.+||.+.+-.++ +.+ .-+..+..+...| ..++.|..++..+++.+..||| T Consensus 66 a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a 144 (535) T protein:vir:33 66 GLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFECLKQLIVAGNA 144 (535) T ss_pred HHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCce Confidence 9999999999999999999999774321 111 1123444555445 4778999999999999999999 Q ss_pred EEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeec Q lcl|NC_013692. 152 IVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEE 231 (726) Q Consensus 152 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 231 (726) ++.+.++... T Consensus 145 ~l~~~~~~~~---------------------------------------------------------------------- 154 (535) T protein:vir:33 145 LLYLPEPEGS---------------------------------------------------------------------- 154 (535) T ss_pred eEEeecCCCC---------------------------------------------------------------------- Confidence 9955333100 Q ss_pred ccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 232 REETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 232 ~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) .++++.++..+|++..++..+++. ++++..+|..+|.+. |..+... +. T Consensus 155 -------~~~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~-~~~~~~~------------~~--------- 202 (535) T protein:vir:33 155 -------YNPMKLYRLSSYVVQRDAYGNVLQ---IVTRDQIAFGALPED-VRSAVEK------------SG--------- 202 (535) T ss_pred -------ceeeEEEEcCeeEEeeCCCCCeeE---EEeeEeecHHHHHHH-hhhhhcc------------cc--------- Confidence 012344556678888776544433 788999999998554 2221100 00 Q ss_pred cccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQ 391 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 391 (726) ......+.+.+|+|.. .+. .+|...++. .+.+..+.-..+-|+.+.+||++..|...++..||+|.+....+-. T Consensus 203 --~~k~~~~~~~v~~~v~-~~~-~~~~~~~~~--~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~ 276 (535) T protein:vir:33 203 --GEKKMDEMVDVYTHVY-LDE-ESGDYLKYE--EVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDL 276 (535) T ss_pred --cccccccCCeEEEEEE-eeC-CCCcEEEEE--EEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHH Confidence 0011234567777643 332 233223332 2333333334455667889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEeecccccchhh-hhhcCCceEeecCccchhhhccccc--CccchhHHHHHHHHHHHH Q lcl|NC_013692. 392 RIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR-RRFDRGENYEFNPGADPRAAVHMHT--FPEIPQSAQYMINLQQAE 468 (726) Q Consensus 392 ~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~-~~~~~g~vi~~~~~~~~~~~i~~~~--~~~~~~~~~~ll~~~~~~ 468 (726) +.+|.+.+..+.++..+.+|+++++.+.+..... ....+|.++...++ .+.+.+ ...-.+.....++.+.+. T Consensus 277 k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~g~~v~g~~~-----~v~~~~~~~~~~~~~~~~~i~~~~~~ 351 (535) T protein:vir:33 277 RSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRE-----DIDFLQLEKQADFTVAKAVSDQIEAR 351 (535) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCceeeecCCcc-----cceeeecccccchhHHHHHHHHHHHH Confidence 9999999999999999999999998777643332 22233333332222 222222 112233445555555555 Q ss_pred HHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceec Q lcl|NC_013692. 469 AESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDI 547 (726) Q Consensus 469 ~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v 547 (726) +...- ..+ +.+. -++..-||+++....+.....+..++.+|.. .+..++.+.+.++.+..--+. + T Consensus 352 I~~af-~~~-~~~~-~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~-----------~ 417 (535) T protein:vir:33 352 LSYAF-MLN-SAVQ-RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPE-----------L 417 (535) T ss_pred HHHHH-hhh-hccc-CCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-----------C Confidence 54432 111 1111 1223369999999999999999999999875 667888888888765322111 1 Q ss_pred chhhcccccceeeecccchHHHH-HHHHHHHHHHHhhhccch-----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhh Q lcl|NC_013692. 548 RRDDLAGNFDLKLDISTAEEDNA-KVNDLTFMLQTMGPNMDP-----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPI 621 (726) Q Consensus 548 ~~~~~~~~~dv~i~~~~~~~~~~-~~~~l~~l~q~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~ 621 (726) . ...+.+++..+.+...+. ..+.+...++.++...|. .+...++..++...+++.. ..++. T Consensus 418 p----~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~-~i~~~-------- 484 (535) T protein:vir:33 418 P----KEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS-GILLT-------- 484 (535) T ss_pred C----ccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHh-HhcCC-------- Confidence 1 112344444443322221 112222222233221111 1222333333333333311 00000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFL 680 (726) Q Consensus 622 ~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~ 680 (726) +++.++ ++++..+++++.+.+.+........ ...-...++.-....-++.- T Consensus 485 ~ee~~~-----~~~q~~~~~~~~~~~~~~g~~~~~~---~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 485 DEQKQA-----LMMQDAAQTGVENAAAAGGAGVGAL---ATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred HHHHHH-----HHHHHHHHHHHHHHHHhhhhhhcch---hhcCChhHHHHHHhccCCCC Confidence 000000 0000000000000000000000000 00000000000001111110 No 31 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.91 E-value=1.2e-21 Score=135.28 Aligned_cols=510 Identities=12% Similarity=0.029 Sum_probs=265.5 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------cCCCCCCC--CCCCC---CcCCC Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMH------VRGEGKPK--TEKGK---SAVQP 79 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~------~~~~~~~~--~~~gr---s~~v~ 79 (726) |+. -+..++..|++.++..++-.+... ..|.++|. +.....++ ...|+ +++++ T Consensus 1 m~~-------------d~~~~~~~l~~r~~~l~~~R~~~e---~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~d 64 (549) T protein:vir:10 1 MTN-------------DDAKILQALNADHGRMKEKRQSYE---AVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFD 64 (549) T ss_pred CCc-------------chHHHHHHHHHHHHHHHHHhhhHH---HHHHHHHHHhccccccccccCCCCCCccccccccccc Confidence 221 224455666666666554444433 33444442 11111111 22343 36788 Q ss_pred HHHHHHHHHHHHHHHHhhcC-CCceEEEecCCcchHHH------HHHHHHHHHHHHh-hcccchhHHHHHHHHHhhcCCe Q lcl|NC_013692. 80 PTIRKQAEWRYSSLSEPFLS-SPNIFEVNPVTWEDAES------ARQNGLVLNQQFN-TKLNKQRFIDEYVRAGVDEGTI 151 (726) Q Consensus 80 ~~v~~~v~~~~~~L~~~f~~-~~~~~~~~p~~~~D~~~------A~q~t~~~n~~~~-~~~~~~~~~~~~~~~~l~~~~~ 151 (726) +.-.+.++.+.+.|+..+|+ +.+||.+.+-.+...+. -.+.+..+..+++ ...+.|..++..+++.++.||| T Consensus 65 stg~~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta 144 (549) T protein:vir:10 65 STAPLALRNFVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPG 144 (549) T ss_pred chHHHHHHHHHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcce Confidence 88899999999999999998 78999998855443332 2333444444443 3677889999999999999999 Q ss_pred EEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeec Q lcl|NC_013692. 152 IVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEE 231 (726) Q Consensus 152 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 231 (726) ++-+..+.. T Consensus 145 ~l~~~~~~~----------------------------------------------------------------------- 153 (549) T protein:vir:10 145 ALMIEHDVG----------------------------------------------------------------------- 153 (549) T ss_pred eeEEeecCC----------------------------------------------------------------------- Confidence 985422100 Q ss_pred ccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 232 REETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 232 ~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) ..+++..++..+|++..++..+++. ++++..+|...|.++ |..+ .++.. +....+ T Consensus 154 ------~~~~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~-fg~~--~l~~~---v~~~~~---------- 208 (549) T protein:vir:10 154 ------KGIVYRNVPMQRLWFAENNSGLIDK---THVQWELTLRQAAQR-FGRE--NLSPS---MQSTLE---------- 208 (549) T ss_pred ------CeeEEEEEEcCeEEEeeCCCCCeEE---EEEEeecCHHHHHHh-cCcc--cCCHH---HHHHhh---------- Confidence 0113455677889998877544433 788999999998776 3221 22110 000000 Q ss_pred cccCCcCCceEEEEEEEEE-eecCC---Cce-EE--EEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGY-YDIHG---DGV-LH--PIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG 384 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~-~~~~~---~g~-~~--~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~ 384 (726) ..+.++|.||++=.. .+.+. ++. .. .++....|+++|.. +.| .++||++..|...++..||.|.+ T Consensus 209 ----~~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~e--sg~--~e~P~~~~Rw~~~~ge~YGrgp~ 280 (549) T protein:vir:10 209 ----KDPEKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQN--SGF--RTFPFAIGRFYVGTDDVYGGSPA 280 (549) T ss_pred ----cCCCceEEEEEEeecCCCCCccccccccCceEEEEEEecCCEeecc--CCc--ccCCcceeeeeecCCCccccchH Confidence 012457888865211 11110 111 11 12222345566654 345 56899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHH Q lcl|NC_013692. 385 ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINL 464 (726) Q Consensus 385 ~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~ 464 (726) ....+-.+.+|.+.+..+.++.+..+|+++++.+.+.. .....||++..+..+......+.+.......+..+.+++. T Consensus 281 ~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~--~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~ 358 (549) T protein:vir:10 281 YDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLD--GFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQD 358 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc--cceeccCCccccccCCCCccceeeeccccchhHHHHHHHH Confidence 99999999999999999999999999999998765422 2335688765544332223334443333223344455555 Q ss_pred HHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCcCeEEEEeccc Q lcl|NC_013692. 465 QQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLS-AGIIEIGRKIIAMNAEFLDDVEVVRITNEH 543 (726) Q Consensus 465 ~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~-~~~~~l~~~il~li~q~~d~e~~iRi~~~~ 543 (726) +.+.+...==+.-+ +...++..-||+++....+.....+...+.++. +.+..++.+.+.++.+..-- T Consensus 359 ~~~rI~~af~~d~~--~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~l---------- 426 (549) T protein:vir:10 359 TRQTINQWFYVTLF--QILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQL---------- 426 (549) T ss_pred HHHHHHHHHhhhhh--hhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCC---------- Confidence 55544432211111 211233457999999999999999999999986 56778888888877763221 Q ss_pred ceecchhhcc---cccceeeecccchHHH-HHHH---HHHHHHHHhhhccchh-----HHHHHHHHHHHhhhhhhhhhhH Q lcl|NC_013692. 544 FVDIRRDDLA---GNFDLKLDISTAEEDN-AKVN---DLTFMLQTMGPNMDPM-----MAQQIMGQIMELKKMPDFAKRI 611 (726) Q Consensus 544 ~v~v~~~~~~---~~~dv~i~~~~~~~~~-~~~~---~l~~l~q~~~~~~~~~-----~~~~~~~~~~~~~~~~e~~~~l 611 (726) +--|..+. ..++++.....+...+ ...+ +.......+++ +.|. +....+..++...+++. ..+ T Consensus 427 --P~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq-~~Pe~ld~id~d~~~~~~a~~~Gvp~--~~i 501 (549) T protein:vir:10 427 --PDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQ-FDPAAAKVPNGARIARLLADYGGVPV--EAM 501 (549) T ss_pred --CCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChhHHhcCCHHHHHHHHHHhcCCCc--ccc Confidence 11222221 1233333222222111 1111 11222222211 2221 22233334444444331 111 Q ss_pred HHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 612 REFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMT 675 (726) Q Consensus 612 ~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~ 675 (726) +.. .+.++++++.++++ +.+++.+.+. .........++.. ...|..+- T Consensus 502 rs~-------------eev~~~r~~~~~qq-q~~~~~~~a~-~a~~~a~~~~~~~-ta~~~~~~ 549 (549) T protein:vir:10 502 STD-------------EELQAQQAAEAQAA-QMQQMLAAAP-VAAGAIKDLSDAQ-TAAQTARV 549 (549) T ss_pred CCH-------------HHHHHHHHHHHHHH-HHHHHHHHHH-HHHHHHHhhhhhc-CCCcccCC Confidence 110 00000000000000 0000000000 0000000000000 00011010 No 32 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.91 E-value=8.5e-22 Score=136.10 Aligned_cols=506 Identities=10% Similarity=0.032 Sum_probs=254.1 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------cCCCCCCCCCCCCCcCCCHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMH------VRGEGKPKTEKGKSAVQPPTIRK 84 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~------~~~~~~~~~~~grs~~v~~~v~~ 84 (726) |++.+- +-..+. .+++.++..++-.+.. ...|.++|. ..+.++... .-..++++..-.+ T Consensus 1 m~~~~~--------~~~~~~---~~k~r~~~l~~~R~~~---e~~w~e~~~~~lP~~~~~~~~~~~-~~~~~~~dst~~~ 65 (535) T protein:vir:15 1 MADSKR--------TGLGED---GAKATYDRLTNDRRAY---ETRAENCAQYTIPSLFPKESDNES-TDYTTPWQAVGAR 65 (535) T ss_pred CCccch--------hccchH---HHHHHHHHHHHHhhHH---HHHHHHHHHHhcccccCCCCCccc-ccccccccccHHH Confidence 322210 001111 2233333333322222 233434332 222222111 1124678888899 Q ss_pred HHHHHHHHHHHhhcCCCceEEEecCCc-------chHH------HHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCe Q lcl|NC_013692. 85 QAEWRYSSLSEPFLSSPNIFEVNPVTW-------EDAE------SARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTI 151 (726) Q Consensus 85 ~v~~~~~~L~~~f~~~~~~~~~~p~~~-------~D~~------~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~ 151 (726) .++.+.+.|+..+|.+.+||.+.+-.. .+.+ .-+..+..+...| ..++.|..++..+++.+..||| T Consensus 66 a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a 144 (535) T protein:vir:15 66 GLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFECLKQLIVAGNA 144 (535) T ss_pred HHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCce Confidence 999999999999999999999977331 1111 1123445555555 4678999999999999999999 Q ss_pred EEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeec Q lcl|NC_013692. 152 IVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEE 231 (726) Q Consensus 152 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 231 (726) ++.+.++... T Consensus 145 ~l~~~~~~~~---------------------------------------------------------------------- 154 (535) T protein:vir:15 145 LLYLPEPEGS---------------------------------------------------------------------- 154 (535) T ss_pred eEEeecCCCC---------------------------------------------------------------------- Confidence 9865433100 Q ss_pred ccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 232 REETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 232 ~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) .++++.++..+|++..++..+++ -++++..+|..+|-+. |.+++.. T Consensus 155 -------~~~f~~~pl~~~~v~~d~~G~vd---~i~r~~~~t~~~l~~~-~~~~~~~----------------------- 200 (535) T protein:vir:15 155 -------YNPMKLYRLSSYVVQRDAYGNVL---QIVTRDQIAFGALPED-VRSAVEK----------------------- 200 (535) T ss_pred -------ceeeEEEEcCeeEEeeCCCCCee---EEEEeEeecHHHHHHH-HhHhhhc----------------------- Confidence 01123455667888776644333 3778899999887443 2111100 Q ss_pred cccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQ 391 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 391 (726) ........++|.||++..+ +. +++...++.. ..|.. +.-..+-|+.+.+||++..|...++..||+|.+....+-. T Consensus 201 ~~~~~~~~~~v~v~~~v~~-~~-~~~~~~~~~e-~~g~~-~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~ 276 (535) T protein:vir:15 201 AGGEKKMDEMVDVYTHVYL-DE-ESGDYLKYEE-VEDVE-IDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDL 276 (535) T ss_pred cccccCCCCceeEEEEEEE-ec-CCCcEEEEEE-eeCcc-ccccccccccccCCceeeeeeecCCCccccchHHHHHHHH Confidence 0011123457889987543 22 2333233322 22333 3223345666889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEeecccccch-hhhhhcCCceEeecCccchhhhccccc--CccchhHHHHHHHHHHHH Q lcl|NC_013692. 392 RIIGAVTRGMIDTMARSANGQVGVMKGALDVT-NRRRFDRGENYEFNPGADPRAAVHMHT--FPEIPQSAQYMINLQQAE 468 (726) Q Consensus 392 ~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~-d~~~~~~g~vi~~~~~~~~~~~i~~~~--~~~~~~~~~~ll~~~~~~ 468 (726) +.+|.+.+..+.++..+.+|+++++.+.+... +.....+|.++...++ .+.+.+ ...-.+.....++.+.+. T Consensus 277 k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~g~~v~g~~~-----~v~~~~~~~~~~~~~~~~~i~~~~~~ 351 (535) T protein:vir:15 277 RSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRE-----DIDFLQLEKQADFTVAKAVSDQIEAR 351 (535) T ss_pred HHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCceeeecCCcc-----cceeeecccccchhHHHHHHHHHHHH Confidence 99999999999999999999999987765433 3233334444332222 222222 112233445555555555 Q ss_pred HHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceec Q lcl|NC_013692. 469 AESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDI 547 (726) Q Consensus 469 ~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v 547 (726) +...- ..+ +.+. -++..-||+++....+.....+..++.+|.. .+..++.+.+.++.+..--+. + T Consensus 352 I~~af-~~~-~~~~-~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~-----------~ 417 (535) T protein:vir:15 352 LSYAF-MLN-SAVQ-RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPE-----------L 417 (535) T ss_pred HHHHH-hhh-hccc-CCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-----------C Confidence 54432 111 1111 1223369999999999999999999999875 667888888888765322111 1 Q ss_pred chhhcccccceeeecccchHHHH-HHHHHHHHHHHhhhccch-----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhh Q lcl|NC_013692. 548 RRDDLAGNFDLKLDISTAEEDNA-KVNDLTFMLQTMGPNMDP-----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPI 621 (726) Q Consensus 548 ~~~~~~~~~dv~i~~~~~~~~~~-~~~~l~~l~q~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~ 621 (726) . ...+.+++..+.+...+. ..+.+...++.++...|. .+...++..++...+++.. ..++. T Consensus 418 p----~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~-~i~~~-------- 484 (535) T protein:vir:15 418 P----KEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS-GILLT-------- 484 (535) T ss_pred C----ccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChh-hhcCC-------- Confidence 1 112344444443322221 122222222333221111 1222333333333333311 00000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFL 680 (726) Q Consensus 622 ~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~ 680 (726) ++ +.+.++++..+++++.+.+.+....... .....=+.+++...+.-++.- T Consensus 485 ~e-----ev~~~~~q~~~~~~~~~~a~~~g~~~~~---~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 485 DE-----QKQALMMQDAAQTGIENAAATGGAGVGA---LATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred HH-----HHHHHHHHHHHHHHHHHHHHHHHhhccc---hhccChHHHHHHHhccCCCCC Confidence 00 0000000000000000000000000000 000000011111111111111 No 33 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=99.91 E-value=1.2e-21 Score=135.31 Aligned_cols=520 Identities=11% Similarity=0.092 Sum_probs=245.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------cCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC-CCce Q lcl|NC_013692. 31 SLAQLKQDYQEAKQVTDEKITQINRWLDYMH------VRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLS-SPNI 103 (726) Q Consensus 31 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~------~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~-~~~~ 103 (726) -=..+++.++..++-.+.. ...|.++|. ....++... .-..++++..-.+.++.+.+.|+..+|+ +.+| T Consensus 1 m~~~~~~r~~~l~~~R~~~---e~~w~e~~~y~lP~~~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~W 76 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDY---LDSGRQSARLTLPYILTDEGHVQG-GYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSF 76 (555) T ss_pred ChhHHHHHHHHHHHHhhHH---HHHHHHHHHHhcccccCCCCCccc-ccccccccccHHHHHHHHHHHHHHhhcCCCCcc Confidence 1112333343332222211 233444442 222222111 1125788888899999999999999998 7899 Q ss_pred EEEecCCcchH------H-H--H----HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 104 FEVNPVTWEDA------E-S--A----RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 104 ~~~~p~~~~D~------~-~--A----~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) |.+.+-.++.. + . . ...+..+...| ..++.|..++..+++.+..||+++ |-+ .. T Consensus 77 F~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l--y~~--------~~-- 143 (555) T protein:vir:17 77 FKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDI-AESSDRVHLEMAMKHLIVTGNALL--YQG--------KK-- 143 (555) T ss_pred cccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEE--Eec--------CC-- Confidence 99998433211 1 1 1 12344454444 367799999999999999999987 221 00 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) . ++.++..+| T Consensus 144 --------------------------~--------------------------------------------~~~~pl~~y 153 (555) T protein:vir:17 144 --------------------------N--------------------------------------------LKLYPLDRF 153 (555) T ss_pred --------------------------c--------------------------------------------eeEEEcCeE Confidence 0 011233457 Q ss_pred eeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcc--hhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEE Q lcl|NC_013692. 251 VIDPSCGSDFSKAKFLIETFESSYAELKADGRYQN--LDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYW 328 (726) Q Consensus 251 ~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~--~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w 328 (726) ++..++..++ .-++++.++|...|.+. |.++ .+... ...........................+++|.++ T Consensus 154 ~v~~d~~G~v---d~v~rk~~~t~~ql~~~-fg~~~l~~~~~----~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~ 225 (555) T protein:vir:17 154 VVSRDGEGNV---MEIVTEEQIDRSLLPEE-FQKVGGLEGAP----DSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYV 225 (555) T ss_pred EEeeCCCcCe---eEEEeeeeecHHHHHHH-hhhccccchhh----hhhhccccchhhhhhhhcccccCCCcceeEeecc Confidence 7766554333 33788999999998776 2221 11111 0000000000000000001112223456666654 Q ss_pred EEeecCCCceEEEEEEEEECCEEEE--eccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 329 GYYDIHGDGVLHPIVATWVGAVMIR--MEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMA 406 (726) Q Consensus 329 ~~~~~~~~g~~~~~~~~~~g~~~l~--~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~ 406 (726) .+- +|...+ ..-+++..+. ..++|| ..+||++..|...++..||.|.+....+-.+.+|.+.+..++++. T Consensus 226 ~~~----~~~~~~--~~e~~~~~v~~~l~e~g~--~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 297 (555) T protein:vir:17 226 CRK----DGQVKW--HQECDGKVIPGSNSSAPY--THNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSA 297 (555) T ss_pred ccc----CCeeEE--EEecCceeccccccccCc--ccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 331 222222 2223343332 356666 579999999999999999999999999999999999999999999 Q ss_pred hcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCc--cchhHHHHHHHHHHHHHHHHhchHHHhhccC- Q lcl|NC_013692. 407 RSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFP--EIPQSAQYMINLQQAEAESMTGVKAFNAGIS- 483 (726) Q Consensus 407 ~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~--~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~- 483 (726) ...+|+++++.+.+.....+...+++.+ .+|. + ..+.+.+.. ..-+..+..++.+.+.+.+. .+..+ T Consensus 298 ~~~~pp~lv~~~g~~~~~~l~~~~~g~v--~~g~-~-~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a------Fm~~~~ 367 (555) T protein:vir:17 298 ASAKVVFMVSPSATTKPQNLALAANGAI--IQGR-P-DDVSVVQANKAADFRTVLEMIQKLEQRISDA------FLMLQV 367 (555) T ss_pred HHhCCceeeccccccCcceeecCCCcee--ecCC-c-ccceeeeccccchhhHHHHHHHHHHHHHHHH------HhhcCC Confidence 9999999997776644333322222233 2322 1 223333321 11223344455544444433 22222 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeec Q lcl|NC_013692. 484 GAALGDTATAVRGALDAASKRELGILRRLS-AGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDI 562 (726) Q Consensus 484 ~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~-~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~ 562 (726) .++..-||+++....+.....+..++.+|. +.+..++.+.+.++.+..--+. + |.+. ..+.+.+ T Consensus 368 ~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~-----------~-p~~~---v~~~i~~ 432 (555) T protein:vir:17 368 RQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQ-----------L-PKDL---VQPTVVA 432 (555) T ss_pred CCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCC-----------C-CHhh---hccceee Confidence 223345999999999999999999999997 4778888988888877533221 1 1111 1223333 Q ss_pred ccchH-HHHHHHHHHHHHHHhhhcc-chh-----HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHH Q lcl|NC_013692. 563 STAEE-DNAKVNDLTFMLQTMGPNM-DPM-----MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQA 635 (726) Q Consensus 563 ~~~~~-~~~~~~~l~~l~q~~~~~~-~~~-----~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qa 635 (726) +.... .....+.+...++.+++.. ++. +....+..+....++.-. ..++ .++..++ +++ T Consensus 433 ~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~-~ivr----s~eev~~---------~rq 498 (555) T protein:vir:17 433 GLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTL-QLIN----SPETMKQ---------LGD 498 (555) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChh-hhcC----CHHHHHH---------HHH Confidence 33222 1122222222233332221 111 112222333332222100 0000 0000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 636 QIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEA 701 (726) Q Consensus 636 q~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~ 701 (726) +.++++++++.+.+.++...+.. .. +.-.+....+++....-+++....--| .++.+ T Consensus 499 ~~~~~~~q~~~~~qa~~~~~~~~--~~----~~~~~~~~~~~~a~~~~~a~~~~~~~~---~~~~~ 555 (555) T protein:vir:17 499 QQKQDMVQASLINQAGQLAKTPM--AE----QAMQLIQQQQEGAQDAGAAESETSSAE---AQAGA 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHhhhh--hh----hHHhccccchhhhhHHHHHHhhcCCcc---cccCC Confidence 00000000000000000000000 00 000000000000000000010000000 00000 No 34 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.90 E-value=5.5e-21 Score=131.63 Aligned_cols=509 Identities=11% Similarity=0.075 Sum_probs=255.9 Q ss_pred CCCch---HHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC Q lcl|NC_013692. 26 WSNAP---SLAQLKQDYQEAKQVT---DEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLS 99 (726) Q Consensus 26 ~~~~~---~~~~~~~~~~~a~~~~---~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~ 99 (726) ++++. .-..+++.++..++-. .+...+..++.--|.....++ +...-+.+++++.-.+.++.+.+.|+..++. T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD-NASTDYQTPWQAVGARGLNNLASKLMLALFP 79 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCC-cccccccccccccHHHHHHHHHHHHHHhhcC Confidence 33322 2233444444333222 222222222222222222221 1111124688888899999999999999998 Q ss_pred CCceEEEecCCcc-------hHHH------HHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEe Q lcl|NC_013692. 100 SPNIFEVNPVTWE-------DAES------ARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKE 166 (726) Q Consensus 100 ~~~~~~~~p~~~~-------D~~~------A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~ 166 (726) +.+||.+.+..++ +... -...+..+...+ ..++.|..++..+++.+..||+++ |-+..... T Consensus 80 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l--y~~e~~~~--- 153 (536) T protein:vir:21 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFEALKQLVVAGNVLL--YLPEPEGS--- 153 (536) T ss_pred CCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeE--EEeeCCCC--- Confidence 8899998764433 1111 223455555555 467799999999999999999998 33310000 Q ss_pred cccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeec Q lcl|NC_013692. 167 QVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCD 246 (726) Q Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~ 246 (726) . ...++.++ T Consensus 154 ---------------------------------------------------~--------------------~~~f~~~p 162 (536) T protein:vir:21 154 ---------------------------------------------------N--------------------YNPMKLYR 162 (536) T ss_pred ---------------------------------------------------c--------------------eeeEEEEE Confidence 0 00124455 Q ss_pred hhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEE Q lcl|NC_013692. 247 YNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHE 326 (726) Q Consensus 247 p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E 326 (726) ..+|++..++..+++. ++|+..+|...|.+. |.+++. . ... .....++|+||+ T Consensus 163 l~~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~-fg~~~~--~--------~~~-------------~~~~~~~v~v~~ 215 (536) T protein:vir:21 163 LSSYVVQRDAFGNVLQ---MVTRDQIAFGALPED-IRKAVE--G--------QGG-------------EKKADETIDVYT 215 (536) T ss_pred cCeEEEeeCCCCCeeE---EeeeeeccHHHHHHh-hhhhhc--c--------ccc-------------ccccccceeEEE Confidence 6778887766443333 788999999998765 322110 0 000 012235788887 Q ss_pred EEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 327 YWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMA 406 (726) Q Consensus 327 ~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~ 406 (726) |- +.+.+ ++...++. -..|.+++. +...|+.+.+||+++.|.+.++..||.|.+....+-.+.+|.+.+..+.+.. T Consensus 216 ~v-~~~~~-~~~~~~~~-e~~g~~v~~-~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 291 (536) T protein:vir:21 216 HI-YLDED-SGEYLRYE-EVEGMEVQG-SDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSM 291 (536) T ss_pred EE-EEecC-CCcEEEEe-ccCCeeecc-ccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 64 33322 33222222 233444433 3455667889999999999999999999999999999999999999999999 Q ss_pred hcCCCceEeecccccc-hhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcc Q lcl|NC_013692. 407 RSANGQVGVMKGALDV-TNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGA 485 (726) Q Consensus 407 ~~~~~~~~~~~gav~~-~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~ 485 (726) .+.+++++++.+.+.. .+.....+|.++...++.. .+.+......-+.....++.+.+.+....=+. +.+ ..+ T Consensus 292 ~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v---~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~-~~~ 365 (536) T protein:vir:21 292 ISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDI---SFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAV-QRT 365 (536) T ss_pred HHhcCCcccCcccccchhhhccCCCcceecCCcccc---eeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcc-cCC Confidence 9999999998777643 3334455666654333221 11112222223345556666666665543221 111 122 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeeccc Q lcl|NC_013692. 486 ALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIST 564 (726) Q Consensus 486 ~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~ 564 (726) +..-||+++....+.....+..++.+|.. .+..+..+.+.++....--+ ++..+. ..+.+..+. T Consensus 366 ~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP-----------~~p~~~----v~~~~vs~l 430 (536) T protein:vir:21 366 GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP-----------ELPKEA----VEPTISTGL 430 (536) T ss_pred CCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCC-----------CCChhh----ccceEEecH Confidence 33459999999999999999999999875 56778888888775432111 111111 222332232 Q ss_pred chHHH-HHHHHHHHHHHHhh---hc-cch-hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHH Q lcl|NC_013692. 565 AEEDN-AKVNDLTFMLQTMG---PN-MDP-MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIE 638 (726) Q Consensus 565 ~~~~~-~~~~~l~~l~q~~~---~~-~~~-~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e 638 (726) +...+ ...+.+....+.++ |. +.+ .+....+..+++..++. -...++. +.|.+++++|.+ T Consensus 431 ~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~-p~~~irt-------------~eev~~~r~q~~ 496 (536) T protein:vir:21 431 EAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGID-TSGILLT-------------EEQKQQKMAQQS 496 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCC-hhhhcCC-------------HHHHHHHHHHHH Confidence 22211 11122222222222 21 111 22233333333333331 0011111 001111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 639 AERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGV 686 (726) Q Consensus 639 ~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~ 686 (726) +++ +.+ .+...+..........--+.+.+...+..++- ++ T Consensus 497 ~~~-~~~--~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----~~ 536 (536) T protein:vir:21 497 MQM-GMD--NGAAALAQGMAAQATASPEAMAAAADSVGLQP-----GI 536 (536) T ss_pred HHH-HHH--HHHHHHHHHHHHHHhcChhhHHhhhhccccCC-----CC Confidence 000 000 00000000000000000001111111111100 00 No 35 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.90 E-value=4.9e-21 Score=131.93 Aligned_cols=509 Identities=11% Similarity=0.059 Sum_probs=256.0 Q ss_pred CCCch---HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC Q lcl|NC_013692. 26 WSNAP---SLAQLKQDYQEAKQV---TDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLS 99 (726) Q Consensus 26 ~~~~~---~~~~~~~~~~~a~~~---~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~ 99 (726) ++++. .-..+++.++..++- +.+...+..++.--|.....++ +...-+.++++..-.+.++.+.+.|+..++. T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~-~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD-NASTDYQTPWQAVGARGLNNLASKLMLALFP 79 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCC-cccccccccccccHHHHHHHHHHHHHhhhcC Confidence 33322 222344444432222 2222222222222222222222 1111224688888899999999999999998 Q ss_pred CCceEEEecCCcc-------hHHH------HHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEe Q lcl|NC_013692. 100 SPNIFEVNPVTWE-------DAES------ARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKE 166 (726) Q Consensus 100 ~~~~~~~~p~~~~-------D~~~------A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~ 166 (726) +.+||.+.+..++ +... -...+..+...+ ..++.|..++..+++.+..||+++ |-+..... T Consensus 80 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l--y~~e~~~~--- 153 (536) T protein:vir:10 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFEALKQLVVAGNVLL--YLPEPEGS--- 153 (536) T ss_pred CCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeE--EEeeCCCC--- Confidence 8899998764433 1111 223455555555 467799999999999999999998 33310000 Q ss_pred cccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeec Q lcl|NC_013692. 167 QVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCD 246 (726) Q Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~ 246 (726) . ...++.++ T Consensus 154 ---------------------------------------------------~--------------------~~~~~~~p 162 (536) T protein:vir:10 154 ---------------------------------------------------N--------------------YNPMKLYR 162 (536) T ss_pred ---------------------------------------------------c--------------------eeeEEEEE Confidence 0 00124455 Q ss_pred hhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEE Q lcl|NC_013692. 247 YNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHE 326 (726) Q Consensus 247 p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E 326 (726) ..+|++..++..+++. ++|+..+|...|.+. |.+++ +. .. ......++|+||+ T Consensus 163 l~~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~-fg~~~--~~--------~~-------------~~~~~~~~v~v~~ 215 (536) T protein:vir:10 163 LSSYVVQRDAFGNVLQ---MVTRDQIAFGALPED-IRKAV--EG--------QG-------------GEKKADETIDVYT 215 (536) T ss_pred cCeEEEeeCCCCCeeE---EeeeeeccHHHHHHh-hhhhh--cc--------cc-------------cccCcccceEEEE Confidence 6778887766443433 688999999998765 32211 00 00 0011235788888 Q ss_pred EEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 327 YWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMA 406 (726) Q Consensus 327 ~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~ 406 (726) |-.+ ..+ ++...++ ....|..++. ....|+.+.+||++..|.+.++..||.|.+....+-.+.+|.+.+..+.+.. T Consensus 216 ~V~~-~~~-~~~~~~~-~e~~g~~v~~-~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 291 (536) T protein:vir:10 216 HIYL-DEA-SGEYLRY-EEVEGMEVQG-SDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSM 291 (536) T ss_pred EEEE-ecC-CCcEEEE-EeecCccccc-cccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 7433 222 2222222 2334444443 3455566889999999999999999999999999999999999999999999 Q ss_pred hcCCCceEeecccccc-hhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcc Q lcl|NC_013692. 407 RSANGQVGVMKGALDV-TNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGA 485 (726) Q Consensus 407 ~~~~~~~~~~~gav~~-~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~ 485 (726) .+.+++++++.+.+.. .+.....+|.++...++.. .+.+......-+.....++.+.+.+....=+. +.+ ..+ T Consensus 292 ~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v---~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~-~~~ 365 (536) T protein:vir:10 292 ISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDI---SFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAV-QRT 365 (536) T ss_pred HHhcCCcccCcccccchhhhccCCCcceecCCcccc---eeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcc-cCC Confidence 9999999998776643 3334455666654333221 11112222223345556666666665543221 111 122 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeeccc Q lcl|NC_013692. 486 ALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIST 564 (726) Q Consensus 486 ~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~ 564 (726) +..-||+++....+.....+..++.+|.. .+..+..+.+.++....--+ ++..+. ..+.+..+. T Consensus 366 ~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP-----------~~p~~~----v~~~~vs~l 430 (536) T protein:vir:10 366 GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP-----------ELPKEA----VEPTISTGL 430 (536) T ss_pred CCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCC-----------CCChhh----ccceEEecH Confidence 33459999999999999999999999875 56778888888775432111 111111 222332232 Q ss_pred chHHH-HHHHHHHHHHHHhh---hcc-ch-hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHH Q lcl|NC_013692. 565 AEEDN-AKVNDLTFMLQTMG---PNM-DP-MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIE 638 (726) Q Consensus 565 ~~~~~-~~~~~l~~l~q~~~---~~~-~~-~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e 638 (726) +...+ ...+.+....+.++ |.. .+ .+....+..+++..++. -...++. +.|.+++++|.+ T Consensus 431 ~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~-p~~~irt-------------~eev~~~r~q~~ 496 (536) T protein:vir:10 431 EAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGID-TSGILLT-------------EEQKQQKMAQQS 496 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCC-chhhcCC-------------HHHHHHHHHHHH Confidence 22211 11122222222222 211 11 12233333333333331 0011111 001111111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 639 AERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGV 686 (726) Q Consensus 639 ~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~ 686 (726) +++ +.+++ ..............--+.+.+...+..++- ++ T Consensus 497 ~~~-~~~~~--a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----~~ 536 (536) T protein:vir:10 497 MQM-GMDNG--AAALAQGMAAQATASPEAMAAAADSVGLQP-----GI 536 (536) T ss_pred HHH-HHHHH--HHHHHHHHHHHHhcCchhHHhhhhccccCC-----CC Confidence 000 00000 000000000000000001111111111110 00 No 36 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.88 E-value=5.4e-21 Score=131.72 Aligned_cols=494 Identities=10% Similarity=0.027 Sum_probs=246.6 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------cCCCCCCCCCCCCCcCCCHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMH------VRGEGKPKTEKGKSAVQPPTIRK 84 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~------~~~~~~~~~~~grs~~v~~~v~~ 84 (726) |.+-+..+.+ .+++.++..++-.+.. ...|.++|. ....++... .-+.+++++.-.+ T Consensus 1 ~~~~~~~~~~-------------~~~~r~~~l~~~R~~~---e~~w~e~~~y~lP~~~~~~~~~~~-~~~~~~~dst~~~ 63 (522) T protein:vir:94 1 MAEREGFAAE-------------GAKAVYDRLKNGRQPY---ETRAQNCAAVTIPSLFPKESDNSS-TEYTTPWQAVGAR 63 (522) T ss_pred CcccchhhHH-------------HHHHHHHHHHHHhhHH---HHHHHHHHHHhcccccCCCCCccc-ccccccccccHHH Confidence 4443333222 2233333222222111 233444442 212222111 1123578888889 Q ss_pred HHHHHHHHHHHhhcCCCceEEEecCCc-------chHHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCe Q lcl|NC_013692. 85 QAEWRYSSLSEPFLSSPNIFEVNPVTW-------EDAESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTI 151 (726) Q Consensus 85 ~v~~~~~~L~~~f~~~~~~~~~~p~~~-------~D~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~ 151 (726) .++.+.+.|+..++.+.+||.+.+..+ ++...+ ...+..+...| ..++.|..++..+++.+..||| T Consensus 64 a~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a 142 (522) T protein:vir:94 64 CLNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYM-ETNSFRVPLFEALKQLIVSGNC 142 (522) T ss_pred HHHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCcE Confidence 999999999999998889999986432 222222 33444454444 4677999999999999999999 Q ss_pred EEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeec Q lcl|NC_013692. 152 IVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEE 231 (726) Q Consensus 152 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 231 (726) ++ |++.... |.+ T Consensus 143 ~l--~~~~~~~------------------------------------------------------~~~------------ 154 (522) T protein:vir:94 143 LL--YIPEPEQ------------------------------------------------------GTY------------ 154 (522) T ss_pred eE--eeeccCC------------------------------------------------------Cce------------ Confidence 97 5541000 000 Q ss_pred ccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 232 REETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 232 ~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) ..++.++..+|++..++..+++ =++++..++.+.|-. ++-.. . . . T Consensus 155 --------~~~~~~pl~~y~v~~d~~G~vd---~i~r~~~~~~~~l~~-----~~~~~-------~-----~-------~ 199 (522) T protein:vir:94 155 --------SPMRMYRLVSYVVQRDAFGNIL---QIVTIDKVAFSALPE-----DVKSQ-------L-----N-------A 199 (522) T ss_pred --------eeEEEEEcceEEEeeCCCcCeE---EEeeeeeccHHhcch-----HHHHH-------H-----h-------c Confidence 0123344556676655433222 245666777766421 11100 0 0 0 Q ss_pred cccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQ 391 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 391 (726) . .....++|.||++..+- +++.. ++. -..|..+ .-.++-|+...+||++..|.+.++..||.|.+....+-. T Consensus 200 ~--~~~p~~~v~v~~~v~~~---~~~~~-~~~-~~~g~~~-~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~ 271 (522) T protein:vir:94 200 D--DYEPDTELEVYTHIYRQ---DDEYL-RYE-EVEGIEV-TGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDL 271 (522) T ss_pred c--cCCccceEEEEEEEEee---CCcee-EEe-eccCcee-cccCCCCccccCCceeeeeeecCCCccccchHHHHHHHH Confidence 0 01224679999886652 23332 222 1223333 223344566889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEeecccccch-hhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHH Q lcl|NC_013692. 392 RIIGAVTRGMIDTMARSANGQVGVMKGALDVT-NRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAE 470 (726) Q Consensus 392 ~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~-d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e 470 (726) +.+|.+.+..+.++..+.+|+++++.+.+... +.....+|.++.-.++. ..+.+...+...+.....++.+.+.+. T Consensus 272 k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~g~~v~g~~~~---v~~~~~~~~~~~~~~~~~i~~~~~rI~ 348 (522) T protein:vir:94 272 NSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEFVAGRVED---INFLQLTKGQDFTIAKSVADAIEQRLG 348 (522) T ss_pred HHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCCceeecCCccc---ceeeecccccchhHHHHHHHHHHHHHH Confidence 99999999999999999999999987765433 33334445443322211 111111222223445566666666666 Q ss_pred HHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecch Q lcl|NC_013692. 471 SMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRR 549 (726) Q Consensus 471 ~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~ 549 (726) ..--+. +.+. -++..-||+++....+.....+..++.+|.. .+..++.+.+.++.+..--+. +. T Consensus 349 ~af~~~--~~~~-~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~-----------~p- 413 (522) T protein:vir:94 349 WAFLLN--SAVQ-RNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPD-----------LP- 413 (522) T ss_pred HHHhhh--hhcc-CCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-----------CC- Confidence 544222 1221 1223469999999999999999999999875 667888888888766433221 11 Q ss_pred hhcccccceeeecccchHHHH-HHHHHHHHHHHhhhccch-----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhh Q lcl|NC_013692. 550 DDLAGNFDLKLDISTAEEDNA-KVNDLTFMLQTMGPNMDP-----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQ 623 (726) Q Consensus 550 ~~~~~~~dv~i~~~~~~~~~~-~~~~l~~l~q~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~q 623 (726) . ..+.+++..+.+...+. -.+.+...++.++...|. .+...++..++...+++- ...++.. + T Consensus 414 ~---~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~-~~ivr~~--------e 481 (522) T protein:vir:94 414 K---EAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDT-AGLLLTQ--------D 481 (522) T ss_pred c---ccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCCh-hhccCCH--------H Confidence 1 11334444433322221 112222222222221111 122223333333333310 0111100 0 Q ss_pred hHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 624 QKAQLELMLLQAQI-EAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESG 685 (726) Q Consensus 624 q~~q~e~q~~qaq~-e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~ 685 (726) |.+++.+|. +++.+++.. ..+.+...+. +..+.. +.+. ++ T Consensus 482 -----e~~~~~~q~~~~~~~~~~~-~~~~~~~~a~----------~~~~~~-~~~~-----~~ 522 (522) T protein:vir:94 482 -----EKIQRMAEQSSQQAVVQGA-SAAGANMGAA----------VGQGAG-EDMA-----QA 522 (522) T ss_pred -----HHHHHHHHHHHHHHHHHHH-HHHHHHhhhh----------hhcccc-hhhh-----cC Confidence 000000000 000000000 0000000000 000000 0000 00 No 37 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=99.87 E-value=1.6e-19 Score=123.65 Aligned_cols=503 Identities=10% Similarity=0.031 Sum_probs=248.5 Q ss_pred CccchhcCCCCCCchHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhccCCCCCCCCCCCC---CcCCCHHHHHHHHHH Q lcl|NC_013692. 16 GDPSKRLQPEWSNAPSLAQLKQDYQEAK---QVTDEKITQINRWLDYMHVRGEGKPKTEKGK---SAVQPPTIRKQAEWR 89 (726) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~---~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr---s~~v~~~v~~~v~~~ 89 (726) +-++|+. =..+ ..+++.++..+ +.+.+...+..++.--|.+..++ ..|. .++++..-.+.++.+ T Consensus 1 m~~~~~~---~~~~---~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~----~~~~~~~~~~~dst~~~a~~~L 70 (532) T protein:vir:99 1 MAEVEKT---GFAA---DGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSAT----ADGSTSYTTPWQSIGARGLNNL 70 (532) T ss_pred Ccchhhc---cccH---HHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCC----CcchhhccccccchHHHHHHHH Confidence 2222211 0011 22334444333 23333333344444444333322 2332 478888889999999 Q ss_pred HHHHHHhhcC-CCceEEEecCCcch-------HHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEE Q lcl|NC_013692. 90 YSSLSEPFLS-SPNIFEVNPVTWED-------AESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKV 155 (726) Q Consensus 90 ~~~L~~~f~~-~~~~~~~~p~~~~D-------~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~ 155 (726) .+.|+..+|+ +.+||.+.+--++- .+.+ ...+..+...| ..++.|..++..+++.+..||+++=+ T Consensus 71 Aa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~ 149 (532) T protein:vir:99 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM-ESNSFRPTLHAAIKQLLVAGNVLLYI 149 (532) T ss_pred HHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeEEe Confidence 9999999998 68999998843221 1111 13344444555 46889999999999999999999955 Q ss_pred eeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccce Q lcl|NC_013692. 156 GWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREET 235 (726) Q Consensus 156 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 235 (726) .++.... T Consensus 150 ~~~~~~~------------------------------------------------------------------------- 156 (532) T protein:vir:99 150 PSTEQVE------------------------------------------------------------------------- 156 (532) T ss_pred ccccccc------------------------------------------------------------------------- Confidence 4431000 Q ss_pred eeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccC Q lcl|NC_013692. 236 VENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQ 315 (726) Q Consensus 236 ~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (726) .....++.++..+|++..++..++++ ++++..++.+.| ++++-.. ... .... T Consensus 157 -~~~~~f~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l-----~e~~~~~-------------~~~------~~~~ 208 (532) T protein:vir:99 157 -GQSNAPKLYKLHNFVVERDAYDNVLQ---IVTEDKIARAAL-----PEDVRKS-------------LED------AQGD 208 (532) T ss_pred -CcccceEEEEcCeEEEeeCCCCCeee---EeeeeeecHHhc-----ChHHHHH-------------hhc------cccc Confidence 00012344556677776665433332 566667777664 1111000 000 0011 Q ss_pred CcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHH Q lcl|NC_013692. 316 DKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIG 395 (726) Q Consensus 316 ~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N 395 (726) ..+..+|.||++..+. .++.... +++ ...|..+ .-.++-|++..+||++..|...++..||.|.+....+-.+.+| T Consensus 209 ~~p~~~v~v~~~v~~~-~~~~~~~-~~~-~~~g~~~-~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~ 284 (532) T protein:vir:99 209 QNPSEEVTIYTHVYRD-PEAMVFR-SYQ-EIDGEIV-AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLE 284 (532) T ss_pred cCCCcceEEEEEEEec-CCCCeeE-EEE-eecCcee-cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHH Confidence 1234568999876543 2222222 222 2234332 2234445567899999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCCceEeecccccc-hhhhhhcCCceEeecCccchhhhcccccC--ccchhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 396 AVTRGMIDTMARSANGQVGVMKGALDV-TNRRRFDRGENYEFNPGADPRAAVHMHTF--PEIPQSAQYMINLQQAEAESM 472 (726) Q Consensus 396 ~~~~~~~d~l~~~~~~~~~~~~gav~~-~d~~~~~~g~vi~~~~~~~~~~~i~~~~~--~~~~~~~~~ll~~~~~~~e~~ 472 (726) .+.+..+.+...+.+|+++++.+.+.. .+.....+|.++.-.++ .+.+.+. ..--+.....++.+.+.+... T Consensus 285 ~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~-----~i~~~~~~~~~~~~~~~~~i~~~~~rI~~a 359 (532) T protein:vir:99 285 NLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQ-----DVEVFQLEKYNDFQVAKATADDIEKRLSYA 359 (532) T ss_pred HHHHHHHHHHHHHcCCCceeccccccchhhhccCCCcceecCCcc-----cceeeecccccchhHHHHHHHHHHHHHHHH Confidence 999999999999999999998776543 33334455555432221 2332221 112233444555555544432 Q ss_pred hchHHHhhccC-cccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchh Q lcl|NC_013692. 473 TGVKAFNAGIS-GAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRD 550 (726) Q Consensus 473 tGv~~~~~G~~-~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~ 550 (726) - +..++. .++..-||+++....+.....+..++.+|.. .+..++.+.+.++.+..- ++.-|. T Consensus 360 f----~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~------------lP~~p~ 423 (532) T protein:vir:99 360 F----MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSK------------IPNLPK 423 (532) T ss_pred H----hhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCC------------CCCCCh Confidence 2 111221 2233359999999999999999999999875 667888888888765322 111222 Q ss_pred hcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhH----HHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHH Q lcl|NC_013692. 551 DLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMM----AQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKA 626 (726) Q Consensus 551 ~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~----~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~ 626 (726) +..+- ++...++.-. ..+....+...++.+++-.++.. ....+..++...+++- ...++. ++ T Consensus 424 ~~~~~-~iv~~is~La-raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~-~~i~r~--------~e--- 489 (532) T protein:vir:99 424 EAVEP-AIATGLEALG-RGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDT-TGLILT--------QQ--- 489 (532) T ss_pred hhccc-ceeecchHHH-HHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCCh-hhccCC--------HH--- Confidence 22221 2222222111 11122233333333333222221 1222222222222210 000100 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 627 QLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQA 689 (726) Q Consensus 627 q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~ 689 (726) +.+.++++.+++++ +++ ...++.....++. ....+++.++-.+ T Consensus 490 --e~~~~~~q~~~~~~--~~~----a~~~~~~~~~~~~------------~~~~~~~~~~~~~ 532 (532) T protein:vir:99 490 --DKQAKMAEASTAAG--MVT----AGQQMGAAGGQAA------------AAMMQQQAGMPTQ 532 (532) T ss_pred --HHHHHHHHHHHHHH--HHH----HHHHHHHHHHHhc------------chhHHhhcCCCCC Confidence 00000000000000 000 0000000000000 0000111110000 No 38 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=99.87 E-value=9.7e-20 Score=124.81 Aligned_cols=493 Identities=10% Similarity=0.047 Sum_probs=248.3 Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHhccCCCCCCCCCCCC---CcCCCHHHHHHHHHHHHHHHHhhcC-CCceEE Q lcl|NC_013692. 33 AQLKQDYQEAKQVTDEKI---TQINRWLDYMHVRGEGKPKTEKGK---SAVQPPTIRKQAEWRYSSLSEPFLS-SPNIFE 105 (726) Q Consensus 33 ~~~~~~~~~a~~~~~~~~---~~~~~~~~~y~~~~~~~~~~~~gr---s~~v~~~v~~~v~~~~~~L~~~f~~-~~~~~~ 105 (726) =.+++.++..++..+... .+..++..-|.+...+. ...|+ -+++++.-.+.++.+.+.|+..+|+ +.+||. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~--~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDIS--SRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFK 78 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCC--CCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 224445544444333322 23333343343333221 12222 3578888899999999999999998 588999 Q ss_pred EecCCcchH------------HHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEeccccccc Q lcl|NC_013692. 106 VNPVTWEDA------------ESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEM 173 (726) Q Consensus 106 ~~p~~~~D~------------~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~ 173 (726) +.+-.++.. ..-...+..+...+ ..++.|..++..+++.+..||+++ |-+.. T Consensus 79 l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l--y~~~~------------- 142 (522) T protein:vir:10 79 LQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYI-AASNDRVAVHQALKHLIVGGNALI--FMGKD------------- 142 (522) T ss_pred ccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCceeE--EEcCC------------- Confidence 987443211 11223444555555 478899999999999999999997 33200 Q ss_pred CCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeC Q lcl|NC_013692. 174 MPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVID 253 (726) Q Consensus 174 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~d 253 (726) . ++.++..+|++. T Consensus 143 -----------------------~--------------------------------------------~~~~pl~~y~v~ 155 (522) T protein:vir:10 143 -----------------------G--------------------------------------------LKTFPLTRYVIN 155 (522) T ss_pred -----------------------C--------------------------------------------ceEEEcceEEEe Confidence 0 012334567887 Q ss_pred CCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeec Q lcl|NC_013692. 254 PSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDI 333 (726) Q Consensus 254 p~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~ 333 (726) .++..+++ -++++.++|...|.+. |..+ .+.. . ... ......+|+||+|.... . T Consensus 156 ~d~~G~vd---~i~r~~~~t~~ql~~~-fg~~--~~~~-~---------~~~---------~~~~~~~v~v~~~v~p~-~ 209 (522) T protein:vir:10 156 RDGDGNVL---EIVTKELISRKVLDIE-LPEP--KPNT-G---------IDE---------SSTTNDDVTIYTYVKLD-K 209 (522) T ss_pred eCCCCCee---EEEeeeeccHHHHHHh-cchh--ccch-h---------hhc---------ccCCCCceEEEEEEEee-c Confidence 76644333 3789999999998776 3221 1100 0 000 01124568898875432 2 Q ss_pred CCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_013692. 334 HGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQV 413 (726) Q Consensus 334 ~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~ 413 (726) + .+...++ ....+.++....+-++++.+||++..|...++..||+|.+....+-.+.+|.+.+..+.++..+.+|.+ T Consensus 210 ~-~~~~~~~--~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~ 286 (522) T protein:vir:10 210 S-SGRWVWH--QEAFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVF 286 (522) T ss_pred c-CCceEEE--EccCCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Confidence 2 2221222 223333443333434557799999999999999999999999999999999999999999999999999 Q ss_pred Eeecccccchhhh-hhcCCceEeecCccchhhhcccccC--ccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhh Q lcl|NC_013692. 414 GVMKGALDVTNRR-RFDRGENYEFNPGADPRAAVHMHTF--PEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDT 490 (726) Q Consensus 414 ~~~~gav~~~d~~-~~~~g~vi~~~~~~~~~~~i~~~~~--~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~t 490 (726) +++.+.+...... ....|.++ +|.. ..+.+.+. ....+.+...++.+.+.+... ++++...++..-| T Consensus 287 lv~~~~~~~~~~l~~~~~~~~v---~g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~a-----Fl~~~~~d~~rvT 356 (522) T protein:vir:10 287 LVSPSSTTKPATIAKAGNGAIV---QGRP--EDVAVIQVGKTADFSTAANMATAIEKRLLEA-----FLVMNVRNAERVT 356 (522) T ss_pred eeccccccccccccCCCCccee---cCCC--ccceeecccccccchHHHHHHHHHHHHHHHH-----HhhccCCCCCCCC Confidence 9976665433332 22223322 2221 12222221 112233344555555555432 3334333444459 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHH Q lcl|NC_013692. 491 ATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDN 569 (726) Q Consensus 491 a~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~ 569 (726) |+++....+.....+..++.+|.. .+..++.+.+.++.+..- +..-|.+......++-....+ +. T Consensus 357 AtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~------------lP~~p~~~~~~~~v~~is~La--ra 422 (522) T protein:vir:10 357 AEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQ------------IPKLPKDIVRPTIVAGVNALG--RG 422 (522) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCC------------CCCCCccccccccccchhHHH--HH Confidence 999999999999999999999864 667788888887765321 111222221111111111111 11 Q ss_pred HHHHHHHHHHHHhhhcc-chh-----HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 570 AKVNDLTFMLQTMGPNM-DPM-----MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERAR 643 (726) Q Consensus 570 ~~~~~l~~l~q~~~~~~-~~~-----~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq 643 (726) +..+.+....+.++... ++. +....+..++...+++-. ..++. ++ +.+..+.+.++++++ T Consensus 423 q~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~-~ivrt----~e---------ev~~~~q~~q~~~~~ 488 (522) T protein:vir:10 423 QDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVL-NLVKT----EQ---------QLAEEQQAAQQQAAQ 488 (522) T ss_pred HHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChh-hhcCC----HH---------HHHHHHHHHHHHHHH Confidence 22223333333332211 111 122233344444443210 00100 00 000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 644 AAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQA 689 (726) Q Consensus 644 ~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~ 689 (726) ++.+.+..+.....+..- +...+...+.+.-.++ T Consensus 489 ~~~~~~a~~~~~~~~~~~------------~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 489 QSLVDQAGQMTGSPLMDP------------TKNPQLMDEEQPPMEE 522 (522) T ss_pred HHHHHHHHHHhcccccCc------------cccHHHHHHhCCCCCC Confidence 000000000000000000 0000000111100000 No 39 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.86 E-value=3.2e-19 Score=121.96 Aligned_cols=516 Identities=11% Similarity=0.057 Sum_probs=249.0 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEK---ITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAE 87 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~---~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~ 87 (726) |++.+-+. ..+. .+++.++..++..+.. ..+..++.--|....+++.... -+.+++++.-.+.++ T Consensus 1 ~~~~~~~~--------~~~~---~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~-~~~~~~dst~~~a~~ 68 (543) T protein:vir:88 1 MAETKREG--------LAEE---GAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSST-DYTTPWQAVGARGLN 68 (543) T ss_pred CcccccCc--------chHH---HHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccc-cccccccchHHHHHH Confidence 33322111 1122 2334444433332222 2222222222222222111111 123578888899999 Q ss_pred HHHHHHHHhhcCCCceEEEecCCcc---------hHHHH----HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEE Q lcl|NC_013692. 88 WRYSSLSEPFLSSPNIFEVNPVTWE---------DAESA----RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVK 154 (726) Q Consensus 88 ~~~~~L~~~f~~~~~~~~~~p~~~~---------D~~~A----~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k 154 (726) .+.+.|+..++.+.+||.+.+-... +...+ ...+..+...| ..++.|..++..+++.+..|||++ T Consensus 69 ~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l- 146 (543) T protein:vir:88 69 NLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYM-EANSYRVTLFELIRQLALAGTALI- 146 (543) T ss_pred HHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCceee- Confidence 9999999999998999999773211 11111 22344454444 467799999999999999999998 Q ss_pred EeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccc Q lcl|NC_013692. 155 VGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREE 234 (726) Q Consensus 155 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 234 (726) |-+...++ +.. T Consensus 147 -y~~~~~~~-----------------------------------------------------~~~--------------- 157 (543) T protein:vir:88 147 -YLPPPDAS-----------------------------------------------------SNS--------------- 157 (543) T ss_pred -eeccCccc-----------------------------------------------------cce--------------- Confidence 32211000 000 Q ss_pred eeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhcccccccc Q lcl|NC_013692. 235 TVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDF 314 (726) Q Consensus 235 ~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (726) + -.+..++..+|++..++.. .-.-++++..+|...|... +..++ .... T Consensus 158 --~--~~~~~~pl~~y~v~~d~~G---~v~~i~r~~~~~~~~l~~~-~~~~v----------~~~~-------------- 205 (543) T protein:vir:88 158 --Y--NPMKLYTLHNHVVQRDAFG---NVLQIVTLDKVAYAALPED-VRNSL----------SGGQ-------------- 205 (543) T ss_pred --e--cceEEeEcceEEEeeCCCC---CeeeeeeeeeccHHHHhHH-hhHHH----------HHHh-------------- Confidence 0 0012344456666555432 2234567788888887433 11111 0000 Q ss_pred CCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHH Q lcl|NC_013692. 315 QDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRII 394 (726) Q Consensus 315 ~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~ 394 (726) ...+.++|.||++-.+. .+++.. .++ . .+.+..+....+-|+.+.+||++..|...++..||+|.+....+-.+.+ T Consensus 206 ~~~p~~~~~v~~~V~pr-~~~~~~-~~~-~-~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L 281 (543) T protein:vir:88 206 EYKPEQELEVYTHIYID-DESGDF-LSY-Q-EIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSL 281 (543) T ss_pred hcCCccceEEEEEEEee-cCCCcc-ccc-c-cccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHH Confidence 01123578888863322 222222 111 1 2234445445566777889999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCceEeecccccchhh-hhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 395 GAVTRGMIDTMARSANGQVGVMKGALDVTNR-RRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMT 473 (726) Q Consensus 395 N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~-~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~t 473 (726) |.+.+..+.++....+|+++++.+.+..... ....+|.++.-.++. . ..++.. .+...+.....++.+.+.+...- T Consensus 282 ~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~~~~g~~v~g~~~~-v-~~~~~~-~~~~~~~~~~~i~~~~~rI~~af 358 (543) T protein:vir:88 282 ESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTGDFVAGRKAD-I-EFLQLE-KTADFTVAKSVADAIEARLSYVF 358 (543) T ss_pred HHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCceeecCCCCc-c-eeeecc-cccchhHHHHHHHHHHHHHHHHH Confidence 9999999999999999999997776543332 222233332222211 1 111122 22233445666666666665543 Q ss_pred chHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhc Q lcl|NC_013692. 474 GVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDL 552 (726) Q Consensus 474 Gv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~ 552 (726) =+ +.+... ++..-||+++....+.....+..++.+|.. .+..++.+.+.++.+..--+. +..+ T Consensus 359 ~~-~~~~~~--~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~-----------~p~~-- 422 (543) T protein:vir:88 359 ML-NSAVQR--SGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPN-----------LPQE-- 422 (543) T ss_pred hh-hhhccC--CCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-----------Cchh-- Confidence 22 211112 222359999999999999999999999875 667888888888776433221 1111 Q ss_pred ccccceeeecccchHHH-HHHHHHHHHHHHhhhccchhH-----HHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHH Q lcl|NC_013692. 553 AGNFDLKLDISTAEEDN-AKVNDLTFMLQTMGPNMDPMM-----AQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKA 626 (726) Q Consensus 553 ~~~~dv~i~~~~~~~~~-~~~~~l~~l~q~~~~~~~~~~-----~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~ 626 (726) ...+++..+.+...+ ...+.+...++.++...++.. ....+..+....+++- ...++. +++.+ T Consensus 423 --~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~-~~i~r~--------~~e~~ 491 (543) T protein:vir:88 423 --AVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDT-AGLLLT--------EAEKA 491 (543) T ss_pred --ceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCCh-hhhcCC--------HHHHH Confidence 123333333222222 222233333333332222221 2222333333333310 011111 00000 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 627 QLELMLLQAQIEAERA-RAAHYMSGAGLQ--DSKVGTEQAKARALASQADMTDLNFLEQESGVQQA 689 (726) Q Consensus 627 q~e~q~~qaq~e~~~a-q~q~~~~~~~~~--~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~ 689 (726) + +++|.+.+++ +++...+..... .....+ +. +++-.++..+..= +..+- T Consensus 492 ~-----~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~p------~~~~~ 543 (543) T protein:vir:88 492 Q-----AQSQEMLKQGGLNAAAGIGSGVAAQATASPE--AM-ESAMDTAGVQPGP------IATQV 543 (543) T ss_pred H-----HHHHHHHHHHHHHHHHHHhhchhhhhccChH--HH-HHHhhhcCCCCCC------CCCCC Confidence 0 0000000000 000000000000 000000 00 0000000000000 00000 No 40 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=99.85 E-value=1.4e-18 Score=118.47 Aligned_cols=501 Identities=11% Similarity=0.055 Sum_probs=244.0 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHhc------cCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHH Q lcl|NC_013692. 26 WSNAPSLAQLKQDYQEAKQVTDEKITQI----NRWLDYMH------VRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSE 95 (726) Q Consensus 26 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~----~~~~~~y~------~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~ 95 (726) ++.+.....+..+ .++...+..-++| ..|.++|. ....++ +......++++..-.+.++.+.+.|+. T Consensus 1 ~~~~~~~~~~~~~--~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~-~~~~~~~~~~dst~~~a~~~Laa~l~~ 77 (535) T protein:vir:94 1 MASSQKREGFAEN--GAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSD-NASTDYTTPWQAVGARGLNNLASKLML 77 (535) T ss_pred CCchhhhhhHHHH--HHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC-ccccccCCcccccHHHHHHHHHHHHHh Confidence 3333332222221 1223333333333 33444442 112222 111223567888889999999999999 Q ss_pred hhcCCCceEEEecCCcc-------hHHH------HHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeee Q lcl|NC_013692. 96 PFLSSPNIFEVNPVTWE-------DAES------ARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSR 162 (726) Q Consensus 96 ~f~~~~~~~~~~p~~~~-------D~~~------A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~ 162 (726) .+|.+.+||.+.+-... +.+. -...+..+...| ..++.+..++..+++.+..||+++-+.++.... T Consensus 78 ~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 156 (535) T protein:vir:94 78 ALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYI-ESNSYRVTLFETLKQLVVAGNALLYIPEPEGTY 156 (535) T ss_pred hhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcc Confidence 99998999998773211 1111 122233333334 578899999999999999999999654431100 Q ss_pred eEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeecccee Q lcl|NC_013692. 163 TVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTV 242 (726) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i 242 (726) ..+ T Consensus 157 -----------------------------------------------------------------------------~~f 159 (535) T protein:vir:94 157 -----------------------------------------------------------------------------NPM 159 (535) T ss_pred -----------------------------------------------------------------------------cce Confidence 011 Q ss_pred eeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceE Q lcl|NC_013692. 243 QVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRL 322 (726) Q Consensus 243 ~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 322 (726) +.++..+|++..++...++ =|+++..++++.|-.. +.+.+ .... .......| T Consensus 160 ~~~pl~~y~v~~d~~G~vd---~i~r~~~~~~~~l~~~-~~~~~----------~~~~--------------~~~~~~~v 211 (535) T protein:vir:94 160 KLYRLSSYVVQRDAFGTVL---QIVTLDKTAYAALPED-VRNSM----------DSSQ--------------EHKGDEMI 211 (535) T ss_pred EEEEcCeEEEeeCCCCCeE---EEEeeeeccHHHhhHH-HHHHH----------Hhcc--------------ccCCCcee Confidence 2233445666554433222 2456777888776332 11100 0000 01234568 Q ss_pred EEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 323 VVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMI 402 (726) Q Consensus 323 ~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~ 402 (726) .||+|. +.+.+ ++...++ ..+.+..+.-..+.++.+.+||++..|...++..||+|.+....+-.+.+|.+.+..+ T Consensus 212 ~v~~~v-~~~~~-~~~~~~~--~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 287 (535) T protein:vir:94 212 DVYTHI-YLDEE-SGEYLKY--EEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 (535) T ss_pred EEEEEE-EeeCC-CCcEEEE--EEecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 888874 33322 2222222 2334433332334455588999999999999999999999999999999999999999 Q ss_pred HHHHhcCCCceEeecccccchh-hhhhcCCceEeecCccchhhhcccccCc--cchhHHHHHHHHHHHHHHHHhchHHHh Q lcl|NC_013692. 403 DTMARSANGQVGVMKGALDVTN-RRRFDRGENYEFNPGADPRAAVHMHTFP--EIPQSAQYMINLQQAEAESMTGVKAFN 479 (726) Q Consensus 403 d~l~~~~~~~~~~~~gav~~~d-~~~~~~g~vi~~~~~~~~~~~i~~~~~~--~~~~~~~~ll~~~~~~~e~~tGv~~~~ 479 (726) .+...+.++.++++.+.+...+ .....+|.++...++ .+.+.+.. ...+....+++.+.+.+...- .. T Consensus 288 ~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~-----~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af----~~ 358 (535) T protein:vir:94 288 KMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPE-----DISFLQLEKAADFSVARAVSEQIEGRLSYAF----ML 358 (535) T ss_pred HHHHHhccCCcccccccccchhhcccCCCceeecCCcc-----cceeeecccccchhHHHHHHHHHHHHHHHHH----hH Confidence 9999999999999876653333 333445555433222 12222221 122334444555444444322 11 Q ss_pred hccC-cccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccc Q lcl|NC_013692. 480 AGIS-GAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFD 557 (726) Q Consensus 480 ~G~~-~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~d 557 (726) .++. .++..-||+++....+.....+..++.+|.. .+..+..+.+.++.+..--+. -|.+. .+ T Consensus 359 ~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~------------~p~~~---v~ 423 (535) T protein:vir:94 359 NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPE------------LPKEA---VE 423 (535) T ss_pred hhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCC------------CChhh---cc Confidence 1111 2233359999999999999999999999875 667888888887765422111 11111 22 Q ss_pred eeeecccchHHH-HHHHHHHHHHHHhhhccch-----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHH Q lcl|NC_013692. 558 LKLDISTAEEDN-AKVNDLTFMLQTMGPNMDP-----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELM 631 (726) Q Consensus 558 v~i~~~~~~~~~-~~~~~l~~l~q~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q 631 (726) +.+..+.+...+ ...+.+....+.++.-.|. .+....+..+.+..+++-. ..++.. .+.+ T Consensus 424 ~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~-~i~rs~-------------eev~ 489 (535) T protein:vir:94 424 PTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTS-GILKTP-------------EEKQ 489 (535) T ss_pred ceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChh-hhcCCH-------------HHHH Confidence 333333222211 1112222222322221111 1122223333333333200 001100 0001 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 632 LLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFL 680 (726) Q Consensus 632 ~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~ 680 (726) +++++.++++++...+.+....... .. ......++....+..+.-. T Consensus 490 ~~~~q~~~~~~~~~~~~~~g~~~~~-~~--~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 490 QEMAEAAQGTAMQNAAASAGAGAGT-MA--TASPENMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhc-cc--ccChHHHHHHHHHhccCCC Confidence 1111111100000000000000000 00 0000011111111111111 No 41 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=99.85 E-value=6.6e-19 Score=120.26 Aligned_cols=488 Identities=10% Similarity=0.020 Sum_probs=244.3 Q ss_pred CCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHH Q lcl|NC_013692. 14 EDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEK---ITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRY 90 (726) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~---~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~ 90 (726) +|+ .=.-..+.|.+.++..++-.+.. ..+..++.--|.+.... ......++++..-.+.++.+. T Consensus 1 ~~~----------~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~---~~~~~~~~~dstg~~a~~~LA 67 (517) T protein:vir:10 1 MDM----------RFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVN---DDLSSQNAWQDDGASATNFLS 67 (517) T ss_pred Ccc----------cccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCC---CCccccccccchHHHHHHHHH Confidence 222 21223345566666554333332 22233333333322221 122345788889999999999 Q ss_pred HHHHHhhcC-CCceEEEecCCcchH---------H----HHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEe Q lcl|NC_013692. 91 SSLSEPFLS-SPNIFEVNPVTWEDA---------E----SARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVG 156 (726) Q Consensus 91 ~~L~~~f~~-~~~~~~~~p~~~~D~---------~----~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~ 156 (726) +.|+..+|+ +.+||.+.+-.+... . .-...+..+...+ ..++.|..++..+++.+..||+++ | T Consensus 68 a~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l--y 144 (517) T protein:vir:10 68 NKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYG-ESLQFRPAVVEAFKHLIVTGNVMM--Y 144 (517) T ss_pred HHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEE--E Confidence 999999998 679999988432211 1 1122333444444 578899999999999999999986 3 Q ss_pred eeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccccee Q lcl|NC_013692. 157 WNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETV 236 (726) Q Consensus 157 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (726) -+. .. T Consensus 145 ~~~-------~~-------------------------------------------------------------------- 149 (517) T protein:vir:10 145 HPD-------KT-------------------------------------------------------------------- 149 (517) T ss_pred EeC-------CC-------------------------------------------------------------------- Confidence 220 00 Q ss_pred eccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCC Q lcl|NC_013692. 237 ENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQD 316 (726) Q Consensus 237 ~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (726) ..++.++..+|++..++..++.+ ++++.++|..+|.+. |...... .... ... T Consensus 150 ---~~~~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~~~-~~~~~~~----------~~~~-----------~~~ 201 (517) T protein:vir:10 150 ---SPIQAVPLHHYCVRRDNNGTVLD---IVFLQEKALETFEPS-IRMAIQA----------SRKG-----------KQY 201 (517) T ss_pred ---CcEEEEEcCeEEEeeCCCcCeEE---EEeeeeccHHHHHHH-hhhhcch----------hhhh-----------hcc Confidence 00122344567777666444444 577889999998665 2211100 0000 001 Q ss_pred cCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHH Q lcl|NC_013692. 317 KSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGA 396 (726) Q Consensus 317 ~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~ 396 (726) .+.+.|+||+|-.+ . .+|... ++...-|.+++. ++-|+.+.+||++..|...++..||.|.+....+--+.+|. T Consensus 202 ~~~~~v~v~~~v~~-~--~~~~~~-~~~~~d~~~~~~--~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~ 275 (517) T protein:vir:10 202 KDKDNVKLYTHAKR-T--KDGKYL-IRQSADDVPVGK--ESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQF 275 (517) T ss_pred CCcCceEEEEEEEE-e--CCCceE-EEEEeCceeecc--ccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHH Confidence 12356888886433 2 244322 222233334433 45566688999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccC--ccchhHHHHHHHHHHHHHHHHhc Q lcl|NC_013692. 397 VTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTF--PEIPQSAQYMINLQQAEAESMTG 474 (726) Q Consensus 397 ~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~--~~~~~~~~~ll~~~~~~~e~~tG 474 (726) +.+..+.+...+.+|+++++.+.+...+. ..+|+.-.+.+|. + ..+.+.+. ..-.+.....++.+.+.+...-= T Consensus 276 l~~~~~~~~~~a~~~~~lv~~~~~~~~~~--l~~~~~g~~~~g~-~-~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~ 351 (517) T protein:vir:10 276 LSEALARGMALMADVKYLVKPGSYTDINQ--FVEGGSGAVLHGV-E-GDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFM 351 (517) T ss_pred HHHHHHHHHHHhccCCcccCcccccchhh--ccCCCccccccCC-c-ccceeeecccccchhHHHHHHHHHHHHHHHHHh Confidence 99999999999999999998877644332 2333322223332 1 22333221 11223445556665555554331 Q ss_pred hHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcc Q lcl|NC_013692. 475 VKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLA 553 (726) Q Consensus 475 v~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~ 553 (726) +.. ++. .++..-||+++....+.....+...+.+|.. .+..+..+.+..+...+..+ T Consensus 352 ~~~--l~~-~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~------------------- 409 (517) T protein:vir:10 352 MEA--MTR-RDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSK------------------- 409 (517) T ss_pred hhh--hhc-cCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCC------------------- Confidence 111 111 1222359999999888888888888888875 55666666665443322111 Q ss_pred cccceeeecccchHHH----HHHHHHHHHHHHhhhccchh-----HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhh Q lcl|NC_013692. 554 GNFDLKLDISTAEEDN----AKVNDLTFMLQTMGPNMDPM-----MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQ 624 (726) Q Consensus 554 ~~~dv~i~~~~~~~~~----~~~~~l~~l~q~~~~~~~~~-----~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq 624 (726) ...+.+..+.+...+ ....++...+..+++ .++. +....+..+++..+++. ..++..+ T Consensus 410 -~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~-~~~~~~~~id~d~~~~~~a~~~Gvp~--~~irs~~--------- 476 (517) T protein:vir:10 410 -NVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQ-WPEPLQQAIKWPDFTDWVQGQISANF--PFFKTQD--------- 476 (517) T ss_pred -CccceeeccHHHHHHHHHHHHHHHHHHHHHHhhc-CChHHHhcCCHHHHHHHHHHHhCCCh--hhcCCHH--------- Confidence 111222222221111 111222222222222 1221 22233444444444431 1111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADM 674 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~ 674 (726) |.++.+++..++++....+.+......+ .++.-+...+-.+ T Consensus 477 ----ev~~~~~~~~~~~~~~~~~~~ag~~~~~-----~~~~~~~~~~~~~ 517 (517) T protein:vir:10 477 ----ELNAEAQAQQEQEATKYAAEQAGKAIPD-----MVKNGQINPQGGQ 517 (517) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCCCCCCC Confidence 0000000000000000000000000000 0000000000000 No 42 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=99.85 E-value=1.4e-18 Score=118.52 Aligned_cols=482 Identities=11% Similarity=0.001 Sum_probs=239.1 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC-CCceEEEec Q lcl|NC_013692. 31 SLAQLKQDYQEA-KQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLS-SPNIFEVNP 108 (726) Q Consensus 31 ~~~~~~~~~~~a-~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~-~~~~~~~~p 108 (726) .=+.+++.++-- ++...+...+..++..-|....+++.....+ -+.++..-.+.++.+.+.|+..+|+ +.+||.+.+ T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~-~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~ 79 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVV-EHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSEL 79 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccccCCCCcccccc-cCcccchHHHHHHHHHHHHHHhhcCCCCcccccCC Confidence 111222222211 1222222222233333222222222111111 2467888889999999999999998 578999987 Q ss_pred CCcch-------HHHH------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCC Q lcl|NC_013692. 109 VTWED-------AESA------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMP 175 (726) Q Consensus 109 ~~~~D-------~~~A------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~ 175 (726) -.... .+.+ ...+..+...| ..++.|..++..+++.+..|++++ |-+.. . T Consensus 80 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l--~~~~~------~-------- 142 (510) T protein:vir:78 80 TDAIRREADSRDTDITEVTAALARVDRKATQRL-FQNASLAVLTQVIKLLIVTGNALL--YRNSD------E-------- 142 (510) T ss_pred ChHHhhhcccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCeEEE--EEeCC------C-------- Confidence 43221 1111 11333344444 467889999999999999999876 32200 0 Q ss_pred cchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCCC Q lcl|NC_013692. 176 DSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPS 255 (726) Q Consensus 176 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~ 255 (726) . +++.++..+|++..+ T Consensus 143 ------------------------------------------~----------------------~~~~~pl~~y~v~~d 158 (510) T protein:vir:78 143 ------------------------------------------A----------------------TVVAWSLRSYAVRRD 158 (510) T ss_pred ------------------------------------------C----------------------eEEEEEcceeEEeeC Confidence 0 012234456777666 Q ss_pred CCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCC Q lcl|NC_013692. 256 CGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHG 335 (726) Q Consensus 256 a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~ 335 (726) +..++++ ++++..+|..+|.+. |.+++.. .. ....+.+.|.||++..+.+..+ T Consensus 159 ~~G~vd~---i~rr~~~t~~~l~~~-~~~~~~~----------~~-------------~~~~~~~~v~v~~~V~~~~~~~ 211 (510) T protein:vir:78 159 ATGRWMD---IVLKQRYKSKDLDDV-YKQDLMR----------AG-------------RNLSGSGSVDLYTHVQRRKGTA 211 (510) T ss_pred CCcCeeE---EEeeeeccHHHHHHH-hhHHhhh----------hh-------------hccCCCceEEEEEEEEeecCCC Confidence 5433433 678899999998664 3221100 00 0112345788999876654222 Q ss_pred CceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEe Q lcl|NC_013692. 336 DGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGV 415 (726) Q Consensus 336 ~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~ 415 (726) .....++.- ..|..++. ++-|+..++||++..|...++..||.|.+....+--+.+|.+.+..+.+...+.++.+++ T Consensus 212 ~~~~sv~~e-~dg~~i~~--~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv 288 (510) T protein:vir:78 212 MDYAEMYHE-IDGVRVGE--TGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLV 288 (510) T ss_pred CcEEEEEEE-ecCeeecc--ccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc Confidence 222211211 23445544 445666889999999999999999999999999999999999999999999999999999 Q ss_pred ecccccchhhhhh-cCCceEeecCccchhhhcccccCc--cchhHHHHHHHHHHHHHHHHhchHHHhhccC-cccchhhH Q lcl|NC_013692. 416 MKGALDVTNRRRF-DRGENYEFNPGADPRAAVHMHTFP--EIPQSAQYMINLQQAEAESMTGVKAFNAGIS-GAALGDTA 491 (726) Q Consensus 416 ~~gav~~~d~~~~-~~g~vi~~~~~~~~~~~i~~~~~~--~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~-~~~~~~ta 491 (726) +.+.+...+.... .+|.+ .+|. + ..+.+.+.. .-.+.....++.+.+.+... ++..+. .++..-|| T Consensus 289 ~p~g~~~~~~l~~~~~g~~---v~g~-~-~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~a-----F~~~l~~~~~~rvTA 358 (510) T protein:vir:78 289 DEAKGAVVDDYQDAEMGDY---VPGG-A-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQA-----FMYGANQRDAERVTA 358 (510) T ss_pred CCccccchhhhccCCCcee---ecCC-c-ccccccccCcccchHHHHHHHHHHHHHHHHH-----HhhccccCCCCCcCH Confidence 8877644333322 22333 3332 1 223333221 12233445555555555443 222222 22223499 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHH Q lcl|NC_013692. 492 TAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNA 570 (726) Q Consensus 492 ~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~ 570 (726) +++....+.....+...+.++.. .+..+..+.+.++....- ..+.++..+. ..++- ++ +....+ T Consensus 359 tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl------------~p~p~~~~~~-~~v~~-is-~Laraq 423 (510) T protein:vir:78 359 EEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL------------QGLITKQHKP-AIETG-LP-ALSRSA 423 (510) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccC------------CCCCcccccc-eeeec-cc-HHHHHH Confidence 99999988888889988888875 667788888777654321 1111111111 11111 11 111112 Q ss_pred HHHHHHHHHHHhhhccc------hhHHHHHHHHHHHhhhh-hhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 571 KVNDLTFMLQTMGPNMD------PMMAQQIMGQIMELKKM-PDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERAR 643 (726) Q Consensus 571 ~~~~l~~l~q~~~~~~~------~~~~~~~~~~~~~~~~~-~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq 643 (726) ....+....+.++.-.+ ..+....+..++...++ +. ..++. +.+.+.++++..+++++ T Consensus 424 ~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~--~ivrs-------------~eev~a~~~~~~~q~~~ 488 (510) T protein:vir:78 424 AVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTS--QFYKS-------------ADELQAEAEEQRRQAAQ 488 (510) T ss_pred HHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChh--hhcCC-------------HHHHHHHHHHHHHHHHH Confidence 22222222222221111 11222233333333332 10 01100 00000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_013692. 644 AAHYMSGAGLQDSKVGTEQAKARAL-ASQADM 674 (726) Q Consensus 644 ~q~~~~~~~~~~~~~~~eqaq~~q~-~~q~~~ 674 (726) ++ +.+++.+ .++.++ ....-+ T Consensus 489 ~~-~~~~a~~---------~~~~~~~~~~~g~ 510 (510) T protein:vir:78 489 AQ-AAQETLL---------EGASDMTNALAGV 510 (510) T ss_pred HH-HHHHHHH---------HhhhhhcccCCCC Confidence 00 0000000 000000 000001 No 43 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.85 E-value=3.5e-18 Score=116.25 Aligned_cols=482 Identities=10% Similarity=0.021 Sum_probs=237.8 Q ss_pred HHHHHHHH-----HHHHHHHHHHHHHHHHHhccCCCCCCCCCCCC-CcCCCHHHHHHHHHHHHHHHHhhcC-CCceEEEe Q lcl|NC_013692. 35 LKQDYQEA-----KQVTDEKITQINRWLDYMHVRGEGKPKTEKGK-SAVQPPTIRKQAEWRYSSLSEPFLS-SPNIFEVN 107 (726) Q Consensus 35 ~~~~~~~a-----~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr-s~~v~~~v~~~v~~~~~~L~~~f~~-~~~~~~~~ 107 (726) +++++..- ++...+.-.+..++.-.|.+...++.....++ -+.++..-...++.+.+.|+..+|+ +.+||.+. T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQIE 80 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 33333211 12222222222222222221111111111121 1345777788899999999999998 57999998 Q ss_pred cCC-------cchHHHHHH------HHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccC Q lcl|NC_013692. 108 PVT-------WEDAESARQ------NGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMM 174 (726) Q Consensus 108 p~~-------~~D~~~A~q------~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~ 174 (726) |-. .+|.+.++. .+..+...| ..++.|..++..+++.+..|++++-+ +.... T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~--~~~~~------------ 145 (514) T protein:vir:80 81 LDDTLQELAAANGIDQSELHSRTADLERRATRRL-FVNASLSKLHRILKLLVVTGNALFYR--EPGTG------------ 145 (514) T ss_pred cCchhhhhccccchhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEEEE--ecCCC------------ Confidence 731 223333322 333444444 46889999999999999999998742 21000 Q ss_pred CcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCC Q lcl|NC_013692. 175 PDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDP 254 (726) Q Consensus 175 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp 254 (726) .++.++..+|++.. T Consensus 146 ------------------------------------------------------------------~~~~~pl~~y~v~~ 159 (514) T protein:vir:80 146 ------------------------------------------------------------------KMLVWTMQSYTVRR 159 (514) T ss_pred ------------------------------------------------------------------cEEEEEcCeEEEee Confidence 01123345677766 Q ss_pred CCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecC Q lcl|NC_013692. 255 SCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIH 334 (726) Q Consensus 255 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~ 334 (726) ++..++.+ ++++.++|..+|-.. +.... ... .....+..+|.||+|..+.+.. T Consensus 160 d~~G~v~~---i~rr~~~~~~~l~~~-~~~~~------------~~~-----------~~~~~~~~~v~v~~~v~~~~~~ 212 (514) T protein:vir:80 160 TSHGDPAV---VVLRQQMPFRELTPE-IQADA------------QAK-----------QIAKRDSDKCDLYTVIEWQPTP 212 (514) T ss_pred CCCcCeEE---EEeeeeecHHHhhhh-hhhhh------------hhh-----------hccCCCCCceEEEEEEEeecCC Confidence 65444443 577889999887432 10000 000 0011234468888886665433 Q ss_pred CCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceE Q lcl|NC_013692. 335 GDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVG 414 (726) Q Consensus 335 ~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~ 414 (726) +... +.++.-..|.+++. ++-|+..++||++..|...++..||.|.+....+--+.+|.+.+..+.+...+.++.++ T Consensus 213 ~~~~-~sv~~e~~g~~i~~--es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~ 289 (514) T protein:vir:80 213 NGKR-CAVWHELEGKRVGP--ESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNL 289 (514) T ss_pred CCeE-EEEEEeccceeecc--cCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Confidence 2222 22222234455544 45566688999999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhhhhcCCceEeecCccchhhhcccccCc--cchhHHHHHHHHHHHHHHHHhchHHHhhccC-cccchhhH Q lcl|NC_013692. 415 VMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFP--EIPQSAQYMINLQQAEAESMTGVKAFNAGIS-GAALGDTA 491 (726) Q Consensus 415 ~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~--~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~-~~~~~~ta 491 (726) ++.+.+...+....-+.+.+ .+|. + ..+.+.+.. .--+..+..++.+.+.+... +++... .++..-|| T Consensus 290 v~~~g~~~~~~l~~~~~g~~--v~g~-~-~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~a-----Fml~~~~rd~~rvTA 360 (514) T protein:vir:80 290 VDEAKGGAVDDYRDAETGDF--VPGQ-V-GSVASYERGDYNKIAQASASVESIVMRLNRA-----FMYTGQVRDAERVTV 360 (514) T ss_pred eCcccccchhhhcccCCcee--ecCC-C-ccceeeecCcccchHHHHHHHHHHHHHHHHH-----HhhhccCCCCCCCCH Confidence 98877644444333332222 2322 1 223333221 11223344444444444332 122211 22223499 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHH- Q lcl|NC_013692. 492 TAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDN- 569 (726) Q Consensus 492 ~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~- 569 (726) +++....+.-...+...+.++.. .+..+..+.+.++..... ....--|... ..+.+..+.+...+ T Consensus 361 tEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~----------g~lP~~p~~l---~~~~~vs~la~l~r~ 427 (514) T protein:vir:80 361 EEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNG----------GMLLGIAQGV---YRPSIITGIPALTRN 427 (514) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhcc----------CCCCCCCchh---hcceeeecHHHHHHH Confidence 99999888888888888888875 556677776666542210 0111111111 22333332221111 Q ss_pred ---HHHHHHHHHHHHhhhccch----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 570 ---AKVNDLTFMLQTMGPNMDP----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERA 642 (726) Q Consensus 570 ---~~~~~l~~l~q~~~~~~~~----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~a 642 (726) .........++.+++..+. .+....+..++...+++... +.. .++ +. +...+ +.+++++ T Consensus 428 ~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~--i~~---~~e----~~-~~~~~----~~~~~~~ 493 (514) T protein:vir:80 428 IETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLST--LSK---DPD----VV-AAEAE----QEAALAQ 493 (514) T ss_pred HHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhh--ccC---CHH----HH-HHHHH----HHHHHHH Confidence 1222223333333332221 12233344444444443210 100 000 00 00000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 643 RAAHYMSGAGLQDSKVGTEQAKARALAS 670 (726) Q Consensus 643 q~q~~~~~~~~~~~~~~~eqaq~~q~~~ 670 (726) ++++..+....+ .+.+.-+-. T Consensus 494 ~~~~~~~~~~~~-------~~~~~~~~~ 514 (514) T protein:vir:80 494 QQLDVASGALAA-------ETSAGVLTS 514 (514) T ss_pred HHHHHHHHHHHH-------hhhccccCC Confidence 000000000000 000000001 No 44 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=99.85 E-value=1.6e-18 Score=118.17 Aligned_cols=480 Identities=10% Similarity=0.022 Sum_probs=237.4 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC-CCceEEEec Q lcl|NC_013692. 31 SLAQLKQDYQEA-KQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLS-SPNIFEVNP 108 (726) Q Consensus 31 ~~~~~~~~~~~a-~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~-~~~~~~~~p 108 (726) .=+.+++.++.- ++...+...+..++.--|.....++.....+ .+.++..-.+.++.+.+.|+..+|+ +.+||.+.+ T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~-~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~ 79 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVV-EHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSEL 79 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccCCCCCCcccccc-CCCccchHHHHHHHHHHHHHhhhcCCCCcccccCC Confidence 222233333321 1222222222222222222222222111111 3578888899999999999999998 578999987 Q ss_pred CCcc-------hHHHHH------HHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCC Q lcl|NC_013692. 109 VTWE-------DAESAR------QNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMP 175 (726) Q Consensus 109 ~~~~-------D~~~A~------q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~ 175 (726) -... +...++ ..+..+...| ..++.|..++..+++.+..||+++-+ +.. T Consensus 80 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~Li~~G~a~l~~--~~~--------------- 141 (510) T protein:vir:63 80 TDAIRREADSRDTDITEVTAALARVDRKATQRL-FQNASLAVLTQVIKLLIVTGNALLYR--DSD--------------- 141 (510) T ss_pred ChHHhhcccccchhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCeEEEEE--cCC--------------- Confidence 4221 111111 2334444444 46889999999999999999987732 200 Q ss_pred cchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCCC Q lcl|NC_013692. 176 DSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPS 255 (726) Q Consensus 176 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~ 255 (726) |. ++..++..+|++..+ T Consensus 142 -----------------------------------------~~----------------------~~~~~pl~~y~v~~d 158 (510) T protein:vir:63 142 -----------------------------------------AA----------------------TVVAWSLRSYAVRRD 158 (510) T ss_pred -----------------------------------------Cc----------------------EEEEEEcceeEEeeC Confidence 00 112344556787766 Q ss_pred CCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCC Q lcl|NC_013692. 256 CGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHG 335 (726) Q Consensus 256 a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~ 335 (726) +..++++ ++++.++|..+|-+. +...+.. .. ....+.+.|.||++-.+.+. T Consensus 159 ~~G~vd~---i~rr~~~t~~~l~e~-~~~~~~~----------~~-------------~~~~~~~~v~v~~~V~~~~~-- 209 (510) T protein:vir:63 159 ATGRWMD---IVLKQRYKSKDLDEE-YKQDLMR----------AG-------------RNLSGSGSVDLYTHVQRKKG-- 209 (510) T ss_pred CCcCeeE---EEeeeeccHHHHhHH-hhhhhhc----------cc-------------cccCCCcceEEEEEEEeecC-- Confidence 6444444 578889999887332 1111000 00 01123456888887655432 Q ss_pred CceEEEEEEEE--ECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_013692. 336 DGVLHPIVATW--VGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQV 413 (726) Q Consensus 336 ~g~~~~~~~~~--~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~ 413 (726) . -+..+.+++ .|.++... +-|+...+||++..|...++..||.|.+....+--+.+|.+.+..+.+.....++.+ T Consensus 210 ~-~~~~~sv~~e~dg~~~~~~--~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~ 286 (510) T protein:vir:63 210 T-AMEYAELYHEIDGVRVGKE--GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLN 286 (510) T ss_pred C-CceEEEEEEEecCceeccc--cccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCc Confidence 1 122223332 34455444 445568899999999999999999999999999999999999999999999999999 Q ss_pred Eeecccccchhh-hhhcCCceEeecCccchhhhcccccCc--cchhHHHHHHHHHHHHHHHHhchHHHhhccC-cccchh Q lcl|NC_013692. 414 GVMKGALDVTNR-RRFDRGENYEFNPGADPRAAVHMHTFP--EIPQSAQYMINLQQAEAESMTGVKAFNAGIS-GAALGD 489 (726) Q Consensus 414 ~~~~gav~~~d~-~~~~~g~vi~~~~~~~~~~~i~~~~~~--~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~-~~~~~~ 489 (726) +++.+.+...+. ....+|.++ +|. + ..+.+.+.. .--+.....++.+.+.+... ++..+. .++..- T Consensus 287 lv~p~g~~~~~~~~~~~~g~~v---~g~-~-~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~a-----f~~~l~~~~~~rv 356 (510) T protein:vir:63 287 LVDEAKGAVVDDYQDAEMGDYV---PGG-A-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQA-----FMYGANQRDAERV 356 (510) T ss_pred ccCcccccchhhhccCCCceee---cCC-c-ccceeeecCcccchHHHHHHHHHHHHHHHHH-----HHhhcccCCCCCc Confidence 998777644333 333334443 332 1 223333221 11233344555555544432 122122 223234 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchH- Q lcl|NC_013692. 490 TATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEE- 567 (726) Q Consensus 490 ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~- 567 (726) ||+++....+.....+...+.++.. .+..+..+.+.++.... ...+.++..+. ..++ +.... T Consensus 357 TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g------------l~p~p~~~~~~-~~v~---~is~La 420 (510) T protein:vir:63 357 TAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------LQGLITKQHKP-AIET---GLPALS 420 (510) T ss_pred CHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc------------CCCCCchhccc-ceec---chhHHH Confidence 9999999988888889988888875 66778888777665422 11112222211 1111 11111 Q ss_pred HHHHHHHHHHH---HHHhhhc---cchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 568 DNAKVNDLTFM---LQTMGPN---MDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAER 641 (726) Q Consensus 568 ~~~~~~~l~~l---~q~~~~~---~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~ 641 (726) ..+....+... +....+. .+..+....+..++...++.- ...++. +.+.+.++++..++. T Consensus 421 raq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p-~~ivrs-------------~eev~a~~~~~~qq~ 486 (510) T protein:vir:63 421 RSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDT-SQFYKS-------------ADELQAEAEQQRQQA 486 (510) T ss_pred HHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCCh-hHhcCC-------------HHHHHHHHHHHHHHH Confidence 11111222222 2222211 111122233333333333210 001100 000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_013692. 642 ARAAHYMSGAGLQDSKVGTEQAKARALA-SQADM 674 (726) Q Consensus 642 aq~q~~~~~~~~~~~~~~~eqaq~~q~~-~q~~~ 674 (726) ++++++++. . ...+.++. .-+-+ T Consensus 487 ~~~~~~~~~---~-------~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 487 AQAQAAQET---L-------LEGASDMTNALAGV 510 (510) T ss_pred HHHHHHHHH---H-------HHHHHhhcccccCC Confidence 000000000 0 00000000 00001 No 45 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=99.82 E-value=3e-18 Score=116.63 Aligned_cols=488 Identities=9% Similarity=0.040 Sum_probs=235.7 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccCCCCCCCCCCCCCcCCCHHHHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYM---HVRGEGKPKTEKGKSAVQPPTIRKQAE 87 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y---~~~~~~~~~~~~grs~~v~~~v~~~v~ 87 (726) |.. ++.. -..-.-+.|++.++..++..+... ..|.++| .-...-+.+...+..++.++.-.+.++ T Consensus 1 ~~~-----~~~~----~~~~~~~~l~~r~~~L~~~R~~~e---~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~ 68 (516) T protein:vir:96 1 MKQ-----SIDL----EYGGKRSKIPKLWEKFSNKRSSFL---DRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATN 68 (516) T ss_pred Ccc-----hhhh----hhhhhHHHHHHHHHHHHHHhhHHH---HHHHHHHHhhcccccCCCCCccccCCcccchHHHHHH Confidence 111 1111 111122445555554443333222 3344443 211111223334445788999999999 Q ss_pred HHHHHHHHhhcC-CCceEEEecCCcch-------HHH------HHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEE Q lcl|NC_013692. 88 WRYSSLSEPFLS-SPNIFEVNPVTWED-------AES------ARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIV 153 (726) Q Consensus 88 ~~~~~L~~~f~~-~~~~~~~~p~~~~D-------~~~------A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~ 153 (726) .+.+.|+..+|+ +.+||.+.+-.... .+. -...+..+...| ..++.|..++..+++.+..|++++ T Consensus 69 ~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l 147 (516) T protein:vir:96 69 HLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKEL-EQRQFRPAVVEAFKHLIVAGSCML 147 (516) T ss_pred HHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEeE Confidence 999999999998 67999997732111 111 122444455455 467899999999999999999987 Q ss_pred EEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccc Q lcl|NC_013692. 154 KVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEERE 233 (726) Q Consensus 154 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 233 (726) |-+... T Consensus 148 --~~d~~~------------------------------------------------------------------------ 153 (516) T protein:vir:96 148 --YKPSKG------------------------------------------------------------------------ 153 (516) T ss_pred --EecCCC------------------------------------------------------------------------ Confidence 222000 Q ss_pred ceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccc Q lcl|NC_013692. 234 ETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFD 313 (726) Q Consensus 234 ~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (726) .++.++..+|++..++..++.+ ++++.+++..+|-+. |....+.. .... T Consensus 154 -------~~~~~pl~~y~v~~d~~G~v~~---i~rr~~~~~~~l~~~-~~~~~~~~-------~~~~------------- 202 (516) T protein:vir:96 154 -------AISAIPMHHYVVNRDTNGDLLD---IILLQEKALRTFDPA-TRAVVEVG-------LKGK------------- 202 (516) T ss_pred -------CEEEEEcCeEEEeeCCCCCeee---ehhhhHhhHHHHHHh-hhhhhhhh-------hhhh------------- Confidence 0112334456776655444443 566778888886543 11110000 0000 Q ss_pred cCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHH Q lcl|NC_013692. 314 FQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRI 393 (726) Q Consensus 314 ~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~ 393 (726) .......|+||.|=.+ .+++... ++.-.-|.+++..+..|| ..|||++..|...++..||.|.+....+--+. T Consensus 203 -~~~~~~~v~v~~~v~~---~~~~~~~-~~~~~d~~~~~~es~~~~--~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~ 275 (516) T protein:vir:96 203 -KCKEDDSVKLYTHAKY---LGDGFWE-LKQSADDIPVGKVSKIKS--EKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFV 275 (516) T ss_pred -hcCCCCceEEEEeeee---eCCceeE-EEEEeCceeecccccccc--ccCCeeeeeeeecCCCCcccchHHHhhHHHHH Confidence 0012245666655332 3344432 222234445555554444 67999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCcc--chhHHHHHHHHHHHHHHH Q lcl|NC_013692. 394 IGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPE--IPQSAQYMINLQQAEAES 471 (726) Q Consensus 394 ~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~--~~~~~~~ll~~~~~~~e~ 471 (726) +|.+.+..+.++..+.++.++++.+.+...+....-+.+.+ .+|. + ..+.+.+... --+.+...++.+.+.+.. T Consensus 276 L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~i--~~g~-~-~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~ 351 (516) T protein:vir:96 276 IQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEV--VTGV-E-EDIHIVQLGKYADLTPISAVLEVYTRRIGV 351 (516) T ss_pred HHHHHHHHHHHHHHhcCCccccCcccccchhhhccCCCcee--ecCC-c-ccceeeecCcccchhHHHHHHHHHHHHHHH Confidence 99999999999999999999998776644333322222222 2332 1 2233332221 123334445544444443 Q ss_pred HhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecchh Q lcl|NC_013692. 472 MTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRD 550 (726) Q Consensus 472 ~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~ 550 (726) .- =..+....++..-||+++....+.....+...+.+|.. .+..+..+.+..+. + . T Consensus 352 af---~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~-----p---------------~ 408 (516) T protein:vir:96 352 VF---MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG-----E---------------S 408 (516) T ss_pred HH---hhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcC-----C---------------C Confidence 21 11111112223359999998888888888888888774 45566555433221 1 0 Q ss_pred hcccccceeeecccchHHHH----HHHHHHHHHHHhhhccch----hHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhh Q lcl|NC_013692. 551 DLAGNFDLKLDISTAEEDNA----KVNDLTFMLQTMGPNMDP----MMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIA 622 (726) Q Consensus 551 ~~~~~~dv~i~~~~~~~~~~----~~~~l~~l~q~~~~~~~~----~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~ 622 (726) .-.....+.+..+.+...+. ........+..+.+..+. .+....+..++...+++. ..++..+ T Consensus 409 lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~--~~irs~e------- 479 (516) T protein:vir:96 409 FTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL--PFLKSAE------- 479 (516) T ss_pred CccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCc--cccCCHH------- Confidence 00111223333232222111 111122222222111111 122233333334333331 1111100 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 623 QQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALAS 670 (726) Q Consensus 623 qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~ 670 (726) +..++. +++.++++.+. +.. ...++.. .+++.+-.++ T Consensus 480 -ev~~~~----~~~~~~q~~~~--~a~--~~~~~~~--~~~~~~~~~~ 516 (516) T protein:vir:96 480 -EMAQEQ----EAQMQAQQAQM--LEE--GVAKAVP--GVIQQELKEA 516 (516) T ss_pred -HHHHHH----HHHHHHHHHHH--HHH--Hhhhhhh--HHhhcccccC Confidence 000000 00000000000 000 0000000 0000000000 No 46 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=99.82 E-value=1.3e-17 Score=113.12 Aligned_cols=492 Identities=10% Similarity=0.058 Sum_probs=230.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CC--CCCCCCCCCCCcC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHV-RG--EGKPKTEKGKSAV 77 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~-~~--~~~~~~~~grs~~ 77 (726) |-| |..--.-..+.|.+.++..++-.+.. ...|.++|.- .| ..+.+...+..++ T Consensus 1 ~~~--------------------~~~~~~~~~~~l~~r~~~Lk~~R~~~---e~~w~e~~~~tlP~~~~~~~~~~~~~~~ 57 (515) T protein:vir:70 1 MQD--------------------TILEYGGQRSKIPKLWEKFSKKRSPY---LDRAKHFAKLTLPYLMNNKGDNETSQNG 57 (515) T ss_pred Ccc--------------------hhhhhcCCHHHHHHHHHHHHHhhhHH---HHHHHHHHHHhcccccCCCCCccccccc Confidence 111 00100111233444444332222221 1234444421 11 1122222333458 Q ss_pred CCHHHHHHHHHHHHHHHHhhcC-CCceEEEecCCcch-------HHHHH------HHHHHHHHHHhhcccchhHHHHHHH Q lcl|NC_013692. 78 QPPTIRKQAEWRYSSLSEPFLS-SPNIFEVNPVTWED-------AESAR------QNGLVLNQQFNTKLNKQRFIDEYVR 143 (726) Q Consensus 78 v~~~v~~~v~~~~~~L~~~f~~-~~~~~~~~p~~~~D-------~~~A~------q~t~~~n~~~~~~~~~~~~~~~~~~ 143 (726) ++..-...++.+.+.|+..+|+ +.+||.+.+-.+.. .+.+. ..+..+...| ..++.|..++..++ T Consensus 58 ~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~ 136 (515) T protein:vir:70 58 WQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKAL-EQRQFRPAIVEVFK 136 (515) T ss_pred ccchHHHHHHHHHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-HhcCchHHHHHHHH Confidence 8888899999999999999998 57899998633221 22222 2334444444 46789999999999 Q ss_pred HHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeec Q lcl|NC_013692. 144 AGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAV 223 (726) Q Consensus 144 ~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 223 (726) +.+..||+++-+ +.. . | T Consensus 137 ~L~~~G~a~l~~--d~~--------~-----------------------------------------------~------ 153 (515) T protein:vir:70 137 HLIVAGNCLLYK--PSK--------G-----------------------------------------------A------ 153 (515) T ss_pred HHHhHCeEEEEE--eCC--------C-----------------------------------------------C------ Confidence 999999998732 200 0 0 Q ss_pred cccceeecccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchh Q lcl|NC_013692. 224 PVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYT 303 (726) Q Consensus 224 ~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~ 303 (726) ++.++..+|++..++..+++. ++++..+|..+|.+. |...... ... T Consensus 154 ------------------~~~~pl~~y~v~~d~~G~v~~---i~rr~~~t~~~l~~~-f~~~~~~----------~~~-- 199 (515) T protein:vir:70 154 ------------------MSAVPMHHYVVNRDTNGDLMD---VILLQEKALRTFDPA-TRMAIEV----------GMK-- 199 (515) T ss_pred ------------------eEEEEcCeEEEeeCCCcCeeE---EEeeeeccHHHHHHh-hhhhhhh----------hhh-- Confidence 122334567776665444443 678899999998765 2211100 000 Q ss_pred hhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCCh Q lcl|NC_013692. 304 GPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESD 383 (726) Q Consensus 304 ~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~ 383 (726) .. ...+.+.|+||.+- ...+++...++..+ -|.+++. ++-|+...+||++..|...++..||.|. T Consensus 200 ~~---------~~~~~~~v~i~~~v---~~~~~~~~~~~~e~-d~~~~~~--es~y~~~e~P~~~~Rw~~~~ge~YGrgp 264 (515) T protein:vir:70 200 GK---------KCKEDDNVKLYTHA---QYAGEGFWKINQSA-DDIPVGK--ESRIKSEKLPFIPLTWKRSYGEDWGRPL 264 (515) T ss_pred hh---------hcCCCCceEEEEEE---EecCCCceEEEEec-Cceeecc--ccccccccCCceeeeeeecCCCCcccch Confidence 00 00112456666542 23345544333222 3334443 4556668899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCcc--chhHHHHH Q lcl|NC_013692. 384 GALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPE--IPQSAQYM 461 (726) Q Consensus 384 ~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~--~~~~~~~l 461 (726) +....+--+.+|.+.+..+.+...+.+|.++++.+.+...+....-+.+.+ .+|. ...+.+.+... --+.+... T Consensus 265 ~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~~~g~i--v~g~--~~~v~~~~~~~~~d~~~~~~~ 340 (515) T protein:vir:70 265 AEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNSGTGEV--ITGV--AEDIHIVQLGKYADLTPISAV 340 (515) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccccCCcee--ecCC--cccceeeecCcccchhHHHHH Confidence 999999999999999999999999999999998887754443322222222 2322 12233322211 12333444 Q ss_pred HHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEe Q lcl|NC_013692. 462 INLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRIT 540 (726) Q Consensus 462 l~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~ 540 (726) ++.+.+.+...-=+.. ....++..-||+++....+.-...+...+.++.. .+..+..+.+. . T Consensus 341 i~~~~~rI~~af~~~~---l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~---~----------- 403 (515) T protein:vir:70 341 LEVYTRRIGVIFMMET---MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ---E----------- 403 (515) T ss_pred HHHHHHHHHHHHhhhh---hhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH---h----------- Confidence 4544444443221111 1111222359999998888888888888888765 33444332211 0 Q ss_pred cccceecchhhcccccceeeecccchHHHH-HHHHHHHHHHHhh--hccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhh Q lcl|NC_013692. 541 NEHFVDIRRDDLAGNFDLKLDISTAEEDNA-KVNDLTFMLQTMG--PNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQ 617 (726) Q Consensus 541 ~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~-~~~~l~~l~q~~~--~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~ 617 (726) ..+..|... .++.+..+.+...+. ....+...++.++ ...++... +.-+..+..+.+...-.. T Consensus 404 ---~~p~~P~~~---v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~--------~~id~d~~~~~~a~~~g~ 469 (515) T protein:vir:70 404 ---AGDSFTSEL---VDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQ--------RAIRWGDYMDWVRGQISA 469 (515) T ss_pred ---hCCCCChhh---cccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHH--------hhCCHHHHHHHHHHHhCC Confidence 011122211 222222222222111 1111222222221 11111111 111111111111111111 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 618 PDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMT 675 (726) Q Consensus 618 ~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~ 675 (726) +...- +.+.+.++.+++.+++++.+.......+.+..-... ..++. T Consensus 470 p~~~~--rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~----------~~~~~ 515 (515) T protein:vir:70 470 ELPFL--KSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ----------EMKEG 515 (515) T ss_pred Ccccc--CCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhh----------hhccC Confidence 11100 000111111111110000000000000000000000 00000 No 47 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=99.80 E-value=3.6e-18 Score=116.19 Aligned_cols=500 Identities=13% Similarity=0.087 Sum_probs=245.5 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHhc------cCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC-CCce Q lcl|NC_013692. 32 LAQ-LKQDYQEAKQVTDEKITQINRWLDYMH------VRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLS-SPNI 103 (726) Q Consensus 32 ~~~-~~~~~~~a~~~~~~~~~~~~~~~~~y~------~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~-~~~~ 103 (726) ... .++.++..++-.+.. ...|.++|. +..+++.. ..-..++++..-.+.++.+.+.|+..+|+ +.+| T Consensus 1 mk~~a~~r~~~l~~~R~~~---e~~w~e~~~y~lP~~~~~~~~~~-~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~W 76 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDF---LDMARRCAALTLPYLLTEDGHAS-GGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSF 76 (542) T ss_pred ChhHHHHHHHHHHHHhhHH---HHHHHHHHHHhccccCCCCCCcc-cccccccccchHHHHHHHHHHHHHHhhcCCCCcc Confidence 111 223333322222211 233444442 22222111 11123778888899999999999999998 7999 Q ss_pred EEEecCCcc-------hHHHH-------HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEeccc Q lcl|NC_013692. 104 FEVNPVTWE-------DAESA-------RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVV 169 (726) Q Consensus 104 ~~~~p~~~~-------D~~~A-------~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~ 169 (726) |.+.+-... |.++. ...+.++...+ ..++.|..++..+++.+..|++++ |-+.. T Consensus 77 F~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l--~~~~~--------- 144 (542) T protein:vir:78 77 FKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQI-AESSDRVQLTAAMKHLIVTGNVLV--FAGKK--------- 144 (542) T ss_pred ccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCeEEE--EecCC--------- Confidence 999874221 22111 12344555555 477899999999999999999987 22200 Q ss_pred ccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhh Q lcl|NC_013692. 170 TYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNN 249 (726) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~ 249 (726) . ++.++..+ T Consensus 145 ~-----------------------------------------------------------------------~~~~pl~~ 153 (542) T protein:vir:78 145 T-----------------------------------------------------------------------LKVYPLDR 153 (542) T ss_pred C-----------------------------------------------------------------------ceEEecce Confidence 0 11233456 Q ss_pred eeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEE Q lcl|NC_013692. 250 IVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWG 329 (726) Q Consensus 250 ~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~ 329 (726) |++..++...++. ++++..+|..+|.+. |.++ .+.. . +... ..+....++.+++.+. T Consensus 154 y~v~~d~~G~vd~---v~r~~~~t~~ql~~~-fg~~--~l~~-~--~~~~--------------~~~~~~~~~~v~~~v~ 210 (542) T protein:vir:78 154 YVIERDGDGNVIE---IITRELVDRSLLPAE-FQKQ--SLLE-G--KDSN--------------AVGEDGPKFGVAQGKG 210 (542) T ss_pred eEEeeCCCCCeEE---EeeeeecCHHHHHHh-hccc--cCch-H--HHhh--------------ccccCCCeEEEEEEee Confidence 7777665444443 788999999998776 3221 1110 0 0000 0011223455555433 Q ss_pred Eee-cC-------CCceEEEEEEEEECCEE-EEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 330 YYD-IH-------GDGVLHPIVATWVGAVM-IRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRG 400 (726) Q Consensus 330 ~~~-~~-------~~g~~~~~~~~~~g~~~-l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 400 (726) ..+ .+ ..+...+ +.-..|..+ ....++|| ..+||++..|...++..||+|.+....+-.+.+|.+.+. T Consensus 211 pr~~~~~~~~~~~~~~~~s~-~~e~~g~~v~~~~~e~g~--~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~ 287 (542) T protein:vir:78 211 GRNDAEVFTCCKLVDGQHRW-HQECDGKEIKGSRSSSPL--KHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRS 287 (542) T ss_pred cccCCccccccccCCCeEEE-EEEecccccccccccccc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHH Confidence 211 10 1122222 222223333 22345555 679999999999999999999999999999999999999 Q ss_pred HHHHHHhcCCCceEeecccccc-hhhhhhcCCceEeecCccchhhhccccc--CccchhHHHHHHHHHHHHHHHHhchHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDV-TNRRRFDRGENYEFNPGADPRAAVHMHT--FPEIPQSAQYMINLQQAEAESMTGVKA 477 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~-~d~~~~~~g~vi~~~~~~~~~~~i~~~~--~~~~~~~~~~ll~~~~~~~e~~tGv~~ 477 (726) .+.++..+.+|+++++.+.+.. .+.....+|.++.-.++ .+.+.+ .+.--+.....++.+.+.+...- T Consensus 288 ~l~~~~~a~~pp~lv~~~g~~~~~~~~~~~~g~iv~g~~~-----~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aF---- 358 (542) T protein:vir:78 288 LIEGSAAAAKVVFMVSPSATTKPQSLARAGTGAIIQGRAE-----DVSVVQANKGADFRTVQEMIRDLSQRISDAF---- 358 (542) T ss_pred HHHHHHHHhcCceeeccccccchhhcccCCCceeecCCcc-----ceeeeecccccchhHHHHHHHHHHHHHHHHh---- Confidence 9999999999999998766533 33334445554432222 122222 22222334555566555555432 Q ss_pred HhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccccc Q lcl|NC_013692. 478 FNAGISGAALGDTATAVRGALDAASKRELGILRRLS-AGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNF 556 (726) Q Consensus 478 ~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~-~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~ 556 (726) +.....++..-||+++....+.....+..++.+|. +.+..++.+.+.++.+..--+. -|.+. . T Consensus 359 -l~~~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~------------~p~~l---v 422 (542) T protein:vir:78 359 -LILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPS------------LPKGL---V 422 (542) T ss_pred -cccccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC------------Cchhc---e Confidence 11111223335999999999999999999999986 4667888888888776433222 11211 2 Q ss_pred ceeeecccchHHHH-HHHHHHHHHHHhhhcc-chh-----HHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHH Q lcl|NC_013692. 557 DLKLDISTAEEDNA-KVNDLTFMLQTMGPNM-DPM-----MAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLE 629 (726) Q Consensus 557 dv~i~~~~~~~~~~-~~~~l~~l~q~~~~~~-~~~-----~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e 629 (726) ++++..+.+...+. ..+.+...++.+++.. ++. +....+..++...+++.. ..++.. +..++ T Consensus 423 ~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~-~i~~s~----e~~~~------ 491 (542) T protein:vir:78 423 MPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTL-NLVKSP----ETMAN------ 491 (542) T ss_pred eeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHh-hccCCH----HHHHH------ Confidence 34444443322221 1122222222222211 111 122333333333343310 001100 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 630 LMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKREL 694 (726) Q Consensus 630 ~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~ 694 (726) .+++.+++++++....+....+....... ...+..+.. +.+-..- ..-.++ T Consensus 492 ---~~~q~q~~~~~~al~~~a~~~a~~~~~~~--~~~~~~a~~---~~~~~~~------~~~~~~ 542 (542) T protein:vir:78 492 ---EAQQAQQQQMTASLMGQAGQLAKSPIGEK--MMQQINAPG---QEAPAGP------QTGEDL 542 (542) T ss_pred ---HHHHHHHHHHHHHHHHhhhhccccccccc--hhhhcCCCC---cCCCCCC------cccccC Confidence 00000000000000000000000000000 000000000 0000000 000001 No 48 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.80 E-value=5e-17 Score=109.94 Aligned_cols=487 Identities=9% Similarity=0.033 Sum_probs=230.6 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCCCCCCCCCCCCcCCCHHHHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMH---VRGEGKPKTEKGKSAVQPPTIRKQAE 87 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~---~~~~~~~~~~~grs~~v~~~v~~~v~ 87 (726) |..++... -.-..+.|++.++..++..+... ..|.++|. -...-+.+...+..++.++.-...++ T Consensus 1 ~~~~~~~~---------~~~~~~~l~~r~~~L~~~R~~~e---~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~ 68 (516) T protein:vir:10 1 MKQSTDLE---------YGGKRSKIPKLWEKFSTKRSSFL---DRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATN 68 (516) T ss_pred CCchhhHh---------hhhHHHHHHHHHHHHHHhhhHHH---HHHHHHHHhhcccccCCCCCcccccccccchHHHHHH Confidence 22211111 11123455555554443333222 33444442 11111223333444788999999999 Q ss_pred HHHHHHHHhhcC-CCceEEEecCCcch-------HH------HHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEE Q lcl|NC_013692. 88 WRYSSLSEPFLS-SPNIFEVNPVTWED-------AE------SARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIV 153 (726) Q Consensus 88 ~~~~~L~~~f~~-~~~~~~~~p~~~~D-------~~------~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~ 153 (726) .+.+.|+..+|+ +.+||.+.+-...+ .+ .-...+..+...| ..++.|..++..+++.+..|++++ T Consensus 69 ~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l 147 (516) T protein:vir:10 69 HLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKEL-EQRQFRPAVVEAFKHLIVAGSCML 147 (516) T ss_pred HHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEeE Confidence 999999999998 67999998632211 11 1122444454444 478899999999999999999986 Q ss_pred EEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccc Q lcl|NC_013692. 154 KVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEERE 233 (726) Q Consensus 154 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 233 (726) |-+.. . | T Consensus 148 --~~d~~--------~-----------------------------------------------~---------------- 154 (516) T protein:vir:10 148 --YKPSK--------G-----------------------------------------------A---------------- 154 (516) T ss_pred --EecCC--------C-----------------------------------------------C---------------- Confidence 33200 0 0 Q ss_pred ceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccc Q lcl|NC_013692. 234 ETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFD 313 (726) Q Consensus 234 ~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (726) ++.++..+|++..++..++.+ ++++..++..+|.+. |..-.+. ..... T Consensus 155 --------~~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~e~-~~~~~~~-------~~~~~------------- 202 (516) T protein:vir:10 155 --------ISAIPMHHYVVNRDTNGDLLD---IILLQEKSLRTFDPA-TRAVVEV-------GLKGK------------- 202 (516) T ss_pred --------eEEEEcCeEEEeeCCCCCeEE---EeeeecccHHHHHHH-hhhhhhh-------hhhhh------------- Confidence 112334467776665444444 567778888887544 2110000 00000 Q ss_pred cCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCE-EEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHH Q lcl|NC_013692. 314 FQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAV-MIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQR 392 (726) Q Consensus 314 ~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~-~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~ 392 (726) .......|.||.|=. ..+++.. .+..-.++. +...+. |+...+||++..|...++..||.|.+....+--+ T Consensus 203 -~~~~~~~~~i~t~v~---~~~~~~~--~~~~~~d~~~~~~~s~--~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k 274 (516) T protein:vir:10 203 -KCKEDDSIKLYTHAK---YLGEGFW--ELKQSADDIPVGKVSK--IKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLF 274 (516) T ss_pred -ccCCCCceEEEEEEE---ecCCCce--EEEEeeCceeeccccc--cccccCCeeeeeeeecCCCCcccchHHHhhHHHH Confidence 001233566665422 2334432 222223443 444344 4446899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccchhhhcccccCcc--chhHHHHHHHHHHHHHH Q lcl|NC_013692. 393 IIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPE--IPQSAQYMINLQQAEAE 470 (726) Q Consensus 393 ~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~--~~~~~~~ll~~~~~~~e 470 (726) .+|.+.+..+.++..+.+|.++++.+.+...+.. .+|+.-.+.+|. + ..+.+.+... --+.+...++.+.+.+. T Consensus 275 ~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l--~~~~~g~~~~g~-~-~~v~~~q~~~~~d~~~~~~~i~~~~~rI~ 350 (516) T protein:vir:10 275 VIQFLSEAVARGAALMADIKYLIRPGAQTDVDHF--VNSGTGEVVTGV-E-EDIHIVQLGKYADLTPISAVLEVYTRRIG 350 (516) T ss_pred HHHHHHHHHHHHHHHhcCCCcccCcccccchhhh--ccCCCceeecCC-c-ccceeeecCcccchHHHHHHHHHHHHHHH Confidence 9999999999999999999999987776443332 233321122332 1 2233332221 11333444444444443 Q ss_pred HHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcCeEEEEecccceecch Q lcl|NC_013692. 471 SMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSA-GIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRR 549 (726) Q Consensus 471 ~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~-~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~ 549 (726) ..-=+. +...-++..-||+++....+.-...+...+.+|.. .+..+..+.+... + ..-| T Consensus 351 ~af~~~---~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~---~--------------p~~P 410 (516) T protein:vir:10 351 VVFMME---TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA---G--------------DSFT 410 (516) T ss_pred HHHhhh---hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhh---C--------------CCCC Confidence 321111 11111222359999998888888888888888764 4455554432111 0 0011 Q ss_pred hhcccccceeeecccchHHH-HHHHHHHHHHHHhhh--ccchhH-----HHHHHHHHHHhhhhhhhhhhHHHHHhhhhhh Q lcl|NC_013692. 550 DDLAGNFDLKLDISTAEEDN-AKVNDLTFMLQTMGP--NMDPMM-----AQQIMGQIMELKKMPDFAKRIREFQPQPDPI 621 (726) Q Consensus 550 ~~~~~~~dv~i~~~~~~~~~-~~~~~l~~l~q~~~~--~~~~~~-----~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~ 621 (726) ... .++.+..+.+...+ +....+...++.++. +.++.. ....+....+..+++. ..++.. T Consensus 411 ~~l---v~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~--~~irs~------- 478 (516) T protein:vir:10 411 SDL---VDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL--PFLKSA------- 478 (516) T ss_pred hhh---cCcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCCh--hccCCH------- Confidence 111 12222222221111 111112222222221 112211 1112223333333221 111100 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMT 675 (726) Q Consensus 622 ~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~ 675 (726) .+.+.++++.. +.++... +...+. ++..--+..+.++. T Consensus 479 ------eev~~~r~~~~----~~q~~~~----~~~~~~--~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 479 ------EEMEQEQEAQM----QAQQAQM----LEEGVA--KAVPGVIQQELKEA 516 (516) T ss_pred ------HHHHHHHHHHH----HHHHHHH----HHHHhh--hcccchhhhhhhcC Confidence 00000000000 0000000 000000 00000000000000 No 49 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=99.75 E-value=1.5e-19 Score=123.83 Aligned_cols=590 Identities=15% Similarity=0.114 Sum_probs=293.7 Q ss_pred hhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhccC-C-----------CCCCCCCC Q lcl|NC_013692. 8 YLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQ---INRWLDYMHVR-G-----------EGKPKTEK 72 (726) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~~~~~y~~~-~-----------~~~~~~~~ 72 (726) ..+-|++..+.+-- .|.--++-.-..|++-++.++--....+.+ +++++..|..- + .++.|-.- T Consensus 1 maispsepninsfv-ytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V 79 (666) T protein:vir:10 1 MAISPSEPNINSFV-YTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQV 79 (666) T ss_pred CCcCCCCCcchhhh-hHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceeeecccccccCccee Confidence 12222222222211 244455666677888888777555556653 68888888521 1 11211111 Q ss_pred CCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeE Q lcl|NC_013692. 73 GKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTII 152 (726) Q Consensus 73 grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i 152 (726) =.-.+|+|-|-.+|++++++|.++|+||.++|.+.. +|...+-|++..-++.....-......++ -+++|++.++..- T Consensus 80 ~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~~~~~~Li-L~L~D~~KYN~~~ 157 (666) T protein:vir:10 80 VNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMTSSIPELI-LCLQDAAKYNLVG 157 (666) T ss_pred eccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhhhhHHHHH-HHHhhhhhcceee Confidence 145789999999999999999999999999999998 89999999999888877665444444444 4889999998877 Q ss_pred EEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecc Q lcl|NC_013692. 153 VKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEER 232 (726) Q Consensus 153 ~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 232 (726) +.+.|. .+++|... .++...+ |.. +..++. T Consensus 158 ~ET~Ws--------~IE~~~~~--------------~~i~~~~---------------------~~K---~TlrR~---- 187 (666) T protein:vir:10 158 WETEWS--------HIETYDPQ--------------KEITDLE---------------------PGK---TTLRRN---- 187 (666) T ss_pred eeeccc--------cccccchh--------------hhhhcCC---------------------Cce---eecccc---- Confidence 777775 33333210 1111000 000 111111 Q ss_pred cceeeccceeeeechhheeeCCC-CCCch-hhCCeEEEEEeccHHHHHhcC---------CCcchhh--c--Ccccch-- Q lcl|NC_013692. 233 EETVENHPTVQVCDYNNIVIDPS-CGSDF-SKAKFLIETFESSYAELKADG---------RYQNLDK--I--QVEGQN-- 295 (726) Q Consensus 233 ~~~~~~~p~i~~v~p~~~~~dp~-a~~d~-~da~~~~~~~~~t~~el~~~g---------~~~~~d~--~--~~~~~~-- 295 (726) .+..-+|++++|+|++|||+ +..|+ ....|++....+++-.|+..- -|+.+-+ + ...+.+ T Consensus 188 ---~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~sD~T 264 (666) T protein:vir:10 188 ---YRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQGSDWT 264 (666) T ss_pred ---hhhhhhhhccccccccccCCCCCCchhhhhhhhhHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhccccccc Confidence 11123578999999999996 33444 456788877777765554320 0111100 0 000000 Q ss_pred --hhcccch--hhhhccccccccCCc------CCceEEEEE--------EEEEeec--------CCCceEEEEEEEEECC Q lcl|NC_013692. 296 --LLSEPDY--TGPSEGVRNFDFQDK------SRKRLVVHE--------YWGYYDI--------HGDGVLHPIVATWVGA 349 (726) Q Consensus 296 --~~~~~~~--~~~~~~~~~~~~~~~------~~~~v~v~E--------~w~~~~~--------~~~g~~~~~~~~~~g~ 349 (726) .....-+ .....+.+.+-|... ..++|-|-| .|.|+-. ..+...-|..+++-|+ T Consensus 265 ~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~~Y~RI~PSDF~~~~P~~N~~QIWK~v~IN~~ 344 (666) T protein:vir:10 265 DNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWKAVMINRD 344 (666) T ss_pred cCCccCccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeeeccccceecCCCCCcceeeeeeeeccc Confidence 0000000 000111111111111 123343333 2333321 1244455667777788 Q ss_pred EEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhc Q lcl|NC_013692. 350 VMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFD 429 (726) Q Consensus 350 ~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~ 429 (726) .++..++-.-.++.||.-..-+..+.-..-..|+.+..++.|+...++++..+-...+....+.++++..+.-.+.-... T Consensus 345 ~iIS~~~~I~AY~~~~~~~~~~LEDG~G~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a~~iNSP~ 424 (666) T protein:vir:10 345 AIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRANDINSPI 424 (666) T ss_pred eeEeeehhhhccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhccChhhhhhhcccCCC Confidence 99988866556677776554445444445567999999999999999988887777777777777766665332222222 Q ss_pred CCceEeecCccchhhhc--ccccCccchhHHHHHHH---HHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHH Q lcl|NC_013692. 430 RGENYEFNPGADPRAAV--HMHTFPEIPQSAQYMIN---LQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKR 504 (726) Q Consensus 430 ~g~vi~~~~~~~~~~~i--~~~~~~~~~~~~~~ll~---~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~ 504 (726) |-.-|.+++.+...+.. -+.+.|.-..+....++ ++-..-.+++|++...+|.---. +.|-.+-.-.+-.+..+ T Consensus 425 ~~~KIP~~~~sL~N~~~~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKG-NKt~~E~~~~MG~a~NR 503 (666) T protein:vir:10 425 PQIKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKG-NKTRAEFDTIMGNAENR 503 (666) T ss_pred CCcccceeehhhcccchhhhhccCCccccchhHHHhhhHHHHhhHHHhhccCCccccccccc-CcceeehhhhcCCcccc Confidence 33334444443332221 22333433333333333 34455566777777766632111 23444444555555555 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccc-ccceeeecccchHHHH----HHHHHHHH Q lcl|NC_013692. 505 ELGILRRLSAGI-IEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAG-NFDLKLDISTAEEDNA----KVNDLTFM 578 (726) Q Consensus 505 ~~~~~~~~~~~~-~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~-~~dv~i~~~~~~~~~~----~~~~l~~l 578 (726) ++...=-+..-+ ..+-+.+.-.+.+|.++-.+|.-+..+.+.|+-+.++. -....+.+|....++. ....++++ T Consensus 504 ~RLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~L~~~~L~F~~~DG~TP~SK~ASs~~lT~~LQM 583 (666) T protein:vir:10 504 MRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKELQDLGLKFELGDGLTPASKLASSDFLTALLQM 583 (666) T ss_pred eehhhHHhhhhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHHHhhhhheeeeccCCCchhhhhhhHHHHHHHHH Confidence 544333333222 23333333345678887777765555566666554432 1222344443333332 22222222 Q ss_pred -------HHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 579 -------LQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGA 651 (726) Q Consensus 579 -------~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~ 651 (726) +++.+++.+ .+..-++.+.++..+++.-...-+.=.+----.+++++..+|.+- | ... T Consensus 584 I~sS~~~~~A~G~~~P-----~M~AH~~QLGGVRG~E~Y~daalP~~~~~~~~~Q~LQ~~~LQ~~~-----Q-SA~---- 648 (666) T protein:vir:10 584 IMSSETTLQAFGTQVP-----GMIAHLAQLGGVRGFEKYADAALPQWQITYGMQQQLQQMLLQLQQ-----Q-SAM---- 648 (666) T ss_pred HhhhhhhHhhhcccch-----HHHHHHHHhccccchhhhhhccCCccccccchhHHHHHHHHHHhh-----h-hhc---- Confidence 223333333 234456667777776665433222111100000111111111110 0 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 652 GLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQ 688 (726) Q Consensus 652 ~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~ 688 (726) |.++.|-+.-.++..- .+ T Consensus 649 ----------Q~~A~Q~~L~~~Q~~P---------Sq 666 (666) T protein:vir:10 649 ----------QLQARQGELSNDQSQP---------SQ 666 (666) T ss_pred ----------ccccccccCcccccCC---------CC Confidence 0000000000000000 00 No 50 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=99.74 E-value=3.8e-19 Score=121.55 Aligned_cols=590 Identities=15% Similarity=0.113 Sum_probs=291.7 Q ss_pred hhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhccC-C-----------CCCCCCCC Q lcl|NC_013692. 8 YLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQ---INRWLDYMHVR-G-----------EGKPKTEK 72 (726) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~~~~~y~~~-~-----------~~~~~~~~ 72 (726) ..+-|++..+.+-- .|.--++-.-..|++-++.++--....+.+ +++++..|..- + .++.|-.- T Consensus 1 maispsepninsfv-ytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V 79 (666) T protein:vir:96 1 MAISPSEPNINSFV-YTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQV 79 (666) T ss_pred CccCCCCCcchhhh-hHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceeeeccccccccccee Confidence 12222222222211 244455666677888888777555556653 68888888521 1 11211111 Q ss_pred CCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeE Q lcl|NC_013692. 73 GKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTII 152 (726) Q Consensus 73 grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i 152 (726) =.-.+|+|-|-.+|++++++|.++|+||.++|.+.. +|...+-|++..-++.....-......++ -+++|++.++..- T Consensus 80 ~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~~~~~~Li-L~L~D~~KYN~~~ 157 (666) T protein:vir:96 80 VNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMTSSIPELI-LCLQDAAKYNLVG 157 (666) T ss_pred eccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhhhhHHHHH-HHHhhhhhcceee Confidence 145789999999999999999999999999999998 89999999999888877665444444444 4889999998877 Q ss_pred EEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecc Q lcl|NC_013692. 153 VKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEER 232 (726) Q Consensus 153 ~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 232 (726) +.+.|. .+++|... .++...+ |.. +..++. T Consensus 158 ~ET~Ws--------~IE~~~~~--------------~~i~~~~---------------------~~K---~TlrR~---- 187 (666) T protein:vir:96 158 WETEWS--------NIETYDPQ--------------KEITDLE---------------------PGK---TTLRRN---- 187 (666) T ss_pred eeeccc--------cccccchh--------------hhhhcCC---------------------Cce---eeeccc---- Confidence 777775 33333210 1111100 000 111111 Q ss_pred cceeeccceeeeechhheeeCCC-CCCch-hhCCeEEEEEeccHHHHHhcC---------CCcchhh--c--Ccccch-- Q lcl|NC_013692. 233 EETVENHPTVQVCDYNNIVIDPS-CGSDF-SKAKFLIETFESSYAELKADG---------RYQNLDK--I--QVEGQN-- 295 (726) Q Consensus 233 ~~~~~~~p~i~~v~p~~~~~dp~-a~~d~-~da~~~~~~~~~t~~el~~~g---------~~~~~d~--~--~~~~~~-- 295 (726) .+..-+|++++|+|++|||+ +..|+ ....|++....+++-.|+..- -|+.+-+ + ...+.+ T Consensus 188 ---~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~sD~T 264 (666) T protein:vir:96 188 ---YRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQGSDWT 264 (666) T ss_pred ---hhhhhhhhccccccccccCCCCCCchhhhhhhhhhHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhhccccccc Confidence 01123578999999999996 33444 456788877777765554320 0111100 0 000000 Q ss_pred --hhcccch--hhhhccccccccCCc------CCceEEEEE--------EEEEeec--------CCCceEEEEEEEEECC Q lcl|NC_013692. 296 --LLSEPDY--TGPSEGVRNFDFQDK------SRKRLVVHE--------YWGYYDI--------HGDGVLHPIVATWVGA 349 (726) Q Consensus 296 --~~~~~~~--~~~~~~~~~~~~~~~------~~~~v~v~E--------~w~~~~~--------~~~g~~~~~~~~~~g~ 349 (726) .....-+ .....+.+.+-|... ..++|-|-| .|.|+-. ..+...-|..+++-|+ T Consensus 265 ~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~mY~RI~PSDF~~~~P~~N~~QIWK~v~IN~~ 344 (666) T protein:vir:96 265 DNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWKAVMINRD 344 (666) T ss_pred cCCcccccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeeeccccceecCCCCCcceeeeeeeeccc Confidence 0000000 000111111111111 123343332 2334321 1244455667777788 Q ss_pred EEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhc Q lcl|NC_013692. 350 VMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFD 429 (726) Q Consensus 350 ~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~ 429 (726) .++..++-.-.++.||.-..-+..+.-..-..|+.+..++.|+...++++..+-...+....+.++++..+.-.+.-... T Consensus 345 ~iIS~~~~I~AY~~~~~~~~~~LEDGmG~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a~~iNSP~ 424 (666) T protein:vir:96 345 AIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRANDINSPI 424 (666) T ss_pred eeEeeehhhcccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhhcchhhhhhhcccCCC Confidence 99988866556677776554445444445567999999999999999998887777777777777766665332222222 Q ss_pred CCceEeecCccchhhhc--ccccCccchhHHHHHHH---HHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHH Q lcl|NC_013692. 430 RGENYEFNPGADPRAAV--HMHTFPEIPQSAQYMIN---LQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKR 504 (726) Q Consensus 430 ~g~vi~~~~~~~~~~~i--~~~~~~~~~~~~~~ll~---~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~ 504 (726) |-.-|.+++.+...+.. -+.+.|.-..+....++ ++-..-.+++|++...+|.---. +.|-.+-.-.+-.+..+ T Consensus 425 ~~~KIP~~~~sL~N~~m~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQKG-NKt~~E~~~~MG~a~NR 503 (666) T protein:vir:96 425 PQIKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQKG-NKTRAEFDTIMGNAENR 503 (666) T ss_pred CCcccceeehhhhccchhhhhccCCccccchhHHHhhhHHHhhhHHHhhccCCccccccccc-CcceeehhhhcCCcccc Confidence 33334444443332221 22333433333333333 34455566777777766632111 23444445555555555 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccc-ccceeeecccchHHHH----HHHHHHHH Q lcl|NC_013692. 505 ELGILRRLSAGI-IEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAG-NFDLKLDISTAEEDNA----KVNDLTFM 578 (726) Q Consensus 505 ~~~~~~~~~~~~-~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~-~~dv~i~~~~~~~~~~----~~~~l~~l 578 (726) ++...--+..-+ ..+-+.+.-.+.+|.++-.+|.-+..+.+.|+-+.++. -....+.+|....++. ....++++ T Consensus 504 mRLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~L~~~~L~F~~~DGlTP~SKlASs~~lT~~LQM 583 (666) T protein:vir:96 504 MRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKELQDLGLKFELGDGLTPASKLASSDFLTALLQM 583 (666) T ss_pred eehhhHHHhhhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHHHhhhhheeeeccCCCchhhhhhhHHHHHHHHH Confidence 544333333222 23333333345578887777765544566666554432 1222344443333332 22222222 Q ss_pred -------HHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 579 -------LQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGA 651 (726) Q Consensus 579 -------~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~ 651 (726) +++.+++.+ .+..-++.+.++..+++..-..-++=.--=-..+++++..+|.+ .+.+ T Consensus 584 I~sS~~~~~A~G~~~P-----~M~AHl~QLGGVRG~E~Y~~~ALPqwqitygm~Q~LQ~~~LQ~~---------~QSA-- 647 (666) T protein:vir:96 584 IMSSETTLQAFGTQVP-----GMIAHLAQLGGVRGFEKYANAALPQWQITYGMQQQLQQMLLQLQ---------QQSA-- 647 (666) T ss_pred HhcchhhHhhhcccch-----HHHHHHHHhccccchhhcccccCcchhhhhhhhHHHHHHHHHHh---------hhhc-- Confidence 223333333 23445666777766665522111100000000011111111100 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 652 GLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQ 688 (726) Q Consensus 652 ~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~ 688 (726) -|.++.|-+.-.++..- .+ T Consensus 648 ---------~Q~~A~Q~~L~~~Q~~P---------Sq 666 (666) T protein:vir:96 648 ---------MQLQARQGELSNDQSQP---------SQ 666 (666) T ss_pred ---------cccccccccCcccccCC---------CC Confidence 00000010000000000 00 No 51 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.66 E-value=6.9e-15 Score=98.24 Aligned_cols=445 Identities=10% Similarity=0.075 Sum_probs=210.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--CCCCCCCC--Cc Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG--KPKTEKGK--SA 76 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~gr--s~ 76 (726) |---+.+--+||+++ +.... .|....+.|...+....+..+||+|.-+. .++..+++ .+ T Consensus 1 ~~~~~~~~~~~p~d~--------------~~~~~---~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~k 63 (453) T protein:vir:39 1 MKYKPPKLMTFPKDE--------------PITNE---VVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNR 63 (453) T ss_pred CeecCCcceEcCCCC--------------CCCHH---HHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccce Confidence 221111112333322 21111 12222335666677778889999876533 22334444 47 Q ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEe Q lcl|NC_013692. 77 VQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVG 156 (726) Q Consensus 77 ~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~ 156 (726) ++.+.....|+.....| ||.. +.|.+ +|.+ ..+.|+.+|. .|+.-..+...+++++.+|.+++.+| T Consensus 64 i~~n~~~~ivd~~~~~l----~g~~--~~~~~---~d~~----~~~~l~~i~~-~N~~~~~~~~~~~~~~~~G~~~~~v~ 129 (453) T protein:vir:39 64 LTVNFTKYIVDTFTGYF----NGIP--VKKSH---SDKE----TLSKLQEFDN-LNDMEDEESELAKMACIYGRAFELLY 129 (453) T ss_pred eecchHHHHHHHHhhhh----cccC--ceecc---CChH----HHHHHHHHHH-hcChhHHHHHHHHHHhhcCeEEEEEE Confidence 88888888888887766 4433 33433 2222 3346777764 56666678899999999999999887 Q ss_pred eeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccccee Q lcl|NC_013692. 157 WNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETV 236 (726) Q Consensus 157 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (726) ++.. T Consensus 130 ~d~~---------------------------------------------------------------------------- 133 (453) T protein:vir:39 130 QNEE---------------------------------------------------------------------------- 133 (453) T ss_pred ecCC---------------------------------------------------------------------------- Confidence 7510 Q ss_pred eccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhcccccccc Q lcl|NC_013692. 237 ENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDF 314 (726) Q Consensus 237 ~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (726) +.|++..++|.+++. |+... ....+.++ .+. + T Consensus 134 -g~~~i~~~~p~~~~~v~d~~~~---~~~~~~ir-~~~------------~----------------------------- 167 (453) T protein:vir:39 134 -TQTNVIYNTPENMFMVYDDTIK---QEPLFAVR-YGY------------D----------------------------- 167 (453) T ss_pred -CceEEEEEcccceEEEecCCCC---CeEEEEEE-EEE------------e----------------------------- Confidence 012334556666443 33221 11222221 110 0 Q ss_pred CCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHH Q lcl|NC_013692. 315 QDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRII 394 (726) Q Consensus 315 ~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~ 394 (726) .+.+.++|+|.. +.+ ++....++..--.++.|.+.+.+|+++++. ..+|.|.++.++++++.+ T Consensus 168 ----~~~~~~~~~yt~-----~~i---~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~ 230 (453) T protein:vir:39 168 ----DDYKLYGEVYTK-----ETT---YALNGTMGFYNMTEQAPNPFDDLPVVEFYF-----NEERMSIFESVISLVNAF 230 (453) T ss_pred ----CCeEEEEEEEeC-----CeE---EEEEecCCceeeecccccCCCceeEEEecC-----CCCCCcchhhhHHHHHHH Confidence 001223444431 111 111111222211233344446778766543 346889999999999999 Q ss_pred HHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCcc--chhhhcccccCccchhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 395 GAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGA--DPRAAVHMHTFPEIPQSAQYMINLQQAEAESM 472 (726) Q Consensus 395 N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~--~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~ 472 (726) |..++.+.+.+...++|.+.+.-..++..+...+..++++.+..+. .....+.+...+.....+...+..+...+... T Consensus 231 ~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~ 310 (453) T protein:vir:39 231 NKAISEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQT 310 (453) T ss_pred HHHHHHHHHHHHHhhCceeeeecCCCCchhhhhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 9999999999999999887765444554455555666666554321 12233445544444456677788888888889 Q ss_pred hchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhc Q lcl|NC_013692. 473 TGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDL 552 (726) Q Consensus 473 tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~ 552 (726) |++++.+.+..++ .|+.++...............+.|..+++++++.++.+........ ++. ++ T Consensus 311 s~~p~~~~~~~gn---~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~--------~~~-----~i 374 (453) T protein:vir:39 311 TMVANISDESFGS---SSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKE--------AWK-----DI 374 (453) T ss_pred hCCcccccccccC---ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc--------ccc-----cc Confidence 9998876654332 4666676665555566666666667777777666666543221110 011 11 Q ss_pred ccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHH Q lcl|NC_013692. 553 AGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELML 632 (726) Q Consensus 553 ~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~ 632 (726) .-.| +.....-.....+.+..+ ...++.... +. .+..+.+....+.....+.....+..+..+. T Consensus 375 ~v~f----~~~~p~~~~~~a~~~~kl----~g~is~et~---l~---~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~-- 438 (453) T protein:vir:39 375 EYTF----TRNEPKDIKEQAETANIL----MGITSQETA---LS---VISVIPDVQAEMEKIKKEEASTAIFDKDKQP-- 438 (453) T ss_pred eEEe----CCCCCcCHHHHHHHHHHH----hccCChHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHHHHHhccC-- Confidence 1111 111111011111111111 111221111 10 0111111111111110000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 633 LQAQIEAERARAAHYMSGAGLQDSKVGTE 661 (726) Q Consensus 633 ~qaq~e~~~aq~q~~~~~~~~~~~~~~~e 661 (726) -..--+....+.. +| T Consensus 439 ~~~~~~~~~~~~~--------------~e 453 (453) T protein:vir:39 439 SEKGTDTVVPETN--------------EE 453 (453) T ss_pred CCCCCCCCCCCcC--------------CC Confidence 0000000000000 00 No 52 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.65 E-value=9.7e-15 Score=97.40 Aligned_cols=444 Identities=9% Similarity=0.052 Sum_probs=214.1 Q ss_pred CCCccchhcCCCCCCch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCCCCCCC--cCCCHHHHHHHHH Q lcl|NC_013692. 14 EDGDPSKRLQPEWSNAP-SLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKTEKGKS--AVQPPTIRKQAEW 88 (726) Q Consensus 14 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~grs--~~v~~~v~~~v~~ 88 (726) ++-.+++-. +--++++ +... |....+.|...+....+..+||.|.-+.. ++..++++ +++.+..+..|+. T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~----i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~ 75 (452) T protein:vir:36 1 MKYKPPKLM-TFSKDEPITVEV----VTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDT 75 (452) T ss_pred CcccCceeE-EcCCccCCCHHH----HHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHH Confidence 222222222 1112222 2223 33334467777777788999999865432 22334443 6777888888887 Q ss_pred HHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecc Q lcl|NC_013692. 89 RYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQV 168 (726) Q Consensus 89 ~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~ 168 (726) ....| ||.+ +.|.+ +|. ...+.|+.+|. .|+.-..+...+++++.+|.+.+.+||+.. T Consensus 76 ~~~~l----~g~~--~~~~~---~d~----~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-------- 133 (452) T protein:vir:36 76 FTGYF----NGIP--VKKSH---SDK----EILTKLQEFDN-LNDMEDEESELAKMACIYGRAFEFLYQDED-------- 133 (452) T ss_pred Hhhhh----cccC--ceeec---CCh----hHHHHHHHHHh-hcChhHHHHHHHHHHHhcCeEEEEEEecCC-------- Confidence 77655 4444 34444 222 23456777764 566667788999999999999998877510 Q ss_pred cccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechh Q lcl|NC_013692. 169 VTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYN 248 (726) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~ 248 (726) +.|++..++|. T Consensus 134 ---------------------------------------------------------------------g~~~i~~~~p~ 144 (452) T protein:vir:36 134 ---------------------------------------------------------------------TQTNVVYNSPE 144 (452) T ss_pred ---------------------------------------------------------------------CeeEEEEEccc Confidence 01233445566 Q ss_pred hee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEE Q lcl|NC_013692. 249 NIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHE 326 (726) Q Consensus 249 ~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E 326 (726) +++ ||+.... ..-+. .+.|... +....+| T Consensus 145 ~~~~v~d~~~~~---~~~~~-i~~~~~~---------------------------------------------~~~~~~~ 175 (452) T protein:vir:36 145 NMFMVYDDTVKQ---EPLFA-VRYGVDE---------------------------------------------DKKLQGE 175 (452) T ss_pred ceEEEEcCCCCC---ceEEE-EEEEEec---------------------------------------------CceEEEE Confidence 653 3332211 11121 1222100 0011223 Q ss_pred EEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 327 YWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMA 406 (726) Q Consensus 327 ~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~ 406 (726) +|.. +.+ ++....++........|.+.+.+|++.++. ...|.|.+..++++++.+|..++.+.+.+. T Consensus 176 vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~ 242 (452) T protein:vir:36 176 VYTL-----LET---IKISGENDEISFGEGTYNPYPDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVD 242 (452) T ss_pred EEec-----CeE---EEEEEcCCceEEecceeccCCcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHH Confidence 3321 111 111111222222233344457778766643 235889999999999999999999999999 Q ss_pred hcCCCceEeecccccchhhhhhcCCceEeecCccch-hhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcc Q lcl|NC_013692. 407 RSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADP-RAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGA 485 (726) Q Consensus 407 ~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~-~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~ 485 (726) ..++|.+.+.-..++..+.....+++++.+..++.. ...+.+...+.....+...+..+...+...|++++.+.+..++ T Consensus 243 ~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn 322 (452) T protein:vir:36 243 YFSDQYLTFLGAAVEEEDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGS 322 (452) T ss_pred HhcCceeEeecCCcCchhhhhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccC Confidence 999998777544444444455566777777654322 1223444444445666777888888899999999877664433 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccc Q lcl|NC_013692. 486 ALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTA 565 (726) Q Consensus 486 ~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~ 565 (726) .|+.++...............+.|..+++.+++.++.+........ ++.. +...| +.... T Consensus 323 ---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~--------~~~~-----i~i~f----~~~~p 382 (452) T protein:vir:36 323 ---SSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKD--------SWKD-----IEYTF----TRNEP 382 (452) T ss_pred ---CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc--------cccc-----ceEEe----CCCCC Confidence 3666676666666666666667777777777777766554321110 1111 11111 11111 Q ss_pred hHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 566 EEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAA 645 (726) Q Consensus 566 ~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q 645 (726) .-.....+.+..+ ...++.... +. .+....+....++....+.... .+..+. ...-. T Consensus 383 ~d~~~~a~~~~k~----~g~iS~et~---~~---~~~~~~d~~~E~~ri~~E~~~~--------~~~~~~-----~~~~~ 439 (452) T protein:vir:36 383 KDIKEQAETANIL----MGITSQETA---LS---VISVIPDVQAEMEKIKKEEAST--------AIFDKD-----KQPSE 439 (452) T ss_pred cCHHHHHHHHHHH----hccCChHHH---HH---hCCCCCCHHHHHHHHHHHHHHH--------HHHHhh-----ccCCC Confidence 1011111111111 111111110 10 0111111111111110000000 000000 00000 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_013692. 646 HYMSGAGLQDSKVGTE 661 (726) Q Consensus 646 ~~~~~~~~~~~~~~~e 661 (726) .-... ......+| T Consensus 440 ~~~~~---~~~~~~~e 452 (452) T protein:vir:36 440 KGTDT---VVSETNEE 452 (452) T ss_pred Ccccc---cCccccCC Confidence 00000 00000000 No 53 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.62 E-value=1.6e-13 Score=90.70 Aligned_cols=440 Identities=10% Similarity=0.038 Sum_probs=200.8 Q ss_pred CCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCCCCCC--CcCCCHHHHHHHHHH Q lcl|NC_013692. 14 EDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKTEKGK--SAVQPPTIRKQAEWR 89 (726) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~gr--s~~v~~~v~~~v~~~ 89 (726) ++..+.|-..-.=..+.+-. +|....+.|...+....+..+||.|.-+.. .+..+|+ .+++.+.....|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~----~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~ 76 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEITDK----VVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTF 76 (453) T ss_pred CccccceeeeccccccCCHH----HHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHh Confidence 55555555522222222222 333333456667767788999999765421 2233443 478888888888877 Q ss_pred HHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEeccc Q lcl|NC_013692. 90 YSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVV 169 (726) Q Consensus 90 ~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~ 169 (726) ...| ||.. +.|.+ +|.. ..+.++.+| ..|+.-..+..++++++++|.+.+.+|++.. T Consensus 77 ~~~l----~g~~--~~~~~---~d~~----~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------- 133 (453) T protein:vir:73 77 VGYF----NGIP--IKKTH---DDKS----VLEAMQLFD-NLNDMEDEESELAKIACVYGRAYELMYQNES--------- 133 (453) T ss_pred hhhh----cccC--ceeec---CChH----HHHHHHHHH-HhcChhHHHHHHHHHHHhcCeEEEEEEeCCC--------- Confidence 7555 4433 34444 2222 334566655 4577777888999999999999998877510 Q ss_pred ccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhh Q lcl|NC_013692. 170 TYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNN 249 (726) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~ 249 (726) +.|.+..++|.+ T Consensus 134 --------------------------------------------------------------------~~~~i~~~~p~~ 145 (453) T protein:vir:73 134 --------------------------------------------------------------------TESEVIYCSPLN 145 (453) T ss_pred --------------------------------------------------------------------CceEEEEEcccc Confidence 002233455555 Q ss_pred eee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEE Q lcl|NC_013692. 250 IVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEY 327 (726) Q Consensus 250 ~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~ 327 (726) ++. |+.. .. ..++..++.. +.+ .....+. T Consensus 146 ~~~v~dd~~----~~-~~~~~i~~~~-----------~~~---------------------------------~~~~~~v 176 (453) T protein:vir:73 146 VFMVYDDSI----KQ-KPLFAVYYGF-----------DEE---------------------------------GNLSGTV 176 (453) T ss_pred eEEEEeCCC----Cc-eeEEEEEEEE-----------ecC---------------------------------ceEEEEE Confidence 433 2211 11 1222222210 000 0001122 Q ss_pred EEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 328 WGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMAR 407 (726) Q Consensus 328 w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~ 407 (726) |.. +.+ +.....++........|.+.+.+|++.++. ..+|.|.+..++++++.+|..++.+.+.+.. T Consensus 177 yt~-----~~i---~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~ 243 (453) T protein:vir:73 177 YTL-----LET---ISITGKAGEVKFGESTYNVYSDLPIVEYNF-----NEERQSIFEPVHSLINSYNKVTSEKANDVEY 243 (453) T ss_pred EeC-----CeE---EEEEecCCceEEccceeccCCceeEEEecC-----CCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Confidence 211 100 000111111111223344447788776643 3468899999999999999999999999999 Q ss_pred cCCCceEeecccccchhhhhhcCCceEeec---Cc---cc-hhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhh Q lcl|NC_013692. 408 SANGQVGVMKGALDVTNRRRFDRGENYEFN---PG---AD-PRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNA 480 (726) Q Consensus 408 ~~~~~~~~~~gav~~~d~~~~~~g~vi~~~---~~---~~-~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~ 480 (726) .++|.+.+--..++..+......+.++... ++ .. ...-+.+...+.....+...++.+...+-..|++++++. T Consensus 244 ~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~ 323 (453) T protein:vir:73 244 FSDQYLVFLGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISD 323 (453) T ss_pred hccceeeeecCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCc Confidence 988887663222332222223233222111 11 00 111234444444455667778888888889999988766 Q ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceee Q lcl|NC_013692. 481 GISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKL 560 (726) Q Consensus 481 G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i 560 (726) +..++ .|+.++...............+.|..+++++++.++.+........ ++. ++ .+.. T Consensus 324 ~~~gn---~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~--------~~~-----~i----~v~f 383 (453) T protein:vir:73 324 ENFGN---SSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKD--------AWK-----DI----EYTF 383 (453) T ss_pred ccccC---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc--------ccc-----cc----eEEe Confidence 54332 3666666655555555555666666676666665554432111000 010 11 1111 Q ss_pred ecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 561 DISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAE 640 (726) Q Consensus 561 ~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~ 640 (726) +.....-..+..+.+..+ ..-++... .+ ..+....+....++. ++++.+.. T Consensus 384 ~~~~p~~~~~~a~~~~k~----~giis~et---~~---~~~~~~~d~~~E~~r-------------------i~~E~~~~ 434 (453) T protein:vir:73 384 TRNEPKDIKEQAETANIL----KGITSEET---AL---SVISVIPDVQAEMEK-------------------IKKKKLLQ 434 (453) T ss_pred CCCCCCCHHHHHHHHHHH----hccCcHHH---HH---HhCCCCCCHHHHHHH-------------------HHHHHHHH Confidence 111111111111111111 11111111 01 011111111110110 00000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 641 RARAAHYMSGAGLQDSKVGTEQAKARAL 668 (726) Q Consensus 641 ~aq~q~~~~~~~~~~~~~~~eqaq~~q~ 668 (726) ..+.+ . .......+....+ T Consensus 435 ~~~~~----~-----~~~~~~~~~~~~~ 453 (453) T protein:vir:73 435 LSLTR----T-----SNLVRMKQMRGNL 453 (453) T ss_pred HHHHH----h-----ccCCcchhhhcCC Confidence 00000 0 0000000000000 No 54 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.61 E-value=7.1e-14 Score=92.68 Aligned_cols=421 Identities=11% Similarity=0.040 Sum_probs=198.4 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--CCCCCCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCc Q lcl|NC_013692. 27 SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG--KPKTEKGK--SAVQPPTIRKQAEWRYSSLSEPFLSSPN 102 (726) Q Consensus 27 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~--~~~~~~gr--s~~v~~~v~~~v~~~~~~L~~~f~~~~~ 102 (726) .....|..| |+ .|...+....+..+||.|.-+. .++..+++ -+++.+..+..|+.....| ||.. T Consensus 1 l~~~~l~~~---i~----~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~- 68 (429) T protein:vir:98 1 MTKDLLSEL---IQ----KHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYF----IGVP- 68 (429) T ss_pred CCHHHHHHH---HH----HHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhh----cccC- Confidence 222222222 22 3556666677889999876432 12333443 3788888888888887666 4433 Q ss_pred eEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHH Q lcl|NC_013692. 103 IFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELA 182 (726) Q Consensus 103 ~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (726) +.|.+ +| +.....++.+|. .|+.-..+..++++++++|.+++.+|++.. T Consensus 69 -~~~~~---~~----~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---------------------- 117 (429) T protein:vir:98 69 -VQTSH---EN----KQVSNYLELLDG-YNDQDDNNAELSKICSIYGHGYELVFNDEN---------------------- 117 (429) T ss_pred -ceeec---CC----hHHHHHHHHHHh-hcCHhHHHHHHHHHHhhcCeEEEEEEecCC---------------------- Confidence 34443 22 134446777654 566667788899999999999998766410 Q ss_pred HHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhee--eCCCCCCch Q lcl|NC_013692. 183 QIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIV--IDPSCGSDF 260 (726) Q Consensus 183 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~ 260 (726) +.|.+..++|.+++ ||..... T Consensus 118 -------------------------------------------------------g~~~~~~~~p~~~~~v~dd~~~~-- 140 (429) T protein:vir:98 118 -------------------------------------------------------AEAGITYLTPLEAFIVYDDSIRQ-- 140 (429) T ss_pred -------------------------------------------------------CcEEEEEEcccceEEEEeCCCCC-- Confidence 01223445566553 2221111 Q ss_pred hhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEE Q lcl|NC_013692. 261 SKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLH 340 (726) Q Consensus 261 ~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~ 340 (726) -...+.+.+.+.+ .+..+++|.. +.+ . T Consensus 141 --~~~~~i~~~~~~~---------------------------------------------~~~~~~~~~~-----~~~-~ 167 (429) T protein:vir:98 141 --KPLFAVRYFYNKG---------------------------------------------GVLEGSYSDA-----SNI-T 167 (429) T ss_pred --ceEEEEEEEEecC---------------------------------------------ceEEEEEEeC-----ceE-E Confidence 1112222221100 0112222211 000 0 Q ss_pred EEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc Q lcl|NC_013692. 341 PIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL 420 (726) Q Consensus 341 ~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav 420 (726) .+ ....++..+ .+..|.+.+.+|++.++ ...+|.|.++.++++++.+|.+++.+.+.+...++|.+.+.-... T Consensus 168 ~~-~~~~~~~~~-~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~ 240 (429) T protein:vir:98 168 YF-KDGEKGIEI-GESEPHPFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAEL 240 (429) T ss_pred EE-EecCCceEe-cccccccCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCC Confidence 00 000111111 12234444677776653 345789999999999999999999999999999998877642223 Q ss_pred cchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHH Q lcl|NC_013692. 421 DVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDA 500 (726) Q Consensus 421 ~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~ 500 (726) +.........++++.+..+..-...+.+...+.....+...+..+...+...|++++.+.+..+ +.|+.++...... T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g---n~Sg~Al~~~~~~ 317 (429) T protein:vir:98 241 DDETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFG---TASGIALRYRLQA 317 (429) T ss_pred CcchhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc---cchHHHHHHHHHH Confidence 3333344455667766543221222344444444455667788888999999999887655333 2366666665555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccc--hHHHHHHHHHHHH Q lcl|NC_013692. 501 ASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTA--EEDNAKVNDLTFM 578 (726) Q Consensus 501 ~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~--~~~~~~~~~l~~l 578 (726) .........+.|..+++++++.++.++...... .++ . ++.+.-... .-.......+..+ T Consensus 318 l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~--------~d~-----~------~i~v~f~~~~p~~~~~~a~~~~kl 378 (429) T protein:vir:98 318 MDNLAKTKERKFMSGMNRRYKLIASYPTSKIGP--------KDW-----I------GIKYKFTRNLPANLLEESQIAGNL 378 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc--------ccc-----c------cceEEeCCCCCcCHHHHHHHHHHH Confidence 555555666666667666666555543211100 011 1 111111111 1111111111111 Q ss_pred HHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 579 LQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKV 658 (726) Q Consensus 579 ~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~ 658 (726) +..++... .+. .+..+.+....+...+.+.... ++.+ ......+.... T Consensus 379 ----~g~is~et---~~~---~l~~v~d~~~E~~ri~~E~~~~---------------~~~~-------~~~~~~~~~~~ 426 (429) T protein:vir:98 379 ----AGIVSEET---QVG---VLSIVENPQKEIERKNSDKSTL---------------ISRQ-------AGGLNGQNTTT 426 (429) T ss_pred ----hccCchHH---HHH---hCCCCCCHHHHHHHHHHHHHHH---------------HHHH-------HhhhcCCCCCC Confidence 11111111 110 1111111111111100000000 0000 00000000000 Q ss_pred HHH Q lcl|NC_013692. 659 GTE 661 (726) Q Consensus 659 ~~e 661 (726) -.+ T Consensus 427 ~~~ 429 (429) T protein:vir:98 427 ILE 429 (429) T ss_pred CCC Confidence 000 No 55 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.58 E-value=5.8e-14 Score=93.16 Aligned_cols=456 Identities=7% Similarity=0.003 Sum_probs=210.8 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC-----------CCCCCCCcCCC Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP-----------KTEKGKSAVQP 79 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~~~grs~~v~ 79 (726) |-|+-.+.+--++.-.........+...|..-...|...+....+..+||.|.-.... ...+-..+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 5554444443333333444444445555666566777777778889999988632211 11112346788 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeee Q lcl|NC_013692. 80 PTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNY 159 (726) Q Consensus 80 ~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~ 159 (726) +..+..|+.....| ||.. +.|.+ +|.+.. ++++..+ .|+....+...+++++++|.+.+.+|++. T Consensus 81 n~~~~ivd~~~~~l----~g~~--~~~~~---~d~~~~----~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~ 145 (472) T protein:vir:93 81 NFHANLVDQKVSYI----VGKP--IAFKH---TDDEVV----KRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDE 145 (472) T ss_pred chHHHHHHHHhhhh----cccC--eeecc---CChHHH----HHHHHHH--hccHHHHHHHHHHHHhhcCeEEEEEEECC Confidence 88888888888666 3433 34433 333333 3555554 35556777788999999999998877641 Q ss_pred eeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeecc Q lcl|NC_013692. 160 QSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENH 239 (726) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 239 (726) . +. T Consensus 146 d-----------------------------------------------------------------------------~~ 148 (472) T protein:vir:93 146 E-----------------------------------------------------------------------------GE 148 (472) T ss_pred C-----------------------------------------------------------------------------Cc Confidence 0 01 Q ss_pred ceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCc Q lcl|NC_013692. 240 PTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDK 317 (726) Q Consensus 240 p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (726) |++..++|.+++. |++.. .+..+. .+.|.+.++ ..+.. +. T Consensus 149 ~~i~~~~p~~~~~i~d~~~~---~~~~~~-ir~~~~~~~-------~~~~~-------------------------~~-- 190 (472) T protein:vir:93 149 FKLFRVPAEQGIPIWTDKEH---EELEAF-IRMYKLENE-------TKVEY-------------------------WD-- 190 (472) T ss_pred eEEEEEcccceEEEEcCCCC---CceEEE-EEEEEeecc-------eeEEE-------------------------Ee-- Confidence 3345677777654 33221 222222 222211100 00000 00 Q ss_pred CCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHH Q lcl|NC_013692. 318 SRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAV 397 (726) Q Consensus 318 ~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~ 397 (726) ...+..+. ..++..... .....+...+. ..|.+.+.+|++.+.. ..+|.|.++.++++++.+|.+ T Consensus 191 -~~~~~~~~------~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~ 255 (472) T protein:vir:93 191 -KVTVNYYV------YENGSLIPD-YSNNLENSKTH--FSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRR 255 (472) T ss_pred -cCeEEEEE------EecCeeeec-ccccccccccc--cccCCCCCcceEEecC-----CCCCCCchhhhHHHHHHHHHH Confidence 00111111 111111000 00001111122 2334447778776654 347899999999999999999 Q ss_pred HHHHHHHHHhcCCCceEeecccc-cchhh--hhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhc Q lcl|NC_013692. 398 TRGMIDTMARSANGQVGVMKGAL-DVTNR--RRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTG 474 (726) Q Consensus 398 ~~~~~d~l~~~~~~~~~~~~gav-~~~d~--~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tG 474 (726) ++.+.+.+...++|.+.+ .|.- ..... .....++++.+..+++ +.+...+.........+..+...+...++ T Consensus 256 ~s~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 330 (472) T protein:vir:93 256 LSDLSNTFKDSNELTYVL-TNYDDQELPEFKRLLRYYGAIKVSDNGG----VDTIQVEVPVENSKKYLDELYQKIMLFGQ 330 (472) T ss_pred HHHHHHHHHHhcCceeEe-ecCCcccchhhHHHHhhccccccCCCCc----ceeEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 999999999999887765 3432 11111 1233445555554432 33444444456677788888899999999 Q ss_pred hHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccc Q lcl|NC_013692. 475 VKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAG 554 (726) Q Consensus 475 v~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~ 554 (726) +++.+.+..++ +.||.|+...............+.|..+++.+++.++.++-... ++..+ T Consensus 331 ~p~~~~~~~~~--n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-----------~~~~i------- 390 (472) T protein:vir:93 331 AVDFSSDKFGS--APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-----------EHKDV------- 390 (472) T ss_pred CCCCCcccccc--CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-----------cccee------- Confidence 99887654332 24666666666555566666666677777776666655442111 11111 Q ss_pred ccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHH Q lcl|NC_013692. 555 NFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQ 634 (726) Q Consensus 555 ~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~q 634 (726) .+..+.....-.....+....+ ..-++.... +. .+....+....++....+.....++.... .. T Consensus 391 --~v~f~~~~p~~~~~~~~~~~k~----~giis~et~---l~---~l~~~~d~~~E~~ri~~E~~~~~~~~~~~----~~ 454 (472) T protein:vir:93 391 --DISFNYNKVANTELQVQTAQQS----MGIVSHETV---LE---NHPFVEDLQAELERIEQEQMEYNKQLPNL----DD 454 (472) T ss_pred --eEEeCCCCCCCHHHHHHHHHHH----hccCchHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHHhccCc----Cc Confidence 1111111110001111111111 011111110 00 00111111111111100000000000000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 635 AQIEAERARAAHYMSGAGLQD 655 (726) Q Consensus 635 aq~e~~~aq~q~~~~~~~~~~ 655 (726) ....... +.........+ T Consensus 455 ~~~d~~~---~~~~~~~~~~e 472 (472) T protein:vir:93 455 GGADGAQ---QQERSNNKESE 472 (472) T ss_pred ccCCCCC---CCCCCCcccCC Confidence 0000000 00000000000 No 56 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.55 E-value=5.1e-13 Score=87.99 Aligned_cols=434 Identities=9% Similarity=0.057 Sum_probs=205.7 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC-----------------CCCCCC--CcCCCHHHHHHHHHHH Q lcl|NC_013692. 30 PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP-----------------KTEKGK--SAVQPPTIRKQAEWRY 90 (726) Q Consensus 30 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-----------------~~~~gr--s~~v~~~v~~~v~~~~ 90 (726) -.+..+.+.|.+-...|...+....+..+||.|.-+... +...++ .+++.+..+..|+..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 556666777777777777777778889999987642210 001111 2577777777787777 Q ss_pred HHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 91 SSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 91 ~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) ..| ||.+ +.|.+ +|.+ ...+++..+. |+....+....++++.+|.+.+.+||+.+. T Consensus 81 ~yl----~G~p--~~~~~---~~~~----~~~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~--------- 136 (471) T protein:vir:10 81 AYA----LTYP--PTFDV---DDKK----VNDMIVDVLG--DDYERISKQLCVNAGNAGIAWLHVWKDASD--------- 136 (471) T ss_pred hhh----cccC--ceecc---CChH----HHHHHHHHHh--cCHHHHHHHHHHHHhhCCeEEEEEEeeCCC--------- Confidence 555 4433 34433 3332 2235555543 555566778889999999999988876210 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) +.+.+..++|.++ T Consensus 137 -------------------------------------------------------------------g~~~~~~~~p~~~ 149 (471) T protein:vir:10 137 -------------------------------------------------------------------NSFRYACVDSKEV 149 (471) T ss_pred -------------------------------------------------------------------CeeEEEEEcccce Confidence 0123455666665 Q ss_pred e--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEE Q lcl|NC_013692. 251 V--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYW 328 (726) Q Consensus 251 ~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w 328 (726) + ||++. .+-...+.+.|.+....- ...+..+|+| T Consensus 150 ~~i~d~~~----~~~~~~~ir~~~~~~~~~----------------------------------------~~~~~~~~vy 185 (471) T protein:vir:10 150 IPIYSKSL----DKKSIGVLRVYSSIDETD----------------------------------------GKNYTVYEYW 185 (471) T ss_pred EEEEcCCC----CCceEEEEEEEEeeccCC----------------------------------------CceeEEEEEE Confidence 4 33322 111222333332221100 0011222222 Q ss_pred EE-----eecCCCceEEE-------EEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHH Q lcl|NC_013692. 329 GY-----YDIHGDGVLHP-------IVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGA 396 (726) Q Consensus 329 ~~-----~~~~~~g~~~~-------~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~ 396 (726) .. +...+.+.... .......+........|.+.+.+|++.+.. ...|.|.+..++++++.+|. T Consensus 186 ~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e~v~~liDa~d~ 260 (471) T protein:vir:10 186 NDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLKPIKDLVDVYDK 260 (471) T ss_pred eCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCchHHHHHHHHHHHH Confidence 10 00000000000 000011122222333344456777766544 44678999999999999999 Q ss_pred HHHHHHHHHHhcCCCceEeeccc-cc--chhhhhhcCCceEeecCcc-chhhhcccccCccchhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 397 VTRGMIDTMARSANGQVGVMKGA-LD--VTNRRRFDRGENYEFNPGA-DPRAAVHMHTFPEIPQSAQYMINLQQAEAESM 472 (726) Q Consensus 397 ~~~~~~d~l~~~~~~~~~~~~ga-v~--~~d~~~~~~g~vi~~~~~~-~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~ 472 (726) ++|.+.+.+...++|.+.+ .|. .. .........++.+.+...+ .....+.+...+.........+..+...+-.. T Consensus 261 ~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 339 (471) T protein:vir:10 261 VFSGFVNDTDDVQEVIFVL-TNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFIS 339 (471) T ss_pred HHHHHHHHHHHhhCceeee-ecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHH Confidence 9999999999999886655 443 11 1112334455666554322 12223455554544566778888888999999 Q ss_pred hchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhc Q lcl|NC_013692. 473 TGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDL 552 (726) Q Consensus 473 tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~ 552 (726) |++++.+.+..++ .|+.++..............-+.|..+++++++.++.++..+ ++.. T Consensus 340 s~tp~~~~~~~gn---~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~------------d~~~------ 398 (471) T protein:vir:10 340 GQGVNPETDKLGN---SSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS------------DKLK------ 398 (471) T ss_pred hCCcCCCcccccC---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC------------CCce------ Confidence 9988876654333 356667666666666666666666667666666555543211 1111 Q ss_pred ccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHH Q lcl|NC_013692. 553 AGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELML 632 (726) Q Consensus 553 ~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~ 632 (726) ..+..+.....-..+..+.+..+ ...+.... .+ . .+..+.+....++....+.....++.....-.. T Consensus 399 ---i~i~f~~~~p~n~~e~~~~~~kl----~g~iS~et---~~-~--~~p~v~D~~~E~eri~~E~~~~~~~~~~~~~~~ 465 (471) T protein:vir:10 399 ---IKQTWTRNSINNDTEMAQVVSTL----ATITSREN---VA-K--SNPIVEDWQDELRLQKAEQEGRSEKLYDMEEVE 465 (471) T ss_pred ---eEEEeCCCCCCCHHHHHHHHHHH----hccCchHH---HH-H--hCCCCCCHHHHHHHHHHHHHHHHhcccccCCCC Confidence 11111111111111111111111 11111111 00 0 011111100000000000000000000000000 Q ss_pred HHHHHH Q lcl|NC_013692. 633 LQAQIE 638 (726) Q Consensus 633 ~qaq~e 638 (726) ...+.+ T Consensus 466 ~~~e~~ 471 (471) T protein:vir:10 466 HESEVE 471 (471) T ss_pred CccccC Confidence 000000 No 57 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.54 E-value=7e-13 Score=87.22 Aligned_cols=422 Identities=7% Similarity=0.017 Sum_probs=191.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCCC----CCCCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCc Q lcl|NC_013692. 38 DYQEAKQVTDEKITQINRWLDYMHVRGEGK----PKTEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTW 111 (726) Q Consensus 38 ~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~----~~~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~ 111 (726) -|. ..+..+....++..+||.|.-... ....++++ +++.+..+..|+.....| ||.+. .|..-.. T Consensus 1 ~~~---~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~~--~~~~~~~ 71 (440) T protein:vir:95 1 MLA---AFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYV----IGNPV--SIGVMEG 71 (440) T ss_pred Chh---hHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhhe----eccCc--eEeeCCC Confidence 111 122233334566678998764321 11223443 678888888888766544 55553 3433333 Q ss_pred chHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhh Q lcl|NC_013692. 112 EDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQI 191 (726) Q Consensus 112 ~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 191 (726) +|.+.. ..+..+| ..|+.-..+..++++++++|.+.+.+|++.. T Consensus 72 ~~~~~~----~~l~~~~-~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~------------------------------- 115 (440) T protein:vir:95 72 GSADQL----STIKDIE-WQNDINALNSDLAFDASVYGRAYEYHFRDKD------------------------------- 115 (440) T ss_pred ccHHHH----HHHHHHH-HhcCHhHHHHHHHHHHhhcCeEEEEEEecCC------------------------------- Confidence 333332 2454444 4666666777899999999999998876410 Q ss_pred hhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEE Q lcl|NC_013692. 192 REESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIET 269 (726) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~ 269 (726) +.|.+..++|.++++ ||... ....+.+ + T Consensus 116 ----------------------------------------------~~~~i~~~~p~~~~~~~d~~~~---~~~~~~i-~ 145 (440) T protein:vir:95 116 ----------------------------------------------KVDRVVLISPLEMFVIRDLTVE---QNIIAAV-H 145 (440) T ss_pred ----------------------------------------------CceEEEEEcccceEEEEcCCCC---CceEEEE-E Confidence 012334566666554 34321 1122222 2 Q ss_pred EeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECC Q lcl|NC_013692. 270 FESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGA 349 (726) Q Consensus 270 ~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~ 349 (726) .|... +.... .+ | ..+++..|+++. .++ . T Consensus 146 ~~~~~----------~~~~~-----~v-----------------y---t~~~~~~~~~~~----~~~------------~ 174 (440) T protein:vir:95 146 LPIYA----------DKVNM-----TV-----------------Y---TKDKVITYKPYS----NNS------------V 174 (440) T ss_pred EEEec----------CceEE-----EE-----------------E---eCCeEEEEEEec----CCc------------c Confidence 22100 00000 00 0 011122222211 000 0 Q ss_pred EEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc---c--cchh Q lcl|NC_013692. 350 VMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA---L--DVTN 424 (726) Q Consensus 350 ~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga---v--~~~d 424 (726) ........|.+.+.+|++.++. ..+|.|.++.++++++.+|..++.+.+++...++|.+++ .|. . +..+ T Consensus 175 ~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~-~g~~~~~~~~~e~ 248 (440) T protein:vir:95 175 RLVVDDVKKHSYNDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLV-KGDLDGIKLSPED 248 (440) T ss_pred ceeecceeeccCceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeee-ecccccCCCCccc Confidence 1111222233346677766543 446889999999999999999999999999998887665 332 1 2222 Q ss_pred hhhhcCCceEeecCcc-----chhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHH Q lcl|NC_013692. 425 RRRFDRGENYEFNPGA-----DPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALD 499 (726) Q Consensus 425 ~~~~~~g~vi~~~~~~-----~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~ 499 (726) .......+.+.+..+. .....+.+...+.....+...++.+...+...|++++.+.+..++ +.||.|+..... T Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~ 326 (440) T protein:vir:95 249 AAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNS--TSSGIALLYKMI 326 (440) T ss_pred hhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--cchHHHHHHHHH Confidence 2233333333332111 111223444444444567778889999999999999887664322 236666776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHH Q lcl|NC_013692. 500 AASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFML 579 (726) Q Consensus 500 ~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~ 579 (726) ............|..+++++++.+..++...... + ++.. +..+........-.....+.+..+ T Consensus 327 ~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~---------~---~~~~----~v~i~f~~~~p~~~~~~ad~~~kl- 389 (440) T protein:vir:95 327 GLEQVRKDKETYFTKALRRRYELISNIHKAINGP---------V---IEAN----KLTFTFHPNIPQDVWTEIKAYIEA- 389 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc---------c---cccc----cceEEeCCCCCCCHHHHHHHHHHH- Confidence 6666666666777777777776665554321110 0 0000 011111111111111122222211 Q ss_pred HHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 580 QTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVG 659 (726) Q Consensus 580 q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~ 659 (726) ...++.... +. .+...... ..+.....+.. ....+. ....... .-. T Consensus 390 ---~g~iS~et~---~~---~l~~~d~~-~E~~ri~~E~~--------------~~~~~~-------~~~~~~~---~~~ 435 (440) T protein:vir:95 390 ---GGEISQETL---ME---NASFTDYK-TEHSRILKQGG--------------SSDLEI-------GQIVGDA---DVG 435 (440) T ss_pred ---hccCcHHHH---HH---hCCCCCcH-HHHHHHHHHHH--------------HhhhhH-------HhhccCC---CCC Confidence 111221111 11 11111000 00000000000 000000 0000000 000 Q ss_pred HHHHH Q lcl|NC_013692. 660 TEQAK 664 (726) Q Consensus 660 ~eqaq 664 (726) .+.++ T Consensus 436 ~~~~e 440 (440) T protein:vir:95 436 QADTE 440 (440) T ss_pred CcCCC Confidence 00000 No 58 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.53 E-value=9.3e-13 Score=86.54 Aligned_cols=452 Identities=8% Similarity=0.024 Sum_probs=205.2 Q ss_pred CCCc-cchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCC--C------- Q lcl|NC_013692. 1 MADV-DEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPK--T------- 70 (726) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~--~------- 70 (726) |++. -+.-|+.=..... +..--.... ...|..-.+.|...+..+.++.+||.|.-+.... . T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~--------~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~ 71 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVE-QIKPQYETQ--------EEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEI 71 (468) T ss_pred CccccCCcCceeehheee-cccccccCc--------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccc Confidence 6654 1111111111111 111111111 1122333345666777788999999987432111 1 Q ss_pred --CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhc Q lcl|NC_013692. 71 --EKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDE 148 (726) Q Consensus 71 --~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~ 148 (726) .+..-+++.+..+..|+.....| ||.+ +.|.+ +|.+..+ .+..+| +++....+...+++++.+ T Consensus 72 ~~~~~~~ki~~n~~~~Iv~~~~~~l----~g~p--~~~~~---~d~~~~~----~l~~~~--~n~~~~~~~~~~~~~~~~ 136 (468) T protein:vir:96 72 DPFKPDWRMYTNYHQNLVDQKVAYA----VANP--VTYGT---EDEKSLK----TIQEVL--NHKWDDKLVDILTAASNK 136 (468) T ss_pred cccccccccccchHHHHHHHHHhhh----ccCC--ceecc---CChHHHH----HHHHHH--hcCHHHHHHHHHHHHhhc Confidence 11223688888888888777655 4433 33432 3433333 444444 356667778899999999 Q ss_pred CCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccce Q lcl|NC_013692. 149 GTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSE 228 (726) Q Consensus 149 ~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 228 (726) |.+.+.+||+.. T Consensus 137 G~~~~~v~~d~~-------------------------------------------------------------------- 148 (468) T protein:vir:96 137 GVEWIQPYVDEQ-------------------------------------------------------------------- 148 (468) T ss_pred CeEEEEEEEcCC-------------------------------------------------------------------- Confidence 999998887510 Q ss_pred eecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhh Q lcl|NC_013692. 229 EEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPS 306 (726) Q Consensus 229 ~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~ 306 (726) +.|++..++|.+++ ||+.. ..+..+.+ +.|...+. ..++. T Consensus 149 ---------~~~~i~~~~p~~~~~v~~~~~---~~~~~~~i-r~~~~~~~-------~~~~~------------------ 190 (468) T protein:vir:96 149 ---------GEFKTFRVPAEQAIPIWTNKE---RDELKAFI-RLYELDGG-------ERVEY------------------ 190 (468) T ss_pred ---------CceEEEEEcccceEEEEcCCC---CCceEEEE-EEEEecCc-------eEEEE------------------ Confidence 01234556666665 33322 22322322 22210000 00000 Q ss_pred ccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH Q lcl|NC_013692. 307 EGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL 386 (726) Q Consensus 307 ~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~ 386 (726) | ..+++..|.++. +..+.....-.............|.+.+++|++.+.. ...|.|.+.. T Consensus 191 -------~---~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n-----~~~g~sd~e~ 250 (468) T protein:vir:96 191 -------W---TANDVTFYELKD-----GQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKN-----NPQEVSDLFM 250 (468) T ss_pred -------E---eCCeEEEEEEcC-----CceeecccccccccccceeeccccccCCcccEEEecC-----CCCCCCchHH Confidence 0 011222222211 0100000000000011112233455567888877754 3458899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchh--hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHH Q lcl|NC_013692. 387 LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTN--RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINL 464 (726) Q Consensus 387 ~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d--~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~ 464 (726) ++++++.+|..+|.+.+.+...++|.+.+.-...+... ......++++.+..... +.+.+...+.-...+...++. T Consensus 251 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~--~~~~~l~~~~~~~~~~~~~~~ 328 (468) T protein:vir:96 251 YKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGS--GGVDTIQIDVPVQSAKEYLDM 328 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCC--CcceEEeecCChHHHHHHHHH Confidence 99999999999999999999988887765422222111 12234566676654322 224444444445677778888 Q ss_pred HHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccc Q lcl|NC_013692. 465 QQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHF 544 (726) Q Consensus 465 ~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~ 544 (726) +...+...|++++.+.+..++ +.||.++.................|..+++++++.+++++ ... T Consensus 329 l~~~I~~~s~~p~~~~~~~~~--n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~----g~~---------- 392 (468) T protein:vir:96 329 LRDYVIEFGQGVDFQQDKFGN--SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY----KLS---------- 392 (468) T ss_pred HHHHHHHHhCccccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC---------- Confidence 999999999999877553322 2466666666666666666666666677776666555542 211 Q ss_pred eecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhh Q lcl|NC_013692. 545 VDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQ 624 (726) Q Consensus 545 v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq 624 (726) ++..++. +..+.....-.....+ .+...+ -+.... .+. .+....+....++....+..... T Consensus 393 --~d~~~i~----i~f~~~~p~d~~e~a~----~~~~~g-~iS~et---~i~---~l~~v~D~~~E~~ri~~E~~~~~-- 453 (468) T protein:vir:96 393 --IKVQDVE----ITFNFNVMVNELEQSQ----IGVNSQ-YLSKET---VVT---NHPWVDDPVAEMERIDQEELALP-- 453 (468) T ss_pred --cccceee----EEecCCCCcCHHHHHH----HHHhcC-CCchHH---HHH---hCCCCCCHHHHHHHHHHHHHHHH-- Confidence 1111111 1111111111111111 111111 011110 010 01111111111111100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKV 658 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~ 658 (726) ..+... -........ T Consensus 454 -------~~~~~~------------~~~~~~~~~ 468 (468) T protein:vir:96 454 -------SIEEGL------------NGKENNEPT 468 (468) T ss_pred -------HHhhcc------------CCCCCCCCC Confidence 000000 000000000 No 59 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.53 E-value=1.2e-12 Score=86.00 Aligned_cols=464 Identities=14% Similarity=0.099 Sum_probs=206.4 Q ss_pred CCCccchhc-C----CCCCCchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCC-CCCCCCC----CCcCCCHHH Q lcl|NC_013692. 14 EDGDPSKRL-Q----PEWSNAPSLAQLKQDYQEAK-QVTDEKITQINRWLDYMHVRGEG-KPKTEKG----KSAVQPPTI 82 (726) Q Consensus 14 ~~~~~~~~~-~----~~~~~~~~~~~~~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~~-~~~~~~g----rs~~v~~~v 82 (726) +.....-+- . ..+---++|..+. ++-+ .-...++..+..|..||.|.... +-....| |-+...+.. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~---~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~ 77 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKIT---DDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMA 77 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhh---cccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchH Confidence 222111110 0 0010111122111 1111 12334445678899999876531 1111122 222333443 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeee Q lcl|NC_013692. 83 RKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSR 162 (726) Q Consensus 83 ~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~ 162 (726) +..++ -+++| +|+-..-|.+. +|. .+..+|+.++. .|+....++.++.+|+..|.+++|+||+.. T Consensus 78 ~~i~~-~~A~l---v~~e~~~i~v~----~~~----~~~e~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~-- 142 (508) T protein:vir:15 78 KTAAR-RIASV---VFNEKAEIHVK----DNN----EADKFLNDVLE-DNDFKNKFEEALEKGVALGGFAMRPYIDGN-- 142 (508) T ss_pred HHHHH-HHHhh---hhCCCceEEeC----Cch----HHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEEeCC-- Confidence 33333 23333 34433233332 222 23346777763 666777899999999999999999999710 Q ss_pred eEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeecccee Q lcl|NC_013692. 163 TVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTV 242 (726) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i 242 (726) .+.| T Consensus 143 ----------------------------------------------------------------------------~~~i 146 (508) T protein:vir:15 143 ----------------------------------------------------------------------------HIKI 146 (508) T ss_pred ----------------------------------------------------------------------------eeEE Confidence 0234 Q ss_pred eeechhheeeC-CCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCce Q lcl|NC_013692. 243 QVCDYNNIVID-PSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKR 321 (726) Q Consensus 243 ~~v~p~~~~~d-p~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (726) ++|+|..|++= .+. .++..|-|+..... .+-.. .+. T Consensus 147 ~~v~ad~~~P~~~d~-~~~~~~af~~~~~~---~~~~~---------------------------------------~~~ 183 (508) T protein:vir:15 147 AWVRADQFYPLQSNT-NDISEAAIASRTQR---TESNQ---------------------------------------TKY 183 (508) T ss_pred EEEcCCeeEEEEEcC-CCeEEEEEEEEEEe---ecCCC---------------------------------------ceE Confidence 55666666631 111 12344443322111 00000 001 Q ss_pred EEEEEEEEEeecCCCceEEEEEEEEEC------CEEEEeccC-------C---CC-CCccceEEeee----eeecCcccC Q lcl|NC_013692. 322 LVVHEYWGYYDIHGDGVLHPIVATWVG------AVMIRMEEN-------P---FP-DKRIPYVVVNY----IPRKRDLYG 380 (726) Q Consensus 322 v~v~E~w~~~~~~~~g~~~~~~~~~~g------~~~l~~~~~-------P---~~-~~~~Pf~~~~~----~~~~~~~~g 380 (726) ++.+|+|...+ +|.|..+ ..+|-+ +..+..... | +. ..+.||+.+.+ ....++.+| T Consensus 184 yt~lE~h~~~~-~~~~~I~--n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG 260 (508) T protein:vir:15 184 YTLLEFHQWQD-NGSYQIT--NELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLG 260 (508) T ss_pred EEEEEEEEEec-CcceEEE--EEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcC Confidence 22222221110 0111110 000000 000000000 0 01 12334554433 223468899 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh--hhhcCCc-eEe-ecCccchhhhcccccCccchh Q lcl|NC_013692. 381 ESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR--RRFDRGE-NYE-FNPGADPRAAVHMHTFPEIPQ 456 (726) Q Consensus 381 ~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~--~~~~~g~-vi~-~~~~~~~~~~i~~~~~~~~~~ 456 (726) .|++..+++.++.+|..++++.+.+ ..+.+++.++++.+..+.. ..+.++. ++. ++.+......|...++..... T Consensus 261 ~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e 339 (508) T protein:vir:15 261 LGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTV 339 (508) T ss_pred CchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecccChH Confidence 9999999999999999999999998 6788899999988854322 2233332 222 222222223455555443444 Q ss_pred HHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeE Q lcl|NC_013692. 457 SAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEV 536 (726) Q Consensus 457 ~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~ 536 (726) .+...++.+...+....|++.-..|..+.. ..||+++....+..-+....+.+.+..+++++.+.++.+..-+.--.- T Consensus 340 ~~~~~~~~~l~~~~~~~gls~~~f~~~~~~-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~- 417 (508) T protein:vir:15 340 QYKDAIDHFIKEFEVQIGLSTGTFSYSNDG-VKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDD- 417 (508) T ss_pred HHHHHHHHHHHHHHHHhCCCchhcccccCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc- Confidence 566677777788888999988777765543 258888877666666777778888888999998888887654321110 Q ss_pred EEEecccceecchhhcccccceeee--cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhh-----hhhhh Q lcl|NC_013692. 537 VRITNEHFVDIRRDDLAGNFDLKLD--ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKM-----PDFAK 609 (726) Q Consensus 537 iRi~~~~~v~v~~~~~~~~~dv~i~--~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~-----~e~~~ 609 (726) +... .........+++++. .+...-.....+....+. +.+. +..... + ++..+. .+..+ T Consensus 418 ----g~~~--~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v-~aGi-~s~e~~---i---~~~~g~~deea~~el~ 483 (508) T protein:vir:15 418 ----GKPL--FTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVL-AIGA-LSKQTF---L---QRNYGMTDEQAAEELA 483 (508) T ss_pred ----cccc--cccccccCCcceEEEeCCCCCCCHHHHHHHHHHHH-hcCC-CCHHHH---H---HhcCCCChHHHHHHHH Confidence 0000 011111122333332 222111111111111111 1111 111100 0 111111 11111 Q ss_pred hHHHHHhhhhhhhhhHHHHHHHHHH Q lcl|NC_013692. 610 RIREFQPQPDPIAQQKAQLELMLLQ 634 (726) Q Consensus 610 ~l~~~~~~~~~~~qq~~q~e~q~~q 634 (726) ++++.+....+.....-...--.-+ T Consensus 484 ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 484 KIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHHHhccccCccccccccCCCCCCC Confidence 1111111110000000000000000 No 60 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.52 E-value=9.5e-13 Score=86.50 Aligned_cols=458 Identities=9% Similarity=0.050 Sum_probs=203.0 Q ss_pred CCCccchhhcCCCCCCcc-chhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--C-------C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDP-SKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--K-------T 70 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~-------~ 70 (726) |+|+.--. -++...+ .+..-+... .. ...|.....-|..++....+...||+|..+... + . T Consensus 1 ~~~~~~~~---~~~~~~e~~~~~~~~~~--~~----~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~ 71 (478) T protein:vir:10 1 MISINWPW---DKPYHEQVVEQIKPKYE--TQ----EEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDY 71 (478) T ss_pred CccccCCC---CchhHHHHHHHHhhccC--Cc----HHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccccccc Confidence 77763200 0000000 011111111 11 112233334566677778889999987653211 1 1 Q ss_pred CCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhc Q lcl|NC_013692. 71 EKGK--SAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDE 148 (726) Q Consensus 71 ~~gr--s~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~ 148 (726) .+++ .+++.+..+..|+.....| ||.+ +.|.. +|.+..+ .|..++. |+....+...+++++.+ T Consensus 72 ~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~--~~~~~---~~d~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~ 136 (478) T protein:vir:10 72 DETKPDWRMYTNYHQNLVDQKVAYA----VANP--VTFGV---DNDKALK----QIQHTLN--HKWDDKLVDILTAASNK 136 (478) T ss_pred ccccccceeccchHHHHHHHHHhhh----ccCC--eeeec---CChHHHH----HHHHHHh--cCHHHHHHHHHHHHHhc Confidence 1233 3678888888888777655 4433 33432 3333332 4444443 56667778899999999 Q ss_pred CCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccce Q lcl|NC_013692. 149 GTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSE 228 (726) Q Consensus 149 ~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 228 (726) |.+++.+|++.. T Consensus 137 G~~~~~~~~d~~-------------------------------------------------------------------- 148 (478) T protein:vir:10 137 GIEWVQPYVDEE-------------------------------------------------------------------- 148 (478) T ss_pred CeEEEEEEecCC-------------------------------------------------------------------- Confidence 999998876510 Q ss_pred eecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhh Q lcl|NC_013692. 229 EEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPS 306 (726) Q Consensus 229 ~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~ 306 (726) +.|++..++|.++++ |++.. .+..+. .+.|-... .+. ... T Consensus 149 ---------g~~~~~~~~p~~~~~i~d~~~~---~~~~~~-v~~~~~~~----------~~~-----~~~---------- 190 (478) T protein:vir:10 149 ---------GEFKTFRVPAEQAVPIWTNKER---DELQAF-IRVYELDG----------AER-----VEY---------- 190 (478) T ss_pred ---------CeeEEEEEcccceEEEEcCCCC---CceEEE-EEEEEecC----------ceE-----EEE---------- Confidence 012334566666553 33321 222222 22221000 000 000 Q ss_pred ccccccccCCcCCceEEEEEEEEEeecCCCceEEEE--EEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChH Q lcl|NC_013692. 307 EGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPI--VATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG 384 (726) Q Consensus 307 ~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~--~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~ 384 (726) | ..++|..|++- +....... ...-... .......|.+.+.+|++.+.. ..+|.|.+ T Consensus 191 -------y---~~~~i~~~~~~------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~ 248 (478) T protein:vir:10 191 -------W---TKDDVTYYELK------EGQLIPDFYRSDDHIQP-HYYQGNKLMSWGRVPFIPFKN-----NPQEVSDL 248 (478) T ss_pred -------E---eCCeEEEEEEc------CCeeecccccccccccc-ceecccccccCCccceEEecc-----CCCCCCcH Confidence 0 01122222110 00000000 0000001 111233355557788766643 45789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cc--hhhhhhcCCceEeecCccchhhhcccccCccchhHHHHH Q lcl|NC_013692. 385 ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DV--TNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYM 461 (726) Q Consensus 385 ~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~--~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~l 461 (726) ..++++++.+|..++.+.+.+...++|.+.+ .|.- +. ........++++.+..... +.+.+...+.-....... T Consensus 249 ~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l~~~~~~~~~~~~ 325 (478) T protein:vir:10 249 FMYKTIIDALDKRLSDTQNTFDESVELIYIL-KGYEGEDMKDFMHNLKYYKAISVAGESG--SGVDTIKVEVPIDSVKEY 325 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccchhhhhhhhcceEEecCCCC--CcceEEeecCChHHHHHH Confidence 9999999999999999999999988886654 4442 11 1122334556666643211 123333333334566778 Q ss_pred HHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 462 INLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 462 l~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +..+...+...|++++.+.+..++ +.||.++...............+.|..+++++++.++++. ... T Consensus 326 ~~~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~----g~~------- 392 (478) T protein:vir:10 326 TKMLRDYIIEFGQGVDFQQDKFGN--SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY----RLD------- 392 (478) T ss_pred HHHHHHHHHHHhCccccCcccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC------- Confidence 888889999999999877654322 2466667766666666666666666677776666555543 210 Q ss_pred ccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhh Q lcl|NC_013692. 542 EHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPI 621 (726) Q Consensus 542 ~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~ 621 (726) ++..++ .+..+.....-....++.+..+ +..++... .+.. +....+....++....+.... T Consensus 393 -----~~~~~i----~i~f~~~~p~d~~e~a~~~~kl----~g~iS~et---~~~~---l~~v~D~~~E~~ri~~E~~~~ 453 (478) T protein:vir:10 393 -----VKVQDI----EITFNFNVMVNELENSQIAMNS----TGLLSKET---ILSN---HAWVEDPVAEMERIEQENIEL 453 (478) T ss_pred -----cccccc----eEEecCCCCCCHHHHHHHHHHH----hCCCChHH---HHHh---CCCCCCHHHHHHHHHHHHHHH Confidence 010111 1111111111111111111111 11111111 1110 011111111111111110000 Q ss_pred hhhHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLL-QAQIEAERARAA 645 (726) Q Consensus 622 ~qq~~q~e~q~~-qaq~e~~~aq~q 645 (726) .+.......... ..+.+..-.+.+ T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 454 NQQLPDIEEGLNGEQQRQSENNQPE 478 (478) T ss_pred HhhccccccccCCCCCCCCCCCCCC Confidence 000000000000 000000000000 No 61 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.52 E-value=2e-12 Score=84.70 Aligned_cols=451 Identities=9% Similarity=0.022 Sum_probs=201.4 Q ss_pred CCCccc--------hhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCCC-CCCC Q lcl|NC_013692. 1 MADVDE--------DYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKI-TQINRWLDYMHVRGEG-KPKT 70 (726) Q Consensus 1 ~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~~~-~~~~ 70 (726) |+|+.- +.-.||.++ +.+...|.+.|. .|.... ...++..+||.|.-.. ..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~i~~~i~----~~~~~~~~~~~~l~~Yy~g~~~i~~~~~ 63 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGE-------------KLTSNELLGFIA----YNETVLKPRYRENMKLYLGKHKILTAPE 63 (470) T ss_pred CccccCCcccccCCceEEeCCCC-------------CcCHHHHHHHHH----HHHHhhHHHHHHHHHHhccccccccCcc Confidence 666531 111344332 122233444443 343333 4567788999875432 1122 Q ss_pred CCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhc Q lcl|NC_013692. 71 EKGK--SAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDE 148 (726) Q Consensus 71 ~~gr--s~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~ 148 (726) .+++ -+++.+.....|+.....| ||.+ +.|.. ++|....+ .++.+| ..|+....+...+++++++ T Consensus 64 ~~~~~~~ki~~n~~~~Ivd~~~~~l----~g~p--~~~~~--~~d~~~~~----~l~~~~-~~n~~~~~~~~~~~~~~~~ 130 (470) T protein:vir:99 64 KETGADNRIVVNSAKYVVDVYNGYF----CGIE--PKLAL--LNDSSKID----EIARWN-RQENFFDTINEISKQCDIF 130 (470) T ss_pred cccCCcceeecchHHHHHHHHhhhh----ccCC--eeEee--CCchhHHH----HHHHHH-HhcCHhHHHHHHHHHHHhc Confidence 2333 3677778788887776555 4443 33432 23333222 344444 3667777888999999999 Q ss_pred CCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccce Q lcl|NC_013692. 149 GTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSE 228 (726) Q Consensus 149 ~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 228 (726) |.+.+.+|++.. T Consensus 131 G~~~~~v~~d~d-------------------------------------------------------------------- 142 (470) T protein:vir:99 131 GRSIASIYQGED-------------------------------------------------------------------- 142 (470) T ss_pred CeeEEEEEeCCC-------------------------------------------------------------------- Confidence 999888776410 Q ss_pred eecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhh Q lcl|NC_013692. 229 EEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPS 306 (726) Q Consensus 229 ~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~ 306 (726) +.|++..++|.+++ ||+.... ...+ +.+.|...++ T Consensus 143 ---------g~~~i~~~~p~~~~~i~d~~~~~---~~~~-~vr~~~~~~~------------------------------ 179 (470) T protein:vir:99 143 ---------ARPHLMYSSPNHAFIIYDDTVQR---QPLA-FVHYQIDNSN------------------------------ 179 (470) T ss_pred ---------CeEEEEEEccceeEEEEcCCCCc---ceEE-EEEEEEEecC------------------------------ Confidence 01234556777754 3332211 1111 1222211000 Q ss_pred ccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH Q lcl|NC_013692. 307 EGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL 386 (726) Q Consensus 307 ~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~ 386 (726) ......++.|.. +.+..+... -.+......+..|.+.+.+|++.+.. ..+|.|.++. T Consensus 180 ------------~~~~~~~~~~~~-----~~~~~~~~~-~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~ 236 (470) T protein:vir:99 180 ------------NWTDAYGVIQYA-----DKFYKFKGY-DIEEDTNAAGYAINPYGLVPAVEFFE-----NEERQGIFDS 236 (470) T ss_pred ------------CeeEEEEEEEec-----CeEEEEEec-ccccccccccccccCCCccceEeecC-----CCCCCcchHh Confidence 000111111110 000000000 00000111122233446778766543 4478999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchh----hhhhcCCceEeecCcc-chhhhcccccCccchhHHHHH Q lcl|NC_013692. 387 LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTN----RRRFDRGENYEFNPGA-DPRAAVHMHTFPEIPQSAQYM 461 (726) Q Consensus 387 ~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d----~~~~~~g~vi~~~~~~-~~~~~i~~~~~~~~~~~~~~l 461 (726) ++++++.+|..++.+.+.+...++|.+.+.-......+ ......+.++.+.... .....+.+...+.....+... T Consensus 237 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 316 (470) T protein:vir:99 237 IKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENL 316 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHH Confidence 99999999999999999999999988876443332222 1223344455443211 122234455544445566777 Q ss_pred HHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 462 INLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 462 l~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +..+...+-..||+++.+.+..++ +.||.++..............-+.|..+++++++.++.++.......- T Consensus 317 ~~~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~------ 388 (470) T protein:vir:99 317 IQHLTDFIFMMAMVPNIQDKNFAG--NSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQE------ 388 (470) T ss_pred HHHHHHHHHHHhCCcccccccccc--CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc------ Confidence 888889999999999877654322 236666666555566666666666667777766666655432211100 Q ss_pred ccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhh Q lcl|NC_013692. 542 EHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPI 621 (726) Q Consensus 542 ~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~ 621 (726) ++. +..+..+.....-..+..+.+..+. .-++.... +. + +..+ +....++....+.... T Consensus 389 -~~~---------~i~v~f~~~~p~~~~e~a~~~~kl~----giis~et~---l~-~--l~~v-d~~~E~eri~~E~~~~ 447 (470) T protein:vir:99 389 -LWS---------ELDFKFTRNLPEDMASAIDNAKNAE----GIVSKKTQ---LG-M--IPDI-EPDAEMKQIAKEKADA 447 (470) T ss_pred -ccc---------cceEEeCCCCCcCHHHHHHHHHHHh----ccCCHHHH---HH-h--CCCC-CHHHHHHHHHHHHHHH Confidence 000 0111111111111111111111111 11111110 10 0 1111 0000011000000000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQ 662 (726) Q Consensus 622 ~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eq 662 (726) .+ ...+.. . ........... +++ T Consensus 448 ------~~---~~~~~~--~-~~d~~~~d~~~------ee~ 470 (470) T protein:vir:99 448 ------IK---QTQQLS--M-PIDILKRDNNA------EEE 470 (470) T ss_pred ------HH---HHHhhc--C-CCCcCCCCCCc------cCC Confidence 00 000000 0 00000000000 000 No 62 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.51 E-value=3.8e-12 Score=83.17 Aligned_cols=426 Identities=13% Similarity=0.034 Sum_probs=192.2 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC-CCCC----CCCcCCCHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_013692. 27 SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP-KTEK----GKSAVQPPTIRKQAEWRYSSLSEPFLSSP 101 (726) Q Consensus 27 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~-~~~~----grs~~v~~~v~~~v~~~~~~L~~~f~~~~ 101 (726) .+......|+..+. .+..+.....+-..||.|....+. +... ..-++|....+..|+.....|. T Consensus 1 ~~~~~~~~i~~l~~----~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~------- 69 (441) T protein:vir:80 1 MNSDELALIEGMYD----RIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD------- 69 (441) T ss_pred CCccHHHHHHHHHH----HHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc------- Confidence 22233233333333 234444455667899987654210 1111 1235666777777775554441 Q ss_pred ceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHH Q lcl|NC_013692. 102 NIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEEL 181 (726) Q Consensus 102 ~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~ 181 (726) +.+.+-+|.+ .+..+| ..|+....+...+++++++|.|.+.+|-+ . T Consensus 70 ----~~g~~~~d~~-------~l~~i~-~~n~~~~~~~~~~~~~~~~G~a~~~v~~d--------~-------------- 115 (441) T protein:vir:80 70 ----WLGWTNGDGY-------GLDGVY-AANRLATASCDVHLDALIFGLSFVAIIPH--------G-------------- 115 (441) T ss_pred ----cccccCCChH-------HHHHHH-HhcCHHHHHHHHHHHHhhcCeeEEEEEeC--------C-------------- Confidence 1111112211 244444 35777778889999999999998865311 0 Q ss_pred HHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhee--eCCCCCCc Q lcl|NC_013692. 182 AQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIV--IDPSCGSD 259 (726) Q Consensus 182 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d 259 (726) .+.|.+..++|.+++ |||.... T Consensus 116 -------------------------------------------------------~g~~~i~~~~p~~~~~i~d~~~~~- 139 (441) T protein:vir:80 116 -------------------------------------------------------DGTVSVRPQSPKNCTGKFSADGSR- 139 (441) T ss_pred -------------------------------------------------------CCceEEEEEccceEEEEEeCCCCc- Confidence 011234567777754 5653311 Q ss_pred hhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceE Q lcl|NC_013692. 260 FSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVL 339 (726) Q Consensus 260 ~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~ 339 (726) -..++ ++.. .+.+ .+...+.|.. +.+ T Consensus 140 --~~~~~-~~~~------------~~~~---------------------------------~~~~~~vy~~-----~~~- 165 (441) T protein:vir:80 140 --LDAGL-VVQQ------------TCDP---------------------------------EVVEAELLLP-----DVI- 165 (441) T ss_pred --eeEEE-EEEE------------EecC---------------------------------ceEEEEEEec-----CeE- Confidence 11111 1111 0000 0001111110 111 Q ss_pred EEEEEEEE-CCEEEEeccCCCCCCccceEEeeeeeecCcccCCChH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeec Q lcl|NC_013692. 340 HPIVATWV-GAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG-ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMK 417 (726) Q Consensus 340 ~~~~~~~~-g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~-~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ 417 (726) +..... ++.....+..|.+.|++|++++...+..++++|.|.+ +.++++++.+|..++.+.+.+...+.|...+ . T Consensus 166 --~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i-~ 242 (441) T protein:vir:80 166 --VQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWV-T 242 (441) T ss_pred --EEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeee-e Confidence 001111 1122233445555688999999988888999999865 5699999999999999999999999987765 3 Q ss_pred cc-ccc--hhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHH---HhchHHHhhccCcccchhhH Q lcl|NC_013692. 418 GA-LDV--TNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAES---MTGVKAFNAGISGAALGDTA 491 (726) Q Consensus 418 ga-v~~--~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~---~tGv~~~~~G~~~~~~~~ta 491 (726) |+ .+. .+.....+++++.+..+..... +.+...+. +.....+..+...+.. .|+++....|..++. ..|| T Consensus 243 G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~-~~Sg 318 (441) T protein:vir:80 243 GVSADEFSQPGWVLSMASVWAVDKDDDGDT-PNVGSFPV--NSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSN-PPSG 318 (441) T ss_pred cCCccccccchhhhcccccccCCCCCCCCc-ceeEecCc--cchHHHHHHHHHHHHHHhcccCCCHHHhccCCCc-chHH Confidence 53 222 2234456788877665443322 22223332 2344455555555554 577777777755432 2366 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHH Q lcl|NC_013692. 492 TAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAK 571 (726) Q Consensus 492 ~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~ 571 (726) .|+...............+.|..+++++++.++.+.-......- .+.. ..++-+........+. T Consensus 319 ~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~-------~~~~---------i~~~f~~~~~~~~~e~ 382 (441) T protein:vir:80 319 EALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEAD-------FFGD---------VGLRWRDASTPTRAAT 382 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-------ccee---------eeEEeCCCCCcCHHHH Confidence 66766666655666666666667777766655544221111000 0000 0111111111111112 Q ss_pred HHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 572 VNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGA 651 (726) Q Consensus 572 ~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~ 651 (726) .+....+.+. +. ..... ..+ ....+..+ ..++..... ++ +.+...+++. T Consensus 383 ad~~~kl~~~-g~--~~~s~-~~~---~~~l~~~~--~e~~~~~~e-------~~--e~~~~~~~~~------------- 431 (441) T protein:vir:80 383 ADAVTKLVGA-GI--LPADS-RTV---LEMLGLDD--VQVEAVMRH-------RA--ESSDPLAVLA------------- 431 (441) T ss_pred HHHHHHHHhc-Cc--ccccH-HHH---HHhCCCCH--HHHHHHHHH-------HH--HHHHHHHHHh------------- Confidence 2222222221 11 10000 111 11111100 000000000 00 0000000000 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_013692. 652 GLQDSKVGTEQAKARAL 668 (726) Q Consensus 652 ~~~~~~~~~eqaq~~q~ 668 (726) ........| . T Consensus 432 --~~~~~~~~~-----~ 441 (441) T protein:vir:80 432 --GAISRQTNE-----V 441 (441) T ss_pred --hhhhccccc-----C Confidence 000000000 0 No 63 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.50 E-value=1.3e-12 Score=85.71 Aligned_cols=460 Identities=11% Similarity=0.093 Sum_probs=204.4 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---CCC---CCCCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG---KPK---TEKGK 74 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---~~~---~~~gr 74 (726) |--++-=|-.+...+--.++ +..-.+... |+.-|.. +....+...+++.+||.|.-.. ..+ ..+++ T Consensus 6 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~---i~~~i~~---~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~ 77 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSD--LAELLKEEN---LRNFISR---HQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDK 77 (481) T ss_pred eehhchhcccccCceeeeec--chhhcCHHH---HHHHHHH---HHHHHHHHHHHHHHHhcCCCcccccCcccccccccc Confidence 44444444433333332221 112222222 2222221 2344556678899999875321 111 11233 Q ss_pred C--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeE Q lcl|NC_013692. 75 S--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTII 152 (726) Q Consensus 75 s--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i 152 (726) + +++.+.....|+.....| || .+ +.|.+ +|... .++|+.+|. .|+.-..+..++++++++|.+. T Consensus 78 ~~~ki~~n~~~~ivd~~~~~l----~g-~~-~~~~~---~d~~~----~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~ 143 (481) T protein:vir:10 78 ADHRAVHNYAKYVSRFIVGYL----TG-NP-ITITH---QDNQT----NDKIIELND-LNDADEVNSDLALNLSIYGRAY 143 (481) T ss_pred ccceeecchHHHHHHHHHhhh----cc-CC-ceEec---CChhH----HHHHHHHHH-hcChhHHHHHHHHHHHhcCeEE Confidence 2 577778788887776544 33 33 24444 23322 235555553 4665567788999999999999 Q ss_pred EEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecc Q lcl|NC_013692. 153 VKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEER 232 (726) Q Consensus 153 ~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 232 (726) +.+|++.. T Consensus 144 ~~~~~d~d------------------------------------------------------------------------ 151 (481) T protein:vir:10 144 EIVYRDFE------------------------------------------------------------------------ 151 (481) T ss_pred EEEEeCCC------------------------------------------------------------------------ Confidence 98765410 Q ss_pred cceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhcccc Q lcl|NC_013692. 233 EETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVR 310 (726) Q Consensus 233 ~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 310 (726) +.|++..++|.+++ ||+... .....+.+.|...++ T Consensus 152 -----g~~~i~~~~p~~~~~v~d~~~~----~~~~~~i~~~~~~~~---------------------------------- 188 (481) T protein:vir:10 152 -----DRDTFKVLDPKSTFVVYDQTLD----KKVVAGVRYFEKQDK---------------------------------- 188 (481) T ss_pred -----CeEEEEEEcccceEEEEcCCCC----CceEEEEEEEEEeeC---------------------------------- Confidence 01234556777765 333221 111222222210000 Q ss_pred ccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHH Q lcl|NC_013692. 311 NFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDN 390 (726) Q Consensus 311 ~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~ 390 (726) ....+..+|+|.. +.+ ++....|+..-..++.|.+.+.+|++.++. ..+|.|.++.++++ T Consensus 189 -------~~~~~~~~~~y~~-----~~i---~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~l 248 (481) T protein:vir:10 189 -------DKVPVQHVEVYTT-----DKI---YYIEIKGGTYHRVEEVEHYYNDVPIIEYLN-----DQFKQGDFENVIAL 248 (481) T ss_pred -------CCceEEEEEEEec-----CeE---EEEEecCCceeecccccccCCceeEEEeec-----CCCCCCchhhHHHH Confidence 0112344455532 111 111222332222233344446778766543 44688999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEeeccc-ccchhhhhhcCCceEeecCcc-----chhhhcccccCccchhHHHHHHHH Q lcl|NC_013692. 391 QRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTNRRRFDRGENYEFNPGA-----DPRAAVHMHTFPEIPQSAQYMINL 464 (726) Q Consensus 391 Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga-v~~~d~~~~~~g~vi~~~~~~-----~~~~~i~~~~~~~~~~~~~~ll~~ 464 (726) ++.+|..++.+.+.+...++|.+.+.... .+..+...+..++++.+..+. .....+.+...+.....+...+.. T Consensus 249 ida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 328 (481) T protein:vir:10 249 IDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKR 328 (481) T ss_pred HHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHH Confidence 99999999999999999998887764322 222333334444444332211 111223444444334567777888 Q ss_pred HHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccc Q lcl|NC_013692. 465 QQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHF 544 (726) Q Consensus 465 ~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~ 544 (726) +...+...|+++....|..++ +.||.|+...............+.|..+++++++.++.++....... .++ T Consensus 329 l~~~i~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~-------~~~ 399 (481) T protein:vir:10 329 LQNDIHKYTNTPDLNDEQFSG--VQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQ-------HNY 399 (481) T ss_pred HHHHHHHHhCCcccccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc-------ccc Confidence 888899999999887764332 23555565555555555555556666677766666665543211100 000 Q ss_pred eecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhh Q lcl|NC_013692. 545 VDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQ 624 (726) Q Consensus 545 v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq 624 (726) .+ +.+........-.....+.+..+. + -++.... +. .+..+.+....++....+.....+. T Consensus 400 ~~---------i~v~f~~~~~~~~~~~a~~~~kl~---g-~is~et~---~~---~l~~i~d~~~E~~ri~~E~~~~~~~ 460 (481) T protein:vir:10 400 AE---------LTITFTPNLPKSMMESINAFNALS---G-GVSESTR---LS---LLDFIDNPKEELEKMQEEEAQREKQ 460 (481) T ss_pred ce---------eeEEeCCCCCcCHHHHHHHHHHHh---c-cCChHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHhh Confidence 00 011111111111111111111111 1 1111110 10 0111111111111100000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGA 651 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~ 651 (726) .+......... ....--..+. T Consensus 461 ---~~~~~~~~~~~---~~~~~dd~~g 481 (481) T protein:vir:10 461 ---ADKRGYGEAFE---NHLNVDDSNG 481 (481) T ss_pred ---hhhccCCccCC---CCCCCCCCCC Confidence 00000000000 0000000000 No 64 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.50 E-value=1.7e-12 Score=85.14 Aligned_cols=460 Identities=13% Similarity=0.075 Sum_probs=200.8 Q ss_pred CCCcc-chhcCCCCCCch-HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCC-CCCCCCCCCc---CC-CHHHHHH Q lcl|NC_013692. 14 EDGDP-SKRLQPEWSNAP-SLAQLKQDYQEAK-QVTDEKITQINRWLDYMHVRGEG-KPKTEKGKSA---VQ-PPTIRKQ 85 (726) Q Consensus 14 ~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~~-~~~~~~grs~---~v-~~~v~~~ 85 (726) +.... .|.....|.+.. ....|+.-.+|-+ +....++..+..|..||.|.... +...+.|+.+ .. .+.-... T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 11111 111122222111 1112222222222 12234445578899999875531 1222333322 11 1222222 Q ss_pred HHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEE Q lcl|NC_013692. 86 AEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVK 165 (726) Q Consensus 86 v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~ 165 (726) .-.+.+.+|+-.+-|.+ +|. ..+++||.++ ..|+....+..++..|+..|.+++|+||+. + T Consensus 81 ----~~~~A~ll~~e~~~i~~-----~d~----~~~e~l~~i~-~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~--~--- 141 (505) T protein:vir:79 81 ----SAKLASLIFNEQCQVTV-----SDE----TANDFLDDVF-QQNDFYTTFEEKLEEWIALGSGCVRPYVDS--G--- 141 (505) T ss_pred ----HHHHHhhhcCCCceeec-----CCh----HHHHHHHHHH-HhccHHHHHHHHHHHHhhcCCeEEEEEEeC--C--- Confidence 22233333443333333 333 4455788776 466777889999999999999999999971 0 Q ss_pred ecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeee Q lcl|NC_013692. 166 EQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVC 245 (726) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v 245 (726) .+.|++| T Consensus 142 -------------------------------------------------------------------------~~~i~~v 148 (505) T protein:vir:79 142 -------------------------------------------------------------------------KIKLAWA 148 (505) T ss_pred -------------------------------------------------------------------------ceEEEEE Confidence 0134556 Q ss_pred chhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEE Q lcl|NC_013692. 246 DYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVH 325 (726) Q Consensus 246 ~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 325 (726) +|..|++=.....++..|-|+.+.+.....+ ..| ++.+ T Consensus 149 ~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~---~~~---------------------------------------yt~l 186 (505) T protein:vir:79 149 TADQVYPLQADTNQVNELAIASRTTEVENHR---TIY---------------------------------------YTLL 186 (505) T ss_pred cCCeeEEEEEcCCCeEEEEEEEEEEEecCCc---ceE---------------------------------------EEEE Confidence 6666653111112344555443221111000 001 2222 Q ss_pred EEEEEeecCCCceEEEEEEEEEC------CEEEEeccCC----------C-CCCccceEEeee----eeecCcccCCChH Q lcl|NC_013692. 326 EYWGYYDIHGDGVLHPIVATWVG------AVMIRMEENP----------F-PDKRIPYVVVNY----IPRKRDLYGESDG 384 (726) Q Consensus 326 E~w~~~~~~~~g~~~~~~~~~~g------~~~l~~~~~P----------~-~~~~~Pf~~~~~----~~~~~~~~g~g~~ 384 (726) |+|... ++.|... ...|.+ +..+.....| + ...+.||+.+++ ....++.+|.|++ T Consensus 187 E~h~~~--~~~~~I~--n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~ 262 (505) T protein:vir:79 187 EFHQWD--HGDYVIT--NELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLI 262 (505) T ss_pred EEEEec--CceEEEE--EEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchh Confidence 222210 0001000 000000 0000000000 1 112334555432 2344678999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh-------h---hhcCCce-EeecCccchhhhcccccCcc Q lcl|NC_013692. 385 ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR-------R---RFDRGEN-YEFNPGADPRAAVHMHTFPE 453 (726) Q Consensus 385 ~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~-------~---~~~~g~v-i~~~~~~~~~~~i~~~~~~~ 453 (726) ..+++..+.+|..++++.+.+.. +..++.++...+..... . .+..+.. +..-.+......+...++.. T Consensus 263 ~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~i 341 (505) T protein:vir:79 263 DNSYTVIDAINRTHDQFVDEVKK-GQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDATSPI 341 (505) T ss_pred hhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEecccC Confidence 99999999999999999998864 66677887776522110 0 1222211 11111112233455555443 Q ss_pred chhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_013692. 454 IPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDD 533 (726) Q Consensus 454 ~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~ 533 (726) ....+...++.+...+....|++.-..|..+.. ..||+++....+..-.....+...+..+++++.+.++.+..-|.-. T Consensus 342 r~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~-~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~ 420 (505) T protein:vir:79 342 RVADYQATMDFFLREFENQTGLSQGTFTTSPSG-IQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFY 420 (505) T ss_pred CHHHHHHHHHHHHHHHHHHhCCChhhcCCCccc-cchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 334456667777777888888887777765543 3578888776665666666777778888888888888776655321 Q ss_pred Ce-EEEEecccceecchhhcccccceeee--cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhh--hh Q lcl|NC_013692. 534 VE-VVRITNEHFVDIRRDDLAGNFDLKLD--ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPD--FA 608 (726) Q Consensus 534 e~-~iRi~~~~~v~v~~~~~~~~~dv~i~--~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e--~~ 608 (726) .- ..+-.+ . ...+++++. .+...-.....+....+.. .+ -+.... . ++...+..+ .. T Consensus 421 ~~g~~~~~~----~------~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~-~G-i~s~e~---~---l~~~~~~~eeea~ 482 (505) T protein:vir:79 421 ADGQARWTG----D------VDSLDITINFNDGVFVDQESKRAADLQAVQ-AQ-VMPKKQ---F---LMRNYGLDEEEAD 482 (505) T ss_pred ccccccccC----C------CCceeEEEEeCCCCCCCHHHHHHHHHHHHH-cC-CCCHHH---H---HHhcCCCChHHHH Confidence 11 000000 0 112223322 2222111111111111111 11 111110 0 111111111 11 Q ss_pred hhHHHHHhhhhhhhhhHHHHHHHHHHHHHH Q lcl|NC_013692. 609 KRIREFQPQPDPIAQQKAQLELMLLQAQIE 638 (726) Q Consensus 609 ~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e 638 (726) +.+...+.+.... ..+.-..-. + T Consensus 483 ~el~ri~~E~~~~---~p~~~~~gg----~ 505 (505) T protein:vir:79 483 EWLAQIDAENSTA---EPEFNQFGG----D 505 (505) T ss_pred HHHHHHHHhcccc---CCCchhccC----C Confidence 1111111000000 000000000 0 No 65 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.50 E-value=1.2e-12 Score=86.03 Aligned_cols=466 Identities=13% Similarity=0.079 Sum_probs=202.5 Q ss_pred CCCCCCccchhcCCCCCCc-hHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCC-CCC--CCCC----CCcCCCHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNA-PSLAQLKQDYQEAK-QVTDEKITQINRWLDYMHVRGEG-KPK--TEKG----KSAVQPPT 81 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~~-~~~--~~~g----rs~~v~~~ 81 (726) |- +.-++.+.+|-+. -.++.|+.-+++.. ..+.++++.+.+|..||.|.... ..+ ...| +-+++.+. T Consensus 1 m~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~ 76 (496) T protein:vir:38 1 MI----NQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNL 76 (496) T ss_pred Ch----hHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecch Confidence 11 2233345555333 23455565565543 44667777788999999875421 001 1112 22333344 Q ss_pred HHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeee Q lcl|NC_013692. 82 IRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQS 161 (726) Q Consensus 82 v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~ 161 (726) ....++.. ..-+|+-..-|.+ +|.. .+++|+.++. .++....+..++.+|+..|.+.+++||+.. T Consensus 77 ~k~i~~~~----a~~l~~~p~~i~~-----~d~~----~~e~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~- 141 (496) T protein:vir:38 77 PKVTAKYM----SKLLFNEKVKINI-----DDKA----AEEFVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDGN- 141 (496) T ss_pred HHHHHHHH----hhhhhCCcceEee-----CChH----HHHHHHHHHh-ccCHHHHHHHHHHHHhhhCcEEEEEEEcCC- Confidence 33333332 2333444333332 4443 3447777663 677788899999999999999999999711 Q ss_pred eeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccce Q lcl|NC_013692. 162 RTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPT 241 (726) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~ 241 (726) +.|. T Consensus 142 ----------------------------------------------------------------------------~~~~ 145 (496) T protein:vir:38 142 ----------------------------------------------------------------------------KNVK 145 (496) T ss_pred ----------------------------------------------------------------------------CcEE Confidence 0123 Q ss_pred eeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCce Q lcl|NC_013692. 242 VQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKR 321 (726) Q Consensus 242 i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (726) +++|+|.+||+=..-..++..+-|+.+ + +.+. -+|-.++...+.+. .+..........+ .+.-... T Consensus 146 i~~v~~~~~~P~~~~~~~~~~~~f~~~--~-~~~~----~~y~~le~h~~~~~------~~~I~~~~y~~~~-~~~~g~~ 211 (496) T protein:vir:38 146 VSFATADCMYPLSNDSENVDECVIANS--F-HKNN----KYYTLLEWNEWQGD------VYTVTTELYQSDD-PNELGTK 211 (496) T ss_pred EEEEcccceEEEEecCCcEEEEEEEEE--E-EeCC----eEEEEEEEEEEeCc------eEEEEEEEEecCC-ccccCcc Confidence 566777776631111112333333211 1 1100 00000000000000 0000000000000 0000011 Q ss_pred EEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeee----eeecCcccCCChHHHHHHHHHHHHHH Q lcl|NC_013692. 322 LVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNY----IPRKRDLYGESDGALLIDNQRIIGAV 397 (726) Q Consensus 322 v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~ 397 (726) |.+.+.|.-+ ..........++||+.+.+ .....+.+|.|+++.++++++.+|.. T Consensus 212 v~~~~~~~~~---------------------~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~ 270 (496) T protein:vir:38 212 VSLTLLFDDI---------------------EPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLM 270 (496) T ss_pred cccccccccc---------------------ccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHH Confidence 1111111100 0000001113456665543 22456788999999999999999999 Q ss_pred HHHHHHHHHhcCCCceEeecccccchh----h--hhhc-CCceEeecCccch--hhhcccccCccchhHHHHHHHHHHHH Q lcl|NC_013692. 398 TRGMIDTMARSANGQVGVMKGALDVTN----R--RRFD-RGENYEFNPGADP--RAAVHMHTFPEIPQSAQYMINLQQAE 468 (726) Q Consensus 398 ~~~~~d~l~~~~~~~~~~~~gav~~~d----~--~~~~-~g~vi~~~~~~~~--~~~i~~~~~~~~~~~~~~ll~~~~~~ 468 (726) ++.+.+.+.. +.+++.++...+.... . ..+. +..++..-.+... ...+....+......+...++.+... T Consensus 271 ~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~ 349 (496) T protein:vir:38 271 FDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRI 349 (496) T ss_pred HHHHHHHHhh-cccceecchHHhhccCCCCCccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHH Confidence 9999998875 6777888766653211 1 1111 1122211111111 12344333322234455666777777 Q ss_pred HHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecc Q lcl|NC_013692. 469 AESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIR 548 (726) Q Consensus 469 ~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~ 548 (726) +....|++....|.++.. ..||+++.......-.....+.+.|..+++++++.++.+...+..-. +. .+. T Consensus 350 i~~~~g~~~~~f~~~~~g-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~------g~---~~~ 419 (496) T protein:vir:38 350 YAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYS------GE---VVE 419 (496) T ss_pred HHHhhCCChhhcCCCccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------CC---CCC Confidence 778888888887765432 24677776555444455556677777888888888887665332100 00 000 Q ss_pred hhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhh--hhhhhHHHHHhhhhhh-h--- Q lcl|NC_013692. 549 RDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMP--DFAKRIREFQPQPDPI-A--- 622 (726) Q Consensus 549 ~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~--e~~~~l~~~~~~~~~~-~--- 622 (726) .....-.| +.+...-.....+....+. ..+ -+.... .+ ....... +..+.+...+.+.... + T Consensus 420 ~~~i~v~f----~d~i~~d~~~~~~~~~~~~-~~G-iiS~et---~l---~~~~~~~d~ea~~el~ri~~E~~~~~~~~d 487 (496) T protein:vir:38 420 LDTITVDF----DDSIAQDEDTTINRYTNAK-NQG-MIPLKI---AL---QRAWNITEAEADEWAEMLAKEKQAEMPNND 487 (496) T ss_pred ccceEEEe----CCCCCCCHHHHHHHHHHHH-hcC-CCCHHH---HH---HhcCCCChHHHHHHHHHHHHhhhccCcccc Confidence 00111111 1111111111111111111 111 111110 00 0111110 0111111111000000 0 Q ss_pred --hhHHHHH Q lcl|NC_013692. 623 --QQKAQLE 629 (726) Q Consensus 623 --qq~~q~e 629 (726) ...-+.| T Consensus 488 ~~~~~~~~e 496 (496) T protein:vir:38 488 MNGIFGEEE 496 (496) T ss_pred ccCCCCCCC Confidence 0000000 No 66 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.49 E-value=4.7e-13 Score=88.17 Aligned_cols=463 Identities=13% Similarity=0.114 Sum_probs=199.8 Q ss_pred CCC-----ccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCC-CC----- Q lcl|NC_013692. 1 MAD-----VDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAK-QVTDEKITQINRWLDYMHVRGEG-KP----- 68 (726) Q Consensus 1 ~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~~-~~----- 68 (726) |-| |-+-+++|. ....|+.-+++.. ..++++...+.+|..||.|.... .. T Consensus 1 m~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~ 62 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMG------------------LLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEH 62 (499) T ss_pred ChhHHHHHHHHHHHHhc------------------cccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhcccccc Confidence 222 222222321 1122333333332 34566667788899999865320 00 Q ss_pred -CCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 69 -KTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 69 -~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) +....+.+++.+.....++... +.+|+-..-|.+ +|. ..+++|+.++. .|+....+...+..|+. T Consensus 63 ~~~~~~~~~~s~n~~~~iv~~~a----~~l~~ep~~i~~-----~d~----~~~e~l~~~~~-~n~f~~~~~~~~~~a~~ 128 (499) T protein:vir:80 63 NGNPVNRRQLSMNLPKVTAKYMS----KLLFNEKVKINI-----DDE----TAEEFVLNVLK-TNGFTKNMERYIEYGEA 128 (499) T ss_pred CCCccccceeecchHHHHHHHHH----HhhhCCcceEee-----CCH----HHHHHHHHHHh-hccHHHHHHHHHHHHhh Confidence 1111233444555444444433 333444333333 343 45557777763 56677789999999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) .|.+++++||+.. T Consensus 129 ~G~~~~~~~~D~~------------------------------------------------------------------- 141 (499) T protein:vir:80 129 MGGFVIKVYHDGN------------------------------------------------------------------- 141 (499) T ss_pred cCcEEEEEEECCC------------------------------------------------------------------- Confidence 9999999999721 Q ss_pred eeecccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhc Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSE 307 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~ 307 (726) +.|.|.+|+|..||+=.....++..|-|+-.. ++++ -+|.-++.--+ T Consensus 142 ----------~~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~---~~~~----~~y~~lE~h~~---------------- 188 (499) T protein:vir:80 142 ----------KNVKVSFATADCMYPLSNDSENVDECLIANSF---HKNN----KYYKLLEWNEW---------------- 188 (499) T ss_pred ----------CcEEEEEEcCCceEEEEecCCCeEEEEEEEEE---eecC----eEEEEEEEEEe---------------- Confidence 01235667777766411111234445443221 1100 00000000000 Q ss_pred cccccccCCcCCceEEE-EEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCC-CCccceEEeeee----eecCcccCC Q lcl|NC_013692. 308 GVRNFDFQDKSRKRLVV-HEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFP-DKRIPYVVVNYI----PRKRDLYGE 381 (726) Q Consensus 308 ~~~~~~~~~~~~~~v~v-~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~-~~~~Pf~~~~~~----~~~~~~~g~ 381 (726) .+......+| .+.|...+...-|.......++.+ +. ...++. .++.||+.+.+. ...++.+|. T Consensus 189 -------~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~---~~-~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~ 257 (499) T protein:vir:80 189 -------KGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFND---IE-PVVPLPSLTRPTFIYIKPNIANNKNLTSPLGI 257 (499) T ss_pred -------cccceeeEEEEEEEEeccCccccCcccchhhhccC---cC-CceeecCCCccceEeecCCccccccCCCccCC Confidence 0000000000 011110000000000000000000 00 000111 245566665442 245778899 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchh------hhhhc-CCceEeecCccch--hhhcccccCc Q lcl|NC_013692. 382 SDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTN------RRRFD-RGENYEFNPGADP--RAAVHMHTFP 452 (726) Q Consensus 382 g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d------~~~~~-~g~vi~~~~~~~~--~~~i~~~~~~ 452 (726) |+++.++++.+.+|..++++.+.+.. +..++.++.+.+.... ...+. ...++....+... ...|...++. T Consensus 258 S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 336 (499) T protein:vir:80 258 SVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDISVE 336 (499) T ss_pred chHhhHHHHHHHHHHHHHHHHHHHHh-cccceecchhhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEecCc Confidence 99999999999999999999998865 5777778777663211 11111 1222222222111 1235444433 Q ss_pred cchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 453 EIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLD 532 (726) Q Consensus 453 ~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d 532 (726) .....+...++.+...+....|++....|.+++. ..||+++.......-.....+...|..++.++.+.++.+..-+.- T Consensus 337 ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g-~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~ 415 (499) T protein:vir:80 337 IRSTEFIESINAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKA 415 (499) T ss_pred CChHHHHHHHHHHHHHHHHhcCCChhhcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 3334455667777777788888887777754432 347777766555555556667777777888888888776554321 Q ss_pred cCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhh--hhhhh Q lcl|NC_013692. 533 DVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMP--DFAKR 610 (726) Q Consensus 533 ~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~--e~~~~ 610 (726) -. +.. +.+....-.| +.+...-.....+....+. +.+ -+.... .+ ....+.. +..+. T Consensus 416 ~~------~~~---~~~~~v~v~f----~d~i~~d~~~~~~~~~~~~-~~G-i~S~et---~l---~~~~~~~d~ea~~e 474 (499) T protein:vir:80 416 YD------GDT---VELDTITVDF----DDSIAQDEDTTINRYTTAK-NQG-MIPLKI---AL---QRAWNITEAEADEW 474 (499) T ss_pred cc------CCC---CCccceEEEe----CCCCCCCHHHHHHHHHHHH-HcC-CCCHHH---HH---hhcCCCChHHHHHH Confidence 10 000 0000111111 1111111111111111111 001 011110 00 0010000 00011 Q ss_pred HHHHHhhhhh-hhhhH-----HHHH Q lcl|NC_013692. 611 IREFQPQPDP-IAQQK-----AQLE 629 (726) Q Consensus 611 l~~~~~~~~~-~~qq~-----~q~e 629 (726) +.+.+.++.. ..... -+.| T Consensus 475 l~~i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 475 AEMLAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred HHHHHHHhhcCCCCCCccccCCCCC Confidence 1111111000 00000 0001 No 67 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.49 E-value=9.3e-13 Score=86.55 Aligned_cols=408 Identities=13% Similarity=0.085 Sum_probs=186.1 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC------CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_013692. 27 SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK------PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSS 100 (726) Q Consensus 27 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~ 100 (726) -+...|..|.+.+.+ +.....+-.+||.|....+ |+..+..-+.|.+-.+..|+.+...| T Consensus 1 m~~~~i~~L~~~~~~-------~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl------- 66 (422) T protein:vir:97 1 MNYMGMGYLRRKLAL-------FKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRI------- 66 (422) T ss_pred CChHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcc------- Confidence 556666677666554 2334566789998755321 11111111233333344444333222 Q ss_pred CceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHH Q lcl|NC_013692. 101 PNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEE 180 (726) Q Consensus 101 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 180 (726) .|...+-.|.+ +..+| ..|+.-.....++++||++|.+++.++.+...+ T Consensus 67 ----~~~Gf~~~d~~--------l~~~w-~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~------------------ 115 (422) T protein:vir:97 67 ----IFREFTNDDFN--------AWEIF-KANNPDIFFDTAIQSALIASCCFVYIMPGAEDG------------------ 115 (422) T ss_pred ----ccceeeCCchh--------HHHHH-HhcChHHHHHHHHHHHHHhcceeEEEeeCCCCC------------------ Confidence 22223334432 34455 356655566678899999999999886541100 Q ss_pred HHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe--eeCCCCCC Q lcl|NC_013692. 181 LAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI--VIDPSCGS 258 (726) Q Consensus 181 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~~dp~a~~ 258 (726) .|.|..++|.++ +|||.... T Consensus 116 ----------------------------------------------------------~p~i~~~sp~~~~~i~D~~~~~ 137 (422) T protein:vir:97 116 ----------------------------------------------------------LPKMQVIEASKATGILDPTTFL 137 (422) T ss_pred ----------------------------------------------------------eeEEEEechhhEEEEEeCCCCc Confidence 122334455553 45663211 Q ss_pred chhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCce Q lcl|NC_013692. 259 DFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGV 338 (726) Q Consensus 259 d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~ 338 (726) + ....+++ ..++ .| .++...+|. ++. T Consensus 138 -~----~~a~~~~-~~~~---~~---------------------------------------~~~~~~~~~------~~~ 163 (422) T protein:vir:97 138 -L----TEGYAIL-ESDS---NG---------------------------------------NPTLEAYFT------DKD 163 (422) T ss_pred -c----eeeEEEE-EecC---CC---------------------------------------cEEEEEEEc------Cce Confidence 1 1111111 0000 00 000011110 000 Q ss_pred EEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeec Q lcl|NC_013692. 339 LHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG-ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMK 417 (726) Q Consensus 339 ~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~-~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ 417 (726) +.++.++......++|+ +++|++++...+...+.+|.|-+ +.++++|+.+|+.++.+.......+.|+..+ . T Consensus 164 ----~~~~~~~~~~~~~~~~~--g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-~ 236 (422) T protein:vir:97 164 ----IWYYPKKGKPYNIKNPT--GHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYV-L 236 (422) T ss_pred ----EEEEcCCCccccccCCC--CCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-c Confidence 00000111111224554 67999999999999999999866 8899999999999999999999999988765 3 Q ss_pred ccc---cchhhhhhcCCceEeecCccchh-hhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHH Q lcl|NC_013692. 418 GAL---DVTNRRRFDRGENYEFNPGADPR-AAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATA 493 (726) Q Consensus 418 gav---~~~d~~~~~~g~vi~~~~~~~~~-~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~ 493 (726) |+- ++.+......+.++.+..+.+.. ..+..++..++ ..+...+..+...+-.+||+|....|...+ ...||.+ T Consensus 237 G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~A 314 (422) T protein:vir:97 237 GMDPDAKPMEKWRATVSTLLEISKDEDGDKPTVGQFTTASM-APFMEHLKMYASLFAGGSGLTLDDLGFPSD-NPSSVES 314 (422) T ss_pred ccCcccccCchhhhhhhhhhccCCCCCCCcceeeecCCCCh-hHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHH Confidence 331 11223334456777665433211 12222222222 223334444444555668899888885442 1246777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHH-H Q lcl|NC_013692. 494 VRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAK-V 572 (726) Q Consensus 494 i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~-~ 572 (726) +......-..+.....+.|..+++.++++++.+.-..-..+ .++..+. -.|...++. . .....+ . T Consensus 315 i~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~-------~~~~~~~-~~w~p~~~~----~--~~s~a~~a 380 (422) T protein:vir:97 315 IKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLR-------NQFMDTV-IKWEPLFEA----D--ANMLTLVG 380 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc-------hhhccce-EEEccCCCC----C--hHHHHHHH Confidence 77655555555566666677777777777665432211100 0010000 001100000 0 111111 1 Q ss_pred HHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHH Q lcl|NC_013692. 573 NDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQA 635 (726) Q Consensus 573 ~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qa 635 (726) .....+.+. .+........ .+..+..+....+..... .++.. T Consensus 381 Da~~Kl~~a----~~~~~~~~~~---~~~lg~~~~~~~~~~~~~--------------~~~d~ 422 (422) T protein:vir:97 381 DGAIKLNQA----IPGFMDADVI---RDLTGVKGADKPIPAITE--------------VTTDG 422 (422) T ss_pred HHHHHHHhh----ccccccHHHH---HHHcCCCchhHHHHHHHh--------------hhccC Confidence 112222221 1111111111 111122111111111100 00000 No 68 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.49 E-value=3.4e-12 Score=83.46 Aligned_cols=440 Identities=10% Similarity=0.051 Sum_probs=202.7 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC---C-C---------CCCCC--CcCCCHHHHHHHHHHHHHHH Q lcl|NC_013692. 30 PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK---P-K---------TEKGK--SAVQPPTIRKQAEWRYSSLS 94 (726) Q Consensus 30 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~---~-~---------~~~gr--s~~v~~~v~~~v~~~~~~L~ 94 (726) =.+..|+..|+.-...+...+....+-.+||.|.-+.. . + ...++ .+++.+.....|+.....| T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl- 79 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYV- 79 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhe- Confidence 44556666666666677777777788899998753210 0 0 01111 2566666666666666443 Q ss_pred HhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccC Q lcl|NC_013692. 95 EPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMM 174 (726) Q Consensus 95 ~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~ 174 (726) ||.+ +.|. .+|....+...++++. +....+....++++.+|.+...+||+..- T Consensus 80 ---~G~p--~~~~---~~d~~~~~~l~~~~~~------~~~~~~~~l~~~~~~~G~a~~~~y~d~~~------------- 132 (470) T protein:vir:10 80 ---ASVF--PDID---VGKDADNKKIIDVLGD------DRALTLNGLLVDSSNAGRAWLHYWIDEDG------------- 132 (470) T ss_pred ---eccc--eeee---cCchHHHHHHHHHHhh------hHHHHHHHHHHHHhhcCeeEEEEEecCCC------------- Confidence 4544 3443 3444444544444332 33456667889999999999988875110 Q ss_pred CcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCC Q lcl|NC_013692. 175 PDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDP 254 (726) Q Consensus 175 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp 254 (726) .+++..++|.++++=. T Consensus 133 ----------------------------------------------------------------~~~~~~~~p~~~~~v~ 148 (470) T protein:vir:10 133 ----------------------------------------------------------------NFRYGIIQPDQITPIY 148 (470) T ss_pred ----------------------------------------------------------------ceEEEEEcccceEEEE Confidence 1233456666655322 Q ss_pred CCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEE---- Q lcl|NC_013692. 255 SCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGY---- 330 (726) Q Consensus 255 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~---- 330 (726) +. +......+ +.+.|.+.+. +. ...+..+|+|.. T Consensus 149 d~-~~~~~~~a-~ir~y~~~~~------~~----------------------------------~~~~~~~e~yt~~~~~ 186 (470) T protein:vir:10 149 AT-TLDNKLLG-ILRSYKQLDP------DS----------------------------------GKYFTVHEYWTDKEAQ 186 (470) T ss_pred cC-CCCCceEE-EEEEEEeeec------CC----------------------------------ceEEEEEEEEcCCcEE Confidence 11 11112222 2222211111 00 001222222210 Q ss_pred ---eecCCCceEEEEEEEE-----ECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 331 ---YDIHGDGVLHPIVATW-----VGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMI 402 (726) Q Consensus 331 ---~~~~~~g~~~~~~~~~-----~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~ 402 (726) ....+......+.... .+...-..+..|..++.+|++.++. ...|.|.++.++++++.+|.++|.+. T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~ 261 (470) T protein:vir:10 187 FFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSK-----NKYRLPELNKYKGLIDAYDDIYNGFI 261 (470) T ss_pred EEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeec-----CCCCCCchhHHHHHHHHHHHHHHHHH Confidence 0000000101000000 0000111122233345666665554 34689999999999999999999999 Q ss_pred HHHHhcCCCceEeecccccchh--hhhhcCCceEeecCccc-hhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHh Q lcl|NC_013692. 403 DTMARSANGQVGVMKGALDVTN--RRRFDRGENYEFNPGAD-PRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFN 479 (726) Q Consensus 403 d~l~~~~~~~~~~~~gav~~~d--~~~~~~g~vi~~~~~~~-~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~ 479 (726) +.+...++|.+.+.-...+... .......+.+.+...+. ....+.+...+.........+..+...+-..+++++.+ T Consensus 262 ~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~ 341 (470) T protein:vir:10 262 NDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPA 341 (470) T ss_pred HHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCC Confidence 9999999998876432222211 22334445555543322 12335555555555677788889999999999999876 Q ss_pred hccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhccccccee Q lcl|NC_013692. 480 AGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLK 559 (726) Q Consensus 480 ~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~ 559 (726) .+..+ +.|+.++..............-+.|..+++.+++.++.++.. .+.++..+ .+. T Consensus 342 ~~~~g---n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~----------~~~d~~~i---------~i~ 399 (470) T protein:vir:10 342 NFESS---NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF----------SDADKRHI---------SQH 399 (470) T ss_pred ccccc---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------cCccccee---------eEE Confidence 55322 346777777777777777777777777777776666554321 01111110 111 Q ss_pred eecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHH Q lcl|NC_013692. 560 LDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEA 639 (726) Q Consensus 560 i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~ 639 (726) .+.....-....++.... ...-+.... .+. .+....+....++....+.....+...+. ..... T Consensus 400 f~~~~p~d~~e~~~~~~~----~~g~iS~et---~l~---~~p~v~D~~~E~eri~~E~~e~~~~~~~~------~~~~~ 463 (470) T protein:vir:10 400 WTRTKVEDSLTKAQIVST----VANYSSKEA---VAK---ANPIVDDWQQELKDLAKDKEENDPYSNQA------DELNG 463 (470) T ss_pred eccCCCCCHHHHHHHHHH----HhccCcHHH---HHH---hCCCCCCHHHHHHHHHHHHHHHHHhhccc------cccCC Confidence 111111111111111111 111111110 000 01111111110111000000000000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 640 ERARAAHYMSGAGLQDSKVGTEQ 662 (726) Q Consensus 640 ~~aq~q~~~~~~~~~~~~~~~eq 662 (726) .-.-.+| T Consensus 464 ----------------~~~dde~ 470 (470) T protein:vir:10 464 ----------------KGVNDEQ 470 (470) T ss_pred ----------------CCCCCCC Confidence 0000000 No 69 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.49 E-value=1e-12 Score=86.38 Aligned_cols=471 Identities=10% Similarity=0.039 Sum_probs=208.3 Q ss_pred CCCccchhhcCCCCCCc------cchhcCCCCCCchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCC-CC-CC--C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGD------PSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEK-ITQINRWLDYMHVRG-EG-KP--K 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~-~~~~~~~~~~y~~~~-~~-~~--~ 69 (726) ..++-.|+.+.--...- +.-..++.| ....|++-|. .|... .....+..+||.|.. .. .. . T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~i~----~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~ 79 (501) T protein:vir:27 8 DSTGQDLVLNLRFHRESRIRYRADNLEELMVN----NWELLKNFIN----HHKLRQAPRIQELLDYARGENHDVLQFGRR 79 (501) T ss_pred eccchhhhhhcccChhHHHhhccccccccccc----cHHHHHHHHH----HHHHHHHHHHHHHHHHhcCCCccccccCcc Confidence 22233322211111000 000011111 1122444443 44433 345678899998753 21 11 1 Q ss_pred CCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 70 TEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 70 ~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) ..++++ +++.+.....|+.....| ||.. +.|... |...-+...++++.+| ..|+.-..+..+++++++ T Consensus 80 ~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p--~~~~~~---d~~~~~~~~~~l~~~~-~~n~~~~~~~~~~~~~~~ 149 (501) T protein:vir:27 80 KDREMADKRAVHNYGRMISKFKTGYL----AGNP--IRVEYD---DNDNNSQNDDTIKRIG-RINDIDSHNRTLIRDLSQ 149 (501) T ss_pred CccccccceeccchHHHHHHHHhhhh----cccC--eeEecC---CccchHHHHHHHHHHH-HhcChhHHHHHHHHHHhh Confidence 223444 678888888888877666 4443 233321 2222334556677765 467777788899999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +|.+.+.+|++.. T Consensus 150 ~G~a~~~vy~ded------------------------------------------------------------------- 162 (501) T protein:vir:27 150 TGRAYEVIYRNEY------------------------------------------------------------------- 162 (501) T ss_pred CCeEEEEEEeCCC------------------------------------------------------------------- Confidence 9999998887510 Q ss_pred eeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhh Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGP 305 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~ 305 (726) +.|++..++|.+++ ||++.. .+..+.+| .|....+ T Consensus 163 ----------~~~~i~~~~p~~~~~v~d~~~~---~~~~~~ir-~~~~~~~----------------------------- 199 (501) T protein:vir:27 163 ----------DETRIKRLNPLETFVIYDNSLE---DNSIAAVR-YYNRGTL----------------------------- 199 (501) T ss_pred ----------CceEEEEEccceeEEEecCCCC---CceEEEEE-EEEeeec----------------------------- Confidence 01234556677765 344321 11222222 2210000 Q ss_pred hccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH Q lcl|NC_013692. 306 SEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA 385 (726) Q Consensus 306 ~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~ 385 (726) ...+..+|+|.. +.+ ++....|+ .......|.+.+.+|++.+.. ..+|.|.+. T Consensus 200 -------------~~~~~~~~vyt~-----~~v---~~~~~~~~-~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e 252 (501) T protein:vir:27 200 -------------QNAKDVVEIYTN-----EHI---YTLDASDD-FNEISVTTHAFGTVPITEFLN-----NVDGIGDYE 252 (501) T ss_pred -------------CCcEEEEEEEeC-----CeE---EEEEeCCc-eeeccccccCCCcccEEEecC-----CCCCCCchh Confidence 001233444432 111 11111111 112223344447788776643 456899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cchh--hhhhcCCceEeecCcc-----chhhhcccccCccchhH Q lcl|NC_013692. 386 LLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVTN--RRRFDRGENYEFNPGA-----DPRAAVHMHTFPEIPQS 457 (726) Q Consensus 386 ~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~d--~~~~~~g~vi~~~~~~-----~~~~~i~~~~~~~~~~~ 457 (726) .++++++.+|..++.+.+.+...+++.+.+ .|.. +..+ .......+.+.+..+. .....+.+...+..... T Consensus 253 ~v~~liDa~d~~~S~~~~~~~~~~~~~~v~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 331 (501) T protein:vir:27 253 TELYLIDLYDSAESDTANHMSDMADAILAI-YGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSG 331 (501) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHH Confidence 999999999999999999999988887765 4432 2221 1222333444443221 11123444444444556 Q ss_pred HHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEE Q lcl|NC_013692. 458 AQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVV 537 (726) Q Consensus 458 ~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~i 537 (726) ....+..+...+...|++++.+.|..++ +.||.++...............+.|..+++++.+.+++++........ T Consensus 332 ~~~~~~~l~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~-- 407 (501) T protein:vir:27 332 AEAYKTRLNRDIHIFTNIPDMSDTNFSG--NTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKD-- 407 (501) T ss_pred HHHHHHHHHHHHHHHhCCcccCcccccc--CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-- Confidence 7778889999999999999887664322 236666666666666666666777777887777777665432211100 Q ss_pred EEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhh Q lcl|NC_013692. 538 RITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQ 617 (726) Q Consensus 538 Ri~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~ 617 (726) -++. .+.-.| ......-..+.++.+..+ ...++.... +. .+..+.+....++....+ T Consensus 408 ----~d~~-----~i~v~f----~~~~p~n~~e~ad~~~kl----~g~iS~et~---l~---~l~~v~D~~~E~eri~~E 464 (501) T protein:vir:27 408 ----FDES-----LLKITF----TPNLPKSLNEQVSILTGL----GGQVSQETA---LS---LSGLVESPNEELDKINKE 464 (501) T ss_pred ----cccc-----cceEEe----CCCCCcCHHHHHHHHHHH----hccCcHHHH---HH---hCCCCCCHHHHHHHHHHH Confidence 0000 000001 000000001111111111 111111110 00 011111101111111100 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 618 PDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTE 661 (726) Q Consensus 618 ~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~e 661 (726) ..... ....+..+.................+..-..| T Consensus 465 ~~e~~-------~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 465 VSEID-------FKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred HHhhh-------HhhhcCccccccccccCCCCCCccccccccCC Confidence 00000 00000000000000000000000000000000 No 70 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.49 E-value=6.3e-12 Score=81.98 Aligned_cols=432 Identities=7% Similarity=0.034 Sum_probs=205.1 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC---------CCCCCCC--cCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_013692. 30 PSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP---------KTEKGKS--AVQPPTIRKQAEWRYSSLSEPFL 98 (726) Q Consensus 30 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~---------~~~~grs--~~v~~~v~~~v~~~~~~L~~~f~ 98 (726) -++..|+.-|+ .|........+..+||.|.-+... ....+++ +++.+..+..|+.....| | T Consensus 1 l~~~~i~~~i~----~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl----~ 72 (451) T protein:vir:10 1 MELEKIRAIIS----ADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYM----F 72 (451) T ss_pred CCHHHHHHHHH----HHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhhe----e Confidence 33444444443 455566667889999988543211 1111222 677888888888777655 4 Q ss_pred CCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcch Q lcl|NC_013692. 99 SSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSS 178 (726) Q Consensus 99 ~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~ 178 (726) |.+ +.|.. .+|.+.. ..+++.+ .|+.-..+....++++.+|.+.+.+|++..... T Consensus 73 G~p--~~~~~--~~~~~~~----~~~~~~~--~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~--------------- 127 (451) T protein:vir:10 73 TYP--VLFDI--DNNKELN----EKVTDVL--GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSG--------------- 127 (451) T ss_pred ccc--ceeec--CCcHHHH----HHHHHHh--ccCHHHHHHHHHHHHhhcCeEEEEEeecCCccc--------------- Confidence 444 23332 2333333 3455443 355556667888999999999998887611000 Q ss_pred HHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheee--CCCC Q lcl|NC_013692. 179 EELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVI--DPSC 256 (726) Q Consensus 179 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a 256 (726) ..+..+.+++..++|.+++. |.+. T Consensus 128 ------------------------------------------------------~~~~~~~~~~~~i~p~~~~~vydd~~ 153 (451) T protein:vir:10 128 ------------------------------------------------------EQVTNQTFKYGVVNTEEIIPIYRNGI 153 (451) T ss_pred ------------------------------------------------------ccccccceeEEEEcccceEEEEcCCC Confidence 00112234566788888764 3221 Q ss_pred CCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCC Q lcl|NC_013692. 257 GSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGD 336 (726) Q Consensus 257 ~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~ 336 (726) . .+..+.+| .|...++- .| ......+..+|+|.. + T Consensus 154 ~---~~~~~~ir-~~~~~~~~--~~----------------------------------~~~~~~~~~~e~yt~-----~ 188 (451) T protein:vir:10 154 E---RELEAVIR-YYIQLEDV--KG----------------------------------QIQKQAYTYVEFWTD-----K 188 (451) T ss_pred C---CceEEEEE-EEEeeecc--cc----------------------------------cccceEEEEEEEEeC-----C Confidence 1 22233322 22111110 00 000112333444431 2 Q ss_pred ceEEEEEE--EEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceE Q lcl|NC_013692. 337 GVLHPIVA--TWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVG 414 (726) Q Consensus 337 g~~~~~~~--~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~ 414 (726) ++..+... -..|..++ ....|.+.+.+|++.++. ...|.|.++.++++++.+|.+.|.+.+.+...++|.+. T Consensus 189 ~~~~~~~~~~~~~~~~~~-~~~~~~~~g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~ 262 (451) T protein:vir:10 189 ILDKYKFFGVSCCGSQIE-HITVQHRFNSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYI 262 (451) T ss_pred eEEEEEecccCccccccc-cccccCCCCeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceee Confidence 22111110 11122222 222333446667665543 34578999999999999999999999999999998776 Q ss_pred eecccc-cc--hhhhhhcCCceEeecCccch-hhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhh Q lcl|NC_013692. 415 VMKGAL-DV--TNRRRFDRGENYEFNPGADP-RAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDT 490 (726) Q Consensus 415 ~~~gav-~~--~d~~~~~~g~vi~~~~~~~~-~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~t 490 (726) + .|.- .. .........+++.+...... .+.+.++..+.........+..+...+...|++++.+.+..| +.| T Consensus 263 ~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~g---n~S 338 (451) T protein:vir:10 263 L-ENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFG---NAS 338 (451) T ss_pred e-ecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccc---ccc Confidence 5 3421 11 12233455666666543222 123455555555667788899999999999999987654322 246 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHH Q lcl|NC_013692. 491 ATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNA 570 (726) Q Consensus 491 a~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~ 570 (726) +.|+..............-+.|..+++++++.++.++-.+ ++.. ..+..+.....-..+ T Consensus 339 g~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~------------d~~~---------i~i~f~~~~p~n~~e 397 (451) T protein:vir:10 339 GVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT------------DYKK---------IQQTYTRNMMSNDLE 397 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC------------Cccc---------eeEEecCCCCCCHHH Confidence 6667776666666666666667677766666665544211 1110 011111111111111 Q ss_pred HHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 571 KVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSG 650 (726) Q Consensus 571 ~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~ 650 (726) ..+.+..+ ...++.... + ..++...+..+.++....+ .+.+..+.+ ..- T Consensus 398 ~~~~~~kl----~g~iS~et~---~---~~~p~v~d~~~e~~~~~ee-----------------~~~~~~~~~----~~~ 446 (451) T protein:vir:10 398 DADIATKS----VGIIPTKII---L---RHHPWVDDVEEAEKLYLEE-----------------KKIQASKVS----DDY 446 (451) T ss_pred HHHHHHHH----hccCchHHH---H---HhCCCCCCHHHHHHHHHHH-----------------HHHHHHHHH----hhc Confidence 11111111 111111110 0 1111111110000000000 000000000 000 Q ss_pred HHHHH Q lcl|NC_013692. 651 AGLQD 655 (726) Q Consensus 651 ~~~~~ 655 (726) -..-. T Consensus 447 ~~~~~ 451 (451) T protein:vir:10 447 NNFTE 451 (451) T ss_pred CCCCC Confidence 00000 No 71 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.47 E-value=9.7e-13 Score=86.45 Aligned_cols=468 Identities=14% Similarity=0.103 Sum_probs=196.8 Q ss_pred CCCccchhcCCCCC--Cc--hHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCHHHH Q lcl|NC_013692. 14 EDGDPSKRLQPEWS--NA--PSLAQLKQDYQEAK-QVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKSAVQPPTIR 83 (726) Q Consensus 14 ~~~~~~~~~~~~~~--~~--~~~~~~~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs~~v~~~v~ 83 (726) +..-.. +.+|- .- -....|....+|.+ .-...++..+..|..||.|....- .|..+.|.....+.-. T Consensus 1 m~~~~~---~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~ 77 (500) T protein:vir:30 1 MGVIQK---IKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIAR 77 (500) T ss_pred CchHHH---HHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHH Confidence 111111 11220 00 00112232333322 233455566888999998653211 0112222222334333 Q ss_pred HHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeee Q lcl|NC_013692. 84 KQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRT 163 (726) Q Consensus 84 ~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~ 163 (726) ..++ .|.+.+|+-.+-+.+ +|. ..+++|+.++ ..|+....+..++..|+..|.+++|+||+.. T Consensus 78 ~i~~----~~A~lv~~e~~~i~~-----~d~----~~~~~l~~il-~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~--- 140 (500) T protein:vir:30 78 TAAK----KIASLVFNEQAEIKV-----DDD----AANEFISETL-KNDRFNKNFERYLESCLALGGLAMRPYVDGD--- 140 (500) T ss_pred HHHH----HHhhhhcCCcceEec-----CCh----HHHHHHHHHH-hhccHHHHHHHHHHHHhhcCCEEEEEEEeCC--- Confidence 3333 233334443333333 333 4555788776 3677778899999999999999999999610 Q ss_pred EEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceee Q lcl|NC_013692. 164 VKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQ 243 (726) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~ 243 (726) .|.|+ T Consensus 141 ---------------------------------------------------------------------------~~~I~ 145 (500) T protein:vir:30 141 ---------------------------------------------------------------------------KVRVA 145 (500) T ss_pred ---------------------------------------------------------------------------ceEEE Confidence 01234 Q ss_pred eechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEE Q lcl|NC_013692. 244 VCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLV 323 (726) Q Consensus 244 ~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 323 (726) +|++..|++=......+..|-++++.. .+.+. ...||-.++..-+.... .+.-.+.-... ...+.-...|- T Consensus 146 ~v~ad~~~P~~~d~~~~~~~a~~~~~~-~~~~~--~~~~yt~lE~h~~~~~~-----~~~I~n~ly~~-~~~~~lG~~v~ 216 (500) T protein:vir:30 146 FVQAPVFLPLQSNTQDVSSAAVVIKSV-KTING--KEVYYTLIEFHEWQSSD-----DYVISNELYRS-DDKAKVGSRVP 216 (500) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEe-eeecC--CceEEEEEEEEEEeCCc-----eeEEEEEEEec-ccccccCcccc Confidence 455555553100001122222222111 00000 00011111000000000 00000000000 00000011222 Q ss_pred EEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeee----eeecCcccCCChHHHHHHHHHHHHHHHH Q lcl|NC_013692. 324 VHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNY----IPRKRDLYGESDGALLIDNQRIIGAVTR 399 (726) Q Consensus 324 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~ 399 (726) +.+.|.-+ .+.+.+. ...+.||+.+.+ ....++.+|.|++..+++..+.+|..++ T Consensus 217 l~~~~~~l---------------~~~~~~~------~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s 275 (500) T protein:vir:30 217 LSEVYKDL---------------KDEAKVT------DVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYD 275 (500) T ss_pred cccccCCc---------------CcceEec------cCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHH Confidence 22322110 0000110 112234444322 2345788999999999999999999999 Q ss_pred HHHHHHHhcCCCceEeecccccchhh---------hhhcCCc-eEe-ecCccchhhhcccccCccchhHHHHHHHHHHHH Q lcl|NC_013692. 400 GMIDTMARSANGQVGVMKGALDVTNR---------RRFDRGE-NYE-FNPGADPRAAVHMHTFPEIPQSAQYMINLQQAE 468 (726) Q Consensus 400 ~~~d~l~~~~~~~~~~~~gav~~~d~---------~~~~~g~-vi~-~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~ 468 (726) ++.+.+.. +..++.++.+.+..... ..+.+.. ++. ++........|...++......+...++.+... T Consensus 276 ~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~ 354 (500) T protein:vir:30 276 EFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSL 354 (500) T ss_pred HHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHH Confidence 99998865 77788888877632211 0111111 222 221112223455544333234455666666666 Q ss_pred HHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCcCeEEEEeccccee Q lcl|NC_013692. 469 AESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEF--LDDVEVVRITNEHFVD 546 (726) Q Consensus 469 ~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~--~d~e~~iRi~~~~~v~ 546 (726) +....|++.-..|.+++. ..||+++....+..-.+...+...+..+++++.+.++.+..-+ +...- T Consensus 355 i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~----------- 422 (500) T protein:vir:30 355 FEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEV----------- 422 (500) T ss_pred HHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC----------- Confidence 777778777666655443 2578888776666667777788888888888888888765432 22110 Q ss_pred cchhhcccccceeee--cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhh--hhhhhHHHHHhhhhh-h Q lcl|NC_013692. 547 IRRDDLAGNFDLKLD--ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMP--DFAKRIREFQPQPDP-I 621 (726) Q Consensus 547 v~~~~~~~~~dv~i~--~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~--e~~~~l~~~~~~~~~-~ 621 (726) ...+++++. .+...-..........+... + -++.... + ++.-+.. +..+.+.+.+.+..+ . T Consensus 423 ------~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~a-G-i~s~~~~---i---~~~~g~~eeea~~~l~~i~~E~~~~~ 488 (500) T protein:vir:30 423 ------PSMDNISISLDDGVFTDRDAELDYWIKVVNA-G-FGTREMA---I---QKVLNVTEEKAQEIAAEINTGIVDEI 488 (500) T ss_pred ------CCCcceEEEeCCCCCCCHHHHHHHHHHHHHc-C-CCCHHHH---H---HhcCCCCHHHHHHHHHHHHHhccccC Confidence 011122221 11111111111111111111 1 1111110 0 0000000 011111111111100 0 Q ss_pred hhhHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLLQ 634 (726) Q Consensus 622 ~qq~~q~e~q~~q 634 (726) ......... --+ T Consensus 489 ~~~~~~~~~-~g~ 500 (500) T protein:vir:30 489 NQQRTDTHL-YGE 500 (500) T ss_pred CCCCccccc-cCC Confidence 000000000 000 No 72 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.47 E-value=9.7e-13 Score=86.45 Aligned_cols=468 Identities=14% Similarity=0.103 Sum_probs=196.8 Q ss_pred CCCccchhcCCCCC--Cc--hHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCHHHH Q lcl|NC_013692. 14 EDGDPSKRLQPEWS--NA--PSLAQLKQDYQEAK-QVTDEKITQINRWLDYMHVRGEGK-----PKTEKGKSAVQPPTIR 83 (726) Q Consensus 14 ~~~~~~~~~~~~~~--~~--~~~~~~~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~~~-----~~~~~grs~~v~~~v~ 83 (726) +..-.. +.+|- .- -....|....+|.+ .-...++..+..|..||.|....- .|..+.|.....+.-. T Consensus 1 m~~~~~---~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~ 77 (500) T protein:vir:98 1 MGVIQK---IKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIAR 77 (500) T ss_pred CchHHH---HHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHH Confidence 111111 11220 00 00112232333322 233455566888999998653211 0112222222334333 Q ss_pred HHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeee Q lcl|NC_013692. 84 KQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRT 163 (726) Q Consensus 84 ~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~ 163 (726) ..++ .|.+.+|+-.+-+.+ +|. ..+++|+.++ ..|+....+..++..|+..|.+++|+||+.. T Consensus 78 ~i~~----~~A~lv~~e~~~i~~-----~d~----~~~~~l~~il-~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~--- 140 (500) T protein:vir:98 78 TAAK----KIASLVFNEQAEIKV-----DDD----AANEFISETL-KNDRFNKNFERYLESCLALGGLAMRPYVDGD--- 140 (500) T ss_pred HHHH----HHhhhhcCCcceEec-----CCh----HHHHHHHHHH-hhccHHHHHHHHHHHHhhcCCEEEEEEEeCC--- Confidence 3333 233334443333333 333 4555788776 3677778899999999999999999999610 Q ss_pred EEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceee Q lcl|NC_013692. 164 VKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQ 243 (726) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~ 243 (726) .|.|+ T Consensus 141 ---------------------------------------------------------------------------~~~I~ 145 (500) T protein:vir:98 141 ---------------------------------------------------------------------------KVRVA 145 (500) T ss_pred ---------------------------------------------------------------------------ceEEE Confidence 01234 Q ss_pred eechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEE Q lcl|NC_013692. 244 VCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLV 323 (726) Q Consensus 244 ~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 323 (726) +|++..|++=......+..|-++++.. .+.+. ...||-.++..-+.... .+.-.+.-... ...+.-...|- T Consensus 146 ~v~ad~~~P~~~d~~~~~~~a~~~~~~-~~~~~--~~~~yt~lE~h~~~~~~-----~~~I~n~ly~~-~~~~~lG~~v~ 216 (500) T protein:vir:98 146 FVQAPVFLPLQSNTQDVSSAAVVIKSV-KTING--KEVYYTLIEFHEWQSSD-----DYVISNELYRS-DDKAKVGSRVP 216 (500) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEe-eeecC--CceEEEEEEEEEEeCCc-----eeEEEEEEEec-ccccccCcccc Confidence 455555553100001122222222111 00000 00011111000000000 00000000000 00000011222 Q ss_pred EEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeee----eeecCcccCCChHHHHHHHHHHHHHHHH Q lcl|NC_013692. 324 VHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNY----IPRKRDLYGESDGALLIDNQRIIGAVTR 399 (726) Q Consensus 324 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~ 399 (726) +.+.|.-+ .+.+.+. ...+.||+.+.+ ....++.+|.|++..+++..+.+|..++ T Consensus 217 l~~~~~~l---------------~~~~~~~------~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s 275 (500) T protein:vir:98 217 LSEVYKDL---------------KDEAKVT------DVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYD 275 (500) T ss_pred cccccCCc---------------CcceEec------cCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHH Confidence 22322110 0000110 112234444322 2345788999999999999999999999 Q ss_pred HHHHHHHhcCCCceEeecccccchhh---------hhhcCCc-eEe-ecCccchhhhcccccCccchhHHHHHHHHHHHH Q lcl|NC_013692. 400 GMIDTMARSANGQVGVMKGALDVTNR---------RRFDRGE-NYE-FNPGADPRAAVHMHTFPEIPQSAQYMINLQQAE 468 (726) Q Consensus 400 ~~~d~l~~~~~~~~~~~~gav~~~d~---------~~~~~g~-vi~-~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~ 468 (726) ++.+.+.. +..++.++.+.+..... ..+.+.. ++. ++........|...++......+...++.+... T Consensus 276 ~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~ 354 (500) T protein:vir:98 276 EFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSL 354 (500) T ss_pred HHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHH Confidence 99998865 77788888877632211 0111111 222 221112223455544333234455666666666 Q ss_pred HHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCcCeEEEEeccccee Q lcl|NC_013692. 469 AESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEF--LDDVEVVRITNEHFVD 546 (726) Q Consensus 469 ~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~--~d~e~~iRi~~~~~v~ 546 (726) +....|++.-..|.+++. ..||+++....+..-.+...+...+..+++++.+.++.+..-+ +...- T Consensus 355 i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~----------- 422 (500) T protein:vir:98 355 FEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEV----------- 422 (500) T ss_pred HHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC----------- Confidence 777778777666655443 2578888776666667777788888888888888888765432 22110 Q ss_pred cchhhcccccceeee--cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhh--hhhhhHHHHHhhhhh-h Q lcl|NC_013692. 547 IRRDDLAGNFDLKLD--ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMP--DFAKRIREFQPQPDP-I 621 (726) Q Consensus 547 v~~~~~~~~~dv~i~--~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~--e~~~~l~~~~~~~~~-~ 621 (726) ...+++++. .+...-..........+... + -++.... + ++.-+.. +..+.+.+.+.+..+ . T Consensus 423 ------~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~a-G-i~s~~~~---i---~~~~g~~eeea~~~l~~i~~E~~~~~ 488 (500) T protein:vir:98 423 ------PSMDNISISLDDGVFTDRDAELDYWIKVVNA-G-FGTREMA---I---QKVLNVTEEKAQEIAAEINTGIVDEI 488 (500) T ss_pred ------CCCcceEEEeCCCCCCCHHHHHHHHHHHHHc-C-CCCHHHH---H---HhcCCCCHHHHHHHHHHHHHhccccC Confidence 011122221 11111111111111111111 1 1111110 0 0000000 011111111111100 0 Q ss_pred hhhHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLLQ 634 (726) Q Consensus 622 ~qq~~q~e~q~~q 634 (726) ......... --+ T Consensus 489 ~~~~~~~~~-~g~ 500 (500) T protein:vir:98 489 NQQRTDTHL-YGE 500 (500) T ss_pred CCCCccccc-cCC Confidence 000000000 000 No 73 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.47 E-value=8.5e-13 Score=86.76 Aligned_cols=489 Identities=14% Similarity=0.057 Sum_probs=203.4 Q ss_pred CCCccc-hhcCCCCCCchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCC-CCCCCCCCCc----CCCHHHHHHH Q lcl|NC_013692. 14 EDGDPS-KRLQPEWSNAPSLAQLKQDYQEAK-QVTDEKITQINRWLDYMHVRGEG-KPKTEKGKSA----VQPPTIRKQA 86 (726) Q Consensus 14 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~-~~~~~~~~~~~~~~~~y~~~~~~-~~~~~~grs~----~v~~~v~~~v 86 (726) +..... |..+..+..-.....|++.+++-+ ....++...+.+|..||.|...- +...+.|+.+ ...+ +...| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~-~~~~i 79 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLN-LRKLS 79 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecC-cHHHH Confidence 111111 111111111101112222222211 22344555678899999765531 1122223221 1112 22334 Q ss_pred HHHHHHHHHhhcCCCceEEEecC-Ccc-hHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeE Q lcl|NC_013692. 87 EWRYSSLSEPFLSSPNIFEVNPV-TWE-DAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTV 164 (726) Q Consensus 87 ~~~~~~L~~~f~~~~~~~~~~p~-~~~-D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~ 164 (726) -.-+|+|+ |+-..-+.+... ..+ +.....-+.++||.++ ..|+.+..+..++.+++..|.|++|+||+.. T Consensus 80 ~~~~A~Ll---~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~-~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~---- 151 (517) T protein:vir:98 80 ADVLSGLV---FNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVF-QHNKFIKNLSDYLEPTFALGGLTVRPYVDNG---- 151 (517) T ss_pred HHHhhhhh---cCCcceEEecccccccccccchhHHHHHHHHHH-HhccHHHHHHHHHHHHhhhCCEEEEEEEeCC---- Confidence 44455552 333333333321 111 1122223566888887 4777788899999999999999999999710 Q ss_pred EecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeee Q lcl|NC_013692. 165 KEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQV 244 (726) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~ 244 (726) .+.|+. T Consensus 152 --------------------------------------------------------------------------~~~I~~ 157 (517) T protein:vir:98 152 --------------------------------------------------------------------------EIEFSW 157 (517) T ss_pred --------------------------------------------------------------------------eeEEEE Confidence 012344 Q ss_pred echhheee-CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEE Q lcl|NC_013692. 245 CDYNNIVI-DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLV 323 (726) Q Consensus 245 v~p~~~~~-dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 323 (726) |++..||+ ..+. ..+..|-+++. .+.+.++ ...||-.++.-.+...... ...+.-.+.-... ...+.-..+|- T Consensus 158 v~ad~~~Pl~~~~-~~v~~~ai~~~-~~~~~~~--~~~~Yt~lE~H~~~~~~~~-~~~y~I~n~ly~s-~~~~~lG~~v~ 231 (517) T protein:vir:98 158 ALANAFYPLRSNS-NGISEGVMKSV-TTKVIGN--KTVYYTLLEFHEWEKTEEG-ESLYVITNELYKS-DNEGEIGKRIP 231 (517) T ss_pred EcCCeeEEEEecC-CCeEEEEEEEE-EEEeecC--CceEEEEEEEEecCceecc-CCcEEEEEEEEec-CCCcccccccc Confidence 55555553 1111 11222332221 1111111 0001110000000000000 0000000000000 00001112222 Q ss_pred EEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccc-eEEeee----eeecCcccCCChHHHHHHHHHHHHHHH Q lcl|NC_013692. 324 VHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIP-YVVVNY----IPRKRDLYGESDGALLIDNQRIIGAVT 398 (726) Q Consensus 324 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~P-f~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~ 398 (726) +.+.|.- +... +++ .+-..| |+.+.. ....++.+|.|++..+++..+.+|..+ T Consensus 232 L~~~~e~-------l~~~--~~~-------------~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~ 289 (517) T protein:vir:98 232 LEELYEG-------MQEK--TYI-------------QGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTY 289 (517) T ss_pred ccccccC-------CCcc--eeE-------------CCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHH Confidence 2222210 0000 111 111123 322211 223477899999999999999999999 Q ss_pred HHHHHHHHhcCCCceEeecccccchhh-hh------hc-CCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHH Q lcl|NC_013692. 399 RGMIDTMARSANGQVGVMKGALDVTNR-RR------FD-RGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAE 470 (726) Q Consensus 399 ~~~~d~l~~~~~~~~~~~~gav~~~d~-~~------~~-~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e 470 (726) +++++.+.+ +..++.++.+.+..... -. +. ...++..-.+......+...++......+...++.+-+.+. T Consensus 290 s~~~~e~~~-g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~ 368 (517) T protein:vir:98 290 DQFWWEIKM-GQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLE 368 (517) T ss_pred HHHHHHHHh-CCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHH Confidence 999998777 67788888888732211 00 11 11222221222223345555444444566777778888888 Q ss_pred HHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCcCeEEEEecccceecc Q lcl|NC_013692. 471 SMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEF--LDDVEVVRITNEHFVDIR 548 (726) Q Consensus 471 ~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~--~d~e~~iRi~~~~~v~v~ 548 (726) ...|++.-..|.++... .||+++....+..-.....+...+..+++++.+.++.+..-+ +...- T Consensus 369 ~~~Gls~~t~~~~~~~~-kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~------------- 434 (517) T protein:vir:98 369 MELKLSVGTFSFDGRSM-KTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEI------------- 434 (517) T ss_pred HHhCCCccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC------------- Confidence 88999988888765543 588888776666666667777778888888888887665433 21110 Q ss_pred hhhcccccceeee--cccchHHHHHHHHHHHHHHHhhhccchhHH-HHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhH Q lcl|NC_013692. 549 RDDLAGNFDLKLD--ISTAEEDNAKVNDLTFMLQTMGPNMDPMMA-QQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQK 625 (726) Q Consensus 549 ~~~~~~~~dv~i~--~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~-~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~ 625 (726) ...+++++. .+...-..........+.. .+ -++.... ...+. . ....+.+...++++.....++..-.+ T Consensus 435 ----~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~-aG-~ms~~~~i~~~~g-~-~eeeA~~e~~~i~~E~~~~~~~~~~~ 506 (517) T protein:vir:98 435 ----PSAEHIGVDFDDGVFQDRSALLRFYGQAKT-FG-FIPTVEAIQRIFK-V-PKKTAEQWLEEIRKDQIELDPVTISQ 506 (517) T ss_pred ----CCCcceEEEcCCCCCCCHHHHHHHHHHHHh-cC-CCCHHHHHHHhCC-C-ChHHHHHHHHHHHHhccccCCCCccc Confidence 011222222 2221111121111211111 11 1111111 00000 0 00000001111111111001000000 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_013692. 626 AQLELMLLQAQIE 638 (726) Q Consensus 626 ~q~e~q~~qaq~e 638 (726) .+. ...--.-| T Consensus 507 ~~~--~~~~gd~e 517 (517) T protein:vir:98 507 RAQ--KRMFGDEE 517 (517) T ss_pred ccc--CCCCCCCC Confidence 000 00000000 No 74 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.47 E-value=6.6e-12 Score=81.89 Aligned_cols=457 Identities=9% Similarity=0.012 Sum_probs=210.1 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CCC------- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KTE------- 71 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~------- 71 (726) |++++-+-.+---....+.. .-.++..-..|+..|+ -|...+....+..+||.|.-+... ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~i~~~i~----~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~ 72 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQI----KPKYETQEEMIIRLIN----DHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEID 72 (474) T ss_pred CeeeccCCCchhhhhHHHHh----hhccCChHHHHHHHHH----HHHHHHHHHHHHHHHhccCCcchhccchhccccccc Confidence 88886554332222221111 1122333344444554 355566667888999987653211 111 Q ss_pred CCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcC Q lcl|NC_013692. 72 KGK--SAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEG 149 (726) Q Consensus 72 ~gr--s~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~ 149 (726) +.+ .+++.+..+..|+.....| ||.+ +.|.+ +|.+..+ .++.++. ++....+...+++++.+| T Consensus 73 ~~~~~~ki~~n~~~~Ivd~~~~~l----~g~p--~~~~~---~d~~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~G 137 (474) T protein:vir:96 73 PLKPDWRMFTNYHQNLVDQKVAYA----VANP--VTFSS---DDDKSLK----TIQEVLN--HKWDDKLVDILTAASNKG 137 (474) T ss_pred ccccchhcccchHHHHHHhhhhhh----cccC--ceeec---CchHHHH----HHHHHHh--cCHHHHHHHHHHHHHhcC Confidence 112 2577777777777777555 4543 34433 3444333 4444443 455667778889999999 Q ss_pred CeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccccee Q lcl|NC_013692. 150 TIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEE 229 (726) Q Consensus 150 ~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 229 (726) .+.+.+||+.. T Consensus 138 ~~~~~~y~d~~--------------------------------------------------------------------- 148 (474) T protein:vir:96 138 IEWLQPYIDEN--------------------------------------------------------------------- 148 (474) T ss_pred eeEEEEEecCC--------------------------------------------------------------------- Confidence 99998876510 Q ss_pred ecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhc Q lcl|NC_013692. 230 EEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSE 307 (726) Q Consensus 230 ~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~ 307 (726) +.+++..++|.++++ |++. ..+..+ +.+.|...+. ..+.. T Consensus 149 --------~~~~i~~~~p~~~~~v~d~~~---~~~~~~-~vr~~~~~~~-------~~~~~------------------- 190 (474) T protein:vir:96 149 --------GEFKTFRVPAEQAIPIWTNKE---RDTLKA-FIRYYRLDGA-------ERVEY------------------- 190 (474) T ss_pred --------CceEEEEEcccceEEEEcCCC---CCceEE-EEEEEeecCc-------eEEEE------------------- Confidence 012345577777663 3322 223332 2233211000 00000 Q ss_pred cccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEEC-CEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH Q lcl|NC_013692. 308 GVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVG-AVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL 386 (726) Q Consensus 308 ~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g-~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~ 386 (726) | ..++|.. |.. .+.+..........+ .........|.+.+.+|++.+.. ...|.|.+.. T Consensus 191 ------y---t~~~v~~---~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~ 250 (474) T protein:vir:96 191 ------W---TDSDVTY---YEY---QDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKN-----NPQEMSDLFM 250 (474) T ss_pred ------E---eCCeEEE---EEe---cCCceeeccccccccccccccccccccCCCceeEEEecc-----CCCCCCcHHH Confidence 0 0011111 111 111111100000000 00011223455567888877755 3468999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc-ccchh--hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHH Q lcl|NC_013692. 387 LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTN--RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMIN 463 (726) Q Consensus 387 ~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga-v~~~d--~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~ 463 (726) ++++++.+|.++|.+.+.+...++|.+.+ .|. ..... ......++++.+...+ +.+.+...+.........++ T Consensus 251 v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~ 326 (474) T protein:vir:96 251 YKTIIDAMDKRLSDTQNTFDESTELIYIL-KGYEGQDLDEFMRNLKYYKAINVDGDG---SGVDTIQIEVPVQSSKEYLD 326 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCcccccchhhhhhcCceEEecCCC---CceeEEeecCChHHHHHHHH Confidence 99999999999999999999999887665 443 22111 2234456666664321 12455554444567778888 Q ss_pred HHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc Q lcl|NC_013692. 464 LQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH 543 (726) Q Consensus 464 ~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~ 543 (726) .+...+-..|++++.+.+..++ +.||.|+...............+.|..+++++++.++.+.-..++ T Consensus 327 ~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~----------- 393 (474) T protein:vir:96 327 MLRDYVIEFGQGVDFQQDKFGN--SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIK----------- 393 (474) T ss_pred HHHHHHHHHhCCcccccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcc----------- Confidence 8999999999999887553332 246666766666666666666666777777766666554321111 Q ss_pred ceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhh Q lcl|NC_013692. 544 FVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQ 623 (726) Q Consensus 544 ~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~q 623 (726) +. .+ .+..+.....-.....+. +.. +..+.... .+. + +..+.+....++....+.....+ T Consensus 394 ~~-----~i----~i~f~~~~p~~~~e~~~~----~~~-ag~iS~et---~~~-~--~~~v~d~~~E~~ri~~E~~e~~~ 453 (474) T protein:vir:96 394 VQ-----DV----EITFNFNVMVNELEQSQI----GVQ-SQYLSKET---VVT-N--HPWVDDPVAELERIEQDNIDFNK 453 (474) T ss_pred cc-----ee----eEEeccCCCcCHHHHHHH----HHh-cCCCchHH---HHH-h--CCCCCCHHHHHHHHHHHHHHHHh Confidence 00 00 111111111111111111 111 11111111 111 0 11111111111111100000000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 624 QKAQLELMLLQAQIEAERARAAHYMSGAGLQDSK 657 (726) Q Consensus 624 q~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~ 657 (726) ...... ...-..... ..+..- T Consensus 454 ~~~~~~------------~~~~~~~~d-~~~e~~ 474 (474) T protein:vir:96 454 QLPPLE------------GDANGRAQD-NESETN 474 (474) T ss_pred cccccc------------cccccccCC-CcccCC Confidence 000000 000000000 000000 No 75 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.46 E-value=4.7e-12 Score=82.66 Aligned_cols=472 Identities=10% Similarity=0.044 Sum_probs=207.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCc-----hHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCC-CC-CCC--C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNA-----PSLAQLKQDYQEAKQVTDEK-ITQINRWLDYMHVRG-EG-KPK--T 70 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~~~~-~~~~~~~~~~y~~~~-~~-~~~--~ 70 (726) -.++-.+-.+. ..-+.....-+..+. .....|.+-| ..|... .....+..+||.|.- .. ..+ . T Consensus 8 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i----~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~ 80 (501) T protein:vir:96 8 DSTGQERVLNL---RFHRESRIRYRADNLEELMVNNWELLKNFI----NHHKLRQAPRIQELLDYARGENHDVLKSGRRK 80 (501) T ss_pred ecccceecccc---ccchhHHhhhcccccccccCChHHHHHHHH----HHHHHHHHHHHHHHHHHhcCCCCcccCccccC Confidence 00000000000 000000000000111 1111233333 244433 345678889998753 21 111 1 Q ss_pred CCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhc Q lcl|NC_013692. 71 EKGK--SAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDE 148 (726) Q Consensus 71 ~~gr--s~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~ 148 (726) ..++ .+++.+.....|+.....| ||.. +.|.....+ ..+....+|+.+|. .|+.-..+..++++++++ T Consensus 81 ~~~~~~~ri~~n~~k~Ivd~~~~yl----~g~p--~~~~~~~~~---~~~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~ 150 (501) T protein:vir:96 81 DNEMADKRAVHNYGRMISKFKTGYL----AGNP--IRVEYDDND---DNSQNDDAIKRIGR-INDLDSLNRTLIRDLSQT 150 (501) T ss_pred ccccccceeecchHHHHHHHHhhhh----cccC--eeEeeCCcc---chhHHHHHHHHHHH-hcCHHHHHHHHHHHHhhc Confidence 2233 3688888888888877655 3433 344443332 23455667887764 677777788999999999 Q ss_pred CCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccce Q lcl|NC_013692. 149 GTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSE 228 (726) Q Consensus 149 ~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 228 (726) |.+.+.+|++.. T Consensus 151 G~a~~~v~~ded-------------------------------------------------------------------- 162 (501) T protein:vir:96 151 GRAYEVIYRSEY-------------------------------------------------------------------- 162 (501) T ss_pred CeEEEEEEEcCC-------------------------------------------------------------------- Confidence 999998877510 Q ss_pred eecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhh Q lcl|NC_013692. 229 EEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPS 306 (726) Q Consensus 229 ~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~ 306 (726) +.|++..++|.+++ ||++.. ....+.++ .|..... T Consensus 163 ---------g~~~i~~~~p~~~~~v~d~~~~---~~~~~~v~-~~~~~~~------------------------------ 199 (501) T protein:vir:96 163 ---------DETRIKRLSPLETFVIYDNSLE---DNSIAAVR-YYNRGTL------------------------------ 199 (501) T ss_pred ---------CceEEEEEccceeEEEEcCCCC---CceEEEEE-EEEeecC------------------------------ Confidence 01234456666654 333221 11122211 1100000 Q ss_pred ccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH Q lcl|NC_013692. 307 EGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL 386 (726) Q Consensus 307 ~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~ 386 (726) ...+.++++|.. +.+. .. -.++........|.+.+.+|++.+.. ..+|.|.+.. T Consensus 200 ------------~~~~~~~~vyt~-----~~i~---~~-~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~ 253 (501) T protein:vir:96 200 ------------QSAKDVVEIYTD-----EHIY---TL-DASDDFNEISVTTHAFGTVPITEYLN-----NIDGIGDYET 253 (501) T ss_pred ------------CCcEEEEEEEcC-----CcEE---EE-eeCCCceeccccccCCCccceEEecC-----CccCCCchhh Confidence 001233444421 1111 11 11111112233344457888776643 4468999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cch--hhhhhcCCceEeecCcc-----chhhhcccccCccchhHH Q lcl|NC_013692. 387 LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVT--NRRRFDRGENYEFNPGA-----DPRAAVHMHTFPEIPQSA 458 (726) Q Consensus 387 ~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~--d~~~~~~g~vi~~~~~~-----~~~~~i~~~~~~~~~~~~ 458 (726) ++++++.+|..++.+.+.+...++|.+.+ .|.. ... ........+++.+.... .....+.+...+.....+ T Consensus 254 v~~liDa~d~~~s~~~~~~~~~~~~~l~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 332 (501) T protein:vir:96 254 ELYLIDLYDSAESDTANHMSDMADAILAI-YGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGA 332 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHHhcCceeee-ecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCHHHH Confidence 99999999999999999999998887766 4432 222 12333444555544321 111223444444444667 Q ss_pred HHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEE Q lcl|NC_013692. 459 QYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVR 538 (726) Q Consensus 459 ~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iR 538 (726) ...+..+...+...|++++.+.|..++ +.||.|+...............+.|..+++++++.++.++........ T Consensus 333 ~~~~~~l~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~--- 407 (501) T protein:vir:96 333 EAYKTRLNRDIHIFTNTPDMSDTNFSG--NTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKD--- 407 (501) T ss_pred HHHHHHHHHHHHHHhCCcccCcccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--- Confidence 778888899999999999888764332 246666766666666666666777777888877777766543211100 Q ss_pred EecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhh Q lcl|NC_013692. 539 ITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQP 618 (726) Q Consensus 539 i~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~ 618 (726) .+..+ ..+........-.....+.+..+ ...++.... +. .+..+.+....++....+. T Consensus 408 --------~d~~~----i~i~f~~~~p~n~~e~ad~~~kl----~g~iS~et~---~~---~l~~v~D~~~E~~ri~~E~ 465 (501) T protein:vir:96 408 --------FDESL----LKITFTPNLPKSLNEQVSILTGL----GGQVSQETA---LS---LSGLVESPNEELDKINKEM 465 (501) T ss_pred --------ccccc----ceEEeCCCCCcCHHHHHHHHHHH----hccCchHHH---HH---hCCCCCCHHHHHHHHHHHH Confidence 00000 00111111111011111111111 111111110 00 0111111000011000000 Q ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 619 DPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQ 662 (726) Q Consensus 619 ~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eq 662 (726) ....... ...+.............. ...+.+....+ T Consensus 466 ~~~~~~~-------~~~~~~~~~~~~~~~~~e-~~~d~~e~~~~ 501 (501) T protein:vir:96 466 SEIDFKG-------YSNDFNEHVGKYTDEVKE-THTDDFEREYE 501 (501) T ss_pred HHhhccc-------cccchhhcccccCCcCCC-CCCCccccccC Confidence 0000000 000000000000000000 00000000000 No 76 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.46 E-value=2.1e-12 Score=84.64 Aligned_cols=462 Identities=8% Similarity=0.006 Sum_probs=205.1 Q ss_pred CCCccc-hhhcCCC--CCCccchhc--CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC------- Q lcl|NC_013692. 1 MADVDE-DYLTLPN--EDGDPSKRL--QPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP------- 68 (726) Q Consensus 1 ~~~~~~-~~~~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~------- 68 (726) ||..-- -=-.|-+ ++.++-+-- ..++..+.+ ...|..-...|..++....+..+||.|.-+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~----~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~ 76 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETL----EEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA 76 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCchhhH----HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc Confidence 332100 0001111 122211111 222333333 233333334566777778889999988643211 Q ss_pred ----CCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHH Q lcl|NC_013692. 69 ----KTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRA 144 (726) Q Consensus 69 ----~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~ 144 (726) ...+-..+++.+..+..|+.....| +|.. +.|.+ +|.+.. .+|+..+. |+..+.+...+++ T Consensus 77 ~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l----~G~p--~~~~~---~d~~~~----~~l~~~~~--n~~~~~~~~~~~~ 141 (483) T protein:vir:12 77 TGAVDPLKPDDRMITNFHANLVDQKVSYI----VGKP--IAFKH---TDDEVV----KRIDEVLG--NRFDDKLHSVLTG 141 (483) T ss_pred cccccccccccccccchHHHHHHHHhhhh----cccC--ceecc---CChHHH----HHHHHHHh--ccHHHHHHHHHHH Confidence 1111224688888888888888665 4433 34432 343333 35555543 4556677788999 Q ss_pred HhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecc Q lcl|NC_013692. 145 GVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVP 224 (726) Q Consensus 145 ~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 224 (726) ++.+|.+.+.+||+.. T Consensus 142 ~~~~G~~y~~v~~d~d---------------------------------------------------------------- 157 (483) T protein:vir:12 142 ASNKGIEWLHPYLDEE---------------------------------------------------------------- 157 (483) T ss_pred HhhCCeEEEEEEEcCC---------------------------------------------------------------- Confidence 9999999998876510 Q ss_pred ccceeecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccch Q lcl|NC_013692. 225 VGSEEEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDY 302 (726) Q Consensus 225 ~~~~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~ 302 (726) +.|.+..++|.++++ |++... +-.+ +.+.|...+. ..++.+ T Consensus 158 -------------~~~~i~~~~p~~~~~v~d~~~~~---~~~~-~ir~~~~~~~-------~~~~~y------------- 200 (483) T protein:vir:12 158 -------------GEFKLFRVPAEQGIPIWTDKEHE---ELEA-FIRMYKLENE-------TKVEYW------------- 200 (483) T ss_pred -------------CceEEEEEcccceEEEEcCCCCC---ceEE-EEEEEEeecc-------eEEEEE------------- Confidence 013445677777654 443221 2222 2222211000 000000 Q ss_pred hhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCC Q lcl|NC_013692. 303 TGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGES 382 (726) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g 382 (726) ...+|..+.+ .+...... .-...+...+.. .|.+.+.+|++.++. ..+|.| T Consensus 201 ---------------~~~~v~~~~~------~~~~~~~~-~~~~~~~~~~~~--~~~~~g~vPvv~~~n-----n~~g~s 251 (483) T protein:vir:12 201 ---------------DKVTVNYYVY------ENGSLIPD-YSNNLENSKTHF--STGSWGKIPFIPFKN-----NDLEIS 251 (483) T ss_pred ---------------ecCeEEEEEE------eCCeeeec-cccccccccccc--ccCCCCccceEEecC-----CCCCCC Confidence 0011111111 11111000 000011111222 233446677766543 456889 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cchhh--hhhcCCceEeecCccchhhhcccccCccchhHHH Q lcl|NC_013692. 383 DGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVTNR--RRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQ 459 (726) Q Consensus 383 ~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~d~--~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~ 459 (726) .++.++++++.+|..+|.+.+.+...++|.+.+ .|.- +.... ......+++.+..+++ +.+...+.....+. T Consensus 252 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~~ 326 (483) T protein:vir:12 252 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNYDDQELPEFKRLLRYYGAIKVSDNGG----VDTIQVEVPVENSK 326 (483) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcccchhHHHhhhhccccccCCCCc----ceEEeecCCHHHHH Confidence 999999999999999999999999999987765 4432 11111 2233445555554433 34444444456677 Q ss_pred HHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEE Q lcl|NC_013692. 460 YMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRI 539 (726) Q Consensus 460 ~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi 539 (726) ..+..+...+...|++++.+.+..++ +.||.|+...............+.|..+++++++.+++++-. . T Consensus 327 ~~~~~l~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~----~----- 395 (483) T protein:vir:12 327 KYLDELYQKIMLFGQAVDFSSDKFGS--APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----K----- 395 (483) T ss_pred HHHHHHHHHHHHHhCCCCCCcccccc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----C----- Confidence 78888889999999999877654332 246666766666666666666677777777766666554321 1 Q ss_pred ecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhh Q lcl|NC_013692. 540 TNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPD 619 (726) Q Consensus 540 ~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~ 619 (726) + ++..+ .+..+.....-.....+.+..+ ..-++.... +. .+..+.+....++....+.. T Consensus 396 -~-~~~~i---------~v~f~~~~p~~~~~~a~~~~kl----~GiiS~et~---~~---~~~~v~d~~~E~~ri~~E~~ 454 (483) T protein:vir:12 396 -G-EHKDV---------DISFNYNKVANTELQVQTAQQS----MGIVSHETV---LE---NHPFVEDLQAELERIEQEQM 454 (483) T ss_pred -C-cccee---------eEEeCCCCCCCHHHHHHHHHHH----hccCchHHH---HH---hCCCCCCHHHHHHHHHHHHH Confidence 0 11111 1111111111111111111111 111111110 00 01111111111111000000 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 620 PIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTE 661 (726) Q Consensus 620 ~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~e 661 (726) ...++.....-. ...-+.+..+..... +| T Consensus 455 ~~~~~~~~~~~~--~~d~~~~~~~~~~~e-----------~e 483 (483) T protein:vir:12 455 EYNKQLPNLDDG--GADGAQQQERSNNKE-----------SE 483 (483) T ss_pred HHHhhccccccc--ccCCcccCCCCCccc-----------CC Confidence 000000000000 000000000000000 00 No 77 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.45 E-value=1.2e-11 Score=80.38 Aligned_cols=442 Identities=12% Similarity=0.053 Sum_probs=198.9 Q ss_pred CccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC---CCCCCC--CcC Q lcl|NC_013692. 3 DVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP---KTEKGK--SAV 77 (726) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~---~~~~gr--s~~ 77 (726) =..+||..++.+.- + +.. .|...|.+ +........++..+||+|.-+... ...+++ .++ T Consensus 1 ~~~~~~~~~~~~~~---------~-~~~---~~~~~i~~---~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki 64 (489) T protein:vir:99 1 MLQEDFEAIDYESK---------L-WID---QLKNYISR---FKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRI 64 (489) T ss_pred CCccceeeeCCCCC---------C-CHH---HHHHHHHH---HHHHHHHHHHHHHHHhcccCccccccccccccCCccee Confidence 23445555554421 1 111 12222322 222334446788889987754321 112233 368 Q ss_pred CCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEee Q lcl|NC_013692. 78 QPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGW 157 (726) Q Consensus 78 v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w 157 (726) +.+..+..|+.....| ||.. +.|.+ +|. ....+|+.+|. .|+.-.....+.++++++|.+++.+|+ T Consensus 65 ~~n~~~~iv~~~~~~l----~g~~--~~~~~---~d~----~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~ 130 (489) T protein:vir:99 65 ASDFAKYITVFEQGYM----LGVP--VEYKN---ENK----DLQAAIDLMSV-RNNEDYHNVKIKTDLSIYGRAYELLTV 130 (489) T ss_pred ecchHHHHHHHHhhhh----ccCC--ceeec---CCh----hHHHHHHHHHh-hcChhHHHHHHHHHHhhCCeEEEEEee Confidence 8888888888887655 4433 34443 232 23457777764 455445667899999999999998876 Q ss_pred eeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceee Q lcl|NC_013692. 158 NYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVE 237 (726) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 237 (726) .... + + . T Consensus 131 ~~~~--------------d----------------------------------------~-------------------~ 137 (489) T protein:vir:99 131 EKID--------------D----------------------------------------K-------------------K 137 (489) T ss_pred ccCc--------------C----------------------------------------C-------------------C Confidence 4100 0 0 1 Q ss_pred ccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccC Q lcl|NC_013692. 238 NHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQ 315 (726) Q Consensus 238 ~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (726) ..+.+..++|.++++ |+... ....+.++ .|... + . T Consensus 138 ~~~~i~~~~p~~~~~v~dd~~~---~~~~~~i~-~~~~~-~---------~----------------------------- 174 (489) T protein:vir:99 138 TEVKLYQLPAEQTFVIYDDTYQ---RNSLMAVH-FYDID-Y---------G----------------------------- 174 (489) T ss_pred cceEEEEEcccceEEEEcCCCC---CceEEEEE-EEEEe-c---------C----------------------------- Confidence 124456778888653 22221 12222222 22100 0 0 Q ss_pred CcCCceEEEEEEEEEeecCCCceEEEEEEEEE--CCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHH Q lcl|NC_013692. 316 DKSRKRLVVHEYWGYYDIHGDGVLHPIVATWV--GAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRI 393 (726) Q Consensus 316 ~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~--g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~ 393 (726) ....+.++++|.. +.+.+++..... +..+.. ..|.+.+.+|++.+.. ...|.|.+..++++++. T Consensus 175 --~~~~~~~~~~y~~-----~~i~~~~~~~~~~~~~~~~~--~~~~~~g~vPvv~~~n-----~~~~~s~~~~v~~liDa 240 (489) T protein:vir:99 175 --SGKRKQIIKAYTS-----DTIYTYEDYNLETKGMRLKD--YEGHFFKGVPVNEYAN-----NEERTGAYESVLDNIDA 240 (489) T ss_pred --CCceEEEEEEEeC-----CcEEEEEecCCCcccceecc--cccccCCceeEEEeec-----CCCCCCchhhhHHHHHH Confidence 0011333444421 111111111111 111222 2233346778776653 34688999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCceEeecccc-cchhh------hhhcC------------CceEeecCccch---hhhcccccC Q lcl|NC_013692. 394 IGAVTRGMIDTMARSANGQVGVMKGAL-DVTNR------RRFDR------------GENYEFNPGADP---RAAVHMHTF 451 (726) Q Consensus 394 ~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~d~------~~~~~------------g~vi~~~~~~~~---~~~i~~~~~ 451 (726) +|..++.+.+.+...+++.+.+ .|.. ...+. ....+ +.++.+.++... ...+.+... T Consensus 241 ~d~~~s~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 319 (489) T protein:vir:99 241 YDLSQSELANFQQDSVNALLVI-AGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKK 319 (489) T ss_pred HHHHHHHHHHHHHHhhhhhhhh-ccCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeee Confidence 9999999999998888877655 3432 11111 11111 222222222211 112333333 Q ss_pred ccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_013692. 452 PEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFL 531 (726) Q Consensus 452 ~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~ 531 (726) +.........+..+...+...||+++.+.+..++ +.||.++...............+.|..+++.+.+.++.++.... T Consensus 320 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~ 397 (489) T protein:vir:99 320 EYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSG--VQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKG 397 (489) T ss_pred cCChHHHHHHHHHHHHHHHHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 3334566667788888888999998876442221 23666666655555555666666677777777776666553221 Q ss_pred CcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhh--hhh- Q lcl|NC_013692. 532 DDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMP--DFA- 608 (726) Q Consensus 532 d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~--e~~- 608 (726) .... ......+ ..+..+.....-..+..+.+..+. + -++.... +.. +..+. +.. T Consensus 398 ~~~~---------~~~~~~~----i~v~f~~~~p~d~~~~~~~~~kl~---g-iis~et~---~~~---l~~v~~~d~~~ 454 (489) T protein:vir:99 398 NEAT---------TYSLVND----TSIVFTPNLPQNDNEIVTAAQNLY---G-IVSDQTI---FEI---LNTVTGVDAEA 454 (489) T ss_pred Cccc---------ccccccc----ceEEeCCCCCcCHHHHHHHHHHHh---c-cCCHHHH---HHh---cCCCCchhHHH Confidence 1100 0000000 011111111110111111111110 1 1111110 000 00100 000 Q ss_pred --hhHHHHHhhhhhh-----------hhhHHHHHH Q lcl|NC_013692. 609 --KRIREFQPQPDPI-----------AQQKAQLEL 630 (726) Q Consensus 609 --~~l~~~~~~~~~~-----------~qq~~q~e~ 630 (726) +++++.+...... +++..+.+. T Consensus 455 E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 455 ELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 0110000000000 000000000 No 78 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.44 E-value=5.2e-12 Score=82.43 Aligned_cols=460 Identities=9% Similarity=0.044 Sum_probs=203.5 Q ss_pred CCCccchhhcCCCCCCccc-hhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCC--------- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPS-KRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKT--------- 70 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~--------- 70 (726) |+|+.--..+ +-..+. +..-+. ++.+-..|.+.+ .-|...+....+..+||.|..+..... T Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~--~~~~~~~i~~~i----~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ 71 (478) T protein:vir:10 1 MISINWPWDK---PYHEQVVEQIKPK--YETQEEMILRLV----REHKENIDNITMGERYYNHHPDILDAPFKRDVNGDY 71 (478) T ss_pred CccccccCCc---hhhhHHHHHhhhc--cCChHHHHHHHH----HHHHHHHHHHHHHHHHhcccccccccchhhhccccc Confidence 7776321111 100000 111111 111112233333 345566666788899998765322111 Q ss_pred CCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhc Q lcl|NC_013692. 71 EKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDE 148 (726) Q Consensus 71 ~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~ 148 (726) .++++ +++.+.....|+.....| ||.+ +.|.+ +|.+.. ..|+..|. |+....+..++++++.+ T Consensus 72 ~~~~~~~ki~~n~~k~ivd~~~~yl----~g~p--~~~~~---~~~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~ 136 (478) T protein:vir:10 72 DETKPDWRMYTNYHQNLVDQKVAYA----VANP--VTFGV---DNDKAL----KQIQHTLN--HKWDDKLVDILTAASNK 136 (478) T ss_pred ccccccceeccchHHHHHHHHhhhh----cccC--ceeec---CChHHH----HHHHHHHh--ccHHHHHHHHHHHHhhC Confidence 12222 577788888888777665 4433 34433 333322 34555443 56667777889999999 Q ss_pred CCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccce Q lcl|NC_013692. 149 GTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSE 228 (726) Q Consensus 149 ~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 228 (726) |.+.+.+||+.. T Consensus 137 G~~~~~v~~d~~-------------------------------------------------------------------- 148 (478) T protein:vir:10 137 GIEWVQPYVDEE-------------------------------------------------------------------- 148 (478) T ss_pred CeEEEEEEecCC-------------------------------------------------------------------- Confidence 999998877510 Q ss_pred eecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhh Q lcl|NC_013692. 229 EEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPS 306 (726) Q Consensus 229 ~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~ 306 (726) +.|++..++|.+++ ||+... .+-.+. .+.+-..+. .. ... T Consensus 149 ---------~~~~~~~~~p~~~~~v~d~~~~---~~~~~~-ir~~~~~~~----------~~-----~~~---------- 190 (478) T protein:vir:10 149 ---------GEFKTFRVPAEQAVPIWTNKER---DELQAF-IRVYELDGA----------ER-----VEY---------- 190 (478) T ss_pred ---------CceEEEEEcccceEEEEcCCCC---CceEEE-EEEEeeeCc----------eE-----EEE---------- Confidence 01233456677654 343322 222222 222211000 00 000 Q ss_pred ccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEEC-CEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH Q lcl|NC_013692. 307 EGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVG-AVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA 385 (726) Q Consensus 307 ~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g-~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~ 385 (726) | ..++|..|.+. +..+.........+ .........|.+.+.+|++.+.. ...|.|.++ T Consensus 191 -------y---~~~~i~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e 249 (478) T protein:vir:10 191 -------W---TKDDVTFYELK------EGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLF 249 (478) T ss_pred -------E---eCCcEEEEEec------CCeeeccccccccccccceecccccccCCcceEEEecc-----CCCCCCcHH Confidence 0 01122222211 11110000000000 01112233456667888877655 346889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cch-h-hhhhcCCceEeecCccchhhhcccccCccchhHHHHHH Q lcl|NC_013692. 386 LLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVT-N-RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMI 462 (726) Q Consensus 386 ~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~-d-~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll 462 (726) .++++++.+|.++|.+.+.+...++|.+.+ .|.- +.. + .......+++.+...... .+.+...+.....+...+ T Consensus 250 ~v~~liDa~~~~~S~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~l~~~~~~~~~~~~~ 326 (478) T protein:vir:10 250 MYKTIIDALDKRLSDTQNTFDESVELIYIL-KGYEGEDMKDFMHNLKYYKAISVAGESGS--GVDTIKVEVPIDSVKEYT 326 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCcceee-ecCCcccccchhhhhhhCceeEecCCCCC--cceEEeecCCHHHHHHHH Confidence 999999999999999999999988886654 4432 111 1 222344556666432222 234444444456677788 Q ss_pred HHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecc Q lcl|NC_013692. 463 NLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNE 542 (726) Q Consensus 463 ~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~ 542 (726) +.+...+...|++++.+.+..++ +.||.++...............+.|..+++++++.++++. ... . T Consensus 327 ~~l~~~I~~~s~~p~~~~~~~~~--n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~----~~~-------~ 393 (478) T protein:vir:10 327 KMLRDYIIEFGQGVDFQQDKFGN--SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY----RLD-------V 393 (478) T ss_pred HHHHHHHHHHhCCcCcCcccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC-------c Confidence 88888999999998877553322 2466667666666666666666666677766666555443 210 0 Q ss_pred cceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhh Q lcl|NC_013692. 543 HFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIA 622 (726) Q Consensus 543 ~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~ 622 (726) ++. + ..++.+.....-.....+.+..+ ...++... .+. .+..+.+....+...+.+..... T Consensus 394 d~~-----~----i~i~f~~~~p~~~~e~~~~~~~~----~g~iS~et---~i~---~~~~v~d~~~E~~ri~~E~~~~~ 454 (478) T protein:vir:10 394 RVQ-----D----IEITFNFNVMVNELENSQIAMNS----TGLLSKET---ILG---NHSWVQDPVAEMERIEQENIELN 454 (478) T ss_pred ccc-----c----ceEEeCCCCCCCHHHHHHHHHHH----hCCCChHH---HHH---hCCCCCCHHHHHHHHHHHHHHHH Confidence 010 1 11111111111111111111111 11111110 010 01111111111100000000000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 623 QQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALA 669 (726) Q Consensus 623 qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~ 669 (726) .+.... .. .... ....+....+.+ T Consensus 455 ~~~~~~----------------~~-----~~~d--~~~~~~~d~~~e 478 (478) T protein:vir:10 455 QQLPDI----------------EE-----GLND--EQQRQSEDNQSE 478 (478) T ss_pred Hhcccc----------------CC-----CCcc--cccccCcCCCCC Confidence 000000 00 0000 000000000000 No 79 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.44 E-value=4.2e-12 Score=82.93 Aligned_cols=480 Identities=9% Similarity=0.031 Sum_probs=212.5 Q ss_pred CCCccchhhcCCCCCCc------cchhcCCCCCCchHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCC----- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGD------PSKRLQPEWSNAPSL-AQLKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGK----- 67 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~----- 67 (726) |.-|.+ |-.-....+. ..+.....|...... .....+|....+.|..... ...+..+||.|.-... T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:96 1 MLKVNE-FETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccc-hhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 433322 1100000010 112224456433322 2223344444445555443 4577899998764321 Q ss_pred -CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHh Q lcl|NC_013692. 68 -PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGV 146 (726) Q Consensus 68 -~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l 146 (726) +...+...+++.+...-.|+.....| ||.. +.|.+ +|.+. .++|+.+| ..|+.-..+...+++++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~~~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:96 80 RKEEYMADNRVAHDYASYISDFINGYF----LGNP--IQYQD---DDKDV----LEAIEAFN-DLNDVESHNRSLGLDLS 145 (511) T ss_pred CcccccCcceeecchHHHHHHHHHhhh----ccCC--ceeec---CchHH----HHHHHHHH-hhcCHHHHHHHHHHHHH Confidence 11112335778888888887776554 4433 34433 33332 34566665 45777777889999999 Q ss_pred hcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccc Q lcl|NC_013692. 147 DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVG 226 (726) Q Consensus 147 ~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 226 (726) ++|.+.+.+|++.. T Consensus 146 i~G~a~~~vy~ded------------------------------------------------------------------ 159 (511) T protein:vir:96 146 IYGKAYELMIRNQD------------------------------------------------------------------ 159 (511) T ss_pred hcCeeEEEEEeCCC------------------------------------------------------------------ Confidence 99999998877510 Q ss_pred ceeecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 227 SEEEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 227 ~~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) +.|++..++|.++++ |.+.. .. ...+.+.|.+... +. T Consensus 160 -----------~~~~i~~~~p~~~~~vydd~~~---~~-~~~~vr~~~~~~~----------d~---------------- 198 (511) T protein:vir:96 160 -----------DETRLYKSDAMSTFVIYDNTIE---RN-SIAGVRYLRTKPI----------DK---------------- 198 (511) T ss_pred -----------CceEEEEEccceeEEEEcCCCC---Cc-eEEEEEEEEeeec----------cc---------------- Confidence 012345567777663 33221 11 1222222211100 00 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCE--E---EEeccCCCCCCccceEEeeeeeecCccc Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAV--M---IRMEENPFPDKRIPYVVVNYIPRKRDLY 379 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~--~---l~~~~~P~~~~~~Pf~~~~~~~~~~~~~ 379 (726) ...+.+..+|+|.. +++.+ .+..++. . ....+.|.+.+.+|++.++. ..+ T Consensus 199 ------------~~~~~~~~~~iyt~-----~~i~~---~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~ 253 (511) T protein:vir:96 199 ------------TDEDEVFTVDLFTS-----HGVYR---YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NER 253 (511) T ss_pred ------------cccceEEEEEEEeC-----CcEEE---EEecCCCcccccccccccccccCCceeeEEecC-----CCC Confidence 00112333444431 11111 1111111 0 01122333446666655542 346 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cchhhhhhcCCceEeecCc---------cchhhhcccc Q lcl|NC_013692. 380 GESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVTNRRRFDRGENYEFNPG---------ADPRAAVHMH 449 (726) Q Consensus 380 g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~d~~~~~~g~vi~~~~~---------~~~~~~i~~~ 449 (726) |.|.++.++++++.+|..+|.+.+.+...++|.+.+.-... +..+......+.++.+.+. ......+.+. T Consensus 254 g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:96 254 RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYI 333 (511) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEE Confidence 88999999999999999999999999988888766533222 2222222333444433221 1112234444 Q ss_pred cCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 450 TFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAE 529 (726) Q Consensus 450 ~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q 529 (726) ..+.-...+...+..+...+...|++++.+.+..++ +.||.++...............+.|..+++++++.++.++.. T Consensus 334 ~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:96 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 444445677788888999999999999987664322 246777777777777777777777778888877777665443 Q ss_pred hcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhh Q lcl|NC_013692. 530 FLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAK 609 (726) Q Consensus 530 ~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 609 (726) ...... +.++..+ .-.| +.....-.....+.+..+ ...++.... +. .+..+.+... T Consensus 412 ~~~~~~-----~~d~~~i-----~~~f----~~~~p~n~~e~~~~~~kl----~G~iS~et~---l~---~l~~v~D~~~ 467 (511) T protein:vir:96 412 TWSIDA-----NKDFNTV-----RYVY----NRNLPKSLIEELKAYIDS----GGKISQTTL---MS---LFSFFQDPEL 467 (511) T ss_pred hcCccc-----ccccccc-----eEEe----CCCCCCCHHHHHHHHHHH----hccCChHHH---HH---hCCCCCCHHH Confidence 211110 0011111 0011 111111111111111111 111111110 10 0111111111 Q ss_pred hHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 610 RIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 610 ~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) .++....+... +++..+ .......... .......+.+....+.+ T Consensus 468 E~~ri~~E~~~---------------~~~~~~-----~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 468 EVKKIEEDEKE---------------SIKKAQ-----KGIYKDPRDI---NDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHH---------------HHHHHh-----hccccCCCCC---CCCCCCCcccccccccC Confidence 01100000000 000000 0000000000 00000000000000000 No 80 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.44 E-value=3.9e-12 Score=83.11 Aligned_cols=395 Identities=13% Similarity=0.047 Sum_probs=189.3 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC------CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_013692. 27 SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK------PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSS 100 (726) Q Consensus 27 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~ 100 (726) -+...|..|.+.+.. +.....+-.+||.|....+ |+..+.+.+.|.+-.+..|+.+...|. T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~------ 67 (409) T protein:vir:94 1 MTEKGIGYLRFKLSV-------HKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV------ 67 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc------ Confidence 556666666666542 2333556678998765321 122223345566666666666544331 Q ss_pred CceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHH Q lcl|NC_013692. 101 PNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEE 180 (726) Q Consensus 101 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 180 (726) |...+..|. -+..+| ..|+.-.....++++||++|.+++.++=+. T Consensus 68 -----~~Gf~~~d~--------~l~~i~-~~N~ld~~~~~~~~~aliyG~sf~~v~~~~--------------------- 112 (409) T protein:vir:94 68 -----FREFENDDF--------TVNEIF-EENNPDIFFDSAVLSSLIASCSFTYISKGE--------------------- 112 (409) T ss_pred -----cCcccCCch--------HHHHHH-HhcChhHHHHHHHHHHHHhcceeEEEecCC--------------------- Confidence 122223332 144454 466655667789999999999999763110 Q ss_pred HHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe--eeCCCCCC Q lcl|NC_013692. 181 LAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI--VIDPSCGS 258 (726) Q Consensus 181 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~~dp~a~~ 258 (726) .+.|.|..++|.++ +|||.... T Consensus 113 --------------------------------------------------------dg~~~i~~~sp~~~~~i~D~~~~~ 136 (409) T protein:vir:94 113 --------------------------------------------------------NDAVRLQVIEAVNATGIIDPITGL 136 (409) T ss_pred --------------------------------------------------------CCceEEEEeccceEEEEEecCCCc Confidence 00122344555553 45663221 Q ss_pred chhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCce Q lcl|NC_013692. 259 DFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGV 338 (726) Q Consensus 259 d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~ 338 (726) ...+.+.+ .++ . . .......+|.. +.. T Consensus 137 -----~~~a~~~~--~~d-----------~---~---------------------------~~~~~~~~~~~-----~~~ 163 (409) T protein:vir:94 137 -----LTEGYAVL--ERD-----------E---N---------------------------NNVVLEAHFLP-----DRT 163 (409) T ss_pred -----eeeeEEEE--Eec-----------C---C---------------------------CceEEEEEEec-----CcE Confidence 11111111 000 0 0 00001111110 000 Q ss_pred EEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeec Q lcl|NC_013692. 339 LHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG-ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMK 417 (726) Q Consensus 339 ~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~-~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ 417 (726) +..+..++ .....++|+ |.+|+|+|...+..++.+|.|-+ +.++++|+.+|+.+..+.......++|+..+ . T Consensus 164 ---~~~~~~~~-~~~~~~n~~--g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-~ 236 (409) T protein:vir:94 164 ---DYYYRDSR-NNISIANPT--GHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYV-T 236 (409) T ss_pred ---EEEEecCc-eeEeeeCCC--CCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhhee-E Confidence 00000111 112235555 78999999999999999999866 7899999999999999999999999987655 2 Q ss_pred ccc---cchhhhhhcCCceEeecCccch-hhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHH Q lcl|NC_013692. 418 GAL---DVTNRRRFDRGENYEFNPGADP-RAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATA 493 (726) Q Consensus 418 gav---~~~d~~~~~~g~vi~~~~~~~~-~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~ 493 (726) |.- ++.+.....++.++.+....+. ...+..++..++ +.+...+..+...+-.+||+|....|.... ...||.+ T Consensus 237 G~d~d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~-NpsSa~A 314 (409) T protein:vir:94 237 GLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEA 314 (409) T ss_pred ecCCCCcccchhhhhHHHhhcCCCCCCCCCceEEecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHH Confidence 321 2223344456777766433221 112322222222 223333444444455577888888885432 1246666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHH-HH Q lcl|NC_013692. 494 VRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNA-KV 572 (726) Q Consensus 494 i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~-~~ 572 (726) +......-........+.|-.+++.++++++.+.-..-..+- ++..+. -.|...++. ...+.. .. T Consensus 315 l~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~-------~~~~~~-v~W~p~~~~------~~~~~a~~a 380 (409) T protein:vir:94 315 IKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLRE-------QFRKTK-PKWEPLFEA------DASMLSLIG 380 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc-------ccccce-EEeccCCCc------chHHHHHHH Confidence 766555555555555666667777777766655332211000 010000 011110110 011111 11 Q ss_pred HHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhh Q lcl|NC_013692. 573 NDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFA 608 (726) Q Consensus 573 ~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 608 (726) .....+.+. ++.+. .... ..+..+..+-. T Consensus 381 Da~~Kl~~a-g~~~~---~~~~---~~~~lG~~~~d 409 (409) T protein:vir:94 381 DGAIKLNQA-IPEFI---NKDT---IRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHh-ccccc---chhH---HHHHcCCCCCC Confidence 122222222 11111 0011 11222222111 No 81 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.43 E-value=1.5e-11 Score=79.97 Aligned_cols=450 Identities=8% Similarity=0.016 Sum_probs=200.7 Q ss_pred CCCccchhcCCCCCCch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC-------------------CCCCCCC Q lcl|NC_013692. 14 EDGDPSKRLQPEWSNAP-SLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG-------------------KPKTEKG 73 (726) Q Consensus 14 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~-------------------~~~~~~g 73 (726) ++. -++++...+.. +-..|...|+ .|........+...||.+.... .....++ T Consensus 1 ~~~---~~~~~~~~~~~~~~e~i~~~i~----~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (474) T protein:vir:10 1 MTL---YKLIDDIEAQGILPKHIEALIE----SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDV 73 (474) T ss_pred Cch---HHHHhhccccCCCHHHHHHHHH----HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccccc Confidence 111 11112221111 1112333332 2333333334455555432210 0112334 Q ss_pred CC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCe Q lcl|NC_013692. 74 KS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTI 151 (726) Q Consensus 74 rs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~ 151 (726) |+ +++.+-.+..|+.....| ||.+.-+.+.+ |...-+....+|+.+| ..|+.-......+++++++|.+ T Consensus 74 ~~~~ki~~n~~~~ivd~~~~yl----~g~pv~~~~~~----~~~~~e~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~a 144 (474) T protein:vir:10 74 SVNNKLNNSFDSEIVDTRVGYL----HGVPVTYDLDE----NAEKNEKLKKFITNFA-IRNSVDDEDSEIGKMAAICGYG 144 (474) T ss_pred CcccccccchHHHHHHhHhhhe----eccceeEeeCC----CCcchHHHHHHHHHHH-hhcCHhHHHHHHHHHHhhcCeE Confidence 44 678888888888776544 44443333333 2223334445677665 3566667788899999999998 Q ss_pred EEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeec Q lcl|NC_013692. 152 IVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEE 231 (726) Q Consensus 152 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 231 (726) .+.+|.+.. T Consensus 145 ~~~~~~d~~----------------------------------------------------------------------- 153 (474) T protein:vir:10 145 ARLAYIDTN----------------------------------------------------------------------- 153 (474) T ss_pred EEEEEeCCC----------------------------------------------------------------------- Confidence 886543200 Q ss_pred ccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 232 REETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 232 ~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) +.+++..++|.++++=.+.. . +.-+.++ .|...++ T Consensus 154 ------~~~~~~~i~p~~~~~v~d~~--~-~~~~~i~-~~~~~~~----------------------------------- 188 (474) T protein:vir:10 154 ------GDIRIKNIDPYNVIFVGDNI--L-EPTYSLR-YFYEKDD----------------------------------- 188 (474) T ss_pred ------CeeEEEEEcccceEEEEcCC--C-ceEEEEE-EEEEeeC----------------------------------- Confidence 01334566777754322111 1 1122222 1211100 Q ss_pred cccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEE-CCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWV-GAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDN 390 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~-g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~ 390 (726) .....+..+++|.+ +.+ +....- ++.....++.|.+.|.+|++.++ ...+|.|.++.++++ T Consensus 189 -----~~~~~~~~~~~y~~-----~~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~l 250 (474) T protein:vir:10 189 -----DNGTDYVYAEFYDN-----AYY---YVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHL 250 (474) T ss_pred -----CCceEEEEEEEEcC-----ceE---EEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHH Confidence 00011223344422 111 110000 11112223333344667776654 355689999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEeeccc-ccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHH Q lcl|NC_013692. 391 QRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEA 469 (726) Q Consensus 391 Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga-v~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~ 469 (726) ++.+|..+|.+.+.+...++|.+.+ .|. .+..+......++.+.+.++. ..+.+...+.....+...+..+...+ T Consensus 251 iDa~d~~~S~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I 326 (474) T protein:vir:10 251 IDAYDLTMSDASSEISQTRLAYLVL-RGMGMSEEMIQETQKSGAFELFDKD---MDVKYLTKDVNDTMIENHLDRIEKNI 326 (474) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhh-ccCCCCchhhhhhhhcceeEecCCC---CceeEEeccCCHHHHHHHHHHHHHHH Confidence 9999999999999999999888766 453 222233334445555554322 12444444444566778888888999 Q ss_pred HHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecch Q lcl|NC_013692. 470 ESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRR 549 (726) Q Consensus 470 e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~ 549 (726) ...|++++.+.+..++ +.||.++..............-+.|..+++++++.++.++..-.....- .++. T Consensus 327 ~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~-----~~~~---- 395 (474) T protein:vir:10 327 MRFAKSVNFNSDEFNG--NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDD-----DSYL---- 395 (474) T ss_pred HHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc-----cccc---- Confidence 9999999887653322 2467777776666667777777777788888888777765432111000 0010 Q ss_pred hhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHH Q lcl|NC_013692. 550 DDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLE 629 (726) Q Consensus 550 ~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e 629 (726) ++.-.| ......-....++.+..+ ...++.... +. .+..+.+....++....+......+..... T Consensus 396 -~i~~~f----~~~~p~d~~e~a~~~~kl----~g~iS~et~---~~---~l~~v~d~~~E~eri~~E~~e~~~~~~~~~ 460 (474) T protein:vir:10 396 -NLIFKF----TRNIPVNKLEESQVLINL----KGQVSERTR---LG---QSQLVDDVDYELDEMEKESLEFNDKLPDID 460 (474) T ss_pred -cceEEe----CCCCCCCHHHHHHHHHHH----hccCchHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHhhccccc Confidence 111111 111111111111111111 111111110 00 011111111111111000000000000000 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_013692. 630 LMLLQAQIEAERARAA 645 (726) Q Consensus 630 ~q~~qaq~e~~~aq~q 645 (726) .....-+.+..+.+ T Consensus 461 --~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 461 --EGDANDKSQNNQSE 474 (474) T ss_pred --CCCcCCCCccccCC Confidence 00000000000000 No 82 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.43 E-value=1.5e-11 Score=79.97 Aligned_cols=450 Identities=8% Similarity=0.016 Sum_probs=200.7 Q ss_pred CCCccchhcCCCCCCch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC-------------------CCCCCCC Q lcl|NC_013692. 14 EDGDPSKRLQPEWSNAP-SLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG-------------------KPKTEKG 73 (726) Q Consensus 14 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~-------------------~~~~~~g 73 (726) ++. -++++...+.. +-..|...|+ .|........+...||.+.... .....++ T Consensus 1 ~~~---~~~~~~~~~~~~~~e~i~~~i~----~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (474) T protein:vir:94 1 MTL---YKLIDDIEAQGILPKHIEALIE----SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDV 73 (474) T ss_pred Cch---HHHHhhccccCCCHHHHHHHHH----HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccccc Confidence 111 11112221111 1112333332 2333333334455555432210 0112334 Q ss_pred CC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCe Q lcl|NC_013692. 74 KS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTI 151 (726) Q Consensus 74 rs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~ 151 (726) |+ +++.+-.+..|+.....| ||.+.-+.+.+ |...-+....+|+.+| ..|+.-......+++++++|.+ T Consensus 74 ~~~~ki~~n~~~~ivd~~~~yl----~g~pv~~~~~~----~~~~~e~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~a 144 (474) T protein:vir:94 74 SVNNKLNNSFDSEIVDTRVGYL----HGVPVTYDLDE----NAEKNEKLKKFITNFA-IRNSVDDEDSEIGKMAAICGYG 144 (474) T ss_pred CcccccccchHHHHHHhHhhhe----eccceeEeeCC----CCcchHHHHHHHHHHH-hhcCHhHHHHHHHHHHhhcCeE Confidence 44 678888888888776544 44443333333 2223334445677665 3566667788899999999998 Q ss_pred EEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeec Q lcl|NC_013692. 152 IVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEE 231 (726) Q Consensus 152 i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 231 (726) .+.+|.+.. T Consensus 145 ~~~~~~d~~----------------------------------------------------------------------- 153 (474) T protein:vir:94 145 ARLAYIDTN----------------------------------------------------------------------- 153 (474) T ss_pred EEEEEeCCC----------------------------------------------------------------------- Confidence 886543200 Q ss_pred ccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 232 REETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 232 ~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) +.+++..++|.++++=.+.. . +.-+.++ .|...++ T Consensus 154 ------~~~~~~~i~p~~~~~v~d~~--~-~~~~~i~-~~~~~~~----------------------------------- 188 (474) T protein:vir:94 154 ------GDIRIKNIDPYNVIFVGDNI--L-EPTYSLR-YFYEKDD----------------------------------- 188 (474) T ss_pred ------CeeEEEEEcccceEEEEcCC--C-ceEEEEE-EEEEeeC----------------------------------- Confidence 01334566777754322111 1 1122222 1211100 Q ss_pred cccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEE-CCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWV-GAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDN 390 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~-g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~ 390 (726) .....+..+++|.+ +.+ +....- ++.....++.|.+.|.+|++.++ ...+|.|.++.++++ T Consensus 189 -----~~~~~~~~~~~y~~-----~~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~l 250 (474) T protein:vir:94 189 -----DNGTDYVYAEFYDN-----AYY---YVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHL 250 (474) T ss_pred -----CCceEEEEEEEEcC-----ceE---EEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHH Confidence 00011223344422 111 110000 11112223333344667776654 355689999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEeeccc-ccchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHH Q lcl|NC_013692. 391 QRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEA 469 (726) Q Consensus 391 Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga-v~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~ 469 (726) ++.+|..+|.+.+.+...++|.+.+ .|. .+..+......++.+.+.++. ..+.+...+.....+...+..+...+ T Consensus 251 iDa~d~~~S~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I 326 (474) T protein:vir:94 251 IDAYDLTMSDASSEISQTRLAYLVL-RGMGMSEEMIQETQKSGAFELFDKD---MDVKYLTKDVNDTMIENHLDRIEKNI 326 (474) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhh-ccCCCCchhhhhhhhcceeEecCCC---CceeEEeccCCHHHHHHHHHHHHHHH Confidence 9999999999999999999888766 453 222233334445555554322 12444444444566778888888999 Q ss_pred HHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecch Q lcl|NC_013692. 470 ESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRR 549 (726) Q Consensus 470 e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~ 549 (726) ...|++++.+.+..++ +.||.++..............-+.|..+++++++.++.++..-.....- .++. T Consensus 327 ~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~-----~~~~---- 395 (474) T protein:vir:94 327 MRFAKSVNFNSDEFNG--NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDD-----DSYL---- 395 (474) T ss_pred HHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc-----cccc---- Confidence 9999999887653322 2467777776666667777777777788888888777765432111000 0010 Q ss_pred hhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHH Q lcl|NC_013692. 550 DDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLE 629 (726) Q Consensus 550 ~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e 629 (726) ++.-.| ......-....++.+..+ ...++.... +. .+..+.+....++....+......+..... T Consensus 396 -~i~~~f----~~~~p~d~~e~a~~~~kl----~g~iS~et~---~~---~l~~v~d~~~E~eri~~E~~e~~~~~~~~~ 460 (474) T protein:vir:94 396 -NLIFKF----TRNIPVNKLEESQVLINL----KGQVSERTR---LG---QSQLVDDVDYELDEMEKESLEFNDKLPDID 460 (474) T ss_pred -cceEEe----CCCCCCCHHHHHHHHHHH----hccCchHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHhhccccc Confidence 111111 111111111111111111 111111110 00 011111111111111000000000000000 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_013692. 630 LMLLQAQIEAERARAA 645 (726) Q Consensus 630 ~q~~qaq~e~~~aq~q 645 (726) .....-+.+..+.+ T Consensus 461 --~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 461 --EGDANDKSQNNQSE 474 (474) T ss_pred --CCCcCCCCccccCC Confidence 00000000000000 No 83 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.43 E-value=5.6e-13 Score=87.75 Aligned_cols=466 Identities=12% Similarity=0.023 Sum_probs=197.0 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCCC-C--CCCcCCCHHHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKTE-K--GKSAVQPPTIRKQ 85 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~-~--grs~~v~~~v~~~ 85 (726) |++.+.... +..|..|.. .+..+.....+-.+||.|....+ +... . ..-++|..-.+.. T Consensus 1 ~~~~~~~d~---------~~~i~~L~~-------~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~i 64 (488) T protein:vir:23 1 MAETESIDP---------EKLRDQLLD-------AFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTY 64 (488) T ss_pred CCcccCCCH---------HHHHHHHHH-------HHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHH Confidence 444433322 123333332 23333344566678998765331 1111 1 1224667777777 Q ss_pred HHHHHHHHHH-hhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeE Q lcl|NC_013692. 86 AEWRYSSLSE-PFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTV 164 (726) Q Consensus 86 v~~~~~~L~~-~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~ 164 (726) |+.+...|.- -|+.+.++ .+..-..+|.+... .++.+| ..|+.-.....+.++++++|.+++.++....... T Consensus 65 vd~~a~~l~~~Gf~~~~~~-~~~~~~~~d~~~~~----~l~~i~-~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~- 137 (488) T protein:vir:23 65 VDAIAERQELEGFRIPSAN-GEEPESGGENDPAS----ELWDWW-QANNLDIEATLGHTDALIYGTAYITISMPDPEVD- 137 (488) T ss_pred HHHHHHhhhccceeccCCc-ccccccccchhHHH----HHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccc- Confidence 7777655521 13222211 12222233444443 455565 4777767788899999999999998876410000 Q ss_pred EecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeee Q lcl|NC_013692. 165 KEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQV 244 (726) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~ 244 (726) .. . -...+.|.. T Consensus 138 --------~~----------------------~--------------------------------------~~~~~~i~~ 149 (488) T protein:vir:23 138 --------FD----------------------V--------------------------------------DPEVPLIRV 149 (488) T ss_pred --------cC----------------------C--------------------------------------CCCcceEEE Confidence 00 0 011134456 Q ss_pred echhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceE Q lcl|NC_013692. 245 CDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRL 322 (726) Q Consensus 245 v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 322 (726) ++|.+++ |||... ....+.+++-+. + ...+ T Consensus 150 ~~p~~~~~~~d~~~~-----~~~~~~~~~~~~------------~-------------------------------~~~~ 181 (488) T protein:vir:23 150 EPPTALYAEVDPRTR-----KVLYAIRAIYGA------------D-------------------------------GNEI 181 (488) T ss_pred eccceeEEEEecCCC-----ceEEEEEEEEec------------C-------------------------------CCcE Confidence 7777755 454321 122222221000 0 0012 Q ss_pred EEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH-HHHHHHHHHHHHHHHH Q lcl|NC_013692. 323 VVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA-LLIDNQRIIGAVTRGM 401 (726) Q Consensus 323 ~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~-~~~d~Q~~~N~~~~~~ 401 (726) ..+++|.. +.+. ..+..++...-....|.+.+.+|+++|...+..+..+|.|.+. .++++++.+|..++.+ T Consensus 182 ~~~~~y~~-----~~~~---~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~ 253 (488) T protein:vir:23 182 VSATLYLP-----DTTM---TWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNM 253 (488) T ss_pred EEEEEEec-----CcEE---EEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHH Confidence 22222221 1110 1111112221223445666889999999888888999999885 6899999999999999 Q ss_pred HHHHHhcCCCceEeecccc-c--------chhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHH- Q lcl|NC_013692. 402 IDTMARSANGQVGVMKGAL-D--------VTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAES- 471 (726) Q Consensus 402 ~d~l~~~~~~~~~~~~gav-~--------~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~- 471 (726) .+.+...+.|+..+ .|.. + ........+|.++.+..|..+. +.+.+. ..+...+..+...++. T Consensus 254 ~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~----~~q~~~--~~~~~~~~~l~~~i~~~ 326 (488) T protein:vir:23 254 QGTANLMAIPQRLI-FGAKPEELGINAETGQRMFDAYMARILAFEGGEGAH----AEQFSA--AELRNFVDALDALDRKA 326 (488) T ss_pred HHHHHHhhhHHHHH-hCCCcccccccccccchhhhhhhhhhccCCCCCCce----eEecCC--CChHHHHHHHHHHHHHH Confidence 99999888876654 2321 1 0112233456665554443322 223332 2334455566655554 Q ss_pred --HhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecch Q lcl|NC_013692. 472 --MTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRR 549 (726) Q Consensus 472 --~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~ 549 (726) .|++++...|.... .+.||.++...............+.|..+++++++.++.+. ..... .. ++..+ T Consensus 327 ~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~----~~~~~---~~-~~~~i-- 395 (488) T protein:vir:23 327 ASYSGLPPQYLSSSSD-NPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMV----KGGDI---PT-EYYRM-- 395 (488) T ss_pred hcccCCCHHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCc---ch-hhccc-- Confidence 57777777774332 12366666666666666666666666677766666665432 21100 00 01110 Q ss_pred hhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHH Q lcl|NC_013692. 550 DDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLE 629 (726) Q Consensus 550 ~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e 629 (726) .++-.........+.......+.+.....++..... .++-. ..+..+.++.... +++ T Consensus 396 -------~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~----~~l~~--~~d~~~~~~~~~~------~~~---- 452 (488) T protein:vir:23 396 -------ETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGW----VDMGY--TIVEREQMRQWLE------QDQ---- 452 (488) T ss_pred -------eEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHH----HhCCC--CchHHHHHHHHHH------HHH---- Confidence 011000000000111111122211111111111110 00000 0000000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_013692. 630 LMLLQAQIEAERARAAHYMSGAGLQ-DSKVGTEQAKA 665 (726) Q Consensus 630 ~q~~qaq~e~~~aq~q~~~~~~~~~-~~~~~~eqaq~ 665 (726) .....++...-............. ......+-+.+ T Consensus 453 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 453 -KQGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred -HHHHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 000000000000000000000000 00000000000 No 84 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.42 E-value=6.9e-12 Score=81.76 Aligned_cols=451 Identities=9% Similarity=0.031 Sum_probs=201.7 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CC------- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSL-AQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KT------- 70 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~------- 70 (726) ---.-++|..|.-.+ .+.... ..|.+.|+ .|..++....+..+||.|.-+... +. T Consensus 25 ~~~~~~~~~~~~~~~-----------~~~~~~~~~i~~~i~----~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~ 89 (492) T protein:vir:97 25 QPTQTEIFDAIVRTN-----------NKPETLEEMIVRYIK----QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV 89 (492) T ss_pred chhhhhHhhhcccCC-----------CchhhHHHHHHHHHH----HHHHHHHHHHHHHHHhcccCccccccccccccccc Confidence 001112222222111 122222 22333333 456667677889999998753311 11 Q ss_pred --CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhc Q lcl|NC_013692. 71 --EKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDE 148 (726) Q Consensus 71 --~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~ 148 (726) .+-..+++.+..+..|+.....| +|.. +.|.+ +|.+. .++++..+ .|+....+....++++.+ T Consensus 90 ~~~~~~~ri~~n~~k~Ivd~~~~yl----~g~p--~~~~~---~d~~~----~~~l~~~~--~n~~~~~~~~~~~~~~~~ 154 (492) T protein:vir:97 90 DPLKPDDRMITNFHANLVDQKVSYI----VGKP--IAFKH---TDDEV----VKRIDEVL--GNRFDDKLHSVLTGASNK 154 (492) T ss_pred cccccccccccchHHHHHHHHhhhh----cccC--ceecc---CchHH----HHHHHHHH--hccHHHHHHHHHHHHhhc Confidence 11234788888888888887665 3332 34433 34333 33555544 356667777889999999 Q ss_pred CCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccce Q lcl|NC_013692. 149 GTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSE 228 (726) Q Consensus 149 ~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 228 (726) |.+.+.+|++.. T Consensus 155 G~a~~~v~~d~d-------------------------------------------------------------------- 166 (492) T protein:vir:97 155 GIEWLHPYLDEE-------------------------------------------------------------------- 166 (492) T ss_pred CeEEEEEEecCC-------------------------------------------------------------------- Confidence 999887765410 Q ss_pred eecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhh Q lcl|NC_013692. 229 EEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPS 306 (726) Q Consensus 229 ~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~ 306 (726) +.|++.+++|.++++ |++.. .+-.+ +.+.|...+. ..++. T Consensus 167 ---------g~~~~~~~~p~~~~~i~d~~~~---~~~~~-~vr~~~~~~~-------~~~~~------------------ 208 (492) T protein:vir:97 167 ---------GEFKLFRVPAEQGIPIWTDKEH---EELEA-FIRMYKLENE-------TKVEY------------------ 208 (492) T ss_pred ---------CceEEEEEcccceEEEEcCCCC---CceEE-EEEEEeeccc-------eeEEE------------------ Confidence 013345677777654 33221 12222 2222211000 00000 Q ss_pred ccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH Q lcl|NC_013692. 307 EGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL 386 (726) Q Consensus 307 ~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~ 386 (726) | ...++..+.+ .+++... ......+...+... |.+.+.+|++.+.. ..+|.|.++. T Consensus 209 -------y---~~~~v~~~~~------~~~~~~~-~~~~~~~~~~~~~~--~~~~g~vPvv~~~n-----n~~g~sd~e~ 264 (492) T protein:vir:97 209 -------W---DKVTVNYYVY------ENGSLIP-DYSNNLENSKTHFS--TGSWGKIPFIPFKN-----NDLEISDIFM 264 (492) T ss_pred -------E---ecCeEEEEEE------ecCeeee-cccccccccccccc--cCCCCCcceEEecC-----CCCCCCchHh Confidence 0 0011111111 1111100 00001111222223 33446777766643 3468899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEeeccccc-chh--hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHH Q lcl|NC_013692. 387 LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALD-VTN--RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMIN 463 (726) Q Consensus 387 ~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~-~~d--~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~ 463 (726) ++++++.+|.++|.+.+.+...++|.+.+ .|.-. ... .......+++.+..+++ +.+...+.....+...+. T Consensus 265 v~~liDa~d~~~S~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~~~~~~ 339 (492) T protein:vir:97 265 YKTLIDAYNRRLSDLSNTFKDSNELTYVL-KNYDDQELPEFKRLLRYYGAIKVSDNGG----VDTIQVEVPVENSKKYLD 339 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCcccchhHHHHHhhccceecCCCCc----ceeEeccCCHHHHHHHHH Confidence 99999999999999999999998887665 44321 111 12234445666655443 334333434456777888 Q ss_pred HHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc Q lcl|NC_013692. 464 LQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH 543 (726) Q Consensus 464 ~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~ 543 (726) .+...+...|++++.+.+..++ +.||.++...............+.|..+++++++.++.++.... + T Consensus 340 ~L~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~-----------~ 406 (492) T protein:vir:97 340 ELYQKIMLFGQAVDFSSDKFGS--APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-----------E 406 (492) T ss_pred HHHHHHHHHhCCCCCCcccccc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-----------c Confidence 8889999999999877653332 24666676666666666666667777777776666655432111 1 Q ss_pred ceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhh Q lcl|NC_013692. 544 FVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQ 623 (726) Q Consensus 544 ~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~q 623 (726) +..+ +++.+.....-.....+.+..+ ...++.... +. .+..+.+....++....+.....+ T Consensus 407 ~~~i---------~v~f~~~~p~~~~e~a~~~~kl----~G~iS~et~---l~---~l~~v~d~~~Eleri~~E~~~~~~ 467 (492) T protein:vir:97 407 HKDV---------DISFNYNKVANTELQVQTAQQS----MGIVSHETV---LE---NHPFVEDLQAELERIEQEQTEYNK 467 (492) T ss_pred ccee---------eEEecCCCCCCHHHHHHHHHHH----hccCchHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHH Confidence 1111 1111111111011111111111 111111110 00 011111111111111000000000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 624 QKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTE 661 (726) Q Consensus 624 q~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~e 661 (726) +.+.. .........+........ ++ T Consensus 468 ~~~~~--~~~~~~~~~~~~~~~~~~-----------~e 492 (492) T protein:vir:97 468 QLPNL--DDGGADSAQQQERSNNKE-----------SE 492 (492) T ss_pred hhhcc--ccCCCCCCcccccccccc-----------cC Confidence 00000 000000000000000000 00 No 85 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.40 E-value=5.8e-13 Score=87.68 Aligned_cols=462 Identities=12% Similarity=0.018 Sum_probs=194.3 Q ss_pred chhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCCCC---CCCcCCCHHHHHHHHHHHHHH Q lcl|NC_013692. 19 SKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKTEK---GKSAVQPPTIRKQAEWRYSSL 93 (726) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~---grs~~v~~~v~~~v~~~~~~L 93 (726) -....+.+.....-..+...| ...|..++....+...||.|.-..+ +...+ -+-++|..-....|+.....| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L---~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEM---VSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCCcccchHHHHHHH---HHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhh Confidence 111123332222222222222 2344555666677789998766431 01111 122345566666666655544 Q ss_pred HHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEeccccccc Q lcl|NC_013692. 94 SEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEM 173 (726) Q Consensus 94 ~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~ 173 (726) + .+.+ . .+++.... ..++.+| ..|+.-.....++++++++|.+.+.+|++..... T Consensus 78 ---~--~~g~---~--~~~~~~~~----~~l~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~---------- 132 (485) T protein:vir:24 78 ---A--VEGF---R--LGDADEAD----EELWQWW-QANNLDIEAPLGYTDAYVHGRSYITISRPDPQID---------- 132 (485) T ss_pred ---c--cCce---e--cCCCchhH----HHHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccc---------- Confidence 1 1111 1 12222222 3345555 3566556677899999999999999877621000 Q ss_pred CCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe--e Q lcl|NC_013692. 174 MPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI--V 251 (726) Q Consensus 174 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~ 251 (726) + ..-.+.|+|..++|.++ + T Consensus 133 ~-----------------------------------------------------------~~~~~~~~i~~~~p~~~~~i 153 (485) T protein:vir:24 133 L-----------------------------------------------------------GWDPNVPLIRVEPPTRMYAE 153 (485) T ss_pred c-----------------------------------------------------------ccCCCcceEEEeccceeEEE Confidence 0 00012345677888887 4 Q ss_pred eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEe Q lcl|NC_013692. 252 IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYY 331 (726) Q Consensus 252 ~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~ 331 (726) |||+.. . ...+.+++-+ + . ...+..+++|.. T Consensus 154 ~D~~~~----~-~~~~~~~~~~--~---~--------------------------------------~~~~~~~~~y~~- 184 (485) T protein:vir:24 154 IDPRIG----R-PAKAIRVAYD--A---E--------------------------------------GNEIQAATLYTP- 184 (485) T ss_pred eeCCcC----c-eeEEEEEEEe--e---c--------------------------------------CCeEEEEEEEcC- Confidence 455321 1 1111111100 0 0 011222333321 Q ss_pred ecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH-HHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_013692. 332 DIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA-LLIDNQRIIGAVTRGMIDTMARSAN 410 (726) Q Consensus 332 ~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~-~~~d~Q~~~N~~~~~~~d~l~~~~~ 410 (726) +. .+..+..++........|.+.+.+|+++|...+..++.+|.|.+. .++++++.+|..++.+.+++...+. T Consensus 185 ----~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~ 257 (485) T protein:vir:24 185 ----NE---TFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGV 257 (485) T ss_pred ----Cc---EEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcc Confidence 11 111112222222223334455889999999888888899998876 5899999999999999999999888 Q ss_pred CceEeeccc----ccch-----hhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHH---hchHHH Q lcl|NC_013692. 411 GQVGVMKGA----LDVT-----NRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESM---TGVKAF 478 (726) Q Consensus 411 ~~~~~~~ga----v~~~-----d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~---tGv~~~ 478 (726) |+..+- |. +... ......+|.++... +.++. +.+++. +.+...+..+...+..+ +++++. T Consensus 258 p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~----~~q~~~--~~~e~~~~~l~~~i~~~s~~~~~p~~ 329 (485) T protein:vir:24 258 PQRLIF-GIKPEEIGVDPETGQTLFDAYLARILAFE-DAEGK----IQQFSA--AELANFTNALDQIAKQVAAYTGLPPQ 329 (485) T ss_pred hhhhhc-cCCccccccccccccchhhhcccceeccC-CCCce----EEeecc--cchHHHHHHHHHHHHHHhcccCCCHH Confidence 877552 32 1101 11223455554443 22221 222322 23445666666666665 567777 Q ss_pred hhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--cCeEEEEecccceecchhhccccc Q lcl|NC_013692. 479 NAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLD--DVEVVRITNEHFVDIRRDDLAGNF 556 (726) Q Consensus 479 ~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d--~e~~iRi~~~~~v~v~~~~~~~~~ 556 (726) ..|..+. .+.||.++..............-+.|..+++++++.++.+...-.. +..-++++ |-...+.. T Consensus 330 ~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~---f~~~~~~s----- 400 (485) T protein:vir:24 330 YLSTAAD-NPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRMETV---WRDPSTPT----- 400 (485) T ss_pred HhccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccceeeEE---ecCCCCCC----- Confidence 7774332 1236666776666666666666777777777777766553221000 00111110 10000000 Q ss_pred ceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhh-hhhhhhHHHHHhhhhhhhhhHHHHHHHHHHH Q lcl|NC_013692. 557 DLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKM-PDFAKRIREFQPQPDPIAQQKAQLELMLLQA 635 (726) Q Consensus 557 dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~-~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qa 635 (726) ..+.......+.+.....++..... .+ .+. .+..+.++....+......+. ... +-. T Consensus 401 -----------~~~~ad~~~kl~~~g~~~~s~et~~----~~---l~~~~d~~~e~~~~~ee~~~~~~~~--~~~--~~~ 458 (485) T protein:vir:24 401 -----------YAAKADAATKLYGNGQGVIPRERAR----KD---MGYSIAEREEMRRWDEEEAAMGLGL--LGT--MVD 458 (485) T ss_pred -----------HHHHHHHHHHHHhcccccCCHHHHH----hh---CCCCHhHHHHHHHHHHHHhhhhhhH--HHh--hcc Confidence 0111111111111100011111000 00 000 000000000000000000000 000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 636 QIEAERARAAHYMSGAGLQDSKVGTEQAKAR 666 (726) Q Consensus 636 q~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~ 666 (726) .......+....... ..+.+... .... T Consensus 459 ~~~~~~~~~~~~e~~-~~~~~~~~---~~~a 485 (485) T protein:vir:24 459 ADPTVPGSPNPTPAP-KPQPAIEG---GDSA 485 (485) T ss_pred cCCCCCCCCCCCCCC-CCccCCCC---CCCC Confidence 000000000000000 00000000 0000 No 86 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.39 E-value=1e-11 Score=80.87 Aligned_cols=474 Identities=10% Similarity=0.026 Sum_probs=204.8 Q ss_pred CCCc--------cchhhcCCCCC--Ccc-chhcCCCCCCchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCC-CC- Q lcl|NC_013692. 1 MADV--------DEDYLTLPNED--GDP-SKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKI-TQINRWLDYMHVRG-EG- 66 (726) Q Consensus 1 ~~~~--------~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~-~~~~~~~~~y~~~~-~~- 66 (726) |+-+ -.|+...--.. -.. ....+++... .....|++-|. .|.... ..+++..+||.|.- .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~i~~~i~----~h~~~~~~rl~~l~~yY~g~~~~i~ 75 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMV-NNWELLKNFIN----HHKLRQAPRIQELLDYARGENHDVL 75 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhcc-ccHHHHHHHHH----HHHHHHHHHHHHHHHHhcCCCcccc Confidence 2211 11111000000 000 0001111110 01122333343 344333 34688999999853 21 Q ss_pred C--CCCCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHH Q lcl|NC_013692. 67 K--PKTEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYV 142 (726) Q Consensus 67 ~--~~~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~ 142 (726) + .....+++ +++.+.....|+.....| +|.. +.|.....+ ..+...++++.+|. .|+.-..+..++ T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p--~~~~~~d~~---~~~~~~~~l~~~~~-~N~~~~~~~~~~ 145 (502) T protein:vir:48 76 KSGRRKDNEMADKRAVHNYGRMISKFKTGYL----AGNP--IRVEYDDNE---DNSQNDDAIKRIGR-INDIDTHNRNLI 145 (502) T ss_pred ccccccccccccceeecchHHHHHHHHhhhh----cccC--eeEecCCcc---chhHHHHHHHHHHh-hcCHhHHHHHHH Confidence 1 11223443 788888888888777655 3333 344442222 23445667777764 577667788999 Q ss_pred HHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceee Q lcl|NC_013692. 143 RAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRA 222 (726) Q Consensus 143 ~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 222 (726) ++++++|.+.+.+|++.. T Consensus 146 ~~~~~~G~a~~~v~~ded-------------------------------------------------------------- 163 (502) T protein:vir:48 146 RDLSQTGRAYEVIYRSEY-------------------------------------------------------------- 163 (502) T ss_pred HHHhhcCeEEEEEEeCCC-------------------------------------------------------------- Confidence 999999999988776410 Q ss_pred ccccceeecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhccc Q lcl|NC_013692. 223 VPVGSEEEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEP 300 (726) Q Consensus 223 ~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~ 300 (726) +.|++..++|.++++ |+... .+..+ +.+.|....+ T Consensus 164 ---------------g~~~i~~~~p~~~~~vydd~~~---~~~~~-~ir~~~~~~~------------------------ 200 (502) T protein:vir:48 164 ---------------DETRIKRLSPLETFVIYDNSLE---DNSIA-AVRYYNRGTL------------------------ 200 (502) T ss_pred ---------------CceEEEEEcccceEEEEcCCCC---CceEE-EEEEEEEeec------------------------ Confidence 012344566666543 33221 11222 2221110000 Q ss_pred chhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccC Q lcl|NC_013692. 301 DYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYG 380 (726) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g 380 (726) ...+.++|+|.. +.+ ++....|+. ......|.+.+.+|++.++. ...| T Consensus 201 ------------------~~~~~~~~iyt~-----~~i---~~~~~~~~~-~~~~~~~~~~g~vPvv~~~n-----n~~g 248 (502) T protein:vir:48 201 ------------------QNAKDVVEIYTN-----QHI---YTLDASDSF-NEISVTPHAFGTVPITEFLN-----NADG 248 (502) T ss_pred ------------------CCcEEEEEEEeC-----CeE---EEEEeCCce-eeccceecCCCccceEEecC-----CCCC Confidence 001233454432 111 111111221 12223344447788776643 4468 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchh--hhhhcCCceEeecCcc-----chhhhcccccCcc Q lcl|NC_013692. 381 ESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTN--RRRFDRGENYEFNPGA-----DPRAAVHMHTFPE 453 (726) Q Consensus 381 ~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d--~~~~~~g~vi~~~~~~-----~~~~~i~~~~~~~ 453 (726) .|.++.++++++.+|..++.+.+.+...++|.+.+.-......+ .......+.+.+.+.. .....+.+...+. T Consensus 249 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 328 (502) T protein:vir:48 249 IGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSY 328 (502) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecC Confidence 89999999999999999999999999998887765432222221 2222333334333211 1122344444444 Q ss_pred chhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_013692. 454 IPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDD 533 (726) Q Consensus 454 ~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~ 533 (726) ....+...+..+...+...|++++.+.|..++ +.||.++..............-+.|..+++++++.++.++...... T Consensus 329 ~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~--n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 406 (502) T protein:vir:48 329 DVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSG--NASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEF 406 (502) T ss_pred CHHHHHHHHHHHHHHHHHHhCCCCcCcccccc--CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 44567777888899999999999877664322 2466667766666666666666777777777776666654422110 Q ss_pred CeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHH Q lcl|NC_013692. 534 VEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIRE 613 (726) Q Consensus 534 e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~ 613 (726) .. .++.. + .+..+.....-....++.+..+ ...++.... +. + +..+.+....++. T Consensus 407 ~~------~d~~~-----i----~i~f~~~~p~d~~e~a~~~~kl----~g~iS~et~---l~-~--l~~v~D~~~E~~r 461 (502) T protein:vir:48 407 KD------FDESR-----L----KITFTPNLPKSLYEQVSILNDL----GGQVSQETA---LS-L--SGLVENPTEELDK 461 (502) T ss_pred cc------ccccc-----c----eEEeCCCCCcCHHHHHHHHHHH----hccCcHHHH---HH-h--CCCCCCHHHHHHH Confidence 00 00000 0 0010110000011111111111 111111110 00 0 0001100000110 Q ss_pred HHhhhhhhhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 614 FQPQPDPIAQQKAQLELMLLQAQIEAER-ARAAHYMSGAGLQDSKVGT 660 (726) Q Consensus 614 ~~~~~~~~~qq~~q~e~q~~qaq~e~~~-aq~q~~~~~~~~~~~~~~~ 660 (726) ...+..+.. ............ ............+....-+ T Consensus 462 i~~E~~~~~-------~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 462 INEESSKID-------FKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred HHHHHHhhh-------hhcccccccccccccCCCccCCCCcCcCCCCC Confidence 000000000 000000000000 0000000000000000000 No 87 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.39 E-value=1.4e-11 Score=80.12 Aligned_cols=477 Identities=9% Similarity=0.044 Sum_probs=211.2 Q ss_pred CCCccchhhcCCCCCCc------cchhcCCCCCCchHHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCC----C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGD------PSKRLQPEWSNAPSLAQ-LKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGK----P 68 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~-~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~----~ 68 (726) |.-|.+ |-.-....+. ..+.....|.+....+. -...|....+.|..... ...+..+||.|.-... . T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:10 1 MLKVNE-FETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccc-hhhhhhhhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 443322 1111111111 11222445644333221 12344444455555443 4677899998764321 1 Q ss_pred CCCC--CCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHh Q lcl|NC_013692. 69 KTEK--GKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGV 146 (726) Q Consensus 69 ~~~~--grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l 146 (726) ...+ ...+++.+...-.|+.....| ||.. +.|.+ +|... ..+++.+| ..|+.-......+++++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~d~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:10 80 RKEEYMADNRVAHDYASYISDFINGYF----LGNP--IQYQD---DDKDV----LEAIEAFN-DLNDVESHNRSLGLDLS 145 (511) T ss_pred ccccccCcceeecchHHHHHHHHhhhh----cccC--ceeec---CchHH----HHHHHHHH-hhcCHHHHHHHHHHHHH Confidence 1112 335778888888888777655 3332 34433 33332 24566665 35666667788999999 Q ss_pred hcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccc Q lcl|NC_013692. 147 DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVG 226 (726) Q Consensus 147 ~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 226 (726) ++|.+.+.+|++.. T Consensus 146 i~G~ay~~vy~ded------------------------------------------------------------------ 159 (511) T protein:vir:10 146 IYGKAYEIMIRNQD------------------------------------------------------------------ 159 (511) T ss_pred hcCeeEEEEEeCCC------------------------------------------------------------------ Confidence 99999987776410 Q ss_pred ceeecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 227 SEEEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 227 ~~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) +.|++..++|.++++ |++... . ...+.+.|.+... .+ T Consensus 160 -----------g~~~i~~~~p~~~~~vydd~~~~---~-~~~~vr~~~~~~~-------d~------------------- 198 (511) T protein:vir:10 160 -----------DETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKPI-------DK------------------- 198 (511) T ss_pred -----------CceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeeec-------cc------------------- Confidence 012345566777653 332211 1 2222222211100 00 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEE-----EEeccCCCCCCccceEEeeeeeecCccc Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVM-----IRMEENPFPDKRIPYVVVNYIPRKRDLY 379 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~-----l~~~~~P~~~~~~Pf~~~~~~~~~~~~~ 379 (726) ...+.+..+|+|.. +++.+ .+..++.. ....+.|.+.+.+|++.++. ..+ T Consensus 199 ------------~~~~~~~~~~iyt~-----~~i~~---~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~n-----n~~ 253 (511) T protein:vir:10 199 ------------TDEDEVFTVDLFTS-----HGVYR---YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NER 253 (511) T ss_pred ------------CccceEEEEEEEeC-----CcEEE---EEecCCCcccccccccccccccCcceeEEEecC-----CCC Confidence 00122333444431 11111 11111110 11123334445666655432 446 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc--cchhhhhhcCCceEeecCcc---------chhhhccc Q lcl|NC_013692. 380 GESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL--DVTNRRRFDRGENYEFNPGA---------DPRAAVHM 448 (726) Q Consensus 380 g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav--~~~d~~~~~~g~vi~~~~~~---------~~~~~i~~ 448 (726) |.|.++.++++++.+|..+|.+.+.+...++|.+.+ .|.. +..+......+.++.+.+.. .....+.+ T Consensus 254 g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 332 (511) T protein:vir:10 254 RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGY 332 (511) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-eccccCCchhhccchhccceecccccccccccccCCCCcceeE Confidence 889999999999999999999999999888887665 3322 22222333344444443211 11223444 Q ss_pred ccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 449 HTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNA 528 (726) Q Consensus 449 ~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~ 528 (726) ...+.....+...+..+...+...|++++.+.+..++ +.||.++..............-+.|..+++++++.++.++. T Consensus 333 l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~ 410 (511) T protein:vir:10 333 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 410 (511) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444445667788888999999999999887653322 23666777776666666767777777787777777666554 Q ss_pred HhcCcCeEEEEecccceecchhhcccccceeeecc--cchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhh Q lcl|NC_013692. 529 EFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIS--TAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPD 606 (726) Q Consensus 529 q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~--~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e 606 (726) ........ .++. ++++.-. ...-.....+.+..+ .+ .++... .+. .+..+.+ T Consensus 411 ~~~~~~~~-----~d~~-----------~i~i~f~~~~p~d~~~~~~~~~kl---~G-~iS~et---~~~---~l~~v~d 464 (511) T protein:vir:10 411 NTRSIDAN-----KDFN-----------TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQTT---LMS---LFSFFQD 464 (511) T ss_pred hhCCcccc-----cccc-----------eeeEEeCCCCCcCHHHHHHHHHHH---hc-cCcHHH---HHH---hCCCCCC Confidence 32111000 0011 1111111 111111111111111 11 111111 010 0111111 Q ss_pred hhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 607 FAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 607 ~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) ....++....+... +++.. .......... . .......+.+....+.+ T Consensus 465 ~~~E~~ri~~E~~~---------------~~~~~-----~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 465 PELEVKKIEEDEKE---------------SIKKA-----QKGIYKDPRD--I-NDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHHH---------------HHHHH-----hhhcccCCCC--C-CCCCCCCcccCcccccC Confidence 11001100000000 00000 0000000000 0 00000000000000000 No 88 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.37 E-value=4.8e-11 Score=77.17 Aligned_cols=458 Identities=8% Similarity=0.006 Sum_probs=201.6 Q ss_pred CCCccchhhcCCCCCCccchhc--CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CC------ Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRL--QPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KT------ 70 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~------ 70 (726) -..|-+-| .+|.+..+.- ..+-..+.+-..|.+.|+ .|.+++....+..+||.|.-+... +. T Consensus 17 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~----~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~ 88 (492) T protein:vir:94 17 GGNILYPS----QPTQTEIFDAIVRTNNKPETLEEMIVRYIK----QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA 88 (492) T ss_pred CCceeecC----ccchhhhhhcccccCCchhhHHHHHHHHHH----HHHHHHHHHHHHHHHhcccccccccccccccccc Confidence 01111111 1222222221 111122223333444443 455666667888999988643311 11 Q ss_pred -CCC--CCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 71 -EKG--KSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 71 -~~g--rs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) .+. ..+++.+..+..|+.....| ||.. +.|.. +|.+.. ++++..+. |+....+...++++++ T Consensus 89 ~~~~~~~~ri~~n~~k~Ivd~~~~yl----~G~p--~~~~~---~d~~~~----~~l~~~~~--n~~~~~~~~~~~~a~~ 153 (492) T protein:vir:94 89 VDPLKPDDRMITNFHANLVDQKVSYI----VGKP--IAFKH---TDDEVV----KRIDEVLG--NRFDDKLHSVLTGASN 153 (492) T ss_pred ccccccccccccchHHHHHHHHHhhh----cccC--ceecc---CchHHH----HHHHHHHh--ccHHHHHHHHHHHHhh Confidence 112 23678888888888777654 4433 33432 343333 34555443 5555677789999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +|.+.+.+|++.. T Consensus 154 ~G~a~~~v~~d~d------------------------------------------------------------------- 166 (492) T protein:vir:94 154 KGIEWLHPYLDEE------------------------------------------------------------------- 166 (492) T ss_pred CCeEEEEEEecCC------------------------------------------------------------------- Confidence 9999988776410 Q ss_pred eeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhh Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGP 305 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~ 305 (726) +.|++..++|.+++ ||++... +-.+ +.+.|... + ... . T Consensus 167 ----------g~~~~~~~~p~~~~~v~d~~~~~---~~~a-~ir~~~~~-~---------~~~-----~----------- 206 (492) T protein:vir:94 167 ----------GEFKLFRVPAEQGIPIWTDKEHE---ELEA-FIRMYKLE-N---------ETK-----V----------- 206 (492) T ss_pred ----------CceEEEEEcccceEEEEcCCCCC---ceEE-EEEEEeec-c---------cee-----E----------- Confidence 01234556777754 3443321 2222 22222110 0 000 0 Q ss_pred hccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH Q lcl|NC_013692. 306 SEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA 385 (726) Q Consensus 306 ~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~ 385 (726) . .| ...+|..+++ .+++... ......+...+... |.+.+.+|++.+.. .-+|.|.++ T Consensus 207 -----~-~y---~~~~v~~~~~------~~~~~~~-~~~~~~~~~~~~~~--~~~~g~vPvv~~~n-----n~~~~sd~e 263 (492) T protein:vir:94 207 -----E-YW---DKVTVNYYVY------ENGSLIP-DYSNNLENSKTHFS--TGSWGKIPFIPFKN-----NDLEISDIF 263 (492) T ss_pred -----E-EE---ecCeEEEEEE------ecCeeee-cccccccccccccc--ccCCCccceEEecC-----CCCCCCchH Confidence 0 00 0011222211 1111110 00000111222233 33447778776644 346889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccch-h--hhhhcCCceEeecCccchhhhcccccCccchhHHHHHH Q lcl|NC_013692. 386 LLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVT-N--RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMI 462 (726) Q Consensus 386 ~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~-d--~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll 462 (726) .++++++.+|..+|.+.+.+...++|.+.+ .|.-... . .......+++.+..+++ +.+...+.........+ T Consensus 264 ~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~~~~~ 338 (492) T protein:vir:94 264 MYKTLIDAYNRRLSDLSNTFKDSNELTYVL-KNYDDQELPEFKRLLRYYGAIKVSDNGG----VDTIQVEVPVENSKKYL 338 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcccchhhHHHHhhccceecCCCCc----ceeEeccCCHHHHHHHH Confidence 999999999999999999999999887665 4432111 1 11223445555554433 33444444445677788 Q ss_pred HHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecc Q lcl|NC_013692. 463 NLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNE 542 (726) Q Consensus 463 ~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~ 542 (726) +.+...+...|++++.+.+..++ +.||.|+...............+.|..+++++++.++.++..-. +..-+ T Consensus 339 ~~l~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~-~~~~i----- 410 (492) T protein:vir:94 339 DELYQKIMLFGQAVDFSSDKFGS--APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDV----- 410 (492) T ss_pred HHHHHHHHHHhCCcCCCcccccc--CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-cccee----- Confidence 88899999999999877654332 24666666666666666666667777777776666655432111 10111 Q ss_pred cceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhh Q lcl|NC_013692. 543 HFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIA 622 (726) Q Consensus 543 ~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~ 622 (726) .++.+.....-.....+....+ ..-++.... +. .+....+....++....+..... T Consensus 411 --------------~v~f~~~~p~~~~e~~~~~~kl----~giiS~et~---~~---~l~~v~d~~~E~eri~~E~~~~~ 466 (492) T protein:vir:94 411 --------------DISFNYNKVANTELQVQTAQQS----MGIVSHETV---LE---NHPFVEDLQAELERIEQEQMEYN 466 (492) T ss_pred --------------eEEecCCCCCCHHHHHHHHHHH----hccCchHHH---HH---hCCCCCCHHHHHHHHHHHHHHHH Confidence 1111111111111111111111 111111110 10 01111111111111100000000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 623 QQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTE 661 (726) Q Consensus 623 qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~e 661 (726) ++..... .....-+....+.. . ...| T Consensus 467 ~~~~~~~--~~~~~~~~~~~~~~----------~-~e~e 492 (492) T protein:vir:94 467 KQLPNLD--DGGADSAQQQERSN----------N-KESE 492 (492) T ss_pred hhccccc--cccCCCCccccCCc----------c-ccCC Confidence 0000000 00000000000000 0 0000 No 89 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.37 E-value=5.2e-11 Score=76.98 Aligned_cols=459 Identities=11% Similarity=0.067 Sum_probs=198.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCC--CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC---------C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEW--SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP---------K 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~---------~ 69 (726) |.. +..||-..-- ....+... .++.+-..|... .+.|...+....++.+||.|.-+... + T Consensus 1 ~~~----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~----i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~ 71 (474) T protein:vir:94 1 MFN----IIRMPWDKPY-GEEVVEQLKPQFETQEEMIVRL----IDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGN 71 (474) T ss_pred Ccc----cccccCCCch-hhHHHHhhhhcccCHHHHHHHH----HHHHHHHHHHHHHHHHHhccccchhcccchhccccc Confidence 211 1122222111 00111111 122222233333 34566677677889999987542210 1 Q ss_pred CCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 70 TEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 70 ~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) ...+++ +++.+.....|+.....| ||.+ +.|.. +|... ..+++..+ .++....+..++++++. T Consensus 72 ~~~~~~~~ki~~n~~k~Ivd~~~~~l----~g~p--~~~~~---~d~~~----~~~l~~~~--~n~~~~~~~e~~~~~~~ 136 (474) T protein:vir:94 72 IDYDKPDWRITTNFHQNLVDQKVSYV----ASKP--VTYSC---EDENV----LKVIHDVL--DTRWDNKLIDILTATSN 136 (474) T ss_pred cccccCcceeecchHHHHHHHHHhhh----hcCC--ceecc---CcHHH----HHHHHHHH--hccHHHHHHHHHHHHhh Confidence 123333 678888888888777665 4433 34432 33332 33555544 46667778889999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +|.+.+.+|++.. T Consensus 137 ~G~~~~~~~~d~~------------------------------------------------------------------- 149 (474) T protein:vir:94 137 KGIDWLQVYINEN------------------------------------------------------------------- 149 (474) T ss_pred cCceEEEEEecCC------------------------------------------------------------------- Confidence 9999988776410 Q ss_pred eeecccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhc Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSE 307 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~ 307 (726) +.|++..++|.++++-.+. ++..+..+.+ +.|... +...+ . T Consensus 150 ----------~~~~i~~~~p~~~~~v~d~-~~~~~~~~~i-r~~~~~----------~~~~~---------~-------- 190 (474) T protein:vir:94 150 ----------GEMKLFRVPAEQAIPIWVD-KEREELKSFI-RYYKFN----------NEEKV---------E-------- 190 (474) T ss_pred ----------CeeEEEEEcccceEEEEcC-CCCCceEEEE-EEEEec----------CeEEE---------E-------- Confidence 0123445666666543221 1112223322 222100 00000 0 Q ss_pred cccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHH Q lcl|NC_013692. 308 GVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALL 387 (726) Q Consensus 308 ~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~ 387 (726) -|. ..+|..|.+ .+.+... . +..+.........|.+.+.+|++.+.. ..+|.|.+..+ T Consensus 191 -----~yt---~~~~~~y~~------~~~~~~~-~--~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v 248 (474) T protein:vir:94 191 -----FWT---DTTVTYYVL------ENGGLIP-D--YYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMY 248 (474) T ss_pred -----EEe---CCeEEEEEE------cCCcccc-c--cccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHH Confidence 000 011111111 1111100 0 000000011112233346677766543 44689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh--hhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHH Q lcl|NC_013692. 388 IDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR--RRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQ 465 (726) Q Consensus 388 ~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~--~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~ 465 (726) +++++.+|...+.+.+.+...++|.+.+.-...+.... .....++++.+..++. +.+...+.....+...+..+ T Consensus 249 ~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~~l 324 (474) T protein:vir:94 249 KSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGG----VETIQVEVPVSSTKEYIDLM 324 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc----eeEEeecCCHHHHHHHHHHH Confidence 99999999999999999999988877654322222121 1223455666655543 34444444456677788888 Q ss_pred HHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccce Q lcl|NC_013692. 466 QAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFV 545 (726) Q Consensus 466 ~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v 545 (726) ...+...|++++.+.+..++ +.||.|+.................|..+++++++.+++ ++... . ++. T Consensus 325 ~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~----~~~~~------~-d~~ 391 (474) T protein:vir:94 325 RVYIMEFGQGVDFQTDKFGS--APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIID----FNNLK------T-DVK 391 (474) T ss_pred HHHHHHHhCccccCcccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCC------c-ccc Confidence 99999999998876543222 24666666655555555555556666666665555544 33211 0 111 Q ss_pred ecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhH Q lcl|NC_013692. 546 DIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQK 625 (726) Q Consensus 546 ~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~ 625 (726) .+ .+..+.....-.....+. +... ..++.... +. .+..+.+....++....+.....+.. T Consensus 392 ~i---------~v~f~~~~p~~~~e~a~~----~~~~-g~iS~et~---l~---~l~~v~D~~~E~eri~~E~~~~~~~~ 451 (474) T protein:vir:94 392 DI---------EISFNFNRMMNDAEQSQI----IAQS-QYLSRETL---VK---SSPLVDDYKAELERIEQEQMEYNKQL 451 (474) T ss_pred ee---------eEEeccCcccCHHHHHHH----HHHc-CCCCHHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHhhc Confidence 11 011111111001111111 1111 11111111 10 11111111111111100000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 626 AQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQ 662 (726) Q Consensus 626 ~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eq 662 (726) ... . ............. ..+ +.+ T Consensus 452 ~~~---------~--~~~~~~~~~~~~~-~~~--~~e 474 (474) T protein:vir:94 452 PNL---------D--DGGADGAQQQEGS-NNK--ESE 474 (474) T ss_pred ccc---------C--CCCCCCcccCCCC-ccc--ccC Confidence 000 0 0000000000000 000 000 No 90 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.37 E-value=5.2e-11 Score=76.98 Aligned_cols=459 Identities=11% Similarity=0.067 Sum_probs=198.5 Q ss_pred CCCccchhhcCCCCCCccchhcCCCC--CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC---------C Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEW--SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP---------K 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~---------~ 69 (726) |.. +..||-..-- ....+... .++.+-..|... .+.|...+....++.+||.|.-+... + T Consensus 1 ~~~----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~----i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~ 71 (474) T protein:vir:97 1 MFN----IIRMPWDKPY-GEEVVEQLKPQFETQEEMIVRL----IDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGN 71 (474) T ss_pred Ccc----cccccCCCch-hhHHHHhhhhcccCHHHHHHHH----HHHHHHHHHHHHHHHHHhccccchhcccchhccccc Confidence 211 1122222111 00111111 122222233333 34566677677889999987542210 1 Q ss_pred CCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 70 TEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 70 ~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) ...+++ +++.+.....|+.....| ||.+ +.|.. +|... ..+++..+ .++....+..++++++. T Consensus 72 ~~~~~~~~ki~~n~~k~Ivd~~~~~l----~g~p--~~~~~---~d~~~----~~~l~~~~--~n~~~~~~~e~~~~~~~ 136 (474) T protein:vir:97 72 IDYDKPDWRITTNFHQNLVDQKVSYV----ASKP--VTYSC---EDENV----LKVIHDVL--DTRWDNKLIDILTATSN 136 (474) T ss_pred cccccCcceeecchHHHHHHHHHhhh----hcCC--ceecc---CcHHH----HHHHHHHH--hccHHHHHHHHHHHHhh Confidence 123333 678888888888777665 4433 34432 33332 33555544 46667778889999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +|.+.+.+|++.. T Consensus 137 ~G~~~~~~~~d~~------------------------------------------------------------------- 149 (474) T protein:vir:97 137 KGIDWLQVYINEN------------------------------------------------------------------- 149 (474) T ss_pred cCceEEEEEecCC------------------------------------------------------------------- Confidence 9999988776410 Q ss_pred eeecccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhc Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSE 307 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~ 307 (726) +.|++..++|.++++-.+. ++..+..+.+ +.|... +...+ . T Consensus 150 ----------~~~~i~~~~p~~~~~v~d~-~~~~~~~~~i-r~~~~~----------~~~~~---------~-------- 190 (474) T protein:vir:97 150 ----------GEMKLFRVPAEQAIPIWVD-KEREELKSFI-RYYKFN----------NEEKV---------E-------- 190 (474) T ss_pred ----------CeeEEEEEcccceEEEEcC-CCCCceEEEE-EEEEec----------CeEEE---------E-------- Confidence 0123445666666543221 1112223322 222100 00000 0 Q ss_pred cccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHH Q lcl|NC_013692. 308 GVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALL 387 (726) Q Consensus 308 ~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~ 387 (726) -|. ..+|..|.+ .+.+... . +..+.........|.+.+.+|++.+.. ..+|.|.+..+ T Consensus 191 -----~yt---~~~~~~y~~------~~~~~~~-~--~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v 248 (474) T protein:vir:97 191 -----FWT---DTTVTYYVL------ENGGLIP-D--YYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMY 248 (474) T ss_pred -----EEe---CCeEEEEEE------cCCcccc-c--cccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHH Confidence 000 011111111 1111100 0 000000011112233346677766543 44689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh--hhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHH Q lcl|NC_013692. 388 IDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR--RRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQ 465 (726) Q Consensus 388 ~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~--~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~ 465 (726) +++++.+|...+.+.+.+...++|.+.+.-...+.... .....++++.+..++. +.+...+.....+...+..+ T Consensus 249 ~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~~l 324 (474) T protein:vir:97 249 KSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGG----VETIQVEVPVSSTKEYIDLM 324 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc----eeEEeecCCHHHHHHHHHHH Confidence 99999999999999999999988877654322222121 1223455666655543 34444444456677788888 Q ss_pred HHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccce Q lcl|NC_013692. 466 QAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFV 545 (726) Q Consensus 466 ~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v 545 (726) ...+...|++++.+.+..++ +.||.|+.................|..+++++++.+++ ++... . ++. T Consensus 325 ~~~I~~~s~~p~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~----~~~~~------~-d~~ 391 (474) T protein:vir:97 325 RVYIMEFGQGVDFQTDKFGS--APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIID----FNNLK------T-DVK 391 (474) T ss_pred HHHHHHHhCccccCcccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCC------c-ccc Confidence 99999999998876543222 24666666655555555555556666666665555544 33211 0 111 Q ss_pred ecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhH Q lcl|NC_013692. 546 DIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQK 625 (726) Q Consensus 546 ~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~ 625 (726) .+ .+..+.....-.....+. +... ..++.... +. .+..+.+....++....+.....+.. T Consensus 392 ~i---------~v~f~~~~p~~~~e~a~~----~~~~-g~iS~et~---l~---~l~~v~D~~~E~eri~~E~~~~~~~~ 451 (474) T protein:vir:97 392 DI---------EISFNFNRMMNDAEQSQI----IAQS-QYLSRETL---VK---SSPLVDDYKAELERIEQEQMEYNKQL 451 (474) T ss_pred ee---------eEEeccCcccCHHHHHHH----HHHc-CCCCHHHH---HH---hCCCCCCHHHHHHHHHHHHHHHHhhc Confidence 11 011111111001111111 1111 11111111 10 11111111111111100000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 626 AQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQ 662 (726) Q Consensus 626 ~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eq 662 (726) ... . ............. ..+ +.+ T Consensus 452 ~~~---------~--~~~~~~~~~~~~~-~~~--~~e 474 (474) T protein:vir:97 452 PNL---------D--DGGADGAQQQEGS-NNK--ESE 474 (474) T ss_pred ccc---------C--CCCCCCcccCCCC-ccc--ccC Confidence 000 0 0000000000000 000 000 No 91 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.37 E-value=3e-11 Score=78.30 Aligned_cols=395 Identities=12% Similarity=0.068 Sum_probs=186.9 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC------CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_013692. 27 SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK------PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSS 100 (726) Q Consensus 27 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~ 100 (726) -+...|..|.+.+.+ +.....+-.+||.|....+ |+..+.+-+.|.+-....|+.+...| T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl------- 66 (409) T protein:vir:16 1 MTEKGIGYLRFKLSV-------HKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL------- 66 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhc------- Confidence 566677777666653 2334556788998755321 12222223345555555566554333 Q ss_pred CceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHH Q lcl|NC_013692. 101 PNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEE 180 (726) Q Consensus 101 ~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~ 180 (726) .|...+..|.. +..+| ..|+.-.....+.++||++|.+++.++=. + T Consensus 67 ----~~~Gf~~~d~~--------l~~i~-~~N~ld~~~~~~~~~al~yG~sf~~v~~~-~-------------------- 112 (409) T protein:vir:16 67 ----VFREFENDDFT--------VNEIF-EENNPDIFFDSTVLSALIASCSFTYISKG-E-------------------- 112 (409) T ss_pred ----ccccccCcchH--------HHHHH-HhcChhHHHHHHHHHHHHhCceeEEEecC-C-------------------- Confidence 12222333321 34444 46766666778999999999999976311 0 Q ss_pred HHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe--eeCCCCCC Q lcl|NC_013692. 181 LAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI--VIDPSCGS 258 (726) Q Consensus 181 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~~dp~a~~ 258 (726) .+.|.|..++|.++ +|||.... T Consensus 113 --------------------------------------------------------dg~~~i~~~sP~~~~~i~D~~~~~ 136 (409) T protein:vir:16 113 --------------------------------------------------------NDAVRLQVIEATNATGIIDPITGL 136 (409) T ss_pred --------------------------------------------------------CCceEEEEEcccceEEEeeccccc Confidence 00123344555553 45553321 Q ss_pred chhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCce Q lcl|NC_013692. 259 DFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGV 338 (726) Q Consensus 259 d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~ 338 (726) +. + +.+.| +.+ ... ..+ .+.+|.. + . T Consensus 137 -~~-~---a~~~~-----------~~d-----~~~--------------------------~~~-~~~~~~~-----~-~ 162 (409) T protein:vir:16 137 -LT-E---GYAVL-----------ERD-----ENN--------------------------NVV-LEAHFLP-----D-R 162 (409) T ss_pred -ce-e---eeEEE-----------Eec-----CCC--------------------------ceE-EEEEEec-----C-c Confidence 10 1 11111 000 000 000 0111110 0 0 Q ss_pred EEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeec Q lcl|NC_013692. 339 LHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG-ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMK 417 (726) Q Consensus 339 ~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~-~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ 417 (726) .. .++-++..-...++|+ |.+|+|+|...++..+.+|.|-+ +.++++|+.+|+.+..+.......++|+..+ . T Consensus 163 ~~---~~~~~~~~~~~~~~~~--g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-~ 236 (409) T protein:vir:16 163 TD---YYYRDSRNNISIANPT--GNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYV-T 236 (409) T ss_pred EE---EEEecCccccceecCC--CCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhhee-E Confidence 00 0000111111234554 78999999999999999999865 7899999999999999999999999988765 3 Q ss_pred ccc---cchhhhhhcCCceEeecCccc-hhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHH Q lcl|NC_013692. 418 GAL---DVTNRRRFDRGENYEFNPGAD-PRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATA 493 (726) Q Consensus 418 gav---~~~d~~~~~~g~vi~~~~~~~-~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~ 493 (726) |.- ++.+.....++.++.+....+ ....+..++..++ +.+...+..+...+-.+||+|....|.... ...||.+ T Consensus 237 G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~A 314 (409) T protein:vir:16 237 GLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSM-SPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEA 314 (409) T ss_pred ecCCCCCccchhhhhhhHhhccCCCCCCCCceEEecCCCCh-hHHHHHHHHHHHHHhhhcCCCHHHcccccC-chhHHHH Confidence 331 122334445677777653322 1112322322222 233344444445555677888888885432 1145666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHH-H Q lcl|NC_013692. 494 VRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAK-V 572 (726) Q Consensus 494 i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~-~ 572 (726) +......-........+.|..+++.++++++.+.-..-...- ++..+. -.|..-++. ...+..+ . T Consensus 315 i~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~-------~~~~~~-v~W~~~~~~------~~~s~a~~a 380 (409) T protein:vir:16 315 IKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLRE-------QFSKTK-PKWEPLFEA------DASMLSLIG 380 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccch-------hhccce-EEecCCCCc------chhhHHHHH Confidence 665554444455555566667777777766654322211000 000000 011110000 0111111 1 Q ss_pred HHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhh Q lcl|NC_013692. 573 NDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFA 608 (726) Q Consensus 573 ~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 608 (726) .....+.+. ++.+. ..... .+..+..+-. T Consensus 381 Da~~Kl~~a-~~~~~---~~~v~---~~~~g~~~~d 409 (409) T protein:vir:16 381 DGAIKLNQA-IPEFI---NKDTI---RDLTGIKGAE 409 (409) T ss_pred HHHHHHHhh-ccccc---chhHH---HHhccCCCCC Confidence 122222222 11111 11111 1122221111 No 92 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.37 E-value=1.2e-11 Score=80.37 Aligned_cols=393 Identities=13% Similarity=0.080 Sum_probs=184.9 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCCC------CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHH Q lcl|NC_013692. 43 KQVTDEKITQINRWLDYMHVRGEGK------PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAES 116 (726) Q Consensus 43 ~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~ 116 (726) -++|.. ...+-.+||.|....+ |+..+.+.+.|.+-.+..|+.+...|. |...+-.|.. T Consensus 1 l~~~~~---r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~-----------~~Gf~~~d~~- 65 (410) T protein:vir:95 1 MNLYQS---RVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLI-----------FRAFANDDFN- 65 (410) T ss_pred CCcchh---hHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhc-----------cccccCCCch- Confidence 223332 3455678998766431 122233445566666667776654441 2222333322 Q ss_pred HHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccC Q lcl|NC_013692. 117 ARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESP 196 (726) Q Consensus 117 A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 196 (726) +..+| ..|+.-.....++++||++|.+++.++=. + T Consensus 66 -------l~~i~-~~N~ld~~~~~~~~~al~~G~sf~~v~~~------------------------------------~- 100 (410) T protein:vir:95 66 -------VTEIF-DRNNPDIFFDSAILSALIGSCSFVYISKG------------------------------------E- 100 (410) T ss_pred -------HHHHH-hhcChHHHHHHHHHHHHHhCceeEEEecC------------------------------------C- Confidence 44444 46766666778899999999999976310 0 Q ss_pred CchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe--eeCCCCCCchhhCCeEEEEEeccH Q lcl|NC_013692. 197 SEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI--VIDPSCGSDFSKAKFLIETFESSY 274 (726) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~~dp~a~~d~~da~~~~~~~~~t~ 274 (726) .+.|.|..++|.++ +|||... ...++.+.+ .. T Consensus 101 ----------------------------------------d~~~~i~~~sP~~~~~i~Dp~~~-----~~~~al~~~-~~ 134 (410) T protein:vir:95 101 ----------------------------------------DDEVRLQVIESSNATGVIDPITG-----LLVEGYAVL-AR 134 (410) T ss_pred ----------------------------------------CCceEEEEEcccceEEEEeCCCC-----ceEEEEEEE-Ee Confidence 00123455666664 4455221 111111111 00 Q ss_pred HHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEe Q lcl|NC_013692. 275 AELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRM 354 (726) Q Consensus 275 ~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~ 354 (726) ++ . .......+|.. + . +.++.++...+. T Consensus 135 ~~---~---------------------------------------~~~~~~~~~~~-----~-~----~~~~~~~~~~~~ 162 (410) T protein:vir:95 135 DD---Y---------------------------------------NRPTLEAYFEP-----N-A----THFIPKDGEPYS 162 (410) T ss_pred cC---C---------------------------------------CeEEEEEEEeC-----C-c----EEEEeeCCcccc Confidence 00 0 00111111110 0 0 001111111122 Q ss_pred ccCCCCCCccceEEeeeeeecCcccCCC-hHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc---cchhhhhhcC Q lcl|NC_013692. 355 EENPFPDKRIPYVVVNYIPRKRDLYGES-DGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL---DVTNRRRFDR 430 (726) Q Consensus 355 ~~~P~~~~~~Pf~~~~~~~~~~~~~g~g-~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav---~~~d~~~~~~ 430 (726) .++| .|.+|+|+|...+..++.+|.| |.+.++++|+.+|+.+..+.......++|+..+ .|.- ++.+...... T Consensus 163 ~~~~--~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-~G~d~d~~~~~~~~~~~ 239 (410) T protein:vir:95 163 VTNE--TGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYI-LGLDPDAEPMEKWKATV 239 (410) T ss_pred ccCC--CCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhee-eccCCCCCcCchhhhhh Confidence 3454 4789999999999999999988 458899999999999999999999999987765 3321 1123344456 Q ss_pred CceEeecCccch-hhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 431 GENYEFNPGADP-RAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGIL 509 (726) Q Consensus 431 g~vi~~~~~~~~-~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~ 509 (726) ++++.+..+.+. ...+..++..++ +.+...+..+...+-..||+|....|.... ...||.++......-........ T Consensus 240 ~~i~~~~~~~~~~~~~v~q~~~~~l-~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~ka~~k~ 317 (410) T protein:vir:95 240 SSLLTISSSDKGVKPSVGQFTTASM-SPFTEQLRTAAAGFAGEMGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQ 317 (410) T ss_pred hhheeccCCCCCCcceEEecCCCCh-HHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHH Confidence 777776543321 112322333222 233344444445555677888888885432 12466666665555555555666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCcC-eEEEEecccceecchhhcccccceeeecccchHHHHHHHH-HHHHHHHhhhccc Q lcl|NC_013692. 510 RRLSAGIIEIGRKIIAMNAEFLDDV-EVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVND-LTFMLQTMGPNMD 587 (726) Q Consensus 510 ~~~~~~~~~l~~~il~li~q~~d~e-~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~-l~~l~q~~~~~~~ 587 (726) +.|..+++.++++.+.+.-..-..+ ...++. + .|..-.+.. ..+..+... ...+.+. .+... T Consensus 318 ~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~------v---~W~p~~d~~------~~s~a~~aDa~~Kl~~a-~~g~~ 381 (410) T protein:vir:95 318 RSLGAGLLNVAYVAACLRDEFRYTRSQFVRTA------V---KWEPLFEAD------ANTMTMIGDGVVKLNQA-LPGYI 381 (410) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCcccccceee------E---EeeecCCcc------hhhHHHHHHHHHHHHHh-ccCCc Confidence 6677788887777766543321111 111110 0 111111111 111111111 1112121 11111 Q ss_pred hhHHHHHHHHHHHhhhhhh--hhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 588 PMMAQQIMGQIMELKKMPD--FAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAA 645 (726) Q Consensus 588 ~~~~~~~~~~~~~~~~~~e--~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q 645 (726) . .... .+..+..+ ..+...+.+. ++.+ T Consensus 382 ~---~~~~---~~~lg~~~~~~~~~~~~e~~-------------------------~~g~ 410 (410) T protein:vir:95 382 N---AETI---RDLTGIAGDMSAKPVVSEGG-------------------------SNGE 410 (410) T ss_pred c---HHHH---HHhcCCChHHHHHHHHHHHH-------------------------hCCC Confidence 1 1111 11112110 1010000000 0000 No 93 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.36 E-value=1.3e-11 Score=80.35 Aligned_cols=481 Identities=10% Similarity=0.052 Sum_probs=211.7 Q ss_pred CCCccchhhcCCCCCCc------cchhcCCCCCCchH-HHHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCC--CCC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGD------PSKRLQPEWSNAPS-LAQLKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGK--PKT 70 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~--~~~ 70 (726) |.-|.+ |.+-.-.++. ........|..... +.....+|....+.|..... ...+..+||.|.-... .+. T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:99 1 MLKVNE-FETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccc-hhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 333321 1100000011 11222445543322 22223344444445554443 4577899998754321 111 Q ss_pred ----CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHh Q lcl|NC_013692. 71 ----EKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGV 146 (726) Q Consensus 71 ----~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l 146 (726) .+...+++.+...-.|+.....| ||.. +.|.. +|... .++++.+| ..|+.-......+++++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~d~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:99 80 RKEEYMADNRVAHDYASYISDFINGYF----LGNP--IQYQD---DDKDV----LEAIEAFN-DLNDVESHNRSLGLDLS 145 (511) T ss_pred ccccccCcceeecchHHHHHHHHHhhh----cccC--ceeec---CchHH----HHHHHHHH-hhcCHhHHHHHHHHHHH Confidence 12234688888888888777555 3433 34432 33332 34666665 35666677888999999 Q ss_pred hcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccc Q lcl|NC_013692. 147 DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVG 226 (726) Q Consensus 147 ~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 226 (726) ++|.+.+.+||+.. T Consensus 146 i~G~a~~~vy~ded------------------------------------------------------------------ 159 (511) T protein:vir:99 146 IYGKAYELMIRNQD------------------------------------------------------------------ 159 (511) T ss_pred hcCeeEEEEEeCCC------------------------------------------------------------------ Confidence 99999998877510 Q ss_pred ceeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 227 SEEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 227 ~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) +.|.+.+++|.+++ ||++.. ...-+ +.+.|.+... .+ T Consensus 160 -----------~~~~i~~~~p~~~~~vyd~~~~---~~~~~-~vr~~~~~~~-------~~------------------- 198 (511) T protein:vir:99 160 -----------DETRLYKSDAMSTFVIYDNTIE---RNSIA-GVRYLRTKPI-------DK------------------- 198 (511) T ss_pred -----------CceEEEEEccceeEEEEcCCCC---CceEE-EEEEEEeeec-------cc------------------- Confidence 01234567777765 444321 11222 2222211100 00 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEE---EeccCCCCCCccceEEeeeeeecCcccCC Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMI---RMEENPFPDKRIPYVVVNYIPRKRDLYGE 381 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l---~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~ 381 (726) ...+.+..+|+|.. +++.+++.-. .+...+ .....|.+.+.+|++.++. ..+|. T Consensus 199 ------------~~~~~~~~~~vyt~-----~~i~~~~~~~-~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~ 255 (511) T protein:vir:99 199 ------------TDEDEVFTVDLFTS-----HGVYRYLTSR-TNGLKLTPRENGFESHSFERMPITEFSN-----NERRK 255 (511) T ss_pred ------------CccceEEEEEEEeC-----CcEEEEEecC-CccccccccccccccCCCCccceEEecC-----CCCCC Confidence 00112333444431 2221111100 011110 1122333446677766543 34689 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc--ccchhhhhhcCCceEeecCc---------cchhhhccccc Q lcl|NC_013692. 382 SDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA--LDVTNRRRFDRGENYEFNPG---------ADPRAAVHMHT 450 (726) Q Consensus 382 g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga--v~~~d~~~~~~g~vi~~~~~---------~~~~~~i~~~~ 450 (726) |.++.++++++.+|..+|.+.+.+...++|.+.+ .|. .+..+......++++..... ......+.+.. T Consensus 256 sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~ 334 (511) T protein:vir:99 256 GDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY 334 (511) T ss_pred CchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhh-ccCcccCchhhcccccccceecccccccccccccCCCCcceeEEe Confidence 9999999999999999999999998888887665 332 22222222333333332211 11112244444 Q ss_pred CccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 451 FPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEF 530 (726) Q Consensus 451 ~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~ 530 (726) .+.-...+...+..+...+...|++++.+.+..++ +.||.++..............-+.|..+++++++.++.++... T Consensus 335 ~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~g--n~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~ 412 (511) T protein:vir:99 335 KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44445667778888889999999999887653322 2467777777766667777777777788888777777765432 Q ss_pred cCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhh Q lcl|NC_013692. 531 LDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKR 610 (726) Q Consensus 531 ~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 610 (726) .....- .++..+ .+........-..+..+.+..+ ...++.... +. .+..+.+.... T Consensus 413 ~~~~~~-----~~~~~i---------~i~f~~~~p~n~~e~~~~~~kl----~GiiS~et~---l~---~l~~v~D~~~E 468 (511) T protein:vir:99 413 RSIDVS-----KDFNTV---------RYVYNRNLPKSLIEELKAYIDS----GGKISQTTL---MS---LFSFFQDPELE 468 (511) T ss_pred CCcccc-----cccccc---------eEEeCCCCCcCHHHHHHHHHHH----hccCCHHHH---HH---hCCCCCCHHHH Confidence 211100 001100 0111111111111111111111 111111110 10 01111111111 Q ss_pred HHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 611 IREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 611 l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) ++....+.... +.. .+ ...................+.+.+..+ T Consensus 469 ~~ri~~E~~~~---------------~~~----~~----~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 469 VKKIEEDEKES---------------IKK----AQ----KNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred HHHHHHHHHHH---------------HHH----Hh----hcccccCCCCCCCCCCCCCcCcccccC Confidence 11100000000 000 00 000000000000000000000000000 No 94 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.36 E-value=2.3e-11 Score=78.91 Aligned_cols=479 Identities=10% Similarity=0.049 Sum_probs=212.6 Q ss_pred CCCccchhhcCCCCCCc------cchhcCCCCCCchH-HHHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCC--CC- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGD------PSKRLQPEWSNAPS-LAQLKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGK--PK- 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~--~~- 69 (726) |.-|.+ |-.-....+. ........|.+... ......+|....+.|..... ...+..+||.|.-... ++ T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:93 1 MLKVNE-FETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccc-hhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 433322 1100000011 11222455643332 22233445555555655443 4677899998765321 11 Q ss_pred ---CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHh Q lcl|NC_013692. 70 ---TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGV 146 (726) Q Consensus 70 ---~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l 146 (726) ..+..-+++.+...-.|+.....| +|.. +.|.+ +|... .++++.++ ..|+.-......+++++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~d~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:93 80 RKEEYMADNRVAHDYASYISDFINGYF----LGNP--IQYQD---DDKDV----LEVIEAFN-DLNDVESHNRSLGLDLS 145 (511) T ss_pred CcccccCcceeecchHHHHHHHHhhhh----cccC--eeecc---CChHH----HHHHHHHH-hhcCHhHHHHHHHHHHH Confidence 112234678888788888777555 3322 34432 33332 23555554 45666677889999999 Q ss_pred hcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccc Q lcl|NC_013692. 147 DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVG 226 (726) Q Consensus 147 ~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 226 (726) ++|.+.+.+|++.. T Consensus 146 ~~G~ay~~vy~de~------------------------------------------------------------------ 159 (511) T protein:vir:93 146 IYGKAYELMIRNQD------------------------------------------------------------------ 159 (511) T ss_pred hcCeeEEEEEeCCC------------------------------------------------------------------ Confidence 99999998877510 Q ss_pred ceeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 227 SEEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 227 ~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) +.|.+..++|.+++ ||+... .-...+.+.|.+... +. T Consensus 160 -----------~~~~i~~~~p~~~~~vydd~~~----~~~~~~vr~~~~~~~----------~~---------------- 198 (511) T protein:vir:93 160 -----------DETRLYKSDAMSTFVIYDNTIE----RNSIAGVRYLRTKPI----------DK---------------- 198 (511) T ss_pred -----------CceEEEEEccceeEEEEcCCCC----CceEEEEEEEEeeec----------cc---------------- Confidence 01234567777765 444321 112233333311100 00 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEE-----EEeccCCCCCCccceEEeeeeeecCccc Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVM-----IRMEENPFPDKRIPYVVVNYIPRKRDLY 379 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~-----l~~~~~P~~~~~~Pf~~~~~~~~~~~~~ 379 (726) ...+.+..+|+|.. +++.+++ ..++.. ....+.|.+.+.+|++.++. ..+ T Consensus 199 ------------~~~~~~~~~~iyt~-----~~i~~~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~ 253 (511) T protein:vir:93 199 ------------TDEDEVFTVDLFTS-----HGVYRYL---TSRTNGLKLTPRENGFESHSFERMPITEFSN-----NER 253 (511) T ss_pred ------------cccceEEEEEEEeC-----CcEEEEE---ecCCCccccccccccccccCCCccceEEecC-----CCC Confidence 00112333444421 1111111 111110 11122233446677665542 446 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc--cchhhhhhcCCceEeecCc---------cchhhhccc Q lcl|NC_013692. 380 GESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL--DVTNRRRFDRGENYEFNPG---------ADPRAAVHM 448 (726) Q Consensus 380 g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav--~~~d~~~~~~g~vi~~~~~---------~~~~~~i~~ 448 (726) |.|.++.++++++.+|..+|.+.+.+...++|.+.+. |.. +..+......+.++.+.++ ......+.+ T Consensus 254 g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (511) T protein:vir:93 254 RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK-GNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGY 332 (511) T ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeee-cCcccCchhhcccccccceecccccccccccccCCCCcceeE Confidence 8899999999999999999999999998888876653 432 2222222333343333221 111223344 Q ss_pred ccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 449 HTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNA 528 (726) Q Consensus 449 ~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~ 528 (726) ...+.....+...+..+...+...|++++.+.+..++ +.||.|+..............-+.|..+++++++.++.++. T Consensus 333 l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~--n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~ 410 (511) T protein:vir:93 333 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 410 (511) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444445667788888899999999999887653322 24677777777777777777777777888887777776543 Q ss_pred HhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhh Q lcl|NC_013692. 529 EFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFA 608 (726) Q Consensus 529 q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 608 (726) .......- .++..+ .+..+.....-..+..+.+..+ ...++... ++. .+..+.+.. T Consensus 411 ~~~~~~~~-----~d~~~i---------~~~f~~~~p~n~~e~~~~~~kl----~g~iS~et---~~~---~l~~v~d~~ 466 (511) T protein:vir:93 411 NTWSIDAN-----KDFNTV---------RYVYNRNLPKSLIEELKAYIDS----GGKISQTT---LMS---LFSFFQDPE 466 (511) T ss_pred hccCcccc-----cccccc---------eEEeCCCCCCCHHHHHHHHHHH----hccCchHH---HHH---hCCCCCCHH Confidence 32211110 011111 0111111111111111111111 11111111 110 011111111 Q ss_pred hhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 609 KRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 609 ~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) ..++....+... +++... ......... . .......+.+....+++ T Consensus 467 ~E~~ri~~E~~~---------------~~~~~~-----~~~~~~~~~--~-~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 467 LEVKKIEEDEKE---------------SIKKAQ-----KGIYKDPRD--I-NDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHH---------------HHHHHh-----hhcccCCCC--C-CCCCCCCcccccccccC Confidence 101100000000 000000 000000000 0 00000000000000000 No 95 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.36 E-value=2.3e-11 Score=78.86 Aligned_cols=479 Identities=10% Similarity=0.034 Sum_probs=206.5 Q ss_pred CCCccc---hhhcCCCCCCccchhcCCCC---CCchHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCC--CC-- Q lcl|NC_013692. 1 MADVDE---DYLTLPNEDGDPSKRLQPEW---SNAPSLAQLKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGK--PK-- 69 (726) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~--~~-- 69 (726) |.-|.+ ++......+=.=.|..+-.+ +-+..+..-..++....+.|..... ...+..+||.|.-... ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~ 80 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 333321 11111100000001000111 1111111112233333334444433 3577889998765421 11 Q ss_pred CCCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 70 TEKGK--SAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 70 ~~~gr--s~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) ..+++ .+++.+...-.|+.....| ||.. +.|.+ +|.. ..++|+.+| ..|+.-..+...++++++ T Consensus 81 ~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p--~~~~~---~d~~----~~~~l~~~~-~~n~~~~~~~~~~~~~~i 146 (512) T protein:vir:97 81 KEEYMADNRVAHDYASYISDFINGYF----LGNP--IQCQD---DDKD----VLEAIEAFN-DLNDVESHNRSLGLDLSI 146 (512) T ss_pred cccccCcceeecchHHHHHHHHhhhh----cccC--ceecc---CChH----HHHHHHHHH-hhcCHHHHHHHHHHHHHh Confidence 11233 4778888888888777555 3322 34433 3332 234677766 356666788899999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +|.+.+.+|++.. T Consensus 147 ~G~ay~~vy~ded------------------------------------------------------------------- 159 (512) T protein:vir:97 147 YGKAYELMIRNQD------------------------------------------------------------------- 159 (512) T ss_pred cCeEEEEEEeCCC------------------------------------------------------------------- Confidence 9999988776410 Q ss_pred eeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhh Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGP 305 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~ 305 (726) +.|++..++|.+++ ||++... . ...+.+.|.+... +. T Consensus 160 ----------~~~~i~~~~p~~~~~iyd~~~~~---~-~~~~vr~~~~~~~----------~~----------------- 198 (512) T protein:vir:97 160 ----------DETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKPI----------DK----------------- 198 (512) T ss_pred ----------CceEEEEEcccceEEEEcCCCCC---c-eEEEEEEEEeeec----------cc----------------- Confidence 01234557777765 4443321 1 1222222211100 00 Q ss_pred hccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCE--E---EEeccCCCCCCccceEEeeeeeecCcccC Q lcl|NC_013692. 306 SEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAV--M---IRMEENPFPDKRIPYVVVNYIPRKRDLYG 380 (726) Q Consensus 306 ~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~--~---l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g 380 (726) ...+.+..+|+|.. +++.+ ....++. . ....+.|.+++.+|++.+.. ..+| T Consensus 199 -----------~~~~~~~~~~vyt~-----~~i~~---~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~ 254 (512) T protein:vir:97 199 -----------TDEDEVFTVDLFTS-----HGVYR---YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERR 254 (512) T ss_pred -----------cccceEEEEEEEeC-----CcEEE---EEecCCCcccccccccccccccCcccceEeecC-----CCCC Confidence 00112333444432 11111 1111111 0 11223344556777766543 3468 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc--cchhhhhhcCCceEeecCcc----------chhhhccc Q lcl|NC_013692. 381 ESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL--DVTNRRRFDRGENYEFNPGA----------DPRAAVHM 448 (726) Q Consensus 381 ~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav--~~~d~~~~~~g~vi~~~~~~----------~~~~~i~~ 448 (726) .|.++.++++++.+|..+|.+.+.+...++|.+.+ .|.. +..+......+.++...+.. ....-+.+ T Consensus 255 ~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 333 (512) T protein:vir:97 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGY 333 (512) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEE Confidence 89999999999999999999999999988887765 3432 22222222333333222110 11112334 Q ss_pred ccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 449 HTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNA 528 (726) Q Consensus 449 ~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~ 528 (726) ...+.........+..+...+...|++++.+.|..++ +.||.|+...............+.|..+++++++.++.++. T Consensus 334 l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~g--n~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~ 411 (512) T protein:vir:97 334 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 411 (512) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444445567778888899999999999988664322 24666777766666677777777777888777777766554 Q ss_pred HhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhh Q lcl|NC_013692. 529 EFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFA 608 (726) Q Consensus 529 q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 608 (726) .......- .++. ++. +..+.....-..+..+.+..+ ...++.... +. + +..+.+.. T Consensus 412 ~~~~~~~~-----~d~~-----~i~----~~f~~~~p~~~~e~~~~~~kl----~giiS~et~---~~-~--l~~v~d~~ 467 (512) T protein:vir:97 412 NTRSIDAN-----KDFN-----TVR----YVYNRNLPKSLIEELKAYIDS----GGKISQTTL---MS-L--FSFFQDPE 467 (512) T ss_pred hcCCcccc-----cccc-----cce----EEeCCCCCcCHHHHHHHHHHH----hccCchHHH---HH-h--CCCCCCHH Confidence 32211100 0010 001 111111111111111111111 111111110 10 0 11111110 Q ss_pred hhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 609 KRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQ-DSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 609 ~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~-~~~~~~eqaq~~q~~~q~~~~~ 676 (726) ..++....+... ++.. .+......... .......+.+ ....+++ T Consensus 468 ~E~eri~~E~~~---------------~~~~----~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 512 (512) T protein:vir:97 468 LEVKKIEEDEKE---------------SIKK----AQKGIYKDPRDINDDEQDDDTK-----DTVDKKE 512 (512) T ss_pred HHHHHHHHHHHH---------------HHHH----HhhcccCCCCCCCCCCCCCCcc-----ccccccC Confidence 001000000000 0000 00000000000 0000000000 0000000 No 96 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.36 E-value=1.5e-11 Score=79.89 Aligned_cols=480 Identities=9% Similarity=0.035 Sum_probs=208.3 Q ss_pred CCCccchhhcCCCCCC------ccchhcCCCCCCchHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCC--CC- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDG------DPSKRLQPEWSNAPSL-AQLKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGK--PK- 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~--~~- 69 (726) |.-|.+ |-.-..-.+ .........|.....+ ......|....+.|.+... ...+..+||.|.-... ++ T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:78 1 MLKVNE-FETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccc-hhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCc Confidence 433322 000000000 0111223345332222 2222334443444444433 3577889998765321 11 Q ss_pred ---CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHh Q lcl|NC_013692. 70 ---TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGV 146 (726) Q Consensus 70 ---~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l 146 (726) ..+...+++.+...-.|+.....| ||.. +.|.+ +|.+ ..++|+.+|. .|+.-......+++++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:78 80 RKEEYMADNRVAHDYASYISDFINGYF----LGNP--IQYQD---DDKD----VLEAIEAFND-LNDVESHNRSLGLDLS 145 (511) T ss_pred ccccccCcceeecchHHHHHHHHhhhh----cccC--ceeec---CchH----HHHHHHHHHh-hcChhHHHHHHHHHHH Confidence 112235778788888888777555 3432 33432 3333 2345666653 5666667778999999 Q ss_pred hcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccc Q lcl|NC_013692. 147 DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVG 226 (726) Q Consensus 147 ~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 226 (726) ++|.+.+.+|++.. T Consensus 146 ~~G~a~~~vy~d~d------------------------------------------------------------------ 159 (511) T protein:vir:78 146 IYGKAYELMIRNQD------------------------------------------------------------------ 159 (511) T ss_pred hcCeeEEEEEeCCC------------------------------------------------------------------ Confidence 99999988776410 Q ss_pred ceeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 227 SEEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 227 ~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) +.|++..++|.+++ ||+... .. ...+.+.|.+... + T Consensus 160 -----------g~~~i~~~~p~~~~~v~dd~~~---~~-~~~~vr~~~~~~~----------~----------------- 197 (511) T protein:vir:78 160 -----------DETRLYKSDAMSTFIIYDNTVE---RN-SIAGVRYLRTKPI----------D----------------- 197 (511) T ss_pred -----------CceEEEEEcccceEEEEcCCCC---Cc-eEEEEEEEEeeec----------c----------------- Confidence 01334567777766 343321 11 1222222211100 0 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCE---EE--EeccCCCCCCccceEEeeeeeecCccc Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAV---MI--RMEENPFPDKRIPYVVVNYIPRKRDLY 379 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~---~l--~~~~~P~~~~~~Pf~~~~~~~~~~~~~ 379 (726) +...+.+..+|+|.. +++.+ +...++. +. .....|.+.+.+|++.+.. ..+ T Consensus 198 -----------~~~~~~~~~~~vyt~-----~~i~~---~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~ 253 (511) T protein:vir:78 198 -----------KTDEDEVFTVDLFTS-----HGVYR---YLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NER 253 (511) T ss_pred -----------ccccceEEEEEEEeC-----CcEEE---EEecCCCcccccccccccccCcCcccceEEecC-----CCC Confidence 000122333444431 11111 1111110 11 1123344446667665543 346 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc-ccchhhhhhcCCceEeecCc---------cchhhhcccc Q lcl|NC_013692. 380 GESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTNRRRFDRGENYEFNPG---------ADPRAAVHMH 449 (726) Q Consensus 380 g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga-v~~~d~~~~~~g~vi~~~~~---------~~~~~~i~~~ 449 (726) |.|.++.++++++.+|..+|.+.+.+...++|.+.+.-.. .+..+......+.++...++ ......+.+. T Consensus 254 g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:78 254 RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYI 333 (511) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEE Confidence 8899999999999999999999999998888877653222 22222222333333332211 1112223444 Q ss_pred cCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 450 TFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAE 529 (726) Q Consensus 450 ~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q 529 (726) ..+.-...+...+..+...+...|++++.+.+..++ +.||.++..............-+.|..+++++++.++.++.. T Consensus 334 ~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~--n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:78 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 444445666778888888999999999887664332 246667777666666666677777778888877777666543 Q ss_pred hcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhh Q lcl|NC_013692. 530 FLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAK 609 (726) Q Consensus 530 ~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 609 (726) ......- .++..+ .+..+.....-..+..+.+..+ .+ .++... .+. .+..+.+... T Consensus 412 ~~~~~~~-----~~~~~i---------~~~f~~~~p~n~~e~~d~~~kl---~G-~iS~et---~l~---~l~~v~d~~~ 467 (511) T protein:vir:78 412 TRSIDAN-----KDFNTV---------RYVYNRNLPKSLIEELKAYIDS---GG-KISQTT---LMS---LFSFFQDPEL 467 (511) T ss_pred cCCCccc-----cccccc---------eEEeCCCCCcCHHHHHHHHHHH---hc-cCChHH---HHH---hCCCCCCHHH Confidence 2211100 011000 0111111111111111111111 11 111111 010 0111111111 Q ss_pred hHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 610 RIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 610 ~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) .++....+... +++... ......... . .......+.+....+.+ T Consensus 468 El~ri~~E~~~---------------~~~~~~-----~~~~~~~~~--~-~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 468 EVKKIEEDEKE---------------SIKKAQ-----KGIYKDPRD--I-NDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHH---------------HHHHHh-----hccccCCCC--C-CCCCCCCCccCcccccC Confidence 11110000000 000000 000000000 0 00000000000000000 No 97 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.36 E-value=1.5e-11 Score=79.89 Aligned_cols=480 Identities=9% Similarity=0.035 Sum_probs=208.3 Q ss_pred CCCccchhhcCCCCCC------ccchhcCCCCCCchHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHhccCCCCC--CC- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDG------DPSKRLQPEWSNAPSL-AQLKQDYQEAKQVTDEKIT-QINRWLDYMHVRGEGK--PK- 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~-~~~~~~~~y~~~~~~~--~~- 69 (726) |.-|.+ |-.-..-.+ .........|.....+ ......|....+.|.+... ...+..+||.|.-... ++ T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:96 1 MLKVNE-FETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccc-hhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCc Confidence 433322 000000000 0111223345332222 2222334443444444433 3577889998765321 11 Q ss_pred ---CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHh Q lcl|NC_013692. 70 ---TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGV 146 (726) Q Consensus 70 ---~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l 146 (726) ..+...+++.+...-.|+.....| ||.. +.|.+ +|.+ ..++|+.+|. .|+.-......+++++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:96 80 RKEEYMADNRVAHDYASYISDFINGYF----LGNP--IQYQD---DDKD----VLEAIEAFND-LNDVESHNRSLGLDLS 145 (511) T ss_pred ccccccCcceeecchHHHHHHHHhhhh----cccC--ceeec---CchH----HHHHHHHHHh-hcChhHHHHHHHHHHH Confidence 112235778788888888777555 3432 33432 3333 2345666653 5666667778999999 Q ss_pred hcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccc Q lcl|NC_013692. 147 DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVG 226 (726) Q Consensus 147 ~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 226 (726) ++|.+.+.+|++.. T Consensus 146 ~~G~a~~~vy~d~d------------------------------------------------------------------ 159 (511) T protein:vir:96 146 IYGKAYELMIRNQD------------------------------------------------------------------ 159 (511) T ss_pred hcCeeEEEEEeCCC------------------------------------------------------------------ Confidence 99999988776410 Q ss_pred ceeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 227 SEEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 227 ~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) +.|++..++|.+++ ||+... .. ...+.+.|.+... + T Consensus 160 -----------g~~~i~~~~p~~~~~v~dd~~~---~~-~~~~vr~~~~~~~----------~----------------- 197 (511) T protein:vir:96 160 -----------DETRLYKSDAMSTFIIYDNTVE---RN-SIAGVRYLRTKPI----------D----------------- 197 (511) T ss_pred -----------CceEEEEEcccceEEEEcCCCC---Cc-eEEEEEEEEeeec----------c----------------- Confidence 01334567777766 343321 11 1222222211100 0 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCE---EE--EeccCCCCCCccceEEeeeeeecCccc Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAV---MI--RMEENPFPDKRIPYVVVNYIPRKRDLY 379 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~---~l--~~~~~P~~~~~~Pf~~~~~~~~~~~~~ 379 (726) +...+.+..+|+|.. +++.+ +...++. +. .....|.+.+.+|++.+.. ..+ T Consensus 198 -----------~~~~~~~~~~~vyt~-----~~i~~---~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~ 253 (511) T protein:vir:96 198 -----------KTDEDEVFTVDLFTS-----HGVYR---YLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NER 253 (511) T ss_pred -----------ccccceEEEEEEEeC-----CcEEE---EEecCCCcccccccccccccCcCcccceEEecC-----CCC Confidence 000122333444431 11111 1111110 11 1123344446667665543 346 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc-ccchhhhhhcCCceEeecCc---------cchhhhcccc Q lcl|NC_013692. 380 GESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTNRRRFDRGENYEFNPG---------ADPRAAVHMH 449 (726) Q Consensus 380 g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga-v~~~d~~~~~~g~vi~~~~~---------~~~~~~i~~~ 449 (726) |.|.++.++++++.+|..+|.+.+.+...++|.+.+.-.. .+..+......+.++...++ ......+.+. T Consensus 254 g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:96 254 RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYI 333 (511) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEE Confidence 8899999999999999999999999998888877653222 22222222333333332211 1112223444 Q ss_pred cCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 450 TFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAE 529 (726) Q Consensus 450 ~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q 529 (726) ..+.-...+...+..+...+...|++++.+.+..++ +.||.++..............-+.|..+++++++.++.++.. T Consensus 334 ~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~--n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:96 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 444445666778888888999999999887664332 246667777666666666677777778888877777666543 Q ss_pred hcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhh Q lcl|NC_013692. 530 FLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAK 609 (726) Q Consensus 530 ~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 609 (726) ......- .++..+ .+..+.....-..+..+.+..+ .+ .++... .+. .+..+.+... T Consensus 412 ~~~~~~~-----~~~~~i---------~~~f~~~~p~n~~e~~d~~~kl---~G-~iS~et---~l~---~l~~v~d~~~ 467 (511) T protein:vir:96 412 TRSIDAN-----KDFNTV---------RYVYNRNLPKSLIEELKAYIDS---GG-KISQTT---LMS---LFSFFQDPEL 467 (511) T ss_pred cCCCccc-----cccccc---------eEEeCCCCCcCHHHHHHHHHHH---hc-cCChHH---HHH---hCCCCCCHHH Confidence 2211100 011000 0111111111111111111111 11 111111 010 0111111111 Q ss_pred hHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 610 RIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 610 ~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) .++....+... +++... ......... . .......+.+....+.+ T Consensus 468 El~ri~~E~~~---------------~~~~~~-----~~~~~~~~~--~-~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 468 EVKKIEEDEKE---------------SIKKAQ-----KGIYKDPRD--I-NDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHHH---------------HHHHHh-----hccccCCCC--C-CCCCCCCCccCcccccC Confidence 11110000000 000000 000000000 0 00000000000000000 No 98 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.36 E-value=6.3e-11 Score=76.52 Aligned_cols=457 Identities=11% Similarity=0.076 Sum_probs=198.0 Q ss_pred CCCccchhhcCCCCCCccchhcCCCC--CCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC---------CC Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEW--SNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK---------PK 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~---------~~ 69 (726) |..+ -+||.. ++.- .+| +-.+........|..-...|...+....++.+||.|.-+.. -+ T Consensus 1 ~~~~----~~~~~~---~~~~--~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~ 71 (474) T protein:vir:95 1 MFNI----IRMPWD---KPYG--EEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGN 71 (474) T ss_pred Ccce----eecCCC---Cchh--hHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccc Confidence 2211 012221 0000 001 00111111112233333345566666788999998753211 01 Q ss_pred CCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 70 TEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 70 ~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) ...+++ +++.+.....|+.....| ||.. +.|.. +|.+ ....+...+ +++....+...+++++. T Consensus 72 ~~~~~~~~ki~~n~~~~Ivd~~~~~l----~g~p--~~~~~---~d~~----~~~~l~~~~--~n~~~~~~~e~~~~~~~ 136 (474) T protein:vir:95 72 IDYDKPDWRITTNFHQNLVDQKVSYV----ASKP--VTYSC---EDES----VLKIIHDVL--DTRWDNKLIDILTATSN 136 (474) T ss_pred cccccccceeccchHHHHHHHHHhhh----ccCC--ceecc---CchH----HHHHHHHHH--hccHHHHHHHHHHHHhh Confidence 122333 678888888888777555 4433 34432 3333 233555544 35666778889999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +|.+.+.+||+.. T Consensus 137 ~G~~~~~v~~d~~------------------------------------------------------------------- 149 (474) T protein:vir:95 137 KGIDWLQVYINEN------------------------------------------------------------------- 149 (474) T ss_pred cCcEEEEEEecCC------------------------------------------------------------------- Confidence 9999998877510 Q ss_pred eeecccceeeccceeeeechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhh Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGP 305 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~ 305 (726) +.|++..++|.+++ ||+... .+..+++ +.|...++ ..++. T Consensus 150 ----------~~~~i~~~~p~~~~~v~d~~~~---~~~~~~i-~~~~~~~~-------~~~~~----------------- 191 (474) T protein:vir:95 150 ----------GEMKLFRVPAEQAIPIWVDKER---EELKSFI-RYYKFNNE-------EKVEF----------------- 191 (474) T ss_pred ----------CceEEEEEcccceEEEEcCCCC---CceEEEE-EEEEEcCe-------eEEEE----------------- Confidence 01233456666655 333221 2222222 22211000 00000 Q ss_pred hccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH Q lcl|NC_013692. 306 SEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA 385 (726) Q Consensus 306 ~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~ 385 (726) | ...+|..|.+ .+.+... .+..+.........|.+.+.+|++.++. ...|.|.++ T Consensus 192 --------y---~~~~~~~~~~------~~~~~~~---~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e 246 (474) T protein:vir:95 192 --------W---TDTTVTYYVL------ENGGLIP---DYYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSDIW 246 (474) T ss_pred --------E---eCCeEEEEEE------cCCcccc---ccccCcccccccccccCCCccceEeecC-----CCCCCCcHH Confidence 0 0011111111 1111100 0000111111122334446778777654 346889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh--hhhcCCceEeecCccchhhhcccccCccchhHHHHHHH Q lcl|NC_013692. 386 LLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR--RRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMIN 463 (726) Q Consensus 386 ~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~--~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~ 463 (726) .++++++.+|.+++.+.+.+...++|.+.+.-...+..+. .....++++.+..+++ +.+...+.....+...+. T Consensus 247 ~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~ 322 (474) T protein:vir:95 247 MYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGG----VETIQVEVPVSSTKEYID 322 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc----eeEEeecCCHHHHHHHHH Confidence 9999999999999999999999988877654322222111 2234455666655443 334444444566777788 Q ss_pred HHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc Q lcl|NC_013692. 464 LQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH 543 (726) Q Consensus 464 ~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~ 543 (726) .+...+...+++++.+.|..++ +.||.++...............+.|..+++++++.++++. ... .+ T Consensus 323 ~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~----g~~-------~d 389 (474) T protein:vir:95 323 LMRAYIMEFGQGVDFQTDKFGS--APSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFN----NLK-------MD 389 (474) T ss_pred HHHHHHHHHhCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC-------cc Confidence 8888999999999877653322 2466667666666666666666666677776666655543 210 01 Q ss_pred ceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhh Q lcl|NC_013692. 544 FVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQ 623 (726) Q Consensus 544 ~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~q 623 (726) +..+ .+..+.....-.....+.+ ...+ .++... .+. .+....+....++....+.....+ T Consensus 390 ~~~i---------~v~f~~~~p~d~~e~a~~~----~~~g-~iS~et---~i~---~l~~v~d~~~E~~ri~~E~~~~~~ 449 (474) T protein:vir:95 390 VKDI---------EISFNFNRMMNDAEQSQII----AQSQ-YLSRET---LVK---SSPLVDDYKAELERIEQEQMEYNK 449 (474) T ss_pred ccee---------eEEeccCCCcCHHHHHHHH----HhcC-CCchHH---HHH---hCCCCCCHHHHHHHHHHHHHHHHh Confidence 1111 1111111111011111111 1111 111110 000 011111111111110000000000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 624 QKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAK 664 (726) Q Consensus 624 q~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq 664 (726) +... ..... .. ...+.......+-+ T Consensus 450 ~~~~------~~~~~---~d-------~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 450 QLPN------LDDGG---AD-------GAQQQERSNDKESE 474 (474) T ss_pred cccc------ccccc---CC-------CCcCCCCCccCCCC Confidence 0000 00000 00 00000000000000 No 99 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.35 E-value=4e-11 Score=77.58 Aligned_cols=477 Identities=12% Similarity=0.037 Sum_probs=190.6 Q ss_pred CCCccc-hhcCCCCCC-ch---HHHHHHHHHHHHHHHHHHHHHHHHHH-HH-H---hccCCCCCCCCCCCCCcCCCHHHH Q lcl|NC_013692. 14 EDGDPS-KRLQPEWSN-AP---SLAQLKQDYQEAKQVTDEKITQINRW-LD-Y---MHVRGEGKPKTEKGKSAVQPPTIR 83 (726) Q Consensus 14 ~~~~~~-~~~~~~~~~-~~---~~~~~~~~~~~a~~~~~~~~~~~~~~-~~-~---y~~~~~~~~~~~~grs~~v~~~v~ 83 (726) +...-. |...+.|-+ .| .+..+..... -+.+. +.+| .. | +-+.| ++|...- .++..+.-+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~---~~~~~----~~~~~~~~~~~~~w~~~--~~~~~~~-~~~~~~l~~ 70 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLP---LVPDN----QKEWSKDSYLTSLWAQG--YVPTVHD-KLMNSGTGN 70 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhh---hcccc----hhhhhhhhhhhhhcccC--CCCcccc-ccccCChHH Confidence 222222 223555532 11 1111111110 00000 1111 01 1 11222 2232322 233444433 Q ss_pred HHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeee Q lcl|NC_013692. 84 KQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRT 163 (726) Q Consensus 84 ~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~ 163 (726) ..++ -+++| .|+-..-|.|......|. ++++++|+.++ ..|+....++.++..++..|.+++|.+|+.. . T Consensus 71 ~i~~-~~A~l---l~~e~~~i~v~~~~~~d~---e~~~~~l~~il-~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~--~ 140 (518) T protein:vir:78 71 EIVV-VAAEY---ISGKPLSIDVTGVNGSKD---ENLTKQLKEAL-RIDNFDSKSVKIVELAGGSGVSAVKINILNG--R 140 (518) T ss_pred HHHH-HHHHh---hcCCCceEEecCccccCc---HHHHHHHHHHH-HhccHHHHHHHHHHHhhccCceEEEEEEECC--e Confidence 3333 33344 244444456654333332 35677888877 4677788899999999999999999999620 0 Q ss_pred EEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceee Q lcl|NC_013692. 164 VKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQ 243 (726) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~ 243 (726) |.|+ T Consensus 141 ----------------------------------------------------------------------------~~i~ 144 (518) T protein:vir:78 141 ----------------------------------------------------------------------------PSIS 144 (518) T ss_pred ----------------------------------------------------------------------------eEEE Confidence 1223 Q ss_pred eechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEE Q lcl|NC_013692. 244 VCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLV 323 (726) Q Consensus 244 ~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 323 (726) .|++..|++..+- .++-.|-|+ ... ..++ ...+|.-++.--+.... .....+ ..-.. T Consensus 145 ~v~ad~~~P~~~~-g~~~~~~f~--~~~-~~~~--k~~~y~~lE~he~~~~~-~~~~~~----------------~~~~I 201 (518) T protein:vir:78 145 VHSSSQFWIDFKN-NEPFRFNFF--EEI-PTSN--KADIYYLVESREIKQWD-KEGKKL----------------SGGFV 201 (518) T ss_pred EEcCCeeEEEeec-CcEEEEEEE--EEe-ecCC--cceeEEEEEeecccccc-ceeecc----------------cceeE Confidence 3444444432111 112222221 100 0000 00011000000000000 000000 00000 Q ss_pred EEEEEEEeecCCCc-------eEEEEEEE--EECCEEEEeccCCCC-CCccceEEeeeee-----ecCcccCCChHHHHH Q lcl|NC_013692. 324 VHEYWGYYDIHGDG-------VLHPIVAT--WVGAVMIRMEENPFP-DKRIPYVVVNYIP-----RKRDLYGESDGALLI 388 (726) Q Consensus 324 v~E~w~~~~~~~~g-------~~~~~~~~--~~g~~~l~~~~~P~~-~~~~Pf~~~~~~~-----~~~~~~g~g~~~~~~ 388 (726) -++.|. .+ .+++ +.+....+ +.|. .. ..-+. ....||+++.+.+ .+++.+|.|++..++ T Consensus 202 ~n~ly~-~~-~~~~v~~~~~~~~~~l~~~~~~~~~--~e--~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~ 275 (518) T protein:vir:78 202 TYSVIK-ID-GDKTTPISAERLPEQITSYLHTNDI--QL--NHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCT 275 (518) T ss_pred EEEEee-ec-CcccccccccccccccccccccccC--cc--ceeeccCCccceEEeeccccccccccCCCcCcchHhhhh Confidence 011110 00 0000 00000000 0000 00 00001 1234666654433 357788999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCceEeecccccchh-------hhhhcCC-ceE-eecC----ccchhhhcccccCccch Q lcl|NC_013692. 389 DNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTN-------RRRFDRG-ENY-EFNP----GADPRAAVHMHTFPEIP 455 (726) Q Consensus 389 d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d-------~~~~~~g-~vi-~~~~----~~~~~~~i~~~~~~~~~ 455 (726) +.++.+|..++++.+.+.. +.+++.++++.+.... ...+..+ .++ .++. +......|...++.... T Consensus 276 ~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~ 354 (518) T protein:vir:78 276 NYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRD 354 (518) T ss_pred HHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccCh Confidence 9999999999999999865 8888999888773211 1112221 222 2221 11111224444433333 Q ss_pred hHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCe Q lcl|NC_013692. 456 QSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVE 535 (726) Q Consensus 456 ~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~ 535 (726) ..+...++.+...+....|++....|.++. ..||+++....+..-+.+..+...+..+++.+...++.+..-+..... T Consensus 355 e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~--~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~ 432 (518) T protein:vir:78 355 GSYRETMEYFAQKAVSKSGYNPATFNLGNR--EVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKE 432 (518) T ss_pred HHHHHHHHHHHHHHHHhhCCChhhcCcccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Confidence 455666777777788888888877775432 368888877666655666677777777888777777776654432211 Q ss_pred EEEEecccceecchhhcccccceee--ecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHH Q lcl|NC_013692. 536 VVRITNEHFVDIRRDDLAGNFDLKL--DISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIRE 613 (726) Q Consensus 536 ~iRi~~~~~v~v~~~~~~~~~dv~i--~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~ 613 (726) .. .....+++++ +.+...-.....+....+.. .+ -+.....-..+.--.....+.+..+++++ T Consensus 433 ~~-------------~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~-aG-imS~e~~i~~~~~~~~deea~~e~~ri~~ 497 (518) T protein:vir:78 433 KA-------------IMRDEIRVIIEFPDPMSVNLNELSSTLNNMNS-AL-AMSVEEKVKLIHPKWEDEEIQAEVKRIYL 497 (518) T ss_pred cc-------------cCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh-cC-CCCHHHHHHHhCCCCCHHHHHHHHHHHHH Confidence 00 0011112222 22222111222221111111 11 11111100000000000011111111111 Q ss_pred HHhhhhhh-hhhHHHHHHHHH Q lcl|NC_013692. 614 FQPQPDPI-AQQKAQLELMLL 633 (726) Q Consensus 614 ~~~~~~~~-~qq~~q~e~q~~ 633 (726) .+...... .....-.+...- T Consensus 498 E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 498 ENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HhcccCCCCCccccCCCCCCC Confidence 11100000 000000000000 No 100 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.35 E-value=2.6e-12 Score=84.14 Aligned_cols=459 Identities=10% Similarity=0.039 Sum_probs=189.1 Q ss_pred CccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CCC---CCCCcCCCHHHHHHHHHHH Q lcl|NC_013692. 16 GDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KTE---KGKSAVQPPTIRKQAEWRY 90 (726) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~---~grs~~v~~~v~~~v~~~~ 90 (726) ++-..+.+++...+..+..|...+.. +.....+..+||.|.-..+- ... ..+-++|..-.+..|+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~-------~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~ 73 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTE-------RTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIA 73 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHH Confidence 22233334555555566666666542 22334567889987653210 000 0111344555555666555 Q ss_pred HHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 91 SSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 91 ~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) ..|. | -| |. .++|.+. ...++.+| ..|+.-.....++++++++|.+++.+|++..... T Consensus 74 ~~l~--~-~g-----~~--~~~~~~~----~~~l~~i~-~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~------- 131 (484) T protein:vir:77 74 ARQE--L-EG-----FR--LGGADKA----DEQLWDWW-QANDLDIESTLGHTDSLVHGRSYITISKPDPNID------- 131 (484) T ss_pred hhhc--c-Cc-----ee--cCCcchh----HHHHHHHH-HhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcc------- Confidence 4441 1 11 11 1233332 23466666 4676666777899999999999999887611000 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) . ......|.|..++|.++ T Consensus 132 --------------------------~------------------------------------~~~~~~~~i~~~~p~~~ 149 (484) T protein:vir:77 132 --------------------------P------------------------------------GVDPEVPIIRVEPPTNL 149 (484) T ss_pred --------------------------c------------------------------------ccccccceEEEecccee Confidence 0 00012244566788887 Q ss_pred e--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEE Q lcl|NC_013692. 251 V--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYW 328 (726) Q Consensus 251 ~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w 328 (726) + |||..+ +..+.+ +.+.+ + . ...+..++.| T Consensus 150 ~~~~D~~~~----~~~~a~-~~~~~--~---~--------------------------------------~~~~~~~~~y 181 (484) T protein:vir:77 150 YAQIDPRTR----QVMRAI-RAIED--E---E--------------------------------------GNEVIGATLY 181 (484) T ss_pred EEEecCCCC----ceEEEE-EEEEe--e---c--------------------------------------CCcEEEEEEE Confidence 5 455321 111211 11111 0 0 0011122222 Q ss_pred EEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH-HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 329 GYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA-LLIDNQRIIGAVTRGMIDTMAR 407 (726) Q Consensus 329 ~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~-~~~d~Q~~~N~~~~~~~d~l~~ 407 (726) .. +.+.+ ....++.....+..|.+.|.+|+++|...+..++++|.|.+. .++++++.+|..++.+.+.+.. T Consensus 182 ~~-----~~~~~---~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~ 253 (484) T protein:vir:77 182 LP-----NNTVI---WNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAEL 253 (484) T ss_pred ec-----CeEEE---EEecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHh Confidence 11 00000 001111111122334455889999999888889999999886 5899999999999999999998 Q ss_pred cCCCceEeecccc-cc---hh-----hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHH---hch Q lcl|NC_013692. 408 SANGQVGVMKGAL-DV---TN-----RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESM---TGV 475 (726) Q Consensus 408 ~~~~~~~~~~gav-~~---~d-----~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~---tGv 475 (726) .+.|+..+ .|.- +. .+ .....+|.++.. ++.++ .+.+++.. .....+.++...+..+ +++ T Consensus 254 ~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~----~~~q~~~~--~~e~~~~~l~~~i~~~s~~~~~ 325 (484) T protein:vir:77 254 MGVPQRLL-FGVKGEELGVDPETGQTLFDAYLARILAF-EDHES----KAQQFSAA--ELRNFVDALDALDRKAAAYTGL 325 (484) T ss_pred hhhhHHHH-hCCCcchhcccccccchhhhhhhhhhccc-CCCCc----eeEeecCC--ChHHHHHHHHHHHHHHhcccCC Confidence 88887654 2321 10 00 111223444333 22222 12233322 2344556666666655 567 Q ss_pred HHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccc Q lcl|NC_013692. 476 KAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGN 555 (726) Q Consensus 476 ~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~ 555 (726) ++...|..+. ...||.++......-........+.|..+++++++.++.+. ..... . .++ T Consensus 326 p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~----~~~~~-~---~~~----------- 385 (484) T protein:vir:77 326 PPYYLSFSSE-NPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVM----NGGDI-P---PEY----------- 385 (484) T ss_pred CHHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCCc-c---ccc----------- Confidence 7777774332 12356666655555455555555666666666666554432 21100 0 000 Q ss_pred cceeeecccc-hHH-HHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhh-hhhhhhHHHHHhhhhhhhhhHHHHHHHH Q lcl|NC_013692. 556 FDLKLDISTA-EED-NAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKM-PDFAKRIREFQPQPDPIAQQKAQLELML 632 (726) Q Consensus 556 ~dv~i~~~~~-~~~-~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~-~e~~~~l~~~~~~~~~~~qq~~q~e~q~ 632 (726) +++.+.-... ..+ .+.......+.+. +..+... ..+. +..+. ++-.+.++....+.. .+.+. . T Consensus 386 ~~i~v~w~~~~~~s~~~~ad~~~kl~~~-g~gi~s~---et~~---~~l~~~~~~~~e~~~~~~ee~------~~~~~-~ 451 (484) T protein:vir:77 386 YRMESIWRDPSTPTYAAKADAATKLYNN-GQGVIPK---ERAR---IDMGYSITEREEMRKWDEEEQ------AQGLG-L 451 (484) T ss_pred ccceEEecCCCCCCHHHHHHHHHHHHhc-cCCCCCH---HHHH---hcCCCChhHHHHHHHHHHHHH------HHHHH-H Confidence 0111110000 001 1111111111111 1001000 0000 00000 000000000000000 00000 0 Q ss_pred HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 633 LQAQ--IEAERARAAHYMSGAGLQDSKVGTEQAKAR 666 (726) Q Consensus 633 ~qaq--~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~ 666 (726) +.+. ...+....-.... ..+.....++..+. T Consensus 452 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 452 MGTMFGTDPSGGGNPDNPE---TPEPQPNPAEEAAA 484 (484) T ss_pred HhhhccccccCCCCCCCCC---cccccCCCccccCC Confidence 0000 0000000000000 00000000000000 No 101 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.34 E-value=1.5e-11 Score=79.90 Aligned_cols=473 Identities=10% Similarity=0.030 Sum_probs=207.8 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCCCC--CCCc Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKTEK--GKSA 76 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~--grs~ 76 (726) ||=+= ..+++.+-. +.++..|...|+ -|...+..+.+..+||.|.-... ++..+ ..-+ T Consensus 1 ~~~~~-------------~~~~~~~~~-~~~~~~i~~~i~----~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~k 62 (499) T protein:vir:10 1 MAVVI-------------DKDLLDDVN-EPNIEAINYAIR----ELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAAN 62 (499) T ss_pred Cccch-------------hhhHHhhhh-cCCHHHHHHHHH----HHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcce Confidence 33221 122222221 222333444443 35566666788999998864321 11222 3457 Q ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEe Q lcl|NC_013692. 77 VQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVG 156 (726) Q Consensus 77 ~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~ 156 (726) ++.+..+..|+.....| ||.+ +.|.+ +|.+..+ .++.+| ..|+.-..+..+.++++++|.+.+.+| T Consensus 63 i~~n~~~~Iv~~~~~~l----~g~p--~~~~~---~~~~~~~----~l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~ 128 (499) T protein:vir:10 63 VMVNHAKYITDMNVGFM----TGNP--VKYVA---EKGKNID----DILEVF-NQIDIHKHDIELEKDLSVFGYGYELLY 128 (499) T ss_pred eecchHHHHHHHHhhhh----cccC--ceeec---CChhHHH----HHHHHH-hhcCHhHHHHHHHHHHHhcCceEEEEE Confidence 77777777777777544 4433 34443 2333333 344444 345555678889999999999999887 Q ss_pred eeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccccee Q lcl|NC_013692. 157 WNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETV 236 (726) Q Consensus 157 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (726) ++.. ....+.. .. .....+. T Consensus 129 ~~~~------g~~~~~~--------------------~~----------------------------------~~~~~~~ 148 (499) T protein:vir:10 129 LKKT------DPISVRD--------------------EL----------------------------------GNEKLTP 148 (499) T ss_pred eccc------ccccccc--------------------cc----------------------------------ccccccc Confidence 7611 0000000 00 0000111 Q ss_pred eccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCC Q lcl|NC_013692. 237 ENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQD 316 (726) Q Consensus 237 ~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (726) ...+++..|+|++++.=.+.. ...-...+.+.+.+.+. + T Consensus 149 ~~~~~~~~v~p~~~~~v~~d~--~~~~~~~~i~~~~~~~~----------~----------------------------- 187 (499) T protein:vir:10 149 NTELKIEVIDPRATVVVCDDT--VEHDPLFAVFTQEKKDL----------E----------------------------- 187 (499) T ss_pred ccceEEEEEcccceEEEecCC--CCcceEEEEEEEEEeec----------C----------------------------- Confidence 223556778888865422111 11111222222211100 0 Q ss_pred cCCceEEEEEEEEEeecCCCceEEEEE----EEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHH Q lcl|NC_013692. 317 KSRKRLVVHEYWGYYDIHGDGVLHPIV----ATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQR 392 (726) Q Consensus 317 ~~~~~v~v~E~w~~~~~~~~g~~~~~~----~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~ 392 (726) ....+..+|.|.. +.+..+.. ....+..++...++|| +.+|++.+.. +.+|.|.++.++++++ T Consensus 188 -~~~~~~~~~iyt~-----~~i~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~~~~d~e~v~~liD 254 (499) T protein:vir:10 188 -GNTNGYSITVYMP-----QRIVEYRTKTTMEVSANDPIVYDGENLF--GAVPIIEFRN-----NEERQGDFEQLISLID 254 (499) T ss_pred -CCceEEEEEEEeC-----CeEEEEEecCCccccCcceecccccCCC--CccceEEecC-----CCCCCCchHhHHHHHH Confidence 0012233333321 11111100 0011122333344443 6777766543 4568899999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCceEeecccccch--hhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHH Q lcl|NC_013692. 393 IIGAVTRGMIDTMARSANGQVGVMKGALDVT--NRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAE 470 (726) Q Consensus 393 ~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~--d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e 470 (726) .+|.++|.+.+.+...++|.+.+.-..++.. .......+.++.+..+... .+.+...+.....+...+..+...+. T Consensus 255 ~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~--d~~~l~~~~~~~~~~~~~~~l~~~I~ 332 (499) T protein:vir:10 255 AYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGA--DIEWLTKSFDETQVNLLSQSIENDIH 332 (499) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCCC--cceEEeccCCHHHHHHHHHHHHHHHH Confidence 9999999999999999998877643223221 1233455565554322222 23344444445667778888889999 Q ss_pred HHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchh Q lcl|NC_013692. 471 SMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRD 550 (726) Q Consensus 471 ~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~ 550 (726) ..|+++..+.+..++ +.||.++..............-+.|..+++++++.++.++.-.... .++. T Consensus 333 ~~s~~p~~~~~~~~g--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~--------~d~~----- 397 (499) T protein:vir:10 333 KISYVPNMNDEKFMG--NVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGAN--------DDAS----- 397 (499) T ss_pred HHhCcccCCchhhcc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCc--------cccc----- Confidence 999998766542221 2366667666666666666666667777776666666553211100 0110 Q ss_pred hcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhh---hhhHHHHHhh---------- Q lcl|NC_013692. 551 DLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDF---AKRIREFQPQ---------- 617 (726) Q Consensus 551 ~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~---~~~l~~~~~~---------- 617 (726) +.. +..+.....-.....+.+..+ ...++....-.+ +..+.+. .+++.+.+.. T Consensus 398 ~i~----i~f~~~~p~n~~e~~~~~~kl----~g~iS~et~~~~------l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 463 (499) T protein:vir:10 398 GCK----ISLVANIPSNLSDVVNNVKNA----DGIIPRKYTYSW------LPDVDNPQDVIDEMNQQDAETIKKNQEALR 463 (499) T ss_pred cce----EEeCCCCCCCHHHHHHHHHHH----hccCChHHHHHh------CCCCCCHHHHHHHHHHHHHHHHHHHHhhhc Confidence 111 111111111011111111111 111111110000 0000000 0111110000 Q ss_pred ---hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 618 ---PDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVG 659 (726) Q Consensus 618 ---~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~ 659 (726) ++.......+...+........+.++. -+.++- T Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~ 499 (499) T protein:vir:10 464 GQDPDRLELEDKQDDSSENDKEAGSNHNQS---------HRTRAV 499 (499) T ss_pred cCCCCCCCCCCCCcccCCCCCCCccccccC---------CCCCCC Confidence 000000000000000000000000000 000000 No 102 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.34 E-value=8.8e-11 Score=75.72 Aligned_cols=450 Identities=9% Similarity=0.018 Sum_probs=201.9 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--C--------- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--K--------- 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~--------- 69 (726) |.++ -+|+.+-..-.....+. ..+...|.+....+ .++...+..+||.|.-+... . T Consensus 1 ~~~~----~~~~~~~~~~~~~~~~~-------~~~~~~i~~~~~~~--~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~ 67 (479) T protein:vir:79 1 MLNI----YISETDLIKVQLKKEST-------INLVKVIEHYILKH--RPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKV 67 (479) T ss_pred CCCc----eecccceEeeccccCCh-------hHHHHHHHHHHhhh--hHHHHHHHHHHhccCCcccccccccccccccc Confidence 4433 23343333222222221 12223333322222 34567888999987543210 0 Q ss_pred CCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 70 TEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 70 ~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) ....|+ +++.+-.+..|+.....| ||.+ +.|.+ .|.. ...+++..+ .|+....+..++++++. T Consensus 68 ~~~~~~~~ki~~~~~~~Ivd~~~~~l----~g~p--~~~~~---~~~~----~~~~~~~~~--~n~~~~~~~~~~~~~~~ 132 (479) T protein:vir:79 68 DDFTKVNNKAINNYHKLLVDQKVGYS----VGNP--IVFNA---DDDN----LTKLLNDLL--GEEFDDTITELYLNASN 132 (479) T ss_pred cccccCcceeecchHHHHHHHHHhhh----hcCC--ceecc---CCHH----HHHHHHHHH--hcCHHHHHHHHHHHHHh Confidence 111222 577777777888766555 4433 34433 2222 223555543 36666778899999999 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +|.+.+.+||+.. T Consensus 133 ~G~~~~~v~~d~~------------------------------------------------------------------- 145 (479) T protein:vir:79 133 KGVEWLHPYINRK------------------------------------------------------------------- 145 (479) T ss_pred cCeEEEEEEeCCC------------------------------------------------------------------- Confidence 9999998877511 Q ss_pred eeecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhh Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGP 305 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~ 305 (726) +.|++..++|.++++ |+... ....+ +.+.|...+. . T Consensus 146 ----------~~~~i~~~~p~~~~~v~d~~~~---~~~~~-~ir~y~~~~~---~------------------------- 183 (479) T protein:vir:79 146 ----------GEFKYVIIPAEEAIPIWDSKRQ---RELVA-FIRFYYIEDI---D------------------------- 183 (479) T ss_pred ----------CceEEEEEccceeEEEEeCCCC---CceEE-EEEEEEEeec---C------------------------- Confidence 012345566666543 33221 11122 2222211100 0 Q ss_pred hccccccccCCcCCceEEEEEEEEE-----eecCCCceEEE------EEEEEECCEEEEeccCCCCCCccceEEeeeeee Q lcl|NC_013692. 306 SEGVRNFDFQDKSRKRLVVHEYWGY-----YDIHGDGVLHP------IVATWVGAVMIRMEENPFPDKRIPYVVVNYIPR 374 (726) Q Consensus 306 ~~~~~~~~~~~~~~~~v~v~E~w~~-----~~~~~~g~~~~------~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~ 374 (726) .+.+..+|+|.. +...+++.... ...+............|.+.+.+||+.+.. T Consensus 184 -------------~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n--- 247 (479) T protein:vir:79 184 -------------GNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKN--- 247 (479) T ss_pred -------------CceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecC--- Confidence 000111122110 00011111000 000001011112233444456777776643 Q ss_pred cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cchh--hhhhcCCceEeecCccchhhhcccccC Q lcl|NC_013692. 375 KRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVTN--RRRFDRGENYEFNPGADPRAAVHMHTF 451 (726) Q Consensus 375 ~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~d--~~~~~~g~vi~~~~~~~~~~~i~~~~~ 451 (726) ..+|.|.+..++++++.+|..++.+.+.+...++|.+.+ .|.- ...+ ......++++.+..+++ +.+... T Consensus 248 --n~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~ 320 (479) T protein:vir:79 248 --NEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVL-KEYPGTSLQEFIDNIRYYKSIKVDGGGG----VDKLEI 320 (479) T ss_pred --CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccccchhhhhhccceecCCCCc----ceEEec Confidence 456889999999999999999999999999998887765 4432 1111 12234566676665543 334444 Q ss_pred ccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_013692. 452 PEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFL 531 (726) Q Consensus 452 ~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~ 531 (726) +.........++.+...+...|++++++.|..++ .|+.|+..............-+.|..+++.+++.++.++.... T Consensus 321 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn---~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 397 (479) T protein:vir:79 321 NIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGD---KSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISG 397 (479) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCccccccccccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 4445667778888888888999999887764332 3666666665555555555666666677666666555442211 Q ss_pred CcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhH Q lcl|NC_013692. 532 DDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRI 611 (726) Q Consensus 532 d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l 611 (726) . ..++..+. .+..+.....-.....+.+..+ ...++... .+. .+..+.+....+ T Consensus 398 ~------------~~~~~~~i----~i~f~~~~p~~~~~~a~~~~kl----~g~iS~et---~l~---~l~~v~d~~~E~ 451 (479) T protein:vir:79 398 N------------KSYDYKTV----QITFNHSMIINEAEKIDMAAKS----TGIVSDET---IVS---NHPWVEDVNDEL 451 (479) T ss_pred C------------Cccccccc----eEEeCCCCCcCHHHHHHHHHHH----hccCcHHH---HHH---hCCCCCCHHHHH Confidence 0 01111111 1111111111111111211111 11111111 111 111111111111 Q ss_pred HHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 612 REFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDS 656 (726) Q Consensus 612 ~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~ 656 (726) +....+.....+... ... ........++ T Consensus 452 ~ri~~E~~~~~~~~~--------------~~~---~~~~~~~~e~ 479 (479) T protein:vir:79 452 ERLKKQEDTQKEYDD--------------LIP---NNQDGVIDET 479 (479) T ss_pred HHHHHHHHHHHHHHh--------------ccC---cccCCCcCcC Confidence 100000000000000 000 0000000000 No 103 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.32 E-value=1.1e-10 Score=75.16 Aligned_cols=447 Identities=10% Similarity=0.047 Sum_probs=199.9 Q ss_pred Cccchhc-CCCCCCchHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCC--CC-------CC--CCc Q lcl|NC_013692. 16 GDPSKRL-QPEWSNAPSL-------AQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPK--TE-------KG--KSA 76 (726) Q Consensus 16 ~~~~~~~-~~~~~~~~~~-------~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~--~~-------~g--rs~ 76 (726) +++.-+. +|+=.++..+ ......|..-.+.|..++....+..+||.|.-+.... .. +. ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 80 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWR 80 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccc Confidence 2221111 1211111111 1222223333345666666678889999886433211 11 11 226 Q ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEe Q lcl|NC_013692. 77 VQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVG 156 (726) Q Consensus 77 ~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~ 156 (726) ++.+..+-.|+.....| ||.. +.|.+ +|... ...++..+ .++..+.+..++++++.+|.+.+.+| T Consensus 81 i~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~~~~~----~~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~~~~ 145 (474) T protein:vir:96 81 ITTNFHQNLVDQKVSYV----AGKP--VTYAH---DDDKV----LDVIHQVL--DTRWDNKLIDILTAASNKGIDWLQVY 145 (474) T ss_pred cccchHHHHHHhhhhhh----cccC--ceecc---CChHH----HHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEee Confidence 77788788888777555 4433 34433 33322 23555544 35666778889999999999999887 Q ss_pred eeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccccee Q lcl|NC_013692. 157 WNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETV 236 (726) Q Consensus 157 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (726) ++.. T Consensus 146 ~d~~---------------------------------------------------------------------------- 149 (474) T protein:vir:96 146 INED---------------------------------------------------------------------------- 149 (474) T ss_pred eCCC---------------------------------------------------------------------------- Confidence 6410 Q ss_pred eccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCC Q lcl|NC_013692. 237 ENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQD 316 (726) Q Consensus 237 ~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (726) +.|++..++|.++|+=.+.. ...+..+ +.+.|... + T Consensus 150 -~~~~i~~~~p~~~~~v~d~~-~~~~~~a-~ir~~~~~----------~------------------------------- 185 (474) T protein:vir:96 150 -GELKLFRVPAEQAIPIWTDK-EREQLNA-FIRIFTFN----------G------------------------------- 185 (474) T ss_pred -CceEEEEEcccceEEEEcCC-CCCceEE-EEEEEeec----------C------------------------------- Confidence 01234556777766433211 1122222 22222100 0 Q ss_pred cCCceEEEEEEEEE-----eecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHH Q lcl|NC_013692. 317 KSRKRLVVHEYWGY-----YDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQ 391 (726) Q Consensus 317 ~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 391 (726) ...+|.|.. +...+.+... .+..+.........|.+.+.+|++.++. ...|.|.++.+++++ T Consensus 186 -----~~~~~vy~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~li 252 (474) T protein:vir:96 186 -----ETKVEYWTAETVTYYVYENGGLIP---DFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFV 252 (474) T ss_pred -----eeEEEEEeCCeEEEEEEcCCceee---ccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHH Confidence 000111110 0001111100 0011111111122333446667665543 456889999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEeecccc-cch-h-hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHH Q lcl|NC_013692. 392 RIIGAVTRGMIDTMARSANGQVGVMKGAL-DVT-N-RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAE 468 (726) Q Consensus 392 ~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~-d-~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~ 468 (726) +.+|.+.|.+.+.+...++|.+.+ .|.- +.. + .......+++.+..+++ +.+...+.........+..+... T Consensus 253 Da~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~~l~~~ 327 (474) T protein:vir:96 253 DAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGLKYYKAINVSSDGG----VETIQVEVPVASTKEYLDMMRAY 327 (474) T ss_pred HHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhhhccceeeccCCCc----eeEEeccCCHHHHHHHHHHHHHH Confidence 999999999999999999887654 4532 111 1 12234445666655443 44555555566778888899999 Q ss_pred HHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecc Q lcl|NC_013692. 469 AESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIR 548 (726) Q Consensus 469 ~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~ 548 (726) +-..|++++.+.+..++ +.||.++..............-+.|..+++++++.++++ +... ++ T Consensus 328 I~~~s~~p~~~~~~~~~--n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~----~g~~------------~d 389 (474) T protein:vir:96 328 IVEFGQGVDFQTDKFGS--ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDF----NKIK------------LD 389 (474) T ss_pred HHHHhCCcCcccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hCCC------------cc Confidence 99999999877543322 246666766666666666666666777777666655543 2210 00 Q ss_pred hhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHH Q lcl|NC_013692. 549 RDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQL 628 (726) Q Consensus 549 ~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~ 628 (726) ..++.-.| +.....-..+..+. +. + . .-+..+. .+. .+....+....++....+.....++. T Consensus 390 ~~~i~i~f----~~~~p~~~~e~a~~-~~--~-~-giiS~et---~~~---~lp~v~D~~~E~eri~~E~~~~~~~~--- 451 (474) T protein:vir:96 390 AKEIEITF----NFNVMVNDLEQSQI-GA--Q-S-QYLSKET---LVR---HHPWVDDPKAELERLDEEQLELNKQL--- 451 (474) T ss_pred cceeeEEe----cCCCccCHHHHHHH-HH--H-c-CCCChHH---HHH---hCCCCCCHHHHHHHHHHHHHHHHhhc--- Confidence 01111111 11111101111111 10 0 0 1111110 000 01111111111111000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 629 ELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALA 669 (726) Q Consensus 629 e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~ 669 (726) . ........... ...+....+.+ T Consensus 452 --------------~---~~~~~~~~~~~-~~~~~~~~e~~ 474 (474) T protein:vir:96 452 --------------P---NLDDGGADGAQ-QQQQSENNQSK 474 (474) T ss_pred --------------c---ccccccCCCCC-CcCCCCccccC Confidence 0 00000000000 00000000000 No 104 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.32 E-value=1.1e-10 Score=75.16 Aligned_cols=447 Identities=10% Similarity=0.047 Sum_probs=199.9 Q ss_pred Cccchhc-CCCCCCchHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCC--CC-------CC--CCc Q lcl|NC_013692. 16 GDPSKRL-QPEWSNAPSL-------AQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPK--TE-------KG--KSA 76 (726) Q Consensus 16 ~~~~~~~-~~~~~~~~~~-------~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~--~~-------~g--rs~ 76 (726) +++.-+. +|+=.++..+ ......|..-.+.|..++....+..+||.|.-+.... .. +. ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 80 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWR 80 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccc Confidence 2221111 1211111111 1222223333345666666678889999886433211 11 11 226 Q ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEe Q lcl|NC_013692. 77 VQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVG 156 (726) Q Consensus 77 ~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~ 156 (726) ++.+..+-.|+.....| ||.. +.|.+ +|... ...++..+ .++..+.+..++++++.+|.+.+.+| T Consensus 81 i~~n~~k~Iv~~~~~yl----~g~p--~~~~~---~~~~~----~~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~~~~ 145 (474) T protein:vir:95 81 ITTNFHQNLVDQKVSYV----AGKP--VTYAH---DDDKV----LDVIHQVL--DTRWDNKLIDILTAASNKGIDWLQVY 145 (474) T ss_pred cccchHHHHHHhhhhhh----cccC--ceecc---CChHH----HHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEee Confidence 77788788888777555 4433 34433 33322 23555544 35666778889999999999999887 Q ss_pred eeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccccee Q lcl|NC_013692. 157 WNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETV 236 (726) Q Consensus 157 w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (726) ++.. T Consensus 146 ~d~~---------------------------------------------------------------------------- 149 (474) T protein:vir:95 146 INED---------------------------------------------------------------------------- 149 (474) T ss_pred eCCC---------------------------------------------------------------------------- Confidence 6410 Q ss_pred eccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCC Q lcl|NC_013692. 237 ENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQD 316 (726) Q Consensus 237 ~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (726) +.|++..++|.++|+=.+.. ...+..+ +.+.|... + T Consensus 150 -~~~~i~~~~p~~~~~v~d~~-~~~~~~a-~ir~~~~~----------~------------------------------- 185 (474) T protein:vir:95 150 -GELKLFRVPAEQAIPIWTDK-EREQLNA-FIRIFTFN----------G------------------------------- 185 (474) T ss_pred -CceEEEEEcccceEEEEcCC-CCCceEE-EEEEEeec----------C------------------------------- Confidence 01234556777766433211 1122222 22222100 0 Q ss_pred cCCceEEEEEEEEE-----eecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHH Q lcl|NC_013692. 317 KSRKRLVVHEYWGY-----YDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQ 391 (726) Q Consensus 317 ~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 391 (726) ...+|.|.. +...+.+... .+..+.........|.+.+.+|++.++. ...|.|.++.+++++ T Consensus 186 -----~~~~~vy~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~li 252 (474) T protein:vir:95 186 -----ETKVEYWTAETVTYYVYENGGLIP---DFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFV 252 (474) T ss_pred -----eeEEEEEeCCeEEEEEEcCCceee---ccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHH Confidence 000111110 0001111100 0011111111122333446667665543 456889999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCceEeecccc-cch-h-hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHH Q lcl|NC_013692. 392 RIIGAVTRGMIDTMARSANGQVGVMKGAL-DVT-N-RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAE 468 (726) Q Consensus 392 ~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~-d-~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~ 468 (726) +.+|.+.|.+.+.+...++|.+.+ .|.- +.. + .......+++.+..+++ +.+...+.........+..+... T Consensus 253 Da~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~~l~~~ 327 (474) T protein:vir:95 253 DAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGLKYYKAINVSSDGG----VETIQVEVPVASTKEYLDMMRAY 327 (474) T ss_pred HHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhhhccceeeccCCCc----eeEEeccCCHHHHHHHHHHHHHH Confidence 999999999999999999887654 4532 111 1 12234445666655443 44555555566778888899999 Q ss_pred HHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecc Q lcl|NC_013692. 469 AESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIR 548 (726) Q Consensus 469 ~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~ 548 (726) +-..|++++.+.+..++ +.||.++..............-+.|..+++++++.++++ +... ++ T Consensus 328 I~~~s~~p~~~~~~~~~--n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~----~g~~------------~d 389 (474) T protein:vir:95 328 IVEFGQGVDFQTDKFGS--ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDF----NKIK------------LD 389 (474) T ss_pred HHHHhCCcCcccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hCCC------------cc Confidence 99999999877543322 246666766666666666666666777777666655543 2210 00 Q ss_pred hhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHH Q lcl|NC_013692. 549 RDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQL 628 (726) Q Consensus 549 ~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~ 628 (726) ..++.-.| +.....-..+..+. +. + . .-+..+. .+. .+....+....++....+.....++. T Consensus 390 ~~~i~i~f----~~~~p~~~~e~a~~-~~--~-~-giiS~et---~~~---~lp~v~D~~~E~eri~~E~~~~~~~~--- 451 (474) T protein:vir:95 390 AKEIEITF----NFNVMVNDLEQSQI-GA--Q-S-QYLSKET---LVR---HHPWVDDPKAELERLDEEQLELNKQL--- 451 (474) T ss_pred cceeeEEe----cCCCccCHHHHHHH-HH--H-c-CCCChHH---HHH---hCCCCCCHHHHHHHHHHHHHHHHhhc--- Confidence 01111111 11111101111111 10 0 0 1111110 000 01111111111111000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 629 ELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALA 669 (726) Q Consensus 629 e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~ 669 (726) . ........... ...+....+.+ T Consensus 452 --------------~---~~~~~~~~~~~-~~~~~~~~e~~ 474 (474) T protein:vir:95 452 --------------P---NLDDGGADGAQ-QQQQSENNQSK 474 (474) T ss_pred --------------c---ccccccCCCCC-CcCCCCccccC Confidence 0 00000000000 00000000000 No 105 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.31 E-value=1.5e-11 Score=79.89 Aligned_cols=577 Identities=11% Similarity=0.030 Sum_probs=177.5 Q ss_pred HHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHH---HHHHhhc----------ccchhHH-HHHHHHHhh Q lcl|NC_013692. 82 IRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVL---NQQFNTK----------LNKQRFI-DEYVRAGVD 147 (726) Q Consensus 82 v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~---n~~~~~~----------~~~~~~~-~~~~~~~l~ 147 (726) -.+....++..+++-|...-.+ .++=...|.....|. ..+|... ..|+..+ .+.++-.+ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~------~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v- 73 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSP------QEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTEL- 73 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhh------hHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHH- Confidence 2222233333333333211100 001011111122222 1122111 1122211 12222222 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +.-.+.++..++.++++|+...+...+++...-+..+ ..+........+.+|......|.||..+.. T Consensus 74 ----------~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~-~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~-- 140 (720) T protein:vir:35 74 ----------NRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRA-DYEETDGGEACDNAFDDGSTGGFGCFRLTT-- 140 (720) T ss_pred ----------HHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHH-HHHhcCchHHHhHHHHHhhhccceeEEeee-- Confidence 2223445667888888887443333333333333321 122445667889999999999999855421 Q ss_pred eeecccceeeccceeeeechhheeeCCCCCCchhhC---CeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKA---KFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da---~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) .++. ..+|+.+.-+...++..+++ -|=-..+..+.+|.. |-.+..+...+.....-++. T Consensus 141 d~~~------------~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar----~~~~~~~~~~d~~~~~yp~~-- 202 (720) T protein:vir:35 141 NLVN------------ALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAE----WAFCMYSLSAEKYKAEYNKD-- 202 (720) T ss_pred cccc------------cCCCCcccceeeEecccCchhheeecccccccChhhhh----hhhhhcCCCHHHHHHhCCCc-- Confidence 1110 01122111000000000111 111112223333311 00011110000000000000 Q ss_pred hhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEE--------EEE----ECCEEEEeccCC-------------- Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIV--------ATW----VGAVMIRMEENP-------------- 358 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~--------~~~----~g~~~l~~~~~P-------------- 358 (726) ..... .+.. .-.+++ | ++.+..-+.++++ +++ +|..+.....++ T Consensus 203 -a~~~~----~~~~--~~~~~d-~--~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 272 (720) T protein:vir:35 203 -PATLM----SGIE--RSWDYD-W--YDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIE 272 (720) T ss_pred -ccccc----cccc--cccccc-c--cCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhcccc Confidence 00000 0000 001111 1 1111111222211 111 122222111111 Q ss_pred -----------C-----CC------CccceEEeeeeeecCcccC-CCh---HHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_013692. 359 -----------F-----PD------KRIPYVVVNYIPRKRDLYG-ESD---GALLIDNQRIIGAVTRGMIDTMARSANGQ 412 (726) Q Consensus 359 -----------~-----~~------~~~Pf~~~~~~~~~~~~~g-~g~---~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~ 412 (726) | .| +.+||-.|++.|..+..+. .|. .-.++++-+...-+...+...++.. T Consensus 273 ~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~---- 348 (720) T protein:vir:35 273 AARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSA---- 348 (720) T ss_pred ccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHH---- Confidence 0 00 1134444444443332221 111 2333444444444443333333322 Q ss_pred eEeecccccchhh--hhhcCCceEeecCccc------------hhhhcccccCc----cchhHHHHHHHHHHHHHHHHhc Q lcl|NC_013692. 413 VGVMKGALDVTNR--RRFDRGENYEFNPGAD------------PRAAVHMHTFP----EIPQSAQYMINLQQAEAESMTG 474 (726) Q Consensus 413 ~~~~~gav~~~d~--~~~~~g~vi~~~~~~~------------~~~~i~~~~~~----~~~~~~~~ll~~~~~~~e~~tG 474 (726) ....+..... ..+..-.--+.+++.. ..+.+...+.+ +.+.-....++++......+.. T Consensus 349 ---~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~ 425 (720) T protein:vir:35 349 ---TQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQE 425 (720) T ss_pred ---HcCCccccccCcchHHHHHHHhhccccccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHH Confidence 1111111000 0000000011122111 01122111111 1234555666666666665543 Q ss_pred hHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcc Q lcl|NC_013692. 475 VKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLA 553 (726) Q Consensus 475 v~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~ 553 (726) .+|.++..+|..+. +++...++.+.... ....|.+.++...+.+-+++..+...- .+.+-.+.|..++-. T Consensus 426 ----vsGi~~~~lG~~sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~----y~~er~~RI~~ed~~ 496 (720) T protein:vir:35 426 ----VTGSSQAMQPMPSN-IAKETVNHLMHRSDMSSFIYLDNMAKSLKRAGEVWLSMAREV----YGSDRQVRIVNADGT 496 (720) T ss_pred ----HhCCChHHcCcccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEecCCCC Confidence 35777777776544 67877666655543 455666777777777777666543321 123334555443322 Q ss_pred ccc-------------------ceee---ecccchH-HHHHHHH-HHHHHHHhhhccchhHH-HH-HHHHHHHhhhhhhh Q lcl|NC_013692. 554 GNF-------------------DLKL---DISTAEE-DNAKVND-LTFMLQTMGPNMDPMMA-QQ-IMGQIMELKKMPDF 607 (726) Q Consensus 554 ~~~-------------------dv~i---~~~~~~~-~~~~~~~-l~~l~q~~~~~~~~~~~-~~-~~~~~~~~~~~~e~ 607 (726) .++ |++. .+..... ...-..+ ....+..+.+.+++... .. ++..+++.+.++.. T Consensus 497 ~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~ 576 (720) T protein:vir:35 497 DDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGL 576 (720) T ss_pred cceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhH Confidence 111 1110 0111111 1111111 12222222233433332 22 22334555555554 Q ss_pred hhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 608 AKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQ 687 (726) Q Consensus 608 ~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~ 687 (726) .+..........+..+.. +...+ .++++..++.+.++...+....++.+.+.+++. .++++.....+....+...+ T Consensus 577 ~e~~erirk~~~~~~~~~-~~~~e-~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~--~kaqa~~~~~qa~a~~aqa~ 652 (720) T protein:vir:35 577 DEFKEYNRKQLLTQGVVK-PRNTE-EEQMVAQMIQQAQQPNAELVAAQGVLMQGQAEV--QKAKNEELAIQVKAFQAQTE 652 (720) T ss_pred HHHHHHHHhhcchhcccC-ccChh-HHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH Confidence 443322222111111111 11111 111111111122222222222222222222222 22222222222222221111 Q ss_pred HHHHH-HHHHHHHHHHHH-HHHHHHHHH--------HHH-HHHHHHHhcC Q lcl|NC_013692. 688 QARKR-ELQQAQSEAQGK-LAMLNSQLK--------RLD-EATSARTSQK 726 (726) Q Consensus 688 ~~~e~-e~~~~q~~~q~~-~~~l~~~~~--------~~~-~~~~a~~~~q 726 (726) .+.+. ......++.+.. +......++ +-+ ++..++..++ T Consensus 653 a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~ 702 (720) T protein:vir:35 653 ARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELILK 702 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhc Confidence 11111 111111111100 000000010 000 1111111111 No 106 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.31 E-value=4.3e-11 Score=77.41 Aligned_cols=455 Identities=14% Similarity=0.040 Sum_probs=194.1 Q ss_pred CccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCCCC---CCCcCCCHHHHHHHHHHH Q lcl|NC_013692. 16 GDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKTEK---GKSAVQPPTIRKQAEWRY 90 (726) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~---grs~~v~~~v~~~v~~~~ 90 (726) ++-.-++...-..+. .+...|- ..+..+.....+-.+||+|....+ ++..+ .+-+++..-.+..|+.+. T Consensus 1 ~~~~i~~~~~~~~~~---~~~~~l~---~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~ 74 (485) T protein:vir:10 1 MTAPLPGQEEIEDPA---IARDEMV---SAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIA 74 (485) T ss_pred CCCCCCCCCCCCCHH---HHHHHHH---HHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHH Confidence 333333322222222 2222222 244455555677899998866431 11111 122344566677777666 Q ss_pred HHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 91 SSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 91 ~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) ..| ++ + -|. .++|.+..+ .++.+| ..|+.-.....++++++++|.+.+.+|.+..... T Consensus 75 ~~l---~~--~---g~~--~~~~~~~~~----~~~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~------- 132 (485) T protein:vir:10 75 ERQ---AV--E---GFR--FGDADEADE----ELWQWW-QANNLDIEAPLGYTDAYVHGRSYITISRPDPQID------- 132 (485) T ss_pred hhh---cc--c---cee--cCCCchhHH----HHHHHH-HhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccc------- Confidence 554 11 1 122 133433333 445555 4566556777899999999999998766511000 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) . ..-.+.|.|..++|.++ T Consensus 133 --------------------------~------------------------------------~~~~~~~~i~~~~p~~~ 150 (485) T protein:vir:10 133 --------------------------L------------------------------------GWDPNTPIIRVEPPTRM 150 (485) T ss_pred --------------------------c------------------------------------ccCCCeeEEEEEcccee Confidence 0 00012345677888886 Q ss_pred e--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEE Q lcl|NC_013692. 251 V--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYW 328 (726) Q Consensus 251 ~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w 328 (726) + |||... .-.+.+++.+ .+ ....++.+++| T Consensus 151 ~~~~D~~~~----~~~~~~~~~~------------~~--------------------------------~~~~~~~~~~y 182 (485) T protein:vir:10 151 YAEIDPRIG----RVSKAIRVAY------------DA--------------------------------EGNEIQAATLY 182 (485) T ss_pred EEEEcCCCC----ceeEEEEEEE------------ee--------------------------------CCCeEEEEEEE Confidence 4 555321 1111111110 00 00112233333 Q ss_pred EEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH-HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 329 GYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA-LLIDNQRIIGAVTRGMIDTMAR 407 (726) Q Consensus 329 ~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~-~~~d~Q~~~N~~~~~~~d~l~~ 407 (726) .. +.+ +.....++........|.+.+.+|+++|...+..+..+|.|.+. .++++++.+|+.++.+.+.+.. T Consensus 183 ~~-----~~~---~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~ 254 (485) T protein:vir:10 183 TP-----NDI---FGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAEL 254 (485) T ss_pred eC-----CeE---EEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHh Confidence 21 111 11111122222223445566889999999999999999999886 5899999999999999999998 Q ss_pred cCCCceEeeccc-cc---ch-----hhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHH---hch Q lcl|NC_013692. 408 SANGQVGVMKGA-LD---VT-----NRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESM---TGV 475 (726) Q Consensus 408 ~~~~~~~~~~ga-v~---~~-----d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~---tGv 475 (726) .+.|+..+- |. .+ .. ......+|.++... +.++ .+.+++. ..+...++.+...+..+ |++ T Consensus 255 ~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~d~----k~~q~~~--~~~~~~~~~l~~~i~~~~~~~~~ 326 (485) T protein:vir:10 255 MGVPQRLIF-GIKPEEIGVDPETGQTLFDAYLARILAFE-DAEG----KIQQFSA--AELANFTNALDQIAKQVAAYTGL 326 (485) T ss_pred hcchHHHHh-cCCcccccccccccchhhhhcccceeccC-CCCc----eEEeecc--cchHHHHHHHHHHHHHHhcccCC Confidence 888876542 22 11 11 11223455555443 2222 2223332 23445566666666665 666 Q ss_pred HHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccc Q lcl|NC_013692. 476 KAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGN 555 (726) Q Consensus 476 ~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~ 555 (726) ++...|..+.. +.||.++...............+.|..+++++++.++.+...-........|. -.|-+..+.... T Consensus 327 p~~~fg~~~~n-~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~-v~w~~~~~~~~~-- 402 (485) T protein:vir:10 327 PPQYLSTAADN-PASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRME-TVWRDPSTPTYA-- 402 (485) T ss_pred CHHHhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeee-EEecCCCCCCHH-- Confidence 66777743321 23566666666555555566666666677666665554321100000000100 001111111111 Q ss_pred cceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHH----------HHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhH Q lcl|NC_013692. 556 FDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQ----------IMGQIMELKKMPDFAKRIREFQPQPDPIAQQK 625 (726) Q Consensus 556 ~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~----------~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~ 625 (726) +.......+.+.....+....... .+..+.+...... ...+..+- .+.....-. T Consensus 403 --------------~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~-~~~~~~~~-~~~~~~~~~ 466 (485) T protein:vir:10 403 --------------AKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMG-LGLIGTMV-DPNPTVPGS 466 (485) T ss_pred --------------HHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHH-HHHHHHhh-ccCCCCCCC Confidence 111111111111000011000000 0000000000000 00000000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 626 AQLELMLLQAQIEAERARAA 645 (726) Q Consensus 626 ~q~e~q~~qaq~e~~~aq~q 645 (726) .+.+.+..... ...--... T Consensus 467 ~~~~~~~~~~~-~~~~~~~~ 485 (485) T protein:vir:10 467 PSPAPAPKPAA-LESGGDAA 485 (485) T ss_pred CCccccccCcC-CCCCCCCC Confidence 00000000000 00000000 No 107 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.29 E-value=6.3e-12 Score=81.98 Aligned_cols=465 Identities=12% Similarity=0.032 Sum_probs=188.9 Q ss_pred CccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CCCC---CCCcCCCHHHHHHHHHHH Q lcl|NC_013692. 16 GDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KTEK---GKSAVQPPTIRKQAEWRY 90 (726) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~~---grs~~v~~~v~~~v~~~~ 90 (726) .+-. ++-+.+......+-..|- ..+..+.....+-.+||+|....+. .... .+-++|..-.+..|+.+. T Consensus 1 ~~~~---~~~~~e~~~~~~~~~~l~---~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~ 74 (486) T protein:vir:42 1 MTAP---LPGMEEIEDPAVVREEMI---SAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVA 74 (486) T ss_pred CCCC---CCCCCCcccHHHHHHHHH---HHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHH Confidence 1111 222322222222222221 1344455556667889987654320 0000 111334445555555444 Q ss_pred HHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccc Q lcl|NC_013692. 91 SSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVT 170 (726) Q Consensus 91 ~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~ 170 (726) -.| .|. -|. .+++.... ..++.+| ..|+.-.....++++++++|.+.+.+|.+..... T Consensus 75 ~~l--~~~------g~~--~~~~~~~~----~~~~~i~-~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~------- 132 (486) T protein:vir:42 75 ERQ--AVE------GFR--LGDADEAD----EELWQWW-QANNLDIEAPLGYTDAYVHGRSFITISKPDPQLD------- 132 (486) T ss_pred hhh--ccc------cee--cCCCchhH----HHHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccc------- Confidence 333 111 121 12222222 2345554 4566656677899999999999998866411000 Q ss_pred cccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe Q lcl|NC_013692. 171 YEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI 250 (726) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~ 250 (726) + . .....|.+..++|.++ T Consensus 133 ---~-----------------~------------------------------------------~~~~~~~i~~~~p~~~ 150 (486) T protein:vir:42 133 ---L-----------------G------------------------------------------WDQNVPIIRVEPPTRM 150 (486) T ss_pred ---c-----------------c------------------------------------------cCCCeeEEEEecccce Confidence 0 0 0012245567788875 Q ss_pred e--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEE Q lcl|NC_013692. 251 V--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYW 328 (726) Q Consensus 251 ~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w 328 (726) + |||... . ..++.+++.+ + . .+.+..+++| T Consensus 151 ~~i~d~~~~----~-~~~~~~~~~~--~---~--------------------------------------~~~~~~~~~y 182 (486) T protein:vir:42 151 HAEIDPRIN----R-VSKAIRVAYD--K---E--------------------------------------GNEIQAATLY 182 (486) T ss_pred EEEEeCCCC----C-eEEEEEEEEe--c---C--------------------------------------CCeEEEEEEE Confidence 5 555321 1 1112222100 0 0 0112333444 Q ss_pred EEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHH-HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 329 GYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGA-LLIDNQRIIGAVTRGMIDTMAR 407 (726) Q Consensus 329 ~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~-~~~d~Q~~~N~~~~~~~d~l~~ 407 (726) .. +. .+..+..++........|.+.+.+|+++|...+..+..+|.|.+. .++++++.+|+.++.+.+++.. T Consensus 183 ~~-----~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~ 254 (486) T protein:vir:42 183 TP-----ME---TIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAEL 254 (486) T ss_pred cC-----Cc---EEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHh Confidence 21 11 111111122222223345555889999999888889999999987 5889999999999999999888 Q ss_pred cCCCceEeeccc----ccchh-----hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHH---hch Q lcl|NC_013692. 408 SANGQVGVMKGA----LDVTN-----RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESM---TGV 475 (726) Q Consensus 408 ~~~~~~~~~~ga----v~~~d-----~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~---tGv 475 (726) .+.|+..+- |. +...+ .....+|.++... +.++ .+.+++. ..+..++..+...+..+ +++ T Consensus 255 ~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~q~~~--~~~e~~~~~l~~~i~~~s~~~~~ 326 (486) T protein:vir:42 255 MGVPQRLIF-GIKPEEIGVDSETGQTLFDAYLARILAFE-DAEG----KIQQFSA--AELANFTNALDQIAKQVAAYTGL 326 (486) T ss_pred hcchHHHhh-cCCccccccccccccchhhhhhchhcccC-CCCc----eEEeecc--cCHHHHHHHHHHHHHHHhcccCC Confidence 888876542 32 11111 1122345554432 2222 2233332 23455666777666665 666 Q ss_pred HHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccc Q lcl|NC_013692. 476 KAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGN 555 (726) Q Consensus 476 ~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~ 555 (726) ++...|..+. ...||.++...............+.|..+++++++.++.+. .... +. .++..+ T Consensus 327 p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~----~~~~-~~---~d~~~i-------- 389 (486) T protein:vir:42 327 PPQYLSTAAD-NPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIM----KGGD-VP---PDMLRM-------- 389 (486) T ss_pred CHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCC-cc---ccceee-------- Confidence 6666664332 12366667666666666666666777777777776655532 1100 00 000000 Q ss_pred cceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhh-hhhhhhHHHHHhhhhhhhhhHHHHHHHHHH Q lcl|NC_013692. 556 FDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKM-PDFAKRIREFQPQPDPIAQQKAQLELMLLQ 634 (726) Q Consensus 556 ~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~-~e~~~~l~~~~~~~~~~~qq~~q~e~q~~q 634 (726) .++-.........+.......+.+.....+..... ..+ .+. ++..+.++....+....... .+ . T Consensus 390 -~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~----~~~---lg~~~d~~~e~~~~~~e~~~~~~~--~~-----~ 454 (486) T protein:vir:42 390 -ETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERA----RID---MGYSVKEREEMRRWDEEEAAMGLG--LL-----G 454 (486) T ss_pred -eEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHH----Hhc---CCCChhHHHHHHHHHHHHHHHHHH--HH-----H Confidence 00000000000011111111111111000111100 000 000 00000000000000000000 00 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 635 AQIEAERARAAHYMSGAGLQDSKVGTEQAKARA 667 (726) Q Consensus 635 aq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q 667 (726) +...... ..+...........+....++.-.. T Consensus 455 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 455 TMVDADP-TVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred HhhcCCC-CCCCCCCCCCCCCCCcccCCCCCCC Confidence 0000000 0000000000000000000000000 No 108 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.24 E-value=1.5e-11 Score=79.85 Aligned_cols=452 Identities=13% Similarity=0.063 Sum_probs=185.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CCC-C--CCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEE Q lcl|NC_013692. 32 LAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KTE-K--GKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEV 106 (726) Q Consensus 32 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~-~--grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~ 106 (726) +++-...|..-...|........+..+||.|.-..+- ... . ..-++|....+..|+...-.|. | + -| T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~--~---~---g~ 72 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD--I---E---GF 72 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhc--c---C---ce Confidence 2222222222223455555556788899987654310 000 0 1224566666666666555441 1 1 11 Q ss_pred ecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhh Q lcl|NC_013692. 107 NPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQ 186 (726) Q Consensus 107 ~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (726) ..++|.+..+ .+..+| ..|+.-......+++++++|.+++.+|=. .. T Consensus 73 --~~~~d~~~~~----~l~~i~-~~N~~d~~~~~~~~~a~~~G~ay~~v~~~--------~~------------------ 119 (480) T protein:vir:78 73 --RISEDSEGLE----ELWNWW-QANDLDEESVLGHDDSLTFGRSYITVSHP--------DV------------------ 119 (480) T ss_pred --ecCCCchhHH----HHHHHH-HhcCHHHHHHHHHHHHhhcCceEEEEecC--------cc------------------ Confidence 1334444433 455555 46666667778999999999998766310 00 Q ss_pred hhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhee--eCCCCCCchhhCC Q lcl|NC_013692. 187 TAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKAK 264 (726) Q Consensus 187 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da~ 264 (726) .+++ -.+.|.+..++|.+++ |||.... ... T Consensus 120 -------~~~d--------------------------------------~~g~~~i~~~~p~~~~~~~D~~~~~---~~~ 151 (480) T protein:vir:78 120 -------ESGD--------------------------------------PAGIPLIRVESPLYMYAELDPRNTR---RVT 151 (480) T ss_pred -------ccCC--------------------------------------CCCeeEEEEEcccceEEEEcCCCcc---ceE Confidence 0000 0122445667777755 4443321 112 Q ss_pred eEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEE Q lcl|NC_013692. 265 FLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVA 344 (726) Q Consensus 265 ~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~ 344 (726) +.++ .+.+.++ ...+..+++|.. +.+.+ . T Consensus 152 ~~i~-~~~~~~~------------------------------------------~~~~~~~~~y~~-----~~~~~---~ 180 (480) T protein:vir:78 152 RAVR-LYTTRDD------------------------------------------VAVPDRATLYLP-----DETVP---L 180 (480) T ss_pred EEEE-EEEeecC------------------------------------------CCceEEEEEEeC-----CeEEE---E Confidence 2222 1111100 001222333321 11111 1 Q ss_pred EEECC----EEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc Q lcl|NC_013692. 345 TWVGA----VMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL-LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA 419 (726) Q Consensus 345 ~~~g~----~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~-~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga 419 (726) ...++ .+....+.|.+.+.+|+++|...+..+.++|.|.+.. ++++++.+|..++.+.+.+...+.|...+ .|. T Consensus 181 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i-~G~ 259 (480) T protein:vir:78 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-SGV 259 (480) T ss_pred EecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh-hcC Confidence 11111 1122233455568899999998888898999998875 89999999999999999999888887655 343 Q ss_pred c-cch-h-----hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHH---HHhchHHHhhccCcccchh Q lcl|NC_013692. 420 L-DVT-N-----RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAE---SMTGVKAFNAGISGAALGD 489 (726) Q Consensus 420 v-~~~-d-----~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e---~~tGv~~~~~G~~~~~~~~ 489 (726) - +.. + ......|.++... +..+ .+.+++.. .+...++.+...+. ..||++....|..+. .+. T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~--~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~ 331 (480) T protein:vir:78 260 TTDELTNDGENTTLDIYYGRILTLA-SEAA----KISEFKAA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPA 331 (480) T ss_pred CccccccccccchhhhhhhhhccCC-CCCc----eEEecCcc--CHHHHHHHHHHHHHHHhcccCCChHHhccccC-cch Confidence 1 110 1 0122234343332 2222 12233321 23334455555544 457777777774332 113 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecc-cchHH Q lcl|NC_013692. 490 TATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIS-TAEED 568 (726) Q Consensus 490 ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~-~~~~~ 568 (726) ||.++...............+.|..+++++++.++.+ ...... .++..+ +++-... ... . T Consensus 332 Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~----~g~~~~-----~~~~~i---------~v~f~~~~~~s-~ 392 (480) T protein:vir:78 332 SAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI----MGREVT-----EEYTRL---------ETVWRDPSTPT-V 392 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcc-----ccceee---------eEEecCCCCCC-H Confidence 5666666555545555555566666666665555443 221100 011111 0110000 000 0 Q ss_pred HHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhh-hhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 569 NAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKM-PDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHY 647 (726) Q Consensus 569 ~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~-~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~ 647 (726) .+....+.++.+.....+...... +..+. ++..+.+.....+. .+. .+.. +.+. ....+..... T Consensus 393 ~~~ad~~~kl~~~g~~~~s~et~~-------~~lg~~~d~~~~~~~~~~e~--~~~---~~~~--~~~~-~~~~~~~~~~ 457 (480) T protein:vir:78 393 AAKADAVSKLYANGQGPIPKEQAR-------IDLGYTATQREQMRDWDKQE--TED---MIDT--LYST-TKAQADATPK 457 (480) T ss_pred HHHHHHHHHHHHhccccCCHHHHH-------hcCCCCHhHHHHHHHHHHHH--HHH---HHHH--hhcc-ccccCCCCCC Confidence 111112222222111111111110 00010 00000010000000 000 0000 0000 0000000000 Q ss_pred H----HHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 648 M----SGAGLQDSKVGTEQAKAR 666 (726) Q Consensus 648 ~----~~~~~~~~~~~~eqaq~~ 666 (726) . .....+.+....-..+.+ T Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 458 PTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred CCCCCCCCccccccCCCCcccCC Confidence 0 000000000000000000 No 109 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.22 E-value=2.9e-10 Score=72.91 Aligned_cols=451 Identities=13% Similarity=0.097 Sum_probs=177.1 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CCCCCCC--- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KTEKGKS--- 75 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~~grs--- 75 (726) |.+. |. .++..+.....|..++- .-|...+...++..+||.|...... +....+. T Consensus 1 ~~~~-------p~----------~~l~~~~~~~~~~~~l~---~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~ 60 (479) T protein:vir:99 1 MIDL-------PD----------EDLSSEGLAKYLETKVF---PKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREV 60 (479) T ss_pred CccC-------Cc----------ccCChhHHHHHHHHHHH---HHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHH Confidence 2211 11 12222222222322222 2444555556778899988764321 1111110 Q ss_pred ---cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeE Q lcl|NC_013692. 76 ---AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTII 152 (726) Q Consensus 76 ---~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i 152 (726) ..|....+..|+.+...| .|.+.+..|.+..+. +..+| ..|+.-.....++++++++|.++ T Consensus 61 ~~~~~~~n~~~~iVd~~~~~l-----------~~~gf~~~d~~~~~~----~~~i~-~~N~~d~~~~~~~~~a~~~G~af 124 (479) T protein:vir:99 61 LQQLSRKPWMGLMVNSFAQQL-----------IVDGYRKTGTNENAK----GWDTW-RLNQMDKQQFWLNRAVLTFGYAF 124 (479) T ss_pred HHHHhhcCcHHHHHHHHHhhc-----------ccccccCCCchhhHH----HHHHH-HhcChhHHHHHHHHHHhhcCceE Confidence 113334444444333222 133334444444443 33444 34554455667889999999998 Q ss_pred EEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecc Q lcl|NC_013692. 153 VKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEER 232 (726) Q Consensus 153 ~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 232 (726) +.+|+..... ++ T Consensus 125 ~~v~~~~~~~-----------------------------------------------------d~--------------- 136 (479) T protein:vir:99 125 IKVTSGISPL-----------------------------------------------------DG--------------- 136 (479) T ss_pred EEEecCCCCc-----------------------------------------------------CC--------------- Confidence 8765420000 00 Q ss_pred cceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhcccc Q lcl|NC_013692. 233 EETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVR 310 (726) Q Consensus 233 ~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 310 (726) ...|.+..++|.+++. |..... .-..|.. +. +. T Consensus 137 ----~g~~~i~~~~p~~~~~iydd~~~~--~~~~~~~-----~~--------~~-------------------------- 171 (479) T protein:vir:99 137 ----TTVARIKCIDPRDAFAIWEDPYWD--EWPKYLL-----ER--------QP-------------------------- 171 (479) T ss_pred ----CCceEEEEechhheEEEecCCccc--ceeeEEE-----ee--------cC-------------------------- Confidence 1123455677777643 222111 0011110 00 00 Q ss_pred ccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHH Q lcl|NC_013692. 311 NFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDN 390 (726) Q Consensus 311 ~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~ 390 (726) ... +.+|... ..+.....++......+.|-+.|.+|+++|...+..+. +|.|.++.++++ T Consensus 172 --------~~~---~~~~~~~--------~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~-~g~sd~e~v~~l 231 (479) T protein:vir:99 172 --------NGQ---YWWWTEE--------DYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLRG-VCYGDVEPLVTV 231 (479) T ss_pred --------cee---EEEEecc--------eEEEEEecCCceeeccccccCCCCcceEEeecCCCcCc-CCcchhHHHHHH Confidence 000 0111110 00000011111111123343448899999988877754 699999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCceEeecccccch------hhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHH Q lcl|NC_013692. 391 QRIIGAVTRGMIDTMARSANGQVGVMKGALDVT------NRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINL 464 (726) Q Consensus 391 Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~------d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~ 464 (726) ++.+|+.++.+...+...+.|+..+. |....+ .......++++...+ .++ .+.+.+. ......+.. T Consensus 232 iDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~----~~~q~~~--~~~~~~~~~ 303 (479) T protein:vir:99 232 AKAIDKTGLDILLVQHHQSFQIRWAT-GLMLPEGANADQEKMRFAQESMLISQN-EKA----SFGAIPA--APLDGLLNA 303 (479) T ss_pred HHHHHHHHHHHHHHHHHhhchhhhhc-CCCcccccccchhccccccccceeecC-CCc----eEEEecc--cchHHHHHH Confidence 99999999999999998888876542 332111 112223445554432 222 2223332 223344445 Q ss_pred HHHHHH---HHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 465 QQAEAE---SMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 465 ~~~~~e---~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +...+. ..|+++....|..++ .|+.++...............+.|..+++.+++.++.+.-. ......+.|+- T Consensus 304 l~~~i~~i~~~t~~p~~~~g~~~n---~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~-~~~~~~~~i~~ 379 (479) T protein:vir:99 304 YKESLLEFLALAQLPPHIAGQIVN---VAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGR-TEEATDLDFTI 379 (479) T ss_pred HHHHHHHHhccCCCCHHHcccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC-CccccceeeeE Confidence 554444 456777777775443 36666666655555555566666667777766665443211 00111111110 Q ss_pred ccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHH--------HHHHHHhhhh-hhhhhhHH Q lcl|NC_013692. 542 EHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQI--------MGQIMELKKM-PDFAKRIR 612 (726) Q Consensus 542 ~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~--------~~~~~~~~~~-~e~~~~l~ 612 (726) .|-... +.+. .+.......+.+. + .++....... +..+.+..+. ........ T Consensus 380 -~w~~~~---------------~~s~-~~~ad~~~kl~~a-g-~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~ 440 (479) T protein:vir:99 380 -TWQDVT---------------IQSL-AQFADAWAKMVES-L-KIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMR 440 (479) T ss_pred -EecCCC---------------CCCH-HHHHHHHHHHHhc-C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHH Confidence 010000 0000 0111111111111 0 1111111000 0000000000 00000000 Q ss_pred HHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 613 EFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKV 658 (726) Q Consensus 613 ~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~ 658 (726) .......+..+... . .-.....++....-. -+..-..-. T Consensus 441 ~~~~~~~~~~~~~~-~-----~~~~~~~~~~~~~~~-~~~~~~~~~ 479 (479) T protein:vir:99 441 KLQNGPDPAEQRGG-P-----NGATNMQQANNKTGE-PASLNKSGA 479 (479) T ss_pred HHhcccCcccccCC-C-----CCCCCCCCCCCCCcc-hhccCCCCC Confidence 00000000000000 0 000000000000000 000000000 No 110 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.21 E-value=1.2e-10 Score=74.96 Aligned_cols=453 Identities=11% Similarity=0.033 Sum_probs=184.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC------CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEE Q lcl|NC_013692. 32 LAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK------PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFE 105 (726) Q Consensus 32 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~------~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~ 105 (726) +++-...|..-...+........+..+||+|.-..+ ++.. ..-++|..-....|+...-.| + .+. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~-~~~~~~~n~~~~ivd~~~~~l---~--~~g--- 71 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPEL-AYLDVQPGWVATYLRTLSDRL---D--IEG--- 71 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhh-hhhhhhcchHHHHHHHHHhhh---c--cCc--- Confidence 222222222222234444555677789998765321 0111 122355566666666555444 1 111 Q ss_pred EecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHh Q lcl|NC_013692. 106 VNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIY 185 (726) Q Consensus 106 ~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (726) | ..++|.+. ...++.+| ..|+.-......+++++++|.+++.+| . . .. T Consensus 72 ~--~~~~d~~~----~~~l~~i~-~~N~~~~~~~~~~~~a~~~G~ay~~v~-~---~----~~----------------- 119 (480) T protein:vir:78 72 F--RISEDSEG----LEELWNWW-QANDLDEESVLGHDDSLTFGRAYITVS-H---P----DV----------------- 119 (480) T ss_pred e--ecCCCchh----HHHHHHHH-HhcCHHHHHHHHHHHHhhcCceEEEee-c---C----cc----------------- Confidence 1 12344332 34556665 467666777789999999999988763 1 0 00 Q ss_pred hhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhee--eCCCCCCchhhC Q lcl|NC_013692. 186 QTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIV--IDPSCGSDFSKA 263 (726) Q Consensus 186 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~--~dp~a~~d~~da 263 (726) .+.+ -.+.|.|..++|.+++ |||+... .- T Consensus 120 --------~~~d--------------------------------------~~~~~~i~~~~p~~~~~i~D~~~~~---~~ 150 (480) T protein:vir:78 120 --------ESGD--------------------------------------PAGIPLIRVESPLYMYAELDPRNTR---RV 150 (480) T ss_pred --------ccCC--------------------------------------CCCeeEEEEEcccceEEEEcCCCcc---ce Confidence 0000 0122456678888855 4553321 22 Q ss_pred CeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEE Q lcl|NC_013692. 264 KFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIV 343 (726) Q Consensus 264 ~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~ 343 (726) .+.++. +.+.++ .+.+..+++|.. +.+.+ T Consensus 151 ~~~i~~-~~~~d~------------------------------------------~~~~~~~~~y~~-----~~~~~--- 179 (480) T protein:vir:78 151 TRAVRL-YTTRDD------------------------------------------VAVPDRATLYLP-----DETVP--- 179 (480) T ss_pred EEEEEE-EEeecC------------------------------------------CcceEEEEEEeC-----CeEEE--- Confidence 222221 111100 001122333321 11111 Q ss_pred EEEECC----EEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEeecc Q lcl|NC_013692. 344 ATWVGA----VMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL-LIDNQRIIGAVTRGMIDTMARSANGQVGVMKG 418 (726) Q Consensus 344 ~~~~g~----~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~-~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~g 418 (726) ....|+ .....+..|.+.|.+|+++|...+..+..+|.|.+.. ++++++.+|..++.+...+...++|+..+ .| T Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i-~G 258 (480) T protein:vir:78 180 LRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-SG 258 (480) T ss_pred EEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhh-hC Confidence 111111 1122233455558899999998888888999998874 89999999999999999999888887655 34 Q ss_pred cc-cc--hh----hhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHH---HhchHHHhhccCcccch Q lcl|NC_013692. 419 AL-DV--TN----RRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAES---MTGVKAFNAGISGAALG 488 (726) Q Consensus 419 av-~~--~d----~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~---~tGv~~~~~G~~~~~~~ 488 (726) .- +. .+ ......|.++... +..+ .+.+.+.. .+..+++.+...+.. +|+++....|..+. .. T Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~--~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~ 330 (480) T protein:vir:78 259 VTTDELTNDGENTTLDIYYGRILTLA-SEAA----KISEFKAA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NP 330 (480) T ss_pred CCccccccccccchhhhhhhhhccCC-CCCc----eEEecCcc--CHHHHHHHHHHHHHHHhcccCCCHHHhccccC-ch Confidence 31 11 01 1122334443332 2222 12233321 233344555555554 56777677774331 11 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecc---cc Q lcl|NC_013692. 489 DTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIS---TA 565 (726) Q Consensus 489 ~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~---~~ 565 (726) .||.++......-.......-+.|..+++++++.++. +..... . .++ +.+.+.=. .. T Consensus 331 ~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~----~~~~~~--~---~~~-----------~~i~v~w~~~~~~ 390 (480) T protein:vir:78 331 ASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ----IMGREV--T---EEY-----------TRLETVWRDPSTP 390 (480) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HcCCCc--c---ccc-----------eeeeEEecCCCCC Confidence 3566666655555555555566666676666665443 322110 0 001 01111100 00 Q ss_pred hHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhh-hhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 566 EEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMP-DFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARA 644 (726) Q Consensus 566 ~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~-e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~ 644 (726) . ..+....+..+.+.....+.... .. +..+.. +..+.+..... ++.+.......... T Consensus 391 s-~~~~ad~~~kl~~~g~~~~s~et----~~---~~lg~~~d~~~e~~~~~~--------------~~~~~~~~~~~~~~ 448 (480) T protein:vir:78 391 T-VAAKADAVSKLYANGQGPIPKEQ----AR---IDLGYTATQREQMRDWDK--------------QETEDMIDTLYSTT 448 (480) T ss_pred C-HHHHHHHHHHHHHhcccCCCHHH----HH---hcCCCCHhHHHHHHHHHH--------------HHHHHHHHHhhccc Confidence 0 01111122222211111111111 00 011110 00000000000 00000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 645 AHYMSGAGLQDSKVGTEQAKARALASQADMTDLN 678 (726) Q Consensus 645 q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e 678 (726) +........ ..........+....+.-+.... T Consensus 449 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 449 KAQADATPK--PTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred cCCCccccC--CCCCCCCCccCCCcccCCCcCCC Confidence 000000000 00000000000000000000000 No 111 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.19 E-value=6.9e-10 Score=70.80 Aligned_cols=484 Identities=13% Similarity=0.049 Sum_probs=189.7 Q ss_pred CC---CccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC-CCCCCCC--- Q lcl|NC_013692. 1 MA---DVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG-KPKTEKG--- 73 (726) Q Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~-~~~~~~g--- 73 (726) |. -+-+=|+||-.--.+++.. +..+.+ .+ ..+.+++..+.+|..||.|.... ......| T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~---~i~~~~-------~i----~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~ 66 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLN---SILEHP-------KI----AVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIK 66 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccch---hccccC-------CC----CCCHHHHHHHHHHHHHhcCCcccccccccCcchh Confidence 11 1111123322111111100 000000 01 12455666688999999764321 0011112 Q ss_pred -CCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeE Q lcl|NC_013692. 74 -KSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTII 152 (726) Q Consensus 74 -rs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i 152 (726) |.....+.-...++ -+++| .|+-.+-+.+ +|. .++++|+.++. .|+.+..++.++..++..|.++ T Consensus 67 ~~~~~slnl~~~i~~-~~A~l---v~~e~~~i~v-----~d~----~~~~~l~~~l~-~n~f~~~~~~~~e~a~a~G~~a 132 (522) T protein:vir:47 67 SRPMNHLPIARTASK-KIASL---VYNEQATITT-----KNE----ILQKFLDDMLT-NDRFNKNFERYLESCLALGGLA 132 (522) T ss_pred cccceecchHHHHHH-HHhhh---hcCCcceeec-----CCh----HHHHHHHHHHh-hcchHHHHHHHHHHhhccCCEE Confidence 12222233333333 23333 3332222222 233 45557777764 6777788999999999999999 Q ss_pred EEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecc Q lcl|NC_013692. 153 VKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEER 232 (726) Q Consensus 153 ~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 232 (726) +|.||+.. . T Consensus 133 ~k~~~d~~--~--------------------------------------------------------------------- 141 (522) T protein:vir:47 133 MRPYIDGD--K--------------------------------------------------------------------- 141 (522) T ss_pred EEEEEcCC--c--------------------------------------------------------------------- Confidence 99999721 0 Q ss_pred cceeeccceeeeechhheeeC-CCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 233 EETVENHPTVQVCDYNNIVID-PSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 233 ~~~~~~~p~i~~v~p~~~~~d-p~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) ++|.+|++..|++= ... .+...|-++.+..+... . ..-||.-++.-.+... +.........+ T Consensus 142 -------~~i~~v~ad~~~P~~~~~-~~~~e~a~~~~~~~~~~-~--~~~~yt~lE~he~~~~------~~~~~~~~~~~ 204 (522) T protein:vir:47 142 -------VRVAFIQAPVFFPLESNT-QDVSSAAILTKTIKSEG-R--KNVYYTLVEFHEWVTA------DGQETGSTNDK 204 (522) T ss_pred -------eEEEEEcCCceEEEEEcC-CceEEEEEEEEEEeecc-c--ceeEEEEEEEeeeccc------ccccccccccC Confidence 01223333333321 000 01222222222111000 0 0000000000000000 00000000000 Q ss_pred cccCCcCCceEEEEEEEEEeecCCCceEEEEEEE--EECCEEEEeccCCCCC-CccceEEee----eeeecCcccCCChH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVAT--WVGAVMIRMEENPFPD-KRIPYVVVN----YIPRKRDLYGESDG 384 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~--~~g~~~l~~~~~P~~~-~~~Pf~~~~----~~~~~~~~~g~g~~ 384 (726) . ...| -++.|...+.+.-|.......+ +.+ |.. ..-+.+ .+.+|+.+. .....++.+|.|++ T Consensus 205 ~------~~~I-~n~ly~~~~~~~lG~~v~l~~~~e~~~---l~~-~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~ 273 (522) T protein:vir:47 205 K------YYRI-TNELYRSDVNDVLGQRVNLSELDKYKN---LEP-VTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIF 273 (522) T ss_pred C------ceEE-EEEEeecCCCcccCccccccccccccC---CCC-ceEeCCCCcceEEEecCCcccccccCCCcCCchh Confidence 0 0000 1111111000000000000000 000 000 000111 222344332 22345788999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhh---------hhcCCc-eEe-ecCccchhhhcccccCcc Q lcl|NC_013692. 385 ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRR---------RFDRGE-NYE-FNPGADPRAAVHMHTFPE 453 (726) Q Consensus 385 ~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~---------~~~~g~-vi~-~~~~~~~~~~i~~~~~~~ 453 (726) ..+++..+.+|..++++.+-+.+. ..++.+++..+...... .+..+. ++. ++........|...++.. T Consensus 274 ~~~~~~id~lD~~~s~~~~e~~~g-~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~i 352 (522) T protein:vir:47 274 DNAKTTIDFINRSYDEFMWEVRMG-QRRVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPI 352 (522) T ss_pred hhhHHHHHHHHHHHHHHHHHHHhc-cceeecchHHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeecccc Confidence 999999999999999999887754 45677877776332110 122222 221 221111223465554433 Q ss_pred chhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-- Q lcl|NC_013692. 454 IPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFL-- 531 (726) Q Consensus 454 ~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~-- 531 (726) -...+...+..+...+....|++.-..|.++.. ..||+++....+..-.....+...+..+++++...++.+...+. T Consensus 353 r~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~-~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~ 431 (522) T protein:vir:47 353 RANDYILAISEGLKLFEMQIGVSSGMFTFDGQG-MKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVY 431 (522) T ss_pred ChHHHHHHHHHHHHHHHHHhCCCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 334455566777777777888877666655443 36888887766666677777888888888888888887764321 Q ss_pred CcCeEEEEecccceecchhhcccccceee--ecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhh Q lcl|NC_013692. 532 DDVEVVRITNEHFVDIRRDDLAGNFDLKL--DISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAK 609 (726) Q Consensus 532 d~e~~iRi~~~~~v~v~~~~~~~~~dv~i--~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 609 (726) ...- ...+++++ +.+...-.....+....+.. .+ -+.....-...-..- ...+.+... T Consensus 432 ~~~~-----------------~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~-aG-~~s~e~~i~~~~g~~-eeea~~el~ 491 (522) T protein:vir:47 432 SGEI-----------------PELDDISVNLDDGVFTDRHAELDYWAKMVA-AG-FSTKKRAIGKTLNIS-GVEAEKELN 491 (522) T ss_pred cCCC-----------------CCcceeEEEcCCCCCCCHHHHHHHHHHHHh-cC-CCCHHHHHHhcCCCC-hHHHHHHHH Confidence 1100 00111111 11111111111111111110 01 011110000000000 000000111 Q ss_pred hHHHHHhhhhhh-------hhhHHHHHHHHH Q lcl|NC_013692. 610 RIREFQPQPDPI-------AQQKAQLELMLL 633 (726) Q Consensus 610 ~l~~~~~~~~~~-------~qq~~q~e~q~~ 633 (726) ++++.+....+. ..+..+.--++- T Consensus 492 ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 492 AINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 111111000000 000000000000 No 112 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.19 E-value=7.1e-10 Score=70.75 Aligned_cols=475 Identities=11% Similarity=0.018 Sum_probs=202.8 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CC--------- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PK--------- 69 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~--------- 69 (726) |||.=.=++....+...-.+.. ..=..+..+..|...++. | ......+..+||.|.-+.. ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~i~~----~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~ 73 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVES-AKEIAEPDTTMIQKLIDE----H--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQ 73 (503) T ss_pred CcccccCChhhHHhHHHhhhhh-hhhccchhHHHHHHHHHh----h--cHHHHHHHHHHhccccchhhccchhccccccc Confidence 7765433332222111111111 111223333333333331 1 2334677889998764321 00 Q ss_pred -CCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHh Q lcl|NC_013692. 70 -TEKGKS--AVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGV 146 (726) Q Consensus 70 -~~~grs--~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l 146 (726) ..++++ +++.+..+..|+.....| ||.. +.|. .+|.+ ...+++..+ .|+....+..++++++ T Consensus 74 ~~~~~~~~~ri~~n~~~~ivd~~~~yl----~g~~--~~~~---~~d~~----~~~~l~~~~--~n~~~~~~~~~~~~~~ 138 (503) T protein:vir:59 74 LVDDTKTNNRTSHAWHKLFVDQKTQYL----VGEP--VTFT---SDNKT----LLEYVNELA--DDDFDDILNETVKNMS 138 (503) T ss_pred ccccccccceeecchHHHHHHHHHhhh----hcCC--eeec---cCcHH----HHHHHHHHH--hcCHHHHHHHHHHHHh Confidence 111222 567777788888777655 3333 2343 23333 333566544 3666777888999999 Q ss_pred hcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeecccc Q lcl|NC_013692. 147 DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVG 226 (726) Q Consensus 147 ~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 226 (726) .+|.+.+.+||+.. T Consensus 139 ~~G~~~~~v~~d~d------------------------------------------------------------------ 152 (503) T protein:vir:59 139 NKGIEYWHPFVDEE------------------------------------------------------------------ 152 (503) T ss_pred hCCeEEEEEeecCC------------------------------------------------------------------ Confidence 99999999887510 Q ss_pred ceeecccceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhh Q lcl|NC_013692. 227 SEEEEREETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 227 ~~~~~~~~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~ 304 (726) +.|++..++|.+++. |+... ....+ +.+.|.+... . T Consensus 153 -----------g~~~i~~~~p~~~~~i~d~~~~---~~~~~-~ir~~~~~~~---~------------------------ 190 (503) T protein:vir:59 153 -----------GEFDYVIFPAEEMIVVYKDNTR---RDILF-ALRYYSYKGI---M------------------------ 190 (503) T ss_pred -----------CceEEEEEccceeEEEEeCCCC---CceEE-EEEEEEEecC---C------------------------ Confidence 013345677777653 33221 22222 2233321110 0 Q ss_pred hhccccccccCCcCCceEEEEEEEEE-----eecCCCceEEEEEE-EEECCEEEEeccCCCCCCccceEEeeeeeecCcc Q lcl|NC_013692. 305 PSEGVRNFDFQDKSRKRLVVHEYWGY-----YDIHGDGVLHPIVA-TWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDL 378 (726) Q Consensus 305 ~~~~~~~~~~~~~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~-~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~ 378 (726) ...+..+|.|.. +...+++....... .......+.....|++.+.+||+.+.. .. T Consensus 191 --------------~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~n-----n~ 251 (503) T protein:vir:59 191 --------------GEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKN-----NE 251 (503) T ss_pred --------------CceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecC-----CC Confidence 000112222211 00111110000000 000000111223345557778776643 44 Q ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc-ccchh--hhhhcCCceEeecCccchhhhcccccCccch Q lcl|NC_013692. 379 YGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTN--RRRFDRGENYEFNPGADPRAAVHMHTFPEIP 455 (726) Q Consensus 379 ~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga-v~~~d--~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~ 455 (726) +|.|.+..++++++.+|.+++.+.+.+...++|.+.+ .|. ..... ......++++.+..+++ +.+....... T Consensus 252 ~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~ 326 (503) T protein:vir:59 252 EMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVL-KNYDGENPKEFTANLRYHSVIKVSGDGG----VDTLRAEIPV 326 (503) T ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEe-ecCCccccchhhhhhhcccceeccCCCc----ceeEeccCCH Confidence 6899999999999999999999999999999887765 343 21111 12234455665554443 3344333334 Q ss_pred hHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCe Q lcl|NC_013692. 456 QSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVE 535 (726) Q Consensus 456 ~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~ 535 (726) ......++.+...+...+++++.+.+..++ +.||.++...............+.|..+++++++.++.++........ T Consensus 327 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~ 404 (503) T protein:vir:59 327 DSAAKELERIQDELYKSAQAVDNSPETIGG--GATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDF 404 (503) T ss_pred HHHHHHHHHHHHHHHHHhcccCCCcccccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Confidence 566777888888888889888776543222 235666666655555555666666667777766666655543221100 Q ss_pred EEEEecccceecchhhcccccceeee--cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHH Q lcl|NC_013692. 536 VVRITNEHFVDIRRDDLAGNFDLKLD--ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIRE 613 (726) Q Consensus 536 ~iRi~~~~~v~v~~~~~~~~~dv~i~--~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~ 613 (726) ....++.+. .....-..+..+.+..+... + -++.... +. .+....+....++. T Consensus 405 -----------------~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~-G-iiS~et~---l~---~l~~v~d~~~E~~r 459 (503) T protein:vir:59 405 -----------------NPDKELTMTFTRTRIQNDSEIVQSLVQGVTG-G-IMSKETA---VA---RNPFVQDPEEELAR 459 (503) T ss_pred -----------------ccccceeEEeCCCCCCCHHHHHHHHHHHHhC-C-CCchHHH---HH---hCCCCCCHHHHHHH Confidence 000011111 11111111111111111110 0 0111110 00 01111100000000 Q ss_pred HHhhhhhhhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 614 FQPQPDPIAQQKAQLELMLLQAQIEAERAR-AAHYMSGAGLQDSKVGTEQAK 664 (726) Q Consensus 614 ~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq-~q~~~~~~~~~~~~~~~eqaq 664 (726) ...+.....++.... ....... .+.+.....-+..+...-++. T Consensus 460 i~~E~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 460 IEEEMNQYAEMQGNL--------LDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred HHHHHHHHHhhhccc--------cCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 000000000000000 0000000 000000000000000000000 No 113 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.19 E-value=7.4e-11 Score=76.13 Aligned_cols=501 Identities=13% Similarity=0.096 Sum_probs=209.1 Q ss_pred CCCCCC--ccchh----cCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCC--CcCCCHHH Q lcl|NC_013692. 11 LPNEDG--DPSKR----LQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGK--SAVQPPTI 82 (726) Q Consensus 11 ~~~~~~--~~~~~----~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr--s~~v~~~v 82 (726) |+-+.. .+.++ ...+|-.+.. ..++..+.-=++||++.---..+...|+ --+-.+.- T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D---------------~~RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~~ps~ 65 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDEND---------------KNRVRAYDLYENIYLNSAETLKLVLRGDDSVPILMPSG 65 (563) T ss_pred CCccccccCCCcccccccccccCCHHH---------------HHHHHHHHHHHHhhcCchhhhhhhcCCCceeeeccchH Confidence 332210 11111 1344421111 1133334445778865432112223343 23444566 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeee Q lcl|NC_013692. 83 RKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSR 162 (726) Q Consensus 83 ~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~ 162 (726) +..|++.+ -|||.+--+-+.|.. +|....+....||+..+.+++ -.-.+...-++|++.|-|+.++-||.... T Consensus 66 r~~V~~~~-----~~Lg~~~~~~Ve~~~-~de~~~~avq~~Lr~~~~~e~-l~~~~~~~~r~a~vlGDgvf~l~wDp~K~ 138 (563) T protein:vir:74 66 RKIVEAVH-----RFLGVGFDYLVEPDM-GDEGIRQSLNAYFRTTFKREA-IKAKFTSNKRWGLIRGDAHFYIHADPNKK 138 (563) T ss_pred HHHHHHHH-----HhcCCCcEEecCccc-cCcchHHHHHHHHHHHHHHhh-hHHHHHHHHHhhhhhcceeEEEeeccccc Confidence 77888744 345666666777755 677777778889998876544 44455567799999999999999996543 Q ss_pred eEEecccccccCCcchHHHHHHhhhhhhhhh-ccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccce Q lcl|NC_013692. 163 TVKEQVVTYEMMPDSSEELAQIYQTAAQIRE-ESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPT 241 (726) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~ 241 (726) +..+-+ ..++-|. .++. +.++. ..|+ T Consensus 139 ~g~R~r-v~~vDP~-------------~~fp~~dpd~----------------v~g~----------------------- 165 (563) T protein:vir:74 139 AGERIS-VDEVDPR-------------QIFLIEDGST----------------VVGF----------------------- 165 (563) T ss_pred cCCCce-EeecCCc-------------eeeeccCCCC----------------cccc----------------------- Confidence 211111 1111010 0000 00000 0011 Q ss_pred eeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCce Q lcl|NC_013692. 242 VQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKR 321 (726) Q Consensus 242 i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (726) -+.++..++.-.. +..+.||+++..++ +|...|.|.. +...+ T Consensus 166 ----~~v~v~~~~~~pd--d~~~~~~r~~~~~~-~lndeg~~~~-----------~~~~d-------------------- 207 (563) T protein:vir:74 166 ----HMVDIVQDFRSPD--DPSKKLARRRTFRR-VRNDEGMFTG-----------RISSE-------------------- 207 (563) T ss_pred ----eeeecccCCCCCc--chhccceeeeeeee-eeCCCCCccc-----------eeeec-------------------- Confidence 0112222221111 12245566554333 1111121110 00000 Q ss_pred EEEEE--EEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHH Q lcl|NC_013692. 322 LVVHE--YWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTR 399 (726) Q Consensus 322 v~v~E--~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~ 399 (726) ++.|| -|-....+.......-.-++...+.++....|.+.+.+||++++..|.+++.||.|....+..+.+++|...+ T Consensus 208 ae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~T 287 (563) T protein:vir:74 208 LTHWTLGNWDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLT 287 (563) T ss_pred cchhccccccccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhh Confidence 00011 1111111111111112223333344444545556688999999999999999999999999999999999999 Q ss_pred HHHHHHHhcCCCceEeeccc-cc----chhhhhhcCCceEeecCccchhhhccc-ccCccchhHHHHHHHHH-HHHHHHH Q lcl|NC_013692. 400 GMIDTMARSANGQVGVMKGA-LD----VTNRRRFDRGENYEFNPGADPRAAVHM-HTFPEIPQSAQYMINLQ-QAEAESM 472 (726) Q Consensus 400 ~~~d~l~~~~~~~~~~~~ga-v~----~~d~~~~~~g~vi~~~~~~~~~~~i~~-~~~~~~~~~~~~ll~~~-~~~~e~~ 472 (726) -..-++..+++|.+..+... ++ .....+..||.++..-..... ..+.. ...|++. .+..-+..+ ...+.++ T Consensus 288 d~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~-g~l~~v~g~~~l~-~~q~Hm~~l~eral~~~ 365 (563) T protein:vir:74 288 DEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGNRND-NYFERVSGVQDVS-PFQDHMKWIDEKGIAEG 365 (563) T ss_pred HHHHHHHhcCCCeEEeccccccccccccccccccCCceeEeccCCccc-cceeeecchhhhH-HHHHHHHHHHHHHHHhh Confidence 99999999999987775322 22 122344678888877532110 11111 1122222 222223333 3467889 Q ss_pred hchHHHhhccCcccchhhHHHHHHHHHH---HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhc---CcCe---------- Q lcl|NC_013692. 473 TGVKAFNAGISGAALGDTATAVRGALDA---ASKREL-GILRRLSAGIIEIGRKIIAMNAEFL---DDVE---------- 535 (726) Q Consensus 473 tGv~~~~~G~~~~~~~~ta~~i~~~~~~---~~~~~~-~~~~~~~~~~~~l~~~il~li~q~~---d~e~---------- 535 (726) +|++++..|.-..+-.-|+.++...+.- ...+-. .+..-+..++-+...++|.+.+..+ +-+. T Consensus 366 s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~ 445 (563) T protein:vir:74 366 SGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNE 445 (563) T ss_pred ccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCc Confidence 9999999994333222344333332221 111111 1333333444555666666555421 2111 Q ss_pred -EEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccch---h--------HHHHHHHHH----- Q lcl|NC_013692. 536 -VVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDP---M--------MAQQIMGQI----- 598 (726) Q Consensus 536 -~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~---~--------~~~~~~~~~----- 598 (726) .+.|+=....++|....-.+.-.-+..|..+.+. .... +...+-..+. . +...++.+. T Consensus 446 ~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSret-Av~~----L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~ 520 (563) T protein:vir:74 446 CSVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKM-AVAK----LRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADAS 520 (563) T ss_pred eEEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHH-HHHH----HHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCc Confidence 1111111122222111100000000011111100 0001 1111111111 0 000000000 Q ss_pred -----HHhhhhhhh-----hhhHHHHHhhhhhhhhhHHHHHHHH Q lcl|NC_013692. 599 -----MELKKMPDF-----AKRIREFQPQPDPIAQQKAQLELML 632 (726) Q Consensus 599 -----~~~~~~~e~-----~~~l~~~~~~~~~~~qq~~q~e~q~ 632 (726) +...++.+- -+-+..+. .+-....--.+.-... T Consensus 521 ~~~~a~~~~g~~~~~~dd~g~p~~~~~-~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 521 LGLSAMDNGGAGEQQFDDQGNPIDQFG-NPVEIPPDVTQVPLSP 563 (563) T ss_pred ccceecccCCCCcccccccCCchhHcC-CcccCCccccccCCCC Confidence 000000000 00000000 0000000000000000 No 114 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.18 E-value=8.1e-10 Score=70.43 Aligned_cols=461 Identities=9% Similarity=0.023 Sum_probs=199.6 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---CC--CCCCCC- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEG---KP--KTEKGK- 74 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~---~~--~~~~gr- 74 (726) |- =|+-.-.+....-. ..+.++.-+ .|...|++ .........++..+||+|.-.. .. ...+|+ T Consensus 1 ~~---~~~~~~~~~~~~~~-~~~~~l~~~----~i~~li~~---~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~ 69 (506) T protein:vir:94 1 MD---YDLTEHKQANLIYQ-ESLENLTPN----KIMKFITH---HFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKA 69 (506) T ss_pred CC---cchhhhhcceeecc-cchhcCCHH----HHHHHHHH---HHHHHHHHHHHHHHHhcCCCccccccccccccccCC Confidence 21 11110001111000 001112112 22333332 1122233457788899876421 11 122344 Q ss_pred -CcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEE Q lcl|NC_013692. 75 -SAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIV 153 (726) Q Consensus 75 -s~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~ 153 (726) .+++.+..+..|+.....| ||.+ +.|.+- |. ...+.|+.+| ..|+....+....++++.+|.+.+ T Consensus 70 ~~ki~~n~~~~Iv~~~~~~l----~G~p--~~~~~~---d~----~~~~~l~~~~-~~N~~~~~~~~~~~~~~~~G~a~~ 135 (506) T protein:vir:94 70 DHRATHSFAKYIADFQTSYS----VGNP--INVKLP---DD----GSNSGFDTFN-KANDVDAENYDLFLDMSRYGRAYE 135 (506) T ss_pred cceeecchHHHHHHHhhhhh----cccC--ceeecC---cc----hHHHHHHHHH-hccCHhHHHHHHHHHHHhcCeEEE Confidence 4678888888888877655 4433 345442 22 2345677766 467766778889999999999999 Q ss_pred EEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeeccc Q lcl|NC_013692. 154 KVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEERE 233 (726) Q Consensus 154 k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 233 (726) .+||+.. T Consensus 136 ~v~~ded------------------------------------------------------------------------- 142 (506) T protein:vir:94 136 YVYRGED------------------------------------------------------------------------- 142 (506) T ss_pred EEEecCC------------------------------------------------------------------------- Confidence 8887510 Q ss_pred ceeeccceeeeechhheee--CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccc Q lcl|NC_013692. 234 ETVENHPTVQVCDYNNIVI--DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRN 311 (726) Q Consensus 234 ~~~~~~p~i~~v~p~~~~~--dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 311 (726) +.|.+.+++|.++++ |+... .. ...+.+.|..... ..+ T Consensus 143 ----~~~~i~~~~p~~~~~v~dd~~~---~~-~~~~v~~~~~~~~------~~~-------------------------- 182 (506) T protein:vir:94 143 ----NEEHLAKLDPLDTFVIYSTDVD---PK-PIMAVRYHQIELV------DDN-------------------------- 182 (506) T ss_pred ----CeeEEEEEcccceEEEecCCCC---Cc-eEEEEEEEeeeec------cCC-------------------------- Confidence 012344566666543 33221 12 2223333311100 000 Q ss_pred cccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEEC----CEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHH Q lcl|NC_013692. 312 FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVG----AVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALL 387 (726) Q Consensus 312 ~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g----~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~ 387 (726) .....+..++.|.. .++.++.+ ..+....++ +.+.+|++.++.. -.|.|.++.+ T Consensus 183 -----~~~~~~~~~~~yt~----------~~~~~~~~~~~~~~~~~~~~~--~~g~vPvv~~~n~-----~~~~sd~e~~ 240 (506) T protein:vir:94 183 -----QVSTINYVPETWTA----------DTYTLYNPTPIMGKMQVDTTK--PITTFPVVEFKNS-----NFRLGDFENV 240 (506) T ss_pred -----ceeEEEEEEEEEeC----------ceEEEeccccCccceeccccc--cCCccceEEecCC-----CCCCCchhhh Confidence 00001222333321 01111211 122222333 3467787666443 3478999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cch--------------------------hhhhhcCCceEeecCcc Q lcl|NC_013692. 388 IDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVT--------------------------NRRRFDRGENYEFNPGA 440 (726) Q Consensus 388 ~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~--------------------------d~~~~~~g~vi~~~~~~ 440 (726) +++++.+|..+|.+.+.+...+++.+.+- |.. ... ......-++++.+.+++ T Consensus 241 ~~liDa~d~~~S~~~~~~~~~~~~~l~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (506) T protein:vir:94 241 LPLIDLYDAAQSDTANYMTDLNEAMLIIQ-GDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGM 319 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhHHHHHh-cCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccc Confidence 99999999999999998887776665442 211 000 00111223344444333 Q ss_pred ch-----hhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 441 DP-----RAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAG 515 (726) Q Consensus 441 ~~-----~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~ 515 (726) .. ..-+.+...+.....+...+..+...+...|++++.+.+..++ +.||.++..............-+.|..+ T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~--n~Sg~Aik~~~~~l~~k~~~k~~~~~~~ 397 (506) T protein:vir:94 320 TVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFAS--NSSGVAMQYKVLGTVELASTKRRMFERG 397 (506) T ss_pred cccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21 1123333444445677778888899999999999876543222 2466677776666666666677777778 Q ss_pred HHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHH Q lcl|NC_013692. 516 IIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIM 595 (726) Q Consensus 516 ~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~ 595 (726) ++++++.++.++........ + ++. +.. +..+.....-..+..+.+.. +...++.... + T Consensus 398 l~~~~~li~~~~~~~~~~~~-~-----d~~-----~i~----i~f~~~~p~d~~e~a~~~~k----l~g~iS~et~---~ 455 (506) T protein:vir:94 398 LYARYQIISDIENSIHGDWT-F-----DPQ-----ELT----FTFRDNLPADNISQIKALVQ----AGATLPQKYL---Y 455 (506) T ss_pred HHHHHHHHHHHHHhcCCccc-c-----ccc-----cce----EEeCCCCCcCHHHHHHHHHH----HhccCChHHH---H Confidence 88877777776543221100 0 010 001 11111111111111111111 1111221111 1 Q ss_pred HHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 596 GQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQAD 673 (726) Q Consensus 596 ~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~ 673 (726) . .+....+....++....+........ ..... ....++...... +...+.+ T Consensus 456 ~---~lp~v~d~~~E~~ri~~E~~~~~~~~---~~~~~-------~~~~~~~~~~~~--------------~~~~e~~ 506 (506) T protein:vir:94 456 Q---QLPGVTNPQDIVDMMKEQSANGDYSF---DQNGV-------ISNDGQTNTTAT--------------QTDEEVR 506 (506) T ss_pred H---hCCCCCCHHHHHHHHHHHHHHHhhcc---hhhcC-------CCcccCcccccc--------------ccccCCC Confidence 0 11111111111111100000000000 00000 000000000000 0000000 No 115 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.11 E-value=1.9e-09 Score=68.39 Aligned_cols=582 Identities=12% Similarity=0.024 Sum_probs=185.4 Q ss_pred HHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHH---HHHhh----------cccchhH-HHHHHHHHhh Q lcl|NC_013692. 82 IRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLN---QQFNT----------KLNKQRF-IDEYVRAGVD 147 (726) Q Consensus 82 v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n---~~~~~----------~~~~~~~-~~~~~~~~l~ 147 (726) -.+..+.++..|++-|..--.. .++=...|.....|.+ .+|.. +..|+.. ..+.++-.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~------~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v- 73 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSP------QKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATEL- 73 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHh------hHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHH- Confidence 2222233333333333111000 0000111111111221 12211 1111111 112222222 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) +.-.+.++..++.++++|+...+...+++....+... ..+........+.+|.+....|.||..+. . T Consensus 74 ----------~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~-~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~--~ 140 (708) T protein:vir:10 74 ----------NRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRA-DYEETDGGEACDNAFDDAATGGFGCFRLT--S 140 (708) T ss_pred ----------HHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHH-HHHhcCchHHHHHHHHhhhhcccceeeee--e Confidence 2223445667788888888433323344443333321 12244566789999999999999985432 1 Q ss_pred eeecccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhc Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSE 307 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~ 307 (726) .+..+.-+..+ + ..+ +....+||. ...-|=-..+..+++|.. |. ....+...+.....-++.... T Consensus 141 d~~~e~d~~~~-~--~~i-~i~~~~~p~-----~~v~~Dp~a~~~D~sDar---~~-~~~~~~~~d~~~~~~p~~a~~-- 205 (708) T protein:vir:10 141 MLVNEYDPMDD-R--QRI-AIEPIYDPS-----RSVWFDPDAKKYDKSDAL---WA-FCMYSLSPEKYEAEYGKKPPT-- 205 (708) T ss_pred ccccccCCCCC-c--ccc-ceEEeecch-----hhcccCccccccChhhhh---hh-hhccCCCHHHHHHhCCCCccc-- Confidence 11111100000 0 000 000111110 000000111112333311 10 000110000000011111110 Q ss_pred cccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEE--------E----ECCEEEEeccCC----------------- Q lcl|NC_013692. 308 GVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVAT--------W----VGAVMIRMEENP----------------- 358 (726) Q Consensus 308 ~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~--------~----~g~~~l~~~~~P----------------- 358 (726) ..++..... ...+ |.. .+...+.+.|+.. + +|..+...+... T Consensus 206 ---~~d~~~~~~---~~~~-~~~--~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~ 276 (708) T protein:vir:10 206 ---SLDVTSMTS---WEYN-WFG--ADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVAR 276 (708) T ss_pred ---ccccccCCC---cccc-ccC--CCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhhe Confidence 011111000 0111 211 1111122222211 1 122222111100 Q ss_pred -------------------CCCCccceEEeeeeeecCcccC-CCh---HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEe Q lcl|NC_013692. 359 -------------------FPDKRIPYVVVNYIPRKRDLYG-ESD---GALLIDNQRIIGAVTRGMIDTMARSANGQVGV 415 (726) Q Consensus 359 -------------------~~~~~~Pf~~~~~~~~~~~~~g-~g~---~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~ 415 (726) ...+.+||-.|++.|..+..++ .|. .-.++++-+....+...+...+...+..+..+ T Consensus 277 r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~ 356 (708) T protein:vir:10 277 RSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQI 356 (708) T ss_pred eeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcc Confidence 0112356666666655544331 121 23344444444444444444444443333222 Q ss_pred ecccccchhhhhhcCCceEeecCcc-------chhhhcccccCc----cchhHHHHHHHHHHHHHHHHhchHHHhhccCc Q lcl|NC_013692. 416 MKGALDVTNRRRFDRGENYEFNPGA-------DPRAAVHMHTFP----EIPQSAQYMINLQQAEAESMTGVKAFNAGISG 484 (726) Q Consensus 416 ~~gav~~~d~~~~~~g~vi~~~~~~-------~~~~~i~~~~~~----~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~ 484 (726) .-...........+.+..-..+... .+.+.+.....+ ..+.....++++++....++.-+ +|.++ T Consensus 357 ~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~v----sG~~~ 432 (708) T protein:vir:10 357 PIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEV----TGGSQ 432 (708) T ss_pred cccChhhhhhHHHHHhhccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHH----hCcCh Confidence 1111111111111111211111111 111112211112 22344555777777777666554 46666 Q ss_pred ccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccc-------- Q lcl|NC_013692. 485 AALGDTATAVRGALDAASKREL-GILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGN-------- 555 (726) Q Consensus 485 ~~~~~ta~~i~~~~~~~~~~~~-~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~-------- 555 (726) ..+|. .+.+++...++.+... .....|.+.++.-.+.+.+++..+..+- .+.+..+.|..++-..+ T Consensus 433 ~~lG~-~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~----y~~er~~RI~~edg~~~~v~in~~~ 507 (708) T protein:vir:10 433 AMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREV----YGSEREVRIVNEDGSDDIAVLSAQV 507 (708) T ss_pred hHccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEecCCCCcceEEeccee Confidence 66664 3346776666555544 3455566666666677777666554331 12334555654432111 Q ss_pred -----------cceee---ecccchH-HHHH-HHHHHHHHHHhhhccchhHHHH--HHHHHHHhhhhhhhhhhHHHHHhh Q lcl|NC_013692. 556 -----------FDLKL---DISTAEE-DNAK-VNDLTFMLQTMGPNMDPMMAQQ--IMGQIMELKKMPDFAKRIREFQPQ 617 (726) Q Consensus 556 -----------~dv~i---~~~~~~~-~~~~-~~~l~~l~q~~~~~~~~~~~~~--~~~~~~~~~~~~e~~~~l~~~~~~ 617 (726) .|+++ .+..... .... .++....+..+.+..++..... ++..+++.+.++...+........ T Consensus 508 ~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~ 587 (708) T protein:vir:10 508 VDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQ 587 (708) T ss_pred ccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHh Confidence 11111 1111111 1111 1222222333333344433322 223344455555543333322222 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_013692. 618 PDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQES-GVQQARKRELQQ 696 (726) Q Consensus 618 ~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~-~~~~~~e~e~~~ 696 (726) .....+..+. .+.+++..+.+ +.+++...+....+++++..++++++++.+.+..+.+. +.+...+.++. T Consensus 588 ~~~~~~~~~~-------~~ee~q~~~~~-q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a- 658 (708) T protein:vir:10 588 LLISGIAKPR-------NEKEQQIVQQA-QMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMES- 658 (708) T ss_pred hccccccccc-------chhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 2111111111 11111111111 11111111222223344444444444444433322111 11111111111 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHH------HHHhcC Q lcl|NC_013692. 697 AQSEAQGKLAMLNSQLK-RLDEATS------ARTSQK 726 (726) Q Consensus 697 ~q~~~q~~~~~l~~~~~-~~~~~~~------a~~~~q 726 (726) +.++-+.++......+ ...+..+ ..++++ T Consensus 659 -~~~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q~~~ 694 (708) T protein:vir:10 659 -QANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQ 694 (708) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHH Confidence 1111111111111100 0011111 011111 No 116 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.11 E-value=5.3e-10 Score=71.42 Aligned_cols=472 Identities=13% Similarity=0.060 Sum_probs=181.3 Q ss_pred CCCccchhhcCCCCCCccchhcCCCCCCch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCCCCCC--- Q lcl|NC_013692. 1 MADVDEDYLTLPNEDGDPSKRLQPEWSNAP-SLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKTEKGK--- 74 (726) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~gr--- 74 (726) |. |.=|.- .+ ++.+ .=.++++- +-..+...+..-...|..+....++-.+||+|....+ ++..+.. T Consensus 1 ~~-~~~~~~-----~~-~~~~-~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~ 72 (501) T protein:vir:25 1 MT-VPVDVI-----AD-APAA-DVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKE 72 (501) T ss_pred Cc-ccchhh-----hc-cCcc-cccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhh Confidence 11 000000 00 1111 11121111 1111222222222245555556677889998765321 1111111 Q ss_pred --CcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeE Q lcl|NC_013692. 75 --SAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTII 152 (726) Q Consensus 75 --s~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i 152 (726) -+.|..-.+..|+.+.-.| ++ .+.+-+|..... .+..+| ..|+.-.....++++++++|.++ T Consensus 73 ~~~~~v~n~~~~ivd~~a~~l---~~--------~gf~~~d~~~~~----~l~~i~-~~N~~d~~~~~~~~~a~i~G~ay 136 (501) T protein:vir:25 73 LAKLSVKNVLSLVRDSFAQNL---SV--------VGYRNALAKEND----PAWEMW-QRNRMDARQAEVHRPALTYGASY 136 (501) T ss_pred hHhhhhcChHHHHHHHHHhhh---cc--------cceecCCccchH----HHHHHH-HhcChhHHHHHHHHHHhhcCceE Confidence 1244455555555443222 11 111222222222 234444 46665566678899999999999 Q ss_pred EEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecc Q lcl|NC_013692. 153 VKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEER 232 (726) Q Consensus 153 ~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 232 (726) +.+|.+.. T Consensus 137 ~~v~~de~------------------------------------------------------------------------ 144 (501) T protein:vir:25 137 VTVTPTDE------------------------------------------------------------------------ 144 (501) T ss_pred EEEecCCC------------------------------------------------------------------------ Confidence 87765410 Q ss_pred cceeeccceeeeechhhee--e-CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccc Q lcl|NC_013692. 233 EETVENHPTVQVCDYNNIV--I-DPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGV 309 (726) Q Consensus 233 ~~~~~~~p~i~~v~p~~~~--~-dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 309 (726) .+.|..++|++++ | ||.... ...++++ .+....+ . + .. .. ... T Consensus 145 ------~~~i~~~sp~~~~~iy~D~~~~~---~~~~ai~-~~~~~~~---~----~--~~---~~----~~~-------- 190 (501) T protein:vir:25 145 ------GPVFRTRSPRQILAVYADPSVDA---WPQYALE-TWVAQKD---A----K--PH---RR----GVL-------- 190 (501) T ss_pred ------CCeEEEeccccEEEEEecCCCCc---ceeEEEE-EEeeccc---c----C--cc---ee----EEE-------- Confidence 0112345566653 3 564321 1222222 2211111 0 0 00 00 000 Q ss_pred cccccCCcCCceEEEEE---EEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHH Q lcl|NC_013692. 310 RNFDFQDKSRKRLVVHE---YWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGAL 386 (726) Q Consensus 310 ~~~~~~~~~~~~v~v~E---~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~ 386 (726) |. ...+..+. .|......+.........+..++. ......|.+++.+||+.++..+..+ .+|.|.++. T Consensus 191 ----y~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vPiv~f~N~~~~~-~~g~sdie~ 261 (501) T protein:vir:25 191 ----YD---DTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDV-IEHGATFEGKPVCPVVRFVNGRDAD-DMIVGEVAP 261 (501) T ss_pred ----ec---CeeEEEEecCceeeeeccccccccccccccccccc-cccccccCCccceeeEeccCccccC-ccccchhhh Confidence 00 00000000 000000000000011111111111 1122234445778888887766554 468999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cchhhhhhcCCceEeecCccchhhhcccccCccc-hhHHHHHHHH Q lcl|NC_013692. 387 LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEI-PQSAQYMINL 464 (726) Q Consensus 387 ~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~-~~~~~~ll~~ 464 (726) ++++++.+|+.++.+...+...+.|+..+ .|.- +..+......+.++... +.++ .+ .+++.. .+.+...+.. T Consensus 262 v~~l~Da~~~~~s~~~~~~e~~a~p~~~i-~G~~~~~~~~~~~~~~~i~~~~-~~~~--~~--~q~~~~~~~~~~~~l~~ 335 (501) T protein:vir:25 262 LILLQQAINSVNFDRLIVSRFGANPQRVI-SGWTGSKAEVLKASALRVWTFE-DPEV--KA--QAFPPASVEPYNLILEE 335 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHhhccHHHHH-hCCCCCccchhhhcccceeccC-CCCc--eE--EEecccChHHHHHHHHH Confidence 99999999999999999998888876544 4442 22334455667766553 3222 22 233321 1223333444 Q ss_pred HHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCe---EEEEec Q lcl|NC_013692. 465 QQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVE---VVRITN 541 (726) Q Consensus 465 ~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~---~iRi~~ 541 (726) +...+...|++|....|...+ +.||.++......-........+.|..+++.+++.++. +.+... ...+. T Consensus 336 ~i~~i~~~s~~P~~~~~~~~~--N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~----~~~~~~~~~~~~i~- 408 (501) T protein:vir:25 336 MLQHVAMVAQISPAQVTGKMI--NVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAE----MDDDPDTAADSGAE- 408 (501) T ss_pred HHHHHHhhcCCChhhhccccC--ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCCccccceeee- Confidence 444444567788777773221 23666666666655556666666677777766665543 322211 01110 Q ss_pred ccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHH--------HHHHHHHhhhhhhhhhhHHH Q lcl|NC_013692. 542 EHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQ--------IMGQIMELKKMPDFAKRIRE 613 (726) Q Consensus 542 ~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~--------~~~~~~~~~~~~e~~~~l~~ 613 (726) -.|-...+..... .......+.+. + ++...... ....+.+...-......+.. T Consensus 409 v~w~~~~~~s~~~----------------~ada~~kl~~~-g--is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~ 469 (501) T protein:vir:25 409 VLWRDTEARSFGA----------------VVDGITKLASA-G--IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDK 469 (501) T ss_pred EEecCCCCCCHHH----------------HHHHHHHHHhc-C--CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHH Confidence 0011111111111 11111111110 0 11110000 00011110000000000000 Q ss_pred HHh-hhhhhhhhHHHHHHH-HHHHHHHHHHHHH Q lcl|NC_013692. 614 FQP-QPDPIAQQKAQLELM-LLQAQIEAERARA 644 (726) Q Consensus 614 ~~~-~~~~~~qq~~q~e~q-~~qaq~e~~~aq~ 644 (726) ... .+.+......+...+ ......... .-+ T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~ 501 (501) T protein:vir:25 470 LLSNEPAPVPPPPPQAAAQALNEGGVNGN-GGA 501 (501) T ss_pred hhccCcCCCCCCCCCCCccccccccCCCC-CCC Confidence 000 000000000000000 000000000 000 No 117 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.11 E-value=2e-09 Score=68.28 Aligned_cols=590 Identities=11% Similarity=-0.003 Sum_probs=210.8 Q ss_pred CCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHH---HHHhhcccchhHHHHHHH----HHhhcCC Q lcl|NC_013692. 78 QPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLN---QQFNTKLNKQRFIDEYVR----AGVDEGT 150 (726) Q Consensus 78 v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n---~~~~~~~~~~~~~~~~~~----~~l~~~~ 150 (726) -+-+-++..+ .+++-|...-+. .++-...|.....|.+ .+|..+.. ..|...-+ -+|..+ T Consensus 1 m~e~~~~~~~----~~~~~~~~~~~~------~~~~r~~~~~d~~f~~~~G~QW~~~~~--~~l~~~~q~~grP~~~~N- 67 (706) T protein:vir:10 1 MAESRQKQHE----RVMLRFDRAWSP------QQVVREKCIEATRFVRVPGGQWEGATV--AGTKLDEQFEKYPKFEIN- 67 (706) T ss_pred CCcchHHHHH----HHHHHHHHHHHH------HHHHHHHHHHHHHhhccCCccCCHHHH--HHHHhhhhhcCCCceEec- Confidence 2212222222 333333111000 1122233333444442 24433321 11111100 011111 Q ss_pred eEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceee Q lcl|NC_013692. 151 IIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEE 230 (726) Q Consensus 151 ~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 230 (726) .|+..++.-.+.++..++.+.++|+.......+++...-+... ..+........+.+|.+....|.||..+. ..+. T Consensus 68 -~i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~-~~~~~~~~~a~s~Af~d~i~~G~G~~ev~--~d~~ 143 (706) T protein:vir:10 68 -KVATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRA-DYEETDGGEACDNAFDDAATGGFGCFRLT--TSFV 143 (706) T ss_pred -chHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHH-HHHhcCchHHHHHHHHHHhhcCcceEEee--eccc Confidence 1111223334555678888888885332222233333333221 12244567789999999999999985432 1110 Q ss_pred cccceeeccceeeeechhheeeCCCCCCchhh-CCeEE---EEEeccHHHHHhcCCCc--chhhcCcccchhhcccchhh Q lcl|NC_013692. 231 EREETVENHPTVQVCDYNNIVIDPSCGSDFSK-AKFLI---ETFESSYAELKADGRYQ--NLDKIQVEGQNLLSEPDYTG 304 (726) Q Consensus 231 ~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~d-a~~~~---~~~~~t~~el~~~g~~~--~~d~~~~~~~~~~~~~~~~~ 304 (726) .. -+|++|-.++.... +.+ .+-|+ ..+..+.+|..-..+.+ +.+.+ ...-++... T Consensus 144 -------~~-----~d~~~~~~~i~i~~-v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~------~~~fp~~~~ 204 (706) T protein:vir:10 144 -------NE-----YDPMDERQRIAVEP-IYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKY------QSEYDKAPT 204 (706) T ss_pred -------cc-----cCCCCCCccceeee-eccchhceecCchhcccChhhcceEeeeecCCHHHH------HHhcCCChh Confidence 00 11222222111100 001 00111 11122333311110111 11111 000011000 Q ss_pred hhccccccccC--CcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEec-------------------cCCC---- Q lcl|NC_013692. 305 PSEGVRNFDFQ--DKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRME-------------------ENPF---- 359 (726) Q Consensus 305 ~~~~~~~~~~~--~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~-------------------~~P~---- 359 (726) ......+.++. ....+.|++.|||.+....-. +. ++...+.++...... .-+. T Consensus 205 ~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 282 (706) T protein:vir:10 205 SLDRVGSVSWQYDWFTPDVVYIAKYYEVRKESVD-VI-SYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRR 282 (706) T ss_pred hhhhhccccccccccCCCcceecccccccceeEE-EE-EeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceee Confidence 00011111111 123467899999876432111 11 111111111111000 0000 Q ss_pred -------------CCCccceEEeeeeeecCcc---cCCCh-HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccc Q lcl|NC_013692. 360 -------------PDKRIPYVVVNYIPRKRDL---YGESD-GALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDV 422 (726) Q Consensus 360 -------------~~~~~Pf~~~~~~~~~~~~---~g~g~-~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~ 422 (726) ...-||.-.|++.|..+.. .|.+. .-.++++-+....++..+...+++.+..+..+..++++. T Consensus 283 v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~ 362 (706) T protein:vir:10 283 IYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQ 362 (706) T ss_pred EEEEeeccccccccCCCCCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhH Confidence 0011233334444443332 22232 344566778888888888899999999999988888765 Q ss_pred hhhhhhcCCceEeecCccch-------hhhcccccCc----cchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhH Q lcl|NC_013692. 423 TNRRRFDRGENYEFNPGADP-------RAAVHMHTFP----EIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTA 491 (726) Q Consensus 423 ~d~~~~~~g~vi~~~~~~~~-------~~~i~~~~~~----~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta 491 (726) .+...-.....-...+.... .+.+.....+ ..+......++++......+. ..+|.++..+|..+ T Consensus 363 i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~----~vsGi~~~~lG~~s 438 (706) T protein:vir:10 363 IRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQ----EVTGSSQAMQQMPS 438 (706) T ss_pred HHHHHHHhhhcccccccchhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHH----HHhCCCHHHcCCcc Confidence 44433222221111111110 1122111111 122333445555444444432 34577776666543 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeec-------- Q lcl|NC_013692. 492 TAVRGALDAASKREL-GILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDI-------- 562 (726) Q Consensus 492 ~~i~~~~~~~~~~~~-~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~-------- 562 (726) .+++...++.+... .....|.+.++...+.+-+++..+...- .+.+-.+.|..++-..+ .+.++. T Consensus 439 -n~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~~~----y~~~R~~RI~~ed~~~~-~v~in~~~~d~~~G 512 (706) T protein:vir:10 439 -NVARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLSMAREI----YGSDREVRIVHEDGTDD-IALMNAAVLDNQTG 512 (706) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEecCCCCcc-ceeeccceeccccC Confidence 36777666655544 3455666777777777777776554331 12333455554332211 111111 Q ss_pred ---------------ccchHHH-HHH-HHHHHHHHHhhhccchhHHHH--HHHHHHHhhhhhhhhhhHHHHHhhhhhhhh Q lcl|NC_013692. 563 ---------------STAEEDN-AKV-NDLTFMLQTMGPNMDPMMAQQ--IMGQIMELKKMPDFAKRIREFQPQPDPIAQ 623 (726) Q Consensus 563 ---------------~~~~~~~-~~~-~~l~~l~q~~~~~~~~~~~~~--~~~~~~~~~~~~e~~~~l~~~~~~~~~~~q 623 (726) ....... .-. ++....+..+.+..++..... ++..+.+.+.++...+..........+... T Consensus 513 ~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~ 592 (706) T protein:vir:10 513 RVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGI 592 (706) T ss_pred ceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCC Confidence 1111111 111 112222222233333333322 223344555555554444333322222212 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 624 QKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQG 703 (726) Q Consensus 624 q~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~ 703 (726) . ++...+..+...+++++++++.. .+..+.+++..+.++++.+.+.+. .+...+.. ..+.++++++++. T Consensus 593 ~-~~~~~~eq~~~~q~qq~q~~q~~-------~~~~~~~aq~~~~qA~~~k~~a~~--~q~~~~a~-~a~~qa~~~~~~~ 661 (706) T protein:vir:10 593 V-KPRNQQEQAIVQQAQQAQATQPD-------PNMLLAQAQMVVAQAEAQKSQNET--VQTQIKAF-TAQQDAMESQANT 661 (706) T ss_pred c-cccchhHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH-HHHHHHHHHHHHH Confidence 1 11111211111222222221111 111122222222222211111111 11111110 1112222222221 Q ss_pred H--HHH-HHHHHHHHHHHHHHH---HhcC Q lcl|NC_013692. 704 K--LAM-LNSQLKRLDEATSAR---TSQK 726 (726) Q Consensus 704 ~--~~~-l~~~~~~~~~~~~a~---~~~q 726 (726) - ... .+.+.....+...+- +..| T Consensus 662 ~~~~~~a~~~~~~~~~q~~q~l~~~~a~q 690 (706) T protein:vir:10 662 VYKLAQARNIDDKAVMETLRLLKEVAASQ 690 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 1 111 111222222222211 1122 No 118 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.10 E-value=1.6e-10 Score=74.35 Aligned_cols=613 Identities=11% Similarity=0.041 Sum_probs=190.0 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCC--CCCCCCCCC-CCcCCCHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINR----WLDYMHVRG--EGKPKTEKG-KSAVQPPTIR 83 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~----~~~~y~~~~--~~~~~~~~g-rs~~v~~~v~ 83 (726) |.. +.++++ ...-|.+.+ +.+.+..+. .|..-+.+.+ T Consensus 1 ~~~------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 44 (776) T protein:vir:93 1 MFD------------------------------------LNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAV 44 (776) T ss_pred CCC------------------------------------ccccccccccccccccccCCCCCcccchhcccCCCCCHHHH Confidence 110 001111 122233333 111111111 1334444555 Q ss_pred HHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHH-HHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeee Q lcl|NC_013692. 84 KQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVL-NQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSR 162 (726) Q Consensus 84 ~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~-n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~ 162 (726) +....++..+.+-+....++ ...|.-.-+|+ ..+|..... ..|+..-+.++..+. |+..++.-.+ T Consensus 45 ~~~~~l~~~~~~~~~~~~~~----------r~~a~~d~~fy~G~Qw~~~~~--~~l~~~g~p~~~~N~--i~~~i~~v~g 110 (776) T protein:vir:93 45 ELHSRLLSYYRQELSRQQDN----------RAEMAVDEDYYDNIQWSQDEI--DELKERGQAPTVYNV--ISQSVNWIIG 110 (776) T ss_pred HHHHHHHHHHHHHHhhchHH----------HHHHHHHHHHhCCCCCCHHHH--HHHHhcCCceEEecc--hHHHHHHHHH Confidence 55555555444444221111 11222222222 222221110 111111111111110 1111222233 Q ss_pred eEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeecccee Q lcl|NC_013692. 163 TVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTV 242 (726) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i 242 (726) ..+..++.+++.|+.. ....++.....+..+ ....+......+.+|.+....|.||..+.. ..++ T Consensus 111 ~~~~nr~~~~~~p~~~-~d~~~Ae~l~~~~~~-~~~~~~~~~~~~~af~d~~~~G~G~~~v~~-----------d~~~-- 175 (776) T protein:vir:93 111 SEKRGRSDFKVLPRRK-DGGKAAERKTALLKY-LSDVNHTPFERSMAFEETTKAGIGWLESQV-----------QDEN-- 175 (776) T ss_pred HHHhCCcceEEecCCh-hHHHHHHHHHHHHHH-HHHhhcHHHHHHHHHHHhhhcCcceEEEEe-----------eccC-- Confidence 4445666677777633 333344444444432 234456677888899999999999854311 0000 Q ss_pred eeechhheeeCCCCCC--chhhCCeEEEEEeccHHHHHhcCCC--cchhhc----Ccccchhhcccchhhhhcccccccc Q lcl|NC_013692. 243 QVCDYNNIVIDPSCGS--DFSKAKFLIETFESSYAELKADGRY--QNLDKI----QVEGQNLLSEPDYTGPSEGVRNFDF 314 (726) Q Consensus 243 ~~v~p~~~~~dp~a~~--d~~da~~~~~~~~~t~~el~~~g~~--~~~d~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (726) .=+|-.+. +..+.-|=-..+..+++|..-..+. -+.+.+ +.....+.................. T Consensus 176 --------~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 247 (776) T protein:vir:93 176 --------DGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDG 247 (776) T ss_pred --------CCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccc Confidence 00110000 1111100001122234442211111 111111 1000000000000000000000000 Q ss_pred CCcCCceEEEEEEEEEeec-----CCCc--eEEEEEEEEE---------C--CEE------------------------- Q lcl|NC_013692. 315 QDKSRKRLVVHEYWGYYDI-----HGDG--VLHPIVATWV---------G--AVM------------------------- 351 (726) Q Consensus 315 ~~~~~~~v~v~E~w~~~~~-----~~~g--~~~~~~~~~~---------g--~~~------------------------- 351 (726) .+ ......+..+|..... ..+- +.++++..+. | ..+ T Consensus 248 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~ 326 (776) T protein:vir:93 248 DD-AMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPM 326 (776) T ss_pred cc-cccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheee Confidence 00 0000111112211110 0011 1232221111 0 011 Q ss_pred ------EEeccCCCCCCccc--eEEeeeeeecCcccC-CChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccc Q lcl|NC_013692. 352 ------IRMEENPFPDKRIP--YVVVNYIPRKRDLYG-ESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDV 422 (726) Q Consensus 352 ------l~~~~~P~~~~~~P--f~~~~~~~~~~~~~g-~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~ 422 (726) +..+.....++..| +-.|++.++++...+ .|....+...=.-.-.+.+...-.+ ..++..+.+-. T Consensus 327 ~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~------~~~l~~~~~~~ 400 (776) T protein:vir:93 327 MRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKA------LYILSTNKVLM 400 (776) T ss_pred eeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHH------HHhhcCCceee Confidence 11111112222233 345556666555443 3443333333333333333222111 12333333211 Q ss_pred hhhhhhcCCceE--eecCccch---hhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHH Q lcl|NC_013692. 423 TNRRRFDRGENY--EFNPGADP---RAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGA 497 (726) Q Consensus 423 ~d~~~~~~g~vi--~~~~~~~~---~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~ 497 (726) .+.........+ ..++++.. .+++....+...++....+++++......+..+ .|.+..++|..+.++++. T Consensus 401 ~~gav~~~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~----tGi~~~~~G~~~n~~Sg~ 476 (776) T protein:vir:93 401 EEGAVDDIDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQV----GGVTDEMLGRTTNAVSGV 476 (776) T ss_pred ccccccchHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHh----hCcChHHhCCCcchhhHH Confidence 111000011111 12333322 223334444455566777777777777766554 476666666666777776 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecc------------- Q lcl|NC_013692. 498 LDAASKREL-GILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIS------------- 563 (726) Q Consensus 498 ~~~~~~~~~-~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~------------- 563 (726) ..++.+... ..+..+.+.+....+.+.+++..+.-.- .+.+..+.|...+-..++ |.|+.+ T Consensus 477 ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~----~~~~r~~ri~~~~~~~~~-v~in~~~~~nd~~~~~~dv 551 (776) T protein:vir:93 477 AIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQY----MTEEKQFRITNSRGNPEY-VTVNDGLPENDITRTKADF 551 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCcceEEEEeecCCCcce-EEecccchhhhhccceeeE Confidence 655544432 2345555566666666666555443321 122234445443322222 223221 Q ss_pred -cchHHHHHH-HHH-HHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 564 -TAEEDNAKV-NDL-TFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAE 640 (726) Q Consensus 564 -~~~~~~~~~-~~l-~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~ 640 (726) ......... .+. ...+..+....++.....+...+++.+..+...+..+..+........... .....+...+ T Consensus 552 ~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~----~~~~e~~~~q 627 (776) T protein:vir:93 552 IIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQD----EPTPEEIARE 627 (776) T ss_pred EEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchh----hcchhHHHHH Confidence 111111111 111 111122223344444444555555555555443333332221111111100 0011111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHH Q lcl|NC_013692. 641 RARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLK---RLDE 717 (726) Q Consensus 641 ~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~---~~~~ 717 (726) +++.+.+..++..+.+++...+++..+.++++...+.+. .+...+......+ .....+++... ..+. T Consensus 628 q~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa------~~~~~~a~~~~~~----a~q~a~qa~~~~~~~~~~ 697 (776) T protein:vir:93 628 QAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKA------KHISRMAIREGVG----AVKDATDAATAIAFMPEL 697 (776) T ss_pred HHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhh------hhhhhcchhhhhh----hhhhhhhhhhhhhhhhhh Confidence 111111111111111111111111111111111100000 0000000011111 11111111000 0001 Q ss_pred HHHHHHhcC Q lcl|NC_013692. 718 ATSARTSQK 726 (726) Q Consensus 718 ~~~a~~~~q 726 (726) +..+.+..+ T Consensus 698 a~~a~~~~~ 706 (776) T protein:vir:93 698 AGLSDGILR 706 (776) T ss_pred hhhhhhhhc Confidence 111111111 No 119 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.08 E-value=2.8e-09 Score=67.44 Aligned_cols=459 Identities=13% Similarity=0.030 Sum_probs=188.9 Q ss_pred CCCCCCccchhc--CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCC--CCCC---CCCcCCCHHHH Q lcl|NC_013692. 11 LPNEDGDPSKRL--QPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKP--KTEK---GKSAVQPPTIR 83 (726) Q Consensus 11 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~--~~~~---grs~~v~~~v~ 83 (726) |.+-+-+.++-. .+...++. ...|...+. .+..+.....+-.+||.|....+. +... .+-.+|..-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e-~~~i~~L~~----~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~ 75 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDV-VDKVNGLYQ----QLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSA 75 (504) T ss_pred CCccCCcccccccccCCCCHHH-HHHHHHHHH----HHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHH Confidence 544444444433 23333332 112222222 233344455667889986553210 0000 01112333333 Q ss_pred HHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeee Q lcl|NC_013692. 84 KQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRT 163 (726) Q Consensus 84 ~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~ 163 (726) ..|+.+.-.| ++-| |. .+++.+.. ..+..+| ..|+.-.....++++++++|.+++.+|=+ T Consensus 76 ~iVd~~a~rl---~~~G-----f~--~~d~~~~~----~~l~~i~-~~N~ld~~~~~~~~~a~iyG~af~~v~~~----- 135 (504) T protein:vir:99 76 KAVDTLARRC---NLES-----FV--WPDGDYGS----IGGPDVW-DENFFATKANNAMVSSLIHGPAFLINTEG----- 135 (504) T ss_pred HHHHHHHhhh---ccce-----ee--CCCCChhh----HHHHHHH-HhcChhhHHHHHHHHHHhhCceeEEEecC----- Confidence 3343332221 1111 21 12222222 2344444 46665556778999999999999876311 Q ss_pred EEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceee Q lcl|NC_013692. 164 VKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQ 243 (726) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~ 243 (726) ++ | ...|.|. T Consensus 136 -------------------------------~d--------------------~-------------------~~~~~I~ 145 (504) T protein:vir:99 136 -------------------------------GA--------------------G-------------------EPDSLIH 145 (504) T ss_pred -------------------------------CC--------------------C-------------------CceeEEE Confidence 00 0 0113456 Q ss_pred eechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCce Q lcl|NC_013692. 244 VCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKR 321 (726) Q Consensus 244 ~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (726) .++|.+++ |||.... -.+.++ ++ .++. .| . T Consensus 146 ~~sP~~~~~iyD~~~~~----~~~a~~-~~-~~d~---~g---------------------------------------~ 177 (504) T protein:vir:99 146 VKSAMQATGEWNSRRNA----MDSLLS-IT-SRDA---EG---------------------------------------H 177 (504) T ss_pred EeccceeEEEEeCCCCc----eeEEEE-EE-EecC---CC---------------------------------------e Confidence 67888864 6764321 111111 11 0000 00 0 Q ss_pred EEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChH-HHHHHHHHHHHHHHHH Q lcl|NC_013692. 322 LVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG-ALLIDNQRIIGAVTRG 400 (726) Q Consensus 322 v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~-~~~~d~Q~~~N~~~~~ 400 (726) ....++|. ++.. +.....++.....+..|.+++ +|++++...+...+.+|.|-+ +.++++++.+|+.++. T Consensus 178 ~~~~~~y~------~~~~--~~~~~~~~~~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~ 248 (504) T protein:vir:99 178 PTGIALYE------DGVT--VTADMDDDGDWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIR 248 (504) T ss_pred EEEEEEEc------CCcE--EEEEEcCCceeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHHHH Confidence 11112221 0000 000011111111223344445 799999988888888998754 6899999999999999 Q ss_pred HHHHHHhcCCCceEeeccccc---------chhhhhhcCCceEeecCccchh----hhcccccCccchhHHHHHHHHHHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALD---------VTNRRRFDRGENYEFNPGADPR----AAVHMHTFPEIPQSAQYMINLQQA 467 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~---------~~d~~~~~~g~vi~~~~~~~~~----~~i~~~~~~~~~~~~~~ll~~~~~ 467 (726) +.......+.|+..+ .|+-. +........++++.+....+.. ....+.+++. ..+..++.++.. T Consensus 249 ~~~~~e~~a~p~r~i-~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~--~~l~~~~~~l~~ 325 (504) T protein:vir:99 249 MDGHADVYSFPQLIL-LGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPA--SSPQPHIEMLEQ 325 (504) T ss_pred HHHHHHHhcchhhhh-ccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCC--CChHHHHHHHHH Confidence 999988888887655 33311 1112233345555554322110 0111122222 123344555555 Q ss_pred HHH---HHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-cCeEEEEeccc Q lcl|NC_013692. 468 EAE---SMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLD-DVEVVRITNEH 543 (726) Q Consensus 468 ~~e---~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d-~e~~iRi~~~~ 543 (726) .+. ..|+++....|..+.+.+.||.++......-........+.|..++++++++++.+....-. .....++.= . T Consensus 326 ~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v-~ 404 (504) T protein:vir:99 326 IAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDS-K 404 (504) T ss_pred HHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccee-E Confidence 555 45899998998765544457777777666666666677777778888888877665433211 011111100 0 Q ss_pred ceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhh-------------hccchhHHHHHHHHHHHhhhhhhhhhh Q lcl|NC_013692. 544 FVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMG-------------PNMDPMMAQQIMGQIMELKKMPDFAKR 610 (726) Q Consensus 544 ~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~-------------~~~~~~~~~~~~~~~~~~~~~~e~~~~ 610 (726) |-...+.......| ....+.+... ...+..+ ..+.....+. ....+... T Consensus 405 w~d~~~~s~a~~aD----------------a~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei-~r~~~e~~~~-~~~~~~~~ 466 (504) T protein:vir:99 405 FRSPLYLSKAAQAD----------------AGAKMLGAGPEWLKETEVGLELLGLTPQQA-KRALAERRRA-SSVSIIEA 466 (504) T ss_pred ecCCCccCHHHHHH----------------HHHHHHhhccccccchHHHHhhcCCCHHHH-HHHHHHHHHH-hhHHHHHH Confidence 21111111100000 0111111000 0000000 0001110000 00111111 Q ss_pred HHHHHhhhhhh----hhhHHH-HHHHHHHHHHHHHHHHH Q lcl|NC_013692. 611 IREFQPQPDPI----AQQKAQ-LELMLLQAQIEAERARA 644 (726) Q Consensus 611 l~~~~~~~~~~----~qq~~q-~e~q~~qaq~e~~~aq~ 644 (726) +......+... .+...+ +....-.+-....+ .- T Consensus 467 l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~-~~ 504 (504) T protein:vir:99 467 LNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTL-VG 504 (504) T ss_pred HhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCccc-CC Confidence 11000000000 000000 00000000000000 00 No 120 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.02 E-value=5.2e-09 Score=66.00 Aligned_cols=495 Identities=12% Similarity=0.063 Sum_probs=204.9 Q ss_pred HHHHHHHHHHHHHHH----------HHHHHHHHH----HHHHhccCC-CCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 32 LAQLKQDYQEAKQVT----------DEKITQINR----WLDYMHVRG-EGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEP 96 (726) Q Consensus 32 ~~~~~~~~~~a~~~~----------~~~~~~~~~----~~~~y~~~~-~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~ 96 (726) .-+=++.++.++... ..+=..|-. =.+||++.- ....+ ..|. +-+.+..-..||... T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~-lrg~------~~~~~r~~~~ps~~~- 72 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVI-LRGG------DEGDQRPIYVPNGEK- 72 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeee-cCCc------cccccceeeehhhHH- Confidence 111122333333221 111112333 366676431 11111 1111 112233333444432 Q ss_pred hcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCc Q lcl|NC_013692. 97 FLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPD 176 (726) Q Consensus 97 f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~ 176 (726) .++...-|-+.+....+...++.....|+- |...++-+..+..--+++++.|-|+.++.|+...++- +|++.. T Consensus 73 ~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~-~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~--~R~~v~---- 145 (527) T protein:vir:10 73 LIEAKMRFLGQGLKWEFSKKDAKVDDAIKV-LFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG--SRLSLH---- 145 (527) T ss_pred hhCCcceeeccCccccccchhHHHHHHHHH-HHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC--CCceEe---- Confidence 245555555566555555566666667765 4566777777888889999999999999999655421 111100 Q ss_pred chHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCCCC Q lcl|NC_013692. 177 SSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSC 256 (726) Q Consensus 177 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a 256 (726) ..+|..... . .++.+. -.+..|+.-+-|..|.. T Consensus 146 ----------------~~DP~~~f~----------~--ed~d~~-------------------~~v~~v~~~~~~~~P~d 178 (527) T protein:vir:10 146 ----------------EVDPSTYFP----------Y--EDPRYP-------------------GQVLGVYLVDEYPHPDS 178 (527) T ss_pred ----------------ecCcceeee----------e--ecCCCC-------------------CceeeEEEeeeccCCcc Confidence 001111000 0 000000 00111211112344422 Q ss_pred CCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEE-EEE--Eee- Q lcl|NC_013692. 257 GSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHE-YWG--YYD- 332 (726) Q Consensus 257 ~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E-~w~--~~~- 332 (726) -...-+|.|.+++++ +|-..|.+ ....++++.+ .|. +++ T Consensus 179 ---~~~~~~~ar~~~~~~-~l~~~g~~---------------------------------~~~G~~~yt~~~w~lg~w~d 221 (527) T protein:vir:10 179 ---EKKNEKCARVQKYMK-TLDDDGKP---------------------------------VPGGAIKYTEELYEPGKWDD 221 (527) T ss_pred ---ccccceehhhhhhhh-hcCccccc---------------------------------ccCcceeeeeceeecccccc Confidence 122223444333333 21111100 0011334333 332 111 Q ss_pred cCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_013692. 333 IHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQ 412 (726) Q Consensus 333 ~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~ 412 (726) .+.-.+-.-.+.+..++++++..++|+ +.+||++++..+.+++.||+|-..+++++.+.+|+.++-...++..+++|. T Consensus 222 ~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 222 RPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 110011122345567788888777776 679999999999999999999999999999999999999999999998887 Q ss_pred eEeeccc--ccch---hhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccc Q lcl|NC_013692. 413 VGVMKGA--LDVT---NRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAAL 487 (726) Q Consensus 413 ~~~~~ga--v~~~---d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~ 487 (726) +.. .|+ ++.. +.....||.+|....++.... + ...+. -+.+...+..+...+.+++|++.+..|..+.+. T Consensus 300 ~~~-tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~-v--~~~~~-la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~ 374 (527) T protein:vir:10 300 YAT-DSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYR-V--NGVAS-LEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAV 374 (527) T ss_pred eee-cccccccccCCcCccccCCceeEecCCCcceee-c--cchhh-hHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc Confidence 765 333 2211 223456888888766543320 1 11122 234667788888899999999999999544332 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccch Q lcl|NC_013692. 488 GDTATAVRGALDAASKRELGILRR-LSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAE 566 (726) Q Consensus 488 ~~ta~~i~~~~~~~~~~~~~~~~~-~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~ 566 (726) +-|+.++...+.. .-.+.--+. +..++.+.+. ..++..++..-+.+-+. +......+++.-+... T Consensus 375 ~~SG~ALeL~L~P--Llar~~rk~L~~~~vqrq~~--~~~~~~~L~aye~v~~~----------d~~~~~~v~ivf~p~l 440 (527) T protein:vir:10 375 AESGIALDLKLSA--ILSSCAEQELELKSVLKQFF--YNLVTQWLPAYEGVGID----------DADKKLTVTITFRDPK 440 (527) T ss_pred CcHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhh--hhhHHHHHHHhhhcccC----------CCccccceEEEecccC Confidence 3344443332222 111000000 0111111111 11222211111111111 1111122333333221 Q ss_pred --HHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHH---H Q lcl|NC_013692. 567 --EDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAE---R 641 (726) Q Consensus 567 --~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~---~ 641 (726) -+.+..++...+.+. .-+... ..+..+.+...+.+.+..+++.... ..+++.++++.. . T Consensus 441 P~D~~avie~v~tL~~a--Gi~S~~---tAv~~L~~~~g~eD~E~E~~~I~~e-----------ra~~a~a~a~A~~~~~ 504 (527) T protein:vir:10 441 PVNSEKRFNQLLQLWEA--GLIPAK---KLTEELSKIMGFELTEEDFKQATED-----------KKTQGIAQAEAADPFG 504 (527) T ss_pred CCCHHHHHHHHHHHHHc--CchhHH---HHHHHHHhccCCCChHHHHHHHHHH-----------HHHHhHHhhhhcCchh Confidence 122222222221111 001100 0111111111111111111100000 000000000000 0 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 642 ARAAHY--MSGAGLQDSKVGTEQAKARAL 668 (726) Q Consensus 642 aq~q~~--~~~~~~~~~~~~~eqaq~~q~ 668 (726) +++-.. ....+.-++-.. .-+ T Consensus 505 a~~~~~~g~~~~~~d~~~~~------~~~ 527 (527) T protein:vir:10 505 AQMAAEQGIPDEEDDQALNG------QPL 527 (527) T ss_pred hhhccccCCCCCCcccccCC------CCC Confidence 000000 000000000000 000 No 121 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.01 E-value=5.6e-09 Score=65.81 Aligned_cols=495 Identities=12% Similarity=0.063 Sum_probs=204.6 Q ss_pred HHHHHHHHHHHHHHH----------HHHHHHHHH----HHHHhccCC-CCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHh Q lcl|NC_013692. 32 LAQLKQDYQEAKQVT----------DEKITQINR----WLDYMHVRG-EGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEP 96 (726) Q Consensus 32 ~~~~~~~~~~a~~~~----------~~~~~~~~~----~~~~y~~~~-~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~ 96 (726) .-+=++.++.++... ..+=..|-. =.+||++.- ....+ ..|. +-+.+..-..||... T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~-lrg~------~~~~~r~~~~ps~~~- 72 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVI-LRGG------DEGDQRPIYVPNGEK- 72 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeee-cCCc------cccccceeeehhhHH- Confidence 111122333333221 111112333 366676431 11111 1111 112233333444432 Q ss_pred hcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCc Q lcl|NC_013692. 97 FLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPD 176 (726) Q Consensus 97 f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~ 176 (726) .++...-|-+.+....+...++.....|+- |...++-+..+..--+++++.|-|+.++.|+...++- +|++.. T Consensus 73 ~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~-~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~--~R~~v~---- 145 (527) T protein:vir:10 73 LIEAKMRFLGQGLKWEFSKKDAKVDDAIRV-LFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG--SRLSLH---- 145 (527) T ss_pred hhCCcceeeccCccccccchhHHHHHHHHH-HHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC--CCceEe---- Confidence 245555555566555555566666667765 4566777777888889999999999999999655421 111100 Q ss_pred chHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCCCC Q lcl|NC_013692. 177 SSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSC 256 (726) Q Consensus 177 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a 256 (726) ..+|..... . .++.+. -.+..|+.-+-|..|.. T Consensus 146 ----------------~~DP~~~f~----------~--ed~d~~-------------------~~v~~v~~~~~~~~P~d 178 (527) T protein:vir:10 146 ----------------EVDPSTYFP----------Y--EDPRYP-------------------GQVLGVYLVDEYPHPDS 178 (527) T ss_pred ----------------ecCcceeee----------e--ecCCCC-------------------CceeeEEEeeeccCCcc Confidence 001111000 0 000000 00111211112344422 Q ss_pred CCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEE-EEE--Eee- Q lcl|NC_013692. 257 GSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHE-YWG--YYD- 332 (726) Q Consensus 257 ~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E-~w~--~~~- 332 (726) -...-+|.|.+++++ +|-..|.+ ....++++.+ .|. +++ T Consensus 179 ---~~~~~~~ar~~~~~~-~l~~~g~~---------------------------------~~~G~~~yt~~~w~lg~w~d 221 (527) T protein:vir:10 179 ---EKKNEKCARVQKYMK-TLDDDGKP---------------------------------VPGGAIKYTEELYEPGKWDD 221 (527) T ss_pred ---ccccceehhhhhhhh-hcCccccc---------------------------------ccCcceeeeeceeecccccc Confidence 122223444333333 21111100 0011334333 332 111 Q ss_pred cCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_013692. 333 IHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQ 412 (726) Q Consensus 333 ~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~ 412 (726) .+.-.+-.-.+.+..++++++..++|+ +.+||++++..+.+++.||+|-..+++++.+.+|+.++-...++..+++|. T Consensus 222 ~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 222 RPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 110011122345567788888777776 679999999999999999999999999999999999999999999998887 Q ss_pred eEeeccc--ccch---hhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccc Q lcl|NC_013692. 413 VGVMKGA--LDVT---NRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAAL 487 (726) Q Consensus 413 ~~~~~ga--v~~~---d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~ 487 (726) +.. .|+ ++.. +.....||.+|....++.... + ...+. -+.+...+..+...+.+++|++.+..|..+.+. T Consensus 300 ~~~-tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~-v--~~~~~-la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~ 374 (527) T protein:vir:10 300 YAT-DSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYR-V--NGVAS-LEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAV 374 (527) T ss_pred eee-cccccccccCCcCccccCCceeEecCCCcceee-c--cchhh-hHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc Confidence 765 333 2211 223456888888766543320 1 11122 234677788888899999999999999544332 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccch Q lcl|NC_013692. 488 GDTATAVRGALDAASKRELGILRR-LSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAE 566 (726) Q Consensus 488 ~~ta~~i~~~~~~~~~~~~~~~~~-~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~ 566 (726) +-|+.++...+.. .-.+.--+. +..++.+.+. ..++..++..-+.+-+. +......+++.-+... T Consensus 375 ~~SG~ALeL~L~P--Llar~~rk~L~~~~Vqrq~~--~~~~~~~L~aye~v~~~----------d~~~~~~v~ivf~p~l 440 (527) T protein:vir:10 375 AESGIALDLKLSA--ILSSCAEQELELKSVLKQFF--YNLVTQWLPAYEGVGID----------DADKKLTVTITFRDPK 440 (527) T ss_pred CcHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhh--hhhHHHHHHHhhhcccC----------CCccccceEEEecccC Confidence 3344443332222 111000000 0111111111 11222211111111111 1111122233333221 Q ss_pred H--HHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHH---H Q lcl|NC_013692. 567 E--DNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAE---R 641 (726) Q Consensus 567 ~--~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~---~ 641 (726) . +.+..++...+.+. .-+... ..+..+.+...+.+.+..+++.... ..+++.++++.. . T Consensus 441 P~D~~avie~v~tL~~a--GiiS~e---tAv~~L~~~~g~eD~E~E~~~I~~e-----------ra~~a~a~a~a~~~~~ 504 (527) T protein:vir:10 441 PVNNEKRFAQLLELWEA--GLIPAK---KLTEELSKIMGFELTEEDFRQATED-----------KKTQGIAQAEAADPFG 504 (527) T ss_pred CCCHHHHHHHHHHHHHc--CchhHH---HHHHHHHhccCCCchHHHHHHHHHH-----------HHHHhHHhhhhcCchh Confidence 1 12222222111111 001100 0111111111111111111100000 000000000000 0 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 642 ARAAHY--MSGAGLQDSKVGTEQAKARAL 668 (726) Q Consensus 642 aq~q~~--~~~~~~~~~~~~~eqaq~~q~ 668 (726) +++-.. ....+.-++-.. .-+ T Consensus 505 a~~~~~~g~~~~~~d~~~~~------~~~ 527 (527) T protein:vir:10 505 AQMAAEQGIPDEEDDQALNG------QPL 527 (527) T ss_pred hhhccccCCCCCCcccccCC------CCC Confidence 000000 000000000000 000 No 122 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=98.91 E-value=1.4e-09 Score=69.13 Aligned_cols=600 Identities=10% Similarity=0.050 Sum_probs=191.7 Q ss_pred CC-CCCC-CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHH Q lcl|NC_013692. 64 GE-GKPK-TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEY 141 (726) Q Consensus 64 ~~-~~~~-~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 141 (726) |+ .... ..++-|-.....-.+...|+.. ..-...++ -.++.+....|-..+|..+. ...|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---------R~~a~~d~~fy~G~Qw~~~~--~~~l~~~ 65 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCS----DIDSQPKW---------RDAANKACAYYDGDQLPPEV--LQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHH----HHHhhHHH---------HHHHHHHHHhhcCCCCCHHH--HHHHHhc Confidence 32 1100 1122222221111111122221 11111111 11111111112222221111 0112211 Q ss_pred HHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcc-hHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccce Q lcl|NC_013692. 142 VRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDS-SEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQV 220 (726) Q Consensus 142 ~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 220 (726) -+.++..+ .|+..++.-.+.++..++.+.++|+. +++...++.....+..+- ..........+.+|.+....|.|| T Consensus 66 g~p~~~~N--~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~-~~~~~~~~~~s~af~~~~~~G~G~ 142 (714) T protein:vir:81 66 GQPMTIHN--LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADA-CRLGNMNKARSDAYAEQIKAGLSW 142 (714) T ss_pred CCCcEEec--cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHH-HHhhchhHHHHHHHHHhhhcCcce Confidence 12222111 11112233345566788888888864 333333333333333221 224456678888999999999987 Q ss_pred eeccccceeecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCC--cchhhc----Cccc Q lcl|NC_013692. 221 RAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRY--QNLDKI----QVEG 293 (726) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~--~~~d~~----~~~~ 293 (726) ..+. +++..|--++..+. +..+.-|=-..+..+++|..=..+. .+.+.+ +... T Consensus 143 ~~~~--------------------~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a 202 (714) T protein:vir:81 143 VEVR--------------------RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA 202 (714) T ss_pred EEec--------------------cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch Confidence 3321 11111111111111 1111110001112233332111000 111111 1000 Q ss_pred chhhcccchhhhhccccccc-cCCcCCceEEEEEEEEEeecCCC----------ceEEEEEEEE---------ECCEEEE Q lcl|NC_013692. 294 QNLLSEPDYTGPSEGVRNFD-FQDKSRKRLVVHEYWGYYDIHGD----------GVLHPIVATW---------VGAVMIR 353 (726) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~v~E~w~~~~~~~~----------g~~~~~~~~~---------~g~~~l~ 353 (726) ..+..... ...+..+.. +.......+.-++.+..++.... -+.++++-.+ .|+.+.. T Consensus 203 ~~i~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~ 279 (714) T protein:vir:81 203 QVIDYAID---DWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAF 279 (714) T ss_pred hhhhhhhh---hhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEe Confidence 00000000 000000000 00001111112222111111111 0112221110 1222222 Q ss_pred eccCC-------------------------------CCCCccce--EEeeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 354 MEENP-------------------------------FPDKRIPY--VVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRG 400 (726) Q Consensus 354 ~~~~P-------------------------------~~~~~~Pf--~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 400 (726) +..+| ...+..|| -.|++.|+.+... +......-+.|. T Consensus 280 d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---------~~~g~~~G~vr~ 350 (714) T protein:vir:81 280 DKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---------DKTGEPYGLISR 350 (714) T ss_pred CccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeee---------eccCceeehhhh Confidence 22111 00011122 1233333222221 111112234444 Q ss_pred HHHHHHhcCCCceEeecccccchhhhhhcCCceEeec-----Cccchhhhcccc-------------cCccchhHHHHHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFN-----PGADPRAAVHMH-------------TFPEIPQSAQYMI 462 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~-----~~~~~~~~i~~~-------------~~~~~~~~~~~ll 462 (726) ++|+-...|.....+.. +++ ........|++.... ..+.+.+.+.+. .+.+.+......+ T Consensus 351 ~~d~Qr~~N~~~s~~~~-~l~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (714) T protein:vir:81 351 AIPAQDEVNFRRIKLTW-LLQ-AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQF 428 (714) T ss_pred chhHHHHHHHHHHHHHH-hhc-CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHH Confidence 54443322211111000 111 011112233332111 112222222221 1222344556666 Q ss_pred HHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 463 NLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 463 ~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +++......+. ..+|.++..+|....++++...++.+.... ....+.+.++...+.+.+++..+...- .+. T Consensus 429 ~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~----~~~ 500 (714) T protein:vir:81 429 QVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDD----LKK 500 (714) T ss_pred HHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCC Confidence 66666555443 346887777887777888887776666543 355566667777777776666543321 233 Q ss_pred ccceecchhhcc---cccceeee------------------cccchHHHHHH--HHHHHHHHHhhhccchhHHHHHHHHH Q lcl|NC_013692. 542 EHFVDIRRDDLA---GNFDLKLD------------------ISTAEEDNAKV--NDLTFMLQTMGPNMDPMMAQQIMGQI 598 (726) Q Consensus 542 ~~~v~v~~~~~~---~~~dv~i~------------------~~~~~~~~~~~--~~l~~l~q~~~~~~~~~~~~~~~~~~ 598 (726) +..+.|..++-. .++ +.++ +.......... ++....+..+....++.....+...+ T Consensus 501 erv~RI~~e~~~~~~~~~-v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~ 579 (714) T protein:vir:81 501 RRNHAVVINRDDRQRRQT-IVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLW 579 (714) T ss_pred CcEEEEeccCCCcCcceE-EeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHH Confidence 344555422111 011 1111 11111111112 22232333334456666666667777 Q ss_pred HHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 599 MELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLN 678 (726) Q Consensus 599 ~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e 678 (726) ++.+..+...+.+.......... ....+.+++..+++.+++.++.++.+.+.+..+++.+..++++.+.++++.+...+ T Consensus 580 l~~~d~p~~~el~~~ir~~~~~~-~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~ 658 (714) T protein:vir:81 580 VNLLDVPQKQEFVERIRAALGTP-KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNAS 658 (714) T ss_pred HHhcCCCCHHHHHHHHHHHcCCC-CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77888877766555554433222 12222233333333333222222222222222222222222222222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 679 FLEQESGVQQARKRELQQAQSEA---QGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 679 ~~~qe~~~~~~~e~e~~~~q~~~---q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........++... +..++.+ .+.++.++....-.+++..+...++ T Consensus 659 a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~ 708 (714) T protein:vir:81 659 AQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQR 708 (714) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHH Confidence 111000000000000 0000000 0001111111111111111111111 No 123 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=98.91 E-value=1.4e-09 Score=69.13 Aligned_cols=600 Identities=10% Similarity=0.050 Sum_probs=191.7 Q ss_pred CC-CCCC-CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHH Q lcl|NC_013692. 64 GE-GKPK-TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEY 141 (726) Q Consensus 64 ~~-~~~~-~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 141 (726) |+ .... ..++-|-.....-.+...|+.. ..-...++ -.++.+....|-..+|..+. ...|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---------R~~a~~d~~fy~G~Qw~~~~--~~~l~~~ 65 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCS----DIDSQPKW---------RDAANKACAYYDGDQLPPEV--LQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHH----HHHhhHHH---------HHHHHHHHHhhcCCCCCHHH--HHHHHhc Confidence 32 1100 1122222221111111122221 11111111 11111111112222221111 0112211 Q ss_pred HHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcc-hHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccce Q lcl|NC_013692. 142 VRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDS-SEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQV 220 (726) Q Consensus 142 ~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 220 (726) -+.++..+ .|+..++.-.+.++..++.+.++|+. +++...++.....+..+- ..........+.+|.+....|.|| T Consensus 66 g~p~~~~N--~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~-~~~~~~~~~~s~af~~~~~~G~G~ 142 (714) T protein:vir:99 66 GQPMTIHN--LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADA-CRLGNMNKARSDAYAEQIKAGLSW 142 (714) T ss_pred CCCcEEec--cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHH-HHhhchhHHHHHHHHHhhhcCcce Confidence 12222111 11112233345566788888888864 333333333333333221 224456678888999999999987 Q ss_pred eeccccceeecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCC--cchhhc----Cccc Q lcl|NC_013692. 221 RAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRY--QNLDKI----QVEG 293 (726) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~--~~~d~~----~~~~ 293 (726) ..+. +++..|--++..+. +..+.-|=-..+..+++|..=..+. .+.+.+ +... T Consensus 143 ~~~~--------------------~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a 202 (714) T protein:vir:99 143 VEVR--------------------RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA 202 (714) T ss_pred EEec--------------------cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch Confidence 3321 11111111111111 1111110001112233332111000 111111 1000 Q ss_pred chhhcccchhhhhccccccc-cCCcCCceEEEEEEEEEeecCCC----------ceEEEEEEEE---------ECCEEEE Q lcl|NC_013692. 294 QNLLSEPDYTGPSEGVRNFD-FQDKSRKRLVVHEYWGYYDIHGD----------GVLHPIVATW---------VGAVMIR 353 (726) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~v~E~w~~~~~~~~----------g~~~~~~~~~---------~g~~~l~ 353 (726) ..+..... ...+..+.. +.......+.-++.+..++.... -+.++++-.+ .|+.+.. T Consensus 203 ~~i~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~ 279 (714) T protein:vir:99 203 QVIDYAID---DWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAF 279 (714) T ss_pred hhhhhhhh---hhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEe Confidence 00000000 000000000 00001111112222111111111 0112221110 1222222 Q ss_pred eccCC-------------------------------CCCCccce--EEeeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 354 MEENP-------------------------------FPDKRIPY--VVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRG 400 (726) Q Consensus 354 ~~~~P-------------------------------~~~~~~Pf--~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 400 (726) +..+| ...+..|| -.|++.|+.+... +......-+.|. T Consensus 280 d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---------~~~g~~~G~vr~ 350 (714) T protein:vir:99 280 DKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---------DKTGEPYGLISR 350 (714) T ss_pred CccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeee---------eccCceeehhhh Confidence 22111 00011122 1233333222221 111112234444 Q ss_pred HHHHHHhcCCCceEeecccccchhhhhhcCCceEeec-----Cccchhhhcccc-------------cCccchhHHHHHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFN-----PGADPRAAVHMH-------------TFPEIPQSAQYMI 462 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~-----~~~~~~~~i~~~-------------~~~~~~~~~~~ll 462 (726) ++|+-...|.....+.. +++ ........|++.... ..+.+.+.+.+. .+.+.+......+ T Consensus 351 ~~d~Qr~~N~~~s~~~~-~l~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (714) T protein:vir:99 351 AIPAQDEVNFRRIKLTW-LLQ-AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQF 428 (714) T ss_pred chhHHHHHHHHHHHHHH-hhc-CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHH Confidence 54443322211111000 111 011112233332111 112222222221 1222344556666 Q ss_pred HHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 463 NLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 463 ~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +++......+. ..+|.++..+|....++++...++.+.... ....+.+.++...+.+.+++..+...- .+. T Consensus 429 ~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~----~~~ 500 (714) T protein:vir:99 429 QVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDD----LKK 500 (714) T ss_pred HHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCC Confidence 66666555443 346887777887777888887776666543 355566667777777776666543321 233 Q ss_pred ccceecchhhcc---cccceeee------------------cccchHHHHHH--HHHHHHHHHhhhccchhHHHHHHHHH Q lcl|NC_013692. 542 EHFVDIRRDDLA---GNFDLKLD------------------ISTAEEDNAKV--NDLTFMLQTMGPNMDPMMAQQIMGQI 598 (726) Q Consensus 542 ~~~v~v~~~~~~---~~~dv~i~------------------~~~~~~~~~~~--~~l~~l~q~~~~~~~~~~~~~~~~~~ 598 (726) +..+.|..++-. .++ +.++ +.......... ++....+..+....++.....+...+ T Consensus 501 erv~RI~~e~~~~~~~~~-v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~ 579 (714) T protein:vir:99 501 RRNHAVVINRDDRQRRQT-IVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLW 579 (714) T ss_pred CcEEEEeccCCCcCcceE-EeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHH Confidence 344555422111 011 1111 11111111112 22232333334456666666667777 Q ss_pred HHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 599 MELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLN 678 (726) Q Consensus 599 ~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e 678 (726) ++.+..+...+.+.......... ....+.+++..+++.+++.++.++.+.+.+..+++.+..++++.+.++++.+...+ T Consensus 580 l~~~d~p~~~el~~~ir~~~~~~-~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~ 658 (714) T protein:vir:99 580 VNLLDVPQKQEFVERIRAALGTP-KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNAS 658 (714) T ss_pred HHhcCCCCHHHHHHHHHHHcCCC-CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77888877766555554433222 12222233333333333222222222222222222222222222222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 679 FLEQESGVQQARKRELQQAQSEA---QGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 679 ~~~qe~~~~~~~e~e~~~~q~~~---q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........++... +..++.+ .+.++.++....-.+++..+...++ T Consensus 659 a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~ 708 (714) T protein:vir:99 659 AQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQR 708 (714) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHH Confidence 111000000000000 0000000 0001111111111111111111111 No 124 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=98.91 E-value=1.4e-09 Score=69.13 Aligned_cols=600 Identities=10% Similarity=0.050 Sum_probs=191.7 Q ss_pred CC-CCCC-CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHH Q lcl|NC_013692. 64 GE-GKPK-TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEY 141 (726) Q Consensus 64 ~~-~~~~-~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 141 (726) |+ .... ..++-|-.....-.+...|+.. ..-...++ -.++.+....|-..+|..+. ...|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---------R~~a~~d~~fy~G~Qw~~~~--~~~l~~~ 65 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCS----DIDSQPKW---------RDAANKACAYYDGDQLPPEV--LQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHH----HHHhhHHH---------HHHHHHHHHhhcCCCCCHHH--HHHHHhc Confidence 32 1100 1122222221111111122221 11111111 11111111112222221111 0112211 Q ss_pred HHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcc-hHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccce Q lcl|NC_013692. 142 VRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDS-SEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQV 220 (726) Q Consensus 142 ~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 220 (726) -+.++..+ .|+..++.-.+.++..++.+.++|+. +++...++.....+..+- ..........+.+|.+....|.|| T Consensus 66 g~p~~~~N--~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~-~~~~~~~~~~s~af~~~~~~G~G~ 142 (714) T protein:vir:32 66 GQPMTIHN--LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADA-CRLGNMNKARSDAYAEQIKAGLSW 142 (714) T ss_pred CCCcEEec--cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHH-HHhhchhHHHHHHHHHhhhcCcce Confidence 12222111 11112233345566788888888864 333333333333333221 224456678888999999999987 Q ss_pred eeccccceeecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCC--cchhhc----Cccc Q lcl|NC_013692. 221 RAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRY--QNLDKI----QVEG 293 (726) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~--~~~d~~----~~~~ 293 (726) ..+. +++..|--++..+. +..+.-|=-..+..+++|..=..+. .+.+.+ +... T Consensus 143 ~~~~--------------------~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a 202 (714) T protein:vir:32 143 VEVR--------------------RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA 202 (714) T ss_pred EEec--------------------cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch Confidence 3321 11111111111111 1111110001112233332111000 111111 1000 Q ss_pred chhhcccchhhhhccccccc-cCCcCCceEEEEEEEEEeecCCC----------ceEEEEEEEE---------ECCEEEE Q lcl|NC_013692. 294 QNLLSEPDYTGPSEGVRNFD-FQDKSRKRLVVHEYWGYYDIHGD----------GVLHPIVATW---------VGAVMIR 353 (726) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~v~E~w~~~~~~~~----------g~~~~~~~~~---------~g~~~l~ 353 (726) ..+..... ...+..+.. +.......+.-++.+..++.... -+.++++-.+ .|+.+.. T Consensus 203 ~~i~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~ 279 (714) T protein:vir:32 203 QVIDYAID---DWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAF 279 (714) T ss_pred hhhhhhhh---hhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEe Confidence 00000000 000000000 00001111112222111111111 0112221110 1222222 Q ss_pred eccCC-------------------------------CCCCccce--EEeeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 354 MEENP-------------------------------FPDKRIPY--VVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRG 400 (726) Q Consensus 354 ~~~~P-------------------------------~~~~~~Pf--~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 400 (726) +..+| ...+..|| -.|++.|+.+... +......-+.|. T Consensus 280 d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---------~~~g~~~G~vr~ 350 (714) T protein:vir:32 280 DKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---------DKTGEPYGLISR 350 (714) T ss_pred CccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeee---------eccCceeehhhh Confidence 22111 00011122 1233333222221 111112234444 Q ss_pred HHHHHHhcCCCceEeecccccchhhhhhcCCceEeec-----Cccchhhhcccc-------------cCccchhHHHHHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFN-----PGADPRAAVHMH-------------TFPEIPQSAQYMI 462 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~-----~~~~~~~~i~~~-------------~~~~~~~~~~~ll 462 (726) ++|+-...|.....+.. +++ ........|++.... ..+.+.+.+.+. .+.+.+......+ T Consensus 351 ~~d~Qr~~N~~~s~~~~-~l~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (714) T protein:vir:32 351 AIPAQDEVNFRRIKLTW-LLQ-AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQF 428 (714) T ss_pred chhHHHHHHHHHHHHHH-hhc-CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHH Confidence 54443322211111000 111 011112233332111 112222222221 1222344556666 Q ss_pred HHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 463 NLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 463 ~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +++......+. ..+|.++..+|....++++...++.+.... ....+.+.++...+.+.+++..+...- .+. T Consensus 429 ~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~----~~~ 500 (714) T protein:vir:32 429 QVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDD----LKK 500 (714) T ss_pred HHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCC Confidence 66666555443 346887777887777888887776666543 355566667777777776666543321 233 Q ss_pred ccceecchhhcc---cccceeee------------------cccchHHHHHH--HHHHHHHHHhhhccchhHHHHHHHHH Q lcl|NC_013692. 542 EHFVDIRRDDLA---GNFDLKLD------------------ISTAEEDNAKV--NDLTFMLQTMGPNMDPMMAQQIMGQI 598 (726) Q Consensus 542 ~~~v~v~~~~~~---~~~dv~i~------------------~~~~~~~~~~~--~~l~~l~q~~~~~~~~~~~~~~~~~~ 598 (726) +..+.|..++-. .++ +.++ +.......... ++....+..+....++.....+...+ T Consensus 501 erv~RI~~e~~~~~~~~~-v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~ 579 (714) T protein:vir:32 501 RRNHAVVINRDDRQRRQT-IVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLW 579 (714) T ss_pred CcEEEEeccCCCcCcceE-EeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHH Confidence 344555422111 011 1111 11111111112 22232333334456666666667777 Q ss_pred HHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 599 MELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLN 678 (726) Q Consensus 599 ~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e 678 (726) ++.+..+...+.+.......... ....+.+++..+++.+++.++.++.+.+.+..+++.+..++++.+.++++.+...+ T Consensus 580 l~~~d~p~~~el~~~ir~~~~~~-~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~ 658 (714) T protein:vir:32 580 VNLLDVPQKQEFVERIRAALGTP-KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNAS 658 (714) T ss_pred HHhcCCCCHHHHHHHHHHHcCCC-CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77888877766555554433222 12222233333333333222222222222222222222222222222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 679 FLEQESGVQQARKRELQQAQSEA---QGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 679 ~~~qe~~~~~~~e~e~~~~q~~~---q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........++... +..++.+ .+.++.++....-.+++..+...++ T Consensus 659 a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~ 708 (714) T protein:vir:32 659 AQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQR 708 (714) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHH Confidence 111000000000000 0000000 0001111111111111111111111 No 125 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=98.91 E-value=1.4e-09 Score=69.13 Aligned_cols=600 Identities=10% Similarity=0.050 Sum_probs=191.7 Q ss_pred CC-CCCC-CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHH Q lcl|NC_013692. 64 GE-GKPK-TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEY 141 (726) Q Consensus 64 ~~-~~~~-~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 141 (726) |+ .... ..++-|-.....-.+...|+.. ..-...++ -.++.+....|-..+|..+. ...|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---------R~~a~~d~~fy~G~Qw~~~~--~~~l~~~ 65 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCS----DIDSQPKW---------RDAANKACAYYDGDQLPPEV--LQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHH----HHHhhHHH---------HHHHHHHHHhhcCCCCCHHH--HHHHHhc Confidence 32 1100 1122222221111111122221 11111111 11111111112222221111 0112211 Q ss_pred HHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcc-hHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccce Q lcl|NC_013692. 142 VRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDS-SEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQV 220 (726) Q Consensus 142 ~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 220 (726) -+.++..+ .|+..++.-.+.++..++.+.++|+. +++...++.....+..+- ..........+.+|.+....|.|| T Consensus 66 g~p~~~~N--~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~-~~~~~~~~~~s~af~~~~~~G~G~ 142 (714) T protein:vir:10 66 GQPMTIHN--LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADA-CRLGNMNKARSDAYAEQIKAGLSW 142 (714) T ss_pred CCCcEEec--cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHH-HHhhchhHHHHHHHHHhhhcCcce Confidence 12222111 11112233345566788888888864 333333333333333221 224456678888999999999987 Q ss_pred eeccccceeecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCC--cchhhc----Cccc Q lcl|NC_013692. 221 RAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRY--QNLDKI----QVEG 293 (726) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~--~~~d~~----~~~~ 293 (726) ..+. +++..|--++..+. +..+.-|=-..+..+++|..=..+. .+.+.+ +... T Consensus 143 ~~~~--------------------~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a 202 (714) T protein:vir:10 143 VEVR--------------------RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA 202 (714) T ss_pred EEec--------------------cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch Confidence 3321 11111111111111 1111110001112233332111000 111111 1000 Q ss_pred chhhcccchhhhhccccccc-cCCcCCceEEEEEEEEEeecCCC----------ceEEEEEEEE---------ECCEEEE Q lcl|NC_013692. 294 QNLLSEPDYTGPSEGVRNFD-FQDKSRKRLVVHEYWGYYDIHGD----------GVLHPIVATW---------VGAVMIR 353 (726) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~v~E~w~~~~~~~~----------g~~~~~~~~~---------~g~~~l~ 353 (726) ..+..... ...+..+.. +.......+.-++.+..++.... -+.++++-.+ .|+.+.. T Consensus 203 ~~i~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~ 279 (714) T protein:vir:10 203 QVIDYAID---DWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAF 279 (714) T ss_pred hhhhhhhh---hhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEe Confidence 00000000 000000000 00001111112222111111111 0112221110 1222222 Q ss_pred eccCC-------------------------------CCCCccce--EEeeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 354 MEENP-------------------------------FPDKRIPY--VVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRG 400 (726) Q Consensus 354 ~~~~P-------------------------------~~~~~~Pf--~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 400 (726) +..+| ...+..|| -.|++.|+.+... +......-+.|. T Consensus 280 d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---------~~~g~~~G~vr~ 350 (714) T protein:vir:10 280 DKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---------DKTGEPYGLISR 350 (714) T ss_pred CccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeee---------eccCceeehhhh Confidence 22111 00011122 1233333222221 111112234444 Q ss_pred HHHHHHhcCCCceEeecccccchhhhhhcCCceEeec-----Cccchhhhcccc-------------cCccchhHHHHHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFN-----PGADPRAAVHMH-------------TFPEIPQSAQYMI 462 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~-----~~~~~~~~i~~~-------------~~~~~~~~~~~ll 462 (726) ++|+-...|.....+.. +++ ........|++.... ..+.+.+.+.+. .+.+.+......+ T Consensus 351 ~~d~Qr~~N~~~s~~~~-~l~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (714) T protein:vir:10 351 AIPAQDEVNFRRIKLTW-LLQ-AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQF 428 (714) T ss_pred chhHHHHHHHHHHHHHH-hhc-CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHH Confidence 54443322211111000 111 011112233332111 112222222221 1222344556666 Q ss_pred HHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 463 NLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 463 ~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +++......+. ..+|.++..+|....++++...++.+.... ....+.+.++...+.+.+++..+...- .+. T Consensus 429 ~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~----~~~ 500 (714) T protein:vir:10 429 QVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDD----LKK 500 (714) T ss_pred HHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCC Confidence 66666555443 346887777887777888887776666543 355566667777777776666543321 233 Q ss_pred ccceecchhhcc---cccceeee------------------cccchHHHHHH--HHHHHHHHHhhhccchhHHHHHHHHH Q lcl|NC_013692. 542 EHFVDIRRDDLA---GNFDLKLD------------------ISTAEEDNAKV--NDLTFMLQTMGPNMDPMMAQQIMGQI 598 (726) Q Consensus 542 ~~~v~v~~~~~~---~~~dv~i~------------------~~~~~~~~~~~--~~l~~l~q~~~~~~~~~~~~~~~~~~ 598 (726) +..+.|..++-. .++ +.++ +.......... ++....+..+....++.....+...+ T Consensus 501 erv~RI~~e~~~~~~~~~-v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~ 579 (714) T protein:vir:10 501 RRNHAVVINRDDRQRRQT-IVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLW 579 (714) T ss_pred CcEEEEeccCCCcCcceE-EeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHH Confidence 344555422111 011 1111 11111111112 22232333334456666666667777 Q ss_pred HHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 599 MELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLN 678 (726) Q Consensus 599 ~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e 678 (726) ++.+..+...+.+.......... ....+.+++..+++.+++.++.++.+.+.+..+++.+..++++.+.++++.+...+ T Consensus 580 l~~~d~p~~~el~~~ir~~~~~~-~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~ 658 (714) T protein:vir:10 580 VNLLDVPQKQEFVERIRAALGTP-KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNAS 658 (714) T ss_pred HHhcCCCCHHHHHHHHHHHcCCC-CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77888877766555554433222 12222233333333333222222222222222222222222222222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 679 FLEQESGVQQARKRELQQAQSEA---QGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 679 ~~~qe~~~~~~~e~e~~~~q~~~---q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........++... +..++.+ .+.++.++....-.+++..+...++ T Consensus 659 a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~ 708 (714) T protein:vir:10 659 AQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQR 708 (714) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHH Confidence 111000000000000 0000000 0001111111111111111111111 No 126 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=98.91 E-value=1.4e-09 Score=69.13 Aligned_cols=600 Identities=10% Similarity=0.050 Sum_probs=191.7 Q ss_pred CC-CCCC-CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHH Q lcl|NC_013692. 64 GE-GKPK-TEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEY 141 (726) Q Consensus 64 ~~-~~~~-~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~ 141 (726) |+ .... ..++-|-.....-.+...|+.. ..-...++ -.++.+....|-..+|..+. ...|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---------R~~a~~d~~fy~G~Qw~~~~--~~~l~~~ 65 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCS----DIDSQPKW---------RDAANKACAYYDGDQLPPEV--LQVLKDR 65 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHH----HHHhhHHH---------HHHHHHHHHhhcCCCCCHHH--HHHHHhc Confidence 32 1100 1122222221111111122221 11111111 11111111112222221111 0112211 Q ss_pred HHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcc-hHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccce Q lcl|NC_013692. 142 VRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDS-SEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQV 220 (726) Q Consensus 142 ~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 220 (726) -+.++..+ .|+..++.-.+.++..++.+.++|+. +++...++.....+..+- ..........+.+|.+....|.|| T Consensus 66 g~p~~~~N--~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~-~~~~~~~~~~s~af~~~~~~G~G~ 142 (714) T protein:vir:27 66 GQPMTIHN--LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADA-CRLGNMNKARSDAYAEQIKAGLSW 142 (714) T ss_pred CCCcEEec--cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHH-HHhhchhHHHHHHHHHhhhcCcce Confidence 12222111 11112233345566788888888864 333333333333333221 224456678888999999999987 Q ss_pred eeccccceeecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCC--cchhhc----Cccc Q lcl|NC_013692. 221 RAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRY--QNLDKI----QVEG 293 (726) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~--~~~d~~----~~~~ 293 (726) ..+. +++..|--++..+. +..+.-|=-..+..+++|..=..+. .+.+.+ +... T Consensus 143 ~~~~--------------------~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a 202 (714) T protein:vir:27 143 VEVR--------------------RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA 202 (714) T ss_pred EEec--------------------cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch Confidence 3321 11111111111111 1111110001112233332111000 111111 1000 Q ss_pred chhhcccchhhhhccccccc-cCCcCCceEEEEEEEEEeecCCC----------ceEEEEEEEE---------ECCEEEE Q lcl|NC_013692. 294 QNLLSEPDYTGPSEGVRNFD-FQDKSRKRLVVHEYWGYYDIHGD----------GVLHPIVATW---------VGAVMIR 353 (726) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~v~E~w~~~~~~~~----------g~~~~~~~~~---------~g~~~l~ 353 (726) ..+..... ...+..+.. +.......+.-++.+..++.... -+.++++-.+ .|+.+.. T Consensus 203 ~~i~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~ 279 (714) T protein:vir:27 203 QVIDYAID---DWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAF 279 (714) T ss_pred hhhhhhhh---hhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEe Confidence 00000000 000000000 00001111112222111111111 0112221110 1222222 Q ss_pred eccCC-------------------------------CCCCccce--EEeeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 354 MEENP-------------------------------FPDKRIPY--VVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRG 400 (726) Q Consensus 354 ~~~~P-------------------------------~~~~~~Pf--~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 400 (726) +..+| ...+..|| -.|++.|+.+... +......-+.|. T Consensus 280 d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~---------~~~g~~~G~vr~ 350 (714) T protein:vir:27 280 DKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRK---------DKTGEPYGLISR 350 (714) T ss_pred CccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeee---------eccCceeehhhh Confidence 22111 00011122 1233333222221 111112234444 Q ss_pred HHHHHHhcCCCceEeecccccchhhhhhcCCceEeec-----Cccchhhhcccc-------------cCccchhHHHHHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFN-----PGADPRAAVHMH-------------TFPEIPQSAQYMI 462 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~-----~~~~~~~~i~~~-------------~~~~~~~~~~~ll 462 (726) ++|+-...|.....+.. +++ ........|++.... ..+.+.+.+.+. .+.+.+......+ T Consensus 351 ~~d~Qr~~N~~~s~~~~-~l~-~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (714) T protein:vir:27 351 AIPAQDEVNFRRIKLTW-LLQ-AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQF 428 (714) T ss_pred chhHHHHHHHHHHHHHH-hhc-CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHH Confidence 54443322211111000 111 011112233332111 112222222221 1222344556666 Q ss_pred HHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEec Q lcl|NC_013692. 463 NLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITN 541 (726) Q Consensus 463 ~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~ 541 (726) +++......+. ..+|.++..+|....++++...++.+.... ....+.+.++...+.+.+++..+...- .+. T Consensus 429 ~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~----~~~ 500 (714) T protein:vir:27 429 QVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDD----LKK 500 (714) T ss_pred HHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCC Confidence 66666555443 346887777887777888887776666543 355566667777777776666543321 233 Q ss_pred ccceecchhhcc---cccceeee------------------cccchHHHHHH--HHHHHHHHHhhhccchhHHHHHHHHH Q lcl|NC_013692. 542 EHFVDIRRDDLA---GNFDLKLD------------------ISTAEEDNAKV--NDLTFMLQTMGPNMDPMMAQQIMGQI 598 (726) Q Consensus 542 ~~~v~v~~~~~~---~~~dv~i~------------------~~~~~~~~~~~--~~l~~l~q~~~~~~~~~~~~~~~~~~ 598 (726) +..+.|..++-. .++ +.++ +.......... ++....+..+....++.....+...+ T Consensus 501 erv~RI~~e~~~~~~~~~-v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~ 579 (714) T protein:vir:27 501 RRNHAVVINRDDRQRRQT-IVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLW 579 (714) T ss_pred CcEEEEeccCCCcCcceE-EeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHH Confidence 344555422111 011 1111 11111111112 22232333334456666666667777 Q ss_pred HHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 599 MELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLN 678 (726) Q Consensus 599 ~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e 678 (726) ++.+..+...+.+.......... ....+.+++..+++.+++.++.++.+.+.+..+++.+..++++.+.++++.+...+ T Consensus 580 l~~~d~p~~~el~~~ir~~~~~~-~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~ 658 (714) T protein:vir:27 580 VNLLDVPQKQEFVERIRAALGTP-KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNAS 658 (714) T ss_pred HHhcCCCCHHHHHHHHHHHcCCC-CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77888877766555554433222 12222233333333333222222222222222222222222222222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 679 FLEQESGVQQARKRELQQAQSEA---QGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 679 ~~~qe~~~~~~~e~e~~~~q~~~---q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........++... +..++.+ .+.++.++....-.+++..+...++ T Consensus 659 a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~ 708 (714) T protein:vir:27 659 AQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQR 708 (714) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHH Confidence 111000000000000 0000000 0001111111111111111111111 No 127 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.87 E-value=2.3e-08 Score=62.41 Aligned_cols=440 Identities=12% Similarity=0.052 Sum_probs=184.1 Q ss_pred Cccchhc-CCCCCCch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCC--------cCCCHHHHH Q lcl|NC_013692. 16 GDPSKRL-QPEWSNAP--SLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKS--------AVQPPTIRK 84 (726) Q Consensus 16 ~~~~~~~-~~~~~~~~--~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs--------~~v~~~v~~ 84 (726) .++++-. ++-..+++ .|..|.+.+ ..+.....+...||.|....+ .-|.+ +.|.+-.+. T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~-------~~~~~~~~~~~~Yy~G~~~~~---~~~~~~p~~~r~~~~v~nw~~~ 70 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQI-------ENLRWKNLLRTSYYENKRTIQ---YVGTLIPPQYFNLGLVLGWTGK 70 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHH-------HHHhhHHHHHHHHhccCCChh---hccccccHHHHHHHhhcChHHH Confidence 4444433 34443333 233333333 333334566678997654321 11111 123333333 Q ss_pred HHHHHHHHHHHhhcCCCceEEEe-cCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeee Q lcl|NC_013692. 85 QAEWRYSSLSEPFLSSPNIFEVN-PVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRT 163 (726) Q Consensus 85 ~v~~~~~~L~~~f~~~~~~~~~~-p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~ 163 (726) .|+.+.-.| .+--|. |-+.+|. ..+..+| ..|+.-.....++++||++|.+++.|+.... T Consensus 71 ~Vd~~a~rl--------~~~Gf~~~d~~~~~-------~~l~~iw-~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d--- 131 (474) T protein:vir:81 71 AVDALARRC--------NLEGFVWPDGDLDS-------LGGTEVV-DDNHLLSEIDSAIVAAMQHGPAFLINTVGED--- 131 (474) T ss_pred HHHHHHhhh--------cccceECCCCCccc-------hHHHHHH-HhcChhHHHHHHHHHHHhhCceeEEEecCCC--- Confidence 344333222 111222 2111111 1244444 4666555677889999999999998765300 Q ss_pred EEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceee Q lcl|NC_013692. 164 VKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQ 243 (726) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~ 243 (726) | ...|.|. T Consensus 132 -----------------------------------------------------~-------------------~~~~~i~ 139 (474) T protein:vir:81 132 -----------------------------------------------------D-------------------EPEALIH 139 (474) T ss_pred -----------------------------------------------------C-------------------CceeEEE Confidence 0 0013355 Q ss_pred eechhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCce Q lcl|NC_013692. 244 VCDYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKR 321 (726) Q Consensus 244 ~v~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (726) .++|.+++ |||... .+. + .+.+...+. .| . T Consensus 140 ~~sp~~~~~~~D~~~~-~~~-~--al~~~~~~~-----~g---------------------------------------~ 171 (474) T protein:vir:81 140 VKDASEATGEWNRRRR-GLN-N--LLSIIDKDK-----EG---------------------------------------K 171 (474) T ss_pred EeccceEEEEEeCCCC-cce-e--eeEEEEEcC-----CC---------------------------------------c Confidence 67777765 677432 111 1 111100000 00 0 Q ss_pred EEEEEEEEEeecCCCceEEEEEEEEECCE-EEEeccCCCCCCccceEEeeeeeecCcccCCChH-HHHHHHHHHHHHHHH Q lcl|NC_013692. 322 LVVHEYWGYYDIHGDGVLHPIVATWVGAV-MIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDG-ALLIDNQRIIGAVTR 399 (726) Q Consensus 322 v~v~E~w~~~~~~~~g~~~~~~~~~~g~~-~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~-~~~~d~Q~~~N~~~~ 399 (726) ++...+|.. +....+..-..+.. .....++|+ | .|++++...+.-...+|.|-+ +.++++|+.+|+.+. T Consensus 172 ~~~~~ly~~------~~~~~~~~~~~~~~w~~~~~~~~~--g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~ 242 (474) T protein:vir:81 172 VLSLALYLD------NETVTAQRDKATLKWQVDRDEHVY--G-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELA 242 (474) T ss_pred EEEEEEEeC------CcEEEEEEcCccceeeeccCCCCC--C-cceEEecccccccCcCCccccchhHHHHHHHHHHHHH Confidence 001111110 00000000000010 112234444 5 699999999888888998754 899999999999999 Q ss_pred HHHHHHHhcCCCceEeecccccc---------hhhhhhcCCceEeecCccchhh----hcccccCccchhHHHHHHHHHH Q lcl|NC_013692. 400 GMIDTMARSANGQVGVMKGALDV---------TNRRRFDRGENYEFNPGADPRA----AVHMHTFPEIPQSAQYMINLQQ 466 (726) Q Consensus 400 ~~~d~l~~~~~~~~~~~~gav~~---------~d~~~~~~g~vi~~~~~~~~~~----~i~~~~~~~~~~~~~~ll~~~~ 466 (726) .+.......+.|+..+ .|+-.. .+......+.++.+..+.+... .....+++. +.+..++..+. T Consensus 243 ~~~~~~e~~a~pqr~i-~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~--a~l~~~~~~l~ 319 (474) T protein:vir:81 243 RREGHMDVFSYPEFWL-LGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPA--ASPDAHWSDIN 319 (474) T ss_pred HHHHHHHHhcchhhee-ecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCC--CChhHHHHHHH Confidence 9999999999988765 333211 1122223455555544322110 011222322 12333444444 Q ss_pred H---HHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc Q lcl|NC_013692. 467 A---EAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH 543 (726) Q Consensus 467 ~---~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~ 543 (726) . .+-..||+|....|....+...||.++......-........+.|..+++.++++.+.+.-.+--++ +. .+ T Consensus 320 ~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~----~~-~~ 394 (474) T protein:vir:81 320 GLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDE----IP-DE 394 (474) T ss_pred HHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc----cc-hh Confidence 4 4445688888888854323235676777655555555666667777777777777765442221110 00 00 Q ss_pred ceecchhhcccccceeeecc-cchHHHHH-HHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhh Q lcl|NC_013692. 544 FVDIRRDDLAGNFDLKLDIS-TAEEDNAK-VNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPI 621 (726) Q Consensus 544 ~v~v~~~~~~~~~dv~i~~~-~~~~~~~~-~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~ 621 (726) + +.+.+.=. +...+..+ ......+.+. +..+.... .... ..+... ..++.... T Consensus 395 ~-----------~~~~v~W~d~~~~s~a~~aDa~~Kl~~a-~~~~~~~~---~~~~---~lg~t~--~~i~~~~~----- 449 (474) T protein:vir:81 395 W-----------KSIDAKWRDPRYLSKSAQADAGMKQLAA-VPWLAETE---VGLE---LIGLTP--QQARRAMA----- 449 (474) T ss_pred h-----------ccceeEecCCCccCHHHHHHHHHHHHhc-ccCCCcHH---HHHh---hcCCCH--HHHHHHHH----- Confidence 1 11111100 00111111 1112222221 11111111 1111 111110 00100000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 622 AQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQ 662 (726) Q Consensus 622 ~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eq 662 (726) +.+.+..+..+....+. ..... ..| T Consensus 450 -----~~~~~~~~~~~~~l~~~---------~~~~~--~aq 474 (474) T protein:vir:81 450 -----DKRRVQGRGTLQALIDR---------SNNGA--TAQ 474 (474) T ss_pred -----HHHHHhHHHHHHHHHhc---------CCCCC--CCC Confidence 00000000011100000 00000 000 No 128 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=98.86 E-value=2.5e-08 Score=62.27 Aligned_cols=592 Identities=10% Similarity=0.066 Sum_probs=197.5 Q ss_pred CCCC--CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHH-HHHhhcccchhHHHH Q lcl|NC_013692. 64 GEGK--PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLN-QQFNTKLNKQRFIDE 140 (726) Q Consensus 64 ~~~~--~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n-~~~~~~~~~~~~~~~ 140 (726) |.-+ ++.+.+-+.-+..... ..+..+........ .. ...|.-..+|++ .+|..+. ...|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~----~~l~~~~~~~~~~~---------~~-r~~a~~d~~fy~G~Qw~~~~--~~~l~~ 64 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQ----RQLLSLCSDIDSQP---------LW-RDAANKACAYYDGDQLAPEV--IQVLKD 64 (714) T ss_pred CCcCcCcccCCCcchhhhhhhH----HHHHHHHHHHhhhH---------HH-HHHHHHHHHhhcCCCCCHHH--HHHHHh Confidence 4333 2223333332222111 11222211111100 00 122222222221 2221111 011211 Q ss_pred HHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcc-hHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccc Q lcl|NC_013692. 141 YVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDS-SEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQ 219 (726) Q Consensus 141 ~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 219 (726) .-+-++..+ .|+..++.-.+.++..++.+.++|+. .++...++.....+..+ ..+........+.+|.+....|.| T Consensus 65 ~g~p~~~~N--~i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~~~~~~G~G 141 (714) T protein:vir:10 65 RGQPMTIHN--LIAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFAD-ACRLGNMNKARSDAYAEQIKAGLS 141 (714) T ss_pred cCCCcEEec--cHHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHH-HHHhhchhHHHHHHHHHhhhcccc Confidence 112222111 11122333345566788888899863 33322233333333321 222445667888999999999998 Q ss_pred eeeccccceeecccceeeccceeeeechhheeeCCCCCC-chhhCCeEEEEEeccHHHHHhcCCC--cchhh----cCcc Q lcl|NC_013692. 220 VRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS-DFSKAKFLIETFESSYAELKADGRY--QNLDK----IQVE 292 (726) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~--~~~d~----~~~~ 292 (726) |..+ +++++.|--++..+. +..+.-|=-..+..+++|..-..+. -+++. ++.. T Consensus 142 ~~~~--------------------~~d~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~ 201 (714) T protein:vir:10 142 WVEV--------------------RRNSEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGM 201 (714) T ss_pred eEEe--------------------eeccCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCc Confidence 8431 112222111111111 1111110001122333332111000 01111 1110 Q ss_pred cchhhcccchhhhhccccccc-cCCcCCceEEEEEEEEEeecCCCc----------eEEEEEEEE---------ECCEEE Q lcl|NC_013692. 293 GQNLLSEPDYTGPSEGVRNFD-FQDKSRKRLVVHEYWGYYDIHGDG----------VLHPIVATW---------VGAVMI 352 (726) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~v~E~w~~~~~~~~g----------~~~~~~~~~---------~g~~~l 352 (726) ...+..... ...+..+.. +.......+.-++.+..++...++ +.++++-.+ .|..+. T Consensus 202 a~~i~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~ 278 (714) T protein:vir:10 202 AQVIDYAID---DWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVA 278 (714) T ss_pred hhhhhccch---hhcCcccchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeee Confidence 000100000 000000000 000111112223332222222111 122221100 122221 Q ss_pred EeccCC-------------------------C------CCC--ccceEEeeeeeecCcccC-CChHHHHHHHHHHHHHHH Q lcl|NC_013692. 353 RMEENP-------------------------F------PDK--RIPYVVVNYIPRKRDLYG-ESDGALLIDNQRIIGAVT 398 (726) Q Consensus 353 ~~~~~P-------------------------~------~~~--~~Pf~~~~~~~~~~~~~g-~g~~~~~~d~Q~~~N~~~ 398 (726) .+..+| | ..+ -||+..|++.|+.+.+.. .|... -+. T Consensus 279 ~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~----------G~v 348 (714) T protein:vir:10 279 FDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY----------GLI 348 (714) T ss_pred eCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccc----------eeh Confidence 111111 0 011 134444555555444332 23322 233 Q ss_pred HHHHHHHHhcCCCceEeecccccchhhhhhcCCceEee-----cCccchhhhcccc-------------cCccchhHHHH Q lcl|NC_013692. 399 RGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEF-----NPGADPRAAVHMH-------------TFPEIPQSAQY 460 (726) Q Consensus 399 ~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~-----~~~~~~~~~i~~~-------------~~~~~~~~~~~ 460 (726) |.+.|+-...|........ .++. .......|++... ..++.+...+.+. .+.+.++.... T Consensus 349 r~~~d~Qr~~N~~~s~~~~-~l~~-~~~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (714) T protein:vir:10 349 SRAIPAQDEVNFRRIKLTW-LLQA-KRVIMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQ 426 (714) T ss_pred hhhhhHHHHHHHHHHHHHH-HHhC-CceeeccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHH Confidence 3333332222211110000 0110 0011223333210 1112222222221 22222344556 Q ss_pred HHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEE Q lcl|NC_013692. 461 MINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRI 539 (726) Q Consensus 461 ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi 539 (726) .++++......+.. .+|.++..+|....++++...++.+.... ....+.+.++...+.+.+++..+...- . T Consensus 427 ~~~llq~~~~~i~~----~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~----~ 498 (714) T protein:vir:10 427 QFQVMQESEKLIQD----TMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDD----L 498 (714) T ss_pred HHHHHHHHHHHHHH----hhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----c Confidence 66666666655443 45877777777777788877666655543 455566667776777777666654321 2 Q ss_pred ecccceecchhhc---ccccceeee------------------cccchHHHH-HHHHHH-HHHHHhhhccchhHHHHHHH Q lcl|NC_013692. 540 TNEHFVDIRRDDL---AGNFDLKLD------------------ISTAEEDNA-KVNDLT-FMLQTMGPNMDPMMAQQIMG 596 (726) Q Consensus 540 ~~~~~v~v~~~~~---~~~~dv~i~------------------~~~~~~~~~-~~~~l~-~l~q~~~~~~~~~~~~~~~~ 596 (726) +.+..+.|..+.- ...+ +.++ +........ -..+.. ..+..+....+|......+. T Consensus 499 ~~~rv~RI~~e~~~~~~~~~-~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~ 577 (714) T protein:vir:10 499 KKRRNHAVVINRDDRQRRQT-IVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLD 577 (714) T ss_pred CCCcEEEEeccCCCccccee-EeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHH Confidence 3334455532211 1111 1111 111111111 223322 12222334556666666666 Q ss_pred HHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 597 QIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTD 676 (726) Q Consensus 597 ~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~ 676 (726) .+++.+..+...+.++.......... ....++.+..+++.+++.++.++ .+.+.++.+++.+++++++.+.+ T Consensus 578 ~~le~~d~p~~~ei~~~ir~~~~~~~-~~~~~~~e~q~~q~~~~~~~~~q-------~~l~~~e~~a~~~k~eaea~~~~ 649 (714) T protein:vir:10 578 LWVNLLDVPQKQEFVERIRAALGTPK-SPDEMTPEEQEVAAQQQALQQQQ-------AELQMREMAGRVAKLEADAARAH 649 (714) T ss_pred HHHHhcCCcCHHHHHHHHHHHcCCCC-CccccCcchhHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHH Confidence 77777777766655554443322211 11222222222222222222222 22222223333333333333222 Q ss_pred HHHHHHHHHHHHH-HHHHHHH-HHHHHHHHHHHHHHHH--HHHHHHHHHHHhcC Q lcl|NC_013692. 677 LNFLEQESGVQQA-RKRELQQ-AQSEAQGKLAMLNSQL--KRLDEATSARTSQK 726 (726) Q Consensus 677 ~e~~~qe~~~~~~-~e~e~~~-~q~~~q~~~~~l~~~~--~~~~~~~~a~~~~q 726 (726) .+........+.. ...+.+. .....+++...+.+.. -.+.....++++.| T Consensus 650 aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~~~~~q~~~q 703 (714) T protein:vir:10 650 AAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLY 703 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHH Confidence 2111111000000 0011111 0111111111111111 11122222233333 No 129 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.79 E-value=4.6e-08 Score=60.81 Aligned_cols=433 Identities=12% Similarity=0.056 Sum_probs=175.8 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCC--CCCCC---cCCCHHHHHHHHHHHHHHHH Q lcl|NC_013692. 23 QPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKT--EKGKS---AVQPPTIRKQAEWRYSSLSE 95 (726) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~--~~grs---~~v~~~v~~~v~~~~~~L~~ 95 (726) +|+=+-+..|..|.. .|..++....+-.+||+|....+ ++. .++++ ++|..-.+..|+...-.|. T Consensus 1 ~~~~t~~~~~~~l~~-------~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTK-------RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII- 72 (456) T ss_pred CCCCCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc- Confidence 333333333333322 34445555677889998865321 111 12333 4677777777777665541 Q ss_pred hhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCC Q lcl|NC_013692. 96 PFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMP 175 (726) Q Consensus 96 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~ 175 (726) ++++ .+ + ...|.+.... +..+| ..|+.-.....++++++++|.+.+.+|=+ + T Consensus 73 ----~~~~-~~-~-~~~d~~~~~~----~~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~d-~--------------- 124 (456) T protein:vir:10 73 ----PNGI-TV-G-GSADSDLALR----ARRIW-RDNRMDSVCKQWVKYGLDFGESYLTCWRR-D--------------- 124 (456) T ss_pred ----cCCe-ec-C-CCCCcchHHH----HHHHH-HhcChhhHHHHHHHHHhhcCeeEEEEeeC-C--------------- Confidence 3323 22 1 2233333332 23334 34554445567889999999988754311 0 Q ss_pred cchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe--eeC Q lcl|NC_013692. 176 DSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI--VID 253 (726) Q Consensus 176 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~~d 253 (726) .+.|.+..++|.++ +|| T Consensus 125 -------------------------------------------------------------~g~~~i~~~~p~~~~~i~d 143 (456) T protein:vir:10 125 -------------------------------------------------------------DGTATITADSPETMVVSVD 143 (456) T ss_pred -------------------------------------------------------------CCceEEEEEccceeEEEEc Confidence 01123456677774 455 Q ss_pred CCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeec Q lcl|NC_013692. 254 PSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDI 333 (726) Q Consensus 254 p~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~ 333 (726) |.... ...++++. +.+.++ .. ..... + +. ..-+..+..+..+.. T Consensus 144 ~~~~~---~~~~~i~~-~~~~d~--------~~------~~~~~----~-----------~~---~~~~~~~~~~~~~~~ 187 (456) T protein:vir:10 144 PLQPW---RIRAAMRW-WRDLDA--------ES------DFAIV----W-----------SG---DGWQKFARPCFVQSS 187 (456) T ss_pred CCCCc---ceEEEEEE-EEecCC--------ce------eEEEE----E-----------ec---cceeEEEEEEEEeec Confidence 54321 12222221 111100 00 00000 0 00 000111111110000 Q ss_pred CCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_013692. 334 HGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQV 413 (726) Q Consensus 334 ~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~ 413 (726) ......++.++........|...+.+|++++ ....|.|.++.++++++.+|..++.++..+...+.|+. T Consensus 188 -----~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~ 256 (456) T protein:vir:10 188 -----SRRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQR 256 (456) T ss_pred -----ccceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhH Confidence 0111222233333222333444455565543 23468899999999999999999988877777666654 Q ss_pred Eeeccc------cc-------chhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhh Q lcl|NC_013692. 414 GVMKGA------LD-------VTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNA 480 (726) Q Consensus 414 ~~~~ga------v~-------~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~ 480 (726) .+ .|. ++ ..+.....+|.++...++.. +...+..+ ...+...+..+...+-..||++.... T Consensus 257 ~i-~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~----~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~ 330 (456) T protein:vir:10 257 AL-KSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD----IWESQAND-FTPMLSAIKEHIRQLSSATKTPLPML 330 (456) T ss_pred hh-hccCcccccccccccccchhhhhhhhccccccCCCCcc----eEEecccC-hhHHHHHHHHHHHHHHhccCCChHHh Confidence 43 121 11 11112234455555544332 22222111 12233334444444556678887777 Q ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceee Q lcl|NC_013692. 481 GISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKL 560 (726) Q Consensus 481 G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i 560 (726) |... .+.||.++......-........+.|..+++++++.++.+ ..... .. ...+.- T Consensus 331 ~~~~--~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~----~g~~~--------~~---------~~~v~w 387 (456) T protein:vir:10 331 MPDS--ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI----EGESV--------ED---------TVDVSF 387 (456) T ss_pred cccc--cChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCc--------cc---------ceeEEe Confidence 6322 2346777777666666666666677777777776665432 12110 00 001110 Q ss_pred ecc-cchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHH Q lcl|NC_013692. 561 DIS-TAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEA 639 (726) Q Consensus 561 ~~~-~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~ 639 (726) ... +.+. .+.......+.+. .+... ..... ..+... ..+.. .+.+....+... T Consensus 388 ~~~~~~~~-~~~ada~~kl~~~---gi~~~---~~~~~---~lg~~~--~~i~~--------------~e~er~~~e~~~ 441 (456) T protein:vir:10 388 ESPDRVTL-GEKYSAASLAKAA---GESWA---SIRRN---ILNYNA--DQIKQ--------------DDLDRAREQITL 441 (456) T ss_pred cCCCCcCH-HHHHHHHHHHHHc---CCChH---HHHHh---hCCCCH--HHHHH--------------HHHHHHHHHHHH Confidence 000 0110 1111111111111 11111 01111 111100 00000 000000000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 640 ERARAAHYMSGAGLQDSK 657 (726) Q Consensus 640 ~~aq~q~~~~~~~~~~~~ 657 (726) .+............+ T Consensus 442 ---~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 442 ---FAGNPVQRPQEDGSR 456 (456) T ss_pred ---HhhhhhhcCCCCCCC Confidence 000000000000000 No 130 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.79 E-value=4.6e-08 Score=60.81 Aligned_cols=433 Identities=12% Similarity=0.056 Sum_probs=175.8 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCC--CCCCC---cCCCHHHHHHHHHHHHHHHH Q lcl|NC_013692. 23 QPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKT--EKGKS---AVQPPTIRKQAEWRYSSLSE 95 (726) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~--~~grs---~~v~~~v~~~v~~~~~~L~~ 95 (726) +|+=+-+..|..|.. .|..++....+-.+||+|....+ ++. .++++ ++|..-.+..|+...-.|. T Consensus 1 ~~~~t~~~~~~~l~~-------~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTK-------RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII- 72 (456) T ss_pred CCCCCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc- Confidence 333333333333322 34445555677889998865321 111 12333 4677777777777665541 Q ss_pred hhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCC Q lcl|NC_013692. 96 PFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMP 175 (726) Q Consensus 96 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~ 175 (726) ++++ .+ + ...|.+.... +..+| ..|+.-.....++++++++|.+.+.+|=+ + T Consensus 73 ----~~~~-~~-~-~~~d~~~~~~----~~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~d-~--------------- 124 (456) T protein:vir:10 73 ----PNGI-TV-G-GSADSDLALR----ARRIW-RDNRMDSVCKQWVKYGLDFGESYLTCWRR-D--------------- 124 (456) T ss_pred ----cCCe-ec-C-CCCCcchHHH----HHHHH-HhcChhhHHHHHHHHHhhcCeeEEEEeeC-C--------------- Confidence 3323 22 1 2233333332 23334 34554445567889999999988754311 0 Q ss_pred cchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhe--eeC Q lcl|NC_013692. 176 DSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNI--VID 253 (726) Q Consensus 176 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~~d 253 (726) .+.|.+..++|.++ +|| T Consensus 125 -------------------------------------------------------------~g~~~i~~~~p~~~~~i~d 143 (456) T protein:vir:10 125 -------------------------------------------------------------DGTATITADSPETMVVSVD 143 (456) T ss_pred -------------------------------------------------------------CCceEEEEEccceeEEEEc Confidence 01123456677774 455 Q ss_pred CCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeec Q lcl|NC_013692. 254 PSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDI 333 (726) Q Consensus 254 p~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~ 333 (726) |.... ...++++. +.+.++ .. ..... + +. ..-+..+..+..+.. T Consensus 144 ~~~~~---~~~~~i~~-~~~~d~--------~~------~~~~~----~-----------~~---~~~~~~~~~~~~~~~ 187 (456) T protein:vir:10 144 PLQPW---RIRAAMRW-WRDLDA--------ES------DFAIV----W-----------SG---DGWQKFARPCFVQSS 187 (456) T ss_pred CCCCc---ceEEEEEE-EEecCC--------ce------eEEEE----E-----------ec---cceeEEEEEEEEeec Confidence 54321 12222221 111100 00 00000 0 00 000111111110000 Q ss_pred CCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_013692. 334 HGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQV 413 (726) Q Consensus 334 ~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~ 413 (726) ......++.++........|...+.+|++++ ....|.|.++.++++++.+|..++.++..+...+.|+. T Consensus 188 -----~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~ 256 (456) T protein:vir:10 188 -----SRRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQR 256 (456) T ss_pred -----ccceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhH Confidence 0111222233333222333444455565543 23468899999999999999999988877777666654 Q ss_pred Eeeccc------cc-------chhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhh Q lcl|NC_013692. 414 GVMKGA------LD-------VTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNA 480 (726) Q Consensus 414 ~~~~ga------v~-------~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~ 480 (726) .+ .|. ++ ..+.....+|.++...++.. +...+..+ ...+...+..+...+-..||++.... T Consensus 257 ~i-~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~----~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~ 330 (456) T protein:vir:10 257 AL-KSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD----IWESQAND-FTPMLSAIKEHIRQLSSATKTPLPML 330 (456) T ss_pred hh-hccCcccccccccccccchhhhhhhhccccccCCCCcc----eEEecccC-hhHHHHHHHHHHHHHHhccCCChHHh Confidence 43 121 11 11112234455555544332 22222111 12233334444444556678887777 Q ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceee Q lcl|NC_013692. 481 GISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKL 560 (726) Q Consensus 481 G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i 560 (726) |... .+.||.++......-........+.|..+++++++.++.+ ..... .. ...+.- T Consensus 331 ~~~~--~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~----~g~~~--------~~---------~~~v~w 387 (456) T protein:vir:10 331 MPDS--ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI----EGESV--------ED---------TVDVSF 387 (456) T ss_pred cccc--cChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCc--------cc---------ceeEEe Confidence 6322 2346777777666666666666677777777776665432 12110 00 001110 Q ss_pred ecc-cchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHH Q lcl|NC_013692. 561 DIS-TAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEA 639 (726) Q Consensus 561 ~~~-~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~ 639 (726) ... +.+. .+.......+.+. .+... ..... ..+... ..+.. .+.+....+... T Consensus 388 ~~~~~~~~-~~~ada~~kl~~~---gi~~~---~~~~~---~lg~~~--~~i~~--------------~e~er~~~e~~~ 441 (456) T protein:vir:10 388 ESPDRVTL-GEKYSAASLAKAA---GESWA---SIRRN---ILNYNA--DQIKQ--------------DDLDRAREQITL 441 (456) T ss_pred cCCCCcCH-HHHHHHHHHHHHc---CCChH---HHHHh---hCCCCH--HHHHH--------------HHHHHHHHHHHH Confidence 000 0110 1111111111111 11111 01111 111100 00000 000000000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 640 ERARAAHYMSGAGLQDSK 657 (726) Q Consensus 640 ~~aq~q~~~~~~~~~~~~ 657 (726) .+............+ T Consensus 442 ---~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 442 ---FAGNPVQRPQEDGSR 456 (456) T ss_pred ---HhhhhhhcCCCCCCC Confidence 000000000000000 No 131 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.78 E-value=1.5e-08 Score=63.48 Aligned_cols=416 Identities=12% Similarity=0.017 Sum_probs=166.1 Q ss_pred hccCCCCCCCCCC-CCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHH Q lcl|NC_013692. 60 MHVRGEGKPKTEK-GKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFI 138 (726) Q Consensus 60 y~~~~~~~~~~~~-grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~ 138 (726) |- .++++ +..+ ++-++|....+..|+.+.-.|. |- +.+-.|.+.- ..++.+| ..|+.-... T Consensus 1 ~l-~~~~~-~~~~~~~~~~v~n~~~~ivd~~~~~l~--~~---------gf~~~d~~~~----~~~~~i~-~~N~~d~~~ 62 (434) T protein:vir:98 1 ML-PKNAE-QAFLDFQRKARTNFCGLIANASVHRLL--AL---------GVTGPDGEPD----TRASRWW-QANRLDSRQ 62 (434) T ss_pred CC-CCCcc-HHHHHhhhhhhccchHHHHHHHHhhhc--cC---------ceecCCCchH----HHHHHHH-HhcChhHHH Confidence 31 12221 2222 3334566677777776554331 11 1122232221 2234444 356666667 Q ss_pred HHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhccc Q lcl|NC_013692. 139 DEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGI 218 (726) Q Consensus 139 ~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~ 218 (726) ..++++++++|.+.+.+|.+.. .+ . . T Consensus 63 ~~~~~~a~i~G~ay~~v~~~~~-~~----~----------------------------~--------------------- 88 (434) T protein:vir:98 63 KLVWRMAMAQSAGYMLVGAHPT-RT----E----------------------------D--------------------- 88 (434) T ss_pred HHHHHHHhhcCceEEEEecCCC-cc----c----------------------------c--------------------- Confidence 7889999999999998865410 00 0 0 Q ss_pred ceeeccccceeecccceeeccceeeeechhhe--eeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchh Q lcl|NC_013692. 219 QVRAVPVGSEEEEREETVENHPTVQVCDYNNI--VIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNL 296 (726) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~--~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~ 296 (726) .....|.|..++|.++ +|||... .-.+.+++...+.+ +.. . T Consensus 89 ----------------~~~~~~~I~~~~p~~~~~i~D~~~~----~~~~ai~~~~~~~~-----~~~----------~-- 131 (434) T protein:vir:98 89 ----------------NGRPSPLITMEHPSECIVEYDPETG----EPLVGLKVWHNDID-----GFG----------Y-- 131 (434) T ss_pred ----------------cCCceeEEEEeccceeEEEEeCCCC----ceEEEEEEEEeccC-----Cce----------E-- Confidence 0012345667888885 4555321 22233332111100 000 0 Q ss_pred hcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecC Q lcl|NC_013692. 297 LSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKR 376 (726) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~ 376 (726) .... + ...+.++ +.+....+.+..... .+......-... |.+.|.+|+++|...+..+ T Consensus 132 ~~~~-------------~----~~~~~~~--~~~~~~~~~~~~~~~-~~~~~~~~~~~~--~h~~g~vPvv~f~N~~~~~ 189 (434) T protein:vir:98 132 ARVF-------------F----DDTSFPY--RTRERTGARLPWGPD-SWVYTGTADSGD--VHDLGGMQLVEFARMPDLG 189 (434) T ss_pred EEEE-------------E----eCcEEEE--EEeeccccccccccc-cceecccccccc--cCCCCccceEEeccCCCcC Confidence 0000 0 0001111 111000000000000 000011111112 2344788999988777765 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccc-cc-hh----------hhhhcCCceEeecCccchhh Q lcl|NC_013692. 377 DLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGAL-DV-TN----------RRRFDRGENYEFNPGADPRA 444 (726) Q Consensus 377 ~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav-~~-~d----------~~~~~~g~vi~~~~~~~~~~ 444 (726) . +|.|.++.+++.++.+|+.++.+...+...+.|+..+ .|.- .. .+ .....++.++.. ++.++ T Consensus 190 ~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~-- 264 (434) T protein:vir:98 190 E-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWI-KGHKFAKRTDPATGMTVVDQPFVPSPSAVWAS-EGENT-- 264 (434) T ss_pred c-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-cCCCcccccccccccchhhhhhhccccccccC-CCCCc-- Confidence 5 5999999999999999999999999999888887655 2321 10 01 111223333322 22222 Q ss_pred hcccccCccchhHHHHHHHHHHHHHH---HHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 445 AVHMHTFPEIPQSAQYMINLQQAEAE---SMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGR 521 (726) Q Consensus 445 ~i~~~~~~~~~~~~~~ll~~~~~~~e---~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~ 521 (726) .+.+++. .....++.++...++ ..|+++....|. ...+.||.++......-........+.|..+++++++ T Consensus 265 --~~~q~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~--~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~r 338 (434) T protein:vir:98 265 --QFGQLDA--TDLSGFLKEHASDVRDMLTISQTPTYLYAT--DLVNISADTIGALDILHVAKVREHIASFSEGLESVLA 338 (434) T ss_pred --eEEEecC--cchHHHHHHHHHHHHHHhcccCCCHHHhcc--ccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222 223444555554444 456777677663 2223577777776666666666666777777777776 Q ss_pred HHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHh Q lcl|NC_013692. 522 KIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMEL 601 (726) Q Consensus 522 ~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~ 601 (726) .++.+.- ......++ .+.-.........+..+....+.+ .+ .+.. .+.++... T Consensus 339 l~~~~~g---~~~~~~~~-----------------~v~w~~~~~~s~~~~ada~~kl~~-~g--~~~e----~~~~~lg~ 391 (434) T protein:vir:98 339 LAAAQAG---VPEDYTEA-----------------EVRWANPAHVTMAVKADAATKLKS-IG--YPLD----VIAEELDE 391 (434) T ss_pred HHHHhcC---CChhheee-----------------eEEecCCCCCCHHHHHHHHHHHHh-cC--CcHH----HHHHhCCC Confidence 6554311 11111010 010000000000111111111111 11 1111 11111100 Q ss_pred hhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 602 KKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDS 656 (726) Q Consensus 602 ~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~ 656 (726) .. .++ +++... ..++.........+ ....-+...- ...... .- T Consensus 392 ~~-~e~-~r~~~e------~~~~~~~~~~~~~~--~~~~~~g~~~-~~~~~~-dg 434 (434) T protein:vir:98 392 SP-ARV-RRIVAG------AASQALLAASLLPA--PGAPSAGNVP-DSGGAV-DG 434 (434) T ss_pred CH-HHH-HHHHHH------HHHHHHHHHhhhcc--CCCCCCCCCC-cccCCC-CC Confidence 00 000 000000 00000000000000 0000000000 000000 00 No 132 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.68 E-value=1.1e-07 Score=58.69 Aligned_cols=591 Identities=11% Similarity=0.000 Sum_probs=186.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHH-HHHHhhcCCCceEEEecCCcchHHHH Q lcl|NC_013692. 39 YQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYS-SLSEPFLSSPNIFEVNPVTWEDAESA 117 (726) Q Consensus 39 ~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~-~L~~~f~~~~~~~~~~p~~~~D~~~A 117 (726) ..+++ ....+++.+|... +..+-+|+-- ..-.-|..|+.| .+...+ T Consensus 1 m~d~~-------~~~~~~~~~~~~~------------------~~~~~~~R~~a~~d~~fy~G~QW--------~~~~~~ 47 (725) T protein:vir:10 1 MADNE-------NRLESILSRFDAD------------------WTASDEARREAKNDLFFSRVSQW--------DDWLSQ 47 (725) T ss_pred CCchH-------HHHHHHHHHHHHH------------------HHhhHHHHHHHHHHHHhhcCCCC--------CHHHHH Confidence 11100 1123344333211 1111122221 122234455544 222222 Q ss_pred HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCC Q lcl|NC_013692. 118 RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPS 197 (726) Q Consensus 118 ~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 197 (726) . + +..|... .+.+.-.+.. -.+.++..++.+.++|+.. ....+++...-+..+ .. T Consensus 48 ~-----l------~~q~rp~-~N~i~~~v~~-----------v~g~e~~nr~d~~v~p~~~-~d~~~Ae~l~~~~~~-~~ 102 (725) T protein:vir:10 48 Y-----T------TLQYRGQ-FDVVRPVVRK-----------LVSEMRQNPIDVLYRPKDG-ASPDAADVLMGMYRT-DM 102 (725) T ss_pred H-----H------HhcCCCc-ccchHHHHHH-----------HHhhHHhCCcceEEecCCc-chHHHHHHHHHHHHH-HH Confidence 1 1 1122221 1333322211 1233455677777788643 333333333333322 13 Q ss_pred chhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHH Q lcl|NC_013692. 198 EYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAEL 277 (726) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el 277 (726) +........+.+|.+....|.||..+. ..|.... + .+..+. |....++.+|.. ..|=-+.+..+++|. T Consensus 103 ~~~~~~~~~s~Af~~~i~~G~G~~ev~--~d~~~~d-~--~~~~~~-i~~~~i~~~~~~------v~~Dp~a~~~D~sDa 170 (725) T protein:vir:10 103 RHNTAKIAVNIAVREQIEAGVGAWRLV--TDYEDQS-P--TSNNQV-IRREPIHSACSH------VIWDSNSKLMDKSDA 170 (725) T ss_pred HhcCcchHHhHHHHHHhhcCcceeeee--ccccCCC-C--CCCcee-eeeeecccCHhH------cccCchhhccChhhh Confidence 345566788899999999999985542 1111000 0 001110 111111112211 111112223334442 Q ss_pred HhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEEEEEEC--------- Q lcl|NC_013692. 278 KADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVG--------- 348 (726) Q Consensus 278 ~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g--------- 348 (726) .=..+.+.++. ... . ...... ......+.+...-. ..+.-|.. .+.--+.++++..... T Consensus 171 r~~~~~~~~~~---~~~--~---~~~~~~-~~~a~~~~~~~~~~-~~~~~~~~--~~~vrv~E~~~r~~~~~~~~~~~d~ 238 (725) T protein:vir:10 171 RHCTVIHSMSQ---NGW--D---DFAEKY-DLDADNIPSFQNPN-DWVFPWLT--QDTIQIAEFYEVVEKKETAFIYQDP 238 (725) T ss_pred hhhhhhccCCH---HHH--H---HHHHhC-CCcccccccccccc-cccccccC--CCeEEEEEEEEEEEEeeEEEEeccC Confidence 10000111110 000 0 000000 00000111100000 00111211 1111122333222211 Q ss_pred --CEEEEeccCCCC-----------------------------------CC--ccceEEeeeeeecCcccC-CC-hH--H Q lcl|NC_013692. 349 --AVMIRMEENPFP-----------------------------------DK--RIPYVVVNYIPRKRDLYG-ES-DG--A 385 (726) Q Consensus 349 --~~~l~~~~~P~~-----------------------------------~~--~~Pf~~~~~~~~~~~~~g-~g-~~--~ 385 (726) +.++.+.+..+. ++ .+|.-.|++.|..+...+ .| .. - T Consensus 239 ~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~~~~G 318 (725) T protein:vir:10 239 VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEG 318 (725) T ss_pred CCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcceeee Confidence 112221111100 01 122222344444333221 12 11 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccch----------hhhcc--cccCcc Q lcl|NC_013692. 386 LLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADP----------RAAVH--MHTFPE 453 (726) Q Consensus 386 ~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~----------~~~i~--~~~~~~ 453 (726) .++++-+....++..+...+...+..+.....+..+..+..... +-++...+ .+++. ...... T Consensus 319 ~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~-----~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~ 393 (725) T protein:vir:10 319 VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHM-----YDGNDDYPYYLLNRTDENNGEMPTQPLAYYE 393 (725) T ss_pred eeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHH-----HhccCCceeeecccccccCcccccccCcccC Confidence 55666666666666667777777666665555544433333211 12222221 01111 111223 Q ss_pred chhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 454 IPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLD 532 (726) Q Consensus 454 ~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d 532 (726) .++-...+++++......+. ..+|.++..+|..+.++++...++.+.... ....|.+.++.-.+.+-+++..+.. T Consensus 394 ~~~~p~~~~~ll~~~~~~i~----~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~ 469 (725) T protein:vir:10 394 NPEVPQANAYMLEAATAAVK----EVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVN 469 (725) T ss_pred CCCchHHHHHHHHHHHHHHH----HHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34555566666666666544 345877777887777888887777766554 4555666777666666666654433 Q ss_pred cCeEEEEecccceecchhhcccccceeeeccc---chHHHHHHHHHH---HHHHHhhhccchhHHHHHHHHHHHhhh-hh Q lcl|NC_013692. 533 DVEVVRITNEHFVDIRRDDLAGNFDLKLDIST---AEEDNAKVNDLT---FMLQTMGPNMDPMMAQQIMGQIMELKK-MP 605 (726) Q Consensus 533 ~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~---~~~~~~~~~~l~---~l~q~~~~~~~~~~~~~~~~~~~~~~~-~~ 605 (726) .- .+.+..+.|...+-..+ -+.++... .+......+.+. ...-..++..+.. ....+..+.++.. ++ T Consensus 470 ~~----~~~er~~RI~~edg~~~-~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~-r~~~~~~l~qll~~~~ 543 (725) T protein:vir:10 470 DI----YDVPRNVTITLEDGSEK-EVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSM-KQQNRSEILELLGKTP 543 (725) T ss_pred HH----cCCCcEEEEecCCCCcc-eeEeccccccccccchhhhhccccceeEEEeeccCcHHH-HHHHHHHHHHHHHhcc Confidence 21 12333455543332111 12222111 111111111110 0000001111100 0111111111100 00 Q ss_pred h--------hhhhH--------HHHHh---hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 606 D--------FAKRI--------REFQP---QPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKAR 666 (726) Q Consensus 606 e--------~~~~l--------~~~~~---~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~ 666 (726) . +...+ .+... .....+....+...++.++..+++++++++...+...++++..+.+++.+ T Consensus 544 ~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ 623 (725) T protein:vir:10 544 QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELA 623 (725) T ss_pred ccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 0 00000 00000 00000000001111111111111111111111111112222222222222 Q ss_pred HHHHHHHHHHHHH-----HHHHHHHHHH-----HHHHHHH-----HHHHHHHHHHHHHH---HHHHHHHHHHHHHhcC Q lcl|NC_013692. 667 ALASQADMTDLNF-----LEQESGVQQA-----RKRELQQ-----AQSEAQGKLAMLNS---QLKRLDEATSARTSQK 726 (726) Q Consensus 667 q~~~q~~~~~~e~-----~~qe~~~~~~-----~e~e~~~-----~q~~~q~~~~~l~~---~~~~~~~~~~a~~~~q 726 (726) +.+++..+.+.+. .++..+.+.. ...+... .+..++.+.+.... ..+...+++...+.++ T Consensus 624 ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~~~ 701 (725) T protein:vir:10 624 KAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQR 701 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 2222222222111 1111111110 0000000 01111111111111 1111112222222222 No 133 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.66 E-value=1.3e-07 Score=58.26 Aligned_cols=431 Identities=14% Similarity=0.070 Sum_probs=175.2 Q ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--CCC--CCCCC---cCCCHHHHHHHHHHHHHHHH Q lcl|NC_013692. 23 QPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGK--PKT--EKGKS---AVQPPTIRKQAEWRYSSLSE 95 (726) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~--~~~--~~grs---~~v~~~v~~~v~~~~~~L~~ 95 (726) +++-.-+-.+..|.+ .+..+.....+-..||.|.-... ++. .+.|+ ++|..-....|+.....| T Consensus 1 ~~~~t~~~~~~~l~~-------~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l-- 71 (456) T protein:vir:79 1 MTASTPAEWLPVLTK-------RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI-- 71 (456) T ss_pred CCCCCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhh-- Confidence 333222223333322 24444555677789998765321 111 12222 345556666666655444 Q ss_pred hhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCC Q lcl|NC_013692. 96 PFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMP 175 (726) Q Consensus 96 ~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~ 175 (726) -++++ .+ ....|.+..+. ++.+| ..|+.-.....++++++++|.+.+.+|=+ + T Consensus 72 ---~~~g~-~~--~~~~d~~~~~~----~~~~~-~~n~~d~~~~~~~~~a~~~G~a~~~~~~~-e--------------- 124 (456) T protein:vir:79 72 ---IPNGI-TV--GGSADSDLALR----ARRIW-RDNRMDSVCKQWVKYGLDFGESYLTCWRR-D--------------- 124 (456) T ss_pred ---ccCCe-ec--CCCCCccHHHH----HHHHH-HhcChhHHHHHHHHHHhhcCeeEEEEeeC-C--------------- Confidence 12222 22 12334443333 33444 34655556668899999999987754321 0 Q ss_pred cchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhhee--eC Q lcl|NC_013692. 176 DSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIV--ID 253 (726) Q Consensus 176 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~--~d 253 (726) .+.|.+..++|++++ || T Consensus 125 -------------------------------------------------------------dg~~~i~~~~p~~~~~i~d 143 (456) T protein:vir:79 125 -------------------------------------------------------------DGTATITADSPETMVVSVD 143 (456) T ss_pred -------------------------------------------------------------CCceEEEEeccceeEEEEc Confidence 011234556677644 44 Q ss_pred CCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeec Q lcl|NC_013692. 254 PSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDI 333 (726) Q Consensus 254 p~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~ 333 (726) |.... ...+.++ .+-+.++ .... ... +. ....+.++.+|..+. T Consensus 144 ~~~~~---~~~~~~~-~~~~~d~--------~~~~----------~~~------------~~--~~~~~~~~~~~~~~~- 186 (456) T protein:vir:79 144 PLQPW---RIRSAMR-WWRDLDA--------ESDF----------AIV------------WS--GDGWQKFARPCFVQS- 186 (456) T ss_pred CCCCC---ceEEEEE-EEEecCC--------ceeE----------EEE------------Ec--CCceEEEEEEEEeec- Confidence 43221 1222222 1111100 0000 000 00 001122222221110 Q ss_pred CCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCce Q lcl|NC_013692. 334 HGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQV 413 (726) Q Consensus 334 ~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~ 413 (726) .. .....+..++........|..++.+|++++. ...|.|.++.++++++.+|..++.+...+...+.+.. T Consensus 187 ---~~-~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~ 256 (456) T protein:vir:79 187 ---SS-RRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQR 256 (456) T ss_pred ---cc-cceeeeccCCceeecccccCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHH Confidence 00 1111122222222223334455667776542 3567899999999999999999998877777766655 Q ss_pred Eeecccc-------------cchhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhh Q lcl|NC_013692. 414 GVMKGAL-------------DVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNA 480 (726) Q Consensus 414 ~~~~gav-------------~~~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~ 480 (726) .+ .|.- +..+.....+|.++..+++.. +...+..+ ...+...+..+...+-..||++.... T Consensus 257 ~~-~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~~----~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~ 330 (456) T protein:vir:79 257 AL-KSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGVD----IWESQTND-FTPMLSAIKEHIRQLSSATKTPLPML 330 (456) T ss_pred HH-hcCCcccccccccccccchhhhhhhhccccccCCCCcc----eeeecccC-hHHHHHHHHHHHHHHHhhcCCChhHh Confidence 44 2221 111122234555555544332 22222221 23344445555556667778888887 Q ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCe--EEEEecccceecchhhcccccce Q lcl|NC_013692. 481 GISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVE--VVRITNEHFVDIRRDDLAGNFDL 558 (726) Q Consensus 481 G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~--~iRi~~~~~v~v~~~~~~~~~dv 558 (726) |... .+.|+.++......-.......-+.|..+++++++.++. +..... .+++. |-...+.......|. T Consensus 331 ~~~~--~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~----~~g~~~~~~i~v~---w~~~~~~s~~~~ada 401 (456) T protein:vir:79 331 MPDS--ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ----IEGESVEDTVDVS---FESPDRVTLGEKYSA 401 (456) T ss_pred cccc--cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hcCCCccccceEE---eCCCCCcCHHHHHHH Confidence 7322 234666666666555555556666666777666665543 333211 11110 111111100000010 Q ss_pred eee-cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhh Q lcl|NC_013692. 559 KLD-ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIA 622 (726) Q Consensus 559 ~i~-~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~ 622 (726) ... .+.+..+.. ..+..++ ..+..+....+....+. .......+- ..++++... T Consensus 402 ~~kl~~~G~~~~~------~~~~~lg-~~~~~i~~~e~~r~~~e--~~~~~~~~~-~~~~~~~~~ 456 (456) T protein:vir:79 402 ASLAKAAGESWAS------IRRNILN-YNADQIKQDDLDRAREQ--ITLFAGNPV-QRPQEDGSR 456 (456) T ss_pred HHHHHhcCCChHH------HHHhcCC-CCHHHHHHHHHHHHHHH--HHHHhhhHh-hcCCCCCCC Confidence 000 000000000 0000000 00111111000000000 000000000 001111111 No 134 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.51 E-value=4e-07 Score=55.66 Aligned_cols=585 Identities=12% Similarity=0.009 Sum_probs=182.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHH-HHHHhhcCCCceEEEecCCcchHHHH Q lcl|NC_013692. 39 YQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYS-SLSEPFLSSPNIFEVNPVTWEDAESA 117 (726) Q Consensus 39 ~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~-~L~~~f~~~~~~~~~~p~~~~D~~~A 117 (726) ..+++ ....+++.+|...- ..+-+|+-- ..-.-|..|+.| .+...+ T Consensus 1 m~d~~-------~~~~~~~~~~~~~~------------------~~~~~~r~~a~~d~~fy~G~Qw--------~~~~~~ 47 (725) T protein:vir:92 1 MADNE-------NRLESILSRFDADW------------------TASDEARREAKNDLFFSRISQW--------DDWLSQ 47 (725) T ss_pred CCchH-------HHHHHHHHHHHHHH------------------HhhHHHHHHHHHHHHhhcCCCC--------CHHHHH Confidence 11100 11333333332111 111112221 122234455533 222222 Q ss_pred HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCC Q lcl|NC_013692. 118 RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPS 197 (726) Q Consensus 118 ~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 197 (726) . + +..|... .+.+.-.+.. -.+.++..++.++++|+.. ....+++...-+..+- . T Consensus 48 ~-----l------~~q~rp~-~N~i~~~i~~-----------v~g~e~~nr~d~~v~P~~~-~d~~~Ae~l~~~~~~~-~ 102 (725) T protein:vir:92 48 Y-----T------TLQYRGQ-FDVVRPVVRK-----------LVSEMRQNPIDVLYRPKDG-ASPDAADVLMGMYRTD-M 102 (725) T ss_pred H-----H------HhcCCCc-ccchHHHHHH-----------HHhhHHhCCcceEEecCCc-cHHHHHHHHHHHHHHH-H Confidence 1 1 1122221 1233322211 1233445677777778643 3333444333333222 2 Q ss_pred chhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhh--eeeCCCC-CCchhhCCeEEEEEeccH Q lcl|NC_013692. 198 EYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNN--IVIDPSC-GSDFSKAKFLIETFESSY 274 (726) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~--~~~dp~a-~~d~~da~~~~~~~~~t~ 274 (726) +........+.+|.+....|.||..+. ..+.. -+|++ +.+.... ..++...-|--+.+..++ T Consensus 103 ~~~~~~~a~s~Af~~~i~~G~G~~ev~--~d~~~-------------~d~~~~~~~i~~~~i~~~~~~V~~Dp~a~~~D~ 167 (725) T protein:vir:92 103 RHNTAKIAVNVAVREQIESGVGAWRLV--TDYED-------------QSPTSNNQVIRREPIHSACSHVIWDSNSKLMDK 167 (725) T ss_pred HhhCchHHHHHHHHHHhhcCcceeeee--ecccC-------------CCCCCCceeeEEeeccCChhhcccCchhhccCh Confidence 345667788999999999999984431 11110 01111 1111000 001111112222233344 Q ss_pred HHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCc--eEEEEEEEEE----- Q lcl|NC_013692. 275 AELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDG--VLHPIVATWV----- 347 (726) Q Consensus 275 ~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g--~~~~~~~~~~----- 347 (726) +|..=..+.+.++. +........+....... ..+.+... ...-|.. .+. +.++++.... T Consensus 168 sDar~~~~~~~~~~----d~~~~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~----~d~vrv~e~~~r~~~~~~~~ 233 (725) T protein:vir:92 168 SDSRHCTVIHSMSQ----NGWEDFAEKYDLDADDI--PSFQNPND----WVFPWLT----QDTIQIAEFYEVVEKKETAF 233 (725) T ss_pred hhHHHHHHHhcCCH----HHHHHHHhhcCcchhhh--hhcccCCc----ccccccC----CCeEEEEEEEEEEEEeeeEE Confidence 44211101111110 00000000110000000 01111000 0111221 122 2233222111 Q ss_pred ------CCEEEEeccCCC--------------------------C---------CC--ccceEEeeeeeecCcccC-CC- Q lcl|NC_013692. 348 ------GAVMIRMEENPF--------------------------P---------DK--RIPYVVVNYIPRKRDLYG-ES- 382 (726) Q Consensus 348 ------g~~~l~~~~~P~--------------------------~---------~~--~~Pf~~~~~~~~~~~~~g-~g- 382 (726) ++.++.+.++.+ + ++ .+|.-.|++.|..+...+ .| T Consensus 234 ~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~ 313 (725) T protein:vir:92 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) T ss_pred eecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCc Confidence 111222111100 0 01 122222344444333221 12 Q ss_pred hH--HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccch----------hhhc--cc Q lcl|NC_013692. 383 DG--ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADP----------RAAV--HM 448 (726) Q Consensus 383 ~~--~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~----------~~~i--~~ 448 (726) .. -.++++-+....+...+...+...+........+..+..+...... -++...+ .+++ .. T Consensus 314 ~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) T protein:vir:92 314 EVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY-----DGNDDYPYYLLNRTDENNGEMPTQP 388 (725) T ss_pred ccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHH-----hccCccceeeccccccccccccccC Confidence 11 4455566666666666666666666555544444333333322111 1221111 1111 12 Q ss_pred ccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 449 HTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMN 527 (726) Q Consensus 449 ~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li 527 (726) ......++-...+++++......+. ..+|.++..+|...+++++...++.+.... ....|.+.++.-.+.+-+++ T Consensus 389 i~~~~~~~~p~~~~~ll~~~~~~i~----~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~l 464 (725) T protein:vir:92 389 LAYYENPEVPQANAYMLEAATAAVK----EVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIY 464 (725) T ss_pred CcccCCCCchHHHHHHHHHHHHHHH----HHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222344555666676666666544 345877777887778889988887776654 45667777777666666666 Q ss_pred HHhcCcCeEEEEecccceecchhhcccccceeeeccc---chHHHHHHHHHHH---HHHHhhhccchhHHHHHHHHHHHh Q lcl|NC_013692. 528 AEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIST---AEEDNAKVNDLTF---MLQTMGPNMDPMMAQQIMGQIMEL 601 (726) Q Consensus 528 ~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~---~~~~~~~~~~l~~---l~q~~~~~~~~~~~~~~~~~~~~~ 601 (726) ..+...- .+.+..+.|..++-. ...+.++... .+......+.+.. ..-..++..+. .....+..+.++ T Consensus 465 L~lI~~~----~~~~r~~RI~~edg~-~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s-~r~~~~~~l~ql 538 (725) T protein:vir:92 465 QSIVNDI----YDVPRNVTITLEDGS-EKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQS-MKQQNRAEILEL 538 (725) T ss_pred HHHHHHh----cCCCcEEEEecCCCC-cceEEeccccccccccchhhhhccccceeeEEeeccChHH-HHHHHHHHHHHH Confidence 6543321 122234455433321 1222232211 0111111111100 00000111000 000111111111 Q ss_pred ------------------hhhhhhhhhHHHHHh---hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 602 ------------------KKMPDFAKRIREFQP---QPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGT 660 (726) Q Consensus 602 ------------------~~~~e~~~~l~~~~~---~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~ 660 (726) ....+.. ...+... .......+..+...+..++..+++++++++...+....++++.. T Consensus 539 ~~~~~~~~~~~~~~l~~~~~~~d~~-~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~ 617 (725) T protein:vir:92 539 LGKTPQGTPEYQLLLLQYFTLLDGK-GVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) T ss_pred HHhcccchhHHHHHHHHHhhcccch-HHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHH Confidence 1111000 0000000 00000100000011111111122222222221111111222222 Q ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHH Q lcl|NC_013692. 661 EQAKARALASQADMTD-----LNFLEQESGVQQA-----RKRELQQAQSEAQGKLAMLN--------SQLKRLDEATSAR 722 (726) Q Consensus 661 eqaq~~q~~~q~~~~~-----~e~~~qe~~~~~~-----~e~e~~~~q~~~q~~~~~l~--------~~~~~~~~~~~a~ 722 (726) .++..++.+++..+.+ .+...+..+.+.. ...+.+....++....+.++ +..+...++.... T Consensus 618 ~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~~~~ 697 (725) T protein:vir:92 618 GQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQT 697 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHH Confidence 2222212222211111 1111111111100 00011100000000001000 0000000111111 Q ss_pred HhcC Q lcl|NC_013692. 723 TSQK 726 (726) Q Consensus 723 ~~~q 726 (726) .+++ T Consensus 698 ~~~~ 701 (725) T protein:vir:92 698 HKQR 701 (725) T ss_pred HHHH Confidence 1111 No 135 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.48 E-value=5e-07 Score=55.12 Aligned_cols=482 Identities=11% Similarity=0.033 Sum_probs=192.7 Q ss_pred CCCC-CCchHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHhccCCCCC--CCC----------CCCCC--cCCCHHHHHH Q lcl|NC_013692. 23 QPEW-SNAPSLAQLKQDYQEA-KQVT-DEKITQINRWLDYMHVRGEGK--PKT----------EKGKS--AVQPPTIRKQ 85 (726) Q Consensus 23 ~~~~-~~~~~~~~~~~~~~~a-~~~~-~~~~~~~~~~~~~y~~~~~~~--~~~----------~~grs--~~v~~~v~~~ 85 (726) +|+= .+. .+..++.-|... ..++ +.+.....+..+||.|.-+.- +.. .+.++ +++..-.... T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~I 79 (537) T protein:vir:78 1 MTSPLLNK-PIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTEL 79 (537) T ss_pred CCcccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHH Confidence 2221 111 123334333221 1222 334455667789998764321 001 11111 5666666666 Q ss_pred HHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEE Q lcl|NC_013692. 86 AEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVK 165 (726) Q Consensus 86 v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~ 165 (726) |+.....| ||.+ +.|.+...++ ......++..+ .++..+.+...++++..+|.+...+||+.. T Consensus 80 vd~~~~yl----~G~P--v~~~~~d~~~----~e~~~~l~~~~--~~~~~~~~~el~~~~s~~G~ay~~~y~de~----- 142 (537) T protein:vir:78 80 VDQLAQYL----LSNG--VEVKVKDEDN----TQLDEILQEYF--DEDFQATIDTLVTNASKKGFEGIFARTTSE----- 142 (537) T ss_pred HHHHhhhh----cccC--ceeecCcchh----HHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeeEEEeeecCC----- Confidence 66666554 5544 3455432222 22344566544 355556777889999999999887776511 Q ss_pred ecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeee Q lcl|NC_013692. 166 EQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVC 245 (726) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v 245 (726) +.+++..+ T Consensus 143 ------------------------------------------------------------------------~~~~~~~i 150 (537) T protein:vir:78 143 ------------------------------------------------------------------------GKLKFQTV 150 (537) T ss_pred ------------------------------------------------------------------------CceEEEEE Confidence 01234556 Q ss_pred chhhee--eCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEE Q lcl|NC_013692. 246 DYNNIV--IDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLV 323 (726) Q Consensus 246 ~p~~~~--~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 323 (726) +|.++| ||.+ .+...++|.......+. ++. ....+. T Consensus 151 ~p~~~~pv~d~~-----~~~~~~~~~y~~~~~~~------~~~-------------------------------~~~~~~ 188 (537) T protein:vir:78 151 DGLTLIPVFDDY-----GVLKMIIRWYSEIRYST------KQQ-------------------------------STETIW 188 (537) T ss_pred ccceeEEEEcCC-----CCceeEEEEEeeeeccc------ccc-------------------------------CcceEE Confidence 677754 3321 12222222221111000 000 001122 Q ss_pred EEEEEEE-----eecCCCceEE-------------EEEEEEECC----EE--EEeccCCCCCCccceEEeeeeeecCccc Q lcl|NC_013692. 324 VHEYWGY-----YDIHGDGVLH-------------PIVATWVGA----VM--IRMEENPFPDKRIPYVVVNYIPRKRDLY 379 (726) Q Consensus 324 v~E~w~~-----~~~~~~g~~~-------------~~~~~~~g~----~~--l~~~~~P~~~~~~Pf~~~~~~~~~~~~~ 379 (726) .+|+|.. +...+.+... ..+.++... .. ......|.+++.+||+.+.. .-+ T Consensus 189 ~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n-----n~~ 263 (537) T protein:vir:78 189 HADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN-----NKD 263 (537) T ss_pred EEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEecc-----Ccc Confidence 2222210 0011111100 001111000 00 01122334446677665554 346 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh--hhhcCCceEeecCccchhhhcccccCccchhH Q lcl|NC_013692. 380 GESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR--RRFDRGENYEFNPGADPRAAVHMHTFPEIPQS 457 (726) Q Consensus 380 g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~--~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~ 457 (726) |.|.++.++++++.+|.++|.+.+.+...++|.+.+.-..++.... ......+++.+.... +.+.+...+..... T Consensus 264 ~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~---~~v~~l~~~~~~~~ 340 (537) T protein:vir:78 264 GMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDN---AGMEIQTVSIPYEA 340 (537) T ss_pred CCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecCCC---CceeEEEecCCHHH Confidence 8899999999999999999999999999998877653222222221 223334455554211 22455555555566 Q ss_pred HHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---cC Q lcl|NC_013692. 458 AQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLD---DV 534 (726) Q Consensus 458 ~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d---~e 534 (726) ....+..+...+...|.+++......| +.|+.|+..+...........-+.|..+++++++.++.++..... +. T Consensus 341 ~e~~ld~L~~~I~~~s~~~~~~~~~~g---n~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~ 417 (537) T protein:vir:78 341 RKAKMDIDVENIYRSGMGFNSTAVGDG---NVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDS 417 (537) T ss_pred HHHHHHHHHHHHHHhcCCCCCcccccc---CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccc Confidence 777888888888888766654433222 246667777777766666666666777777777777766543211 01 Q ss_pred eEEEEecccceecchhhcccccceeee-cccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHH Q lcl|NC_013692. 535 EVVRITNEHFVDIRRDDLAGNFDLKLD-ISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIRE 613 (726) Q Consensus 535 ~~iRi~~~~~v~v~~~~~~~~~dv~i~-~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~ 613 (726) .-+.++ |....|.+.....+.... .+.+..+.. . ++. +.+.........+..+..+ ....++.+.+.+ T Consensus 418 ~~i~i~---f~~~~P~n~~e~a~~~~~l~~~giiS~e-----T-~l~-~~p~vdd~e~ek~~~ee~~-~~~~~~~~~~~~ 486 (537) T protein:vir:78 418 NDICFE---IEPHVLANELDIATTRKTEAETEALKIG-----N-IMT-VAPRIGDDETLKLIAEELD-LDYNELKDALAE 486 (537) T ss_pred ceeeEE---eccCCCCCHHHHHHHHHHHHhcCcchHH-----H-HHH-hCCCCCCHHHHHHHHHHHH-hhhhhhhhhhhh Confidence 111111 111111111100000000 000000000 0 000 0000000000000000000 000000000000 Q ss_pred HHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHH Q lcl|NC_013692. 614 FQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAG------LQDSKVGTEQAKA 665 (726) Q Consensus 614 ~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~------~~~~~~~~eqaq~ 665 (726) .+.+...... ..+.........-++.-. ......-.- --.+-- +. T Consensus 487 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~-----~~ 537 (537) T protein:vir:78 487 QDAQSLDVSP-DVQAMLDGLPVNANQPPV-DPNQPVADPNVVPPTDPNAVP-----QT 537 (537) T ss_pred hcccccCcCc-chhhhcCCCCCCCCCCCC-CccCCCCCCCCCCCCCCccCC-----CC Confidence 0000000000 000000000000000000 000000000 000000 00 No 136 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=98.42 E-value=7.3e-07 Score=54.22 Aligned_cols=592 Identities=12% Similarity=0.018 Sum_probs=186.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHH-HHHhhcCCCceEEEecCCcchHHHH Q lcl|NC_013692. 39 YQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSS-LSEPFLSSPNIFEVNPVTWEDAESA 117 (726) Q Consensus 39 ~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~-L~~~f~~~~~~~~~~p~~~~D~~~A 117 (726) ..+++ ....+++.+|...- ..+-+|+--. .-.-|..|+.| .+...+ T Consensus 1 m~d~~-------~~~~~~~~~~~~~~------------------~~~~~~r~~a~~d~~fy~G~Qw--------~~~~~~ 47 (725) T protein:vir:77 1 MADNE-------NRLESILSRFDADW------------------TASDEARREAKNDLFFSRVSQW--------DDWLSQ 47 (725) T ss_pred CCchH-------HHHHHHHHHHHHHH------------------HhhHHHHHHHHHHHHhhCCCCC--------CHHHHH Confidence 11110 11233333332111 1111122211 11234455533 222222 Q ss_pred HHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCC Q lcl|NC_013692. 118 RQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPS 197 (726) Q Consensus 118 ~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 197 (726) . + +..|... .+.+.-.+.. -.+.++..++.++++|+.. ....+++...-+..+- . T Consensus 48 ~-----l------~~q~rp~-~N~i~~~i~~-----------v~g~~~~nr~d~~v~P~~~-~d~~~Ae~l~~~~~~~-~ 102 (725) T protein:vir:77 48 Y-----T------TLQYRGQ-FDVVRPVVRK-----------LVSEMRQNPIDVLYRPKDG-ARPDAADVLMGMYRTD-M 102 (725) T ss_pred H-----H------HhcCCCc-cccHHHHHHH-----------HHhhHHhCCcceEEecCCc-cHHHHHHHHHHHHHHH-H Confidence 1 1 1122221 1333332211 1233445677777788643 3333333333333221 2 Q ss_pred chhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHH Q lcl|NC_013692. 198 EYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAEL 277 (726) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el 277 (726) +........+.+|.+....|.||..+. ..+. ..-+..+.+.|++++ ++.||... -|--+.+..+++|. T Consensus 103 ~~~~~~~a~s~Af~~~i~~G~G~~ev~--~d~~-~~d~~~~~~~i~~~~---~~~~~~~v------~~Dp~a~~~D~sDa 170 (725) T protein:vir:77 103 RHNTAKIAVNIAVREQIEAGVGAWRLV--TDYE-DQSPTSNNQVIRREP---IHSACSHV------IWDSNSKLMDKSDA 170 (725) T ss_pred HhhCchhHHHHHHHHHhhcCcceeeee--eccc-CCCCCCCceeeEEee---cccChhhc------eeCchhhccChhhH Confidence 345666788999999999999985532 2221 111222233333322 22233221 11112223344442 Q ss_pred HhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCc--eEEEEEEEEEC------- Q lcl|NC_013692. 278 KADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDG--VLHPIVATWVG------- 348 (726) Q Consensus 278 ~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g--~~~~~~~~~~g------- 348 (726) .=..+.+.++. +......+.+....... ..+.+... ...-|.. .+. +.++++..... T Consensus 171 r~~~~~~~~~~----d~~~~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~----~d~vrv~E~~~r~~~~~~~~~~~ 236 (725) T protein:vir:77 171 RHCTVIHSMSQ----NGWEDFAEKYDLDADDI--PSFQNPND----WVFPWLT----QDTIQIAEFYEVVEKKETAFIYQ 236 (725) T ss_pred HHHHHHhcCCH----HHHHHHHhhCCcchhhc--cccccccc----ccccccC----CCeeEEEEEEEEEEEeeEEEEec Confidence 11101111110 00000000110000000 01111000 0111221 121 22322221111 Q ss_pred ----CEEEEeccCC--------------------------CC---------C--CccceEEeeeeeecCcccC-CChH-- Q lcl|NC_013692. 349 ----AVMIRMEENP--------------------------FP---------D--KRIPYVVVNYIPRKRDLYG-ESDG-- 384 (726) Q Consensus 349 ----~~~l~~~~~P--------------------------~~---------~--~~~Pf~~~~~~~~~~~~~g-~g~~-- 384 (726) +.++.+..+. |+ . ..+|.-.|++.|..+...+ .|.. T Consensus 237 ~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~~ 316 (725) T protein:vir:77 237 DPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVY 316 (725) T ss_pred CCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCcccc Confidence 1111111000 00 0 0122222333333322211 1221 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccch-------hhhc--ccccCccc Q lcl|NC_013692. 385 -ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGADP-------RAAV--HMHTFPEI 454 (726) Q Consensus 385 -~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~~-------~~~i--~~~~~~~~ 454 (726) -.++++-+....+...+...+...+..+.....+..+..+........+-.. .... .+++ ........ T Consensus 317 ~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~g~~~~~~i~~~~~ 394 (725) T protein:vir:77 317 EGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDY--PYYLLNRTDENSGDLPTQPLAYYEN 394 (725) T ss_pred cchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCC--ceecccccccCCCcccccCccccCC Confidence 3444555555655555666666666665554444444333333221111110 0010 1111 11112233 Q ss_pred hhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_013692. 455 PQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDD 533 (726) Q Consensus 455 ~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~ 533 (726) +.-....++++......+. ..+|.++..+|....++++...++.+.... ....|.+.++.-.+.+-+++..+... T Consensus 395 ~~lp~~~~~ll~~~~~~i~----~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~ 470 (725) T protein:vir:77 395 PEVPQANAYMLEAATSAVK----EVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVND 470 (725) T ss_pred CCchHHHHHHHHHHHHHHH----HHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4445566666666666543 345888878887777888887777766543 45566677777777777666654332 Q ss_pred CeEEEEecccceecchhhcccccceeeecc---cchHHHHHHHHHHH---HHHHhhhccchhHHHHHHHHHHHhhh-hhh Q lcl|NC_013692. 534 VEVVRITNEHFVDIRRDDLAGNFDLKLDIS---TAEEDNAKVNDLTF---MLQTMGPNMDPMMAQQIMGQIMELKK-MPD 606 (726) Q Consensus 534 e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~---~~~~~~~~~~~l~~---l~q~~~~~~~~~~~~~~~~~~~~~~~-~~e 606 (726) - .+.+..+.|...+-. ...+.++.. ..+......+.+.. ..-..++..+. .....+..+.++.. ++. T Consensus 471 ~----~~~~rv~RI~~ed~~-~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s-~r~~~~~~l~qll~~~~~ 544 (725) T protein:vir:77 471 I----YDVPRNVTITLEDGS-EKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQS-MKQQNRAEILELLGKTPQ 544 (725) T ss_pred H----cCCCcEEEEecCCCC-cceeeecccccccccchhHhhhhhccceeeEEeeccchHH-HHHHHHHHHHHHHHhccc Confidence 1 123334555443322 122333311 11111111122110 00000111110 00111111111110 000 Q ss_pred ----hhhhHHHH---------HhhhhhhhhhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 607 ----FAKRIREF---------QPQPDPIAQQKA------QLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARA 667 (726) Q Consensus 607 ----~~~~l~~~---------~~~~~~~~qq~~------q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q 667 (726) ....+... .........+.. +......+...+.++++.++...+...+++++...+++..+ T Consensus 545 ~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~k 624 (725) T protein:vir:77 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) T ss_pred cchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH Confidence 00000000 000000111100 00011111111111111111111111111111122222111 Q ss_pred HHHHHHHHHHH-----HHHHHHHHHHH-----HHHHHHHH-----HHHHHHHHH---HHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 668 LASQADMTDLN-----FLEQESGVQQA-----RKRELQQA-----QSEAQGKLA---MLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 668 ~~~q~~~~~~e-----~~~qe~~~~~~-----~e~e~~~~-----q~~~q~~~~---~l~~~~~~~~~~~~a~~~~q 726 (726) .+++..+.+.+ ...+.++.+.. ...+.+.. +...+.+.+ ...+..+...+++...++++ T Consensus 625 aq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~q~ 701 (725) T protein:vir:77 625 AQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQR 701 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHHHhhH Confidence 22221111111 11111111100 00000000 000011110 11111122222222222222 No 137 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=98.14 E-value=3.7e-06 Score=50.38 Aligned_cols=588 Identities=11% Similarity=0.025 Sum_probs=184.3 Q ss_pred HHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHH--HH-HHHHhhcccchhHHHHHHH----HHhhcCCeEEE Q lcl|NC_013692. 82 IRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGL--VL-NQQFNTKLNKQRFIDEYVR----AGVDEGTIIVK 154 (726) Q Consensus 82 v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~--~~-n~~~~~~~~~~~~~~~~~~----~~l~~~~~i~k 154 (726) -.+..+.++..|++-|...-.+ ..+=...+..... |+ ..+|..... ..|+..-+ -+|..+ .|+ T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~------~~~~r~~~~~d~~f~~y~G~Qw~~~~~--~~l~~~~q~~~rP~~~~N--~i~ 70 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSP------QQEVREKCIEATRFARVPGGQWEGATA--AGTKLDEQFEKYPKFEIN--KVA 70 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhh------hHHHHHHHHHHHHhhccCCCCCCHHHH--HHHHhhhhhcCCCceEEc--chH Confidence 4555555666777766432111 1111122222222 11 113322111 11111111 112111 122 Q ss_pred EeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccc Q lcl|NC_013692. 155 VGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREE 234 (726) Q Consensus 155 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 234 (726) ..++.-.+.++..++.++++|+...+...+++...-+..+ ..+........+.+|......|.||..+.. .|..+.. T Consensus 71 ~~i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~-~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~--d~~~e~d 147 (708) T protein:vir:17 71 TELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRA-DYEETDGGEACDNAFDDAATGGFGCFRLTS--MLVNEYD 147 (708) T ss_pred HHHHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHH-HHHhcCchhHHhHHHHHhhhcccceeeeee--cccccCC Confidence 2334445667778889999999533323344433333321 222445667899999999999999854321 1110000 Q ss_pred eeeccceeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhcccccccc Q lcl|NC_013692. 235 TVENHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDF 314 (726) Q Consensus 235 ~~~~~p~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (726) +.. .|.++.+-+.- .++...-|=-..+..+++|..=. ....+...+.....-++.... ......+ T Consensus 148 ~~~--------~~~~i~i~~~~-~~~~~v~~Dp~a~~~D~sDar~~----~~~~~~~~d~~~~~yp~~a~~--~~~~~~~ 212 (708) T protein:vir:17 148 PMD--------DRQRIAIEPIY-DPSRSVWFDPDAKKYDKSDALWA----FCMYSLSPEKYEAEYGKKPPA--SLDVTSM 212 (708) T ss_pred CCC--------CccccceEeec-cchhheecCccccccChhhhhhh----hhhccCCHHHHHHhCccccch--hhhhhhh Confidence 000 01111110000 00000000011112233331100 000000000000000000000 0000000 Q ss_pred CCcCCceEEEEEEEEEeecCCCc--eEEEEEEE--------E----ECCEEEEeccCC---------------------- Q lcl|NC_013692. 315 QDKSRKRLVVHEYWGYYDIHGDG--VLHPIVAT--------W----VGAVMIRMEENP---------------------- 358 (726) Q Consensus 315 ~~~~~~~v~v~E~w~~~~~~~~g--~~~~~~~~--------~----~g~~~l~~~~~P---------------------- 358 (726) . -..++ |. ..+. +.+.++.. + +|..+...+... T Consensus 213 ~------~~~~~-~~----~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r 281 (708) T protein:vir:17 213 T------SWEYD-WF----DADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKR 281 (708) T ss_pred c------ccccc-cc----CCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeE Confidence 0 01111 21 1121 12222211 1 122211111100 Q ss_pred --------------CCCCccceEEeeeeeecCcccC-CChH---HHHHHHHHHHHHHHHHHHHHHHhcCCCceEeeccc- Q lcl|NC_013692. 359 --------------FPDKRIPYVVVNYIPRKRDLYG-ESDG---ALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA- 419 (726) Q Consensus 359 --------------~~~~~~Pf~~~~~~~~~~~~~g-~g~~---~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~ga- 419 (726) ...+.+||-.|++.|..+..+. .|.. -.++++-+....+...+...+...+.. .... T Consensus 282 ~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~----~~~~~ 357 (708) T protein:vir:17 282 RRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQD----PGQIP 357 (708) T ss_pred EEEEEEeecccccccCCCCCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhc----CCcce Confidence 0112245555555555443221 1222 233334333333322222222222111 1111 Q ss_pred c-cchhhhhh--cCCc-------eEeecCccchhhhccc----ccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcc Q lcl|NC_013692. 420 L-DVTNRRRF--DRGE-------NYEFNPGADPRAAVHM----HTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGA 485 (726) Q Consensus 420 v-~~~d~~~~--~~g~-------vi~~~~~~~~~~~i~~----~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~ 485 (726) + +......+ .... ...+++..++...+.. ....+.+......+++++.....+.-+ +|.++. T Consensus 358 i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~----tGi~d~ 433 (708) T protein:vir:17 358 IVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEV----TGGSQA 433 (708) T ss_pred eechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHh----cCCChH Confidence 1 10011011 1000 0111111111111211 112233455556666666666555443 576666 Q ss_pred cchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeec-- Q lcl|NC_013692. 486 ALGDTATAVRGALDAASKRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDI-- 562 (726) Q Consensus 486 ~~~~ta~~i~~~~~~~~~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~-- 562 (726) ++|.+ +.+++...++.+.... ....|.+.++.-.+.+.+++..+..+- .+.+-.+.|..++-..++ +.++. T Consensus 434 ~~G~~-sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~----y~~~R~~RI~~edg~~~~-v~in~~~ 507 (708) T protein:vir:17 434 MQQMP-SNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREV----YGSEREVRIVNEDGSDDI-AVLSAQV 507 (708) T ss_pred HccCc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEecCCCCcce-eeeccee Confidence 66653 3467776666555443 355566677777777777666543332 122334555443321111 11110 Q ss_pred ---------------------ccchHHH-HHHHH-HHHHHHHhhhccchhHHHH--HHHHHHHhhhhhhhhhhHHHHHhh Q lcl|NC_013692. 563 ---------------------STAEEDN-AKVND-LTFMLQTMGPNMDPMMAQQ--IMGQIMELKKMPDFAKRIREFQPQ 617 (726) Q Consensus 563 ---------------------~~~~~~~-~~~~~-l~~l~q~~~~~~~~~~~~~--~~~~~~~~~~~~e~~~~l~~~~~~ 617 (726) ....... .-..+ ....+..+....++..... ++..+.+.+..+...+........ T Consensus 508 ~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~ 587 (708) T protein:vir:17 508 VDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQ 587 (708) T ss_pred ccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHH Confidence 0011101 11111 1112222223333322221 122233334444433322222222 Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 618 PDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQA 697 (726) Q Consensus 618 ~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~ 697 (726) .....+..+. .....+...++++++.++.......++++....++++++.++++.+.+.+..+.+.. ..+...+.. T Consensus 588 ~~~~~~~~~~-~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~---~~~a~~~a~ 663 (708) T protein:vir:17 588 LLISGIAKPR-NEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQD---AMESQANTV 663 (708) T ss_pred hhccccccCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHH Confidence 2111111111 111111111111111111111111222222222222222222222222211111110 111111111 Q ss_pred HHHHHH---HHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 698 QSEAQG---KLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 698 q~~~q~---~~~~l~~~~~~~~~~~~a~~~~q 726 (726) +.-.++ .+.+.....+.++..+...+++. T Consensus 664 q~~~q~~~~~~~~~~~~~~~l~~~q~~q~q~~ 695 (708) T protein:vir:17 664 YKLAQARNIDDKAVMEAIRLLKDVAESQQQQF 695 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHH Confidence 111111 11112222222222221111111 No 138 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=98.07 E-value=5.4e-06 Score=49.44 Aligned_cols=595 Identities=12% Similarity=0.062 Sum_probs=135.6 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEE Q lcl|NC_013692. 26 WSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFE 105 (726) Q Consensus 26 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~ 105 (726) ++++...+.+.. ++ .-+.|...|.....|.=..|..- T Consensus 1 ~~k~~~~~~~~~---------~~-------------------------~~~~~~~~~~~a~~~~~~~~~~~--------- 37 (705) T protein:vir:88 1 MAKRRKIKPMDD---------EQ-------------------------VLRHLDQLVNDALDFNSSELSKQ--------- 37 (705) T ss_pred CCcccccccCCH---------HH-------------------------HHHHHHHHHHHHHhhhhhHHHHH--------- Confidence 111111111000 00 01223333333322211111100 Q ss_pred EecCCcchHHHHHHHHHHHHHHHhhcccchh--------HHHHHHHHHhhc----CCeEEEEeeeeeeeeEEeccccccc Q lcl|NC_013692. 106 VNPVTWEDAESARQNGLVLNQQFNTKLNKQR--------FIDEYVRAGVDE----GTIIVKVGWNYQSRTVKEQVVTYEM 173 (726) Q Consensus 106 ~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~--------~~~~~~~~~l~~----~~~i~k~~w~~~~~~~~~~~~~~~~ 173 (726) ..+....|+...+.....|.+ ..-+|+...|+. |.-+++ + T Consensus 38 ----------~~~~~~~y~g~~~~~~~~~~s~~~~~~v~~~v~~~~~~l~~~~~~~~~~~~------------------~ 89 (705) T protein:vir:88 38 ----------RSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVK------------------Y 89 (705) T ss_pred ----------HHHHHHHHhCCCCCcccCCCCccccHHHHHHHHHHHHHHHHhhcCCCceEE------------------E Confidence 011111111111111111100 111455555532 222222 1 Q ss_pred CCcc--hHHHHHHhhhhhhh-hhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccce----eeccceeeeec Q lcl|NC_013692. 174 MPDS--SEELAQIYQTAAQI-REESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREET----VENHPTVQVCD 246 (726) Q Consensus 174 ~~~~--~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~----~~~~p~i~~v~ 246 (726) .|.. +...+......... ..+. +.....+...+.+....|.|+..+ .|...... +++-+ -+. T Consensus 90 ~p~~~~D~~~a~~~~~~~~~~~~~~----~~~~~~~~~~~~dal~~g~gi~kv----~we~~~~~~~e~~~~~~---~~~ 158 (705) T protein:vir:88 90 EPDTAEDVEQAEQETEYVNYLFMRK----NEGFKVMFDWFQDTLMMKTGVVKV----YVEEVLKPTFERFSGLS---EDM 158 (705) T ss_pred eeCChhHHHHHHHHHHHHhHHHhhc----cchhHHHHHHHHHHhhcCCeEEEe----ccccccchhhhhhccCC---hhh Confidence 2221 12222222222221 1111 111233445566666777776443 22222111 11111 112 Q ss_pred hhheeeCCCCC----CchhhCCe-----------EEEEEeccHHHHHhcCCCcchhhcCcccc----------hhh-ccc Q lcl|NC_013692. 247 YNNIVIDPSCG----SDFSKAKF-----------LIETFESSYAELKADGRYQNLDKIQVEGQ----------NLL-SEP 300 (726) Q Consensus 247 p~~~~~dp~a~----~d~~da~~-----------~~~~~~~t~~el~~~g~~~~~d~~~~~~~----------~~~-~~~ 300 (726) .-.++.||.+. ++..+..| .++...++..++ .++++.-.+....+ ... ... T Consensus 159 l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~---~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~ 235 (705) T protein:vir:88 159 VADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENF---LVDRLATCIDDARFLCHREKYTVSDLRLLGV 235 (705) T ss_pred hhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHc---eecCCCCCcccCcEEEEEEeccHHHHHhhcC Confidence 23344455321 11111111 111122333332 11221100000000 000 000 Q ss_pred chhh-hhccccc------------cccCCcCCceEEEEEEEEEeecCCCceEEEEEEE-EECCEEEEeccCCCCCCc--- Q lcl|NC_013692. 301 DYTG-PSEGVRN------------FDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVAT-WVGAVMIRMEENPFPDKR--- 363 (726) Q Consensus 301 ~~~~-~~~~~~~------------~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~~~-~~g~~~l~~~~~P~~~~~--- 363 (726) +... ....... .+..+.. ....+.+.|.......--+.++|+.+ .-|+.+.+.....|..+. T Consensus 236 ~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~-~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~ 314 (705) T protein:vir:88 236 PEDVIEELPYDEYEFSDSQPERLVRDNFDMT-GQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIIS 314 (705) T ss_pred ChhHhhhhhcccccchhhhhhhccccccccc-cccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCccccc Confidence 0000 0000000 0000000 01111221111000000001111111 111111111111111111 Q ss_pred -cceEEeeeeeecCcccCCCh-HHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeecCccc Q lcl|NC_013692. 364 -IPYVVVNYIPRKRDLYGESD-GALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFNPGAD 441 (726) Q Consensus 364 -~Pf~~~~~~~~~~~~~g~g~-~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~~~~~ 441 (726) .|+-.+++...+--..+.++ ...+.+.-.-+....+.+...+.-.. .....+...+ +.+. ..+...+...||+. T Consensus 315 ~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~-~~~~~~~~~~-~~g~--v~~~d~~~~~pg~v 390 (705) T protein:vir:88 315 NEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNI-YRTNQGRSVV-LDGQ--VNLEDLLTNEAAGI 390 (705) T ss_pred cccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHH-HhccCCceec-cccc--cCcccccccCCCee Confidence 11111111111111111122 23334333333333343333332111 1111111212 1111 12333444445442 Q ss_pred hhhhc-ccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHH-----HHHHHHHHH Q lcl|NC_013692. 442 PRAAV-HMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKREL-----GILRRLSAG 515 (726) Q Consensus 442 ~~~~i-~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~-----~~~~~~~~~ 515 (726) ..-.- .-..+-+.+.-...+.++++. +...-+...|.+.-..|-.+.++..-..++...+. ..+..+.+. T Consensus 391 v~~~~~~~i~~~~~~~~~~~~~~ll~~----~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~ 466 (705) T protein:vir:88 391 VRVKSMNSITPLETPQLSGEVYGMLDR----LEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARM 466 (705) T ss_pred EEecCCCccccccCCcCcHHHHHHHHH----HHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 22111 111222233333334444443 33344556676554433222222211111111111 112222222 Q ss_pred HH-HHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeee--cccc-------hH-HHHHHHHHHHHHHHhhh Q lcl|NC_013692. 516 II-EIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLD--ISTA-------EE-DNAKVNDLTFMLQTMGP 584 (726) Q Consensus 516 ~~-~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~--~~~~-------~~-~~~~~~~l~~l~q~~~~ 584 (726) +. ...+.++.++....-.- ......+.+.. .+ +.+. ...+ .. ......+....+..+.. T Consensus 467 ~a~~~~~~l~~~~~~li~~~----~~~~~~~ri~g-----~~-v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~ 536 (705) T protein:vir:88 467 FAETGVKRLFQLLHDHAIKY----QNQEEVFQLRG-----KW-VAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWE 536 (705) T ss_pred HHHHHHHHHHHHHHHHHHHh----CCCceEEeecc-----ch-hccchHhhccCCceEEeeccccchHHHHHHHHHHHHH Confidence 21 22233333333322211 11222344432 21 1111 0000 00 00111121111111111 Q ss_pred ccchhHHHHHHHHHHHhhhhhhhhhhH---------HHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 585 NMDPMMAQQIMGQIMELKKMPDFAKRI---------REFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQD 655 (726) Q Consensus 585 ~~~~~~~~~~~~~~~~~~~~~e~~~~l---------~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~ 655 (726) ..+.......+..+.......+....+ ..+...+...+++..+++..+.+.+.++..++++...+.++. . T Consensus 537 ~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~-e 615 (705) T protein:vir:88 537 MAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQS-D 615 (705) T ss_pred HHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHH-H Confidence 000000000001011111111111111 111111111111111111111111111111111111111111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHhcC Q lcl|NC_013692. 656 SKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLN-------SQLKRLDEATSARTSQK 726 (726) Q Consensus 656 ~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~-------~~~~~~~~~~~a~~~~q 726 (726) +...+.+++..+.++++++++++.++++.... +++.+.++.+.+.+++..+++ .+++..+.........+ T Consensus 616 ~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~-~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~~ 692 (705) T protein:vir:88 616 ALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQ-QREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGK 692 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11112222333344445444444443332222 222222221111111111111 11111111111111111 No 139 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=98.01 E-value=7.4e-06 Score=48.73 Aligned_cols=583 Identities=11% Similarity=0.041 Sum_probs=182.8 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHH Q lcl|NC_013692. 41 EAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQN 120 (726) Q Consensus 41 ~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~ 120 (726) -|+....|. -.+.|- . +.+...+.+.--....+...+|+...+. . ..+=...|.-. T Consensus 1 ~~~~~~~~~------~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~------~~~~r~~a~~d 56 (711) T protein:vir:10 1 MAKKQKKSR------VEQLYA--K--KAKVYAKNNDDDRALLATARERARDGAT--------Y------WKDNWEAAEDD 56 (711) T ss_pred CCccccccc------ccchhH--H--HHHhcccCcchHHHHHHHHHHHHHHHHh--------h------hHHHHHHHHHH Confidence 011000000 111121 0 0111111111111122222233322110 0 11111112211 Q ss_pred HHHHH-HHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcc---------------------h Q lcl|NC_013692. 121 GLVLN-QQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDS---------------------S 178 (726) Q Consensus 121 t~~~n-~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~---------------------~ 178 (726) -.|++ .+|..+. ...++..-+.++..+ +|+..++.-.+.++..++.+.++|+. . T Consensus 57 ~~fy~G~Qw~~~~--~~~l~~~g~p~~~~N--~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~ 132 (711) T protein:vir:10 57 LKFLGGEQWPSQV--RTERELEQRPCLVNN--VLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAG 132 (711) T ss_pred HHHhCCCCCCHHH--HHHHHhcCCCcEEEc--chHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCC Confidence 22211 1111000 001111111111111 11112222234445566666666642 1 Q ss_pred HHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccceeeeechhheeeCCCCCC Q lcl|NC_013692. 179 EELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHPTVQVCDYNNIVIDPSCGS 258 (726) Q Consensus 179 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p~i~~v~p~~~~~dp~a~~ 258 (726) .....+++..+.+... -..........+.+|.+....|.||.. +++|+... T Consensus 133 ~~d~~~Ae~l~~~~~~-~~~~~~~~~~~s~af~d~~~~G~G~~e---------------------------v~~d~~~~- 183 (711) T protein:vir:10 133 KNDYELAEVFTGLIKN-IEYNCDAETEYDIAFQGAVESGMGYLR---------------------------VRSDYLAD- 183 (711) T ss_pred hhHHHHHHHHHHHHHH-HHHhcChhHHHHHHHHHhhhcCcceEE---------------------------EEecccCC- Confidence 2222222333333221 122345566788889888889998743 22222211 Q ss_pred chhhCCeEEEE-------------EeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCceEEEE Q lcl|NC_013692. 259 DFSKAKFLIET-------------FESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVH 325 (726) Q Consensus 259 d~~da~~~~~~-------------~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 325 (726) |.-+-.+++.+ +..+.+|..=.. +..+...+.....-++... .... ...+.-+ T Consensus 184 d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~----~~~~~~~~~~~~~yp~~a~-----~~~~-----~~~~~~~ 249 (711) T protein:vir:10 184 DSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCL----IDDTMSKEKFKALYPDATA-----EPVY-----EDSVADY 249 (711) T ss_pred CCCCCCeEEeeecChhheeeCccccccChhhhccee----eeecCCHHHHHHhCCchhh-----hhhh-----ccccccc Confidence 11111222211 112222211000 0000000000000000000 0000 0011112 Q ss_pred EEEEEeecCCCce--EEE--------EEEEEECCEEEEeccC-C---------------------------CC-----CC Q lcl|NC_013692. 326 EYWGYYDIHGDGV--LHP--------IVATWVGAVMIRMEEN-P---------------------------FP-----DK 362 (726) Q Consensus 326 E~w~~~~~~~~g~--~~~--------~~~~~~g~~~l~~~~~-P---------------------------~~-----~~ 362 (726) ..|+.- +.+ .+. +++.+.++.......+ + +- ++ T Consensus 250 ~~~~~~----~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~ 325 (711) T protein:vir:10 250 DTWFTE----KSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEG 325 (711) T ss_pred CcccCc----ceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecC Confidence 223221 111 111 1111122111111110 0 00 01 Q ss_pred --ccceEEeeeeeecCccc---CCChHHH-HHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceE-- Q lcl|NC_013692. 363 --RIPYVVVNYIPRKRDLY---GESDGAL-LIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENY-- 434 (726) Q Consensus 363 --~~Pf~~~~~~~~~~~~~---g~g~~~~-~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi-- 434 (726) -+|+-.|++.|..+... +.+.... +.++-+....++..+.-.+...+.. ..+-+ ....|.+- T Consensus 326 ~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~----~~~~~------~~~~gai~~~ 395 (711) T protein:vir:10 326 PVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALA----PKAPF------IGSEGNVEGR 395 (711) T ss_pred CCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhc----CCCce------eecCcccCCh Confidence 12222233333322211 2232232 2334343333333333333333211 11111 11222221 Q ss_pred -------eecCccchh-----hhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHH Q lcl|NC_013692. 435 -------EFNPGADPR-----AAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAAS 502 (726) Q Consensus 435 -------~~~~~~~~~-----~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~ 502 (726) ..++++... .+.....+...++-....+++++.....+.. .+|.+..++|..+.++++...++. T Consensus 396 ~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~----~tGi~~~~~G~~~n~~Sg~ai~~~ 471 (711) T protein:vir:10 396 EDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKS----TMGMYDASLGAMGNETSGRAIIAR 471 (711) T ss_pred HHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHH----HhCCChHHcCCCccchHHHHHHHH Confidence 012222110 0111122223344556666666666665554 457777777776777888777666 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchhhcccccceeeecc------------------ Q lcl|NC_013692. 503 KRELG-ILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRDDLAGNFDLKLDIS------------------ 563 (726) Q Consensus 503 ~~~~~-~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~------------------ 563 (726) +.... .+..|.+.++...+.+.+++..+...- .+.+..+.|..++-..++ +.++.. T Consensus 472 q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~----~~~er~~rI~ged~~~~~-v~ln~~~~~~~~G~~~~~nDi~~g 546 (711) T protein:vir:10 472 QRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHI----YDTERVVRLKFPDETEDF-VKLNEQIFDEESGEWVTIHDLNVQ 546 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCeEEEEecCCCCcce-EEecccccccccccceeeecccee Confidence 66543 355566777777777777766543321 123335556544322221 222211 Q ss_pred -----cchH-HHHHHHHHH-HHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHH Q lcl|NC_013692. 564 -----TAEE-DNAKVNDLT-FMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQ 636 (726) Q Consensus 564 -----~~~~-~~~~~~~l~-~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq 636 (726) .... ......+.. ..+..+.+.. |.....++..+++.+.++...+..........+. ...+..+.+..+++ T Consensus 547 ~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~-p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~-~~~~~~~~~~qq~~ 624 (711) T protein:vir:10 547 KYDVVVTTGPAFATQRIEAAEAMIQFAQAV-PSAAAVMADLIAQNMDWPGADVIAERLKKIVPPN-VLSKDEREAIEEDM 624 (711) T ss_pred eeEEEEeeccCchhHHHHHHHHHHHHHhhc-chhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcc-cCcchhhhHHHHHH Confidence 1111 111122221 1122233322 3455555556666666666544433333222211 11112221111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_013692. 637 IEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQS-EAQGKLAMLNSQLKRL 715 (726) Q Consensus 637 ~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~-~~q~~~~~l~~~~~~~ 715 (726) .++ +.+..+.+.+.+.++....+++++..++++.+.+.+ ....+...+....+.++ ...++++.+..++++. T Consensus 625 ~e~---qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~----~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~ 697 (711) T protein:vir:10 625 PEQ---TEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQ----LETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQA 697 (711) T ss_pred HHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 111111111122222222222222222222211111 11111111111111111 1222333333444444 Q ss_pred HHHHHHHHhcC Q lcl|NC_013692. 716 DEATSARTSQK 726 (726) Q Consensus 716 ~~~~~a~~~~q 726 (726) +.+..+.+.+- T Consensus 698 qaelq~~q~~~ 708 (711) T protein:vir:10 698 LAEITASQANV 708 (711) T ss_pred HHHHHHHHHHh Confidence 44444444333 No 140 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=97.34 E-value=9e-05 Score=42.75 Aligned_cols=580 Identities=11% Similarity=0.050 Sum_probs=162.9 Q ss_pred HHHHHHHHHHH------------HHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhh Q lcl|NC_013692. 80 PTIRKQAEWRY------------SSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVD 147 (726) Q Consensus 80 ~~v~~~v~~~~------------~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~ 147 (726) ..|-+....-+ +.++.-|+. +.......-.++.+....|-..+|..+. ...|+..-+-+|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~q~~~r~~a~~d~~fy~G~QW~~~~--~~~l~~~g~p~~~ 73 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADINY-----EIEDQPAWRAVADKEMDYADGNQLDTEL--LRRQQALGIPPAV 73 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHHH-----HHhccHHHHHHHHHHHHhhcCCCCCHHH--HHHHHhcCCCcEE Confidence 11111111111 011111110 0000000111111111112122221111 0112221122221 Q ss_pred cCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccc Q lcl|NC_013692. 148 EGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGS 227 (726) Q Consensus 148 ~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 227 (726) .+ .|+..++.-.+.++..++.++++|+.......+++....+..+ ..+........+.+|.+....|.||..+ T Consensus 74 ~N--~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~-~~~~~~~~~~~s~Af~~~i~~G~Gw~e~---- 146 (772) T protein:vir:10 74 ED--LIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNT-AERQSGADRACSEAFRPQIACGIGWVEV---- 146 (772) T ss_pred Ec--chHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHH-HHHhcChHHHHHHHHHHhhhcCceeEEe---- Confidence 11 1222334445667778889999997543333444444443322 2224456678889999999999988431 Q ss_pred eeecccceeeccceeeeechhh--eee---CCCCCCchhhCCeEEEEEeccHHHHHhcCCCcch--hh----cCcccchh Q lcl|NC_013692. 228 EEEEREETVENHPTVQVCDYNN--IVI---DPSCGSDFSKAKFLIETFESSYAELKADGRYQNL--DK----IQVEGQNL 296 (726) Q Consensus 228 ~~~~~~~~~~~~p~i~~v~p~~--~~~---dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~--d~----~~~~~~~~ 296 (726) .+..+ |+. +++ ||...- |+ . .. ..+++|..=..+.+.+ +. ++.....+ T Consensus 147 -------~~~~d-------~~~~~i~i~~v~p~~v~-~D-p----~a-~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~ 205 (772) T protein:vir:10 147 -------SRESD-------PFKFPYRCRPIRRDEIH-WD-M----KC-GDDWEACRFLRRQRWLSPDRIALVFPEHAELI 205 (772) T ss_pred -------ccccC-------CCCCCeEEEeeCcccce-ec-C----CC-CCCHHHhhhhhhhccCCHHHHHHhCCCchhHH Confidence 01111 111 111 221110 00 0 00 1244442111111111 11 11111001 Q ss_pred hcccchhhhhccccccccC-CcCCc----eEEEEEEEEEe-----ecCCC--ceEE-EEEE-----EE---ECCEEEEec Q lcl|NC_013692. 297 LSEPDYTGPSEGVRNFDFQ-DKSRK----RLVVHEYWGYY-----DIHGD--GVLH-PIVA-----TW---VGAVMIRME 355 (726) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~-~~~~~----~v~v~E~w~~~-----~~~~~--g~~~-~~~~-----~~---~g~~~l~~~ 355 (726) ....+......+..+.++. +.... .......|... +...+ -+.+ |++. ++ .|+.+..+. T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~ 285 (772) T protein:vir:10 206 GMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDP 285 (772) T ss_pred HhhhhhcccccCcccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCc Confidence 1000000000011111110 00000 00011111110 00000 0122 2111 11 122222111 Q ss_pred c-------------------------------CCCCCCccce--EEeeeeeecCcccC-CChHH-HHHHHHHHHHHHHHH Q lcl|NC_013692. 356 E-------------------------------NPFPDKRIPY--VVVNYIPRKRDLYG-ESDGA-LLIDNQRIIGAVTRG 400 (726) Q Consensus 356 ~-------------------------------~P~~~~~~Pf--~~~~~~~~~~~~~g-~g~~~-~~~d~Q~~~N~~~~~ 400 (726) . .-...+..|| -.|++.|+.+.... .|... .++++-+....+... T Consensus 286 ~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~ 365 (772) T protein:vir:10 286 NNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSG 365 (772) T ss_pred ccHHHHHHHhhcccchheeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHH Confidence 1 1122222233 33455544333322 23222 223333333322222 Q ss_pred HHHHHHhcCCCceEeecccccchhhhhhcCCceEee-----cCccchhhhccccc-----------CccchhHHHHHHHH Q lcl|NC_013692. 401 MIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEF-----NPGADPRAAVHMHT-----------FPEIPQSAQYMINL 464 (726) Q Consensus 401 ~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~-----~~~~~~~~~i~~~~-----------~~~~~~~~~~ll~~ 464 (726) +...+ +++...++- ...|.+--. ...+.+...|.+.+ +...+......+++ T Consensus 366 ~S~~~-------~~l~~~~~~------~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 432 (772) T protein:vir:10 366 VSKLR-------WGMSVARVE------RTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQM 432 (772) T ss_pred HHHHH-------HHHhccccc------ccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHH Confidence 22222 222222221 122322111 11122222222221 11234445667777 Q ss_pred HHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEeccc Q lcl|NC_013692. 465 QQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKREL-GILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEH 543 (726) Q Consensus 465 ~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~-~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~ 543 (726) +......+.-| +|.++..+|....+++++..++.+... .....|.+.++...+.+.+++..+...- .+.+. T Consensus 433 lq~~~~~i~~v----sGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~----y~~er 504 (772) T protein:vir:10 433 LQDNRATIERV----SNITAGFQGRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVED----IGQER 504 (772) T ss_pred HHHHHHHHHHH----hCCCHHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCc Confidence 77766655433 487777788777888888777666553 3455666667776777766666544331 12233 Q ss_pred ceecchhhcc-cccceeeeccc---chHHHHHHHHHHHHHHH----hhhccchhHHHHHHHHHHHhhhh--hhhhhhHHH Q lcl|NC_013692. 544 FVDIRRDDLA-GNFDLKLDIST---AEEDNAKVNDLTFMLQT----MGPNMDPMMAQQIMGQIMELKKM--PDFAKRIRE 613 (726) Q Consensus 544 ~v~v~~~~~~-~~~dv~i~~~~---~~~~~~~~~~l~~l~q~----~~~~~~~~~~~~~~~~~~~~~~~--~e~~~~l~~ 613 (726) .+.|...+-. .+.-+.++... .+......+.+...... .++..+. .....+..++++... +.....+-. T Consensus 505 ~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t-~r~~~~~~m~ql~~~~~P~~~~~~~~ 583 (772) T protein:vir:10 505 TEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNS-YRGQQLNAMSEAVKSMPPQYQAAVLP 583 (772) T ss_pred EEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEeeccccchH-HHHHHHHHHHHHHhccChhHHHHHHH Confidence 4555433321 11112221100 00000000000000000 0000000 000111111111100 111111100 Q ss_pred HHhhhhhh------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 614 FQPQPDPI------AQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQ 687 (726) Q Consensus 614 ~~~~~~~~------~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~ 687 (726) +....... .....++..++...+.+++..++.+...++. ..+++.+++.++..+.+.+.++.+ .+ T Consensus 584 ~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~qq~~~~~-------~~el~~~q~~a~~~~~~A~a~~~~--aq 654 (772) T protein:vir:10 584 FLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQDALAKA-------GNDIKLRELEIKERKADSEISGLN--AK 654 (772) T ss_pred HHHhhcCCCChHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHH--HH Confidence 00000000 0000000000000000000000000000000 011111111111111111111000 00 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 688 QARKRELQQAQSEAQGKLAM-LNSQLKRLDEATSARTSQK 726 (726) Q Consensus 688 ~~~e~e~~~~q~~~q~~~~~-l~~~~~~~~~~~~a~~~~q 726 (726) ... ...+.+..+.+.-+.. ...++.+..+......-.+ T Consensus 655 a~~-~~~~a~~~a~~aa~~~~q~~q~a~~ad~~l~~~g~~ 693 (772) T protein:vir:10 655 AVQ-IGVQAAFSAMQAGAQIAQMPMIAPIADAVMQSAGYQ 693 (772) T ss_pred HHH-HHHHHHHHHhhhhhhHHhhhhhhHHHHHHHHhcccc Confidence 000 0000011111100000 0001111111111111111 No 141 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=89.66 E-value=0.025 Score=29.36 Aligned_cols=131 Identities=14% Similarity=0.139 Sum_probs=15.9 Q ss_pred HHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 593 QIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQA 672 (726) Q Consensus 593 ~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~ 672 (726) ..++++.....+......+.++.......+.+..++.....++..+.+....+ ...........+..++...++.++ T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~e---e~i~~l~~~~~el~e~~~~l~~ei 77 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVE---DEINKLEGEKTELEEKKSKLEGEI 77 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222222222222222222222111111111111000000000000000 000000000000011111111111 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHH----HHHHHhcC Q lcl|NC_013692. 673 DMTDLNFLEQESGVQ-QARKREL-QQAQ-SEAQGKLAMLNSQLKRLDEA----TSARTSQK 726 (726) Q Consensus 673 ~~~~~e~~~qe~~~~-~~~e~e~-~~~q-~~~q~~~~~l~~~~~~~~~~----~~a~~~~q 726 (726) ...+.+..+...... ...+... ...+ ..............+..... ...+...+ T Consensus 78 ~~le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (466) T protein:vir:80 78 KELENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVK 138 (466) T ss_pred HHHHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHH Confidence 111100000000000 0000000 0000 00000000011111111100 00011111 No 142 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=81.12 E-value=0.089 Score=26.35 Aligned_cols=426 Identities=12% Similarity=0.071 Sum_probs=163.2 Q ss_pred CCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCC----------CcCCCH Q lcl|NC_013692. 11 LPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQVTDEKITQINRWLDYMHVRGEGKPKTEKGK----------SAVQPP 80 (726) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~gr----------s~~v~~ 80 (726) |+= ... -|.......+.+-.+.... -.+++..-+..+.|...|. ..+..+ T Consensus 1 m~V------~~~------hp~y~a~~~~W~~~rd~~~--------G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n 60 (452) T protein:vir:94 1 MPI------ETK------HPEYLAYENDWIDCRVASL--------GQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYS 60 (452) T ss_pred CCC------CCc------CHHHHHHHHHHHHHHHHhc--------ChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCc Confidence 221 111 1111111111111000000 0122211111123334442 256678 Q ss_pred HHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHhhcCCeEEEEeeeee Q lcl|NC_013692. 81 TIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLVLNQQFNTKLNKQRFIDEYVRAGVDEGTIIVKVGWNYQ 160 (726) Q Consensus 81 ~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~~i~k~~w~~~ 160 (726) -+++++++++-.+ |.-++.+++ | . .. .. +|. =...++...++..++..+|.+|.+-+-|.|.. T Consensus 61 ~~~~t~~~~~G~v----f~k~p~~~~-p---~--~l----~~-~~~-D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~- 123 (452) T protein:vir:94 61 ITSKTLSALSGMV----LDQPPVITH-P---D--AM----SK-YFE-DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPL- 123 (452) T ss_pred hHHHHHHHHhchh----hcCCceecc-c---H--HH----HH-HHh-cccCCCHHHHHHHHHHHHHhcCeEEEEEeecc- Confidence 8888888877554 444544442 1 1 11 11 121 02334446678899999999999888875520 Q ss_pred eeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhhcccceeeccccceeecccceeeccc Q lcl|NC_013692. 161 SRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVENHP 240 (726) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~p 240 (726) .| .+| T Consensus 124 -------------------------------------------------------~g--------------------~rP 128 (452) T protein:vir:94 124 -------------------------------------------------------TG--------------------GDP 128 (452) T ss_pred -------------------------------------------------------CC--------------------Cce Confidence 00 136 Q ss_pred eeeeechhheeeCCCCCCchhhCCeEEEEEeccHHHHHhcCCCcchhhcCcccchhhcccchhhhhccccccccCCcCCc Q lcl|NC_013692. 241 TVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEGVRNFDFQDKSRK 320 (726) Q Consensus 241 ~i~~v~p~~~~~dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (726) ++-.++|.+|+ ++....+ ....++.-+......| ..+.++..... T Consensus 129 y~~~~~~~~Ii-~W~~~~~-g~l~~v~lre~~~~~d---------------------------------~~d~f~~~~~~ 173 (452) T protein:vir:94 129 YISVYTTENIL-NWEEDED-GRLLMVVLREFYTVRD---------------------------------TADRYVQNIRV 173 (452) T ss_pred EEEEechhhhc-Ccccccc-CCeeEEEEEEEEEEec---------------------------------CCCcccceeEE Confidence 67777888866 4432211 1111111111000000 00001111111 Q ss_pred eEEEEE-------EEEEeecCCCceEEEEEEEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHH Q lcl|NC_013692. 321 RLVVHE-------YWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRI 393 (726) Q Consensus 321 ~v~v~E-------~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~ 393 (726) .+++++ +|... ..+++..... -......+.+| .+.+||+++...... ...+.+..-.+..++.. T Consensus 174 ~yRvL~l~~g~~~v~~~~-~~~~~~~~~~-----~~~~~~~~~~~--l~~IP~v~~~~~~~~-~~~~~pPLl~LA~ln~~ 244 (452) T protein:vir:94 174 RYRCLELVDGLLQITVHE-TQDGKVWELA-----KTSTIQNVGVT--MDYIPFFCITPSGLS-MTPAKPPMIDIVDINYS 244 (452) T ss_pred EEEEEEEeCCeEEEEEEE-ccCCceeeec-----cceeecCCCcc--cceeEEEEEcCCCCC-CCCCccchHHHHHHHHH Confidence 222222 11111 1111111100 01122233333 366788766544332 33567777788888888 Q ss_pred HHHHHHHHHHHHHhcCCCceEeecccccchhhhhhcCCceEeec-CccchhhhcccccCccch-hHHHHHHHHHHHHHHH Q lcl|NC_013692. 394 IGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDRGENYEFN-PGADPRAAVHMHTFPEIP-QSAQYMINLQQAEAES 471 (726) Q Consensus 394 ~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~~~~~~g~vi~~~-~~~~~~~~i~~~~~~~~~-~~~~~ll~~~~~~~e~ 471 (726) +....+-..++++.++.|...+ .|. +..+....-++.+|.+. +|+. ..+..+...+ .....-+.-+...+.. T Consensus 245 hy~~~sd~~~~l~~~~~P~l~~-~g~-~~~~~i~iG~~~~~~lpe~~~~----~~yie~~g~~i~~~~~~l~~le~~m~~ 318 (452) T protein:vir:94 245 HYRTSADLEHGRHFTGLPTPWI-TGA-ESQSTMHIGSTKAWVIPEVAAK----VGFLEFTGQGLQSLEKALSEKQAQLAS 318 (452) T ss_pred HhcchhHHHHHHHHcccceeEe-ecC-cCCCceEecccccccCCCCCCc----ceEEccCchhHHHHHHHHHHHHHHHHH Confidence 8877777889999999996665 332 22334445566666665 2432 2233322111 1112222222333322 Q ss_pred HhchHHHhhccCcccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCeEEEEecccceecchh Q lcl|NC_013692. 472 MTGVKAFNAGISGAALGDTATAVR-GALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRD 550 (726) Q Consensus 472 ~tGv~~~~~G~~~~~~~~ta~~i~-~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~~d~e~~iRi~~~~~v~v~~~ 550 (726) + |. +...+ ...+++++... .........+..++.++..++ .+++.++..|.....-+. +.++++ T Consensus 319 ~-Ga-~ll~~---~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al----~~~l~~~a~w~g~~~~~~------v~~n~d 383 (452) T protein:vir:94 319 L-SA-RLIDN---STRGSEATETVKLRYMSETASLKSVTRAVEALL----NKAYSCIMDMESMGGTLN------IKLNSA 383 (452) T ss_pred H-HH-Hhhcc---CCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHH----HHHHHHHHHHcCCCCceE------EEeccc Confidence 2 22 12222 11123333222 222223455566666666554 567777777766543222 222222 Q ss_pred hcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhh---hhhHHHHHhhhhh--hhhh- Q lcl|NC_013692. 551 DLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDF---AKRIREFQPQPDP--IAQQ- 624 (726) Q Consensus 551 ~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~---~~~l~~~~~~~~~--~~qq- 624 (726) -....++ ....+.+..+.+ +..+..... +..+. ..++.+. ...+....+.+.+ .... T Consensus 384 F~~~~~~-----------~~~~~al~~~~~--~G~is~~t~---~~~L~-~~gvl~~~~e~~~i~~E~~~~~~~~~~~~~ 446 (452) T protein:vir:94 384 FLDSKLT-----------AAELKAWVEAYL--SGGISKEIY---IHALK-VGKVLPPPGESMGVIPDPPAPEPSPSNTPP 446 (452) T ss_pred cccccCC-----------HHHHHHHHHHHh--cCCCcHHHH---HHHHH-hCCCCCCccCHHHHHHHhhccCcccCCCCC Confidence 1111111 111111111111 111221111 11111 1111100 0001000000000 0000 Q ss_pred HHHHHH Q lcl|NC_013692. 625 KAQLEL 630 (726) Q Consensus 625 ~~q~e~ 630 (726) ..--+. T Consensus 447 ~~~~~~ 452 (452) T protein:vir:94 447 NPSSKA 452 (452) T ss_pred CCccCC Confidence 000000 No 143 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=78.92 E-value=0.11 Score=25.85 Aligned_cols=147 Identities=12% Similarity=0.123 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhh-hhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 570 AKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMP-DFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYM 648 (726) Q Consensus 570 ~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~-e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~ 648 (726) +...++-..+......+ .....++.... +..+...+.......... .+.+...++.+++......+... T Consensus 1 Mki~elk~el~~~~~el--------~~~~~elr~~~~~~~~~~~el~~~~~e~~~--~~~ei~el~~~l~~~~~~~~~~~ 70 (437) T protein:vir:10 1 MKIEKLKKDLATKTAEL--------NTKKAEIRSFTESEDKTIDEVKAGMTEIKE--KEDEIKEIRSNIEVLEQASALKV 70 (437) T ss_pred CCHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111111110000 00000000000 000000000000000000 00011111111111111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH-HHHHHHHHHH-----HHHHHHHHH Q lcl|NC_013692. 649 SGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQAR-KRELQQAQSEAQ-GKLAMLNSQL-----KRLDEATSA 721 (726) Q Consensus 649 ~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~-e~e~~~~q~~~q-~~~~~l~~~~-----~~~~~~~~a 721 (726) .+.+........++...................++....... ..+......... .......... ......... T Consensus 71 e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (437) T protein:vir:10 71 EEKRDDSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKT 150 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHh Confidence 000000000000000000000000000000000000000000 000000000000 0000000000 000000000 Q ss_pred HHhcC Q lcl|NC_013692. 722 RTSQK 726 (726) Q Consensus 722 ~~~~q 726 (726) ..... T Consensus 151 ~e~~~ 155 (437) T protein:vir:10 151 GEVRD 155 (437) T ss_pred hhhhh Confidence 00000 No 144 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=57.94 E-value=0.42 Score=22.63 Aligned_cols=99 Identities=8% Similarity=0.097 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------ Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQ------ 698 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q------ 698 (726) +...+.+.+++++...+.+........... ....+..+...+..+.+....+..+.+.... ..+...+... T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~--~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~-~~e~~~e~~~~~~~~~ 77 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNA--LESDDLEAARSIKAEVEQAKANLVEAENDLK-LYESSVEVGGAENIGG 77 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hchhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhhhccccccc Confidence 222223333333222211111111000000 0000001111111111111111111000000 0000000000 Q ss_pred ---HHHHH-HHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 699 ---SEAQG-KLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 699 ---~~~q~-~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..... .......-.+............+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (394) T protein:vir:97 78 KEVTQEEKTYRESVNDFIRSKGKIVNDSLRFE 109 (394) T ss_pred cccchhhHHHHHHHHHHHHHHHHHhhhhhhhh Confidence 00000 00000111111000000000000 No 145 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=48.40 E-value=0.67 Score=21.53 Aligned_cols=127 Identities=15% Similarity=0.139 Sum_probs=11.3 Q ss_pred HHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_013692. 593 QIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQD--SKVGTEQAKARALAS 670 (726) Q Consensus 593 ~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~--~~~~~eqaq~~q~~~ 670 (726) ..+++++...++......+.++............++.. +++..+...+.......... .......+....++. T Consensus 1 ~~~~~~~~~~el~~~~~~l~el~~~~~el~~~~~el~~-----~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~ 75 (425) T protein:vir:95 1 MALRQLMLTKKIEQRKAALDELVKREQELQAKAAELEQ-----AIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEG 75 (425) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222111111111111111111111111111100 00000000000000000000 000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH---HHHHHHHHHHHHHH-----HHHHHHHhcC Q lcl|NC_013692. 671 QADMTDLNFLEQESGVQQARKRELQQAQ-SEAQ---GKLAMLNSQLKRLD-----EATSARTSQK 726 (726) Q Consensus 671 q~~~~~~e~~~qe~~~~~~~e~e~~~~q-~~~q---~~~~~l~~~~~~~~-----~~~~a~~~~q 726 (726) +....+.+...- ...+.......... .... .........++... +......... T Consensus 76 ~~~~~~~~l~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (425) T protein:vir:95 76 EIAQLEDELEQI--NSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFR 138 (425) T ss_pred HHHHHHHHHHHh--hhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHH Confidence 000000000000 00000000000000 0000 00000000000000 0000000000 No 146 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=44.29 E-value=0.81 Score=21.08 Aligned_cols=141 Identities=9% Similarity=0.072 Sum_probs=13.1 Q ss_pred cchHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHh-hhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 564 TAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMEL-KKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERA 642 (726) Q Consensus 564 ~~~~~~~~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~-~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~a 642 (726) .+..+.....++..+... + ..+..+..++ ....++.+.+.+... ..+....+.+++.... T Consensus 1 ~~~~~~~l~~~~~~~~~~----l-----~el~e~~~~l~k~~~el~~~l~ea~~----------~ee~~~~ee~i~~l~~ 61 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAA----L-----AELLEQEKALQKRSEELEAAIDEANT----------DEEIAVVEDEINKLEG 61 (466) T ss_pred CchHHHHHHHHHHHHHHH----H-----HHHHHHHHHHHHHHHHHHHHHHhhhh----------HHHHHHHHHHHHHHHH Confidence 111111100000000000 0 0011111000 001111111111110 0000001111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHH--H----HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 643 RAAHYMSGAGLQDSKVGTEQAKARALASQADMT----DLNFLEQESGVQ--Q----ARKRELQQAQSEAQGKLAMLNSQL 712 (726) Q Consensus 643 q~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~----~~e~~~qe~~~~--~----~~e~e~~~~q~~~q~~~~~l~~~~ 712 (726) ......+............+.+...+....... ......+..... . .....+...+........+.+.-+ T Consensus 62 ~~~el~e~~~~l~~ei~~le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 141 (466) T protein:vir:80 62 EKTELEEKKSKLEGEIKELENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFL 141 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHH Confidence 000000000000000000000000000000000 000000000000 0 000000000111111111111111 Q ss_pred HHHHHHHHHHHhcC Q lcl|NC_013692. 713 KRLDEATSARTSQK 726 (726) Q Consensus 713 ~~~~~~~~a~~~~q 726 (726) ...... ..... T Consensus 142 ~~~~~~---~~~~~ 152 (466) T protein:vir:80 142 AQVRTL---AQQKR 152 (466) T ss_pred HHHHHH---hhhhh Confidence 111100 01111 No 147 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=43.21 E-value=0.85 Score=20.96 Aligned_cols=128 Identities=16% Similarity=0.198 Sum_probs=9.4 Q ss_pred hhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_013692. 588 PMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQD--SKVGTEQAKA 665 (726) Q Consensus 588 ~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~--~~~~~eqaq~ 665 (726) ..+..+.+. +..+..+.++.++.+.. .+...+++.... .+.+........... ...++ ........+. T Consensus 1 ~~~~~~~~~---~e~~~~e~a~~~~~~~~-----~~k~~e~~~~~k-e~~~~~l~~~~e~~~-k~~~E~~~~le~~~ee~ 70 (458) T protein:vir:10 1 MTIDINKLK---EELGLGDLAKSLEGLTA-----AQKAQEAERMRK-EQEEKELARMNDLVS-KAVGEDRKRLEEALELV 70 (458) T ss_pred Cccchhhhh---hhhchhhHHHHHHHHHH-----HHHHHHHHHHHH-HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 000000011 11111111111110000 000000000000 000000000000000 00000 0000000000 Q ss_pred HHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHH----HHH-hcC Q lcl|NC_013692. 666 RALASQADMTDL----NFLEQESGVQQARKRELQQAQSEAQG-------KLAMLNSQLKRLDEATS----ART-SQK 726 (726) Q Consensus 666 ~q~~~q~~~~~~----e~~~qe~~~~~~~e~e~~~~q~~~q~-------~~~~l~~~~~~~~~~~~----a~~-~~q 726 (726) +.+..+.++... ...+.........+ +........+. ....+....+....... ... ..+ T Consensus 71 k~l~ee~~~~~~~~a~~~e~~~~~~~~~~~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 146 (458) T protein:vir:10 71 KSLDEKSKKSNELFAQTVEKQQETIVGLQD-EIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEK 146 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhh Confidence 011111000000 00000000000000 00000000000 00000000000000000 000 000 No 148 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=40.02 E-value=0.99 Score=20.61 Aligned_cols=96 Identities=11% Similarity=0.130 Sum_probs=9.4 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_013692. 624 QKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADM--TDLNFLEQES-GVQQARKRELQQAQSE 700 (726) Q Consensus 624 q~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~--~~~e~~~qe~-~~~~~~e~e~~~~q~~ 700 (726) .......+.++++.+...+.... .........++. +..+.+.+. ++.+....+. .++...+.+....... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~-----~~~~~~~~~ee~--~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~ 73 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQG-----KVEKNEVRSEEL--AAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKD 73 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH-----HHhccCccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 00000011111111111111111 000000001110 111111111 1111100000 0000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 701 AQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 701 ~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ...+...-....+...+.......++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~e~~ 99 (434) T protein:vir:62 74 DDPEKKEDPTAKENPNEKTELSEEQR 99 (434) T ss_pred chhhhhcchhhhcchhhhHHHHHHHH Confidence 00000000000000000000000111 No 149 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=33.90 E-value=1.3 Score=19.91 Aligned_cols=120 Identities=9% Similarity=0.077 Sum_probs=6.2 Q ss_pred hhhhhhhhhHHHHHhhh-----------hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 602 KKMPDFAKRIREFQPQP-----------DPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALAS 670 (726) Q Consensus 602 ~~~~e~~~~l~~~~~~~-----------~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~ 670 (726) |.+.++.+.+.+...+. ........++ +....+++....+.....++........ +....+.+. T Consensus 1 Mki~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el--~~~~~e~~~~~~ei~el~~~l~~~~~~~---~~~~e~~~~ 75 (437) T protein:vir:10 1 MKIEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEV--KAGMTEIKEKEDEIKEIRSNIEVLEQAS---ALKVEEKRD 75 (437) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHH Confidence 11111111111100000 0000000000 0000000000000000000000000000 000000000 Q ss_pred HHHHH--HHHHHHH--HHHHHHHHHHHH----HHHHHHHHHHHHHHHHH----HHHHHHHHHHH--HhcC Q lcl|NC_013692. 671 QADMT--DLNFLEQ--ESGVQQARKREL----QQAQSEAQGKLAMLNSQ----LKRLDEATSAR--TSQK 726 (726) Q Consensus 671 q~~~~--~~e~~~q--e~~~~~~~e~e~----~~~q~~~q~~~~~l~~~----~~~~~~~~~a~--~~~q 726 (726) ..... +.+.... +.........+. ................. ........... ..+. T Consensus 76 ~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 145 (437) T protein:vir:10 76 DSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFA 145 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhH Confidence 00000 0000000 000000000000 00000000000000000 00000000000 0000 No 150 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=27.68 E-value=1.8 Score=19.16 Aligned_cols=98 Identities=13% Similarity=0.051 Sum_probs=9.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH--HHHH- Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQES---GVQQARKREL--QQAQ- 698 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~---~~~~~~e~e~--~~~q- 698 (726) +.. .+++++++...+.+........... ....+..+...+..+.+....+..+.+. .++....... .... T Consensus 1 mk~--~~el~~~l~el~~~~~~~~~e~~~~--l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (415) T protein:vir:98 1 MKT--KEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVE 76 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHH--hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 111 1111111111111111110000000 0000001111111111110000000000 0000000000 0000 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 699 ---SEAQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 699 ---~~~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........................++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:98 77 VNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred cchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 0000000000000000111101111111 No 151 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=27.68 E-value=1.8 Score=19.16 Aligned_cols=98 Identities=13% Similarity=0.051 Sum_probs=9.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH--HHHH- Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQES---GVQQARKREL--QQAQ- 698 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~---~~~~~~e~e~--~~~q- 698 (726) +.. .+++++++...+.+........... ....+..+...+..+.+....+..+.+. .++....... .... T Consensus 1 mk~--~~el~~~l~el~~~~~~~~~e~~~~--l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (415) T protein:vir:79 1 MKT--KEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVE 76 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHH--hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 111 1111111111111111110000000 0000001111111111110000000000 0000000000 0000 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 699 ---SEAQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 699 ---~~~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........................++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:79 77 VNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred cchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 0000000000000000111101111111 No 152 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=27.68 E-value=1.8 Score=19.16 Aligned_cols=98 Identities=13% Similarity=0.051 Sum_probs=9.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH--HHHH- Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQES---GVQQARKREL--QQAQ- 698 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~---~~~~~~e~e~--~~~q- 698 (726) +.. .+++++++...+.+........... ....+..+...+..+.+....+..+.+. .++....... .... T Consensus 1 mk~--~~el~~~l~el~~~~~~~~~e~~~~--l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (415) T protein:vir:81 1 MKT--KEELQSEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVE 76 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHH--hchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 111 1111111111111111110000000 0000001111111111110000000000 0000000000 0000 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 699 ---SEAQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 699 ---~~~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) ..........................++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:81 77 VNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred cchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 0000000000000000111101111111 No 153 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=27.37 E-value=1.8 Score=19.12 Aligned_cols=98 Identities=12% Similarity=0.080 Sum_probs=9.3 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH Q lcl|NC_013692. 621 IAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKREL--QQAQ 698 (726) Q Consensus 621 ~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~--~~~q 698 (726) +.. .+++++++...+.+............ ......+...+..+.+..+.+..+.+.......+... .... T Consensus 1 mk~------~~el~~~l~el~~~~~~~~~~~~~~~--~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~ 72 (415) T protein:vir:94 1 MKT------KEELQSEISDIKRQIDLKVKYATRAL--NNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQ 72 (415) T ss_pred CCh------HHHHHHHHHHHHHHHHHHHHHHHHHh--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 111 01111111111111000000000000 0000001111111111111111100000000000000 0000 Q ss_pred H-------HHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 699 S-------EAQGKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 699 ~-------~~q~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) . .........................++ T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 107 (415) T protein:vir:94 73 QSVEVNEASTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred ccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHH Confidence 0 000000000000000010000001111 No 154 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=25.56 E-value=2 Score=18.88 Aligned_cols=96 Identities=13% Similarity=0.074 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGK 704 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~ 704 (726) ++-+ +.+.+++...+.+.................+ ....++.+.+..+.+..+.+... .+..+........... T Consensus 1 mk~~--~em~~~l~el~~~~~~~~~e~~~~~~~~~~e--~~~~~~~ev~~l~~~i~~~~~~~--~~~~~~~~~~~~~~~~ 74 (415) T protein:vir:46 1 MKTK--EELQSEISDIKRQIDLKVKYATRALNNDELE--KAEKLEQEITDLRSQIQEKQEEL--DKLKEKDRTSENNQQS 74 (415) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHhchhhHH--HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhhhhcccc Confidence 1000 0111111111111000000000000000000 00011111100000000000000 0000000000000000 Q ss_pred HHH-----------HHHHHHHHHHHHHHH---HhcC Q lcl|NC_013692. 705 LAM-----------LNSQLKRLDEATSAR---TSQK 726 (726) Q Consensus 705 ~~~-----------l~~~~~~~~~~~~a~---~~~q 726 (726) ... ............... ..+. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (415) T protein:vir:46 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFT 110 (415) T ss_pred cccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHH Confidence 000 000000000000000 0000 No 155 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=25.56 E-value=2 Score=18.88 Aligned_cols=96 Identities=13% Similarity=0.074 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGK 704 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~ 704 (726) ++-+ +.+.+++...+.+.................+ ....++.+.+..+.+..+.+... .+..+........... T Consensus 1 mk~~--~em~~~l~el~~~~~~~~~e~~~~~~~~~~e--~~~~~~~ev~~l~~~i~~~~~~~--~~~~~~~~~~~~~~~~ 74 (415) T protein:vir:47 1 MKTK--EELQSEISDIKRQIDLKVKYATRALNNDELE--KAEKLEQEITDLRSQIQEKQEEL--DKLKEKDRTSENNQQS 74 (415) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHhchhhHH--HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhhhhcccc Confidence 1000 0111111111111000000000000000000 00011111100000000000000 0000000000000000 Q ss_pred HHH-----------HHHHHHHHHHHHHHH---HhcC Q lcl|NC_013692. 705 LAM-----------LNSQLKRLDEATSAR---TSQK 726 (726) Q Consensus 705 ~~~-----------l~~~~~~~~~~~~a~---~~~q 726 (726) ... ............... ..+. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (415) T protein:vir:47 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFT 110 (415) T ss_pred cccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHH Confidence 000 000000000000000 0000 No 156 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=21.55 E-value=2.6 Score=18.32 Aligned_cols=107 Identities=16% Similarity=0.183 Sum_probs=10.2 Q ss_pred hhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH----HHHHHHHHHHHHHHHHH Q lcl|NC_013692. 607 FAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQ--DSKVGTEQA----KARALASQADMTDLNFL 680 (726) Q Consensus 607 ~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~~~--~~~~~~eqa----q~~q~~~q~~~~~~e~~ 680 (726) +.+.++++...-....+++.++ ..+.+.....++.+ ......+.+ ....++.+++..+. .. T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l------------~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~-~~ 67 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATL------------KAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVED-LD 67 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH-HH Confidence 2222222211111111111110 00000000000000 000000000 00111111111000 00 Q ss_pred HH--HHHHHHHHHHHHHHHHH-----HHHHHHHHHHHH---HHHHHHHHHHHHhcC Q lcl|NC_013692. 681 EQ--ESGVQQARKRELQQAQS-----EAQGKLAMLNSQ---LKRLDEATSARTSQK 726 (726) Q Consensus 681 ~q--e~~~~~~~e~e~~~~q~-----~~q~~~~~l~~~---~~~~~~~~~a~~~~q 726 (726) ++ +...+..+......... ..+.+....... .............+. T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (477) T protein:vir:84 68 EQIRELESEIERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMA 123 (477) T ss_pred HHHHHHHHHHHHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhh Confidence 00 00000000000000000 000000000000 000000000000011 No 157 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=21.45 E-value=2.6 Score=18.31 Aligned_cols=635 Identities=12% Similarity=0.030 Sum_probs=160.4 Q ss_pred CCCccchh---------hcCCCCCCccchhcCCCCCCchHHHHHHHHHHHHHHH---------------------HHHHH Q lcl|NC_013692. 1 MADVDEDY---------LTLPNEDGDPSKRLQPEWSNAPSLAQLKQDYQEAKQV---------------------TDEKI 50 (726) Q Consensus 1 ~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~---------------------~~~~~ 50 (726) |..-.+.. .|+++=..++..+.|.+|.+...-. +...+..+..| ++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~~v 79 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPS-HTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPKLV 79 (763) T ss_pred CCcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcc-hhHHHHHHHHHHHhhhccccCcccccCCCccccCHHH Confidence 43333333 2455555666666666653321111 11111111111 11222 Q ss_pred HHHHHHHHHhccCCCCC-------CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCceEEEecCCcchHHHHHHHHHH Q lcl|NC_013692. 51 TQINRWLDYMHVRGEGK-------PKTEKGKSAVQPPTIRKQAEWRYSSLSEPFLSSPNIFEVNPVTWEDAESARQNGLV 123 (726) Q Consensus 51 ~~~~~~~~~y~~~~~~~-------~~~~~grs~~v~~~v~~~v~~~~~~L~~~f~~~~~~~~~~p~~~~D~~~A~q~t~~ 123 (726) ....+|+.-..-+.+.. .|...| +-...++.--++=+++..- .|. T Consensus 80 ~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~-----D~~~A~q~t~~~n~~~~~~--------------~~~--------- 131 (763) T protein:vir:95 80 RRQAEWRYSALTEPFLGSNKLFKVTPVTWE-----DVQGARQNELVLNYQFRTK--------------LNR--------- 131 (763) T ss_pred HHHHHHHHHHHHHhhcCCCcEEEEecCCcc-----hHHHHHHHHHHHHHHHhhc--------------Cch--------- Confidence 22333432221111110 011111 1111222211221221111 110 Q ss_pred HHHHHhhcccchhHHHHHHHHHh-hcCCeEEEEeeeeeeeeEEecccccccCCcchH----HHHHHhhhhhh-hhhccCC Q lcl|NC_013692. 124 LNQQFNTKLNKQRFIDEYVRAGV-DEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSE----ELAQIYQTAAQ-IREESPS 197 (726) Q Consensus 124 ~n~~~~~~~~~~~~~~~~~~~~l-~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~-l~~~~~~ 197 (726) ..-.++.+++++-..+ +.+++.-. -++..++. ......+........ +...+...... ..+.-+. T Consensus 132 -------~~~~~~~~~~~l~~~~gv~k~~W~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (763) T protein:vir:95 132 -------VSFIDNYVRSVVDDGTGIVRVGWNR-EIRKEKQE-VPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDE 202 (763) T ss_pred -------hhHHHHHHHHHhhcCcceEEEeeee-eeeeeeee-ehhhhhccccchhHHHHHHHHHHhhhhhhccccccccc Confidence 0111223333332222 12222211 12222211 222222222222111 11111111111 1111111 Q ss_pred chhhhHHHHHHhhhhhhhcccceeeccccceeecccceeec--------cceeee-echhheeeCCCCC--CchhhCCeE Q lcl|NC_013692. 198 EYPEIPEDVRLGLEETEANGIQVRAVPVGSEEEEREETVEN--------HPTVQV-CDYNNIVIDPSCG--SDFSKAKFL 266 (726) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~--------~p~i~~-v~p~~~~~dp~a~--~d~~da~~~ 266 (726) .+.+.+......-...-..+.+........+ .....+++. +|.... ++=-+|++..... .||-+-.|. T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~ 281 (763) T protein:vir:95 203 AIKESVRFFDETGQATYAVQTGTTTTEVEVP-LANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDR 281 (763) T ss_pred hhhhhhhhccccCcceeeecccceeEEEEEE-ecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCC Confidence 2222221111111111112222111111111 122222222 232221 1111354444222 245333322 Q ss_pred EEEEeccHHHHHhcCCCcchhhc--Ccccchhhcccchh-hhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEEE Q lcl|NC_013692. 267 IETFESSYAELKADGRYQNLDKI--QVEGQNLLSEPDYT-GPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIV 343 (726) Q Consensus 267 ~~~~~~t~~el~~~g~~~~~d~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~~ 343 (726) +. ++ +++... .+.+.+.- .+.........+.. ....-...+...+...+. +.| |+++-.-|+.+.+. T Consensus 282 y~--~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg--~~~-~~~v~~~g~~iL~~-- 351 (763) T protein:vir:95 282 YH--NL--NKIDWQ-SSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIEGNG--VLE-PIVATWIGSTLIRL-- 351 (763) T ss_pred cc--cc--chhcch-hccccccccccccchhhccCCCcccceEEEEEeeeeeccCCcc--eeE-EEEEEEEcCeeeec-- Confidence 11 11 111000 00000000 00000000000000 000000000001111111 122 22322223322222 Q ss_pred EEEECCEEEEeccCCCCCCccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHH---HHHHHhcCCCceEeecccc Q lcl|NC_013692. 344 ATWVGAVMIRMEENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGM---IDTMARSANGQVGVMKGAL 420 (726) Q Consensus 344 ~~~~g~~~l~~~~~P~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~---~d~l~~~~~~~~~~~~gav 420 (726) +......+..||-+ +||.+...- ..+..++..+.+.-...-...|..+..+ .........+.+ -..+.+ T Consensus 352 ----~~~p~~~~~~PFv~--~~~~p~~~~-~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav-~~~d~~ 423 (763) T protein:vir:95 352 ----EKNPYPDGKLPFVL--IPYMPVKRD-MYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGML-DALNSR 423 (763) T ss_pred ----ccccccCCCcCEEE--ecceeecCc-ccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccc-cchhhh Confidence 22222334445532 444443332 2233333333333333333334333222 111122233332 223322 Q ss_pred cc--hhhhhhcCCceEeecCccchhhhcccccCccchhHHHHHHHHHHHH----HHHHhchHHHhhccCcccchhhHHHH Q lcl|NC_013692. 421 DV--TNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAE----AESMTGVKAFNAGISGAALGDTATAV 494 (726) Q Consensus 421 ~~--~d~~~~~~g~vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~----~e~~tGv~~~~~G~~~~~~~~ta~~i 494 (726) .. .......+|+.+.-... ...+.+.|..+..+..+++...+. .....|+.....|..+++. ...+ T Consensus 424 ~~~pg~v~~v~~g~~~~~~~~-----~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v---~~l~ 495 (763) T protein:vir:95 424 RYREGEDYEYNPTQNPAQMII-----EHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGI---RGVL 495 (763) T ss_pred cccCCceEEeeCCCChhhhcc-----cccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHH---HHHH Confidence 21 11233445544332211 111223455555555555554443 3344466555444333222 1122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCcCeEEEEecccceecchhhcccccceeeecccchHHHH Q lcl|NC_013692. 495 RGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEF----LDDVEVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNA 570 (726) Q Consensus 495 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~il~li~q~----~d~e~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~ 570 (726) .....+...-+..+.+.+....+.+...+..+.-.- ...+..+.++...+.. .-+.. .++.. ......... T Consensus 496 qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~--~~DV~--V~~~~-as~~~q~~~ 570 (763) T protein:vir:95 496 DAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKG--NFDLE--VDIST-AEVDNQKSQ 570 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcC--CcceE--Eeccc-chHHHHHHH Confidence 222222333333444444444444444444432110 0111223332221110 00000 00000 011111122 Q ss_pred HHHHHHHHH-HHhhhccchhHHH---------HHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 571 KVNDLTFML-QTMGPNMDPMMAQ---------QIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAE 640 (726) Q Consensus 571 ~~~~l~~l~-q~~~~~~~~~~~~---------~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~ 640 (726) +..+++..+ ..+.+.....+.. .+...+.....-++ ..+..+.+.+.++.+++.+.++++++.. T Consensus 571 ~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d------~~~q~qaqle~~~~q~e~~~~~akaq~~ 644 (763) T protein:vir:95 571 DLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPD------PVQEQLKQLAVEKAQLENEELRSKIRLN 644 (763) T ss_pred HHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCcc------chhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222211 1111111111100 11111111111011 1111112223333344444444444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 641 RARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRELQQAQSEAQGKLAMLNSQLKRLDEATS 720 (726) Q Consensus 641 ~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e~~~~q~~~q~~~~~l~~~~~~~~~~~~ 720 (726) ++++++..+++....+. +.+++.+.|..+ +....+.....+++ ++...++.+.+. +.+...++ + . T Consensus 645 qaqa~~~~aq~e~~~~d-----~~~~e~~~Q~~~---e~~~~~~~~eaq~~--l~~~~a~~~~~~-ea~~~~~~--~--~ 709 (763) T protein:vir:95 645 DAQAQKAMAERDNKNLD-----YLEQESGTKHAR---DLEKMKAQSQGNQQ--LEITKALTKPRK-EGELPPNL--S--A 709 (763) T ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHHHHHHH---HHHHHHHHHHHHHH--HHHHHHHHHHHH-HhccChhH--H--H Confidence 44443333322221111 111111111111 11111111111111 111111111111 11111111 1 1 Q ss_pred HHHhcC Q lcl|NC_013692. 721 ARTSQK 726 (726) Q Consensus 721 a~~~~q 726 (726) +..+-. T Consensus 710 ~~~~~~ 715 (763) T protein:vir:95 710 AIGYNA 715 (763) T ss_pred hhhhcc Confidence 111111 No 158 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=21.38 E-value=2.6 Score=18.30 Aligned_cols=449 Identities=12% Similarity=0.066 Sum_probs=147.6 Q ss_pred CCCCCCCCCCC-CcC---CCHHHHHHHHHHHHHHHHhhcCCCceE-E---EecCCcchHHHHHHHHHHHHHHHhhcccch Q lcl|NC_013692. 64 GEGKPKTEKGK-SAV---QPPTIRKQAEWRYSSLSEPFLSSPNIF-E---VNPVTWEDAESARQNGLVLNQQFNTKLNKQ 135 (726) Q Consensus 64 ~~~~~~~~~gr-s~~---v~~~v~~~v~~~~~~L~~~f~~~~~~~-~---~~p~~~~D~~~A~q~t~~~n~~~~~~~~~~ 135 (726) |- . .-|+ +.| .+-.....-.|- +++..|+|+..+ . +.|..+. .. ..+.|-+|+-. T Consensus 1 ~~-~---~~~~~~~V~~~hp~y~a~~~~W~---~ird~~~G~~~~~~r~~yl~~~~~--~~--~e~~Y~~rl~r------ 63 (489) T protein:vir:78 1 ML-T---ENGQGSGVKTKHREWLHYAPKWQ---KVRHALAGELVSYLRNVGLNEPDK--AY--GEARQAEYEAG------ 63 (489) T ss_pred Cc-c---CCCccCCCCccCHHHHHHHHHHH---HHHHHhcCcccccccCCCCCCCCC--CC--ChHHHHHHHhc------ Confidence 31 1 1232 322 222333444554 488888887542 1 3332211 00 01225555321 Q ss_pred hHHHHHHHHHhhcCCeEEEEeeeeeeeeEEecccccccCCcchHHHHHHhhhhhhhhhccCCchhhhHHHHHHhhhhhhh Q lcl|NC_013692. 136 RFIDEYVRAGVDEGTIIVKVGWNYQSRTVKEQVVTYEMMPDSSEELAQIYQTAAQIREESPSEYPEIPEDVRLGLEETEA 215 (726) Q Consensus 136 ~~~~~~~~~~l~~~~~i~k~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 215 (726) -.+.+++++.|-.=+|.+.. ..++.+ +|.. +.. +.++=+..-...-.-+...+...-. T Consensus 64 A~~~n~~~~tl~~l~G~vfr-----------k~p~~~-~p~~---l~~-------l~~d~D~~G~~L~~f~~~~~~~~l~ 121 (489) T protein:vir:78 64 GIVYNFTRRTLSGMVGSVMR-----------KEPEIN-IPKE---LEY-------LLKNADGSGVGLIQHAQDTLMEIDS 121 (489) T ss_pred cccCChHHHHHHHHhchhhc-----------CCccee-ccHH---HHH-------HHhccCCCCCCHHHHHHHHHHHHHh Confidence 13456667776544444421 111111 1211 111 1111111111122223333444444 Q ss_pred cccceeecccccee---ecccceeeccceeeeechhheeeCCCCCC-c-hhhCCeEEEEEeccHHHHHhcCCCcchhhcC Q lcl|NC_013692. 216 NGIQVRAVPVGSEE---EEREETVENHPTVQVCDYNNIVIDPSCGS-D-FSKAKFLIETFESSYAELKADGRYQNLDKIQ 290 (726) Q Consensus 216 ~g~~~~~~~~~~~~---~~~~~~~~~~p~i~~v~p~~~~~dp~a~~-d-~~da~~~~~~~~~t~~el~~~g~~~~~d~~~ 290 (726) .|..+..+.....- .-+.+..-.+|++..++|.+|+ ++.... + .....++..+.-...+ T Consensus 122 ~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~Ii-nW~~~~v~G~~~Lt~v~lrE~~~~~--------------- 185 (489) T protein:vir:78 122 VGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIV-NWRLTRVGSVNRVTMVVLRETWEYN--------------- 185 (489) T ss_pred cCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhc-CceeeeeCCccceeEEEEEEeEEee--------------- Confidence 44444333332110 0011111225777778888865 442211 1 0112222111100000 Q ss_pred cccchhhcccchhhhhccccccccCCcCCceEEEEEEEEEeecCCCceEEEE--EEEEECCE------EE-EeccCCCCC Q lcl|NC_013692. 291 VEGQNLLSEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPI--VATWVGAV------MI-RMEENPFPD 361 (726) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~~~~~~~~g~~~~~--~~~~~g~~------~l-~~~~~P~~~ 361 (726) ...+.+......+++|++. +.+|....+ +..-.|.. ++ ..+.+ +. T Consensus 186 ------------------d~~~~f~~~~~~q~RvL~~------~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~--~l 239 (489) T protein:vir:78 186 ------------------EPGNEFETKYGEQYRVLDI------DSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGES--LR 239 (489) T ss_pred ------------------cCCCCccceeEEEEEEEec------CCCcceEEEEEEeecCCcccceeeEEeccCCCC--cc Confidence 0011122222233444331 111111110 00001111 11 11112 23 Q ss_pred CccceEEeeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCceEeecccccchhh---------hhhcCCc Q lcl|NC_013692. 362 KRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNR---------RRFDRGE 432 (726) Q Consensus 362 ~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~d~l~~~~~~~~~~~~gav~~~d~---------~~~~~g~ 432 (726) +.+||+++..... +...+.+..-.+..+.-..=...+-.-++++.++.|...+. |.-+.++. ..+-++. T Consensus 240 ~~IPfv~~~~~~~-~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~-G~d~~~~~~~~~~~~~~i~~g~~~ 317 (489) T protein:vir:78 240 GVIPFTFIGATNN-DATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PGENLTPQAFKEANPNGIKFGSRR 317 (489) T ss_pred CeeeEEEEecCCC-CCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cCccCCcccccccCccceeeCCcc Confidence 5567666554321 22223444555555543332233344577888888877653 33211111 1111222 Q ss_pred eEeecCccchhhhcccccCccchhHHHHHHHHHHHHHHHHhchHHHhhccCcccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 433 NYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDTATAVRGALDAASKRELGILRRL 512 (726) Q Consensus 433 vi~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~e~~tGv~~~~~G~~~~~~~~ta~~i~~~~~~~~~~~~~~~~~~ 512 (726) .+.+..++. ..+..+....-.-..|....++|+. .|.. +.. . + .+.|+++.+.-..+....+..++.++ T Consensus 318 ~~~lp~~~~----~~~ie~~~~~~~r~~l~~le~qm~~--lGa~--l~~-~-~-~~~Ta~~~~~~~~~~~S~L~~~a~~~ 386 (489) T protein:vir:78 318 GHNLGYGGS----AQLIQAGENNLARQNMLDKEQQAIQ--IGAQ--LIT-P-T-QQITAQSARIQRGADTSVMATIARNV 386 (489) T ss_pred cccCCCCCC----cceeccCcchHHHHHHHHHHHHHHH--Hhhh--hcc-C-C-cchhHHHHHHHHHHhhHHHHHHHHHH Confidence 222222211 1222222222111222222222221 1221 111 1 1 14677777776666666677777776 Q ss_pred HHHHHHHHHHHHHHHHHhcCcC--eEEEEecccceecchhhcccccceeeecccchHHHHHHHHHHHHHHHhhhccchhH Q lcl|NC_013692. 513 SAGIIEIGRKIIAMNAEFLDDV--EVVRITNEHFVDIRRDDLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMM 590 (726) Q Consensus 513 ~~~~~~l~~~il~li~q~~d~e--~~iRi~~~~~v~v~~~~~~~~~dv~i~~~~~~~~~~~~~~l~~l~q~~~~~~~~~~ 590 (726) ..++ .+++.++..|.... ..+.+ .++++-....++ .+....+..+.+ +..+.... T Consensus 387 e~al----~~~l~~~a~w~G~~~~~~~~i------~~n~dF~~~~~d-----------~~~~~al~~~~~--~G~is~~t 443 (489) T protein:vir:78 387 SQAY----TDALRWVAVMLGKPEDTEVEF------RLNMDFFLEPMT-----------AQDRAAWMADIN--AGLLPATA 443 (489) T ss_pred HHHH----HHHHHHHHHHcCCCCCCceEE------EeecccCcccCC-----------HHHHHHHHHHHh--cCCCCHHH Confidence 6655 55666777775432 11111 122111111111 111112222111 11122111 Q ss_pred HHHHHHH--HHHhhhhhhhhhhHHHHHh-----hhhhhhhhHHHHHH Q lcl|NC_013692. 591 AQQIMGQ--IMELKKMPDFAKRIREFQP-----QPDPIAQQKAQLEL 630 (726) Q Consensus 591 ~~~~~~~--~~~~~~~~e~~~~l~~~~~-----~~~~~~qq~~q~e~ 630 (726) ....+.. +++ ....++...+..... -....++..++.++ T Consensus 444 ~~~~L~~~gv~d-~~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 444 YYAALRKAGVTD-WTDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHHHhCCCCC-ccHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 1110100 000 000111111111000 00001111111010 No 159 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=21.24 E-value=2.6 Score=18.28 Aligned_cols=145 Identities=12% Similarity=0.096 Sum_probs=11.6 Q ss_pred HHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_013692. 575 LTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAG-L 653 (726) Q Consensus 575 l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e~q~~qaq~e~~~aq~q~~~~~~~-~ 653 (726) +..-++.+.+.............+.......+.. .+...+.... .. ....+... .++..+.+......+.+ + T Consensus 1 ~~~~~~~~~~e~~~~e~a~~~~~~~~~~k~~e~~-~~~ke~~~~~-l~-~~~e~~~k----~~~E~~~~le~~~ee~k~l 73 (458) T protein:vir:10 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAQEAE-RMRKEQEEKE-LA-RMNDLVSK----AVGEDRKRLEEALELVKSL 73 (458) T ss_pred CccchhhhhhhhchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHH-HH-HHHHHHHH----HHHHHHHHHHHHHHHHHHH Confidence 1111111111111110000000000000000000 0000000000 00 00000000 00000000000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH--HHH-----HHHHHH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 654 QDSKVGTEQAKARALASQADMTDLNFLEQES--GVQQARKR--ELQ-----QAQSEA--QGKLAMLNSQLKRLDEATSAR 722 (726) Q Consensus 654 ~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~--~~~~~~e~--e~~-----~~q~~~--q~~~~~l~~~~~~~~~~~~a~ 722 (726) .....+.....+...+...+.......+... .....+.. +.. ...... ..+.................. T Consensus 74 ~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~ 153 (458) T protein:vir:10 74 DEKSKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEH 153 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhh Confidence 0000000000000000000000000000000 00000000 000 000000 000000011000000000000 Q ss_pred HhcC Q lcl|NC_013692. 723 TSQK 726 (726) Q Consensus 723 ~~~q 726 (726) .... T Consensus 154 ~~~~ 157 (458) T protein:vir:10 154 GQRH 157 (458) T ss_pred hhhh Confidence 0001 No 160 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=21.16 E-value=2.6 Score=18.26 Aligned_cols=126 Identities=12% Similarity=0.143 Sum_probs=8.1 Q ss_pred HHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhhhhHHHHHhhhhhhhhhHHHHH-----------HHHHHHHHHH Q lcl|NC_013692. 571 KVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKRIREFQPQPDPIAQQKAQLE-----------LMLLQAQIEA 639 (726) Q Consensus 571 ~~~~l~~l~q~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~~~~qq~~q~e-----------~q~~qaq~e~ 639 (726) +...+..+.. .+.++.+.+.++.........+..+++ .....+++.. T Consensus 1 m~~k~~~l~~----------------------~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~ 58 (397) T protein:vir:96 1 MALKQLILNK----------------------QIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADD 58 (397) T ss_pred CcHHHHHHHH----------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHH Confidence 0000000000 000111111111110000000000000 0000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 640 ERARAAHYMSGAGLQDSKVGTEQAKARALASQADMTDLNFLEQESGVQQARKRE-LQQAQSEAQGKLAMLNSQLKRLDEA 718 (726) Q Consensus 640 ~~aq~q~~~~~~~~~~~~~~~eqaq~~q~~~q~~~~~~e~~~qe~~~~~~~e~e-~~~~q~~~q~~~~~l~~~~~~~~~~ 718 (726) .+.+... ......+.+.+...++.+.......... .......+... ..............+....+..... T Consensus 59 l~~~i~~-------l~~~i~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (397) T protein:vir:96 59 LEKQVKD-------LDEKIAELQKEKQDLEDELAKAADPTDQ-KPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAE 130 (397) T ss_pred HHHHHHH-------HHHHHHHHHHHHHHHHHHHHhhhhhhhh-hhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhh Confidence 0000000 0000000000000000000000000000 00000000000 0000000000000000000000000 Q ss_pred HHHH-------------------HhcC Q lcl|NC_013692. 719 TSAR-------------------TSQK 726 (726) Q Consensus 719 ~~a~-------------------~~~q 726 (726) .... +..+ T Consensus 131 ~~~~~~~~~~~~~vp~~~~~~i~~~~~ 157 (397) T protein:vir:96 131 KRDGFTSVEGGALIPQELLQPQLEPKD 157 (397) T ss_pred hhhcccccccccchhHHHHHHHHHhhh Confidence 0000 0000 No 161 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=20.55 E-value=2.7 Score=18.17 Aligned_cols=117 Identities=12% Similarity=0.136 Sum_probs=8.7 Q ss_pred hhhhhhhhhHHHHHhhhhhhhhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH--HHHHHH Q lcl|NC_013692. 602 KKMPDFAKRIREFQPQPDPIAQQKA---QLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQ-AKARALA--SQADMT 675 (726) Q Consensus 602 ~~~~e~~~~l~~~~~~~~~~~qq~~---q~e~q~~qaq~e~~~aq~q~~~~~~~~~~~~~~~eq-aq~~q~~--~q~~~~ 675 (726) |.+.++.+...+...+......... .+.. ....+.+..+.+.... ..++...+ +++.... ...... T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~-ee~~~~~~l~~ei~~l-------~~~I~~~e~~~~~~~~~~~~~~~~ 72 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSV-EQQAEFDQLSSKFSEL-------TAQIERAEAAERMAAAAAVPVDPN 72 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCH-HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccccch Confidence 2222222211111110000000000 0000 0000000000000000 00000000 0000000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHH--------HHHHHHhcC Q lcl|NC_013692. 676 DLNFLEQESGVQQARKRELQQAQSEAQG---KLAMLNSQLKRLDE--------ATSARTSQK 726 (726) Q Consensus 676 ~~e~~~qe~~~~~~~e~e~~~~q~~~q~---~~~~l~~~~~~~~~--------~~~a~~~~q 726 (726) ..............+............+ .+...+........ ...+..... T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (435) T protein:vir:14 73 PTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNT 134 (435) T ss_pred hhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhccc Confidence 0000000000000000000000000000 00000000000000 000010111 No 162 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=20.44 E-value=2.8 Score=18.16 Aligned_cols=95 Identities=13% Similarity=0.162 Sum_probs=8.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 625 KAQLELMLLQAQIEAERARAAHYMSG--AGLQDSKVGTEQAKARALASQAD--MTDLNFLEQESGVQQARKRELQQAQSE 700 (726) Q Consensus 625 ~~q~e~q~~qaq~e~~~aq~q~~~~~--~~~~~~~~~~eqaq~~q~~~q~~--~~~~e~~~qe~~~~~~~e~e~~~~q~~ 700 (726) ++.+ ..+++.+.....+.+..... ......+...++.. .++.+.+ ..+.+..+.+ .... +.+....... T Consensus 1 Mk~l--~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~--~~~~~~~~l~~~~~~l~~~--~~~~-e~~~~~~~~~ 73 (387) T protein:vir:93 1 MPTL--YELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIK--QLETEKAGLQQRFNIVERQ--VKDI-EEKEKAKVKD 73 (387) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHH--HHHHHHHHHHHHHHHHHHH--HHHH-HHHHHHhhhh Confidence 1111 11111111111111111000 00000000001110 0111110 0011100000 0000 0000000000 Q ss_pred HH-------HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013692. 701 AQ-------GKLAMLNSQLKRLDEATSARTSQK 726 (726) Q Consensus 701 ~q-------~~~~~l~~~~~~~~~~~~a~~~~q 726 (726) .. .+.....+..+-............ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~ 106 (387) T protein:vir:93 74 TGEAYQSLNDHEKMVKAKAEFYRHAILPNEFEK 106 (387) T ss_pred ccccCCCcchhhHHHHHHHHHHHHHhhhhhhhh Confidence 00 000000000000000000000000 Done!