Query lcl|NC_021540.1_cdsid_YP_008126825.1 [gene=M612_gp41] [protein=portal protein] [protein_id=YP_008126825.1] [location=complement(53746..55863)] Match_columns 705 No_of_seqs 218 out of 356 Neff 9.0 Searched_HMMs 1612 Date Thu Nov 7 17:34:36 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_62 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_62_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95821 Length: 763 100.0 5E-168 3E-171 937.7 71.2 691 5-705 1-703 (763) 2 protein:vir:8846 Length: 705 # 100.0 9E-131 6E-134 733.5 68.5 655 9-705 1-705 (705) 3 protein:vir:80165 Length: 651 100.0 2.8E-98 2E-101 555.4 59.1 613 5-705 1-651 (651) 4 protein:vir:93630 Length: 776 100.0 2.8E-99 2E-102 560.9 53.3 615 1-705 1-713 (776) 5 protein:vir:108295 Length: 711 100.0 2.8E-93 1.7E-96 528.0 64.0 607 1-702 1-711 (711) 6 protein:vir:2764 Length: 714 # 100.0 6.6E-91 4.1E-94 515.0 60.3 614 1-705 1-711 (714) 7 protein:vir:9950 Length: 714 # 100.0 6.6E-91 4.1E-94 515.0 60.3 614 1-705 1-711 (714) 8 protein:vir:817 Length: 714 # 100.0 6.6E-91 4.1E-94 515.0 60.3 614 1-705 1-711 (714) 9 protein:vir:3296 Length: 714 # 100.0 6.6E-91 4.1E-94 515.0 60.3 614 1-705 1-711 (714) 10 protein:vir:10117 Length: 714 100.0 6.6E-91 4.1E-94 515.0 60.3 614 1-705 1-711 (714) 11 protein:vir:104437 Length: 714 100.0 1.9E-89 1.2E-92 507.0 57.6 613 1-705 1-711 (714) 12 protein:vir:105619 Length: 772 100.0 4E-89 2.5E-92 505.2 55.4 610 2-705 1-700 (772) 13 protein:vir:9263 Length: 725 # 100.0 5.4E-82 3.3E-85 466.1 56.0 595 1-705 1-717 (725) 14 protein:vir:77597 Length: 725 100.0 1.9E-81 1.2E-84 463.1 58.5 595 1-705 1-717 (725) 15 protein:vir:105520 Length: 706 100.0 5.7E-80 3.5E-83 455.0 61.1 604 9-705 1-692 (706) 16 protein:vir:100920 Length: 725 100.0 2.3E-80 1.4E-83 457.2 58.3 595 1-705 1-717 (725) 17 protein:vir:94599 Length: 641 100.0 2.3E-81 1.4E-84 462.7 47.3 607 1-697 6-641 (641) 18 protein:vir:3520 Length: 720 # 100.0 1.5E-78 9.5E-82 447.2 58.9 594 20-705 1-712 (720) 19 protein:vir:172 Length: 708 # 100.0 5.2E-77 3.2E-80 438.8 58.5 595 1-705 1-696 (708) 20 protein:vir:105429 Length: 708 100.0 7.5E-77 4.7E-80 437.9 57.3 599 1-705 1-696 (708) 21 protein:vir:95449 Length: 584 100.0 1.1E-76 6.5E-80 437.1 44.5 549 9-649 1-584 (584) 22 protein:vir:3139 Length: 599 # 100.0 6.6E-69 4.1E-72 394.4 40.1 555 5-654 1-599 (599) 23 protein:vir:345 Length: 663 # 100.0 1.4E-45 8.8E-49 266.5 42.4 602 1-705 1-659 (663) 24 protein:vir:7321 Length: 556 # 100.0 2.3E-29 1.5E-32 177.5 46.7 519 20-674 1-556 (556) 25 protein:vir:107822 Length: 555 100.0 4.1E-29 2.6E-32 176.2 47.1 522 20-676 1-555 (555) 26 protein:vir:98506 Length: 555 100.0 4.1E-29 2.6E-32 176.2 47.1 522 20-676 1-555 (555) 27 protein:vir:107404 Length: 555 100.0 4.1E-29 2.6E-32 176.2 47.1 522 20-676 1-555 (555) 28 protein:vir:95315 Length: 559 100.0 7.3E-29 4.5E-32 174.8 46.3 524 21-689 1-559 (559) 29 protein:vir:102668 Length: 547 100.0 2.6E-28 1.6E-31 171.8 45.5 503 24-668 1-547 (547) 30 protein:vir:103765 Length: 549 100.0 5.1E-27 3.2E-30 164.7 46.7 512 20-671 1-549 (549) 31 protein:vir:1538 Length: 535 # 100.0 3.8E-27 2.3E-30 165.4 42.9 512 9-670 1-535 (535) 32 protein:vir:1785 Length: 555 # 100.0 1.1E-26 6.5E-30 163.0 45.2 527 20-690 1-555 (555) 33 protein:vir:2198 Length: 536 # 100.0 2E-26 1.3E-29 161.4 44.7 511 20-671 1-536 (536) 34 protein:vir:10447 Length: 536 100.0 2E-26 1.3E-29 161.5 44.6 511 20-671 1-536 (536) 35 protein:vir:3361 Length: 535 # 100.0 1.1E-26 7E-30 162.8 43.0 512 9-665 1-535 (535) 36 protein:vir:99672 Length: 532 99.9 2.4E-25 1.5E-28 155.5 44.9 509 9-674 1-532 (532) 37 protein:vir:94709 Length: 522 99.9 1.4E-24 8.5E-28 151.4 43.3 498 9-674 1-522 (522) 38 protein:vir:8883 Length: 543 # 99.9 5.6E-25 3.5E-28 153.5 40.0 519 9-705 1-542 (543) 39 protein:vir:100039 Length: 522 99.9 3.6E-24 2.2E-27 149.1 42.4 497 27-674 1-522 (522) 40 protein:vir:94572 Length: 535 99.9 1.3E-23 8E-27 146.1 44.8 513 9-680 1-535 (535) 41 protein:vir:103330 Length: 517 99.9 4.3E-23 2.6E-26 143.2 43.0 493 18-667 1-517 (517) 42 protein:vir:80211 Length: 514 99.9 6.8E-23 4.2E-26 142.1 43.8 483 29-655 1-514 (514) 43 protein:vir:96988 Length: 516 99.9 4.3E-23 2.7E-26 143.2 40.5 491 9-657 1-516 (516) 44 protein:vir:78942 Length: 510 99.9 7E-22 4.3E-25 136.6 45.6 480 20-671 1-510 (510) 45 protein:vir:6322 Length: 510 # 99.9 8.3E-22 5.1E-25 136.2 45.2 479 25-671 1-510 (510) 46 protein:vir:78696 Length: 542 99.9 5.4E-23 3.4E-26 142.6 38.1 505 20-677 1-542 (542) 47 protein:vir:7017 Length: 515 # 99.9 3.9E-21 2.4E-24 132.5 44.2 491 9-674 1-515 (515) 48 protein:vir:105641 Length: 516 99.9 4.6E-21 2.9E-24 132.1 42.4 491 1-662 1-516 (516) 49 protein:vir:3964 Length: 453 # 99.7 2.5E-15 1.6E-18 100.6 39.2 442 9-646 1-453 (453) 50 protein:vir:103385 Length: 666 99.7 1.3E-18 8.3E-22 118.6 19.3 577 1-656 1-666 (666) 51 protein:vir:3609 Length: 452 # 99.7 2.2E-15 1.3E-18 101.0 34.8 444 9-649 1-452 (452) 52 protein:vir:96403 Length: 666 99.7 3.3E-18 2E-21 116.5 19.2 577 1-667 1-666 (666) 53 protein:vir:9871 Length: 429 # 99.7 4.1E-15 2.6E-18 99.4 34.8 423 17-646 1-429 (429) 54 protein:vir:38 Length: 496 # N 99.6 5.1E-14 3.2E-17 93.5 36.4 456 9-637 1-496 (496) 55 protein:vir:80680 Length: 441 99.6 2.8E-13 1.7E-16 89.4 38.3 426 20-662 1-441 (441) 56 protein:vir:96494 Length: 501 99.6 1.6E-13 9.9E-17 90.7 36.5 463 1-658 1-501 (501) 57 protein:vir:96179 Length: 468 99.6 1.7E-14 1E-17 96.1 30.8 447 1-641 1-468 (468) 58 protein:vir:79703 Length: 505 99.6 2E-13 1.2E-16 90.2 36.6 465 1-635 3-505 (505) 59 protein:vir:93747 Length: 472 99.6 1.5E-13 9.1E-17 90.9 35.7 449 5-648 1-472 (472) 60 protein:vir:733 Length: 453 # 99.6 1.1E-13 6.8E-17 91.6 34.8 441 10-655 1-453 (453) 61 protein:vir:99522 Length: 470 99.6 5.9E-13 3.7E-16 87.6 40.8 452 1-648 1-470 (470) 62 protein:vir:1587 Length: 508 # 99.6 6E-13 3.7E-16 87.6 39.0 468 1-637 1-508 (508) 63 protein:vir:95806 Length: 440 99.6 1.2E-13 7.6E-17 91.4 34.0 421 29-644 1-440 (440) 64 protein:vir:106639 Length: 481 99.6 6.2E-13 3.9E-16 87.5 40.5 449 1-643 14-481 (481) 65 protein:vir:80959 Length: 499 99.6 6.7E-13 4.1E-16 87.3 39.5 457 1-637 1-499 (499) 66 protein:vir:1236 Length: 483 # 99.6 1.2E-13 7.3E-17 91.5 33.2 452 1-648 1-483 (483) 67 protein:vir:102950 Length: 471 99.6 1.5E-13 9.4E-17 90.9 33.7 435 20-646 1-471 (471) 68 protein:vir:105292 Length: 478 99.6 2.8E-14 1.7E-17 94.9 29.4 453 1-651 1-478 (478) 69 protein:vir:2732 Length: 501 # 99.6 2.4E-13 1.5E-16 89.7 33.8 457 1-656 1-501 (501) 70 protein:vir:107112 Length: 478 99.6 4.2E-14 2.6E-17 93.9 29.2 452 1-649 1-478 (478) 71 protein:vir:4898 Length: 502 # 99.6 1.5E-12 9.1E-16 85.5 37.0 458 1-647 1-502 (502) 72 protein:vir:9306 Length: 511 # 99.5 2.7E-13 1.7E-16 89.5 31.8 475 1-676 1-511 (511) 73 protein:vir:96240 Length: 511 99.5 3E-13 1.8E-16 89.3 32.0 475 1-676 1-511 (511) 74 protein:vir:94742 Length: 409 99.5 4.3E-13 2.7E-16 88.4 32.6 393 21-597 1-409 (409) 75 protein:vir:98883 Length: 517 99.5 2.4E-12 1.5E-15 84.3 36.4 472 23-628 1-517 (517) 76 protein:vir:9751 Length: 422 # 99.5 2.5E-13 1.6E-16 89.6 30.8 406 21-634 1-422 (422) 77 protein:vir:105889 Length: 474 99.5 7.2E-13 4.5E-16 87.1 33.1 447 1-656 3-474 (474) 78 protein:vir:94101 Length: 474 99.5 7.2E-13 4.5E-16 87.1 33.1 447 1-656 3-474 (474) 79 protein:vir:103951 Length: 511 99.5 1.4E-12 9E-16 85.5 34.7 467 1-649 1-511 (511) 80 protein:vir:78805 Length: 511 99.5 3.5E-13 2.2E-16 88.9 30.8 475 1-668 1-511 (511) 81 protein:vir:96366 Length: 511 99.5 3.5E-13 2.2E-16 88.9 30.8 475 1-668 1-511 (511) 82 protein:vir:97336 Length: 492 99.5 3.6E-12 2.3E-15 83.3 35.8 449 1-648 7-492 (492) 83 protein:vir:9922 Length: 489 # 99.5 2.1E-12 1.3E-15 84.6 34.5 455 1-645 1-489 (489) 84 protein:vir:99781 Length: 511 99.5 3.2E-13 2E-16 89.1 29.5 475 1-666 1-511 (511) 85 protein:vir:94498 Length: 474 99.5 6E-13 3.7E-16 87.6 30.2 449 1-656 1-474 (474) 86 protein:vir:97447 Length: 474 99.5 6E-13 3.7E-16 87.6 30.2 449 1-656 1-474 (474) 87 protein:vir:102330 Length: 451 99.5 5.5E-12 3.4E-15 82.3 36.9 433 21-642 1-451 (451) 88 protein:vir:3028 Length: 500 # 99.5 4.3E-12 2.7E-15 82.9 34.5 473 1-639 3-500 (500) 89 protein:vir:9815 Length: 500 # 99.5 4.3E-12 2.7E-15 82.9 34.5 473 1-639 3-500 (500) 90 protein:vir:105461 Length: 470 99.5 4.2E-12 2.6E-15 83.0 34.2 440 24-647 1-470 (470) 91 protein:vir:94805 Length: 492 99.5 7.1E-12 4.4E-15 81.7 35.3 448 1-648 7-492 (492) 92 protein:vir:106571 Length: 499 99.5 2.1E-12 1.3E-15 84.6 32.0 487 1-671 1-499 (499) 93 protein:vir:1634 Length: 409 # 99.5 2.1E-12 1.3E-15 84.6 31.7 393 21-597 1-409 (409) 94 protein:vir:96839 Length: 474 99.5 1E-11 6.4E-15 80.8 37.5 456 1-648 1-474 (474) 95 protein:vir:97171 Length: 512 99.5 1.1E-11 6.5E-15 80.8 35.3 466 1-649 1-512 (512) 96 protein:vir:95113 Length: 474 99.5 6.2E-12 3.9E-15 82.0 33.4 444 1-649 1-474 (474) 97 protein:vir:2341 Length: 488 # 99.4 1.5E-11 9.4E-15 79.9 35.2 463 9-650 1-488 (488) 98 protein:vir:9568 Length: 410 # 99.4 3.4E-12 2.1E-15 83.4 30.5 396 37-628 1-410 (410) 99 protein:vir:7430 Length: 563 # 99.4 3E-12 1.9E-15 83.7 29.9 519 1-659 1-563 (563) 100 protein:vir:95899 Length: 474 99.4 1.3E-11 7.9E-15 80.3 32.9 451 1-649 1-474 (474) 101 protein:vir:96266 Length: 474 99.4 1.3E-11 7.9E-15 80.3 32.9 451 1-649 1-474 (474) 102 protein:vir:78227 Length: 480 99.4 1.1E-12 6.6E-16 86.2 26.9 455 17-661 1-480 (480) 103 protein:vir:2427 Length: 485 # 99.4 3E-11 1.9E-14 78.3 34.7 450 11-648 1-485 (485) 104 protein:vir:78537 Length: 480 99.4 1.4E-12 8.8E-16 85.5 26.1 455 17-667 1-480 (480) 105 protein:vir:94546 Length: 506 99.4 9E-12 5.6E-15 81.1 29.4 457 1-665 6-506 (506) 106 protein:vir:5961 Length: 503 # 99.4 5.6E-11 3.5E-14 76.8 39.1 470 1-675 1-503 (503) 107 protein:vir:99072 Length: 479 99.4 2.1E-11 1.3E-14 79.1 30.4 456 1-662 1-479 (479) 108 protein:vir:104082 Length: 485 99.3 7.1E-11 4.4E-14 76.2 32.2 453 9-648 1-485 (485) 109 protein:vir:2500 Length: 501 # 99.3 3.3E-11 2.1E-14 78.0 28.6 478 1-666 1-501 (501) 110 protein:vir:102239 Length: 527 99.3 4.7E-11 2.9E-14 77.2 29.2 496 1-643 1-527 (527) 111 protein:vir:101494 Length: 527 99.3 5.3E-11 3.3E-14 76.9 29.2 496 1-643 1-527 (527) 112 protein:vir:79043 Length: 479 99.3 1.6E-10 9.7E-14 74.3 39.6 446 1-641 1-479 (479) 113 protein:vir:4223 Length: 486 # 99.3 1.6E-10 1E-13 74.3 33.5 459 11-652 1-486 (486) 114 protein:vir:4782 Length: 522 # 99.3 3.1E-10 1.9E-13 72.7 33.4 486 1-647 3-522 (522) 115 protein:vir:7768 Length: 484 # 99.2 3.3E-10 2E-13 72.6 33.3 458 9-653 1-484 (484) 116 protein:vir:99916 Length: 504 99.2 8.8E-10 5.4E-13 70.2 36.3 459 1-636 1-504 (504) 117 protein:vir:105819 Length: 456 99.2 1E-09 6.3E-13 69.9 31.9 433 17-649 1-456 (456) 118 protein:vir:102602 Length: 456 99.2 1E-09 6.3E-13 69.9 31.9 433 17-649 1-456 (456) 119 protein:vir:8184 Length: 474 # 99.1 1.2E-09 7.7E-13 69.4 34.0 445 9-644 1-474 (474) 120 protein:vir:78907 Length: 518 99.1 1.4E-09 8.5E-13 69.2 36.0 474 1-630 7-518 (518) 121 protein:vir:105520 Length: 706 99.1 2.9E-09 1.8E-12 67.4 37.1 572 72-705 1-689 (706) 122 protein:vir:7987 Length: 456 # 99.0 6.9E-09 4.3E-12 65.3 37.0 434 17-649 1-456 (456) 123 protein:vir:93630 Length: 776 98.9 1.2E-08 7.5E-12 64.0 37.1 601 5-705 1-703 (776) 124 protein:vir:98444 Length: 434 98.9 1E-08 6.4E-12 64.4 23.8 416 55-643 1-434 (434) 125 protein:vir:104437 Length: 714 98.8 4.3E-08 2.7E-11 61.0 32.1 579 54-705 1-703 (714) 126 protein:vir:8846 Length: 705 # 98.6 1.6E-07 9.9E-11 57.8 35.9 576 59-705 1-685 (705) 127 protein:vir:3520 Length: 720 # 98.6 1.9E-07 1.2E-10 57.5 29.4 563 44-705 1-691 (720) 128 protein:vir:105429 Length: 708 98.6 2.4E-07 1.5E-10 56.9 34.9 572 76-705 1-691 (708) 129 protein:vir:817 Length: 714 # 98.5 4.7E-07 2.9E-10 55.3 31.4 578 58-705 1-703 (714) 130 protein:vir:2764 Length: 714 # 98.5 4.7E-07 2.9E-10 55.3 31.4 578 58-705 1-703 (714) 131 protein:vir:10117 Length: 714 98.5 4.7E-07 2.9E-10 55.3 31.4 578 58-705 1-703 (714) 132 protein:vir:9950 Length: 714 # 98.5 4.7E-07 2.9E-10 55.3 31.4 578 58-705 1-703 (714) 133 protein:vir:3296 Length: 714 # 98.5 4.7E-07 2.9E-10 55.3 31.4 578 58-705 1-703 (714) 134 protein:vir:108295 Length: 711 98.4 6.8E-07 4.2E-10 54.4 38.0 576 35-705 1-710 (711) 135 protein:vir:78083 Length: 537 98.3 1.5E-06 9.1E-10 52.6 40.2 443 12-614 1-537 (537) 136 protein:vir:100920 Length: 725 98.3 1.7E-06 1.1E-09 52.2 39.7 579 26-705 1-703 (725) 137 protein:vir:77597 Length: 725 98.1 3.8E-06 2.4E-09 50.3 40.4 577 26-705 1-700 (725) 138 protein:vir:9263 Length: 725 # 98.0 7.2E-06 4.5E-09 48.8 39.4 571 26-705 1-699 (725) 139 protein:vir:105619 Length: 772 98.0 8.6E-06 5.4E-09 48.3 31.9 541 101-705 1-693 (772) 140 protein:vir:95821 Length: 763 92.9 0.0094 5.8E-06 31.7 31.9 575 1-705 87-715 (763) 141 protein:vir:94956 Length: 452 84.2 0.062 3.9E-05 27.2 34.6 416 9-613 1-452 (452) 142 protein:vir:78393 Length: 489 78.9 0.11 6.8E-05 25.9 27.7 443 58-616 1-489 (489) 143 protein:vir:96783 Length: 488 78.8 0.11 6.9E-05 25.8 26.0 440 1-610 1-488 (488) 144 protein:vir:78641 Length: 278 75.8 0.14 8.8E-05 25.2 21.2 258 213-541 1-278 (278) 145 protein:vir:1084 Length: 437 # 62.0 0.34 0.00021 23.1 13.2 112 591-705 1-124 (437) 146 protein:vir:172 Length: 708 # 60.7 0.37 0.00023 23.0 35.3 566 76-705 1-691 (708) 147 protein:vir:95014 Length: 491 58.4 0.41 0.00026 22.7 29.5 457 58-626 1-491 (491) 148 protein:vir:80453 Length: 535 55.4 0.48 0.0003 22.3 21.7 470 35-595 1-535 (535) 149 protein:vir:1084 Length: 437 # 29.7 1.6 0.001 19.4 14.4 126 575-705 1-134 (437) 150 protein:vir:3870 Length: 400 # 20.4 2.8 0.0017 18.1 10.5 111 578-705 1-135 (400) No 1 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=5.3e-168 Score=937.66 Aligned_cols=691 Identities=45% Similarity=0.781 Sum_probs=594.8 Q ss_pred hhhhhcc-------cccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHH Q lcl|NC_021540. 5 NEEFLED-------TVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIR 77 (705) Q Consensus 5 ~~~~~~~-------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~ 77 (705) +|..... ++|.|||+|||+++|++|++||+.|++++++++++..+|++||+++++.+|++++|||+|||++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~~v~ 80 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPKLVR 80 (763) T ss_pred CCcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccccCCCccccCHHHH Confidence 4444444 488999999999999999999999999999999999999999999999888899999999999999 Q ss_pred HHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhh Q lcl|NC_021540. 78 KQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETK 157 (705) Q Consensus 78 ~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~ 157 (705) ++|||++|+|+++|||+++||+|.|+++||+++|+|+|+||||+|+++|+||+++++||++||++|+|||||||++++++ T Consensus 81 ~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~ 160 (763) T protein:vir:95 81 RQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRK 160 (763) T ss_pred HHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988877 Q ss_pred hhhcccccc-cccCCchhHHHHHHHHHHHhhchhhhcc-hHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceE Q lcl|NC_021540. 158 VTENVPVFQ-YVEATGESIDLINQAVQMYQMNPSILDT-MPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEV 235 (705) Q Consensus 158 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i 235 (705) .++....++ +...+...........++..+++....+ ..+.+..+.......|.++..++.+.....+.+..+++|+| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~i 240 (763) T protein:vir:95 161 EKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTV 240 (763) T ss_pred eeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEE Confidence 777666665 3344444444444555555555554332 44556666777778899999999998888888888999999 Q ss_pred EEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCc-Ccchhhhhhhhhhcccccc--ccccccccccccCeEE Q lcl|NC_021540. 236 TICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYS-NLEYIKEDSSTSTSSDHYS--SDTSFTFSDKARKKIV 312 (705) Q Consensus 236 ~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~-d~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~v~ 312 (705) ++|+|++|||||+|+.|++||+||+|++++|+++|+++|++. +++.+++............ .....++.+.++++|+ T Consensus 241 e~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~ 320 (763) T protein:vir:95 241 EMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVV 320 (763) T ss_pred EeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEE Confidence 999999999999998789999999999999999999997643 3444544332222222111 1123445667789999 Q ss_pred EEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 313 VYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMID 392 (705) Q Consensus 313 v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d 392 (705) |||||+++|++|||++++|+++|+|+++|+.+++||+|++|||++++++|+++++||+|+++.++|+|+.+|+++|+++| T Consensus 321 v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d 400 (763) T protein:vir:95 321 AYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMID 400 (763) T ss_pred EEEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 393 AMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 393 ~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) +++++++|+|++++|++++.+.++++||+++++++|.++...+.+..+|++++..+.+++++...++++|||+++++|.+ T Consensus 401 ~l~~~~~~~~~v~~gav~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~ 480 (763) T protein:vir:95 401 LLGRSANGQRGMPKGMLDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVT 480 (763) T ss_pred HHHhhcCCcEEeecccccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcC Confidence 99999999999999999999999999999999999999888899999999999999999999999999999999999999 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeecc Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSIS 552 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~ 552 (705) ++.++.+|++++++++++++++..++|||+++++++|++++.||++||+++++|||+|++|++|+++++.++|||.|+++ T Consensus 481 ~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~ 560 (763) T protein:vir:95 481 GESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS 560 (763) T ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc Confidence 99888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 553 NAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKL 632 (705) Q Consensus 553 ~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~ 632 (705) +++.+.++.+++.+|++.+++.+++.....++..++++..+..+...++..++++++.++++. |++++++ T Consensus 561 ~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qa----------qle~~~~ 630 (763) T protein:vir:95 561 TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLK----------QLAVEKA 630 (763) T ss_pred cchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHH----------HHHHHHH Confidence 988888889999999999999999998889999999998888888887777666555433221 2223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 633 QAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 633 qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) +++++..+++++..+++++..+.+++.+.+++.+++.++++++++++...++++++++++++++.+.+.++.. T Consensus 631 q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~~ea~~ 703 (763) T protein:vir:95 631 QLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPRKEGEL 703 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 4444444455555555555556666666677777777778888888888888888899999998888887777 No 2 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=9.2e-131 Score=733.50 Aligned_cols=655 Identities=16% Similarity=0.178 Sum_probs=461.7 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHH-HHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVA-IIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSAL 87 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l 87 (705) |.+.+| --.+++..+++.|..++++|+++++++++ +..+|++||+|++.+ ...+|||+||+++|+++|||++|+| T Consensus 1 ~~k~~~--~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~--~~~~~~s~~~~~~v~~~v~~~~~~l 76 (705) T protein:vir:88 1 MAKRRK--IKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFG--NERPGKSGIVSRDVQETVDWIMPSL 76 (705) T ss_pred CCcccc--cccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCC--cccCCCCccccHHHHHHHHHHHHHH Confidence 666655 44668899999999999999999999997 567999999999865 4578999999999999999999999 Q ss_pred HHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccc Q lcl|NC_021540. 88 SEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQY 167 (705) Q Consensus 88 ~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~ 167 (705) +++||+|+++|.|.|++++|+++|++.|+|+||+|+++|++++++++||++||++|+||+||+|+.+.++.++.. T Consensus 77 ~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~----- 151 (705) T protein:vir:88 77 MKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERF----- 151 (705) T ss_pred HHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhh----- Confidence 999999999999999999999999999999999999999999999999999999999999999986665554421 Q ss_pred ccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCC Q lcl|NC_021540. 168 VEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDP 247 (705) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp 247 (705) ....+. .+.....++.+. .+....+....+..........++|+|++|||++||||| T Consensus 152 ~~~~~~-------~l~~~~~d~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp 208 (705) T protein:vir:88 152 SGLSED-------MVADILSDPDTS----------------ILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDR 208 (705) T ss_pred ccCChh-------hhhhhhhhhhhh----------------cccccccccceeeeEEeeeeecCceeeeeccHHHceecC Confidence 111111 111111122210 000011111111122223345689999999999999999 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcch-hhhh---hhhhh----ccccccccc----cccccccccCeEEEEE Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEY-IKED---SSTST----SSDHYSSDT----SFTFSDKARKKIVVYE 315 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~-~~~~---~~~~~----~~~~~~~~~----~~~~~~~~~~~v~v~E 315 (705) +|+ +++||+|++|++++|+++|.++||+.+... +... +.... .....+... ...+.+..+++|++|| T Consensus 209 ~a~-~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E 287 (705) T protein:vir:88 209 LAT-CIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASE 287 (705) T ss_pred CCC-CcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEE Confidence 987 799999999999999999999999875321 1111 00000 011111111 1112334567899999 Q ss_pred EEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 316 YWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMA 395 (705) Q Consensus 316 ~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~ 395 (705) ||++++++|||+.++++++|+|+++|+.++ ++++||++++++|+++++||+|+++.++|+|+.+|+++|+++|+++ T Consensus 288 ~y~~~d~~~d~~~~~~~~~~~g~~il~~~~----~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~ 363 (705) T protein:vir:88 288 CYTLLDVDGDGISELRRILYVGDYIISNEP----WDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIY 363 (705) T ss_pred eeeEecccCCcceeeEEEEEeCcccccccc----CCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999998653 3789999999999999999999999999999999999999999999 Q ss_pred hcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccc Q lcl|NC_021540. 396 RSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDS 475 (705) Q Consensus 396 ~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~ 475 (705) ++++|++++++|+++..+.++++||++++++++ .++.+.++|++++.+++|++++.+.++++|||+++++|.++++ T Consensus 364 ~~~~~~~~~~~g~v~~~d~~~~~pg~vv~~~~~----~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~ 439 (705) T protein:vir:88 364 RTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSM----NSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNT 439 (705) T ss_pred hccCCceeccccccCcccccccCCCeeEEecCC----CccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCccc Confidence 999999999999999999999999999999865 3678889999999999999999999999999999999998877 Q ss_pred c--chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeecc Q lcl|NC_021540. 476 L--GTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSIS 552 (705) Q Consensus 476 ~--~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~ 552 (705) + +.||++++++++++++++..++++|++ +++++|++++.||++|+++++++||+| .|++++|+++.+++++.++++ T Consensus 440 ~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g-~~v~v~~~~~~~~~~v~v~v~ 518 (705) T protein:vir:88 440 LHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRG-KWVAVNPANWRERSDLTVTVG 518 (705) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeecc-chhccchHhhccCCceEEeec Confidence 6 469999999999999999999999986 689999999999999999999999997 589999999999999987665 Q ss_pred ch----hHHHHHHHHHHHHHHHHhh------hchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHH----H Q lcl|NC_021540. 553 NA----ETDAIKAQELSFMLQTMGQ------SLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQI----K 618 (705) Q Consensus 553 ~~----~~~~~~~q~~~~llq~~~~------~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~----~ 618 (705) .+ ..+.+.+..++++.+.+.+ ...+.....++.++.+..++....+++..+.......+++..++ + T Consensus 519 ~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~ 598 (705) T protein:vir:88 519 IGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQP 598 (705) T ss_pred cccchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhH Confidence 33 2334444455554444322 22233444566677777777777666654332111111111000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-----HHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 619 QLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEA--DLNTLDFVEQE-----TGVK-QERELELMQAQAKGNTQR 690 (705) Q Consensus 619 q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea--~~~~~~~~~q~-----~~~k-q~~e~e~~~~q~~~~~~~ 690 (705) +.+..+.|+++++++++++.++++++.++++++..+.+. ++++....+++ ...+ +++..++++++.+++.++ T Consensus 599 ~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~ 678 (705) T protein:vir:88 599 KPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHL 678 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111112333344444444333333333222221111111 11111111110 0000 001111111111112122 Q ss_pred HHHHHH------------HHHHhhccC Q lcl|NC_021540. 691 DIVKTF------------LDTNKQGNQ 705 (705) Q Consensus 691 ~~~k~~------------~~~~~q~~~ 705 (705) +...+. .+..+..+| T Consensus 679 e~~q~~~~~~~~~~~~~~~k~~~~~rr 705 (705) T protein:vir:88 679 EATQARAAYIGDGKVPETKKPTKAVRR 705 (705) T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHhcC Confidence 111111 111112222 No 3 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=2.8e-98 Score=555.37 Aligned_cols=613 Identities=14% Similarity=0.188 Sum_probs=416.2 Q ss_pred hhhhhcccccccCCCCCC-HHHHHHHHHHHHHhhHHhhHHHHHH----------HHHHHHhccCCCC--CCCCCCCCCcC Q lcl|NC_021540. 5 NEEFLEDTVPSLQEDWKN-KPKVSDLLNDFNNAKSTKDTQVAII----------DDWLAQLNVTGAY--KPKQQVGRSSV 71 (705) Q Consensus 5 ~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~--~~~~~~grs~~ 71 (705) ++-..+-.+| |...+.+ +.+.+.|.++++..+++.+....++ .++++||++.... .+++.+|||+| T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~ 79 (651) T protein:vir:80 1 MKLATTTTDK-NRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKI 79 (651) T ss_pred Ccccccccch-hhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccc Confidence 1111111111 1222333 3378888888888887776544443 3578999987643 35667899999 Q ss_pred CCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhc--CCcch-HHHHHHHHHhcCCeEEE Q lcl|NC_021540. 72 QPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQL--DKVKL-IDTMVRTAVNEGTVIFR 148 (705) Q Consensus 72 v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~--~~~~~-~~~~~~~al~~g~gi~k 148 (705) |+++|+.+|||++|+|+++||++++||+|.|. +|++.|++.+++|||++..+. .+|.. +..+++++|+.|+||+| T Consensus 80 ~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~--~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~k 157 (651) T protein:vir:80 80 TTGKAFEAIETIHAYLMSATFPNKNWFDVVPA--KPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLA 157 (651) T ss_pred cChhHHHHHHHHHHHHHHhhcCCCceeEeccC--CchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEE Confidence 99999999999999999999999999999995 555679999999999998763 34544 44678999999999999 Q ss_pred EeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeee Q lcl|NC_021540. 149 TSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKT 228 (705) Q Consensus 149 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 228 (705) |+|+.++++.++.... + .....|-+.. ......+. T Consensus 158 v~we~~~~~~~~~~~~--------~-------------------------------~~~~~~~~~~------~v~~~~~~ 192 (651) T protein:vir:80 158 LPWRVETAEVKKKVQV--------R-------------------------------TPLFEDEPTF------EVVSEERE 192 (651) T ss_pred Eeecceeeeeehheec--------c-------------------------------ccccccccce------eeecccee Confidence 9998776555432210 0 0000011111 01111233 Q ss_pred ccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHH---HHhcCCcCcchhhhhhhhhhc--ccccccc---cc Q lcl|NC_021540. 229 VKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDL---EKYGIYSNLEYIKEDSSTSTS--SDHYSSD---TS 300 (705) Q Consensus 229 ~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el---~~~g~~~d~~~~~~~~~~~~~--~~~~~~~---~~ 300 (705) ..++|+|++|||++|||||+|+ +++||.|++|+++ |+.++ .++|+|.+.+........... ....... .. T Consensus 193 ~~~~~~i~~v~p~~~~~dp~a~-~~~d~~~v~~~~~-t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (651) T protein:vir:80 193 VKSSPDFEVLDMFDCFYDPNVT-DPNRGAFIRKLTK-TKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQG 270 (651) T ss_pred eeceeEEEEecHHHeeecCCCc-Cccccceeeeeee-eHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccC Confidence 4689999999999999999985 8999999988754 56554 456888765433221111100 0000000 00 Q ss_pred c-cccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHH Q lcl|NC_021540. 301 F-TFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDN 379 (705) Q Consensus 301 ~-~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~ 379 (705) . ..+..+.++|.|||||++++.+++++ +.+++++.|+++|+.+++||++ .+||++++|+|+++++||+|+++.+.|. T Consensus 271 ~d~~~~~~~~~v~v~E~~~~~d~e~~~~-~~~~v~~~g~~il~~~~~~~~~-~~Pf~~~~~~~~~~~~yG~g~~~~~~~~ 348 (651) T protein:vir:80 271 VTTSLWSPHQNVELLEYWGDIHLENKTY-HDVVVTIMGNEVLRFEQNPYWC-GRPFVIGTYIPTARQPYAMGALQPNLGM 348 (651) T ss_pred CCccccccccceEEEEEEEEeeccCCce-EEEEEEEcCcEEecccccCCCC-CCCeeeecceecCccccCCChHHHHhHH Confidence 1 11223567899999999999999887 5678888899999999999986 4699999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccC-ccchHHHHHHHHHHHHHH Q lcl|NC_021540. 380 QKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKY-PELPASSYNMLQMFTLEA 458 (705) Q Consensus 380 Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~-~~i~~~~~~~l~~~~~~~ 458 (705) |+.+|+++|+++++++++++|++++++|++...+.+.++||++|+++.+.. +.++++ ++.++..+++++++.+.+ T Consensus 349 q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~vi~~~~~~~----~~~l~~~~~~~~~~~~~l~~l~~~~ 424 (651) T protein:vir:80 349 LHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGKVFLVSDHGD----LQPLANQSSNFSITYQESSFLESTI 424 (651) T ss_pred HHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCceEEecCCCC----ceeeccCcccchhHHHHHHHHHHHH Confidence 999999999999999999999999998887777777889999998876543 333332 334567789999999999 Q ss_pred HHHhCcchHhcCCCcccc-chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCc---- Q lcl|NC_021540. 459 DALSGVKSFSQGLTGDSL-GTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEE---- 532 (705) Q Consensus 459 ~~~tGv~d~~~G~~~~~~-~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~---- 532 (705) +++|||+++++|..+... ..||++|+++++++++++..++++|++ ++++++++++.|+++|++.++++|++|+. T Consensus 425 ~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~ 504 (651) T protein:vir:80 425 DKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAY 504 (651) T ss_pred HHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccc Confidence 999999999999877654 358999999999999999999999998 78999999999999999999999999863 Q ss_pred -eeeechhhcccceeEEeeccc--hhHHHHHHHHHHHHHHHHhhhchh---HHHHHHHHHHHhhhccchhhhhhhccccc Q lcl|NC_021540. 533 -FVQINRDNLVGSFDIKLSISN--AETDAIKAQELSFMLQTMGQSLPF---DMTKLILGEIAKLRGMPDLSKMISKYNPE 606 (705) Q Consensus 533 -~v~i~~~~~~~~~dv~v~~~~--~~~~~~~~q~~~~llq~~~~~~~~---~~~~~il~~l~e~~~~~~~~~~~~~~~~q 606 (705) ++.++++++.+++++. ..|. ...+.+..+++.++++.+++..+. .....++.++++..|++....++..+.++ T Consensus 505 ~~~~i~~~dl~~~~~iv-~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~ 583 (651) T protein:vir:80 505 EYYELDVEDLQKEVRLV-PIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQ 583 (651) T ss_pred cccccCccceeeeeeee-eccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccc Confidence 6778888888888874 3333 234566777777888877654322 23455677888999999988888665543 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 607 PSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKG 686 (705) Q Consensus 607 ~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~ 686 (705) +++.+++.. ..+++....+++.+.++.++ ++.+ +.++++++.+.+++.+. T Consensus 584 ~~~~~~~~~-~~q~~~~~~~a~~~~~~~~~-------~~~~----------------------~~~~~~~~~~~~~~~~~ 633 (651) T protein:vir:80 584 APANPQEAL-LSQAKDVGGQAMSNMLQNQL-------QADG----------------------GTQMMSEMYGTPNADQM 633 (651) T ss_pred hhhhhhHHH-HhhHHHHHHHHHHHHHHHHH-------HHHH----------------------HHHHHHHHHHHHHHHHH Confidence 322222111 11111111111111111100 0000 00111111111111111 Q ss_pred HHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 687 NTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 687 ~~~~~~~k~~~~~~~q~~~ 705 (705) ++++.+.+..+++ ++-.+ T Consensus 634 ~~~~~~~~~~l~~-~~~~~ 651 (651) T protein:vir:80 634 QQELMATTPNVSE-QQLTQ 651 (651) T ss_pred HHHHHHHHHHHHH-hhccC Confidence 1111111111111 11111 No 4 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=2.8e-99 Score=560.89 Aligned_cols=615 Identities=14% Similarity=0.135 Sum_probs=407.7 Q ss_pred Ccchhh--------------------hhhcccccccCCCCCCHH---HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccC Q lcl|NC_021540. 1 MSDINE--------------------EFLEDTVPSLQEDWKNKP---KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVT 57 (705) Q Consensus 1 ~~~~~~--------------------~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 57 (705) |-+++. +..+++++ .....++. +++.|...+..+......-.++..+.++||+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~ 78 (776) T protein:vir:93 1 MFDLNDKDSTQLVPARTDEGELSPGEDAAQREKP--ANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNI 78 (776) T ss_pred CCCccccccccccccccccccCCCCCcccchhcc--cCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC Confidence 333332 11122222 12334444 555555555555544443344556789999987 Q ss_pred CCCCC----CCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHH Q lcl|NC_021540. 58 GAYKP----KQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLID 133 (705) Q Consensus 58 ~~~~~----~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~ 133 (705) -.... .+..|++.+|-+.|.-+|+|++....+ +..-+.|.|++++|++.|+..|.++||+ ...++...... T Consensus 79 Qw~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~----nr~~~~~~p~~~~d~~~Ae~l~~~~~~~-~~~~~~~~~~~ 153 (776) T protein:vir:93 79 QWSQDEIDELKERGQAPTVYNVISQSVNWIIGSEKR----GRSDFKVLPRRKDGGKAAERKTALLKYL-SDVNHTPFERS 153 (776) T ss_pred CCCHHHHHHHHhcCCceEEecchHHHHHHHHHHHHh----CCcceEEecCChhHHHHHHHHHHHHHHH-HHhhcHHHHHH Confidence 54432 234799999999999999999988855 5566999999999999999999999997 58889999999 Q ss_pred HHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccc Q lcl|NC_021540. 134 TMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPI 213 (705) Q Consensus 134 ~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 213 (705) ++++++|++|.|+++++|++... T Consensus 154 ~af~d~~~~G~G~~~v~~d~~~~--------------------------------------------------------- 176 (776) T protein:vir:93 154 MAFEETTKAGIGWLESQVQDEND--------------------------------------------------------- 176 (776) T ss_pred HHHHHhhhcCcceEEEEeeccCC--------------------------------------------------------- Confidence 99999999999999999853200 Q ss_pred eeccCcccccceeeeccCcceEEEechhheeeCCCcc-CChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhh-- Q lcl|NC_021540. 214 LAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCN-GNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTST-- 290 (705) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~-- 290 (705) .+.+++++|+|++|||||+++ .|++||+|++|++|+|+++|+++ |+...+.+........ T Consensus 177 ----------------~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-~p~~~~~~~~~~~~~~~~ 239 (776) T protein:vir:93 177 ----------------GEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAI-FPERAAQLRAAAVDNFET 239 (776) T ss_pred ----------------CCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHh-cCCchHHHHHhhhhcccc Confidence 012345689999999999775 59999999999999999999998 4433222211110000 Q ss_pred -------cc------cc--ccccccccccccccCeEEEEEEEEEeeec-------------------------------- Q lcl|NC_021540. 291 -------SS------DH--YSSDTSFTFSDKARKKIVVYEYWGYWDID-------------------------------- 323 (705) Q Consensus 291 -------~~------~~--~~~~~~~~~~~~~~~~v~v~E~w~k~~~~-------------------------------- 323 (705) .. .. ........+.+..+++|+|+|||+|..+. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~ 319 (776) T protein:vir:93 240 WGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRA 319 (776) T ss_pred cchhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCce Confidence 00 00 00111223445677899999999985321 Q ss_pred ---CCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_021540. 324 ---GSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANG 400 (705) Q Consensus 324 ---~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~ 400 (705) ..++.++|+++|+|+++|+.+++||+|++|||||+|+++++++++|+|+++.++|+|+++|+++|+++|++ +++ T Consensus 320 ~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l---~~~ 396 (776) T protein:vir:93 320 VLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL---STN 396 (776) T ss_pred eehheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhh---cCC Confidence 01224568889999999999999999999999999999999999999999999999999999999999887 567 Q ss_pred cEEeeccccCchhhhh---hcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccc Q lcl|NC_021540. 401 QRGMSKNLLDPVNERK---FKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLG 477 (705) Q Consensus 401 ~~~~~~~av~~~d~~~---~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~ 477 (705) ++++++|++++.+.+. ++||++|++++|+.. .+.+...+++++.++++++++.+.++++|||+++++|..+|+.+ T Consensus 397 ~~~~~~gav~~~d~~~~~~~rp~~vi~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~S 474 (776) T protein:vir:93 397 KVLMEEGAVDDIDEFRREAARPDAVMTVKNGKLG--AVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVS 474 (776) T ss_pred ceeeccccccchHHHHHhcccCCceeeeCCcccc--ccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhh Confidence 8999999998776544 689999999998654 34555677899999999999999999999999999999988766 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecC----ceeeechhh-----cccceeEE Q lcl|NC_021540. 478 TTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDE----EFVQINRDN-----LVGSFDIK 548 (705) Q Consensus 478 ~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~----~~v~i~~~~-----~~~~~dv~ 548 (705) ++| +.+++++|++++..++|||+++++++|+++|+||.+||++++++||+|+ .||.||... ..++|||. T Consensus 475 g~a--i~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~ 552 (776) T protein:vir:93 475 GVA--IQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFI 552 (776) T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEE Confidence 554 7777799999999999999999999999999999999999999999986 499998543 34789998 Q ss_pred eeccc--hhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhH----HHHHHHHHHHH Q lcl|NC_021540. 549 LSISN--AETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA----QLEIQIKQLEA 622 (705) Q Consensus 549 v~~~~--~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~----q~~~q~~q~~~ 622 (705) |..+. ++.+++..+.++++++.+.+.+.......+ .+++++.+...+.+.++...+++++.+ +++.++.++++ T Consensus 553 v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~-~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~ 631 (776) T protein:vir:93 553 IDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLL-VENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQ 631 (776) T ss_pred EeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHH-HHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhh Confidence 87754 455667777777777766554443333332 244555566666666665444333222 11112222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021540. 623 QELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQRDIVKTFLDTNKQ 702 (705) Q Consensus 623 q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~~q 702 (705) +..+++.+.++++++.+++++....++++..+.++.....+...+..... +..++...+..............+....+ T Consensus 632 ~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~-q~a~qa~~~~~~~~~~a~~a~~~~~~a~~ 710 (776) T protein:vir:93 632 QQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAV-KDATDAATAIAFMPELAGLSDGILRESGW 710 (776) T ss_pred HHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhh-hhhhhhhhhhhhhhhhhhhhhhhhccccc Confidence 22222222222322222333222222222222222211111100000000 00000000000000000001111111000 Q ss_pred ccC Q lcl|NC_021540. 703 GNQ 705 (705) Q Consensus 703 ~~~ 705 (705) ... T Consensus 711 ~~p 713 (776) T protein:vir:93 711 DDP 713 (776) T ss_pred ccc Confidence 000 No 5 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=2.8e-93 Score=527.99 Aligned_cols=607 Identities=13% Similarity=0.096 Sum_probs=416.4 Q ss_pred Cc------chhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCc Q lcl|NC_021540. 1 MS------DINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSS 70 (705) Q Consensus 1 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~ 70 (705) |+ -|-+-+.-+.+.-....-.++.++..++..|..+..+.....+.-.++.+||+|+--+. ..+..|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~ 80 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCc Confidence 43 23333332222222334455568888888888888888777776678899999864332 134579999 Q ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCC----------------------cchHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021540. 71 VQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKT----------------------WQDREAARQNEAILNYQFNNQLDK 128 (705) Q Consensus 71 ~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~----------------------~~D~~~A~~~t~~~n~~~~~~~~~ 128 (705) ++-+.|+-+|+|++..-.+ +-.-+.|.|+. .+|++.|++.|.+++|+.. .++. T Consensus 81 ~~~N~i~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~-~~~~ 155 (711) T protein:vir:10 81 LVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEY-NCDA 155 (711) T ss_pred EEEcchHHHHHHHhhhHhh----CCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHH-hcCh Confidence 9999999999999887754 55668889875 7899999999999999544 5566 Q ss_pred cchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhh Q lcl|NC_021540. 129 VKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVA 208 (705) Q Consensus 129 ~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (705) -....++++++|++|.|+++++|++.... T Consensus 156 ~~~~s~af~d~~~~G~G~~ev~~d~~~~d--------------------------------------------------- 184 (711) T protein:vir:10 156 ETEYDIAFQGAVESGMGYLRVRSDYLADD--------------------------------------------------- 184 (711) T ss_pred hHHHHHHHHHhhhcCcceEEEEecccCCC--------------------------------------------------- Confidence 56777999999999999999887532110 Q ss_pred cCccceeccCcccccceeeeccCcceEEEe-chhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCc-chhhhh Q lcl|NC_021540. 209 NNRPILAIINGYEEQEVIKTVKNQPEVTIC-DYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNL-EYIKED 285 (705) Q Consensus 209 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V-~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~-~~~~~~ 285 (705) ...++|+|.+| +|.+|||||.+ +.|++||+|+++++|||+++++++ |+... ..+. T Consensus 185 -------------------~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-yp~~a~~~~~-- 242 (711) T protein:vir:10 185 -------------------SFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKAL-YPDATAEPVY-- 242 (711) T ss_pred -------------------CCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHh-CCchhhhhhh-- Confidence 11356778788 79999999966 469999999999999999999998 54321 1110 Q ss_pred hhhhhccccccccccccccccccCeEEEEEEEEEeeec------CCC-----------------------------eeEE Q lcl|NC_021540. 286 SSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDID------GSG-----------------------------VTTP 330 (705) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~------~dg-----------------------------~~~~ 330 (705) ..+ ...++ .....++|+|.|||++.... ++| ..+. T Consensus 243 ---~~~------~~~~~-~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v 312 (711) T protein:vir:10 243 ---EDS------VADYD-TWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKT 312 (711) T ss_pred ---ccc------ccccC-cccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeE Confidence 000 00000 11234789999999874311 111 0133 Q ss_pred EEEEEECCEEEecccCCCCCCCcceEEeeeeee--cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc Q lcl|NC_021540. 331 IVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPV--KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL 408 (705) Q Consensus 331 ~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~--~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a 408 (705) ++.+|+|+++| .+++||+|++|||+|+++++. +++++|+|+++.++|+|+++|+++|+++|++++++++++++++|+ T Consensus 313 ~~~~~~G~~~L-~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~ga 391 (711) T protein:vir:10 313 YWRKITGANVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) T ss_pred EEEEEecceee-cCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcc Confidence 55678899999 688999999999999999864 788899999999999999999999999999999999999999999 Q ss_pred cCchhh-hh---hcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHH Q lcl|NC_021540. 409 LDPVNE-RK---FKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQ 484 (705) Q Consensus 409 v~~~d~-~~---~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~ 484 (705) |++.+. +. .+||+++++|++......+.+.+.|++|+.+++|++++.+.++++|||+++++|..+|+.++ .+|+ T Consensus 392 i~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg--~ai~ 469 (711) T protein:vir:10 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSG--RAII 469 (711) T ss_pred cCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHH--HHHH Confidence 987554 32 68999999999998888899999999999999999999999999999999999999887555 4578 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecC----ceeeechh--------------hccccee Q lcl|NC_021540. 485 GVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDE----EFVQINRD--------------NLVGSFD 546 (705) Q Consensus 485 ~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~----~~v~i~~~--------------~~~~~~d 546 (705) +++++|++++..++|||+++++++|+++|+||.+||++++++||+|+ +|+.||+. ...++|| T Consensus 470 ~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~D 549 (711) T protein:vir:10 470 ARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYD 549 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeE Confidence 88899999999999999999999999999999999999999999986 58888753 3467889 Q ss_pred EEeecc--chhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHH--HHHHHHHHHH Q lcl|NC_021540. 547 IKLSIS--NAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQ--LEIQIKQLEA 622 (705) Q Consensus 547 v~v~~~--~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q--~~~q~~q~~~ 622 (705) |.|+++ +++.+.+....+++++++++. ..+. ...++.+++++++..++.+.++...+++.+..+ .+.++.+++. T Consensus 550 v~i~~~p~~~s~r~~~~~~l~ql~~~~p~-~~~~-~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~ 627 (711) T protein:vir:10 550 VVVTTGPAFATQRIEAAEAMIQFAQAVPS-AAAV-MADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQ 627 (711) T ss_pred EEEeeccCchhHHHHHHHHHHHHHhhcch-hhhH-HHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHH Confidence 888774 455666666677777665532 2222 223445677777777888887766554433221 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---H--HHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 623 QELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQ---E--REL-ELMQAQAKGNTQRDIVKTF 696 (705) Q Consensus 623 q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq---~--~e~-e~~~~q~~~~~~~~~~k~~ 696 (705) ++...+++.++++++...++++++.++++..+++++....+...+....+. . +.+ +...+.++.+++++..+++ T Consensus 628 qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qaelq~~q~~ 707 (711) T protein:vir:10 628 TEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQAN 707 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111222222222222222222222211211111111111111111 1 111 1111222233333333333 Q ss_pred HHHHhh Q lcl|NC_021540. 697 LDTNKQ 702 (705) Q Consensus 697 ~~~~~q 702 (705) +.+ | T Consensus 708 ~~q--~ 711 (711) T protein:vir:10 708 VTE--Q 711 (711) T ss_pred hhc--C Confidence 322 2 No 6 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=6.6e-91 Score=514.98 Aligned_cols=614 Identities=11% Similarity=0.051 Sum_probs=404.2 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) |.+ |+ ..-+.|+.+=.+..+...+...+..+...+..--+...+..+||+|.=-.. -....|+..+|-+.| T Consensus 1 ~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:27 1 MKN--ET---NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCc--cc---ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccH Confidence 321 22 222333332222233334444444333333322234457889999863322 123569999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchH--HHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDR--EAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~--~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) +-+|+|++..-.+ +-.=+.|.|++++|+ +.|+..|.+++|+.... +.-....++++++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~-~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:27 76 APTVDGVLGMEAK----TRTDLVVMSDEPDDETEKLAEAINAEFADACRLG-NMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHHh----CCcceEEecCCCCchhHHHHHHHHHHHHHHHHhh-chhHHHHHHHHHhhhcCcceEEeccccC Confidence 9999999888754 556699999987665 68999999999987743 3334566899999999999877765310 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ...++++ T Consensus 151 -------------------------------------------------------------------------~~~~~i~ 157 (714) T protein:vir:27 151 -------------------------------------------------------------------------PFGPEFK 157 (714) T ss_pred -------------------------------------------------------------------------CCCCCeE Confidence 0124578 Q ss_pred EEEechhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh-----------hhc-------cccc Q lcl|NC_021540. 235 VTICDYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST-----------STS-------SDHY 295 (705) Q Consensus 235 i~~V~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~-----------~~~-------~~~~ 295 (705) |++|||++|||||++ +.|++||+|++|++|||+++++++ |+...+.+...... ... .... T Consensus 158 i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (714) T protein:vir:27 158 VSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQ 236 (714) T ss_pred EEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhhc Confidence 999999999999965 569999999999999999999998 44322222111000 000 0001 Q ss_pred ccc-ccccccccccCeEEEEEEEEEeee---------------cC------------------CCeeEEEEEEEECCEEE Q lcl|NC_021540. 296 SSD-TSFTFSDKARKKIVVYEYWGYWDI---------------DG------------------SGVTTPIVASWVDDVMI 341 (705) Q Consensus 296 ~~~-~~~~~~~~~~~~v~v~E~w~k~~~---------------~~------------------dg~~~~~~~~~~g~~iL 341 (705) .++ ....+.+..+++|+|+|||+|... ++ ..+.+.++++|+|.++| T Consensus 237 ~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L 316 (714) T protein:vir:27 237 SWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFI 316 (714) T ss_pred cccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCccc Confidence 111 122345566889999999987421 11 12356788999999999 Q ss_pred ecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-h---hh Q lcl|NC_021540. 342 RLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-R---KF 417 (705) Q Consensus 342 ~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~---~~ 417 (705) +.+++||+|++|||+|+++++.+..+.++|+++.++|+|+.+|+++|++++++ ++ .++++.+|+++..+. + -+ T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~-~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:27 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QA-KRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cC-CceeeecCcccccHHHHHHhcc Confidence 99999999999999999999998888899999999999999999999998865 34 456688888877542 2 26 Q ss_pred cCCcceeecCCc----ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 418 KMGEDYKYNPGT----NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKR 493 (705) Q Consensus 418 ~pg~~i~~~~~~----~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~ 493 (705) +||+++.+||+. .+...+.+.+.+++|+.++++++++.+.++++|||+++++|..+|+.||.| |++++++|++. T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg~~~ 471 (714) T protein:vir:27 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQGATT 471 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHHHHH Confidence 999999998753 334567888889999999999999999999999999999999999877766 77777999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCc-------eeeechh---------hcccceeEEeeccc--hh Q lcl|NC_021540. 494 ELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEE-------FVQINRD---------NLVGSFDIKLSISN--AE 555 (705) Q Consensus 494 ~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~-------~v~i~~~---------~~~~~~dv~v~~~~--~~ 555 (705) +..++|||+++++.+|+++|+||.+||++++++||+|++ ++.+|+. ...++|||.|+.+. ++ T Consensus 472 l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t 551 (714) T protein:vir:27 472 LAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPA 551 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchH Confidence 999999999999999999999999999999999999752 7888754 34678899887654 45 Q ss_pred HHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhH-------HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 556 TDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA-------QLEIQIKQLEAQELQMR 628 (705) Q Consensus 556 ~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~-------q~~~q~~q~~~q~~q~e 628 (705) .+.+..+.++++++.++|........ ++.+++++++...+.+++++..+++++.. +.+.+++++++++.+++ T Consensus 552 ~r~~~~~~l~~l~~~~~p~~~~~~~~-~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:27 552 FKAQLAQRMSEVIQGLPPQVQAVVLD-LWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHhhcCchhhhhHHH-HHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 56777778888887766554433333 44477777777788888776554433211 11111222222223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-H-Hhhc Q lcl|NC_021540. 629 IAKLQAEIQLMPYEAQAEAAKARKANTEADLNTL--DFVEQETGVKQEREL-ELMQAQAKGNTQRDIVKTFLD-T-NKQG 703 (705) Q Consensus 629 ~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~--~~~~q~~~~kq~~e~-e~~~~q~~~~~~~~~~k~~~~-~-~~q~ 703 (705) +.+++++++..+++++..++++.+...++..... +...... ...++++ +.++..+..+++..++.++.. . +++. T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:27 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 3333443333333333333333222222211100 0000011 1111111 111111222223333222221 1 1222 Q ss_pred cC Q lcl|NC_021540. 704 NQ 705 (705) Q Consensus 704 ~~ 705 (705) +. T Consensus 710 ~~ 711 (714) T protein:vir:27 710 NE 711 (714) T ss_pred Hh Confidence 22 No 7 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=6.6e-91 Score=514.98 Aligned_cols=614 Identities=11% Similarity=0.051 Sum_probs=404.2 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) |.+ |+ ..-+.|+.+=.+..+...+...+..+...+..--+...+..+||+|.=-.. -....|+..+|-+.| T Consensus 1 ~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:99 1 MKN--ET---NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCc--cc---ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccH Confidence 321 22 222333332222233334444444333333322234457889999863322 123569999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchH--HHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDR--EAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~--~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) +-+|+|++..-.+ +-.=+.|.|++++|+ +.|+..|.+++|+.... +.-....++++++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~-~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:99 76 APTVDGVLGMEAK----TRTDLVVMSDEPDDETEKLAEAINAEFADACRLG-NMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHHh----CCcceEEecCCCCchhHHHHHHHHHHHHHHHHhh-chhHHHHHHHHHhhhcCcceEEeccccC Confidence 9999999888754 556699999987665 68999999999987743 3334566899999999999877765310 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ...++++ T Consensus 151 -------------------------------------------------------------------------~~~~~i~ 157 (714) T protein:vir:99 151 -------------------------------------------------------------------------PFGPEFK 157 (714) T ss_pred -------------------------------------------------------------------------CCCCCeE Confidence 0124578 Q ss_pred EEEechhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh-----------hhc-------cccc Q lcl|NC_021540. 235 VTICDYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST-----------STS-------SDHY 295 (705) Q Consensus 235 i~~V~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~-----------~~~-------~~~~ 295 (705) |++|||++|||||++ +.|++||+|++|++|||+++++++ |+...+.+...... ... .... T Consensus 158 i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (714) T protein:vir:99 158 VSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQ 236 (714) T ss_pred EEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhhc Confidence 999999999999965 569999999999999999999998 44322222111000 000 0001 Q ss_pred ccc-ccccccccccCeEEEEEEEEEeee---------------cC------------------CCeeEEEEEEEECCEEE Q lcl|NC_021540. 296 SSD-TSFTFSDKARKKIVVYEYWGYWDI---------------DG------------------SGVTTPIVASWVDDVMI 341 (705) Q Consensus 296 ~~~-~~~~~~~~~~~~v~v~E~w~k~~~---------------~~------------------dg~~~~~~~~~~g~~iL 341 (705) .++ ....+.+..+++|+|+|||+|... ++ ..+.+.++++|+|.++| T Consensus 237 ~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L 316 (714) T protein:vir:99 237 SWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFI 316 (714) T ss_pred cccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCccc Confidence 111 122345566889999999987421 11 12356788999999999 Q ss_pred ecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-h---hh Q lcl|NC_021540. 342 RLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-R---KF 417 (705) Q Consensus 342 ~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~---~~ 417 (705) +.+++||+|++|||+|+++++.+..+.++|+++.++|+|+.+|+++|++++++ ++ .++++.+|+++..+. + -+ T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~-~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:99 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QA-KRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cC-CceeeecCcccccHHHHHHhcc Confidence 99999999999999999999998888899999999999999999999998865 34 456688888877542 2 26 Q ss_pred cCCcceeecCCc----ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 418 KMGEDYKYNPGT----NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKR 493 (705) Q Consensus 418 ~pg~~i~~~~~~----~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~ 493 (705) +||+++.+||+. .+...+.+.+.+++|+.++++++++.+.++++|||+++++|..+|+.||.| |++++++|++. T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg~~~ 471 (714) T protein:vir:99 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQGATT 471 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHHHHH Confidence 999999998753 334567888889999999999999999999999999999999999877766 77777999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCc-------eeeechh---------hcccceeEEeeccc--hh Q lcl|NC_021540. 494 ELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEE-------FVQINRD---------NLVGSFDIKLSISN--AE 555 (705) Q Consensus 494 ~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~-------~v~i~~~---------~~~~~~dv~v~~~~--~~ 555 (705) +..++|||+++++.+|+++|+||.+||++++++||+|++ ++.+|+. ...++|||.|+.+. ++ T Consensus 472 l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t 551 (714) T protein:vir:99 472 LAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPA 551 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchH Confidence 999999999999999999999999999999999999752 7888754 34678899887654 45 Q ss_pred HHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhH-------HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 556 TDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA-------QLEIQIKQLEAQELQMR 628 (705) Q Consensus 556 ~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~-------q~~~q~~q~~~q~~q~e 628 (705) .+.+..+.++++++.++|........ ++.+++++++...+.+++++..+++++.. +.+.+++++++++.+++ T Consensus 552 ~r~~~~~~l~~l~~~~~p~~~~~~~~-~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:99 552 FKAQLAQRMSEVIQGLPPQVQAVVLD-LWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHhhcCchhhhhHHH-HHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 56777778888887766554433333 44477777777788888776554433211 11111222222223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-H-Hhhc Q lcl|NC_021540. 629 IAKLQAEIQLMPYEAQAEAAKARKANTEADLNTL--DFVEQETGVKQEREL-ELMQAQAKGNTQRDIVKTFLD-T-NKQG 703 (705) Q Consensus 629 ~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~--~~~~q~~~~kq~~e~-e~~~~q~~~~~~~~~~k~~~~-~-~~q~ 703 (705) +.+++++++..+++++..++++.+...++..... +...... ...++++ +.++..+..+++..++.++.. . +++. T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:99 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 3333443333333333333333222222211100 0000011 1111111 111111222223333222221 1 1222 Q ss_pred cC Q lcl|NC_021540. 704 NQ 705 (705) Q Consensus 704 ~~ 705 (705) +. T Consensus 710 ~~ 711 (714) T protein:vir:99 710 NE 711 (714) T ss_pred Hh Confidence 22 No 8 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=6.6e-91 Score=514.98 Aligned_cols=614 Identities=11% Similarity=0.051 Sum_probs=404.2 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) |.+ |+ ..-+.|+.+=.+..+...+...+..+...+..--+...+..+||+|.=-.. -....|+..+|-+.| T Consensus 1 ~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:81 1 MKN--ET---NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCc--cc---ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccH Confidence 321 22 222333332222233334444444333333322234457889999863322 123569999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchH--HHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDR--EAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~--~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) +-+|+|++..-.+ +-.=+.|.|++++|+ +.|+..|.+++|+.... +.-....++++++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~-~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:81 76 APTVDGVLGMEAK----TRTDLVVMSDEPDDETEKLAEAINAEFADACRLG-NMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHHh----CCcceEEecCCCCchhHHHHHHHHHHHHHHHHhh-chhHHHHHHHHHhhhcCcceEEeccccC Confidence 9999999888754 556699999987665 68999999999987743 3334566899999999999877765310 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ...++++ T Consensus 151 -------------------------------------------------------------------------~~~~~i~ 157 (714) T protein:vir:81 151 -------------------------------------------------------------------------PFGPEFK 157 (714) T ss_pred -------------------------------------------------------------------------CCCCCeE Confidence 0124578 Q ss_pred EEEechhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh-----------hhc-------cccc Q lcl|NC_021540. 235 VTICDYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST-----------STS-------SDHY 295 (705) Q Consensus 235 i~~V~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~-----------~~~-------~~~~ 295 (705) |++|||++|||||++ +.|++||+|++|++|||+++++++ |+...+.+...... ... .... T Consensus 158 i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (714) T protein:vir:81 158 VSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQ 236 (714) T ss_pred EEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhhc Confidence 999999999999965 569999999999999999999998 44322222111000 000 0001 Q ss_pred ccc-ccccccccccCeEEEEEEEEEeee---------------cC------------------CCeeEEEEEEEECCEEE Q lcl|NC_021540. 296 SSD-TSFTFSDKARKKIVVYEYWGYWDI---------------DG------------------SGVTTPIVASWVDDVMI 341 (705) Q Consensus 296 ~~~-~~~~~~~~~~~~v~v~E~w~k~~~---------------~~------------------dg~~~~~~~~~~g~~iL 341 (705) .++ ....+.+..+++|+|+|||+|... ++ ..+.+.++++|+|.++| T Consensus 237 ~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L 316 (714) T protein:vir:81 237 SWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFI 316 (714) T ss_pred cccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCccc Confidence 111 122345566889999999987421 11 12356788999999999 Q ss_pred ecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-h---hh Q lcl|NC_021540. 342 RLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-R---KF 417 (705) Q Consensus 342 ~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~---~~ 417 (705) +.+++||+|++|||+|+++++.+..+.++|+++.++|+|+.+|+++|++++++ ++ .++++.+|+++..+. + -+ T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~-~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:81 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QA-KRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cC-CceeeecCcccccHHHHHHhcc Confidence 99999999999999999999998888899999999999999999999998865 34 456688888877542 2 26 Q ss_pred cCCcceeecCCc----ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 418 KMGEDYKYNPGT----NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKR 493 (705) Q Consensus 418 ~pg~~i~~~~~~----~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~ 493 (705) +||+++.+||+. .+...+.+.+.+++|+.++++++++.+.++++|||+++++|..+|+.||.| |++++++|++. T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg~~~ 471 (714) T protein:vir:81 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQGATT 471 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHHHHH Confidence 999999998753 334567888889999999999999999999999999999999999877766 77777999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCc-------eeeechh---------hcccceeEEeeccc--hh Q lcl|NC_021540. 494 ELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEE-------FVQINRD---------NLVGSFDIKLSISN--AE 555 (705) Q Consensus 494 ~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~-------~v~i~~~---------~~~~~~dv~v~~~~--~~ 555 (705) +..++|||+++++.+|+++|+||.+||++++++||+|++ ++.+|+. ...++|||.|+.+. ++ T Consensus 472 l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t 551 (714) T protein:vir:81 472 LAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPA 551 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchH Confidence 999999999999999999999999999999999999752 7888754 34678899887654 45 Q ss_pred HHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhH-------HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 556 TDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA-------QLEIQIKQLEAQELQMR 628 (705) Q Consensus 556 ~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~-------q~~~q~~q~~~q~~q~e 628 (705) .+.+..+.++++++.++|........ ++.+++++++...+.+++++..+++++.. +.+.+++++++++.+++ T Consensus 552 ~r~~~~~~l~~l~~~~~p~~~~~~~~-~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:81 552 FKAQLAQRMSEVIQGLPPQVQAVVLD-LWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHhhcCchhhhhHHH-HHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 56777778888887766554433333 44477777777788888776554433211 11111222222223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-H-Hhhc Q lcl|NC_021540. 629 IAKLQAEIQLMPYEAQAEAAKARKANTEADLNTL--DFVEQETGVKQEREL-ELMQAQAKGNTQRDIVKTFLD-T-NKQG 703 (705) Q Consensus 629 ~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~--~~~~q~~~~kq~~e~-e~~~~q~~~~~~~~~~k~~~~-~-~~q~ 703 (705) +.+++++++..+++++..++++.+...++..... +...... ...++++ +.++..+..+++..++.++.. . +++. T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:81 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 3333443333333333333333222222211100 0000011 1111111 111111222223333222221 1 1222 Q ss_pred cC Q lcl|NC_021540. 704 NQ 705 (705) Q Consensus 704 ~~ 705 (705) +. T Consensus 710 ~~ 711 (714) T protein:vir:81 710 NE 711 (714) T ss_pred Hh Confidence 22 No 9 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=6.6e-91 Score=514.98 Aligned_cols=614 Identities=11% Similarity=0.051 Sum_probs=404.2 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) |.+ |+ ..-+.|+.+=.+..+...+...+..+...+..--+...+..+||+|.=-.. -....|+..+|-+.| T Consensus 1 ~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:32 1 MKN--ET---NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCc--cc---ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccH Confidence 321 22 222333332222233334444444333333322234457889999863322 123569999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchH--HHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDR--EAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~--~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) +-+|+|++..-.+ +-.=+.|.|++++|+ +.|+..|.+++|+.... +.-....++++++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~-~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:32 76 APTVDGVLGMEAK----TRTDLVVMSDEPDDETEKLAEAINAEFADACRLG-NMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHHh----CCcceEEecCCCCchhHHHHHHHHHHHHHHHHhh-chhHHHHHHHHHhhhcCcceEEeccccC Confidence 9999999888754 556699999987665 68999999999987743 3334566899999999999877765310 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ...++++ T Consensus 151 -------------------------------------------------------------------------~~~~~i~ 157 (714) T protein:vir:32 151 -------------------------------------------------------------------------PFGPEFK 157 (714) T ss_pred -------------------------------------------------------------------------CCCCCeE Confidence 0124578 Q ss_pred EEEechhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh-----------hhc-------cccc Q lcl|NC_021540. 235 VTICDYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST-----------STS-------SDHY 295 (705) Q Consensus 235 i~~V~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~-----------~~~-------~~~~ 295 (705) |++|||++|||||++ +.|++||+|++|++|||+++++++ |+...+.+...... ... .... T Consensus 158 i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (714) T protein:vir:32 158 VSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQ 236 (714) T ss_pred EEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhhc Confidence 999999999999965 569999999999999999999998 44322222111000 000 0001 Q ss_pred ccc-ccccccccccCeEEEEEEEEEeee---------------cC------------------CCeeEEEEEEEECCEEE Q lcl|NC_021540. 296 SSD-TSFTFSDKARKKIVVYEYWGYWDI---------------DG------------------SGVTTPIVASWVDDVMI 341 (705) Q Consensus 296 ~~~-~~~~~~~~~~~~v~v~E~w~k~~~---------------~~------------------dg~~~~~~~~~~g~~iL 341 (705) .++ ....+.+..+++|+|+|||+|... ++ ..+.+.++++|+|.++| T Consensus 237 ~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L 316 (714) T protein:vir:32 237 SWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFI 316 (714) T ss_pred cccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCccc Confidence 111 122345566889999999987421 11 12356788999999999 Q ss_pred ecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-h---hh Q lcl|NC_021540. 342 RLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-R---KF 417 (705) Q Consensus 342 ~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~---~~ 417 (705) +.+++||+|++|||+|+++++.+..+.++|+++.++|+|+.+|+++|++++++ ++ .++++.+|+++..+. + -+ T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~-~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:32 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QA-KRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cC-CceeeecCcccccHHHHHHhcc Confidence 99999999999999999999998888899999999999999999999998865 34 456688888877542 2 26 Q ss_pred cCCcceeecCCc----ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 418 KMGEDYKYNPGT----NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKR 493 (705) Q Consensus 418 ~pg~~i~~~~~~----~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~ 493 (705) +||+++.+||+. .+...+.+.+.+++|+.++++++++.+.++++|||+++++|..+|+.||.| |++++++|++. T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg~~~ 471 (714) T protein:vir:32 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQGATT 471 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHHHHH Confidence 999999998753 334567888889999999999999999999999999999999999877766 77777999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCc-------eeeechh---------hcccceeEEeeccc--hh Q lcl|NC_021540. 494 ELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEE-------FVQINRD---------NLVGSFDIKLSISN--AE 555 (705) Q Consensus 494 ~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~-------~v~i~~~---------~~~~~~dv~v~~~~--~~ 555 (705) +..++|||+++++.+|+++|+||.+||++++++||+|++ ++.+|+. ...++|||.|+.+. ++ T Consensus 472 l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t 551 (714) T protein:vir:32 472 LAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPA 551 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchH Confidence 999999999999999999999999999999999999752 7888754 34678899887654 45 Q ss_pred HHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhH-------HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 556 TDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA-------QLEIQIKQLEAQELQMR 628 (705) Q Consensus 556 ~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~-------q~~~q~~q~~~q~~q~e 628 (705) .+.+..+.++++++.++|........ ++.+++++++...+.+++++..+++++.. +.+.+++++++++.+++ T Consensus 552 ~r~~~~~~l~~l~~~~~p~~~~~~~~-~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:32 552 FKAQLAQRMSEVIQGLPPQVQAVVLD-LWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHhhcCchhhhhHHH-HHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 56777778888887766554433333 44477777777788888776554433211 11111222222223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-H-Hhhc Q lcl|NC_021540. 629 IAKLQAEIQLMPYEAQAEAAKARKANTEADLNTL--DFVEQETGVKQEREL-ELMQAQAKGNTQRDIVKTFLD-T-NKQG 703 (705) Q Consensus 629 ~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~--~~~~q~~~~kq~~e~-e~~~~q~~~~~~~~~~k~~~~-~-~~q~ 703 (705) +.+++++++..+++++..++++.+...++..... +...... ...++++ +.++..+..+++..++.++.. . +++. T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:32 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 3333443333333333333333222222211100 0000011 1111111 111111222223333222221 1 1222 Q ss_pred cC Q lcl|NC_021540. 704 NQ 705 (705) Q Consensus 704 ~~ 705 (705) +. T Consensus 710 ~~ 711 (714) T protein:vir:32 710 NE 711 (714) T ss_pred Hh Confidence 22 No 10 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=6.6e-91 Score=514.98 Aligned_cols=614 Identities=11% Similarity=0.051 Sum_probs=404.2 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) |.+ |+ ..-+.|+.+=.+..+...+...+..+...+..--+...+..+||+|.=-.. -....|+..+|-+.| T Consensus 1 ~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:10 1 MKN--ET---NTMATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCc--cc---ccccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccH Confidence 321 22 222333332222233334444444333333322234457889999863322 123569999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchH--HHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDR--EAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~--~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) +-+|+|++..-.+ +-.=+.|.|++++|+ +.|+..|.+++|+.... +.-....++++++|++|.|++.++|++. T Consensus 76 ~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~-~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:10 76 APTVDGVLGMEAK----TRTDLVVMSDEPDDETEKLAEAINAEFADACRLG-NMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred HHHHHHHHhHHHh----CCcceEEecCCCCchhHHHHHHHHHHHHHHHHhh-chhHHHHHHHHHhhhcCcceEEeccccC Confidence 9999999888754 556699999987665 68999999999987743 3334566899999999999877765310 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ...++++ T Consensus 151 -------------------------------------------------------------------------~~~~~i~ 157 (714) T protein:vir:10 151 -------------------------------------------------------------------------PFGPEFK 157 (714) T ss_pred -------------------------------------------------------------------------CCCCCeE Confidence 0124578 Q ss_pred EEEechhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh-----------hhc-------cccc Q lcl|NC_021540. 235 VTICDYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST-----------STS-------SDHY 295 (705) Q Consensus 235 i~~V~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~-----------~~~-------~~~~ 295 (705) |++|||++|||||++ +.|++||+|++|++|||+++++++ |+...+.+...... ... .... T Consensus 158 i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 236 (714) T protein:vir:10 158 VSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQ 236 (714) T ss_pred EEecchhheeeccccccCChhhccceeeeecCCHHHHHHh-cCCchhhhhhhhhhhccccccccccccccccccchhhhc Confidence 999999999999965 569999999999999999999998 44322222111000 000 0001 Q ss_pred ccc-ccccccccccCeEEEEEEEEEeee---------------cC------------------CCeeEEEEEEEECCEEE Q lcl|NC_021540. 296 SSD-TSFTFSDKARKKIVVYEYWGYWDI---------------DG------------------SGVTTPIVASWVDDVMI 341 (705) Q Consensus 296 ~~~-~~~~~~~~~~~~v~v~E~w~k~~~---------------~~------------------dg~~~~~~~~~~g~~iL 341 (705) .++ ....+.+..+++|+|+|||+|... ++ ..+.+.++++|+|.++| T Consensus 237 ~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L 316 (714) T protein:vir:10 237 SWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFI 316 (714) T ss_pred cccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCccc Confidence 111 122345566889999999987421 11 12356788999999999 Q ss_pred ecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-h---hh Q lcl|NC_021540. 342 RLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-R---KF 417 (705) Q Consensus 342 ~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~---~~ 417 (705) +.+++||+|++|||+|+++++.+..+.++|+++.++|+|+.+|+++|++++++ ++ .++++.+|+++..+. + -+ T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~-~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:10 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QA-KRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cC-CceeeecCcccccHHHHHHhcc Confidence 99999999999999999999998888899999999999999999999998865 34 456688888877542 2 26 Q ss_pred cCCcceeecCCc----ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 418 KMGEDYKYNPGT----NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKR 493 (705) Q Consensus 418 ~pg~~i~~~~~~----~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~ 493 (705) +||+++.+||+. .+...+.+.+.+++|+.++++++++.+.++++|||+++++|..+|+.||.| |++++++|++. T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--i~~rq~qg~~~ 471 (714) T protein:vir:10 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQGATT 471 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHH--HHHHHHHHHHH Confidence 999999998753 334567888889999999999999999999999999999999999877766 77777999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCc-------eeeechh---------hcccceeEEeeccc--hh Q lcl|NC_021540. 494 ELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEE-------FVQINRD---------NLVGSFDIKLSISN--AE 555 (705) Q Consensus 494 ~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~-------~v~i~~~---------~~~~~~dv~v~~~~--~~ 555 (705) +..++|||+++++.+|+++|+||.+||++++++||+|++ ++.+|+. ...++|||.|+.+. ++ T Consensus 472 l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t 551 (714) T protein:vir:10 472 LAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPA 551 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchH Confidence 999999999999999999999999999999999999752 7888754 34678899887654 45 Q ss_pred HHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhH-------HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 556 TDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA-------QLEIQIKQLEAQELQMR 628 (705) Q Consensus 556 ~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~-------q~~~q~~q~~~q~~q~e 628 (705) .+.+..+.++++++.++|........ ++.+++++++...+.+++++..+++++.. +.+.+++++++++.+++ T Consensus 552 ~r~~~~~~l~~l~~~~~p~~~~~~~~-~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:10 552 FKAQLAQRMSEVIQGLPPQVQAVVLD-LWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHhhcCchhhhhHHH-HHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 56777778888887766554433333 44477777777788888776554433211 11111222222223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-H-Hhhc Q lcl|NC_021540. 629 IAKLQAEIQLMPYEAQAEAAKARKANTEADLNTL--DFVEQETGVKQEREL-ELMQAQAKGNTQRDIVKTFLD-T-NKQG 703 (705) Q Consensus 629 ~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~--~~~~q~~~~kq~~e~-e~~~~q~~~~~~~~~~k~~~~-~-~~q~ 703 (705) +.+++++++..+++++..++++.+...++..... +...... ...++++ +.++..+..+++..++.++.. . +++. T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q~~~~~~ 709 (714) T protein:vir:10 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRM 709 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHHHHHHHH Confidence 3333443333333333333333222222211100 0000011 1111111 111111222223333222221 1 1222 Q ss_pred cC Q lcl|NC_021540. 704 NQ 705 (705) Q Consensus 704 ~~ 705 (705) +. T Consensus 710 ~~ 711 (714) T protein:vir:10 710 NE 711 (714) T ss_pred Hh Confidence 22 No 11 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=1.9e-89 Score=506.96 Aligned_cols=613 Identities=11% Similarity=0.037 Sum_probs=398.5 Q ss_pred CcchhhhhhcccccccCCCCCCHH-HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKP-KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKL 75 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~ 75 (705) |++ ..+.+...-+++ .+.- ....|...+.+ ......--+...++.+||+|.=-.. -.+..|+..++-+. T Consensus 1 ~~~-~~~~~~~~~~~~----~~~~~~~~~l~~~~~~-~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~ 74 (714) T protein:vir:10 1 MKN-EINTTAMKNDHG----STPRFSQRQLLSLCSD-IDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNL 74 (714) T ss_pred CCc-CcCcccCCCcch----hhhhhhHHHHHHHHHH-HhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEecc Confidence 544 111111222222 2221 22222222222 2222222234568899999863321 12356999999999 Q ss_pred HHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchH--HHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 76 IRKQAEWRYSALSEPFLNDENIFSIAPKTWQDR--EAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 76 v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~--~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) |+-+|+|++....+ +-.=+.|.|++++|+ +.|+..|.+++|+....+-. ....+++.+++.+|.|+++++|++ T Consensus 75 i~~~v~~v~g~~~~----nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~-~~~s~af~~~~~~G~G~~~~~~d~ 149 (714) T protein:vir:10 75 IAPTVDGVLGMEAK----TRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMN-KARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred HHHHHHHHHHHHHh----CCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchh-HHHHHHHHHhhhcccceEEeeecc Confidence 99999999888754 555689999987765 68999999999987754433 356689999999999977776631 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) . ...+++ T Consensus 150 d-------------------------------------------------------------------------~~~~~i 156 (714) T protein:vir:10 150 E-------------------------------------------------------------------------PFGPEF 156 (714) T ss_pred C-------------------------------------------------------------------------CCCCCe Confidence 1 113467 Q ss_pred eEEEechhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh-----------h-------hcccc Q lcl|NC_021540. 234 EVTICDYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST-----------S-------TSSDH 294 (705) Q Consensus 234 ~i~~V~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~-----------~-------~~~~~ 294 (705) +|++|||++|||||++ +.|++||+|++|++|||+++++++ |+...+.+...... . ..+.. T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~-fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:10 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKAT-FPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHh-cCCchhhhhccchhhcCcccchhhhhhcccccccchhh Confidence 8999999999999965 579999999999999999999998 44322222211000 0 00011 Q ss_pred cccccc-ccccccccCeEEEEEEEEEeee---------------cC------------------CCeeEEEEEEEECCEE Q lcl|NC_021540. 295 YSSDTS-FTFSDKARKKIVVYEYWGYWDI---------------DG------------------SGVTTPIVASWVDDVM 340 (705) Q Consensus 295 ~~~~~~-~~~~~~~~~~v~v~E~w~k~~~---------------~~------------------dg~~~~~~~~~~g~~i 340 (705) ..++.. ..+.+..+++|+|+|||++... ++ ..+.+.++++|+|.++ T Consensus 236 ~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:10 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred cccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchh Confidence 112211 2234566789999999988321 11 1234567889999999 Q ss_pred EecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-hh--- Q lcl|NC_021540. 341 IRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-RK--- 416 (705) Q Consensus 341 L~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~~--- 416 (705) |+.+++||+|++|||+|+|+++.+..+.++|+++.++|+|+.+|+++|++++.+ +..++++.+|+++..+. +. T Consensus 316 L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l---~~~~~~~~~gav~~~d~~~~e~~ 392 (714) T protein:vir:10 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL---QAKRVIMDEDATQLSDNDLMEQL 392 (714) T ss_pred hhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHH---hCCceeeccccccccHHHHHHhc Confidence 999999999999999999999999888899999999999999999999998866 34478889999977543 22 Q ss_pred hcCCcceeecCCc----ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHH Q lcl|NC_021540. 417 FKMGEDYKYNPGT----NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGK 492 (705) Q Consensus 417 ~~pg~~i~~~~~~----~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~ 492 (705) ++||++|.+|++. .+...+.+.+++++|+.++++++++...++++|||+++++|..+|+.||+| |++++++|++ T Consensus 393 ~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA--I~~r~~qg~~ 470 (714) T protein:vir:10 393 ERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA--ISNLVEQGAT 470 (714) T ss_pred cCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHH--HHHHHHHHHH Confidence 5899999998743 344568888899999999999999999999999999999999999877766 7778899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecC-------ceeeech---------hhcccceeEEeecc--ch Q lcl|NC_021540. 493 RELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDE-------EFVQINR---------DNLVGSFDIKLSIS--NA 554 (705) Q Consensus 493 ~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~-------~~v~i~~---------~~~~~~~dv~v~~~--~~ 554 (705) .+..++|||+++++.+|+++|+||.+||++++++||+++ .++.+|. +...++|||.|+++ ++ T Consensus 471 ~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~ 550 (714) T protein:vir:10 471 TLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcH Confidence 999999999999999999999999999999999999975 2677774 33457889888665 45 Q ss_pred hHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhh-------HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 555 ETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQ-------AQLEIQIKQLEAQELQM 627 (705) Q Consensus 555 ~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~-------~q~~~q~~q~~~q~~q~ 627 (705) +.+.+..+.++++++.++|.+....+. ++.++++.++...+.+++++..+++.+. ++.+.++++++.++.++ T Consensus 551 s~r~~~~~~l~ql~~~~~p~~~~~~~~-~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~~~~~~~~q~~l 629 (714) T protein:vir:10 551 AFKAQLAQRMSEVIQGLPPQVQAVVLD-LWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHHH-HHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHHHHHHHHHHHHH Confidence 567777778888887665544433333 3345666666666777766554432221 11111222222223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH--HHHhh Q lcl|NC_021540. 628 RIAKLQAEIQLMPYEAQAEAAKARKANTEADLN--TLDFVEQETGVKQEREL-ELMQAQAKGNTQRDIVKTFL--DTNKQ 702 (705) Q Consensus 628 e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~--~~~~~~q~~~~kq~~e~-e~~~~q~~~~~~~~~~k~~~--~~~~q 702 (705) ++.+++++++..+++++..++++.+...++... .++.+.... ...++++ ++.+.....+++...+.++. ..++| T Consensus 630 ~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~-~~~~a~~a~~l~~~~~~~q~~~~~~q~~~q~~~~~ 708 (714) T protein:vir:10 630 QMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQR 708 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHH Confidence 333444444333333333233322222222111 111111111 1111111 11111111111222221111 11122 Q ss_pred ccC Q lcl|NC_021540. 703 GNQ 705 (705) Q Consensus 703 ~~~ 705 (705) .++ T Consensus 709 ~~~ 711 (714) T protein:vir:10 709 MNE 711 (714) T ss_pred HHh Confidence 222 No 12 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=4e-89 Score=505.21 Aligned_cols=610 Identities=14% Similarity=0.088 Sum_probs=399.9 Q ss_pred cchhhhhhccccccc-CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 2 SDINEEFLEDTVPSL-QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 2 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) -+|-|.. .++.+ |.+=.+.++...+...|..+...+....+...+..+||+|+=-.. -....|+..++-+.| T Consensus 1 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i 77 (772) T protein:vir:10 1 MQITEND---RQYLNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLI 77 (772) T ss_pred CCcchhh---HHhhccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcch Confidence 2333322 22222 222234444445555555444443333444567889999863322 123579999999999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCC-cchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKT-WQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEE 155 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~-~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~ 155 (705) +-+|+|++..-.+ +-.=+.|.|.+ .+|++.|+..|.+++|+.... +.-....++++++|++|.|++.++++.. T Consensus 78 ~~~v~~v~g~~~~----nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~-~~~~~~s~Af~~~i~~G~Gw~e~~~~~d- 151 (772) T protein:vir:10 78 GPALLSLQGYEAV----TRTDWRVTPNGDVGGQEVADALNYRLNTAERQS-GADRACSEAFRPQIACGIGWVEVSRESD- 151 (772) T ss_pred HHHHHHHHHHHHh----cCcceEEecCCCchHHHHHHHHHHHHHHHHHhc-ChHHHHHHHHHHhhhcCceeEEeccccC- Confidence 9999999888754 56669999985 699999999999999986643 3334466899999999999655433100 Q ss_pred hhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceE Q lcl|NC_021540. 156 TKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEV 235 (705) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i 235 (705) ...+.++| T Consensus 152 ------------------------------------------------------------------------~~~~~i~i 159 (772) T protein:vir:10 152 ------------------------------------------------------------------------PFKFPYRC 159 (772) T ss_pred ------------------------------------------------------------------------CCCCCeEE Confidence 01235789 Q ss_pred EEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhh--hhh-------c-------------cc Q lcl|NC_021540. 236 TICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSS--TST-------S-------------SD 293 (705) Q Consensus 236 ~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~--~~~-------~-------------~~ 293 (705) ++|+|++|||||+|+.|++||+|+++.+|||+++++++ |+...+.+..... ... . .. T Consensus 160 ~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~-fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (772) T protein:vir:10 160 RPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALV-FPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNE 238 (772) T ss_pred EeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHh-CCCchhHHHhhhhhcccccCcccccccccccccccccccch Confidence 99999999999998779999999999999999999998 4432222211100 000 0 00 Q ss_pred cccc-cccccccccccCeEEEEEEEEEeee--------cCCC-------------------------eeEEEEEEEECCE Q lcl|NC_021540. 294 HYSS-DTSFTFSDKARKKIVVYEYWGYWDI--------DGSG-------------------------VTTPIVASWVDDV 339 (705) Q Consensus 294 ~~~~-~~~~~~~~~~~~~v~v~E~w~k~~~--------~~dg-------------------------~~~~~~~~~~g~~ 339 (705) ...+ .....+.+.++++|+|+|||+|..+ +|.+ ..+.++++|+|.+ T Consensus 239 ~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~ 318 (772) T protein:vir:10 239 ARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPH 318 (772) T ss_pred hhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecce Confidence 0001 1112344566889999999998532 1111 2356778999999 Q ss_pred EEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-h--- Q lcl|NC_021540. 340 MIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-R--- 415 (705) Q Consensus 340 iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~--- 415 (705) +|+.+++||+|++|||||+|+++++.++.+||+++.++|+|+++|++.|+++++++.+ ++++++|+|++.+. + T Consensus 319 ~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~---~~~~~~gav~~~d~~~~e~ 395 (772) T protein:vir:10 319 CLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVA---RVERTKGAVAMTDAQFRRQ 395 (772) T ss_pred eeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhcc---cccccCCCccchhHHHHHh Confidence 9999999999999999999999999999999999999999999999999999988554 68899999998663 2 Q ss_pred hhcCCcceeecCCcc--cccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 416 KFKMGEDYKYNPGTN--PVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKR 493 (705) Q Consensus 416 ~~~pg~~i~~~~~~~--~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~ 493 (705) -.+|+++|.+|++.. +...+.+.+.+.+++.++++++...++++++|||+++++|..+|+.||+| |++++++|++. T Consensus 396 ~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvA--i~~rq~qg~~~ 473 (772) T protein:vir:10 396 IARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQ--EQQQIEQSNQS 473 (772) T ss_pred ccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHH--HHHHHHHHHHH Confidence 258999999998754 34567788889999999999999999999999999999999988877766 66777999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCc------eeeech--------------hhcccceeEEeeccc Q lcl|NC_021540. 494 ELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEE------FVQINR--------------DNLVGSFDIKLSISN 553 (705) Q Consensus 494 ~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~------~v~i~~--------------~~~~~~~dv~v~~~~ 553 (705) +..++|||+++++.+|+++|+||.+||+++|++||+|++ ++.||. +...++|||.|+.+. T Consensus 474 l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p 553 (772) T protein:vir:10 474 IGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVP 553 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEeeccc Confidence 999999999999999999999999999999999999753 466653 345678998887764 Q ss_pred --hhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 554 --AETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAK 631 (705) Q Consensus 554 --~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k 631 (705) ++.+++..+.++++++.+.|.+....+ .++.+++++++...+.+.+++..+++++.+++..+ .+..+..+++ T Consensus 554 ~~~t~r~~~~~~m~ql~~~~~P~~~~~~~-~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~-----~q~~qq~~~~ 627 (772) T protein:vir:10 554 STNSYRGQQLNAMSEAVKSMPPQYQAAVL-PFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQI-----DQAVQDALAK 627 (772) T ss_pred cchHHHHHHHHHHHHHHhccChhHHHHHH-HHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHH-----HHHHHHHHHH Confidence 344555566666665544333322222 22345556666677888887766554443322111 1112222333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 632 LQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 632 ~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) ++++++..+..++..+..++..+.+++........+..+++..+....+.+.+..-.. .+.++-....+.+.. T Consensus 628 ~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~a~~ad~-~l~~~g~~~~~~~~~ 700 (772) T protein:vir:10 628 AGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMIAPIADA-VMQSAGYQRPNPAGD 700 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhHHHHH-HHHhccccccccccc Confidence 3444433333333333333222222222222222222222222111111111110000 000111110011111 No 13 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=5.4e-82 Score=466.13 Aligned_cols=595 Identities=15% Similarity=0.136 Sum_probs=377.9 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC----CCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP----KQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~grs~~v~~~v 76 (705) |+ + +..++..++..|..+......--....+.++||+|.--+.. ....||. +-+.| T Consensus 1 m~--------d----------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i 60 (725) T protein:vir:92 1 MA--------D----------NENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRG--QFDVV 60 (725) T ss_pred CC--------c----------hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccch Confidence 22 1 22467777777776666555444455678999998644321 2234654 56999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) +-+|+|++..-.+ +-.-+.|.|+.++|++.|+..|.+++|+.. .++.-....++++++|.||.|.+.+.|++... T Consensus 61 ~~~i~~v~g~e~~----nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~ 135 (725) T protein:vir:92 61 RPVVRKLVSEMRQ----NPIDVLYRPKDGASPDAADVLMGMYRTDMR-HNTAKIAVNVAVREQIESGVGAWRLVTDYEDQ 135 (725) T ss_pred HHHHHHHHhhHHh----CCcceEEecCCccHHHHHHHHHHHHHHHHH-hhCchHHHHHHHHHHhhcCcceeeeeecccCC Confidence 9999999776644 677799999999999999999999999855 55555566799999999999977766543210 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCc--ce Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQ--PE 234 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~ 234 (705) .. ..+. |+ T Consensus 136 d~----------------------------------------------------------------------~~~~~~i~ 145 (725) T protein:vir:92 136 SP----------------------------------------------------------------------TSNNQVIR 145 (725) T ss_pred CC----------------------------------------------------------------------CCCceeeE Confidence 00 0011 22 Q ss_pred EEEe--chhheeeCCCcc-CChhhCCeEEEEEeccHHHHHHh--cCCcCcchhhhhhhhhhccccccccccccccccccC Q lcl|NC_021540. 235 VTIC--DYHNVTIDPTCN-GNLDEAKFVIYSFESSRSDLEKY--GIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARK 309 (705) Q Consensus 235 i~~V--~~~~~~~Dp~a~-~d~~da~~~~~~~~~t~~el~~~--g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (705) +..| |+.+|||||.++ .|++||+|+|++.||+++++..+ .|..+.... . ......+ +.....+++ T Consensus 146 ~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~----~--~~~~~~~----~~~~~~~~d 215 (725) T protein:vir:92 146 REPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDI----P--SFQNPND----WVFPWLTQD 215 (725) T ss_pred EeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhh----h--hcccCCc----ccccccCCC Confidence 3333 566799999764 69999999999999999876543 122211100 0 0011111 111123457 Q ss_pred eEEEEEEEEEeeec-----------C-------------------CCe--------eEEEE--EEEECCEEEecccCCCC Q lcl|NC_021540. 310 KIVVYEYWGYWDID-----------G-------------------SGV--------TTPIV--ASWVDDVMIRLEKNPYP 349 (705) Q Consensus 310 ~v~v~E~w~k~~~~-----------~-------------------dg~--------~~~~~--~~~~g~~iL~~~~~p~~ 349 (705) +|+|+|||++..+. | .|. ...++ .+++|.++| .+++||+ T Consensus 216 ~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l-~~~~~~~ 294 (725) T protein:vir:92 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVL-KDKQLIA 294 (725) T ss_pred eEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhh-cCCCCCC Confidence 89999999975321 1 111 11122 234566665 4578999 Q ss_pred CCCcceEEeeeeee--cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcce---- Q lcl|NC_021540. 350 DGKLPFVVVPYLPV--KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDY---- 423 (705) Q Consensus 350 ~~~~Pfv~~~~~~~--~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i---- 423 (705) |++|||||+++++. ++..|++|+++.++|+|+.+|+++|+++++++++++.+++++.|+++.......+|..+. T Consensus 295 ~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) T protein:vir:92 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) T ss_pred CCceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeec Confidence 99999999999976 667778899999999999999999999999999999999999999987655555555442 Q ss_pred ---eecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 424 ---KYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRR 500 (705) Q Consensus 424 ---~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n 500 (705) ..++|..+...+.+.+.|++|+++++|++...+.++++|||++.++|..+|+.|+.| |.+++++|++.+..++|| T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a--i~~rq~qg~~~l~~~~Dn 452 (725) T protein:vir:92 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDT--VNQLNMRADLETYVFQDN 452 (725) T ss_pred cccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHH--HHHHHHHHHHHHHHHHHH Confidence 335666666778888999999999999999999999999999999999988766655 667779999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceeEeEecC----ceeeech-------------hhcccceeEEeecc--chhHHHHHH Q lcl|NC_021540. 501 LANGLTEVAKKILAMNSVWLSDEEVIRITDE----EFVQINR-------------DNLVGSFDIKLSIS--NAETDAIKA 561 (705) Q Consensus 501 ~~~~~~~~~~~~l~li~q~~~~~~~iri~~~----~~v~i~~-------------~~~~~~~dv~v~~~--~~~~~~~~~ 561 (705) |+.+++.+|+++|+||.+||++++++||+|+ .++.||. .++.|+|||.|+++ +++.+++.. T Consensus 453 l~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~ 532 (725) T protein:vir:92 453 LATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHH Confidence 9999999999999999999999999999986 4787875 34567899998775 456677778 Q ss_pred HHHHHHHHHHhhhchhHHHHHHHH---HHHhhhccchhhhhhhccccc-----chhh--HHHHHHHHHHHHHHHHHHHH- Q lcl|NC_021540. 562 QELSFMLQTMGQSLPFDMTKLILG---EIAKLRGMPDLSKMISKYNPE-----PSPQ--AQLEIQIKQLEAQELQMRIA- 630 (705) Q Consensus 562 q~~~~llq~~~~~~~~~~~~~il~---~l~e~~~~~~~~~~~~~~~~q-----~~~~--~q~~~q~~q~~~q~~q~e~~- 630 (705) ..+++|++++++..|. ...++. .+++..+...+.+.++...++ |... ++...++++++.++.++++. T Consensus 533 ~~l~ql~~~~~~~~~~--~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~ 610 (725) T protein:vir:92 533 AEILELLGKTPQGTPE--YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQ 610 (725) T ss_pred HHHHHHHHhcccchhH--HHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHH Confidence 8888888877665442 122222 333344455555555432211 1111 11111122222222222222 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---H-----------------------HHHHHHHHHHHHH Q lcl|NC_021540. 631 ------KLQAEIQLMPYEAQAEAAKARKANTEADLNTLDF---V-----------------------EQETGVKQERELE 678 (705) Q Consensus 631 ------k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~---~-----------------------~q~~~~kq~~e~e 678 (705) +++++++..+++....++++.+.+.+++..+++. . +++...+..+|+. T Consensus 611 ~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~ 690 (725) T protein:vir:92 611 AQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELL 690 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHH Confidence 2222221111111111111111111111111000 0 0001111112222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 679 LMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 679 ~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) +++.++..+++.++.++...++.+++- T Consensus 691 l~~~~~~~~~~~d~~~~~~~~~~~~~~ 717 (725) T protein:vir:92 691 LKGNEQTHKQRMDIANILQSQRQNQPS 717 (725) T ss_pred HHHHHHHHHHHHHHHHHhcchhccCCc Confidence 333333333333333333333333333 No 14 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=1.9e-81 Score=463.11 Aligned_cols=595 Identities=15% Similarity=0.144 Sum_probs=382.6 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) |++ +..++..++..|..+......--....+.++||+|.--+. .....||. +-+.| T Consensus 1 m~d------------------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i 60 (725) T protein:vir:77 1 MAD------------------NENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVV 60 (725) T ss_pred CCc------------------hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ccccH Confidence 221 2336777777777666655544444567889999864432 12234654 55999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) +-+|+|++.+-.+ +-.-+.|.|+.++|++.|+..|.+++|+.. .++.-....++++++|.+|.|.+.+.|++... T Consensus 61 ~~~i~~v~g~~~~----nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~ 135 (725) T protein:vir:77 61 RPVVRKLVSEMRQ----NPIDVLYRPKDGARPDAADVLMGMYRTDMR-HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ 135 (725) T ss_pred HHHHHHHHhhHHh----CCcceEEecCCccHHHHHHHHHHHHHHHHH-hhCchhHHHHHHHHHhhcCcceeeeeecccCC Confidence 9999999777655 777799999999999999999999999855 55555556799999999999987776543311 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEE Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVT 236 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~ 236 (705) .. ..+.++|+ T Consensus 136 d~----------------------------------------------------------------------~~~~~~i~ 145 (725) T protein:vir:77 136 SP----------------------------------------------------------------------TSNNQVIR 145 (725) T ss_pred CC----------------------------------------------------------------------CCCceeeE Confidence 00 00112222 Q ss_pred ----EechhheeeCCCcc-CChhhCCeEEEEEeccHHHHHHh--cCCcCcchhhhhhhhhhccccccccccccccccccC Q lcl|NC_021540. 237 ----ICDYHNVTIDPTCN-GNLDEAKFVIYSFESSRSDLEKY--GIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARK 309 (705) Q Consensus 237 ----~V~~~~~~~Dp~a~-~d~~da~~~~~~~~~t~~el~~~--g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (705) +.+|.+|||||.++ .|++||+|+|+.+||+++++..+ .|..+.... ........ +.+...+++ T Consensus 146 ~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~------~~~~~~~~----~~~~~~~~d 215 (725) T protein:vir:77 146 REPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDI------PSFQNPND----WVFPWLTQD 215 (725) T ss_pred EeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhc------cccccccc----ccccccCCC Confidence 23788999999764 69999999999999999977654 122111000 00001111 111122457 Q ss_pred eEEEEEEEEEeeec------------------------------CCCee--------EEEE--EEEECCEEEecccCCCC Q lcl|NC_021540. 310 KIVVYEYWGYWDID------------------------------GSGVT--------TPIV--ASWVDDVMIRLEKNPYP 349 (705) Q Consensus 310 ~v~v~E~w~k~~~~------------------------------~dg~~--------~~~~--~~~~g~~iL~~~~~p~~ 349 (705) +|+|+|||++..+. +.|.. ..++ ++|.|.++| .+++||+ T Consensus 216 ~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l-~~~~~~~ 294 (725) T protein:vir:77 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVL-KDKQLIA 294 (725) T ss_pred eeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceee-ccCCcCC Confidence 79999999975321 11211 1122 224455544 5788999 Q ss_pred CCCcceEEeeeeee--cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcc----- Q lcl|NC_021540. 350 DGKLPFVVVPYLPV--KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGED----- 422 (705) Q Consensus 350 ~~~~Pfv~~~~~~~--~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~----- 422 (705) |++|||||+++++. ++..|++|+++.++|+|+.+|+++|+++++++++++.++++..|+++..+....+|+++ T Consensus 295 ~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) T protein:vir:77 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) T ss_pred CCccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecc Confidence 99999999999975 67777889999999999999999999999999999999999999998766554455443 Q ss_pred --eeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 423 --YKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRR 500 (705) Q Consensus 423 --i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n 500 (705) +..++|..+.+.+.+.+.|++|+.+++|++.....++++|||+++++|..+|++||.| |.+++++|.+.+..++|| T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a--i~~rq~qg~~~~~~~~Dn 452 (725) T protein:vir:77 375 NRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDT--VNQLNMRADLETYVFQDN 452 (725) T ss_pred cccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHH--HHHHHHHHHHHHHHHHHH Confidence 5557777777788899999999999999999999999999999999999998766655 677779999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceeEeEecCc----eeeech-------------hhcccceeEEeecc--chhHHHHHH Q lcl|NC_021540. 501 LANGLTEVAKKILAMNSVWLSDEEVIRITDEE----FVQINR-------------DNLVGSFDIKLSIS--NAETDAIKA 561 (705) Q Consensus 501 ~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~----~v~i~~-------------~~~~~~~dv~v~~~--~~~~~~~~~ 561 (705) |+.+++.+|+++|+||.+||++++++||+|++ ++.||. .++.|+|||.|+++ +++.+++.. T Consensus 453 l~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~ 532 (725) T protein:vir:77 453 LATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHH Confidence 99999999999999999999999999999874 788874 24567899998775 456677888 Q ss_pred HHHHHHHHHHhhhchhHHHHHHHHH---HHhhhccchhhhhhhccccc-----ch-h-hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 562 QELSFMLQTMGQSLPFDMTKLILGE---IAKLRGMPDLSKMISKYNPE-----PS-P-QAQLEIQIKQLEAQELQMRIAK 631 (705) Q Consensus 562 q~~~~llq~~~~~~~~~~~~~il~~---l~e~~~~~~~~~~~~~~~~q-----~~-~-~~q~~~q~~q~~~q~~q~e~~k 631 (705) +.+++|++++++..|. ...++.. +++..+...+.+.+++..++ +. + .++..++.+++++++.++++.+ T Consensus 533 ~~l~qll~~~~~~~~~--~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q 610 (725) T protein:vir:77 533 AEILELLGKTPQGTPE--YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQ 610 (725) T ss_pred HHHHHHHHhccccchh--HHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHH Confidence 8888888887765442 2222223 33334444445444432211 11 1 1111111122222222222222 Q ss_pred -------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------------------HHHHHHHHHHHHHH Q lcl|NC_021540. 632 -------LQAEIQLMPYEAQAEAAKARKANTEADLNTLDF--------------------------VEQETGVKQERELE 678 (705) Q Consensus 632 -------~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~--------------------------~~q~~~~kq~~e~e 678 (705) ++++++..+++....++++.+.+.++...+.+. .+++..+++.+|+. T Consensus 611 ~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~ 690 (725) T protein:vir:77 611 AQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELL 690 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH Confidence 222211111111000001100001100000000 00001111222333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 679 LMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 679 ~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) +++++...+++.++.++...++.+++= T Consensus 691 ~~~~~~~~~q~~~~~~~~~~~~~~~~~ 717 (725) T protein:vir:77 691 LKGDEQTHKQRMDIANILQSQRQNQPS 717 (725) T ss_pred HHhhhHHHhhHHHHHHHHHHHHhcCCC Confidence 344444445555555544444444433 No 15 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=5.7e-80 Score=455.02 Aligned_cols=604 Identities=14% Similarity=0.121 Sum_probs=376.9 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCC----------CCCCCCcCCCHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPK----------QQVGRSSVQPKLIRK 78 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~grs~~v~~~v~~ 78 (705) |.+ ++..++..++..|+.+......--....+..+||+++|.=-+. ...||..++.+.|+- T Consensus 1 m~e---------~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~ 71 (706) T protein:vir:10 1 MAE---------SRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVAT 71 (706) T ss_pred CCc---------chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHH Confidence 222 3446888899999988888765555555667889876522111 234899999999999 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEeCCC-cchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhh Q lcl|NC_021540. 79 QAEWRYSALSEPFLNDENIFSIAPKT-WQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETK 157 (705) Q Consensus 79 ~~e~~~~~l~~~f~~~~~~~~~~p~~-~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~ 157 (705) +|+|+++...+ +-.=+.|.|.. .+|++.|+..|.+++|+. ..++.-....+++++++++|.|++++.-+++.. T Consensus 72 ~v~~v~g~~~~----nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~-~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~- 145 (706) T protein:vir:10 72 ELNRIISEYRN----NRISVKFRPGDNAASEELANKLNGLFRADY-EETDGGEACDNAFDDAATGGFGCFRLTTSFVNE- 145 (706) T ss_pred HHHHHhhHHHh----CCCceEEecCCCCchHHHHHHHHHHHHHHH-HhcCchHHHHHHHHHHhhcCcceEEeeeccccc- Confidence 99999998865 44459999964 668999999999999984 456666667799999999999966654221100 Q ss_pred hhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEE Q lcl|NC_021540. 158 VTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTI 237 (705) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~ 237 (705) ........+|.|++ T Consensus 146 ------------------------------------------------------------------~d~~~~~~~i~i~~ 159 (706) T protein:vir:10 146 ------------------------------------------------------------------YDPMDERQRIAVEP 159 (706) T ss_pred ------------------------------------------------------------------cCCCCCCccceeee Confidence 00001133556676 Q ss_pred e-ch-hheeeCCCc-cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccccccc-ccCeEEE Q lcl|NC_021540. 238 C-DY-HNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDK-ARKKIVV 313 (705) Q Consensus 238 V-~~-~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~v 313 (705) | +| .+|||||.+ +.|++||+|+++++|||+++++++ |++....+....+.....++...++. ..... .++.+.+ T Consensus 160 v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~-fp~~~~~~~~~~~~~~~~d~~~~d~~-~~~eyy~~~~~~~ 237 (706) T protein:vir:10 160 IYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSE-YDKAPTSLDRVGSVSWQYDWFTPDVV-YIAKYYEVRKESV 237 (706) T ss_pred eccchhceecCchhcccChhhcceEeeeecCCHHHHHHh-cCCChhhhhhhccccccccccCCCcc-eecccccccceeE Confidence 5 34 589999966 469999999999999999999998 44332222221111111111111111 11111 1223334 Q ss_pred EEEEEEeeecC-------------------CCee----------EEEEEEEECCEEEecccCCCCCCCcceEEeeeeee- Q lcl|NC_021540. 314 YEYWGYWDIDG-------------------SGVT----------TPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPV- 363 (705) Q Consensus 314 ~E~w~k~~~~~-------------------dg~~----------~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~- 363 (705) ..||++.+..+ .|.. +.+..+++|.++| .+++||+|++|||||+++++. T Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l-~~~~p~~~~~~P~vP~~g~r~~ 316 (706) T protein:vir:10 238 DVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFL-EKPRRIPGEHIPLIPVYGKRWF 316 (706) T ss_pred EEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeecccccc-ccCCCCCCCccceEEEeecccc Confidence 44666533211 1211 1133345676666 679999999999999999986 Q ss_pred -cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhh---hcCC------c---ceeecCCc- Q lcl|NC_021540. 364 -KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERK---FKMG------E---DYKYNPGT- 429 (705) Q Consensus 364 -~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~---~~pg------~---~i~~~~~~- 429 (705) +++..+||+++.++|+|+.+|+++|+++++++++.+...+ ++++..+.+. ..+. . .+..++|. T Consensus 317 ~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~---~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i 393 (706) T protein:vir:10 317 IDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPI---VDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNV 393 (706) T ss_pred ccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccc---cchhHHHHHHHHhhhcccccccchhcccccCCCCcc Confidence 7888899999999999999999999999999877654444 4433322110 0110 0 11112221 Q ss_pred -ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 430 -NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEV 508 (705) Q Consensus 430 -~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~ 508 (705) .+.+.+....+|.+++.+++++++....++++|||+++++|..+| .| +.+|++++++|++.+..++|||+++++.+ T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn-~S--G~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~ 470 (706) T protein:vir:10 394 VAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN-VA--RETVNSLLNRSDMASFIYLDNMAKSLKRA 470 (706) T ss_pred cccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc-hH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123345666788899999999999999999999999999998765 34 45588888999999999999999999999 Q ss_pred HHHHHHHHHHhcCCceeEeEecC----ceeeech--------------hhcccceeEEeecc--chhHHHHHHHHHHHHH Q lcl|NC_021540. 509 AKKILAMNSVWLSDEEVIRITDE----EFVQINR--------------DNLVGSFDIKLSIS--NAETDAIKAQELSFML 568 (705) Q Consensus 509 ~~~~l~li~q~~~~~~~iri~~~----~~v~i~~--------------~~~~~~~dv~v~~~--~~~~~~~~~q~~~~ll 568 (705) |+++|+||.+||+++|++||+|+ +++.||. +...|+|||.|+.+ +++.+.+..+.+++|+ T Consensus 471 g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~ 550 (706) T protein:vir:10 471 GEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLL 550 (706) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHH Confidence 99999999999999999999985 4677763 44567899988764 4567888888899999 Q ss_pred HHHhhhchhH-HHHHHHHHHHhhhccchhhhhhhcccccchh-----hHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 569 QTMGQSLPFD-MTKLILGEIAKLRGMPDLSKMISKYNPEPSP-----QAQ--LEIQIKQLEAQELQMRIAKLQAEIQLMP 640 (705) Q Consensus 569 q~~~~~~~~~-~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~-----~~q--~~~q~~q~~~q~~q~e~~k~qa~~q~~~ 640 (705) +.++|..+.. .+..++.+++++++...+.+.+++..++... +++ ...+++++++++.+.++.++++++...+ T Consensus 551 ~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~q 630 (706) T protein:vir:10 551 QGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQ 630 (706) T ss_pred HhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9887754422 1223345667777777888877654432211 111 1111222223333333333333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 641 YEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 641 ~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) ++++..++++.+.+.++...+++..+++++..+..... .+...++..+..+...+....+.+ T Consensus 631 A~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a---~~~~~~~~~q~~q~l~~~~a~q~~ 692 (706) T protein:vir:10 631 AEAQKSQNETVQTQIKAFTAQQDAMESQANTVYKLAQA---RNIDDKAVMETLRLLKEVAASQQQ 692 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhccC Confidence 33222222222222222223333323332222211111 111112222222222221111112 No 16 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=2.3e-80 Score=457.17 Aligned_cols=595 Identities=15% Similarity=0.139 Sum_probs=371.9 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC----CCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK----PKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~grs~~v~~~v 76 (705) |+ + +..++..++..|..+......--....+..+||+|.=-+. ..+..||. +-+.| T Consensus 1 m~--------d----------~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp--~~N~i 60 (725) T protein:vir:10 1 MA--------D----------NENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVV 60 (725) T ss_pred CC--------c----------hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccch Confidence 22 1 2235666777766666555443344557789999753321 12234554 56999 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) +-+|+|++..-.+ +-.=+.|.|+.++|++.|+..|.+++|+.. .++.-....+++.++|.||.|.+.+.|++... T Consensus 61 ~~~v~~v~g~e~~----nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~ 135 (725) T protein:vir:10 61 RPVVRKLVSEMRQ----NPIDVLYRPKDGASPDAADVLMGMYRTDMR-HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ 135 (725) T ss_pred HHHHHHHHhhHHh----CCcceEEecCCcchHHHHHHHHHHHHHHHH-hcCcchHHhHHHHHHhhcCcceeeeeccccCC Confidence 9999999887755 556699999999999999999999999844 34444445689999999999987776543310 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCc--ce Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQ--PE 234 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~ 234 (705) -. + .+. |+ T Consensus 136 d~-------------------------------------------------------------~---------~~~~~i~ 145 (725) T protein:vir:10 136 SP-------------------------------------------------------------T---------SNNQVIR 145 (725) T ss_pred CC-------------------------------------------------------------C---------CCceeee Confidence 00 0 011 22 Q ss_pred EE--EechhheeeCCCc-cCChhhCCeEEEEEeccHHHHHHh--cCCcCcchhhhhhhhhhccccccccccccccccccC Q lcl|NC_021540. 235 VT--ICDYHNVTIDPTC-NGNLDEAKFVIYSFESSRSDLEKY--GIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARK 309 (705) Q Consensus 235 i~--~V~~~~~~~Dp~a-~~d~~da~~~~~~~~~t~~el~~~--g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (705) +. +.||.+|||||.+ +.|++||+|+|+.+||++..+... .|+.+..... ..... ..+.+...+++ T Consensus 146 ~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~------~~~~~----~~~~~~~~~~~ 215 (725) T protein:vir:10 146 REPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIP------SFQNP----NDWVFPWLTQD 215 (725) T ss_pred eeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccc------ccccc----ccccccccCCC Confidence 22 3478899999966 569999999999999998654321 1222111000 00001 11122233467 Q ss_pred eEEEEEEEEEeeec-----------C-------------------CCe--------eEEEE--EEEECCEEEecccCCCC Q lcl|NC_021540. 310 KIVVYEYWGYWDID-----------G-------------------SGV--------TTPIV--ASWVDDVMIRLEKNPYP 349 (705) Q Consensus 310 ~v~v~E~w~k~~~~-----------~-------------------dg~--------~~~~~--~~~~g~~iL~~~~~p~~ 349 (705) +|+|+|||++.++. | .|. ...++ .+|+|.++| .+++||+ T Consensus 216 ~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l-~~~~~~~ 294 (725) T protein:vir:10 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVL-KDKQLIA 294 (725) T ss_pred eEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhh-cCCCCCC Confidence 79999999986421 1 111 11122 234566666 4578999 Q ss_pred CCCcceEEeeeeee--cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceee-- Q lcl|NC_021540. 350 DGKLPFVVVPYLPV--KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKY-- 425 (705) Q Consensus 350 ~~~~Pfv~~~~~~~--~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~-- 425 (705) |++|||||+++++. ++..|++|++|.++|+|+.+|+++|+++++++++++.+++++.++++.......+|+.+..+ T Consensus 295 ~~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~ 374 (725) T protein:vir:10 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) T ss_pred CCceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeec Confidence 99999999999975 66777889999999999999999999999999999999999999998766555666665433 Q ss_pred -----cCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 426 -----NPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRR 500 (705) Q Consensus 426 -----~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n 500 (705) ++|..+...+.+.+.|++|+++++|+++..+.++++|||+++++|..+|+.|+.| |.+++++|++.+..++|| T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a--i~~rq~qg~~~l~~~~Dn 452 (725) T protein:vir:10 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDT--VNQLNMRADLETYVFQDN 452 (725) T ss_pred ccccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHH--HHHHHHHHHHHHHHHHHH Confidence 5666666778888999999999999999999999999999999999988766655 677779999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCceeEeEecCc----eeeech-------------hhcccceeEEeecc--chhHHHHHH Q lcl|NC_021540. 501 LANGLTEVAKKILAMNSVWLSDEEVIRITDEE----FVQINR-------------DNLVGSFDIKLSIS--NAETDAIKA 561 (705) Q Consensus 501 ~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~----~v~i~~-------------~~~~~~~dv~v~~~--~~~~~~~~~ 561 (705) |+.+++.+|+++|+||.+||++++++||+|++ ++.||. .++.|+|||.|+++ +++.+.+.. T Consensus 453 l~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~ 532 (725) T protein:vir:10 453 LATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHH Confidence 99999999999999999999999999999874 788874 34567899998775 456677788 Q ss_pred HHHHHHHHHHhhhchhHHHHHHHH---HHHhhhccchhhhhhhccccc-----ch-hh-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 562 QELSFMLQTMGQSLPFDMTKLILG---EIAKLRGMPDLSKMISKYNPE-----PS-PQ-AQLEIQIKQLEAQELQMRIAK 631 (705) Q Consensus 562 q~~~~llq~~~~~~~~~~~~~il~---~l~e~~~~~~~~~~~~~~~~q-----~~-~~-~q~~~q~~q~~~q~~q~e~~k 631 (705) +.+++|++++++..|. ...++. .+++..+...+.+.+++..++ +. ++ +++..+++++++++.+.++.+ T Consensus 533 ~~l~qll~~~~~~~~~--~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q 610 (725) T protein:vir:10 533 SEILELLGKTPQGTPE--YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQ 610 (725) T ss_pred HHHHHHHHhccccchh--HHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHH Confidence 8888888887765443 122222 334445555555555543221 11 11 111111222222222222222 Q ss_pred HH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHH------------ Q lcl|NC_021540. 632 LQ-------AEIQLMPYEAQAEAAKARKANTEADLNTLDFV------------EQETGVKQERELELM------------ 680 (705) Q Consensus 632 ~q-------a~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~------------~q~~~~kq~~e~e~~------------ 680 (705) ++ ++++..+++....++++.+.+.++...+++.. +++...+..+.++++ T Consensus 611 ~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~ 690 (725) T protein:vir:10 611 AQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELL 690 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH Confidence 22 22111111110011111111111111111000 000001111111111 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 681 --QAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 681 --~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) .+..+.++++++.|+...+++|++= T Consensus 691 ~~~~~~~~~~~~~~~~~~~~q~~~~~~ 717 (725) T protein:vir:10 691 LKGNEQTHKQRMDIANILQSQRQNQPS 717 (725) T ss_pred HHHHHHHHHHHhhhhhccccccccCCC Confidence 1111111222222211111111111 No 17 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=2.3e-81 Score=462.67 Aligned_cols=607 Identities=15% Similarity=0.157 Sum_probs=408.7 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC------------CCCCCCCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY------------KPKQQVGR 68 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~gr 68 (705) =-+|+| +++++...++++.+.+.|.+.++.+++..+...+++++-.+||..++.. .....++| T Consensus 6 ~~~~~~-----~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 80 (641) T protein:vir:94 6 PTPIIE-----DKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWR 80 (641) T ss_pred Cccccc-----CCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhccc Confidence 235555 5666666777777999999999999998887766665556666543321 12234579 Q ss_pred CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEE Q lcl|NC_021540. 69 SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFR 148 (705) Q Consensus 69 s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k 148 (705) |+++++.+.+.++|++|+||+.||++++||+|.|.+++|+++|++.+.|+|+++ ++++++.+++++++++|+.|+||+| T Consensus 81 ~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l-~~~~~~~~~~~~~~d~~~~g~~iv~ 159 (641) T protein:vir:94 81 HRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKL-EAASIRDIFETYVRNLVLYGVSTYR 159 (641) T ss_pred ccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHH-hhcchHHHHHHHHHHHhhcCceEEE Confidence 999999999999999999999999999999999999999999999999999998 7889999999999999999999999 Q ss_pred EeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeee Q lcl|NC_021540. 149 TSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKT 228 (705) Q Consensus 149 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 228 (705) ++|+.+..+.+++... .+|... +......... T Consensus 160 ~~w~~~~~~~~~~~~~-------------------------------------------~~~~~~-----~~~~~~~v~~ 191 (641) T protein:vir:94 160 LGWDTSMERQFKRTFV-------------------------------------------ETGDIF-----GGWEDVAVNR 191 (641) T ss_pred eehhhHHHHhhhhhcc-------------------------------------------cchhhc-----ccccccceec Confidence 9998776654332110 000000 0000011112 Q ss_pred ccCcceEEEechhheeeCCCccCChhhCCeEEEE-EeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccc Q lcl|NC_021540. 229 VKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYS-FESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKA 307 (705) Q Consensus 229 ~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~-~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (705) ....++++.|+|++|||||+++ .+++.|+++| +.+|+.+|+++||+ +.+.+.............+.....+.. . T Consensus 192 ~~~~~r~~~v~~~di~~dps~~--~~~~~f~~~r~t~~t~~~l~~eg~~-~~d~v~~~~~~~~~~~~~d~~~d~~~~--~ 266 (641) T protein:vir:94 192 QRSELRIEPLSPYDVWLDTSGG--KNTGTFVRLRHTREELHELVTSGYY-DLDLTQVEQYVDYKFADPDTPKDVNGT--D 266 (641) T ss_pred ccceeeEEecchhheeecCCCC--cccccceehhhhHHHHHHHHhcCCC-Chhhcchhhcccccccccccccccccc--c Confidence 3456889999999999999975 4456666555 56677778888876 223222211110000001111111112 2 Q ss_pred cCeEEEEEEEEEeeecCCCeeE-EEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHH Q lcl|NC_021540. 308 RKKIVVYEYWGYWDIDGSGVTT-PIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGAL 386 (705) Q Consensus 308 ~~~v~v~E~w~k~~~~~dg~~~-~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~ 386 (705) ..++++||+|+.++ ++|... .+++++.|+++|+.+.++|++ .+||++++|.+.++++||.|+++.+.|.|+.+|++ T Consensus 267 ~~~~~~~e~~gd~~--~d~~~~~~~~~~~~g~~il~~~~~~~~d-~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l 343 (641) T protein:vir:94 267 TSGWDIIEYYGPLL--VEGVQFWCVHAVFYGKQLIRLSDSKYWC-GSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVL 343 (641) T ss_pred ccccceeeeeeeec--cCCCceeeEEEEEeCCEEeecccccccC-cCCeEEecceecCCcccCCChHHHHHHHHHHHHHH Confidence 34567889997554 455433 366888999999999998754 67999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCcc-chHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_021540. 387 TRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPE-LPASSYNMLQMFTLEADALSGVK 465 (705) Q Consensus 387 ~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~-i~~~~~~~l~~~~~~~~~~tGv~ 465 (705) .+.+++++.++++|++++..+++.....++..||++++.+.... +.++..+. ......+.++++...+.+.+|+. T Consensus 344 ~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~~----v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 419 (641) T protein:vir:94 344 TNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHGS----LQPIDMGRQDFVVTYQEAQVQESSVYRNTSTG 419 (641) T ss_pred HHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCCc----ceeecCCccccchhHHHHHHHHHHHHHhhhhh Confidence 99999999999999999988887666778889999998765432 33433222 12334567788888999999999 Q ss_pred hHhcCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecC-----ceeeech Q lcl|NC_021540. 466 SFSQGLTGDSLG-TTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDE-----EFVQINR 538 (705) Q Consensus 466 d~~~G~~~~~~~-~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~-----~~v~i~~ 538 (705) .+.+|.++...+ -||+++++++++++.++..++++|+. +++.+++.++.+++++++.+.++|+.|. .|+++.| T Consensus 420 ~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p 499 (641) T protein:vir:94 420 PLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSP 499 (641) T ss_pred hhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCc Confidence 888887654433 38999999999999999999999996 7889999999999999999999999985 4788999 Q ss_pred hhcccceeEEeecc--chhHHHHHHHHHHHHHHHHhhh---chhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHH Q lcl|NC_021540. 539 DNLVGSFDIKLSIS--NAETDAIKAQELSFMLQTMGQS---LPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQL 613 (705) Q Consensus 539 ~~~~~~~dv~v~~~--~~~~~~~~~q~~~~llq~~~~~---~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~ 613 (705) +++++++++ +..+ ....+.+.++++.++++.++.. ........++..+++..|++.+..+++....++.+.++. T Consensus 500 ~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~~~~~~~~ 578 (641) T protein:vir:94 500 EYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDPMRYIKKAEAPPAAPPIA 578 (641) T ss_pred cceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCchhhccCccCchhHHHHH Confidence 999998886 3333 3445667778888888776642 111233456778888889888888887654433322211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 614 EIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEA--AKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQRD 691 (705) Q Consensus 614 ~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~--a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~~ 691 (705) +. +++++.. .++|+-......++.... +..+.+..++..+..+.+.|++..- . -. T Consensus 579 ~~-----~~q~~~~--~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~--~~ 635 (641) T protein:vir:94 579 PA-----EPGALPP--EMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSDVAPEAMAAA--------------T--QQ 635 (641) T ss_pred HH-----HHHHHHH--HHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchhhhHHHHhcc--------------c--cc Confidence 11 1111111 111111111111111100 0000000000001111111111100 0 00 Q ss_pred HHHHHH Q lcl|NC_021540. 692 IVKTFL 697 (705) Q Consensus 692 ~~k~~~ 697 (705) +...++ T Consensus 636 ~~~~~~ 641 (641) T protein:vir:94 636 ITSGAL 641 (641) T ss_pred ccccCC Confidence 000111 No 18 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=1.5e-78 Score=447.18 Aligned_cols=594 Identities=12% Similarity=0.109 Sum_probs=371.8 Q ss_pred CCCH--HHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCC----------CCCCCCcCCCHHHHHHHHHHHHHH Q lcl|NC_021540. 20 WKNK--PKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPK----------QQVGRSSVQPKLIRKQAEWRYSAL 87 (705) Q Consensus 20 ~~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~grs~~v~~~v~~~~e~~~~~l 87 (705) |++. .++..+..+++.+......--.+-.+-.+||+++|.=-+. ...||..++-+.|+-+|+|++..- T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 4444 3666777777766665544333333456788866522111 135788889999999999998877 Q ss_pred HHhhcCCCCEEEEeCCCcc-hHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccc Q lcl|NC_021540. 88 SEPFLNDENIFSIAPKTWQ-DREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQ 166 (705) Q Consensus 88 ~~~f~~~~~~~~~~p~~~~-D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~ 166 (705) .+ +-.=+.|.|++.+ |++.|+..|.+++|+.. .++.-....+++.++|++|.|++++.|++...-. T Consensus 81 ~~----nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d-------- 147 (720) T protein:vir:35 81 RH----NRITVKFRPGDKTASEALANKLNGLFRADYE-ETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALD-------- 147 (720) T ss_pred Hh----CCCceEEEcCCCcchHHHHHHHHHHHHHHHH-hcCchHHHhHHHHHhhhccceeEEeeecccccCC-------- Confidence 44 5556999999664 99999999999999765 3444444568999999999999988876431100 Q ss_pred cccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEE--echhhee Q lcl|NC_021540. 167 YVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTI--CDYHNVT 244 (705) Q Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~--V~~~~~~ 244 (705) + . .....++|++ +|+++|| T Consensus 148 ----------------------~---------------------~----------------~~~~~i~i~~v~~~~~~v~ 168 (720) T protein:vir:35 148 ----------------------P---------------------M----------------DERQRICLEPIYDPARSVW 168 (720) T ss_pred ----------------------C---------------------C----------------cccceeeEecccCchhhee Confidence 0 0 0012345555 4789999 Q ss_pred eCCCcc-CChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeee- Q lcl|NC_021540. 245 IDPTCN-GNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDI- 322 (705) Q Consensus 245 ~Dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~- 322 (705) |||.++ .|++||+|+++..|||+++++++ |+.+...+..... .+..+++ ...+.|+++|||.+..+ T Consensus 169 ~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~-yp~~a~~~~~~~~---------~~~~~d~--~~~~~v~i~E~~~~~~~~ 236 (720) T protein:vir:35 169 FDPDAKKYDKSDAEWAFCMYSLSAEKYKAE-YNKDPATLMSGIE---------RSWDYDW--YDVDVVYIAKYYEVKKES 236 (720) T ss_pred ecccccccChhhhhhhhhhcCCCHHHHHHh-CCCcccccccccc---------ccccccc--cCCCceEEEEeeEEEEEE Confidence 999875 59999999999999999999998 5544322211100 0111111 22466999999977432 Q ss_pred -----------------cCC------------Ce--------eEEEEE-EEECCEEEecccCCCCCCCcceEEeeeeee- Q lcl|NC_021540. 323 -----------------DGS------------GV--------TTPIVA-SWVDDVMIRLEKNPYPDGKLPFVVVPYLPV- 363 (705) Q Consensus 323 -----------------~~d------------g~--------~~~~~~-~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~- 363 (705) +++ |. ..+++. .+++|+++-.+++|+||++|||||+++++. T Consensus 237 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~ 316 (720) T protein:vir:35 237 VDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWF 316 (720) T ss_pred EEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeec Confidence 011 10 111122 234666666788999999999999999886 Q ss_pred -cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhh---hcCCcc---------eeecCCcc Q lcl|NC_021540. 364 -KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERK---FKMGED---------YKYNPGTN 430 (705) Q Consensus 364 -~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~---~~pg~~---------i~~~~~~~ 430 (705) ++..++||+++.++|+|+++|+++|++++++++ .+.+++.|++++.+... .+++.+ +..++|.. T Consensus 317 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~---~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~ 393 (720) T protein:vir:35 317 IDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQ---DTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNI 393 (720) T ss_pred cCCCcccceeeecchhHHHHHHHHHHHHHHHHHc---CCccccccCcchHHHHHHHhhccccccccccccccccccCccc Confidence 666777999999999999999999999999954 47778888887755433 233332 22334432 Q ss_pred --cccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 431 --PVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEV 508 (705) Q Consensus 431 --~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~ 508 (705) ..+.+.+.+.+++++.++++++.....++++|||+++++|..+| .| +.+|++++++|++.+..++|||+++++.+ T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn-~S--G~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~ 470 (720) T protein:vir:35 394 IAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN-IA--KETVNHLMHRSDMSSFIYLDNMAKSLKRA 470 (720) T ss_pred ccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc-hH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23456788889999999999999999999999999999998766 34 44577888999999999999999999999 Q ss_pred HHHHHHHHHHhcCCceeEeEecC----ceeeec--------------hhhcccceeEEeecc--chhHHHHHHHHHHHHH Q lcl|NC_021540. 509 AKKILAMNSVWLSDEEVIRITDE----EFVQIN--------------RDNLVGSFDIKLSIS--NAETDAIKAQELSFML 568 (705) Q Consensus 509 ~~~~l~li~q~~~~~~~iri~~~----~~v~i~--------------~~~~~~~~dv~v~~~--~~~~~~~~~q~~~~ll 568 (705) |+++|+||.+||+++|++||+|+ .++.+| ++...|+|||.|+++ +++.+++..+.+++++ T Consensus 471 g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll 550 (720) T protein:vir:35 471 GEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLL 550 (720) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHH Confidence 99999999999999999999985 355444 345568899998775 4566788888888888 Q ss_pred HHHhhhchhH-HHHHHHHHHHhhhccchhhhhhhcccccc------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 569 QTMGQSLPFD-MTKLILGEIAKLRGMPDLSKMISKYNPEP------SPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPY 641 (705) Q Consensus 569 q~~~~~~~~~-~~~~il~~l~e~~~~~~~~~~~~~~~~q~------~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~ 641 (705) +.++|..+.. ....++.+++++++...+.+++++..+.. .++.++..+.++++.++.+.+++++|++++..++ T Consensus 551 ~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqa 630 (720) T protein:vir:35 551 AGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQGQA 630 (720) T ss_pred HhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHH Confidence 8877754432 22223345556666666666665433211 1222222233333334444444444444332222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHH----------HHHHHHHHHHH-----HHHHh Q lcl|NC_021540. 642 EAQAEAAKARKANTEADLNTLDFVEQETGVKQ-----ERELELMQAQAK----------GNTQRDIVKTF-----LDTNK 701 (705) Q Consensus 642 ~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq-----~~e~e~~~~q~~----------~~~~~~~~k~~-----~~~~~ 701 (705) +++..+ ++....+++..+.+...+..+.++ ++.+..+..+.+ .+.+.+...++ ....+ T Consensus 631 e~~kaq--a~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~~~~~~~ 708 (720) T protein:vir:35 631 EVQKAK--NEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELILKATDTQH 708 (720) T ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhcccchhh Confidence 222111 111111111111111111000000 000000000000 00000111111 11111 Q ss_pred hccC Q lcl|NC_021540. 702 QGNQ 705 (705) Q Consensus 702 q~~~ 705 (705) .+++ T Consensus 709 ~~~~ 712 (720) T protein:vir:35 709 KQNR 712 (720) T ss_pred hhhH Confidence 1111 No 19 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=5.2e-77 Score=438.79 Aligned_cols=595 Identities=14% Similarity=0.118 Sum_probs=363.0 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHH--HHhccCCCCCC--------CCCCCCCc Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWL--AQLNVTGAYKP--------KQQVGRSS 70 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~--------~~~~grs~ 70 (705) ||+.. ..++..+..-|+.+.++......+..+.+ +||.|.=-... ....||.. T Consensus 1 ma~~~-----------------~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~ 63 (708) T protein:vir:17 1 MAETL-----------------EKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPK 63 (708) T ss_pred CchhH-----------------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCc Confidence 22211 13556666666666665555544443333 45655322211 12357899 Q ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCc-chHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEE Q lcl|NC_021540. 71 VQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTW-QDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRT 149 (705) Q Consensus 71 ~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~-~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~ 149 (705) ++-+.|+-+|+|++..=.+ +-.=+.|.|+++ +|.+.|+..|.+++|+.. .++.-....+++++++.+|.|.+++ T Consensus 64 ~~~N~i~~~i~~v~g~e~~----nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~~ 138 (708) T protein:vir:17 64 FEINKVATELNRIIAEYRN----NRITVKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRL 138 (708) T ss_pred eEEcchHHHHHHHHhhHhh----CCcceEEecCCCcchHHHHHHHHHHHHHHHH-hcCchhHHhHHHHHhhhcccceeee Confidence 9999999999999776533 445599999975 499999999999999765 3444445668999999999996554 Q ss_pred eecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeec Q lcl|NC_021540. 150 SWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTV 229 (705) Q Consensus 150 ~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 229 (705) .-+++.. . ++. .. T Consensus 139 ~~d~~~e--------------~----------------d~~-------------------------------------~~ 151 (708) T protein:vir:17 139 TSMLVNE--------------Y----------------DPM-------------------------------------DD 151 (708) T ss_pred eeccccc--------------C----------------CCC-------------------------------------CC Confidence 3111100 0 000 00 Q ss_pred cCcceEEE--echhheeeCCCcc-CChhhCCeEEEEEeccHHHHHHhcCCcCcc-hhhhhhhhhhccccccccccccccc Q lcl|NC_021540. 230 KNQPEVTI--CDYHNVTIDPTCN-GNLDEAKFVIYSFESSRSDLEKYGIYSNLE-YIKEDSSTSTSSDHYSSDTSFTFSD 305 (705) Q Consensus 230 ~~~~~i~~--V~~~~~~~Dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (705) ..++.|+. +||.+|||||.++ .|++||+|++++.|||+++++++ |++... ..+. ....+ ..+.. T Consensus 152 ~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~-yp~~a~~~~~~-------~~~~~----~~~~~ 219 (708) T protein:vir:17 152 RQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAE-YGKKPPASLDV-------TSMTS----WEYDW 219 (708) T ss_pred ccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHh-Cccccchhhhh-------hhhcc----ccccc Confidence 11233434 4789999999774 59999999999999999999998 443211 1110 00001 11112 Q ss_pred cccCeEEEEEEEEEeeec---------CC---------------------Ce--------e--EEEEEEEECCEEEeccc Q lcl|NC_021540. 306 KARKKIVVYEYWGYWDID---------GS---------------------GV--------T--TPIVASWVDDVMIRLEK 345 (705) Q Consensus 306 ~~~~~v~v~E~w~k~~~~---------~d---------------------g~--------~--~~~~~~~~g~~iL~~~~ 345 (705) ...++|+|+|||+|.... .. |. . +++.++|.|..+| .++ T Consensus 220 ~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l-~~~ 298 (708) T protein:vir:17 220 FDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFL-EKP 298 (708) T ss_pred cCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccc-cCC Confidence 234789999999875321 01 11 1 1122334455555 678 Q ss_pred CCCCCCCcceEEeeeeee--cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh----h---- Q lcl|NC_021540. 346 NPYPDGKLPFVVVPYLPV--KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE----R---- 415 (705) Q Consensus 346 ~p~~~~~~Pfv~~~~~~~--~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~----~---- 415 (705) +|+||++|||||+++++. ++...+||++|.++|+|+.+|+++|+++++++++++.+++++.+++..... . T Consensus 299 ~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~ 378 (708) T protein:vir:17 299 RRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKR 378 (708) T ss_pred CCCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccch Confidence 999999999999999976 566666999999999999999999999999999999999999988743221 1 Q ss_pred ------hhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHH Q lcl|NC_021540. 416 ------KFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGA 489 (705) Q Consensus 416 ------~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~ 489 (705) +..++.+-.+++++ ..+...++|++++.++++++....+++++|||+++++|..+| +| +.+|++++++ T Consensus 379 ~~~~~~~~~~~~~g~v~~~a---~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn-~S--G~Ai~~rq~q 452 (708) T protein:vir:17 379 PAFLPLREVRDKYGNIIAGA---TPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-IA--QETVNNLMNR 452 (708) T ss_pred hhhhhhhccCCccccccccc---CCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccc-hH--HHHHHHHHHH Confidence 11123222223322 234556688999999999999999999999999999997654 34 4457778899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecC----ceeeech--------------hhcccceeEEeec Q lcl|NC_021540. 490 SGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDE----EFVQINR--------------DNLVGSFDIKLSI 551 (705) Q Consensus 490 ~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~----~~v~i~~--------------~~~~~~~dv~v~~ 551 (705) |++.+..++||++.+++.+|+++|+||.+||+++|++||+|+ .++.+|. +...|+|||.|+. T Consensus 453 g~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~ 532 (708) T protein:vir:17 453 ADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDV 532 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEec Confidence 999999999999999999999999999999999999999986 3666653 3445788988876 Q ss_pred c--chhHHHHHHHHHHHHHHHHhhhchhH-HHHHHHHHHHhhhccchhhhhhhcccccch------hhH-HHHHHHHHHH Q lcl|NC_021540. 552 S--NAETDAIKAQELSFMLQTMGQSLPFD-MTKLILGEIAKLRGMPDLSKMISKYNPEPS------PQA-QLEIQIKQLE 621 (705) Q Consensus 552 ~--~~~~~~~~~q~~~~llq~~~~~~~~~-~~~~il~~l~e~~~~~~~~~~~~~~~~q~~------~~~-q~~~q~~q~~ 621 (705) + +++.+++..+.++++++++++..+.. ....++.++++.++..++.+.++...++.. ++. ++.+++.+++ T Consensus 533 ~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~ 612 (708) T protein:vir:17 533 GPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAA 612 (708) T ss_pred ccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHH Confidence 4 45667777888888888887754432 223344566677777777777765433211 111 1111111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHH Q lcl|NC_021540. 622 AQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELEL-MQAQ-AKGNTQRDIVKTFLDT 699 (705) Q Consensus 622 ~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~-~~~q-~~~~~~~~~~k~~~~~ 699 (705) +++.++++.+++++....++++ ++++++..+.+++..+.+....+...+..+.++. ...+ .+....++.++.+.+. T Consensus 613 q~q~~~~~~eaqa~~~~~qAe~--~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~ 690 (708) T protein:vir:17 613 QSQPNPEMVLAQAQMVAAQAEA--QKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAES 690 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh Confidence 2222222222232222222211 1222222222221111111111111111101110 0000 0011112222232222 Q ss_pred HhhccC Q lcl|NC_021540. 700 NKQGNQ 705 (705) Q Consensus 700 ~~q~~~ 705 (705) ++|+.+ T Consensus 691 q~q~~~ 696 (708) T protein:vir:17 691 QQQQFQ 696 (708) T ss_pred HHHHHh Confidence 232222 No 20 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=7.5e-77 Score=437.92 Aligned_cols=599 Identities=15% Similarity=0.126 Sum_probs=371.0 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC----------CCCCCCCc Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP----------KQQVGRSS 70 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~grs~ 70 (705) ||+ . ...++..++..|+.+..+...-.....+..+||+++|.=-+ ....||.. T Consensus 1 m~~--------~---------~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~ 63 (708) T protein:vir:10 1 MAE--------T---------LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPK 63 (708) T ss_pred Cch--------h---------HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCc Confidence 221 1 12467777778877777666555545556778886653221 12358899 Q ss_pred CCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcc-hHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEE Q lcl|NC_021540. 71 VQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQ-DREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRT 149 (705) Q Consensus 71 ~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~-D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~ 149 (705) ++-+.|+-+|+|++..-.+ +-.=+.|.|.+++ |++.|+..|.+++|+...- +.-....+++++++++|.|.+++ T Consensus 64 ~~~N~i~~~v~~v~g~~~~----nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~-~~~~~~s~Af~d~i~~G~Gw~~~ 138 (708) T protein:vir:10 64 FEINKVATELNRIIAEYRN----NRITVKFRPGDREASEELANKLNGLFRADYEET-DGGEACDNAFDDAATGGFGCFRL 138 (708) T ss_pred eEEcchHHHHHHHHHHHHh----CCcceEEEcCCCCchHHHHHHHHHHHHHHHHhc-CchHHHHHHHHhhhhcccceeee Confidence 9999999999999887755 5566999999765 9999999999999986643 33345668999999999996655 Q ss_pred eecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeec Q lcl|NC_021540. 150 SWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTV 229 (705) Q Consensus 150 ~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 229 (705) .-+++.. .++. .. T Consensus 139 ~~d~~~e------------------------------~d~~-------------------------------------~~ 151 (708) T protein:vir:10 139 TSMLVNE------------------------------YDPM-------------------------------------DD 151 (708) T ss_pred eeccccc------------------------------cCCC-------------------------------------CC Confidence 3221100 0000 00 Q ss_pred cCcceE--EEechhheeeCCCcc-CChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccccccc Q lcl|NC_021540. 230 KNQPEV--TICDYHNVTIDPTCN-GNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDK 306 (705) Q Consensus 230 ~~~~~i--~~V~~~~~~~Dp~a~-~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (705) ..+++| .+.|+.+|||||.++ .|++||+|+++++|||++++++++..+.....+. ... ....+... T Consensus 152 ~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~-------~~~----~~~~~~~~ 220 (708) T protein:vir:10 152 RQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDV-------TSM----TSWEYNWF 220 (708) T ss_pred ccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCccccccc-------ccC----CCcccccc Confidence 112233 344678999999775 6999999999999999999999833221111100 000 01111112 Q ss_pred ccCeEEEEEEEEEeee---------cCCC-----------------------------ee--EEEEEEEECCEEEecccC Q lcl|NC_021540. 307 ARKKIVVYEYWGYWDI---------DGSG-----------------------------VT--TPIVASWVDDVMIRLEKN 346 (705) Q Consensus 307 ~~~~v~v~E~w~k~~~---------~~dg-----------------------------~~--~~~~~~~~g~~iL~~~~~ 346 (705) ..+.|+|.|||.+..+ ...| +. +.+..+|.|..+| ..++ T Consensus 221 ~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l-e~~~ 299 (708) T protein:vir:10 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFL-EKPR 299 (708) T ss_pred CCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhh-ccCC Confidence 2355888888876321 0111 01 1122344566666 6789 Q ss_pred CCCCCCcceEEeeeeee--cCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhh----hhc-- Q lcl|NC_021540. 347 PYPDGKLPFVVVPYLPV--KDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNER----KFK-- 418 (705) Q Consensus 347 p~~~~~~Pfv~~~~~~~--~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~----~~~-- 418 (705) |++|++|||+|+++++. ++...+||+++.++|+|+++|+++|++.+++++++....+++.+++...... +.. T Consensus 300 ~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~ 379 (708) T protein:vir:10 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) T ss_pred CCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccch Confidence 99999999999999986 5667779999999999999999999999999999999999988887543221 111 Q ss_pred ---CCcceeecCCccc--ccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 419 ---MGEDYKYNPGTNP--VTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKR 493 (705) Q Consensus 419 ---pg~~i~~~~~~~~--~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~ 493 (705) +...+..+.|... ...+...+++++++.++++++.....++++||++++++|..+| . ++.+|++++++|++. T Consensus 380 ~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn-~--SG~aI~~rq~qg~~~ 456 (708) T protein:vir:10 380 AFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN-I--AQETVNNLMNRADMA 456 (708) T ss_pred hhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccc-h--HHHHHHHHHHHHHHH Confidence 1111222222211 2234556678999999999999999999999999999997544 3 445688888999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecC----ceeeec--------------hhhcccceeEEeecc--c Q lcl|NC_021540. 494 ELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDE----EFVQIN--------------RDNLVGSFDIKLSIS--N 553 (705) Q Consensus 494 ~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~----~~v~i~--------------~~~~~~~~dv~v~~~--~ 553 (705) +..++|||+.+++.+|+++|+||.+||+++|++||+|+ +++.+| .+...|+|||.|+.+ + T Consensus 457 l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~ 536 (708) T protein:vir:10 457 SFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSY 536 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCc Confidence 99999999999999999999999999999999999986 355554 345567889988764 5 Q ss_pred hhHHHHHHHHHHHHHHHHhhhchhH-HHHHHHHHHHhhhccchhhhhhhcccccchhhH-------HHHHHHHHHHHHHH Q lcl|NC_021540. 554 AETDAIKAQELSFMLQTMGQSLPFD-MTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA-------QLEIQIKQLEAQEL 625 (705) Q Consensus 554 ~~~~~~~~q~~~~llq~~~~~~~~~-~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~-------q~~~q~~q~~~q~~ 625 (705) ++.+++..+.++++++.++|..|.. ....++.+++++++...+.+++++..+++.+.. ++.+++.+++.++. T Consensus 537 ~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~ 616 (708) T protein:vir:10 537 TARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQP 616 (708) T ss_pred hhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHH Confidence 6778888889999999887754432 122344567777888888888776543322111 11111112222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_021540. 626 QMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELM--QAQAKGNTQRDIVKTFLDTNKQG 703 (705) Q Consensus 626 q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~--~~q~~~~~~~~~~k~~~~~~~q~ 703 (705) ++++..++++....+ +++++++++..+.+++..+.+....+.+.+..+.++.. .+..+....+++++.....++++ T Consensus 617 ~~~~~e~qa~~~~~q--Ae~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q~~~ 694 (708) T protein:vir:10 617 NPEMVLAQAQMVAAQ--AEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQ 694 (708) T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHH Confidence 222222222222222 22222222222222211111111111111111111100 00000011122222222222322 Q ss_pred cC Q lcl|NC_021540. 704 NQ 705 (705) Q Consensus 704 ~~ 705 (705) .+ T Consensus 695 ~~ 696 (708) T protein:vir:10 695 FQ 696 (708) T ss_pred Hh Confidence 22 No 21 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=1.1e-76 Score=437.11 Aligned_cols=549 Identities=15% Similarity=0.134 Sum_probs=382.4 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCCCCCcCCCHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQVGRSSVQPKLIRKQAEWRYSA 86 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~grs~~v~~~v~~~~e~~~~~ 86 (705) |+.+.+.-.+-..-..+-+.|.++++++.+.++..+-.+.+-.+||..+-.. ...+.++|||++.++++..++|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~ 80 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSN 80 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHH Confidence 4444443333333334668899999999998886644334445565554222 24567799999999999999999999 Q ss_pred HHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhc---CCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccc Q lcl|NC_021540. 87 LSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQL---DKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVP 163 (705) Q Consensus 87 l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~---~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~ 163 (705) ||++||++++||+|.|.-++|..+ .....+++.+..|+ +=..+++.+|++++..|+||+|++|.....+..+. T Consensus 81 l~~~~Fp~~~w~~~v~~~~~~~~~--~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~-- 156 (584) T protein:vir:95 81 YFSSLFPNDDWLRWVGYGKGDSTK--TKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDG-- 156 (584) T ss_pred HHHhhcCccceeeeecCCCchhhH--HHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecc-- Confidence 999999999999999999999987 22445666655555 34456899999999999999999997654333110 Q ss_pred ccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhe Q lcl|NC_021540. 164 VFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNV 243 (705) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~ 243 (705) +. ...+.+|+|++|+|++| T Consensus 157 -----------------------------------------------------------~~--v~~~~~prieriSP~d~ 175 (584) T protein:vir:95 157 -----------------------------------------------------------TL--VPDYIGPRLVRISPLDI 175 (584) T ss_pred -----------------------------------------------------------cc--ccccccceEEeeChhhe Confidence 00 01245889999999999 Q ss_pred eeCCCccCChhhCCeEEEEEeccHHHHHHhc----C-CcCcchhhhhhhhhhccccc-----------cccccc-ccccc Q lcl|NC_021540. 244 TIDPTCNGNLDEAKFVIYSFESSRSDLEKYG----I-YSNLEYIKEDSSTSTSSDHY-----------SSDTSF-TFSDK 306 (705) Q Consensus 244 ~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g----~-~~d~~~~~~~~~~~~~~~~~-----------~~~~~~-~~~~~ 306 (705) ||||+|+ +++|+.||+ +..+|+++|.++. + +-+.+.+.......-+..+. ..+..+ .++.. T Consensus 176 ~~Dpsa~-~i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ey~ 253 (584) T protein:vir:95 176 VFNPLAT-SISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKAAGFDVDGFGNLYEYY 253 (584) T ss_pred eecCCCC-Cccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccccccccccccccccccccccc Confidence 9999995 899999998 6668999998772 1 22333333222111011111 111111 11122 Q ss_pred ccCeEEEEEEEEEe-eecCCCeeEE-EEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHH Q lcl|NC_021540. 307 ARKKIVVYEYWGYW-DIDGSGVTTP-IVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIG 384 (705) Q Consensus 307 ~~~~v~v~E~w~k~-~~~~dg~~~~-~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN 384 (705) ...+|.++|+|+.+ +...++...+ .++++.|+++|+.+.||||++++||+.+++.|+++++||+|+.+.+.|+|+++| T Consensus 254 ~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~~ll~d~Q~~ln 333 (584) T protein:vir:95 254 MSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRID 333 (584) T ss_pred CCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCCchhhhhhHHHHHh Confidence 34579999999854 4444544444 455678999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCc--cchHHHHHHHHHHHHHHHHHh Q lcl|NC_021540. 385 ALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYP--ELPASSYNMLQMFTLEADALS 462 (705) Q Consensus 385 ~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~--~i~~~~~~~l~~~~~~~~~~t 462 (705) +++|+++||+.++++|. .+..++. .....+||++|+..... .+.+..+| .+. ..++.++++++.+++.| T Consensus 334 a~~r~~iDnl~l~~~pv---~k~~~~~-~~~~~~pg~~~~~~~~~----~~q~~~p~a~~~~-s~~~~lq~~e~~me~~s 404 (584) T protein:vir:95 334 HLENAKADAVDLIIQPP---LKIIGEV-EEFVWGPGAEIHLDQGG----DVQEIAKNVNYII-NADNQIQMLEDRMELYA 404 (584) T ss_pred HHHHHHHHHHHHhcCcc---eeecccc-chhcccCCceeecCCCC----CcceecCchhhhh-HHHHHHHHHHHHHHhhh Confidence 99999999999999983 3444443 33467899999885432 23344333 222 33455899999999999 Q ss_pred CcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCceeEeEecCc-----eeee Q lcl|NC_021540. 463 GVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGL-TEVAKKILAMNSVWLSDEEVIRITDEE-----FVQI 536 (705) Q Consensus 463 Gv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~-~~~~~~~l~li~q~~~~~~~iri~~~~-----~v~i 536 (705) ||+.+++|.++. ...||+++++++++++..++.+++.|.+.+ ++++..++.+..++++...++|++|++ |++| T Consensus 405 Gvp~~~~G~~~~-~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i 483 (584) T protein:vir:95 405 GAPREAMGIRTP-GEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSV 483 (584) T ss_pred CCChhhcccccc-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeecccccccccccc Confidence 999999998744 468999999999999999999999999965 788888888888889999999999975 8999 Q ss_pred chhhcccceeEEeeccchhH-HHHHHHHHHHHHH-HHhhhchhHHHH-HHHHHHHhhhccchhhhhhhcccccchhhHHH Q lcl|NC_021540. 537 NRDNLVGSFDIKLSISNAET-DAIKAQELSFMLQ-TMGQSLPFDMTK-LILGEIAKLRGMPDLSKMISKYNPEPSPQAQL 613 (705) Q Consensus 537 ~~~~~~~~~dv~v~~~~~~~-~~~~~q~~~~llq-~~~~~~~~~~~~-~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~ 613 (705) .+++++|++++....+++.. +.+..+.+.+++| ++++.+.+.... .++..++++.+++.. .+..+ ......|+ T Consensus 484 ~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~--~~~~~--~~~~~~Q~ 559 (584) T protein:vir:95 484 TREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGY--EIFRP--NVAVAEQA 559 (584) T ss_pred ChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCcc--cccCC--CcccchhH Confidence 99999999999887766543 4566788888887 566655444433 444446677666632 12221 11111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 614 EIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAK 649 (705) Q Consensus 614 ~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~ 649 (705) ..|+..+++|+. ..+|++ ..++.|- T Consensus 560 ~~q~~~~~~q~~----~~~~~~-------~~~~~~~ 584 (584) T protein:vir:95 560 ETQSLVAQAQED----LQLQAQ-------MPAEGAI 584 (584) T ss_pred HHHhhhHHHHHH----HHHHHh-------hhhccCC Confidence 222222111111 111111 1111111 No 22 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=6.6e-69 Score=394.39 Aligned_cols=555 Identities=14% Similarity=0.088 Sum_probs=375.3 Q ss_pred hhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhcc---C--CCCCCCCCCCCCcCCCHHHHHH Q lcl|NC_021540. 5 NEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNV---T--GAYKPKQQVGRSSVQPKLIRKQ 79 (705) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~---~--~~~~~~~~~grs~~v~~~v~~~ 79 (705) ++-.+....+--.+.=+....+++|...+....+.++ .++++|.|.|++ + ..-.+.+.+||+|+..+.+.+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~---~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~ 77 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARA---QKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHL 77 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhh---hhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHH Confidence 2222222222112233344566778877776555555 345566665553 2 2334667889999999999999 Q ss_pred HHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhc---CCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 80 AEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQL---DKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 80 ~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~---~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) ++.++++++..+|++++||+|.|..++|.. +-.+..+...+..|+ +=..++..+|.+.++.|++|.++.|....+ T Consensus 78 ~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~--~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~ 155 (599) T protein:vir:31 78 HLMITTSYMEHLLPNRNWVDFVGFDNDSVN--AEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMT 155 (599) T ss_pred HHHHHHHHHhhhcCCccceEeeecCCchhH--HHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcce Confidence 999999999999999999999999999653 333556665556666 444568899999999999999998853322 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEE Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVT 236 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~ 236 (705) +.. .| -....+.+|+++ T Consensus 156 ~~~-------------------------------------------------d~--------------~v~~~~~~P~~e 172 (599) T protein:vir:31 156 VTA-------------------------------------------------EN--------------QVIKNYSGTVTE 172 (599) T ss_pred eec-------------------------------------------------cc--------------ccccccccceEE Confidence 210 00 011235688999 Q ss_pred EechhheeeCCCccCChhhCCeEEEEEeccHHHHHHh---cCCc--Ccchhhhhhhhhhcccccccc--ccccccccccC Q lcl|NC_021540. 237 ICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKY---GIYS--NLEYIKEDSSTSTSSDHYSSD--TSFTFSDKARK 309 (705) Q Consensus 237 ~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~---g~~~--d~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 309 (705) +|+|+||||||+|+ +++||.||+ |...|+++|..+ +++. +++.+.............+.+ ......|.++. T Consensus 173 rvsP~Di~~Dp~A~-si~d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~~~~ 250 (599) T protein:vir:31 173 RLSPSDVFWDVTAD-SLPKAAKCI-RQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDSLHK 250 (599) T ss_pred eecccceeeCCCCC-CCCcceeee-ehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhhcccccc Confidence 99999999999995 899998877 888999999875 2321 233333221111111111111 11222232222 Q ss_pred -------------eEEEEEEEE-EeeecCCCeeEEEEEEEECC-EEEecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 310 -------------KIVVYEYWG-YWDIDGSGVTTPIVASWVDD-VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 310 -------------~v~v~E~w~-k~~~~~dg~~~~~~~~~~g~-~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) -|.++|+|+ .++.++|+..+.++++++|+ ++++.+.|||++|++||+..++.|+++++||+|+.. T Consensus 251 d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~ 330 (599) T protein:vir:31 251 KGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIGPLH 330 (599) T ss_pred ccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccCCCCCch Confidence 278899996 78889999988899999995 788999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHH Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMF 454 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~ 454 (705) .+.+.|..+|.++|+++|++.++..|.+. ..+.+.+.| ....||++|++.... .+.+..+|.-......+++++ T Consensus 331 ~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~-~~~dl~~eD-~~~~P~~v~~~~d~~----~vq~~~p~s~~~~a~~~is~~ 404 (599) T protein:vir:31 331 RLTGMQYKLDKRENFREDLHDRFLHPSLK-KVGDVREKG-MRGGPNHVFEVEETG----DVQYMTPPAEVLQPDNQLSIT 404 (599) T ss_pred hcchHHHHHHHHHHHhhhhhhhhhccccc-ccccccccC-ccCCCCcceeecCCC----ccccccCchhhhhHHHHHHHH Confidence 99999999999999999999999877333 333344433 335699999886543 234444443344455578999 Q ss_pred HHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCceeEeEecCc- Q lcl|NC_021540. 455 TLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGL-TEVAKKILAMNSVWLSDEEVIRITDEE- 532 (705) Q Consensus 455 ~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~-~~~~~~~l~li~q~~~~~~~iri~~~~- 532 (705) ...+++.||++.++.|.++.. ..||+++++++++++.+.+.+++.|.+.+ ++++++++.+.++|++++.++|+++++ T Consensus 405 e~~mee~sGvp~~~~G~~~ag-~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~ 483 (599) T protein:vir:31 405 LQLMEDLSGAPKESIGQRTAG-EKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSEL 483 (599) T ss_pred HHHHHHhhccchhhcCCcccc-hhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccc Confidence 999999999999999988765 47999999999999999999999999965 679999999999999999999999986 Q ss_pred ----eeeechhhcccceeEEeeccch--hHHHHHHHHHHHHHH-HHhhhchhHHHHH-HHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 533 ----FVQINRDNLVGSFDIKLSISNA--ETDAIKAQELSFMLQ-TMGQSLPFDMTKL-ILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 533 ----~v~i~~~~~~~~~dv~v~~~~~--~~~~~~~q~~~~llq-~~~~~~~~~~~~~-il~~l~e~~~~~~~~~~~~~~~ 604 (705) |++|.++++++.+++ +..|+. ..+.+..+.+.++++ ++++...+.+.++ ++..+.. ...+...+ T Consensus 484 ~~~~f~~i~redl~~~~~~-v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~l~~-------~~~l~~~~ 555 (599) T protein:vir:31 484 GTATFLDITADDLNLNGQM-VAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNAVEY-------LGDLDAYG 555 (599) T ss_pred cceeeEEeehhhhhCCeee-eechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHHHHH-------HHhccccc Confidence 999999999999999 455543 234445555556553 2333333333332 2222222 22334444 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQL----MPYEAQAEAAKARKAN 654 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~----~~~~~q~e~a~a~~~~ 654 (705) .++.+.+.+++|. +..-+|++++. +.++...-.--+.+.+ T Consensus 556 ~~~~~va~~eqq~----------~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 556 IFTFGIGVQEDQQ----------LARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 (599) T ss_pred cCCCchhHHHHHH----------HHHHHHHHHHHhHhhhhhhhhcCCCCcccCC Confidence 4444544333211 11111111111 1110000000000000 No 23 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=1.4e-45 Score=266.46 Aligned_cols=602 Identities=13% Similarity=0.103 Sum_probs=320.6 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQA 80 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~ 80 (705) |++-..-.-.++ |.. +...-.+.+..|+...++.=.+.++--+.|-+.-.... ...+. .+.+..+| T Consensus 1 m~~~~~~~~~~t-pe~--------la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~---~~~~r--~nl~~sni 66 (663) T protein:vir:34 1 MNESQPTDFADT-PQG--------WAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAH---DAETR--WNLFSTNI 66 (663) T ss_pred CCccccccchhc-chh--------HHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCC---ccccc--cchhhhhH Confidence 555443333333 211 22223333444544333332223333455544332222 22233 48999999 Q ss_pred HHHHHHHHHhhcCCCCEEEEeCCCcc-hHHHHHHHHHHHHHHHHhhcCC----c-chHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 81 EWRYSALSEPFLNDENIFSIAPKTWQ-DREAARQNEAILNYQFNNQLDK----V-KLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 81 e~~~~~l~~~f~~~~~~~~~~p~~~~-D~~~A~~~t~~~n~~~~~~~~~----~-~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) +.++|++ .+..+++.|.|+..+ |.+.++.+.++++..+++-..+ + ..+...++++|+||.|++++.+..+ T Consensus 67 ~~i~P~i----Yar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~ 142 (663) T protein:vir:34 67 QTQMASL----YGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVE 142 (663) T ss_pred HHHhhhh----hcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecc Confidence 9999999 889999999998887 5567888888888877544433 2 3478899999999999999976432 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ..+ ..+.... .++....+ ...+ ....+...+.+++ T Consensus 143 ~~~-------~~~~~~~---------------~D~~~~~~-----------~a~~------------~~~~e~~a~E~v~ 177 (663) T protein:vir:34 143 WEE-------VAGVDAI---------------LDEATGAE-----------LAAA------------VPPTQRKAYECVE 177 (663) T ss_pred cch-------hcccccc---------------CCCccccc-----------hhcc------------cccchhhccccee Confidence 110 0000000 00000000 0001 1122334467899 Q ss_pred EEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEE Q lcl|NC_021540. 235 VTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVY 314 (705) Q Consensus 235 i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 314 (705) |++|+|.||++||+ + .|++++||+++.|||++++++. |..+.+.....+.. ........+.+-.+.+.++.+|| T Consensus 178 id~v~~~dfl~~pA-r-~W~ev~wva~r~~mtk~e~~~r-f~~~~~~~~~a~~~---~~~~~~~~~~~~~~~~~~~a~Vw 251 (663) T protein:vir:34 178 TDYLHWQDVLWSPA-R-VWHEVRWLAFRNLLDMREFNAR-FDADGSRNLWASVP---KVGKPKDGKDGQSCHPWDRAEVW 251 (663) T ss_pred eeeechhhcccchh-h-ccccccceeeeccCCHHHHHHh-hcCChhhhhhhhcc---CcCCccccCCCCCcchhcCccee Confidence 99999999999996 3 6999999999999999999887 33333211111111 11111111111123334579999 Q ss_pred EEEEEeeecCCCeeEEEEEEEECC--EEEecccCCCCCCCc---ceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHH Q lcl|NC_021540. 315 EYWGYWDIDGSGVTTPIVASWVDD--VMIRLEKNPYPDGKL---PFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRG 389 (705) Q Consensus 315 E~w~k~~~~~dg~~~~~~~~~~g~--~iL~~~~~p~~~~~~---Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~ 389 (705) |+|.|- ..+++|++.| .+|+.++.|.....| ||..++... +++.+|.+.+-...+.++++|-++.+ T Consensus 252 EIWdK~--------~~~V~w~~eg~~~~L~~~~p~lgl~~ffPcPrpl~~~~~-~ds~ipvpd~~~y~~~~~E~n~~t~R 322 (663) T protein:vir:34 252 EIWDKG--------GRKVDWYVEGYSAVLDTQPDPLGLESFFPCPKPLLANWT-TDKVVPRPDFVLAQDLYKEIDLVSTR 322 (663) T ss_pred EEEecC--------CcEEEEEEcCcceecccCCCCCCCCCCCCCcccccceec-CCCeecCCcHHHHHHHHHHHHHHHHH Confidence 999983 3456777765 477766666555444 665544443 45677777777999999999987766 Q ss_pred HHHHHHhcCCCcEEeeccccCchhh-h-hhcCCcceeecCC------cccccccccccCccchHHHHHHHH---HHHHHH Q lcl|NC_021540. 390 MIDAMARSANGQRGMSKNLLDPVNE-R-KFKMGEDYKYNPG------TNPVTDIIEHKYPELPASSYNMLQ---MFTLEA 458 (705) Q Consensus 390 ~~d~~~~~~~~~~~~~~~av~~~d~-~-~~~pg~~i~~~~~------~~~~~~i~~~~~~~i~~~~~~~l~---~~~~~~ 458 (705) + ..+.-...++++++.|+...... + ....+..+.+... +.....|..++.+++.+.+..+.+ .+...+ T Consensus 323 i-n~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~ 401 (663) T protein:vir:34 323 I-TLLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDAL 401 (663) T ss_pred H-HHHHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHH Confidence 5 55566688999999776533221 2 1112233333221 112345777777777766666654 577888 Q ss_pred HHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCce---ee Q lcl|NC_021540. 459 DALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEF---VQ 535 (705) Q Consensus 459 ~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~---v~ 535 (705) .++||++|+++|... .+.||++.+...+.++.|+..+.+.+.++.+++++....+|.+.++-+.+-+++|.+. ++ T Consensus 402 ~qITGiaDi~Rga~~--a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~e 479 (663) T protein:vir:34 402 HQVTGMADIMRGASD--PRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKE 479 (663) T ss_pred HHHHhHHHHhhcccC--cchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccc Confidence 899999999999653 3577776666668899999999999999999999999999999998888878877542 22 Q ss_pred ec-------hhhcccceeEEeecc-chhHH----H----HHHHHHHHHHHHHhhhchh-HHHHHHHHHHHh-----hhcc Q lcl|NC_021540. 536 IN-------RDNLVGSFDIKLSIS-NAETD----A----IKAQELSFMLQTMGQSLPF-DMTKLILGEIAK-----LRGM 593 (705) Q Consensus 536 i~-------~~~~~~~~dv~v~~~-~~~~~----~----~~~q~~~~llq~~~~~~~~-~~~~~il~~l~e-----~~~~ 593 (705) |. .+.+ ..|.+.|..+ +...+ . ..+..+..+++++++.... .....++.++.. +.+. T Consensus 480 i~~~~~~L~n~~~-r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~ 558 (663) T protein:vir:34 480 LAPKAAELIKSRF-SMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGS 558 (663) T ss_pred hhHHHHHHhcCCC-cceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcCChh Confidence 22 2222 3455555322 22111 1 1122223333333222100 001111222211 1111 Q ss_pred chh-------hhhhh---cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 594 PDL-------SKMIS---KYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLD 663 (705) Q Consensus 594 ~~~-------~~~~~---~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~ 663 (705) ..+ ....+ +...+|.++.+.......+++.+.|.+++++|+++|..+++.+.+.+. .+.++++ T Consensus 559 ~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeAq~e~q~~~~~~ql~~~~-------~~~k~~~ 631 (663) T protein:vir:34 559 STIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQEMAKVQAEVQGDLLRIQAETQA-------NETKERQ 631 (663) T ss_pred hhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHH Confidence 111 11111 111112222211111122222333333333333333333333322211 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 664 FVEQETGVKQERELELMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 664 ~~~q~~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) .. +......+++.+..++...+ +.+++- T Consensus 632 ~a--~~~~~~a~q~~~~~~~~r~~------------~~~a~~ 659 (663) T protein:vir:34 632 QA--EWNVREAAQKNLISQAARAM------------NPQARN 659 (663) T ss_pred HH--HHHHHHHHHhhHHHHHHHhh------------chhhhc Confidence 00 00001122222222222211 111111 No 24 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=2.3e-29 Score=177.54 Aligned_cols=519 Identities=12% Similarity=0.073 Sum_probs=288.9 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC--CCCCCCCCC---CCcCCCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG--AYKPKQQVG---RSSVQPKLIRKQAEWRYSALSEPFLN- 93 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g---rs~~v~~~v~~~~e~~~~~l~~~f~~- 93 (705) |.+ ...+.|++.++..++..++..+++++-.+|..-.. ........| .++++++.....++.+.+.||..+|+ T Consensus 1 m~~-~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp 79 (556) T protein:vir:73 1 MAE-TEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSP 79 (556) T ss_pred CCh-hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCC Confidence 444 46777888999999988877665554444432111 111112222 35789999999999999999999998 Q ss_pred CCCEEEEeCCCcchHHHHHH------HHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccc Q lcl|NC_021540. 94 DENIFSIAPKTWQDREAARQ------NEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQY 167 (705) Q Consensus 94 ~~~~~~~~p~~~~D~~~A~~------~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~ 167 (705) +.+||.+.+..++..+.+.. .+..+.-.|. .++-+..++..+++.+..|||++.+.++ T Consensus 80 ~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~~--------------- 143 (556) T protein:vir:73 80 ARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFN-KSNLYQSLPVMYASLGTFGTGAMAVMED--------------- 143 (556) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeeeeec--------------- Confidence 89999999876543333322 3444433333 3455566778888888888887743221 Q ss_pred ccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCC Q lcl|NC_021540. 168 VEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDP 247 (705) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp 247 (705) ..+.+++..+++.+|++.. T Consensus 144 -------------------------------------------------------------~~~~~r~~~~~l~~~~~~~ 162 (556) T protein:vir:73 144 -------------------------------------------------------------DQDVIRTMPFPIGSYYLAN 162 (556) T ss_pred -------------------------------------------------------------CCceEEEEEeecceeEEee Confidence 1123468899999999999 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcch-hhhhhhhhhccccccccccccccccccCeEEEEEE-EEEeeecCC Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEY-IKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEY-WGYWDIDGS 325 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~-w~k~~~~~d 325 (705) ++.+.++. |++++.+|..++.+++....++. +...+ .. +....+|.|+.+ |.+.+.+.+ T Consensus 163 d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l~~~v~~~~---------~~-------~~~~~~~~v~~~V~pr~~~~~~ 223 (556) T protein:vir:73 163 SPRGSVDT---CIRQFSMTVRQMVQEFGLDNVSTSVKGMW---------EN-------GTYETWVEVNHCITPNVNRDSG 223 (556) T ss_pred CCCCCeEE---EEEEEeccHHHHHHHcCcccCCHHHHHHH---------hc-------CCccceEEEEEEEecccccccc Confidence 88765544 78999999999988743332221 11110 00 011234666554 333332221 Q ss_pred ---CeeEEEEEE-EE----CCEEEecccCCCCCCCcceEEeeeeeecCcccCCc-hHHHhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021540. 326 ---GVTTPIVAS-WV----DDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEA-DAELLSDNQKLIGALTRGMIDAMAR 396 (705) Q Consensus 326 ---g~~~~~~~~-~~----g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g-~~~~~~d~Q~~iN~~~~~~~d~~~~ 396 (705) +.-..+..+ |. ++++++ ++.| ..|||++..|.+.++..||+| ++....+..+.+|.+.+..+.+..+ T Consensus 224 ~~~~~~~p~~s~~~~~~~~~~~vl~--esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~ 299 (556) T protein:vir:73 224 KMDSKNKPYRSVYFESGGDSDKLLR--ESGF--DEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDK 299 (556) T ss_pred ccCcccceEEEEEEEecCCCceecc--cCCc--ccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHH Confidence 111222222 21 245664 4556 569999999999999999999 6999999999999999999999999 Q ss_pred cCCCcEEeeccccCchhhhhhcCCcceeecCCccccccccccc--CccchHHHHHHHHHHHHHHHHHhCcchH-hcCCCc Q lcl|NC_021540. 397 SANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHK--YPELPASSYNMLQMFTLEADALSGVKSF-SQGLTG 473 (705) Q Consensus 397 ~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~--~~~i~~~~~~~l~~~~~~~~~~tGv~d~-~~G~~~ 473 (705) +++|+++++.+... ...+..||+++....+.. ...+.+.. .+.+ ..+.+.++.+.+.|....-..-+ +++. . T Consensus 300 ~~~pp~~v~~~~~~--~~~~~~pgg~~~~~~~~~-~~~i~p~~~~~~d~-~~~~~~i~~~~~rI~~af~~d~~~~l~~-~ 374 (556) T protein:vir:73 300 ATNPPMVAPTSLKN--QRVSLLPGDVTYLDVISG-QDGFKPAYLVNPNT-ADLLADIQDTRQTINSAYFVDLFMMLQN-I 374 (556) T ss_pred HhcCceeccccccc--cceeeccCccccccCCCC-ccceeeeccccccH-HHHHHHHHHHHHHHHHHhhcchhhhhcc-C Confidence 99999999887532 345677888766543322 22344432 2232 33344566677777665533321 1222 2 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeecc Q lcl|NC_021540. 474 DSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSIS 552 (705) Q Consensus 474 ~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~ 552 (705) ++..-||++|..+.+.....+..+.-++.. .+..+..+.+.++.+..-=|.. |+.+.+ .++.|..- T Consensus 375 ~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------P~~l~~-~~i~v~yi 441 (556) T protein:vir:73 375 NTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEP------------PDVLQG-MPLRIEYI 441 (556) T ss_pred CCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------chhhcC-ceeEEEee Confidence 334469999999999999999998888865 7789999999988875432221 222222 12333222 Q ss_pred chhHHHH---H---HHHHHHHHHHHhhhch----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHH Q lcl|NC_021540. 553 NAETDAI---K---AQELSFMLQTMGQSLP----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEA 622 (705) Q Consensus 553 ~~~~~~~---~---~q~~~~llq~~~~~~~----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~ 622 (705) ++-...+ . +.++++.+..+++..| ..+...++..+++..|.+. ..++.. . ..++..++++++ T Consensus 442 s~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~--~~irs~-e-----ev~~~rq~r~~~ 513 (556) T protein:vir:73 442 SVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSP--TVIVPQ-E-----QVQGIREERAKQ 513 (556) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCh--hhcCCH-H-----HHHHHHHHHHHH Confidence 2322222 2 2233333333333223 2345566777777777763 233321 1 111100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 623 QELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQE 674 (705) Q Consensus 623 q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~ 674 (705) ++.+..++.+++ .++.....+.+.. .....++......+.=++ T Consensus 514 qq~~~~~~~~~~-----a~~~~~~~~~~~~----~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 514 AQAAQAMAMGQA-----AAQGAKTLSETQT----SDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHHHHH-----HHHHHHHhhhccC----CCHHHHHHHHHhhcCCCC Confidence 000000000000 0000000000000 000011000000000000 No 25 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=4.1e-29 Score=176.20 Aligned_cols=522 Identities=10% Similarity=0.004 Sum_probs=296.2 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC-CC-CCCCCCC---CCcCCCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG-AY-KPKQQVG---RSSVQPKLIRKQAEWRYSALSEPFLN- 93 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~g---rs~~v~~~v~~~~e~~~~~l~~~f~~- 93 (705) |.+....+.|++.++..++..++..+++++-.+|..-.. .. .+....| ..+++++...+.++.+.+.||..+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 777778888999999999988877665554444432111 01 1112223 35589999999999999999999998 Q ss_pred CCCEEEEeCCCcchHHHHHH------HHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccc Q lcl|NC_021540. 94 DENIFSIAPKTWQDREAARQ------NEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQY 167 (705) Q Consensus 94 ~~~~~~~~p~~~~D~~~A~~------~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~ 167 (705) +.+||++.+..++..+.+.. .+..+.-.|. .++-+..++..+++.+..|||++-+..+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d--------------- 144 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPD--------------- 144 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecC--------------- Confidence 89999999976554433322 3333433333 3445555888888888888887743210 Q ss_pred ccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCC Q lcl|NC_021540. 168 VEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDP 247 (705) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp 247 (705) ..+.+++..+++.+|++.. T Consensus 145 -------------------------------------------------------------~~~~~rf~~~pl~~~~v~~ 163 (555) T protein:vir:10 145 -------------------------------------------------------------FDAVVYHHSLTAGEYAIAA 163 (555) T ss_pred -------------------------------------------------------------CCceEEEEEeecceeEEee Confidence 1123567889999999988 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEee-ecC-- Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWD-IDG-- 324 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~-~~~-- 324 (705) ++.+.++ =+++++.||..++.+++....++... .... +. +....+|.|+++.+..+ .+. T Consensus 164 d~~G~vd---~i~r~~~~t~~ql~~~fg~~~l~~~~----~~~~----~~-------~~~~~~v~v~~~V~pr~~~~~~~ 225 (555) T protein:vir:10 164 DNQGRVN---TLYREFQITVAQMVREFGKDKCSTTV----QSLF----DR-------GALEQWVTVIHAIEPRADRDPSK 225 (555) T ss_pred CCCCCEE---EEEEEEeccHHHHHHhcCcccCCHHH----HHHH----hc-------CCCCceEEEEEEEeeccCcCcCC Confidence 8765552 36789999999999884333232110 0000 00 01123578888765422 211 Q ss_pred -CCeeEEEEEE-E---E-CCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021540. 325 -SGVTTPIVAS-W---V-DDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSA 398 (705) Q Consensus 325 -dg~~~~~~~~-~---~-g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~ 398 (705) ++....+..+ | + |.++| .++.| ..|||++..|.+.++..||.|++..+.+-.+.+|++.+..+.++.+.+ T Consensus 226 ~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~ 301 (555) T protein:vir:10 226 RDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKS 301 (555) T ss_pred CCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1111112222 2 1 33565 34455 479999999999999999999999999999999999999999999999 Q ss_pred CCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcch-HhcCCCccccc Q lcl|NC_021540. 399 NGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKS-FSQGLTGDSLG 477 (705) Q Consensus 399 ~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d-~~~G~~~~~~~ 477 (705) +|++.++.+... +..+..||++..+.+|..............-.+...+.++.+.+.|.... ..+ +.++...++.. T Consensus 302 ~pp~~v~~~~~~--~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~ 378 (555) T protein:vir:10 302 NPPLQLPVSAKN--QDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQ 378 (555) T ss_pred cCceeecccccc--ccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCc Confidence 999999887632 34677899987776554322222222222223445566788888887665 333 22333345555 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhH Q lcl|NC_021540. 478 TTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAET 556 (705) Q Consensus 478 ~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~ 556 (705) -||++|....+.....+..++-++.. .+..+..+.+.++....-=|.. |+.+.+ .++.|..-++-. T Consensus 379 ~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------P~~l~~-~~i~v~yis~La 445 (555) T protein:vir:10 379 MTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPP------------PQEMQG-VDLNVEFVSMLA 445 (555) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------chhhcC-ceeEEEeccHHH Confidence 79999999988899999988888864 7788999999888775322221 233332 224443333433 Q ss_pred HHHHHH---HHHHHHH---HHhhhch----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 557 DAIKAQ---ELSFMLQ---TMGQSLP----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQ 626 (705) Q Consensus 557 ~~~~~q---~~~~llq---~~~~~~~----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q 626 (705) +.+... .+..+++ .+.+..| ..+...++..+++..|.+. ..++... + .++...|+.+ T Consensus 446 ~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~e-e---v~~~r~qr~~------- 512 (555) T protein:vir:10 446 QAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVPGN-Q---VALIRKQRAD------- 512 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCCHH-H---HHHHHHHHHH------- Confidence 333322 2223333 3322222 2344566677777777663 2332211 0 0000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 627 MRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERE 676 (705) Q Consensus 627 ~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e 676 (705) ++++..++.++.+.++..+.. .+++...+...-..++.----. T Consensus 513 ~~q~~~~a~~~~q~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 513 QQQAAQQAALLNQGADTAAKL-------GSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHh-------cccccCcchhHHHHHhhhccCC Confidence 000000000000000000000 0000000000000000000000 No 26 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=4.1e-29 Score=176.20 Aligned_cols=522 Identities=10% Similarity=0.004 Sum_probs=296.2 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC-CC-CCCCCCC---CCcCCCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG-AY-KPKQQVG---RSSVQPKLIRKQAEWRYSALSEPFLN- 93 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~g---rs~~v~~~v~~~~e~~~~~l~~~f~~- 93 (705) |.+....+.|++.++..++..++..+++++-.+|..-.. .. .+....| ..+++++...+.++.+.+.||..+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 777778888999999999988877665554444432111 01 1112223 35589999999999999999999998 Q ss_pred CCCEEEEeCCCcchHHHHHH------HHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccc Q lcl|NC_021540. 94 DENIFSIAPKTWQDREAARQ------NEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQY 167 (705) Q Consensus 94 ~~~~~~~~p~~~~D~~~A~~------~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~ 167 (705) +.+||++.+..++..+.+.. .+..+.-.|. .++-+..++..+++.+..|||++-+..+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d--------------- 144 (555) T protein:vir:98 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPD--------------- 144 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecC--------------- Confidence 89999999976554433322 3333433333 3445555888888888888887743210 Q ss_pred ccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCC Q lcl|NC_021540. 168 VEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDP 247 (705) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp 247 (705) ..+.+++..+++.+|++.. T Consensus 145 -------------------------------------------------------------~~~~~rf~~~pl~~~~v~~ 163 (555) T protein:vir:98 145 -------------------------------------------------------------FDAVVYHHSLTAGEYAIAA 163 (555) T ss_pred -------------------------------------------------------------CCceEEEEEeecceeEEee Confidence 1123567889999999988 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEee-ecC-- Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWD-IDG-- 324 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~-~~~-- 324 (705) ++.+.++ =+++++.||..++.+++....++... .... +. +....+|.|+++.+..+ .+. T Consensus 164 d~~G~vd---~i~r~~~~t~~ql~~~fg~~~l~~~~----~~~~----~~-------~~~~~~v~v~~~V~pr~~~~~~~ 225 (555) T protein:vir:98 164 DNQGRVN---TLYREFQITVAQMVREFGKDKCSTTV----QSLF----DR-------GALEQWVTVIHAIEPRADRDPSK 225 (555) T ss_pred CCCCCEE---EEEEEEeccHHHHHHhcCcccCCHHH----HHHH----hc-------CCCCceEEEEEEEeeccCcCcCC Confidence 8765552 36789999999999884333232110 0000 00 01123578888765422 211 Q ss_pred -CCeeEEEEEE-E---E-CCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021540. 325 -SGVTTPIVAS-W---V-DDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSA 398 (705) Q Consensus 325 -dg~~~~~~~~-~---~-g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~ 398 (705) ++....+..+ | + |.++| .++.| ..|||++..|.+.++..||.|++..+.+-.+.+|++.+..+.++.+.+ T Consensus 226 ~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~ 301 (555) T protein:vir:98 226 RDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKS 301 (555) T ss_pred CCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1111112222 2 1 33565 34455 479999999999999999999999999999999999999999999999 Q ss_pred CCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcch-HhcCCCccccc Q lcl|NC_021540. 399 NGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKS-FSQGLTGDSLG 477 (705) Q Consensus 399 ~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d-~~~G~~~~~~~ 477 (705) +|++.++.+... +..+..||++..+.+|..............-.+...+.++.+.+.|.... ..+ +.++...++.. T Consensus 302 ~pp~~v~~~~~~--~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~ 378 (555) T protein:vir:98 302 NPPLQLPVSAKN--QDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQ 378 (555) T ss_pred cCceeecccccc--ccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCc Confidence 999999887632 34677899987776554322222222222223445566788888887665 333 22333345555 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhH Q lcl|NC_021540. 478 TTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAET 556 (705) Q Consensus 478 ~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~ 556 (705) -||++|....+.....+..++-++.. .+..+..+.+.++....-=|.. |+.+.+ .++.|..-++-. T Consensus 379 ~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------P~~l~~-~~i~v~yis~La 445 (555) T protein:vir:98 379 MTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPP------------PQEMQG-VDLNVEFVSMLA 445 (555) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------chhhcC-ceeEEEeccHHH Confidence 79999999988899999988888864 7788999999888775322221 233332 224443333433 Q ss_pred HHHHHH---HHHHHHH---HHhhhch----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 557 DAIKAQ---ELSFMLQ---TMGQSLP----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQ 626 (705) Q Consensus 557 ~~~~~q---~~~~llq---~~~~~~~----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q 626 (705) +.+... .+..+++ .+.+..| ..+...++..+++..|.+. ..++... + .++...|+.+ T Consensus 446 ~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~e-e---v~~~r~qr~~------- 512 (555) T protein:vir:98 446 QAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVPGN-Q---VALIRKQRAD------- 512 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCCHH-H---HHHHHHHHHH------- Confidence 333322 2223333 3322222 2344566677777777663 2332211 0 0000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 627 MRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERE 676 (705) Q Consensus 627 ~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e 676 (705) ++++..++.++.+.++..+.. .+++...+...-..++.----. T Consensus 513 ~~q~~~~a~~~~q~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 513 QQQAAQQAALLNQGADTAAKL-------GSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHh-------cccccCcchhHHHHHhhhccCC Confidence 000000000000000000000 0000000000000000000000 No 27 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=4.1e-29 Score=176.20 Aligned_cols=522 Identities=10% Similarity=0.004 Sum_probs=296.2 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC-CC-CCCCCCC---CCcCCCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG-AY-KPKQQVG---RSSVQPKLIRKQAEWRYSALSEPFLN- 93 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~g---rs~~v~~~v~~~~e~~~~~l~~~f~~- 93 (705) |.+....+.|++.++..++..++..+++++-.+|..-.. .. .+....| ..+++++...+.++.+.+.||..+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 777778888999999999988877665554444432111 01 1112223 35589999999999999999999998 Q ss_pred CCCEEEEeCCCcchHHHHHH------HHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccc Q lcl|NC_021540. 94 DENIFSIAPKTWQDREAARQ------NEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQY 167 (705) Q Consensus 94 ~~~~~~~~p~~~~D~~~A~~------~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~ 167 (705) +.+||++.+..++..+.+.. .+..+.-.|. .++-+..++..+++.+..|||++-+..+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d--------------- 144 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPD--------------- 144 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecC--------------- Confidence 89999999976554433322 3333433333 3445555888888888888887743210 Q ss_pred ccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCC Q lcl|NC_021540. 168 VEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDP 247 (705) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp 247 (705) ..+.+++..+++.+|++.. T Consensus 145 -------------------------------------------------------------~~~~~rf~~~pl~~~~v~~ 163 (555) T protein:vir:10 145 -------------------------------------------------------------FDAVVYHHSLTAGEYAIAA 163 (555) T ss_pred -------------------------------------------------------------CCceEEEEEeecceeEEee Confidence 1123567889999999988 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEee-ecC-- Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWD-IDG-- 324 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~-~~~-- 324 (705) ++.+.++ =+++++.||..++.+++....++... .... +. +....+|.|+++.+..+ .+. T Consensus 164 d~~G~vd---~i~r~~~~t~~ql~~~fg~~~l~~~~----~~~~----~~-------~~~~~~v~v~~~V~pr~~~~~~~ 225 (555) T protein:vir:10 164 DNQGRVN---TLYREFQITVAQMVREFGKDKCSTTV----QSLF----DR-------GALEQWVTVIHAIEPRADRDPSK 225 (555) T ss_pred CCCCCEE---EEEEEEeccHHHHHHhcCcccCCHHH----HHHH----hc-------CCCCceEEEEEEEeeccCcCcCC Confidence 8765552 36789999999999884333232110 0000 00 01123578888765422 211 Q ss_pred -CCeeEEEEEE-E---E-CCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021540. 325 -SGVTTPIVAS-W---V-DDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSA 398 (705) Q Consensus 325 -dg~~~~~~~~-~---~-g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~ 398 (705) ++....+..+ | + |.++| .++.| ..|||++..|.+.++..||.|++..+.+-.+.+|++.+..+.++.+.+ T Consensus 226 ~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~ 301 (555) T protein:vir:10 226 RDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKS 301 (555) T ss_pred CCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1111112222 2 1 33565 34455 479999999999999999999999999999999999999999999999 Q ss_pred CCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcch-HhcCCCccccc Q lcl|NC_021540. 399 NGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKS-FSQGLTGDSLG 477 (705) Q Consensus 399 ~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d-~~~G~~~~~~~ 477 (705) +|++.++.+... +..+..||++..+.+|..............-.+...+.++.+.+.|.... ..+ +.++...++.. T Consensus 302 ~pp~~v~~~~~~--~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~dlf~~l~~~~~~~ 378 (555) T protein:vir:10 302 NPPLQLPVSAKN--QDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASF-YADLFLMLANGTNPQ 378 (555) T ss_pred cCceeecccccc--ccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCc Confidence 999999887632 34677899987776554322222222222223445566788888887665 333 22333345555 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhH Q lcl|NC_021540. 478 TTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAET 556 (705) Q Consensus 478 ~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~ 556 (705) -||++|....+.....+..++-++.. .+..+..+.+.++....-=|.. |+.+.+ .++.|..-++-. T Consensus 379 ~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------P~~l~~-~~i~v~yis~La 445 (555) T protein:vir:10 379 MTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPP------------PQEMQG-VDLNVEFVSMLA 445 (555) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------chhhcC-ceeEEEeccHHH Confidence 79999999988899999988888864 7788999999888775322221 233332 224443333433 Q ss_pred HHHHHH---HHHHHHH---HHhhhch----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 557 DAIKAQ---ELSFMLQ---TMGQSLP----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQ 626 (705) Q Consensus 557 ~~~~~q---~~~~llq---~~~~~~~----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q 626 (705) +.+... .+..+++ .+.+..| ..+...++..+++..|.+. ..++... + .++...|+.+ T Consensus 446 ~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~e-e---v~~~r~qr~~------- 512 (555) T protein:vir:10 446 QAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDP--ELIVPGN-Q---VALIRKQRAD------- 512 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCc--cccCCHH-H---HHHHHHHHHH------- Confidence 333322 2223333 3322222 2344566677777777663 2332211 0 0000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 627 MRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERE 676 (705) Q Consensus 627 ~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e 676 (705) ++++..++.++.+.++..+.. .+++...+...-..++.----. T Consensus 513 ~~q~~~~a~~~~q~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 513 QQQAAQQAALLNQGADTAAKL-------GSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHh-------cccccCcchhHHHHHhhhccCC Confidence 000000000000000000000 0000000000000000000000 No 28 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=7.3e-29 Score=174.85 Aligned_cols=524 Identities=13% Similarity=0.073 Sum_probs=287.9 Q ss_pred CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCC--CCCCCCC---CCCcCCCHHHHHHHHHHHHHHHHhhcC-C Q lcl|NC_021540. 21 KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGA--YKPKQQV---GRSSVQPKLIRKQAEWRYSALSEPFLN-D 94 (705) Q Consensus 21 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---grs~~v~~~v~~~~e~~~~~l~~~f~~-~ 94 (705) =++++...|++.++..++..++..+++++-.+|..-... ....... ..++++++.....++.+.+.||..+|+ + T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~ 80 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 334478889999999999999886655544454321111 1111122 246789999999999999999999998 8 Q ss_pred CCEEEEeCCCcchHHHHHH------HHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccc Q lcl|NC_021540. 95 ENIFSIAPKTWQDREAARQ------NEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYV 168 (705) Q Consensus 95 ~~~~~~~p~~~~D~~~A~~------~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~ 168 (705) .+||++.+..++..+.+.. .+..+.-.|. .++-+..++..+++.+..|||++.+.++ T Consensus 81 ~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~Gta~l~~~~d---------------- 143 (559) T protein:vir:95 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSNLYQSLPQLYGSLGTYSTGAMAVLDD---------------- 143 (559) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeEeecC---------------- Confidence 9999998865543333222 2222322222 3444555777788888888887743221 Q ss_pred cCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCCC Q lcl|NC_021540. 169 EATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPT 248 (705) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~ 248 (705) ..+.+++..+++.+|++..+ T Consensus 144 ------------------------------------------------------------~~~~~r~~~~~l~~~~v~~d 163 (559) T protein:vir:95 144 ------------------------------------------------------------DEDIIRTMPFPIGSYYLANS 163 (559) T ss_pred ------------------------------------------------------------CCceeEEEEeecCeEEEeeC Confidence 11235788999999999998 Q ss_pred ccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEE-EEEeeecCCCe Q lcl|NC_021540. 249 CNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEY-WGYWDIDGSGV 327 (705) Q Consensus 249 a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~-w~k~~~~~dg~ 327 (705) +.+.++. |++++.+|..++..++....++.. ..... .. +...+.|.|+++ |.+.+.+.++. T Consensus 164 ~~G~vd~---i~r~~~~t~~ql~~~fg~~~l~~~---~~~~~-~~-----------~~~~~~v~v~~~V~pr~~~~~~~~ 225 (559) T protein:vir:95 164 PRGSVDT---CFRKFSMTVRQLVQEFGLNNVSES---VKSMW-ES-----------GTYEKWIEVMHSVYPNIDRDTSKL 225 (559) T ss_pred CCCCeEE---EEEeEecCHHHHHHHcCcccCCHH---HHHHH-hc-----------CCCCCeEEEEEEEecccccccccc Confidence 8665544 688999999999987433322211 00000 00 011235777765 33333332221 Q ss_pred ---eEEEEEE-EE---C-CEEEecccCCCCCCCcceEEeeeeeecCcccCCc-hHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021540. 328 ---TTPIVAS-WV---D-DVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEA-DAELLSDNQKLIGALTRGMIDAMARSA 398 (705) Q Consensus 328 ---~~~~~~~-~~---g-~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g-~~~~~~d~Q~~iN~~~~~~~d~~~~~~ 398 (705) ...+..+ |. + .++++ ++.| .+|||++..|.+.++..||+| ++..+.+..+.+|.+.+..+.+..++. T Consensus 226 ~~~~~pf~s~~~e~~~~~~~~l~--esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~ 301 (559) T protein:vir:95 226 DSKNKPFKSVYYEVGGDNDKLLR--ESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKAT 301 (559) T ss_pred ccccceEEEEEEEecCCCceeee--cCCc--ccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHh Confidence 1112222 22 2 35665 4455 469999999999999999999 699999999999999999999999999 Q ss_pred CCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccC--ccchHHHHHHHHHHHHHHHHHhCcchHhcCCCcccc Q lcl|NC_021540. 399 NGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKY--PELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSL 476 (705) Q Consensus 399 ~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~--~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~ 476 (705) +|+++++.+... ...+..||+++.+..+.. ...+.+... +.+ ..+...++.+.+.|....-..-+.+-...++. T Consensus 302 ~pp~~v~~~~~~--~~~~l~pgg~~~~~~~~~-~~~i~p~~~~~~~~-~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~ 377 (559) T protein:vir:95 302 NPPMVAPTSLKN--QRASLLPGDITYIDQITG-QDGFRPAYLVNPST-ADLVADIQDTRQIINSAYFVDLFMMLQNINTR 377 (559) T ss_pred cCceeccccccc--cceeeeccceeeeCCCCC-cccceeecccccch-HHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCC Confidence 999999877543 335567999887765432 223444322 222 22234456667777665544322111122333 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchh Q lcl|NC_021540. 477 GTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAE 555 (705) Q Consensus 477 ~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~ 555 (705) .-||++|..+.+.....+..+.-++.. .+..++.+.+.++....-=|.. |+.+.+ .++.|..-++- T Consensus 378 rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------p~~l~~-~~i~v~~is~L 444 (559) T protein:vir:95 378 SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPP------------PDVMEG-MPLKVEYISVM 444 (559) T ss_pred CCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------cccccC-cceEEEeecHH Confidence 459999999999999999998888865 7789999999988876432221 222222 12223222232 Q ss_pred HHHH---H---HHHHHHHHHHHhhhch----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 556 TDAI---K---AQELSFMLQTMGQSLP----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQEL 625 (705) Q Consensus 556 ~~~~---~---~q~~~~llq~~~~~~~----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~ 625 (705) ...+ . +.++++.+..+++..| ..+...++..+++..|.+. ..++.. .+ .++..++++++++.+ T Consensus 445 a~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~--~~irs~-~e----v~~~rqqr~~~qq~~ 517 (559) T protein:vir:95 445 AQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSP--TVIVPQ-EQ----VEQARQQRAQQQQQQ 517 (559) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCch--hhcCCH-HH----HHHHHHHHHHHHHHH Confidence 2222 2 2333333333333223 2445566777777777762 233321 10 000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 626 QMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQ 689 (705) Q Consensus 626 q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~ 689 (705) +.++... ..+ ++.+...+++....+.. |.+... . .-.-++++ T Consensus 518 ----q~~~~~~--~aa-------~~~~~~~~~~~~~~~~l-~~~~~~-------~-~~~~~~~~ 559 (559) T protein:vir:95 518 ----QMMAMGM--AAA-------QGVKTLSEAKTSDPSVL-SAMANA-------V-SGQGGQSQ 559 (559) T ss_pred ----HHHHHHH--HHH-------HhhhccccccCCChhHH-HHHHHh-------h-cCccccCC Confidence 0000000 000 00000011110000000 000000 0 00000000 No 29 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=2.6e-28 Score=171.80 Aligned_cols=503 Identities=11% Similarity=0.037 Sum_probs=282.3 Q ss_pred HHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC-C----CCCCC------CCCCCcCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_021540. 24 PKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG-A----YKPKQ------QVGRSSVQPKLIRKQAEWRYSALSEPFL 92 (705) Q Consensus 24 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~------~~grs~~v~~~v~~~~e~~~~~l~~~f~ 92 (705) =..+.|++-++..++..++..+ .|.++|..+. . ..... .+..++++++.....++.+.+.||..+| T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~---~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~lt 77 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQ---IWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLT 77 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHH---HHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhc Confidence 2345667777778887776644 4555554331 1 00001 1224668899999999999999999999 Q ss_pred C-CCCEEEEeCCCcchHHHH------HHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccc Q lcl|NC_021540. 93 N-DENIFSIAPKTWQDREAA------RQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVF 165 (705) Q Consensus 93 ~-~~~~~~~~p~~~~D~~~A------~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~ 165 (705) + +.+||.+.+...+..+.+ ...+..+.-.|. ..+-+..++..+++.+..|||++.+..+ T Consensus 78 Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~d------------- 143 (547) T protein:vir:10 78 SPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQ-DSNFNLEANETYIDLCGYGNAIMVEEED------------- 143 (547) T ss_pred CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCcEeEEeccC------------- Confidence 8 799999987554322222 122333333333 3444455777777888888887754210 Q ss_pred ccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheee Q lcl|NC_021540. 166 QYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTI 245 (705) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~ 245 (705) ....+.+++..++..+|++ T Consensus 144 -------------------------------------------------------------~~~~~~~r~~~~pl~~~~v 162 (547) T protein:vir:10 144 -------------------------------------------------------------EDEEGSVVFQSSPIQDSYF 162 (547) T ss_pred -------------------------------------------------------------CCCCCceeEEEeecceEEE Confidence 0113456789999999999 Q ss_pred CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEE-eeecC Q lcl|NC_021540. 246 DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGY-WDIDG 324 (705) Q Consensus 246 Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k-~~~~~ 324 (705) ..++.+.++. +++++.||..++.++.....++.. ..... ..+ .+....++.++.|... .+.+. T Consensus 163 ~~d~~G~v~~---i~r~~~~t~~qi~~~fg~~~l~~~---v~~~~-----~~~-----~~~~~~~~~v~~~v~~~~~~~~ 226 (547) T protein:vir:10 163 EEDSRGQVVN---FYRVFRWTPAQIYDRFGDEGTPEA---IIKKA-----KEA-----SNQAALKQEVVMCVFTRYDKKQ 226 (547) T ss_pred eeCCCcCeee---eeeeeeccHHHHHHhcCcccCCHH---HHHHH-----hcC-----CCcccceEEEEEEEeeccCCCC Confidence 9988666644 688999999999887433333211 11111 000 1111234666665433 22221 Q ss_pred CC---e-----eEEEEEEE--EC--CEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 325 SG---V-----TTPIVASW--VD--DVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMID 392 (705) Q Consensus 325 dg---~-----~~~~~~~~--~g--~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d 392 (705) +. . -.....+| .+ .++++ ++.| .+|||++..|.+.++..||.|++..+.+..+.+|.+.+.++. T Consensus 227 ~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~--esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~ 302 (547) T protein:vir:10 227 NRNAGTVLAPTERPFGKKWILKEGAVQLGE--EGGY--YEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLR 302 (547) T ss_pred CccccceeeccccceeEEEEEecCceeeee--cCCc--ccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHH Confidence 11 0 00111222 23 34554 4455 469999999999999999999999999999999999999999 Q ss_pred HHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 393 AMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 393 ~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) ++.++++|+++++.+.+.. .++..||+++...+.. .+.+++...-.......++.+.+.|....=+.-+.+. T Consensus 303 ~~~~~~~pp~~v~~~g~~~--~~~~~pgg~~~~~~~~----~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~-- 374 (547) T protein:vir:10 303 SSEKVIDPAIMVTERGLIS--DIDLGASGLTVVRDME----SMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMK-- 374 (547) T ss_pred HHHHHhcCceecccccccc--cceecCCeeeecCCcc----cceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcC-- Confidence 9999999999998665432 2556799988765443 3334444433344556677777777765433222221 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhc-c-cceeEEe Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNL-V-GSFDIKL 549 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~-~-~~~dv~v 549 (705) ++..-||++|..+.+.....+..+..+|.. .+..+..+.+.++....-=|.+ |+.+ . +-.++.| T Consensus 375 -~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------p~~l~~~~~~~~~v 441 (547) T protein:vir:10 375 -DSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGEL------------PSKLLESGKAAMDI 441 (547) T ss_pred -CCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------chhhhccCcceEEE Confidence 334579999999999999999998888874 7788999999888775332221 1221 1 1223444 Q ss_pred eccchhHHHHHH---HHHHHHHHH---Hhhhch----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHH Q lcl|NC_021540. 550 SISNAETDAIKA---QELSFMLQT---MGQSLP----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQ 619 (705) Q Consensus 550 ~~~~~~~~~~~~---q~~~~llq~---~~~~~~----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q 619 (705) ..-++-.+.+.. +.+..+++. +++..| ..+...++..++...|.+. ..++.. .+ .+ +..+++ T Consensus 442 ~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~--~~irs~-ee----v~-~~r~qr 513 (547) T protein:vir:10 442 VYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQ--TLMRPK-AK----VT-SIRKNR 513 (547) T ss_pred EeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCCh--hccCCH-HH----HH-HHHHHH Confidence 443443333332 222333333 222222 2345566677777777652 223221 10 00 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 620 LEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQE 668 (705) Q Consensus 620 ~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~ 668 (705) +++++ ...|+.+..+..++...+...... .++- + T Consensus 514 ~~~~q-----~~~qaa~~~~~g~~m~~~~~~~a~-----~~~~-----~ 547 (547) T protein:vir:10 514 SQTQQ-----KAEQAAIAEAEGNAMEAQGKGQAA-----LKEN-----Q 547 (547) T ss_pred HHHHH-----HHHHHHHHHHHHHHHHhhcCcccc-----hhcc-----C Confidence 00000 000110000001111111000000 0000 0 No 30 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.96 E-value=5.1e-27 Score=164.72 Aligned_cols=512 Identities=15% Similarity=0.063 Sum_probs=288.5 Q ss_pred CCCH--HHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccC-----CCCCCCCCCCC---CcCCCHHHHHHHHHHHHHHHH Q lcl|NC_021540. 20 WKNK--PKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVT-----GAYKPKQQVGR---SSVQPKLIRKQAEWRYSALSE 89 (705) Q Consensus 20 ~~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~gr---s~~v~~~v~~~~e~~~~~l~~ 89 (705) |+|+ .++..|++.++..++..++..+++++-.+|-.-. ..+.+....|+ ++++++.-...++.+.+.||. T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~ 80 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDS 80 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHh Confidence 5554 4778888888888888887755544444442211 11112223343 357888889999999999999 Q ss_pred hhcC-CCCEEEEeCCCcchHHHHH------HHHHHHHHHHH-hhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhc Q lcl|NC_021540. 90 PFLN-DENIFSIAPKTWQDREAAR------QNEAILNYQFN-NQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTEN 161 (705) Q Consensus 90 ~f~~-~~~~~~~~p~~~~D~~~A~------~~t~~~n~~~~-~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~ 161 (705) .+|+ +.+||.+.+-.+...+.+. +.+..+.-++. ...+-+..++..+++.+..|||++.+.. T Consensus 81 ~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~---------- 150 (549) T protein:vir:10 81 MITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEH---------- 150 (549) T ss_pred hccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEee---------- Confidence 9998 7899999886554433332 22333333332 2344556677788888888888875421 Q ss_pred ccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechh Q lcl|NC_021540. 162 VPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYH 241 (705) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~ 241 (705) ...+.+++..++.. T Consensus 151 ------------------------------------------------------------------~~~~~~~f~~~pl~ 164 (549) T protein:vir:10 151 ------------------------------------------------------------------DVGKGIVYRNVPMQ 164 (549) T ss_pred ------------------------------------------------------------------cCCCeeEEEEEEcC Confidence 01123578889999 Q ss_pred heeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEE-e Q lcl|NC_021540. 242 NVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGY-W 320 (705) Q Consensus 242 ~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k-~ 320 (705) +|++..++.+.++. +++++.||..+|.++.-...++. .. ..... ..+.++|.||++=+. . T Consensus 165 ~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l~~---~v-~~~~~------------~~~~~~~~v~~~V~pr~ 225 (549) T protein:vir:10 165 RLWFAENNSGLIDK---THVQWELTLRQAAQRFGRENLSP---SM-QSTLE------------KDPEKSAIFYHAVEPRA 225 (549) T ss_pred eEEEeeCCCCCeEE---EEEEeecCHHHHHHhcCcccCCH---HH-HHHhh------------cCCCceEEEEEEeecCC Confidence 99999887655533 78999999999988733222221 10 00000 012356777765221 1 Q ss_pred eec---CCCeeEEEE-EEE--ECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 321 DID---GSGVTTPIV-ASW--VDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAM 394 (705) Q Consensus 321 ~~~---~dg~~~~~~-~~~--~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~ 394 (705) +.+ .++.-..+. +++ .++++|+. +.| .+|||++..|.+.++..||.|++....+-.+.+|.+.+..+... T Consensus 226 ~~~~~~~~~~~~pf~sv~~e~~~~~il~e--sg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 301 (549) T protein:vir:10 226 DRDPRKLDGRNMQFASYWLDEGRDRIVQN--SGF--RTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGA 301 (549) T ss_pred CCCccccccccCceEEEEEEecCCEeecc--CCc--ccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 111 122111222 222 34566654 445 46999999999999999999999999999999999999999999 Q ss_pred HhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCcc Q lcl|NC_021540. 395 ARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGD 474 (705) Q Consensus 395 ~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~ 474 (705) .++.+|.++++.+.+.. ..+..||++..+..+......+.++....-.+....+++.+.+.|....-..-+..-. + T Consensus 302 ~~~~~p~~~v~~~g~~~--~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~--~ 377 (549) T protein:vir:10 302 QKLVDPPLLANEDGVLD--GFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILV--D 377 (549) T ss_pred HHHhcCceeeccccccc--cceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhc--C Confidence 99999999998765422 2345678765543332223344554444334456667788888887765433322222 3 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcc-cceeEEeecc Q lcl|NC_021540. 475 SLGTTTAGVQGVIGASGKRELGILRRLA-NGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLV-GSFDIKLSIS 552 (705) Q Consensus 475 ~~~~~a~~i~~l~~~~~~~~~~~~~n~~-~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~-~~~dv~v~~~ 552 (705) +..-||++|....+.....+..+.-++. +.+..++.+.+.++.+..-=|+. |+.+. ...++.|..- T Consensus 378 ~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~------------p~~l~~~~~~~~i~yi 445 (549) T protein:vir:10 378 SGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDM------------PQELIDAGADVDVEYD 445 (549) T ss_pred CCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------ChhhhcCCceeEEEee Confidence 4457999999998888889999888886 57889999999888774332221 22221 1223334333 Q ss_pred chhHHHHH---HHHHHHHHHHHh---hhchh----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHH Q lcl|NC_021540. 553 NAETDAIK---AQELSFMLQTMG---QSLPF----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEA 622 (705) Q Consensus 553 ~~~~~~~~---~q~~~~llq~~~---~~~~~----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~ 622 (705) ++-.+.+. .+.+..+++.++ +..|. .+...++..+++..|.+. ..++.. ++.+ +..++.+ T Consensus 446 s~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~--~~irs~-------eev~-~~r~~~~ 515 (549) T protein:vir:10 446 SPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPV--EAMSTD-------EELQ-AQQAAEA 515 (549) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCc--cccCCH-------HHHH-HHHHHHH Confidence 33333333 223333333322 22222 334456666777777663 222221 0100 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 623 QELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGV 671 (705) Q Consensus 623 q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~ 671 (705) ++.|.+++.+.+. ..+.+....++++.. .|.+.+ T Consensus 516 ~qqq~~~~~~~a~---~a~~~a~~~~~~~ta------------~~~~~~ 549 (549) T protein:vir:10 516 QAAQMQQMLAAAP---VAAGAIKDLSDAQTA------------AQTARV 549 (549) T ss_pred HHHHHHHHHHHHH---HHHHHHHhhhhhcCC------------CcccCC Confidence 0000000000000 000010011111000 000000 No 31 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.96 E-value=3.8e-27 Score=165.44 Aligned_cols=512 Identities=10% Similarity=0.022 Sum_probs=279.6 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCC--CCCcCCCHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQV--GRSSVQPKLIRKQAEWRYSA 86 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--grs~~v~~~v~~~~e~~~~~ 86 (705) |.++++.+|. -..+++-++..++..++..+++++-.+|..-.-...+.... ...+++++.....++.+.+. T Consensus 1 m~~~~~~~~~-------~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 73 (535) T protein:vir:15 1 MADSKRTGLG-------EDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASK 73 (535) T ss_pred CCccchhccc-------hHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHH Confidence 6677766555 12356667777777776655444444443322111111111 12457888888999999999 Q ss_pred HHHhhcCCCCEEEEeCCCc-------chHHHH------HHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 87 LSEPFLNDENIFSIAPKTW-------QDREAA------RQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 87 l~~~f~~~~~~~~~~p~~~-------~D~~~A------~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) ||..+|++.+||.+.+... ++.+.+ +..+..+.-.| ..++-+..++..+++.+..|||++.+.++ T Consensus 74 l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~~~~- 151 (535) T protein:vir:15 74 LMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFECLKQLIVAGNALLYLPEP- 151 (535) T ss_pred HHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCceeEEeecC- Confidence 9999999999999987432 111111 12333443333 34666777888999999999998865321 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) ..+.+ T Consensus 152 ---------------------------------------------------------------------------~~~~~ 156 (535) T protein:vir:15 152 ---------------------------------------------------------------------------EGSYN 156 (535) T ss_pred ---------------------------------------------------------------------------CCCce Confidence 01223 Q ss_pred eEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEE Q lcl|NC_021540. 234 EVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVV 313 (705) Q Consensus 234 ~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 313 (705) +++.++..+|++..++.+.++. +++++.+|..+|... +-.+ ...... ......+|.| T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~-~~~~-----------~~~~~~--------~~~~~~~v~v 213 (535) T protein:vir:15 157 PMKLYRLSSYVVQRDAYGNVLQ---IVTRDQIAFGALPED-VRSA-----------VEKAGG--------EKKMDEMVDV 213 (535) T ss_pred eeEEEEcCeeEEeeCCCCCeeE---EEEeEeecHHHHHHH-HhHh-----------hhcccc--------ccCCCCceeE Confidence 5677888999998887655543 789999999988543 1111 000000 0112346888 Q ss_pred EEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 314 YEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDA 393 (705) Q Consensus 314 ~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~ 393 (705) |++.++. .+++...+++ .+.+..+....+.|+.+.|||++..|.+.++..||.|++..+.+..+.+|.+.+..+.. T Consensus 214 ~~~v~~~--~~~~~~~~~~--e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~ 289 (535) T protein:vir:15 214 YTHVYLD--EESGDYLKYE--EVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKM 289 (535) T ss_pred EEEEEEe--cCCCcEEEEE--EeeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 8876442 2233222222 23343333334455557899999999999999999999999999999999999999999 Q ss_pred HHhcCCCcEEeeccccCchh-hhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 394 MARSANGQRGMSKNLLDPVN-ERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 394 ~~~~~~~~~~~~~~av~~~d-~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) ..++.+|.++++.+.+.... .....+|.++.-+++. + .+.......-.+.....++.+.+.|.... ..+..... T Consensus 290 ~~~~~~p~~lv~~~g~~~~~~l~~~~~g~~v~g~~~~-v--~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~- 364 (535) T protein:vir:15 290 SMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRED-I--DFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQR- 364 (535) T ss_pred HHHHhcCceeecccccccchhcccCCceeeecCCccc-c--eeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccC- Confidence 99999999999776654433 3333444433222221 1 11222222233456677788888887765 22222112 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI 551 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~ 551 (705) ++..-||++|..+.+.....+..+..+|.. .+..++.+.+.++.+..-=+.+ .. ..+.+.+.. T Consensus 365 -~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~-----------p~----~~v~~~yis 428 (535) T protein:vir:15 365 -TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPEL-----------PK----EAVEPTIST 428 (535) T ss_pred -CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-----------Cc----cceeEEEec Confidence 223468999999999999999999888886 6788999999888764322221 11 112333322 Q ss_pred cc-hhHHHHHHHHHHHHHHHHhhhchh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 552 SN-AETDAIKAQELSFMLQTMGQSLPF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQEL 625 (705) Q Consensus 552 ~~-~~~~~~~~q~~~~llq~~~~~~~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~ 625 (705) +. +..+...++.+...++++....|. .+...++..+++..|.+.. ..++. + + +.++.++++++. T Consensus 429 ~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~-~i~~~-~-----e---ev~~~~~q~~~~ 498 (535) T protein:vir:15 429 GLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS-GILLT-D-----E---QKQALMMQDAAQ 498 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChh-hhcCC-H-----H---HHHHHHHHHHHH Confidence 21 122333344444444444332222 2344566666666666521 11111 0 0 000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 626 QMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETG 670 (705) Q Consensus 626 q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~ 670 (705) +..++.+ +.+.... ...+.+ .-+......+..-.++. T Consensus 499 ~~~~~~a-~~~g~~~----~~~~~~---~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 499 TGIENAA-ATGGAGV----GALATS---SPEAMQGAAAQAGLDAT 535 (535) T ss_pred HHHHHHH-HHHHhhc----cchhcc---ChHHHHHHHhccCCCCC Confidence 0000000 0000000 000000 00000000000000000 No 32 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=99.96 E-value=1.1e-26 Score=163.01 Aligned_cols=527 Identities=15% Similarity=0.104 Sum_probs=266.0 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC--CCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG--AYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLN-DEN 96 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~-~~~ 96 (705) |.. .|++-++..++..++..+++++-.+|..-.- ..+.....-..+++++.....++.+.+.||..+|+ +.+ T Consensus 1 m~~-----~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~ 75 (555) T protein:vir:17 1 MKH-----SAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTS 75 (555) T ss_pred Chh-----HHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCc Confidence 222 2666677777777766554443333332211 11111111235688888999999999999999998 789 Q ss_pred EEEEeCCCcchH------HH-HH------HHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccc Q lcl|NC_021540. 97 IFSIAPKTWQDR------EA-AR------QNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVP 163 (705) Q Consensus 97 ~~~~~p~~~~D~------~~-A~------~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~ 163 (705) ||.+.+..++.. +. +. ..+..+...| ..++-+..++..+++.+..|||++-+. + T Consensus 76 WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~~---~--------- 142 (555) T protein:vir:17 76 FFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDI-AESSDRVHLEMAMKHLIVTGNALLYQG---K--------- 142 (555) T ss_pred ccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEEEec---C--------- Confidence 999998543311 11 11 1222333222 345566668888888888888876210 0 Q ss_pred ccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhe Q lcl|NC_021540. 164 VFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNV 243 (705) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~ 243 (705) -.++.++..+| T Consensus 143 ---------------------------------------------------------------------~~~~~~pl~~y 153 (555) T protein:vir:17 143 ---------------------------------------------------------------------KNLKLYPLDRF 153 (555) T ss_pred ---------------------------------------------------------------------CceeEEEcCeE Confidence 00234566778 Q ss_pred eeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcC-cchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeee Q lcl|NC_021540. 244 TIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSN-LEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDI 322 (705) Q Consensus 244 ~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~ 322 (705) ++..++.+.++. +++++.+|..+|.+..-... .+.+............................+.+|.++.+. T Consensus 154 ~v~~d~~G~vd~---v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~-- 228 (555) T protein:vir:17 154 VVSRDGEGNVME---IVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRK-- 228 (555) T ss_pred EEeeCCCcCeeE---EEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeeccccc-- Confidence 887776554433 78999999999987732111 111111000000000000000000011122335566554431 Q ss_pred cCCCeeEEEEEEEECCEEEe--cccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_021540. 323 DGSGVTTPIVASWVDDVMIR--LEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANG 400 (705) Q Consensus 323 ~~dg~~~~~~~~~~g~~iL~--~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~ 400 (705) +|... +..-+++.++. ..++|| ..|||++..|.+.++..||.|++..+.+..+.+|.+.+..+.+..++.+| T Consensus 229 --~~~~~--~~~e~~~~~v~~~l~e~g~--~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p 302 (555) T protein:vir:17 229 --DGQVK--WHQECDGKVIPGSNSSAPY--THNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKV 302 (555) T ss_pred --CCeeE--EEEecCceeccccccccCc--ccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 22211 11223444432 356777 47999999999999999999999999999999999999999999999999 Q ss_pred cEEeeccccCchhhhhhcCCcceeecCCccccccccccc--CccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccch Q lcl|NC_021540. 401 QRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHK--YPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGT 478 (705) Q Consensus 401 ~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~--~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~ 478 (705) .++++.+.+.....+...+.+.|. +|.. ..+.+.+ .+.--+.....++.+.+.|.+...+. + ..++..- T Consensus 303 p~lv~~~g~~~~~~l~~~~~g~v~--~g~~--~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~----~-~~d~~r~ 373 (555) T protein:vir:17 303 VFMVSPSATTKPQNLALAANGAII--QGRP--DDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML----Q-VRQSERT 373 (555) T ss_pred ceeeccccccCcceeecCCCceee--cCCc--ccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc----C-CCCcccc Confidence 999977765444433333323332 3221 1233332 22223445566777777777654321 1 2233446 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHH Q lcl|NC_021540. 479 TTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETD 557 (705) Q Consensus 479 ~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~ 557 (705) ||++|..+.+.....+..++.+|.. .+..++.+.+.++.+..-=+.+ |+... ..++.+... +..+ T Consensus 374 TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~------------p~~~v-~~~i~~~l~-~l~r 439 (555) T protein:vir:17 374 TATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQL------------PKDLV-QPTVVAGLW-GVGR 439 (555) T ss_pred hHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC------------CHhhh-ccceeehHH-HHHH Confidence 8999999999999999999999874 7888999999988875432221 11111 122333222 2234 Q ss_pred HHHHHHHHHHHHHHhhhc-hh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 558 AIKAQELSFMLQTMGQSL-PF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAK 631 (705) Q Consensus 558 ~~~~q~~~~llq~~~~~~-~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k 631 (705) ++.++.+...+++++... |+ .+...++..++...|++ ....++.+. ..++..++++++++++.+++++ T Consensus 440 ~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~-p~~ivrs~e----ev~~~rq~~~~~~~q~~~~~qa- 513 (555) T protein:vir:17 440 GQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGID-TLQLINSPE----TMKQLGDQQKQDMVQASLINQA- 513 (555) T ss_pred HHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCC-hhhhcCCHH----HHHHHHHHHHHHHHHHHHHHHH- Confidence 555566666665554332 11 23334555566655553 112222110 0001000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 632 LQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQR 690 (705) Q Consensus 632 ~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~ 690 (705) ++ .+.+ .. +.+.........+. .+.... +..+ .+-.++.+.. T Consensus 514 --~~----~~~~--~~--~~~~~~~~~~~~~~--a~~~~~---a~~~--~~~~~~~~~~ 555 (555) T protein:vir:17 514 --GQ----LAKT--PM--AEQAMQLIQQQQEG--AQDAGA---AESE--TSSAEAQAGA 555 (555) T ss_pred --HH----HHhh--hh--hhhHHhccccchhh--hhHHHH---HHhh--cCCcccccCC Confidence 00 0000 00 00000000000000 000000 0000 0000000000 No 33 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.95 E-value=2e-26 Score=161.45 Aligned_cols=511 Identities=12% Similarity=0.058 Sum_probs=280.1 Q ss_pred CCCHH---HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC--CCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021540. 20 WKNKP---KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG--AYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLND 94 (705) Q Consensus 20 ~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~ 94 (705) |+++. ....|++-++..++..++..+++++-.+|..-.- .++......+.+++++.....++.+.+.||..+|++ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC Confidence 44432 4556777788777777766555444444433221 111111112346888899999999999999999998 Q ss_pred CCEEEEeCCCcc-------hHHH------HHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhc Q lcl|NC_021540. 95 ENIFSIAPKTWQ-------DREA------ARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTEN 161 (705) Q Consensus 95 ~~~~~~~p~~~~-------D~~~------A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~ 161 (705) .+||.+.+..++ +... -+..+..+...| ..++-+..++..+++.+..|||++.+- + T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~~---e------- 149 (536) T protein:vir:21 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFEALKQLVVAGNVLLYLP---E------- 149 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeEEEe---e------- Confidence 899999875433 1111 122334443333 345666678888888888899987431 0 Q ss_pred ccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechh Q lcl|NC_021540. 162 VPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYH 241 (705) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~ 241 (705) . ...+...++.++.. T Consensus 150 ---------~--------------------------------------------------------~~~~~~~f~~~pl~ 164 (536) T protein:vir:21 150 ---------P--------------------------------------------------------EGSNYNPMKLYRLS 164 (536) T ss_pred ---------C--------------------------------------------------------CCCceeeEEEEEcC Confidence 0 00111246788889 Q ss_pred heeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEee Q lcl|NC_021540. 242 NVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWD 321 (705) Q Consensus 242 ~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~ 321 (705) +|++..++.+.++. +++++.+|..+|..... .+.. ..... ....++|.||++-.+ . T Consensus 165 ~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~fg-~~~~-----------~~~~~--------~~~~~~v~v~~~v~~-~ 220 (536) T protein:vir:21 165 SYVVQRDAFGNVLQ---MVTRDQIAFGALPEDIR-KAVE-----------GQGGE--------KKADETIDVYTHIYL-D 220 (536) T ss_pred eEEEeeCCCCCeeE---EeeeeeccHHHHHHhhh-hhhc-----------ccccc--------cccccceeEEEEEEE-e Confidence 99998876654544 78999999999887622 1100 00000 112346777766432 2 Q ss_pred ecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_021540. 322 IDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQ 401 (705) Q Consensus 322 ~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~ 401 (705) .+ ++...+|. - +++..+.-+...|+...|||++..|.+.++..||.|++..+.+-.+.+|++.+..+.....+.++. T Consensus 221 ~~-~~~~~~~~-e-~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~ 297 (536) T protein:vir:21 221 ED-SGEYLRYE-E-VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) T ss_pred cC-CCcEEEEe-c-cCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 22 22222222 1 244444334455556789999999999999999999999999999999999999999999999999 Q ss_pred EEeeccccCchh-hhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHH Q lcl|NC_021540. 402 RGMSKNLLDPVN-ERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTT 480 (705) Q Consensus 402 ~~~~~~av~~~d-~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a 480 (705) ++++.+.+...+ .....+|.++.-+++. ..+.+.....-.+.....++.+.+.|....-+. +++ ..++..-|| T Consensus 298 ~lv~p~g~~~~~~~~~~~~g~~v~g~~~~---v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~-~~~~~r~TA 371 (536) T protein:vir:21 298 GLVNPAGITQPRRLTKAQTGDFVTGRPED---ISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAV-QRTGERVTA 371 (536) T ss_pred cccCcccccchhhhccCCCcceecCCccc---ceeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcc-cCCCCCccH Confidence 999877664433 3455666654333221 112233333334556677888888888766332 122 123334689 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHH Q lcl|NC_021540. 481 AGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAI 559 (705) Q Consensus 481 ~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~ 559 (705) ++|..+.+.....+..+..+|.. .+..+..+.+.++....-=+. +..+... .++...++ +-.+.+ T Consensus 372 tEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~-----------~p~~~v~--~~~vs~l~-~l~r~~ 437 (536) T protein:vir:21 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE-----------LPKEAVE--PTISTGLE-AIGRGQ 437 (536) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCC-----------CChhhcc--ceEEecHH-HHHHHH Confidence 99999998899999888888876 677888888888865432111 1111111 22222222 223344 Q ss_pred HHHHHHHHHHHHhhhch-----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 560 KAQELSFMLQTMGQSLP-----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQA 634 (705) Q Consensus 560 ~~q~~~~llq~~~~~~~-----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa 634 (705) .++.+...++.+....| ..+...++..+++..|.. ....++.. +..++..++++++++. ++++ T Consensus 438 ~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~-p~~~irt~------eev~~~r~q~~~~~~~-----~~~a 505 (536) T protein:vir:21 438 DLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGID-TSGILLTE------EQKQQKMAQQSMQMGM-----DNGA 505 (536) T ss_pred HHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCC-hhhhcCCH------HHHHHHHHHHHHHHHH-----HHHH Confidence 44445444444333222 235556666677766652 22233221 1000000000000000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 635 EIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGV 671 (705) Q Consensus 635 ~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~ 671 (705) .. ..+.+. +++. ...+...++.+...++-++ T Consensus 506 ~~---~~~~~~--~~~~-~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 506 AA---LAQGMA--AQAT-ASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HH---HHHHHH--HHHh-cChhhHHhhhhccccCCCC Confidence 00 000000 0000 0000000000000111111 No 34 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.95 E-value=2e-26 Score=161.45 Aligned_cols=511 Identities=12% Similarity=0.049 Sum_probs=280.3 Q ss_pred CCCHH---HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC--CCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021540. 20 WKNKP---KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG--AYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLND 94 (705) Q Consensus 20 ~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~ 94 (705) |+++. ....|++-++..++..++..+++++-.+|..-.- .++......+.+++++.....++.+.+.||..+|++ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcCC Confidence 44432 4556777778777777766555544444443221 111111112346888889999999999999999998 Q ss_pred CCEEEEeCCCcc-------hHHH------HHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhc Q lcl|NC_021540. 95 ENIFSIAPKTWQ-------DREA------ARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTEN 161 (705) Q Consensus 95 ~~~~~~~p~~~~-------D~~~------A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~ 161 (705) .+||.+.+..++ +... -+..+..+...| ..++-+..++..+++.+..|||++.+- + T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~~---e------- 149 (536) T protein:vir:10 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFEALKQLVVAGNVLLYLP---E------- 149 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeEEEe---e------- Confidence 899999875433 1111 122334443333 345666678888888888899987431 0 Q ss_pred ccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechh Q lcl|NC_021540. 162 VPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYH 241 (705) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~ 241 (705) . ...+...++.++.. T Consensus 150 ---------~--------------------------------------------------------~~~~~~~~~~~pl~ 164 (536) T protein:vir:10 150 ---------P--------------------------------------------------------EGSNYNPMKLYRLS 164 (536) T ss_pred ---------C--------------------------------------------------------CCCceeeEEEEEcC Confidence 0 00111246788889 Q ss_pred heeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEee Q lcl|NC_021540. 242 NVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWD 321 (705) Q Consensus 242 ~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~ 321 (705) +|++..++.+.++. +++++.+|..+|..... .+.. .... .....++|.||++-++. T Consensus 165 ~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~fg-~~~~-----------~~~~--------~~~~~~~v~v~~~V~~~- 220 (536) T protein:vir:10 165 SYVVQRDAFGNVLQ---MVTRDQIAFGALPEDIR-KAVE-----------GQGG--------EKKADETIDVYTHIYLD- 220 (536) T ss_pred eEEEeeCCCCCeeE---EeeeeeccHHHHHHhhh-hhhc-----------cccc--------ccCcccceEEEEEEEEe- Confidence 99998877655544 78999999999877621 1100 0000 01123468888775432 Q ss_pred ecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_021540. 322 IDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQ 401 (705) Q Consensus 322 ~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~ 401 (705) ++++...+|+ . +++..+..+.+.|+...|||++..|.+.++..||.|++..+.+-.+.+|.+.+..+.....+.++. T Consensus 221 -~~~~~~~~~~-e-~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~ 297 (536) T protein:vir:10 221 -EASGEYLRYE-E-VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) T ss_pred -cCCCcEEEEE-e-ecCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 2233222222 2 344433333455555789999999999999999999999999999999999999999999999999 Q ss_pred EEeeccccCch-hhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHH Q lcl|NC_021540. 402 RGMSKNLLDPV-NERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTT 480 (705) Q Consensus 402 ~~~~~~av~~~-d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a 480 (705) ++++.+.+... +.....+|.++.-+++. ..+.+.....-.+.....++.+.+.|....-+. +++ ..++..-|| T Consensus 298 ~lv~p~g~~~~~~~~~~~~g~~v~g~~~~---v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~-~~~~~r~TA 371 (536) T protein:vir:10 298 GLVNPAGITQPRRLTKAQTGDFVTGRPED---ISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAV-QRTGERVTA 371 (536) T ss_pred cccCcccccchhhhccCCCcceecCCccc---ceeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcc-cCCCCCccH Confidence 99987766443 33455666654333221 112233333334556677888888888766332 122 123334689 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHH Q lcl|NC_021540. 481 AGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAI 559 (705) Q Consensus 481 ~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~ 559 (705) ++|..+.+.....+..+..+|.. .+..+..+.+.++....-=+. +..+... .++...++ +-.+.+ T Consensus 372 tEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~-----------~p~~~v~--~~~vs~l~-~l~r~~ 437 (536) T protein:vir:10 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE-----------LPKEAVE--PTISTGLE-AIGRGQ 437 (536) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCC-----------CChhhcc--ceEEecHH-HHHHHH Confidence 99999999899999988888876 677888888888865422111 1111111 22222222 223344 Q ss_pred HHHHHHHHHHHHhhhch-----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 560 KAQELSFMLQTMGQSLP-----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQA 634 (705) Q Consensus 560 ~~q~~~~llq~~~~~~~-----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa 634 (705) .++.+...++.+....| ..+...++..+++..|.. ....++.. +..++..++++++++ .++++ T Consensus 438 ~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~-p~~~irt~------eev~~~r~q~~~~~~-----~~~~a 505 (536) T protein:vir:10 438 DLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGID-TSGILLTE------EQKQQKMAQQSMQMG-----MDNGA 505 (536) T ss_pred HHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCC-chhhcCCH------HHHHHHHHHHHHHHH-----HHHHH Confidence 44444444444333222 234556666666666651 22223221 100000000000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 635 EIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGV 671 (705) Q Consensus 635 ~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~ 671 (705) .. ..+.+.. ++. ...+...++.+...++-++ T Consensus 506 ~~---~~~~~~~--~~~-~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 506 AA---LAQGMAA--QAT-ASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HH---HHHHHHH--HHh-cCchhHHhhhhccccCCCC Confidence 00 0000000 000 0000000000000111111 No 35 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.95 E-value=1.1e-26 Score=162.83 Aligned_cols=512 Identities=11% Similarity=0.025 Sum_probs=276.2 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCC--CCCCcCCCHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQ--VGRSSVQPKLIRKQAEWRYSA 86 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~grs~~v~~~v~~~~e~~~~~ 86 (705) |.+++...+. -..+++-++..++..++..+++++-.+|..-.-...+... ....+++++.....++.+.+. T Consensus 1 m~~~~~~~~~-------~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 73 (535) T protein:vir:33 1 MADSKRTGLG-------EDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASK 73 (535) T ss_pred CChhhhhccC-------hhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHH Confidence 4444433332 1235666777777777665544444444332211111111 122457788888999999999 Q ss_pred HHHhhcCCCCEEEEeCCCcc-------hHHHH------HHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 87 LSEPFLNDENIFSIAPKTWQ-------DREAA------RQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 87 l~~~f~~~~~~~~~~p~~~~-------D~~~A------~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) ||..+|++.+||.+.+..++ +.+.+ +..+..+...| ..++-+..++..+++.+..|||++.+.++ T Consensus 74 l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~~- 151 (535) T protein:vir:33 74 LMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFECLKQLIVAGNALLYLPEP- 151 (535) T ss_pred HHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCceeEEeecC- Confidence 99999999999999875421 11111 22333343333 45666677888999999999998865321 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) ..+.+ T Consensus 152 ---------------------------------------------------------------------------~~~~~ 156 (535) T protein:vir:33 152 ---------------------------------------------------------------------------EGSYN 156 (535) T ss_pred ---------------------------------------------------------------------------CCCce Confidence 01123 Q ss_pred eEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEE Q lcl|NC_021540. 234 EVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVV 313 (705) Q Consensus 234 ~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 313 (705) +++.++..+|++..++.+.++. +++++.+|..+|.+... .+.. ..... ....+.+.+ T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~~~-~~~~-----------~~~~~--------k~~~~~~~v 213 (535) T protein:vir:33 157 PMKLYRLSSYVVQRDAYGNVLQ---IVTRDQIAFGALPEDVR-SAVE-----------KSGGE--------KKMDEMVDV 213 (535) T ss_pred eeEEEEcCeeEEeeCCCCCeeE---EEeeEeecHHHHHHHhh-hhhc-----------ccccc--------cccccCCeE Confidence 5778889999998887655544 78999999999865411 1100 00000 011234566 Q ss_pred EEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 314 YEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDA 393 (705) Q Consensus 314 ~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~ 393 (705) |.|.++ +. .+|...+++ .+.+..+....+.|+.+.|||++..|.+.++..||.|++..+.+..+.+|.+.+..+.. T Consensus 214 ~~~v~~-~~-~~~~~~~~~--~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~ 289 (535) T protein:vir:33 214 YTHVYL-DE-ESGDYLKYE--EVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKM 289 (535) T ss_pred EEEEEe-eC-CCCcEEEEE--EEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 666433 22 223223332 34444444445556667899999999999999999999999999999999999999999 Q ss_pred HHhcCCCcEEeeccccCchhh-hhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 394 MARSANGQRGMSKNLLDPVNE-RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 394 ~~~~~~~~~~~~~~av~~~d~-~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) ..++.+|.++++.+.+..... ....+|.++.-+++. + .+.......-.+.....++.+.+.|.... ..+..... T Consensus 290 ~~~~~~p~~lv~~~g~~~~~~~~~~~~g~~v~g~~~~-v--~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~- 364 (535) T protein:vir:33 290 SMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRED-I--DFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQR- 364 (535) T ss_pred HHHHhcCceeeccccccchhhcccCCceeeecCCccc-c--eeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccC- Confidence 999999999998766544333 333333333222221 1 12222222233456677788888887765 22222112 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI 551 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~ 551 (705) ++..-||++|..+.+.....+..+..+|.. .+..++.+.+.++.+..-=+.+ .. ..+.+.+.. T Consensus 365 -~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~-----------p~----~~v~~~yis 428 (535) T protein:vir:33 365 -TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPEL-----------PK----EAVEPTIST 428 (535) T ss_pred -CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-----------Cc----cceeEEEec Confidence 223468999999999999999999888886 6788999999888764322221 11 112333322 Q ss_pred cc-hhHHHHHHHHHHHHHHHHhhhchh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 552 SN-AETDAIKAQELSFMLQTMGQSLPF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQEL 625 (705) Q Consensus 552 ~~-~~~~~~~~q~~~~llq~~~~~~~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~ 625 (705) +. +..+...++.+...++++....|. .+...++..+++..|.+.. ..++. +.+ .++.+++++.. T Consensus 429 ~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~-~i~~~-~ee--------~~~~~~q~~~~ 498 (535) T protein:vir:33 429 GLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS-GILLT-DEQ--------KQALMMQDAAQ 498 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHh-HhcCC-HHH--------HHHHHHHHHHH Confidence 21 122333344444444444332222 2344566666676666421 11111 100 00000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 626 QMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFV 665 (705) Q Consensus 626 q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~ 665 (705) +..++.+.+..+. .+....... +..+..+...=++.. T Consensus 499 ~~~~~~~~~~g~~-~~~~~~~~~--~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 499 TGVENAAAAGGAG-VGALATSSP--EAMQGAAAKAGLNAT 535 (535) T ss_pred HHHHHHHHhhhhh-hcchhhcCC--hhHHHHHHhccCCCC Confidence 0000000000000 000000000 000000000000000 No 36 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=99.95 E-value=2.4e-25 Score=155.54 Aligned_cols=509 Identities=11% Similarity=0.058 Sum_probs=277.3 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCC---CcCCCHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGR---SSVQPKLIRKQAEWRYS 85 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr---s~~v~~~v~~~~e~~~~ 85 (705) |.+++...+. ...+++-++..++..++..++.++-.+|..-.- ..+....|. .+++++.....++.+.+ T Consensus 1 m~~~~~~~~~-------~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~-~~~~~~~~~~~~~~~~dst~~~a~~~LAa 72 (532) T protein:vir:99 1 MAEVEKTGFA-------ADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV-FPSATADGSTSYTTPWQSIGARGLNNLAS 72 (532) T ss_pred Ccchhhcccc-------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc-cCCCCCcchhhccccccchHHHHHHHHHH Confidence 4444432222 234666777777777766555444444443221 111122332 45788889999999999 Q ss_pred HHHHhhcC-CCCEEEEeCCCcch-------HHHHH------HHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEee Q lcl|NC_021540. 86 ALSEPFLN-DENIFSIAPKTWQD-------REAAR------QNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSW 151 (705) Q Consensus 86 ~l~~~f~~-~~~~~~~~p~~~~D-------~~~A~------~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W 151 (705) .||..+|+ +.+||.+.+..++- ...++ ..+..+...| ..++-+..++..+++.+..|||++-+.+ T Consensus 73 ~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~ 151 (532) T protein:vir:99 73 KLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM-ESNSFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) T ss_pred HHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeEEecc Confidence 99999998 69999999853321 11111 2223333233 4466667788899998899999886543 Q ss_pred cchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccC Q lcl|NC_021540. 152 CLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKN 231 (705) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 231 (705) ... ...+ T Consensus 152 ~~~-------------------------------------------------------------------------~~~~ 158 (532) T protein:vir:99 152 TEQ-------------------------------------------------------------------------VEGQ 158 (532) T ss_pred ccc-------------------------------------------------------------------------ccCc Confidence 100 0012 Q ss_pred cceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeE Q lcl|NC_021540. 232 QPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKI 311 (705) Q Consensus 232 ~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 311 (705) ...++.++..+|++..++.+.+++ ++++..++.+.|-+. +.. ....... .. ....+| T Consensus 159 ~~~f~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~e~--------~~~----~~~~~~~-------~~-~p~~~v 215 (532) T protein:vir:99 159 SNAPKLYKLHNFVVERDAYDNVLQ---IVTEDKIARAALPED--------VRK----SLEDAQG-------DQ-NPSEEV 215 (532) T ss_pred ccceEEEEcCeEEEeeCCCCCeee---EeeeeeecHHhcChH--------HHH----Hhhcccc-------cc-CCCcce Confidence 235677888899998877655543 677888887765211 110 1100000 01 123458 Q ss_pred EEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHH Q lcl|NC_021540. 312 VVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMI 391 (705) Q Consensus 312 ~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~ 391 (705) .||++..+.+ ++..+ .+| ..+++..+...++-|+...|||++..|.+.++..||.|++....+-.+.+|.+.+..+ T Consensus 216 ~v~~~v~~~~-~~~~~-~~~--~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 291 (532) T protein:vir:99 216 TIYTHVYRDP-EAMVF-RSY--QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIV 291 (532) T ss_pred EEEEEEEecC-CCCee-EEE--EeecCceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 8888766532 22112 222 2334433333344444567999999999999999999999999999999999999999 Q ss_pred HHHHhcCCCcEEeeccccCchh-hhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcC Q lcl|NC_021540. 392 DAMARSANGQRGMSKNLLDPVN-ERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQG 470 (705) Q Consensus 392 d~~~~~~~~~~~~~~~av~~~d-~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G 470 (705) .....+.+|.++++.+.+...+ .....+|.++.-+++ ...+.+.....-.+.....++.+.+.|....= .+. +. T Consensus 292 ~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~---~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~-~~ 366 (532) T protein:vir:99 292 KMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQ---DVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNS-AV 366 (532) T ss_pred HHHHHHcCCCceeccccccchhhhccCCCcceecCCcc---cceeeecccccchhHHHHHHHHHHHHHHHHHh-hhh-cc Confidence 9999999999999876654433 334455554322221 11122222222334556677888888877551 121 11 Q ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEe Q lcl|NC_021540. 471 LTGDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKL 549 (705) Q Consensus 471 ~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v 549 (705) ..++..-||++|....+.....+..+..++.. .+..++.+.+.++.+..-=+.+ |+...+ .++.+ T Consensus 367 -~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------p~~~~~-~~iv~ 432 (532) T protein:vir:99 367 -QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNL------------PKEAVE-PAIAT 432 (532) T ss_pred -cCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------Chhhcc-cceee Confidence 12333458999999988899999888888875 6788989999888764322211 222222 12322 Q ss_pred eccchhHHHHHHHHHHHHHHHHhhhchhH----HHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 550 SISNAETDAIKAQELSFMLQTMGQSLPFD----MTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQEL 625 (705) Q Consensus 550 ~~~~~~~~~~~~q~~~~llq~~~~~~~~~----~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~ 625 (705) .++ +-.+.+.++.+...++.+....|.. +...++..+++..|.+-. ..++. ++ + .++..++++.. T Consensus 433 ~is-~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~-~i~r~------~e-e--~~~~~~q~~~~ 501 (532) T protein:vir:99 433 GLE-ALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTT-GLILT------QQ-D--KQAKMAEASTA 501 (532) T ss_pred cch-HHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChh-hccCC------HH-H--HHHHHHHHHHH Confidence 222 4455666666666666655544432 233444555555554211 11111 00 0 00000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 626 QMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQE 674 (705) Q Consensus 626 q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~ 674 (705) + + +.++..++.+...+ .......++.++-++ T Consensus 502 ~---~---~~~a~~~~~~~~~~------------~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 502 A---G---MVTAGQQMGAAGGQ------------AAAAMMQQQAGMPTQ 532 (532) T ss_pred H---H---HHHHHHHHHHHHHH------------hcchhHHhhcCCCCC Confidence 0 0 00000000000000 000011111111111 No 37 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.94 E-value=1.4e-24 Score=151.40 Aligned_cols=498 Identities=9% Similarity=0.018 Sum_probs=268.7 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC--CCCCCCCcCCCHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP--KQQVGRSSVQPKLIRKQAEWRYSA 86 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~grs~~v~~~v~~~~e~~~~~ 86 (705) |+...- -....|++-++..++..++..+++++-.+|..-.....+ .....+.+++++.....++.+.+. T Consensus 1 ~~~~~~---------~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~ 71 (522) T protein:vir:94 1 MAEREG---------FAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAK 71 (522) T ss_pred Ccccch---------hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHH Confidence 222100 123446677777777777665544444444332211111 111123447888888999999999 Q ss_pred HHHhhcCCCCEEEEeCCCc-------chHHHH------HHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 87 LSEPFLNDENIFSIAPKTW-------QDREAA------RQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 87 l~~~f~~~~~~~~~~p~~~-------~D~~~A------~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) ||..+|++.+||.+.+..+ ++...+ ...+..+... ...++-+..++..+++.+..|||++.+ .. T Consensus 72 l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-~~~snf~~~~~~~~~~L~~~G~a~l~~--~~ 148 (522) T protein:vir:94 72 LMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAY-METNSFRVPLFEALKQLIVSGNCLLYI--PE 148 (522) T ss_pred HHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhhCcEeEee--ec Confidence 9999998889999987532 222222 2223333222 344566667888888888889988732 10 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccC-c Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKN-Q 232 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~-~ 232 (705) ...+ . T Consensus 149 --------------------------------------------------------------------------~~~~~~ 154 (522) T protein:vir:94 149 --------------------------------------------------------------------------PEQGTY 154 (522) T ss_pred --------------------------------------------------------------------------cCCCce Confidence 0001 1 Q ss_pred ceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEE Q lcl|NC_021540. 233 PEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIV 312 (705) Q Consensus 233 ~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 312 (705) ..++.++..++++..++.+.++. ++++..++.+.|-.. +. ....... .....+|. T Consensus 155 ~~~~~~pl~~y~v~~d~~G~vd~---i~r~~~~~~~~l~~~--------~~----~~~~~~~----------~~p~~~v~ 209 (522) T protein:vir:94 155 SPMRMYRLVSYVVQRDAFGNILQ---IVTIDKVAFSALPED--------VK----SQLNADD----------YEPDTELE 209 (522) T ss_pred eeEEEEEcceEEEeeCCCcCeEE---EeeeeeccHHhcchH--------HH----HHHhccc----------CCccceEE Confidence 24667788888887776554433 677888887765221 11 1110000 01135688 Q ss_pred EEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 313 VYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMID 392 (705) Q Consensus 313 v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d 392 (705) ||++.++. +++.. ++.. +.+..+.-.++-|+...|||++..|.+.++..||.|++..+.+..+.+|.+.+..+. T Consensus 210 v~~~v~~~---~~~~~-~~~~--~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~ 283 (522) T protein:vir:94 210 VYTHIYRQ---DDEYL-RYEE--VEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITK 283 (522) T ss_pred EEEEEEee---CCcee-EEee--ccCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHH Confidence 88887653 23322 2221 234333333444555789999999999999999999999999999999999999999 Q ss_pred HHHhcCCCcEEeeccccCchh-hhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCC Q lcl|NC_021540. 393 AMARSANGQRGMSKNLLDPVN-ERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGL 471 (705) Q Consensus 393 ~~~~~~~~~~~~~~~av~~~d-~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~ 471 (705) +..++.+|+++++++.+.... .....+|.++.-+++ ...+.+...+.-.+.....++.+.+.|....-+. +++. T Consensus 284 ~~~~~~~p~~~v~~~g~~~~~~~~~~~~g~~v~g~~~---~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~ 358 (522) T protein:vir:94 284 MAKVASKVVGLVNPNGITQPRRLNKAATGEFVAGRVE---DINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLN--SAVQ 358 (522) T ss_pred HHHHHhCCceeecccccccchheeccCCceeecCCcc---cceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hhcc Confidence 999999999999876654443 334445543322221 1122233333334456677888888888876443 2221 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEee Q lcl|NC_021540. 472 TGDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLS 550 (705) Q Consensus 472 ~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~ 550 (705) .++..-||++|..+.+.....+..+..+|.. .+..++.+.+.++....-=+.+ |+. .+.+.+. T Consensus 359 -~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------p~~---~v~v~~~ 422 (522) T protein:vir:94 359 -RNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDL------------PKE---AVEPTVS 422 (522) T ss_pred -CCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------Ccc---cEEeeEe Confidence 2233468999999999999999998888876 6788999988888765432221 111 1222222 Q ss_pred ccc-hhHHHHHHHHHHHHHHHHhhhchh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHH Q lcl|NC_021540. 551 ISN-AETDAIKAQELSFMLQTMGQSLPF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQE 624 (705) Q Consensus 551 ~~~-~~~~~~~~q~~~~llq~~~~~~~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~ 624 (705) .+. +-.+...++.+...++.++...|. .+...++..+++..|.+- ...++.. .+.++..++ + + T Consensus 423 s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~-~~ivr~~-----ee~~~~~~q-~---~- 491 (522) T protein:vir:94 423 TGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDT-AGLLLTQ-----DEKIQRMAE-Q---S- 491 (522) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCCh-hhccCCH-----HHHHHHHHH-H---H- Confidence 111 122333344444444433332222 234455566666666521 1122210 000000000 0 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 625 LQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQE 674 (705) Q Consensus 625 ~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~ 674 (705) +.++.++++.. +.+...+.. .....+.+. ++ T Consensus 492 -~~~~~~~~~~~--~~~~~~a~~-------~~~~~~~~~---------~~ 522 (522) T protein:vir:94 492 -SQQAVVQGASA--AGANMGAAV-------GQGAGEDMA---------QA 522 (522) T ss_pred -HHHHHHHHHHH--HHHHhhhhh-------hcccchhhh---------cC Confidence 00000000000 000000000 000000000 00 No 38 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.94 E-value=5.6e-25 Score=153.55 Aligned_cols=519 Identities=12% Similarity=0.066 Sum_probs=273.3 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC--CCCCCCCcCCCHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP--KQQVGRSSVQPKLIRKQAEWRYSA 86 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~grs~~v~~~v~~~~e~~~~~ 86 (705) |.+++..+.+ -..|++-++..++..++..+++++-.+|..-.....+ .....+.+++++.....++.+.+. T Consensus 1 ~~~~~~~~~~-------~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 73 (543) T protein:vir:88 1 MAETKREGLA-------EEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAK 73 (543) T ss_pred CcccccCcch-------HHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHH Confidence 5555543333 2345666777777777665544444444432211111 111112357888889999999999 Q ss_pred HHHhhcCCCCEEEEeCCCcc-------hHHHH------HHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 87 LSEPFLNDENIFSIAPKTWQ-------DREAA------RQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 87 l~~~f~~~~~~~~~~p~~~~-------D~~~A------~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) ||..+|++.+||.+.+.... +.+.+ ...+..+.-. ...++-+..++..+++.+..|||++.+. T Consensus 74 l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-~~~snf~~~~~~~~~~L~~~G~a~ly~~--- 149 (543) T protein:vir:88 74 VMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSY-MEANSYRVTLFELIRQLALAGTALIYLP--- 149 (543) T ss_pred HHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhhCceeeeec--- Confidence 99999999999999874322 11111 1222333322 3456666778888999889999987321 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) + . + + ...+.- T Consensus 150 ~----------------~-----------------~--------------------~-----------------~~~~~~ 159 (543) T protein:vir:88 150 P----------------P-----------------D--------------------A-----------------SSNSYN 159 (543) T ss_pred c----------------C-----------------c--------------------c-----------------ccceec Confidence 0 0 0 0 000111 Q ss_pred eEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEE Q lcl|NC_021540. 234 EVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVV 313 (705) Q Consensus 234 ~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 313 (705) .+..++..++++..++.+.+ .-++++..++..+|... +.. ... .... ....++|.| T Consensus 160 ~~~~~pl~~y~v~~d~~G~v---~~i~r~~~~~~~~l~~~-~~~-----------~v~-~~~~--------~~p~~~~~v 215 (543) T protein:vir:88 160 PMKLYTLHNHVVQRDAFGNV---LQIVTLDKVAYAALPED-VRN-----------SLS-GGQE--------YKPEQELEV 215 (543) T ss_pred ceEEeEcceEEEeeCCCCCe---eeeeeeeeccHHHHhHH-hhH-----------HHH-HHhh--------cCCccceEE Confidence 24566777888876665434 33678899999987543 111 110 0000 011245788 Q ss_pred EEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 314 YEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDA 393 (705) Q Consensus 314 ~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~ 393 (705) |++-++. ++.+...++. -+.+..+....+-|+.+.|||++..|.+.++..||.|++..+.+..+.+|.+.+..+.. T Consensus 216 ~~~V~pr--~~~~~~~~~~--~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~ 291 (543) T protein:vir:88 216 YTHIYID--DESGDFLSYQ--EIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKF 291 (543) T ss_pred EEEEEee--cCCCcccccc--cccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 7763322 2222111111 23455555555666667899999999999999999999999999999999999999999 Q ss_pred HHhcCCCcEEeeccccCchh-hhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 394 MARSANGQRGMSKNLLDPVN-ERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 394 ~~~~~~~~~~~~~~av~~~d-~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) ..++.+|.++++.+.+.... .....+|.++ +|......+.......-.+.....++.+.+.|....-+. ..... T Consensus 292 ~~~~~~pp~~v~~~g~~~~~~~~~~~~g~~v---~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-~~~~~- 366 (543) T protein:vir:88 292 AMISSKVVGLVNPNGITQVRRLVKAQTGDFV---AGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLN-SAVQR- 366 (543) T ss_pred HHHHhcCceeeccccccchhhcccCCCceee---cCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhh-hhccC- Confidence 99999999999776654333 2233333322 222111112222222233456677888888888766332 22222 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI 551 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~ 551 (705) ++..-||++|..+.+.....+..++.+|.. .+..++.+.+.++....-=+.+ ..+ .+.+.+.. T Consensus 367 -~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~-----------p~~----~v~~~~vs 430 (543) T protein:vir:88 367 -SGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNL-----------PQE----AVEPTVTT 430 (543) T ss_pred -CCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-----------chh----ceeeeEEe Confidence 233468999999999999999999888876 6788999999988775433221 111 12222211 Q ss_pred c-chhHHHHHHHHHHHHHHHHhhhchh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 552 S-NAETDAIKAQELSFMLQTMGQSLPF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQEL 625 (705) Q Consensus 552 ~-~~~~~~~~~q~~~~llq~~~~~~~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~ 625 (705) + .+-.+.+..+.+...++.++...++ .+...++..+++..|.+ ....++.. .+.++.+++++.. T Consensus 431 ~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~-~~~i~r~~---------~e~~~~~~q~~~q 500 (543) T protein:vir:88 431 GAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGID-TAGLLLTE---------AEKAQAQSQEMLK 500 (543) T ss_pred cHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCC-hhhhcCCH---------HHHHHHHHHHHHH Confidence 1 2334455555566666655443332 23344555555555552 11122211 0111110000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 626 QMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 626 q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) ++.++.+.++..-..+.+.. .-+.++..-.....+. .--..| T Consensus 501 ~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~--------------------------~p~~~~ 542 (543) T protein:vir:88 501 QGGLNAAAGIGSGVAAQATA------------SPEAMESAMDTAGVQP--------------------------GPIATQ 542 (543) T ss_pred HHHHHHHHHHhhchhhhhcc------------ChHHHHHHhhhcCCCC--------------------------CCCCCC Confidence 00000000000000000000 0000000000000000 000000 No 39 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=99.93 E-value=3.6e-24 Score=149.10 Aligned_cols=497 Identities=13% Similarity=0.060 Sum_probs=270.7 Q ss_pred HHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC-CCCCCC---CcCCCHHHHHHHHHHHHHHHHhhcC-CCCEEEEe Q lcl|NC_021540. 27 SDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP-KQQVGR---SSVQPKLIRKQAEWRYSALSEPFLN-DENIFSIA 101 (705) Q Consensus 27 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~gr---s~~v~~~v~~~~e~~~~~l~~~f~~-~~~~~~~~ 101 (705) ..+++-++..++..++..+++++-.+|..-.....+ ....|+ .+++++...+.++.+.+.|+..+|+ +.+||.+. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFKLQ 80 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 457778888888887775654444444432111111 111222 3478888899999999999999998 58999999 Q ss_pred CCCcchHH------------HHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccccc Q lcl|NC_021540. 102 PKTWQDRE------------AARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVE 169 (705) Q Consensus 102 p~~~~D~~------------~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~ 169 (705) +...+..+ .-...+..+...| ..++-+..++..+++.+..|||++-+. + T Consensus 81 ~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~~---~--------------- 141 (522) T protein:vir:10 81 VRDDKLGEELDPQIRSELDLSFSKMERMIMDYI-AASNDRVAVHQALKHLIVGGNALIFMG---K--------------- 141 (522) T ss_pred CChHHHhhhcChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCceeEEEc---C--------------- Confidence 85432111 1122333333333 356667778888888899999986320 0 Q ss_pred CCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCc Q lcl|NC_021540. 170 ATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTC 249 (705) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a 249 (705) . .++.++..+|++..++ T Consensus 142 ---------------------------------------~------------------------~~~~~pl~~y~v~~d~ 158 (522) T protein:vir:10 142 ---------------------------------------D------------------------GLKTFPLTRYVINRDG 158 (522) T ss_pred ---------------------------------------C------------------------CceEEEcceEEEeeCC Confidence 0 0234567889988776 Q ss_pred cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeE Q lcl|NC_021540. 250 NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTT 329 (705) Q Consensus 250 ~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~ 329 (705) .+.++. +++++++|..+|....-...++ ...+ .. .....+|.||++.+.. .+.|... T Consensus 159 ~G~vd~---i~r~~~~t~~ql~~~fg~~~~~----~~~~----~~----------~~~~~~v~v~~~v~p~--~~~~~~~ 215 (522) T protein:vir:10 159 DGNVLE---IVTKELISRKVLDIELPEPKPN----TGID----ES----------STTNDDVTIYTYVKLD--KSSGRWV 215 (522) T ss_pred CCCeeE---EEeeeeccHHHHHHhcchhccc----hhhh----cc----------cCCCCceEEEEEEEee--ccCCceE Confidence 655543 7899999999998762211110 0000 00 1123458888875542 1222211 Q ss_pred EEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc Q lcl|NC_021540. 330 PIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLL 409 (705) Q Consensus 330 ~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av 409 (705) ++ ....+.++...++-++...|||++..|...++..||.|++..+.+-.+.+|.+.+..+....++.+|.++++.+.+ T Consensus 216 ~~--~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~ 293 (522) T protein:vir:10 216 WH--QEAFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSST 293 (522) T ss_pred EE--EccCCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccc Confidence 11 1234444433334334467999999999999999999999999999999999999999999999999999976665 Q ss_pred Cchhhh-hhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHH Q lcl|NC_021540. 410 DPVNER-KFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIG 488 (705) Q Consensus 410 ~~~d~~-~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~ 488 (705) .....+ ....| .+ .+|....-.+.......-.+.....++.+.+.+.+.. +++...++..-||++|....+ T Consensus 294 ~~~~~l~~~~~~-~~--v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aF-----l~~~~~d~~rvTAtEV~~r~~ 365 (522) T protein:vir:10 294 TKPATIAKAGNG-AI--VQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAF-----LVMNVRNAERVTAEEVRLTQL 365 (522) T ss_pred cccccccCCCCc-ce--ecCCCccceeecccccccchHHHHHHHHHHHHHHHHH-----hhccCCCCCCCCHHHHHHHHH Confidence 443333 22222 22 2322111111112222223445666777777777653 344344455569999999988 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHH Q lcl|NC_021540. 489 ASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFM 567 (705) Q Consensus 489 ~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~l 567 (705) .....+..++.++.. .+..++.+.+.++.+..-=+.+ |+++. +.. .|...++-.+.+.++.+... T Consensus 366 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~------------p~~~~-~~~-~v~~is~Laraq~~~~l~~~ 431 (522) T protein:vir:10 366 ELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKL------------PKDIV-RPT-IVAGVNALGRGQDRESLTAF 431 (522) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------Ccccc-ccc-cccchhHHHHHHHHHHHHHH Confidence 899999998888865 7788989988887764211111 12221 111 23333344566667777777 Q ss_pred HHHHhhhc-hhH-----HHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 568 LQTMGQSL-PFD-----MTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPY 641 (705) Q Consensus 568 lq~~~~~~-~~~-----~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~ 641 (705) ++.++... |+. +...++..++...|.+.. ..++.+ . ..+++++++++.++++..++++ . T Consensus 432 ~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~-~ivrt~-e---ev~~~~q~~q~~~~~~~~~~~a----------~ 496 (522) T protein:vir:10 432 VGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVL-NLVKTE-Q---QLAEEQQAAQQQAAQQSLVDQA----------G 496 (522) T ss_pred HHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChh-hhcCCH-H---HHHHHHHHHHHHHHHHHHHHHH----------H Confidence 66664433 222 223445666666665421 111110 0 0000000000000000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 642 EAQAEAAKARKANTEADLNTLDFVEQETGVKQE 674 (705) Q Consensus 642 ~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~ 674 (705) ++ +.. .... -.+.++...|...-.++ T Consensus 497 ~~----~~~--~~~~-~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 497 QM----TGS--PLMD-PTKNPQLMDEEQPPMEE 522 (522) T ss_pred HH----hcc--cccC-ccccHHHHHHhCCCCCC Confidence 00 000 0000 00000000000000000 No 40 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=99.93 E-value=1.3e-23 Score=146.06 Aligned_cols=513 Identities=11% Similarity=0.042 Sum_probs=269.0 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCC--CCCCCCCCCCCcCCCHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTG--AYKPKQQVGRSSVQPKLIRKQAEWRYSA 86 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~grs~~v~~~v~~~~e~~~~~ 86 (705) |..++. + +.-.-..+++-++..++..++..+++++-.+|..-.. ...........+++++.....++.+.+. T Consensus 1 ~~~~~~---~---~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 74 (535) T protein:vir:94 1 MASSQK---R---EGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASK 74 (535) T ss_pred CCchhh---h---hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHH Confidence 322222 1 0011223666677777777766554444444433221 1111112223557888899999999999 Q ss_pred HHHhhcCCCCEEEEeCCCcc-------hHHHHH------HHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 87 LSEPFLNDENIFSIAPKTWQ-------DREAAR------QNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 87 l~~~f~~~~~~~~~~p~~~~-------D~~~A~------~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) ||..+|++.+||.+.+.... +.+.+. ..+..+. .....++-+..++..+++.+..|||++.+.++. T Consensus 75 l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~-~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~ 153 (535) T protein:vir:94 75 LMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILM-NYIESNSYRVTLFETLKQLVVAGNALLYIPEPE 153 (535) T ss_pred HHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHH-HHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCc Confidence 99999999999999774311 122111 1222222 223456666778888888889999988653310 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) ...+ T Consensus 154 ----------------------------------------------------------------------------~~~~ 157 (535) T protein:vir:94 154 ----------------------------------------------------------------------------GTYN 157 (535) T ss_pred ----------------------------------------------------------------------------Cccc Confidence 0012 Q ss_pred eEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEE Q lcl|NC_021540. 234 EVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVV 313 (705) Q Consensus 234 ~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 313 (705) +++.++..+|++..++.+.++. +++++.++.+.|-.. + ........ .......|.| T Consensus 158 ~f~~~pl~~y~v~~d~~G~vd~---i~r~~~~~~~~l~~~-~-----------~~~~~~~~---------~~~~~~~v~v 213 (535) T protein:vir:94 158 PMKLYRLSSYVVQRDAFGTVLQ---IVTLDKTAYAALPED-V-----------RNSMDSSQ---------EHKGDEMIDV 213 (535) T ss_pred ceEEEEcCeEEEeeCCCCCeEE---EEeeeeccHHHhhHH-H-----------HHHHHhcc---------ccCCCceeEE Confidence 4556777888887776554443 678889998886432 1 11110000 0112355778 Q ss_pred EEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 314 YEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDA 393 (705) Q Consensus 314 ~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~ 393 (705) |++-++ +. +++...+ ...++|..+.-..+.++...|||++..|.+.++..||.|++....+-.+.+|.+.+..+.+ T Consensus 214 ~~~v~~-~~-~~~~~~~--~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~ 289 (535) T protein:vir:94 214 YTHIYL-DE-ESGEYLK--YEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKM 289 (535) T ss_pred EEEEEe-eC-CCCcEEE--EEEecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 877433 22 2222222 2345555543334444447899999999999999999999999999999999999999999 Q ss_pred HHhcCCCcEEeeccccCchh-hhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 394 MARSANGQRGMSKNLLDPVN-ERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 394 ~~~~~~~~~~~~~~av~~~d-~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) ..++.++.++++.+.+...+ .....+|.++.-.++. . .+.+.....-.+....+++.+.+.|....= .+. ++ . T Consensus 290 ~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~-v--~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~-~~-~ 363 (535) T protein:vir:94 290 SMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPED-I--SFLQLEKAADFSVARAVSEQIEGRLSYAFM-LNS-AV-Q 363 (535) T ss_pred HHHhccCCcccccccccchhhcccCCCceeecCCccc-c--eeeecccccchhHHHHHHHHHHHHHHHHHh-Hhh-hc-c Confidence 99999999999876654433 3344455443322211 1 122223222334556677888888876551 111 11 1 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI 551 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~ 551 (705) .++..-||++|..+.+.....+..+.-+|.. .+..+..+.+.++....-=+. + |++.. +.++.... T Consensus 364 ~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~-----------~-p~~~v-~~~~vs~l 430 (535) T protein:vir:94 364 RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPE-----------L-PKEAV-EPTISTGM 430 (535) T ss_pred CCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCC-----------C-Chhhc-cceEeehH Confidence 2333458999999988888899888888875 678888998888876532221 1 11111 12221111 Q ss_pred cchhHHHHHHHHHHHHHHHHhhhchh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 552 SNAETDAIKAQELSFMLQTMGQSLPF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQ 626 (705) Q Consensus 552 ~~~~~~~~~~q~~~~llq~~~~~~~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q 626 (705) .+-.+.+.++.+...++.+....|. .+...++..+++..|.+.. ..++.. +. .++.++++++.+ T Consensus 431 -a~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~-~i~rs~------ee---v~~~~~q~~~~~ 499 (535) T protein:vir:94 431 -EALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTS-GILKTP------EE---KQQEMAEAAQGT 499 (535) T ss_pred -HHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChh-hhcCCH------HH---HHHHHHHHHHHH Confidence 1223344445555555544433222 2344455666666665521 112111 10 011000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 627 MRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELM 680 (705) Q Consensus 627 ~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~ 680 (705) .+++.+.+..+.... .+.. ....++....++++. -- T Consensus 500 ~~~~~~~~~g~~~~~-----~~~~-------~~~~~~~~~~~~g~~------~~ 535 (535) T protein:vir:94 500 AMQNAAASAGAGAGT-----MATA-------SPENMKAAAAQAGMA------PN 535 (535) T ss_pred HHHHHHHHHHHhhhc-----cccc-------ChHHHHHHHHHhccC------CC Confidence 000000000000000 0000 000000000000000 00 No 41 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=99.92 E-value=4.3e-23 Score=143.22 Aligned_cols=493 Identities=13% Similarity=0.068 Sum_probs=269.5 Q ss_pred CCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_021540. 18 EDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLN-DEN 96 (705) Q Consensus 18 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~-~~~ 96 (705) -+|.=....+.|++-++..++..++..++.++-.+|..-.-...+.......+++++.....++.+.+.|+..+|+ +.+ T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 80 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLSNKLSQVLFPAQRS 80 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 4555445667888888888888887766555444444432111111112345688888999999999999999998 579 Q ss_pred EEEEeCCCcchHH---------HHHH----HHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccc Q lcl|NC_021540. 97 IFSIAPKTWQDRE---------AARQ----NEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVP 163 (705) Q Consensus 97 ~~~~~p~~~~D~~---------~A~~----~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~ 163 (705) ||.+.+..++..+ .++. .+..+. .....++-+..++..+++.+..||+++.+. T Consensus 81 WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~-~~l~~snf~~~~~~~~~~L~~~G~a~ly~~------------- 146 (517) T protein:vir:10 81 FFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAM-LYGESLQFRPAVVEAFKHLIVTGNVMMYHP------------- 146 (517) T ss_pred cccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHH-HHHHhcCcHHHHHHHHHHHHhHCeEEEEEe------------- Confidence 9999985432111 1111 122222 223456666778888888888888866210 Q ss_pred ccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhe Q lcl|NC_021540. 164 VFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNV 243 (705) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~ 243 (705) .+...++.++..+| T Consensus 147 ------------------------------------------------------------------~~~~~~~~~pl~~y 160 (517) T protein:vir:10 147 ------------------------------------------------------------------DKTSPIQAVPLHHY 160 (517) T ss_pred ------------------------------------------------------------------CCCCcEEEEEcCeE Confidence 00113456777889 Q ss_pred eeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeec Q lcl|NC_021540. 244 TIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDID 323 (705) Q Consensus 244 ~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~ 323 (705) ++..++.+.+++ ++++.++|..+|...+- .+.. .. ..... .....+|.||++-++ . T Consensus 161 ~v~~d~~G~v~~---ivrr~~~~~~~l~~~~~-~~~~--------~~-~~~~~--------~~~~~~v~v~~~v~~---~ 216 (517) T protein:vir:10 161 CVRRDNNGTVLD---IVFLQEKALETFEPSIR-MAIQ--------AS-RKGKQ--------YKDKDNVKLYTHAKR---T 216 (517) T ss_pred EEeeCCCcCeEE---EEeeeeccHHHHHHHhh-hhcc--------hh-hhhhc--------cCCcCceEEEEEEEE---e Confidence 998777655555 67899999999876521 1110 00 00000 012245777776443 2 Q ss_pred CCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEE Q lcl|NC_021540. 324 GSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRG 403 (705) Q Consensus 324 ~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~ 403 (705) .+|...++ .- +++..+ ..++-|+...|||++..|.+.++..||.|++..+.+-.+.+|++.+..+.....+.+|.++ T Consensus 217 ~~~~~~~~-~~-~d~~~~-~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~l 293 (517) T protein:vir:10 217 KDGKYLIR-QS-ADDVPV-GKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYL 293 (517) T ss_pred CCCceEEE-EE-eCceee-ccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcc Confidence 34433222 22 244433 2345555578999999999999999999999999999999999999999999999999999 Q ss_pred eeccccCchhhhhhcCCcceeecCCccccccccccc--CccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHH Q lcl|NC_021540. 404 MSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHK--YPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTA 481 (705) Q Consensus 404 ~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~--~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~ 481 (705) ++.+.+...+.+ .+|+.-.+.+|.. ..+.+.+ ...-.+.....++.+.+.|....=+.. ++. .++..-||+ T Consensus 294 v~~~~~~~~~~l--~~~~~g~~~~g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~--l~~-~~~~rvTAt 366 (517) T protein:vir:10 294 VKPGSYTDINQF--VEGGSGAVLHGVE--GDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEA--MTR-RDAERVTAY 366 (517) T ss_pred cCcccccchhhc--cCCCccccccCCc--ccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh--hhc-cCCccccHH Confidence 988766443332 2333322233321 1222222 222234556677888888887653221 121 122235899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHH Q lcl|NC_021540. 482 GVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIK 560 (705) Q Consensus 482 ~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~ 560 (705) +|....+.....+..++-++.. .+..+..+.+..+...+..+. +.++ +.... .+-.+.+. T Consensus 367 EV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~~-----------v~~~-------~~s~l-a~l~r~~~ 427 (517) T protein:vir:10 367 EIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSKN-----------VSPT-------ILTGI-EALGRMAE 427 (517) T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCCC-----------ccce-------eeccH-HHHHHHHH Confidence 9998888888888888887775 667777777766654332221 1111 11111 12233334 Q ss_pred HHHHHHHHHHHhhh--chh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 561 AQELSFMLQTMGQS--LPF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQ 633 (705) Q Consensus 561 ~q~~~~llq~~~~~--~~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~q 633 (705) .+.+..+++.++.. .|+ .+...++..+++..|.+. ..++... +.++.+++.+..+ +++.. T Consensus 428 ~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~--~~irs~~---------ev~~~~~~~~~~~--~~~~~ 494 (517) T protein:vir:10 428 LDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANF--PFFKTQD---------ELNAEAQAQQEQE--ATKYA 494 (517) T ss_pred HHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCCh--hhcCCHH---------HHHHHHHHHHHHH--HHHHH Confidence 44444444433321 122 234456666777777663 2333211 0001000000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 634 AEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQ 667 (705) Q Consensus 634 a~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q 667 (705) + ....++....++ ..+...+ -.| T Consensus 495 ~---~~ag~~~~~~~~-------~~~~~~~-~~~ 517 (517) T protein:vir:10 495 A---EQAGKAIPDMVK-------NGQINPQ-GGQ 517 (517) T ss_pred H---HHHHHHHHHHHh-------CCCCCCC-CCC Confidence 0 000000000000 0000000 000 No 42 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.92 E-value=6.8e-23 Score=142.11 Aligned_cols=483 Identities=12% Similarity=0.044 Sum_probs=252.1 Q ss_pred HHHHHHHh--hHHhhHHHHHHHHHHHHhccCCCCCCCC-C--CCC-CcCCCHHHHHHHHHHHHHHHHhhcC-CCCEEEEe Q lcl|NC_021540. 29 LLNDFNNA--KSTKDTQVAIIDDWLAQLNVTGAYKPKQ-Q--VGR-SSVQPKLIRKQAEWRYSALSEPFLN-DENIFSIA 101 (705) Q Consensus 29 l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~gr-s~~v~~~v~~~~e~~~~~l~~~f~~-~~~~~~~~ 101 (705) +++.+..- +...++..+++++-.+|..-.....|.. . .++ -+.++..-...++.+.+.||..+|+ +.+||.+. T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQIE 80 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 23332221 2233344343333334433221111111 1 111 2346777888899999999999998 57999998 Q ss_pred CCC-------cchHHHHHH------HHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccc Q lcl|NC_021540. 102 PKT-------WQDREAARQ------NEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYV 168 (705) Q Consensus 102 p~~-------~~D~~~A~~------~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~ 168 (705) |-. .+|.+.++. .+..+.-. ...++-+..++..+++.+..|++++.+. . T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~~L~~~G~a~l~~~---~-------------- 142 (514) T protein:vir:80 81 LDDTLQELAAANGIDQSELHSRTADLERRATRR-LFVNASLSKLHRILKLLVVTGNALFYRE---P-------------- 142 (514) T ss_pred cCchhhhhccccchhHHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhHCeEEEEEe---c-------------- Confidence 732 223332222 22222222 3446666778888888888888876420 0 Q ss_pred cCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCCC Q lcl|NC_021540. 169 EATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPT 248 (705) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~ 248 (705) +.-.+..++..+|++..+ T Consensus 143 --------------------------------------------------------------~~~~~~~~pl~~y~v~~d 160 (514) T protein:vir:80 143 --------------------------------------------------------------GTGKMLVWTMQSYTVRRT 160 (514) T ss_pred --------------------------------------------------------------CCCcEEEEEcCeEEEeeC Confidence 000244567788888777 Q ss_pred ccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCee Q lcl|NC_021540. 249 CNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVT 328 (705) Q Consensus 249 a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~ 328 (705) +.+.+++ ++++.++|+++|-.... . ... ... .......+|.||.+.++.+..+.+.. T Consensus 161 ~~G~v~~---i~rr~~~~~~~l~~~~~-~-------~~~----~~~--------~~~~~~~~v~v~~~v~~~~~~~~~~~ 217 (514) T protein:vir:80 161 SHGDPAV---VVLRQQMPFRELTPEIQ-A-------DAQ----AKQ--------IAKRDSDKCDLYTVIEWQPTPNGKRC 217 (514) T ss_pred CCcCeEE---EEeeeeecHHHhhhhhh-h-------hhh----hhh--------ccCCCCCceEEEEEEEeecCCCCeEE Confidence 6655554 68899999988744310 0 000 000 01112345788887766543322222 Q ss_pred EEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc Q lcl|NC_021540. 329 TPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL 408 (705) Q Consensus 329 ~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a 408 (705) .+ +.-..|.+++ .++-|+..+|||++..|...++..||.|++..+.+-.+.+|++.+..+.....+.++.++++.+. T Consensus 218 sv-~~e~~g~~i~--~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g 294 (514) T protein:vir:80 218 AV-WHELEGKRVG--PESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAK 294 (514) T ss_pred EE-EEeccceeec--ccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCccc Confidence 22 2222344554 44556667899999999999999999999999999999999999999999999999999998876 Q ss_pred cCchhhhhhcCCcceeecCCcccccccccccC--ccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHH Q lcl|NC_021540. 409 LDPVNERKFKMGEDYKYNPGTNPVTDIIEHKY--PELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGV 486 (705) Q Consensus 409 v~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~--~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l 486 (705) +...+.+..-+.+.| .+|.. ..+.+.+. ..-.+.....++.+.+.|....=+. +...++..-||++|... T Consensus 295 ~~~~~~l~~~~~g~~--v~g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~----~~~rd~~rvTAtEV~~r 366 (514) T protein:vir:80 295 GGAVDDYRDAETGDF--VPGQV--GSVASYERGDYNKIAQASASVESIVMRLNRAFMYT----GQVRDAERVTVEEIRTV 366 (514) T ss_pred ccchhhhcccCCcee--ecCCC--ccceeeecCcccchHHHHHHHHHHHHHHHHHHhhh----ccCCCCCCCCHHHHHHH Confidence 655444443332223 23321 22333222 2223444566777777777643111 11112223489999988 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHH---HHHHH Q lcl|NC_021540. 487 IGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETD---AIKAQ 562 (705) Q Consensus 487 ~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~---~~~~q 562 (705) .+.....+..++.++.. .+..+..+.+.++..... .. +-.+ |+.+. +.++.... .+-.+ ...+. T Consensus 367 ~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~-g~--------lP~~-p~~l~-~~~~vs~l-a~l~r~~~~~~l~ 434 (514) T protein:vir:80 367 AEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNG-GM--------LLGI-AQGVY-RPSIITGI-PALTRNIETANIL 434 (514) T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhcc-CC--------CCCC-Cchhh-cceeeecH-HHHHHHHHHHHHH Confidence 88888888888887775 677888887777653210 00 0011 11111 12222111 12222 23344 Q ss_pred HHHHHHHHHhhhchh----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 563 ELSFMLQTMGQSLPF----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQ-LEIQIKQLEAQELQMRIAKLQAEIQ 637 (705) Q Consensus 563 ~~~~llq~~~~~~~~----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q-~~~q~~q~~~q~~q~e~~k~qa~~q 637 (705) ++.+.++.+++..|. .+...++..+++..|.+... +..- +... ...++.+++ +++++ + T Consensus 435 ~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~--i~~~-----~e~~~~~~~~~~~~-------~~~~~---~ 497 (514) T protein:vir:80 435 RATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLST--LSKD-----PDVVAAEAEQEAAL-------AQQQL---D 497 (514) T ss_pred HHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhh--ccCC-----HHHHHHHHHHHHHH-------HHHHH---H Confidence 444445555444332 33455666777777766321 1111 1100 000000000 00000 0 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 638 LMPYEAQAEAAKARKANT 655 (705) Q Consensus 638 ~~~~~~q~e~a~a~~~~~ 655 (705) ..+..+..++ .+..... T Consensus 498 ~~~~~~~~~~-~~~~~~~ 514 (514) T protein:vir:80 498 VASGALAAET-SAGVLTS 514 (514) T ss_pred HHHHHHHHhh-hccccCC Confidence 0000000000 0000000 No 43 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=99.92 E-value=4.3e-23 Score=143.18 Aligned_cols=491 Identities=13% Similarity=0.036 Sum_probs=260.4 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALS 88 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~ 88 (705) |...+....- -.-+.|++-++..++..++..+++++-.+|..-.....+....+..++.++.-...+..+.+.|| T Consensus 1 ~~~~~~~~~~-----~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~ 75 (516) T protein:vir:96 1 MKQSIDLEYG-----GKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLA 75 (516) T ss_pred Ccchhhhhhh-----hhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCccccCCcccchHHHHHHHHHHHHH Confidence 2222221111 13366778888888888877665554445544332222222333446888899999999999999 Q ss_pred HhhcC-CCCEEEEeCCCcch-------HHHHH------HHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 89 EPFLN-DENIFSIAPKTWQD-------REAAR------QNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 89 ~~f~~-~~~~~~~~p~~~~D-------~~~A~------~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) ..+|+ +.+||.+.+....+ .+.+. .++..+.-. ...++-+..++..+.+.+..|+|++.+ + + T Consensus 76 ~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~~L~~~G~a~l~~--d-~ 151 (516) T protein:vir:96 76 QVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKE-LEQRQFRPAVVEAFKHLIVAGSCMLYK--P-S 151 (516) T ss_pred hhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhHCeEeEEe--c-C Confidence 99998 57999998743211 11111 123333323 234566677888888888888887632 0 0 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ... T Consensus 152 -----------------------------------------------------------------------------~~~ 154 (516) T protein:vir:96 152 -----------------------------------------------------------------------------KGA 154 (516) T ss_pred -----------------------------------------------------------------------------CCC Confidence 001 Q ss_pred EEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEE Q lcl|NC_021540. 235 VTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVY 314 (705) Q Consensus 235 i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 314 (705) ++.++..+|++..++.+.+.+ ++++.++++.+|.... ....+.. . ... . ......|.|| T Consensus 155 ~~~~pl~~y~v~~d~~G~v~~---i~rr~~~~~~~l~~~~-~~~~~~~-----~---~~~-~--------~~~~~~v~v~ 213 (516) T protein:vir:96 155 ISAIPMHHYVVNRDTNGDLLD---IILLQEKALRTFDPAT-RAVVEVG-----L---KGK-K--------CKEDDSVKLY 213 (516) T ss_pred EEEEEcCeEEEeeCCCCCeee---ehhhhHhhHHHHHHhh-hhhhhhh-----h---hhh-h--------cCCCCceEEE Confidence 345677888887776555544 6778888988876542 1111000 0 000 0 0111335555 Q ss_pred EEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 315 EYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAM 394 (705) Q Consensus 315 E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~ 394 (705) .+=. .+.++...+ +.-.-|.+++..+.+|| ..|||++..|.+.++..||.|++....+-.+.+|.+.+..+.+. T Consensus 214 ~~v~---~~~~~~~~~-~~~~d~~~~~~es~~~~--~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~ 287 (516) T protein:vir:96 214 THAK---YLGDGFWEL-KQSADDIPVGKVSKIKS--EKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGA 287 (516) T ss_pred Eeee---eeCCceeEE-EEEeCceeecccccccc--ccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHH Confidence 4433 334553322 22233445655555554 57999999999999999999999999999999999999999999 Q ss_pred HhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccC--ccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 395 ARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKY--PELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 395 ~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~--~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) ..+.+|.++++.+.+...+.+..-+.+.| .+|.. ..+.+.+. +.-.+.....++.+.+.|....=+. +... T Consensus 288 ~~a~~~~~lv~p~g~~~~~~l~~~~~g~i--~~g~~--~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~---~l~~ 360 (516) T protein:vir:96 288 ALMADIKYLIRPGAQTDVDHFVNSGTGEV--VTGVE--EDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME---TMTR 360 (516) T ss_pred HHhcCCccccCcccccchhhhccCCCcee--ecCCc--ccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh---hhcc Confidence 99999999998776654443332222222 33321 12233322 1222455566777777777654221 1112 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEe-e Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKL-S 550 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v-~ 550 (705) .++..-||++|....+.....+..++-++.. .+..+..+++..+. +++ | .+.+++.+ + T Consensus 361 r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~-----p~l------------p---~~~v~~~~vs 420 (516) T protein:vir:96 361 RDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG-----ESF------------T---SDLVDPVIIT 420 (516) T ss_pred CCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcC-----CCC------------c---cccccceeec Confidence 2333358999998888888888888777765 55666665543321 111 0 01111111 1 Q ss_pred ccchhHHHHHHHHHHHHHHHHhhh---ch----hHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHH Q lcl|NC_021540. 551 ISNAETDAIKAQELSFMLQTMGQS---LP----FDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQ 623 (705) Q Consensus 551 ~~~~~~~~~~~q~~~~llq~~~~~---~~----~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q 623 (705) .-.+-.+.+....+...++.++.. .| ..+...++..+++..|.+. ..++.. +...+..+++++++ T Consensus 421 ~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~--~~irs~------eev~~~~~~~~~~q 492 (516) T protein:vir:96 421 GIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL--PFLKSA------EEMAQEQEAQMQAQ 492 (516) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCc--cccCCH------HHHHHHHHHHHHHH Confidence 112233444444444444443322 12 2334456666777777663 233221 11111100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 624 ELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEA 657 (705) Q Consensus 624 ~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea 657 (705) +.+ .+++ ...++....++++. .++ T Consensus 493 ~~~-----~~a~---~~~~~~~~~~~~~~--~~~ 516 (516) T protein:vir:96 493 QAQ-----MLEE---GVAKAVPGVIQQEL--KEA 516 (516) T ss_pred HHH-----HHHH---HhhhhhhHHhhccc--ccC Confidence 000 0000 00000000111100 000 No 44 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=99.91 E-value=7e-22 Score=136.57 Aligned_cols=480 Identities=12% Similarity=0.022 Sum_probs=253.1 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccC-CC-CCCCCCCCC---CcCCCHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVT-GA-YKPKQQVGR---SSVQPKLIRKQAEWRYSALSEPFLN- 93 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~gr---s~~v~~~v~~~~e~~~~~l~~~f~~- 93 (705) |+. .+++-++..+ .++. +..|.+++..+ |. ..+....++ -+.++......+..+.+.||..+|+ T Consensus 1 mk~-----~~~~~~~~lk--r~~~---e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp 70 (510) T protein:vir:78 1 MKS-----TAAMLWEKLR--DGSV---EQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPT 70 (510) T ss_pred Chh-----HHHHHHHHHh--ccch---HHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCC Confidence 111 1122222111 3333 33566666543 11 111111111 2367888889999999999999998 Q ss_pred CCCEEEEeCCCcchHH------HHHHHH-------HHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhh Q lcl|NC_021540. 94 DENIFSIAPKTWQDRE------AARQNE-------AILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTE 160 (705) Q Consensus 94 ~~~~~~~~p~~~~D~~------~A~~~t-------~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~ 160 (705) +.+||.+.+......+ .+.... ..+.- ....++-+..++..+++.+..|++++-+. + T Consensus 71 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~~L~~~G~a~l~~~---~------ 140 (510) T protein:vir:78 71 GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQ-RLFQNASLAVLTQVIKLLIVTGNALLYRN---S------ 140 (510) T ss_pred CCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHH-HHHhcCcHHHHHHHHHHHHhhCeEEEEEe---C------ Confidence 5789999875432211 111111 22221 22345566667777777777777755210 0 Q ss_pred cccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEech Q lcl|NC_021540. 161 NVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDY 240 (705) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~ 240 (705) ...+++.++. T Consensus 141 ----------------------------------------------------------------------~~~~~~~~pl 150 (510) T protein:vir:78 141 ----------------------------------------------------------------------DEATVVAWSL 150 (510) T ss_pred ----------------------------------------------------------------------CCCeEEEEEc Confidence 0013556778 Q ss_pred hheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEe Q lcl|NC_021540. 241 HNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYW 320 (705) Q Consensus 241 ~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~ 320 (705) .+|++..++.+.++. +++++.+|..+|... |..+.. ... ......++|.||++.++. T Consensus 151 ~~y~v~~d~~G~vd~---i~rr~~~t~~~l~~~-~~~~~~-----------~~~--------~~~~~~~~v~v~~~V~~~ 207 (510) T protein:vir:78 151 RSYAVRRDATGRWMD---IVLKQRYKSKDLDDV-YKQDLM-----------RAG--------RNLSGSGSVDLYTHVQRR 207 (510) T ss_pred ceeEEeeCCCcCeeE---EEeeeeccHHHHHHH-hhHHhh-----------hhh--------hccCCCceEEEEEEEEee Confidence 889987776555544 788999999998765 211100 000 011223568888887765 Q ss_pred eecCCCeeEEEEEEE-ECC-EEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021540. 321 DIDGSGVTTPIVASW-VDD-VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSA 398 (705) Q Consensus 321 ~~~~dg~~~~~~~~~-~g~-~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~ 398 (705) +..+.. .+.+++ +++ +++. ++-|+..+|||++..|.+.++..||.|++..+.+-.+.+|++.+..+.....+. T Consensus 208 ~~~~~~---~~sv~~e~dg~~i~~--~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~ 282 (510) T protein:vir:78 208 KGTAMD---YAEMYHEIDGVRVGE--TGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESL 282 (510) T ss_pred cCCCCc---EEEEEEEecCeeecc--ccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 422211 122222 344 4543 445555789999999999999999999999999999999999999999999999 Q ss_pred CCcEEeeccccCchhhhhhcCCcceeecCCccccccccccc--CccchHHHHHHHHHHHHHHHHHhCcchHhcCCCcccc Q lcl|NC_021540. 399 NGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHK--YPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSL 476 (705) Q Consensus 399 ~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~--~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~ 476 (705) ++.++++.+.+...+.+..-+.+.| .+|.. ..+.+.+ ...-.+.....++.+.+.|....=+ . +...++. T Consensus 283 ~~~~lv~p~g~~~~~~l~~~~~g~~--v~g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~---~-l~~~~~~ 354 (510) T protein:vir:78 283 EVLNLVDEAKGAVVDDYQDAEMGDY--VPGGA--EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY---G-ANQRDAE 354 (510) T ss_pred cCCcccCCccccchhhhccCCCcee--ecCCc--ccccccccCcccchHHHHHHHHHHHHHHHHHHhh---c-cccCCCC Confidence 9999998876644443332221222 34321 1233332 2222344556677778877765411 1 1122333 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchh Q lcl|NC_021540. 477 GTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAE 555 (705) Q Consensus 477 ~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~ 555 (705) .-||++|....+.....+..+.-++.. .+..+..+.+.++....--+ +-++..+ - ..|...++- T Consensus 355 rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p------------~p~~~~~--~-~~v~~is~L 419 (510) T protein:vir:78 355 RVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQG------------LITKQHK--P-AIETGLPAL 419 (510) T ss_pred CcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCC------------CCccccc--c-eeeecccHH Confidence 458999999888888888888877775 67888888888776543111 1111111 1 113333344 Q ss_pred HHHHHHHHHHHHHHHHhhh------chhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 556 TDAIKAQELSFMLQTMGQS------LPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRI 629 (705) Q Consensus 556 ~~~~~~q~~~~llq~~~~~------~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~ 629 (705) .+.+..+.+..+++.+... .|..+...++..+++..|.. ....++. ++ +.+..+++++++ + T Consensus 420 araq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~-p~~ivrs------~e-ev~a~~~~~~~q-----~ 486 (510) T protein:vir:78 420 SRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVD-TSQFYKS------AD-ELQAEAEEQRRQ-----A 486 (510) T ss_pred HHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCC-hhhhcCC------HH-HHHHHHHHHHHH-----H Confidence 5555555555444433222 23334455566666666641 1112221 01 000000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 630 AKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGV 671 (705) Q Consensus 630 ~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~ 671 (705) +++++. +++..+++- +. .....++ T Consensus 487 ~~~~~~-----~~a~~~~~~-~~------------~~~~~g~ 510 (510) T protein:vir:78 487 AQAQAA-----QETLLEGAS-DM------------TNALAGV 510 (510) T ss_pred HHHHHH-----HHHHHHhhh-hh------------cccCCCC Confidence 000000 000000000 00 0000011 No 45 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=99.91 E-value=8.3e-22 Score=136.16 Aligned_cols=479 Identities=11% Similarity=0.015 Sum_probs=254.3 Q ss_pred HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccC-CC-CCCCCCCCC---CcCCCHHHHHHHHHHHHHHHHhhcC-CCCEE Q lcl|NC_021540. 25 KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVT-GA-YKPKQQVGR---SSVQPKLIRKQAEWRYSALSEPFLN-DENIF 98 (705) Q Consensus 25 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~gr---s~~v~~~v~~~~e~~~~~l~~~f~~-~~~~~ 98 (705) .=+.+++-++..+ .++. +..|.+++..+ |. ..+....++ .+.+++.....+..+.+.||..+|+ +.+|| T Consensus 1 mk~~~~~~~~~lk--R~~~---e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSV---EQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHh--ccch---HHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCccc Confidence 1122333333222 3333 33566666543 11 111111122 3478888889999999999999998 57899 Q ss_pred EEeCCCcch-------HHHHH------HHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccc Q lcl|NC_021540. 99 SIAPKTWQD-------REAAR------QNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVF 165 (705) Q Consensus 99 ~~~p~~~~D-------~~~A~------~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~ 165 (705) .+.+....+ ...++ ..+..+.-. ...++-+..++..+++.+..|++++.+. T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~~Li~~G~a~l~~~--------------- 139 (510) T protein:vir:63 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQR-LFQNASLAVLTQVIKLLIVTGNALLYRD--------------- 139 (510) T ss_pred ccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhhCeEEEEEc--------------- Confidence 998753221 11111 122223222 2446666778888888888888866320 Q ss_pred ccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheee Q lcl|NC_021540. 166 QYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTI 245 (705) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~ 245 (705) ....++..++..+|++ T Consensus 140 ----------------------------------------------------------------~~~~~~~~~pl~~y~v 155 (510) T protein:vir:63 140 ----------------------------------------------------------------SDAATVVAWSLRSYAV 155 (510) T ss_pred ----------------------------------------------------------------CCCcEEEEEEcceeEE Confidence 0011355678888999 Q ss_pred CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCC Q lcl|NC_021540. 246 DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGS 325 (705) Q Consensus 246 Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~d 325 (705) ..++.+.++. ++++.++|..+|-.. +-.+ ... ... .....+.|.||.+-++.+ +. T Consensus 156 ~~d~~G~vd~---i~rr~~~t~~~l~e~-~~~~-------~~~----~~~--------~~~~~~~v~v~~~V~~~~--~~ 210 (510) T protein:vir:63 156 RRDATGRWMD---IVLKQRYKSKDLDEE-YKQD-------LMR----AGR--------NLSGSGSVDLYTHVQRKK--GT 210 (510) T ss_pred eeCCCcCeeE---EEeeeeccHHHHhHH-hhhh-------hhc----ccc--------ccCCCcceEEEEEEEeec--CC Confidence 8777655554 688999999887442 1110 000 000 011224577777765543 22 Q ss_pred CeeEEEEEEE-ECC-EEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEE Q lcl|NC_021540. 326 GVTTPIVASW-VDD-VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRG 403 (705) Q Consensus 326 g~~~~~~~~~-~g~-~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~ 403 (705) + ...+.+++ +++ ++... +-|+...|||++..|...++..||.|++..+.+-.+.+|++.+..+.....+.+|.++ T Consensus 211 ~-~~~~sv~~e~dg~~~~~~--~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~l 287 (510) T protein:vir:63 211 A-MEYAELYHEIDGVRVGKE--GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNL 287 (510) T ss_pred C-ceEEEEEEEecCceeccc--cccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc Confidence 1 22223333 344 44434 4445578999999999999999999999999999999999999999999999999999 Q ss_pred eeccccCchhh-hhhcCCcceeecCCccccccccccc--CccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHH Q lcl|NC_021540. 404 MSKNLLDPVNE-RKFKMGEDYKYNPGTNPVTDIIEHK--YPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTT 480 (705) Q Consensus 404 ~~~~av~~~d~-~~~~pg~~i~~~~~~~~~~~i~~~~--~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a 480 (705) ++.+.+...+. ....+|.+ .+|.. ..+.+.+ .+.-.+.....++.+.+.|....=+ . +...++..-|| T Consensus 288 v~p~g~~~~~~~~~~~~g~~---v~g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~---~-l~~~~~~rvTA 358 (510) T protein:vir:63 288 VDEAKGAVVDDYQDAEMGDY---VPGGA--EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY---G-ANQRDAERVTA 358 (510) T ss_pred cCcccccchhhhccCCCcee---ecCCc--ccceeeecCcccchHHHHHHHHHHHHHHHHHHHh---h-cccCCCCCcCH Confidence 98876644333 33333433 23321 1233332 2222344556777777777775311 1 12223334589 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHH Q lcl|NC_021540. 481 AGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAI 559 (705) Q Consensus 481 ~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~ 559 (705) ++|....+.....+..++-++.. .+..+..+.+.++.... ++ .+-++.... . .|...++-.+.+ T Consensus 359 tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g----l~--------p~p~~~~~~-~--~v~~is~Laraq 423 (510) T protein:vir:63 359 EEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL----LQ--------GLITKQHKP-A--IETGLPALSRSA 423 (510) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc----CC--------CCCchhccc-c--eecchhHHHHHH Confidence 99999888888888888777765 67888888888776532 11 111222211 1 122233444555 Q ss_pred HHHHHHHHHHHHhh------hchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 560 KAQELSFMLQTMGQ------SLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQ 633 (705) Q Consensus 560 ~~q~~~~llq~~~~------~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~q 633 (705) ..+.+..+++.+.. ..|..+...++..+++..|.+- ...++.. +..++. ++++.++.+ +++.++ T Consensus 424 ~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p-~~ivrs~------eev~a~-~~~~~qq~~--~~~~~~ 493 (510) T protein:vir:63 424 AVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDT-SQFYKSA------DELQAE-AEQQRQQAA--QAQAAQ 493 (510) T ss_pred HHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCCh-hHhcCCH------HHHHHH-HHHHHHHHH--HHHHHH Confidence 55555444443322 2233344555566666655421 1122111 110000 000000000 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 634 AEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGV 671 (705) Q Consensus 634 a~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~ 671 (705) +.+.....+ .....+ ++ T Consensus 494 ~~~~~~a~~---------~~~~~~------------g~ 510 (510) T protein:vir:63 494 ETLLEGASD---------MTNALA------------GV 510 (510) T ss_pred HHHHHHHHh---------hccccc------------CC Confidence 000000000 000000 00 No 46 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=99.91 E-value=5.4e-23 Score=142.65 Aligned_cols=505 Identities=14% Similarity=0.073 Sum_probs=264.5 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCC--CCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGA--YKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLN-DEN 96 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~-~~~ 96 (705) |++ ..++-++..++..++..+++++-.+|..-... ++........+++++...+.++.+.+.||..+|+ +.+ T Consensus 1 mk~-----~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 75 (542) T protein:vir:78 1 MKG-----LAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTS 75 (542) T ss_pred Chh-----HHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 222 23455667777776665544444444332211 1111111234678888899999999999999998 799 Q ss_pred EEEEeCCCcc-------hHHHHH-------HHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcc Q lcl|NC_021540. 97 IFSIAPKTWQ-------DREAAR-------QNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENV 162 (705) Q Consensus 97 ~~~~~p~~~~-------D~~~A~-------~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~ 162 (705) ||.+.+-..+ |.++.. ..+.++.-. ...++-+..++..+++.+..|++++-+. + T Consensus 76 WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~~L~~~G~a~l~~~---~-------- 143 (542) T protein:vir:78 76 FFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQ-IAESSDRVQLTAAMKHLIVTGNVLVFAG---K-------- 143 (542) T ss_pred cccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhhCeEEEEec---C-------- Confidence 9999985322 222111 122333333 3355666778888888888899876210 0 Q ss_pred cccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhh Q lcl|NC_021540. 163 PVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHN 242 (705) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~ 242 (705) ..++.++..+ T Consensus 144 ----------------------------------------------------------------------~~~~~~pl~~ 153 (542) T protein:vir:78 144 ----------------------------------------------------------------------KTLKVYPLDR 153 (542) T ss_pred ----------------------------------------------------------------------CCceEEecce Confidence 0134566778 Q ss_pred eeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEee- Q lcl|NC_021540. 243 VTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWD- 321 (705) Q Consensus 243 ~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~- 321 (705) +++..++.+.++. +++++.+|..+|.++.-...++.. ..... .+....++.+++.+...+ T Consensus 154 y~v~~d~~G~vd~---v~r~~~~t~~ql~~~fg~~~l~~~----~~~~~------------~~~~~~~~~v~~~v~pr~~ 214 (542) T protein:vir:78 154 YVIERDGDGNVIE---IITRELVDRSLLPAEFQKQSLLEG----KDSNA------------VGEDGPKFGVAQGKGGRND 214 (542) T ss_pred eEEeeCCCCCeEE---EeeeeecCHHHHHHhhccccCchH----HHhhc------------cccCCCeEEEEEEeecccC Confidence 8888776655544 789999999999887322212110 00000 001123344544433211 Q ss_pred e-------cCCCeeEEEEEEEECCEEE--ecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 322 I-------DGSGVTTPIVASWVDDVMI--RLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMID 392 (705) Q Consensus 322 ~-------~~dg~~~~~~~~~~g~~iL--~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d 392 (705) . ...+...+ + .-+++..+ ...+++| ..|||++..|.+.++..||.|++..+.+-.+.+|.+.+..+. T Consensus 215 ~~~~~~~~~~~~~~s~-~-~e~~g~~v~~~~~e~g~--~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~ 290 (542) T protein:vir:78 215 AEVFTCCKLVDGQHRW-H-QECDGKEIKGSRSSSPL--KHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIE 290 (542) T ss_pred CccccccccCCCeEEE-E-EEecccccccccccccc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHH Confidence 1 01222111 1 11333332 2355565 579999999999999999999999999999999999999999 Q ss_pred HHHhcCCCcEEeeccccCc-hhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCC Q lcl|NC_021540. 393 AMARSANGQRGMSKNLLDP-VNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGL 471 (705) Q Consensus 393 ~~~~~~~~~~~~~~~av~~-~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~ 471 (705) ...++.+|.++++.+.+.. .+.....+|.++.-+++. . .+.+...+.-.+.....++.+.+.|....-+. . T Consensus 291 ~~~~a~~pp~lv~~~g~~~~~~~~~~~~g~iv~g~~~~-v--~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~-----~ 362 (542) T protein:vir:78 291 GSAAAAKVVFMVSPSATTKPQSLARAGTGAIIQGRAED-V--SVVQANKGADFRTVQEMIRDLSQRISDAFLIL-----N 362 (542) T ss_pred HHHHHhcCceeeccccccchhhcccCCCceeecCCccc-e--eeeecccccchhHHHHHHHHHHHHHHHHhccc-----c Confidence 9999999999997766543 333445555443222211 1 12222333233456677888888888765332 1 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEee Q lcl|NC_021540. 472 TGDSLGTTTAGVQGVIGASGKRELGILRRLA-NGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLS 550 (705) Q Consensus 472 ~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~-~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~ 550 (705) ..++..-||++|..+.+.....+..++.+|. +.+..++.+.+.++....-=+.+ |+++ +++.+. T Consensus 363 ~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~------------p~~l---v~~~~~ 427 (542) T protein:vir:78 363 VRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSL------------PKGL---VMPTVV 427 (542) T ss_pred cCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC------------chhc---eeeeee Confidence 2233345999999999999999999999886 47788999999988775432321 1111 222222 Q ss_pred ccc-hhHHHHHHHHHHHHHHHHhhhc-hh-----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHH Q lcl|NC_021540. 551 ISN-AETDAIKAQELSFMLQTMGQSL-PF-----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQ 623 (705) Q Consensus 551 ~~~-~~~~~~~~q~~~~llq~~~~~~-~~-----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q 623 (705) .+. +..+.+..+.+...++.++... |+ .+...++..+++..|.+... .++.+ +. .++++++++ T Consensus 428 s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~-i~~s~------e~---~~~~~~q~q 497 (542) T protein:vir:78 428 AGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLN-LVKSP------ET---MANEAQQAQ 497 (542) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhh-ccCCH------HH---HHHHHHHHH Confidence 221 2223333444444444443322 22 23345556666666665211 11111 10 011100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 624 ELQMRIAKLQAEIQLMPYEAQA-EAAKARKANTEADLNTLDFVEQETGVKQEREL 677 (705) Q Consensus 624 ~~q~e~~k~qa~~q~~~~~~q~-e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~ 677 (705) ++ .+|+.+..+...+.. -.......+..+.... +=..=+.+ .++ T Consensus 498 ~~-----~~~~al~~~a~~~a~~~~~~~~~~~~~a~~~~-~~~~~~~~----~~~ 542 (542) T protein:vir:78 498 QQ-----QMTASLMGQAGQLAKSPIGEKMMQQINAPGQE-APAGPQTG----EDL 542 (542) T ss_pred HH-----HHHHHHHHhhhhccccccccchhhhcCCCCcC-CCCCCccc----ccC Confidence 00 000000000000000 0000000000000000 00000000 001 No 47 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=99.90 E-value=3.9e-21 Score=132.49 Aligned_cols=491 Identities=13% Similarity=0.061 Sum_probs=252.0 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALS 88 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~ 88 (705) |.++ .|--.=..+.|++.++..++..++..+++++-.+|..-.-...+....+..+++++.....+..+.+.|| T Consensus 1 ~~~~------~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~ 74 (515) T protein:vir:70 1 MQDT------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLA 74 (515) T ss_pred Ccch------hhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcccccccccchHHHHHHHHHHHHH Confidence 1111 1111113567788888888888777665555555554322111111222334788888999999999999 Q ss_pred HhhcC-CCCEEEEeCCCcch-------HHHHH------HHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 89 EPFLN-DENIFSIAPKTWQD-------REAAR------QNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 89 ~~f~~-~~~~~~~~p~~~~D-------~~~A~------~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) ..+|+ +.+||.+.+..... ...+. ..+..+... ...++-+..++..+++.+..||+++.+- + T Consensus 75 ~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~-l~~snf~~~~~~~~~~L~~~G~a~l~~d---~ 150 (515) T protein:vir:70 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKA-LEQRQFRPAIVEVFKHLIVAGNCLLYKP---S 150 (515) T ss_pred HhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHH-HHhcCchHHHHHHHHHHHhHCeEEEEEe---C Confidence 99998 57999998743322 12221 122223222 3345666778888888888888876320 0 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) .+ . T Consensus 151 ---------------------------------------------------------------------------~~--~ 153 (515) T protein:vir:70 151 ---------------------------------------------------------------------------KG--A 153 (515) T ss_pred ---------------------------------------------------------------------------CC--C Confidence 00 1 Q ss_pred EEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEE Q lcl|NC_021540. 235 VTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVY 314 (705) Q Consensus 235 i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 314 (705) ++.++..+|++..++.+.++. ++++..+|..+|...+- .... . . ...... ...+.|.+| T Consensus 154 ~~~~pl~~y~v~~d~~G~v~~---i~rr~~~t~~~l~~~f~-~~~~---~-----~-~~~~~~--------~~~~~v~i~ 212 (515) T protein:vir:70 154 MSAVPMHHYVVNRDTNGDLMD---VILLQEKALRTFDPATR-MAIE---V-----G-MKGKKC--------KEDDNVKLY 212 (515) T ss_pred eEEEEcCeEEEeeCCCcCeeE---EEeeeeccHHHHHHhhh-hhhh---h-----h-hhhhhc--------CCCCceEEE Confidence 345677888887776655544 78899999999987622 1100 0 0 000000 011335555 Q ss_pred EEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 315 EYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAM 394 (705) Q Consensus 315 E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~ 394 (705) .+ +...++|...++..+ -|.++++ ++-|+...|||++..|...++..||.|++..+.+-.+.+|.+.+..+... T Consensus 213 ~~---v~~~~~~~~~~~~e~-d~~~~~~--es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 286 (515) T protein:vir:70 213 TH---AQYAGEGFWKINQSA-DDIPVGK--ESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGA 286 (515) T ss_pred EE---EEecCCCceEEEEec-Cceeecc--ccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHH Confidence 43 333445554332222 2333443 45555578999999999999999999999999999999999999999999 Q ss_pred HhcCCCcEEeeccccCchhhhhhcCCcceeecCCccccccccccc--CccchHHHHHHHHHHHHHHHHHhCcchHhcCCC Q lcl|NC_021540. 395 ARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHK--YPELPASSYNMLQMFTLEADALSGVKSFSQGLT 472 (705) Q Consensus 395 ~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~--~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~ 472 (705) ..+.+|.++++.+.+...+.+..-+.+.| .+|.. ..+.+.+ .+.-.+.....++.+.+.|....=+...... T Consensus 287 ~~a~~p~~lv~~~g~~~~~~l~~~~~g~i--v~g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r-- 360 (515) T protein:vir:70 287 ALMADIKYLIRPGSQTDVDHFVNSGTGEV--ITGVA--EDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRR-- 360 (515) T ss_pred HHhcCCCeeeCcccccchhhccccCCcee--ecCCc--ccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhcc-- Confidence 99999999998877655444433222222 33321 1222322 2222345556677777777765533322222 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec Q lcl|NC_021540. 473 GDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI 551 (705) Q Consensus 473 ~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~ 551 (705) ++..-||++|....+.....+..++-++.. .+..+..+.+. ..+.. -|.... +..+ |+. T Consensus 361 -d~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~---~~~p~--------------~P~~~v-~~~~-vs~ 420 (515) T protein:vir:70 361 -DAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ---EAGDS--------------FTSELV-DPVI-VTG 420 (515) T ss_pred -CCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH---hhCCC--------------CChhhc-ccce-ehh Confidence 223358999998887777778777777765 44454333211 11111 111111 1111 121 Q ss_pred cchhHHHHHHHHHHHHHHHHhhh--chhH-----HHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHH Q lcl|NC_021540. 552 SNAETDAIKAQELSFMLQTMGQS--LPFD-----MTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQE 624 (705) Q Consensus 552 ~~~~~~~~~~q~~~~llq~~~~~--~~~~-----~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~ 624 (705) -++-.+.+..+.+..+++.++.. .++. +...++..+++..+.+. ..++ ...+ .++.. +++ T Consensus 421 l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~--~~~r-------s~ee--v~~~r--~q~ 487 (515) T protein:vir:70 421 IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAEL--PFLK-------SEEE--MQQEM--AQQ 487 (515) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCc--cccC-------CHHH--HHHHH--HHH Confidence 12333444444444444443311 1111 22223333333333221 1111 1111 11100 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 625 LQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQE 674 (705) Q Consensus 625 ~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~ 674 (705) .|++++ +.+..... ++........ +|+. T Consensus 488 ~~~~~~---~~~~~~~~-------~a~~~~~~~~------------~~~~ 515 (515) T protein:vir:70 488 AQAQQE---AMLNEGVA-------KAVPGVIQQE------------MKEG 515 (515) T ss_pred HHHHHH---HHHHHhhh-------hhcccchhhh------------hccC Confidence 000000 00000000 0000000000 0000 No 48 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.89 E-value=4.6e-21 Score=132.08 Aligned_cols=491 Identities=13% Similarity=0.057 Sum_probs=255.2 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQA 80 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~ 80 (705) |.+=++. .+ .-..+.|++-++..++..++..+++++-.+|..-.....+....+..++.++.-...+ T Consensus 1 ~~~~~~~--------~~-----~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~ 67 (516) T protein:vir:10 1 MKQSTDL--------EY-----GGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQAT 67 (516) T ss_pred CCchhhH--------hh-----hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcccccccccchHHHHH Confidence 2221111 11 1245678888888888888776655544444443322222222334468888999999 Q ss_pred HHHHHHHHHhhcC-CCCEEEEeCCCcchH-------HHH------HHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeE Q lcl|NC_021540. 81 EWRYSALSEPFLN-DENIFSIAPKTWQDR-------EAA------RQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVI 146 (705) Q Consensus 81 e~~~~~l~~~f~~-~~~~~~~~p~~~~D~-------~~A------~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi 146 (705) ..+.+.||..+|+ +.+||.+.+....+. +.+ ..++..+.. ....++-+..++..+.+.+..|+|+ T Consensus 68 ~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~~L~~~G~a~ 146 (516) T protein:vir:10 68 NHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMK-ELEQRQFRPAVVEAFKHLIVAGSCM 146 (516) T ss_pred HHHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHH-HHHhcCcHHHHHHHHHHHHhHCeEe Confidence 9999999999998 579999987432211 111 112222222 2344566667888888888888886 Q ss_pred EEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccccccee Q lcl|NC_021540. 147 FRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVI 226 (705) Q Consensus 147 ~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 226 (705) +-+ + + T Consensus 147 l~~--d-~------------------------------------------------------------------------ 151 (516) T protein:vir:10 147 LYK--P-S------------------------------------------------------------------------ 151 (516) T ss_pred EEe--c-C------------------------------------------------------------------------ Confidence 521 0 0 Q ss_pred eeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccccccc Q lcl|NC_021540. 227 KTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDK 306 (705) Q Consensus 227 ~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (705) ...++.++..+|++..++.+.+.+ ++++..++..+|.+. |.+ . ......... .. T Consensus 152 -----~~~~~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~e~-~~~-~------~~~~~~~~~----------~~ 205 (516) T protein:vir:10 152 -----KGAISAIPMHHYVVNRDTNGDLLD---IILLQEKSLRTFDPA-TRA-V------VEVGLKGKK----------CK 205 (516) T ss_pred -----CCCeEEEEcCeEEEeeCCCCCeEE---EeeeecccHHHHHHH-hhh-h------hhhhhhhhc----------cC Confidence 001345677788887776555544 678889999888665 211 0 000000000 01 Q ss_pred ccCeEEEEEEEEEeeecCCCeeEEEEEEEECCE-EEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHH Q lcl|NC_021540. 307 ARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDV-MIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGA 385 (705) Q Consensus 307 ~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~-iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~ 385 (705) ....+.+|.+ +..+.++... +..-+++. +...+..|| ..|||++..|...++..||.|++....+-.+.+|. T Consensus 206 ~~~~~~i~t~---v~~~~~~~~~--~~~~~d~~~~~~~s~~~~--~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~ 278 (516) T protein:vir:10 206 EDDSIKLYTH---AKYLGEGFWE--LKQSADDIPVGKVSKIKS--EKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQF 278 (516) T ss_pred CCCceEEEEE---EEecCCCceE--EEEeeCceeecccccccc--ccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHH Confidence 1233555443 2233344322 12223444 444444444 58999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccccccccccC--ccchHHHHHHHHHHHHHHHHHhC Q lcl|NC_021540. 386 LTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVTDIIEHKY--PELPASSYNMLQMFTLEADALSG 463 (705) Q Consensus 386 ~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~--~~i~~~~~~~l~~~~~~~~~~tG 463 (705) +.+..+.....+++|.++++.+.+...+.+. ||+.-.+.+|.. ..+.+.+. +.-.+.....++.+.+.|....= T Consensus 279 l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~--~~~~g~~~~g~~--~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~ 354 (516) T protein:vir:10 279 LSEAVARGAALMADIKYLIRPGAQTDVDHFV--NSGTGEVVTGVE--EDIHIVQLGKYADLTPISAVLEVYTRRIGVVFM 354 (516) T ss_pred HHHHHHHHHHHhcCCCcccCcccccchhhhc--cCCCceeecCCc--ccceeeecCcccchHHHHHHHHHHHHHHHHHHh Confidence 9999999999999999999877664433332 333222233321 12233322 11224455667777777776543 Q ss_pred cchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcc Q lcl|NC_021540. 464 VKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLV 542 (705) Q Consensus 464 v~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~ 542 (705) +.... .-++..-||++|....+.....+..++-++.. .+..+..+.+..+ +. .+ |..+. T Consensus 355 ~~~l~---~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~---~p--~~------------P~~lv 414 (516) T protein:vir:10 355 METMT---RRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA---GD--SF------------TSDLV 414 (516) T ss_pred hhhhh---ccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhh---CC--CC------------Chhhc Confidence 32221 11223358999998888777788877777764 5556555443211 11 10 11111 Q ss_pred cceeEEeeccchhHHHHHHHHHHHHHHHHhhh---chh----HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 543 GSFDIKLSISNAETDAIKAQELSFMLQTMGQS---LPF----DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 543 ~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~---~~~----~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) +.++ |..-++-.+.+.++.+..+++.++.. .|. .+....+..+++..|.+. ..++... + .++... T Consensus 415 -~~~~-v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~--~~irs~e-e---v~~~r~ 486 (516) T protein:vir:10 415 -DPVI-ITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL--PFLKSAE-E---MEQEQE 486 (516) T ss_pred -Ccce-ehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCCh--hccCCHH-H---HHHHHH Confidence 1112 22222334455555555555544322 221 122334555666666542 2222110 0 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTL 662 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~ 662 (705) |++ +.++.. ++..+ ..++.......+.++. T Consensus 487 ~~~-------~~q~~~-~~~~~---------~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 487 AQM-------QAQQAQ-MLEEG---------VAKAVPGVIQQELKEA 516 (516) T ss_pred HHH-------HHHHHH-HHHHH---------hhhcccchhhhhhhcC Confidence 000 000000 00000 0011111111111110 No 49 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.72 E-value=2.5e-15 Score=100.60 Aligned_cols=442 Identities=13% Similarity=0.063 Sum_probs=214.2 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCCCC--CcCCCHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQVGR--SSVQPKLIRKQAEWRY 84 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gr--s~~v~~~v~~~~e~~~ 84 (705) |--++|-.-.-=+++++...+ +....+.|...+.+.++..+||.|.-.. .+...++| .+++.+..+..|+... T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~---l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 77 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEV---VTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFT 77 (453) T ss_pred CeecCCcceEcCCCCCCCHHH---HHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHh Confidence 444444332211344443333 3333445666677777888999986311 22233343 4678888888888877 Q ss_pred HHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccc Q lcl|NC_021540. 85 SALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPV 164 (705) Q Consensus 85 ~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~ 164 (705) ..| ||.+..+. + +|.+ ..+.++.+|.. |+--..+...+++++..|.|++.+|++ T Consensus 78 ~~l----~g~~~~~~--~---~d~~----~~~~l~~i~~~-N~~~~~~~~~~~~~~~~G~~~~~v~~d------------ 131 (453) T protein:vir:39 78 GYF----NGIPVKKS--H---SDKE----TLSKLQEFDNL-NDMEDEESELAKMACIYGRAFELLYQN------------ 131 (453) T ss_pred hhh----cccCceec--c---CChH----HHHHHHHHHHh-cChhHHHHHHHHHHhhcCeEEEEEEec------------ Confidence 766 55443332 2 2322 23456555444 444456778999999999999887652 Q ss_pred cccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee Q lcl|NC_021540. 165 FQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT 244 (705) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~ 244 (705) ..+.|+|..++|.+++ T Consensus 132 ----------------------------------------------------------------~~g~~~i~~~~p~~~~ 147 (453) T protein:vir:39 132 ----------------------------------------------------------------EETQTNVIYNTPENMF 147 (453) T ss_pred ----------------------------------------------------------------CCCceEEEEEcccceE Confidence 0234677888888865 Q ss_pred e--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeee Q lcl|NC_021540. 245 I--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDI 322 (705) Q Consensus 245 ~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~ 322 (705) . |+... ....+.+ +.+.. .....++|+|.. T Consensus 148 ~v~d~~~~---~~~~~~i-r~~~~-----------------------------------------~~~~~~~~~yt~--- 179 (453) T protein:vir:39 148 MVYDDTIK---QEPLFAV-RYGYD-----------------------------------------DDYKLYGEVYTK--- 179 (453) T ss_pred EEecCCCC---CeEEEEE-EEEEe-----------------------------------------CCeEEEEEEEeC--- Confidence 4 33221 1122222 22100 001233455532 Q ss_pred cCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcE Q lcl|NC_021540. 323 DGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQR 402 (705) Q Consensus 323 ~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~ 402 (705) +. .++....++..-..+..|.+.|.+|+++++. ..+|.|.+..++++++.+|..++.+.+.+...+.|.+ T Consensus 180 --~~---i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~ 249 (453) T protein:vir:39 180 --ET---TYALNGTMGFYNMTEQAPNPFDDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYL 249 (453) T ss_pred --Ce---EEEEEecCCceeeecccccCCCceeEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCcee Confidence 11 1111222222211223333346778777653 3468999999999999999999999999999899888 Q ss_pred EeeccccCchhhhhhcCCcceeecCCcc--cccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHH Q lcl|NC_021540. 403 GMSKNLLDPVNERKFKMGEDYKYNPGTN--PVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTT 480 (705) Q Consensus 403 ~~~~~av~~~d~~~~~pg~~i~~~~~~~--~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a 480 (705) ++....++..+....+.++++...++.. ..+.+.+...+.-.......++.+...+...|++++.+.+..++. |+. T Consensus 250 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~-Sg~- 327 (453) T protein:vir:39 250 TFLGAAVEEEDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSS-SGV- 327 (453) T ss_pred eeecCCCCchhhhhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCC-hHH- Confidence 7764445544444555555555443211 222344444333345566678889999999999988777655432 344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHH Q lcl|NC_021540. 481 AGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIK 560 (705) Q Consensus 481 ~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~ 560 (705) ++...............+.|..+++++++.++.+....... .+.. ...+..+...+...... T Consensus 328 -Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~-------------~~~~----~i~v~f~~~~p~~~~~~ 389 (453) T protein:vir:39 328 -SLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNK-------------EAWK----DIEYTFTRNEPKDIKEQ 389 (453) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCc-------------cccc----cceEEeCCCCCcCHHHH Confidence 34444344445556666666667777766666554322111 0001 11122222222111122 Q ss_pred HHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHH-HHHHHHHHHHHHH-- Q lcl|NC_021540. 561 AQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQE-LQMRIAKLQAEIQ-- 637 (705) Q Consensus 561 ~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~-~q~e~~k~qa~~q-- 637 (705) .+.+..+. ..++.. -.+. .++...+.. .+.++.+++... .+........... T Consensus 390 a~~~~kl~----g~is~e---t~l~---~l~~v~D~~---------------~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 444 (453) T protein:vir:39 390 AETANILM----GITSQE---TALS---VISVIPDVQ---------------AEMEKIKKEEASTAIFDKDKQPSEKGTD 444 (453) T ss_pred HHHHHHHh----ccCChH---HHHH---hCCCCCCHH---------------HHHHHHHHHHHHHHHHHHhccCCCCCCC Confidence 22222211 111110 0111 111111111 111111111000 0000000000000 Q ss_pred HHHHHHHHH Q lcl|NC_021540. 638 LMPYEAQAE 646 (705) Q Consensus 638 ~~~~~~q~e 646 (705) .....-..| T Consensus 445 ~~~~~~~~e 453 (453) T protein:vir:39 445 TVVPETNEE 453 (453) T ss_pred CCCCCcCCC Confidence 000000000 No 50 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=99.71 E-value=1.3e-18 Score=118.58 Aligned_cols=577 Identities=17% Similarity=0.147 Sum_probs=286.3 Q ss_pred Ccchhhhhhccccccc----CCCCCCHHHHHHHHHHHHHhhHHhhHHHHH---HHHHHHHhccCC-----CCCC------ Q lcl|NC_021540. 1 MSDINEEFLEDTVPSL----QEDWKNKPKVSDLLNDFNNAKSTKDTQVAI---IDDWLAQLNVTG-----AYKP------ 62 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~-----~~~~------ 62 (705) |+ +.-..|.- .|.--++-+-..|++-++.++--...+|++ ++++..-|-..- .+.+ T Consensus 1 ma------ispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~ 74 (666) T protein:vir:10 1 MA------ISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAK 74 (666) T ss_pred CC------cCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceeeeccccccc Confidence 22 11112210 111123334455666666666555555554 345555443210 0111 Q ss_pred -CCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHH-HHHHhhcCCc-chHHHHHHHH Q lcl|NC_021540. 63 -KQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILN-YQFNNQLDKV-KLIDTMVRTA 139 (705) Q Consensus 63 -~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n-~~~~~~~~~~-~~~~~~~~~a 139 (705) +|.-=.-.+|+|.|-.+|+.+.+.|.++|+||-++|-+.- +|.--+-|++..-++. |... .++ +-|-=.++|+ T Consensus 75 V~C~V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~---~~~~~~LiL~L~D~ 150 (666) T protein:vir:10 75 VRCQVVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTM---TSSIPELILCLQDA 150 (666) T ss_pred CcceeeccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhh---hhhHHHHHHHHhhh Confidence 1111134589999999999999999999999999988876 6666677777666553 2211 110 1122244555 Q ss_pred HhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCc Q lcl|NC_021540. 140 VNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIING 219 (705) Q Consensus 140 l~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 219 (705) +..-..-+-+.|...+.-. .. .++ +...+ T Consensus 151 ~KYN~~~~ET~Ws~IE~~~-----------~~-------------------------~~i-----~~~~~---------- 179 (666) T protein:vir:10 151 AKYNLVGWETEWSHIETYD-----------PQ-------------------------KEI-----TDLEP---------- 179 (666) T ss_pred hhcceeeeeeccccccccc-----------hh-------------------------hhh-----hcCCC---------- Confidence 5544443334443221100 00 000 00011 Q ss_pred ccccceeeeccCcc-eEEEechhheeeCCCc-cCCh-hhCCeEEEEEeccHHHHHHhcCC-cC---cc---hhhhhhhhh Q lcl|NC_021540. 220 YEEQEVIKTVKNQP-EVTICDYHNVTIDPTC-NGNL-DEAKFVIYSFESSRSDLEKYGIY-SN---LE---YIKEDSSTS 289 (705) Q Consensus 220 ~~~~~~~~~~~~~~-~i~~V~~~~~~~Dp~a-~~d~-~da~~~~~~~~~t~~el~~~g~~-~d---~~---~~~~~~~~~ 289 (705) .+...+.++... +|++++|.|++|||+. -+|. ....|++....+++-.|++.-.+ .| +. -+....... T Consensus 180 --~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s 257 (666) T protein:vir:10 180 --GKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSS 257 (666) T ss_pred --ceeecccchhhhhhhhccccccccccCCCCCCchhhhhhhhhHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhh Confidence 111112223333 7899999999999974 2343 35678888888888777653110 00 00 001110000 Q ss_pred -----------hcc--------cccccccccccc---ccccCeEEEEE--EEE------Ee-------eecCCCeeEEEE Q lcl|NC_021540. 290 -----------TSS--------DHYSSDTSFTFS---DKARKKIVVYE--YWG------YW-------DIDGSGVTTPIV 332 (705) Q Consensus 290 -----------~~~--------~~~~~~~~~~~~---~~~~~~v~v~E--~w~------k~-------~~~~dg~~~~~~ 332 (705) ++. .+-+||.-..++ ....+||-|.| .|+ |+ .++.......++ T Consensus 258 ~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~~Y~RI~PSDF~~~~P~~N~~QIWK 337 (666) T protein:vir:10 258 FQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWK 337 (666) T ss_pred ccccccccCCccCccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeeeccccceecCCCCCcceeee Confidence 000 011122211111 12345555444 222 22 122333345566 Q ss_pred EEEE-CCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc Q lcl|NC_021540. 333 ASWV-DDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDP 411 (705) Q Consensus 333 ~~~~-g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~ 411 (705) ++++ |+.+|..++..-..+.||.-......+.-..-..|+.+...|.|+...++++..+-...+....+.++++..+.. T Consensus 338 ~v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDG~G~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a 417 (666) T protein:vir:10 338 AVMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRA 417 (666) T ss_pred eeeeccceeEeeehhhhccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhccChhhhhh Confidence 6655 467888886655667777655444444444456889999999999999999988777777777777777766644 Q ss_pred hhhhhhcCCcceeecCCccccccc--ccccCccchHHHHHHHH---HHHHHHHHHhCcchHhcCCCccccchHHHHHHHH Q lcl|NC_021540. 412 VNERKFKMGEDYKYNPGTNPVTDI--IEHKYPELPASSYNMLQ---MFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGV 486 (705) Q Consensus 412 ~d~~~~~pg~~i~~~~~~~~~~~i--~~~~~~~i~~~~~~~l~---~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l 486 (705) .+.-...|.--|.+++.+..+..+ -+.+.|.-..+....++ .+.+.-++++|++...+|.-- .-+++-.+..-. T Consensus 418 ~~iNSP~~~~KIP~~~~sL~N~~~~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQ-KGNKt~~E~~~~ 496 (666) T protein:vir:10 418 NDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQ-KGNKTRAEFDTI 496 (666) T ss_pred hcccCCCCCcccceeehhhcccchhhhhccCCccccchhHHHhhhHHHHhhHHHhhccCCccccccc-ccCcceeehhhh Confidence 333222333334444444333322 23344444444444443 456677789999999999531 123444455566 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccc-eeEEeeccc-hhHHH---HH Q lcl|NC_021540. 487 IGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGS-FDIKLSISN-AETDA---IK 560 (705) Q Consensus 487 ~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~-~dv~v~~~~-~~~~~---~~ 560 (705) |..+..|+++-+-.+++ .+..+-+.+.--+.+|.++..++.-+.++.+.|+-+.++.. ....+.+|- +.... .. T Consensus 497 MG~a~NR~RLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~L~~~~L~F~~~DG~TP~SK~ASs~~ 576 (666) T protein:vir:10 497 MGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKELQDLGLKFELGDGLTPASKLASSDF 576 (666) T ss_pred cCCcccceehhhHHhhhhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHHHhhhhheeeeccCCCchhhhhhhHH Confidence 67777777776666655 23444444433456788888888887777888887766642 223333331 21222 22 Q ss_pred HHHHH-------HHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccccc--chhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 561 AQELS-------FMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPE--PSPQAQLEIQIKQLEAQELQMRIAK 631 (705) Q Consensus 561 ~q~~~-------~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q--~~~~~q~~~q~~q~~~q~~q~e~~k 631 (705) +..+. .+++++|+..|. +...++.+.|..-+.++....-|+ +...-+++.|+.-.+ .++ T Consensus 577 lT~~LQMI~sS~~~~~A~G~~~P~-----M~AH~~QLGGVRG~E~Y~daalP~~~~~~~~~Q~LQ~~~LQ--~~~----- 644 (666) T protein:vir:10 577 LTALLQMIMSSETTLQAFGTQVPG-----MIAHLAQLGGVRGFEKYADAALPQWQITYGMQQQLQQMLLQ--LQQ----- 644 (666) T ss_pred HHHHHHHHhhhhhhHhhhcccchH-----HHHHHHHhccccchhhhhhccCCccccccchhHHHHHHHHH--Hhh----- Confidence 22222 233455655543 455666777666665554332222 222212222111110 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 632 LQAEIQLMPYEAQAEAAKARKANTE 656 (705) Q Consensus 632 ~qa~~q~~~~~~q~e~a~a~~~~~e 656 (705) |..+|.+..+-+.-. .+-...+ T Consensus 645 -QSA~Q~~A~Q~~L~~--~Q~~PSq 666 (666) T protein:vir:10 645 -QSAMQLQARQGELSN--DQSQPSQ 666 (666) T ss_pred -hhhcccccccccCcc--cccCCCC Confidence 000000000000000 0000000 No 51 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.69 E-value=2.2e-15 Score=100.97 Aligned_cols=444 Identities=13% Similarity=0.051 Sum_probs=219.6 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCCCCC--cCCCHHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQVGRS--SVQPKLIRKQAEWRY 84 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~grs--~~v~~~v~~~~e~~~ 84 (705) |.-..|.-.+.=+++++.. ..+....+.|...+.+..+..+||.|...- .+...++|+ +++.+..+..|+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 77 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITV---EVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFT 77 (452) T ss_pred CcccCceeEEcCCccCCCH---HHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHh Confidence 4444444444334554432 234444556666677777888999986422 222333433 577778887777776 Q ss_pred HHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccc Q lcl|NC_021540. 85 SALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPV 164 (705) Q Consensus 85 ~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~ 164 (705) ..| ||.+.-+ .+ +|.. ..+.++.++. .|+--..+...+++++..|.+.+.+||+ T Consensus 78 ~~l----~g~~~~~--~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d------------ 131 (452) T protein:vir:36 78 GYF----NGIPVKK--SH---SDKE----ILTKLQEFDN-LNDMEDEESELAKMACIYGRAFEFLYQD------------ 131 (452) T ss_pred hhh----cccCcee--ec---CChh----HHHHHHHHHh-hcChhHHHHHHHHHHHhcCeEEEEEEec------------ Confidence 655 5655443 33 2222 2345665543 3444455778999999999998877652 Q ss_pred cccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee Q lcl|NC_021540. 165 FQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT 244 (705) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~ 244 (705) ..+.|++..++|.+++ T Consensus 132 ----------------------------------------------------------------~~g~~~i~~~~p~~~~ 147 (452) T protein:vir:36 132 ----------------------------------------------------------------EDTQTNVVYNSPENMF 147 (452) T ss_pred ----------------------------------------------------------------CCCeeEEEEEcccceE Confidence 0234677788888875 Q ss_pred e--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeee Q lcl|NC_021540. 245 I--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDI 322 (705) Q Consensus 245 ~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~ 322 (705) + |+.... ..-+ +.+.+... .....+|+|.. T Consensus 148 ~v~d~~~~~---~~~~-~i~~~~~~-----------------------------------------~~~~~~~vyt~--- 179 (452) T protein:vir:36 148 MVYDDTVKQ---EPLF-AVRYGVDE-----------------------------------------DKKLQGEVYTL--- 179 (452) T ss_pred EEEcCCCCC---ceEE-EEEEEEec-----------------------------------------CceEEEEEEec--- Confidence 3 432211 1112 22222100 00122344432 Q ss_pred cCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcE Q lcl|NC_021540. 323 DGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQR 402 (705) Q Consensus 323 ~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~ 402 (705) +. .++....++........|.+.|.+|++.++. ...|.|.+..++++++.+|..++.+.+.+...++|.+ T Consensus 180 --~~---i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~ 249 (452) T protein:vir:36 180 --LE---TIKISGENDEISFGEGTYNPYPDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYL 249 (452) T ss_pred --Ce---EEEEEEcCCceEEecceeccCCcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 11 1111222222222233344446778877643 3358899999999999999999999999999999888 Q ss_pred EeeccccCchhhhhhcCCcceeecCCccc-ccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHH Q lcl|NC_021540. 403 GMSKNLLDPVNERKFKMGEDYKYNPGTNP-VTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTA 481 (705) Q Consensus 403 ~~~~~av~~~d~~~~~pg~~i~~~~~~~~-~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~ 481 (705) ++....++..+....++++.+.+.+++.. ...+.+...+.-.......++.+...+...|++++.+.+..++. |+.| T Consensus 250 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~-Sg~A- 327 (452) T protein:vir:36 250 TFLGAAVEEEDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSS-SGVS- 327 (452) T ss_pred EeecCCcCchhhhhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCC-cHHH- Confidence 77654445444455566666666553322 22234444343345666778899999999999998877765443 4444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHH Q lcl|NC_021540. 482 GVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKA 561 (705) Q Consensus 482 ~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~ 561 (705) +...............+.|..+++++++.++.++....... ++. ...+..+...+....... T Consensus 328 -l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~--------~~~---------~i~i~f~~~~p~d~~~~a 389 (452) T protein:vir:36 328 -LAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKD--------SWK---------DIEYTFTRNEPKDIKEQA 389 (452) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc--------ccc---------cceEEeCCCCCcCHHHHH Confidence 44444445555566667777777777777766554321110 111 112222222221111112 Q ss_pred HHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_021540. 562 QELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKL-QAEIQLMP 640 (705) Q Consensus 562 q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~-qa~~q~~~ 640 (705) +.+..+ .+ .++. .-++. .++...+ +. .+.++.+++.+. ..+..+. +....-.. T Consensus 390 ~~~~k~---~g-~iS~---et~~~---~~~~~~d-------------~~--~E~~ri~~E~~~-~~~~~~~~~~~~~~~~ 443 (452) T protein:vir:36 390 ETANIL---MG-ITSQ---ETALS---VISVIPD-------------VQ--AEMEKIKKEEAS-TAIFDKDKQPSEKGTD 443 (452) T ss_pred HHHHHH---hc-cCCh---HHHHH---hCCCCCC-------------HH--HHHHHHHHHHHH-HHHHHhhccCCCCccc Confidence 211111 11 1111 11111 1111111 11 111111111100 0000000 00000000 Q ss_pred HHHHHHHHH Q lcl|NC_021540. 641 YEAQAEAAK 649 (705) Q Consensus 641 ~~~q~e~a~ 649 (705) ........+ T Consensus 444 ~~~~~~~~e 452 (452) T protein:vir:36 444 TVVSETNEE 452 (452) T ss_pred ccCccccCC Confidence 000000000 No 52 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=99.69 E-value=3.3e-18 Score=116.46 Aligned_cols=577 Identities=17% Similarity=0.148 Sum_probs=283.5 Q ss_pred Ccchhhhhhccccccc----CCCCCCHHHHHHHHHHHHHhhHHhhHHHHH---HHHHHHHhccCC-----CCCC------ Q lcl|NC_021540. 1 MSDINEEFLEDTVPSL----QEDWKNKPKVSDLLNDFNNAKSTKDTQVAI---IDDWLAQLNVTG-----AYKP------ 62 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~-----~~~~------ 62 (705) |+ +.-..|.- .|.--++-+-..|++-++.++--...+|++ ++++..-|-..- .+.+ T Consensus 1 ma------ispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~ 74 (666) T protein:vir:96 1 MA------ISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAK 74 (666) T ss_pred Cc------cCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceeeeccccccc Confidence 22 11112210 111123334456666666666555555554 345555443210 0111 Q ss_pred -CCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHH-HHHHhhcCCc-chHHHHHHHH Q lcl|NC_021540. 63 -KQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILN-YQFNNQLDKV-KLIDTMVRTA 139 (705) Q Consensus 63 -~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n-~~~~~~~~~~-~~~~~~~~~a 139 (705) +|.-=.-.+|+|.|-.+|+.+.+.|.++|+||-++|-+.- +|.--+-|++..-++. |... .++ +-|-=.++|+ T Consensus 75 V~C~V~~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~---~~~~~~LiL~L~D~ 150 (666) T protein:vir:96 75 VRCQVVNKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTM---TSSIPELILCLQDA 150 (666) T ss_pred ccceeeccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhh---hhhHHHHHHHHhhh Confidence 1111134589999999999999999999999999988876 6666677777666553 2211 110 1111244454 Q ss_pred HhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCc Q lcl|NC_021540. 140 VNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIING 219 (705) Q Consensus 140 l~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 219 (705) +..-..-+-+.|.-.+. +.+ ..++ +...+ T Consensus 151 ~KYN~~~~ET~Ws~IE~-------------~~~-----------------------~~~i-----~~~~~---------- 179 (666) T protein:vir:96 151 AKYNLVGWETEWSNIET-------------YDP-----------------------QKEI-----TDLEP---------- 179 (666) T ss_pred hhcceeeeeeccccccc-------------cch-----------------------hhhh-----hcCCC---------- Confidence 44444333333321110 000 0000 00011 Q ss_pred ccccceeeeccCcc-eEEEechhheeeCCCc-cCCh-hhCCeEEEEEeccHHHHHHhcCC-cC---cc---hhhhhhhhh Q lcl|NC_021540. 220 YEEQEVIKTVKNQP-EVTICDYHNVTIDPTC-NGNL-DEAKFVIYSFESSRSDLEKYGIY-SN---LE---YIKEDSSTS 289 (705) Q Consensus 220 ~~~~~~~~~~~~~~-~i~~V~~~~~~~Dp~a-~~d~-~da~~~~~~~~~t~~el~~~g~~-~d---~~---~~~~~~~~~ 289 (705) .+...+.++... +|++++|.|++|||+. -+|. ....|++....+++-.|++.-.+ .| +. -+....... T Consensus 180 --~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~R~~LKK~LN~LT~EKkltykkvV~~Al~~s 257 (666) T protein:vir:96 180 --GKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLNRIQLKKYLNYLTNEKKLTYKKVVNEALKSS 257 (666) T ss_pred --ceeeeccchhhhhhhhccccccccccCCCCCCchhhhhhhhhhHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHHhhh Confidence 111112223333 7899999999999974 2343 35678888888888777653110 00 00 001110000 Q ss_pred -----------hcc--------cccccccccccc---ccccCeEEEEE--E------EEEe-------eecCCCeeEEEE Q lcl|NC_021540. 290 -----------TSS--------DHYSSDTSFTFS---DKARKKIVVYE--Y------WGYW-------DIDGSGVTTPIV 332 (705) Q Consensus 290 -----------~~~--------~~~~~~~~~~~~---~~~~~~v~v~E--~------w~k~-------~~~~dg~~~~~~ 332 (705) ++. .+-+||.-..++ ....+||-|.| . |.|+ .++.......++ T Consensus 258 ~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~sS~~~rvpvneqg~Y~k~~mY~RI~PSDF~~~~P~~N~~QIWK 337 (666) T protein:vir:96 258 FQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETSSTNRRVPVNEQGVYCKHTMYLRIIPSDFEMNVPNRNQVQIWK 337 (666) T ss_pred ccccccccCCcccccccccchhhccchhhcCcccccccccccccccccccceeeeeeeeeeccccceecCCCCCcceeee Confidence 000 011122211111 12345555444 2 2222 122333345566 Q ss_pred EEEE-CCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc Q lcl|NC_021540. 333 ASWV-DDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDP 411 (705) Q Consensus 333 ~~~~-g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~ 411 (705) ++++ |+.+|..++..-..+.||.-......+.-..-..|+.+...|.|+...++++..+-...+....+.++++..+.. T Consensus 338 ~v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDGmG~QTQ~~~E~~~P~Q~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a 417 (666) T protein:vir:96 338 AVMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQSATTELWNAYIQGARRAVMDRALYNPSMIRA 417 (666) T ss_pred eeeeccceeEeeehhhcccchhhhhhhhhhhhccccccccccccccchhhhhhHHhhhhhhhhhhhhhhhhhcchhhhhh Confidence 6655 467888886655667777655444444444456889999999999999999988877777777777777766644 Q ss_pred hhhhhhcCCcceeecCCccccccc--ccccCccchHHHHHHHH---HHHHHHHHHhCcchHhcCCCccccchHHHHHHHH Q lcl|NC_021540. 412 VNERKFKMGEDYKYNPGTNPVTDI--IEHKYPELPASSYNMLQ---MFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGV 486 (705) Q Consensus 412 ~d~~~~~pg~~i~~~~~~~~~~~i--~~~~~~~i~~~~~~~l~---~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l 486 (705) .+.-...|.--|.+++.+..+..+ -+.+.|.-..+....++ .+.+.-++++|++...+|.-- .-+++-.+..-. T Consensus 418 ~~iNSP~~~~KIP~~~~sL~N~~m~~~Y~~IPFD~RG~E~~~Q~A~~l~~~~r~L~GMN~~~~GQFQ-KGNKt~~E~~~~ 496 (666) T protein:vir:96 418 NDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLTDWQRELSGMNSATRGQFQ-KGNKTRAEFDTI 496 (666) T ss_pred hcccCCCCCcccceeehhhhccchhhhhccCCccccchhHHHhhhHHHhhhHHHhhccCCccccccc-ccCcceeehhhh Confidence 333222333334444444333322 23344444444444443 456667789999999999531 123444455566 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccc-eeEEeeccc-hhHHH---HH Q lcl|NC_021540. 487 IGASGKRELGILRRLAN-GLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGS-FDIKLSISN-AETDA---IK 560 (705) Q Consensus 487 ~~~~~~~~~~~~~n~~~-~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~-~dv~v~~~~-~~~~~---~~ 560 (705) +..+..|+++-+-.+++ .+..+-+.+.--+.+|.++..++.-+.++.+.|+-+.++.. ....+.+|- +.... .. T Consensus 497 MG~a~NRmRLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~~vDi~~L~~~~L~F~~~DGlTP~SKlASs~~ 576 (666) T protein:vir:96 497 MGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGVRVDIKELQDLGLKFELGDGLTPASKLASSDF 576 (666) T ss_pred cCCcccceehhhHHHhhhhhhhHHHHHhhhhhhccccchhcccccCceeeeeHHHHhhhhheeeeccCCCchhhhhhhHH Confidence 67777777776666655 23444444433456788888888887777888887766642 223333331 21222 22 Q ss_pred HHHHH-------HHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 561 AQELS-------FMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQA--QLEIQIKQLEAQELQMRIAK 631 (705) Q Consensus 561 ~q~~~-------~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~--q~~~q~~q~~~q~~q~e~~k 631 (705) +..+. .+++++++..| .+...++.+.|..-+.++--..-|+=.-.- +++.|+. -+|... T Consensus 577 lT~~LQMI~sS~~~~~A~G~~~P-----~M~AHl~QLGGVRG~E~Y~~~ALPqwqitygm~Q~LQ~~-----~LQ~~~-- 644 (666) T protein:vir:96 577 LTALLQMIMSSETTLQAFGTQVP-----GMIAHLAQLGGVRGFEKYANAALPQWQITYGMQQQLQQM-----LLQLQQ-- 644 (666) T ss_pred HHHHHHHHhcchhhHhhhcccch-----HHHHHHHHhccccchhhcccccCcchhhhhhhhHHHHHH-----HHHHhh-- Confidence 22222 22345555554 355667777776666555222111100000 1111110 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 632 LQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQ 667 (705) Q Consensus 632 ~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q 667 (705) |..+|.+..+-+. ..++.+-. | T Consensus 645 -QSA~Q~~A~Q~~L--------~~~Q~~PS-----q 666 (666) T protein:vir:96 645 -QSAMQLQARQGEL--------SNDQSQPS-----Q 666 (666) T ss_pred -hhccccccccccC--------cccccCCC-----C Confidence 0000000000000 00000000 0 No 53 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.68 E-value=4.1e-15 Score=99.43 Aligned_cols=423 Identities=11% Similarity=0.024 Sum_probs=200.9 Q ss_pred CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCCCC--CcCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_021540. 17 QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQVGR--SSVQPKLIRKQAEWRYSALSEPFL 92 (705) Q Consensus 17 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gr--s~~v~~~v~~~~e~~~~~l~~~f~ 92 (705) +| ...|..|.. .|...+.+.++..+||.|.-.. .+.+.+++ .+++.+..+..|+.....| | T Consensus 1 l~----~~~l~~~i~-------~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~ 65 (429) T protein:vir:98 1 MT----KDLLSELIQ-------KHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYF----I 65 (429) T ss_pred CC----HHHHHHHHH-------HHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhh----c Confidence 22 223444433 3444455566778899986311 12233333 4678888888888877666 5 Q ss_pred CCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCc Q lcl|NC_021540. 93 NDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATG 172 (705) Q Consensus 93 ~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~ 172 (705) |.+.- |.+ +|. ...+.++.++. .|+--..+...+++++..|.|.+.+|++ T Consensus 66 g~~~~--~~~---~~~----~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d-------------------- 115 (429) T protein:vir:98 66 GVPVQ--TSH---ENK----QVSNYLELLDG-YNDQDDNNAELSKICSIYGHGYELVFND-------------------- 115 (429) T ss_pred ccCce--eec---CCh----HHHHHHHHHHh-hcCHhHHHHHHHHHHhhcCeEEEEEEec-------------------- Confidence 54433 332 221 23445655543 3444455778999999999998876542 Q ss_pred hhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheee--CCCcc Q lcl|NC_021540. 173 ESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTI--DPTCN 250 (705) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~ 250 (705) ..+.|++..++|.+++. |.... T Consensus 116 --------------------------------------------------------~~g~~~~~~~~p~~~~~v~dd~~~ 139 (429) T protein:vir:98 116 --------------------------------------------------------ENAEAGITYLTPLEAFIVYDDSIR 139 (429) T ss_pred --------------------------------------------------------CCCcEEEEEEcccceEEEEeCCCC Confidence 02346677888888753 32211 Q ss_pred CChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEE Q lcl|NC_021540. 251 GNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTP 330 (705) Q Consensus 251 ~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~ 330 (705) .-...+.+.+.+. ..+..+++|.. +.+ . T Consensus 140 ----~~~~~~i~~~~~~-----------------------------------------~~~~~~~~~~~-----~~~-~- 167 (429) T protein:vir:98 140 ----QKPLFAVRYFYNK-----------------------------------------GGVLEGSYSDA-----SNI-T- 167 (429) T ss_pred ----CceEEEEEEEEec-----------------------------------------CceEEEEEEeC-----ceE-E- Confidence 1112222332110 01122333321 000 0 Q ss_pred EEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC Q lcl|NC_021540. 331 IVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLD 410 (705) Q Consensus 331 ~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~ 410 (705) ++..-.++..+ .+..|.+.+.+|+++++ ...+|.|.+..++++++.+|...+.+.+.+...+.|.+++.....+ T Consensus 168 ~~~~~~~~~~~-~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~ 241 (429) T protein:vir:98 168 YFKDGEKGIEI-GESEPHPFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAELD 241 (429) T ss_pred EEEecCCceEe-cccccccCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCC Confidence 00000111111 12233344677877754 3457999999999999999999999999999999888776533333 Q ss_pred chhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHH Q lcl|NC_021540. 411 PVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGAS 490 (705) Q Consensus 411 ~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~ 490 (705) .........++++.+..+..-...+.+...+.-.+.+...++.+...+...|++++.+.+..++ .|+. ++....... T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn-~Sg~--Al~~~~~~l 318 (429) T protein:vir:98 242 DETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFGT-ASGI--ALRYRLQAM 318 (429) T ss_pred cchhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccc-chHH--HHHHHHHHH Confidence 3333344455666654332112223444433334456667899999999999998877665443 2343 344444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHH Q lcl|NC_021540. 491 GKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQT 570 (705) Q Consensus 491 ~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~ 570 (705) ........+.|..+++++++.++.++.... +. .+.. ...+..+...+.......+.+..+ T Consensus 319 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~~----------~~---~d~~----~i~v~f~~~~p~~~~~~a~~~~kl--- 378 (429) T protein:vir:98 319 DNLAKTKERKFMSGMNRRYKLIASYPTSKI----------GP---KDWI----GIKYKFTRNLPANLLEESQIAGNL--- 378 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCC----------Cc---cccc----cceEEeCCCCCcCHHHHHHHHHHH--- Confidence 455555666666666666666655432111 10 0100 122333322221111222222221 Q ss_pred HhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 571 MGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAE 646 (705) Q Consensus 571 ~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e 646 (705) ...++. .-++. .++... ++. .+.++.+++..+. ..+.+..+..+...--.+ T Consensus 379 -~g~is~---et~~~---~l~~v~-------------d~~--~E~~ri~~E~~~~---~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 379 -AGIVSE---ETQVG---VLSIVE-------------NPQ--KEIERKNSDKSTL---ISRQAGGLNGQNTTTILE 429 (429) T ss_pred -hccCch---HHHHH---hCCCCC-------------CHH--HHHHHHHHHHHHH---HHHHHhhhcCCCCCCCCC Confidence 111111 11111 111111 111 1111111111100 000000000000000000 No 54 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.64 E-value=5.1e-14 Score=93.46 Aligned_cols=456 Identities=11% Similarity=0.087 Sum_probs=207.2 Q ss_pred hcccccccCCCCCCHH-HHHHHHHHHHHh-hHHhhHHHHHHHHHHHHhccCCCCC--CC-CCCC----CCcCCCHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKP-KVSDLLNDFNNA-KSTKDTQVAIIDDWLAQLNVTGAYK--PK-QQVG----RSSVQPKLIRKQ 79 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~-~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~g----rs~~v~~~v~~~ 79 (705) |-+.-+.+.-.|-+.- ..+.++.-+++. ...+.+.+.++++|..||.|....- +. ...| +.+++.+.-... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i 80 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHH Confidence 2222222222222221 123333333332 2235566777889999999864221 10 1112 223344433333 Q ss_pred HHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhh Q lcl|NC_021540. 80 AEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVT 159 (705) Q Consensus 80 ~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~ 159 (705) ++.. .+.+||-+.-+.+ +|.+.++ +|+.++. .++-.+.+..++..|+..|.|.+++||+. T Consensus 81 ~~~~----a~~l~~~p~~i~~-----~d~~~~e----~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~------ 140 (496) T protein:vir:38 81 AKYM----SKLLFNEKVKINI-----DDKAAEE----FVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDG------ 140 (496) T ss_pred HHHH----hhhhhCCcceEee-----CChHHHH----HHHHHHh-ccCHHHHHHHHHHHHhhhCcEEEEEEEcC------ Confidence 3333 3334555544444 4544444 5555443 35555778899999999999999998841 Q ss_pred hcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEec Q lcl|NC_021540. 160 ENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICD 239 (705) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~ 239 (705) .+.|+|++|+ T Consensus 141 ----------------------------------------------------------------------~~~~~i~~v~ 150 (496) T protein:vir:38 141 ----------------------------------------------------------------------NKNVKVSFAT 150 (496) T ss_pred ----------------------------------------------------------------------CCcEEEEEEc Confidence 2346788999 Q ss_pred hhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEE Q lcl|NC_021540. 240 YHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGY 319 (705) Q Consensus 240 ~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k 319 (705) |..||+=..-...+..+-|+.+ + +.+ ...+..+|+|.. T Consensus 151 ~~~~~P~~~~~~~~~~~~f~~~--~-~~~---------------------------------------~~~y~~le~h~~ 188 (496) T protein:vir:38 151 ADCMYPLSNDSENVDECVIANS--F-HKN---------------------------------------NKYYTLLEWNEW 188 (496) T ss_pred ccceEEEEecCCcEEEEEEEEE--E-EeC---------------------------------------CeEEEEEEEEEE Confidence 9998741111123333333211 1 100 011222222221 Q ss_pred eeecCCCeeEEEEEEEE---C---CEE---------EecccCCCCCCCcceEEeee----eeecCcccCCchHHHhhHHH Q lcl|NC_021540. 320 WDIDGSGVTTPIVASWV---D---DVM---------IRLEKNPYPDGKLPFVVVPY----LPVKDSVYGEADAELLSDNQ 380 (705) Q Consensus 320 ~~~~~dg~~~~~~~~~~---g---~~i---------L~~~~~p~~~~~~Pfv~~~~----~~~~~~~~g~g~~~~~~d~Q 380 (705) .+ +.+..+ +.+|- + |.. +..........+.||+.+.. ....++.+|.|.+..+++++ T Consensus 189 ~~--~~~~I~--~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~li 264 (496) T protein:vir:38 189 QG--DVYTVT--TELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTL 264 (496) T ss_pred eC--ceEEEE--EEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHH Confidence 00 000000 00000 0 000 00000001113455655533 22457788999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh------hhhcC-CcceeecCCccc--ccccccccCccc-hHHHHHH Q lcl|NC_021540. 381 KLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE------RKFKM-GEDYKYNPGTNP--VTDIIEHKYPEL-PASSYNM 450 (705) Q Consensus 381 ~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~------~~~~p-g~~i~~~~~~~~--~~~i~~~~~~~i-~~~~~~~ 450 (705) +.+|..++.+.+.+.. +.+++.++.+.+..... ..+.+ ..++..-.+... ...+.... |.+ ....... T Consensus 265 d~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~i~~e~~~~~ 342 (496) T protein:vir:38 265 KTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDIS-VEIRSTEFIES 342 (496) T ss_pred HHHHHHHHHHHHHHhh-cccceecchHHhhccCCCCCccccCCCCccceEEEeecCCCcccccceeec-cccCHHHHHHH Confidence 9999999999999876 67788887766532111 01111 112221111111 11233333 333 3456667 Q ss_pred HHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEec Q lcl|NC_021540. 451 LQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITD 530 (705) Q Consensus 451 l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~ 530 (705) ++.+...+...+|+++...|..++. ..||+++.......-.......+.|..+++++++.++.+...+..-. + T Consensus 343 l~~~l~~i~~~~g~~~~~f~~~~~g-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~------g 415 (496) T protein:vir:38 343 INAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYS------G 415 (496) T ss_pred HHHHHHHHHHhhCCChhhcCCCccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------C Confidence 7888888888999999998865432 34677776654555555566777788888898888887765432100 0 Q ss_pred CceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhh Q lcl|NC_021540. 531 EEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQ 610 (705) Q Consensus 531 ~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~ 610 (705) . .+. .....+..+.+...-.....+..+.+.+. |. ++.. .+ +....+... .. T Consensus 416 ~---~~~----~~~i~v~f~d~i~~d~~~~~~~~~~~~~~-Gi-iS~e---t~---l~~~~~~~d-------------~e 467 (496) T protein:vir:38 416 E---VVE----LDTITVDFDDSIAQDEDTTINRYTNAKNQ-GM-IPLK---IA---LQRAWNITE-------------AE 467 (496) T ss_pred C---CCC----ccceEEEeCCCCCCCHHHHHHHHHHHHhc-CC-CCHH---HH---HHhcCCCCh-------------HH Confidence 0 000 01122333333222222233333333221 11 1100 00 011111110 00 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHH Q lcl|NC_021540. 611 AQLEIQIKQLEAQEL--QMRIAKLQAEIQ 637 (705) Q Consensus 611 ~q~~~q~~q~~~q~~--q~e~~k~qa~~q 637 (705) +....++.+.+.+.. +........+.+ T Consensus 468 a~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 468 ADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred HHHHHHHHHHhhhccCccccccCCCCCCC Confidence 100111100000000 000000000000 No 55 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.61 E-value=2.8e-13 Score=89.39 Aligned_cols=426 Identities=10% Similarity=0.000 Sum_probs=189.4 Q ss_pred CCCHH--HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCC---CCCcCCCHHHHHHHHHHHHHHHHhhc Q lcl|NC_021540. 20 WKNKP--KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQV---GRSSVQPKLIRKQAEWRYSALSEPFL 92 (705) Q Consensus 20 ~~~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---grs~~v~~~v~~~~e~~~~~l~~~f~ 92 (705) +++++ ++..|... +.....+.++..+||.|.... .+...+ ..-++|.+-.+..|+.....|. T Consensus 1 ~~~~~~~~i~~l~~~-------~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~---- 69 (441) T protein:vir:80 1 MNSDELALIEGMYDR-------IQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD---- 69 (441) T ss_pred CCccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc---- Confidence 33332 23333332 233334455667899887432 111111 1234666666666665444331 Q ss_pred CCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCc Q lcl|NC_021540. 93 NDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATG 172 (705) Q Consensus 93 ~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~ 172 (705) +.+.+-+|.+ .+..+| ..|+-...+...++++++.|.|.+.+|= T Consensus 70 -------~~g~~~~d~~-------~l~~i~-~~n~~~~~~~~~~~~~~~~G~a~~~v~~--------------------- 113 (441) T protein:vir:80 70 -------WLGWTNGDGY-------GLDGVY-AANRLATASCDVHLDALIFGLSFVAIIP--------------------- 113 (441) T ss_pred -------cccccCCChH-------HHHHHH-HhcCHHHHHHHHHHHHhhcCeeEEEEEe--------------------- Confidence 1111112211 122222 2355555566777888888888665420 Q ss_pred hhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCCCcc Q lcl|NC_021540. 173 ESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDPTCN 250 (705) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~ 250 (705) ...+.|+|..++|.+++ |||... T Consensus 114 -------------------------------------------------------d~~g~~~i~~~~p~~~~~i~d~~~~ 138 (441) T protein:vir:80 114 -------------------------------------------------------HGDGTVSVRPQSPKNCTGKFSADGS 138 (441) T ss_pred -------------------------------------------------------CCCCceEEEEEccceEEEEEeCCCC Confidence 11345678889999965 565432 Q ss_pred CChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEE Q lcl|NC_021540. 251 GNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTP 330 (705) Q Consensus 251 ~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~ 330 (705) .+ ...++++. .+.+ .+...+.|.. +. . T Consensus 139 -~~---~~~~~~~~------------~~~~-----------------------------~~~~~~vy~~-----~~---~ 165 (441) T protein:vir:80 139 -RL---DAGLVVQQ------------TCDP-----------------------------EVVEAELLLP-----DV---I 165 (441) T ss_pred -ce---eEEEEEEE------------EecC-----------------------------ceEEEEEEec-----Ce---E Confidence 11 11111111 0000 0011222221 11 0 Q ss_pred EEEEEEC-CEEEecccCCCCCCCcceEEeeeeeecCcccCCchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc Q lcl|NC_021540. 331 IVASWVD-DVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADA-ELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL 408 (705) Q Consensus 331 ~~~~~~g-~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~-~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a 408 (705) +.....| +.....+..|.+.|.+|+++++..+..++++|.|-+ +.++++++.+|..++.+.+.+...+.|...+- |+ T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~-G~ 244 (441) T protein:vir:80 166 VQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT-GV 244 (441) T ss_pred EEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee-cC Confidence 1111111 222333444555588999999988888999999865 56999999999999999999999998877653 43 Q ss_pred -cCc--hhhhhhcCCcceeecCCcccccccccccCccc-hHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHH Q lcl|NC_021540. 409 -LDP--VNERKFKMGEDYKYNPGTNPVTDIIEHKYPEL-PASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQ 484 (705) Q Consensus 409 -v~~--~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i-~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~ 484 (705) .+. .+.....+++++.+.++.... .+.+.+.+.- ...+...+......+-..+++++...|..++.. .+|.+++ T Consensus 245 ~~~~~~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~-~Sg~Al~ 322 (441) T protein:vir:80 245 SADEFSQPGWVLSMASVWAVDKDDDGD-TPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNP-PSGEALA 322 (441) T ss_pred CccccccchhhhcccccccCCCCCCCC-cceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcc-hHHHHHH Confidence 221 234556778887765543322 1222222221 122223333344444455888888888765432 2444455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHH Q lcl|NC_021540. 485 GVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQEL 564 (705) Q Consensus 485 ~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~ 564 (705) .....-........+.|..+++++++.++.+.-...... +. -....+..+...+.......+.+ T Consensus 323 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~---------------~~-~~~i~~~f~~~~~~~~~e~ad~~ 386 (441) T protein:vir:80 323 AEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEA---------------DF-FGDVGLRWRDASTPTRAATADAV 386 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc---------------cc-ceeeeEEeCCCCCcCHHHHHHHH Confidence 444444444455556666666666655444322111000 00 01122333333222223333333 Q ss_pred HHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 565 SFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQ 644 (705) Q Consensus 565 ~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q 644 (705) ..|.++..+... ...++ +..+.. +++.+ +.+..+++.+ ..++.+-.. T Consensus 387 ~kl~~~g~~~~s---~~~~~----~~l~~~------------~~e~~--~~~~e~~e~~---~~~~~~~~~--------- 433 (441) T protein:vir:80 387 TKLVGAGILPAD---SRTVL----EMLGLD------------DVQVE--AVMRHRAESS---DPLAVLAGA--------- 433 (441) T ss_pred HHHHhcCccccc---HHHHH----HhCCCC------------HHHHH--HHHHHHHHHH---HHHHHHhhh--------- Confidence 333333211111 00011 111110 00000 0000000000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 645 AEAAKARKANTEADLNTL 662 (705) Q Consensus 645 ~e~a~a~~~~~ea~~~~~ 662 (705) ...+..++ T Consensus 434 ----------~~~~~~~~ 441 (441) T protein:vir:80 434 ----------ISRQTNEV 441 (441) T ss_pred ----------hhcccccC Confidence 00000000 No 56 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.61 E-value=1.6e-13 Score=90.73 Aligned_cols=463 Identities=10% Similarity=0.018 Sum_probs=208.2 Q ss_pred Ccc-------h-------------hhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhh-HHHHHHHHHHHHhccCC- Q lcl|NC_021540. 1 MSD-------I-------------NEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKD-TQVAIIDDWLAQLNVTG- 58 (705) Q Consensus 1 ~~~-------~-------------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~- 58 (705) |.+ + +..+.... =..++.+.- ..|++- ...|. ....+.++..+||.|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~----~~i~~~----i~~~~~~~~~r~~~~~~yY~g~~~ 71 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADN-LEELMVNNW----ELLKNF----INHHKLRQAPRIQELLDYARGENH 71 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccc-cccccCChH----HHHHHH----HHHHHHHHHHHHHHHHHHhcCCCC Confidence 111 0 00000000 011221111 112222 22333 22345667789999852 Q ss_pred CC---CCCCCCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHH Q lcl|NC_021540. 59 AY---KPKQQVGR--SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLID 133 (705) Q Consensus 59 ~~---~~~~~~gr--s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~ 133 (705) .. ......++ .+++.+-....|+.....| ||.+. .|.....+| -+...++++.+|. .|+--..+. T Consensus 72 ~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl----~g~p~--~~~~~~~~~---~~~~~~~l~~~~~-~n~~~~~~~ 141 (501) T protein:vir:96 72 DVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYL----AGNPI--RVEYDDNDD---NSQNDDAIKRIGR-INDLDSLNR 141 (501) T ss_pred cccCccccCccccccceeecchHHHHHHHHhhhh----cccCe--eEeeCCccc---hhHHHHHHHHHHH-hcCHHHHHH Confidence 22 11122233 3678888888888877655 45443 333322222 2345566666544 355445677 Q ss_pred HHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccc Q lcl|NC_021540. 134 TMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPI 213 (705) Q Consensus 134 ~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 213 (705) .++++++..|.+.+.+||+. T Consensus 142 ~~~~~~~~~G~a~~~v~~de------------------------------------------------------------ 161 (501) T protein:vir:96 142 TLIRDLSQTGRAYEVIYRSE------------------------------------------------------------ 161 (501) T ss_pred HHHHHHhhcCeEEEEEEEcC------------------------------------------------------------ Confidence 89999999999988876520 Q ss_pred eeccCcccccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhc Q lcl|NC_021540. 214 LAIINGYEEQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTS 291 (705) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~ 291 (705) .+.|+|..++|.++++ |+.... +..+.+ +.+.... . T Consensus 162 ----------------dg~~~i~~~~p~~~~~v~d~~~~~---~~~~~v-~~~~~~~-----------~----------- 199 (501) T protein:vir:96 162 ----------------YDETRIKRLSPLETFVIYDNSLED---NSIAAV-RYYNRGT-----------L----------- 199 (501) T ss_pred ----------------CCceEEEEEccceeEEEEcCCCCC---ceEEEE-EEEEeec-----------C----------- Confidence 1345677888888754 333211 122222 2221000 0 Q ss_pred cccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCc Q lcl|NC_021540. 292 SDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEA 371 (705) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g 371 (705) ...+.++++|.. +.+ ++ +-.++........|.+.|.+|++++. ...+|.| T Consensus 200 ----------------~~~~~~~~vyt~-----~~i---~~-~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~s 249 (501) T protein:vir:96 200 ----------------QSAKDVVEIYTD-----EHI---YT-LDASDDFNEISVTTHAFGTVPITEYL-----NNIDGIG 249 (501) T ss_pred ----------------CCcEEEEEEEcC-----CcE---EE-EeeCCCceeccccccCCCccceEEec-----CCccCCC Confidence 001234555543 111 11 11122222223334444788888764 3456899 Q ss_pred hHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchh--hhhhcCCcceeecCCcc-----cccccccccCccch Q lcl|NC_021540. 372 DAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVN--ERKFKMGEDYKYNPGTN-----PVTDIIEHKYPELP 444 (705) Q Consensus 372 ~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d--~~~~~pg~~i~~~~~~~-----~~~~i~~~~~~~i~ 444 (705) .+..++++++.+|...+.+.+.+...+.|.+++........+ .......+.+.+..... ....+.++..+.-. T Consensus 250 d~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 329 (501) T protein:vir:96 250 DYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDV 329 (501) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCH Confidence 999999999999999999999999988887766433222221 22333444444432211 11123333333333 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ..+...++.+...+...|++++.+.|..++..|+.| +...............+.|..+++++++.++.++........ T Consensus 330 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 407 (501) T protein:vir:96 330 SGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEA--LKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKD 407 (501) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCcccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 556677899999999999999988876544445544 444334445555666677777777777777766543221100 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) . ++ ....+..+...+.......+.+..+. + .++. .-++. .++....+. T Consensus 408 ~------d~---------~~i~i~f~~~~p~n~~e~ad~~~kl~---g-~iS~---et~~~---~l~~v~D~~------- 455 (501) T protein:vir:96 408 F------DE---------SLLKITFTPNLPKSLNEQVSILTGLG---G-QVSQ---ETALS---LSGLVESPN------- 455 (501) T ss_pred c------cc---------ccceEEeCCCCCcCHHHHHHHHHHHh---c-cCch---HHHHH---hCCCCCCHH------- Confidence 0 00 01222222222221222222222211 1 1111 00110 111111110 Q ss_pred ccchhhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQ--MRIAKLQAEIQLMPYEAQAEAAKARKANTEAD 658 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q--~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~ 658 (705) .++++.+.+.++.. ........... .....+.+ ..+......-+ T Consensus 456 --------~E~~ri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~e-~~~d~~e~~~~ 501 (501) T protein:vir:96 456 --------EELDKINKEMSEIDFKGYSNDFNEHVG-KYTDEVKE-THTDDFEREYE 501 (501) T ss_pred --------HHHHHHHHHHHHhhccccccchhhccc-ccCCcCCC-CCCCccccccC Confidence 01111111111100 00000000000 00000000 00000000000 No 57 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.60 E-value=1.7e-14 Score=96.11 Aligned_cols=447 Identities=11% Similarity=0.025 Sum_probs=204.3 Q ss_pred Ccchhhhhh----ccccccc--CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCC-------C Q lcl|NC_021540. 1 MSDINEEFL----EDTVPSL--QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQ-------Q 65 (705) Q Consensus 1 ~~~~~~~~~----~~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-------~ 65 (705) |.++.-..- +.+-.+. ... .+..+|+ ...+.|...+.+..++.+||.|..... +.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~i~-------~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~ 72 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYE-TQEEMIL-------RLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEID 72 (468) T ss_pred CccccCCcCceeehheeeccccccc-CcHHHHH-------HHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccc Confidence 777744322 1111111 111 1112232 333455555666778999999873111 001 1 Q ss_pred CC--CCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcC Q lcl|NC_021540. 66 VG--RSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEG 143 (705) Q Consensus 66 ~g--rs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g 143 (705) +. ..+++.+..+..|+.....| ||.+.-+. .+|.+.- ..+..++. ++-...+...+++++..| T Consensus 73 ~~~~~~ki~~n~~~~Iv~~~~~~l----~g~p~~~~-----~~d~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~G 137 (468) T protein:vir:96 73 PFKPDWRMYTNYHQNLVDQKVAYA----VANPVTYG-----TEDEKSL----KTIQEVLN--HKWDDKLVDILTAASNKG 137 (468) T ss_pred ccccccccccchHHHHHHHHHhhh----ccCCceec-----cCChHHH----HHHHHHHh--cCHHHHHHHHHHHHhhcC Confidence 12 23688888888888777666 55444432 2343332 23444442 455556677889999999 Q ss_pred CeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccccc Q lcl|NC_021540. 144 TVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQ 223 (705) Q Consensus 144 ~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 223 (705) .+.+.+||+ T Consensus 138 ~~~~~v~~d----------------------------------------------------------------------- 146 (468) T protein:vir:96 138 VEWIQPYVD----------------------------------------------------------------------- 146 (468) T ss_pred eEEEEEEEc----------------------------------------------------------------------- Confidence 998887752 Q ss_pred ceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccc Q lcl|NC_021540. 224 EVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSF 301 (705) Q Consensus 224 ~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~ 301 (705) ..+.++|..++|.++++ |+.. ..+..+.+ +.+...+ .. .... + T Consensus 147 -----~~~~~~i~~~~p~~~~~v~~~~~---~~~~~~~i-r~~~~~~----------~~-------------~~~~---~ 191 (468) T protein:vir:96 147 -----EQGEFKTFRVPAEQAIPIWTNKE---RDELKAFI-RLYELDG----------GE-------------RVEY---W 191 (468) T ss_pred -----CCCceEEEEEcccceEEEEcCCC---CCceEEEE-EEEEecC----------ce-------------EEEE---E Confidence 01346788889988763 3332 22323332 3331000 00 0000 0 Q ss_pred cccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHH Q lcl|NC_021540. 302 TFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQK 381 (705) Q Consensus 302 ~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~ 381 (705) ...++..|.++. +..+.....-.............|.+.+.+|++++.. ...|.|.+..++++++ T Consensus 192 -----~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n-----~~~g~sd~e~v~~liD 256 (468) T protein:vir:96 192 -----TANDVTFYELKD-----GQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKN-----NPQEVSDLFMYKTIID 256 (468) T ss_pred -----eCCeEEEEEEcC-----CceeecccccccccccceeeccccccCCcccEEEecC-----CCCCCCchHHHHHHHH Confidence 001122221110 0000000000000000111223344557788887754 3468999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCcEEeeccccCchhh--hhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 382 LIGALTRGMIDAMARSANGQRGMSKNLLDPVNE--RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEAD 459 (705) Q Consensus 382 ~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~--~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~ 459 (705) .+|...|.+.+.+...++|.+++.....+.... ...+.++++.+.+... +.+.+...+.-.......++.+...+. T Consensus 257 a~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~--~~~~~l~~~~~~~~~~~~~~~l~~~I~ 334 (468) T protein:vir:96 257 AMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGS--GGVDTIQIDVPVQSAKEYLDMLRDYVI 334 (468) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCC--CcceEEeecCChHHHHHHHHHHHHHHH Confidence 999999999999998888877765332222121 2234455666654321 223444433334566677899999999 Q ss_pred HHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechh Q lcl|NC_021540. 460 ALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRD 539 (705) Q Consensus 460 ~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~ 539 (705) ..+++++.+.+..++..|+.| +...............+.|..+++++++.++.+ +... ++.. T Consensus 335 ~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~----~g~~------------~d~~ 396 (468) T protein:vir:96 335 EFGQGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTLTALQELLQYIIDF----YKLS------------IKVQ 396 (468) T ss_pred HHhCcccccccccccchHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hCCC------------cccc Confidence 999998887665443334443 444444444445555566666666665555543 2210 1111 Q ss_pred hcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHH Q lcl|NC_021540. 540 NLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQ 619 (705) Q Consensus 540 ~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q 619 (705) ...+..+.+.+.......+ .++..+. ++ ..-++. .++...++ +.+.++.+ T Consensus 397 ----~i~i~f~~~~p~d~~e~a~----~~~~~g~-iS---~et~i~---~l~~v~D~---------------~~E~~ri~ 446 (468) T protein:vir:96 397 ----DVEITFNFNVMVNELEQSQ----IGVNSQY-LS---KETVVT---NHPWVDDP---------------VAEMERID 446 (468) T ss_pred ----eeeEEecCCCCcCHHHHHH----HHHhcCC-Cc---hHHHHH---hCCCCCCH---------------HHHHHHHH Confidence 1122222222211111111 1111110 10 011111 11111111 11111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 620 LEAQELQMRIAKLQAEIQLMPY 641 (705) Q Consensus 620 ~~~q~~q~e~~k~qa~~q~~~~ 641 (705) .+..+........-....-.-. T Consensus 447 ~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 447 QEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred HHHHHHHHHhhccCCCCCCCCC Confidence 1111000000000000000000 No 58 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.60 E-value=2e-13 Score=90.20 Aligned_cols=465 Identities=11% Similarity=0.046 Sum_probs=216.0 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhh-HHhhHHHHHHHHHHHHhccCCCCC-CCCCCCCCc---CCC-H Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAK-STKDTQVAIIDDWLAQLNVTGAYK-PKQQVGRSS---VQP-K 74 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~grs~---~v~-~ 74 (705) |-+-++..+-...+. + -+...|.+-.++-+ ...+.++.+++.|..||.|++... .....|+.+ ..+ + T Consensus 3 ~~~~ik~~~~~~~~~-~------~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~sln 75 (505) T protein:vir:79 3 FWDTLKNLFRKGSAA-V------GMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVN 75 (505) T ss_pred hHHHHHHHHHHhhhh-h------cchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecc Confidence 333333333221110 0 01122332222221 122345667788999998875321 222333221 222 2 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 75 LIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 75 ~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) .-...++ .+.+.+|+-..-|.+ +|.+ .+++|+.++. .|+-...+..++..|+..|.+++|+||+ T Consensus 76 l~~~i~~----~~A~ll~~e~~~i~~-----~d~~----~~e~l~~i~~-~n~f~~~~~~~~e~a~a~G~~~~k~~~D-- 139 (505) T protein:vir:79 76 VTKLASA----KLASLIFNEQCQVTV-----SDET----ANDFLDDVFQ-QNDFYTTFEEKLEEWIALGSGCVRPYVD-- 139 (505) T ss_pred hHHHHHH----HHHhhhcCCCceeec-----CChH----HHHHHHHHHH-hccHHHHHHHHHHHHhhcCCeEEEEEEe-- Confidence 2222222 223333444444444 3433 4456666543 2444566788999999999999999983 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) .+.++ T Consensus 140 ---------------------------------------------------------------------------~~~~~ 144 (505) T protein:vir:79 140 ---------------------------------------------------------------------------SGKIK 144 (505) T ss_pred ---------------------------------------------------------------------------CCceE Confidence 12356 Q ss_pred EEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEE Q lcl|NC_021540. 235 VTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVY 314 (705) Q Consensus 235 i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 314 (705) |+.|++..|++=..-..++.+|-|+.+.+..... . ..-++.+ T Consensus 145 i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~---~-----------------------------------~~~yt~l 186 (505) T protein:vir:79 145 LAWATADQVYPLQADTNQVNELAIASRTTEVENH---R-----------------------------------TIYYTLL 186 (505) T ss_pred EEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCC---c-----------------------------------ceEEEEE Confidence 8889998887421111245555544322111100 0 0012233 Q ss_pred EEEEEeeecCCCeeEEEEEEEEC------CEEEecccCC-----------CCCCCcceEEee----eeeecCcccCCchH Q lcl|NC_021540. 315 EYWGYWDIDGSGVTTPIVASWVD------DVMIRLEKNP-----------YPDGKLPFVVVP----YLPVKDSVYGEADA 373 (705) Q Consensus 315 E~w~k~~~~~dg~~~~~~~~~~g------~~iL~~~~~p-----------~~~~~~Pfv~~~----~~~~~~~~~g~g~~ 373 (705) |+|...+ +.|..+ .-.|.+ |..+.....| ....+.+|+.++ .....++.+|.|++ T Consensus 187 E~h~~~~--~~~~I~--n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~ 262 (505) T protein:vir:79 187 EFHQWDH--GDYVIT--NELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLI 262 (505) T ss_pred EEEEecC--ceEEEE--EEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchh Confidence 4333210 111100 111110 1101001111 111233454442 22345778999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-------h---hhcCCcceeec-CCcccccccccccCcc Q lcl|NC_021540. 374 ELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-------R---KFKMGEDYKYN-PGTNPVTDIIEHKYPE 442 (705) Q Consensus 374 ~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-------~---~~~pg~~i~~~-~~~~~~~~i~~~~~~~ 442 (705) ..+++..+.+|..++++.+.+.+ +..++.++.+++...-. . .+..+..+... .+......+...++.- T Consensus 263 ~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~i 341 (505) T protein:vir:79 263 DNSYTVIDAINRTHDQFVDEVKK-GQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDATSPI 341 (505) T ss_pred hhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEecccC Confidence 99999999999999999998865 66778887776532110 0 11122211111 1111223455555433 Q ss_pred chHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021540. 443 LPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSD 522 (705) Q Consensus 443 i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~ 522 (705) ....++..++.+...+...+|++....|..++. ..||+++....+..-.....+.+.+..+++++.+.++.+..-+.-. T Consensus 342 r~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~-~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~ 420 (505) T protein:vir:79 342 RVADYQATMDFFLREFENQTGLSQGTFTTSPSG-IQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFY 420 (505) T ss_pred CHHHHHHHHHHHHHHHHHHhCCChhhcCCCccc-cchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 334567778888888888999999988876554 3578888876666666777888888888999988888876655421 Q ss_pred ceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhc Q lcl|NC_021540. 523 EEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISK 602 (705) Q Consensus 523 ~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~ 602 (705) .-- ...-....-.....+..+.+...-.....+..+++.+.. . ++.. .-+++..++.+ T Consensus 421 ~~g-------~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~G-i-~s~e------~~l~~~~~~~e------- 478 (505) T protein:vir:79 421 ADG-------QARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQ-V-MPKK------QFLMRNYGLDE------- 478 (505) T ss_pred ccc-------cccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcC-C-CCHH------HHHHhcCCCCh------- Confidence 100 000000000112233334443332333344444443321 1 1110 00112222211 Q ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 603 YNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAE 635 (705) Q Consensus 603 ~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~ 635 (705) ..+....++.+.+......+.-..-.+ T Consensus 479 ------eea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 479 ------EEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred ------HHHHHHHHHHHHhccccCCCchhccCC Confidence 011111111111000000000000001 No 59 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.60 E-value=1.5e-13 Score=90.94 Aligned_cols=449 Identities=9% Similarity=0.016 Sum_probs=202.7 Q ss_pred hhhhhcccccccCCCCCCHHH-HHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--C---------CCCCCCCcCC Q lcl|NC_021540. 5 NEEFLEDTVPSLQEDWKNKPK-VSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--P---------KQQVGRSSVQ 72 (705) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~---------~~~~grs~~v 72 (705) |.-.|.-++.-+ ++-...+. ...+...+......+...+.+.++..+||.|.-... + ...+-..+++ T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~ 79 (472) T protein:vir:93 1 MYPSQPTQTEIF-DAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 79 (472) T ss_pred CCCCCCcchhhh-hceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccc Confidence 333332222211 11111111 112333445555566677777788899999862110 0 0111234678 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeec Q lcl|NC_021540. 73 PKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWC 152 (705) Q Consensus 73 ~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~ 152 (705) .+..+..|+.....| ||.+. .|.+ +|.+.. +.++..+. |+-...+...+++++.+|.|.+.+|++ T Consensus 80 ~n~~~~ivd~~~~~l----~g~~~--~~~~---~d~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d 144 (472) T protein:vir:93 80 TNFHANLVDQKVSYI----VGKPI--AFKH---TDDEVV----KRIDEVLG--NRFDDKLHSVLTGASNKGIEWLHPYLD 144 (472) T ss_pred cchHHHHHHHHhhhh----cccCe--eecc---CChHHH----HHHHHHHh--ccHHHHHHHHHHHHhhcCeEEEEEEEC Confidence 888888888877666 45442 3322 333332 34444432 444455667889999999998876542 Q ss_pred chhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCc Q lcl|NC_021540. 153 LEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQ 232 (705) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 232 (705) ..+. T Consensus 145 ----------------------------------------------------------------------------~d~~ 148 (472) T protein:vir:93 145 ----------------------------------------------------------------------------EEGE 148 (472) T ss_pred ----------------------------------------------------------------------------CCCc Confidence 1234 Q ss_pred ceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCe Q lcl|NC_021540. 233 PEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKK 310 (705) Q Consensus 233 ~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (705) |++..++|.++++ |++.. .+-.+ +.+.+.+..+ .. T Consensus 149 ~~i~~~~p~~~~~i~d~~~~---~~~~~-~ir~~~~~~~---------------------------------------~~ 185 (472) T protein:vir:93 149 FKLFRVPAEQGIPIWTDKEH---EELEA-FIRMYKLENE---------------------------------------TK 185 (472) T ss_pred eEEEEEcccceEEEEcCCCC---CceEE-EEEEEEeecc---------------------------------------ee Confidence 6788899999775 33322 12222 2333311000 00 Q ss_pred EEEE---EEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHH Q lcl|NC_021540. 311 IVVY---EYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALT 387 (705) Q Consensus 311 v~v~---E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~ 387 (705) +.+| ++|. +-.++++.... .....+....... |.+.+.+|+++++. +.+|.|.+..++++++.+|.++ T Consensus 186 ~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~--~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~~ 256 (472) T protein:vir:93 186 VEYWDKVTVNY-YVYENGSLIPD-YSNNLENSKTHFS--TGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRL 256 (472) T ss_pred EEEEecCeEEE-EEEecCeeeec-ccccccccccccc--cCCCCCcceEEecC-----CCCCCCchhhhHHHHHHHHHHH Confidence 1111 1110 00111111000 0000111112222 33346778877753 3478999999999999999999 Q ss_pred HHHHHHHHhcCCCcEEeeccccCchhhh--hhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_021540. 388 RGMIDAMARSANGQRGMSKNLLDPVNER--KFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVK 465 (705) Q Consensus 388 ~~~~d~~~~~~~~~~~~~~~av~~~d~~--~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~ 465 (705) +.+.+.+...+.|.+++...-....... ....++++.+..++. +.+...+.-.......++.+...+...++++ T Consensus 257 s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 332 (472) T protein:vir:93 257 SDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGG----VDTIQVEVPVENSKKYLDELYQKIMLFGQAV 332 (472) T ss_pred HHHHHHHHHhcCceeEeecCCcccchhhHHHHhhccccccCCCCc----ceeEeecCCHHHHHHHHHHHHHHHHHHhCCC Confidence 9999999998888776543211111111 223445555544332 3344333334566677899999999999999 Q ss_pred hHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccce Q lcl|NC_021540. 466 SFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSF 545 (705) Q Consensus 466 d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~ 545 (705) +.+.+..++..|+.| +...............+.|..+++++++.++.++ ... .++. +. T Consensus 333 ~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~----~~~-------~~~~---------~i 390 (472) T protein:vir:93 333 DFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF----DIK-------GEHK---------DV 390 (472) T ss_pred CCCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC-------cccc---------ee Confidence 887765544444444 4443344444455555666666666655555443 211 0111 12 Q ss_pred eEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHH- Q lcl|NC_021540. 546 DIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQE- 624 (705) Q Consensus 546 dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~- 624 (705) .+..+...+.......+.+..+. + .++.. -.+. .+........ +.++.+.+..+ T Consensus 391 ~v~f~~~~p~~~~~~~~~~~k~~---g-iis~e---t~l~---~l~~~~d~~~---------------E~~ri~~E~~~~ 445 (472) T protein:vir:93 391 DISFNYNKVANTELQVQTAQQSM---G-IVSHE---TVLE---NHPFVEDLQA---------------ELERIEQEQMEY 445 (472) T ss_pred eEEeCCCCCCCHHHHHHHHHHHh---c-cCchH---HHHH---hCCCCCCHHH---------------HHHHHHHHHHHH Confidence 22222222211111122111111 1 11110 0010 1111111100 11111100000 Q ss_pred --HHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_021540. 625 --LQMRIAKLQAEIQ-LMPYEAQAEAA 648 (705) Q Consensus 625 --~q~e~~k~qa~~q-~~~~~~q~e~a 648 (705) ............. .....-+.+.+ T Consensus 446 ~~~~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 446 NKQLPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred HHhccCcCcccCCCCCCCCCCCcccCC Confidence 0000000000000 00000000000 No 60 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.60 E-value=1.1e-13 Score=91.63 Aligned_cols=441 Identities=10% Similarity=0.017 Sum_probs=200.1 Q ss_pred cccccccCCCCC-CHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCCCC--CcCCCHHHHHHHHHHH Q lcl|NC_021540. 10 EDTVPSLQEDWK-NKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQVGR--SSVQPKLIRKQAEWRY 84 (705) Q Consensus 10 ~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gr--s~~v~~~v~~~~e~~~ 84 (705) -+-.|-++=.++ ++.+.+ .++....+.|...+.+.++..+||.|.-.- .+...+|+ .+++.+..+..|+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 77 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEITD---KVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFV 77 (453) T ss_pred CccccceeeeccccccCCH---HHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHhh Confidence 122233322222 222222 233344455666666677888999986321 12233443 4688888888888776 Q ss_pred HHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccc Q lcl|NC_021540. 85 SALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPV 164 (705) Q Consensus 85 ~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~ 164 (705) ..| ||.+. .|.+ +|.. ..+.++.++ ..|+--..+..++++++.+|.|.+.+|++. T Consensus 78 ~~l----~g~~~--~~~~---~d~~----~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~----------- 132 (453) T protein:vir:73 78 GYF----NGIPI--KKTH---DDKS----VLEAMQLFD-NLNDMEDEESELAKIACVYGRAYELMYQNE----------- 132 (453) T ss_pred hhh----cccCc--eeec---CChH----HHHHHHHHH-HhcChhHHHHHHHHHHHhcCeEEEEEEeCC----------- Confidence 555 55443 3333 2322 223444443 335554567789999999999988776520 Q ss_pred cccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee Q lcl|NC_021540. 165 FQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT 244 (705) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~ 244 (705) .+.|++..++|.+++ T Consensus 133 -----------------------------------------------------------------~~~~~i~~~~p~~~~ 147 (453) T protein:vir:73 133 -----------------------------------------------------------------STESEVIYCSPLNVF 147 (453) T ss_pred -----------------------------------------------------------------CCceEEEEEcccceE Confidence 124567778888865 Q ss_pred eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecC Q lcl|NC_021540. 245 IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDG 324 (705) Q Consensus 245 ~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~ 324 (705) +=.+- ...+-...+.+.+.+ .+ .....++|.. T Consensus 148 ~v~dd--~~~~~~~~~i~~~~~------------~~-----------------------------~~~~~~vyt~----- 179 (453) T protein:vir:73 148 MVYDD--SIKQKPLFAVYYGFD------------EE-----------------------------GNLSGTVYTL----- 179 (453) T ss_pred EEEeC--CCCceeEEEEEEEEe------------cC-----------------------------ceEEEEEEeC----- Confidence 42211 011111112222210 00 0012233322 Q ss_pred CCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEe Q lcl|NC_021540. 325 SGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGM 404 (705) Q Consensus 325 dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~ 404 (705) +. .++...-++..-.....|.+.|.+|+++++. ..+|.|.+..++++++.+|..+|.+.+.+...++|.+++ T Consensus 180 ~~---i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~ 251 (453) T protein:vir:73 180 LE---TISITGKAGEVKFGESTYNVYSDLPIVEYNF-----NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVF 251 (453) T ss_pred Ce---EEEEEecCCceEEccceeccCCceeEEEecC-----CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeee Confidence 11 1111111111111222333346788887653 346889999999999999999999999998888888776 Q ss_pred eccccCchhhhhhcCCcceeec---CC---ccc-ccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccc Q lcl|NC_021540. 405 SKNLLDPVNERKFKMGEDYKYN---PG---TNP-VTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLG 477 (705) Q Consensus 405 ~~~av~~~d~~~~~pg~~i~~~---~~---~~~-~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~ 477 (705) ....++..+......+.++... ++ ... ...+.+...+.-...+...++.+...+...|++++.+.+..++ +| T Consensus 252 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~S 330 (453) T protein:vir:73 252 LGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGN-SS 330 (453) T ss_pred ecCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccC-cc Confidence 4322332222222222222211 11 111 1113344333333455667888999999999998887765443 34 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHH Q lcl|NC_021540. 478 TTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETD 557 (705) Q Consensus 478 ~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~ 557 (705) +.| +...............+.|..+++++++.++.++.... . ..+.. ...+..+...+... T Consensus 331 g~A--l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~----------~---~~~~~----~i~v~f~~~~p~~~ 391 (453) T protein:vir:73 331 GVA--LAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNAS----------N---KDAWK----DIEYTFTRNEPKDI 391 (453) T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----------C---ccccc----cceEEeCCCCCCCH Confidence 444 44433444445555566666666666666555432111 0 00000 12233333322212 Q ss_pred HHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 558 AIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQ 637 (705) Q Consensus 558 ~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q 637 (705) .+..+.+..+. + .++. ..+ +..+.... ++.+ +.++.+.+.++. .+.+.... T Consensus 392 ~~~a~~~~k~~---g-iis~----et~--~~~~~~~~-------------d~~~--E~~ri~~E~~~~----~~~~~~~~ 442 (453) T protein:vir:73 392 KEQAETANILK---G-ITSE----ETA--LSVISVIP-------------DVQA--EMEKIKKKKLLQ----LSLTRTSN 442 (453) T ss_pred HHHHHHHHHHh---c-cCcH----HHH--HHhCCCCC-------------CHHH--HHHHHHHHHHHH----HHHHHhcc Confidence 22222211111 1 1111 111 01111111 1111 111111110000 00000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 638 LMPYEAQAEAAKARKANT 655 (705) Q Consensus 638 ~~~~~~q~e~a~a~~~~~ 655 (705) ..+. .+.+.-+ T Consensus 443 ~~~~-------~~~~~~~ 453 (453) T protein:vir:73 443 LVRM-------KQMRGNL 453 (453) T ss_pred CCcc-------hhhhcCC Confidence 0000 0000000 No 61 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.59 E-value=5.9e-13 Score=87.61 Aligned_cols=452 Identities=13% Similarity=0.022 Sum_probs=206.6 Q ss_pred Ccchhhhhhccccccc--CCCCCCH-HHHHHHHHHHHHhhHHhhH-HHHHHHHHHHHhccCCCC-CCCCCCC--CCcCCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSL--QEDWKNK-PKVSDLLNDFNNAKSTKDT-QVAIIDDWLAQLNVTGAY-KPKQQVG--RSSVQP 73 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~g--rs~~v~ 73 (705) |.+++.---.-+.... .+ .++ .+...|++.++ .|.. ...+.++..+||.|.-.. .....++ .-+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~i~~~i~----~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~ 74 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFP--KGEKLTSNELLGFIA----YNETVLKPRYRENMKLYLGKHKILTAPEKETGADNRIVV 74 (470) T ss_pred CccccCCcccccCCceEEeC--CCCCcCHHHHHHHHH----HHHHhhHHHHHHHHHHhccccccccCcccccCCcceeec Confidence 8888876543333322 22 111 12233333333 2322 224456677899986321 1111223 345777 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 74 KLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 74 ~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) +.....|+.....| ||.+.-|.. .+|....+ .+..++ ..|+-...+...+++++..|.+.+.+|++ T Consensus 75 n~~~~Ivd~~~~~l----~g~p~~~~~----~~d~~~~~----~l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~d- 140 (470) T protein:vir:99 75 NSAKYVVDVYNGYF----CGIEPKLAL----LNDSSKID----EIARWN-RQENFFDTINEISKQCDIFGRSIASIYQG- 140 (470) T ss_pred chHHHHHHHHhhhh----ccCCeeEee----CCchhHHH----HHHHHH-HhcCHhHHHHHHHHHHHhcCeeEEEEEeC- Confidence 77777777766655 555533332 23332222 233332 34555566788999999999997776642 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) ..+.| T Consensus 141 ---------------------------------------------------------------------------~dg~~ 145 (470) T protein:vir:99 141 ---------------------------------------------------------------------------EDARP 145 (470) T ss_pred ---------------------------------------------------------------------------CCCeE Confidence 02346 Q ss_pred eEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeE Q lcl|NC_021540. 234 EVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKI 311 (705) Q Consensus 234 ~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 311 (705) ++..++|.++++ |+.... . ...+.+.+.... . .... T Consensus 146 ~i~~~~p~~~~~i~d~~~~~---~-~~~~vr~~~~~~------------------------------~--------~~~~ 183 (470) T protein:vir:99 146 HLMYSSPNHAFIIYDDTVQR---Q-PLAFVHYQIDNS------------------------------N--------NWTD 183 (470) T ss_pred EEEEEccceeEEEEcCCCCc---c-eEEEEEEEEEec------------------------------C--------CeeE Confidence 778889998754 332211 1 111222221100 0 0001 Q ss_pred EEEEEEEEeeecCCCeeEEEEEEEEC----CEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHH Q lcl|NC_021540. 312 VVYEYWGYWDIDGSGVTTPIVASWVD----DVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALT 387 (705) Q Consensus 312 ~v~E~w~k~~~~~dg~~~~~~~~~~g----~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~ 387 (705) ..+++|.. +. .+.....+ ..+....++| .+.+|++++.. ..+|.|.+..++++++.+|..+ T Consensus 184 ~~~~~~~~-----~~---~~~~~~~~~~~~~~~~~~~~~~--~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~ 248 (470) T protein:vir:99 184 AYGVIQYA-----DK---FYKFKGYDIEEDTNAAGYAINP--YGLVPAVEFFE-----NEERQGIFDSIKTLINALDKVI 248 (470) T ss_pred EEEEEEec-----Ce---EEEEEecccccccccccccccC--CCccceEeecC-----CCCCCcchHhHHHHHHHHHHHH Confidence 11222221 00 00000001 1122223333 36778777643 4578999999999999999999 Q ss_pred HHHHHHHHhcCCCcEEeeccccCchhh----hhhcCCcceeecCCc-ccccccccccCccchHHHHHHHHHHHHHHHHHh Q lcl|NC_021540. 388 RGMIDAMARSANGQRGMSKNLLDPVNE----RKFKMGEDYKYNPGT-NPVTDIIEHKYPELPASSYNMLQMFTLEADALS 462 (705) Q Consensus 388 ~~~~d~~~~~~~~~~~~~~~av~~~d~----~~~~pg~~i~~~~~~-~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~t 462 (705) +.+.+.+...++|.+++........+. ......+++.+.+.. ...+.+.++..+.....+...++.+...+-..| T Consensus 249 s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 328 (470) T protein:vir:99 249 SQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMA 328 (470) T ss_pred HHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHh Confidence 999999999999888776443332221 122334444443221 112234455444444556667899999999999 Q ss_pred CcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcc Q lcl|NC_021540. 463 GVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLV 542 (705) Q Consensus 463 Gv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~ 542 (705) |+++.+.+..++.+|+.| +...............+.|..+++++++.++.++....... + + . T Consensus 329 ~~p~~~~~~~~~n~Sg~A--i~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~---------~---~----~ 390 (470) T protein:vir:99 329 MVPNIQDKNFAGNSSGVA--LQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQ---------E---L----W 390 (470) T ss_pred CCccccccccccCchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc---------c---c----c Confidence 999887765444344444 44433444555566666666677776666665543322110 0 0 0 Q ss_pred cceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHH Q lcl|NC_021540. 543 GSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEA 622 (705) Q Consensus 543 ~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~ 622 (705) ..+.+..+...+.......+.+..+. + .++. .-++. .+.+. ++. .+.++.+++. T Consensus 391 ~~i~v~f~~~~p~~~~e~a~~~~kl~---g-iis~---et~l~---~l~~v--------------d~~--~E~eri~~E~ 444 (470) T protein:vir:99 391 SELDFKFTRNLPEDMASAIDNAKNAE---G-IVSK---KTQLG---MIPDI--------------EPD--AEMKQIAKEK 444 (470) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh---c-cCCH---HHHHH---hCCCC--------------CHH--HHHHHHHHHH Confidence 02223333322221222222222211 1 1111 11111 11111 000 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 623 QELQMRIAKLQAEIQLMPYEAQAEAA 648 (705) Q Consensus 623 q~~q~e~~k~qa~~q~~~~~~q~e~a 648 (705) ........+..-...........+.. T Consensus 445 ~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 445 ADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred HHHHHHHHhhcCCCCcCCCCCCccCC Confidence 11000001100000000000000000 No 62 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.59 E-value=6e-13 Score=87.58 Aligned_cols=468 Identities=13% Similarity=0.092 Sum_probs=216.6 Q ss_pred Ccc--hhhhhhcccccccCCCCCCHHHHHHHHHHHHHh-hHHhhHHHHHHHHHHHHhccCCCCCC-CCCCC----CCcCC Q lcl|NC_021540. 1 MSD--INEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNA-KSTKDTQVAIIDDWLAQLNVTGAYKP-KQQVG----RSSVQ 72 (705) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g----rs~~v 72 (705) |+= =++..+-.. ...|--... |++-.++- ..-...++.+.+.|..||.|...... ...+| |-... T Consensus 1 m~~~~~~k~~~~~~----~~~~~~~~~---~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~s 73 (508) T protein:vir:15 1 MGLIQRIKDLFWKG----AAATGVTGS---LSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNT 73 (508) T ss_pred CChHHHHHHHHHHH----HHHhccccc---hHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceee Confidence 211 111111000 000000001 11111111 11223445667899999998753221 11122 22233 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeec Q lcl|NC_021540. 73 PKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWC 152 (705) Q Consensus 73 ~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~ 152 (705) .+.-+..++. +++| +|+-..-+.+. +|.. ...+|+.++. .|+-...++.++..|+..|.|++|+||+ T Consensus 74 ln~~~~i~~~-~A~l---v~~e~~~i~v~----~~~~----~~e~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d 140 (508) T protein:vir:15 74 INMAKTAARR-IASV---VFNEKAEIHVK----DNNE----ADKFLNDVLE-DNDFKNKFEEALEKGVALGGFAMRPYID 140 (508) T ss_pred cchHHHHHHH-HHhh---hhCCCceEEeC----CchH----HHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEEe Confidence 3444444432 2333 34443334432 2222 2335655543 3444566888999999999999999984 Q ss_pred chhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCc Q lcl|NC_021540. 153 LEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQ 232 (705) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 232 (705) .++ T Consensus 141 -----------------------------------------------------------------------------~~~ 143 (508) T protein:vir:15 141 -----------------------------------------------------------------------------GNH 143 (508) T ss_pred -----------------------------------------------------------------------------CCe Confidence 124 Q ss_pred ceEEEechhheee-CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeE Q lcl|NC_021540. 233 PEVTICDYHNVTI-DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKI 311 (705) Q Consensus 233 ~~i~~V~~~~~~~-Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 311 (705) ++|++|++..||+ ..+. .++..|-|+..... + +- ...+.+ T Consensus 144 ~~i~~v~ad~~~P~~~d~-~~~~~~af~~~~~~-~--~~-----------------------------------~~~~~y 184 (508) T protein:vir:15 144 IKIAWVRADQFYPLQSNT-NDISEAAIASRTQR-T--ES-----------------------------------NQTKYY 184 (508) T ss_pred eEEEEEcCCeeEEEEEcC-CCeEEEEEEEEEEe-e--cC-----------------------------------CCceEE Confidence 5788899988884 1121 23445444322211 0 00 000112 Q ss_pred EEEEEEEEeeecCCCeeEEEEEEEEC------CEEEecccCC----------C-CCCCcceEEeee----eeecCcccCC Q lcl|NC_021540. 312 VVYEYWGYWDIDGSGVTTPIVASWVD------DVMIRLEKNP----------Y-PDGKLPFVVVPY----LPVKDSVYGE 370 (705) Q Consensus 312 ~v~E~w~k~~~~~dg~~~~~~~~~~g------~~iL~~~~~p----------~-~~~~~Pfv~~~~----~~~~~~~~g~ 370 (705) +.+|+|...+ ++.|..+ ..+|-+ |..+.....| + ...+.||+.+.. ....++.+|. T Consensus 185 t~lE~h~~~~-~~~~~I~--n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~ 261 (508) T protein:vir:15 185 TLLEFHQWQD-NGSYQIT--NELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGL 261 (508) T ss_pred EEEEEEEEec-CcceEEE--EEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCC Confidence 3333332210 1111111 111110 1111111100 0 112234444322 2234788999 Q ss_pred chHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh--hhhcCCc-cee-ecCCcccccccccccCccchHH Q lcl|NC_021540. 371 ADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE--RKFKMGE-DYK-YNPGTNPVTDIIEHKYPELPAS 446 (705) Q Consensus 371 g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~--~~~~pg~-~i~-~~~~~~~~~~i~~~~~~~i~~~ 446 (705) |++..+++.++.+|..++++.+.+ ..+.+++.++++.+..+.. ..+.++. +++ ++.+......+...++.--... T Consensus 262 S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~ 340 (508) T protein:vir:15 262 GVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQ 340 (508) T ss_pred chHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecccChHH Confidence 999999999999999999999999 5788899999988754322 2233332 222 2222222234555444433456 Q ss_pred HHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeE Q lcl|NC_021540. 447 SYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVI 526 (705) Q Consensus 447 ~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~i 526 (705) +...++.+...+....|++....|..++. ..||+++....+..-.....+.+.+..+++++.+.++.+..-+.--.- T Consensus 341 ~~~~~~~~l~~~~~~~gls~~~f~~~~~~-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~-- 417 (508) T protein:vir:15 341 YKDAIDHFIKEFEVQIGLSTGTFSYSNDG-VKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDD-- 417 (508) T ss_pred HHHHHHHHHHHHHHHhCCCchhcccccCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 77788888889999999999988876543 357888887777777777888888989999999998887654321110 Q ss_pred eEecCceeeechhhcccc--eeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 527 RITDEEFVQINRDNLVGS--FDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 527 ri~~~~~v~i~~~~~~~~--~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) .....+.+..... ..+..+++...-..+.++..+.+.++ |. ++.. . -+++..++.. T Consensus 418 -----g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~a-Gi-~s~e---~---~i~~~~g~~d--------- 475 (508) T protein:vir:15 418 -----GKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAI-GA-LSKQ---T---FLQRNYGMTD--------- 475 (508) T ss_pred -----cccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhc-CC-CCHH---H---HHHhcCCCCh--------- Confidence 0000111111112 33333444333333444444444332 11 1110 0 0111112111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHH--HHHHHH--HHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMR--IAKLQA--EIQ 637 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e--~~k~qa--~~q 637 (705) .+++...++.+.+....... ...... .-+ T Consensus 476 ----eea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 476 ----EQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred ----HHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 00111111111100000000 000000 000 No 63 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.59 E-value=1.2e-13 Score=91.37 Aligned_cols=421 Identities=8% Similarity=-0.025 Sum_probs=191.9 Q ss_pred HHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC---CC-CCCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeC Q lcl|NC_021540. 29 LLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY---KP-KQQVGRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAP 102 (705) Q Consensus 29 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p 102 (705) |..+ .+..+..+.++..+||.|.-.. .+ ...++++ +++.+..+..|+.....| ||.+.-+.+ T Consensus 1 ~~~~------~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~~~~~~-- 68 (440) T protein:vir:95 1 MLAA------FLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYV----IGNPVSIGV-- 68 (440) T ss_pred Chhh------HHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhhe----eccCceEee-- Confidence 2211 1222233344556899886321 11 1123443 578888888888765554 666644433 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHH Q lcl|NC_021540. 103 KTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAV 182 (705) Q Consensus 103 ~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (705) -..+|.+... .+..+ ...|+--.....+.++++..|.+.+.+|++ T Consensus 69 ~~~~~~~~~~----~l~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d------------------------------ 113 (440) T protein:vir:95 69 MEGGSADQLS----TIKDI-EWQNDINALNSDLAFDASVYGRAYEYHFRD------------------------------ 113 (440) T ss_pred CCCccHHHHH----HHHHH-HHhcCHhHHHHHHHHHHhhcCeEEEEEEec------------------------------ Confidence 2333333222 23222 234444455678999999999998887642 Q ss_pred HHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheee--CCCccCChhhCCeEE Q lcl|NC_021540. 183 QMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVI 260 (705) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~ 260 (705) ..+.|++..++|.++++ ||.... ...+. T Consensus 114 ----------------------------------------------~~~~~~i~~~~p~~~~~~~d~~~~~---~~~~~- 143 (440) T protein:vir:95 114 ----------------------------------------------KDKVDRVVLISPLEMFVIRDLTVEQ---NIIAA- 143 (440) T ss_pred ----------------------------------------------CCCceEEEEEcccceEEEEcCCCCC---ceEEE- Confidence 02346778889988775 333211 12222 Q ss_pred EEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEEC--C Q lcl|NC_021540. 261 YSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVD--D 338 (705) Q Consensus 261 ~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g--~ 338 (705) .+++... +. ..+++|.. +++.++ ....+ + T Consensus 144 i~~~~~~----------~~--------------------------------~~~~vyt~-----~~~~~~--~~~~~~~~ 174 (440) T protein:vir:95 144 VHLPIYA----------DK--------------------------------VNMTVYTK-----DKVITY--KPYSNNSV 174 (440) T ss_pred EEEEEec----------Cc--------------------------------eEEEEEeC-----CeEEEE--EEecCCcc Confidence 2222100 00 01122211 111110 01110 1 Q ss_pred EEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc--c--Cchhh Q lcl|NC_021540. 339 VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL--L--DPVNE 414 (705) Q Consensus 339 ~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a--v--~~~d~ 414 (705) ........|.+.+.+|++.++- ..+|.|.++.++++++.+|..++.+.+.+...+.|.+++.-.. . +.... T Consensus 175 ~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~ 249 (440) T protein:vir:95 175 RLVVDDVKKHSYNDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDA 249 (440) T ss_pred ceeecceeeccCceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccch Confidence 1111222233335677776543 4468999999999999999999999999999888877653211 1 22222 Q ss_pred hhhcCCcceeecCCc-----ccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHH Q lcl|NC_021540. 415 RKFKMGEDYKYNPGT-----NPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGA 489 (705) Q Consensus 415 ~~~~pg~~i~~~~~~-----~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~ 489 (705) ......+.+.+..+. .....+.++..+.-.......++.+...+...|++++.+.+.-++..|+.| +...... T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~ 327 (440) T protein:vir:95 250 AKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIA--LLYKMIG 327 (440) T ss_pred hhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHH Confidence 233323333222110 111123344333333556677899999999999999987775443344444 5544444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHH Q lcl|NC_021540. 490 SGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQ 569 (705) Q Consensus 490 ~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq 569 (705) .........+.|..+++++++.+..++..... .. ++. ....+..+...+.......+.+..+ T Consensus 328 l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~---------~~---~~~----~~v~i~f~~~~p~~~~~~ad~~~kl-- 389 (440) T protein:vir:95 328 LEQVRKDKETYFTKALRRRYELISNIHKAING---------PV---IEA----NKLTFTFHPNIPQDVWTEIKAYIEA-- 389 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC---------cc---ccc----ccceEEeCCCCCCCHHHHHHHHHHH-- Confidence 45555666666777777766666555432211 00 110 1222333333322222222222221 Q ss_pred HHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 570 TMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQ 644 (705) Q Consensus 570 ~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q 644 (705) ...++. .-++. .+.+. ++. .+.++.+++......+....-....-...+.+ T Consensus 390 --~g~iS~---et~~~---~l~~~--------------d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 390 --GGEISQ---ETLME---NASFT--------------DYK--TEHSRILKQGGSSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred --hccCcH---HHHHH---hCCCC--------------CcH--HHHHHHHHHHHHhhhhHHhhccCCCCCCcCCC Confidence 111111 11111 11111 111 11111111111111111100000000000000 No 64 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.59 E-value=6.2e-13 Score=87.49 Aligned_cols=449 Identities=9% Similarity=0.050 Sum_probs=203.4 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC---C---CCCCCCC--CcCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY---K---PKQQVGR--SSVQ 72 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~---~~~~~gr--s~~v 72 (705) -.--+.+++..+-...++ ...|..+..+. ....+.+.+++.+||.|.-.. . .....++ .+++ T Consensus 14 ~~~~~~~~~~~~~~~~~~----~~~i~~~i~~~------~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~ 83 (481) T protein:vir:10 14 SPLANDDFVVSDLAELLK----EENLRNFISRH------QTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAV 83 (481) T ss_pred ccccCceeeeecchhhcC----HHHHHHHHHHH------HHHHHHHHHHHHHHhcCCCcccccCccccccccccccceee Confidence 111223333333222222 23333333321 234456677888999886321 1 1112233 3567 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeec Q lcl|NC_021540. 73 PKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWC 152 (705) Q Consensus 73 ~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~ 152 (705) .+.....|+.....| ||.+. .|.+ +|.... ++++-+|. .|+--..+..++++++..|.+.+.+|++ T Consensus 84 ~n~~~~ivd~~~~~l----~g~~~--~~~~---~d~~~~----~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~~~~d 149 (481) T protein:vir:10 84 HNYAKYVSRFIVGYL----TGNPI--TITH---QDNQTN----DKIIELND-LNDADEVNSDLALNLSIYGRAYEIVYRD 149 (481) T ss_pred cchHHHHHHHHHhhh----ccCCc--eEec---CChhHH----HHHHHHHH-hcChhHHHHHHHHHHHhcCeEEEEEEeC Confidence 777788777766544 44333 3333 233322 24443332 2444445778999999999998876541 Q ss_pred chhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCc Q lcl|NC_021540. 153 LEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQ 232 (705) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 232 (705) ..+. T Consensus 150 ----------------------------------------------------------------------------~dg~ 153 (481) T protein:vir:10 150 ----------------------------------------------------------------------------FEDR 153 (481) T ss_pred ----------------------------------------------------------------------------CCCe Confidence 0234 Q ss_pred ceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCe Q lcl|NC_021540. 233 PEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKK 310 (705) Q Consensus 233 ~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (705) |++..++|.++++ |+... .....+.+.+.... ..... T Consensus 154 ~~i~~~~p~~~~~v~d~~~~----~~~~~~i~~~~~~~-------------------------------------~~~~~ 192 (481) T protein:vir:10 154 DTFKVLDPKSTFVVYDQTLD----KKVVAGVRYFEKQD-------------------------------------KDKVP 192 (481) T ss_pred EEEEEEcccceEEEEcCCCC----CceEEEEEEEEEee-------------------------------------CCCce Confidence 6778889988864 33321 11122222221100 00122 Q ss_pred EEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHH Q lcl|NC_021540. 311 IVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGM 390 (705) Q Consensus 311 v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~ 390 (705) +..+|+|.. +. .++....++..-..++.|.+.+.+|+++++- ..+|.|.+..++++++.+|..++.+ T Consensus 193 ~~~~~~y~~-----~~---i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~lida~~~~~s~~ 259 (481) T protein:vir:10 193 VQHVEVYTT-----DK---IYYIEIKGGTYHRVEEVEHYYNDVPIIEYLN-----DQFKQGDFENVIALIDLYDSAQSDT 259 (481) T ss_pred EEEEEEEec-----Ce---EEEEEecCCceeecccccccCCceeEEEeec-----CCCCCCchhhHHHHHHHHHHHHHHH Confidence 444566643 11 1222222333222233343446778776542 4568999999999999999999999 Q ss_pred HHHHHhcCCCcEEeecccc-CchhhhhhcCCcceeecCCcc-----cccccccccCccchHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021540. 391 IDAMARSANGQRGMSKNLL-DPVNERKFKMGEDYKYNPGTN-----PVTDIIEHKYPELPASSYNMLQMFTLEADALSGV 464 (705) Q Consensus 391 ~d~~~~~~~~~~~~~~~av-~~~d~~~~~pg~~i~~~~~~~-----~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv 464 (705) .+.+...+.|.+++..... +..+...++.++.+.+..+.. ....+.+...+.-...+...++.+...+...|++ T Consensus 260 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~ 339 (481) T protein:vir:10 260 ANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNT 339 (481) T ss_pred HHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCC Confidence 9999988888887653322 223333444444443322111 1122333333332345666788899999999999 Q ss_pred chHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccc Q lcl|NC_021540. 465 KSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGS 544 (705) Q Consensus 465 ~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~ 544 (705) ++.+.|..++..|+.| +...............+.|..+++++++.++.++....... .++ .. T Consensus 340 p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~-------~~~---------~~ 401 (481) T protein:vir:10 340 PDLNDEQFSGVQSGES--MKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQ-------HNY---------AE 401 (481) T ss_pred ccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc-------ccc---------ce Confidence 9988875443344444 33333333444455555666666666665555443211100 000 01 Q ss_pred eeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHH Q lcl|NC_021540. 545 FDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQE 624 (705) Q Consensus 545 ~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~ 624 (705) +.+..+...+.......+.+..+. + .++. .-.+. .+.......+.++ ..+.++.+..+ T Consensus 402 i~v~f~~~~~~~~~~~a~~~~kl~---g-~is~---et~~~---~l~~i~d~~~E~~------------ri~~E~~~~~~ 459 (481) T protein:vir:10 402 LTITFTPNLPKSMMESINAFNALS---G-GVSE---STRLS---LLDFIDNPKEELE------------KMQEEEAQREK 459 (481) T ss_pred eeEEeCCCCCcCHHHHHHHHHHHh---c-cCCh---HHHHH---hCCCCCCHHHHHH------------HHHHHHHHHHh Confidence 223333332222222222222221 1 1111 01111 1111111110000 00000000000 Q ss_pred HHHH--HHHHH-HHHHHHHHHH Q lcl|NC_021540. 625 LQMR--IAKLQ-AEIQLMPYEA 643 (705) Q Consensus 625 ~q~e--~~k~q-a~~q~~~~~~ 643 (705) .... ...+- .......-+- T Consensus 460 ~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 460 QADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred hhhhccCCccCCCCCCCCCCCC Confidence 0000 00000 0000000000 No 65 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.58 E-value=6.7e-13 Score=87.33 Aligned_cols=457 Identities=11% Similarity=0.088 Sum_probs=208.0 Q ss_pred Ccc----hhhhhhcccccccCCCCCCHHHHHHHHHHHHHhh-HHhhHHHHHHHHHHHHhccCCCC--CC-----CCCCCC Q lcl|NC_021540. 1 MSD----INEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAK-STKDTQVAIIDDWLAQLNVTGAY--KP-----KQQVGR 68 (705) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~--~~-----~~~~gr 68 (705) |=+ -+.+.+.. +- ....|+.-+++.. ..+++++.++++|..||.|.... .+ .....+ T Consensus 1 m~~~~~~~~~~~~~~--------~~---~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~ 69 (499) T protein:vir:80 1 MINQIIAGVKGVMRR--------MG---LLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNR 69 (499) T ss_pred ChhHHHHHHHHHHHH--------hc---cccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCcccc Confidence 111 11111110 00 0111222222211 23556667788999999886321 11 011123 Q ss_pred CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEE Q lcl|NC_021540. 69 SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFR 148 (705) Q Consensus 69 s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k 148 (705) .+++.+.-...++.. .+.+|+-+.-+.+ +|.+ .+++++.++. .|+-.+.+...+..|+..|.+.+| T Consensus 70 ~~~s~n~~~~iv~~~----a~~l~~ep~~i~~-----~d~~----~~e~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~ 135 (499) T protein:vir:80 70 RQLSMNLPKVTAKYM----SKLLFNEKVKINI-----DDET----AEEFVLNVLK-TNGFTKNMERYIEYGEAMGGFVIK 135 (499) T ss_pred ceeecchHHHHHHHH----HHhhhCCcceEee-----CCHH----HHHHHHHHHh-hccHHHHHHHHHHHHhhcCcEEEE Confidence 344455545444443 2334555444544 3433 4446665543 344556688999999999999999 Q ss_pred EeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeee Q lcl|NC_021540. 149 TSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKT 228 (705) Q Consensus 149 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 228 (705) +||+. T Consensus 136 ~~~D~--------------------------------------------------------------------------- 140 (499) T protein:vir:80 136 VYHDG--------------------------------------------------------------------------- 140 (499) T ss_pred EEECC--------------------------------------------------------------------------- Confidence 99841 Q ss_pred ccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccccccccc Q lcl|NC_021540. 229 VKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKAR 308 (705) Q Consensus 229 ~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (705) .++|+|+.|++..||+=.....++..|-|+-.. ++++ T Consensus 141 -~~~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~---~~~~--------------------------------------- 177 (499) T protein:vir:80 141 -NKNVKVSFATADCMYPLSNDSENVDECLIANSF---HKNN--------------------------------------- 177 (499) T ss_pred -CCcEEEEEEcCCceEEEEecCCCeEEEEEEEEE---eecC--------------------------------------- Confidence 135678899999988522111345555443211 1100 Q ss_pred CeEEEEEEEEEeeecCCCeeEEEEEEEE-------CCEE----Eec---ccCCC-CCCCcceEEeee----eeecCcccC Q lcl|NC_021540. 309 KKIVVYEYWGYWDIDGSGVTTPIVASWV-------DDVM----IRL---EKNPY-PDGKLPFVVVPY----LPVKDSVYG 369 (705) Q Consensus 309 ~~v~v~E~w~k~~~~~dg~~~~~~~~~~-------g~~i----L~~---~~~p~-~~~~~Pfv~~~~----~~~~~~~~g 369 (705) +.+..+|+|...+.. .+.-......|. |..+ +.. ...++ ..++.||+.+.. ....++.+| T Consensus 178 ~~y~~lE~h~~~~~~-~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG 256 (499) T protein:vir:80 178 KYYKLLEWNEWKGEK-EEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLG 256 (499) T ss_pred eEEEEEEEEEecccc-eeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccC Confidence 011112222110000 000000000010 1000 000 00011 123455655433 124577889 Q ss_pred CchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh------hhhcC-CcceeecCCcc--cccccccccC Q lcl|NC_021540. 370 EADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE------RKFKM-GEDYKYNPGTN--PVTDIIEHKY 440 (705) Q Consensus 370 ~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~------~~~~p-g~~i~~~~~~~--~~~~i~~~~~ 440 (705) .|++..++++.+.+|..++.+.+.+.. +..++.++.+++..... ..+.+ ..++....+.. ....+...++ T Consensus 257 ~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 335 (499) T protein:vir:80 257 ISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDISV 335 (499) T ss_pred CchHhhHHHHHHHHHHHHHHHHHHHHh-cccceecchhhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEecC Confidence 999999999999999999999999876 67778887776633211 11111 12222222111 1123444443 Q ss_pred ccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021540. 441 PELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWL 520 (705) Q Consensus 441 ~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~ 520 (705) .-....+...++.+...+....|++....|..++. ..||+++....+..-.....+.+.|..++.++.+.++.+...+. T Consensus 336 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g-~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~ 414 (499) T protein:vir:80 336 EIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIK 414 (499) T ss_pred cCChHHHHHHHHHHHHHHHHhcCCChhhcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 33334566778888888888999999888865433 35787787655555556677778888888888888887655432 Q ss_pred CCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhh Q lcl|NC_021540. 521 SDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMI 600 (705) Q Consensus 521 ~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~ 600 (705) -.. +..+ . .....+..+.+...-.....+..+.+.+.. . ++.. .. ++...+... T Consensus 415 ~~~------~~~~---~----~~~v~v~f~d~i~~d~~~~~~~~~~~~~~G-i-~S~e---t~---l~~~~~~~d----- 468 (499) T protein:vir:80 415 AYD------GDTV---E----LDTITVDFDDSIAQDEDTTINRYTTAKNQG-M-IPLK---IA---LQRAWNITE----- 468 (499) T ss_pred ccc------CCCC---C----ccceEEEeCCCCCCCHHHHHHHHHHHHHcC-C-CCHH---HH---HhhcCCCCh----- Confidence 110 0000 0 012223333332222223333333333221 1 1100 00 111111111 Q ss_pred hcccccchhhHHHHHHHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_021540. 601 SKYNPEPSPQAQLEIQIKQLEAQELQM--RIAKLQAEIQ 637 (705) Q Consensus 601 ~~~~~q~~~~~q~~~q~~q~~~q~~q~--e~~k~qa~~q 637 (705) .++....++.+.+....-. ...-+..+.+ T Consensus 469 --------~ea~~el~~i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 469 --------AEADEWAEMLAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred --------HHHHHHHHHHHHHhhcCCCCCCccccCCCCC Confidence 0000000000000000000 0000000000 No 66 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.58 E-value=1.2e-13 Score=91.45 Aligned_cols=452 Identities=10% Similarity=0.043 Sum_probs=208.6 Q ss_pred CcchhhhhhcccccccCC--CCCCHH----H-HHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CC-------- Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQE--DWKNKP----K-VSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PK-------- 63 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~----~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-------- 63 (705) |+|-.-.+.----|..-| +..|.- . -..+...+......|...+.+..+..+||.|.-... +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~ 80 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV 80 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Confidence 777655554333333211 111110 0 112233344555556666777778899999863111 10 Q ss_pred -CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhc Q lcl|NC_021540. 64 -QQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNE 142 (705) Q Consensus 64 -~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~ 142 (705) ..+-..+++.+..+..|+.....| ||.+.- |. .+|.+.. +.++..+. |+-...+....++++.+ T Consensus 81 ~~~~~~~ki~~n~~k~Ivd~~~~~l----~G~p~~--~~---~~d~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~ 145 (483) T protein:vir:12 81 DPLKPDDRMITNFHANLVDQKVSYI----VGKPIA--FK---HTDDEVV----KRIDEVLG--NRFDDKLHSVLTGASNK 145 (483) T ss_pred cccccccccccchHHHHHHHHhhhh----cccCce--ec---cCChHHH----HHHHHHHh--ccHHHHHHHHHHHHhhC Confidence 111234688888888888877665 554433 32 2343322 34444433 34445566788999999 Q ss_pred CCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccc Q lcl|NC_021540. 143 GTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEE 222 (705) Q Consensus 143 g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 222 (705) |.|.+.+||+ T Consensus 146 G~~y~~v~~d---------------------------------------------------------------------- 155 (483) T protein:vir:12 146 GIEWLHPYLD---------------------------------------------------------------------- 155 (483) T ss_pred CeEEEEEEEc---------------------------------------------------------------------- Confidence 9998877652 Q ss_pred cceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccc Q lcl|NC_021540. 223 QEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTS 300 (705) Q Consensus 223 ~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~ 300 (705) ..+.|++..++|.++++ |++.... -.+ +.+.+.... . T Consensus 156 ------~d~~~~i~~~~p~~~~~v~d~~~~~~---~~~-~ir~~~~~~----------~--------------------- 194 (483) T protein:vir:12 156 ------EEGEFKLFRVPAEQGIPIWTDKEHEE---LEA-FIRMYKLEN----------E--------------------- 194 (483) T ss_pred ------CCCceEEEEEcccceEEEEcCCCCCc---eEE-EEEEEEeec----------c--------------------- Confidence 02346788899999754 4433222 122 233331100 0 Q ss_pred ccccccccCeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHH Q lcl|NC_021540. 301 FTFSDKARKKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAEL 375 (705) Q Consensus 301 ~~~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~ 375 (705) ..+ |+|.. +..++.+.... .-...+...+...++ +.+.+|+++++. +.+|.|.+.. T Consensus 195 --------~~~---~~y~~~~v~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~g~vPvv~~~n-----n~~g~sd~e~ 255 (483) T protein:vir:12 195 --------TKV---EYWDKVTVNYYVYENGSLIPD-YSNNLENSKTHFSTG--SWGKIPFIPFKN-----NDLEISDIFM 255 (483) T ss_pred --------eEE---EEEecCeEEEEEEeCCeeeec-ccccccccccccccC--CCCccceEEecC-----CCCCCCchhh Confidence 001 11111 00111110000 000001112222333 335677776653 4578999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh--hhhcCCcceeecCCcccccccccccCccchHHHHHHHHH Q lcl|NC_021540. 376 LSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE--RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQM 453 (705) Q Consensus 376 ~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~--~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~ 453 (705) ++++++.+|...|.+.+.+...+.|.+++.....+.... ...+.++++.+..++ .+.++..+.-.......++. T Consensus 256 v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~~~~~~~~~~~~~ 331 (483) T protein:vir:12 256 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNG----GVDTIQVEVPVENSKKYLDE 331 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhhccccccCCCC----cceEEeecCCHHHHHHHHHH Confidence 999999999999999999999888877654322222111 122344555554443 23444433334566677889 Q ss_pred HHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCce Q lcl|NC_021540. 454 FTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEF 533 (705) Q Consensus 454 ~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~ 533 (705) +.+.+...+++++.+.+..++.+|+.| +...............+.|..+++++++.++.++ ... .++ T Consensus 332 l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~----~~~-------~~~ 398 (483) T protein:vir:12 332 LYQKIMLFGQAVDFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF----DIK-------GEH 398 (483) T ss_pred HHHHHHHHhCCCCCCccccccCcHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC-------Ccc Confidence 999999999999887765444444444 4444444445555556666666666666555443 211 011 Q ss_pred eeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHH Q lcl|NC_021540. 534 VQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQL 613 (705) Q Consensus 534 v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~ 613 (705) . +..+..+...+.......+.+..+. | .++. .-.+ ..++...+ +. . T Consensus 399 ~---------~i~v~f~~~~p~~~~~~a~~~~kl~---G-iiS~---et~~---~~~~~v~d-------------~~--~ 444 (483) T protein:vir:12 399 K---------DVDISFNYNKVANTELQVQTAQQSM---G-IVSH---ETVL---ENHPFVED-------------LQ--A 444 (483) T ss_pred c---------eeeEEeCCCCCCCHHHHHHHHHHHh---c-cCch---HHHH---HhCCCCCC-------------HH--H Confidence 1 1223333332221112222222111 1 1110 0000 01111111 11 1 Q ss_pred HHHHHHHHHHHHHHHH---HHHHHHHH-HHHHHHHHHHH Q lcl|NC_021540. 614 EIQIKQLEAQELQMRI---AKLQAEIQ-LMPYEAQAEAA 648 (705) Q Consensus 614 ~~q~~q~~~q~~q~e~---~k~qa~~q-~~~~~~q~e~a 648 (705) +.++.+.+..+..... ........ .....-+.++. T Consensus 445 E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 445 ELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 483 (483) T ss_pred HHHHHHHHHHHHHhhcccccccccCCcccCCCCCcccCC Confidence 1111111100000000 00000000 00000001110 No 67 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.58 E-value=1.5e-13 Score=90.88 Aligned_cols=435 Identities=10% Similarity=0.029 Sum_probs=202.6 Q ss_pred CCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC-------------CC----CCCCC--CcCCCHHHHHHH Q lcl|NC_021540. 20 WKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK-------------PK----QQVGR--SSVQPKLIRKQA 80 (705) Q Consensus 20 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~----~~~gr--s~~v~~~v~~~~ 80 (705) |. +..+.+.+......|...+.+..+..+||.|.-... +. ...++ .+++.+..+..| T Consensus 1 ~~----~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 76 (471) T protein:vir:10 1 ME----IEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLL 76 (471) T ss_pred CC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHH Confidence 22 233344455555666666677778889999852110 00 00111 246777777777 Q ss_pred HHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhh Q lcl|NC_021540. 81 EWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTE 160 (705) Q Consensus 81 e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~ 160 (705) +.....| ||.+.-+ .+ +|.+ ..++++..+. |+-...+....++++..|.+.+.+||+. T Consensus 77 d~~~~yl----~G~p~~~--~~---~~~~----~~~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~------- 134 (471) T protein:vir:10 77 DQKKAYA----LTYPPTF--DV---DDKK----VNDMIVDVLG--DDYERISKQLCVNAGNAGIAWLHVWKDA------- 134 (471) T ss_pred Hhhhhhh----cccCcee--cc---CChH----HHHHHHHHHh--cCHHHHHHHHHHHHhhCCeEEEEEEeeC------- Confidence 7766555 5655433 22 3332 2234554443 3333445678889999999988776620 Q ss_pred cccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEech Q lcl|NC_021540. 161 NVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDY 240 (705) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~ 240 (705) ..+++++..++| T Consensus 135 --------------------------------------------------------------------~~g~~~~~~~~p 146 (471) T protein:vir:10 135 --------------------------------------------------------------------SDNSFRYACVDS 146 (471) T ss_pred --------------------------------------------------------------------CCCeeEEEEEcc Confidence 023567888999 Q ss_pred hheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEE Q lcl|NC_021540. 241 HNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWG 318 (705) Q Consensus 241 ~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~ 318 (705) .++++ |+.. .+-...+.+.+.+.... ....+..+|+|. T Consensus 147 ~~~~~i~d~~~----~~~~~~~ir~~~~~~~~------------------------------------~~~~~~~~~vy~ 186 (471) T protein:vir:10 147 KEVIPIYSKSL----DKKSIGVLRVYSSIDET------------------------------------DGKNYTVYEYWN 186 (471) T ss_pred cceEEEEcCCC----CCceEEEEEEEEeeccC------------------------------------CCceeEEEEEEe Confidence 98753 3322 11122233333221110 001122333332 Q ss_pred Ee-----eecCCCeeEE-------EEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHH Q lcl|NC_021540. 319 YW-----DIDGSGVTTP-------IVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGAL 386 (705) Q Consensus 319 k~-----~~~~dg~~~~-------~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~ 386 (705) .. -..+.+.... .......+........|.+.|.+|++.+.. ...|.|.+..++++++.+|.+ T Consensus 187 ~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e~v~~liDa~d~~ 261 (471) T protein:vir:10 187 DKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLKPIKDLVDVYDKV 261 (471) T ss_pred CCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCchHHHHHHHHHHHHH Confidence 10 0000100000 000011122333333343446677766543 456889999999999999999 Q ss_pred HHHHHHHHHhcCCCcEEeeccccC--chhhhhhcCCcceeecCCc-ccccccccccCccchHHHHHHHHHHHHHHHHHhC Q lcl|NC_021540. 387 TRGMIDAMARSANGQRGMSKNLLD--PVNERKFKMGEDYKYNPGT-NPVTDIIEHKYPELPASSYNMLQMFTLEADALSG 463 (705) Q Consensus 387 ~~~~~d~~~~~~~~~~~~~~~av~--~~d~~~~~pg~~i~~~~~~-~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tG 463 (705) .|.+.+.+...++|.+++...... .........++.+.+.... .....+.++..+.-.......++.+.+.+-..++ T Consensus 262 ~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~ 341 (471) T protein:vir:10 262 FSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQ 341 (471) T ss_pred HHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhC Confidence 999999999998887665432111 1112233445555554322 1222344555444445667788999999999999 Q ss_pred cchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhccc Q lcl|NC_021540. 464 VKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVG 543 (705) Q Consensus 464 v~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~ 543 (705) .++.+.+..++ +|+.| +..+............+.|..+++++++.++.++..+ ++. T Consensus 342 tp~~~~~~~gn-~Sg~A--lk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~------------d~~--------- 397 (471) T protein:vir:10 342 GVNPETDKLGN-SSGVA--LKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS------------DKL--------- 397 (471) T ss_pred CcCCCcccccC-ccHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC------------CCc--------- Confidence 98877665544 34544 5544444555555556666666666665555543211 111 Q ss_pred ceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHH Q lcl|NC_021540. 544 SFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQ 623 (705) Q Consensus 544 ~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q 623 (705) .+.+..+...+.......+.+..+ .+ .++. .-++ ..++... ++ +.+.++.+.+.. T Consensus 398 ~i~i~f~~~~p~n~~e~~~~~~kl---~g-~iS~---et~~---~~~p~v~-------------D~--~~E~eri~~E~~ 452 (471) T protein:vir:10 398 KIKQTWTRNSINNDTEMAQVVSTL---AT-ITSR---ENVA---KSNPIVE-------------DW--QDELRLQKAEQE 452 (471) T ss_pred eeEEEeCCCCCCCHHHHHHHHHHH---hc-cCch---HHHH---HhCCCCC-------------CH--HHHHHHHHHHHH Confidence 122333322221111111211111 11 1110 0011 0111111 11 111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 624 ELQMRIAKLQAEIQLMPYEAQAE 646 (705) Q Consensus 624 ~~q~e~~k~qa~~q~~~~~~q~e 646 (705) +. ++..........+.+.+ T Consensus 453 ~~----~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 453 GR----SEKLYDMEEVEHESEVE 471 (471) T ss_pred HH----HhcccccCCCCCccccC Confidence 00 00000000000000111 No 68 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.58 E-value=2.8e-14 Score=94.90 Aligned_cols=453 Identities=10% Similarity=0.036 Sum_probs=202.0 Q ss_pred Ccchhh--------hhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC--CC------ Q lcl|NC_021540. 1 MSDINE--------EFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP--KQ------ 64 (705) Q Consensus 1 ~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~------ 64 (705) |++|+- +.++-=+|...+ + ...+......|...+.+.++..+||+|...... .+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~---~-------~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~ 70 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYET---Q-------EEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGD 70 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCC---c-------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccc Confidence 888743 111111111111 1 112333444556666777788899998642211 11 Q ss_pred -CCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHh Q lcl|NC_021540. 65 -QVGR--SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVN 141 (705) Q Consensus 65 -~~gr--s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~ 141 (705) .+++ .+++.+..+..|+.....| ||.+.-+. .+|.+..+ .+..++. |+-...+...+++++. T Consensus 71 ~~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~~~~~-----~~~d~~~~----~l~~~~~--n~~~~~~~~~~~~~~~ 135 (478) T protein:vir:10 71 YDETKPDWRMYTNYHQNLVDQKVAYA----VANPVTFG-----VDNDKALK----QIQHTLN--HKWDDKLVDILTAASN 135 (478) T ss_pred cccccccceeccchHHHHHHHHHhhh----ccCCeeee-----cCChHHHH----HHHHHHh--cCHHHHHHHHHHHHHh Confidence 1223 3578888888888776655 55544442 23333332 3333332 4444556778899999 Q ss_pred cCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccc Q lcl|NC_021540. 142 EGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYE 221 (705) Q Consensus 142 ~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 221 (705) .|.|++.+|++ T Consensus 136 ~G~~~~~~~~d--------------------------------------------------------------------- 146 (478) T protein:vir:10 136 KGIEWVQPYVD--------------------------------------------------------------------- 146 (478) T ss_pred cCeEEEEEEec--------------------------------------------------------------------- Confidence 99998887652 Q ss_pred ccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 222 EQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 222 ~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.+++..++|.++++ |++... +-.+. .+.+.... ..+ ...+ T Consensus 147 -------~~g~~~~~~~~p~~~~~i~d~~~~~---~~~~~-v~~~~~~~----------~~~------------~~~y-- 191 (478) T protein:vir:10 147 -------EEGEFKTFRVPAEQAVPIWTNKERD---ELQAF-IRVYELDG----------AER------------VEYW-- 191 (478) T ss_pred -------CCCeeEEEEEcccceEEEEcCCCCC---ceEEE-EEEEEecC----------ceE------------EEEE-- Confidence 01346777888888774 443322 22222 23321100 000 0000 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEE-EEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhH Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPI-VASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSD 378 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~-~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d 378 (705) ...++..|++- .+....... ...-.... ......|.+.+.+|+++++. +.+|.|.+..+++ T Consensus 192 -------~~~~i~~~~~~-----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~v~~ 253 (478) T protein:vir:10 192 -------TKDDVTYYELK-----EGQLIPDFYRSDDHIQPH-YYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKT 253 (478) T ss_pred -------eCCeEEEEEEc-----CCeeeccccccccccccc-eecccccccCCccceEEecc-----CCCCCCcHHHHHH Confidence 00111111110 000000000 00000111 11223344457788877643 4578999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc--hhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 379 NQKLIGALTRGMIDAMARSANGQRGMSKNLL-DP--VNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 379 ~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av-~~--~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) +++.+|...+.+.+.+...+.|.+++. |.- +. ........++++.+.+... +.+.+...+.-.......++.+. T Consensus 254 liDa~~~~~S~~~~~~~~~~~p~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l~~~~~~~~~~~~~~~l~ 330 (478) T protein:vir:10 254 IIDALDKRLSDTQNTFDESVELIYILK-GYEGEDMKDFMHNLKYYKAISVAGESG--SGVDTIKVEVPIDSVKEYTKMLR 330 (478) T ss_pred HHHHHHHHHHHHHHHHHHhhCceeeee-cCCccccchhhhhhhhcceEEecCCCC--CcceEEeecCChHHHHHHHHHHH Confidence 999999999999999998888876643 332 21 1122334455665543211 22334433333355667789999 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceee Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQ 535 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~ 535 (705) ..+...+++++.+.+..++.+|+.| +...............+.|..+++++++.++. ++.. . T Consensus 331 ~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~----~~g~---------~--- 392 (478) T protein:vir:10 331 DYIIEFGQGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTLTALQELLQYIID----FYRL---------D--- 392 (478) T ss_pred HHHHHHhCccccCccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCC---------C--- Confidence 9999999999887765444444444 44444444555555566666666665555544 3321 0 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) ++.. ...+..+...+.......+.+..+ ...++. ..++..+ +..... . .+. T Consensus 393 ~~~~----~i~i~f~~~~p~d~~e~a~~~~kl----~g~iS~---et~~~~l---~~v~D~-------------~--~E~ 443 (478) T protein:vir:10 393 VKVQ----DIEITFNFNVMVNELENSQIAMNS----TGLLSK---ETILSNH---AWVEDP-------------V--AEM 443 (478) T ss_pred cccc----cceEEecCCCCCCHHHHHHHHHHH----hCCCCh---HHHHHhC---CCCCCH-------------H--HHH Confidence 0101 122333332221111111211111 111111 1111110 111111 1 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKAR 651 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~ 651 (705) ++.+.+................ ...+.+.+-.+.+ T Consensus 444 ~ri~~E~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 478 (478) T protein:vir:10 444 ERIEQENIELNQQLPDIEEGLN-GEQQRQSENNQPE 478 (478) T ss_pred HHHHHHHHHHHhhccccccccC-CCCCCCCCCCCCC Confidence 1111110000000000000000 0000000000000 No 69 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.57 E-value=2.4e-13 Score=89.72 Aligned_cols=457 Identities=10% Similarity=0.032 Sum_probs=207.5 Q ss_pred Ccc---hhh-----------------hhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhH-HHHHHHHHHHHhccCC- Q lcl|NC_021540. 1 MSD---INE-----------------EFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDT-QVAIIDDWLAQLNVTG- 58 (705) Q Consensus 1 ~~~---~~~-----------------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~- 58 (705) |.| |++ .+..... ..++.+ ....|++-++ .|.. ...+.++..+||.|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----~~~~l~~~i~----~~~~~~~~r~~~l~~yY~g~~~ 71 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRADNL-EELMVN----NWELLKNFIN----HHKLRQAPRIQELLDYARGENH 71 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhccccc-cccccc----cHHHHHHHHH----HHHHHHHHHHHHHHHHhcCCCc Confidence 221 111 1111111 112211 1222333333 3432 2355678889999863 Q ss_pred CCC---CCCCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHH Q lcl|NC_021540. 59 AYK---PKQQVGRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLID 133 (705) Q Consensus 59 ~~~---~~~~~grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~ 133 (705) ... ....++++ +++.+..+..|+.....| ||.+.-+.... ...-+...++++.++ ..|+--..+. T Consensus 72 ~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p~~~~~~d-----~~~~~~~~~~l~~~~-~~n~~~~~~~ 141 (501) T protein:vir:27 72 DVLQFGRRKDREMADKRAVHNYGRMISKFKTGYL----AGNPIRVEYDD-----NDNNSQNDDTIKRIG-RINDIDSHNR 141 (501) T ss_pred cccccCccCccccccceeccchHHHHHHHHhhhh----cccCeeEecCC-----ccchHHHHHHHHHHH-HhcChhHHHH Confidence 221 11223443 677888888887776666 55554343322 222233445565543 4455556677 Q ss_pred HHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccc Q lcl|NC_021540. 134 TMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPI 213 (705) Q Consensus 134 ~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 213 (705) ..+++++..|.+.+.+|++ T Consensus 142 ~~~~~~~~~G~a~~~vy~d------------------------------------------------------------- 160 (501) T protein:vir:27 142 TLIRDLSQTGRAYEVIYRN------------------------------------------------------------- 160 (501) T ss_pred HHHHHHhhCCeEEEEEEeC------------------------------------------------------------- Confidence 8999999999998887752 Q ss_pred eeccCcccccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhc Q lcl|NC_021540. 214 LAIINGYEEQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTS 291 (705) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~ 291 (705) ..++|+|..++|.++++ |+.... +..+.+ +++..... T Consensus 161 ---------------ed~~~~i~~~~p~~~~~v~d~~~~~---~~~~~i-r~~~~~~~---------------------- 199 (501) T protein:vir:27 161 ---------------EYDETRIKRLNPLETFVIYDNSLED---NSIAAV-RYYNRGTL---------------------- 199 (501) T ss_pred ---------------CCCceEEEEEccceeEEEecCCCCC---ceEEEE-EEEEeeec---------------------- Confidence 01346788888988764 443221 222222 22211000 Q ss_pred cccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCc Q lcl|NC_021540. 292 SDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEA 371 (705) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g 371 (705) ...+..+|+|.. +.+ ++.. .++........|.+.|.+|++.++ ....|.| T Consensus 200 ----------------~~~~~~~~vyt~-----~~v---~~~~-~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~s 249 (501) T protein:vir:27 200 ----------------QNAKDVVEIYTN-----EHI---YTLD-ASDDFNEISVTTHAFGTVPITEFL-----NNVDGIG 249 (501) T ss_pred ----------------CCcEEEEEEEeC-----CeE---EEEE-eCCceeeccccccCCCcccEEEec-----CCCCCCC Confidence 011234555543 111 1111 122222223333344778887764 3457899 Q ss_pred hHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchh--hhhhcCCcceeecCCc-----ccccccccccCccch Q lcl|NC_021540. 372 DAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVN--ERKFKMGEDYKYNPGT-----NPVTDIIEHKYPELP 444 (705) Q Consensus 372 ~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d--~~~~~pg~~i~~~~~~-----~~~~~i~~~~~~~i~ 444 (705) .+..++++++.+|..++.+.+.+...+.|.+++........+ .......+.+.+..+. .....+.++..+.-. T Consensus 250 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 329 (501) T protein:vir:27 250 DYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDV 329 (501) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCH Confidence 999999999999999999999999888877765432222221 1122233344443211 111123344333333 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ......++.+...+...|++++.+.|..++..|+.| +...............+.|..+++++++.++.++........ T Consensus 330 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 407 (501) T protein:vir:27 330 SGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEA--LKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKD 407 (501) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 456667899999999999999887775444345544 443334444455666677777777777776665432211100 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) -++. ...+..+...+.......+.+..+. + .++. ..++. .++....+ T Consensus 408 ------~d~~---------~i~v~f~~~~p~n~~e~ad~~~kl~---g-~iS~---et~l~---~l~~v~D~-------- 454 (501) T protein:vir:27 408 ------FDES---------LLKITFTPNLPKSLNEQVSILTGLG---G-QVSQ---ETALS---LSGLVESP-------- 454 (501) T ss_pred ------cccc---------cceEEeCCCCCcCHHHHHHHHHHHh---c-cCcH---HHHHH---hCCCCCCH-------- Confidence 0000 1122222222211111111111111 1 1111 00110 11111111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYE--------AQAEAAKARKANTE 656 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~--------~q~e~a~a~~~~~e 656 (705) . .+.++.+.+..+... ...+..+...... ..-+...+- | T Consensus 455 -----~--~E~eri~~E~~e~~~--~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~----~ 501 (501) T protein:vir:27 455 -----N--EELDKINKEVSEIDF--KGYSNDFNEHVGKYTDEVKETHTDDFERAY----E 501 (501) T ss_pred -----H--HHHHHHHHHHHhhhH--hhhcCccccccccccCCCCCCccccccccC----C Confidence 1 111111111111000 0000000000000 000000000 0 No 70 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.56 E-value=4.2e-14 Score=93.93 Aligned_cols=452 Identities=10% Similarity=0.043 Sum_probs=201.2 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHH-------HHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCC--C------- Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDL-------LNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPK--Q------- 64 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~------- 64 (705) |++|+--.-..- ++..+..+ ...+......|...+.+.++..+||.|....... + T Consensus 1 ~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ 71 (478) T protein:vir:10 1 MISINWPWDKPY---------HEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDY 71 (478) T ss_pred CccccccCCchh---------hhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhccccc Confidence 877753322111 01111111 1112333445555667777889999987421110 0 Q ss_pred CCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhc Q lcl|NC_021540. 65 QVGR--SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNE 142 (705) Q Consensus 65 ~~gr--s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~ 142 (705) .+++ .+++.+..+..|+.....| ||.+.-+ .+ +|.+.. +.++..+. |+-...+...+++++.. T Consensus 72 ~~~~~~~ki~~n~~k~ivd~~~~yl----~g~p~~~--~~---~~~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~ 136 (478) T protein:vir:10 72 DETKPDWRMYTNYHQNLVDQKVAYA----VANPVTF--GV---DNDKAL----KQIQHTLN--HKWDDKLVDILTAASNK 136 (478) T ss_pred ccccccceeccchHHHHHHHHhhhh----cccCcee--ec---CChHHH----HHHHHHHh--ccHHHHHHHHHHHHhhC Confidence 1222 2477777777777776665 5555333 22 333322 34444442 44445566788999999 Q ss_pred CCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccc Q lcl|NC_021540. 143 GTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEE 222 (705) Q Consensus 143 g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 222 (705) |.|.+.+||+ T Consensus 137 G~~~~~v~~d---------------------------------------------------------------------- 146 (478) T protein:vir:10 137 GIEWVQPYVD---------------------------------------------------------------------- 146 (478) T ss_pred CeEEEEEEec---------------------------------------------------------------------- Confidence 9998887652 Q ss_pred cceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccc Q lcl|NC_021540. 223 QEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTS 300 (705) Q Consensus 223 ~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~ 300 (705) ..+.|++..++|.+++ ||+.... +-.+. .+.+-+.. ..++ .. T Consensus 147 ------~~~~~~~~~~~p~~~~~v~d~~~~~---~~~~~-ir~~~~~~----------~~~~----------~~------ 190 (478) T protein:vir:10 147 ------EEGEFKTFRVPAEQAVPIWTNKERD---ELQAF-IRVYELDG----------AERV----------EY------ 190 (478) T ss_pred ------CCCceEEEEEcccceEEEEcCCCCC---ceEEE-EEEEeeeC----------ceEE----------EE------ Confidence 0134677788998875 4443322 22222 23321100 0000 00 Q ss_pred ccccccccCeEEEEEEEEEeeecCCCeeE-EEE-EEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhH Q lcl|NC_021540. 301 FTFSDKARKKIVVYEYWGYWDIDGSGVTT-PIV-ASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSD 378 (705) Q Consensus 301 ~~~~~~~~~~v~v~E~w~k~~~~~dg~~~-~~~-~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d 378 (705) + ...+|..|.+. +.+... ... ....... ......|.+.|.+|++++.. ...|.|.+..+++ T Consensus 191 y-----~~~~i~~~~~~------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~ 253 (478) T protein:vir:10 191 W-----TKDDVTFYELK------EGQLIPDFYRSEDHIQPH-YYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKT 253 (478) T ss_pred E-----eCCcEEEEEec------CCeeeccccccccccccc-eecccccccCCcceEEEecc-----CCCCCCcHHHHHH Confidence 0 00112211111 000000 000 0000111 11233355557788887764 3468999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCc-hh-hhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 379 NQKLIGALTRGMIDAMARSANGQRGMSKNL-LDP-VN-ERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 379 ~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~-~d-~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) +++.+|.+.|.+.+.+...+.|.+++. |+ .+. .+ .......+++.+.+... +.+.++..+.-...+...++.+. T Consensus 254 liDa~~~~~S~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l~~~~~~~~~~~~~~~l~ 330 (478) T protein:vir:10 254 IIDALDKRLSDTQNTFDESVELIYILK-GYEGEDMKDFMHNLKYYKAISVAGESG--SGVDTIKVEVPIDSVKEYTKMLR 330 (478) T ss_pred HHHHHHHHHHHHHHHHHHhhCcceeee-cCCcccccchhhhhhhCceeEecCCCC--CcceEEeecCCHHHHHHHHHHHH Confidence 999999999999999998888876653 32 121 11 12223345555543321 12344443333455666788999 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceee Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQ 535 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~ 535 (705) +.+...|++++.+.+..++..|+.| +..+............+.|..+++++++.++.+ ... . T Consensus 331 ~~I~~~s~~p~~~~~~~~~n~Sg~A--i~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~----~~~---------~--- 392 (478) T protein:vir:10 331 DYIIEFGQGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTLTALQELLQYIIDF----YRL---------D--- 392 (478) T ss_pred HHHHHHhCCcCcCccccccchHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hCC---------C--- Confidence 9999999998877665443334443 444444445555555666666666655555443 321 0 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) ++.. .+.+..+...+.......+.+..+ .+ .++. .-++ ..++.... + +.+. T Consensus 393 ~d~~----~i~i~f~~~~p~~~~e~~~~~~~~---~g-~iS~---et~i---~~~~~v~d-------------~--~~E~ 443 (478) T protein:vir:10 393 VRVQ----DIEITFNFNVMVNELENSQIAMNS---TG-LLSK---ETIL---GNHSWVQD-------------P--VAEM 443 (478) T ss_pred cccc----cceEEeCCCCCCCHHHHHHHHHHH---hC-CCCh---HHHH---HhCCCCCC-------------H--HHHH Confidence 0000 122333333221111111111111 11 1110 0011 01111111 1 1111 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQ-AEIQLMPYEAQAEAAK 649 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~q-a~~q~~~~~~q~e~a~ 649 (705) ++.+++..+...+..... ........+..-.+.+ T Consensus 444 ~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 444 ERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQSE 478 (478) T ss_pred HHHHHHHHHHHHhccccCCCCcccccccCcCCCCC Confidence 111111110000000000 0000000000000000 No 71 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.55 E-value=1.5e-12 Score=85.47 Aligned_cols=458 Identities=9% Similarity=0.005 Sum_probs=205.0 Q ss_pred Ccch----hhhhhccc-------------ccccCCCCCCHHHHHHHHHHHHHhhHHhhH-HHHHHHHHHHHhccCC-CC- Q lcl|NC_021540. 1 MSDI----NEEFLEDT-------------VPSLQEDWKNKPKVSDLLNDFNNAKSTKDT-QVAIIDDWLAQLNVTG-AY- 60 (705) Q Consensus 1 ~~~~----~~~~~~~~-------------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~- 60 (705) |... ++.+..+. ....++++... ....|++-++ .|.. ...+.++..+||+|.- .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~i~----~h~~~~~~rl~~l~~yY~g~~~~i~ 75 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVN-NWELLKNFIN----HHKLRQAPRIQELLDYARGENHDVL 75 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccc-cHHHHHHHHH----HHHHHHHHHHHHHHHHhcCCCcccc Confidence 2221 11111110 01111211110 1223333333 3332 2345677889999852 22 Q ss_pred --CCCCCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHH Q lcl|NC_021540. 61 --KPKQQVGRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMV 136 (705) Q Consensus 61 --~~~~~~grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~ 136 (705) ......+++ +++.+.....|+.....| ||.+.-+.+ ...+| -+...++++.++. .|+--..+...+ T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p~~~~~--~d~~~---~~~~~~~l~~~~~-~N~~~~~~~~~~ 145 (502) T protein:vir:48 76 KSGRRKDNEMADKRAVHNYGRMISKFKTGYL----AGNPIRVEY--DDNED---NSQNDDAIKRIGR-INDIDTHNRNLI 145 (502) T ss_pred ccccccccccccceeecchHHHHHHHHhhhh----cccCeeEec--CCccc---hhHHHHHHHHHHh-hcCHhHHHHHHH Confidence 112223443 778888888888777665 444443333 22222 2344556666543 354445677899 Q ss_pred HHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceec Q lcl|NC_021540. 137 RTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAI 216 (705) Q Consensus 137 ~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 216 (705) ++++..|.+.+.+|++ T Consensus 146 ~~~~~~G~a~~~v~~d---------------------------------------------------------------- 161 (502) T protein:vir:48 146 RDLSQTGRAYEVIYRS---------------------------------------------------------------- 161 (502) T ss_pred HHHhhcCeEEEEEEeC---------------------------------------------------------------- Confidence 9999999998877642 Q ss_pred cCcccccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccc Q lcl|NC_021540. 217 INGYEEQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDH 294 (705) Q Consensus 217 ~~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~ 294 (705) ..+.+++..++|.++++ |+.... +..+ +.+++.... . T Consensus 162 ------------edg~~~i~~~~p~~~~~vydd~~~~---~~~~-~ir~~~~~~-----------~-------------- 200 (502) T protein:vir:48 162 ------------EYDETRIKRLSPLETFVIYDNSLED---NSIA-AVRYYNRGT-----------L-------------- 200 (502) T ss_pred ------------CCCceEEEEEcccceEEEEcCCCCC---ceEE-EEEEEEEee-----------c-------------- Confidence 02346778888888764 433211 2222 222221100 0 Q ss_pred ccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 295 YSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 295 ~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) ...+.++|+|.. + +.++....++. ......|.+.|.+|++.++. ...|.|.+. T Consensus 201 -------------~~~~~~~~iyt~-----~---~i~~~~~~~~~-~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e 253 (502) T protein:vir:48 201 -------------QNAKDVVEIYTN-----Q---HIYTLDASDSF-NEISVTPHAFGTVPITEFLN-----NADGIGDYE 253 (502) T ss_pred -------------CCcEEEEEEEeC-----C---eEEEEEeCCce-eeccceecCCCccceEEecC-----CCCCCCchh Confidence 011234566643 1 11111112222 22223333446788877653 446899999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchh--hhhhcCCcceeecCCc-----ccccccccccCccchHHH Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVN--ERKFKMGEDYKYNPGT-----NPVTDIIEHKYPELPASS 447 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d--~~~~~pg~~i~~~~~~-----~~~~~i~~~~~~~i~~~~ 447 (705) .++++++.+|..++.+.+.+...+.|.+++........+ .......+.+.+.++. .....+.++..+.-.... T Consensus 254 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~ 333 (502) T protein:vir:48 254 TELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGA 333 (502) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecCCHHHH Confidence 999999999999999999999888887776543222211 1222223333332211 111223344433333556 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEe Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIR 527 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~ir 527 (705) ...+..+...+...|++++.+.+..++.+|+.| +...............+.|..+++++++.++.++........ T Consensus 334 ~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~--- 408 (502) T protein:vir:48 334 EAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEA--LKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKD--- 408 (502) T ss_pred HHHHHHHHHHHHHHhCCCCcCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--- Confidence 667899999999999999887775443345554 444334445555666667777777777666665543211100 Q ss_pred EecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccc Q lcl|NC_021540. 528 ITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEP 607 (705) Q Consensus 528 i~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~ 607 (705) .++ ....+..+...+.......+.+..+. + .++. ..++.. +....... T Consensus 409 ---~d~---------~~i~i~f~~~~p~d~~e~a~~~~kl~---g-~iS~---et~l~~---l~~v~D~~---------- 456 (502) T protein:vir:48 409 ---FDE---------SRLKITFTPNLPKSLYEQVSILNDLG---G-QVSQ---ETALSL---SGLVENPT---------- 456 (502) T ss_pred ---ccc---------ccceEEeCCCCCcCHHHHHHHHHHHh---c-cCcH---HHHHHh---CCCCCCHH---------- Confidence 000 01122222222111111111111111 1 1111 000100 01111000 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHH--------HHHHH---HHHHHHHHHHH Q lcl|NC_021540. 608 SPQAQLEIQIKQLEAQELQMRIAK--------LQAEI---QLMPYEAQAEA 647 (705) Q Consensus 608 ~~~~q~~~q~~q~~~q~~q~e~~k--------~qa~~---q~~~~~~q~e~ 647 (705) .+.++.+.+..+.+..... ..... .....+...-. T Consensus 457 -----~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 457 -----EELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred -----HHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcCCCCC Confidence 0011110000000000000 00000 00000000000 No 72 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.54 E-value=2.7e-13 Score=89.51 Aligned_cols=475 Identities=11% Similarity=0.024 Sum_probs=210.0 Q ss_pred Ccchhhhhhccc----------cccc-CCCCCCHHH-HHHHHHHHHHhhHHhhHH-HHHHHHHHHHhccCCCC----C-- Q lcl|NC_021540. 1 MSDINEEFLEDT----------VPSL-QEDWKNKPK-VSDLLNDFNNAKSTKDTQ-VAIIDDWLAQLNVTGAY----K-- 61 (705) Q Consensus 1 ~~~~~~~~~~~~----------~~~~-~~~~~~~~~-~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~-- 61 (705) |-+|+|=-.-.+ +..+ -..|++..- ......++....+.|... ..+.++..+||.|.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC Confidence 555543211000 0111 123443321 111233344445555433 23456778999986321 1 Q ss_pred CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHh Q lcl|NC_021540. 62 PKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVN 141 (705) Q Consensus 62 ~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~ 141 (705) +...+...+++.+.....|+.....| +|.+.-+ . .+|... .++++.++ ..|+--.......++++. T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~---~~d~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~~ 146 (511) T protein:vir:93 81 KEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--Q---DDDKDV----LEVIEAFN-DLNDVESHNRSLGLDLSI 146 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhh----cccCeee--c---cCChHH----HHHHHHHH-hhcCHhHHHHHHHHHHHh Confidence 11112235678888888777776555 4543333 2 233332 23343332 334444556788999999 Q ss_pred cCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccc Q lcl|NC_021540. 142 EGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYE 221 (705) Q Consensus 142 ~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 221 (705) .|.+.+.+|++ T Consensus 147 ~G~ay~~vy~d--------------------------------------------------------------------- 157 (511) T protein:vir:93 147 YGKAYELMIRN--------------------------------------------------------------------- 157 (511) T ss_pred cCeeEEEEEeC--------------------------------------------------------------------- Confidence 99998877652 Q ss_pred ccceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 222 EQEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 222 ~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.|++..++|.+++ ||+.... -...+.+++.+.. .+. T Consensus 158 -------e~~~~~i~~~~p~~~~~vydd~~~~----~~~~~vr~~~~~~------~~~---------------------- 198 (511) T protein:vir:93 158 -------QDDETRLYKSDAMSTFVIYDNTIER----NSIAGVRYLRTKP------IDK---------------------- 198 (511) T ss_pred -------CCCceEEEEEccceeEEEEcCCCCC----ceEEEEEEEEeee------ccc---------------------- Confidence 0134678889999986 4444321 1233334432210 000 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEE-----EecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVM-----IRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~i-----L~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) .....+..+|+|.. +++.+ ....++.. ......|.+.+.+|++.++ ...+|.|.++ T Consensus 199 ------~~~~~~~~~~iyt~-----~~i~~---~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~gd~e 259 (511) T protein:vir:93 199 ------TDEDEVFTVDLFTS-----HGVYR---YLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYE 259 (511) T ss_pred ------cccceEEEEEEEeC-----CcEEE---EEecCCCccccccccccccccCCCccceEEec-----CCCCCCCchh Confidence 00122334455543 22111 11111110 1112223333566776654 2346899999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-CchhhhhhcCCcceeecCC---------cccccccccccCccch Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLL-DPVNERKFKMGEDYKYNPG---------TNPVTDIIEHKYPELP 444 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av-~~~d~~~~~pg~~i~~~~~---------~~~~~~i~~~~~~~i~ 444 (705) .++++++.+|..+|.+.+.+...+.|.+++..... +..+......+.++...++ ....+.+.++..+.-. T Consensus 260 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 339 (511) T protein:vir:93 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCH Confidence 99999999999999999999888887776543222 2222222233333332221 1112223344433334 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ..+...+..+...+...|++++.+.+..++.+|+.| +...............+.|..+++++++.++.++........ T Consensus 340 ~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~ 417 (511) T protein:vir:93 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDA 417 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Confidence 566777889999999999999987765444444444 554445555566666777777777777777765543221110 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) - .++. ...+..+...+.......+.+..+ .+ .++. .-++. .++... T Consensus 418 ~-----~d~~---------~i~~~f~~~~p~n~~e~~~~~~kl---~g-~iS~---et~~~---~l~~v~---------- 463 (511) T protein:vir:93 418 N-----KDFN---------TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQ---TTLMS---LFSFFQ---------- 463 (511) T ss_pred c-----cccc---------cceEEeCCCCCCCHHHHHHHHHHH---hc-cCch---HHHHH---hCCCCC---------- Confidence 0 0110 122223332222122222222221 11 1111 00111 111111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERE 676 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e 676 (705) ++. .+.++.+.+.. .+ ....+.... ......... .... + .+....+. | T Consensus 464 ---d~~--~E~~ri~~E~~-~~--~~~~~~~~~-----~~~~~~~~~---~~~~-~-----~~~~~~~~--~ 511 (511) T protein:vir:93 464 ---DPE--LEVKKIEEDEK-ES--IKKAQKGIY-----KDPRDINDD---EQDD-D-----TKDTVDKK--E 511 (511) T ss_pred ---CHH--HHHHHHHHHHH-HH--HHHHhhhcc-----cCCCCCCCC---CCCC-c-----cccccccc--C Confidence 111 11111111100 00 000000000 000000000 0000 0 00000000 0 No 73 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.54 E-value=3e-13 Score=89.28 Aligned_cols=475 Identities=11% Similarity=0.027 Sum_probs=210.5 Q ss_pred Ccchhhhhhcc-----------cccccCCCCCCHHHHH-HHHHHHHHhhHHhhHHH-HHHHHHHHHhccCCCC------C Q lcl|NC_021540. 1 MSDINEEFLED-----------TVPSLQEDWKNKPKVS-DLLNDFNNAKSTKDTQV-AIIDDWLAQLNVTGAY------K 61 (705) Q Consensus 1 ~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~------~ 61 (705) |-+|+|=-.-. .+-..-..|+...... ....++......|.... .+.++..+||.|.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~ 80 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC Confidence 55554321100 0111112344332211 11233444444444332 3456788999886321 1 Q ss_pred CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHh Q lcl|NC_021540. 62 PKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVN 141 (705) Q Consensus 62 ~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~ 141 (705) +...+...+++.+...-.|+.....| ||.+.-+. .+|.+. .++++.++ ..|+--......++++++ T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~~-----~~~~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~i 146 (511) T protein:vir:96 81 KEEYMADNRVAHDYASYISDFINGYF----LGNPIQYQ-----DDDKDV----LEAIEAFN-DLNDVESHNRSLGLDLSI 146 (511) T ss_pred cccccCcceeecchHHHHHHHHHhhh----ccCCceee-----cCchHH----HHHHHHHH-hhcCHHHHHHHHHHHHHh Confidence 11122345678888888887776555 45443332 233332 23444443 335544567789999999 Q ss_pred cCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccc Q lcl|NC_021540. 142 EGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYE 221 (705) Q Consensus 142 ~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 221 (705) .|.+.+.+|++ T Consensus 147 ~G~a~~~vy~d--------------------------------------------------------------------- 157 (511) T protein:vir:96 147 YGKAYELMIRN--------------------------------------------------------------------- 157 (511) T ss_pred cCeeEEEEEeC--------------------------------------------------------------------- Confidence 99998877652 Q ss_pred ccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 222 EQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 222 ~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.|++..++|.++++ |++... . ...+.+++.+.. .+ . T Consensus 158 -------ed~~~~i~~~~p~~~~~vydd~~~~---~-~~~~vr~~~~~~------~d--------------------~-- 198 (511) T protein:vir:96 158 -------QDDETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKP------ID--------------------K-- 198 (511) T ss_pred -------CCCceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeee------cc--------------------c-- Confidence 02346788889999874 433211 1 223333331100 00 0 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCE--E---EecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDV--M---IRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~--i---L~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) .....+..+|+|.. +++. +.+..++. . ......|.+.+.+|++.++ ..-+|.|.++ T Consensus 199 ------~~~~~~~~~~iyt~-----~~i~---~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----nn~~g~gd~e 259 (511) T protein:vir:96 199 ------TDEDEVFTVDLFTS-----HGVY---RYLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYE 259 (511) T ss_pred ------cccceEEEEEEEeC-----CcEE---EEEecCCCcccccccccccccccCCceeeEEec-----CCCCCCCchh Confidence 00122334455542 2211 11111111 0 1112223333566666654 2346899999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-CchhhhhhcCCcceeecCC---------cccccccccccCccch Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLL-DPVNERKFKMGEDYKYNPG---------TNPVTDIIEHKYPELP 444 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av-~~~d~~~~~pg~~i~~~~~---------~~~~~~i~~~~~~~i~ 444 (705) .++++++.+|...|.+.+.+...++|.+++..... +..+......+.++...+. ......+.++..+.-. T Consensus 260 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 339 (511) T protein:vir:96 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCH Confidence 99999999999999999999888887776543222 2222222333333333221 1112223444444344 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ......+..+...+...|++++.+.+..++.+|+.| +...............+.|..+++++++.++.++........ T Consensus 340 ~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~ 417 (511) T protein:vir:96 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDA 417 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Confidence 566777889999999999999987765443344444 555555566666667777777777777777665543221100 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) +.++. ...+..+...+.......+.+..+ .| .++. .-++. .++... T Consensus 418 -----~~d~~---------~i~~~f~~~~p~n~~e~~~~~~kl---~G-~iS~---et~l~---~l~~v~---------- 463 (511) T protein:vir:96 418 -----NKDFN---------TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQ---TTLMS---LFSFFQ---------- 463 (511) T ss_pred -----ccccc---------cceEEeCCCCCCCHHHHHHHHHHH---hc-cCCh---HHHHH---hCCCCC---------- Confidence 00111 112223332222112222221111 11 1111 00110 111111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERE 676 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e 676 (705) ++. .+.++.+.+... +....+.... ...... .-.+.+-..+....+. | T Consensus 464 ---D~~--~E~~ri~~E~~~---~~~~~~~~~~-----~~~~~~---------~~~~~~~~~~~~~~~~--~ 511 (511) T protein:vir:96 464 ---DPE--LEVKKIEEDEKE---SIKKAQKGIY-----KDPRDI---------NDDEQDDDTKDTVDKK--E 511 (511) T ss_pred ---CHH--HHHHHHHHHHHH---HHHHHhhccc-----cCCCCC---------CCCCCCCccccccccc--C Confidence 111 111111111000 0000000000 000000 0000000000000000 0 No 74 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.54 E-value=4.3e-13 Score=88.36 Aligned_cols=393 Identities=12% Similarity=0.026 Sum_probs=195.7 Q ss_pred CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC------CCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021540. 21 KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY------KPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLND 94 (705) Q Consensus 21 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~ 94 (705) -+..+|..|.+.+.. ...+.+...+||.|.... .|+..+.+.+.|.+-.+..|+.+...| T Consensus 1 ~~~~~i~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl------- 66 (409) T protein:vir:94 1 MTEKGIGYLRFKLSV-------HKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL------- 66 (409) T ss_pred CCHHHHHHHHHHHHH-------HhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhc------- Confidence 344577777666543 223344556899986422 122222234455566666666553332 Q ss_pred CCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchh Q lcl|NC_021540. 95 ENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGES 174 (705) Q Consensus 95 ~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~ 174 (705) .|...+..|.. +..+| ..|+--.....++++||+.|.+++.|+= T Consensus 67 ----~~~Gf~~~d~~--------l~~i~-~~N~ld~~~~~~~~~aliyG~sf~~v~~----------------------- 110 (409) T protein:vir:94 67 ----VFREFENDDFT--------VNEIF-EENNPDIFFDSAVLSSLIASCSFTYISK----------------------- 110 (409) T ss_pred ----ccCcccCCchH--------HHHHH-HhcChhHHHHHHHHHHHHhcceeEEEec----------------------- Confidence 12222233422 22333 3344333456788888988988776530 Q ss_pred HHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhe--eeCCCccCC Q lcl|NC_021540. 175 IDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNV--TIDPTCNGN 252 (705) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~--~~Dp~a~~d 252 (705) ...+.|+|..++|.++ +|||... . T Consensus 111 -----------------------------------------------------~~dg~~~i~~~sp~~~~~i~D~~~~-~ 136 (409) T protein:vir:94 111 -----------------------------------------------------GENDAVRLQVIEAVNATGIIDPITG-L 136 (409) T ss_pred -----------------------------------------------------CCCCceEEEEeccceEEEEEecCCC-c Confidence 0023467778888875 4555321 1 Q ss_pred hhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEE Q lcl|NC_021540. 253 LDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIV 332 (705) Q Consensus 253 ~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~ 332 (705) ...+.+++ +.+. . .......+|.. +. .+. T Consensus 137 ----~~~a~~~~-----------~~d~------------------~----------~~~~~~~~~~~-----~~---~~~ 165 (409) T protein:vir:94 137 ----LTEGYAVL-----------ERDE------------------N----------NNVVLEAHFLP-----DR---TDY 165 (409) T ss_pred ----eeeeEEEE-----------EecC------------------C----------CceEEEEEEec-----Cc---EEE Confidence 11111211 0000 0 00011111111 00 000 Q ss_pred EEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCcEEe---eccc Q lcl|NC_021540. 333 ASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADA-ELLSDNQKLIGALTRGMIDAMARSANGQRGM---SKNL 408 (705) Q Consensus 333 ~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~-~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~---~~~a 408 (705) . +.++......++|+ |.+|+|+++..+..++.+|.|-+ +.++++|+.+|+.+..++......+.|+..+ +++. T Consensus 166 ~-~~~~~~~~~~~n~~--g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~ 242 (409) T protein:vir:94 166 Y-YRDSRNNISIANPT--GHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA 242 (409) T ss_pred E-EecCceeEeeeCCC--CCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC Confidence 1 11111122345665 68999999999999999999976 7899999999999999999999999997765 2222 Q ss_pred cCchhhhhhcCCcceeecCCcc-cccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHH Q lcl|NC_021540. 409 LDPVNERKFKMGEDYKYNPGTN-PVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVI 487 (705) Q Consensus 409 v~~~d~~~~~pg~~i~~~~~~~-~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~ 487 (705) +..+.+...++.++.+....+ ....+..++..++. .+...+..+...+-.+||+++..+|...+. +.+|.++.... T Consensus 243 -~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~t~lP~~~lg~~~~N-psSa~Al~a~~ 319 (409) T protein:vir:94 243 -EPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLRTAAAGFAGETGLTLDDLGFVSDN-PSSVEAIKASH 319 (409) T ss_pred -cccchhhhhHHHhhcCCCCCCCCCceEEecCCCChh-HHHHHHHHHHHHHhhhcCCCHHHhccccCc-hhHHHHHHHHH Confidence 233456666677777643322 11223333444443 334555666666667789999999965431 23454555433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEee---ccchhHHHHHHHHH Q lcl|NC_021540. 488 GASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLS---ISNAETDAIKAQEL 564 (705) Q Consensus 488 ~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~---~~~~~~~~~~~q~~ 564 (705) ..-........+.|..++++++++++.+.-.. +..+ +++. +..+.-. ........+....+ T Consensus 320 ~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~-~~~~--------------~~~~-~~~v~W~p~~~~~~~~~a~~aDa~ 383 (409) T protein:vir:94 320 ENLRLAGRKAQRSLGAGLLNVAYLAACLRDDA-PYLR--------------EQFR-KTKPKWEPLFEADASMLSLIGDGA 383 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-Cccc--------------cccc-cceEEeccCCCcchHHHHHHHHHH Confidence 33333344555556666777777666543322 1100 1100 1111111 11111122233334 Q ss_pred HHHHHHHhhhchhHHHHHHHHHHHhhhccchhh Q lcl|NC_021540. 565 SFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLS 597 (705) Q Consensus 565 ~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~ 597 (705) .-|.++..+..+. .-..+..|+.... T Consensus 384 ~Kl~~ag~~~~~~-------~~~~~~lG~~~~d 409 (409) T protein:vir:94 384 IKLNQAIPEFINK-------DTIRDLTGIEGGE 409 (409) T ss_pred HHHHHhcccccch-------hHHHHHcCCCCCC Confidence 4444443221111 1122334443322 No 75 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.53 E-value=2.4e-12 Score=84.33 Aligned_cols=472 Identities=10% Similarity=-0.004 Sum_probs=211.3 Q ss_pred HHHHHHHHHHH----------------HH-hhHHhhHHHHHHHHHHHHhccCCCCC-CCCCCCCCc---CCCHHHHHHHH Q lcl|NC_021540. 23 KPKVSDLLNDF----------------NN-AKSTKDTQVAIIDDWLAQLNVTGAYK-PKQQVGRSS---VQPKLIRKQAE 81 (705) Q Consensus 23 ~~~~~~l~~~~----------------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~grs~---~v~~~v~~~~e 81 (705) =.++..||..+ ++ -..-..++...++.|..||.|.+... +....|+.+ ..+--+...|- T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 01222222222 11 11123345667888999998876332 222233221 11112223333 Q ss_pred HHHHHHHHhhcCCCCEEEEeCCC-cch-HHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhh Q lcl|NC_021540. 82 WRYSALSEPFLNDENIFSIAPKT-WQD-REAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVT 159 (705) Q Consensus 82 ~~~~~l~~~f~~~~~~~~~~p~~-~~D-~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~ 159 (705) .-+++| .|+-..-+.+.... .+. -....-++++||.++. .|+-...+..++..++-.|.|++|+||+ T Consensus 81 ~~~A~L---l~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~-~n~f~~~~~~~~e~a~a~G~~a~k~~~d------- 149 (517) T protein:vir:98 81 DVLSGL---VFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQ-HNKFIKNLSDYLEPTFALGGLTVRPYVD------- 149 (517) T ss_pred HHhhhh---hcCCcceEEecccccccccccchhHHHHHHHHHHH-hccHHHHHHHHHHHHhhhCCEEEEEEEe------- Confidence 344455 24444445444321 111 1122224567776644 4455677889999999999999999994 Q ss_pred hcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEec Q lcl|NC_021540. 160 ENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICD 239 (705) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~ 239 (705) .+.++|+.|+ T Consensus 150 ----------------------------------------------------------------------~~~~~I~~v~ 159 (517) T protein:vir:98 150 ----------------------------------------------------------------------NGEIEFSWAL 159 (517) T ss_pred ----------------------------------------------------------------------CCeeEEEEEc Confidence 1234677888 Q ss_pred hhheee-CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc-cccccccccCeEEEEEEE Q lcl|NC_021540. 240 YHNVTI-DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT-SFTFSDKARKKIVVYEYW 317 (705) Q Consensus 240 ~~~~~~-Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~v~E~w 317 (705) +..||+ ..+. ..+..|-+++ ..+.+... +..||.-++ .+.+.. ..+. ..-...+|+| T Consensus 160 ad~~~Pl~~~~-~~v~~~ai~~-~~~~~~~~--~~~~Yt~lE-------------~H~~~~~~~~~----~~y~I~n~ly 218 (517) T protein:vir:98 160 ANAFYPLRSNS-NGISEGVMKS-VTTKVIGN--KTVYYTLLE-------------FHEWEKTEEGE----SLYVITNELY 218 (517) T ss_pred CCeeEEEEecC-CCeEEEEEEE-EEEEeecC--CceEEEEEE-------------EEecCceeccC----CcEEEEEEEE Confidence 888774 1111 1233333322 12211110 000111000 000000 0000 0000112222 Q ss_pred EEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeee-----eecCcccCCchHHHhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 318 GYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYL-----PVKDSVYGEADAELLSDNQKLIGALTRGMID 392 (705) Q Consensus 318 ~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~-----~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d 392 (705) ..-....-|......-++.+ |. +...+.+-..|.+.+... ...++.+|.|++..+++..+.+|..++++++ T Consensus 219 ~s~~~~~lG~~v~L~~~~e~---l~-~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~ 294 (517) T protein:vir:98 219 KSDNEGEIGKRIPLEELYEG---MQ-EKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWW 294 (517) T ss_pred ecCCCccccccccccccccC---CC-cceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHH Confidence 11000000000000000110 00 000001111243322222 2347889999999999999999999999999 Q ss_pred HHHhcCCCcEEeeccccCchhh-hhhcC-------CcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021540. 393 AMARSANGQRGMSKNLLDPVNE-RKFKM-------GEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGV 464 (705) Q Consensus 393 ~~~~~~~~~~~~~~~av~~~d~-~~~~p-------g~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv 464 (705) .+.+ +..++.++.+++..+.. -...+ ..+++.-.+......+...++.-....++..++.+.+.+....|+ T Consensus 295 e~~~-g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gl 373 (517) T protein:vir:98 295 EIKM-GQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKL 373 (517) T ss_pred HHHh-CCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCC Confidence 8877 77788999888732211 01111 122222122222334454444333456778888888889999999 Q ss_pred chHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCceeEeEecCceeeechhhcc Q lcl|NC_021540. 465 KSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVW--LSDEEVIRITDEEFVQINRDNLV 542 (705) Q Consensus 465 ~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~--~~~~~~iri~~~~~v~i~~~~~~ 542 (705) +.-..|..+.. ..||++|..-.+..-.....+.+.+..+++++.+.++.+..-+ +...- ++ . T Consensus 374 s~~t~~~~~~~-~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~------------~~---~ 437 (517) T protein:vir:98 374 SVGTFSFDGRS-MKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEI------------PS---A 437 (517) T ss_pred Ccccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC------------CC---C Confidence 99999977654 3688888876666666677788888888888888887765533 22110 00 0 Q ss_pred cceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchh-----hhhhhcccccchhhHHHHHHH Q lcl|NC_021540. 543 GSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDL-----SKMISKYNPEPSPQAQLEIQI 617 (705) Q Consensus 543 ~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~-----~~~~~~~~~q~~~~~q~~~q~ 617 (705) .+..+..+++...-..+..+...++.++.. .-....+.. .-|+.+- .+.++......++..-.+.+. T Consensus 438 ~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~-ms~~~~i~~-------~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~ 509 (517) T protein:vir:98 438 EHIGVDFDDGVFQDRSALLRFYGQAKTFGF-IPTVEAIQR-------IFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQ 509 (517) T ss_pred cceEEEcCCCCCCCHHHHHHHHHHHHhcCC-CCHHHHHHH-------hCCCChHHHHHHHHHHHHhccccCCCCcccccc Confidence 112233344433333334444444433321 111111111 1111000 000000000000000000000 Q ss_pred HHHHHHHHHHH Q lcl|NC_021540. 618 KQLEAQELQMR 628 (705) Q Consensus 618 ~q~~~q~~q~e 628 (705) ..-. .+.| T Consensus 510 -~~~~--gd~e 517 (517) T protein:vir:98 510 -KRMF--GDEE 517 (517) T ss_pred -CCCC--CCCC Confidence 0000 0000 No 76 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.53 E-value=2.5e-13 Score=89.65 Aligned_cols=406 Identities=12% Similarity=0.045 Sum_probs=186.6 Q ss_pred CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CC----CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021540. 21 KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PK----QQVGRSSVQPKLIRKQAEWRYSALSEPFLND 94 (705) Q Consensus 21 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~----~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~ 94 (705) -+...+..|.+.+..- ..+.++..+||.|..... ++ ..+...+.|.+-.+..|+.+...| T Consensus 1 m~~~~i~~L~~~~~~~-------~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl------- 66 (422) T protein:vir:97 1 MNYMGMGYLRRKLALF-------KTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRI------- 66 (422) T ss_pred CChHHHHHHHHHHHHH-------HHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcc------- Confidence 2334566665554442 233446678998864321 11 111111223333333333332211 Q ss_pred CCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchh Q lcl|NC_021540. 95 ENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGES 174 (705) Q Consensus 95 ~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~ 174 (705) .|.+.+-+|.+ +..+| ..|+--.....++++||+.|.+++.|+.. T Consensus 67 ----~~~Gf~~~d~~--------l~~~w-~~N~ld~~~~~~~~~al~~G~sf~~v~~~---------------------- 111 (422) T protein:vir:97 67 ----IFREFTNDDFN--------AWEIF-KANNPDIFFDTAIQSALIASCCFVYIMPG---------------------- 111 (422) T ss_pred ----ccceeeCCchh--------HHHHH-HhcChHHHHHHHHHHHHHhcceeEEEeeC---------------------- Confidence 12223334432 22233 33443333456788889999988876431 Q ss_pred HHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCCCccCC Q lcl|NC_021540. 175 IDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDPTCNGN 252 (705) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d 252 (705) ...+.|.|..++|.+++ |||... . T Consensus 112 -----------------------------------------------------~~~~~p~i~~~sp~~~~~i~D~~~~-~ 137 (422) T protein:vir:97 112 -----------------------------------------------------AEDGLPKMQVIEASKATGILDPTTF-L 137 (422) T ss_pred -----------------------------------------------------CCCCeeEEEEechhhEEEEEeCCCC-c Confidence 00134567777888754 455421 1 Q ss_pred hhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEE Q lcl|NC_021540. 253 LDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIV 332 (705) Q Consensus 253 ~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~ 332 (705) + ..+.+.+ +.+.. .. ... .+|+. ++. + T Consensus 138 ~----~~a~~~~-----------~~~~~---------------------------~~-~~~-~~~~~-----~~~----~ 164 (422) T protein:vir:97 138 L----TEGYAIL-----------ESDSN---------------------------GN-PTL-EAYFT-----DKD----I 164 (422) T ss_pred c----eeeEEEE-----------EecCC---------------------------Cc-EEE-EEEEc-----Cce----E Confidence 1 1111111 00000 00 000 01100 000 0 Q ss_pred EEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-- Q lcl|NC_021540. 333 ASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADA-ELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLL-- 409 (705) Q Consensus 333 ~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~-~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av-- 409 (705) .++.++......++|+ |..|+++++..+..++.+|.|-+ +.++++|+.+|+.++.++......+.|+..+- |+- T Consensus 165 ~~~~~~~~~~~~~~~~--g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d 241 (422) T protein:vir:97 165 WYYPKKGKPYNIKNPT--GHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVL-GMDPD 241 (422) T ss_pred EEEcCCCccccccCCC--CCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-ccCcc Confidence 0111111111235655 67899999999999999999976 88999999999999999999999999987652 221 Q ss_pred -CchhhhhhcCCcceeecCCccc-ccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHH Q lcl|NC_021540. 410 -DPVNERKFKMGEDYKYNPGTNP-VTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVI 487 (705) Q Consensus 410 -~~~d~~~~~pg~~i~~~~~~~~-~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~ 487 (705) ...+.+....+.++.+....+. ...+..++...+. .+...+..+...+-.+||+++..+|...+. +.+|.++.... T Consensus 242 ~~~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~lP~~~lg~~~~N-psSa~Ai~a~~ 319 (422) T protein:vir:97 242 AKPMEKWRATVSTLLEISKDEDGDKPTVGQFTTASMA-PFMEHLKMYASLFAGGSGLTLDDLGFPSDN-PSSVESIKAAH 319 (422) T ss_pred cccCchhhhhhhhhhccCCCCCCCcceeeecCCCChh-HHHHHHHHHHHHHhcccCCCHHHhccccCc-hhHHHHHHHHH Confidence 1233455566777776543321 1123333333433 344455666666667789999999976532 23454554433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEee---ccchhHHHHHHHHH Q lcl|NC_021540. 488 GASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLS---ISNAETDAIKAQEL 564 (705) Q Consensus 488 ~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~---~~~~~~~~~~~q~~ 564 (705) ..-........+.|..++++++++++.+.-.. ... ++.+. +..+.-. .....+..+....+ T Consensus 320 ~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~-~~~--------------~~~~~-~~~~~w~p~~~~~~~s~a~~aDa~ 383 (422) T protein:vir:97 320 ENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEF-PYL--------------RNQFM-DTVIKWEPLFEADANMLTLVGDGA 383 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-ccc--------------chhhc-cceEEEccCCCCChHHHHHHHHHH Confidence 33333345555666666666666665443221 100 11111 1112211 11111122222333 Q ss_pred HHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 565 SFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQA 634 (705) Q Consensus 565 ~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa 634 (705) ..|.++.....+ .... .+..|+....... +..++. ++.. T Consensus 384 ~Kl~~a~~~~~~----~~~~---~~~lg~~~~~~~~------------~~~~~~------------~~d~ 422 (422) T protein:vir:97 384 IKLNQAIPGFMD----ADVI---RDLTGVKGADKPI------------PAITEV------------TTDG 422 (422) T ss_pred HHHHhhcccccc----HHHH---HHHcCCCchhHHH------------HHHHhh------------hccC Confidence 333343211111 1111 1222332211100 000000 0000 No 77 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.53 E-value=7.2e-13 Score=87.14 Aligned_cols=447 Identities=9% Similarity=0.004 Sum_probs=200.7 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC------------------C- Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY------------------K- 61 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~- 61 (705) |-+..++.-+ ..+| -+ . |++-++ .|.....+..+..+||.+.... . T Consensus 3 ~~~~~~~~~~----~~~~---~e-~---i~~~i~----~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (474) T protein:vir:10 3 LYKLIDDIEA----QGIL---PK-H---IEALIE----SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGN 67 (474) T ss_pred hHHHHhhccc----cCCC---HH-H---HHHHHH----HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccc Confidence 4444433322 2222 11 1 222221 1221222222333444332110 0 Q ss_pred CCCCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHH Q lcl|NC_021540. 62 PKQQVGRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTA 139 (705) Q Consensus 62 ~~~~~grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~a 139 (705) -...++|+ +++.+-.+..|+.....| ||.+.-+.+.+-+..|+. ...+++.+ ...|+--......++++ T Consensus 68 ~~~~~~~~~~ki~~n~~~~ivd~~~~yl----~g~pv~~~~~~~~~~~e~----~~~~l~~~-~~~n~~~~~~~~~~~~~ 138 (474) T protein:vir:10 68 VRRLDVSVNNKLNNSFDSEIVDTRVGYL----HGVPVTYDLDENAEKNEK----LKKFITNF-AIRNSVDDEDSEIGKMA 138 (474) T ss_pred ccccccCcccccccchHHHHHHhHhhhe----eccceeEeeCCCCcchHH----HHHHHHHH-HhhcCHhHHHHHHHHHH Confidence 01223343 677887777777766554 666665666443333333 23344443 23354445577889999 Q ss_pred HhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCc Q lcl|NC_021540. 140 VNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIING 219 (705) Q Consensus 140 l~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 219 (705) +..|.+.+.+|.+ T Consensus 139 ~~~G~a~~~~~~d------------------------------------------------------------------- 151 (474) T protein:vir:10 139 AICGYGARLAYID------------------------------------------------------------------- 151 (474) T ss_pred hhcCeEEEEEEeC------------------------------------------------------------------- Confidence 9999987765321 Q ss_pred ccccceeeeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 220 YEEQEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 220 ~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.+++..++|.++++=.+-+. +.-+.+ +++....+ T Consensus 152 ---------~~~~~~~~~i~p~~~~~v~d~~~---~~~~~i-~~~~~~~~------------------------------ 188 (474) T protein:vir:10 152 ---------TNGDIRIKNIDPYNVIFVGDNIL---EPTYSL-RYFYEKDD------------------------------ 188 (474) T ss_pred ---------CCCeeEEEEEcccceEEEEcCCC---ceEEEE-EEEEEeeC------------------------------ Confidence 12346788889988754222111 112222 22211000 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEEC---CEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHh Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVD---DVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELL 376 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g---~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~ 376 (705) ....-+..+++|.. +. +..|.+ +.....++.|.+.|.+|+++++ ...+|.|.+..+ T Consensus 189 ------~~~~~~~~~~~y~~-----~~-----~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v 247 (474) T protein:vir:10 189 ------DNGTDYVYAEFYDN-----AY-----YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKV 247 (474) T ss_pred ------CCceEEEEEEEEcC-----ce-----EEEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHH Confidence 00011223444432 11 111111 1112222233333667777654 355789999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 377 SDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 377 ~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) +++++.+|...|.+.+.+...+.|.+++. |. .+..........+.+.+.++. ..+.++..+.-.......++.+. T Consensus 248 ~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~~~~~~~~~~~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~ 323 (474) T protein:vir:10 248 IHLIDAYDLTMSDASSEISQTRLAYLVLR-GMGMSEEMIQETQKSGAFELFDKD---MDVKYLTKDVNDTMIENHLDRIE 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhc-cCCCCchhhhhhhhcceeEecCCC---CceeEEeccCCHHHHHHHHHHHH Confidence 99999999999999999998888887663 43 232233334444555554322 12344444433456677789999 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceee Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQ 535 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~ 535 (705) ..+...|++++.+.+..++.+|+.| +...............+.|..+++++++.++.++........ . T Consensus 324 ~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~----------~ 391 (474) T protein:vir:10 324 KNIMRFAKSVNFNSDEFNGNVPIIG--MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLD----------D 391 (474) T ss_pred HHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC----------c Confidence 9999999999887764433344444 554445555566666777777888887777776543221100 0 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) .+.. +..+..+...+.......+.+..+. + .++. .-++. .++...+ + +.+. T Consensus 392 ~~~~----~i~~~f~~~~p~d~~e~a~~~~kl~---g-~iS~---et~~~---~l~~v~d-------------~--~~E~ 442 (474) T protein:vir:10 392 DSYL----NLIFKFTRNIPVNKLEESQVLINLK---G-QVSE---RTRLG---QSQLVDD-------------V--DYEL 442 (474) T ss_pred cccc----cceEEeCCCCCCCHHHHHHHHHHHh---c-cCch---HHHHH---hCCCCCC-------------H--HHHH Confidence 0000 1122222222211112222221111 1 1110 00111 0111111 1 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTE 656 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~e 656 (705) ++.+.+..+. .+...+........+.+. . +.+ T Consensus 443 eri~~E~~e~----~~~~~~~~~~~~~~~~~~--~---~s~ 474 (474) T protein:vir:10 443 DEMEKESLEF----NDKLPDIDEGDANDKSQN--N---QSE 474 (474) T ss_pred HHHHHHHHHH----HhhcccccCCCcCCCCcc--c---cCC Confidence 1111111000 000000000000000000 0 000 No 78 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.53 E-value=7.2e-13 Score=87.14 Aligned_cols=447 Identities=9% Similarity=0.004 Sum_probs=200.7 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC------------------C- Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY------------------K- 61 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~- 61 (705) |-+..++.-+ ..+| -+ . |++-++ .|.....+..+..+||.+.... . T Consensus 3 ~~~~~~~~~~----~~~~---~e-~---i~~~i~----~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (474) T protein:vir:94 3 LYKLIDDIEA----QGIL---PK-H---IEALIE----SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGN 67 (474) T ss_pred hHHHHhhccc----cCCC---HH-H---HHHHHH----HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccc Confidence 4444433322 2222 11 1 222221 1221222222333444332110 0 Q ss_pred CCCCCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHH Q lcl|NC_021540. 62 PKQQVGRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTA 139 (705) Q Consensus 62 ~~~~~grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~a 139 (705) -...++|+ +++.+-.+..|+.....| ||.+.-+.+.+-+..|+. ...+++.+ ...|+--......++++ T Consensus 68 ~~~~~~~~~~ki~~n~~~~ivd~~~~yl----~g~pv~~~~~~~~~~~e~----~~~~l~~~-~~~n~~~~~~~~~~~~~ 138 (474) T protein:vir:94 68 VRRLDVSVNNKLNNSFDSEIVDTRVGYL----HGVPVTYDLDENAEKNEK----LKKFITNF-AIRNSVDDEDSEIGKMA 138 (474) T ss_pred ccccccCcccccccchHHHHHHhHhhhe----eccceeEeeCCCCcchHH----HHHHHHHH-HhhcCHhHHHHHHHHHH Confidence 01223343 677887777777766554 666665666443333333 23344443 23354445577889999 Q ss_pred HhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCc Q lcl|NC_021540. 140 VNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIING 219 (705) Q Consensus 140 l~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 219 (705) +..|.+.+.+|.+ T Consensus 139 ~~~G~a~~~~~~d------------------------------------------------------------------- 151 (474) T protein:vir:94 139 AICGYGARLAYID------------------------------------------------------------------- 151 (474) T ss_pred hhcCeEEEEEEeC------------------------------------------------------------------- Confidence 9999987765321 Q ss_pred ccccceeeeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 220 YEEQEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 220 ~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.+++..++|.++++=.+-+. +.-+.+ +++....+ T Consensus 152 ---------~~~~~~~~~i~p~~~~~v~d~~~---~~~~~i-~~~~~~~~------------------------------ 188 (474) T protein:vir:94 152 ---------TNGDIRIKNIDPYNVIFVGDNIL---EPTYSL-RYFYEKDD------------------------------ 188 (474) T ss_pred ---------CCCeeEEEEEcccceEEEEcCCC---ceEEEE-EEEEEeeC------------------------------ Confidence 12346788889988754222111 112222 22211000 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEEC---CEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHh Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVD---DVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELL 376 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g---~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~ 376 (705) ....-+..+++|.. +. +..|.+ +.....++.|.+.|.+|+++++ ...+|.|.+..+ T Consensus 189 ------~~~~~~~~~~~y~~-----~~-----~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v 247 (474) T protein:vir:94 189 ------DNGTDYVYAEFYDN-----AY-----YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKV 247 (474) T ss_pred ------CCceEEEEEEEEcC-----ce-----EEEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHH Confidence 00011223444432 11 111111 1112222233333667777654 355789999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 377 SDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 377 ~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) +++++.+|...|.+.+.+...+.|.+++. |. .+..........+.+.+.++. ..+.++..+.-.......++.+. T Consensus 248 ~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~~~~~~~~~~~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~ 323 (474) T protein:vir:94 248 IHLIDAYDLTMSDASSEISQTRLAYLVLR-GMGMSEEMIQETQKSGAFELFDKD---MDVKYLTKDVNDTMIENHLDRIE 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhc-cCCCCchhhhhhhhcceeEecCCC---CceeEEeccCCHHHHHHHHHHHH Confidence 99999999999999999998888887663 43 232233334444555554322 12344444433456677789999 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceee Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQ 535 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~ 535 (705) ..+...|++++.+.+..++.+|+.| +...............+.|..+++++++.++.++........ . T Consensus 324 ~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~----------~ 391 (474) T protein:vir:94 324 KNIMRFAKSVNFNSDEFNGNVPIIG--MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLD----------D 391 (474) T ss_pred HHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC----------c Confidence 9999999999887764433344444 554445555566666777777888887777776543221100 0 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) .+.. +..+..+...+.......+.+..+. + .++. .-++. .++...+ + +.+. T Consensus 392 ~~~~----~i~~~f~~~~p~d~~e~a~~~~kl~---g-~iS~---et~~~---~l~~v~d-------------~--~~E~ 442 (474) T protein:vir:94 392 DSYL----NLIFKFTRNIPVNKLEESQVLINLK---G-QVSE---RTRLG---QSQLVDD-------------V--DYEL 442 (474) T ss_pred cccc----cceEEeCCCCCCCHHHHHHHHHHHh---c-cCch---HHHHH---hCCCCCC-------------H--HHHH Confidence 0000 1122222222211112222221111 1 1110 00111 0111111 1 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTE 656 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~e 656 (705) ++.+.+..+. .+...+........+.+. . +.+ T Consensus 443 eri~~E~~e~----~~~~~~~~~~~~~~~~~~--~---~s~ 474 (474) T protein:vir:94 443 DEMEKESLEF----NDKLPDIDEGDANDKSQN--N---QSE 474 (474) T ss_pred HHHHHHHHHH----HhhcccccCCCcCCCCcc--c---cCC Confidence 1111111000 000000000000000000 0 000 No 79 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.52 E-value=1.4e-12 Score=85.50 Aligned_cols=467 Identities=9% Similarity=0.002 Sum_probs=209.9 Q ss_pred Ccchhhhhhccc----------ccccCC-CCCCHHHHHH-HHHHHHHhhHHhhHH-HHHHHHHHHHhccCCCC----CCC Q lcl|NC_021540. 1 MSDINEEFLEDT----------VPSLQE-DWKNKPKVSD-LLNDFNNAKSTKDTQ-VAIIDDWLAQLNVTGAY----KPK 63 (705) Q Consensus 1 ~~~~~~~~~~~~----------~~~~~~-~~~~~~~~~~-l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~~~ 63 (705) |-+|+|=-.-.+ +..+.. .|++...... -...+....+.|... ..+.++..+||.|.-.. ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~ 80 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 555544211000 111221 3443322111 112344444444433 24456788999986321 111 Q ss_pred --CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHh Q lcl|NC_021540. 64 --QQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVN 141 (705) Q Consensus 64 --~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~ 141 (705) ..+...+++.+..+-.|+.....| ||.+.-+ .+ +|...- +.++.++ ..|+--......++++++ T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~~---~d~~~~----~~l~~~~-~~n~~~~~~~~~~~~~~i 146 (511) T protein:vir:10 81 KEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--QD---DDKDVL----EAIEAFN-DLNDVESHNRSLGLDLSI 146 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhh----cccCcee--ec---CchHHH----HHHHHHH-hhcCHHHHHHHHHHHHHh Confidence 112335678888888888776655 4444333 22 333322 3444433 235444556788999999 Q ss_pred cCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccc Q lcl|NC_021540. 142 EGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYE 221 (705) Q Consensus 142 ~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 221 (705) .|.+.+.+|++ T Consensus 147 ~G~ay~~vy~d--------------------------------------------------------------------- 157 (511) T protein:vir:10 147 YGKAYEIMIRN--------------------------------------------------------------------- 157 (511) T ss_pred cCeeEEEEEeC--------------------------------------------------------------------- Confidence 99997776542 Q ss_pred ccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 222 EQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 222 ~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.+++..++|.++++ |++... . ...+.+++.+.. ++ . T Consensus 158 -------edg~~~i~~~~p~~~~~vydd~~~~---~-~~~~vr~~~~~~------~d--------------------~-- 198 (511) T protein:vir:10 158 -------QDDETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKP------ID--------------------K-- 198 (511) T ss_pred -------CCCceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeee------cc--------------------c-- Confidence 02346788889998774 333211 1 223333332110 00 0 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEE-----EecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVM-----IRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~i-----L~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) .....+..+|+|.. +++ ++....++.. ......|.+.+.+|++.++ ..-+|.|.++ T Consensus 199 ------~~~~~~~~~~iyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~-----nn~~g~gd~e 259 (511) T protein:vir:10 199 ------TDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYE 259 (511) T ss_pred ------CccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccccCcceeEEEec-----CCCCCCCchh Confidence 00122344455543 211 1111111111 1112223334566666654 2346899999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCCc---------ccccccccccCccch Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPGT---------NPVTDIIEHKYPELP 444 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~~---------~~~~~i~~~~~~~i~ 444 (705) .++++++.+|...|.+.+.+...++|.+++.... .+..+......+.++...+.. .....+.++..+.-. T Consensus 260 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~ 339 (511) T protein:vir:10 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCH Confidence 9999999999999999999988888777654322 222222333334444332211 111223444434334 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ..+...+..+...+...|++++.+.+..++.+|+.| ++..............+.|..+++++++.++.++........ T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~ 417 (511) T protein:vir:10 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 417 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccc Confidence 566678899999999999999887765433344444 555445555556666677777777777776665543221100 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) ..++. ...+..+...+.......+.+..+ .+ .++. .-++. .++...+..+ T Consensus 418 -----~~d~~---------~i~i~f~~~~p~d~~~~~~~~~kl---~G-~iS~---et~~~---~l~~v~d~~~------ 467 (511) T protein:vir:10 418 -----NKDFN---------TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQ---TTLMS---LFSFFQDPEL------ 467 (511) T ss_pred -----ccccc---------eeeEEeCCCCCcCHHHHHHHHHHH---hc-cCcH---HHHHH---hCCCCCCHHH------ Confidence 00110 122233332222222222222222 11 1111 00111 1111111111 Q ss_pred ccchhhHHHHHHHHHHHHH-HHHHHHHHHHH-------HHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQ-ELQMRIAKLQA-------EIQLMPYEAQAEAAK 649 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q-~~q~e~~k~qa-------~~q~~~~~~q~e~a~ 649 (705) +.++.+.+.+ ..+........ .....+.+-..++.+ T Consensus 468 ---------E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 468 ---------EVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ---------HHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 1111111100 00000000000 000000000000000 No 80 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.52 E-value=3.5e-13 Score=88.89 Aligned_cols=475 Identities=10% Similarity=0.018 Sum_probs=207.5 Q ss_pred Ccchhhhhhcccc----------cccC-CCCCCHHH-HHHHHHHHHHhhHHhhHH-HHHHHHHHHHhccCCCC---C-CC Q lcl|NC_021540. 1 MSDINEEFLEDTV----------PSLQ-EDWKNKPK-VSDLLNDFNNAKSTKDTQ-VAIIDDWLAQLNVTGAY---K-PK 63 (705) Q Consensus 1 ~~~~~~~~~~~~~----------~~~~-~~~~~~~~-~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~---~-~~ 63 (705) |-+|+|=-...+. ..+. -.|..... ......++....+.|.+. ..+.++..+||.|.-.. . .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:78 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc Confidence 5555442111111 1111 12333221 111122233334444332 23456778899886321 1 11 Q ss_pred --CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHh Q lcl|NC_021540. 64 --QQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVN 141 (705) Q Consensus 64 --~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~ 141 (705) ..+...+++.+-..-.|+.....| ||.+.-+ .+ +|.+. .+.++.++ ..|+--.......+++++ T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~~---~d~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~~ 146 (511) T protein:vir:78 81 KEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--QD---DDKDV----LEAIEAFN-DLNDVESHNRSLGLDLSI 146 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhh----cccCcee--ec---CchHH----HHHHHHHH-hhcChhHHHHHHHHHHHh Confidence 112235678888888888776655 4544333 22 33332 23444433 334444556788899999 Q ss_pred cCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccc Q lcl|NC_021540. 142 EGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYE 221 (705) Q Consensus 142 ~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 221 (705) .|.+.+.+|++ T Consensus 147 ~G~a~~~vy~d--------------------------------------------------------------------- 157 (511) T protein:vir:78 147 YGKAYELMIRN--------------------------------------------------------------------- 157 (511) T ss_pred cCeeEEEEEeC--------------------------------------------------------------------- Confidence 99997776652 Q ss_pred ccceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 222 EQEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 222 ~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.|++..++|.+++ ||+.... . ...+.+++.+.. ++ T Consensus 158 -------~dg~~~i~~~~p~~~~~v~dd~~~~---~-~~~~vr~~~~~~------~~----------------------- 197 (511) T protein:vir:78 158 -------QDDETRLYKSDAMSTFIIYDNTVER---N-SIAGVRYLRTKP------ID----------------------- 197 (511) T ss_pred -------CCCceEEEEEcccceEEEEcCCCCC---c-eEEEEEEEEeee------cc----------------------- Confidence 0234678889999987 4443221 1 223333332110 00 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCE---E--EecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDV---M--IRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~---i--L~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) +.....+..+|+|.. +++ ++....++. + -.....|.+.+.+|++.++. ..+|.|.+. T Consensus 198 -----~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e 259 (511) T protein:vir:78 198 -----KTDEDEVFTVDLFTS-----HGV---YRYLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYE 259 (511) T ss_pred -----ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchh Confidence 000122334455542 221 111111111 0 01122333445667766542 346899999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCC---------cccccccccccCccch Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPG---------TNPVTDIIEHKYPELP 444 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~---------~~~~~~i~~~~~~~i~ 444 (705) .++++++.+|...|.+.+.+...++|.+++.... .+..+......+..+...++ ......+.++..+.-. T Consensus 260 ~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 339 (511) T protein:vir:78 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCH Confidence 9999999999999999999988888877654322 22222222222333322211 1111223344433334 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ......+..+...+...|++++.+.+..++.+|+.| +...............+.|..+++++++.++.++........ T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~ 417 (511) T protein:vir:78 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 417 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Confidence 556677888999999999999987775543344444 444444455555666677777777777777666543221110 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) - .++. ...+..+...+.......+.+..+ .+ .++.. -++. .++...+ T Consensus 418 ~-----~~~~---------~i~~~f~~~~p~n~~e~~d~~~kl---~G-~iS~e---t~l~---~l~~v~d--------- 464 (511) T protein:vir:78 418 N-----KDFN---------TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQT---TLMS---LFSFFQD--------- 464 (511) T ss_pred c-----cccc---------cceEEeCCCCCcCHHHHHHHHHHH---hc-cCChH---HHHH---hCCCCCC--------- Confidence 0 0110 122333333232222222222221 11 11110 0110 0111111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQE 668 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~ 668 (705) + +.++++.+.+.+ +..+.+.. ..........-. ......+....+.+ T Consensus 465 ----~--~~El~ri~~E~~----~~~~~~~~----~~~~~~~~~~~~---~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 465 ----P--ELEVKKIEEDEK----ESIKKAQK----GIYKDPRDINDD---EQDDDTKDTVDKKE 511 (511) T ss_pred ----H--HHHHHHHHHHHH----HHHHHHhh----ccccCCCCCCCC---CCCCCccCcccccC Confidence 1 111111111100 00000000 000000000000 00000000000000 No 81 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.52 E-value=3.5e-13 Score=88.89 Aligned_cols=475 Identities=10% Similarity=0.018 Sum_probs=207.5 Q ss_pred Ccchhhhhhcccc----------cccC-CCCCCHHH-HHHHHHHHHHhhHHhhHH-HHHHHHHHHHhccCCCC---C-CC Q lcl|NC_021540. 1 MSDINEEFLEDTV----------PSLQ-EDWKNKPK-VSDLLNDFNNAKSTKDTQ-VAIIDDWLAQLNVTGAY---K-PK 63 (705) Q Consensus 1 ~~~~~~~~~~~~~----------~~~~-~~~~~~~~-~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~---~-~~ 63 (705) |-+|+|=-...+. ..+. -.|..... ......++....+.|.+. ..+.++..+||.|.-.. . .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc Confidence 5555442111111 1111 12333221 111122233334444332 23456778899886321 1 11 Q ss_pred --CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHh Q lcl|NC_021540. 64 --QQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVN 141 (705) Q Consensus 64 --~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~ 141 (705) ..+...+++.+-..-.|+.....| ||.+.-+ .+ +|.+. .+.++.++ ..|+--.......+++++ T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~--~~---~d~~~----~~~l~~~~-~~n~~~~~~~~~~~~~~~ 146 (511) T protein:vir:96 81 KEEYMADNRVAHDYASYISDFINGYF----LGNPIQY--QD---DDKDV----LEAIEAFN-DLNDVESHNRSLGLDLSI 146 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhh----cccCcee--ec---CchHH----HHHHHHHH-hhcChhHHHHHHHHHHHh Confidence 112235678888888888776655 4544333 22 33332 23444433 334444556788899999 Q ss_pred cCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccc Q lcl|NC_021540. 142 EGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYE 221 (705) Q Consensus 142 ~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 221 (705) .|.+.+.+|++ T Consensus 147 ~G~a~~~vy~d--------------------------------------------------------------------- 157 (511) T protein:vir:96 147 YGKAYELMIRN--------------------------------------------------------------------- 157 (511) T ss_pred cCeeEEEEEeC--------------------------------------------------------------------- Confidence 99997776652 Q ss_pred ccceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 222 EQEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 222 ~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.|++..++|.+++ ||+.... . ...+.+++.+.. ++ T Consensus 158 -------~dg~~~i~~~~p~~~~~v~dd~~~~---~-~~~~vr~~~~~~------~~----------------------- 197 (511) T protein:vir:96 158 -------QDDETRLYKSDAMSTFIIYDNTVER---N-SIAGVRYLRTKP------ID----------------------- 197 (511) T ss_pred -------CCCceEEEEEcccceEEEEcCCCCC---c-eEEEEEEEEeee------cc----------------------- Confidence 0234678889999987 4443221 1 223333332110 00 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCE---E--EecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDV---M--IRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~---i--L~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) +.....+..+|+|.. +++ ++....++. + -.....|.+.+.+|++.++. ..+|.|.+. T Consensus 198 -----~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e 259 (511) T protein:vir:96 198 -----KTDEDEVFTVDLFTS-----HGV---YRYLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYE 259 (511) T ss_pred -----ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchh Confidence 000122334455542 221 111111111 0 01122333445667766542 346899999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCC---------cccccccccccCccch Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPG---------TNPVTDIIEHKYPELP 444 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~---------~~~~~~i~~~~~~~i~ 444 (705) .++++++.+|...|.+.+.+...++|.+++.... .+..+......+..+...++ ......+.++..+.-. T Consensus 260 ~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 339 (511) T protein:vir:96 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCH Confidence 9999999999999999999988888877654322 22222222222333322211 1111223344433334 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ......+..+...+...|++++.+.+..++.+|+.| +...............+.|..+++++++.++.++........ T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~ 417 (511) T protein:vir:96 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 417 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Confidence 556677888999999999999987775543344444 444444455555666677777777777777666543221110 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) - .++. ...+..+...+.......+.+..+ .+ .++.. -++. .++...+ T Consensus 418 ~-----~~~~---------~i~~~f~~~~p~n~~e~~d~~~kl---~G-~iS~e---t~l~---~l~~v~d--------- 464 (511) T protein:vir:96 418 N-----KDFN---------TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQT---TLMS---LFSFFQD--------- 464 (511) T ss_pred c-----cccc---------cceEEeCCCCCcCHHHHHHHHHHH---hc-cCChH---HHHH---hCCCCCC--------- Confidence 0 0110 122333333232222222222221 11 11110 0110 0111111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQE 668 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~ 668 (705) + +.++++.+.+.+ +..+.+.. ..........-. ......+....+.+ T Consensus 465 ----~--~~El~ri~~E~~----~~~~~~~~----~~~~~~~~~~~~---~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 465 ----P--ELEVKKIEEDEK----ESIKKAQK----GIYKDPRDINDD---EQDDDTKDTVDKKE 511 (511) T ss_pred ----H--HHHHHHHHHHHH----HHHHHHhh----ccccCCCCCCCC---CCCCCccCcccccC Confidence 1 111111111100 00000000 000000000000 00000000000000 No 82 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.51 E-value=3.6e-12 Score=83.29 Aligned_cols=449 Identities=10% Similarity=0.067 Sum_probs=200.7 Q ss_pred Ccchhhhhhcc---ccccc---CCCC-------CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCC- Q lcl|NC_021540. 1 MSDINEEFLED---TVPSL---QEDW-------KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQ- 64 (705) Q Consensus 1 ~~~~~~~~~~~---~~~~~---~~~~-------~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~- 64 (705) .|+|-.--+.. -.|.. ...+ ++..+ +...+....+.|...+.+..+..+||.|..... +.+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~ 83 (492) T protein:vir:97 7 ISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPET---LEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV 83 (492) T ss_pred HHHHHHHHhcCCceeeccchhhhhHhhhcccCCCchhh---HHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc Confidence 22222211100 01100 0011 12222 222344444556666677778899999873211 111 Q ss_pred --------CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHH Q lcl|NC_021540. 65 --------QVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMV 136 (705) Q Consensus 65 --------~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~ 136 (705) .+-..+++.+..+..|+.....| +|.+. .|.+ +|.+. .++++..+. |+-...+.... T Consensus 84 ~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl----~g~p~--~~~~---~d~~~----~~~l~~~~~--n~~~~~~~~~~ 148 (492) T protein:vir:97 84 DATGAVDPLKPDDRMITNFHANLVDQKVSYI----VGKPI--AFKH---TDDEV----VKRIDEVLG--NRFDDKLHSVL 148 (492) T ss_pred cccccccccccccccccchHHHHHHHHhhhh----cccCc--eecc---CchHH----HHHHHHHHh--ccHHHHHHHHH Confidence 11234678888888888877665 45443 3322 34332 234444432 44445566788 Q ss_pred HHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceec Q lcl|NC_021540. 137 RTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAI 216 (705) Q Consensus 137 ~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 216 (705) ++++.+|.+.+.+|++ T Consensus 149 ~~~~~~G~a~~~v~~d---------------------------------------------------------------- 164 (492) T protein:vir:97 149 TGASNKGIEWLHPYLD---------------------------------------------------------------- 164 (492) T ss_pred HHHhhcCeEEEEEEec---------------------------------------------------------------- Confidence 9999999987766531 Q ss_pred cCcccccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccc Q lcl|NC_021540. 217 INGYEEQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDH 294 (705) Q Consensus 217 ~~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~ 294 (705) ..+.|++..++|.++++ |++.... -. .+.+.+.... . T Consensus 165 ------------~dg~~~~~~~~p~~~~~i~d~~~~~~---~~-~~vr~~~~~~----------~--------------- 203 (492) T protein:vir:97 165 ------------EEGEFKLFRVPAEQGIPIWTDKEHEE---LE-AFIRMYKLEN----------E--------------- 203 (492) T ss_pred ------------CCCceEEEEEcccceEEEEcCCCCCc---eE-EEEEEEeecc----------c--------------- Confidence 02346788899999765 3332212 22 2333331100 0 Q ss_pred ccccccccccccccCeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccC Q lcl|NC_021540. 295 YSSDTSFTFSDKARKKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYG 369 (705) Q Consensus 295 ~~~~~~~~~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g 369 (705) ..+ |+|.. +.+.+++... ......+...+...+++ .+.+|+++++. +.+| T Consensus 204 --------------~~~---~~y~~~~v~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~--~g~vPvv~~~n-----n~~g 258 (492) T protein:vir:97 204 --------------TKV---EYWDKVTVNYYVYENGSLIP-DYSNNLENSKTHFSTGS--WGKIPFIPFKN-----NDLE 258 (492) T ss_pred --------------eeE---EEEecCeEEEEEEecCeeee-cccccccccccccccCC--CCCcceEEecC-----CCCC Confidence 001 11111 1111111100 00000111222233333 36677777653 3468 Q ss_pred CchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc-hhhh-hhcCCcceeecCCcccccccccccCccchHHH Q lcl|NC_021540. 370 EADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDP-VNER-KFKMGEDYKYNPGTNPVTDIIEHKYPELPASS 447 (705) Q Consensus 370 ~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~-~d~~-~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~ 447 (705) .|.+..++++++.+|.+.|.+.+.+...+.|.+++....... .+.. ......++.+..++. +.+...+.-.... T Consensus 259 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~ 334 (492) T protein:vir:97 259 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGG----VDTIQVEVPVENS 334 (492) T ss_pred CCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHHhhccceecCCCCc----ceeEeccCCHHHH Confidence 999999999999999999999999999888877654211111 1111 224445565554432 3343333333556 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEe Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIR 527 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~ir 527 (705) ...++.+.+.+...|++++.+.+..++..|+.| +...............+.|..+++++++.++.++ ... T Consensus 335 ~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~----~~~---- 404 (492) T protein:vir:97 335 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF----DIK---- 404 (492) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccccCcHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC---- Confidence 677899999999999998877765444444444 4444444444555556666666666666555433 211 Q ss_pred EecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccc Q lcl|NC_021540. 528 ITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEP 607 (705) Q Consensus 528 i~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~ 607 (705) .++. ...+..+...+.......+.+..+. + .++.. -.+ ..++...+.. T Consensus 405 ---~~~~---------~i~v~f~~~~p~~~~e~a~~~~kl~---G-~iS~e---t~l---~~l~~v~d~~---------- 452 (492) T protein:vir:97 405 ---GEHK---------DVDISFNYNKVANTELQVQTAQQSM---G-IVSHE---TVL---ENHPFVEDLQ---------- 452 (492) T ss_pred ---cccc---------eeeEEecCCCCCCHHHHHHHHHHHh---c-cCchH---HHH---HhCCCCCCHH---------- Confidence 0111 1223333322221112222222211 1 11110 011 0111111111 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHH---HHHHHHH-HHHHHHHHHH Q lcl|NC_021540. 608 SPQAQLEIQIKQLEAQELQMRIAK---LQAEIQL-MPYEAQAEAA 648 (705) Q Consensus 608 ~~~~q~~~q~~q~~~q~~q~e~~k---~qa~~q~-~~~~~q~e~a 648 (705) .+.++.+.+..+....... ....... .+.....+.+ T Consensus 453 -----~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 453 -----AELERIEQEQTEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred -----HHHHHHHHHHHHHHHhhhccccCCCCCCcccccccccccC Confidence 1111111111000000000 0000000 0000000000 No 83 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.51 E-value=2.1e-12 Score=84.62 Aligned_cols=455 Identities=10% Similarity=0.007 Sum_probs=195.4 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CC-CCCCC--CcCCCHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PK-QQVGR--SSVQPKL 75 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~gr--s~~v~~~ 75 (705) |++=+=.++ +++.+++ ...+..|.... + .....+.++..+||.|.-... +. ..+++ .+++.+. T Consensus 1 ~~~~~~~~~--~~~~~~~----~~~~~~~i~~~---~---~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~ 68 (489) T protein:vir:99 1 MLQEDFEAI--DYESKLW----IDQLKNYISRF---K---AEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDF 68 (489) T ss_pred CCccceeee--CCCCCCC----HHHHHHHHHHH---H---HHHHHHHHHHHHHhcccCccccccccccccCCcceeecch Confidence 322211111 2222233 12222232222 1 223445667889999874221 11 12233 3688888 Q ss_pred HHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCc-chHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 76 IRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKV-KLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 76 v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~-~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) .+..|+.....| ||.+.- |.+ +|.. ..++++.++.. | .+ .......++++..|.|.+.+|+... T Consensus 69 ~~~iv~~~~~~l----~g~~~~--~~~---~d~~----~~~~l~~~~~~-n-~~~~~~~~~~~~~~~~G~~~~~v~~~~~ 133 (489) T protein:vir:99 69 AKYITVFEQGYM----LGVPVE--YKN---ENKD----LQAAIDLMSVR-N-NEDYHNVKIKTDLSIYGRAYELLTVEKI 133 (489) T ss_pred HHHHHHHHhhhh----ccCCce--eec---CChh----HHHHHHHHHhh-c-ChhHHHHHHHHHHhhCCeEEEEEeeccC Confidence 888888877665 554433 332 3322 34566665543 3 44 4567889999999999887765200 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) ....++++ T Consensus 134 ------------------------------------------------------------------------~d~~~~~~ 141 (489) T protein:vir:99 134 ------------------------------------------------------------------------DDKKTEVK 141 (489) T ss_pred ------------------------------------------------------------------------cCCCcceE Confidence 01134678 Q ss_pred EEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEE Q lcl|NC_021540. 235 VTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIV 312 (705) Q Consensus 235 i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 312 (705) |..++|.++++ |+... .+..+.+ +++... + .....+. T Consensus 142 i~~~~p~~~~~v~dd~~~---~~~~~~i-~~~~~~----------~---------------------------~~~~~~~ 180 (489) T protein:vir:99 142 LYQLPAEQTFVIYDDTYQ---RNSLMAV-HFYDID----------Y---------------------------GSGKRKQ 180 (489) T ss_pred EEEEcccceEEEEcCCCC---CceEEEE-EEEEEe----------c---------------------------CCCceEE Confidence 89999999854 32221 1222222 222100 0 0001233 Q ss_pred EEEEEEEeeecCCCeeEEEEEEEE-CC-EEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHH Q lcl|NC_021540. 313 VYEYWGYWDIDGSGVTTPIVASWV-DD-VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGM 390 (705) Q Consensus 313 v~E~w~k~~~~~dg~~~~~~~~~~-g~-~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~ 390 (705) ++++|.. +.+.++...... ++ .+. ...|.+.+.+|++++.. ...|.|.+..++++++.+|..++.+ T Consensus 181 ~~~~y~~-----~~i~~~~~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----~~~~~s~~~~v~~liDa~d~~~s~~ 248 (489) T protein:vir:99 181 IIKAYTS-----DTIYTYEDYNLETKGMRLK--DYEGHFFKGVPVNEYAN-----NEERTGAYESVLDNIDAYDLSQSEL 248 (489) T ss_pred EEEEEeC-----CcEEEEEecCCCcccceec--ccccccCCceeEEEeec-----CCCCCCchhhhHHHHHHHHHHHHHH Confidence 4455532 111111111101 11 122 22233336778877653 3468899999999999999999999 Q ss_pred HHHHHhcCCCcEEeeccccCchh------hhhhcCC------------cceeecCCccc---ccccccccCccchHHHHH Q lcl|NC_021540. 391 IDAMARSANGQRGMSKNLLDPVN------ERKFKMG------------EDYKYNPGTNP---VTDIIEHKYPELPASSYN 449 (705) Q Consensus 391 ~d~~~~~~~~~~~~~~~av~~~d------~~~~~pg------------~~i~~~~~~~~---~~~i~~~~~~~i~~~~~~ 449 (705) .+.+...+.+.+++-.......+ .....++ .++...++... ...+.++..+.-...... T Consensus 249 ~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 328 (489) T protein:vir:99 249 ANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEA 328 (489) T ss_pred HHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHH Confidence 99998888777655321111111 1111111 12222222111 112233333323345556 Q ss_pred HHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEe Q lcl|NC_021540. 450 MLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRIT 529 (705) Q Consensus 450 ~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~ 529 (705) .++.+...+...||+++.+.+..++..|+.| +...............+.|..+++++++.++.++....... T Consensus 329 ~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~------ 400 (489) T protein:vir:99 329 YKNRLVADILRFTFTPDTQDMKFSGVQSGES--MKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEA------ 400 (489) T ss_pred HHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc------ Confidence 7788899999999998876553333334444 44333444444555566666677777766666553221110 Q ss_pred cCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchh Q lcl|NC_021540. 530 DEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSP 609 (705) Q Consensus 530 ~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~ 609 (705) . .........+..+...+.......+.+..+. + .++... .+.. +.+... +++ T Consensus 401 ---~----~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~---g-iis~et---~~~~---l~~v~~-----------~d~ 452 (489) T protein:vir:99 401 ---T----TYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY---G-IVSDQT---IFEI---LNTVTG-----------VDA 452 (489) T ss_pred ---c----cccccccceEEeCCCCCcCHHHHHHHHHHHh---c-cCCHHH---HHHh---cCCCCc-----------hhH Confidence 0 0000011223333322221222222222211 1 111100 0000 001000 000 Q ss_pred hHHHHHHHHHHHHH-HHHHHHHHHHHH--HHHHHHHHHH Q lcl|NC_021540. 610 QAQLEIQIKQLEAQ-ELQMRIAKLQAE--IQLMPYEAQA 645 (705) Q Consensus 610 ~~q~~~q~~q~~~q-~~q~e~~k~qa~--~q~~~~~~q~ 645 (705) +.++++.+.+.. +.+......-.. -+.+....+- T Consensus 453 --~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 453 --EAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred --HHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 000000000000 000000000000 0000000000 No 84 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.50 E-value=3.2e-13 Score=89.05 Aligned_cols=475 Identities=10% Similarity=0.038 Sum_probs=208.6 Q ss_pred Ccchhhhhhcc----------cccccCC-CCCCHHH-HHHHHHHHHHhhHHhhHH-HHHHHHHHHHhccCCCC----C-- Q lcl|NC_021540. 1 MSDINEEFLED----------TVPSLQE-DWKNKPK-VSDLLNDFNNAKSTKDTQ-VAIIDDWLAQLNVTGAY----K-- 61 (705) Q Consensus 1 ~~~~~~~~~~~----------~~~~~~~-~~~~~~~-~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~-- 61 (705) |-+|++=-.-- .+..+.. .|..... +..-..++......|... ..+.++..+||.|.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~ 80 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 44443311100 0111111 3433221 111123344444444433 23456788999886321 1 Q ss_pred CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHh Q lcl|NC_021540. 62 PKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVN 141 (705) Q Consensus 62 ~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~ 141 (705) +...+...+++.+...-.|+.....| ||.+.-+. . +|... .++++.++. .|+--......+++++. T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl----~g~p~~~~--~---~d~~~----~~~l~~~~~-~n~~~~~~~~~~~~~~i 146 (511) T protein:vir:99 81 KEEYMADNRVAHDYASYISDFINGYF----LGNPIQYQ--D---DDKDV----LEAIEAFND-LNDVESHNRSLGLDLSI 146 (511) T ss_pred cccccCcceeecchHHHHHHHHHhhh----cccCceee--c---CchHH----HHHHHHHHh-hcCHhHHHHHHHHHHHh Confidence 11112235688888888888776655 45443332 2 33332 234444432 35444567789999999 Q ss_pred cCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCccc Q lcl|NC_021540. 142 EGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYE 221 (705) Q Consensus 142 ~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 221 (705) .|.+.+.+||+ T Consensus 147 ~G~a~~~vy~d--------------------------------------------------------------------- 157 (511) T protein:vir:99 147 YGKAYELMIRN--------------------------------------------------------------------- 157 (511) T ss_pred cCeeEEEEEeC--------------------------------------------------------------------- Confidence 99998877652 Q ss_pred ccceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccc Q lcl|NC_021540. 222 EQEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDT 299 (705) Q Consensus 222 ~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~ 299 (705) ..+.|++..++|.++| ||+.... . ...+.+++.+.. ++. T Consensus 158 -------ed~~~~i~~~~p~~~~~vyd~~~~~---~-~~~~vr~~~~~~------~~~---------------------- 198 (511) T protein:vir:99 158 -------QDDETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKP------IDK---------------------- 198 (511) T ss_pred -------CCCceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeee------ccc---------------------- Confidence 0134678889999986 4444221 1 222333331110 000 Q ss_pred cccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECC--EEE---ecccCCCCCCCcceEEeeeeeecCcccCCchHH Q lcl|NC_021540. 300 SFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDD--VMI---RLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE 374 (705) Q Consensus 300 ~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~--~iL---~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~ 374 (705) .....+..+|+|.. +++.. ...-++ ..+ .....|.+.+.+|++.++. ..+|.|.+. T Consensus 199 ------~~~~~~~~~~vyt~-----~~i~~---~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e 259 (511) T protein:vir:99 199 ------TDEDEVFTVDLFTS-----HGVYR---YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYE 259 (511) T ss_pred ------CccceEEEEEEEeC-----CcEEE---EEecCCccccccccccccccCCCCccceEEecC-----CCCCCCchh Confidence 00112334455543 22111 111111 100 1122233336677777653 346899999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCC---------cccccccccccCccch Q lcl|NC_021540. 375 LLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPG---------TNPVTDIIEHKYPELP 444 (705) Q Consensus 375 ~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~---------~~~~~~i~~~~~~~i~ 444 (705) .++++++.+|..+|.+.+.+...++|.+++.... .+..+......++++...+. ......+.++..+.-. T Consensus 260 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~ 339 (511) T protein:vir:99 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDV 339 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCH Confidence 9999999999999999999988888776654322 22222222222333322111 0111223444433334 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEE 524 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~ 524 (705) ......+..+...+...|++++.+.+..++.+|+.| +..+............+.|..+++++++.++.++........ T Consensus 340 ~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~A--lk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~ 417 (511) T protein:vir:99 340 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDV 417 (511) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc Confidence 556677889999999999999987765433344444 555445555566666777777888877777776644321100 Q ss_pred eEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccc Q lcl|NC_021540. 525 VIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYN 604 (705) Q Consensus 525 ~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~ 604 (705) ..++. ...+..+...+.......+.+..+ .+ .++. .-++. .++... T Consensus 418 -----~~~~~---------~i~i~f~~~~p~n~~e~~~~~~kl---~G-iiS~---et~l~---~l~~v~---------- 463 (511) T protein:vir:99 418 -----SKDFN---------TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQ---TTLMS---LFSFFQ---------- 463 (511) T ss_pred -----ccccc---------cceEEeCCCCCcCHHHHHHHHHHH---hc-cCCH---HHHHH---hCCCCC---------- Confidence 00110 122333333222111222221111 11 1111 00111 111111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 605 PEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVE 666 (705) Q Consensus 605 ~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~ 666 (705) ++. .+.++.+.+... .....+.... .+.....-.. .........+..| T Consensus 464 ---D~~--~E~~ri~~E~~~---~~~~~~~~~~-----~~~~~~~~~~-~~~~~~~~~d~~e 511 (511) T protein:vir:99 464 ---DPE--LEVKKIEEDEKE---SIKKAQKNMY-----QDPRNINDDE-QDDSTKDSIDKKE 511 (511) T ss_pred ---CHH--HHHHHHHHHHHH---HHHHHhhccc-----ccCCCCCCCC-CCCCCcCcccccC Confidence 111 111111111000 0000000000 0000000000 0000000000000 No 85 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.49 E-value=6e-13 Score=87.59 Aligned_cols=449 Identities=10% Similarity=0.047 Sum_probs=197.1 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHH-------HHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC---------CC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLL-------NDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP---------KQ 64 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~ 64 (705) |-.+.+=.. ++|- ++.++..|. ..+....+.|...+.+.+++.+||.|.-.... .. T Consensus 1 ~~~~~~~~~--~~~~------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~ 72 (474) T protein:vir:94 1 MFNIIRMPW--DKPY------GEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNI 72 (474) T ss_pred CcccccccC--CCch------hhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhcccccc Confidence 322221111 1110 111111111 22333445566666777889999998631110 01 Q ss_pred CCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhc Q lcl|NC_021540. 65 QVGRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNE 142 (705) Q Consensus 65 ~~grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~ 142 (705) ..+++ +++.+.....|+.....| ||.+.- |. -+|.. ....++..+ .|+-...+...+++++.. T Consensus 73 ~~~~~~~ki~~n~~k~Ivd~~~~~l----~g~p~~--~~---~~d~~----~~~~l~~~~--~n~~~~~~~e~~~~~~~~ 137 (474) T protein:vir:94 73 DYDKPDWRITTNFHQNLVDQKVSYV----ASKPVT--YS---CEDEN----VLKVIHDVL--DTRWDNKLIDILTATSNK 137 (474) T ss_pred ccccCcceeecchHHHHHHHHHhhh----hcCCce--ec---cCcHH----HHHHHHHHH--hccHHHHHHHHHHHHhhc Confidence 22333 578888888888776666 554433 32 23333 223444433 355556667788999999 Q ss_pred CCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccc Q lcl|NC_021540. 143 GTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEE 222 (705) Q Consensus 143 g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 222 (705) |.+.+.+|++ T Consensus 138 G~~~~~~~~d---------------------------------------------------------------------- 147 (474) T protein:vir:94 138 GIDWLQVYIN---------------------------------------------------------------------- 147 (474) T ss_pred CceEEEEEec---------------------------------------------------------------------- Confidence 9987776542 Q ss_pred cceeeeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccc Q lcl|NC_021540. 223 QEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFT 302 (705) Q Consensus 223 ~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 302 (705) ..+.|++..++|.++++-.+-. ...+-.++ .+.+... + T Consensus 148 ------~~~~~~i~~~~p~~~~~v~d~~-~~~~~~~~-ir~~~~~----------~------------------------ 185 (474) T protein:vir:94 148 ------ENGEMKLFRVPAEQAIPIWVDK-EREELKSF-IRYYKFN----------N------------------------ 185 (474) T ss_pred ------CCCeeEEEEEcccceEEEEcCC-CCCceEEE-EEEEEec----------C------------------------ Confidence 1234678888999877543211 12222332 2332100 0 Q ss_pred ccccccCeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhh Q lcl|NC_021540. 303 FSDKARKKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLS 377 (705) Q Consensus 303 ~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~ 377 (705) ...+|+|.. +-.++.+.. .....-.+.... ...|.+.+.+|++++.. ..+|.|.+..++ T Consensus 186 --------~~~~~~yt~~~~~~y~~~~~~~~-~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~ 249 (474) T protein:vir:94 186 --------EEKVEFWTDTTVTYYVLENGGLI-PDYYYGANHVQS--HFSNGNWGRVPFIAFKN-----NPEEVSDIWMYK 249 (474) T ss_pred --------eEEEEEEeCCeEEEEEEcCCccc-cccccCcCcccc--cccccCCCccceEEecC-----CcCCCCcHHHHH Confidence 001222211 000111100 000000011111 12222336677777643 457899999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhh--hhcCCcceeecCCcccccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 378 DNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNER--KFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 378 d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~--~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) ++++.+|.+.+.+.+.+...+.|.+++.....+..... ....+.++.+.+++. +.+...+.-...+...++.+. T Consensus 250 ~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~~l~ 325 (474) T protein:vir:94 250 SIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGG----VETIQVEVPVSSTKEYIDLMR 325 (474) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc----eeEEeecCCHHHHHHHHHHHH Confidence 99999999999999999988888877654333222221 123445566655442 344443333455666788999 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceee Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQ 535 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~ 535 (705) ..+...+++++.+.+.-++.+|+.| +..+............+.|..+++++++.++ +++... .++. T Consensus 326 ~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~~l~~~~~li~----~~~~~~-------~d~~- 391 (474) T protein:vir:94 326 VYIMEFGQGVDFQTDKFGSAPSGIA--LKFLYGNLDLKANKLKNKATVAIQELISFII----DFNNLK-------TDVK- 391 (474) T ss_pred HHHHHHhCccccCccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhCCC-------cccc- Confidence 9999999998877654433334443 4433333444444445555555555555444 443211 0111 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) ...+..+...+....... +.+.+.+ .++. ..++. .++...+ +. .+. T Consensus 392 --------~i~v~f~~~~p~~~~e~a----~~~~~~g-~iS~---et~l~---~l~~v~D-------------~~--~E~ 437 (474) T protein:vir:94 392 --------DIEISFNFNRMMNDAEQS----QIIAQSQ-YLSR---ETLVK---SSPLVDD-------------YK--AEL 437 (474) T ss_pred --------eeeEEeccCcccCHHHHH----HHHHHcC-CCCH---HHHHH---hCCCCCC-------------HH--HHH Confidence 111222222221111111 1111111 1111 11111 1111111 11 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTE 656 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~e 656 (705) ++.+.+.... ++..............+.........| T Consensus 438 eri~~E~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 438 ERIEQEQMEY----NKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHH----HhhccccCCCCCCCcccCCCCcccccC Confidence 1111111100 000000000000000000000000000 No 86 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.49 E-value=6e-13 Score=87.59 Aligned_cols=449 Identities=10% Similarity=0.047 Sum_probs=197.1 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHH-------HHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC---------CC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLL-------NDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP---------KQ 64 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~ 64 (705) |-.+.+=.. ++|- ++.++..|. ..+....+.|...+.+.+++.+||.|.-.... .. T Consensus 1 ~~~~~~~~~--~~~~------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~ 72 (474) T protein:vir:97 1 MFNIIRMPW--DKPY------GEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNI 72 (474) T ss_pred CcccccccC--CCch------hhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhcccccc Confidence 322221111 1110 111111111 22333445566666777889999998631110 01 Q ss_pred CCCCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhc Q lcl|NC_021540. 65 QVGRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNE 142 (705) Q Consensus 65 ~~grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~ 142 (705) ..+++ +++.+.....|+.....| ||.+.- |. -+|.. ....++..+ .|+-...+...+++++.. T Consensus 73 ~~~~~~~ki~~n~~k~Ivd~~~~~l----~g~p~~--~~---~~d~~----~~~~l~~~~--~n~~~~~~~e~~~~~~~~ 137 (474) T protein:vir:97 73 DYDKPDWRITTNFHQNLVDQKVSYV----ASKPVT--YS---CEDEN----VLKVIHDVL--DTRWDNKLIDILTATSNK 137 (474) T ss_pred ccccCcceeecchHHHHHHHHHhhh----hcCCce--ec---cCcHH----HHHHHHHHH--hccHHHHHHHHHHHHhhc Confidence 22333 578888888888776666 554433 32 23333 223444433 355556667788999999 Q ss_pred CCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccc Q lcl|NC_021540. 143 GTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEE 222 (705) Q Consensus 143 g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 222 (705) |.+.+.+|++ T Consensus 138 G~~~~~~~~d---------------------------------------------------------------------- 147 (474) T protein:vir:97 138 GIDWLQVYIN---------------------------------------------------------------------- 147 (474) T ss_pred CceEEEEEec---------------------------------------------------------------------- Confidence 9987776542 Q ss_pred cceeeeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccc Q lcl|NC_021540. 223 QEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFT 302 (705) Q Consensus 223 ~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 302 (705) ..+.|++..++|.++++-.+-. ...+-.++ .+.+... + T Consensus 148 ------~~~~~~i~~~~p~~~~~v~d~~-~~~~~~~~-ir~~~~~----------~------------------------ 185 (474) T protein:vir:97 148 ------ENGEMKLFRVPAEQAIPIWVDK-EREELKSF-IRYYKFN----------N------------------------ 185 (474) T ss_pred ------CCCeeEEEEEcccceEEEEcCC-CCCceEEE-EEEEEec----------C------------------------ Confidence 1234678888999877543211 12222332 2332100 0 Q ss_pred ccccccCeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhh Q lcl|NC_021540. 303 FSDKARKKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLS 377 (705) Q Consensus 303 ~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~ 377 (705) ...+|+|.. +-.++.+.. .....-.+.... ...|.+.+.+|++++.. ..+|.|.+..++ T Consensus 186 --------~~~~~~yt~~~~~~y~~~~~~~~-~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~ 249 (474) T protein:vir:97 186 --------EEKVEFWTDTTVTYYVLENGGLI-PDYYYGANHVQS--HFSNGNWGRVPFIAFKN-----NPEEVSDIWMYK 249 (474) T ss_pred --------eEEEEEEeCCeEEEEEEcCCccc-cccccCcCcccc--cccccCCCccceEEecC-----CcCCCCcHHHHH Confidence 001222211 000111100 000000011111 12222336677777643 457899999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhh--hhcCCcceeecCCcccccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 378 DNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNER--KFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 378 d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~--~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) ++++.+|.+.+.+.+.+...+.|.+++.....+..... ....+.++.+.+++. +.+...+.-...+...++.+. T Consensus 250 ~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~~l~ 325 (474) T protein:vir:97 250 SIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGG----VETIQVEVPVSSTKEYIDLMR 325 (474) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc----eeEEeecCCHHHHHHHHHHHH Confidence 99999999999999999988888877654333222221 123445566655442 344443333455666788999 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceee Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQ 535 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~ 535 (705) ..+...+++++.+.+.-++.+|+.| +..+............+.|..+++++++.++ +++... .++. T Consensus 326 ~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~~l~~~~~li~----~~~~~~-------~d~~- 391 (474) T protein:vir:97 326 VYIMEFGQGVDFQTDKFGSAPSGIA--LKFLYGNLDLKANKLKNKATVAIQELISFII----DFNNLK-------TDVK- 391 (474) T ss_pred HHHHHHhCccccCccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhCCC-------cccc- Confidence 9999999998877654433334443 4433333444444445555555555555444 443211 0111 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) ...+..+...+....... +.+.+.+ .++. ..++. .++...+ +. .+. T Consensus 392 --------~i~v~f~~~~p~~~~e~a----~~~~~~g-~iS~---et~l~---~l~~v~D-------------~~--~E~ 437 (474) T protein:vir:97 392 --------DIEISFNFNRMMNDAEQS----QIIAQSQ-YLSR---ETLVK---SSPLVDD-------------YK--AEL 437 (474) T ss_pred --------eeeEEeccCcccCHHHHH----HHHHHcC-CCCH---HHHHH---hCCCCCC-------------HH--HHH Confidence 111222222221111111 1111111 1111 11111 1111111 11 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTE 656 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~e 656 (705) ++.+.+.... ++..............+.........| T Consensus 438 eri~~E~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 438 ERIEQEQMEY----NKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHH----HhhccccCCCCCCCcccCCCCcccccC Confidence 1111111100 000000000000000000000000000 No 87 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.49 E-value=5.5e-12 Score=82.33 Aligned_cols=433 Identities=9% Similarity=0.021 Sum_probs=201.6 Q ss_pred CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--C-------CCCCCCC--cCCCHHHHHHHHHHHHHHHH Q lcl|NC_021540. 21 KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--P-------KQQVGRS--SVQPKLIRKQAEWRYSALSE 89 (705) Q Consensus 21 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-------~~~~grs--~~v~~~v~~~~e~~~~~l~~ 89 (705) .+...|..| .+.|...+.+..+..+||.|.-... + ....+++ +++.+..+..|+.....| T Consensus 1 l~~~~i~~~-------i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl-- 71 (451) T protein:vir:10 1 MELEKIRAI-------ISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYM-- 71 (451) T ss_pred CCHHHHHHH-------HHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhhe-- Confidence 223333333 2334445555678889999853110 0 0111222 677888888888776655 Q ss_pred hhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccccc Q lcl|NC_021540. 90 PFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVE 169 (705) Q Consensus 90 ~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~ 169 (705) ||.+.-+.. .+|.+.. ..+++.+ .|+--.......++++.+|.|.+.+|++.... T Consensus 72 --~G~p~~~~~----~~~~~~~----~~~~~~~--~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~------------- 126 (451) T protein:vir:10 72 --FTYPVLFDI----DNNKELN----EKVTDVL--GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYS------------- 126 (451) T ss_pred --ecccceeec----CCcHHHH----HHHHHHh--ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcc------------- Confidence 565543332 2333333 3444433 24444455678899999999988877631100 Q ss_pred CCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheee--CC Q lcl|NC_021540. 170 ATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTI--DP 247 (705) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp 247 (705) ......+.+++..++|.++++ |. T Consensus 127 -------------------------------------------------------~~~~~~~~~~~~~i~p~~~~~vydd 151 (451) T protein:vir:10 127 -------------------------------------------------------GEQVTNQTFKYGVVNTEEIIPIYRN 151 (451) T ss_pred -------------------------------------------------------cccccccceeEEEEcccceEEEEcC Confidence 000123466788899999864 33 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCe Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGV 327 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~ 327 (705) +... +..+ +.|++....+- . +......+..+|+|.. +.+ T Consensus 152 ~~~~---~~~~-~ir~~~~~~~~----------------------~----------~~~~~~~~~~~e~yt~-----~~~ 190 (451) T protein:vir:10 152 GIER---ELEA-VIRYYIQLEDV----------------------K----------GQIQKQAYTYVEFWTD-----KIL 190 (451) T ss_pred CCCC---ceEE-EEEEEEeeecc----------------------c----------ccccceEEEEEEEEeC-----CeE Confidence 2221 2223 33333211100 0 0001122334455542 221 Q ss_pred eEEEEE--EEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEee Q lcl|NC_021540. 328 TTPIVA--SWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMS 405 (705) Q Consensus 328 ~~~~~~--~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~ 405 (705) ..+... -..++.++ ....|-+.|.+|++.++. .-.|.|.+..++++++.+|.+.|.+.+.+.-.++|.+++. T Consensus 191 ~~~~~~~~~~~~~~~~-~~~~~~~~g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~ 264 (451) T protein:vir:10 191 DKYKFFGVSCCGSQIE-HITVQHRFNSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILE 264 (451) T ss_pred EEEEecccCccccccc-cccccCCCCeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeee Confidence 111100 01122222 222233335666666543 3457899999999999999999999999999998877654 Q ss_pred ccccCc--hhhhhhcCCcceeecCCccc-ccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHH Q lcl|NC_021540. 406 KNLLDP--VNERKFKMGEDYKYNPGTNP-VTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAG 482 (705) Q Consensus 406 ~~av~~--~d~~~~~pg~~i~~~~~~~~-~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~ 482 (705) ...... .........+++.+.+.... .+.+.++..+.-.......++.+...+...|++++.+.+..+| +|+.| T Consensus 265 g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~Sg~A-- 341 (451) T protein:vir:10 265 NFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGN-ASGVA-- 341 (451) T ss_pred cCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc-ccHHH-- Confidence 311111 12233445556666543221 2234555544445666778999999999999998876654443 34444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHH Q lcl|NC_021540. 483 VQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQ 562 (705) Q Consensus 483 i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q 562 (705) +..+............+.|..+++++++.++.++ ... ++ . .+.+..+...+.-.....+ T Consensus 342 lk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~----~~~--------d~-----~----~i~i~f~~~~p~n~~e~~~ 400 (451) T protein:vir:10 342 LKFFYRKLELKSGLLETEFRTSFDKLIKAILYFL----GVT--------DY-----K----KIQQTYTRNMMSNDLEDAD 400 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC--------Cc-----c----ceeEEecCCCCCCHHHHHH Confidence 4444444444555555666666666555555443 210 11 0 1222222222211111111 Q ss_pred HHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 563 ELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYE 642 (705) Q Consensus 563 ~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~ 642 (705) .+..+. + .++. ..+ +..++.... +. ...++...+.+. +..+.+....- .-+ T Consensus 401 ~~~kl~---g-~iS~----et~--~~~~p~v~d-------------~~--~e~~~~~ee~~~---~~~~~~~~~~~-~~~ 451 (451) T protein:vir:10 401 IATKSV---G-IIPT----KII--LRHHPWVDD-------------VE--EAEKLYLEEKKI---QASKVSDDYNN-FTE 451 (451) T ss_pred HHHHHh---c-cCch----HHH--HHhCCCCCC-------------HH--HHHHHHHHHHHH---HHHHHHhhcCC-CCC Confidence 111111 1 1111 110 011111111 11 111111111110 01111111000 000 No 88 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.49 E-value=4.3e-12 Score=82.90 Aligned_cols=473 Identities=12% Similarity=0.073 Sum_probs=207.8 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhH-HhhHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKS-TKDTQVAIIDDWLAQLNVTGAYK-----PKQQVGRSSVQPK 74 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~grs~~v~~ 74 (705) |-+-++..+-.... ++. ...|.+..++.+= -...++.+.+.|..||.|..... .+....|.....+ T Consensus 3 ~~~~~k~~~~~~~~-~~~-------~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~sln 74 (500) T protein:vir:30 3 VIQKIKNLVTRSKY-VMT-------TQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLP 74 (500) T ss_pred hHHHHHHHHHHHHH-Hhh-------cchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecc Confidence 33333333322111 010 0112222222111 22345577889999998763221 1111122222333 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 75 LIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 75 ~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) .-...++. +.+.+|+-.+-+.+ +|.. .+++++.++. .|+-...+..++..|+..|.|++|+||+ T Consensus 75 l~~~i~~~----~A~lv~~e~~~i~~-----~d~~----~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d-- 138 (500) T protein:vir:30 75 IARTAAKK----IASLVFNEQAEIKV-----DDDA----ANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVD-- 138 (500) T ss_pred hHHHHHHH----HhhhhcCCcceEec-----CChH----HHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEe-- Confidence 33333332 23334554444444 3443 4446665543 3455667889999999999999999983 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) .++|+ T Consensus 139 ---------------------------------------------------------------------------~~~~~ 143 (500) T protein:vir:30 139 ---------------------------------------------------------------------------GDKVR 143 (500) T ss_pred ---------------------------------------------------------------------------CCceE Confidence 12356 Q ss_pred EEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh--hhccccccccccccccccccCeEE Q lcl|NC_021540. 235 VTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST--STSSDHYSSDTSFTFSDKARKKIV 312 (705) Q Consensus 235 i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~ 312 (705) |++|++..|++=.........|-++++.. .+... ...||.-++.=....+. -+....+..++. +.-...|- T Consensus 144 I~~v~ad~~~P~~~d~~~~~~~a~~~~~~-~~~~~--~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~----~~lG~~v~ 216 (500) T protein:vir:30 144 VAFVQAPVFLPLQSNTQDVSSAAVVIKSV-KTING--KEVYYTLIEFHEWQSSDDYVISNELYRSDDK----AKVGSRVP 216 (500) T ss_pred EEEEcCCeeEEEEEcCCCeEEEEEEEEEe-eeecC--CceEEEEEEEEEEeCCceeEEEEEEEecccc----cccCcccc Confidence 77888888774111111233332222111 11000 00011000000000000 000000000000 00001111 Q ss_pred EEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEe----eeeeecCcccCCchHHHhhHHHHHHHHHHH Q lcl|NC_021540. 313 VYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVV----PYLPVKDSVYGEADAELLSDNQKLIGALTR 388 (705) Q Consensus 313 v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~----~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~ 388 (705) +.++| .+ |..........+.||+.+ +-....++.+|.|++..++++.+.+|..++ T Consensus 217 l~~~~------------------~~---l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s 275 (500) T protein:vir:30 217 LSEVY------------------KD---LKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYD 275 (500) T ss_pred ccccc------------------CC---cCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHH Confidence 11111 10 000000011122234332 222345788999999999999999999999 Q ss_pred HHHHHHHhcCCCcEEeeccccCchhh-h--------hhcCCc-cee-ecCCcccccccccccCccchHHHHHHHHHHHHH Q lcl|NC_021540. 389 GMIDAMARSANGQRGMSKNLLDPVNE-R--------KFKMGE-DYK-YNPGTNPVTDIIEHKYPELPASSYNMLQMFTLE 457 (705) Q Consensus 389 ~~~d~~~~~~~~~~~~~~~av~~~d~-~--------~~~pg~-~i~-~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~ 457 (705) ++.+.+.. +..++.++.+++..+-. . .+.++. +++ ++.+......+...++.-....+...++.+... T Consensus 276 ~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~ 354 (500) T protein:vir:30 276 EFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSL 354 (500) T ss_pred HHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHH Confidence 99999866 77788888877643211 0 111111 222 222212223455444332234566777777888 Q ss_pred HHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCceeEeEecCceee Q lcl|NC_021540. 458 ADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVW--LSDEEVIRITDEEFVQ 535 (705) Q Consensus 458 ~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~--~~~~~~iri~~~~~v~ 535 (705) +....|++.-..|..++. ..||+++....+..-.....+.+.+..+++++.+.++.+..-+ +.... T Consensus 355 i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~----------- 422 (500) T protein:vir:30 355 FEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEV----------- 422 (500) T ss_pred HHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC----------- Confidence 888899998888876543 3578888876667777778888888889999988888765432 22110 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) +. ...+.+..+++...-.....+..+.+.++. . ++... -+++.-+..+ .++.... T Consensus 423 -~~---~~~v~v~f~d~i~~d~~~~~~~~~~~v~aG-i-~s~~~------~i~~~~g~~e-------------eea~~~l 477 (500) T protein:vir:30 423 -PS---MDNISISLDDGVFTDRDAELDYWIKVVNAG-F-GTREM------AIQKVLNVTE-------------EKAQEIA 477 (500) T ss_pred -CC---CcceEEEeCCCCCCCHHHHHHHHHHHHHcC-C-CCHHH------HHHhcCCCCH-------------HHHHHHH Confidence 00 001223333333322333344444443331 1 11100 0111111110 0111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLM 639 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~ 639 (705) ++.+.+... +.-.......+--+ T Consensus 478 ~~i~~E~~~-~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 478 AEINTGIVD-EINQQRTDTHLYGE 500 (500) T ss_pred HHHHHhccc-cCCCCCccccccCC Confidence 110000000 00000000000000 No 89 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.49 E-value=4.3e-12 Score=82.90 Aligned_cols=473 Identities=12% Similarity=0.073 Sum_probs=207.8 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhH-HhhHHHHHHHHHHHHhccCCCCC-----CCCCCCCCcCCCH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKS-TKDTQVAIIDDWLAQLNVTGAYK-----PKQQVGRSSVQPK 74 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~grs~~v~~ 74 (705) |-+-++..+-.... ++. ...|.+..++.+= -...++.+.+.|..||.|..... .+....|.....+ T Consensus 3 ~~~~~k~~~~~~~~-~~~-------~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~sln 74 (500) T protein:vir:98 3 VIQKIKNLVTRSKY-VMT-------TQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLP 74 (500) T ss_pred hHHHHHHHHHHHHH-Hhh-------cchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecc Confidence 33333333322111 010 0112222222111 22345577889999998763221 1111122222333 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecch Q lcl|NC_021540. 75 LIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLE 154 (705) Q Consensus 75 ~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~ 154 (705) .-...++. +.+.+|+-.+-+.+ +|.. .+++++.++. .|+-...+..++..|+..|.|++|+||+ T Consensus 75 l~~~i~~~----~A~lv~~e~~~i~~-----~d~~----~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d-- 138 (500) T protein:vir:98 75 IARTAAKK----IASLVFNEQAEIKV-----DDDA----ANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVD-- 138 (500) T ss_pred hHHHHHHH----HhhhhcCCcceEec-----CChH----HHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEe-- Confidence 33333332 23334554444444 3443 4446665543 3455667889999999999999999983 Q ss_pred hhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce Q lcl|NC_021540. 155 ETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE 234 (705) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 234 (705) .++|+ T Consensus 139 ---------------------------------------------------------------------------~~~~~ 143 (500) T protein:vir:98 139 ---------------------------------------------------------------------------GDKVR 143 (500) T ss_pred ---------------------------------------------------------------------------CCceE Confidence 12356 Q ss_pred EEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhh--hhccccccccccccccccccCeEE Q lcl|NC_021540. 235 VTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSST--STSSDHYSSDTSFTFSDKARKKIV 312 (705) Q Consensus 235 i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~ 312 (705) |++|++..|++=.........|-++++.. .+... ...||.-++.=....+. -+....+..++. +.-...|- T Consensus 144 I~~v~ad~~~P~~~d~~~~~~~a~~~~~~-~~~~~--~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~----~~lG~~v~ 216 (500) T protein:vir:98 144 VAFVQAPVFLPLQSNTQDVSSAAVVIKSV-KTING--KEVYYTLIEFHEWQSSDDYVISNELYRSDDK----AKVGSRVP 216 (500) T ss_pred EEEEcCCeeEEEEEcCCCeEEEEEEEEEe-eeecC--CceEEEEEEEEEEeCCceeEEEEEEEecccc----cccCcccc Confidence 77888888774111111233332222111 11000 00011000000000000 000000000000 00001111 Q ss_pred EEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEe----eeeeecCcccCCchHHHhhHHHHHHHHHHH Q lcl|NC_021540. 313 VYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVV----PYLPVKDSVYGEADAELLSDNQKLIGALTR 388 (705) Q Consensus 313 v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~----~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~ 388 (705) +.++| .+ |..........+.||+.+ +-....++.+|.|++..++++.+.+|..++ T Consensus 217 l~~~~------------------~~---l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s 275 (500) T protein:vir:98 217 LSEVY------------------KD---LKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYD 275 (500) T ss_pred ccccc------------------CC---cCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHH Confidence 11111 10 000000011122234332 222345788999999999999999999999 Q ss_pred HHHHHHHhcCCCcEEeeccccCchhh-h--------hhcCCc-cee-ecCCcccccccccccCccchHHHHHHHHHHHHH Q lcl|NC_021540. 389 GMIDAMARSANGQRGMSKNLLDPVNE-R--------KFKMGE-DYK-YNPGTNPVTDIIEHKYPELPASSYNMLQMFTLE 457 (705) Q Consensus 389 ~~~d~~~~~~~~~~~~~~~av~~~d~-~--------~~~pg~-~i~-~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~ 457 (705) ++.+.+.. +..++.++.+++..+-. . .+.++. +++ ++.+......+...++.-....+...++.+... T Consensus 276 ~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~ 354 (500) T protein:vir:98 276 EFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSL 354 (500) T ss_pred HHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHH Confidence 99999866 77788888877643211 0 111111 222 222212223455444332234566777777888 Q ss_pred HHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCceeEeEecCceee Q lcl|NC_021540. 458 ADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVW--LSDEEVIRITDEEFVQ 535 (705) Q Consensus 458 ~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~--~~~~~~iri~~~~~v~ 535 (705) +....|++.-..|..++. ..||+++....+..-.....+.+.+..+++++.+.++.+..-+ +.... T Consensus 355 i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~----------- 422 (500) T protein:vir:98 355 FEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEV----------- 422 (500) T ss_pred HHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC----------- Confidence 888899998888876543 3578888876667777778888888889999988888765432 22110 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHH Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEI 615 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~ 615 (705) +. ...+.+..+++...-.....+..+.+.++. . ++... -+++.-+..+ .++.... T Consensus 423 -~~---~~~v~v~f~d~i~~d~~~~~~~~~~~v~aG-i-~s~~~------~i~~~~g~~e-------------eea~~~l 477 (500) T protein:vir:98 423 -PS---MDNISISLDDGVFTDRDAELDYWIKVVNAG-F-GTREM------AIQKVLNVTE-------------EKAQEIA 477 (500) T ss_pred -CC---CcceEEEeCCCCCCCHHHHHHHHHHHHHcC-C-CCHHH------HHHhcCCCCH-------------HHHHHHH Confidence 00 001223333333322333344444443331 1 11100 0111111110 0111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLM 639 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~ 639 (705) ++.+.+... +.-.......+--+ T Consensus 478 ~~i~~E~~~-~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 478 AEINTGIVD-EINQQRTDTHLYGE 500 (500) T ss_pred HHHHHhccc-cCCCCCccccccCC Confidence 110000000 00000000000000 No 90 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.48 E-value=4.2e-12 Score=82.97 Aligned_cols=440 Identities=9% Similarity=0.021 Sum_probs=200.5 Q ss_pred HHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CC---------CC--CCCC--CcCCCHHHHHHHHHHHHHHH Q lcl|NC_021540. 24 PKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KP---------KQ--QVGR--SSVQPKLIRKQAEWRYSALS 88 (705) Q Consensus 24 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~---------~~--~~gr--s~~v~~~v~~~~e~~~~~l~ 88 (705) =.+..|++.++.....+...+.+.++-.+||.|.-.. .+ .. ..++ .+++.+.....|+.....| T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl- 79 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYV- 79 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhe- Confidence 2334455555555556666666677788999985211 00 00 1111 2466666666666655444 Q ss_pred HhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccc Q lcl|NC_021540. 89 EPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYV 168 (705) Q Consensus 89 ~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~ 168 (705) ||.+.-+. .+|....+...++++. +-...+....++++.+|.+.+.+||+ T Consensus 80 ---~G~p~~~~-----~~d~~~~~~l~~~~~~------~~~~~~~~l~~~~~~~G~a~~~~y~d---------------- 129 (470) T protein:vir:10 80 ---ASVFPDID-----VGKDADNKKIIDVLGD------DRALTLNGLLVDSSNAGRAWLHYWID---------------- 129 (470) T ss_pred ---eccceeee-----cCchHHHHHHHHHHhh------hHHHHHHHHHHHHhhcCeeEEEEEec---------------- Confidence 66553332 2444444444444432 22344556778899999998877652 Q ss_pred cCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCCC Q lcl|NC_021540. 169 EATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPT 248 (705) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~ 248 (705) ..+.+++..++|.++++=.+ T Consensus 130 ------------------------------------------------------------~~~~~~~~~~~p~~~~~v~d 149 (470) T protein:vir:10 130 ------------------------------------------------------------EDGNFRYGIIQPDQITPIYA 149 (470) T ss_pred ------------------------------------------------------------CCCceEEEEEcccceEEEEc Confidence 01346778889988775322 Q ss_pred ccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEE-----eeec Q lcl|NC_021540. 249 CNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGY-----WDID 323 (705) Q Consensus 249 a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k-----~~~~ 323 (705) -. ......+ +.+++.+.+. +. ...+..+|+|.. +-.. T Consensus 150 ~~-~~~~~~a-~ir~y~~~~~------~~------------------------------~~~~~~~e~yt~~~~~~~~~~ 191 (470) T protein:vir:10 150 TT-LDNKLLG-ILRSYKQLDP------DS------------------------------GKYFTVHEYWTDKEAQFFRTN 191 (470) T ss_pred CC-CCCceEE-EEEEEEeeec------CC------------------------------ceEEEEEEEEcCCcEEEEEee Confidence 11 1112222 2233322100 00 011223343321 0000 Q ss_pred CCC--eeEEEE-EEE----ECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_021540. 324 GSG--VTTPIV-ASW----VDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMAR 396 (705) Q Consensus 324 ~dg--~~~~~~-~~~----~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~ 396 (705) +.+ ..+.+. .+. .+...-..+..|.+.+.+|++.++- +-+|.|.+..++++++.+|.+.|.+.+.+.. T Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~ 266 (470) T protein:vir:10 192 ATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSK-----NKYRLPELNKYKGLIDAYDDIYNGFINDLDD 266 (470) T ss_pred cCcceeccccccccccccccccccccccccccCCCeeeEEEeec-----CCCCCCchhHHHHHHHHHHHHHHHHHHHHHH Confidence 110 000000 000 0000011122222235566665553 3468999999999999999999999999999 Q ss_pred cCCCcEEeeccccCc-hhh-hhhcCCcceeecCCcc-cccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCc Q lcl|NC_021540. 397 SANGQRGMSKNLLDP-VNE-RKFKMGEDYKYNPGTN-PVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTG 473 (705) Q Consensus 397 ~~~~~~~~~~~av~~-~d~-~~~~pg~~i~~~~~~~-~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~ 473 (705) .++|.+++.....+. .+. ......+.+.++.... ....+.++..+.-.......++.+...+-..+++++.+.+..+ T Consensus 267 ~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~g 346 (470) T protein:vir:10 267 VQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESS 346 (470) T ss_pred hcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccc Confidence 998888775433322 121 2234445555543222 1223455554444556677889999999999999988776543 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccc Q lcl|NC_021540. 474 DSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISN 553 (705) Q Consensus 474 ~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~ 553 (705) + +|+.| +..+............+.|..+++++++.++.++.. .+.++. ...+..+... T Consensus 347 n-~Sg~A--lk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~----------~~~d~~---------~i~i~f~~~~ 404 (470) T protein:vir:10 347 N-ASGVA--IKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF----------SDADKR---------HISQHWTRTK 404 (470) T ss_pred c-chHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------cCcccc---------eeeEEeccCC Confidence 2 34444 555555566666666666666666666665544321 111111 1122222222 Q ss_pred hhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 554 AETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQ 633 (705) Q Consensus 554 ~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~q 633 (705) +.-.....+.+..+ ...++. .-++. .++.... + +.+.++.+++.++......+.. T Consensus 405 p~d~~e~~~~~~~~----~g~iS~---et~l~---~~p~v~D-------------~--~~E~eri~~E~~e~~~~~~~~~ 459 (470) T protein:vir:10 405 VEDSLTKAQIVSTV----ANYSSK---EAVAK---ANPIVDD-------------W--QQELKDLAKDKEENDPYSNQAD 459 (470) T ss_pred CCCHHHHHHHHHHH----hccCcH---HHHHH---hCCCCCC-------------H--HHHHHHHHHHHHHHHHhhcccc Confidence 21111111111111 111110 00010 0111111 1 1111111111111100000000 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_021540. 634 AEIQLMPYEAQAEA 647 (705) Q Consensus 634 a~~q~~~~~~q~e~ 647 (705) .. .....--++ T Consensus 460 -~~--~~~~~dde~ 470 (470) T protein:vir:10 460 -EL--NGKGVNDEQ 470 (470) T ss_pred -cc--CCCCCCCCC Confidence 00 000000000 No 91 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.48 E-value=7.1e-12 Score=81.70 Aligned_cols=448 Identities=12% Similarity=0.099 Sum_probs=199.6 Q ss_pred Ccchhhhhhcc---ccccc---CCCC-------CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCC- Q lcl|NC_021540. 1 MSDINEEFLED---TVPSL---QEDW-------KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQ- 64 (705) Q Consensus 1 ~~~~~~~~~~~---~~~~~---~~~~-------~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~- 64 (705) .|+|-.--+.. --|.. ...+ ++.. .+...+......|.+.+.+.++..+||.|..... +.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~ 83 (492) T protein:vir:94 7 ISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPE---TLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV 83 (492) T ss_pred HHHHHHHHhcCCceeecCccchhhhhhcccccCCchh---hHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Confidence 23322211100 00100 0011 1111 1222233334445566666778889999863111 111 Q ss_pred ------CCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHH Q lcl|NC_021540. 65 ------QVGR--SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMV 136 (705) Q Consensus 65 ------~~gr--s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~ 136 (705) .+.| .+++.+..+..|+.....| ||.+.-+. .+|.+.. ++++..+. |+-...+...+ T Consensus 84 ~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl----~G~p~~~~-----~~d~~~~----~~l~~~~~--n~~~~~~~~~~ 148 (492) T protein:vir:94 84 DATGAVDPLKPDDRMITNFHANLVDQKVSYI----VGKPIAFK-----HTDDEVV----KRIDEVLG--NRFDDKLHSVL 148 (492) T ss_pred cccccccccccccccccchHHHHHHHHHhhh----cccCceec-----cCchHHH----HHHHHHHh--ccHHHHHHHHH Confidence 1122 4678888888888777655 55553332 2343332 34444433 44445566788 Q ss_pred HHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceec Q lcl|NC_021540. 137 RTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAI 216 (705) Q Consensus 137 ~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 216 (705) ++++.+|.|.+.+|++ T Consensus 149 ~~a~~~G~a~~~v~~d---------------------------------------------------------------- 164 (492) T protein:vir:94 149 TGASNKGIEWLHPYLD---------------------------------------------------------------- 164 (492) T ss_pred HHHhhCCeEEEEEEec---------------------------------------------------------------- Confidence 9999999998776542 Q ss_pred cCcccccceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccc Q lcl|NC_021540. 217 INGYEEQEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDH 294 (705) Q Consensus 217 ~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~ 294 (705) ..+.|++..++|.+++ ||++....+ .+ +.+.+.... . T Consensus 165 ------------~dg~~~~~~~~p~~~~~v~d~~~~~~~---~a-~ir~~~~~~----------~--------------- 203 (492) T protein:vir:94 165 ------------EEGEFKLFRVPAEQGIPIWTDKEHEEL---EA-FIRMYKLEN----------E--------------- 203 (492) T ss_pred ------------CCCceEEEEEcccceEEEEcCCCCCce---EE-EEEEEeecc----------c--------------- Confidence 0234678888999965 444432222 22 233331100 0 Q ss_pred ccccccccccccccCeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccC Q lcl|NC_021540. 295 YSSDTSFTFSDKARKKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYG 369 (705) Q Consensus 295 ~~~~~~~~~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g 369 (705) ..+ |+|.. +...+++... ......+.......++| .+.+|+++++. +-+| T Consensus 204 --------------~~~---~~y~~~~v~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~--~g~vPvv~~~n-----n~~~ 258 (492) T protein:vir:94 204 --------------TKV---EYWDKVTVNYYVYENGSLIP-DYSNNLENSKTHFSTGS--WGKIPFIPFKN-----NDLE 258 (492) T ss_pred --------------eeE---EEEecCeEEEEEEecCeeee-ccccccccccccccccC--CCccceEEecC-----CCCC Confidence 001 11111 0011111100 00000112222333343 36778877653 3468 Q ss_pred CchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC-chhh--hhhcCCcceeecCCcccccccccccCccchHH Q lcl|NC_021540. 370 EADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLD-PVNE--RKFKMGEDYKYNPGTNPVTDIIEHKYPELPAS 446 (705) Q Consensus 370 ~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~-~~d~--~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~ 446 (705) .|.+..++++++.+|.+.|.+.+.+...+.|.+++. |.-. .... ......+++.+..++. +.++..+.-... T Consensus 259 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~ 333 (492) T protein:vir:94 259 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK-NYDDQELPEFKRLLRYYGAIKVSDNGG----VDTIQVEVPVEN 333 (492) T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCcccchhhHHHHhhccceecCCCCc----ceeEeccCCHHH Confidence 999999999999999999999999999888877653 3221 1111 1223344555544332 334333333345 Q ss_pred HHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeE Q lcl|NC_021540. 447 SYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVI 526 (705) Q Consensus 447 ~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~i 526 (705) ....++.+...+...+++++.+.+.-++.+|+.| +...............+.|..+++++++.++.++.... T Consensus 334 ~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~------ 405 (492) T protein:vir:94 334 SKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------ 405 (492) T ss_pred HHHHHHHHHHHHHHHhCCcCCCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc------ Confidence 6667889999999999998877765444444444 44433444455566666666666666666555432110 Q ss_pred eEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhccccc Q lcl|NC_021540. 527 RITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPE 606 (705) Q Consensus 527 ri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q 606 (705) ++. ++.+..+...+.......+.+..+. + .++.. -++. .+....+... T Consensus 406 -----~~~---------~i~v~f~~~~p~~~~e~~~~~~kl~---g-iiS~e---t~~~---~l~~v~d~~~-------- 453 (492) T protein:vir:94 406 -----EHK---------DVDISFNYNKVANTELQVQTAQQSM---G-IVSHE---TVLE---NHPFVEDLQA-------- 453 (492) T ss_pred -----ccc---------eeeEEecCCCCCCHHHHHHHHHHHh---c-cCchH---HHHH---hCCCCCCHHH-------- Confidence 111 1223333332221112222211111 1 11110 0110 1111111111 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHH---HHHHH-HHHHHHHHHHH Q lcl|NC_021540. 607 PSPQAQLEIQIKQLEAQELQMRIAKL---QAEIQ-LMPYEAQAEAA 648 (705) Q Consensus 607 ~~~~~q~~~q~~q~~~q~~q~e~~k~---qa~~q-~~~~~~q~e~a 648 (705) +.++.+.+.++........ ...-. .....-+.+.. T Consensus 454 -------E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 454 -------ELERIEQEQMEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred -------HHHHHHHHHHHHHhhccccccccCCCCccccCCccccCC Confidence 1111111100000000000 00000 00000000000 No 92 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.48 E-value=2.1e-12 Score=84.63 Aligned_cols=487 Identities=10% Similarity=0.003 Sum_probs=211.7 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCC--CCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQ--QVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~grs~~v~~~v 76 (705) |+=+++..+.++ +++.+ ...|.+-+ +.|...+.+.++..+||.|.-.. .+.. .+...+++.+.. T Consensus 1 ~~~~~~~~~~~~----~~~~~----~~~i~~~i----~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~ 68 (499) T protein:vir:10 1 MAVVIDKDLLDD----VNEPN----IEAINYAI----RELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHA 68 (499) T ss_pred CccchhhhHHhh----hhcCC----HHHHHHHH----HHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchH Confidence 776666665433 22222 22222222 23445556667788999986211 1111 223456777777 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) +..|+.....| ||.+.-+. + +|.+..+. ++.++ ..|+--..+..+.++++..|.+.+.+|++..-. T Consensus 69 ~~Iv~~~~~~l----~g~p~~~~--~---~~~~~~~~----l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~ 134 (499) T protein:vir:10 69 KYITDMNVGFM----TGNPVKYV--A---EKGKNIDD----ILEVF-NQIDIHKHDIELEKDLSVFGYGYELLYLKKTDP 134 (499) T ss_pred HHHHHHHhhhh----cccCceee--c---CChhHHHH----HHHHH-hhcCHhHHHHHHHHHHHhcCceEEEEEeccccc Confidence 77777766544 66553333 2 23333332 33322 234333457789999999999988877631100 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEE Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVT 236 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~ 236 (705) -. ... ............+++. T Consensus 135 ~~-------------------------------------------------------~~~----~~~~~~~~~~~~~~~~ 155 (499) T protein:vir:10 135 IS-------------------------------------------------------VRD----ELGNEKLTPNTELKIE 155 (499) T ss_pred cc-------------------------------------------------------ccc----cccccccccccceEEE Confidence 00 000 0000001123456888 Q ss_pred EechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEE Q lcl|NC_021540. 237 ICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEY 316 (705) Q Consensus 237 ~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~ 316 (705) .|+|.+++.=.+.. ...-...+.+.+.+.+. . ....+..+|+ T Consensus 156 ~v~p~~~~~v~~d~--~~~~~~~~i~~~~~~~~------------------------------~------~~~~~~~~~i 197 (499) T protein:vir:10 156 VIDPRATVVVCDDT--VEHDPLFAVFTQEKKDL------------------------------E------GNTNGYSITV 197 (499) T ss_pred EEcccceEEEecCC--CCcceEEEEEEEEEeec------------------------------C------CCceEEEEEE Confidence 99999865422211 11111222233211000 0 0112334455 Q ss_pred EEEeeecCCCeeEEEE----EEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 317 WGYWDIDGSGVTTPIV----ASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMID 392 (705) Q Consensus 317 w~k~~~~~dg~~~~~~----~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d 392 (705) |.. +.+..+.. ....+..++...++|| |.+|++++.. +.+|.|.+..++++++.+|...|.+.+ T Consensus 198 yt~-----~~i~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~~~~d~e~v~~liD~~~~~~S~~~~ 265 (499) T protein:vir:10 198 YMP-----QRIVEYRTKTTMEVSANDPIVYDGENLF--GAVPIIEFRN-----NEERQGDFEQLISLIDAYNLLQTDRIS 265 (499) T ss_pred EeC-----CeEEEEEecCCccccCcceecccccCCC--CccceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHH Confidence 542 11111000 0001223333444444 6778777653 456899999999999999999999999 Q ss_pred HHHhcCCCcEEeeccccCch--hhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcC Q lcl|NC_021540. 393 AMARSANGQRGMSKNLLDPV--NERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQG 470 (705) Q Consensus 393 ~~~~~~~~~~~~~~~av~~~--d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G 470 (705) .+...+.|.+++.-..++.. .......+.++.+..+.. ..+.++..+.-...+...++.+...|...|++++.+.+ T Consensus 266 ~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~--~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 343 (499) T protein:vir:10 266 DKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEG--ADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDE 343 (499) T ss_pred HHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCC--CcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCch Confidence 99999988887753333221 122334555555443222 12344444433456667789999999999998877655 Q ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEee Q lcl|NC_021540. 471 LTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLS 550 (705) Q Consensus 471 ~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~ 550 (705) .-++.+|+.| +..+............+.|..+++++++.++.++.-.... .++ . ...+..+ T Consensus 344 ~~~gn~Sg~A--l~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~--------~d~-----~----~i~i~f~ 404 (499) T protein:vir:10 344 KFMGNVSGEA--MKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGAN--------DDA-----S----GCKISLV 404 (499) T ss_pred hhcccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCc--------ccc-----c----cceEEeC Confidence 4333344544 4444455555566666667667666666666554311110 011 0 1223233 Q ss_pred ccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 551 ISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIA 630 (705) Q Consensus 551 ~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~ 630 (705) ...+.......+.+..+. + .++. .-++. .++........++. ...++.+......+.. T Consensus 405 ~~~p~n~~e~~~~~~kl~---g-~iS~---et~~~---~l~~v~d~~~E~~r------------i~~E~~~~~~~~~~~~ 462 (499) T protein:vir:10 405 ANIPSNLSDVVNNVKNAD---G-IIPR---KYTYS---WLPDVDNPQDVIDE------------MNQQDAETIKKNQEAL 462 (499) T ss_pred CCCCCCHHHHHHHHHHHh---c-cCCh---HHHHH---hCCCCCCHHHHHHH------------HHHHHHHHHHHHHhhh Confidence 222221112222222211 1 1111 00110 11111111100100 0000000000000000 Q ss_pred HHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 631 KLQAEIQLMPYEAQA-EAAK-ARKANTEADLNTLDFVEQETGV 671 (705) Q Consensus 631 k~qa~~q~~~~~~q~-e~a~-a~~~~~ea~~~~~~~~~q~~~~ 671 (705) ..+..........+. .+.. -... ....+. .|-.++ T Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~----~~~~~~ 499 (499) T protein:vir:10 463 RGQDPDRLELEDKQDDSSENDKEAG--SNHNQS----HRTRAV 499 (499) T ss_pred ccCCCCCCCCCCCCcccCCCCCCCc--cccccC----CCCCCC Confidence 000000000000000 0000 0000 000000 000000 No 93 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.47 E-value=2.1e-12 Score=84.65 Aligned_cols=393 Identities=12% Similarity=0.031 Sum_probs=192.0 Q ss_pred CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC------CCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021540. 21 KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY------KPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLND 94 (705) Q Consensus 21 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~ 94 (705) -+..+|..|.+.+..- ..+.+...+||.|.... .|+..+.+-+.|.+-....|+.+...| T Consensus 1 ~~~~~i~~L~~~~~~~-------~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl------- 66 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVH-------KRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL------- 66 (409) T ss_pred CCHHHHHHHHHHHHHH-------hHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhc------- Confidence 2334677776665442 23344566899986432 122221223344455555555543322 Q ss_pred CCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchh Q lcl|NC_021540. 95 ENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGES 174 (705) Q Consensus 95 ~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~ 174 (705) .|...+..|.. +..+ +..|+--.....+.++||+.|.+++.|+ . T Consensus 67 ----~~~Gf~~~d~~--------l~~i-~~~N~ld~~~~~~~~~al~yG~sf~~v~-~---------------------- 110 (409) T protein:vir:16 67 ----VFREFENDDFT--------VNEI-FEENNPDIFFDSTVLSALIASCSFTYIS-K---------------------- 110 (409) T ss_pred ----ccccccCcchH--------HHHH-HHhcChhHHHHHHHHHHHHhCceeEEEe-c---------------------- Confidence 12222333422 2223 3344433445678888888888877653 0 Q ss_pred HHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCCCccCC Q lcl|NC_021540. 175 IDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDPTCNGN 252 (705) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d 252 (705) ...+.|+|..++|.+++ |||... . T Consensus 111 -----------------------------------------------------~~dg~~~i~~~sP~~~~~i~D~~~~-~ 136 (409) T protein:vir:16 111 -----------------------------------------------------GENDAVRLQVIEATNATGIIDPITG-L 136 (409) T ss_pred -----------------------------------------------------CCCCceEEEEEcccceEEEeecccc-c Confidence 00234567788887754 455322 1 Q ss_pred hhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEE Q lcl|NC_021540. 253 LDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIV 332 (705) Q Consensus 253 ~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~ 332 (705) +. .+.+++ +.+.. . . ...+.+|.. +. . + T Consensus 137 ~~----~a~~~~-----------~~d~~-------------~--------------~-~~~~~~~~~-----~~-~---~ 164 (409) T protein:vir:16 137 LT----EGYAVL-----------ERDEN-------------N--------------N-VVLEAHFLP-----DR-T---D 164 (409) T ss_pred ce----eeeEEE-----------EecCC-------------C--------------c-eEEEEEEec-----Cc-E---E Confidence 11 111111 00000 0 0 001111110 00 0 0 Q ss_pred EEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchH-HHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc--- Q lcl|NC_021540. 333 ASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADA-ELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL--- 408 (705) Q Consensus 333 ~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~-~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a--- 408 (705) .++-++..-...++|+ |.+|+|+|+..++.+..+|.|-+ +.++++|+.+|+.+..+.......+.|+..+- |+ T Consensus 165 ~~~~~~~~~~~~~~~~--g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d 241 (409) T protein:vir:16 165 YYYRDSRNNISIANPT--GNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-GLSDD 241 (409) T ss_pred EEEecCccccceecCC--CCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-ecCCC Confidence 0001111112235555 78999999999999999998865 78999999999999999999999999987652 22 Q ss_pred cCchhhhhhcCCcceeecCCcc-cccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHH Q lcl|NC_021540. 409 LDPVNERKFKMGEDYKYNPGTN-PVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVI 487 (705) Q Consensus 409 v~~~d~~~~~pg~~i~~~~~~~-~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~ 487 (705) .+..+.+...++.++.+....+ ....+..++..++. .+...+..+...+-.+||++...+|..... +.+|.++.... T Consensus 242 ~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~~s~lP~~~lg~~~~N-psSa~Ai~a~~ 319 (409) T protein:vir:16 242 AEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLRTAAAGFAGETGLTLDDLGFVSDN-PSSVEAIKASH 319 (409) T ss_pred CCccchhhhhhhHhhccCCCCCCCCceEEecCCCChh-HHHHHHHHHHHHHhhhcCCCHHHcccccCc-hhHHHHHHHHH Confidence 1233456666777777653322 11223334444443 345556666666667889999999965431 23455555433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeecc---chhHHHHHHHHH Q lcl|NC_021540. 488 GASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSIS---NAETDAIKAQEL 564 (705) Q Consensus 488 ~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~---~~~~~~~~~q~~ 564 (705) ..-........+.|..++++++++++.+.-..-.. ++.+. +..+.-... ...+-.+....+ T Consensus 320 ~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~---------------~~~~~-~~~v~W~~~~~~~~~s~a~~aDa~ 383 (409) T protein:vir:16 320 ENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYL---------------REQFS-KTKPKWEPLFEADASMLSLIGDGA 383 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---------------chhhc-cceEEecCCCCcchhhHHHHHHHH Confidence 33333334445556666666666666543322110 01000 111111110 011112233334 Q ss_pred HHHHHHHhhhchhHHHHHHHHHHHhhhccchhh Q lcl|NC_021540. 565 SFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLS 597 (705) Q Consensus 565 ~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~ 597 (705) .-|.++.....+. .+. .+..|+.... T Consensus 384 ~Kl~~a~~~~~~~----~v~---~~~~g~~~~d 409 (409) T protein:vir:16 384 IKLNQAIPEFINK----DTI---RDLTGIKGAE 409 (409) T ss_pred HHHHhhcccccch----hHH---HHhccCCCCC Confidence 4444443221111 111 2333443322 No 94 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.46 E-value=1e-11 Score=80.83 Aligned_cols=456 Identities=10% Similarity=0.032 Sum_probs=203.5 Q ss_pred Ccchhhhhhcc--cccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCC-------CCC- Q lcl|NC_021540. 1 MSDINEEFLED--TVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQ-------VGR- 68 (705) Q Consensus 1 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-------~gr- 68 (705) |++|.=...+. +.+-..-.=.++.+-..|++.++ .|...+.+.++..+||.|...-. +.+. +.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~----~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLIN----DHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHH----HHHHHHHHHHHHHHHhccCCcchhccchhccccccccccc Confidence 77773221100 00000001111223333333333 34455666778889999874211 1111 112 Q ss_pred -CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEE Q lcl|NC_021540. 69 -SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIF 147 (705) Q Consensus 69 -s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~ 147 (705) .+++.+..+..|+.....| ||.+.-+ .+ +|.+..+ .++..+. ++-...+...+++++..|.+.+ T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l----~g~p~~~--~~---~d~~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~ 141 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYA----VANPVTF--SS---DDDKSLK----TIQEVLN--HKWDDKLVDILTAASNKGIEWL 141 (474) T ss_pred chhcccchHHHHHHhhhhhh----cccCcee--ec---CchHHHH----HHHHHHh--cCHHHHHHHHHHHHHhcCeeEE Confidence 2467777777777776655 6655433 22 3443333 3444432 4445556678889999999988 Q ss_pred EEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceee Q lcl|NC_021540. 148 RTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIK 227 (705) Q Consensus 148 k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 227 (705) .+||+ T Consensus 142 ~~y~d--------------------------------------------------------------------------- 146 (474) T protein:vir:96 142 QPYID--------------------------------------------------------------------------- 146 (474) T ss_pred EEEec--------------------------------------------------------------------------- Confidence 77652 Q ss_pred eccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccc Q lcl|NC_021540. 228 TVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKA 307 (705) Q Consensus 228 ~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (705) ..+++++..++|.++++-.+.. ...+..+ +.+.+.... ..+ ..-+ . T Consensus 147 -~~~~~~i~~~~p~~~~~v~d~~-~~~~~~~-~vr~~~~~~----------~~~------------~~~y---------t 192 (474) T protein:vir:96 147 -ENGEFKTFRVPAEQAIPIWTNK-ERDTLKA-FIRYYRLDG----------AER------------VEYW---------T 192 (474) T ss_pred -CCCceEEEEEcccceEEEEcCC-CCCceEE-EEEEEeecC----------ceE------------EEEE---------e Confidence 0234678889999988432211 1223333 333331100 000 0000 0 Q ss_pred cCeEEEEEEEEEeeecCCCeeEEEE--E-EEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHH Q lcl|NC_021540. 308 RKKIVVYEYWGYWDIDGSGVTTPIV--A-SWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIG 384 (705) Q Consensus 308 ~~~v~v~E~w~k~~~~~dg~~~~~~--~-~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN 384 (705) ..+|. +|.. .+.+...... . ...++.+ ....|.+.+.+|++.++. ..+|.|.+..++++++.+| T Consensus 193 ~~~v~---~~~~---~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d 259 (474) T protein:vir:96 193 DSDVT---YYEY---QDGILIPDYYHGEEHIQSHYY--VGNKRVSWGRVPFIPFKN-----NPQEMSDLFMYKTIIDAMD 259 (474) T ss_pred CCeEE---EEEe---cCCceeecccccccccccccc--ccccccCCCceeEEEecc-----CCCCCCcHHHHHHHHHHHH Confidence 00111 1111 1111100000 0 0001111 123344457788887764 4568999999999999999 Q ss_pred HHHHHHHHHHHhcCCCcEEeeccccCchhh--hhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHh Q lcl|NC_021540. 385 ALTRGMIDAMARSANGQRGMSKNLLDPVNE--RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALS 462 (705) Q Consensus 385 ~~~~~~~d~~~~~~~~~~~~~~~av~~~d~--~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~t 462 (705) ...|.+.+.+...+.|.+++.......... .....++++.+.+.. +.+.++..+.-.......++.+...+-..| T Consensus 260 ~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 336 (474) T protein:vir:96 260 KRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDG---SGVDTIQIEVPVQSSKEYLDMLRDYVIEFG 336 (474) T ss_pred HHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCCC---CceeEEeecCChHHHHHHHHHHHHHHHHHh Confidence 999999999999998877654322222111 233455666665321 123454444334566677899999999999 Q ss_pred CcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcc Q lcl|NC_021540. 463 GVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLV 542 (705) Q Consensus 463 Gv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~ 542 (705) ++++.+.+..++.+|+.| +...............+.|..+++++++.++.+.-..++ + . T Consensus 337 ~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~-----------~-----~--- 395 (474) T protein:vir:96 337 QGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIK-----------V-----Q--- 395 (474) T ss_pred CCccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcc-----------c-----c--- Confidence 999887765444444444 444444444555555666666666666665554311111 1 0 Q ss_pred cceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHH Q lcl|NC_021540. 543 GSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEA 622 (705) Q Consensus 543 ~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~ 622 (705) ...+..+...+.......+ ++.+.+ .++ ...++. .++...+.. .+.++.+.+. T Consensus 396 -~i~i~f~~~~p~~~~e~~~----~~~~ag-~iS---~et~~~---~~~~v~d~~---------------~E~~ri~~E~ 448 (474) T protein:vir:96 396 -DVEITFNFNVMVNELEQSQ----IGVQSQ-YLS---KETVVT---NHPWVDDPV---------------AELERIEQDN 448 (474) T ss_pred -eeeEEeccCCCcCHHHHHH----HHHhcC-CCc---hHHHHH---hCCCCCCHH---------------HHHHHHHHHH Confidence 1122222222211111111 111111 111 001111 111111111 1111111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 623 QELQMRIAKLQAEIQLMPYEAQAEAA 648 (705) Q Consensus 623 q~~q~e~~k~qa~~q~~~~~~q~e~a 648 (705) .+.......+..+.--....-+.+-- T Consensus 449 ~e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 449 IDFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred HHHHhcccccccccccccCCCcccCC Confidence 00000000000000000000000000 No 95 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.46 E-value=1.1e-11 Score=80.77 Aligned_cols=466 Identities=10% Similarity=-0.004 Sum_probs=210.8 Q ss_pred Ccchhhh-----hhcc-----cccccCCCCCCHHHHHHHH---HHHHHhhHHhhHH-HHHHHHHHHHhccCCCC----CC Q lcl|NC_021540. 1 MSDINEE-----FLED-----TVPSLQEDWKNKPKVSDLL---NDFNNAKSTKDTQ-VAIIDDWLAQLNVTGAY----KP 62 (705) Q Consensus 1 ~~~~~~~-----~~~~-----~~~~~~~~~~~~~~~~~l~---~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~~ 62 (705) |-+|+|= ...+ .+-+++++= ..++-..+. .++......|... ..+.++..+||.|.-.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~ 79 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYT-YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccc-cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 8777742 1111 122223210 011112222 2233334444332 23456778899986422 11 Q ss_pred CCCCC--CCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHH Q lcl|NC_021540. 63 KQQVG--RSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAV 140 (705) Q Consensus 63 ~~~~g--rs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al 140 (705) ...++ ..+++.+...-.|+.....| ||.+.- |.+ +|.. ..++++.++. .|+--.......++++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Ivd~~~~yl----~g~p~~--~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~ 145 (512) T protein:vir:97 80 RKEEYMADNRVAHDYASYISDFINGYF----LGNPIQ--CQD---DDKD----VLEAIEAFND-LNDVESHNRSLGLDLS 145 (512) T ss_pred ccccccCcceeecchHHHHHHHHhhhh----cccCce--ecc---CChH----HHHHHHHHHh-hcCHHHHHHHHHHHHH Confidence 11222 35678888888888776655 443333 322 3332 2234555432 3544456778999999 Q ss_pred hcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcc Q lcl|NC_021540. 141 NEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGY 220 (705) Q Consensus 141 ~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 220 (705) ..|.+.+.+|++ T Consensus 146 i~G~ay~~vy~d-------------------------------------------------------------------- 157 (512) T protein:vir:97 146 IYGKAYELMIRN-------------------------------------------------------------------- 157 (512) T ss_pred hcCeEEEEEEeC-------------------------------------------------------------------- Confidence 999998776652 Q ss_pred cccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccc Q lcl|NC_021540. 221 EEQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSD 298 (705) Q Consensus 221 ~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~ 298 (705) ..+.|++..++|.+++. |++... . ...+.+++.+.. ++. T Consensus 158 --------ed~~~~i~~~~p~~~~~iyd~~~~~---~-~~~~vr~~~~~~------~~~--------------------- 198 (512) T protein:vir:97 158 --------QDDETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKP------IDK--------------------- 198 (512) T ss_pred --------CCCceEEEEEcccceEEEEcCCCCC---c-eEEEEEEEEeee------ccc--------------------- Confidence 02346788899999774 444321 1 223333332110 000 Q ss_pred ccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCE-----EEecccCCCCCCCcceEEeeeeeecCcccCCchH Q lcl|NC_021540. 299 TSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDV-----MIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADA 373 (705) Q Consensus 299 ~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~-----iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~ 373 (705) .....+..+|+|.. +++ ++....++. .....+.|.+.+.+|+++++. ..+|.|.+ T Consensus 199 -------~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~~gd~ 258 (512) T protein:vir:97 199 -------TDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDY 258 (512) T ss_pred -------cccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccccCcccceEeecC-----CCCCCCch Confidence 00112334455543 111 111111111 011123334446677776643 34689999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCCc----------ccccccccccCcc Q lcl|NC_021540. 374 ELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPGT----------NPVTDIIEHKYPE 442 (705) Q Consensus 374 ~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~~----------~~~~~i~~~~~~~ 442 (705) +.++++++.+|...|.+.+.+...+.|.+++.... .+..+......+.++...+.. .....+.++..+. T Consensus 259 e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 338 (512) T protein:vir:97 259 EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY 338 (512) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecC Confidence 99999999999999999999988888877654322 122222222333333222110 1111233444333 Q ss_pred chHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021540. 443 LPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSD 522 (705) Q Consensus 443 i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~ 522 (705) -.......+..+...+...|++++.+.|..++.+|+.| +...............+.|..+++++++.++.++...... T Consensus 339 ~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~A--l~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~ 416 (512) T protein:vir:97 339 DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSI 416 (512) T ss_pred CHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Confidence 34556677899999999999999988775443344444 5444455555566667777777777777776665432211 Q ss_pred ceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhc Q lcl|NC_021540. 523 EEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISK 602 (705) Q Consensus 523 ~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~ 602 (705) .. . .++. ...+..+...+.......+.+..+ .+ .++. .-++. .+....... T Consensus 417 ~~----------~---~d~~-~i~~~f~~~~p~~~~e~~~~~~kl---~g-iiS~---et~~~---~l~~v~d~~----- 467 (512) T protein:vir:97 417 DA----------N---KDFN-TVRYVYNRNLPKSLIEELKAYIDS---GG-KISQ---TTLMS---LFSFFQDPE----- 467 (512) T ss_pred cc----------c---cccc-cceEEeCCCCCcCHHHHHHHHHHH---hc-cCch---HHHHH---hCCCCCCHH----- Confidence 00 0 0000 122333332222222222222222 11 1111 01111 111111111 Q ss_pred ccccchhhHHHHHHHHHHHHHH-HHHHHHHHHHH-------HHHHHHHHHHHHHH Q lcl|NC_021540. 603 YNPEPSPQAQLEIQIKQLEAQE-LQMRIAKLQAE-------IQLMPYEAQAEAAK 649 (705) Q Consensus 603 ~~~q~~~~~q~~~q~~q~~~q~-~q~e~~k~qa~-------~q~~~~~~q~e~a~ 649 (705) + +.++.+.+.++ ++......... ....+.+....+.+ T Consensus 468 --------~--E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 468 --------L--EVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred --------H--HHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 0 11111111000 00000000000 00000000000000 No 96 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.46 E-value=6.2e-12 Score=82.02 Aligned_cols=444 Identities=10% Similarity=0.056 Sum_probs=197.5 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHH-------HHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--C-------CC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDL-------LNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--P-------KQ 64 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-------~~ 64 (705) |-.+++- --++| .++..+..| ...+......+...+.+.+++.+||.|.-... + .. T Consensus 1 ~~~~~~~--~~~~~------~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~ 72 (474) T protein:vir:95 1 MFNIIRM--PWDKP------YGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNI 72 (474) T ss_pred Ccceeec--CCCCc------hhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcccccccccccc Confidence 3333221 11111 001111111 11233333455566677788999999862110 0 01 Q ss_pred CCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhc Q lcl|NC_021540. 65 QVGR--SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNE 142 (705) Q Consensus 65 ~~gr--s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~ 142 (705) ..++ .+++.+..+..|+.....| ||.+.- |. .+|.+ ..+.+...+. ++-...+...+++++.+ T Consensus 73 ~~~~~~~ki~~n~~~~Ivd~~~~~l----~g~p~~--~~---~~d~~----~~~~l~~~~~--n~~~~~~~e~~~~~~~~ 137 (474) T protein:vir:95 73 DYDKPDWRITTNFHQNLVDQKVSYV----ASKPVT--YS---CEDES----VLKIIHDVLD--TRWDNKLIDILTATSNK 137 (474) T ss_pred ccccccceeccchHHHHHHHHHhhh----ccCCce--ec---cCchH----HHHHHHHHHh--ccHHHHHHHHHHHHhhc Confidence 2233 3677888888787776655 554433 32 23333 2334444443 44445567788999999 Q ss_pred CCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccc Q lcl|NC_021540. 143 GTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEE 222 (705) Q Consensus 143 g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 222 (705) |.|.+.+||+ T Consensus 138 G~~~~~v~~d---------------------------------------------------------------------- 147 (474) T protein:vir:95 138 GIDWLQVYIN---------------------------------------------------------------------- 147 (474) T ss_pred CcEEEEEEec---------------------------------------------------------------------- Confidence 9998877652 Q ss_pred cceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccc Q lcl|NC_021540. 223 QEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTS 300 (705) Q Consensus 223 ~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~ 300 (705) ..+.+++..++|.+++ ||+... .+..++ .+.+..... T Consensus 148 ------~~~~~~i~~~~p~~~~~v~d~~~~---~~~~~~-i~~~~~~~~------------------------------- 186 (474) T protein:vir:95 148 ------ENGEMKLFRVPAEQAIPIWVDKER---EELKSF-IRYYKFNNE------------------------------- 186 (474) T ss_pred ------CCCceEEEEEcccceEEEEcCCCC---CceEEE-EEEEEEcCe------------------------------- Confidence 0234677888888877 343322 222222 232211000 Q ss_pred ccccccccCeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHH Q lcl|NC_021540. 301 FTFSDKARKKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAEL 375 (705) Q Consensus 301 ~~~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~ 375 (705) ..+++|.. +-..+.+.. . ....+.........+.+.+.+|+++++. ...|.|.+.. T Consensus 187 -----------~~~~~y~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~ 247 (474) T protein:vir:95 187 -----------EKVEFWTDTTVTYYVLENGGLI-P--DYYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWM 247 (474) T ss_pred -----------eEEEEEeCCeEEEEEEcCCccc-c--ccccCcccccccccccCCCccceEeecC-----CCCCCCcHHH Confidence 01122211 000111110 0 0001111111122233446788887654 3468999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhh--hhcCCcceeecCCcccccccccccCccchHHHHHHHHH Q lcl|NC_021540. 376 LSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNER--KFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQM 453 (705) Q Consensus 376 ~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~--~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~ 453 (705) ++++++.+|.+.+.+.+.+...+.|.+++.....+..... ....+.++.+.+++. +.+...+.-...+...+.. T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:95 248 YKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGG----VETIQVEVPVSSTKEYIDL 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc----eeEEeecCCHHHHHHHHHH Confidence 9999999999999999999988888777654322222221 223445666655432 3344433334556667888 Q ss_pred HHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCce Q lcl|NC_021540. 454 FTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEF 533 (705) Q Consensus 454 ~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~ 533 (705) +...+...+++++.+.+..++.+|+.| +..+............+.|..+++++++.++.++ ... .++ T Consensus 324 l~~~i~~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~----g~~-------~d~ 390 (474) T protein:vir:95 324 MRAYIMEFGQGVDFQTDKFGSAPSGIA--LKFLYGNLDLKANKLKNKATVAIQELIGFIIDFN----NLK-------MDV 390 (474) T ss_pred HHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC-------ccc Confidence 999999999999877665443344444 5444444444555555666666666665555442 210 011 Q ss_pred eeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHH Q lcl|NC_021540. 534 VQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQL 613 (705) Q Consensus 534 v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~ 613 (705) . ...+..+.+.+.......+ .+.+.+ .++. ...+ ..++...+.. . T Consensus 391 ~---------~i~v~f~~~~p~d~~e~a~----~~~~~g-~iS~---et~i---~~l~~v~d~~---------------~ 435 (474) T protein:vir:95 391 K---------DIEISFNFNRMMNDAEQSQ----IIAQSQ-YLSR---ETLV---KSSPLVDDYK---------------A 435 (474) T ss_pred c---------eeeEEeccCCCcCHHHHHH----HHHhcC-CCch---HHHH---HhCCCCCCHH---------------H Confidence 1 1112222222211111111 111111 1110 0000 0111111110 0 Q ss_pred HHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 614 EIQIKQLEAQE-LQM--RIAKLQAEIQLMPYEAQAEAAK 649 (705) Q Consensus 614 ~~q~~q~~~q~-~q~--e~~k~qa~~q~~~~~~q~e~a~ 649 (705) +.++.+.+..+ .+. ..............+..-+.-+ T Consensus 436 E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 436 ELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred HHHHHHHHHHHHHhcccccccccCCCCcCCCCCccCCCC Confidence 11111000000 000 0000000000000000000000 No 97 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.44 E-value=1.5e-11 Score=79.90 Aligned_cols=463 Identities=11% Similarity=0.027 Sum_probs=195.9 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCC---CCCcCCCHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQV---GRSSVQPKLIRKQAEWR 83 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---grs~~v~~~v~~~~e~~ 83 (705) |.+ ......+.++..|.. .+.....+.++..+||.|.... .+...+ ..-++|.+-.+..|+.+ T Consensus 1 ~~~-----~~~~d~~~~i~~L~~-------~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~ 68 (488) T protein:vir:23 1 MAE-----TESIDPEKLRDQLLD-------AFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAI 68 (488) T ss_pred CCc-----ccCCCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHH Confidence 211 112223334454443 3333344455667899987422 111111 12235667666677766 Q ss_pred HHHHH-HhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcc Q lcl|NC_021540. 84 YSALS-EPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENV 162 (705) Q Consensus 84 ~~~l~-~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~ 162 (705) ...|. .-|+.+.+ +.+.....+|.+..+ .++.+| ..|+--.....+.+++++.|.+++.+++...... T Consensus 69 a~~l~~~Gf~~~~~-~~~~~~~~~d~~~~~----~l~~i~-~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~----- 137 (488) T protein:vir:23 69 AERQELEGFRIPSA-NGEEPESGGENDPAS----ELWDWW-QANNLDIEATLGHTDALIYGTAYITISMPDPEVD----- 137 (488) T ss_pred HHhhhccceeccCC-cccccccccchhHHH----HHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccc----- Confidence 55442 11221111 111222234444433 344433 3455555567899999999999888764210000 Q ss_pred cccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhh Q lcl|NC_021540. 163 PVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHN 242 (705) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~ 242 (705) .....+.++|..++|.+ T Consensus 138 ---------------------------------------------------------------~~~~~~~~~i~~~~p~~ 154 (488) T protein:vir:23 138 ---------------------------------------------------------------FDVDPEVPLIRVEPPTA 154 (488) T ss_pred ---------------------------------------------------------------cCCCCCcceEEEeccce Confidence 00113345777889998 Q ss_pred ee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEe Q lcl|NC_021540. 243 VT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYW 320 (705) Q Consensus 243 ~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~ 320 (705) ++ |||... ....+.+++.+ . + ...+..+++|.. T Consensus 155 ~~~~~d~~~~-----~~~~~~~~~~~----------~--------------------~---------~~~~~~~~~y~~- 189 (488) T protein:vir:23 155 LYAEVDPRTR-----KVLYAIRAIYG----------A--------------------D---------GNEIVSATLYLP- 189 (488) T ss_pred eEEEEecCCC-----ceEEEEEEEEe----------c--------------------C---------CCcEEEEEEEec- Confidence 66 554321 12222222210 0 0 001222333322 Q ss_pred eecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHH-HhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021540. 321 DIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE-LLSDNQKLIGALTRGMIDAMARSAN 399 (705) Q Consensus 321 ~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~-~~~d~Q~~iN~~~~~~~d~~~~~~~ 399 (705) +. .+..+-.++........|.+.|.+|+++|...+..+..+|.|-+. .++++++.+|..++.+.+.+...+. T Consensus 190 ----~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~ 262 (488) T protein:vir:23 190 ----DT---TMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAI 262 (488) T ss_pred ----Cc---EEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhh Confidence 11 111111222222233445555889999999888888899999885 6899999999999999999988888 Q ss_pred CcEEeeccc----c-----CchhhhhhcCCcceeecCCcccccccccccCccch-HHHHHHHHHHHHHHHHHhCcchHhc Q lcl|NC_021540. 400 GQRGMSKNL----L-----DPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELP-ASSYNMLQMFTLEADALSGVKSFSQ 469 (705) Q Consensus 400 ~~~~~~~~a----v-----~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~-~~~~~~l~~~~~~~~~~tGv~d~~~ 469 (705) |+..+- |+ + .....+...+|.++...+|..+ .+.+.+..+ ..+...+......+-..+++++..+ T Consensus 263 p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~----~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~ 337 (488) T protein:vir:23 263 PQRLIF-GAKPEELGINAETGQRMFDAYMARILAFEGGEGA----HAEQFSAAELRNFVDALDALDRKAASYSGLPPQYL 337 (488) T ss_pred HHHHHh-CCCcccccccccccchhhhhhhhhhccCCCCCCc----eeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHh Confidence 766542 21 1 1112234455666655444322 222322221 2233334444444445678888888 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEe Q lcl|NC_021540. 470 GLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKL 549 (705) Q Consensus 470 G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v 549 (705) |..... +.++.++......-........+.|..+++++++.++.+. ..... +.++. ...+.. T Consensus 338 g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~----~~~~~------------~~~~~-~i~v~f 399 (488) T protein:vir:23 338 SSSSDN-PASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMV----KGGDI------------PTEYY-RMETVW 399 (488) T ss_pred ccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCc------------chhhc-cceEEe Confidence 854321 1244445544444444445555556556666665555432 21100 00000 112222 Q ss_pred eccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhcc-chhhhhhhcccccchhhHHHHHHHHHHHHH-HHHH Q lcl|NC_021540. 550 SISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGM-PDLSKMISKYNPEPSPQAQLEIQIKQLEAQ-ELQM 627 (705) Q Consensus 550 ~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~-~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q-~~q~ 627 (705) .........+....+..+.+.....++.. .+.+ +.++ +...+.++.. .+++.++.. .+.. T Consensus 400 ~~~~~~s~~~~ada~~kl~~~g~~~~s~e----t~~~---~l~~~~d~~~~~~~~-----------~~~~~~~~~~~~~~ 461 (488) T protein:vir:23 400 RDPSTPTYAAKADAAAKLFANGAGLIPRE----RGWV---DMGYTIVEREQMRQW-----------LEQDQKQGLGLIGS 461 (488) T ss_pred cCCCCCCHHHHHHHHHHHHhcccccCCHH----HHHH---hCCCCchHHHHHHHH-----------HHHHHHHHHHHHHH Confidence 22211122222233333333221111110 0000 0010 0000000000 000000000 0000 Q ss_pred HHHHHHHHHH----HHHHHHHHHHHHH Q lcl|NC_021540. 628 RIAKLQAEIQ----LMPYEAQAEAAKA 650 (705) Q Consensus 628 e~~k~qa~~q----~~~~~~q~e~a~a 650 (705) .....+...+ ........+-+-| T Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 462 LYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HhccCCCcccCCCCCCCCCCCCCCCCC Confidence 0000000000 0000000000000 No 98 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.44 E-value=3.4e-12 Score=83.45 Aligned_cols=396 Identities=11% Similarity=0.048 Sum_probs=189.4 Q ss_pred hHHhhHHHHHHHHHHHHhccCCCCC------CCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHH Q lcl|NC_021540. 37 KSTKDTQVAIIDDWLAQLNVTGAYK------PKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREA 110 (705) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~ 110 (705) .+.|.+. .+...+||.|..... |...+.+.+.|.+-.+..|+.+...|. |...+..|.. T Consensus 1 l~~~~~r---~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~-----------~~Gf~~~d~~- 65 (410) T protein:vir:95 1 MNLYQSR---VNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLI-----------FRAFANDDFN- 65 (410) T ss_pred CCcchhh---HHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhc-----------cccccCCCch- Confidence 4555444 445668999874321 122223344566666666666544331 2222333322 Q ss_pred HHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchh Q lcl|NC_021540. 111 ARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPS 190 (705) Q Consensus 111 A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (705) +..+| ..|+--.....++++||+.|.+++.|+ . T Consensus 66 -------l~~i~-~~N~ld~~~~~~~~~al~~G~sf~~v~-~-------------------------------------- 98 (410) T protein:vir:95 66 -------VTEIF-DRNNPDIFFDSAILSALIGSCSFVYIS-K-------------------------------------- 98 (410) T ss_pred -------HHHHH-hhcChHHHHHHHHHHHHHhCceeEEEe-c-------------------------------------- Confidence 22333 344433345678888999999877653 0 Q ss_pred hhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCCCccCChhhCCeEEEEEeccHH Q lcl|NC_021540. 191 ILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRS 268 (705) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~ 268 (705) ...+.|+|..++|.+++ |||... . ...+.+.+ T Consensus 99 -------------------------------------~~d~~~~i~~~sP~~~~~i~Dp~~~-~----~~~al~~~---- 132 (410) T protein:vir:95 99 -------------------------------------GEDDEVRLQVIESSNATGVIDPITG-L----LVEGYAVL---- 132 (410) T ss_pred -------------------------------------CCCCceEEEEEcccceEEEEeCCCC-c----eEEEEEEE---- Confidence 00234567788888854 555311 1 11111111 Q ss_pred HHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCC Q lcl|NC_021540. 269 DLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPY 348 (705) Q Consensus 269 el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~ 348 (705) +.+. ........+|.. + ...++.++..-+..++|+ T Consensus 133 -------~~~~----------------------------~~~~~~~~~~~~-----~-----~~~~~~~~~~~~~~~~~~ 167 (410) T protein:vir:95 133 -------ARDD----------------------------YNRPTLEAYFEP-----N-----ATHFIPKDGEPYSVTNET 167 (410) T ss_pred -------EecC----------------------------CCeEEEEEEEeC-----C-----cEEEEeeCCccccccCCC Confidence 0000 000111112210 0 011111111112235554 Q ss_pred CCCCcceEEeeeeeecCcccCCch-HHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc---CchhhhhhcCCccee Q lcl|NC_021540. 349 PDGKLPFVVVPYLPVKDSVYGEAD-AELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLL---DPVNERKFKMGEDYK 424 (705) Q Consensus 349 ~~~~~Pfv~~~~~~~~~~~~g~g~-~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av---~~~d~~~~~pg~~i~ 424 (705) |.+|+|+|+..+..++.+|.|- .+.++++|+.+|+.+..+.......+.|+..+- |+- +..+.+...++.++. T Consensus 168 --g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~d~d~~~~~~~~~~~~~i~~ 244 (410) T protein:vir:95 168 --GIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYIL-GLDPDAEPMEKWKATVSSLLT 244 (410) T ss_pred --CCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-ccCCCCCcCchhhhhhhhhee Confidence 7899999999999999999884 588999999999999999999999999977652 221 223445666777777 Q ss_pred ecCCcc-cccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 425 YNPGTN-PVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLAN 503 (705) Q Consensus 425 ~~~~~~-~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~ 503 (705) +..+.. ....+..++..++. .+...+..+...+-.+||++...+|...+. +.+|.++......-........+.|.. T Consensus 245 ~~~~~~~~~~~v~q~~~~~l~-~~~~~l~~l~~~~a~~s~lP~~~lg~~~~N-psSa~Al~a~~~~L~~ka~~k~~~fg~ 322 (410) T protein:vir:95 245 ISSSDKGVKPSVGQFTTASMS-PFTEQLRTAAAGFAGEMGLTLDDLGFVSDN-PSSVEAIKASHENLRLAGRKAQRSLGA 322 (410) T ss_pred ccCCCCCCcceEEecCCCChH-HHHHHHHHHHHHHhhhcCCCHHHhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654322 11223334444443 344556666666667789999999965431 234545554333333344555666667 Q ss_pred HHHHHHHHHHHHHHHhcCCc-eeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHH Q lcl|NC_021540. 504 GLTEVAKKILAMNSVWLSDE-EVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKL 582 (705) Q Consensus 504 ~~~~~~~~~l~li~q~~~~~-~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~ 582 (705) ++++++++++.+.-..-..+ ...++. -.|-.+ .+. .+. +..+....+.-|.++.. .... .. T Consensus 323 ~l~~~~rla~~i~~~~~~~~~~~~~~~-v~W~p~--------~d~----~~~-s~a~~aDa~~Kl~~a~~-g~~~---~~ 384 (410) T protein:vir:95 323 GLLNVAYVAACLRDEFRYTRSQFVRTA-VKWEPL--------FEA----DAN-TMTMIGDGVVKLNQALP-GYIN---AE 384 (410) T ss_pred HHHHHHHHHHHHhcCCCCcccccceee-EEeeec--------CCc----chh-hHHHHHHHHHHHHHhcc-CCcc---HH Confidence 77777777666543321111 110000 001111 011 111 11222223333333321 1111 11 Q ss_pred HHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 583 ILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMR 628 (705) Q Consensus 583 il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e 628 (705) + +.+..|+..- .+.. +. .+.++.+-+ T Consensus 385 ~---~~~~lg~~~~--------------~~~~-~~--~~e~~~~g~ 410 (410) T protein:vir:95 385 T---IRDLTGIAGD--------------MSAK-PV--VSEGGSNGE 410 (410) T ss_pred H---HHHhcCCChH--------------HHHH-HH--HHHHHhCCC Confidence 1 1122232110 0000 00 000000000 No 99 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.43 E-value=3e-12 Score=83.73 Aligned_cols=519 Identities=13% Similarity=0.055 Sum_probs=224.6 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhh-HHHHHHHHHHHHhccCCCC-CCCCCCC--CCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKD-TQVAIIDDWLAQLNVTGAY-KPKQQVG--RSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~g--rs~~v~~~v 76 (705) |.+=-. ++.++.+-+-...+.. .+-++ -.+..-+...+||++.--. ++. -+| |-.+-.+.- T Consensus 1 m~~~~~-----------q~~p~~~~fp~~~a~w---V~~~D~~RlaaY~ly~d~y~n~~~el~~i-l~G~dr~~~~~ps~ 65 (563) T protein:vir:74 1 MPYNHK-----------QYDPAKPFLRGGDDNI---VDENDKNRVRAYDLYENIYLNSAETLKLV-LRGDDSVPILMPSG 65 (563) T ss_pred CCcccc-----------ccCCCccccccccccc---CCHHHHHHHHHHHHHHHhhcCchhhhhhh-cCCCceeeeccchH Confidence 333222 2333332111111111 00011 0122223456778765322 221 234 333555667 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) +..|+.. ++ |||.+-.+-|.|.. +|++..+....|++-.+.+++-..+ +...-++|+.-|-|++++.|+-..+ T Consensus 66 r~~V~~~----~~-~Lg~~~~~~Ve~~~-~de~~~~avq~~Lr~~~~~e~l~~~-~~~~~r~a~vlGDgvf~l~wDp~K~ 138 (563) T protein:vir:74 66 RKIVEAV----HR-FLGVGFDYLVEPDM-GDEGIRQSLNAYFRTTFKREAIKAK-FTSNKRWGLIRGDAHFYIHADPNKK 138 (563) T ss_pred HHHHHHH----HH-hcCCCcEEecCccc-cCcchHHHHHHHHHHHHHHhhhHHH-HHHHHHhhhhhcceeEEEeeccccc Confidence 7888874 33 45777778888876 5777777788899988888776655 4456788999999999999975332 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcce-E Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPE-V 235 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~-i 235 (705) +. +......+- | ..+.+.... .....++ + T Consensus 139 ~g-~R~rv~~vD--------------------P---------------------~~~fp~~dp--------d~v~g~~~v 168 (563) T protein:vir:74 139 AG-ERISVDEVD--------------------P---------------------RQIFLIEDG--------STVVGFHMV 168 (563) T ss_pred cC-CCceEeecC--------------------C---------------------ceeeeccCC--------CCcccceee Confidence 11 000000000 0 000000000 0001111 1 Q ss_pred EEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEE Q lcl|NC_021540. 236 TICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYE 315 (705) Q Consensus 236 ~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E 315 (705) +.+. -|+.|.. . .+.|+++...++. +-..|.|. . +-..-.| T Consensus 169 ~v~~---~~~~pdd---~--~~~~~r~~~~~~~-lndeg~~~---------------~---------------~~~~dae 209 (563) T protein:vir:74 169 DIVQ---DFRSPDD---P--SKKLARRRTFRRV-RNDEGMFT---------------G---------------RISSELT 209 (563) T ss_pred eccc---CCCCCcc---h--hccceeeeeeeee-eCCCCCcc---------------c---------------eeeeccc Confidence 1111 1222221 1 1234444332211 00011000 0 0001112 Q ss_pred EE-----EEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHH Q lcl|NC_021540. 316 YW-----GYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGM 390 (705) Q Consensus 316 ~w-----~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~ 390 (705) .| .....+..-....-.-++...+..+...-|.+.+.+||+.++..|.+++.||.|-...+..+.+++|...+-. T Consensus 210 ~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~ 289 (563) T protein:vir:74 210 HWTLGNWDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDE 289 (563) T ss_pred hhccccccccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHH Confidence 22 1111111111112223344444444454455557899999999999999999999999999999999999999 Q ss_pred HHHHHhcCCCcEEeeccc-cC----chhhhhhcCCcceeecCCcccccccc-cccCccchHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021540. 391 IDAMARSANGQRGMSKNL-LD----PVNERKFKMGEDYKYNPGTNPVTDII-EHKYPELPASSYNMLQMFTLEADALSGV 464 (705) Q Consensus 391 ~d~~~~~~~~~~~~~~~a-v~----~~d~~~~~pg~~i~~~~~~~~~~~i~-~~~~~~i~~~~~~~l~~~~~~~~~~tGv 464 (705) -.++..+++|.++.+..+ ++ ....++..||.+++.-..... ..+. ....+++..-..-|=......+.+++|+ T Consensus 290 s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~-g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~t 368 (563) T protein:vir:74 290 DATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGNRND-NYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGT 368 (563) T ss_pred HHHHHhcCCCeEEeccccccccccccccccccCCceeEeccCCccc-cceeeecchhhhHHHHHHHHHHHHHHHHhhccC Confidence 999999998888776322 22 122345678888887532111 1122 2222333222222222334467889999 Q ss_pred chHhcC--CCccccchHHHHHHH--HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechh Q lcl|NC_021540. 465 KSFSQG--LTGDSLGTTTAGVQG--VIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRD 539 (705) Q Consensus 465 ~d~~~G--~~~~~~~~~a~~i~~--l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~ 539 (705) ++...| -.+...|+.|-.++. |. +...+-+ .+..-+..++.....++|.+++..+-.-.- .+|+.++ T Consensus 369 PavA~G~vD~~~~~SGiALeL~L~PL~-a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~-----~~~~g~~-- 440 (563) T protein:vir:74 369 PEVAIGRVDVTSAESGISLELQLKPLL-AANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDG-----SRPFASA-- 440 (563) T ss_pred cceeecccccccccchhhhhhhhhHHH-HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcc-----ccccccc-- Confidence 999999 345556666543332 11 1111112 234444445555666666655543222111 1222221 Q ss_pred hcccceeE--EeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHH Q lcl|NC_021540. 540 NLVGSFDI--KLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQI 617 (705) Q Consensus 540 ~~~~~~dv--~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~ 617 (705) .+.++..| +...-.+.-+++..++...|.++.-.. .......+.+. |.+......+....+...... T Consensus 441 ~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiS-----retAv~~L~~~-g~~~pdae~e~~~ie~~~i~~----- 509 (563) T protein:vir:74 441 DLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAHLIL-----RKMAVAKLRSI-GWEYPEVDDQGNALTDDDIAD----- 509 (563) T ss_pred ccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcCchh-----HHHHHHHHHhC-CCCCCcHHHHHhhcCHHHHHH----- Confidence 22222222 223333444455555555554432110 11111122221 222211000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH------HH-------HHHH----H----HHHHHHHHHH Q lcl|NC_021540. 618 KQLEAQELQMRIAKLQAEIQLMPYE------AQ-------AEAA----K----ARKANTEADL 659 (705) Q Consensus 618 ~q~~~q~~q~e~~k~qa~~q~~~~~------~q-------~e~a----~----a~~~~~ea~~ 659 (705) +.+.++++-+-...+..- .+ ..+- + .-+.... . T Consensus 510 -------~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~~~~~~~~~--~ 563 (563) T protein:vir:74 510 -------MLLAEAEADASLGLSAMDNGGAGEQQFDDQGNPIDQFGNPVEIPPDVTQVPLS--P 563 (563) T ss_pred -------HHHHHhhccCcccceecccCCCCcccccccCCchhHcCCcccCCccccccCCC--C Confidence 000000000000000000 00 0000 0 0000000 0 No 100 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.43 E-value=1.3e-11 Score=80.30 Aligned_cols=451 Identities=11% Similarity=0.059 Sum_probs=203.2 Q ss_pred CcchhhhhhcccccccCCCCCCHH-HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCC-------CC--C Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKP-KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQ-------VG--R 68 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-------~g--r 68 (705) |-.+++--. ++|-......... ........+....+.|...+.+..+..+||.|.-... +.+. +. . T Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:95 1 MINIIRMPW--DKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPD 78 (474) T ss_pred CcccccCCC--CCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccc Confidence 555544332 2232222111111 1112222244455556666777778889999863111 1111 11 2 Q ss_pred CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEE Q lcl|NC_021540. 69 SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFR 148 (705) Q Consensus 69 s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k 148 (705) .+++.+..+-.|+.....| ||.+.- |.+ +|... .+.++..+. ++-...+...+++++..|.+.+. T Consensus 79 ~ki~~n~~k~Iv~~~~~yl----~g~p~~--~~~---~~~~~----~~~l~~~~~--n~~~~~~~~l~~~~~~~G~~~~~ 143 (474) T protein:vir:95 79 WRITTNFHQNLVDQKVSYV----AGKPVT--YAH---DDDKV----LDVIHQVLD--TRWDNKLIDILTAASNKGIDWLQ 143 (474) T ss_pred cccccchHHHHHHhhhhhh----cccCce--ecc---CChHH----HHHHHHHHh--ccHHHHHHHHHHHHhhCCeEEEE Confidence 3577787787777776665 554433 322 33222 234444432 45556677889999999999887 Q ss_pred EeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeee Q lcl|NC_021540. 149 TSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKT 228 (705) Q Consensus 149 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 228 (705) +|++ T Consensus 144 ~~~d---------------------------------------------------------------------------- 147 (474) T protein:vir:95 144 VYIN---------------------------------------------------------------------------- 147 (474) T ss_pred eeeC---------------------------------------------------------------------------- Confidence 7652 Q ss_pred ccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccccccccc Q lcl|NC_021540. 229 VKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKAR 308 (705) Q Consensus 229 ~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (705) ..+.+++..++|.++|+=.+.. ...+.- .+.+.+... + T Consensus 148 ~~~~~~i~~~~p~~~~~v~d~~-~~~~~~-a~ir~~~~~----------~------------------------------ 185 (474) T protein:vir:95 148 EDGELKLFRVPAEQAIPIWTDK-EREQLN-AFIRIFTFN----------G------------------------------ 185 (474) T ss_pred CCCceEEEEEcccceEEEEcCC-CCCceE-EEEEEEeec----------C------------------------------ Confidence 0234677889999987533321 112222 223333110 0 Q ss_pred CeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHH Q lcl|NC_021540. 309 KKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLI 383 (705) Q Consensus 309 ~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~i 383 (705) ...+|+|.. +-..+.+.. ..+..++........|.+.+.+|++.++. +..|.|.+..++++++.+ T Consensus 186 --~~~~~vy~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~ 255 (474) T protein:vir:95 186 --ETKVEYWTAETVTYYVYENGGLI---PDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAI 255 (474) T ss_pred --eeEEEEEeCCeEEEEEEcCCcee---eccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHH Confidence 001223221 000111100 00011111111122233335677776653 456899999999999999 Q ss_pred HHHHHHHHHHHHhcCCCcEEeeccc-cCchhh--hhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHH Q lcl|NC_021540. 384 GALTRGMIDAMARSANGQRGMSKNL-LDPVNE--RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADA 460 (705) Q Consensus 384 N~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~--~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~ 460 (705) |.+.|.+.+.+...+.|.+++. |. .+.... ...+...++.+..++ .+.++..+.-.......++.+...+-. T Consensus 256 d~~~S~~~~~~~~~~~p~lv~~-g~~~~~~~~~~~~~~~~~~i~~~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~ 330 (474) T protein:vir:95 256 DKRLSDVQNMFDESVELIYILR-GYEGEDLSEFMEGLKYYKAINVSSDG----GVETIQVEVPVASTKEYLDMMRAYIVE 330 (474) T ss_pred HHHHHHHHHHHHHhhcchhhhc-CCCcccccchhhhhhccceeeccCCC----ceeEEeccCCHHHHHHHHHHHHHHHHH Confidence 9999999999999988877653 43 111111 122334456555543 234544444456677788999999999 Q ss_pred HhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhh Q lcl|NC_021540. 461 LSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDN 540 (705) Q Consensus 461 ~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~ 540 (705) .|++++.+.+..++.+|+.| ++.+............+.|..+++++++.++. +.... .++ . T Consensus 331 ~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~----~~g~~-------~d~-----~- 391 (474) T protein:vir:95 331 FGQGVDFQTDKFGSATSGIA--LKFLYTNLNLKANKLKNKANVALQELMQFILD----FNKIK-------LDA-----K- 391 (474) T ss_pred HhCCcCccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCC-------ccc-----c- Confidence 99998887665444444444 44444444555555555666666665555544 32210 011 0 Q ss_pred cccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHH Q lcl|NC_021540. 541 LVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQL 620 (705) Q Consensus 541 ~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~ 620 (705) .+.+..+...+.......+ ++.+.+ .++. .-++. .++...+ +. .+.++.+. T Consensus 392 ---~i~i~f~~~~p~~~~e~a~----~~~~~g-iiS~---et~~~---~lp~v~D-------------~~--~E~eri~~ 442 (474) T protein:vir:95 392 ---EIEITFNFNVMVNDLEQSQ----IGAQSQ-YLSK---ETLVR---HHPWVDD-------------PK--AELERLDE 442 (474) T ss_pred ---eeeEEecCCCccCHHHHHH----HHHHcC-CCCh---HHHHH---hCCCCCC-------------HH--HHHHHHHH Confidence 1122222222211111111 111111 1110 00110 1111111 11 11111111 Q ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 621 EAQELQMRIAKL---QAEIQLMPYEAQAEAAK 649 (705) Q Consensus 621 ~~q~~q~e~~k~---qa~~q~~~~~~q~e~a~ 649 (705) +..+....+... .............++.+ T Consensus 443 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 443 EQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 111000000000 00000000000000000 No 101 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.43 E-value=1.3e-11 Score=80.30 Aligned_cols=451 Identities=11% Similarity=0.059 Sum_probs=203.2 Q ss_pred CcchhhhhhcccccccCCCCCCHH-HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCC-------CC--C Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKP-KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQ-------VG--R 68 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-------~g--r 68 (705) |-.+++--. ++|-......... ........+....+.|...+.+..+..+||.|.-... +.+. +. . T Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:96 1 MINIIRMPW--DKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPD 78 (474) T ss_pred CcccccCCC--CCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccc Confidence 555544332 2232222111111 1112222244455556666777778889999863111 1111 11 2 Q ss_pred CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEE Q lcl|NC_021540. 69 SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFR 148 (705) Q Consensus 69 s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k 148 (705) .+++.+..+-.|+.....| ||.+.- |.+ +|... .+.++..+. ++-...+...+++++..|.+.+. T Consensus 79 ~ki~~n~~k~Iv~~~~~yl----~g~p~~--~~~---~~~~~----~~~l~~~~~--n~~~~~~~~l~~~~~~~G~~~~~ 143 (474) T protein:vir:96 79 WRITTNFHQNLVDQKVSYV----AGKPVT--YAH---DDDKV----LDVIHQVLD--TRWDNKLIDILTAASNKGIDWLQ 143 (474) T ss_pred cccccchHHHHHHhhhhhh----cccCce--ecc---CChHH----HHHHHHHHh--ccHHHHHHHHHHHHhhCCeEEEE Confidence 3577787787777776665 554433 322 33222 234444432 45556677889999999999887 Q ss_pred EeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeee Q lcl|NC_021540. 149 TSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKT 228 (705) Q Consensus 149 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 228 (705) +|++ T Consensus 144 ~~~d---------------------------------------------------------------------------- 147 (474) T protein:vir:96 144 VYIN---------------------------------------------------------------------------- 147 (474) T ss_pred eeeC---------------------------------------------------------------------------- Confidence 7652 Q ss_pred ccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccccccccc Q lcl|NC_021540. 229 VKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKAR 308 (705) Q Consensus 229 ~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (705) ..+.+++..++|.++|+=.+.. ...+.- .+.+.+... + T Consensus 148 ~~~~~~i~~~~p~~~~~v~d~~-~~~~~~-a~ir~~~~~----------~------------------------------ 185 (474) T protein:vir:96 148 EDGELKLFRVPAEQAIPIWTDK-EREQLN-AFIRIFTFN----------G------------------------------ 185 (474) T ss_pred CCCceEEEEEcccceEEEEcCC-CCCceE-EEEEEEeec----------C------------------------------ Confidence 0234677889999987533321 112222 223333110 0 Q ss_pred CeEEEEEEEEE-----eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHH Q lcl|NC_021540. 309 KKIVVYEYWGY-----WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLI 383 (705) Q Consensus 309 ~~v~v~E~w~k-----~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~i 383 (705) ...+|+|.. +-..+.+.. ..+..++........|.+.+.+|++.++. +..|.|.+..++++++.+ T Consensus 186 --~~~~~vy~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~ 255 (474) T protein:vir:96 186 --ETKVEYWTAETVTYYVYENGGLI---PDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAI 255 (474) T ss_pred --eeEEEEEeCCeEEEEEEcCCcee---eccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHH Confidence 001223221 000111100 00011111111122233335677776653 456899999999999999 Q ss_pred HHHHHHHHHHHHhcCCCcEEeeccc-cCchhh--hhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHH Q lcl|NC_021540. 384 GALTRGMIDAMARSANGQRGMSKNL-LDPVNE--RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADA 460 (705) Q Consensus 384 N~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~--~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~ 460 (705) |.+.|.+.+.+...+.|.+++. |. .+.... ...+...++.+..++ .+.++..+.-.......++.+...+-. T Consensus 256 d~~~S~~~~~~~~~~~p~lv~~-g~~~~~~~~~~~~~~~~~~i~~~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~ 330 (474) T protein:vir:96 256 DKRLSDVQNMFDESVELIYILR-GYEGEDLSEFMEGLKYYKAINVSSDG----GVETIQVEVPVASTKEYLDMMRAYIVE 330 (474) T ss_pred HHHHHHHHHHHHHhhcchhhhc-CCCcccccchhhhhhccceeeccCCC----ceeEEeccCCHHHHHHHHHHHHHHHHH Confidence 9999999999999988877653 43 111111 122334456555543 234544444456677788999999999 Q ss_pred HhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhh Q lcl|NC_021540. 461 LSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDN 540 (705) Q Consensus 461 ~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~ 540 (705) .|++++.+.+..++.+|+.| ++.+............+.|..+++++++.++. +.... .++ . T Consensus 331 ~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~----~~g~~-------~d~-----~- 391 (474) T protein:vir:96 331 FGQGVDFQTDKFGSATSGIA--LKFLYTNLNLKANKLKNKANVALQELMQFILD----FNKIK-------LDA-----K- 391 (474) T ss_pred HhCCcCccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhCCC-------ccc-----c- Confidence 99998887665444444444 44444444555555555666666665555544 32210 011 0 Q ss_pred cccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHH Q lcl|NC_021540. 541 LVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQL 620 (705) Q Consensus 541 ~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~ 620 (705) .+.+..+...+.......+ ++.+.+ .++. .-++. .++...+ +. .+.++.+. T Consensus 392 ---~i~i~f~~~~p~~~~e~a~----~~~~~g-iiS~---et~~~---~lp~v~D-------------~~--~E~eri~~ 442 (474) T protein:vir:96 392 ---EIEITFNFNVMVNDLEQSQ----IGAQSQ-YLSK---ETLVR---HHPWVDD-------------PK--AELERLDE 442 (474) T ss_pred ---eeeEEecCCCccCHHHHHH----HHHHcC-CCCh---HHHHH---hCCCCCC-------------HH--HHHHHHHH Confidence 1122222222211111111 111111 1110 00110 1111111 11 11111111 Q ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_021540. 621 EAQELQMRIAKL---QAEIQLMPYEAQAEAAK 649 (705) Q Consensus 621 ~~q~~q~e~~k~---qa~~q~~~~~~q~e~a~ 649 (705) +..+....+... .............++.+ T Consensus 443 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 443 EQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred HHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 111000000000 00000000000000000 No 102 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.42 E-value=1.1e-12 Score=86.21 Aligned_cols=455 Identities=12% Similarity=0.047 Sum_probs=190.5 Q ss_pred CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCCC---CCCcCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_021540. 17 QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQV---GRSSVQPKLIRKQAEWRYSALSEPF 91 (705) Q Consensus 17 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~---grs~~v~~~v~~~~e~~~~~l~~~f 91 (705) ++ +-.++|..|.+. +.....+..+..+||.|.-... +...+ ..-++|.+..+..|+.....|. | T Consensus 1 ~~--t~~~~i~~L~~~-------~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~--~ 69 (480) T protein:vir:78 1 MT--TYHEHVERLQGL-------LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD--I 69 (480) T ss_pred CC--CHHHHHHHHHHH-------HHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhc--c Confidence 22 222344444443 3334445567779999874321 11111 1224666666666665555441 1 Q ss_pred cCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCC Q lcl|NC_021540. 92 LNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEAT 171 (705) Q Consensus 92 ~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~ 171 (705) +. | ..++|.+..+ .+..+| ..|+--......++++++.|.+.+.+| ..+ . T Consensus 70 ---~g---~--~~~~d~~~~~----~l~~i~-~~N~~d~~~~~~~~~a~~~G~ay~~v~-~~~---~------------- 119 (480) T protein:vir:78 70 ---EG---F--RISEDSEGLE----ELWNWW-QANDLDEESVLGHDDSLTFGRSYITVS-HPD---V------------- 119 (480) T ss_pred ---Cc---e--ecCCCchhHH----HHHHHH-HhcCHHHHHHHHHHHHhhcCceEEEEe-cCc---c------------- Confidence 11 1 1334444333 344333 345444556788999999999977653 000 0 Q ss_pred chhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCCCc Q lcl|NC_021540. 172 GESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDPTC 249 (705) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a 249 (705) ......+.++|..++|.+++ |||.. T Consensus 120 -----------------------------------------------------~~~d~~g~~~i~~~~p~~~~~~~D~~~ 146 (480) T protein:vir:78 120 -----------------------------------------------------ESGDPAGIPLIRVESPLYMYAELDPRN 146 (480) T ss_pred -----------------------------------------------------ccCCCCCeeEEEEEcccceEEEEcCCC Confidence 00012356788899999966 44443 Q ss_pred cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeE Q lcl|NC_021540. 250 NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTT 329 (705) Q Consensus 250 ~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~ 329 (705) .. .-.+.+ +++.+.++ . ..+..+++|.. +. T Consensus 147 ~~---~~~~~i-~~~~~~~~------------------------------~--------~~~~~~~~y~~-----~~--- 176 (480) T protein:vir:78 147 TR---RVTRAV-RLYTTRDD------------------------------V--------AVPDRATLYLP-----DE--- 176 (480) T ss_pred cc---ceEEEE-EEEEeecC------------------------------C--------CceEEEEEEeC-----Ce--- Confidence 21 112222 22211000 0 01223344432 11 Q ss_pred EEEEEEECC----EEEecccCCCCCCCcceEEeeeeeecCcccCCchHHH-hhHHHHHHHHHHHHHHHHHHhcCCCcEEe Q lcl|NC_021540. 330 PIVASWVDD----VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAEL-LSDNQKLIGALTRGMIDAMARSANGQRGM 404 (705) Q Consensus 330 ~~~~~~~g~----~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~-~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~ 404 (705) .++....++ .+...+..|.+.|.+|+++|+..+..+..+|.|.+.. ++++++.+|..++.+.+.+...+.|+..+ T Consensus 177 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i 256 (480) T protein:vir:78 177 TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI 256 (480) T ss_pred EEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh Confidence 111111111 1112233344457899999998888888999998875 89999999999999999999888887655 Q ss_pred ecccc-Cc---h---hhhhhcCCcceeecCCcccccccccccCccc-hHHHHHHHHHHHHHHHHHhCcchHhcCCCcccc Q lcl|NC_021540. 405 SKNLL-DP---V---NERKFKMGEDYKYNPGTNPVTDIIEHKYPEL-PASSYNMLQMFTLEADALSGVKSFSQGLTGDSL 476 (705) Q Consensus 405 ~~~av-~~---~---d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i-~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~ 476 (705) - |+- +. + ..+....+.++...++ . ..+.+.+.- ...+...+......+-..+|+++...|..+.. T Consensus 257 ~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n- 329 (480) T protein:vir:78 257 S-GVTTDELTNDGENTTLDIYYGRILTLASE-A----AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN- 329 (480) T ss_pred h-cCCccccccccccchhhhhhhhhccCCCC-C----ceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCc- Confidence 3 321 11 1 1122233434333221 1 122222221 12333444555555555688888888864421 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhH Q lcl|NC_021540. 477 GTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAET 556 (705) Q Consensus 477 ~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~ 556 (705) +.++.++......-........+.|..+++++++.++ .+...... .++. ...+......... T Consensus 330 ~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~----~~~g~~~~-----~~~~---------~i~v~f~~~~~~s 391 (480) T protein:vir:78 330 PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM----QIMGREVT-----EEYT---------RLETVWRDPSTPT 391 (480) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHcCCCcc-----ccce---------eeeEEecCCCCCC Confidence 2244445443333334444445555555555555444 33321100 0111 1122222221111 Q ss_pred HHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_021540. 557 DAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIA-KLQAE 635 (705) Q Consensus 557 ~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~-k~qa~ 635 (705) ..+....+..+.++....++. ..+. +..++- +++.+..... .+++.+..-..+. ....+ T Consensus 392 ~~~~ad~~~kl~~~g~~~~s~----et~~---~~lg~~------------~d~~~~~~~~-~~e~~~~~~~~~~~~~~~~ 451 (480) T protein:vir:78 392 VAAKADAVSKLYANGQGPIPK----EQAR---IDLGYT------------ATQREQMRDW-DKQETEDMIDTLYSTTKAQ 451 (480) T ss_pred HHHHHHHHHHHHHhccccCCH----HHHH---hcCCCC------------HhHHHHHHHH-HHHHHHHHHHHhhcccccc Confidence 223333344444432211111 1111 111111 0011110000 0000000000000 00000 Q ss_pred HHHH----HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 636 IQLM----PYEAQAEAAKARKANTEADLNT 661 (705) Q Consensus 636 ~q~~----~~~~q~e~a~a~~~~~ea~~~~ 661 (705) .... .....-+...+-...-.+ ..+ T Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 480 (480) T protein:vir:78 452 ADATPKPTVTETKTETQTSPSGFNRT-KTR 480 (480) T ss_pred CCCCCCCCCCCCCCccccccCCCCcc-cCC Confidence 0000 000000000000000000 000 No 103 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.40 E-value=3e-11 Score=78.26 Aligned_cols=450 Identities=10% Similarity=0.002 Sum_probs=192.9 Q ss_pred ccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCC---CCCcCCCHHHHHHHHHHHH Q lcl|NC_021540. 11 DTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQV---GRSSVQPKLIRKQAEWRYS 85 (705) Q Consensus 11 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---grs~~v~~~v~~~~e~~~~ 85 (705) =+.| ...+.....-..+..+ -...+..++.+.++..+||.|.... .+...+ .+-++|.+-....|+.... T Consensus 1 ~~~~--i~~~~~~~~~~~~~~~---L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 75 (485) T protein:vir:24 1 MTAP--LPGQEEIADPAIARDE---MVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAE 75 (485) T ss_pred CCCC--CCCCCcccchHHHHHH---HHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhh Confidence 1111 1122111111122111 1334455556666778999987532 111111 1223555666666666555 Q ss_pred HHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccc Q lcl|NC_021540. 86 ALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVF 165 (705) Q Consensus 86 ~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~ 165 (705) .| + .+.++ ..++.... ..++.+| ..|+--.....+++++++.|.+.+.+|++..... T Consensus 76 ~l---~--~~g~~-----~~~~~~~~----~~l~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~-------- 132 (485) T protein:vir:24 76 RQ---A--VEGFR-----LGDADEAD----EELWQWW-QANNLDIEAPLGYTDAYVHGRSYITISRPDPQID-------- 132 (485) T ss_pred hh---c--cCcee-----cCCCchhH----HHHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccc-------- Confidence 44 1 11111 12223322 2334333 3344334567899999999999998875311000 Q ss_pred ccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhe-- Q lcl|NC_021540. 166 QYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNV-- 243 (705) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~-- 243 (705) .....+.|+|..++|.++ T Consensus 133 ------------------------------------------------------------~~~~~~~~~i~~~~p~~~~~ 152 (485) T protein:vir:24 133 ------------------------------------------------------------LGWDPNVPLIRVEPPTRMYA 152 (485) T ss_pred ------------------------------------------------------------cccCCCcceEEEeccceeEE Confidence 001134567888999998 Q ss_pred eeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeec Q lcl|NC_021540. 244 TIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDID 323 (705) Q Consensus 244 ~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~ 323 (705) +|||+.. . ...+.+++-+ .+ ...+..+++|.. T Consensus 153 i~D~~~~-~----~~~~~~~~~~----------~~-----------------------------~~~~~~~~~y~~---- 184 (485) T protein:vir:24 153 EIDPRIG-R----PAKAIRVAYD----------AE-----------------------------GNEIQAATLYTP---- 184 (485) T ss_pred EeeCCcC-c----eeEEEEEEEe----------ec-----------------------------CCeEEEEEEEcC---- Confidence 4555421 1 1112121100 00 011222333322 Q ss_pred CCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCcE Q lcl|NC_021540. 324 GSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE-LLSDNQKLIGALTRGMIDAMARSANGQR 402 (705) Q Consensus 324 ~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~-~~~d~Q~~iN~~~~~~~d~~~~~~~~~~ 402 (705) +. .++.+-.++........|.+.|.+|+|+|+..+..+..+|.|-+. .++++++.+|..++.+...+...+.|+. T Consensus 185 -~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~ 260 (485) T protein:vir:24 185 -NE---TFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQR 260 (485) T ss_pred -Cc---EEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 11 111222233332223334444789999999888888889999876 5899999999999999999998888877 Q ss_pred Eeec---cccCc-----hhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHH---hCcchHhcCC Q lcl|NC_021540. 403 GMSK---NLLDP-----VNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADAL---SGVKSFSQGL 471 (705) Q Consensus 403 ~~~~---~av~~-----~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~---tGv~d~~~G~ 471 (705) .+-. ..+.. ...+...+|.++....+ . ....+.+. .....+++.++..+..+ +++++..+|. T Consensus 261 ~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~----~~~~q~~~--~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~ 333 (485) T protein:vir:24 261 LIFGIKPEEIGVDPETGQTLFDAYLARILAFEDA-E----GKIQQFSA--AELANFTNALDQIAKQVAAYTGLPPQYLST 333 (485) T ss_pred hhccCCccccccccccccchhhhcccceeccCCC-C----ceEEeecc--cchHHHHHHHHHHHHHHhcccCCCHHHhcc Confidence 6531 11111 11234455655544321 1 11222221 12234455566666555 6788888885 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec Q lcl|NC_021540. 472 TGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI 551 (705) Q Consensus 472 ~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~ 551 (705) .+.. +.++.++......-........+.|..+++++++.++.+.. ....+ . ++. ...+.... T Consensus 334 ~~~n-~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~-~~~~~-~------d~~---------~i~v~f~~ 395 (485) T protein:vir:24 334 AADN-PASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMK-GGDVP-P------DML---------RMETVWRD 395 (485) T ss_pred ccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCc-c------ccc---------eeeEEecC Confidence 5421 12344455444444555566666666777777766655322 11100 0 000 11122211 Q ss_pred cchhHHHHHHHHHHHHHHHHhhhchhHHHHH----------HHHHHHhhhcc--chhhhhhhcccc----cchhhHHHHH Q lcl|NC_021540. 552 SNAETDAIKAQELSFMLQTMGQSLPFDMTKL----------ILGEIAKLRGM--PDLSKMISKYNP----EPSPQAQLEI 615 (705) Q Consensus 552 ~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~----------il~~l~e~~~~--~~~~~~~~~~~~----q~~~~~q~~~ 615 (705) ............+..|.+.....++...+.. -+..+.+.... ......+....+ +++...++.. T Consensus 396 ~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 475 (485) T protein:vir:24 396 PSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKP 475 (485) T ss_pred CCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCC Confidence 1111112222222222222111111000000 00000000000 000000000000 0000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 616 QIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAA 648 (705) Q Consensus 616 q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a 648 (705) +.. ..-...| T Consensus 476 ~~~-----------------------~~~~~~a 485 (485) T protein:vir:24 476 QPA-----------------------IEGGDSA 485 (485) T ss_pred ccC-----------------------CCCCCCC Confidence 000 0000000 No 104 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.40 E-value=1.4e-12 Score=85.53 Aligned_cols=455 Identities=12% Similarity=0.049 Sum_probs=189.4 Q ss_pred CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCC-C--CCCcCCCHHHHHHHHHHHHHHHHhh Q lcl|NC_021540. 17 QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQ-V--GRSSVQPKLIRKQAEWRYSALSEPF 91 (705) Q Consensus 17 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~--grs~~v~~~v~~~~e~~~~~l~~~f 91 (705) ++ .-..+|..|.+.+ .....+..+..+||.|.-... +... + ..-++|.+-....|+.....| + T Consensus 1 ~~--t~~d~i~~L~~~~-------~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l---~ 68 (480) T protein:vir:78 1 MT--TYHEHVERLQGLL-------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL---D 68 (480) T ss_pred CC--CHHHHHHHHHHHH-------HHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhh---c Confidence 22 1122444444433 233344456678999874221 1111 1 122356666666666554444 1 Q ss_pred cCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCC Q lcl|NC_021540. 92 LNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEAT 171 (705) Q Consensus 92 ~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~ 171 (705) .+. | ...+|.+. .+.+..+| ..|+--......++++++.|.+.+.+| .- +. T Consensus 69 --~~g---~--~~~~d~~~----~~~l~~i~-~~N~~~~~~~~~~~~a~~~G~ay~~v~-~~---~~------------- 119 (480) T protein:vir:78 69 --IEG---F--RISEDSEG----LEELWNWW-QANDLDEESVLGHDDSLTFGRAYITVS-HP---DV------------- 119 (480) T ss_pred --cCc---e--ecCCCchh----HHHHHHHH-HhcCHHHHHHHHHHHHhhcCceEEEee-cC---cc------------- Confidence 111 1 12233332 33454444 345544556788999999999977663 00 00 Q ss_pred chhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCCCc Q lcl|NC_021540. 172 GESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDPTC 249 (705) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp~a 249 (705) ......+.++|..++|.+++ |||.. T Consensus 120 -----------------------------------------------------~~~d~~~~~~i~~~~p~~~~~i~D~~~ 146 (480) T protein:vir:78 120 -----------------------------------------------------ESGDPAGIPLIRVESPLYMYAELDPRN 146 (480) T ss_pred -----------------------------------------------------ccCCCCCeeEEEEEcccceEEEEcCCC Confidence 00012456788899999955 55543 Q ss_pred cCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeE Q lcl|NC_021540. 250 NGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTT 329 (705) Q Consensus 250 ~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~ 329 (705) .. .-.+.+ +++...++ . ..+..+++|.. +. T Consensus 147 ~~---~~~~~i-~~~~~~d~------------------------------~--------~~~~~~~~y~~-----~~--- 176 (480) T protein:vir:78 147 TR---RVTRAV-RLYTTRDD------------------------------V--------AVPDRATLYLP-----DE--- 176 (480) T ss_pred cc---ceEEEE-EEEEeecC------------------------------C--------cceEEEEEEeC-----Ce--- Confidence 21 122222 22211100 0 01123344432 11 Q ss_pred EEEEEEECC----EEEecccCCCCCCCcceEEeeeeeecCcccCCchHHH-hhHHHHHHHHHHHHHHHHHHhcCCCcEEe Q lcl|NC_021540. 330 PIVASWVDD----VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAEL-LSDNQKLIGALTRGMIDAMARSANGQRGM 404 (705) Q Consensus 330 ~~~~~~~g~----~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~-~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~ 404 (705) .+.....++ .+...+..|.+.|.+|+++|+..+..+..+|.|-+.. ++++++.+|..++.+...+...+.|+..+ T Consensus 177 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i 256 (480) T protein:vir:78 177 TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI 256 (480) T ss_pred EEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Confidence 111111111 1122233344457899999998888888999998874 89999999999999999999888887655 Q ss_pred ecccc-C---ch---hhhhhcCCcceeecCCcccccccccccCccc-hHHHHHHHHHHHHHHHHHhCcchHhcCCCcccc Q lcl|NC_021540. 405 SKNLL-D---PV---NERKFKMGEDYKYNPGTNPVTDIIEHKYPEL-PASSYNMLQMFTLEADALSGVKSFSQGLTGDSL 476 (705) Q Consensus 405 ~~~av-~---~~---d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i-~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~ 476 (705) - |.- + .+ ..+...+|.++...++ . ..+.+.+.- ...+...+......+-.++++++...|..+.. T Consensus 257 ~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n- 329 (480) T protein:vir:78 257 S-GVTTDELTNDGENTTLDIYYGRILTLASE-A----AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN- 329 (480) T ss_pred h-CCCccccccccccchhhhhhhhhccCCCC-C----ceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCc- Confidence 3 321 1 11 1122233444433322 1 122222221 12334445555555556678888888854321 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhH Q lcl|NC_021540. 477 GTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAET 556 (705) Q Consensus 477 ~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~ 556 (705) +.++.++......-........+.|..+++++++.++ .+...... .++. ...+.-....... T Consensus 330 ~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~----~~~~~~~~-----~~~~---------~i~v~w~~~~~~s 391 (480) T protein:vir:78 330 PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM----QIMGREVT-----EEYT---------RLETVWRDPSTPT 391 (480) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHcCCCcc-----ccce---------eeeEEecCCCCCC Confidence 1244444443333344444555555555655555444 33321100 0110 1122222221111 Q ss_pred HHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHH-HHHHH- Q lcl|NC_021540. 557 DAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRI-AKLQA- 634 (705) Q Consensus 557 ~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~-~k~qa- 634 (705) .......+..|.++....... .++. +..++. +++.+... +.++++.+.....+ +..++ T Consensus 392 ~~~~ad~~~kl~~~g~~~~s~----et~~---~~lg~~------------~d~~~e~~-~~~~~~~~~~~~~~~~~~~~~ 451 (480) T protein:vir:78 392 VAAKADAVSKLYANGQGPIPK----EQAR---IDLGYT------------ATQREQMR-DWDKQETEDMIDTLYSTTKAQ 451 (480) T ss_pred HHHHHHHHHHHHHhcccCCCH----HHHH---hcCCCC------------HhHHHHHH-HHHHHHHHHHHHHhhccccCC Confidence 222333333444332111111 1110 111110 00111100 00000000000000 00000 Q ss_pred -HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 635 -EIQLM--PYEAQAEAAKARKANTEADLNTLDFVEQ 667 (705) Q Consensus 635 -~~q~~--~~~~q~e~a~a~~~~~ea~~~~~~~~~q 667 (705) ..+.. ......+ ++.......+..- + T Consensus 452 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~----~ 480 (480) T protein:vir:78 452 ADATPKPTVTETKTE---TQTSPSGFNRTKT----R 480 (480) T ss_pred CccccCCCCCCCCCc---cCCCcccCCCcCC----C Confidence 00000 0000000 0000000000000 0 No 105 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.38 E-value=9e-12 Score=81.14 Aligned_cols=457 Identities=9% Similarity=0.028 Sum_probs=201.3 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC-----CCCCCCCC--CcCCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY-----KPKQQVGR--SSVQP 73 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~gr--s~~v~ 73 (705) --..+.... .|..+..+.-+.++. .++.-+.. ...+.++..+||.|.-.. .....+++ .+++. T Consensus 6 ~~~~~~~~~---~~~~~~~l~~~~i~~----li~~~~~~---~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~ 75 (506) T protein:vir:94 6 TEHKQANLI---YQESLENLTPNKIMK----FITHHFNY---QRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATH 75 (506) T ss_pred hhhhcceee---cccchhcCCHHHHHH----HHHHHHHH---HHHHHHHHHHHhcCCCccccccccccccccCCcceeec Confidence 000111110 111222222222333 23222222 223456778899986321 11122333 46788 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 74 KLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 74 ~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) +..+..|+.....| ||.+. .|.+ +|.. ..+.++.+| ..|+--..+....++++..|.+.+.+||+ T Consensus 76 n~~~~Iv~~~~~~l----~G~p~--~~~~---~d~~----~~~~l~~~~-~~N~~~~~~~~~~~~~~~~G~a~~~v~~d- 140 (506) T protein:vir:94 76 SFAKYIADFQTSYS----VGNPI--NVKL---PDDG----SNSGFDTFN-KANDVDAENYDLFLDMSRYGRAYEYVYRG- 140 (506) T ss_pred chHHHHHHHhhhhh----cccCc--eeec---Ccch----HHHHHHHHH-hccCHhHHHHHHHHHHHhcCeEEEEEEec- Confidence 88888888877665 55543 3333 2222 234565544 33444455778899999999998887752 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) ..+.| T Consensus 141 ---------------------------------------------------------------------------ed~~~ 145 (506) T protein:vir:94 141 ---------------------------------------------------------------------------EDNEE 145 (506) T ss_pred ---------------------------------------------------------------------------CCCee Confidence 02346 Q ss_pred eEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeE Q lcl|NC_021540. 234 EVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKI 311 (705) Q Consensus 234 ~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 311 (705) ++..++|.++++ |+... .....+.+++..... + .......+ T Consensus 146 ~i~~~~p~~~~~v~dd~~~----~~~~~~v~~~~~~~~------~---------------------------~~~~~~~~ 188 (506) T protein:vir:94 146 HLAKLDPLDTFVIYSTDVD----PKPIMAVRYHQIELV------D---------------------------DNQVSTIN 188 (506) T ss_pred EEEEEcccceEEEecCCCC----CceEEEEEEEeeeec------c---------------------------CCceeEEE Confidence 778889998754 33221 122333344322100 0 00000112 Q ss_pred EEEEEEEEeeecCCCeeEEEEEEEEC----CEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHH Q lcl|NC_021540. 312 VVYEYWGYWDIDGSGVTTPIVASWVD----DVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALT 387 (705) Q Consensus 312 ~v~E~w~k~~~~~dg~~~~~~~~~~g----~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~ 387 (705) ..+|+|.. .+..++.+ ..+....++|| +.+|+++++.. -.|.|.+..++++++.+|..+ T Consensus 189 ~~~~~yt~----------~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n~-----~~~~sd~e~~~~liDa~d~~~ 251 (506) T protein:vir:94 189 YVPETWTA----------DTYTLYNPTPIMGKMQVDTTKPI--TTFPVVEFKNS-----NFRLGDFENVLPLIDLYDAAQ 251 (506) T ss_pred EEEEEEeC----------ceEEEeccccCccceeccccccC--CccceEEecCC-----CCCCCchhhhHHHHHHHHHHH Confidence 23344422 11122221 22333333443 67788776543 357899999999999999999 Q ss_pred HHHHHHHHhcCCCcEEeeccccCch----------------------h----hhhhcCCcceeecCCccc-----ccccc Q lcl|NC_021540. 388 RGMIDAMARSANGQRGMSKNLLDPV----------------------N----ERKFKMGEDYKYNPGTNP-----VTDII 436 (705) Q Consensus 388 ~~~~d~~~~~~~~~~~~~~~av~~~----------------------d----~~~~~pg~~i~~~~~~~~-----~~~i~ 436 (705) |.+.+.+.-.+++.+++........ + ....+-++.+.+.++... ...+. T Consensus 252 S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 331 (506) T protein:vir:94 252 SDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAK 331 (506) T ss_pred HHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccce Confidence 9999988766666544321110000 0 001112233444433211 11233 Q ss_pred cccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 437 EHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMN 516 (705) Q Consensus 437 ~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li 516 (705) ++..+.-.......++.+...+...|++++.+.+..++.+|+.| +..+............+.|..+++++++.++.++ T Consensus 332 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--ik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~ 409 (506) T protein:vir:94 332 YINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVA--MQYKVLGTVELASTKRRMFERGLYARYQIISDIE 409 (506) T ss_pred eeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33333445666777899999999999999877654434444444 5554455555556666777777777777777765 Q ss_pred HHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchh Q lcl|NC_021540. 517 SVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDL 596 (705) Q Consensus 517 ~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~ 596 (705) ....... .++.. ...+..+...+.......+.+..+ ...++. ..++ ..+++..+. T Consensus 410 ~~~~~~~-----------~~d~~----~i~i~f~~~~p~d~~e~a~~~~kl----~g~iS~---et~~---~~lp~v~d~ 464 (506) T protein:vir:94 410 NSIHGDW-----------TFDPQ----ELTFTFRDNLPADNISQIKALVQA----GATLPQ---KYLY---QQLPGVTNP 464 (506) T ss_pred HhcCCcc-----------ccccc----cceEEeCCCCCcCHHHHHHHHHHH----hccCCh---HHHH---HhCCCCCCH Confidence 4322110 01100 122333333322111222211111 111111 0111 111111111 Q ss_pred hhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 597 SKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFV 665 (705) Q Consensus 597 ~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~ 665 (705) . .+.++.+.+.++........ .....+-+.+. .+ +....+.+ T Consensus 465 -------------~--~E~~ri~~E~~~~~~~~~~~----~~~~~~~~~~~-~~-------~~~~~e~~ 506 (506) T protein:vir:94 465 -------------Q--DIVDMMKEQSANGDYSFDQN----GVISNDGQTNT-TA-------TQTDEEVR 506 (506) T ss_pred -------------H--HHHHHHHHHHHHHhhcchhh----cCCCcccCccc-cc-------cccccCCC Confidence 1 11111111111000000000 00000000000 00 00000011 No 106 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.37 E-value=5.6e-11 Score=76.80 Aligned_cols=470 Identities=9% Similarity=-0.007 Sum_probs=202.9 Q ss_pred Ccchhhhhh---cc------cccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CC------ Q lcl|NC_021540. 1 MSDINEEFL---ED------TVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PK------ 63 (705) Q Consensus 1 ~~~~~~~~~---~~------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~------ 63 (705) |++|-..+- +. +.+..++ +.....|.+.++ .+ ...+.++..+||.|.-... +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~i~~~i~----~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~ 70 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIA----EPDTTMIQKLID----EH--NPEPLLKGVRYYMCENDIEKKRRTYYDAA 70 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhcc----chhHHHHHHHHH----hh--cHHHHHHHHHHhccccchhhccchhcccc Confidence 555432221 11 1111122 122223332222 11 2344567889998863110 00 Q ss_pred ----CCCCC--CcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHH Q lcl|NC_021540. 64 ----QQVGR--SSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVR 137 (705) Q Consensus 64 ----~~~gr--s~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~ 137 (705) ..++| .+++.+..+..|+.....| ||.+. .|. .+|.. ..++++..+ .|+-...+..+++ T Consensus 71 ~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl----~g~~~--~~~---~~d~~----~~~~l~~~~--~n~~~~~~~~~~~ 135 (503) T protein:vir:59 71 GQQLVDDTKTNNRTSHAWHKLFVDQKTQYL----VGEPV--TFT---SDNKT----LLEYVNELA--DDDFDDILNETVK 135 (503) T ss_pred cccccccccccceeecchHHHHHHHHHhhh----hcCCe--eec---cCcHH----HHHHHHHHH--hcCHHHHHHHHHH Confidence 11122 2567777777777776655 44443 232 23333 233555543 2555566778999 Q ss_pred HHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceecc Q lcl|NC_021540. 138 TAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAII 217 (705) Q Consensus 138 ~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 217 (705) +++..|.+.+.+||+ T Consensus 136 ~~~~~G~~~~~v~~d----------------------------------------------------------------- 150 (503) T protein:vir:59 136 NMSNKGIEYWHPFVD----------------------------------------------------------------- 150 (503) T ss_pred HHhhCCeEEEEEeec----------------------------------------------------------------- Confidence 999999998888762 Q ss_pred CcccccceeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccc Q lcl|NC_021540. 218 NGYEEQEVIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHY 295 (705) Q Consensus 218 ~~~~~~~~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~ 295 (705) ..+++++..++|.+++. |+.. ..+..+ +.+++.+... + T Consensus 151 -----------~dg~~~i~~~~p~~~~~i~d~~~---~~~~~~-~ir~~~~~~~--------~----------------- 190 (503) T protein:vir:59 151 -----------EEGEFDYVIFPAEEMIVVYKDNT---RRDILF-ALRYYSYKGI--------M----------------- 190 (503) T ss_pred -----------CCCceEEEEEccceeEEEEeCCC---CCceEE-EEEEEEEecC--------C----------------- Confidence 02346788899999774 3332 122222 3333322100 0 Q ss_pred cccccccccccccCeEEEEEEEEE-----eeecCCCeeEEEEE-EEECCEEEecccCCCCCCCcceEEeeeeeecCcccC Q lcl|NC_021540. 296 SSDTSFTFSDKARKKIVVYEYWGY-----WDIDGSGVTTPIVA-SWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYG 369 (705) Q Consensus 296 ~~~~~~~~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~~~~-~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g 369 (705) ...+..+|+|.. +...+.+....... .......+.....|.+.+.+||+.+.. +.+| T Consensus 191 ------------~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~n-----n~~~ 253 (503) T protein:vir:59 191 ------------GEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKN-----NEEM 253 (503) T ss_pred ------------CceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecC-----CCCC Confidence 011222333322 11111111000000 000000111222344446778777653 4468 Q ss_pred CchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCch-hh-hhhcCCcceeecCCcccccccccccCccchHHH Q lcl|NC_021540. 370 EADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPV-NE-RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASS 447 (705) Q Consensus 370 ~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~-d~-~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~ 447 (705) .|.+..++++++.+|.+.+.+.+.+...+.|.+++...-.... +. .....+.++.+..++. +.++....-.... T Consensus 254 ~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~ 329 (503) T protein:vir:59 254 VSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGG----VDTLRAEIPVDSA 329 (503) T ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCc----ceeEeccCCHHHH Confidence 9999999999999999999999999999988776542211111 11 1233445565554432 3333333333455 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEe Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIR 527 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~ir 527 (705) ...++.+...+...+++++.+.+..++..|+.| +...............+.|..+++++++.++.++....... T Consensus 330 ~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A--i~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~---- 403 (503) T protein:vir:59 330 AKELERIQDELYKSAQAVDNSPETIGGGATGPA--LENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGD---- 403 (503) T ss_pred HHHHHHHHHHHHHHhcccCCCcccccccccHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc---- Confidence 667888999999999988876554333345544 44433444444455566666666666666665554322110 Q ss_pred EecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccc Q lcl|NC_021540. 528 ITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEP 607 (705) Q Consensus 528 i~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~ 607 (705) +. .. ..+.+..+...+.......+.+..+.+.. .++. ..+.. .++... T Consensus 404 -----~~-----~~-~~i~i~f~~~~p~d~~~~~~~~~kl~~~G--iiS~----et~l~--~l~~v~------------- 451 (503) T protein:vir:59 404 -----FN-----PD-KELTMTFTRTRIQNDSEIVQSLVQGVTGG--IMSK----ETAVA--RNPFVQ------------- 451 (503) T ss_pred -----cc-----cc-cceeEEeCCCCCCCHHHHHHHHHHHHhCC--CCch----HHHHH--hCCCCC------------- Confidence 00 00 01233333333322223333333333321 1111 11100 111111 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 608 SPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQER 675 (705) Q Consensus 608 ~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~ 675 (705) ++. .+.++.+.+.... .+.+....-...-...+.+... .. . + .+....-+.+ T Consensus 452 d~~--~E~~ri~~E~~~~----~~~~~~~~~~~~~~~~~~~~~~----~~-~-~----~~~~~~g~~~ 503 (503) T protein:vir:59 452 DPE--EELARIEEEMNQY----AEMQGNLLDDEGGDDDLEEDDP----NA-G-A----AESGGAGQVS 503 (503) T ss_pred CHH--HHHHHHHHHHHHH----HhhhccccCccCCCCCCCcCCC----CC-C-c----ccCCCCCCcC Confidence 111 1111111110000 0000000000000000000000 00 0 0 0000000000 No 107 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.36 E-value=2.1e-11 Score=79.14 Aligned_cols=456 Identities=11% Similarity=0.052 Sum_probs=179.9 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCC--CCCCCCC------cCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKP--KQQVGRS------SVQ 72 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~grs------~~v 72 (705) |-.+ | .+++..+.+...|..++ ...|...+.+.++..+||.|...... ...+.+. ..| T Consensus 1 ~~~~---------p--~~~l~~~~~~~~~~~~l---~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 66 (479) T protein:vir:99 1 MIDL---------P--DEDLSSEGLAKYLETKV---FPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSR 66 (479) T ss_pred CccC---------C--cccCChhHHHHHHHHHH---HHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhh Confidence 2222 2 12334443333333222 23344444556677789999854321 1111110 113 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeec Q lcl|NC_021540. 73 PKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWC 152 (705) Q Consensus 73 ~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~ 152 (705) .+-.+..|+.+...| + |.+.+..|.+..+... .+| ..|+--.....+++++++.|.+++.+|+. T Consensus 67 ~n~~~~iVd~~~~~l---~--------~~gf~~~d~~~~~~~~----~i~-~~N~~d~~~~~~~~~a~~~G~af~~v~~~ 130 (479) T protein:vir:99 67 KPWMGLMVNSFAQQL---I--------VDGYRKTGTNENAKGW----DTW-RLNQMDKQQFWLNRAVLTFGYAFIKVTSG 130 (479) T ss_pred cCcHHHHHHHHHhhc---c--------cccccCCCchhhHHHH----HHH-HhcChhHHHHHHHHHHhhcCceEEEEecC Confidence 344444444332222 1 2333334444333333 232 23433344557889999999998766531 Q ss_pred chhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCc Q lcl|NC_021540. 153 LEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQ 232 (705) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 232 (705) .. .....+. T Consensus 131 ~~-----------------------------------------------------------------------~~d~~g~ 139 (479) T protein:vir:99 131 IS-----------------------------------------------------------------------PLDGTTV 139 (479) T ss_pred CC-----------------------------------------------------------------------CcCCCCc Confidence 00 0011345 Q ss_pred ceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCe Q lcl|NC_021540. 233 PEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKK 310 (705) Q Consensus 233 ~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (705) ++|..++|.+++. |...+ + . +..+.. + ++. ... T Consensus 140 ~~i~~~~p~~~~~iydd~~~-~--~--~~~~~~--~--------~~~------------------------------~~~ 174 (479) T protein:vir:99 140 ARIKCIDPRDAFAIWEDPYW-D--E--WPKYLL--E--------RQP------------------------------NGQ 174 (479) T ss_pred eEEEEechhheEEEecCCcc-c--c--eeeEEE--e--------ecC------------------------------cee Confidence 6788889999753 32211 1 1 111100 0 000 000 Q ss_pred EEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHH Q lcl|NC_021540. 311 IVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGM 390 (705) Q Consensus 311 v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~ 390 (705) +.+|... ..+.....++........|-+.|.+|+++|...+..+. +|.|.++.++++++.+|+.++.+ T Consensus 175 ---~~~~~~~--------~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~-~g~sd~e~v~~liDa~~~~~s~~ 242 (479) T protein:vir:99 175 ---YWWWTEE--------DYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLRG-VCYGDVEPLVTVAKAIDKTGLDI 242 (479) T ss_pred ---EEEEecc--------eEEEEEecCCceeeccccccCCCCcceEEeecCCCcCc-CCcchhHHHHHHHHHHHHHHHHH Confidence 0111110 00011111121111122333347899999998887754 79999999999999999999999 Q ss_pred HHHHHhcCCCcEEeeccccCc------hhhhhhcCCcceeecCCcccccccccccCccc-hHHHHHHHHHHHHHHHHHhC Q lcl|NC_021540. 391 IDAMARSANGQRGMSKNLLDP------VNERKFKMGEDYKYNPGTNPVTDIIEHKYPEL-PASSYNMLQMFTLEADALSG 463 (705) Q Consensus 391 ~d~~~~~~~~~~~~~~~av~~------~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i-~~~~~~~l~~~~~~~~~~tG 463 (705) ...+...+.|+..+. |.... .+.+....+.++...++ . ....+.+.. ...+...++.+...+-..|+ T Consensus 243 ~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~i~~~~~~-~----~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~ 316 (479) T protein:vir:99 243 LLVQHHQSFQIRWAT-GLMLPEGANADQEKMRFAQESMLISQNE-K----ASFGAIPAAPLDGLLNAYKESLLEFLALAQ 316 (479) T ss_pred HHHHHHhhchhhhhc-CCCcccccccchhccccccccceeecCC-C----ceEEEecccchHHHHHHHHHHHHHHhccCC Confidence 999998888876543 32111 11122233444444322 1 122233221 12333344444455555678 Q ss_pred cchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhccc Q lcl|NC_021540. 464 VKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVG 543 (705) Q Consensus 464 v~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~ 543 (705) +++...|..+| .| +.++......-........+.|..+++++++.++.+ .+...- ..+.. T Consensus 317 ~p~~~~g~~~n-~S--g~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~----~~~~~~-----~~~~~-------- 376 (479) T protein:vir:99 317 LPPHIAGQIVN-VA--ADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKI----EGRTEE-----ATDLD-------- 376 (479) T ss_pred CCHHHcccccc-hH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcc-----cccee-------- Confidence 88888886554 23 334444333444444555556666666666555442 221100 00000 Q ss_pred ceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhh-hhhhcccccchhhHHHHHHHHHHHH Q lcl|NC_021540. 544 SFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLS-KMISKYNPEPSPQAQLEIQIKQLEA 622 (705) Q Consensus 544 ~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~-~~~~~~~~q~~~~~q~~~q~~q~~~ 622 (705) +.+.-.........+..+.+..|.++.+ ++.. .++ ..+.++.... +.++. ...++.+. T Consensus 377 -i~~~w~~~~~~s~~~~ad~~~kl~~ag~--is~e---t~l---~~l~gv~~~~~e~~~~------------~~~~~~~~ 435 (479) T protein:vir:99 377 -FTITWQDVTIQSLAQFADAWAKMVESLK--IPAE---GVW---DMIPNLDQSTVNGWKE------------IYDREGDF 435 (479) T ss_pred -eeEEecCCCCCCHHHHHHHHHHHHhcCC--CCHH---HHH---HhcCCCCHHHHHHHHH------------HHHHHHHH Confidence 1111111111111222333333333311 1111 111 1111221100 00000 00000000 Q ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 623 QELQMRIAKLQAEIQLM-----PYEAQAEAAKARKANTEADLNTL 662 (705) Q Consensus 623 q~~q~e~~k~qa~~q~~-----~~~~q~e~a~a~~~~~ea~~~~~ 662 (705) ......+.......... ...++...... ...++-.+.-- T Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 479 (479) T protein:vir:99 436 GKYMRKLQNGPDPAEQRGGPNGATNMQQANNKT-GEPASLNKSGA 479 (479) T ss_pred HHHHHHHhcccCcccccCCCCCCCCCCCCCCCC-cchhccCCCCC Confidence 00000000000000000 00000000000 00000000000 No 108 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.35 E-value=7.1e-11 Score=76.23 Aligned_cols=453 Identities=11% Similarity=0.031 Sum_probs=192.3 Q ss_pred hcccccccCCCCCCHH--HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCC---CCCcCCCHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKP--KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQV---GRSSVQPKLIRKQAE 81 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---grs~~v~~~v~~~~e 81 (705) |+..-| ++.=.+.+ ++..|.+. +.....+.++..+||.|.... .++..+ .+-+++.+-.+..|+ T Consensus 1 ~~~~i~--~~~~~~~~~~~~~~l~~~-------~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd 71 (485) T protein:vir:10 1 MTAPLP--GQEEIEDPAIARDEMVSA-------FEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVD 71 (485) T ss_pred CCCCCC--CCCCCCCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHH Confidence 443333 43212222 33333333 333444556778999987532 111111 122345566677777 Q ss_pred HHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhc Q lcl|NC_021540. 82 WRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTEN 161 (705) Q Consensus 82 ~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~ 161 (705) .+...| ++ + -|. ..+|.+..+ .++.+| ..|+--.....++++|++.|.+.+.+|.... T Consensus 72 ~~~~~l---~~--~---g~~--~~~~~~~~~----~~~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~------- 129 (485) T protein:vir:10 72 SIAERQ---AV--E---GFR--FGDADEADE----ELWQWW-QANNLDIEAPLGYTDAYVHGRSYITISRPDP------- 129 (485) T ss_pred HHHhhh---cc--c---cee--cCCCchhHH----HHHHHH-HhcCHhHHHHHHHHHHhhcCceEEEEeeCCc------- Confidence 665544 11 1 122 123433333 333333 3444445567899999999999888754200 Q ss_pred ccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechh Q lcl|NC_021540. 162 VPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYH 241 (705) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~ 241 (705) +.. .....+.++|..++|. T Consensus 130 ------------------------------------------------~~~-------------~~~~~~~~~i~~~~p~ 148 (485) T protein:vir:10 130 ------------------------------------------------QID-------------LGWDPNTPIIRVEPPT 148 (485) T ss_pred ------------------------------------------------ccc-------------cccCCCeeEEEEEccc Confidence 000 0012345788889999 Q ss_pred hee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEE Q lcl|NC_021540. 242 NVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGY 319 (705) Q Consensus 242 ~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k 319 (705) +++ |||.-. +-.+.+++.. .+ ....+..+++|.. T Consensus 149 ~~~~~~D~~~~----~~~~~~~~~~------------~~----------------------------~~~~~~~~~~y~~ 184 (485) T protein:vir:10 149 RMYAEIDPRIG----RVSKAIRVAY------------DA----------------------------EGNEIQAATLYTP 184 (485) T ss_pred eeEEEEcCCCC----ceeEEEEEEE------------ee----------------------------CCCeEEEEEEEeC Confidence 965 565421 1111111111 00 0011223334432 Q ss_pred eeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHH-HhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021540. 320 WDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE-LLSDNQKLIGALTRGMIDAMARSA 398 (705) Q Consensus 320 ~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~-~~~d~Q~~iN~~~~~~~d~~~~~~ 398 (705) +. .++....++........|.+.|.+|+++|+..+..+..+|.|-+. .++++++.+|+.++.+.......+ T Consensus 185 -----~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a 256 (485) T protein:vir:10 185 -----ND---IFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMG 256 (485) T ss_pred -----Ce---EEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 11 111111222222223445555889999999999999999999886 589999999999999999999888 Q ss_pred CCcEEeec---cccC--c---hhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHH---hCcchH Q lcl|NC_021540. 399 NGQRGMSK---NLLD--P---VNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADAL---SGVKSF 467 (705) Q Consensus 399 ~~~~~~~~---~av~--~---~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~---tGv~d~ 467 (705) .|+..+-. +.+. + ...+...+|.++.... .. ..+.+.+.- .....++.++..+..+ +++++. T Consensus 257 ~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~d----~k~~q~~~~--~~~~~~~~l~~~i~~~~~~~~~p~~ 329 (485) T protein:vir:10 257 VPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFED-AE----GKIQQFSAA--ELANFTNALDQIAKQVAAYTGLPPQ 329 (485) T ss_pred chHHHHhcCCcccccccccccchhhhhcccceeccCC-CC----ceEEeeccc--chHHHHHHHHHHHHHHhcccCCCHH Confidence 88765431 1111 1 1123445566555432 11 122222221 1233455555555555 777778 Q ss_pred hcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeE Q lcl|NC_021540. 468 SQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDI 547 (705) Q Consensus 468 ~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv 547 (705) .+|..+.. +.++.++...............+.|..+++++++.++.+. ..... . .++. ...+ T Consensus 330 ~fg~~~~n-~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~----~~~~~-~---~~~~---------~i~v 391 (485) T protein:vir:10 330 YLSTAADN-PASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMM----KGGDV-P---PDML---------RMET 391 (485) T ss_pred HhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCCC-c---ccce---------eeeE Confidence 88754321 1233344443344444445555555566666665554432 11100 0 0000 1112 Q ss_pred EeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccch-hhhhhhcccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 548 KLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPD-LSKMISKYNPEPSPQAQLEIQIKQLEAQELQ 626 (705) Q Consensus 548 ~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~-~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q 626 (705) ...........+..+.+..|.++....++... +. ++.++.. ..+.++... + .+.++..... T Consensus 392 ~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et----~~---~~lg~~~~~~~~~~~~~-------e----e~~~~~~~~~ 453 (485) T protein:vir:10 392 VWRDPSTPTYAAKADAASKLYNGGTGVIPRER----AR---KDMGYSIAEREEMRRWD-------E----EEAAMGLGLI 453 (485) T ss_pred EecCCCCCCHHHHHHHHHHHHhccccCCCHHH----HH---HhCCCCHhHHHHHHHHH-------H----HHHHHHHHHH Confidence 22222111122222333333332111111100 00 1111100 000000000 0 0000000000 Q ss_pred HHHHHHH--H--HHHHHHHHHH------HHHH Q lcl|NC_021540. 627 MRIAKLQ--A--EIQLMPYEAQ------AEAA 648 (705) Q Consensus 627 ~e~~k~q--a--~~q~~~~~~q------~e~a 648 (705) ..+.... . +-+..++-.. -..| T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 454 GTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 0000000 0 0000000000 0000 No 109 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.31 E-value=3.3e-11 Score=78.01 Aligned_cols=478 Identities=10% Similarity=0.041 Sum_probs=191.4 Q ss_pred CcchhhhhhcccccccCCCCCCHH-HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCCCC-----CcCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKP-KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQVGR-----SSVQ 72 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gr-----s~~v 72 (705) |.--+.++ ...|.+.=.+.++. +-..+...+..-...|..+..+.++..+||.|.... .+...+.+ -..| T Consensus 1 ~~~~~~~~--~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v 78 (501) T protein:vir:25 1 MTVPVDVI--ADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSV 78 (501) T ss_pred Ccccchhh--hccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhh Confidence 43222222 33454443344332 222223333333344445555667778999987421 12222211 1245 Q ss_pred CHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeec Q lcl|NC_021540. 73 PKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWC 152 (705) Q Consensus 73 ~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~ 152 (705) .+-.+..|+.+.-.| ++ .+.+-+|....+ .+.. +...|+--.....+++++++.|.|++.+|.+ T Consensus 79 ~n~~~~ivd~~a~~l---~~--------~gf~~~d~~~~~----~l~~-i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d 142 (501) T protein:vir:25 79 KNVLSLVRDSFAQNL---SV--------VGYRNALAKEND----PAWE-MWQRNRMDARQAEVHRPALTYGASYVTVTPT 142 (501) T ss_pred cChHHHHHHHHHhhh---cc--------cceecCCccchH----HHHH-HHHhcChhHHHHHHHHHHhhcCceEEEEecC Confidence 555555666433222 11 111222222111 2322 3344554444567899999999998776531 Q ss_pred chhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCc Q lcl|NC_021540. 153 LEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQ 232 (705) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 232 (705) ... T Consensus 143 -----------------------------------------------------------------------------e~~ 145 (501) T protein:vir:25 143 -----------------------------------------------------------------------------DEG 145 (501) T ss_pred -----------------------------------------------------------------------------CCC Confidence 001 Q ss_pred ceEEEechhhee--e-CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccC Q lcl|NC_021540. 233 PEVTICDYHNVT--I-DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARK 309 (705) Q Consensus 233 ~~i~~V~~~~~~--~-Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (705) ++|..++|.+++ | ||..... ..+++ +.+....+ .+ ... T Consensus 146 ~~i~~~sp~~~~~iy~D~~~~~~---~~~ai-~~~~~~~~-------~~----------------------------~~~ 186 (501) T protein:vir:25 146 PVFRTRSPRQILAVYADPSVDAW---PQYAL-ETWVAQKD-------AK----------------------------PHR 186 (501) T ss_pred CeEEEeccccEEEEEecCCCCcc---eeEEE-EEEeeccc-------cC----------------------------cce Confidence 346677888874 3 5653211 12222 22211110 00 000 Q ss_pred eEEEE--EEEEEeeec-------CCC--eeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhH Q lcl|NC_021540. 310 KIVVY--EYWGYWDID-------GSG--VTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSD 378 (705) Q Consensus 310 ~v~v~--E~w~k~~~~-------~dg--~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d 378 (705) ++.+| .+++.+... ..+ ..........++. ......|-+.+.+|+++++-.+..+ .+|.|.++.+++ T Consensus 187 ~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vPiv~f~N~~~~~-~~g~sdie~v~~ 264 (501) T protein:vir:25 187 RGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDV-IEHGATFEGKPVCPVVRFVNGRDAD-DMIVGEVAPLIL 264 (501) T ss_pred eEEEecCeeEEEEecCceeeeeccccccccccccccccccc-cccccccCCccceeeEeccCccccC-ccccchhhhhHH Confidence 01111 000111000 000 0011111111111 1122233334678888887766554 468999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHH Q lcl|NC_021540. 379 NQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLE 457 (705) Q Consensus 379 ~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~ 457 (705) +++.+|+.++.+.......+.|+..+- |+ .+..+.+...++.++....+ . ..+.-++...+ ..+...+..+... T Consensus 265 l~Da~~~~~s~~~~~~e~~a~p~~~i~-G~~~~~~~~~~~~~~~i~~~~~~-~--~~~~q~~~~~~-~~~~~~l~~~i~~ 339 (501) T protein:vir:25 265 LQQAINSVNFDRLIVSRFGANPQRVIS-GWTGSKAEVLKASALRVWTFEDP-E--VKAQAFPPASV-EPYNLILEEMLQH 339 (501) T ss_pred HHHHHHHHHHHHHHHHHhhccHHHHHh-CCCCCccchhhhcccceeccCCC-C--ceEEEecccCh-HHHHHHHHHHHHH Confidence 999999999999999988888765443 33 23334456667776655422 1 12222222222 2344455666666 Q ss_pred HHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeec Q lcl|NC_021540. 458 ADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQIN 537 (705) Q Consensus 458 ~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~ 537 (705) +-..|++++...|...+.+ ++.++......-........+.|..+++++++.++ .+.....- ..+. T Consensus 340 i~~~s~~P~~~~~~~~~N~--Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~----~~~~~~~~-----~~~~--- 405 (501) T protein:vir:25 340 VAMVAQISPAQVTGKMINV--SAEALAAAEANQQRKLAAKRESFGESWEQLLRLAA----EMDDDPDT-----AADS--- 405 (501) T ss_pred HHhhcCCChhhhccccCCh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhCCCcc-----ccce--- Confidence 6667889988888543323 34344443333344445555556566666555544 33322110 0000 Q ss_pred hhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHH Q lcl|NC_021540. 538 RDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQI 617 (705) Q Consensus 538 ~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~ 617 (705) ...+.-.........+..+.+..|.+. + .+.. .+ +..+.++... .......+. T Consensus 406 ------~i~v~w~~~~~~s~~~~ada~~kl~~~-g--is~e---t~---~~~~~g~~~~------------~ie~~~~~~ 458 (501) T protein:vir:25 406 ------GAEVLWRDTEARSFGAVVDGITKLASA-G--IPIE---HL---LSMVPGMTQQ------------TIQAIKDSL 458 (501) T ss_pred ------eeeEEecCCCCCCHHHHHHHHHHHHhc-C--CCHH---HH---HHHcCCCCHH------------HHHHHHHHH Confidence 111222222121222222323333322 1 1111 01 1112222110 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 618 KQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVE 666 (705) Q Consensus 618 ~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~ 666 (705) ++ +..+.....+.+...........+.+.+..-..+..-.. - . T Consensus 459 ~e---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g-~ 501 (501) T protein:vir:25 459 RG---GEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGNG--G-A 501 (501) T ss_pred HH---HhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCCC--C-C Confidence 00 000000111000000000000000000000000000000 0 0 No 110 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.31 E-value=4.7e-11 Score=77.18 Aligned_cols=496 Identities=12% Similarity=0.054 Sum_probs=214.3 Q ss_pred Ccchhhhhhcccccc--cCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPS--LQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRK 78 (705) Q Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~ 78 (705) |.+=-.-+ +-++|= +--.+.|. +.+.+.++ +..-+...+||++.........+|. +-+. T Consensus 1 ~~~~~~~~-~~~~~~~~g~~~~p~~------v~~~d~~R------l~aY~l~~~~y~n~~~~~~~~lrg~------~~~~ 61 (527) T protein:vir:10 1 MGQDKRQY-GSTQQLRAGEANFPNA------VTDFDKAR------LASYRLYEDMYLTNTSDYQVILRGG------DEGD 61 (527) T ss_pred CCcccccc-CCCcCcCCccccCccc------CCHHHHHH------HHHHHHHHHHhcCchhheeeecCCc------cccc Confidence 33211111 112221 00112222 22222221 1112234567776421111111111 1122 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhh Q lcl|NC_021540. 79 QAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKV 158 (705) Q Consensus 79 ~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~ 158 (705) ...-.+||. .-.++...-|-+.+....+...++.....++-.+.. ++=...++..-++++.-|-|++++.|+...++. T Consensus 62 ~r~~~~ps~-~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~-e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~ 139 (527) T protein:vir:10 62 QRPIYVPNG-EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDR-ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG 139 (527) T ss_pred cceeeehhh-HHhhCCcceeeccCccccccchhHHHHHHHHHHHHH-hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC Confidence 333334444 333566666777777766666676677777654444 444445677888999999999999998544321 Q ss_pred -hhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEE Q lcl|NC_021540. 159 -TENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTI 237 (705) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~ 237 (705) +..+. .+ +|. .+.++. +..+.-.+.. T Consensus 140 ~R~~v~---~~-------------------DP~---------------------~~f~~e----------d~d~~~~v~~ 166 (527) T protein:vir:10 140 SRLSLH---EV-------------------DPS---------------------TYFPYE----------DPRYPGQVLG 166 (527) T ss_pred CCceEe---ec-------------------Ccc---------------------eeeeee----------cCCCCCceee Confidence 10000 00 000 000000 0000001222 Q ss_pred echhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEE-E Q lcl|NC_021540. 238 CDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYE-Y 316 (705) Q Consensus 238 V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E-~ 316 (705) |+.-+-|..|.. ....-+|.+..+.++ +|-..|.+ - ..-++|+.+ . T Consensus 167 v~~~~~~~~P~d---~~~~~~~ar~~~~~~-~l~~~g~~-------------------------~----~~G~~~yt~~~ 213 (527) T protein:vir:10 167 VYLVDEYPHPDS---EKKNEKCARVQKYMK-TLDDDGKP-------------------------V----PGGAIKYTEEL 213 (527) T ss_pred EEEeeeccCCcc---ccccceehhhhhhhh-hcCccccc-------------------------c----cCcceeeeece Confidence 222112333321 111123332222221 11000000 0 001233333 3 Q ss_pred EE--Eee-ecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 317 WG--YWD-IDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDA 393 (705) Q Consensus 317 w~--k~~-~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~ 393 (705) |. +++ .+.-..-.-.+.+.+++++++..++|+ +.+|+|+++-.|.+++.||+|-...++.+++++|+.++....+ T Consensus 214 w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~i 291 (527) T protein:vir:10 214 YEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLI 291 (527) T ss_pred eeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHH Confidence 43 121 111010011234557888988888877 6789999999999999999999999999999999999999999 Q ss_pred HHhcCCCcEEeeccc-cCc---hhhhhhcCCcceeecCCcccccccccccC-ccchHHHHHHHHHHHHHHHHHhCcchHh Q lcl|NC_021540. 394 MARSANGQRGMSKNL-LDP---VNERKFKMGEDYKYNPGTNPVTDIIEHKY-PELPASSYNMLQMFTLEADALSGVKSFS 468 (705) Q Consensus 394 ~~~~~~~~~~~~~~a-v~~---~d~~~~~pg~~i~~~~~~~~~~~i~~~~~-~~i~~~~~~~l~~~~~~~~~~tGv~d~~ 468 (705) +..++.|.+...--. ++. .+.+...||++|....++. +..... +.+ ..+...+..+...+.+++|++..+ T Consensus 292 s~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak----~~~v~~~~~l-a~~~~h~~~L~~~l~~vA~~PavA 366 (527) T protein:vir:10 292 MVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNK----IYRVNGVASL-EPSQTHMNKAEEAMQQTKGIPDIA 366 (527) T ss_pred HHHhCCceeeecccccccccCCcCccccCCceeEecCCCcc----eeeccchhhh-HHHHHHHHHHHHHHHHhhcCCeee Confidence 999888877663221 221 1224456888887765432 222222 233 345666888889999999999999 Q ss_pred cCC--CccccchHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH-HHHHHHHhcCCceeEeEecCceeeechhhccc Q lcl|NC_021540. 469 QGL--TGDSLGTTTAGVQGVIGASGKRELGILRRLANG--LTEVAKK-ILAMNSVWLSDEEVIRITDEEFVQINRDNLVG 543 (705) Q Consensus 469 ~G~--~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~--~~~~~~~-~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~ 543 (705) .|. .++..|+.|-.++. +.. +.+.-+.. ++-+.++ ..+++..++..-+-+-+.+ -.-.. T Consensus 367 ~G~vD~s~~~SG~ALeL~L----~PL----lar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d--------~~~~~ 430 (527) T protein:vir:10 367 VGVVDAAVAESGIALDLKL----SAI----LSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDD--------ADKKL 430 (527) T ss_pred eccccCCcCcHHHHHHHHH----HHH----HHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCC--------Ccccc Confidence 994 34444554432221 111 11111111 0111111 0111111111101111111 00011 Q ss_pred ceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHH Q lcl|NC_021540. 544 SFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQ 623 (705) Q Consensus 544 ~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q 623 (705) .+.+......+.-+.+..+++..+.++... .....+..+.+..++......++.... ...++ . T Consensus 431 ~v~ivf~p~lP~D~~avie~v~tL~~aGii-----S~etAv~~L~~~~g~eD~E~E~~~I~~-------era~~-----a 493 (527) T protein:vir:10 431 TVTITFRDPKPVNNEKRFAQLLELWEAGLI-----PAKKLTEELSKIMGFELTEEDFRQATE-------DKKTQ-----G 493 (527) T ss_pred ceEEEecccCCCCHHHHHHHHHHHHHcCch-----hHHHHHHHHHhccCCCchHHHHHHHHH-------HHHHH-----h Confidence 223333443444444555555554443211 122223344444443333222221100 00000 0 Q ss_pred HHHHHHH---HHHH--H---------HHHHHHHH Q lcl|NC_021540. 624 ELQMRIA---KLQA--E---------IQLMPYEA 643 (705) Q Consensus 624 ~~q~e~~---k~qa--~---------~q~~~~~~ 643 (705) ..+++.. .+++ . .+..-.-+ T Consensus 494 ~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 494 IAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 0000000 0000 0 00000000 No 111 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.30 E-value=5.3e-11 Score=76.91 Aligned_cols=496 Identities=12% Similarity=0.054 Sum_probs=214.0 Q ss_pred Ccchhhhhhcccccc--cCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPS--LQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRK 78 (705) Q Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~ 78 (705) |.+=-.-+ +-++|= +--.+.|. +.+.+.++ +..-+...+||++.........+|. +-+. T Consensus 1 ~~~~~~~~-~~~~~~~~g~~~~p~~------v~~~d~~R------l~aY~l~~~~y~n~~~~~~~~lrg~------~~~~ 61 (527) T protein:vir:10 1 MGQDKRQY-GSTQQLRAGEANFPNA------VTDFDKAR------LASYRLYEDMYLTNTSDYQVILRGG------DEGD 61 (527) T ss_pred CCcccccc-CCCcCcCCccccCccc------CCHHHHHH------HHHHHHHHHHhcCchhheeeecCCc------cccc Confidence 33211111 112221 00112222 22222221 1112234567776421111111111 1122 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhh Q lcl|NC_021540. 79 QAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKV 158 (705) Q Consensus 79 ~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~ 158 (705) ..+-.+||. .-.++...-|-+.+....+...++.....++-.+.. ++=...++..-++++.-|-|++++.|+...++. T Consensus 62 ~r~~~~ps~-~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~-e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~ 139 (527) T protein:vir:10 62 QRPIYVPNG-EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDR-ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG 139 (527) T ss_pred cceeeehhh-HHhhCCcceeeccCccccccchhHHHHHHHHHHHHH-hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC Confidence 333334444 333566666777777766666676677777664444 444445677888999999999999998554321 Q ss_pred -hhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEE Q lcl|NC_021540. 159 -TENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTI 237 (705) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~ 237 (705) +..+. .+ +|. .+.++. +..+.-.+.. T Consensus 140 ~R~~v~---~~-------------------DP~---------------------~~f~~e----------d~d~~~~v~~ 166 (527) T protein:vir:10 140 SRLSLH---EV-------------------DPS---------------------TYFPYE----------DPRYPGQVLG 166 (527) T ss_pred CCceEe---ec-------------------Ccc---------------------eeeeee----------cCCCCCceee Confidence 10000 00 000 000000 0000001222 Q ss_pred echhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEE-E Q lcl|NC_021540. 238 CDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYE-Y 316 (705) Q Consensus 238 V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E-~ 316 (705) |+.-+-|..|.. ....-+|.+..+.++ +|-..|.+ - ..-++|+.+ . T Consensus 167 v~~~~~~~~P~d---~~~~~~~ar~~~~~~-~l~~~g~~-------------------------~----~~G~~~yt~~~ 213 (527) T protein:vir:10 167 VYLVDEYPHPDS---EKKNEKCARVQKYMK-TLDDDGKP-------------------------V----PGGAIKYTEEL 213 (527) T ss_pred EEEeeeccCCcc---ccccceehhhhhhhh-hcCccccc-------------------------c----cCcceeeeece Confidence 222112333321 111123332222221 11000000 0 001233333 3 Q ss_pred EE--Eee-ecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 317 WG--YWD-IDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDA 393 (705) Q Consensus 317 w~--k~~-~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~ 393 (705) |. +++ .+.-..-.-.+.+.+++++++..++|+ +.+|+|+++-.|.+++.||+|-...++.+++++|+.++....+ T Consensus 214 w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~i 291 (527) T protein:vir:10 214 YEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLI 291 (527) T ss_pred eeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHH Confidence 43 121 111010011234557888988888877 6789999999999999999999999999999999999999999 Q ss_pred HHhcCCCcEEeeccc-cCc---hhhhhhcCCcceeecCCcccccccccccC-ccchHHHHHHHHHHHHHHHHHhCcchHh Q lcl|NC_021540. 394 MARSANGQRGMSKNL-LDP---VNERKFKMGEDYKYNPGTNPVTDIIEHKY-PELPASSYNMLQMFTLEADALSGVKSFS 468 (705) Q Consensus 394 ~~~~~~~~~~~~~~a-v~~---~d~~~~~pg~~i~~~~~~~~~~~i~~~~~-~~i~~~~~~~l~~~~~~~~~~tGv~d~~ 468 (705) +..++.|.+...--. ++. .+.+...||++|....++. +..... +.+ ..+...+..+...+.+++|++..+ T Consensus 292 s~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak----~~~v~~~~~l-a~~~~h~~~L~~~l~~vA~~PavA 366 (527) T protein:vir:10 292 MVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNK----IYRVNGVASL-EPSQTHMTKAEEAMQQTKGIPDIA 366 (527) T ss_pred HHHhCCceeeecccccccccCCcCccccCCceeEecCCCcc----eeeccchhhh-HHHHHHHHHHHHHHHHhhcCCeee Confidence 999888877663221 221 1224456888887765432 222222 233 345666788889999999999999 Q ss_pred cCC--CccccchHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH-HHHHHHHhcCCceeEeEecCceeeechhhccc Q lcl|NC_021540. 469 QGL--TGDSLGTTTAGVQGVIGASGKRELGILRRLANG--LTEVAKK-ILAMNSVWLSDEEVIRITDEEFVQINRDNLVG 543 (705) Q Consensus 469 ~G~--~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~--~~~~~~~-~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~ 543 (705) .|. .++..|+.|-.++. +.. +.+.-+.. ++-+.++ ..+++..++..-+-+-+.+ -.-.. T Consensus 367 ~G~vD~s~~~SG~ALeL~L----~PL----lar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d--------~~~~~ 430 (527) T protein:vir:10 367 VGVVDAAVAESGIALDLKL----SAI----LSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDD--------ADKKL 430 (527) T ss_pred eccccCCcCcHHHHHHHHH----HHH----HHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCC--------Ccccc Confidence 994 34444554432221 111 11111111 0111111 0111111111101111111 00011 Q ss_pred ceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHH Q lcl|NC_021540. 544 SFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQ 623 (705) Q Consensus 544 ~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q 623 (705) .+.+......+.-+.+..+++..+.++... .....+..+.+..++......++.... ...++ . T Consensus 431 ~v~ivf~p~lP~D~~avie~v~tL~~aGi~-----S~~tAv~~L~~~~g~eD~E~E~~~I~~-------era~~-----a 493 (527) T protein:vir:10 431 TVTITFRDPKPVNSEKRFNQLLQLWEAGLI-----PAKKLTEELSKIMGFELTEEDFKQATE-------DKKTQ-----G 493 (527) T ss_pred ceEEEecccCCCCHHHHHHHHHHHHHcCch-----hHHHHHHHHHhccCCCChHHHHHHHHH-------HHHHH-----h Confidence 223333433444444455555444443211 122223344444443333222221110 00000 0 Q ss_pred HHHHHHH---HHHH--H---------HHHHHHHH Q lcl|NC_021540. 624 ELQMRIA---KLQA--E---------IQLMPYEA 643 (705) Q Consensus 624 ~~q~e~~---k~qa--~---------~q~~~~~~ 643 (705) ..+++.. .+++ . .+..-.-+ T Consensus 494 ~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 494 IAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 0000000 0000 0 00000000 No 112 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.30 E-value=1.6e-10 Score=74.34 Aligned_cols=446 Identities=10% Similarity=0.033 Sum_probs=201.0 Q ss_pred Ccc--hhhhhh-cccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCC---------CCC Q lcl|NC_021540. 1 MSD--INEEFL-EDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPK---------QQV 66 (705) Q Consensus 1 ~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~---------~~~ 66 (705) |-. |+.-.+ ...-+...+ ..+.+.+++....+ .+++.+++.+||.|.-.. .+. ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--------~~~~~~i~~~~~~~--~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~ 70 (479) T protein:vir:79 1 MLNIYISETDLIKVQLKKEST--------INLVKVIEHYILKH--RPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDF 70 (479) T ss_pred CCCceecccceEeeccccCCh--------hHHHHHHHHHHhhh--hHHHHHHHHHHhccCCccccccccccccccccccc Confidence 321 111111 222222222 12223333333333 245567888999885321 011 111 Q ss_pred CCC--cCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCC Q lcl|NC_021540. 67 GRS--SVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGT 144 (705) Q Consensus 67 grs--~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~ 144 (705) .|+ +++.+-.+..|+.....| ||.+.-+ .+ +|.. ..++++..+. |+-...+..++++++..|. T Consensus 71 ~~~~~ki~~~~~~~Ivd~~~~~l----~g~p~~~--~~---~~~~----~~~~~~~~~~--n~~~~~~~~~~~~~~~~G~ 135 (479) T protein:vir:79 71 TKVNNKAINNYHKLLVDQKVGYS----VGNPIVF--NA---DDDN----LTKLLNDLLG--EEFDDTITELYLNASNKGV 135 (479) T ss_pred ccCcceeecchHHHHHHHHHhhh----hcCCcee--cc---CCHH----HHHHHHHHHh--cCHHHHHHHHHHHHHhcCe Confidence 222 577777777777666555 5554333 22 2222 2235554432 4444556788999999999 Q ss_pred eEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccc Q lcl|NC_021540. 145 VIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQE 224 (705) Q Consensus 145 gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 224 (705) |.+.+||+ T Consensus 136 ~~~~v~~d------------------------------------------------------------------------ 143 (479) T protein:vir:79 136 EWLHPYIN------------------------------------------------------------------------ 143 (479) T ss_pred EEEEEEeC------------------------------------------------------------------------ Confidence 98887652 Q ss_pred eeeeccCcceEEEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcccccccccccc Q lcl|NC_021540. 225 VIKTVKNQPEVTICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFT 302 (705) Q Consensus 225 ~~~~~~~~~~i~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 302 (705) ..+++++..++|.++++ |+.... ...+ +.+.+.... .+ T Consensus 144 ----~~~~~~i~~~~p~~~~~v~d~~~~~---~~~~-~ir~y~~~~--------~~------------------------ 183 (479) T protein:vir:79 144 ----RKGEFKYVIIPAEEAIPIWDSKRQR---ELVA-FIRFYYIED--------ID------------------------ 183 (479) T ss_pred ----CCCceEEEEEccceeEEEEeCCCCC---ceEE-EEEEEEEee--------cC------------------------ Confidence 02346788899999754 333211 1122 233332110 00 Q ss_pred ccccccCeEEEEEEEEE-----eeecCCCeeEE------EEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCc Q lcl|NC_021540. 303 FSDKARKKIVVYEYWGY-----WDIDGSGVTTP------IVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEA 371 (705) Q Consensus 303 ~~~~~~~~v~v~E~w~k-----~~~~~dg~~~~------~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g 371 (705) .+.+..+|+|.. +-..+++.... ................|.+.+.+||+++.. ..+|.| T Consensus 184 -----~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~s 253 (479) T protein:vir:79 184 -----GNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKN-----NEKCVS 253 (479) T ss_pred -----CceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecC-----CCCCCc Confidence 011222333321 00111111000 000011111112233344446677776643 456899 Q ss_pred hHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCch-hh-hhhcCCcceeecCCcccccccccccCccchHHHHH Q lcl|NC_021540. 372 DAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPV-NE-RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYN 449 (705) Q Consensus 372 ~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~-d~-~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~ 449 (705) .+..++++++.+|...|.+.+.+...++|.+++........ +. ...+.++++.+++++. +.+...+.-...... T Consensus 254 d~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~l~~~~~~~~~~~ 329 (479) T protein:vir:79 254 DLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGG----VDKLEINIPVEAKKE 329 (479) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecCCCCc----ceEEeccCCHHHHHH Confidence 99999999999999999999999998888776543211111 11 2234556666665543 344443333456667 Q ss_pred HHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEe Q lcl|NC_021540. 450 MLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRIT 529 (705) Q Consensus 450 ~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~ 529 (705) .++.+...+...+++++.+.+..++ .|+. ++...............+.|..+++++++.++.++.... T Consensus 330 ~~~~l~~~i~~~s~~p~~~~~~~gn-~Sg~--Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--------- 397 (479) T protein:vir:79 330 LLDRLEKNIIIFGQGVNPESQNTGD-KSGV--ALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISG--------- 397 (479) T ss_pred HHHHHHHHHHHHhCccccccccccc-hhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--------- Confidence 7889999999999999988775543 3444 344444444444455555566666666666555443211 Q ss_pred cCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchh Q lcl|NC_021540. 530 DEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSP 609 (705) Q Consensus 530 ~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~ 609 (705) + ..++.. ...+..+...+.......+.+..+ .+ .++. ...+. .++...+. T Consensus 398 -~--~~~~~~----~i~i~f~~~~p~~~~~~a~~~~kl---~g-~iS~---et~l~---~l~~v~d~------------- 447 (479) T protein:vir:79 398 -N--KSYDYK----TVQITFNHSMIINEAEKIDMAAKS---TG-IVSD---ETIVS---NHPWVEDV------------- 447 (479) T ss_pred -C--Cccccc----cceEEeCCCCCcCHHHHHHHHHHH---hc-cCcH---HHHHH---hCCCCCCH------------- Confidence 0 011111 122333322221111111111111 11 1111 11111 11111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH Q lcl|NC_021540. 610 QAQLEIQIKQLEAQELQMRIAKLQAEIQ--LMPY 641 (705) Q Consensus 610 ~~q~~~q~~q~~~q~~q~e~~k~qa~~q--~~~~ 641 (705) . .+.++.+++................ ..++ T Consensus 448 ~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 448 N--DELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred H--HHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 1 1111111111110000000000000 0000 No 113 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.30 E-value=1.6e-10 Score=74.27 Aligned_cols=459 Identities=10% Similarity=-0.018 Sum_probs=190.1 Q ss_pred ccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCCC---CCCcCCCHHHHHHHHHHHH Q lcl|NC_021540. 11 DTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQV---GRSSVQPKLIRKQAEWRYS 85 (705) Q Consensus 11 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~---grs~~v~~~v~~~~e~~~~ 85 (705) =+.| .+-+.+......+...| ...+.....+.++..+||.|..... +...+ .+-++|.+-.+..|+.+.. T Consensus 1 ~~~~--~~~~~e~~~~~~~~~~l---~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~ 75 (486) T protein:vir:42 1 MTAP--LPGMEEIEDPAVVREEM---ISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAE 75 (486) T ss_pred CCCC--CCCCCCcccHHHHHHHH---HHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHh Confidence 1112 22222222222222222 2223334455566678999875321 11111 1113344555555554433 Q ss_pred HHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccc Q lcl|NC_021540. 86 ALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVF 165 (705) Q Consensus 86 ~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~ 165 (705) .| .| .-|. .+++.... ..++.+| ..|+--.....++++|++.|.+.+.||..... T Consensus 76 ~l--~~------~g~~--~~~~~~~~----~~~~~i~-~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~---------- 130 (486) T protein:vir:42 76 RQ--AV------EGFR--LGDADEAD----EELWQWW-QANNLDIEAPLGYTDAYVHGRSFITISKPDPQ---------- 130 (486) T ss_pred hh--cc------ccee--cCCCchhH----HHHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcc---------- Confidence 33 11 1121 12222222 2333333 34554455678999999999998887532100 Q ss_pred ccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee- Q lcl|NC_021540. 166 QYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT- 244 (705) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~- 244 (705) .. .....+.++|..++|.+++ T Consensus 131 ---------------------------------------------~~-------------~~~~~~~~~i~~~~p~~~~~ 152 (486) T protein:vir:42 131 ---------------------------------------------LD-------------LGWDQNVPIIRVEPPTRMHA 152 (486) T ss_pred ---------------------------------------------cc-------------cccCCCeeEEEEecccceEE Confidence 00 0011345678889999866 Q ss_pred -eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeec Q lcl|NC_021540. 245 -IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDID 323 (705) Q Consensus 245 -~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~ 323 (705) |||... . ...+.+++.+ + ....+..+++|.. T Consensus 153 i~d~~~~----~-~~~~~~~~~~-----------~----------------------------~~~~~~~~~~y~~---- 184 (486) T protein:vir:42 153 EIDPRIN----R-VSKAIRVAYD-----------K----------------------------EGNEIQAATLYTP---- 184 (486) T ss_pred EEeCCCC----C-eEEEEEEEEe-----------c----------------------------CCCeEEEEEEEcC---- Confidence 555421 1 1122222210 0 0011333444432 Q ss_pred CCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHH-HhhHHHHHHHHHHHHHHHHHHhcCCCcE Q lcl|NC_021540. 324 GSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE-LLSDNQKLIGALTRGMIDAMARSANGQR 402 (705) Q Consensus 324 ~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~-~~~d~Q~~iN~~~~~~~d~~~~~~~~~~ 402 (705) +. .++.+..++........|.+.|.+|+++|+..+..+..+|.|-+. .++++++.+|+.++.+.......+.|+. T Consensus 185 -~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~ 260 (486) T protein:vir:42 185 -ME---TIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQR 260 (486) T ss_pred -Cc---EEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHH Confidence 11 111111222222223334455789999999888889999999987 5889999999999999999888888876 Q ss_pred Eee---ccccCch-----hhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHH---hCcchHhcCC Q lcl|NC_021540. 403 GMS---KNLLDPV-----NERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADAL---SGVKSFSQGL 471 (705) Q Consensus 403 ~~~---~~av~~~-----d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~---tGv~d~~~G~ 471 (705) .+- ...+... ..+...+|.++....+ . ..+.+.+.- ....+++.++..+..+ +++++...|. T Consensus 261 ~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~q~~~~--~~e~~~~~l~~~i~~~s~~~~~p~~~fg~ 333 (486) T protein:vir:42 261 LIFGIKPEEIGVDSETGQTLFDAYLARILAFEDA-E----GKIQQFSAA--ELANFTNALDQIAKQVAAYTGLPPQYLST 333 (486) T ss_pred HhhcCCccccccccccccchhhhhhchhcccCCC-C----ceEEeeccc--CHHHHHHHHHHHHHHHhcccCCCHHHhcc Confidence 543 1111111 1123345555444321 1 122233322 2334556666666555 6788888875 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec Q lcl|NC_021540. 472 TGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI 551 (705) Q Consensus 472 ~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~ 551 (705) .+.. +.++.+++.....-........+.|..+++++++.++.+... ...+ .++. ...+.... T Consensus 334 ~~~n-~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~-~~~~-------~d~~---------~i~v~w~~ 395 (486) T protein:vir:42 334 AADN-PASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKG-GDVP-------PDML---------RMETVWRD 395 (486) T ss_pred ccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCcc-------ccce---------eeeEEecC Confidence 4321 124444544444444444555666666777776665543211 0000 0010 11222222 Q ss_pred cchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhcc-chhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 552 SNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGM-PDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIA 630 (705) Q Consensus 552 ~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~-~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~ 630 (705) .......+..+.+..|.++....++... +.+ +.++ +...+.++....+ +.+.....-..+. T Consensus 396 ~~~~s~~~~ad~~~kl~~~~~g~~s~et----~~~---~lg~~~d~~~e~~~~~~e-----------~~~~~~~~~~~~~ 457 (486) T protein:vir:42 396 PSTPTYAAKADAATKLYGNGQGVIPRER----ARI---DMGYSVKEREEMRRWDEE-----------EAAMGLGLLGTMV 457 (486) T ss_pred CCCCCHHHHHHHHHHHHhcccCCCCHHH----HHh---cCCCChhHHHHHHHHHHH-----------HHHHHHHHHHHhh Confidence 2221222223333333332211111100 000 0010 0000000000000 0000000000000 Q ss_pred HH--HHHHHH-----HHHHHHHHHHHHHH Q lcl|NC_021540. 631 KL--QAEIQL-----MPYEAQAEAAKARK 652 (705) Q Consensus 631 k~--qa~~q~-----~~~~~q~e~a~a~~ 652 (705) .. ..+.+. ...+....++..-. T Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 458 DADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred cCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 00 000000 00000000000000 No 114 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.25 E-value=3.1e-10 Score=72.70 Aligned_cols=486 Identities=11% Similarity=0.059 Sum_probs=206.3 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHh--hHHhhHHHHHHHHHHHHhccCCCCCC-CCCCC----CCcCCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNA--KSTKDTQVAIIDDWLAQLNVTGAYKP-KQQVG----RSSVQP 73 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g----rs~~v~ 73 (705) |-+-++..+.+-.+. ++ .. .|. ++.+. ..-+..++.+.+.|..||.|...... ....| |..... T Consensus 3 ~~~~~k~~~~k~~~~-~~---~~----~~~-~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sl 73 (522) T protein:vir:47 3 LFQKVKDFFSRGRYY-MQ---TS----NLN-SILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHL 73 (522) T ss_pred hHHHHHHHHHHHHHH-hh---cc----cch-hccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceec Confidence 222222222221110 00 00 000 01100 11144556677899999987543210 11111 122222 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 74 KLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 74 ~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) +.-...++. +++| .|+-..-+.+ +|. ..+++++.++. .|+-...++.++..++-.|.+++|+||+. T Consensus 74 nl~~~i~~~-~A~l---v~~e~~~i~v-----~d~----~~~~~l~~~l~-~n~f~~~~~~~~e~a~a~G~~a~k~~~d~ 139 (522) T protein:vir:47 74 PIARTASKK-IASL---VYNEQATITT-----KNE----ILQKFLDDMLT-NDRFNKNFERYLESCLALGGLAMRPYIDG 139 (522) T ss_pred chHHHHHHH-Hhhh---hcCCcceeec-----CCh----HHHHHHHHHHh-hcchHHHHHHHHHHhhccCCEEEEEEEcC Confidence 333333332 2333 3443333333 343 34456666554 34555678899999999999999999941 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) +++ T Consensus 140 -----------------------------------------------------------------------------~~~ 142 (522) T protein:vir:47 140 -----------------------------------------------------------------------------DKV 142 (522) T ss_pred -----------------------------------------------------------------------------Cce Confidence 234 Q ss_pred eEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEE Q lcl|NC_021540. 234 EVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVV 313 (705) Q Consensus 234 ~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 313 (705) +|..|++..|++=-.-..+...|-++.+....... .--||.-+++-....... ..+.... ....-... T Consensus 143 ~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~---~~~~yt~lE~he~~~~~~------~~~~~~~---~~~~~~I~ 210 (522) T protein:vir:47 143 RVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGR---KNVYYTLVEFHEWVTADG------QETGSTN---DKKYYRIT 210 (522) T ss_pred EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeccc---ceeEEEEEEEeeeccccc------ccccccc---cCCceEEE Confidence 56677777776411101123334333322221100 000111000000000000 0000000 00000111 Q ss_pred EEEEEEeeecCCCeeEEEEEE--EECCEEEecccCCCCC-CCcceEEe----eeeeecCcccCCchHHHhhHHHHHHHHH Q lcl|NC_021540. 314 YEYWGYWDIDGSGVTTPIVAS--WVDDVMIRLEKNPYPD-GKLPFVVV----PYLPVKDSVYGEADAELLSDNQKLIGAL 386 (705) Q Consensus 314 ~E~w~k~~~~~dg~~~~~~~~--~~g~~iL~~~~~p~~~-~~~Pfv~~----~~~~~~~~~~g~g~~~~~~d~Q~~iN~~ 386 (705) ++.|.-..-+.-|......-+ +.+ |. +...+.+ .+.+|+.+ +-....++.+|.|++..+++..+.+|.. T Consensus 211 n~ly~~~~~~~lG~~v~l~~~~e~~~---l~-~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~ 286 (522) T protein:vir:47 211 NELYRSDVNDVLGQRVNLSELDKYKN---LE-PVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRS 286 (522) T ss_pred EEEeecCCCcccCccccccccccccC---CC-CceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHH Confidence 122211000000100000000 000 00 0001111 22234332 2223458889999999999999999999 Q ss_pred HHHHHHHHHhcCCCcEEeeccccCchhhh---------hhcCCc-cee-ecCCcccccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 387 TRGMIDAMARSANGQRGMSKNLLDPVNER---------KFKMGE-DYK-YNPGTNPVTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 387 ~~~~~d~~~~~~~~~~~~~~~av~~~d~~---------~~~pg~-~i~-~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) ++++++-+.+ +..++.+++.++...... .+.++. +++ ++.+......+...++.--...+...++.+. T Consensus 287 ~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l 365 (522) T protein:vir:47 287 YDEFMWEVRM-GQRRVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGL 365 (522) T ss_pred HHHHHHHHHh-ccceeecchHHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHH Confidence 9999998875 555788887776432111 111122 222 1211122234555543333345666778888 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCceeEeEecCce Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWL--SDEEVIRITDEEF 533 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~--~~~~~iri~~~~~ 533 (705) ..+....|++.-..|.++.. ..||+++....+..-.....+.+.+..+++++...++.+...+. ..... + T Consensus 366 ~~i~~~~gls~~tf~~~~~~-~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~-----~-- 437 (522) T protein:vir:47 366 KLFEMQIGVSSGMFTFDGQG-MKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIP-----E-- 437 (522) T ss_pred HHHHHHhCCCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCC-----C-- Confidence 88888899998888876543 46888888777777777788888998899999888887764321 11000 0 Q ss_pred eeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHH Q lcl|NC_021540. 534 VQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQL 613 (705) Q Consensus 534 v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~ 613 (705) .....+..+++...-....++..+++.++. . ++.. .-+++..++.+ .+++. T Consensus 438 --------~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG-~-~s~e------~~i~~~~g~~e-------------eea~~ 488 (522) T protein:vir:47 438 --------LDDISVNLDDGVFTDRHAELDYWAKMVAAG-F-STKK------RAIGKTLNISG-------------VEAEK 488 (522) T ss_pred --------cceeEEEcCCCCCCCHHHHHHHHHHHHhcC-C-CCHH------HHHHhcCCCCh-------------HHHHH Confidence 011223334333322333344444433321 1 1100 00111111110 00000 Q ss_pred HHHHHHHHHHHHHHHHHHH---HHHH----HHHHHHHHHHH Q lcl|NC_021540. 614 EIQIKQLEAQELQMRIAKL---QAEI----QLMPYEAQAEA 647 (705) Q Consensus 614 ~~q~~q~~~q~~q~e~~k~---qa~~----q~~~~~~q~e~ 647 (705) ... +++.|...+ .... ..++.....+- T Consensus 489 el~-------ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 489 ELN-------AINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HHH-------HHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 000 000000000 0000 00000000000 No 115 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.25 E-value=3.3e-10 Score=72.59 Aligned_cols=458 Identities=10% Similarity=0.032 Sum_probs=189.0 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCCCC---CCCcCCCHHHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQQV---GRSSVQPKLIRKQAEWR 83 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---grs~~v~~~v~~~~e~~ 83 (705) |+..-| .++.+..+.++..|.+.+..-. .+.++..+||.|.-.. .+...+ .+-++|.+-.+..|+.+ T Consensus 1 ~~~~~~-~~~~~~~~~~~~~l~~~~~~~~-------~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~ 72 (484) T protein:vir:77 1 MTSPLQ-KQENVDPEKAREEMLNLFTERT-------QDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAI 72 (484) T ss_pred CCCccc-ccCCCCHHHHHHHHHHHHHHHH-------HHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHH Confidence 443333 3455666667777776665322 2334567899986422 111111 11124455555555555 Q ss_pred HHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccc Q lcl|NC_021540. 84 YSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVP 163 (705) Q Consensus 84 ~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~ 163 (705) ...|. | .| |. .++|... ...++.+| ..|+--.....++++|++.|.+.+.||+.... T Consensus 73 ~~~l~--~-~g-----~~--~~~~~~~----~~~l~~i~-~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~-------- 129 (484) T protein:vir:77 73 AARQE--L-EG-----FR--LGGADKA----DEQLWDWW-QANDLDIESTLGHTDSLVHGRSYITISKPDPN-------- 129 (484) T ss_pred Hhhhc--c-Cc-----ee--cCCcchh----HHHHHHHH-HhcCHhHHHHHHHHHHhhcCceEEEEecCCCC-------- Confidence 44431 1 11 11 1233332 23454444 34544455678999999999999988763110 Q ss_pred ccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhe Q lcl|NC_021540. 164 VFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNV 243 (705) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~ 243 (705) . ........++|..++|.++ T Consensus 130 -----------------------------------------------~-------------~~~~~~~~~~i~~~~p~~~ 149 (484) T protein:vir:77 130 -----------------------------------------------I-------------DPGVDPEVPIIRVEPPTNL 149 (484) T ss_pred -----------------------------------------------c-------------ccccccccceEEEecccee Confidence 0 0001123567888899998 Q ss_pred e--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEee Q lcl|NC_021540. 244 T--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWD 321 (705) Q Consensus 244 ~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~ 321 (705) + |||.. .. ..+ +.+.+.+. + ...+..+++|.. T Consensus 150 ~~~~D~~~-~~---~~~-a~~~~~~~------------------------------~---------~~~~~~~~~y~~-- 183 (484) T protein:vir:77 150 YAQIDPRT-RQ---VMR-AIRAIEDE------------------------------E---------GNEVIGATLYLP-- 183 (484) T ss_pred EEEecCCC-Cc---eEE-EEEEEEee------------------------------c---------CCcEEEEEEEec-- Confidence 5 55532 11 111 22222110 0 001122233321 Q ss_pred ecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHH-HhhHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_021540. 322 IDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAE-LLSDNQKLIGALTRGMIDAMARSANG 400 (705) Q Consensus 322 ~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~-~~~d~Q~~iN~~~~~~~d~~~~~~~~ 400 (705) +. .+.....++.....+..|-+.|.+|+++|+..+..+..+|.|.+. .++++++.+|..++.+.......+.| T Consensus 184 ---~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p 257 (484) T protein:vir:77 184 ---NN---TVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVP 257 (484) T ss_pred ---Ce---EEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 10 011111112111122233444789999999888889999999886 58999999999999999999888887 Q ss_pred cEEeecccc-C---c-----hhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHH---hCcchHh Q lcl|NC_021540. 401 QRGMSKNLL-D---P-----VNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADAL---SGVKSFS 468 (705) Q Consensus 401 ~~~~~~~av-~---~-----~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~---tGv~d~~ 468 (705) +..+- |.- + . ...+...+|.++... +.. ..+.+.+..+ ...++..++..+..+ +++++.. T Consensus 258 ~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~q~~~~~--~e~~~~~l~~~i~~~s~~~~~p~~~ 329 (484) T protein:vir:77 258 QRLLF-GVKGEELGVDPETGQTLFDAYLARILAFE-DHE----SKAQQFSAAE--LRNFVDALDALDRKAAAYTGLPPYY 329 (484) T ss_pred HHHHh-CCCcchhcccccccchhhhhhhhhhcccC-CCC----ceeEeecCCC--hHHHHHHHHHHHHHHhcccCCCHHH Confidence 76543 221 1 0 111223344444332 111 1222332221 223445555555554 6788888 Q ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEE Q lcl|NC_021540. 469 QGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIK 548 (705) Q Consensus 469 ~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~ 548 (705) +|..+.. +.++.++......-........+.|..+++++++.++.+ ...... . .++. ...+. T Consensus 330 fg~~~~n-~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~----~~~~~~-~---~~~~---------~i~v~ 391 (484) T protein:vir:77 330 LSFSSEN-PASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKV----MNGGDI-P---PEYY---------RMESI 391 (484) T ss_pred hccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hCCCCc-c---cccc---------cceEE Confidence 8854321 123434443333333333444455555555555554432 211000 0 0000 11222 Q ss_pred eeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccc-hhhhhhhcccccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 549 LSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMP-DLSKMISKYNPEPSPQAQLEIQIKQLEAQELQM 627 (705) Q Consensus 549 v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~-~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~ 627 (705) ..........+....+..|.+.....++.. .+ .+..++- ...+.++....+ +..+. .+.+.. T Consensus 392 w~~~~~~s~~~~ad~~~kl~~~g~gi~s~e----t~---~~~l~~~~~~~~e~~~~~~e---------e~~~~-~~~~~~ 454 (484) T protein:vir:77 392 WRDPSTPTYAAKADAATKLYNNGQGVIPKE----RA---RIDMGYSITEREEMRKWDEE---------EQAQG-LGLMGT 454 (484) T ss_pred ecCCCCCCHHHHHHHHHHHHhccCCCCCHH----HH---HhcCCCChhHHHHHHHHHHH---------HHHHH-HHHHhh Confidence 222111112222222223322211111100 00 0101110 000000000000 00000 000000 Q ss_pred HHHH-HH----HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 628 RIAK-LQ----AEIQLMPYEAQAEAAKARKA 653 (705) Q Consensus 628 e~~k-~q----a~~q~~~~~~q~e~a~a~~~ 653 (705) .... .+ .... ...+.+......... T Consensus 455 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 455 MFGTDPSGGGNPDNP-ETPEPQPNPAEEAAA 484 (484) T ss_pred hccccccCCCCCCCC-CcccccCCCccccCC Confidence 0000 00 0000 000000000000000 No 116 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.17 E-value=8.8e-10 Score=70.23 Aligned_cols=459 Identities=11% Similarity=0.002 Sum_probs=189.6 Q ss_pred CcchhhhhhcccccccCCCCCCHH--HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCCCC---CCcCCC Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKP--KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQVG---RSSVQP 73 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g---rs~~v~ 73 (705) |..|+...-..+.. .+.++++. +|..|.+.+ .....+.++..+||.|..... +...+- +-.+|. T Consensus 1 ~~~~~~~~~~~~~~--~~~l~~~e~~~i~~L~~~~-------~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~ 71 (504) T protein:vir:99 1 MTEETTSASKFTFR--IPELNDDVVDKVNGLYQQL-------VDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVL 71 (504) T ss_pred CCccCCcccccccc--cCCCCHHHHHHHHHHHHHH-------HHHhHHHHHHHHHHhccccchhccccccHHHHHHhhcc Confidence 66665544433322 44444443 333333333 223344556678998764221 111110 111233 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecc Q lcl|NC_021540. 74 KLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCL 153 (705) Q Consensus 74 ~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~ 153 (705) +-.+..|+.+.-.| ++.| |. .+++.... ..+..+ ...|+--.....++++|++.|.+++.|| .. T Consensus 72 n~~~~iVd~~a~rl---~~~G-----f~--~~d~~~~~----~~l~~i-~~~N~ld~~~~~~~~~a~iyG~af~~v~-~~ 135 (504) T protein:vir:99 72 GWSAKAVDTLARRC---NLES-----FV--WPDGDYGS----IGGPDV-WDENFFATKANNAMVSSLIHGPAFLINT-EG 135 (504) T ss_pred CcHHHHHHHHHhhh---ccce-----ee--CCCCChhh----HHHHHH-HHhcChhhHHHHHHHHHHhhCceeEEEe-cC Confidence 33343444332211 1111 11 12222222 233333 3344433346688999999999988763 00 Q ss_pred hhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcc Q lcl|NC_021540. 154 EETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQP 233 (705) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 233 (705) + ....++ T Consensus 136 ~-------------------------------------------------------------------------d~~~~~ 142 (504) T protein:vir:99 136 G-------------------------------------------------------------------------AGEPDS 142 (504) T ss_pred C-------------------------------------------------------------------------CCCcee Confidence 0 001245 Q ss_pred eEEEechhhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeE Q lcl|NC_021540. 234 EVTICDYHNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKI 311 (705) Q Consensus 234 ~i~~V~~~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 311 (705) .|..++|.+++ |||... .+ ...+ +++ +.+. ++ .. T Consensus 143 ~I~~~sP~~~~~iyD~~~~-~~---~~a~-~~~-----------~~d~------------------~g----------~~ 178 (504) T protein:vir:99 143 LIHVKSAMQATGEWNSRRN-AM---DSLL-SIT-----------SRDA------------------EG----------HP 178 (504) T ss_pred EEEEeccceeEEEEeCCCC-ce---eEEE-EEE-----------EecC------------------CC----------eE Confidence 68888999874 776532 11 1111 111 0000 00 01 Q ss_pred EEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchH-HHhhHHHHHHHHHHHHH Q lcl|NC_021540. 312 VVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADA-ELLSDNQKLIGALTRGM 390 (705) Q Consensus 312 ~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~-~~~~d~Q~~iN~~~~~~ 390 (705) ...++|.. +. .+.....++.....+..|.+.| +|+|+++..+..+..+|.|-+ +.++++++.+|+.++.+ T Consensus 179 ~~~~~y~~------~~--~~~~~~~~~~~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~ 249 (504) T protein:vir:99 179 TGIALYED------GV--TVTADMDDDGDWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRM 249 (504) T ss_pred EEEEEEcC------Cc--EEEEEEcCCceeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHHHHH Confidence 11222221 00 0011111111111222233334 799999988888888998855 68999999999999999 Q ss_pred HHHHHhcCCCcEEeecccc---------CchhhhhhcCCcceeecCCccc------ccccccccCccchHHHHHHHHHHH Q lcl|NC_021540. 391 IDAMARSANGQRGMSKNLL---------DPVNERKFKMGEDYKYNPGTNP------VTDIIEHKYPELPASSYNMLQMFT 455 (705) Q Consensus 391 ~d~~~~~~~~~~~~~~~av---------~~~d~~~~~pg~~i~~~~~~~~------~~~i~~~~~~~i~~~~~~~l~~~~ 455 (705) +......+.|+..+- |+- +....+....+.++.+...... ...+..++..++. .+...+..+. T Consensus 250 ~~~~e~~a~p~r~i~-G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~-~~~~~l~~~i 327 (504) T protein:vir:99 250 DGHADVYSFPQLILL-GADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQ-PHIEMLEQIA 327 (504) T ss_pred HHHHHHhcchhhhhc-cCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChH-HHHHHHHHHH Confidence 999988888876542 221 1112344455666655432211 1112222222222 2334445555 Q ss_pred HHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceee Q lcl|NC_021540. 456 LEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQ 535 (705) Q Consensus 456 ~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~ 535 (705) ..+-.+||+++..+|..++..+.+|.++......-........+.|..++++++++++.+.... +.... ++.. T Consensus 328 ~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~-~~~~~------~~~~ 400 (504) T protein:vir:99 328 MMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGL-DRIPP------EWKT 400 (504) T ss_pred HHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Ccccc------cccc Confidence 5555569999999996654433455556544444444456666777777777777776654432 11000 0000 Q ss_pred echhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhc------------hhHHHHHHHHHHHhhhccchhhhhhhcc Q lcl|NC_021540. 536 INRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSL------------PFDMTKLILGEIAKLRGMPDLSKMISKY 603 (705) Q Consensus 536 i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~------------~~~~~~~il~~l~e~~~~~~~~~~~~~~ 603 (705) ..+.-.........+....+..|.++..... .+.....+..+..+..+...+. .+... T Consensus 401 ---------~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~-~l~~~ 470 (504) T protein:vir:99 401 ---------IDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIE-ALNRR 470 (504) T ss_pred ---------ceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHH-HHhcc Confidence 0011111111111111222222222211100 0111111111111100000000 00000 Q ss_pred cc--------cchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 604 NP--------EPSPQAQLEIQIKQLEAQELQMRIAKLQAEI 636 (705) Q Consensus 604 ~~--------q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~ 636 (705) .+ ...+..+......-+...+. ..+- T Consensus 471 ~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p-------~~~~ 504 (504) T protein:vir:99 471 QQEAATAGEDQDQGAGEPPANEPPAALGRP-------TLVG 504 (504) T ss_pred cCCCCCCCCCCCcCCCCCCCCCCCccCCCc-------ccCC Confidence 00 00000000000000000000 0000 No 117 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.16 E-value=1e-09 Score=69.89 Aligned_cols=433 Identities=12% Similarity=0.053 Sum_probs=184.7 Q ss_pred CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCC--CCCCC---cCCCHHHHHHHHHHHHHHHH Q lcl|NC_021540. 17 QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQ--QVGRS---SVQPKLIRKQAEWRYSALSE 89 (705) Q Consensus 17 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~grs---~~v~~~v~~~~e~~~~~l~~ 89 (705) +|..+-+.++..|... |..++.+.++..+||.|.... .++. .+.|+ ++|.+-.+..|+.....|. T Consensus 1 ~~~~t~~~~~~~l~~~-------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKR-------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII- 72 (456) T ss_pred CCCCCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc- Confidence 4444434456655443 333455556777999997522 2222 12333 4677777777777666542 Q ss_pred hhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccccc Q lcl|NC_021540. 90 PFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVE 169 (705) Q Consensus 90 ~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~ 169 (705) ++.+. + + ...|.+.......+ | ..|+--.....+++++++.|.+.+.+| . T Consensus 73 ----~~~~~-~-~-~~~d~~~~~~~~~i----~-~~N~~d~~~~~~~~~a~i~G~ay~~v~-~----------------- 122 (456) T protein:vir:10 73 ----PNGIT-V-G-GSADSDLALRARRI----W-RDNRMDSVCKQWVKYGLDFGESYLTCW-R----------------- 122 (456) T ss_pred ----cCCee-c-C-CCCCcchHHHHHHH----H-HhcChhhHHHHHHHHHhhcCeeEEEEe-e----------------- Confidence 22332 1 1 12233332222222 2 234333334567788888888866442 0 Q ss_pred CCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCC Q lcl|NC_021540. 170 ATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDP 247 (705) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp 247 (705) ...+.++|..++|.+++ ||| T Consensus 123 ----------------------------------------------------------d~~g~~~i~~~~p~~~~~i~d~ 144 (456) T protein:vir:10 123 ----------------------------------------------------------RDDGTATITADSPETMVVSVDP 144 (456) T ss_pred ----------------------------------------------------------CCCCceEEEEEccceeEEEEcC Confidence 01245678888999854 555 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEE-EEEEeeecCCC Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYE-YWGYWDIDGSG 326 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E-~w~k~~~~~dg 326 (705) .... ...++++ ++.+.+ ....+. ..+ ..+ .-+..+. +|..... T Consensus 145 ~~~~---~~~~~i~-~~~~~d--------~~~~~~----------~~~-------~~~---~~~~~~~~~~~~~~~---- 188 (456) T protein:vir:10 145 LQPW---RIRAAMR-WWRDLD--------AESDFA----------IVW-------SGD---GWQKFARPCFVQSSS---- 188 (456) T ss_pred CCCc---ceEEEEE-EEEecC--------CceeEE----------EEE-------ecc---ceeEEEEEEEEeecc---- Confidence 4321 2222222 221110 000000 000 000 0011111 1111110 Q ss_pred eeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeec Q lcl|NC_021540. 327 VTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSK 406 (705) Q Consensus 327 ~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~ 406 (705) ......+.++........|...+..|++++. +..|.|.++.++++++.+|..++.++......+.|+..+- T Consensus 189 --~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~- 259 (456) T protein:vir:10 189 --RRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK- 259 (456) T ss_pred --cceeeeecCCceeeccccCCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh- Confidence 1112222333332223333333555665542 3468899999999999999999998877777766654432 Q ss_pred cc-------------cCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCc Q lcl|NC_021540. 407 NL-------------LDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTG 473 (705) Q Consensus 407 ~a-------------v~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~ 473 (705) |. ++..+.+...+|.++..+++.. +..++..++ ..+...+..+...+-..||+++...|... T Consensus 260 G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~----~~q~~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~ 334 (456) T protein:vir:10 260 STEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD----IWESQANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDS 334 (456) T ss_pred ccCcccccccccccccchhhhhhhhccccccCCCCcc----eEEecccCh-hHHHHHHHHHHHHHHhccCCChHHhcccc Confidence 11 1112223345565655554432 222222222 33445566666666677899999888543 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccc Q lcl|NC_021540. 474 DSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISN 553 (705) Q Consensus 474 ~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~ 553 (705) +..| +.+++.....-........+.|..+++++++.++. ...... + ..+.+.-.... T Consensus 335 ~N~S--g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~----~~g~~~--------~---------~~~~v~w~~~~ 391 (456) T protein:vir:10 335 ANQS--AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ----IEGESV--------E---------DTVDVSFESPD 391 (456) T ss_pred cChH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hcCCCc--------c---------cceeEEecCCC Confidence 2233 44455444444444555566666666666665543 222110 0 01222222221 Q ss_pred hhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 554 AETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQ 633 (705) Q Consensus 554 ~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~q 633 (705) ........+.+..|.++ + .+.. .++ .+..|+.. +...+.+++ +...+.. ++ T Consensus 392 ~~~~~~~ada~~kl~~~-g--i~~~---~~~---~~~lg~~~------------~~i~~~e~e-------r~~~e~~-~~ 442 (456) T protein:vir:10 392 RVTLGEKYSAASLAKAA-G--ESWA---SIR---RNILNYNA------------DQIKQDDLD-------RAREQIT-LF 442 (456) T ss_pred CcCHHHHHHHHHHHHHc-C--CChH---HHH---HhhCCCCH------------HHHHHHHHH-------HHHHHHH-HH Confidence 11112222222332222 1 1110 011 11112110 000000110 0000000 00 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021540. 634 AEIQLMPYEAQAEAAK 649 (705) Q Consensus 634 a~~q~~~~~~q~e~a~ 649 (705) +. ......+-+..+ T Consensus 443 ~~--~~~~~~~~~~~~ 456 (456) T protein:vir:10 443 AG--NPVQRPQEDGSR 456 (456) T ss_pred hh--hhhhcCCCCCCC Confidence 00 000000000011 No 118 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.16 E-value=1e-09 Score=69.89 Aligned_cols=433 Identities=12% Similarity=0.053 Sum_probs=184.7 Q ss_pred CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCC--CCCCC---cCCCHHHHHHHHHHHHHHHH Q lcl|NC_021540. 17 QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQ--QVGRS---SVQPKLIRKQAEWRYSALSE 89 (705) Q Consensus 17 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~grs---~~v~~~v~~~~e~~~~~l~~ 89 (705) +|..+-+.++..|... |..++.+.++..+||.|.... .++. .+.|+ ++|.+-.+..|+.....|. T Consensus 1 ~~~~t~~~~~~~l~~~-------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKR-------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII- 72 (456) T ss_pred CCCCCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc- Confidence 4444434456655443 333455556777999997522 2222 12333 4677777777777666542 Q ss_pred hhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccccc Q lcl|NC_021540. 90 PFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVE 169 (705) Q Consensus 90 ~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~ 169 (705) ++.+. + + ...|.+.......+ | ..|+--.....+++++++.|.+.+.+| . T Consensus 73 ----~~~~~-~-~-~~~d~~~~~~~~~i----~-~~N~~d~~~~~~~~~a~i~G~ay~~v~-~----------------- 122 (456) T protein:vir:10 73 ----PNGIT-V-G-GSADSDLALRARRI----W-RDNRMDSVCKQWVKYGLDFGESYLTCW-R----------------- 122 (456) T ss_pred ----cCCee-c-C-CCCCcchHHHHHHH----H-HhcChhhHHHHHHHHHhhcCeeEEEEe-e----------------- Confidence 22332 1 1 12233332222222 2 234333334567788888888866442 0 Q ss_pred CCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCC Q lcl|NC_021540. 170 ATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDP 247 (705) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp 247 (705) ...+.++|..++|.+++ ||| T Consensus 123 ----------------------------------------------------------d~~g~~~i~~~~p~~~~~i~d~ 144 (456) T protein:vir:10 123 ----------------------------------------------------------RDDGTATITADSPETMVVSVDP 144 (456) T ss_pred ----------------------------------------------------------CCCCceEEEEEccceeEEEEcC Confidence 01245678888999854 555 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEE-EEEEeeecCCC Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYE-YWGYWDIDGSG 326 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E-~w~k~~~~~dg 326 (705) .... ...++++ ++.+.+ ....+. ..+ ..+ .-+..+. +|..... T Consensus 145 ~~~~---~~~~~i~-~~~~~d--------~~~~~~----------~~~-------~~~---~~~~~~~~~~~~~~~---- 188 (456) T protein:vir:10 145 LQPW---RIRAAMR-WWRDLD--------AESDFA----------IVW-------SGD---GWQKFARPCFVQSSS---- 188 (456) T ss_pred CCCc---ceEEEEE-EEEecC--------CceeEE----------EEE-------ecc---ceeEEEEEEEEeecc---- Confidence 4321 2222222 221110 000000 000 000 0011111 1111110 Q ss_pred eeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeec Q lcl|NC_021540. 327 VTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSK 406 (705) Q Consensus 327 ~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~ 406 (705) ......+.++........|...+..|++++. +..|.|.++.++++++.+|..++.++......+.|+..+- T Consensus 189 --~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~- 259 (456) T protein:vir:10 189 --RRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK- 259 (456) T ss_pred --cceeeeecCCceeeccccCCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh- Confidence 1112222333332223333333555665542 3468899999999999999999998877777766654432 Q ss_pred cc-------------cCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCc Q lcl|NC_021540. 407 NL-------------LDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTG 473 (705) Q Consensus 407 ~a-------------v~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~ 473 (705) |. ++..+.+...+|.++..+++.. +..++..++ ..+...+..+...+-..||+++...|... T Consensus 260 G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~----~~q~~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~ 334 (456) T protein:vir:10 260 STEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD----IWESQANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDS 334 (456) T ss_pred ccCcccccccccccccchhhhhhhhccccccCCCCcc----eEEecccCh-hHHHHHHHHHHHHHHhccCCChHHhcccc Confidence 11 1112223345565655554432 222222222 33445566666666677899999888543 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccc Q lcl|NC_021540. 474 DSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISN 553 (705) Q Consensus 474 ~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~ 553 (705) +..| +.+++.....-........+.|..+++++++.++. ...... + ..+.+.-.... T Consensus 335 ~N~S--g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~----~~g~~~--------~---------~~~~v~w~~~~ 391 (456) T protein:vir:10 335 ANQS--AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ----IEGESV--------E---------DTVDVSFESPD 391 (456) T ss_pred cChH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hcCCCc--------c---------cceeEEecCCC Confidence 2233 44455444444444555566666666666665543 222110 0 01222222221 Q ss_pred hhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 554 AETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQ 633 (705) Q Consensus 554 ~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~q 633 (705) ........+.+..|.++ + .+.. .++ .+..|+.. +...+.+++ +...+.. ++ T Consensus 392 ~~~~~~~ada~~kl~~~-g--i~~~---~~~---~~~lg~~~------------~~i~~~e~e-------r~~~e~~-~~ 442 (456) T protein:vir:10 392 RVTLGEKYSAASLAKAA-G--ESWA---SIR---RNILNYNA------------DQIKQDDLD-------RAREQIT-LF 442 (456) T ss_pred CcCHHHHHHHHHHHHHc-C--CChH---HHH---HhhCCCCH------------HHHHHHHHH-------HHHHHHH-HH Confidence 11112222222332222 1 1110 011 11112110 000000110 0000000 00 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_021540. 634 AEIQLMPYEAQAEAAK 649 (705) Q Consensus 634 a~~q~~~~~~q~e~a~ 649 (705) +. ......+-+..+ T Consensus 443 ~~--~~~~~~~~~~~~ 456 (456) T protein:vir:10 443 AG--NPVQRPQEDGSR 456 (456) T ss_pred hh--hhhhcCCCCCCC Confidence 00 000000000011 No 119 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.15 E-value=1.2e-09 Score=69.41 Aligned_cols=445 Identities=12% Similarity=0.033 Sum_probs=187.1 Q ss_pred hcccccccCCCCCCHH--HHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCC--CCCCC--CC-CcCCCHHHHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKP--KVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYK--PKQQV--GR-SSVQPKLIRKQAE 81 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~--gr-s~~v~~~v~~~~e 81 (705) |--.+-..++.+++++ ++..|.+.+..- ..+.+...+||.|..... +...+ -| -+.|.+-.+..|+ T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~-------~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd 73 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENL-------RWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVD 73 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHH-------hhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHH Confidence 2111111133444444 344444433332 233445568998874321 11110 00 0123333333333 Q ss_pred HHHHHHHHhhcCCCCEEEEe-CCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhh Q lcl|NC_021540. 82 WRYSALSEPFLNDENIFSIA-PKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTE 160 (705) Q Consensus 82 ~~~~~l~~~f~~~~~~~~~~-p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~ 160 (705) .+.-.| .+--|. |-+.+|. ..+..+ ...|+--.....++++||+.|.+++.|+... T Consensus 74 ~~a~rl--------~~~Gf~~~d~~~~~-------~~l~~i-w~~N~ld~~~~~~~~~al~~G~sf~~V~~~~------- 130 (474) T protein:vir:81 74 ALARRC--------NLEGFVWPDGDLDS-------LGGTEV-VDDNHLLSEIDSAIVAAMQHGPAFLINTVGE------- 130 (474) T ss_pred HHHhhh--------cccceECCCCCccc-------hHHHHH-HHhcChhHHHHHHHHHHHhhCceeEEEecCC------- Confidence 332221 111121 2112211 123223 3334433345678899999999988775310 Q ss_pred cccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEech Q lcl|NC_021540. 161 NVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDY 240 (705) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~ 240 (705) .....|.|..++| T Consensus 131 -------------------------------------------------------------------d~~~~~~i~~~sp 143 (474) T protein:vir:81 131 -------------------------------------------------------------------DDEPEALIHVKDA 143 (474) T ss_pred -------------------------------------------------------------------CCCceeEEEEecc Confidence 0122467888899 Q ss_pred hhee--eCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEE Q lcl|NC_021540. 241 HNVT--IDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWG 318 (705) Q Consensus 241 ~~~~--~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~ 318 (705) .+++ |||... .+. + .+.+... +.+ +. ....-+|. T Consensus 144 ~~~~~~~D~~~~-~~~-~--al~~~~~----------~~~--------------------g~----------~~~~~ly~ 179 (474) T protein:vir:81 144 SEATGEWNRRRR-GLN-N--LLSIIDK----------DKE--------------------GK----------VLSLALYL 179 (474) T ss_pred ceEEEEEeCCCC-cce-e--eeEEEEE----------cCC--------------------Cc----------EEEEEEEe Confidence 9877 777532 121 1 1111100 000 00 00111111 Q ss_pred EeeecCCCeeEEEEEEEEC-CE--EEecccCCCCCCCcceEEeeeeeecCcccCCchH-HHhhHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 319 YWDIDGSGVTTPIVASWVD-DV--MIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADA-ELLSDNQKLIGALTRGMIDAM 394 (705) Q Consensus 319 k~~~~~dg~~~~~~~~~~g-~~--iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~-~~~~d~Q~~iN~~~~~~~d~~ 394 (705) . +.. +.+...+ +. .....++|+ | .|+|+++..+.....+|.|-+ +.++++|+.+|+.+..+.... T Consensus 180 ~------~~~--~~~~~~~~~~~w~~~~~~~~~--g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~ 248 (474) T protein:vir:81 180 D------NET--VTAQRDKATLKWQVDRDEHVY--G-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHM 248 (474) T ss_pred C------CcE--EEEEEcCccceeeeccCCCCC--C-cceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 0 000 0000001 11 112234554 4 699999999998888998855 799999999999999999999 Q ss_pred HhcCCCcEEeecccc---------CchhhhhhcCCcceeecCCcccc------cccccccCccchHHHHHHHHHHHHHHH Q lcl|NC_021540. 395 ARSANGQRGMSKNLL---------DPVNERKFKMGEDYKYNPGTNPV------TDIIEHKYPELPASSYNMLQMFTLEAD 459 (705) Q Consensus 395 ~~~~~~~~~~~~~av---------~~~d~~~~~pg~~i~~~~~~~~~------~~i~~~~~~~i~~~~~~~l~~~~~~~~ 459 (705) ...+.|+..+- |+- ...+.+....+.++.+..+.+.. ..+.-++..++. .+...+..+...+- T Consensus 249 e~~a~pqr~i~-G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~-~~~~~l~~~~~~~a 326 (474) T protein:vir:81 249 DVFSYPEFWLL-GADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPD-AHWSDINGLAKLFA 326 (474) T ss_pred HHhcchhheee-cCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChh-HHHHHHHHHHHHHH Confidence 99999987653 221 11223444455565554432211 112222222222 23334455555555 Q ss_pred HHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechh Q lcl|NC_021540. 460 ALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRD 539 (705) Q Consensus 460 ~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~ 539 (705) ..||++...+|......+.+|.++......-........+.|..++++++++.+.+.-.+--++ +.. ++. T Consensus 327 ~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~----~~~-~~~----- 396 (474) T protein:vir:81 327 REASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDE----IPD-EWK----- 396 (474) T ss_pred hhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc----cch-hhc----- Confidence 6789999999854222234555565544444444455666677777777777665442221110 000 000 Q ss_pred hcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHH Q lcl|NC_021540. 540 NLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQ 619 (705) Q Consensus 540 ~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q 619 (705) ...+.-.........+....+..+.++.....+.. ++. +..|+... ..........+ T Consensus 397 ----~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~----~~~---~~lg~t~~------------~i~~~~~~~~~ 453 (474) T protein:vir:81 397 ----SIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETE----VGL---ELIGLTPQ------------QARRAMADKRR 453 (474) T ss_pred ----cceeEecCCCccCHHHHHHHHHHHHhcccCCCcHH----HHH---hhcCCCHH------------HHHHHHHHHHH Confidence 11111111111122233333334444321111111 111 11222100 00000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 620 LEAQELQMRIAKLQAEIQLMPYEAQ 644 (705) Q Consensus 620 ~~~q~~q~e~~k~qa~~q~~~~~~q 644 (705) + +.+..+..+.+ .......+| T Consensus 454 ~---~~~~~~~~l~~-~~~~~~~aq 474 (474) T protein:vir:81 454 V---QGRGTLQALID-RSNNGATAQ 474 (474) T ss_pred H---hHHHHHHHHHh-cCCCCCCCC Confidence 0 01111111000 000111111 No 120 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.14 E-value=1.4e-09 Score=69.19 Aligned_cols=474 Identities=11% Similarity=0.011 Sum_probs=198.8 Q ss_pred CcchhhhhhcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHH-HHHHHHH-HHhccC--CCCCCCCCCCCCcCCCHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQV-AIIDDWL-AQLNVT--GAYKPKQQVGRSSVQPKLI 76 (705) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~--~~~~~~~~~grs~~v~~~v 76 (705) |-.-+..-+..+.+... +..+.. .+. .+ ....+|. ..|.+. +...++.. .+.++..+.- T Consensus 7 ~~~~i~~w~~~~~~~~~--------~~~~~~----~~~----~~~~~~~~~~~~~~~~~~w~~~~~~~~-~~~~~~~~l~ 69 (518) T protein:vir:78 7 MTRFIKGWLNGKPNGSE--------PELIPK----YLP----LVPDNQKEWSKDSYLTSLWAQGYVPTV-HDKLMNSGTG 69 (518) T ss_pred HHHHHHHhhcCCCCccc--------hhccHH----Hhh----hcccchhhhhhhhhhhhhcccCCCCcc-ccccccCChH Confidence 11111111111111100 000000 000 00 1111221 111111 11111111 1223333333 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) +..++ -+++| .|+-..-+.|......|.+ +++++++.++.. |+-...+..++..++-.|.+++|++|+ T Consensus 70 ~~i~~-~~A~l---l~~e~~~i~v~~~~~~d~e---~~~~~l~~il~~-n~f~~~~~~~~e~a~a~G~~~~k~~~d---- 137 (518) T protein:vir:78 70 NEIVV-VAAEY---ISGKPLSIDVTGVNGSKDE---NLTKQLKEALRI-DNFDSKSVKIVELAGGSGVSAVKINIL---- 137 (518) T ss_pred HHHHH-HHHHh---hcCCCceEEecCccccCcH---HHHHHHHHHHHh-ccHHHHHHHHHHHhhccCceEEEEEEE---- Confidence 33333 23333 3555555777654444433 456677765433 555566889999999999999999883 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEE Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVT 236 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~ 236 (705) .++++|+ T Consensus 138 -------------------------------------------------------------------------~~~~~i~ 144 (518) T protein:vir:78 138 -------------------------------------------------------------------------NGRPSIS 144 (518) T ss_pred -------------------------------------------------------------------------CCeeEEE Confidence 1245677 Q ss_pred EechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEE Q lcl|NC_021540. 237 ICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEY 316 (705) Q Consensus 237 ~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~ 316 (705) .|++..|++..+- .++..|-|+-.... .+ +..+|.-+++ +..............-...++. T Consensus 145 ~v~ad~~~P~~~~-g~~~~~~f~~~~~~---~~--k~~~y~~lE~-------------he~~~~~~~~~~~~~~~I~n~l 205 (518) T protein:vir:78 145 VHSSSQFWIDFKN-NEPFRFNFFEEIPT---SN--KADIYYLVES-------------REIKQWDKEGKKLSGGFVTYSV 205 (518) T ss_pred EEcCCeeEEEeec-CcEEEEEEEEEeec---CC--cceeEEEEEe-------------eccccccceeecccceeEEEEE Confidence 8888888864332 23333333211111 00 0001111100 0000000000000000111222 Q ss_pred EEEeeecCCCe-------eEEEEEEE-ECCEEEecccCCC-CCCCcceEEeeeee-----ecCcccCCchHHHhhHHHHH Q lcl|NC_021540. 317 WGYWDIDGSGV-------TTPIVASW-VDDVMIRLEKNPY-PDGKLPFVVVPYLP-----VKDSVYGEADAELLSDNQKL 382 (705) Q Consensus 317 w~k~~~~~dg~-------~~~~~~~~-~g~~iL~~~~~p~-~~~~~Pfv~~~~~~-----~~~~~~g~g~~~~~~d~Q~~ 382 (705) |- -+ .++++ .+....++ ..+.. +..-+ .....||+++...+ ..++.+|.|++..++++++. T Consensus 206 y~-~~-~~~~v~~~~~~~~~~l~~~~~~~~~~---e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~ 280 (518) T protein:vir:78 206 IK-ID-GDKTTPISAERLPEQITSYLHTNDIQ---LNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFA 280 (518) T ss_pred ee-ec-CcccccccccccccccccccccccCc---cceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHH Confidence 21 00 01110 00000000 00100 00001 11245777664443 35788899999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCcEEeeccccCchhh-------hhhcCC-ccee-ecC----CcccccccccccCccchHHHHH Q lcl|NC_021540. 383 IGALTRGMIDAMARSANGQRGMSKNLLDPVNE-------RKFKMG-EDYK-YNP----GTNPVTDIIEHKYPELPASSYN 449 (705) Q Consensus 383 iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-------~~~~pg-~~i~-~~~----~~~~~~~i~~~~~~~i~~~~~~ 449 (705) +|...+++.+.+.+ +.+++.++++++..+.. ..+..+ ..+. ++. +......+...++.--...+.. T Consensus 281 lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~ 359 (518) T protein:vir:78 281 VDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRE 359 (518) T ss_pred HHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHH Confidence 99999999999966 88899998887642211 112211 1121 221 1111122444443322346667 Q ss_pred HHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEe Q lcl|NC_021540. 450 MLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRIT 529 (705) Q Consensus 450 ~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~ 529 (705) .++.+...+....|++....|.++. ..||+++....+..-.....+...+..+++++...++.+..-++...... T Consensus 360 ~~~~~l~~~~~~~G~s~~tfg~~~~--~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~--- 434 (518) T protein:vir:78 360 TMEYFAQKAVSKSGYNPATFNLGNR--EVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKA--- 434 (518) T ss_pred HHHHHHHHHHHhhCCChhhcCcccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccc--- Confidence 7788888888889999998886532 36788887766666666777788888888888888777765543221100 Q ss_pred cCceeeechhhccccee--EEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcc---- Q lcl|NC_021540. 530 DEEFVQINRDNLVGSFD--IKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKY---- 603 (705) Q Consensus 530 ~~~~v~i~~~~~~~~~d--v~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~---- 603 (705) .....+. +..+++...-.....+....+.++ |..-....+..+.....+..-...+ ++++.. T Consensus 435 ----------~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~a-GimS~e~~i~~~~~~~~deea~~e~-~ri~~E~~~~ 502 (518) T protein:vir:78 435 ----------IMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSA-LAMSVEEKVKLIHPKWEDEEIQAEV-KRIYLENAIG 502 (518) T ss_pred ----------cCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhc-CCCCHHHHHHHhCCCCCHHHHHHHH-HHHHHHhccc Confidence 0011222 222333222222222222222222 1100000000000000000000000 000000 Q ss_pred -cccchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 604 -NPEPSPQAQLEIQIKQLEAQELQMRIA 630 (705) Q Consensus 604 -~~q~~~~~q~~~q~~q~~~q~~q~e~~ 630 (705) .++|++..=+...+ . T Consensus 503 ~~~~p~~~~g~~~~~------------g 518 (518) T protein:vir:78 503 EVPDPEAIGGMETKG------------G 518 (518) T ss_pred CCCCCccccCCCCCC------------C Confidence 00010000000000 0 No 121 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.07 E-value=2.9e-09 Score=67.39 Aligned_cols=572 Identities=11% Similarity=0.021 Sum_probs=172.4 Q ss_pred CCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCC-cchHHHHHHHHHHHHH---HHHhh----------cCCc-chHHHHH Q lcl|NC_021540. 72 QPKLIRKQAEWRYSALSEPFLNDENIFSIAPKT-WQDREAARQNEAILNY---QFNNQ----------LDKV-KLIDTMV 136 (705) Q Consensus 72 v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~-~~D~~~A~~~t~~~n~---~~~~~----------~~~~-~~~~~~~ 136 (705) -+-+-++.. -.+++-|. ..... .+=...|....+|.++ +|... ..|. .+..|.| T Consensus 1 m~e~~~~~~----~~~~~~~~-------~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i 69 (706) T protein:vir:10 1 MAESRQKQH----ERVMLRFD-------RAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKV 69 (706) T ss_pred CCcchHHHH----HHHHHHHH-------HHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecch Confidence 111122222 22222221 11111 1111222223333321 32221 1122 2334555 Q ss_pred HHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceec Q lcl|NC_021540. 137 RTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAI 216 (705) Q Consensus 137 ~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 216 (705) +-.+..-.| ..+.+++.+++.+...+......+.+..+.......+....+++.+|.+.+.+|+||.++ T Consensus 70 ~~~v~~v~g-----------~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev 138 (706) T protein:vir:10 70 ATELNRIIS-----------EYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRL 138 (706) T ss_pred HHHHHHHhh-----------HHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEe Confidence 555555444 445566677888765544444444554444444556678999999999999999998752 Q ss_pred cCcccccceeeeccCcc---eEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCc--Ccchhhhhhhhhhc Q lcl|NC_021540. 217 INGYEEQEVIKTVKNQP---EVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYS--NLEYIKEDSSTSTS 291 (705) Q Consensus 217 ~~~~~~~~~~~~~~~~~---~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~--d~~~~~~~~~~~~~ 291 (705) . ......+ .++....-...+||--..-| | -..+..+.++..-..+.+ +.+.+...+.+... T Consensus 139 ~---------~d~~~~~d~~~~~~~i~i~~v~~p~~~v~~-D----p~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~ 204 (706) T protein:vir:10 139 T---------TSFVNEYDPMDERQRIAVEPIYDPARSVWF-D----PDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPT 204 (706) T ss_pred e---------eccccccCCCCCCccceeeeeccchhceec-C----chhcccChhhcceEeeeecCCHHHHHHhcCCChh Confidence 1 1111111 11111111112222110000 0 011222333322111112 12222222221111 Q ss_pred ccccccccccccc--ccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCC--------------------- Q lcl|NC_021540. 292 SDHYSSDTSFTFS--DKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPY--------------------- 348 (705) Q Consensus 292 ~~~~~~~~~~~~~--~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~--------------------- 348 (705) .-. ...+.++. ....+.|++.|||.+....-+- .+++-.+.++........++ T Consensus 205 ~~~--~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 280 (706) T protein:vir:10 205 SLD--RVGSVSWQYDWFTPDVVYIAKYYEVRKESVDV--ISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKR 280 (706) T ss_pred hhh--hhccccccccccCCCcceecccccccceeEEE--EEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccce Confidence 000 00111111 1123567888988764211110 11111111111110000000 Q ss_pred ---------------CCCCcceEEeeeeeecCcc---cCCchHHHhhHHHHHHHHHHHHHHHHH-HhcCCCcEEeecccc Q lcl|NC_021540. 349 ---------------PDGKLPFVVVPYLPVKDSV---YGEADAELLSDNQKLIGALTRGMIDAM-ARSANGQRGMSKNLL 409 (705) Q Consensus 349 ---------------~~~~~Pfv~~~~~~~~~~~---~g~g~~~~~~d~Q~~iN~~~~~~~d~~-~~~~~~~~~~~~~av 409 (705) ...-||.-.||+.|.-+.. .|.+.+..+...=+..=...|...-.+ ++.+..+.. .-+ T Consensus 281 ~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~---~~~ 357 (706) T protein:vir:10 281 RRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQ---TPI 357 (706) T ss_pred eeEEEEeeccccccccCCCCCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCc---ccc Confidence 0001111222222221111 112222222222222222222111111 110000000 000 Q ss_pred CchhhhhhcCCcceeecCC----------ccccccc-ccccCc---cchHHHHHHHHHHHHHHHHHhCcchHhcCCCccc Q lcl|NC_021540. 410 DPVNERKFKMGEDYKYNPG----------TNPVTDI-IEHKYP---ELPASSYNMLQMFTLEADALSGVKSFSQGLTGDS 475 (705) Q Consensus 410 ~~~d~~~~~pg~~i~~~~~----------~~~~~~i-~~~~~~---~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~ 475 (705) ...+.........-.-+.. ....+.+ .+.+.+ +.+.-....++++......+ ....|..... T Consensus 358 ~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i----~~vsGi~~~~ 433 (706) T protein:vir:10 358 VDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADI----QEVTGSSQAM 433 (706) T ss_pred cchhHHHHHHHHhhhcccccccchhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHH----HHHhCCCHHH Confidence 0000000000000000000 0001111 111111 11122222444444444433 3355665444 Q ss_pred cchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEe----- Q lcl|NC_021540. 476 LGTTTAGVQGVIGASGKRE-LGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKL----- 549 (705) Q Consensus 476 ~~~~a~~i~~l~~~~~~~~-~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v----- 549 (705) ++ ..+..++..-+..+.. ....-.|-+.++...+.+-.++..+... . .+.+..+.|...+-..++ +.+ T Consensus 434 lG-~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~~--~--y~~~R~~RI~~ed~~~~~-v~in~~~~ 507 (706) T protein:vir:10 434 QQ-MPSNVARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLSMARE--I--YGSDREVRIVHEDGTDDI-ALMNAAVL 507 (706) T ss_pred cC-CccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--H--cCCCcEEEEecCCCCccc-eeecccee Confidence 43 2223333221111111 1111112223333334444444433221 0 123334455443322111 111 Q ss_pred -----------eccchhH----------HHHHHHHHHHHHHHHhhhchhHHH-HHH-HHHHHhhhccchhhhhhhccccc Q lcl|NC_021540. 550 -----------SISNAET----------DAIKAQELSFMLQTMGQSLPFDMT-KLI-LGEIAKLRGMPDLSKMISKYNPE 606 (705) Q Consensus 550 -----------~~~~~~~----------~~~~~q~~~~llq~~~~~~~~~~~-~~i-l~~l~e~~~~~~~~~~~~~~~~q 606 (705) ++..+.. .+.+.+.+..|++.+ +.+++... ... +.-+.++..++...+..+....+ T Consensus 508 d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~-~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~ 586 (706) T protein:vir:10 508 DNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLL-QGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQ 586 (706) T ss_pred ccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHH-HhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHh Confidence 1111111 134444555555544 44444333 332 33456667777666565555433 Q ss_pred chhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 607 PSPQAQLEIQIK---QLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQ 683 (705) Q Consensus 607 ~~~~~q~~~q~~---q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q 683 (705) ..+......... +..++..|++++ +++++..+++++..+++++..+.+++..+.+.....+ .++...++ T Consensus 587 ~~~q~~~~~~~~~eq~~~~q~qq~q~~--q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a------~~qa~~~~ 658 (706) T protein:vir:10 587 LLTQGIVKPRNQQEQAIVQQAQQAQAT--QPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTA------QQDAMESQ 658 (706) T ss_pred hcccCCccccchhHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHH Confidence 222222111111 111111122222 2222222222222222222222222222222111111 11111111 Q ss_pred HHHHHH----HHHHHHHHHHHhh-----ccC Q lcl|NC_021540. 684 AKGNTQ----RDIVKTFLDTNKQ-----GNQ 705 (705) Q Consensus 684 ~~~~~~----~~~~k~~~~~~~q-----~~~ 705 (705) +..-.. .++..++..+..| +.. T Consensus 659 ~~~~~~~~~a~~~~~~~~~q~~q~l~~~~a~ 689 (706) T protein:vir:10 659 ANTVYKLAQARNIDDKAVMETLRLLKEVAAS 689 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 111111 1111111111111 111 No 122 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.99 E-value=6.9e-09 Score=65.33 Aligned_cols=434 Identities=12% Similarity=0.028 Sum_probs=181.7 Q ss_pred CCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCC--CCCC--CCCCC---cCCCHHHHHHHHHHHHHHHH Q lcl|NC_021540. 17 QEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAY--KPKQ--QVGRS---SVQPKLIRKQAEWRYSALSE 89 (705) Q Consensus 17 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~grs---~~v~~~v~~~~e~~~~~l~~ 89 (705) +++..-..+++.|.+. +..+..+.++..+||.|.... .++. .+.|+ ++|.+-....|+.....| T Consensus 1 ~~~~t~~~~~~~l~~~-------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l-- 71 (456) T protein:vir:79 1 MTASTPAEWLPVLTKR-------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI-- 71 (456) T ss_pred CCCCCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhh-- Confidence 3322222345544443 333344456677999986421 1111 12332 245666666666665544 Q ss_pred hhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhccccccccc Q lcl|NC_021540. 90 PFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVE 169 (705) Q Consensus 90 ~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~ 169 (705) + ++.+. .....|.+..+....++ ..|+--......++++++.|.+.+.+|= T Consensus 72 --~-~~g~~---~~~~~d~~~~~~~~~~~-----~~n~~d~~~~~~~~~a~~~G~a~~~~~~------------------ 122 (456) T protein:vir:79 72 --I-PNGIT---VGGSADSDLALRARRIW-----RDNRMDSVCKQWVKYGLDFGESYLTCWR------------------ 122 (456) T ss_pred --c-cCCee---cCCCCCccHHHHHHHHH-----HhcChhHHHHHHHHHHhhcCeeEEEEee------------------ Confidence 1 22221 12233443333333332 2344334456788899999988665420 Q ss_pred CCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhhee--eCC Q lcl|NC_021540. 170 ATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVT--IDP 247 (705) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~--~Dp 247 (705) ...+.+++..++|.+++ ||| T Consensus 123 ----------------------------------------------------------~edg~~~i~~~~p~~~~~i~d~ 144 (456) T protein:vir:79 123 ----------------------------------------------------------RDDGTATITADSPETMVVSVDP 144 (456) T ss_pred ----------------------------------------------------------CCCCceEEEEeccceeEEEEcC Confidence 01244677888888854 444 Q ss_pred CccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCe Q lcl|NC_021540. 248 TCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGV 327 (705) Q Consensus 248 ~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~ 327 (705) .... ...+.+ +++-+.++ ...+. .. +.. ...+.++.+|.... + T Consensus 145 ~~~~---~~~~~~-~~~~~~d~--------~~~~~----------~~--~~~--------~~~~~~~~~~~~~~---~-- 187 (456) T protein:vir:79 145 LQPW---RIRSAM-RWWRDLDA--------ESDFA----------IV--WSG--------DGWQKFARPCFVQS---S-- 187 (456) T ss_pred CCCC---ceEEEE-EEEEecCC--------ceeEE----------EE--EcC--------CceEEEEEEEEeec---c-- Confidence 3221 112222 22211100 00000 00 000 01111222221110 0 Q ss_pred eEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeecc Q lcl|NC_021540. 328 TTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKN 407 (705) Q Consensus 328 ~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~ 407 (705) .........++........|...+.+|++++. +..|.|.+..++++++.+|..++.+...+...+.|+..+. | T Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~-G 260 (456) T protein:vir:79 188 SRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-S 260 (456) T ss_pred ccceeeeccCCceeecccccCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHh-c Confidence 01111222233322233334445667776652 3567899999999999999999998887777666655442 2 Q ss_pred c-------------cCchhhhhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCcc Q lcl|NC_021540. 408 L-------------LDPVNERKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGD 474 (705) Q Consensus 408 a-------------v~~~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~ 474 (705) . ++..+.+...+|.++..+++... ...+..++ ..+...+..+...+-..||+++...|...+ T Consensus 261 ~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~~~----~q~~~~~~-~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~ 335 (456) T protein:vir:79 261 SEHRLPKVDENGNAIDYASIFEAAPGALWELPPGVDI----WESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA 335 (456) T ss_pred CCcccccccccccccchhhhhhhhccccccCCCCcce----eeecccCh-HHHHHHHHHHHHHHHhhcCCChhHhccccc Confidence 1 11122233456666655554322 22222222 335556677777777888999998885433 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccch Q lcl|NC_021540. 475 SLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNA 554 (705) Q Consensus 475 ~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~ 554 (705) .+|+. ++......-........+.|..+++++++.++ .+..... +. ...+.-..... T Consensus 336 N~Sg~--Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~----~~~g~~~--------~~---------~i~v~w~~~~~ 392 (456) T protein:vir:79 336 NQSAE--GAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL----QIEGESV--------ED---------TVDVSFESPDR 392 (456) T ss_pred CcHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhcCCCc--------cc---------cceEEeCCCCC Confidence 23443 34443333334444455556566665555543 4332211 00 11222222211 Q ss_pred hHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 555 ETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQA 634 (705) Q Consensus 555 ~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa 634 (705) ....+..+.+..+.++ | .+.. .. .....|+.. +..++ ++.++...+.. +.+ T Consensus 393 ~s~~~~ada~~kl~~~-G--~~~~---~~---~~~~lg~~~------------~~i~~-------~e~~r~~~e~~-~~~ 443 (456) T protein:vir:79 393 VTLGEKYSAASLAKAA-G--ESWA---SI---RRNILNYNA------------DQIKQ-------DDLDRAREQIT-LFA 443 (456) T ss_pred cCHHHHHHHHHHHHhc-C--CChH---HH---HHhcCCCCH------------HHHHH-------HHHHHHHHHHH-HHh Confidence 1112222222222222 1 1110 00 011112110 00000 01111000000 000 Q ss_pred HHHHHHHHHHHHHHH Q lcl|NC_021540. 635 EIQLMPYEAQAEAAK 649 (705) Q Consensus 635 ~~q~~~~~~q~e~a~ 649 (705) ....+..+...++ T Consensus 444 --~~~~~~~~~~~~~ 456 (456) T protein:vir:79 444 --GNPVQRPQEDGSR 456 (456) T ss_pred --hhHhhcCCCCCCC Confidence 0000000000000 No 123 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=98.94 E-value=1.2e-08 Score=63.99 Aligned_cols=601 Identities=10% Similarity=0.013 Sum_probs=202.8 Q ss_pred hhhhhcccccccC-------CCC-CCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHH Q lcl|NC_021540. 5 NEEFLEDTVPSLQ-------EDW-KNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLI 76 (705) Q Consensus 5 ~~~~~~~~~~~~~-------~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v 76 (705) +-+--..+++ +| ..+ +|++ ....+.............-.++-+.+|..+-.. ..++|+ T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~---~~~~r~------- 66 (776) T protein:vir:93 1 MFDLNDKDST-QLVPARTDEGELSPGED---AAQREKPANPLDSEQAVELHSRLLSYYRQELSR---QQDNRA------- 66 (776) T ss_pred CCCccccccc-cccccccccccCCCCCc---ccchhcccCCCCCHHHHHHHHHHHHHHHHHHhh---chHHHH------- Confidence 2222222222 11 111 2221 122222211111111111122333434322111 223332 Q ss_pred HHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhh Q lcl|NC_021540. 77 RKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEET 156 (705) Q Consensus 77 ~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~ 156 (705) +..+- .-|..|+.| .++..+. ....+.-.+++|.|+-.+.+-.| T Consensus 67 -~a~~d------~~fy~G~Qw--------~~~~~~~----------l~~~g~p~~~~N~i~~~i~~v~g----------- 110 (776) T protein:vir:93 67 -EMAVD------EDYYDNIQW--------SQDEIDE----------LKERGQAPTVYNVISQSVNWIIG----------- 110 (776) T ss_pred -HHHHH------HHHhCCCCC--------CHHHHHH----------HHhcCCceEEecchHHHHHHHHH----------- Confidence 11111 134444433 3333221 12234445667777777766555 Q ss_pred hhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEE Q lcl|NC_021540. 157 KVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVT 236 (705) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~ 236 (705) ....+++.+.+.+.+... ......+..+.......+.+..+++.+|.+.+.+|+||.++.+ + T Consensus 111 ~~~~nr~~~~~~p~~~~d-~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~-----------------d 172 (776) T protein:vir:93 111 SEKRGRSDFKVLPRRKDG-GKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQV-----------------Q 172 (776) T ss_pred HHHhCCcceEEecCChhH-HHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEe-----------------e Confidence 233455567777765543 3344444444445566778899999999999999999875321 1 Q ss_pred EechhheeeCCCcc--CChhhCCeEE--EEEeccHHHHHHhcCCcC--cchhhhhhhh---hhccccccccccccccccc Q lcl|NC_021540. 237 ICDYHNVTIDPTCN--GNLDEAKFVI--YSFESSRSDLEKYGIYSN--LEYIKEDSST---STSSDHYSSDTSFTFSDKA 307 (705) Q Consensus 237 ~V~~~~~~~Dp~a~--~d~~da~~~~--~~~~~t~~el~~~g~~~d--~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 307 (705) . ++.=+|-.+ .++.+ +++ ..+..+.++..-.+.... .+.+...+.+ .......+.....++.+.. T Consensus 173 ~----~~~~~~~~~~~~~p~~--i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 246 (776) T protein:vir:93 173 D----ENDGEPIYAGAESWRN--ILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDID 246 (776) T ss_pred c----cCCCCceEeeccChhh--eeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccc Confidence 0 000111111 01111 111 122234445433222222 2222222211 1111111111111111110 Q ss_pred cC----eEEEEEEEEEeee-----cCCC--eeEEEEEEEEC-----------CE-------------------------- Q lcl|NC_021540. 308 RK----KIVVYEYWGYWDI-----DGSG--VTTPIVASWVD-----------DV-------------------------- 339 (705) Q Consensus 308 ~~----~v~v~E~w~k~~~-----~~dg--~~~~~~~~~~g-----------~~-------------------------- 339 (705) .. ...+...|..... ..+. +.++|+-.++. .. T Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~ 326 (776) T protein:vir:93 247 GDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPM 326 (776) T ss_pred ccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheee Confidence 00 0111112211110 0011 12222211110 00 Q ss_pred -----EEecccCCCCCCC--cceEEeeeeeecCccc-CCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc Q lcl|NC_021540. 340 -----MIRLEKNPYPDGK--LPFVVVPYLPVKDSVY-GEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDP 411 (705) Q Consensus 340 -----iL~~~~~p~~~~~--~Pfv~~~~~~~~~~~~-g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~ 411 (705) ++..+...+.++. ||+-.|++.|+.+... ..|+...+...=+..-...|...-.+ ..++..+.+-. T Consensus 327 ~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~------~~~l~~~~~~~ 400 (776) T protein:vir:93 327 MRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKA------LYILSTNKVLM 400 (776) T ss_pred eeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHH------HHhhcCCceee Confidence 1111222222232 3334455555554433 34555555555555555555432222 22333332211 Q ss_pred hh-hhhhcCCccee--ecCCcccc---cccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHH Q lcl|NC_021540. 412 VN-ERKFKMGEDYK--YNPGTNPV---TDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQG 485 (705) Q Consensus 412 ~d-~~~~~pg~~i~--~~~~~~~~---~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~ 485 (705) .+ -+.. ....+. .++++... .++....+...++-...+++++......+..++ |......+....+.+. T Consensus 401 ~~gav~~-~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~t----Gi~~~~~G~~~n~~Sg 475 (776) T protein:vir:93 401 EEGAVDD-IDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVG----GVTDEMLGRTTNAVSG 475 (776) T ss_pred ccccccc-hHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhh----CcChHHhCCCcchhhH Confidence 11 0100 011111 12322211 122233333444455556677776666666554 4433222222222333 Q ss_pred HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeE-ecCceeeechhhcccceeEEeecc---------- Q lcl|NC_021540. 486 VIGA--SGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRI-TDEEFVQINRDNLVGSFDIKLSIS---------- 552 (705) Q Consensus 486 l~~~--~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri-~~~~~v~i~~~~~~~~~dv~v~~~---------- 552 (705) ..-+ .... ......+.+.+....+.+..++..... .. +.+..+.|...+-..++ |.|+.+ T Consensus 476 ~ai~~~~~~~-~~~~~~~~dn~~~~~~~~~~~~l~li~-----~~~~~~r~~ri~~~~~~~~~-v~in~~~~~nd~~~~~ 548 (776) T protein:vir:93 476 VAIQARQEQG-SVATNKLFDNLRLAFQQHGEKELSLIE-----QYMTEEKQFRITNSRGNPEY-VTVNDGLPENDITRTK 548 (776) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCcceEEEEeecCCCcce-EEecccchhhhhccce Confidence 2111 1111 111222223334444444444443321 11 12234444333222222 222211 Q ss_pred ----chhHH---HHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 553 ----NAETD---AIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQEL 625 (705) Q Consensus 553 ----~~~~~---~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~ 625 (705) ..... ..+.+++..|++.++ .+++.....++..+.+..+++...+..+.......+..+.+.+....+.+.. T Consensus 549 ~dv~v~~~~~~~s~r~~~~~~l~ql~~-~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~q 627 (776) T protein:vir:93 549 ADFIIDEAEWRATMRQAAVAELMEVIG-KMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIARE 627 (776) T ss_pred eeEEEeecccchhHHHHHHHHHHHHHh-hcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHH Confidence 11111 234445555555554 4566667777777888888877655554443222111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021540. 626 QMRIAKLQAEIQLMPYEAQAEAAKAR--KANTEADLNTLDFVEQETGVKQERELE-LMQAQAKGNTQRDIVKTFLDTNKQ 702 (705) Q Consensus 626 q~e~~k~qa~~q~~~~~~q~e~a~a~--~~~~ea~~~~~~~~~q~~~~kq~~e~e-~~~~q~~~~~~~~~~k~~~~~~~q 702 (705) +.+++..+ ++.++++++.+.++++ ...+++...+.+...... ++..+ ...++.+.+....+.... ..... T Consensus 628 q~q~~~~q--~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~----~a~~~~~~a~q~a~qa~~~~~~~~-~~a~~ 700 (776) T protein:vir:93 628 QAQQQQQQ--YNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISR----MAIREGVGAVKDATDAATAIAFMP-ELAGL 700 (776) T ss_pred HHhhHHHH--HHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhh----cchhhhhhhhhhhhhhhhhhhhhh-hhhhh Confidence 11111111 1111222222211221 111111111111110000 00111 111111111111110000 00011 Q ss_pred ccC Q lcl|NC_021540. 703 GNQ 705 (705) Q Consensus 703 ~~~ 705 (705) +.+ T Consensus 701 a~~ 703 (776) T protein:vir:93 701 SDG 703 (776) T ss_pred hhh Confidence 111 No 124 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.86 E-value=1e-08 Score=64.38 Aligned_cols=416 Identities=10% Similarity=-0.043 Sum_probs=165.2 Q ss_pred ccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHH Q lcl|NC_021540. 55 NVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDT 134 (705) Q Consensus 55 ~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~ 134 (705) +-.+...+..-..+-++|.+..+..|+.+.-.|. | .+.+-.|.+.- ..+..+| ..|+--..... T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~--~---------~gf~~~d~~~~----~~~~~i~-~~N~~d~~~~~ 64 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLL--A---------LGVTGPDGEPD----TRASRWW-QANRLDSRQKL 64 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhc--c---------CceecCCCchH----HHHHHHH-HhcChhHHHHH Confidence 1111111111112333566677777775544331 1 11122222211 1222233 23444445567 Q ss_pred HHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccce Q lcl|NC_021540. 135 MVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPIL 214 (705) Q Consensus 135 ~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 214 (705) +++++++.|.|.+.+|.... T Consensus 65 ~~~~a~i~G~ay~~v~~~~~------------------------------------------------------------ 84 (434) T protein:vir:98 65 VWRMAMAQSAGYMLVGAHPT------------------------------------------------------------ 84 (434) T ss_pred HHHHHhhcCceEEEEecCCC------------------------------------------------------------ Confidence 88999999999887753100 Q ss_pred eccCcccccceeeeccCcceEEEechhhe--eeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhcc Q lcl|NC_021540. 215 AIINGYEEQEVIKTVKNQPEVTICDYHNV--TIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSS 292 (705) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~i~~V~~~~~--~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~ 292 (705) .........|.|+.++|.++ +|||... +-.+.+++...+.. + T Consensus 85 ---------~~~~~~~~~~~I~~~~p~~~~~i~D~~~~----~~~~ai~~~~~~~~-----~------------------ 128 (434) T protein:vir:98 85 ---------RTEDNGRPSPLITMEHPSECIVEYDPETG----EPLVGLKVWHNDID-----G------------------ 128 (434) T ss_pred ---------cccccCCceeEEEEeccceeEEEEeCCCC----ceEEEEEEEEeccC-----C------------------ Confidence 00001134567888999995 4555422 22233332211100 0 Q ss_pred ccccccccccccccccCeEEEE--EEEEEeeecCCCeeEEEEE-EEECCEEEecccCCCCCCCcceEEeeeeeecCcccC Q lcl|NC_021540. 293 DHYSSDTSFTFSDKARKKIVVY--EYWGYWDIDGSGVTTPIVA-SWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDSVYG 369 (705) Q Consensus 293 ~~~~~~~~~~~~~~~~~~v~v~--E~w~k~~~~~dg~~~~~~~-~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g 369 (705) .. ...+.++ +++++......+...+.-. +......-...++| .|.+|+++|+..+..+. +| T Consensus 129 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~--~g~vPvv~f~N~~~~~~-~g 192 (434) T protein:vir:98 129 FG-------------YARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHD--LGGMQLVEFARMPDLGE-DP 192 (434) T ss_pred ce-------------EEEEEEeCcEEEEEEeeccccccccccccceecccccccccCC--CCccceEEeccCCCcCc-CC Confidence 00 0011110 0111100000000000000 00111111122333 37889999988777655 69 Q ss_pred CchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCc-----------hhhhhhcCCcceeecCCccccccccc Q lcl|NC_021540. 370 EADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDP-----------VNERKFKMGEDYKYNPGTNPVTDIIE 437 (705) Q Consensus 370 ~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~-----------~d~~~~~pg~~i~~~~~~~~~~~i~~ 437 (705) .|.++.++++++.+|..++.+.......+.|+..+. |+ ... .+.....++.++... +.. ..+.. T Consensus 193 ~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~--~~~~q 268 (434) T protein:vir:98 193 EPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIK-GHKFAKRTDPATGMTVVDQPFVPSPSAVWASE-GEN--TQFGQ 268 (434) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCcccccccccccchhhhhhhccccccccCC-CCC--ceEEE Confidence 999999999999999999999999998888876553 21 110 011112334433332 111 11222 Q ss_pred ccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 438 HKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNS 517 (705) Q Consensus 438 ~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~ 517 (705) .+...+ ..+...+......+-.+|++++...|...+. .++.++......-........+.|..++++++++++.+ T Consensus 269 ~~~~~~-~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n--~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~-- 343 (434) T protein:vir:98 269 LDATDL-SGFLKEHASDVRDMLTISQTPTYLYATDLVN--ISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQ-- 343 (434) T ss_pred ecCcch-HHHHHHHHHHHHHHhcccCCCHHHhccccCC--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 221111 2233444555555556678888888843222 34444544334444444555666666776666655433 Q ss_pred HhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhh Q lcl|NC_021540. 518 VWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLS 597 (705) Q Consensus 518 q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~ 597 (705) .... .++. +..+.-............+.+..|.+.. .+.. .+.+ ..++.. T Consensus 344 --~g~~-------~~~~---------~~~v~w~~~~~~s~~~~ada~~kl~~~g---~~~e----~~~~---~lg~~~-- 393 (434) T protein:vir:98 344 --AGVP-------EDYT---------EAEVRWANPAHVTMAVKADAATKLKSIG---YPLD----VIAE---ELDESP-- 393 (434) T ss_pred --cCCC-------hhhe---------eeeEEecCCCCCCHHHHHHHHHHHHhcC---CcHH----HHHH---hCCCCH-- Confidence 1110 0000 1222222222222222333333333321 1111 1111 111110 Q ss_pred hhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_021540. 598 KMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEI-QLMPYEA 643 (705) Q Consensus 598 ~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~-q~~~~~~ 643 (705) ..++... .+...+...+.....+.........- ......- T Consensus 394 ~e~~r~~------~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 394 ARVRRIV------AGAASQALLAASLLPAPGAPSAGNVPDSGGAVDG 434 (434) T ss_pred HHHHHHH------HHHHHHHHHHHhhhccCCCCCCCCCCcccCCCCC Confidence 0000000 00000000000000000000000000 0000000 No 125 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=98.80 E-value=4.3e-08 Score=60.98 Aligned_cols=579 Identities=12% Similarity=0.063 Sum_probs=186.2 Q ss_pred hccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHH-HH-------Hhh Q lcl|NC_021540. 54 LNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNY-QF-------NNQ 125 (705) Q Consensus 54 ~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~-~~-------~~~ 125 (705) ++-+ ... ....+-+.-+ +......+..+.........| ...|.-..+|++- +| ... T Consensus 1 ~~~~-~~~-~~~~~~~~~~----~~~~~~~l~~~~~~~~~~~~~----------r~~a~~d~~fy~G~Qw~~~~~~~l~~ 64 (714) T protein:vir:10 1 MKNE-INT-TAMKNDHGST----PRFSQRQLLSLCSDIDSQPLW----------RDAANKACAYYDGDQLAPEVIQVLKD 64 (714) T ss_pred CCcC-cCc-ccCCCcchhh----hhhhHHHHHHHHHHHhhhHHH----------HHHHHHHHHhhcCCCCCHHHHHHHHh Confidence 2211 000 0111111111 111111111111111111111 1111111122210 11 111 Q ss_pred cCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCch-hHHHHHHHHHHHhhchhhhcchHHHHHHHHH Q lcl|NC_021540. 126 LDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGE-SIDLINQAVQMYQMNPSILDTMPEALAESVR 204 (705) Q Consensus 126 ~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (705) .+.-.+.+|.|+-.+..-.| ..+.+++.+.+.+.... ........+..+.......+....+++.+|. T Consensus 65 ~g~p~~~~N~i~~~v~~v~g-----------~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~ 133 (714) T protein:vir:10 65 RGQPMTIHNLIAPTVDGVLG-----------MEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYA 133 (714) T ss_pred cCCCcEEeccHHHHHHHHHH-----------HHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHH Confidence 22233344555555555444 45566777888876543 3333444555555555556678899999999 Q ss_pred hhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccC-ChhhCCeEEEEEeccHHHHHHhcCCc--Ccch Q lcl|NC_021540. 205 YSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNG-NLDEAKFVIYSFESSRSDLEKYGIYS--NLEY 281 (705) Q Consensus 205 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~--d~~~ 281 (705) +.+.+|.||.++ .++++.|--++.++. ++.+--|=-..+..+.++..-....+ +++. T Consensus 134 ~~~~~G~G~~~~--------------------~~d~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~ 193 (714) T protein:vir:10 134 EQIKAGLSWVEV--------------------RRNSEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDE 193 (714) T ss_pred HhhhcccceEEe--------------------eeccCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHH Confidence 999999998631 222222222222211 12111110012223334433221111 1222 Q ss_pred hhhhhhh---hhcccccccccc---ccccccccCeEEEEEEEEEeeecCCC--------e--eEEE-EE-----EE---E Q lcl|NC_021540. 282 IKEDSST---STSSDHYSSDTS---FTFSDKARKKIVVYEYWGYWDIDGSG--------V--TTPI-VA-----SW---V 336 (705) Q Consensus 282 ~~~~~~~---~~~~~~~~~~~~---~~~~~~~~~~v~v~E~w~k~~~~~dg--------~--~~~~-~~-----~~---~ 336 (705) +...+.. .......++.+. .........-+.-++.+..++...+. + .+++ +. ++ . T Consensus 194 ~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~ 273 (714) T protein:vir:10 194 AKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN 273 (714) T ss_pred HHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCC Confidence 2222111 111111111111 00000111111222222222222111 1 1211 10 00 1 Q ss_pred CCEEEecccCC-------------------------------CCCC--CcceEEeeeeeecCcccC-CchHHHhhHHHHH Q lcl|NC_021540. 337 DDVMIRLEKNP-------------------------------YPDG--KLPFVVVPYLPVKDSVYG-EADAELLSDNQKL 382 (705) Q Consensus 337 g~~iL~~~~~p-------------------------------~~~~--~~Pfv~~~~~~~~~~~~g-~g~~~~~~d~Q~~ 382 (705) |+.+.....++ ...+ -||+..|++.|+.+.... .|.... T Consensus 274 g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G------- 346 (714) T protein:vir:10 274 GRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG------- 346 (714) T ss_pred CCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccce------- Confidence 22221111111 1112 134444555554333321 333332 Q ss_pred HHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeec-----CCccccccccc-------------ccCccch Q lcl|NC_021540. 383 IGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYN-----PGTNPVTDIIE-------------HKYPELP 444 (705) Q Consensus 383 iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~-----~~~~~~~~i~~-------------~~~~~i~ 444 (705) +.|.++++-...|+-...+.. +++..- .-..+|++.... .++.+...+.+ ..+.+.+ T Consensus 347 ---~vr~~~d~Qr~~N~~~s~~~~-~l~~~~-~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~ 421 (714) T protein:vir:10 347 ---LISRAIPAQDEVNFRRIKLTW-LLQAKR-VIMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDF 421 (714) T ss_pred ---ehhhhhhHHHHHHHHHHHHHH-HHhCCc-eeeccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCC Confidence 333333332222111110000 111100 011123332210 11122212222 1122223 Q ss_pred HHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021540. 445 ASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDE 523 (705) Q Consensus 445 ~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~ 523 (705) +-....++++......+-- ..|.....++....+.++..-++.+... .....+-+.++...+.+..++..+... T Consensus 422 ~~~~~~~~llq~~~~~i~~----~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~- 496 (714) T protein:vir:10 422 QVASQQFQVMQESEKLIQD----TMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD- 496 (714) T ss_pred CCcHHHHHHHHHHHHHHHH----hhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 3344455666666655543 4565543333222223432211111111 111222233444444555555444221 Q ss_pred eeEeEecCceeeechh-hccc-ceeEEe------------------eccchhH---HHHHHHHHHHHHHHHhhhchhHHH Q lcl|NC_021540. 524 EVIRITDEEFVQINRD-NLVG-SFDIKL------------------SISNAET---DAIKAQELSFMLQTMGQSLPFDMT 580 (705) Q Consensus 524 ~~iri~~~~~v~i~~~-~~~~-~~dv~v------------------~~~~~~~---~~~~~q~~~~llq~~~~~~~~~~~ 580 (705) -.+.+..+.|... +-.+ ..-+.+ ++...+. .+.+.+.+..|++.+. .+|+... T Consensus 497 ---~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~-~~~p~~~ 572 (714) T protein:vir:10 497 ---DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQ-GLPPQVQ 572 (714) T ss_pred ---HcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHh-hcCchhh Confidence 0122333444321 1111 111122 1111222 2445555566666654 6677777 Q ss_pred HHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH Q lcl|NC_021540. 581 KLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQI-KQLEAQELQMRIAKLQAEIQLMPYEAQ--AEAAKARKANTEA 657 (705) Q Consensus 581 ~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~-~q~~~q~~q~e~~k~qa~~q~~~~~~q--~e~a~a~~~~~ea 657 (705) ..++..++++..++...+..+................ ++.+.+..+..+++.+++++..+.+++ ..++++.+.+.++ T Consensus 573 ~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa 652 (714) T protein:vir:10 573 AVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAA 652 (714) T ss_pred hhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7777788899999888777766543322221111111 111111111112222222222222222 2222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H---HHHHHHHH-HhhccC Q lcl|NC_021540. 658 DLNTLDFVEQETGVKQERELELMQAQAKGNTQR----D---IVKTFLDT-NKQGNQ 705 (705) Q Consensus 658 ~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~----~---~~k~~~~~-~~q~~~ 705 (705) .....+...+.. +++.++. ..+..++++ + -++..... .++..+ T Consensus 653 ~~~~~~a~~~~~----~~~~q~~-~~~~~~a~~a~~l~~~~~~~q~~~~~~q~~~q 703 (714) T protein:vir:10 653 QRDNASAQREVA----LTQGQRY-VDALNQAHTAEIITGVQNMEQEQDVLQQQMLY 703 (714) T ss_pred HHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHH Confidence 111111111000 0111110 011111111 0 01111111 111111 No 126 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=98.64 E-value=1.6e-07 Score=57.85 Aligned_cols=576 Identities=12% Similarity=0.094 Sum_probs=150.5 Q ss_pred CCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHH----HHHhhcC-CcchHH Q lcl|NC_021540. 59 AYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNY----QFNNQLD-KVKLID 133 (705) Q Consensus 59 ~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~----~~~~~~~-~~~~~~ 133 (705) -+ +.+-.|+.....+...++..+-... ++-|...+.+....++| .+....+ .++++. T Consensus 1 ~~---k~~~~~~~~~~~~~~~~~~~~~~a~---------------~~~~~~~~~~~~~~~~~y~g~~~~~~~~~~s~~~~ 62 (705) T protein:vir:88 1 MA---KRRKIKPMDDEQVLRHLDQLVNDAL---------------DFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVS 62 (705) T ss_pred CC---cccccccCCHHHHHHHHHHHHHHHH---------------hhhhhHHHHHHHHHHHHHhCCCCCcccCCCCcccc Confidence 11 1112233333333333322211110 11111111111111111 1111111 222211 Q ss_pred -------HHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhh Q lcl|NC_021540. 134 -------TMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYS 206 (705) Q Consensus 134 -------~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (705) +|+...|+.-. ..+...+++.+.+.+...........+..-..........+..++.+. T Consensus 63 ~~v~~~v~~~~~~l~~~~--------------~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~da 128 (705) T protein:vir:88 63 RDVQETVDWIMPSLMKVF--------------TSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDT 128 (705) T ss_pred HHHHHHHHHHHHHHHHhh--------------cCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHH Confidence 34444333210 012223445666666555555444444332233344557788899999 Q ss_pred hhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccC----ChhhCCeE-----------EEEEeccHHHHH Q lcl|NC_021540. 207 VANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNG----NLDEAKFV-----------IYSFESSRSDLE 271 (705) Q Consensus 207 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~----d~~da~~~-----------~~~~~~t~~el~ 271 (705) +.+|.|+..+++............+ +.-+..-.++.||.+.. +..+..|- ++...++..++ T Consensus 129 l~~g~gi~kv~we~~~~~~~e~~~~---~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~- 204 (705) T protein:vir:88 129 LMMKTGVVKVYVEEVLKPTFERFSG---LSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENF- 204 (705) T ss_pred hhcCCeEEEeccccccchhhhhhcc---CChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHc- Confidence 9999999987653222111111111 11123344555554310 11111110 11111222222 Q ss_pred HhcCCcCc------chh-------hhhhh-----hhhccc-ccc-cccc--------ccccccccCeEEEEEEEEEeeec Q lcl|NC_021540. 272 KYGIYSNL------EYI-------KEDSS-----TSTSSD-HYS-SDTS--------FTFSDKARKKIVVYEYWGYWDID 323 (705) Q Consensus 272 ~~g~~~d~------~~~-------~~~~~-----~~~~~~-~~~-~~~~--------~~~~~~~~~~v~v~E~w~k~~~~ 323 (705) +++.+. .++ ..+.. ...... ..+ .+.. .+..+.+. ...+.+.|..... T Consensus 205 --~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~-~~~~~~~~~~~~~- 280 (705) T protein:vir:88 205 --LVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTG-QLQYNSGDDAEAN- 280 (705) T ss_pred --eecCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhcccccccccc-ccccccccccCCc- Confidence 122110 000 00000 000000 000 0000 00000110 1112222211100 Q ss_pred CCCe--eEEEEEEE-ECCEE------EecccCCCCCCCcce--EEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHH Q lcl|NC_021540. 324 GSGV--TTPIVASW-VDDVM------IRLEKNPYPDGKLPF--VVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMID 392 (705) Q Consensus 324 ~dg~--~~~~~~~~-~g~~i------L~~~~~p~~~~~~Pf--v~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d 392 (705) ..+ .+.|.-+- -|+.+ +..+.+.... .|+ .||..++.- ...+..+...+.+.-.-+-...+.+.. T Consensus 281 -r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~--~~~~~~PF~~~~~~-p~~~~~~G~g~~~~~~d~Q~~~n~~~~ 356 (705) T protein:vir:88 281 -REVWASECYTLLDVDGDGISELRRILYVGDYIISN--EPWDCRPFADLNAY-RIAHKFHGMSVYDKIRDIQEIRSVLMR 356 (705) T ss_pred -eeEEEEEeeeEecccCCcceeeEEEEEeCcccccc--ccCCCCCEEEecce-eecCccccCChHHHHhHHHHHHHHHHH Confidence 000 01111110 11111 1111111111 121 122221111 111233334455555555555555544 Q ss_pred HHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCccccc-ccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCC Q lcl|NC_021540. 393 AMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPVT-DIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGL 471 (705) Q Consensus 393 ~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~~-~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~ 471 (705) .+.-. -.....+...++ .+.+ .+.......||....- ......+-+.++-.-.+.+++. .+...-....|. T Consensus 357 ~~~d~-~~~~~~~~~~~~-~g~v--~~~d~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~----~~~~~~~~~tGi 428 (705) T protein:vir:88 357 NIMDN-IYRTNQGRSVVL-DGQV--NLEDLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLD----RLEADRGKRTGI 428 (705) T ss_pred HHHHH-HHhccCCceecc-cccc--CcccccccCCCeeEEecCCCccccccCCcCcHHHHHHHH----HHHHHHHHhhCC Confidence 43211 111111111121 1111 1222233333322111 1111122222222222333333 222233556665 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-HHHHHHHHHHHHhcCCceeEeEecCceeeechhhccc Q lcl|NC_021540. 472 TGDSLGTTTAGVQGVIGASGKRELGI-------LRRLANGLT-EVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVG 543 (705) Q Consensus 472 ~~~~~~~~a~~i~~l~~~~~~~~~~~-------~~n~~~~~~-~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~ 543 (705) ..-..+-.+..+.. +...+.+..+ .+.+.+-+. ...+.++.++....-.- ...+..+.|.... T Consensus 429 ~~~~~G~~~~~~~~--~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~----~~~~~~~ri~g~~--- 499 (705) T protein:vir:88 429 TDRTRGLDQNTLHS--NQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKY----QNQEEVFQLRGKW--- 499 (705) T ss_pred chHHcCCCcccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCceEEeeccch--- Confidence 43222211111111 0111111111 111111111 12233333333322110 1122344443322 Q ss_pred ceeEEeec--cchhHH--------H----HHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchh-hhhh-----hc- Q lcl|NC_021540. 544 SFDIKLSI--SNAETD--------A----IKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDL-SKMI-----SK- 602 (705) Q Consensus 544 ~~dv~v~~--~~~~~~--------~----~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~-~~~~-----~~- 602 (705) +.++. -.+..+ . ++.+.+..+++......+......+ ......... .+.. .. T Consensus 500 ---v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~----~~~~~~~~~~~el~e~~~~k~~ 572 (705) T protein:vir:88 500 ---VAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVL----VSEQNLYNILKEVTENAGYKDP 572 (705) T ss_pred ---hccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhh----cChHHHHHHHHHHHHhhhhhhH Confidence 11211 001111 1 1223333333332222221110000 000000011 1111 11 Q ss_pred --ccccchhhHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_021540. 603 --YNPEPSPQAQLEIQIKQLEAQELQM--RIAKLQAEIQLMPYEAQAEAAKARKANTEADLN--TLDFVEQETGVKQERE 676 (705) Q Consensus 603 --~~~q~~~~~q~~~q~~q~~~q~~q~--e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~--~~~~~~q~~~~kq~~e 676 (705) ...++....+++ .+.+..+.+.+. +..++|++++..+++++..+++++..+.+++.+ +++.+.+++.. ++++ T Consensus 573 ~~~~~~~~~~e~~~-~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~-~~~e 650 (705) T protein:vir:88 573 DRFWTNPNSPEALQ-AKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVL-QQRE 650 (705) T ss_pred HHHhhhhhhHHHHH-HHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 111111111111 111111111222 222333333333333333333333333332222 22222222111 1122 Q ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHhhccC Q lcl|NC_021540. 677 LELMQAQAKGN-------TQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 677 ~e~~~~q~~~~-------~~~~~~k~~~~~~~q~~~ 705 (705) ++.++++.+.+ +.....+.+++. .|+.+ T Consensus 651 ~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~-~q~~~ 685 (705) T protein:vir:88 651 MALKEAELQLERDRFTWERARNEAEYHLEA-TQARA 685 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 22221111111 111111111110 11111 No 127 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.62 E-value=1.9e-07 Score=57.47 Aligned_cols=563 Identities=12% Similarity=0.081 Sum_probs=181.4 Q ss_pred HHH-HHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHH Q lcl|NC_021540. 44 VAI-IDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQF 122 (705) Q Consensus 44 ~~~-~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~ 122 (705) +|+ .+++++-++..-. ..+...-+|.--.+-..-| ..+..--|.+...+.. .+ T Consensus 1 ma~~~~~~l~~~~~~~~--------------~~~~~~~~~r~~~~~d~~f-----~~~~G~QW~~~~~~~~--~~----- 54 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFD--------------RAHSPQEAVREKCLEATRF-----ARVPGGQWEGATAAGS--EL----- 54 (720) T ss_pred CchHHHHHHHHHHHHHH--------------HHHhhhHHHHHHHHHHHhh-----hccCCCCCCHHHHHHH--HH----- Confidence 221 2233332221100 0111112233222222111 1111223333332210 10 Q ss_pred HhhcCCcc-hHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHH Q lcl|NC_021540. 123 NNQLDKVK-LIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAE 201 (705) Q Consensus 123 ~~~~~~~~-~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (705) ..+..|.+ +..|.|+-.+..-.| ..+.+++.+++.+............+..+.......+....+++. T Consensus 55 ~l~~~~~P~~~~N~i~~~v~~v~g-----------~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~ 123 (720) T protein:vir:35 55 GKHFEKYPKFEINKISTELNRIIS-----------EYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDN 123 (720) T ss_pred HHhhCCCCeEEEccHHHHHHHHHh-----------HHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhH Confidence 11344555 344777766666555 455667778888876655455555555555555556778899999 Q ss_pred HHHhhhhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCc-- Q lcl|NC_021540. 202 SVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNL-- 279 (705) Q Consensus 202 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~-- 279 (705) +|.+.+.+|+||.++.+.+... ..|.. .+..+.+.|.-. ++...-|=-..+..+.++..-....+.+ T Consensus 124 Af~~~i~~G~G~~~v~~d~~~~-------~d~~~---~~~~i~i~~v~~-~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~ 192 (720) T protein:vir:35 124 AFDDGSTGGFGCFRLTTNLVNA-------LDPMD---ERQRICLEPIYD-PARSVWFDPDAKKYDKSDAEWAFCMYSLSA 192 (720) T ss_pred HHHHhhhccceeEEeeeccccc-------CCCCc---ccceeeEecccC-chhheeecccccccChhhhhhhhhhcCCCH Confidence 9999999999998754332210 01100 011112211100 1111111112233344443211111111 Q ss_pred chhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEE--------EEE----ECCEEEecccCC Q lcl|NC_021540. 280 EYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIV--------ASW----VDDVMIRLEKNP 347 (705) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~--------~~~----~g~~iL~~~~~p 347 (705) +.+...+. ...... +.+. ..-.+++ |+ +.+...+.+.+. +++ +|..+...+.++ T Consensus 193 d~~~~~yp-----~~a~~~----~~~~--~~~~~~d-~~--~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 258 (720) T protein:vir:35 193 EKYKAEYN-----KDPATL----MSGI--ERSWDYD-WY--DVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQL 258 (720) T ss_pred HHHHHhCC-----Cccccc----cccc--ccccccc-cc--CCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccH Confidence 11111111 111000 0000 0001111 11 111111222211 111 122222121111 Q ss_pred -------CCCC-----------------------------CcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHH Q lcl|NC_021540. 348 -------YPDG-----------------------------KLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMI 391 (705) Q Consensus 348 -------~~~~-----------------------------~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~ 391 (705) ...+ .+||-.||+.|.- |+... T Consensus 259 ~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~----g~r~~------------------ 316 (720) T protein:vir:35 259 ELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVY----GKRWF------------------ 316 (720) T ss_pred HHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEE----eeeec------------------ Confidence 0000 1122222222211 11110 Q ss_pred HHHHhcCCCc-EEeeccccCchhhhhh----------cCCccee--------------ecCCcc------------cccc Q lcl|NC_021540. 392 DAMARSANGQ-RGMSKNLLDPVNERKF----------KMGEDYK--------------YNPGTN------------PVTD 434 (705) Q Consensus 392 d~~~~~~~~~-~~~~~~av~~~d~~~~----------~pg~~i~--------------~~~~~~------------~~~~ 434 (705) ..+.+. +.+-.++.++-+..+. ..+.++. .++... ..+. T Consensus 317 ----~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~ 392 (720) T protein:vir:35 317 ----IDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGN 392 (720) T ss_pred ----cCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhccccccccccccccccccCcc Confidence 011111 1222222222222111 1111111 011111 0111 Q ss_pred cccccCc----cchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_021540. 435 IIEHKYP----ELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRE-LGILRRLANGLTEVA 509 (705) Q Consensus 435 i~~~~~~----~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~-~~~~~n~~~~~~~~~ 509 (705) +...+.+ +.++-....++++......+ ....|.....++.. +..++..-+..+.. ....-.|-+.++.-. T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i----~~vsGi~~~~lG~~-sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~ 467 (720) T protein:vir:35 393 IIAPPTPVGYTQPQPLNQAMAALLQQTGADI----QEVTGSSQAMQPMP-SNIAKETVNHLMHRSDMSSFIYLDNMAKSL 467 (720) T ss_pred cccCCCcccccCCCCCchHHHHHHHHHHHHH----HHHhCCChHHcCcc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111 11222222344455444444 33456554444432 22333221111111 111111222233333 Q ss_pred HHHHHHHHHhcCCceeEeEecCceeeechhhcccce-------------------eEEe---eccchhH---HHHHHHHH Q lcl|NC_021540. 510 KKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSF-------------------DIKL---SISNAET---DAIKAQEL 564 (705) Q Consensus 510 ~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~-------------------dv~v---~~~~~~~---~~~~~q~~ 564 (705) +.+..++..+... -.+.+..+.|...+-...+ |+.+ ++..... .+.+.+.+ T Consensus 468 ~~~g~~lL~lI~~----~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~ 543 (720) T protein:vir:35 468 KRAGEVWLSMARE----VYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATV 543 (720) T ss_pred HHHHHHHHHHHHH----HcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHH Confidence 4444444332211 0123334555443322111 1110 1111111 13445555 Q ss_pred HHHHHHHhhhchhHHHHHHH-HHHHhhhccchhhhhhhcccccchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 565 SFMLQTMGQSLPFDMTKLIL-GEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQ--IKQLEAQELQMRIAKLQAEIQLMPY 641 (705) Q Consensus 565 ~~llq~~~~~~~~~~~~~il-~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q--~~q~~~q~~q~e~~k~qa~~q~~~~ 641 (705) ..|++.++...|......++ ..+.+...++...+..+.......+..+.... +.++..+.++.++++++++++ ++ T Consensus 544 ~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~--~a 621 (720) T protein:vir:35 544 SVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELV--AA 621 (720) T ss_pred HHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHH--HH Confidence 55555554433434445443 45577888887766666654433333211111 111111222222223333332 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH---HhhccC Q lcl|NC_021540. 642 EAQAEAAKARKANTEADLNTLDFVEQETGVKQER---ELELMQAQAKGNTQRDIVKTFLDT---NKQGNQ 705 (705) Q Consensus 642 ~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~---e~e~~~~q~~~~~~~~~~k~~~~~---~~q~~~ 705 (705) ++...+++++..+++++....+.+..+.+.+.+. .+....+|+...++..+.++.... ++++.+ T Consensus 622 qa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~ 691 (720) T protein:vir:35 622 QGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGD 691 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcch Confidence 2333334444433444333333333222222111 111111111111111121211110 111111 No 128 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=98.59 E-value=2.4e-07 Score=56.92 Aligned_cols=572 Identities=10% Similarity=0.021 Sum_probs=173.8 Q ss_pred HHHHHHHHHHHHHHhhcCC-------------CCEEEE-eCCCcchHHHHHHHHHHHHHHHHhhcCCcc-hHHHHHHHHH Q lcl|NC_021540. 76 IRKQAEWRYSALSEPFLND-------------ENIFSI-APKTWQDREAARQNEAILNYQFNNQLDKVK-LIDTMVRTAV 140 (705) Q Consensus 76 v~~~~e~~~~~l~~~f~~~-------------~~~~~~-~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~-~~~~~~~~al 140 (705) ..+..+.++-.+++-|.-- |.-|.+ .+.-|.++..+.... ..+..|.+ +.+|.|+-.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~-------~~q~~grP~~~~N~i~~~v 73 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKL-------DEQFEKYPKFEINKVATEL 73 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHH-------hhhhcCCCceEEcchHHHH Confidence 2223333333333333110 001111 132343333221000 11223333 3446666666 Q ss_pred hcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcc Q lcl|NC_021540. 141 NEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGY 220 (705) Q Consensus 141 ~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 220 (705) ..-.| ..+.+++.+.+.+...+........+..+.......+....+++.+|.+.+.+|.||.++...+ T Consensus 74 ~~v~g-----------~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~ 142 (708) T protein:vir:10 74 NRIIA-----------EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSML 142 (708) T ss_pred HHHHH-----------HHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeecc Confidence 66555 4456677788887765544445555555555555567788999999999999999987532111 Q ss_pred cccceeeeccCcc-eEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCc--Ccchhhhhhhhhhccccccc Q lcl|NC_021540. 221 EEQEVIKTVKNQP-EVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYS--NLEYIKEDSSTSTSSDHYSS 297 (705) Q Consensus 221 ~~~~~~~~~~~~~-~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~--d~~~~~~~~~~~~~~~~~~~ 297 (705) .. ...| .+.-..+-...+||.- ..-|=-..+..+.++..-...-+ +.+.+...+.+... .. T Consensus 143 ~~-------e~d~~~~~~~i~i~~~~~p~~-----~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~----~~ 206 (708) T protein:vir:10 143 VN-------EYDPMDDRQRIAIEPIYDPSR-----SVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP----TS 206 (708) T ss_pred cc-------ccCCCCCccccceEEeecchh-----hcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcc----cc Confidence 00 0000 0000001111112210 00000011112333322110011 11111111111100 00 Q ss_pred cccccccccccCeEEEEEEEEEeeecCCCeeEEEEEE--------E----ECCEEEecccCC------------------ Q lcl|NC_021540. 298 DTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVAS--------W----VDDVMIRLEKNP------------------ 347 (705) Q Consensus 298 ~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~--------~----~g~~iL~~~~~p------------------ 347 (705) ....++++. .. -|. ..+...+.+.|+.. + +|+.+...+... T Consensus 207 ~d~~~~~~~------~~-~~~--~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r 277 (708) T protein:vir:10 207 LDVTSMTSW------EY-NWF--GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARR 277 (708) T ss_pred cccccCCCc------cc-ccc--CCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhhee Confidence 001111100 00 121 11112223332221 1 122211111110 Q ss_pred ------------------CCCCCcceEEeeeeeecCcccC-CchH--HHhhHHHHHHHHHHHHHHHH-HHhcCCCcEEee Q lcl|NC_021540. 348 ------------------YPDGKLPFVVVPYLPVKDSVYG-EADA--ELLSDNQKLIGALTRGMIDA-MARSANGQRGMS 405 (705) Q Consensus 348 ------------------~~~~~~Pfv~~~~~~~~~~~~g-~g~~--~~~~d~Q~~iN~~~~~~~d~-~~~~~~~~~~~~ 405 (705) ...+.+||..|++.|.-+.... .|.. ..+...=+-.=...|...-. +...+..+..+. T Consensus 278 ~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~ 357 (708) T protein:vir:10 278 SVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIP 357 (708) T ss_pred eeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCccc Confidence 0112244444444433222210 0111 11111111111111111000 000100000000 Q ss_pred ccccCchhhhhhcCCcceeec-------CCcccccccccccCc----cchHHHHHHHHHHHHHHHHHhCcchHhcCCCcc Q lcl|NC_021540. 406 KNLLDPVNERKFKMGEDYKYN-------PGTNPVTDIIEHKYP----ELPASSYNMLQMFTLEADALSGVKSFSQGLTGD 474 (705) Q Consensus 406 ~~av~~~d~~~~~pg~~i~~~-------~~~~~~~~i~~~~~~----~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~ 474 (705) -........+..+-+..-..+ +.....+.+.....+ +.++-....++++......+.-+ .|.... T Consensus 358 i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~v----sG~~~~ 433 (708) T protein:vir:10 358 IVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEV----TGGSQA 433 (708) T ss_pred ccChhhhhhHHHHHhhccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHH----hCcChh Confidence 000000000111100100000 001111111111111 22233333566666666665444 565544 Q ss_pred ccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccce-------- Q lcl|NC_021540. 475 SLGTTTAGVQGVIGASGKRE-LGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSF-------- 545 (705) Q Consensus 475 ~~~~~a~~i~~l~~~~~~~~-~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~-------- 545 (705) .++ ..+.+++..-++.+.. ....-.|-+.++.-.+.+..++..+... -.+.+..+.|..++-..++ T Consensus 434 ~lG-~~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~----~y~~er~~RI~~edg~~~~v~in~~~~ 508 (708) T protein:vir:10 434 MQQ-MPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMARE----VYGSEREVRIVNEDGSDDIAVLSAQVV 508 (708) T ss_pred Hcc-CccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HcCCCcEEEEecCCCCcceEEecceec Confidence 433 2333333211111111 1111111122333334444444433221 1123334555544321111 Q ss_pred -----------eEEe---eccchhH--HHHHHHHHHHHHHHHhhhchhHHH-HH-HHHHHHhhhccchhhhhhhcccccc Q lcl|NC_021540. 546 -----------DIKL---SISNAET--DAIKAQELSFMLQTMGQSLPFDMT-KL-ILGEIAKLRGMPDLSKMISKYNPEP 607 (705) Q Consensus 546 -----------dv~v---~~~~~~~--~~~~~q~~~~llq~~~~~~~~~~~-~~-il~~l~e~~~~~~~~~~~~~~~~q~ 607 (705) |+.+ ++..... ...+.++....|..+.+.+++... .. ++.-+.++..++...+..+....+. T Consensus 509 d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~ 588 (708) T protein:vir:10 509 DRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQL 588 (708) T ss_pred cCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhh Confidence 1100 1111111 123344444444444455555433 33 3445667777777666665554432 Q ss_pred hhhHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHH Q lcl|NC_021540. 608 SPQAQLEIQIKQLE---AQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQEREL---ELMQ 681 (705) Q Consensus 608 ~~~~q~~~q~~q~~---~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~---e~~~ 681 (705) .+..+........+ .+..++++++++.. ..+++++..+++++..+.+++..+.+.. ..+++.+. +.+. T Consensus 589 ~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~--~~e~qa~~~~~qAe~~ka~a~a~~~~~~----a~q~~~~~~~a~~~a 662 (708) T protein:vir:10 589 LISGIAKPRNEKEQQIVQQAQMAAQSQPNPE--MVLAQAQMVAAQAEAQKATNETAQTQIK----AFTAQQDAMESQANT 662 (708) T ss_pred cccccccccchhhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHH Confidence 22222221111111 11111111111111 1122222222222222222222221111 11111111 1111 Q ss_pred HHHHHHH-----HHHHHHHHHHHHhhccC Q lcl|NC_021540. 682 AQAKGNT-----QRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 682 ~q~~~~~-----~~~~~k~~~~~~~q~~~ 705 (705) .+.-.++ ...+..++.....|..+ T Consensus 663 ~q~~~~a~~~~~~~~~~~~q~l~~~q~~q 691 (708) T protein:vir:10 663 VYKLAQARNIDDKAVMEAIRLLKDVAESQ 691 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhH Confidence 1111111 11111111222222222 No 129 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=98.49 E-value=4.7e-07 Score=55.27 Aligned_cols=578 Identities=13% Similarity=0.075 Sum_probs=182.1 Q ss_pred CCCCC-CC-CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHH-HHHH-------hhcC Q lcl|NC_021540. 58 GAYKP-KQ-QVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILN-YQFN-------NQLD 127 (705) Q Consensus 58 ~~~~~-~~-~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n-~~~~-------~~~~ 127 (705) |++-. .. .++-| ...++.-..++..+.+-.-...+| ...|.-.-+|.+ -+|. ...+ T Consensus 1 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----------R~~a~~d~~fy~G~Qw~~~~~~~l~~~g 66 (714) T protein:vir:81 1 MKNETNTMATKNDN----GATPRFSQRQLQALCSDIDSQPKW----------RDAANKACAYYDGDQLPPEVLQVLKDRG 66 (714) T ss_pred CCcccccccCCCCc----chhHHHHHHHHHHHHHHHHhhHHH----------HHHHHHHHHhhcCCCCCHHHHHHHHhcC Confidence 43322 11 22223 222332233333333322222222 122222222322 1221 1111 Q ss_pred CcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCch-hHHHHHHHHHHHhhchhhhcchHHHHHHHHHhh Q lcl|NC_021540. 128 KVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGE-SIDLINQAVQMYQMNPSILDTMPEALAESVRYS 206 (705) Q Consensus 128 ~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (705) .-.+.+|.|+-.+..-. ...+.+++.+.+.+...+ ........+..+.......+....+++.+|.+. T Consensus 67 ~p~~~~N~i~~~v~~v~-----------g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:81 67 QPMTIHNLIAPTVDGVL-----------GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred CCcEEeccHHHHHHHHH-----------hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 11223344444443333 355667778888886643 333344444444444444667888999999999 Q ss_pred hhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccC-ChhhCCeEEEEEeccHHHHHHhcCCcC--cchhh Q lcl|NC_021540. 207 VANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNG-NLDEAKFVIYSFESSRSDLEKYGIYSN--LEYIK 283 (705) Q Consensus 207 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~d--~~~~~ 283 (705) +.+|+||..+ +++++.|-.++.++. ++.+--|=-..+..+.++..=..+... .+.+. T Consensus 136 ~~~G~G~~~~--------------------~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~ 195 (714) T protein:vir:81 136 IKAGLSWVEV--------------------RRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAK 195 (714) T ss_pred hhcCcceEEe--------------------ccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHH Confidence 9999998532 112221211111111 122211000112223344322211111 22222 Q ss_pred hhhhh---hh---ccccccccccccccccccCeEEEEEEEEEeeecC------CCe-eEEEEEEE------------ECC Q lcl|NC_021540. 284 EDSST---ST---SSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDG------SGV-TTPIVASW------------VDD 338 (705) Q Consensus 284 ~~~~~---~~---~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~------dg~-~~~~~~~~------------~g~ 338 (705) ..+.+ .+ ...+....+.............-++....++... +.. .+.+.+++ .|+ T Consensus 196 ~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~ 275 (714) T protein:vir:81 196 ATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGR 275 (714) T ss_pred HhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCc Confidence 21111 11 1111111111000000011111122111111111 111 11111111 122 Q ss_pred EEEecccCC-------------------------------CCCCCcce--EEeeeeeecCcccCCchHHHhhHHHHHHHH Q lcl|NC_021540. 339 VMIRLEKNP-------------------------------YPDGKLPF--VVVPYLPVKDSVYGEADAELLSDNQKLIGA 385 (705) Q Consensus 339 ~iL~~~~~p-------------------------------~~~~~~Pf--v~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~ 385 (705) .+...+.+| ...+..|| -.|++.|+ || +.+ +.....-- T Consensus 276 ~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~----~g--~~~---~~~g~~~G 346 (714) T protein:vir:81 276 VVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPF----WG--YRK---DKTGEPYG 346 (714) T ss_pred eEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEE----ee--eee---eccCceee Confidence 222222211 00111122 12333332 22 111 11111122 Q ss_pred HHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeec-----CCcccccccccc-------------cCccchHHH Q lcl|NC_021540. 386 LTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYN-----PGTNPVTDIIEH-------------KYPELPASS 447 (705) Q Consensus 386 ~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~-----~~~~~~~~i~~~-------------~~~~i~~~~ 447 (705) +.|.++|+-...|.-...+.. +++..-.+ ..+|++.... ..+.+...+.+. .+.+.++-. T Consensus 347 ~vr~~~d~Qr~~N~~~s~~~~-~l~~~~~~-~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~ 424 (714) T protein:vir:81 347 LISRAIPAQDEVNFRRIKLTW-LLQAKRVI-MDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVA 424 (714) T ss_pred hhhhchhHHHHHHHHHHHHHH-hhcCCcee-eecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCcc Confidence 344444443222211110000 11111001 1233332211 112222222221 112223344 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeE Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDEEVI 526 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~~~i 526 (705) ...++++......+- ...|.....++....+.++..-++.+... ...-.+-+.++...+.+..++..+... T Consensus 425 ~~~~~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~---- 496 (714) T protein:vir:81 425 SQQFQVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD---- 496 (714) T ss_pred HHHHHHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 445555555555443 34565544443333334443222222111 111122233344444444444433211 Q ss_pred eEecCceeeechh-hcccc-eeEEee------------------ccchhHH---HHHHHHHHHHHHHHhhhchhHHHHHH Q lcl|NC_021540. 527 RITDEEFVQINRD-NLVGS-FDIKLS------------------ISNAETD---AIKAQELSFMLQTMGQSLPFDMTKLI 583 (705) Q Consensus 527 ri~~~~~v~i~~~-~~~~~-~dv~v~------------------~~~~~~~---~~~~q~~~~llq~~~~~~~~~~~~~i 583 (705) -.+.+..+.|... +-.+. --+.++ +...+.. +.+.+.+..|++.+ ..+|+.....+ T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~-~~~~p~~~~~~ 575 (714) T protein:vir:81 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVI-QGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHH-hhcCchhhhhH Confidence 1122334444321 11110 012221 1112222 23344444555554 45676666667 Q ss_pred HHHHHhhhccchhhhhhhcccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_021540. 584 LGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIK-QLEAQELQMRIAKLQAEIQLMPYEAQA--EAAKARKANTEADLN 660 (705) Q Consensus 584 l~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~-q~~~q~~q~e~~k~qa~~q~~~~~~q~--e~a~a~~~~~ea~~~ 660 (705) +.-+.++..++...+..+.......+.......+. +.+++..+..+++.+++++..+.+++. .++++++.+.++... T Consensus 576 ~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:81 576 LDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred HHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77788999998887776665443222111121111 111111111112222222222222222 122222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-HHhhccC Q lcl|NC_021540. 661 TLDFVEQETGVKQERELELMQAQAKGNTQ-------RDIVKTFLD-TNKQGNQ 705 (705) Q Consensus 661 ~~~~~~q~~~~kq~~e~e~~~~q~~~~~~-------~~~~k~~~~-~~~q~~~ 705 (705) ..+...+.. .++.++... +..+++ .+.+..... .++|..+ T Consensus 656 ~~~a~~~~~----~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q 703 (714) T protein:vir:81 656 NASAQREVA----LTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLY 703 (714) T ss_pred HHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHH Confidence 111110000 111111110 000000 011111111 1112122 No 130 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=98.49 E-value=4.7e-07 Score=55.27 Aligned_cols=578 Identities=13% Similarity=0.075 Sum_probs=182.1 Q ss_pred CCCCC-CC-CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHH-HHHH-------hhcC Q lcl|NC_021540. 58 GAYKP-KQ-QVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILN-YQFN-------NQLD 127 (705) Q Consensus 58 ~~~~~-~~-~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n-~~~~-------~~~~ 127 (705) |++-. .. .++-| ...++.-..++..+.+-.-...+| ...|.-.-+|.+ -+|. ...+ T Consensus 1 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----------R~~a~~d~~fy~G~Qw~~~~~~~l~~~g 66 (714) T protein:vir:27 1 MKNETNTMATKNDN----GATPRFSQRQLQALCSDIDSQPKW----------RDAANKACAYYDGDQLPPEVLQVLKDRG 66 (714) T ss_pred CCcccccccCCCCc----chhHHHHHHHHHHHHHHHHhhHHH----------HHHHHHHHHhhcCCCCCHHHHHHHHhcC Confidence 43322 11 22223 222332233333333322222222 122222222322 1221 1111 Q ss_pred CcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCch-hHHHHHHHHHHHhhchhhhcchHHHHHHHHHhh Q lcl|NC_021540. 128 KVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGE-SIDLINQAVQMYQMNPSILDTMPEALAESVRYS 206 (705) Q Consensus 128 ~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (705) .-.+.+|.|+-.+..-. ...+.+++.+.+.+...+ ........+..+.......+....+++.+|.+. T Consensus 67 ~p~~~~N~i~~~v~~v~-----------g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:27 67 QPMTIHNLIAPTVDGVL-----------GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred CCcEEeccHHHHHHHHH-----------hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 11223344444443333 355667778888886643 333344444444444444667888999999999 Q ss_pred hhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccC-ChhhCCeEEEEEeccHHHHHHhcCCcC--cchhh Q lcl|NC_021540. 207 VANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNG-NLDEAKFVIYSFESSRSDLEKYGIYSN--LEYIK 283 (705) Q Consensus 207 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~d--~~~~~ 283 (705) +.+|+||..+ +++++.|-.++.++. ++.+--|=-..+..+.++..=..+... .+.+. T Consensus 136 ~~~G~G~~~~--------------------~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~ 195 (714) T protein:vir:27 136 IKAGLSWVEV--------------------RRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAK 195 (714) T ss_pred hhcCcceEEe--------------------ccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHH Confidence 9999998532 112221211111111 122211000112223344322211111 22222 Q ss_pred hhhhh---hh---ccccccccccccccccccCeEEEEEEEEEeeecC------CCe-eEEEEEEE------------ECC Q lcl|NC_021540. 284 EDSST---ST---SSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDG------SGV-TTPIVASW------------VDD 338 (705) Q Consensus 284 ~~~~~---~~---~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~------dg~-~~~~~~~~------------~g~ 338 (705) ..+.+ .+ ...+....+.............-++....++... +.. .+.+.+++ .|+ T Consensus 196 ~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~ 275 (714) T protein:vir:27 196 ATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGR 275 (714) T ss_pred HhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCc Confidence 21111 11 1111111111000000011111122111111111 111 11111111 122 Q ss_pred EEEecccCC-------------------------------CCCCCcce--EEeeeeeecCcccCCchHHHhhHHHHHHHH Q lcl|NC_021540. 339 VMIRLEKNP-------------------------------YPDGKLPF--VVVPYLPVKDSVYGEADAELLSDNQKLIGA 385 (705) Q Consensus 339 ~iL~~~~~p-------------------------------~~~~~~Pf--v~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~ 385 (705) .+...+.+| ...+..|| -.|++.|+ || +.+ +.....-- T Consensus 276 ~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~----~g--~~~---~~~g~~~G 346 (714) T protein:vir:27 276 VVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPF----WG--YRK---DKTGEPYG 346 (714) T ss_pred eEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEE----ee--eee---eccCceee Confidence 222222211 00111122 12333332 22 111 11111122 Q ss_pred HHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeec-----CCcccccccccc-------------cCccchHHH Q lcl|NC_021540. 386 LTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYN-----PGTNPVTDIIEH-------------KYPELPASS 447 (705) Q Consensus 386 ~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~-----~~~~~~~~i~~~-------------~~~~i~~~~ 447 (705) +.|.++|+-...|.-...+.. +++..-.+ ..+|++.... ..+.+...+.+. .+.+.++-. T Consensus 347 ~vr~~~d~Qr~~N~~~s~~~~-~l~~~~~~-~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~ 424 (714) T protein:vir:27 347 LISRAIPAQDEVNFRRIKLTW-LLQAKRVI-MDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVA 424 (714) T ss_pred hhhhchhHHHHHHHHHHHHHH-hhcCCcee-eecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCcc Confidence 344444443222211110000 11111001 1233332211 112222222221 112223344 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeE Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDEEVI 526 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~~~i 526 (705) ...++++......+- ...|.....++....+.++..-++.+... ...-.+-+.++...+.+..++..+... T Consensus 425 ~~~~~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~---- 496 (714) T protein:vir:27 425 SQQFQVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD---- 496 (714) T ss_pred HHHHHHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 445555555555443 34565544443333334443222222111 111122233344444444444433211 Q ss_pred eEecCceeeechh-hcccc-eeEEee------------------ccchhHH---HHHHHHHHHHHHHHhhhchhHHHHHH Q lcl|NC_021540. 527 RITDEEFVQINRD-NLVGS-FDIKLS------------------ISNAETD---AIKAQELSFMLQTMGQSLPFDMTKLI 583 (705) Q Consensus 527 ri~~~~~v~i~~~-~~~~~-~dv~v~------------------~~~~~~~---~~~~q~~~~llq~~~~~~~~~~~~~i 583 (705) -.+.+..+.|... +-.+. --+.++ +...+.. +.+.+.+..|++.+ ..+|+.....+ T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~-~~~~p~~~~~~ 575 (714) T protein:vir:27 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVI-QGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHH-hhcCchhhhhH Confidence 1122334444321 11110 012221 1112222 23344444555554 45676666667 Q ss_pred HHHHHhhhccchhhhhhhcccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_021540. 584 LGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIK-QLEAQELQMRIAKLQAEIQLMPYEAQA--EAAKARKANTEADLN 660 (705) Q Consensus 584 l~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~-q~~~q~~q~e~~k~qa~~q~~~~~~q~--e~a~a~~~~~ea~~~ 660 (705) +.-+.++..++...+..+.......+.......+. +.+++..+..+++.+++++..+.+++. .++++++.+.++... T Consensus 576 ~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:27 576 LDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred HHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77788999998887776665443222111121111 111111111112222222222222222 122222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-HHhhccC Q lcl|NC_021540. 661 TLDFVEQETGVKQERELELMQAQAKGNTQ-------RDIVKTFLD-TNKQGNQ 705 (705) Q Consensus 661 ~~~~~~q~~~~kq~~e~e~~~~q~~~~~~-------~~~~k~~~~-~~~q~~~ 705 (705) ..+...+.. .++.++... +..+++ .+.+..... .++|..+ T Consensus 656 ~~~a~~~~~----~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q 703 (714) T protein:vir:27 656 NASAQREVA----LTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLY 703 (714) T ss_pred HHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHH Confidence 111110000 111111110 000000 011111111 1112122 No 131 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=98.49 E-value=4.7e-07 Score=55.27 Aligned_cols=578 Identities=13% Similarity=0.075 Sum_probs=182.1 Q ss_pred CCCCC-CC-CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHH-HHHH-------hhcC Q lcl|NC_021540. 58 GAYKP-KQ-QVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILN-YQFN-------NQLD 127 (705) Q Consensus 58 ~~~~~-~~-~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n-~~~~-------~~~~ 127 (705) |++-. .. .++-| ...++.-..++..+.+-.-...+| ...|.-.-+|.+ -+|. ...+ T Consensus 1 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----------R~~a~~d~~fy~G~Qw~~~~~~~l~~~g 66 (714) T protein:vir:10 1 MKNETNTMATKNDN----GATPRFSQRQLQALCSDIDSQPKW----------RDAANKACAYYDGDQLPPEVLQVLKDRG 66 (714) T ss_pred CCcccccccCCCCc----chhHHHHHHHHHHHHHHHHhhHHH----------HHHHHHHHHhhcCCCCCHHHHHHHHhcC Confidence 43322 11 22223 222332233333333322222222 122222222322 1221 1111 Q ss_pred CcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCch-hHHHHHHHHHHHhhchhhhcchHHHHHHHHHhh Q lcl|NC_021540. 128 KVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGE-SIDLINQAVQMYQMNPSILDTMPEALAESVRYS 206 (705) Q Consensus 128 ~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (705) .-.+.+|.|+-.+..-. ...+.+++.+.+.+...+ ........+..+.......+....+++.+|.+. T Consensus 67 ~p~~~~N~i~~~v~~v~-----------g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:10 67 QPMTIHNLIAPTVDGVL-----------GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred CCcEEeccHHHHHHHHH-----------hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 11223344444443333 355667778888886643 333344444444444444667888999999999 Q ss_pred hhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccC-ChhhCCeEEEEEeccHHHHHHhcCCcC--cchhh Q lcl|NC_021540. 207 VANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNG-NLDEAKFVIYSFESSRSDLEKYGIYSN--LEYIK 283 (705) Q Consensus 207 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~d--~~~~~ 283 (705) +.+|+||..+ +++++.|-.++.++. ++.+--|=-..+..+.++..=..+... .+.+. T Consensus 136 ~~~G~G~~~~--------------------~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~ 195 (714) T protein:vir:10 136 IKAGLSWVEV--------------------RRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAK 195 (714) T ss_pred hhcCcceEEe--------------------ccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHH Confidence 9999998532 112221211111111 122211000112223344322211111 22222 Q ss_pred hhhhh---hh---ccccccccccccccccccCeEEEEEEEEEeeecC------CCe-eEEEEEEE------------ECC Q lcl|NC_021540. 284 EDSST---ST---SSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDG------SGV-TTPIVASW------------VDD 338 (705) Q Consensus 284 ~~~~~---~~---~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~------dg~-~~~~~~~~------------~g~ 338 (705) ..+.+ .+ ...+....+.............-++....++... +.. .+.+.+++ .|+ T Consensus 196 ~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~ 275 (714) T protein:vir:10 196 ATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGR 275 (714) T ss_pred HhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCc Confidence 21111 11 1111111111000000011111122111111111 111 11111111 122 Q ss_pred EEEecccCC-------------------------------CCCCCcce--EEeeeeeecCcccCCchHHHhhHHHHHHHH Q lcl|NC_021540. 339 VMIRLEKNP-------------------------------YPDGKLPF--VVVPYLPVKDSVYGEADAELLSDNQKLIGA 385 (705) Q Consensus 339 ~iL~~~~~p-------------------------------~~~~~~Pf--v~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~ 385 (705) .+...+.+| ...+..|| -.|++.|+ || +.+ +.....-- T Consensus 276 ~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~----~g--~~~---~~~g~~~G 346 (714) T protein:vir:10 276 VVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPF----WG--YRK---DKTGEPYG 346 (714) T ss_pred eEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEE----ee--eee---eccCceee Confidence 222222211 00111122 12333332 22 111 11111122 Q ss_pred HHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeec-----CCcccccccccc-------------cCccchHHH Q lcl|NC_021540. 386 LTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYN-----PGTNPVTDIIEH-------------KYPELPASS 447 (705) Q Consensus 386 ~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~-----~~~~~~~~i~~~-------------~~~~i~~~~ 447 (705) +.|.++|+-...|.-...+.. +++..-.+ ..+|++.... ..+.+...+.+. .+.+.++-. T Consensus 347 ~vr~~~d~Qr~~N~~~s~~~~-~l~~~~~~-~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~ 424 (714) T protein:vir:10 347 LISRAIPAQDEVNFRRIKLTW-LLQAKRVI-MDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVA 424 (714) T ss_pred hhhhchhHHHHHHHHHHHHHH-hhcCCcee-eecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCcc Confidence 344444443222211110000 11111001 1233332211 112222222221 112223344 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeE Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDEEVI 526 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~~~i 526 (705) ...++++......+- ...|.....++....+.++..-++.+... ...-.+-+.++...+.+..++..+... T Consensus 425 ~~~~~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~---- 496 (714) T protein:vir:10 425 SQQFQVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD---- 496 (714) T ss_pred HHHHHHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 445555555555443 34565544443333334443222222111 111122233344444444444433211 Q ss_pred eEecCceeeechh-hcccc-eeEEee------------------ccchhHH---HHHHHHHHHHHHHHhhhchhHHHHHH Q lcl|NC_021540. 527 RITDEEFVQINRD-NLVGS-FDIKLS------------------ISNAETD---AIKAQELSFMLQTMGQSLPFDMTKLI 583 (705) Q Consensus 527 ri~~~~~v~i~~~-~~~~~-~dv~v~------------------~~~~~~~---~~~~q~~~~llq~~~~~~~~~~~~~i 583 (705) -.+.+..+.|... +-.+. --+.++ +...+.. +.+.+.+..|++.+ ..+|+.....+ T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~-~~~~p~~~~~~ 575 (714) T protein:vir:10 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVI-QGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHH-hhcCchhhhhH Confidence 1122334444321 11110 012221 1112222 23344444555554 45676666667 Q ss_pred HHHHHhhhccchhhhhhhcccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_021540. 584 LGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIK-QLEAQELQMRIAKLQAEIQLMPYEAQA--EAAKARKANTEADLN 660 (705) Q Consensus 584 l~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~-q~~~q~~q~e~~k~qa~~q~~~~~~q~--e~a~a~~~~~ea~~~ 660 (705) +.-+.++..++...+..+.......+.......+. +.+++..+..+++.+++++..+.+++. .++++++.+.++... T Consensus 576 ~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:10 576 LDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred HHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77788999998887776665443222111121111 111111111112222222222222222 122222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-HHhhccC Q lcl|NC_021540. 661 TLDFVEQETGVKQERELELMQAQAKGNTQ-------RDIVKTFLD-TNKQGNQ 705 (705) Q Consensus 661 ~~~~~~q~~~~kq~~e~e~~~~q~~~~~~-------~~~~k~~~~-~~~q~~~ 705 (705) ..+...+.. .++.++... +..+++ .+.+..... .++|..+ T Consensus 656 ~~~a~~~~~----~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q 703 (714) T protein:vir:10 656 NASAQREVA----LTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLY 703 (714) T ss_pred HHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHH Confidence 111110000 111111110 000000 011111111 1112122 No 132 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=98.49 E-value=4.7e-07 Score=55.27 Aligned_cols=578 Identities=13% Similarity=0.075 Sum_probs=182.1 Q ss_pred CCCCC-CC-CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHH-HHHH-------hhcC Q lcl|NC_021540. 58 GAYKP-KQ-QVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILN-YQFN-------NQLD 127 (705) Q Consensus 58 ~~~~~-~~-~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n-~~~~-------~~~~ 127 (705) |++-. .. .++-| ...++.-..++..+.+-.-...+| ...|.-.-+|.+ -+|. ...+ T Consensus 1 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----------R~~a~~d~~fy~G~Qw~~~~~~~l~~~g 66 (714) T protein:vir:99 1 MKNETNTMATKNDN----GATPRFSQRQLQALCSDIDSQPKW----------RDAANKACAYYDGDQLPPEVLQVLKDRG 66 (714) T ss_pred CCcccccccCCCCc----chhHHHHHHHHHHHHHHHHhhHHH----------HHHHHHHHHhhcCCCCCHHHHHHHHhcC Confidence 43322 11 22223 222332233333333322222222 122222222322 1221 1111 Q ss_pred CcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCch-hHHHHHHHHHHHhhchhhhcchHHHHHHHHHhh Q lcl|NC_021540. 128 KVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGE-SIDLINQAVQMYQMNPSILDTMPEALAESVRYS 206 (705) Q Consensus 128 ~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (705) .-.+.+|.|+-.+..-. ...+.+++.+.+.+...+ ........+..+.......+....+++.+|.+. T Consensus 67 ~p~~~~N~i~~~v~~v~-----------g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:99 67 QPMTIHNLIAPTVDGVL-----------GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred CCcEEeccHHHHHHHHH-----------hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 11223344444443333 355667778888886643 333344444444444444667888999999999 Q ss_pred hhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccC-ChhhCCeEEEEEeccHHHHHHhcCCcC--cchhh Q lcl|NC_021540. 207 VANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNG-NLDEAKFVIYSFESSRSDLEKYGIYSN--LEYIK 283 (705) Q Consensus 207 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~d--~~~~~ 283 (705) +.+|+||..+ +++++.|-.++.++. ++.+--|=-..+..+.++..=..+... .+.+. T Consensus 136 ~~~G~G~~~~--------------------~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~ 195 (714) T protein:vir:99 136 IKAGLSWVEV--------------------RRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAK 195 (714) T ss_pred hhcCcceEEe--------------------ccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHH Confidence 9999998532 112221211111111 122211000112223344322211111 22222 Q ss_pred hhhhh---hh---ccccccccccccccccccCeEEEEEEEEEeeecC------CCe-eEEEEEEE------------ECC Q lcl|NC_021540. 284 EDSST---ST---SSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDG------SGV-TTPIVASW------------VDD 338 (705) Q Consensus 284 ~~~~~---~~---~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~------dg~-~~~~~~~~------------~g~ 338 (705) ..+.+ .+ ...+....+.............-++....++... +.. .+.+.+++ .|+ T Consensus 196 ~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~ 275 (714) T protein:vir:99 196 ATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGR 275 (714) T ss_pred HhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCc Confidence 21111 11 1111111111000000011111122111111111 111 11111111 122 Q ss_pred EEEecccCC-------------------------------CCCCCcce--EEeeeeeecCcccCCchHHHhhHHHHHHHH Q lcl|NC_021540. 339 VMIRLEKNP-------------------------------YPDGKLPF--VVVPYLPVKDSVYGEADAELLSDNQKLIGA 385 (705) Q Consensus 339 ~iL~~~~~p-------------------------------~~~~~~Pf--v~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~ 385 (705) .+...+.+| ...+..|| -.|++.|+ || +.+ +.....-- T Consensus 276 ~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~----~g--~~~---~~~g~~~G 346 (714) T protein:vir:99 276 VVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPF----WG--YRK---DKTGEPYG 346 (714) T ss_pred eEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEE----ee--eee---eccCceee Confidence 222222211 00111122 12333332 22 111 11111122 Q ss_pred HHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeec-----CCcccccccccc-------------cCccchHHH Q lcl|NC_021540. 386 LTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYN-----PGTNPVTDIIEH-------------KYPELPASS 447 (705) Q Consensus 386 ~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~-----~~~~~~~~i~~~-------------~~~~i~~~~ 447 (705) +.|.++|+-...|.-...+.. +++..-.+ ..+|++.... ..+.+...+.+. .+.+.++-. T Consensus 347 ~vr~~~d~Qr~~N~~~s~~~~-~l~~~~~~-~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~ 424 (714) T protein:vir:99 347 LISRAIPAQDEVNFRRIKLTW-LLQAKRVI-MDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVA 424 (714) T ss_pred hhhhchhHHHHHHHHHHHHHH-hhcCCcee-eecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCcc Confidence 344444443222211110000 11111001 1233332211 112222222221 112223344 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeE Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDEEVI 526 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~~~i 526 (705) ...++++......+- ...|.....++....+.++..-++.+... ...-.+-+.++...+.+..++..+... T Consensus 425 ~~~~~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~---- 496 (714) T protein:vir:99 425 SQQFQVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD---- 496 (714) T ss_pred HHHHHHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 445555555555443 34565544443333334443222222111 111122233344444444444433211 Q ss_pred eEecCceeeechh-hcccc-eeEEee------------------ccchhHH---HHHHHHHHHHHHHHhhhchhHHHHHH Q lcl|NC_021540. 527 RITDEEFVQINRD-NLVGS-FDIKLS------------------ISNAETD---AIKAQELSFMLQTMGQSLPFDMTKLI 583 (705) Q Consensus 527 ri~~~~~v~i~~~-~~~~~-~dv~v~------------------~~~~~~~---~~~~q~~~~llq~~~~~~~~~~~~~i 583 (705) -.+.+..+.|... +-.+. --+.++ +...+.. +.+.+.+..|++.+ ..+|+.....+ T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~-~~~~p~~~~~~ 575 (714) T protein:vir:99 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVI-QGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHH-hhcCchhhhhH Confidence 1122334444321 11110 012221 1112222 23344444555554 45676666667 Q ss_pred HHHHHhhhccchhhhhhhcccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_021540. 584 LGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIK-QLEAQELQMRIAKLQAEIQLMPYEAQA--EAAKARKANTEADLN 660 (705) Q Consensus 584 l~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~-q~~~q~~q~e~~k~qa~~q~~~~~~q~--e~a~a~~~~~ea~~~ 660 (705) +.-+.++..++...+..+.......+.......+. +.+++..+..+++.+++++..+.+++. .++++++.+.++... T Consensus 576 ~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:99 576 LDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred HHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77788999998887776665443222111121111 111111111112222222222222222 122222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-HHhhccC Q lcl|NC_021540. 661 TLDFVEQETGVKQERELELMQAQAKGNTQ-------RDIVKTFLD-TNKQGNQ 705 (705) Q Consensus 661 ~~~~~~q~~~~kq~~e~e~~~~q~~~~~~-------~~~~k~~~~-~~~q~~~ 705 (705) ..+...+.. .++.++... +..+++ .+.+..... .++|..+ T Consensus 656 ~~~a~~~~~----~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q 703 (714) T protein:vir:99 656 NASAQREVA----LTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLY 703 (714) T ss_pred HHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHH Confidence 111110000 111111110 000000 011111111 1112122 No 133 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=98.49 E-value=4.7e-07 Score=55.27 Aligned_cols=578 Identities=13% Similarity=0.075 Sum_probs=182.1 Q ss_pred CCCCC-CC-CCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHH-HHHH-------hhcC Q lcl|NC_021540. 58 GAYKP-KQ-QVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILN-YQFN-------NQLD 127 (705) Q Consensus 58 ~~~~~-~~-~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n-~~~~-------~~~~ 127 (705) |++-. .. .++-| ...++.-..++..+.+-.-...+| ...|.-.-+|.+ -+|. ...+ T Consensus 1 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----------R~~a~~d~~fy~G~Qw~~~~~~~l~~~g 66 (714) T protein:vir:32 1 MKNETNTMATKNDN----GATPRFSQRQLQALCSDIDSQPKW----------RDAANKACAYYDGDQLPPEVLQVLKDRG 66 (714) T ss_pred CCcccccccCCCCc----chhHHHHHHHHHHHHHHHHhhHHH----------HHHHHHHHHhhcCCCCCHHHHHHHHhcC Confidence 43322 11 22223 222332233333333322222222 122222222322 1221 1111 Q ss_pred CcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCch-hHHHHHHHHHHHhhchhhhcchHHHHHHHHHhh Q lcl|NC_021540. 128 KVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGE-SIDLINQAVQMYQMNPSILDTMPEALAESVRYS 206 (705) Q Consensus 128 ~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (705) .-.+.+|.|+-.+..-. ...+.+++.+.+.+...+ ........+..+.......+....+++.+|.+. T Consensus 67 ~p~~~~N~i~~~v~~v~-----------g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~ 135 (714) T protein:vir:32 67 QPMTIHNLIAPTVDGVL-----------GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQ 135 (714) T ss_pred CCcEEeccHHHHHHHHH-----------hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHh Confidence 11223344444443333 355667778888886643 333344444444444444667888999999999 Q ss_pred hhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccC-ChhhCCeEEEEEeccHHHHHHhcCCcC--cchhh Q lcl|NC_021540. 207 VANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNG-NLDEAKFVIYSFESSRSDLEKYGIYSN--LEYIK 283 (705) Q Consensus 207 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~-d~~da~~~~~~~~~t~~el~~~g~~~d--~~~~~ 283 (705) +.+|+||..+ +++++.|-.++.++. ++.+--|=-..+..+.++..=..+... .+.+. T Consensus 136 ~~~G~G~~~~--------------------~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~ 195 (714) T protein:vir:32 136 IKAGLSWVEV--------------------RRNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAK 195 (714) T ss_pred hhcCcceEEe--------------------ccccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHH Confidence 9999998532 112221211111111 122211000112223344322211111 22222 Q ss_pred hhhhh---hh---ccccccccccccccccccCeEEEEEEEEEeeecC------CCe-eEEEEEEE------------ECC Q lcl|NC_021540. 284 EDSST---ST---SSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDG------SGV-TTPIVASW------------VDD 338 (705) Q Consensus 284 ~~~~~---~~---~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~------dg~-~~~~~~~~------------~g~ 338 (705) ..+.+ .+ ...+....+.............-++....++... +.. .+.+.+++ .|+ T Consensus 196 ~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~ 275 (714) T protein:vir:32 196 ATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGR 275 (714) T ss_pred HhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCc Confidence 21111 11 1111111111000000011111122111111111 111 11111111 122 Q ss_pred EEEecccCC-------------------------------CCCCCcce--EEeeeeeecCcccCCchHHHhhHHHHHHHH Q lcl|NC_021540. 339 VMIRLEKNP-------------------------------YPDGKLPF--VVVPYLPVKDSVYGEADAELLSDNQKLIGA 385 (705) Q Consensus 339 ~iL~~~~~p-------------------------------~~~~~~Pf--v~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~ 385 (705) .+...+.+| ...+..|| -.|++.|+ || +.+ +.....-- T Consensus 276 ~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~----~g--~~~---~~~g~~~G 346 (714) T protein:vir:32 276 VVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPF----WG--YRK---DKTGEPYG 346 (714) T ss_pred eEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEE----ee--eee---eccCceee Confidence 222222211 00111122 12333332 22 111 11111122 Q ss_pred HHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeec-----CCcccccccccc-------------cCccchHHH Q lcl|NC_021540. 386 LTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYN-----PGTNPVTDIIEH-------------KYPELPASS 447 (705) Q Consensus 386 ~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~-----~~~~~~~~i~~~-------------~~~~i~~~~ 447 (705) +.|.++|+-...|.-...+.. +++..-.+ ..+|++.... ..+.+...+.+. .+.+.++-. T Consensus 347 ~vr~~~d~Qr~~N~~~s~~~~-~l~~~~~~-~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~ 424 (714) T protein:vir:32 347 LISRAIPAQDEVNFRRIKLTW-LLQAKRVI-MDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVA 424 (714) T ss_pred hhhhchhHHHHHHHHHHHHHH-hhcCCcee-eecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCcc Confidence 344444443222211110000 11111001 1233332211 112222222221 112223344 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeE Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDEEVI 526 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~~~i 526 (705) ...++++......+- ...|.....++....+.++..-++.+... ...-.+-+.++...+.+..++..+... T Consensus 425 ~~~~~llq~~~~~i~----~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~---- 496 (714) T protein:vir:32 425 SQQFQVMQESEKLIQ----DTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLD---- 496 (714) T ss_pred HHHHHHHHHHHHHHH----HhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 445555555555443 34565544443333334443222222111 111122233344444444444433211 Q ss_pred eEecCceeeechh-hcccc-eeEEee------------------ccchhHH---HHHHHHHHHHHHHHhhhchhHHHHHH Q lcl|NC_021540. 527 RITDEEFVQINRD-NLVGS-FDIKLS------------------ISNAETD---AIKAQELSFMLQTMGQSLPFDMTKLI 583 (705) Q Consensus 527 ri~~~~~v~i~~~-~~~~~-~dv~v~------------------~~~~~~~---~~~~q~~~~llq~~~~~~~~~~~~~i 583 (705) -.+.+..+.|... +-.+. --+.++ +...+.. +.+.+.+..|++.+ ..+|+.....+ T Consensus 497 ~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~-~~~~p~~~~~~ 575 (714) T protein:vir:32 497 DLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVI-QGLPPQVQAVV 575 (714) T ss_pred HcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHH-hhcCchhhhhH Confidence 1122334444321 11110 012221 1112222 23344444555554 45676666667 Q ss_pred HHHHHhhhccchhhhhhhcccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Q lcl|NC_021540. 584 LGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIK-QLEAQELQMRIAKLQAEIQLMPYEAQA--EAAKARKANTEADLN 660 (705) Q Consensus 584 l~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~-q~~~q~~q~e~~k~qa~~q~~~~~~q~--e~a~a~~~~~ea~~~ 660 (705) +.-+.++..++...+..+.......+.......+. +.+++..+..+++.+++++..+.+++. .++++++.+.++... T Consensus 576 ~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:32 576 LDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred HHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77788999998887776665443222111121111 111111111112222222222222222 122222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-HHhhccC Q lcl|NC_021540. 661 TLDFVEQETGVKQERELELMQAQAKGNTQ-------RDIVKTFLD-TNKQGNQ 705 (705) Q Consensus 661 ~~~~~~q~~~~kq~~e~e~~~~q~~~~~~-------~~~~k~~~~-~~~q~~~ 705 (705) ..+...+.. .++.++... +..+++ .+.+..... .++|..+ T Consensus 656 ~~~a~~~~~----~~~~~~~~~-~~~~a~~a~~~~~~~~~~~~~~~~~~q~~q 703 (714) T protein:vir:32 656 NASAQREVA----LTQGQRYVD-ALNQAHTAEIITGVQNMEQEQDVLQQQMLY 703 (714) T ss_pred HHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHhHhhhhhhhHHHHHHHHH Confidence 111110000 111111110 000000 011111111 1112122 No 134 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=98.43 E-value=6.8e-07 Score=54.40 Aligned_cols=576 Identities=14% Similarity=0.096 Sum_probs=186.7 Q ss_pred HhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHH------------HhhcCCCCEEEEeC Q lcl|NC_021540. 35 NAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALS------------EPFLNDENIFSIAP 102 (705) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~------------~~f~~~~~~~~~~p 102 (705) -|+....|-+- +.|-. + .....+.+.--...++...+|+...+. .-|..|+.| T Consensus 1 ~~~~~~~~~~~------~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw----- 65 (711) T protein:vir:10 1 MAKKQKKSRVE------QLYAK--K--AKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW----- 65 (711) T ss_pred CCccccccccc------chhHH--H--HHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCC----- Confidence 11111111110 01100 0 000001011111122222233222111 011122211 Q ss_pred CCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCC----------- Q lcl|NC_021540. 103 KTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEAT----------- 171 (705) Q Consensus 103 ~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~----------- 171 (705) .++..+ .....+.-.+.+|.|+-.+..-.| ..+.+++.+.+.+.+ T Consensus 66 ---~~~~~~----------~l~~~g~p~~~~N~i~~~v~~v~g-----------~~~~nr~~~~v~p~~~~~~~~~~~~~ 121 (711) T protein:vir:10 66 ---PSQVRT----------ERELEQRPCLVNNVLPTFVDQVLG-----------DQRQNRPAIKVSSTEVTRVPDAESGE 121 (711) T ss_pred ---CHHHHH----------HHHhcCCCcEEEcchHHHHHHHhh-----------hHhhCCcceEEecccccchhhhhhhh Confidence 111111 011122224455666665555444 344455566666654 Q ss_pred ----------chhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechh Q lcl|NC_021540. 172 ----------GESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYH 241 (705) Q Consensus 172 ----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~ 241 (705) ..........+..+.......+....+++.+|.+.+.+|.||.++ ++++.+.. T Consensus 122 ~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev-----------------~~d~~~~d 184 (711) T protein:vir:10 122 DTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRV-----------------RSDYLADD 184 (711) T ss_pred ccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEE-----------------EecccCCC Confidence 233333444444444445556678889999999999999998742 11121111 Q ss_pred heeeCCCccC--ChhhCCeEEEEEeccHHHHHHhcCCcCc--chhhhhhhhhhccccccccccccccccccCeEEEEEEE Q lcl|NC_021540. 242 NVTIDPTCNG--NLDEAKFVIYSFESSRSDLEKYGIYSNL--EYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYW 317 (705) Q Consensus 242 ~~~~Dp~a~~--d~~da~~~~~~~~~t~~el~~~g~~~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w 317 (705) .|--++..+. ++.+--|=-..+..+.++..-......+ +.+...+ ...........+ +.-+..| T Consensus 185 ~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~y-----p~~a~~~~~~~~-------~~~~~~~ 252 (711) T protein:vir:10 185 SFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALY-----PDATAEPVYEDS-------VADYDTW 252 (711) T ss_pred CCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhC-----Cchhhhhhhccc-------ccccCcc Confidence 1111111110 1111000001122233332211111111 1111111 000000000000 1112233 Q ss_pred EEeeecCCCe--eEE--------EEEEEECCEEEecccCCCC---------------------------------CC--C Q lcl|NC_021540. 318 GYWDIDGSGV--TTP--------IVASWVDDVMIRLEKNPYP---------------------------------DG--K 352 (705) Q Consensus 318 ~k~~~~~dg~--~~~--------~~~~~~g~~iL~~~~~p~~---------------------------------~~--~ 352 (705) +. .+.+ .+. +++.+.++.......+.-. ++ - T Consensus 253 ~~----~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p 328 (711) T protein:vir:10 253 FT----EKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVE 328 (711) T ss_pred cC----cceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCC Confidence 32 1221 111 1112222222111110000 01 1 Q ss_pred cceEEeeeeeecCcc---cCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCccee----e Q lcl|NC_021540. 353 LPFVVVPYLPVKDSV---YGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYK----Y 425 (705) Q Consensus 353 ~Pfv~~~~~~~~~~~---~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~----~ 425 (705) ||+-.+|+.|.-+.. -+.|....+...=+-.=...|.+.-.+....+ ....+. +-...|.+-. + T Consensus 329 ~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~---~~~~~~------~~~~~gai~~~~~~~ 399 (711) T protein:vir:10 329 IPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVA---LAPKAP------FIGSEGNVEGREDEW 399 (711) T ss_pred CCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHH---hcCCCc------eeecCcccCChHHHH Confidence 222222222222111 12222333222222222222222111111100 000111 1111122110 0 Q ss_pred -cCCccccccccc---------ccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHH- Q lcl|NC_021540. 426 -NPGTNPVTDIIE---------HKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRE- 494 (705) Q Consensus 426 -~~~~~~~~~i~~---------~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~- 494 (705) .....+...+.. ....+.++-....++++......+.. ..|......+..+.+.+...-++.+.. T Consensus 400 ~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~----~tGi~~~~~G~~~n~~Sg~ai~~~q~qg 475 (711) T protein:vir:10 400 EQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKS----TMGMYDASLGAMGNETSGRAIIARQRQG 475 (711) T ss_pred HhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHH----HhCCChHHcCCCccchHHHHHHHHHHHH Confidence 011111111111 22222233344456666666666544 456544333322222333221111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeec----------------------- Q lcl|NC_021540. 495 LGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI----------------------- 551 (705) Q Consensus 495 ~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~----------------------- 551 (705) ......|-+.+....+.+..++..+... -.+.+..+.|...+...++ +.++. T Consensus 476 ~~~l~~~~dn~~~~~~~~g~~ll~li~~----~~~~er~~rI~ged~~~~~-v~ln~~~~~~~~G~~~~~nDi~~g~~Dv 550 (711) T protein:vir:10 476 DRGSFAFIDNLTKSIRRVGKILVEMIPH----IYDTERVVRLKFPDETEDF-VKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HcCCCeEEEEecCCCCcce-EEecccccccccccceeeeccceeeeEE Confidence 1112223334444445555555543221 0123334556544322222 11211 Q ss_pred cchhH---HHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 552 SNAET---DAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMR 628 (705) Q Consensus 552 ~~~~~---~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e 628 (705) ..... .+.+.+.+..|+ ++.+.+| .....++..+.++..++...+..........+..+......+. ++.+.+ T Consensus 551 ~i~~~p~~~s~r~~~~~~l~-ql~~~~p-~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~--qq~~~e 626 (711) T protein:vir:10 551 VVTTGPAFATQRIEAAEAMI-QFAQAVP-SAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAI--EEDMPE 626 (711) T ss_pred EEeeccCchhHHHHHHHHHH-HHHhhcc-hhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHH--HHHHHH Confidence 11111 233344444444 4556664 5677777788888889887777766665544333333333322 233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHH---HH Q lcl|NC_021540. 629 IAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELELMQAQAKGNTQRD-----IVKTFLD---TN 700 (705) Q Consensus 629 ~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e~~~~q~~~~~~~~-----~~k~~~~---~~ 700 (705) +++..++++.++++++...++++....+++..+++...+....+.+.......+++ +++.++ +.+.+.+ .+ T Consensus 627 ~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~-~~~~~qq~~~~l~~~qaelq~~q 705 (711) T protein:vir:10 627 QTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQG-GDVVYQQVRELVAQALAEITASQ 705 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 33333334444444444444433333333222222111111222221111111111 111111 1111111 11 Q ss_pred hhccC Q lcl|NC_021540. 701 KQGNQ 705 (705) Q Consensus 701 ~q~~~ 705 (705) .+.+| T Consensus 706 ~~~~q 710 (711) T protein:vir:10 706 ANVTE 710 (711) T ss_pred HHhhc Confidence 12222 No 135 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.31 E-value=1.5e-06 Score=52.58 Aligned_cols=443 Identities=13% Similarity=0.050 Sum_probs=187.2 Q ss_pred cccccCCCCCCHHHHHHHHHHHHH-hhHHh-hHHHHHHHHHHHHhccCCCCC--CCC----------CCCC--CcCCCHH Q lcl|NC_021540. 12 TVPSLQEDWKNKPKVSDLLNDFNN-AKSTK-DTQVAIIDDWLAQLNVTGAYK--PKQ----------QVGR--SSVQPKL 75 (705) Q Consensus 12 ~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~--~~~----------~~gr--s~~v~~~ 75 (705) -.| ...+.+ +..+..-+.. -..++ +..+....+-.+||.|.-... +.. .+.+ .+++.+- T Consensus 1 ~~~----~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf 75 (537) T protein:vir:78 1 MTS----PLLNKP-IDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGF 75 (537) T ss_pred CCc----cccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccch Confidence 111 112221 1222222222 12222 223444556778999863211 111 1112 2466666 Q ss_pred HHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchh Q lcl|NC_021540. 76 IRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEE 155 (705) Q Consensus 76 v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~ 155 (705) ....|+.....| ||.+.-+ .+...++ ......++..+. ++-.+.+...++++..+|.+...+||+ T Consensus 76 ~k~Ivd~~~~yl----~G~Pv~~--~~~d~~~----~e~~~~l~~~~~--~~~~~~~~el~~~~s~~G~ay~~~y~d--- 140 (537) T protein:vir:78 76 FTELVDQLAQYL----LSNGVEV--KVKDEDN----TQLDEILQEYFD--EDFQATIDTLVTNASKKGFEGIFARTT--- 140 (537) T ss_pred HHHHHHHHhhhh----cccCcee--ecCcchh----HHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeeEEEeeec--- Confidence 666666665555 6665444 3322222 223445555432 444456778889999999997766652 Q ss_pred hhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceE Q lcl|NC_021540. 156 TKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEV 235 (705) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i 235 (705) ..+.+++ T Consensus 141 -------------------------------------------------------------------------e~~~~~~ 147 (537) T protein:vir:78 141 -------------------------------------------------------------------------SEGKLKF 147 (537) T ss_pred -------------------------------------------------------------------------CCCceEE Confidence 0234677 Q ss_pred EEechhheee--CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEE Q lcl|NC_021540. 236 TICDYHNVTI--DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVV 313 (705) Q Consensus 236 ~~V~~~~~~~--Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 313 (705) ..++|.++|+ |... +...+++.+.....+. . +.....+.. T Consensus 148 ~~i~p~~~~pv~d~~~-----~~~~~~~~y~~~~~~~------~---------------------------~~~~~~~~~ 189 (537) T protein:vir:78 148 QTVDGLTLIPVFDDYG-----VLKMIIRWYSEIRYST------K---------------------------QQSTETIWH 189 (537) T ss_pred EEEccceeEEEEcCCC-----CceeEEEEEeeeeccc------c---------------------------ccCcceEEE Confidence 8889998753 3321 1122222221110000 0 000111223 Q ss_pred EEEEEE-----eeecCCCeeEE-------------EEEEEEC----CEEEe--cccCCCCCCCcceEEeeeeeecCcccC Q lcl|NC_021540. 314 YEYWGY-----WDIDGSGVTTP-------------IVASWVD----DVMIR--LEKNPYPDGKLPFVVVPYLPVKDSVYG 369 (705) Q Consensus 314 ~E~w~k-----~~~~~dg~~~~-------------~~~~~~g----~~iL~--~~~~p~~~~~~Pfv~~~~~~~~~~~~g 369 (705) +|+|.. +...+.|.... ++.++.. +..-. ....|.+.|.+|++.+.. .-+| T Consensus 190 ~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n-----n~~~ 264 (537) T protein:vir:78 190 ADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN-----NKDG 264 (537) T ss_pred EEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEecc-----CccC Confidence 333321 11111111100 0111100 00000 111222335667666553 3468 Q ss_pred CchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhh--hhcCCcceeecCCcccccccccccCccchHHH Q lcl|NC_021540. 370 EADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNER--KFKMGEDYKYNPGTNPVTDIIEHKYPELPASS 447 (705) Q Consensus 370 ~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~--~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~ 447 (705) .|.+..++++++.+|.+.|.+.+.+...++|.+++....+...... ..+-.+++.+++.. +.+.++..+.-.... T Consensus 265 ~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~---~~v~~l~~~~~~~~~ 341 (537) T protein:vir:78 265 MSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDN---AGMEIQTVSIPYEAR 341 (537) T ss_pred CCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecCCC---CceeEEEecCCHHHH Confidence 9999999999999999999999999999988777653323222221 22333455554321 123455444444566 Q ss_pred HHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEe Q lcl|NC_021540. 448 YNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIR 527 (705) Q Consensus 448 ~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~ir 527 (705) ...++.+.+.+...|.+.+......+| +|+.| +.-+...........-+.|..+++++++.++.++....... + T Consensus 342 e~~ld~L~~~I~~~s~~~~~~~~~~gn-~SGvA--lk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~--~- 415 (537) T protein:vir:78 342 KAKMDIDVENIYRSGMGFNSTAVGDGN-VTNVV--IKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGE--Y- 415 (537) T ss_pred HHHHHHHHHHHHHhcCCCCCccccccC-CcHHH--HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc--c- Confidence 677899999999888666554433333 34444 44444445455555556666666766666666554321100 0 Q ss_pred EecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHH-----------HHHHhhhchhHHHHHHHHHHHhh--hcc- Q lcl|NC_021540. 528 ITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFM-----------LQTMGQSLPFDMTKLILGEIAKL--RGM- 593 (705) Q Consensus 528 i~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~l-----------lq~~~~~~~~~~~~~il~~l~e~--~~~- 593 (705) ++. ...+..+...+.-.....+.+..+ +... |.........+..+-.+. ... T Consensus 416 ----d~~---------~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~-p~vdd~e~ek~~~ee~~~~~~~~~ 481 (537) T protein:vir:78 416 ----DSN---------DICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVA-PRIGDDETLKLIAEELDLDYNELK 481 (537) T ss_pred ----ccc---------eeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhC-CCCCCHHHHHHHHHHHHhhhhhhh Confidence 000 011111111110000000000000 0000 100000000000000000 000 Q ss_pred --------------chhhhhhhcc----cc------------------cchhhHHHH Q lcl|NC_021540. 594 --------------PDLSKMISKY----NP------------------EPSPQAQLE 614 (705) Q Consensus 594 --------------~~~~~~~~~~----~~------------------q~~~~~q~~ 614 (705) +......... .+ .|... ++. T Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~-~~~ 537 (537) T protein:vir:78 482 DALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAV-PQT 537 (537) T ss_pred hhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccC-CCC Confidence 0000000000 00 00000 000 No 136 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.28 E-value=1.7e-06 Score=52.17 Aligned_cols=579 Identities=13% Similarity=0.015 Sum_probs=173.5 Q ss_pred HHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCc Q lcl|NC_021540. 26 VSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTW 105 (705) Q Consensus 26 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~ 105 (705) .+.-++++..++++....+....+|.+-..-+..++. | ..| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~----G-~QW---------------------------------- 41 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSR----V-SQW---------------------------------- 41 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc----C-CCC---------------------------------- Confidence 4444555555555555444443344333332222221 2 122 Q ss_pred chHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHH Q lcl|NC_021540. 106 QDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMY 185 (705) Q Consensus 106 ~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (705) .++..+. + +..|.. ..|.|.-.+..-.| ..+.+++.+.+.+..... .....++..+ T Consensus 42 ~~~~~~~-----l------~~q~rp-~~N~i~~~v~~v~g-----------~e~~nr~d~~v~p~~~~d-~~~Ae~l~~~ 97 (725) T protein:vir:10 42 DDWLSQY-----T------TLQYRG-QFDVVRPVVRKLVS-----------EMRQNPIDVLYRPKDGAS-PDAADVLMGM 97 (725) T ss_pred CHHHHHH-----H------HhcCCC-cccchHHHHHHHHh-----------hHHhCCcceEEecCCcch-HHHHHHHHHH Confidence 2111111 1 112222 23455555544334 334455667777766543 3344444444 Q ss_pred hhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEe-chhheeeCCCccCChhhCCeEEEEEe Q lcl|NC_021540. 186 QMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTIC-DYHNVTIDPTCNGNLDEAKFVIYSFE 264 (705) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V-~~~~~~~Dp~a~~d~~da~~~~~~~~ 264 (705) .......+....+++.+|.+.+.+|+||.++.+.+. .......+.+ ....++.+|. +.-|=-..+. T Consensus 98 ~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~-------~~d~~~~~~~i~~~~i~~~~~------~v~~Dp~a~~ 164 (725) T protein:vir:10 98 YRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE-------DQSPTSNNQVIRREPIHSACS------HVIWDSNSKL 164 (725) T ss_pred HHHHHHhcCcchHHhHHHHHHhhcCcceeeeecccc-------CCCCCCCceeeeeeecccCHh------HcccCchhhc Confidence 444455677789999999999999999976432221 0111122221 1111222222 1111112233 Q ss_pred ccHHHHHHhcCCcCcchhh-hhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEEC------ Q lcl|NC_021540. 265 SSRSDLEKYGIYSNLEYIK-EDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVD------ 337 (705) Q Consensus 265 ~t~~el~~~g~~~d~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g------ 337 (705) .+.++..=....+.++... ........... .....+.+. +. .+.-|+. .+.-.+.+.++..... T Consensus 165 ~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a---~~~~~~~~~-~~---~~~~~~~--~~~vrv~E~~~r~~~~~~~~~~ 235 (725) T protein:vir:10 165 MDKSDARHCTVIHSMSQNGWDDFAEKYDLDA---DNIPSFQNP-ND---WVFPWLT--QDTIQIAEFYEVVEKKETAFIY 235 (725) T ss_pred cChhhhhhhhhhccCCHHHHHHHHHhCCCcc---ccccccccc-cc---ccccccC--CCeEEEEEEEEEEEEeeEEEEe Confidence 3344432111112222110 00000000000 000111110 00 0111211 1111122333222111 Q ss_pred -----CEEEecccCCCCC-------------------------------------CCcceEEeeeeeecCccc-CCch-- Q lcl|NC_021540. 338 -----DVMIRLEKNPYPD-------------------------------------GKLPFVVVPYLPVKDSVY-GEAD-- 372 (705) Q Consensus 338 -----~~iL~~~~~p~~~-------------------------------------~~~Pfv~~~~~~~~~~~~-g~g~-- 372 (705) |.++...+..+.+ ..+|.-.||+.|.-+..+ -.|. T Consensus 236 ~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~~ 315 (725) T protein:vir:10 236 QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEV 315 (725) T ss_pred ccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcce Confidence 2222221111000 011221222333222111 0111 Q ss_pred HHHhhHHHHHHHHHHHHH-HHHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCcccc----------ccc--cccc Q lcl|NC_021540. 373 AELLSDNQKLIGALTRGM-IDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNPV----------TDI--IEHK 439 (705) Q Consensus 373 ~~~~~d~Q~~iN~~~~~~-~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~~----------~~i--~~~~ 439 (705) ...+...=+-.=...|.. .-.+...+..+.....+..+..+.... .+-++...+- +.+ .+.. T Consensus 316 ~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~-----~~~~~~~~~~~~~~~~~~~~g~~~~~~i~ 390 (725) T protein:vir:10 316 YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH-----MYDGNDDYPYYLLNRTDENNGEMPTQPLA 390 (725) T ss_pred eeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHH-----HHhccCCceeeecccccccCcccccccCc Confidence 112222222222222222 222222323333322222222222211 1112221110 001 1111 Q ss_pred CccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 440 YPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELG-ILRRLANGLTEVAKKILAMNSV 518 (705) Q Consensus 440 ~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~-~~~n~~~~~~~~~~~~l~li~q 518 (705) ..+.++-....++++......+ ....|.....++....+.++..-++.+.... ..-.|-+.++.-.+.+..++.. T Consensus 391 ~~~~~~~p~~~~~ll~~~~~~i----~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~ 466 (725) T protein:vir:10 391 YYENPEVPQANAYMLEAATAAV----KEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQS 466 (725) T ss_pred ccCCCCchHHHHHHHHHHHHHH----HHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2223333345566666665555 3445655444433332344432222222211 1122223333333444444443 Q ss_pred hcCCceeEeEecCceeeechhhcccceeEEeecc-----c----------hhH----------HHHHHHHHHHHHHHHhh Q lcl|NC_021540. 519 WLSDEEVIRITDEEFVQINRDNLVGSFDIKLSIS-----N----------AET----------DAIKAQELSFMLQTMGQ 573 (705) Q Consensus 519 ~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~-----~----------~~~----------~~~~~q~~~~llq~~~~ 573 (705) +... . .+.+..+.|...+-..++ +.++.. + +.. .+.+.+.+..|++.+ + T Consensus 467 lI~~--~--~~~er~~RI~~edg~~~~-v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll-~ 540 (725) T protein:vir:10 467 IVND--I--YDVPRNVTITLEDGSEKE-VQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELL-G 540 (725) T ss_pred HHHH--H--cCCCcEEEEecCCCCcce-eEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHH-H Confidence 3211 0 122334445433322122 122110 0 000 112223333333332 2 Q ss_pred hchhHHHHHHHHHHHh---hhccchhhhhhhcccccchhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 574 SLPFDMTKLILGEIAK---LRGMPDLSKMISKYNPEPSPQA---QLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEA 647 (705) Q Consensus 574 ~~~~~~~~~il~~l~e---~~~~~~~~~~~~~~~~q~~~~~---q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~ 647 (705) .+|+ .......-+.. ++..+...+.......+..+.. +...+..+...+..+++++++++++...++++...+ T Consensus 541 ~~~~-~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~q 619 (725) T protein:vir:10 541 KTPQ-GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQ 619 (725) T ss_pred hccc-cchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHH Confidence 2222 22221122221 1122222232233322222211 011112222223334444444433332222211111 Q ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHHH-----HHHHHHHHHHH---------HHHHH---HH---H-HHHHHHHHHh Q lcl|NC_021540. 648 AKARKANTEADLNTLD-----FVEQETGVK-----QERELELMQAQ---------AKGNT---QR---D-IVKTFLDTNK 701 (705) Q Consensus 648 a~a~~~~~ea~~~~~~-----~~~q~~~~k-----q~~e~e~~~~q---------~~~~~---~~---~-~~k~~~~~~~ 701 (705) +++++.+.+....+.+ ...+..+.+ .++..+.+..+ .+.++ .+ + .++++....+ T Consensus 620 ae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~ 699 (725) T protein:vir:10 620 AELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHK 699 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 1111111121111111 111111111 11111111000 00000 00 0 0111111111 Q ss_pred hccC Q lcl|NC_021540. 702 QGNQ 705 (705) Q Consensus 702 q~~~ 705 (705) |+-+ T Consensus 700 ~~~~ 703 (725) T protein:vir:10 700 QRMD 703 (725) T ss_pred HHhh Confidence 1111 No 137 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=98.14 E-value=3.8e-06 Score=50.28 Aligned_cols=577 Identities=12% Similarity=0.019 Sum_probs=173.2 Q ss_pred HHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCc Q lcl|NC_021540. 26 VSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTW 105 (705) Q Consensus 26 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~ 105 (705) .+.-++++..++++....+....+|..-+.-+..++. | ..| +..+.... T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~----G-~Qw-~~~~~~~l------------------------- 49 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSR----V-SQW-DDWLSQYT------------------------- 49 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhC----C-CCC-CHHHHHHH------------------------- Confidence 5666666777777666666555555554444433432 3 233 22211111 Q ss_pred chHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHH Q lcl|NC_021540. 106 QDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMY 185 (705) Q Consensus 106 ~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (705) +..|.. ..|.|.-.+.+-.| ..+.+++.+.+.+....... ...++..+ T Consensus 50 -------------------~~q~rp-~~N~i~~~i~~v~g-----------~~~~nr~d~~v~P~~~~d~~-~Ae~l~~~ 97 (725) T protein:vir:77 50 -------------------TLQYRG-QFDVVRPVVRKLVS-----------EMRQNPIDVLYRPKDGARPD-AADVLMGM 97 (725) T ss_pred -------------------HhcCCC-ccccHHHHHHHHHh-----------hHHhCCcceEEecCCccHHH-HHHHHHHH Confidence 111222 22444444444333 23345566777776654443 33344444 Q ss_pred hhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEe-chhheeeCCCccCChhhCCeEEEEEe Q lcl|NC_021540. 186 QMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTIC-DYHNVTIDPTCNGNLDEAKFVIYSFE 264 (705) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V-~~~~~~~Dp~a~~d~~da~~~~~~~~ 264 (705) .......+....+++.+|.+.+.+|+||.++.+.+. .......+.. ....++. ++...-|--+.+. T Consensus 98 ~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~-------~~d~~~~~~~i~~~~~~~------~~~~v~~Dp~a~~ 164 (725) T protein:vir:77 98 YRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYE-------DQSPTSNNQVIRREPIHS------ACSHVIWDSNSKL 164 (725) T ss_pred HHHHHHhhCchhHHHHHHHHHhhcCcceeeeeeccc-------CCCCCCCceeeEEeeccc------ChhhceeCchhhc Confidence 444445677889999999999999999865321110 0001111100 0111111 1111111122333 Q ss_pred ccHHHHHHhcCCcCcchhhh-hhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCC--eeEEEEEEEE----- Q lcl|NC_021540. 265 SSRSDLEKYGIYSNLEYIKE-DSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSG--VTTPIVASWV----- 336 (705) Q Consensus 265 ~t~~el~~~g~~~d~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg--~~~~~~~~~~----- 336 (705) .+.++..-..+.+.++.... ...........++ ..+.+.. . .+.-|+ ..+. +.++++.... T Consensus 165 ~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~---~~~~~~~-~---~~~~~~----~~d~vrv~E~~~r~~~~~~~~ 233 (725) T protein:vir:77 165 MDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDI---PSFQNPN-D---WVFPWL----TQDTIQIAEFYEVVEKKETAF 233 (725) T ss_pred cChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhc---ccccccc-c---cccccc----CCCeeEEEEEEEEEEEeeEEE Confidence 34444332222222221100 0000000001111 1111100 0 011122 1222 2233222111 Q ss_pred ---C---CEEEecccCCC--------CCC-----------------------------CcceEEeeeeeecCccc---CC Q lcl|NC_021540. 337 ---D---DVMIRLEKNPY--------PDG-----------------------------KLPFVVVPYLPVKDSVY---GE 370 (705) Q Consensus 337 ---g---~~iL~~~~~p~--------~~~-----------------------------~~Pfv~~~~~~~~~~~~---g~ 370 (705) + |.++....+-+ ..| .+|.-.||+.|.-+... |. T Consensus 234 ~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~ 313 (725) T protein:vir:77 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) T ss_pred EecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCc Confidence 1 11111111100 000 01111122222221111 11 Q ss_pred chHHHhhHHHHHHHHHHHHHH-HHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCccc---------ccc-c--cc Q lcl|NC_021540. 371 ADAELLSDNQKLIGALTRGMI-DAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNP---------VTD-I--IE 437 (705) Q Consensus 371 g~~~~~~d~Q~~iN~~~~~~~-d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~---------~~~-i--~~ 437 (705) .....+...=+-.=...|... -.+...+..+.....+..+..+.....- -++...+ .++ + .+ T Consensus 314 ~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) T protein:vir:77 314 EVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMY-----DGNDDYPYYLLNRTDENSGDLPTQP 388 (725) T ss_pred ccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHH-----HhccCCceecccccccCCCcccccC Confidence 112233333333333333322 2222333333222222222222222211 1111110 000 0 11 Q ss_pred ccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 438 HKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRE-LGILRRLANGLTEVAKKILAMN 516 (705) Q Consensus 438 ~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~-~~~~~n~~~~~~~~~~~~l~li 516 (705) ....+.++-....++++......+ ....|.....++....+.++..-++.+.. ....-.|-+.++.-.+.+..++ T Consensus 389 i~~~~~~~lp~~~~~ll~~~~~~i----~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~l 464 (725) T protein:vir:77 389 LAYYENPEVPQANAYMLEAATSAV----KEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIY 464 (725) T ss_pred ccccCCCCchHHHHHHHHHHHHHH----HHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112233333334556666655555 34456654444333323444332222221 1112223334444445555555 Q ss_pred HHhcCCceeEeEecCceeeechhhcccceeEEeec-----cchh----------H---------H-HHHHHHHHHHHHHH Q lcl|NC_021540. 517 SVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSI-----SNAE----------T---------D-AIKAQELSFMLQTM 571 (705) Q Consensus 517 ~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~-----~~~~----------~---------~-~~~~q~~~~llq~~ 571 (705) ..+... . .+.+..+.|...+-..+ .+.++. .++. . . +.+.+.+..|++. T Consensus 465 L~lI~~--~--~~~~rv~RI~~ed~~~~-~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql- 538 (725) T protein:vir:77 465 QSIVND--I--YDVPRNVTITLEDGSEK-DVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILEL- 538 (725) T ss_pred HHHHHH--H--cCCCcEEEEecCCCCcc-eeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHH- Confidence 443221 0 12233445544332211 122221 1110 0 0 1122222222222 Q ss_pred hhhchhHHHHHHHHHHHhhhcc---chhhhhhhcccccchhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HH Q lcl|NC_021540. 572 GQSLPFDMTKLILGEIAKLRGM---PDLSKMISKYNPEPSPQAQ---LEIQIKQLEAQELQMRIAKLQAEIQLMP---YE 642 (705) Q Consensus 572 ~~~~~~~~~~~il~~l~e~~~~---~~~~~~~~~~~~q~~~~~q---~~~q~~q~~~q~~q~e~~k~qa~~q~~~---~~ 642 (705) .+.+|+ ....+..-+...... +...+.......+..+... .....++...+..+.+++++++++...+ ++ T Consensus 539 l~~~~~-~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~ 617 (725) T protein:vir:77 539 LGKTPQ-GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) T ss_pred HHhccc-cchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHH Confidence 222221 122111111111111 1111212222111111110 0011111222223333333333332222 22 Q ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-----HHHHHHHHHHH---------HHHHHHH-HHHHHHHHHH---hh Q lcl|NC_021540. 643 AQAEAAKARKANTEADLNT--LDFVEQETGVK-----QERELELMQAQ---------AKGNTQR-DIVKTFLDTN---KQ 702 (705) Q Consensus 643 ~q~e~a~a~~~~~ea~~~~--~~~~~q~~~~k-----q~~e~e~~~~q---------~~~~~~~-~~~k~~~~~~---~q 702 (705) ++++.++++.....+..+. .+...+..+.+ .++..+.++.. .+.+++. ....+++... ++ T Consensus 618 ~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~ 697 (725) T protein:vir:77 618 GQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQT 697 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHH Confidence 2222222211111111111 01111111111 01110110000 0000000 0000010110 00 Q ss_pred ccC Q lcl|NC_021540. 703 GNQ 705 (705) Q Consensus 703 ~~~ 705 (705) +++ T Consensus 698 ~~q 700 (725) T protein:vir:77 698 HKQ 700 (725) T ss_pred Hhh Confidence 011 No 138 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.01 E-value=7.2e-06 Score=48.77 Aligned_cols=571 Identities=12% Similarity=-0.008 Sum_probs=170.7 Q ss_pred HHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCc Q lcl|NC_021540. 26 VSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTW 105 (705) Q Consensus 26 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~ 105 (705) .+.-++++..++++....+....+|..-..-+..++. | ..| T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~----G-~Qw---------------------------------- 41 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSR----I-SQW---------------------------------- 41 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc----C-CCC---------------------------------- Confidence 4444555555555555544444444333333332322 2 122 Q ss_pred chHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHH Q lcl|NC_021540. 106 QDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMY 185 (705) Q Consensus 106 ~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (705) .++..+. + +..|.. ..|.|.-.+.+-.| ..+.+++.+.+.+...... ....++..+ T Consensus 42 ~~~~~~~-----l------~~q~rp-~~N~i~~~i~~v~g-----------~e~~nr~d~~v~P~~~~d~-~~Ae~l~~~ 97 (725) T protein:vir:92 42 DDWLSQY-----T------TLQYRG-QFDVVRPVVRKLVS-----------EMRQNPIDVLYRPKDGASP-DAADVLMGM 97 (725) T ss_pred CHHHHHH-----H------HhcCCC-cccchHHHHHHHHh-----------hHHhCCcceEEecCCccHH-HHHHHHHHH Confidence 1111111 0 112222 23445444444333 3344556677777665443 333344444 Q ss_pred hhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceEEEechhh--ee--eCCCccCChhhCCeEEE Q lcl|NC_021540. 186 QMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHN--VT--IDPTCNGNLDEAKFVIY 261 (705) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~--~~--~Dp~a~~d~~da~~~~~ 261 (705) .......+....+++.+|.+.+.+|+||.++.+. .... +|++ +. ..|- ..++...-|--+ T Consensus 98 ~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d---------~~~~------d~~~~~~~i~~~~i-~~~~~~V~~Dp~ 161 (725) T protein:vir:92 98 YRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTD---------YEDQ------SPTSNNQVIRREPI-HSACSHVIWDSN 161 (725) T ss_pred HHHHHHhhCchHHHHHHHHHHhhcCcceeeeeec---------ccCC------CCCCCceeeEEeec-cCChhhcccCch Confidence 4444456778899999999999999998653111 1011 1211 11 1110 001111112222 Q ss_pred EEeccHHHHHHhcCCcCcch-----hhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCC--eeEEEEEE Q lcl|NC_021540. 262 SFESSRSDLEKYGIYSNLEY-----IKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSG--VTTPIVAS 334 (705) Q Consensus 262 ~~~~t~~el~~~g~~~d~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg--~~~~~~~~ 334 (705) .+..+.++..-..+.+.++. +.+.++. ...++....+..+. ..-|+. .+. +.++++.. T Consensus 162 a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~----~~~~~~~~~~~~~~-------~~~~~~----~d~vrv~e~~~r~ 226 (725) T protein:vir:92 162 SKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDL----DADDIPSFQNPNDW-------VFPWLT----QDTIQIAEFYEVV 226 (725) T ss_pred hhccChhhHHHHHHHhcCCHHHHHHHHhhcCc----chhhhhhcccCCcc-------cccccC----CCeEEEEEEEEEE Confidence 33344444332222222221 1111110 01111111111100 111221 222 22333222 Q ss_pred EE-----------CCEEEecccCCCC-------------------------------------CCCcceEEeeeeeecCc Q lcl|NC_021540. 335 WV-----------DDVMIRLEKNPYP-------------------------------------DGKLPFVVVPYLPVKDS 366 (705) Q Consensus 335 ~~-----------g~~iL~~~~~p~~-------------------------------------~~~~Pfv~~~~~~~~~~ 366 (705) .. +|.++....+-+. ...+|.-.||+.|.-+. T Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~ 306 (725) T protein:vir:92 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGE 306 (725) T ss_pred EEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEee Confidence 11 1222221111000 00111112222232221 Q ss_pred ccC-Cch--HHHhhHHHHHHHHHHHHH-HHHHHhcCCCcEEeeccccCchhhhhhcCCcceeecCCccc---------cc Q lcl|NC_021540. 367 VYG-EAD--AELLSDNQKLIGALTRGM-IDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYNPGTNP---------VT 433 (705) Q Consensus 367 ~~g-~g~--~~~~~d~Q~~iN~~~~~~-~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~~~~~~---------~~ 433 (705) ..+ .|. ...+...=+-.=...|.. .-.+...+...-....+..+..+.... ..-++...+ .+ T Consensus 307 r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~~~~~~~~~~~ 381 (725) T protein:vir:92 307 WGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH-----MYDGNDDYPYYLLNRTDENN 381 (725) T ss_pred eeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHH-----HHhccCccceeecccccccc Confidence 110 111 112222222222222222 122222222222211221111111111 111111110 00 Q ss_pred -cc--ccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Q lcl|NC_021540. 434 -DI--IEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELG-ILRRLANGLTEVA 509 (705) Q Consensus 434 -~i--~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~-~~~n~~~~~~~~~ 509 (705) .+ .+....+.++-....++++......+. ...|.....++....+.++..-+..+.... ..-.|-+.++.-. T Consensus 382 g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~----~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~ 457 (725) T protein:vir:92 382 GEMPTQPLAYYENPEVPQANAYMLEAATAAVK----EVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAM 457 (725) T ss_pred ccccccCCcccCCCCchHHHHHHHHHHHHHHH----HHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 112222333444456666666666553 445655444433333344433332222211 1222223344444 Q ss_pred HHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeecc---------------chhH---------HHHHHHHHH Q lcl|NC_021540. 510 KKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSIS---------------NAET---------DAIKAQELS 565 (705) Q Consensus 510 ~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~---------------~~~~---------~~~~~q~~~ 565 (705) +.+..++..+... . .+.+..+.|...+-. ...+.++.. .+.. ...+.++.. T Consensus 458 ~~~g~~lL~lI~~--~--~~~~r~~RI~~edg~-~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~ 532 (725) T protein:vir:92 458 RRDGEIYQSIVND--I--YDVPRNVTITLEDGS-EKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNR 532 (725) T ss_pred HHHHHHHHHHHHH--h--cCCCcEEEEecCCCC-cceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHH Confidence 4444444433221 0 122233444333222 111222211 0000 112222222 Q ss_pred HHHHHHhhhchhHHHHHHHHHHHhh---hccchhhhhhhcccccchhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 566 FMLQTMGQSLPFDMTKLILGEIAKL---RGMPDLSKMISKYNPEPSPQAQ---LEIQIKQLEAQELQMRIAKLQAEIQLM 639 (705) Q Consensus 566 ~llq~~~~~~~~~~~~~il~~l~e~---~~~~~~~~~~~~~~~q~~~~~q---~~~q~~q~~~q~~q~e~~k~qa~~q~~ 639 (705) ..+.++.+.+|+ ....+..-+... +..+...+.......+..+..+ ...+..++..+..+++++++++++... T Consensus 533 ~~l~ql~~~~~~-~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~ 611 (725) T protein:vir:92 533 AEILELLGKTPQ-GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQA 611 (725) T ss_pred HHHHHHHHhccc-chhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHH Confidence 222222232322 222222222211 1112222222222222112110 011122222333344444444433222 Q ss_pred HH---HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHH-HHHHHHHHHHH-----------H Q lcl|NC_021540. 640 PY---EAQAEAAKA--RKANTEADLNTLDFVEQETGV-----KQERELELMQAQAK-GNTQRDIVKTF-----------L 697 (705) Q Consensus 640 ~~---~~q~e~a~a--~~~~~ea~~~~~~~~~q~~~~-----kq~~e~e~~~~q~~-~~~~~~~~k~~-----------~ 697 (705) ++ +++++.+++ +....+++....+...+..+. ..+..++++..+++ ++...+..+.. + T Consensus 612 qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l 691 (725) T protein:vir:92 612 QGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLL 691 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHH Confidence 22 222222111 111111111111111111111 11111111100000 00000000000 1 Q ss_pred HHHhhccC Q lcl|NC_021540. 698 DTNKQGNQ 705 (705) Q Consensus 698 ~~~~q~~~ 705 (705) +...++.+ T Consensus 692 ~~~~~~~~ 699 (725) T protein:vir:92 692 KGNEQTHK 699 (725) T ss_pred HHHHHHHH Confidence 11111100 No 139 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=97.97 E-value=8.6e-06 Score=48.34 Aligned_cols=541 Identities=11% Similarity=0.068 Sum_probs=171.3 Q ss_pred eCCCcchHHHHHHHH---------HHHHHHHHhhcCCcchHHHHHHHHH--hcCCeEEEEeecchh-------------- Q lcl|NC_021540. 101 APKTWQDREAARQNE---------AILNYQFNNQLDKVKLIDTMVRTAV--NEGTVIFRTSWCLEE-------------- 155 (705) Q Consensus 101 ~p~~~~D~~~A~~~t---------~~~n~~~~~~~~~~~~~~~~~~~al--~~g~gi~k~~W~~~~-------------- 155 (705) .-.+++|.+.|.-.- ..+.+ |.....+..-.+.-.+..+ -.|+ -|+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~q~~~r~~a~~d~~fy~G~-----QW~~~~~~~l~~~g~p~~~~ 74 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYAD-INYEIEDQPAWRAVADKEMDYADGN-----QLDTELLRRQQALGIPPAVE 74 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHH-HHHHHhccHHHHHHHHHHHHhhcCC-----CCCHHHHHHHHhcCCCcEEE Confidence 222333333222000 01111 1111222111111111111 1121 233222 Q ss_pred -----------hhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccc Q lcl|NC_021540. 156 -----------TKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQE 224 (705) Q Consensus 156 -----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 224 (705) ...+.+++.+++.+..........+.+..+.......+....+++.+|.+.+.+|+||.++ T Consensus 75 N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~-------- 146 (772) T protein:vir:10 75 DLIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEV-------- 146 (772) T ss_pred cchHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEe-------- Confidence 2344456677888865444455555555555556667788899999999999999998531 Q ss_pred eeeeccCcceEEEechh--heeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCc--chhhhhhhhh---h---cccc Q lcl|NC_021540. 225 VIKTVKNQPEVTICDYH--NVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNL--EYIKEDSSTS---T---SSDH 294 (705) Q Consensus 225 ~~~~~~~~~~i~~V~~~--~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~--~~~~~~~~~~---~---~~~~ 294 (705) .... +++ ++.+.+- ++.+--|=- ....+.++..-..+.+.+ +.+...+.+. . .... T Consensus 147 ---~~~~-------d~~~~~i~i~~v---~p~~v~~Dp-~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~ 212 (772) T protein:vir:10 147 ---SRES-------DPFKFPYRCRPI---RRDEIHWDM-KCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYG 212 (772) T ss_pred ---cccc-------CCCCCCeEEEee---CcccceecC-CCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhc Confidence 0000 111 1111110 111100000 001244443322222222 1222111110 0 0011 Q ss_pred ccccc--ccccc-ccccC----eEEEEEEEEEee---ecC--CC--eeE-EEEE-----EE---ECCEEEec-------- Q lcl|NC_021540. 295 YSSDT--SFTFS-DKARK----KIVVYEYWGYWD---IDG--SG--VTT-PIVA-----SW---VDDVMIRL-------- 343 (705) Q Consensus 295 ~~~~~--~~~~~-~~~~~----~v~v~E~w~k~~---~~~--dg--~~~-~~~~-----~~---~g~~iL~~-------- 343 (705) ..+-+ ..+.. +.... .......|.... +.. +. +.+ |++. ++ .|+.+... T Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~ 292 (772) T protein:vir:10 213 STWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNI 292 (772) T ss_pred ccccCcccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHH Confidence 11100 00000 00000 001111221110 011 11 111 1111 11 12222211 Q ss_pred -----------------------ccCCCCCCC--cceEEeeeeeecCccc-CCchHHHhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021540. 344 -----------------------EKNPYPDGK--LPFVVVPYLPVKDSVY-GEADAELLSDNQKLIGALTRGMIDAMARS 397 (705) Q Consensus 344 -----------------------~~~p~~~~~--~Pfv~~~~~~~~~~~~-g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~ 397 (705) +...+..+. ||+-.|++.|.-+... -.|....++..=+-.=...|...-.+ T Consensus 293 ~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~--- 369 (772) T protein:vir:10 293 ALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKL--- 369 (772) T ss_pred HHhhcccchheeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHH--- Confidence 111122222 2333344444322221 12333322222222111122111110 Q ss_pred CCCcEEeeccccCchhhhhhcCCcceee-----cCCcccccccccc-----------cCccchHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 398 ANGQRGMSKNLLDPVNERKFKMGEDYKY-----NPGTNPVTDIIEH-----------KYPELPASSYNMLQMFTLEADAL 461 (705) Q Consensus 398 ~~~~~~~~~~av~~~d~~~~~pg~~i~~-----~~~~~~~~~i~~~-----------~~~~i~~~~~~~l~~~~~~~~~~ 461 (705) .+++...++- ...|.+--. ...+.+...+.+. ...+.+.-....++++......+ T Consensus 370 ---~~~l~~~~~~------~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i 440 (772) T protein:vir:10 370 ---RWGMSVARVE------RTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATI 440 (772) T ss_pred ---HHHHhccccc------ccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHH Confidence 1122211111 112222111 1111221122221 11122333445667777766666 Q ss_pred hCcchHhcCCCccccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeech-h Q lcl|NC_021540. 462 SGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRE-LGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINR-D 539 (705) Q Consensus 462 tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~-~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~-~ 539 (705) .-+ .|.....++....+.+.+.-++.+.. ....-.|-+.++...+.+..++..+... -.+.+..+.|.. + T Consensus 441 ~~v----sGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~----~y~~er~~RI~~~d 512 (772) T protein:vir:10 441 ERV----SNITAGFQGRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVE----DIGQERTEVVIEGD 512 (772) T ss_pred HHH----hCCCHHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HcCCCcEEEEecCC Confidence 443 46554444332222333221111111 1111111122333333344444333211 112233444543 3 Q ss_pred hcccceeEEe----------------eccchhHH----------HHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhcc Q lcl|NC_021540. 540 NLVGSFDIKL----------------SISNAETD----------AIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGM 593 (705) Q Consensus 540 ~~~~~~dv~v----------------~~~~~~~~----------~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~ 593 (705) ....+--+.+ ++..+..+ +.+.+.+..|++.+++ +|+.....++.-+.++..+ T Consensus 513 ~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~-~~P~~~~~~~~~~le~~D~ 591 (772) T protein:vir:10 513 AVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKS-MPPQYQAAVLPFLVSLMDV 591 (772) T ss_pred CCCCCceEEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhc-cChhHHHHHHHHHHhhcCC Confidence 2211111111 11112222 4566666777776654 6777777777777888887 Q ss_pred chhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 594 PDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQ 673 (705) Q Consensus 594 ~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq 673 (705) +...+..+...... .+...+.++ +...+..|..++..+++++..+..++..+.+++.+.+..+..+... T Consensus 592 p~~~ei~~~ir~~~--------~~~~peq~~-~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~-- 660 (772) T protein:vir:10 592 PFKRDVVEAIRAVD--------QQQTPEQIQ-QQIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGV-- 660 (772) T ss_pred CChHHHHHHHHHHh--------ccCChHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 76555544432110 000001111 1111111222222223332222222222222222221111111110 Q ss_pred HHHHHHHHHHHH---HH--HHHHHHHHHHHHHhhccC Q lcl|NC_021540. 674 ERELELMQAQAK---GN--TQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 674 ~~e~e~~~~q~~---~~--~~~~~~k~~~~~~~q~~~ 705 (705) +.++.+.++. .+ +......+++ +.+..+ T Consensus 661 --~a~~~a~~aa~~~~q~~q~a~~ad~~l--~~~g~~ 693 (772) T protein:vir:10 661 --QAAFSAMQAGAQIAQMPMIAPIADAVM--QSAGYQ 693 (772) T ss_pred --HHHHHHhhhhhhHHhhhhhhHHHHHHH--Hhcccc Confidence 1111111111 11 1111111111 122222 No 140 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=92.93 E-value=0.0094 Score=31.69 Aligned_cols=575 Identities=13% Similarity=0.096 Sum_probs=154.8 Q ss_pred Ccchhhhhhccccc-ccCC-CCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCCCCcCCCHHHHH Q lcl|NC_021540. 1 MSDINEEFLEDTVP-SLQE-DWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVGRSSVQPKLIRK 78 (705) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~~v~~~v~~ 78 (705) |+.+.+-+...+.. .-.+ .|-+.+...+. ..|+|+--. .+.+| .++++ T Consensus 87 ~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~---------------------t~~~n~~~~---~~~~~-~~~~~----- 136 (763) T protein:vir:95 87 YSALTEPFLGSNKLFKVTPVTWEDVQGARQN---------------------ELVLNYQFR---TKLNR-VSFID----- 136 (763) T ss_pred HHHHHHhhcCCCcEEEEecCCcchHHHHHHH---------------------HHHHHHHHh---hcCch-hhHHH----- Confidence 33333333322222 1111 12222211111 122221100 01111 11111 Q ss_pred HHHHHHHHHHHhhcCCCCEEE------------------EeCCCcchHHHHHHHHHHHH---HHHHhhcCCcchHHHHHH Q lcl|NC_021540. 79 QAEWRYSALSEPFLNDENIFS------------------IAPKTWQDREAARQNEAILN---YQFNNQLDKVKLIDTMVR 137 (705) Q Consensus 79 ~~e~~~~~l~~~f~~~~~~~~------------------~~p~~~~D~~~A~~~t~~~n---~~~~~~~~~~~~~~~~~~ 137 (705) +|+-..|+. +-=+++ -.|.-.++...+ .....-. .....+-+--..+.-... T Consensus 137 --~~~~~~l~~----~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (763) T protein:vir:95 137 --NYVRSVVDD----GTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADA-LQQALQLRTDNPRGYEENVDEAIKESVR 209 (763) T ss_pred --HHHHHHhhc----CcceEEEeeeeeeeeeeeeehhhhhccccchhHHHH-HHHHHHhhhhhhccccccccchhhhhhh Confidence 111111111 111111 001111110000 0000000 000000000001111222 Q ss_pred HHHhcCCeEEEE-----eecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHH---HH---HHHHhh Q lcl|NC_021540. 138 TAVNEGTVIFRT-----SWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEA---LA---ESVRYS 206 (705) Q Consensus 138 ~al~~g~gi~k~-----~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~---~~~~~~ 206 (705) .....|.++..+ .|+.+ +...+.|.++.++... . .-+|....++.++ +. ....+. T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~--~~~k~~p~ie~V~p~d--~----------~iDp~a~sD~~Da~~~~~~~~~t~~dL 275 (763) T protein:vir:95 210 FFDETGQATYAVQTGTTTTEVE--VPLANHPTVEMLNPEN--I----------IIDPSCQGDINKAMFAIVSFETCKADL 275 (763) T ss_pred hccccCcceeeecccceeEEEE--EEecCceEEEeecHHH--h----------eecCCCCCchhhCceEeeEEeccHHHH Confidence 233445444433 12211 1122233333221111 0 0111110000000 00 000000 Q ss_pred hhcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhh Q lcl|NC_021540. 207 VANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDS 286 (705) Q Consensus 207 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~ 286 (705) ..-|.++..+. .+.+. .....+...-....+.++-+ .|.+..+.+++..|. T Consensus 276 ~~~~~~y~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~~-----~d~~~~~V~v~E~y~--------------------- 326 (763) T protein:vir:95 276 LKEKDRYHNLN-KIDWQ--SSAPVNEPDHATTTPQEFQI-----SDPMRKRVVAYEYWG--------------------- 326 (763) T ss_pred HhccCCccccc-hhcch--hccccccccccccchhhccC-----CCcccceEEEEEeee--------------------- Confidence 00000000000 00000 00000000000000000000 000000111111110 Q ss_pred hhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeecCc Q lcl|NC_021540. 287 STSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVKDS 366 (705) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~~~ 366 (705) ..+.. ++. +.|+ +++-.-|+ .+...+......+..||-+ +|+++.+.. ..+. T Consensus 327 ---------~~d~~---gdg------~~~~-~~v~~~g~------~iL~~~~~p~~~~~~PFv~--~~~~p~~~~-~~G~ 378 (763) T protein:vir:95 327 ---------FWDIE---GNG------VLEP-IVATWIGS------TLIRLEKNPYPDGKLPFVL--IPYMPVKRD-MYGE 378 (763) T ss_pred ---------eeccC---Ccc------eeEE-EEEEEEcC------eeeecccccccCCCcCEEE--ecceeecCc-ccCC Confidence 00000 000 1111 11111111 1222222223334456643 455554333 2333 Q ss_pred ccCCchHHHhhHHHHHHHHHHHHHHHH---HHhcCCCcEEeeccccCc--hhhhhhcCCcceeecCCcccccccccccCc Q lcl|NC_021540. 367 VYGEADAELLSDNQKLIGALTRGMIDA---MARSANGQRGMSKNLLDP--VNERKFKMGEDYKYNPGTNPVTDIIEHKYP 441 (705) Q Consensus 367 ~~g~g~~~~~~d~Q~~iN~~~~~~~d~---~~~~~~~~~~~~~~av~~--~d~~~~~pg~~i~~~~~~~~~~~i~~~~~~ 441 (705) .++.-+.+.-.......|.....+.-. ......+.+. ..+.+.. .......||+........ ..+.+.+ T Consensus 379 gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~-~~d~~~~~pg~v~~v~~g~~~~~~~~~-----~~~p~~~ 452 (763) T protein:vir:95 379 PDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLD-ALNSRRYREGEDYEYNPTQNPAQMIIE-----HKFPELP 452 (763) T ss_pred chHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeeccccc-chhhhcccCCceEEeeCCCChhhhccc-----ccCCCCc Confidence 444555555555555555555433221 2222333332 2222211 122334555544322211 1122334 Q ss_pred cchHHHHHHHHHHHHHHHH----HhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 442 ELPASSYNMLQMFTLEADA----LSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNS 517 (705) Q Consensus 442 ~i~~~~~~~l~~~~~~~~~----~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~ 517 (705) .-+..++++++...+.+.- ..|++....|..++ +....+.+...+.+..+..+.+.+....+.++..+..+.- T Consensus 453 ~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat---~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d 529 (763) T protein:vir:95 453 QSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAA---GIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLA 529 (763) T ss_pred chHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC Confidence 5556666766665565543 34666655553322 2222233333444555667767766666666666655432 Q ss_pred Hh----cCCceeEeEe-----cCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHH-HHhhhchhHHHHHHHHHH Q lcl|NC_021540. 518 VW----LSDEEVIRIT-----DEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQ-TMGQSLPFDMTKLILGEI 587 (705) Q Consensus 518 q~----~~~~~~iri~-----~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq-~~~~~~~~~~~~~il~~l 587 (705) .- ...+..+.+. ++.-|.|+ +.. .+......+.+..+..++. .+.+.+.......+ .++ T Consensus 530 ~~rviRI~g~e~v~v~~~~~~~~~DV~V~---------~~~-as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~-~d~ 598 (763) T protein:vir:95 530 EHEVVRITNEEFVTIKREDLKGNFDLEVD---------IST-AEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEI-ADL 598 (763) T ss_pred CCcEEEEeCCccccccHHHhcCCcceEEe---------ccc-chHHHHHHHHHHHHHHHhccccChHHHHHHHHHH-Hhh Confidence 21 0111222222 22112221 111 1222234444455554442 22222222222211 111 Q ss_pred Hhhhccch-hhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 588 AKLRGMPD-LSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVE 666 (705) Q Consensus 588 ~e~~~~~~-~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~ 666 (705) .....+.. +......+.+...++++.++.+.+++++..+++.+..+++++..+++++...+++.....+.+ ++.+... T Consensus 599 ~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q-~~~e~~~ 677 (763) T protein:vir:95 599 KRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTK-HARDLEK 677 (763) T ss_pred hchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 11111111 111111222333334444444444444444433333333333322222221111111111111 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 667 QETGVKQERELELMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 667 q~~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) ..+....+++++...++.+...+.+..+++... -+.+. T Consensus 678 ~~~~~eaq~~l~~~~a~~~~~~ea~~~~~~~~~-~~~~~ 715 (763) T protein:vir:95 678 MKAQSQGNQQLEITKALTKPRKEGELPPNLSAA-IGYNA 715 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccChhHHHh-hhhcc Confidence 111111111222222222222222221212111 11111 No 141 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=84.22 E-value=0.062 Score=27.20 Aligned_cols=416 Identities=10% Similarity=0.059 Sum_probs=170.8 Q ss_pred hcccccccCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCC----------CCcCCCHHHHH Q lcl|NC_021540. 9 LEDTVPSLQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVG----------RSSVQPKLIRK 78 (705) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g----------rs~~v~~~v~~ 78 (705) |..+.| -|.........+--++-.... +++...+..+-++.+| ...+..+-++. T Consensus 1 m~V~~~--------hp~y~a~~~~W~~~rd~~~G~--------~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~ 64 (452) T protein:vir:94 1 MPIETK--------HPEYLAYENDWIDCRVASLGQ--------REVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSK 64 (452) T ss_pred CCCCCc--------CHHHHHHHHHHHHHHHHhcCh--------HHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHH Confidence 222221 123333333322222211111 0111111101112222 22467788888 Q ss_pred HHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcch---HHHHHHHHHhcCCeEEEEeecchh Q lcl|NC_021540. 79 QAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVKL---IDTMVRTAVNEGTVIFRTSWCLEE 155 (705) Q Consensus 79 ~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~~---~~~~~~~al~~g~gi~k~~W~~~~ 155 (705) +++.+...+ |.-++.+++ | + .. . ++|. ...|..+ +..+++.+|..|.+-+-|-|.. T Consensus 65 t~~~~~G~v----f~k~p~~~~-p---~--~l----~-~~~~----D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~-- 123 (452) T protein:vir:94 65 TLSALSGMV----LDQPPVITH-P---D--AM----S-KYFE----DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPL-- 123 (452) T ss_pred HHHHHhchh----hcCCceecc-c---H--HH----H-HHHh----cccCCCHHHHHHHHHHHHHhcCeEEEEEeecc-- Confidence 888887766 445555543 1 1 11 1 1121 3445543 6788888889998877664410 Q ss_pred hhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccCcccccceeeeccCcceE Q lcl|NC_021540. 156 TKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIINGYEEQEVIKTVKNQPEV 235 (705) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i 235 (705) ...+|.| T Consensus 124 -------------------------------------------------------------------------~g~rPy~ 130 (452) T protein:vir:94 124 -------------------------------------------------------------------------TGGDPYI 130 (452) T ss_pred -------------------------------------------------------------------------CCCceEE Confidence 0125888 Q ss_pred EEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEE Q lcl|NC_021540. 236 TICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYE 315 (705) Q Consensus 236 ~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E 315 (705) ..++|.+++ ++.-.. +.+....+.+-+ ... .+....|+......+++++ T Consensus 131 ~~~~~~~Ii-~W~~~~---~g~l~~v~lre~-------------------------~~~--~d~~d~f~~~~~~~yRvL~ 179 (452) T protein:vir:94 131 SVYTTENIL-NWEEDE---DGRLLMVVLREF-------------------------YTV--RDTADRYVQNIRVRYRCLE 179 (452) T ss_pred EEechhhhc-Cccccc---cCCeeEEEEEEE-------------------------EEE--ecCCCcccceeEEEEEEEE Confidence 889999987 444211 122211111100 000 0011111222222233322 Q ss_pred EEEEeeecCCCeeEEEEEEEECC--------EEEecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHH Q lcl|NC_021540. 316 YWGYWDIDGSGVTTPIVASWVDD--------VMIRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALT 387 (705) Q Consensus 316 ~w~k~~~~~dg~~~~~~~~~~g~--------~iL~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~ 387 (705) . ++|..+.++--..++ .......+|+ +.+||+++...+.. ...+.+..-.+..++..+.... T Consensus 180 l-------~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l--~~IP~v~~~~~~~~-~~~~~pPLl~LA~ln~~hy~~~ 249 (452) T protein:vir:94 180 L-------VDGLLQITVHETQDGKVWELAKTSTIQNVGVTM--DYIPFFCITPSGLS-MTPAKPPMIDIVDINYSHYRTS 249 (452) T ss_pred E-------eCCeEEEEEEEccCCceeeeccceeecCCCccc--ceeEEEEEcCCCCC-CCCCccchHHHHHHHHHHhcch Confidence 1 112111111000111 1222223332 56777766544433 3357788889999999999989 Q ss_pred HHHHHHHHhcCCCcEEeeccccCchhhhhhcCCcceeec-CCcccccccccccCccc-hHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_021540. 388 RGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYKYN-PGTNPVTDIIEHKYPEL-PASSYNMLQMFTLEADALSGVK 465 (705) Q Consensus 388 ~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~~~-~~~~~~~~i~~~~~~~i-~~~~~~~l~~~~~~~~~~tGv~ 465 (705) +-.-+++..++.|...+... +..+....-++.+|.+. +++. +.+..+..- ......-|+.+.+.+..+ |.. T Consensus 250 sd~~~~l~~~~~P~l~~~g~--~~~~~i~iG~~~~~~lpe~~~~----~~yie~~g~~i~~~~~~l~~le~~m~~~-Ga~ 322 (452) T protein:vir:94 250 ADLEHGRHFTGLPTPWITGA--ESQSTMHIGSTKAWVIPEVAAK----VGFLEFTGQGLQSLEKALSEKQAQLASL-SAR 322 (452) T ss_pred hHHHHHHHHcccceeEeecC--cCCCceEecccccccCCCCCCc----ceEEccCchhHHHHHHHHHHHHHHHHHH-HHH Confidence 98999999999997766532 22233444555555554 2322 334442211 122334455555555433 321 Q ss_pred hHhcCCCccccchHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhcccc Q lcl|NC_021540. 466 SFSQGLTGDSLGTTTAGVQGVIGA-SGKRELGILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGS 544 (705) Q Consensus 466 d~~~G~~~~~~~~~a~~i~~l~~~-~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~ 544 (705) ...+... +++++.......+ ....+..++.++++++ .++|.++..|.....- .-+.+|++-.... T Consensus 323 -ll~~~~~---~~~s~ea~~~~~~~~~s~L~~~a~~~e~al----~~~l~~~a~w~g~~~~------~~v~~n~dF~~~~ 388 (452) T protein:vir:94 323 -LIDNSTR---GSEATETVKLRYMSETASLKSVTRAVEALL----NKAYSCIMDMESMGGT------LNIKLNSAFLDSK 388 (452) T ss_pred -hhccCCC---cchHHHHHHHHHHHhhHHHHHHHHHHHHHH----HHHHHHHHHHcCCCCc------eEEEecccccccc Confidence 2222111 1222222222222 3456677777776655 5666667777654321 1233333321111 Q ss_pred eeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhccchhh---hhhhc--ccccc-------hhhHH Q lcl|NC_021540. 545 FDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGMPDLS---KMISK--YNPEP-------SPQAQ 612 (705) Q Consensus 545 ~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~~~~~---~~~~~--~~~q~-------~~~~q 612 (705) + +.+.++++.++.++.. .... -++..+.+ .++.... +.+.. +.+.| ++.-. T Consensus 389 ~-----------~~~~~~al~~~~~~G~--is~~---t~~~~L~~-~gvl~~~~e~~~i~~E~~~~~~~~~~~~~~~~~~ 451 (452) T protein:vir:94 389 L-----------TAAELKAWVEAYLSGG--ISKE---IYIHALKV-GKVLPPPGESMGVIPDPPAPEPSPSNTPPNPSSK 451 (452) T ss_pred C-----------CHHHHHHHHHHHhcCC--CcHH---HHHHHHHh-CCCCCCccCHHHHHHHhhccCcccCCCCCCCccC Confidence 1 1223333344433221 1110 01111111 1111100 00000 00000 00000 Q ss_pred H Q lcl|NC_021540. 613 L 613 (705) Q Consensus 613 ~ 613 (705) . T Consensus 452 ~ 452 (452) T protein:vir:94 452 A 452 (452) T ss_pred C Confidence 0 No 142 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=78.94 E-value=0.11 Score=25.85 Aligned_cols=443 Identities=12% Similarity=0.020 Sum_probs=163.8 Q ss_pred CCCCCCCCCC-CCcC---CCHHHHHHHHHHHHHHHHhhcCCCCEE-E---EeCCCcchHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_021540. 58 GAYKPKQQVG-RSSV---QPKLIRKQAEWRYSALSEPFLNDENIF-S---IAPKTWQDREAARQNEAILNYQFNNQLDKV 129 (705) Q Consensus 58 ~~~~~~~~~g-rs~~---v~~~v~~~~e~~~~~l~~~f~~~~~~~-~---~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~ 129 (705) |.... | |+.| .+-.....-.|- +++..++|+..+ . +.|.-+. ++ ..+.|-+|+ .. T Consensus 1 ~~~~~----~~~~~V~~~hp~y~a~~~~W~---~ird~~~G~~~~~~r~~yl~~~~~-~~---~e~~Y~~rl----~r-- 63 (489) T protein:vir:78 1 MLTEN----GQGSGVKTKHREWLHYAPKWQ---KVRHALAGELVSYLRNVGLNEPDK-AY---GEARQAEYE----AG-- 63 (489) T ss_pred CccCC----CccCCCCccCHHHHHHHHHHH---HHHHHhcCcccccccCCCCCCCCC-CC---ChHHHHHHH----hc-- Confidence 33322 2 3333 222344444665 477778776531 1 2322111 00 112355553 11 Q ss_pred chHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchh-hhcchHHHHHHHHHhhhh Q lcl|NC_021540. 130 KLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPS-ILDTMPEALAESVRYSVA 208 (705) Q Consensus 130 ~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 208 (705) -+++++++++|..=.|.+ |..+ |..+. ...++-+..+.+ ...++...+...+..... T Consensus 64 A~~~n~~~~tl~~l~G~v---frk~--------p~~~~-----------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~ 121 (489) T protein:vir:78 64 GIVYNFTRRTLSGMVGSV---MRKE--------PEINI-----------PKELEYLLKNADGSGVGLIQHAQDTLMEIDS 121 (489) T ss_pred cccCChHHHHHHHHhchh---hcCC--------cceec-----------cHHHHHHHhccCCCCCCHHHHHHHHHHHHHh Confidence 235678888876655533 1111 11111 112222333333 345566777777777777 Q ss_pred cCccceeccCccccc---ceeeeccCcceEEEechhheeeCCCccCChhhC----CeEEEEEeccHHHHHHhcCCcCcch Q lcl|NC_021540. 209 NNRPILAIINGYEEQ---EVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEA----KFVIYSFESSRSDLEKYGIYSNLEY 281 (705) Q Consensus 209 ~g~~~~~~~~~~~~~---~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da----~~~~~~~~~t~~el~~~g~~~d~~~ 281 (705) +|..+.-+-...... ...+...-+|++..++|.+|+ ++.-. ..+. .++..+.... T Consensus 122 ~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~Ii-nW~~~--~v~G~~~Lt~v~lrE~~~--------------- 183 (489) T protein:vir:78 122 VGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIV-NWRLT--RVGSVNRVTMVVLRETWE--------------- 183 (489) T ss_pred cCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhc-Cceee--eeCCccceeEEEEEEeEE--------------- Confidence 776654322111100 001111236888888888885 33211 1122 1211111000 Q ss_pred hhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEE--EEEEECCE------EEe-cccCCCCCCC Q lcl|NC_021540. 282 IKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPI--VASWVDDV------MIR-LEKNPYPDGK 352 (705) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~--~~~~~g~~------iL~-~~~~p~~~~~ 352 (705) ..+....|+.....+++|++. +.+|.-+.+ +..--|+. ++. .... ..+. T Consensus 184 --------------~~d~~~~f~~~~~~q~RvL~~------~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~--~l~~ 241 (489) T protein:vir:78 184 --------------YNEPGNEFETKYGEQYRVLDI------DSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGES--LRGV 241 (489) T ss_pred --------------eecCCCCccceeEEEEEEEec------CCCcceEEEEEEeecCCcccceeeEEeccCCCC--ccCe Confidence 001111222233334444432 112211111 00001111 111 1111 1245 Q ss_pred cceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhh-h--------hhcCCcce Q lcl|NC_021540. 353 LPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNE-R--------KFKMGEDY 423 (705) Q Consensus 353 ~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~-~--------~~~pg~~i 423 (705) +||+++..... +...+.++.-.+..++..+=...+-.-+.+..++.|...+. |.-+.++. . ..-+...+ T Consensus 242 IPfv~~~~~~~-~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~-G~d~~~~~~~~~~~~~~i~~g~~~~~ 319 (489) T protein:vir:78 242 IPFTFIGATNN-DATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PGENLTPQAFKEANPNGIKFGSRRGH 319 (489) T ss_pred eeEEEEecCCC-CCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cCccCCcccccccCccceeeCCcccc Confidence 56665543322 22224555556666655544445556777888888877654 22111111 1 11111122 Q ss_pred eecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 424 KYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLAN 503 (705) Q Consensus 424 ~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~ 503 (705) .+-.+ ....+.++.... .....|..+.+.+.. .|..- .. .+ .+.||++++.-..+....+..++.++++ T Consensus 320 ~lp~~----~~~~~ie~~~~~-~~r~~l~~le~qm~~-lGa~l--~~-~~--~~~Ta~~~~~~~~~~~S~L~~~a~~~e~ 388 (489) T protein:vir:78 320 NLGYG----GSAQLIQAGENN-LARQNMLDKEQQAIQ-IGAQL--IT-PT--QQITAQSARIQRGADTSVMATIARNVSQ 388 (489) T ss_pred cCCCC----CCcceeccCcch-HHHHHHHHHHHHHHH-Hhhhh--cc-CC--cchhHHHHHHHHHHhhHHHHHHHHHHHH Confidence 11111 112333333222 223334433443332 23221 11 11 1355655555445566667777777765 Q ss_pred HHHHHHHHHHHHHHHhcCCc--eeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchh-HHH Q lcl|NC_021540. 504 GLTEVAKKILAMNSVWLSDE--EVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPF-DMT 580 (705) Q Consensus 504 ~~~~~~~~~l~li~q~~~~~--~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~-~~~ 580 (705) ++. ++|.++..|.... ..+ -+.+|++-... ..+.+.++++..+.+... +.. ... T Consensus 389 al~----~~l~~~a~w~G~~~~~~~------~i~~n~dF~~~-----------~~d~~~~~al~~~~~~G~--is~~t~~ 445 (489) T protein:vir:78 389 AYT----DALRWVAVMLGKPEDTEV------EFRLNMDFFLE-----------PMTAQDRAAWMADINAGL--LPATAYY 445 (489) T ss_pred HHH----HHHHHHHHHcCCCCCCce------EEEeecccCcc-----------cCCHHHHHHHHHHHhcCC--CCHHHHH Confidence 554 5555566664431 111 12233221111 111223344444443221 111 111 Q ss_pred HHHHH-HHHhhhccchhhhhhhccc--------ccchhhHHHHHH Q lcl|NC_021540. 581 KLILG-EIAKLRGMPDLSKMISKYN--------PEPSPQAQLEIQ 616 (705) Q Consensus 581 ~~il~-~l~e~~~~~~~~~~~~~~~--------~q~~~~~q~~~q 616 (705) ..+.. .+.+. ....+...+.... .+.++..|+..+ T Consensus 446 ~~L~~~gv~d~-~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 446 AALRKAGVTDW-TDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHhCCCCCc-cHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 11100 01000 0111111111110 000111111100 No 143 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=78.81 E-value=0.11 Score=25.82 Aligned_cols=440 Identities=12% Similarity=0.055 Sum_probs=147.4 Q ss_pred Ccch-----hhhhhcccccc-----cCCCCCCHHHHHHHHHHHHHhhHHhhHHHHHHHHH-HHHhccCC-CCCCC----- Q lcl|NC_021540. 1 MSDI-----NEEFLEDTVPS-----LQEDWKNKPKVSDLLNDFNNAKSTKDTQVAIIDDW-LAQLNVTG-AYKPK----- 63 (705) Q Consensus 1 ~~~~-----~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~----- 63 (705) |-+. -.=+|-.+.|. .++.|.- + .| + .... .+.+ -.|.-.-. ...+. T Consensus 1 ~~~~~~~~~~~~~m~V~~~hp~y~a~~~~W~~---~----~d---~--g~~~----~k~~g~~YLPk~~~~~~~~~~d~~ 64 (488) T protein:vir:96 1 MLKCLYIKHRGFFMLTPIYHPDYLVNAPQWLR---N----LD---C--VMDN----IKRKKQTYLPNLGAIPPEAKTDPK 64 (488) T ss_pred CceeEEEeecceeecccccCHHHHHHhhhhhH---h----hh---h--hhHH----HHHhhhhcCCCCCCccccccCcch Confidence 1100 00011111111 1334421 0 01 1 1111 1111 11211000 00000 Q ss_pred -------------CCCCCCcCCCHHHHHHHHHHHHHHHHhhcCCCCEEEEeCCCcchHHHHHHHHHHHHHHHHhhcCCcc Q lcl|NC_021540. 64 -------------QQVGRSSVQPKLIRKQAEWRYSALSEPFLNDENIFSIAPKTWQDREAARQNEAILNYQFNNQLDKVK 130 (705) Q Consensus 64 -------------~~~grs~~v~~~v~~~~e~~~~~l~~~f~~~~~~~~~~p~~~~D~~~A~~~t~~~n~~~~~~~~~~~ 130 (705) ....|-.+..+-++.+++.++..+ |.-++.++ .+...+ ...++. +-...|.. T Consensus 65 y~~~~~~~~~~y~~~~~~rA~~~n~~~~tl~~l~G~v----frk~p~~~------~~~~~~--l~~l~~---d~D~~G~~ 129 (488) T protein:vir:96 65 VTALAAKIEKDWEDLTWRLANYVNIVNPTMNAITGAV----MRREPEFD------TMDNPV--LIGLRD---NIDGKGNG 129 (488) T ss_pred hhhhhccchhhhHhhhhhccccCchhHHHHHHhcchh----hccCceec------cCCcHH--HHHHHh---ccCCCCCC Confidence 001123456777888888776555 33333333 221111 111111 11233443 Q ss_pred ---hHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhh Q lcl|NC_021540. 131 ---LIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSV 207 (705) Q Consensus 131 ---~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (705) ++..+++.+|..|.+-+=|-. +.... +..+.. T Consensus 130 L~~f~~~~~~~~l~~G~~~ilVD~----------------P~~~~---------------------T~ade~-------- 164 (488) T protein:vir:96 130 IDQECKQALNALQWGSRCGWLVRS----------------HPESA---------------------TMADWN-------- 164 (488) T ss_pred HHHHHHHHHHHHHhcCeEEEEEec----------------CCCcC---------------------CHHHHH-------- Confidence 377888888888888664421 11000 000100 Q ss_pred hcCccceeccCcccccceeeeccCcceEEEechhheeeCCCccCChhhCCeEE--EEEeccHHHHHHhcCCcCcchhhhh Q lcl|NC_021540. 208 ANNRPILAIINGYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVI--YSFESSRSDLEKYGIYSNLEYIKED 285 (705) Q Consensus 208 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~--~~~~~t~~el~~~g~~~d~~~~~~~ 285 (705) ...-+|.+..++|.+++ ++.-. ..+.+..+ .+.+-+..+ T Consensus 165 --------------------~~~~rPy~~~~~a~~Ii-nW~~~--~v~G~~~L~~v~lrE~~~~---------------- 205 (488) T protein:vir:96 165 --------------------KGKKLPTAAFYDALHII-DWEVE--YIDGEEKLTYLSLLEDYQE---------------- 205 (488) T ss_pred --------------------HhcCCcEEEEechhhhc-Cccee--ccCCceeeEEEEEEEEEEe---------------- Confidence 00225788888888875 33211 11222111 111100000 Q ss_pred hhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEEC---CEEEec-ccCCCCCCCcceEEeeee Q lcl|NC_021540. 286 SSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVD---DVMIRL-EKNPYPDGKLPFVVVPYL 361 (705) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g---~~iL~~-~~~p~~~~~~Pfv~~~~~ 361 (705) .| ..++. ++..++++. | .+|..+.++..-.+ ...... ... ..+.+||+++... T Consensus 206 -----------~D-~~~~~--~~~~~~~~~-l------~~g~~~v~~~~~~~~~~e~~~~~~g~~--~l~~IP~v~~~~~ 262 (488) T protein:vir:96 206 -----------RD-GGTYV--SKQRLINHR-L------VDGLCEFQEVTDDEYSDEWTPVLINSK--QSDTIPFFLASSQ 262 (488) T ss_pred -----------cc-CCCcc--cceEEEEEE-E------ECcEEEEEEEecCCcccceEeecCCCc--ccCeeEEEEEecC Confidence 00 00111 111122211 0 02221111111111 111111 111 1244566654333 Q ss_pred eecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCchhhhhhcCCccee-ec-CCccccccccccc Q lcl|NC_021540. 362 PVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNLLDPVNERKFKMGEDYK-YN-PGTNPVTDIIEHK 439 (705) Q Consensus 362 ~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~av~~~d~~~~~pg~~i~-~~-~~~~~~~~i~~~~ 439 (705) .. +...+.++.-.+..++..+=...+-.-+++..+.-|.++..-+-.+........+.++.. .+ +...+.+.+.+.+ T Consensus 263 ~~-~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~g~~~~~e 341 (488) T protein:vir:96 263 SN-EWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTMASEMNPLGFTLAGRMPYYVKNGDVKVIQ 341 (488) T ss_pred CC-CCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCcccccccccceeeecccccccccCCceeecC Confidence 21 222345555566666655544555556666666666665421111111111111222110 01 1101112222222 Q ss_pred CccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021540. 440 YPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVW 519 (705) Q Consensus 440 ~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~ 519 (705) +..+.-....|+.+.+.+.. .|..-...| .+.||++++.-..+....+..++.++++++. ++|.++..| T Consensus 342 -~~~~~l~~~~l~~l~~qm~~-~Ga~l~~~~-----~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~----~~l~~~A~w 410 (488) T protein:vir:96 342 -AQFSPETENKVEKLFEQAVK-VGASLFTQQ-----SNETATGAAIRSGSSTASMATLGNNVEDTVR----NMLRFIMRY 410 (488) T ss_pred -CchhHHHHHHHHHHHHHHHH-HhHhhccCC-----CcchHHHHHHHHHHhhHHHHHHHHHHHHHHH----HHHHHHHHH Confidence 22222223445555555533 333222121 1245665655445666677888877776555 455556666 Q ss_pred cCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHhhhcc------ Q lcl|NC_021540. 520 LSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIAKLRGM------ 593 (705) Q Consensus 520 ~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~e~~~~------ 593 (705) ....--.....+--+.||++-.. +..+.+.++++..+.++.. +... -++..+.. .++ T Consensus 411 ~g~~~~~~~~~~~~~~in~dF~~-----------~~ld~~~~~al~~~~~~G~--Is~~---t~~~~L~~-~gvl~~d~~ 473 (488) T protein:vir:96 411 FEGTNLYVNPDELVFKLNRDYFD-----------VEVNPQMLQVAYAAMMEGN--LPQV---SWFELLKR-ARVVRGDMS 473 (488) T ss_pred cCCCCCCcCccceEEEeccCCCC-----------ccCCHHHHHHHHHHHhcCC--CCHH---HHHHHHHh-CCcCCccCC Confidence 54210000000001222222111 1112233444444444321 1111 01111111 111 Q ss_pred -chhhhhhhcccccchhh Q lcl|NC_021540. 594 -PDLSKMISKYNPEPSPQ 610 (705) Q Consensus 594 -~~~~~~~~~~~~q~~~~ 610 (705) +.....++. +.... T Consensus 474 ~e~~~~~ie~---~g~~~ 488 (488) T protein:vir:96 474 KEEFDEHIAE---LGFGM 488 (488) T ss_pred HHHHHHHHhh---cCCCC Confidence 001111110 00000 No 144 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=75.82 E-value=0.14 Score=25.23 Aligned_cols=258 Identities=9% Similarity=-0.014 Sum_probs=96.5 Q ss_pred ceeccCcccccceeeeccCcceEEEechh-heee-CCCccCChhhCCeEEEEEeccHHHHHHhcCCcCcchhhhhhhhhh Q lcl|NC_021540. 213 ILAIINGYEEQEVIKTVKNQPEVTICDYH-NVTI-DPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSNLEYIKEDSSTST 290 (705) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~i~~V~~~-~~~~-Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~ 290 (705) +...+.... . .++ ..+ ++. .++- .|. .++|-.++...-. .++ .. T Consensus 1 ia~l~~~~~----~---~~~-~~~--~~l~~lL~~~PN--------------~~~t~~~f~~~~~-~~l---------l~ 46 (278) T protein:vir:78 1 MASLPLKMY----E---DYK-VVN--TEVSDLLTVSPN--------------NSLSSFDFINQIE-TIR---------NE 46 (278) T ss_pred CccceeEEE----e---cCc-ccc--cHHHHHHHhcCC--------------CCCCHHHHHHHHH-HHH---------hh Confidence 110000000 0 000 000 111 1110 111 1222222221100 000 00 Q ss_pred ccccccccccccccccccCeEEEEEEE------EEeeecCCCeeEEEEEEEECCEEEecccCCCCCCCcceEEeeeeeec Q lcl|NC_021540. 291 SSDHYSSDTSFTFSDKARKKIVVYEYW------GYWDIDGSGVTTPIVASWVDDVMIRLEKNPYPDGKLPFVVVPYLPVK 364 (705) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~v~v~E~w------~k~~~~~dg~~~~~~~~~~g~~iL~~~~~p~~~~~~Pfv~~~~~~~~ 364 (705) .+..+-.-. .+...+ +.++| +.+..+.+|....+.+...++.... |+... .+++...... T Consensus 47 ~Gna~~~i~----r~~~G~---~~~l~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~-----~~~~e--vih~~~~~~~ 112 (278) T protein:vir:78 47 KGNAYVLIE----RDIYHQ---PSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLI-----VHNMD--MLHFKHIVAS 112 (278) T ss_pred cCCEEEEEE----ECCCCc---EEEEEEECCceeEEEEcCCCceEEEEEEcCCceEEE-----Ecccc--EEEECCCCCC Confidence 000000000 000000 11222 2223333443333333333333221 11111 2222222234 Q ss_pred CcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEe-eccccCchhhhh---------hcCCcceeecCCcccccc Q lcl|NC_021540. 365 DSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGM-SKNLLDPVNERK---------FKMGEDYKYNPGTNPVTD 434 (705) Q Consensus 365 ~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~-~~~av~~~d~~~---------~~pg~~i~~~~~~~~~~~ 434 (705) +..+|.|.+..+...-...+...+..+... .+.|..++ ..+.++++.... ...|+++.+.+|.. T Consensus 113 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~---- 186 (278) T protein:vir:78 113 NMVQGISPIDVLKNTTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVE---- 186 (278) T ss_pred CCeeeccHHHHHHHHHHHHHHHHHHHHHHh--cCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCCCce---- Confidence 567899999888887777666555443333 23344444 344454433211 12445555544332 Q ss_pred cccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_021540. 435 IIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLAN-GLTEVAKKIL 513 (705) Q Consensus 435 i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~-~~~~~~~~~l 513 (705) +........-....+..+...+.+-...||++...|...+..-.+ +.+. .+.|.+ .+..+.+.+- T Consensus 187 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn---~~~~-----------~~~~~~~~l~P~~~~i~ 252 (278) T protein:vir:78 187 IEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAK---NEEL-----------NRFYLQHTLLPIVKQYE 252 (278) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc---HHHH-----------HHHHHHHHHHHHHHHHH Confidence 233333222334455567788888889999999999654432112 1111 112222 3344444433 Q ss_pred HH-HHHhcCCceeEeEecCceeeechhhc Q lcl|NC_021540. 514 AM-NSVWLSDEEVIRITDEEFVQINRDNL 541 (705) Q Consensus 514 ~l-i~q~~~~~~~iri~~~~~v~i~~~~~ 541 (705) +- ..+.+++... ..+.++.+|.+.+ T Consensus 253 ~~ln~~L~~~~e~---~~g~~~~f~~~~l 278 (278) T protein:vir:78 253 EEFNRKLLTKTDR---EKIGILNLTLNLI 278 (278) T ss_pred HHHHhhcCChhHh---cCCceEEEecccC Confidence 32 2233443221 1234666666655 No 145 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=61.98 E-value=0.34 Score=23.13 Aligned_cols=112 Identities=13% Similarity=0.121 Sum_probs=10.2 Q ss_pred hccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH---HH-HHH Q lcl|NC_021540. 591 RGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKAN--TEADL---NT-LDF 664 (705) Q Consensus 591 ~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~--~ea~~---~~-~~~ 664 (705) +.+..+.+.+.....+... ...+.+.......+...++.+.+.+.+....++....++..... .++.. +. .+. T Consensus 1 Mki~elk~el~~~~~el~~-~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~ 79 (437) T protein:vir:10 1 MKIEKLKKDLATKTAELNT-KKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDL 79 (437) T ss_pred CCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111111110000000 00000000000000001111111111111111100000000000 00000 00 000 Q ss_pred H--HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cC Q lcl|NC_021540. 665 V--EQETGVK--QERELELMQAQAKGNTQRDIVKTFLDTNKQG--NQ 705 (705) Q Consensus 665 ~--~q~~~~k--q~~e~e~~~~q~~~~~~~~~~k~~~~~~~q~--~~ 705 (705) . ++..... ...+..+... +.+...+..+......... .. T Consensus 80 ~~~e~~~~~~~~e~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 124 (437) T protein:vir:10 80 VAPELEENSADNEEDDPEKLKT--ETKSEAEKDKKTVKDEEKRDAGG 124 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHhHHH Confidence 0 0000000 0000000000 0000101001000100000 00 No 146 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=60.70 E-value=0.37 Score=22.97 Aligned_cols=566 Identities=10% Similarity=0.020 Sum_probs=168.9 Q ss_pred HHHHHHHHHHHHHHhhcCC-------------CCEEEEe-CCCcchHHHHHHHHHHHHHHHH--hhcCCcc-hHHHHHHH Q lcl|NC_021540. 76 IRKQAEWRYSALSEPFLND-------------ENIFSIA-PKTWQDREAARQNEAILNYQFN--NQLDKVK-LIDTMVRT 138 (705) Q Consensus 76 v~~~~e~~~~~l~~~f~~~-------------~~~~~~~-p~~~~D~~~A~~~t~~~n~~~~--~~~~~~~-~~~~~~~~ 138 (705) ..++.+.++..|++-|... +.-|.|. +.-|.++..+. +. .+-.|.+ +..|.|+- T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~---------l~~~~q~~~rP~~~~N~i~~ 71 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAG---------TKLDEQFEKYPKFEINKVAT 71 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHH---------HHhhhhhcCCCceEEcchHH Confidence 1122222222222222110 0002222 22233333331 11 1222333 34466666 Q ss_pred HHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchhhhcchHHHHHHHHHhhhhcCccceeccC Q lcl|NC_021540. 139 AVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPSILDTMPEALAESVRYSVANNRPILAIIN 218 (705) Q Consensus 139 al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 218 (705) .+..-.| ..+.+++.+.+.+...+........+..+.......+....+++.+|.+.+.+|+||.++.. T Consensus 72 ~i~~v~g-----------~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~ 140 (708) T protein:vir:17 72 ELNRIIA-----------EYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) T ss_pred HHHHHHh-----------hHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeee Confidence 6666555 45566677888888554434445555555555555677889999999999999999875422 Q ss_pred cccccceeeeccCcceEEEechhheeeCCCccCChhhCCeEEEEEeccHHHHHHhcCCcC--cchhhhhhhhhhcccccc Q lcl|NC_021540. 219 GYEEQEVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVIYSFESSRSDLEKYGIYSN--LEYIKEDSSTSTSSDHYS 296 (705) Q Consensus 219 ~~~~~~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~~~~~~t~~el~~~g~~~d--~~~~~~~~~~~~~~~~~~ 296 (705) .+.+.. .+ . -.|.++.+-+.- .++...-|=-..+..+.++..-...-+. .+.+...+. .... T Consensus 141 d~~~e~-------d~--~-~~~~~i~i~~~~-~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp-----~~a~ 204 (708) T protein:vir:17 141 MLVNEY-------DP--M-DDRQRIAIEPIY-DPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYG-----KKPP 204 (708) T ss_pred cccccC-------CC--C-CCccccceEeec-cchhheecCccccccChhhhhhhhhhccCCHHHHHHhCc-----cccc Confidence 211100 00 0 011111110000 0000000001112223333321111111 111111111 1100 Q ss_pred ccccccccccccCeEEEEEEEEEeeecCCC--eeEEEEEEE------------ECCEEEecccCC------C-------- Q lcl|NC_021540. 297 SDTSFTFSDKARKKIVVYEYWGYWDIDGSG--VTTPIVASW------------VDDVMIRLEKNP------Y-------- 348 (705) Q Consensus 297 ~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg--~~~~~~~~~------------~g~~iL~~~~~p------~-------- 348 (705) ..... ..+..|..-.+..+. +.+.++..+ +|+.+...+... + T Consensus 205 ~~~~~----------~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~ 274 (708) T protein:vir:17 205 ASLDV----------TSMTSWEYDWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEV 274 (708) T ss_pred hhhhh----------hhhccccccccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccc Confidence 00000 000111111111222 223222211 122111111110 0 Q ss_pred ----------------------CCCCcceEEeeeeeecCccc-CCchH--HHhhHHHHHHHHHHHHHHHHHHhcCCCcEE Q lcl|NC_021540. 349 ----------------------PDGKLPFVVVPYLPVKDSVY-GEADA--ELLSDNQKLIGALTRGMIDAMARSANGQRG 403 (705) Q Consensus 349 ----------------------~~~~~Pfv~~~~~~~~~~~~-g~g~~--~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~ 403 (705) ..+.+|+-.|++.|.-+... -.|.. ..++..=+-.=...|...-.+... -.. T Consensus 275 ~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~---~a~ 351 (708) T protein:vir:17 275 ARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADT---AAQ 351 (708) T ss_pred eeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHH---HHh Confidence 00113444444443332211 01111 111111111111111111000000 000 Q ss_pred eeccc-c-Cchhhhh--hcCCc-------ceeecCCcccccccc----cccCccchHHHHHHHHHHHHHHHHHhCcchHh Q lcl|NC_021540. 404 MSKNL-L-DPVNERK--FKMGE-------DYKYNPGTNPVTDII----EHKYPELPASSYNMLQMFTLEADALSGVKSFS 468 (705) Q Consensus 404 ~~~~a-v-~~~d~~~--~~pg~-------~i~~~~~~~~~~~i~----~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~ 468 (705) ..+.. + +...... ..-.. ...+++...+...+. .....+.++-....++++......+.- . T Consensus 352 ~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~----~ 427 (708) T protein:vir:17 352 DPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQE----V 427 (708) T ss_pred cCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHH----h Confidence 00000 0 0000000 00000 011111111111111 111123233333344455555554433 3 Q ss_pred cCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeEeEecCceeeechhhccccee- Q lcl|NC_021540. 469 QGLTGDSLGTTTAGVQGVIGASGKREL-GILRRLANGLTEVAKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFD- 546 (705) Q Consensus 469 ~G~~~~~~~~~a~~i~~l~~~~~~~~~-~~~~n~~~~~~~~~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d- 546 (705) .|.....++ ..+.+++..-+..+... ...-.|-+.++.-.+.+..++..+...- .+.+..+.|...+-..++- T Consensus 428 tGi~d~~~G-~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~----y~~~R~~RI~~edg~~~~v~ 502 (708) T protein:vir:17 428 TGGSQAMQQ-MPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREV----YGSEREVRIVNEDGSDDIAV 502 (708) T ss_pred cCCChHHcc-CccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCcEEEEecCCCCcceee Confidence 554433332 22223332211111111 1111122233333344444444332110 1223344454333211111 Q ss_pred ------------------EEe---eccchhH--HHHHHHHHHHHHHHHhhhchhHHH-HH-HHHHHHhhhccchhhhhhh Q lcl|NC_021540. 547 ------------------IKL---SISNAET--DAIKAQELSFMLQTMGQSLPFDMT-KL-ILGEIAKLRGMPDLSKMIS 601 (705) Q Consensus 547 ------------------v~v---~~~~~~~--~~~~~q~~~~llq~~~~~~~~~~~-~~-il~~l~e~~~~~~~~~~~~ 601 (705) +.+ ++...+. ...+.++....+..+.+.+++... .. ++..+.+...++...+..+ T Consensus 503 in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e 582 (708) T protein:vir:17 503 LSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKE 582 (708) T ss_pred ecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHH Confidence 110 0111111 123344444444444444544433 32 2334556677777666555 Q ss_pred cccccchhhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 602 KYNPEPSPQAQLEIQ---IKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEADLNTLDFVEQETGVKQERELE 678 (705) Q Consensus 602 ~~~~q~~~~~q~~~q---~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea~~~~~~~~~q~~~~kq~~e~e 678 (705) ....+..+..+.... ..++..+..+.++++++..+. +++++..+++++.+..+++....+... .+++.+.+ T Consensus 583 ~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~--eaqa~~~~~qAe~~ka~aea~~~q~~a----~q~~~~~~ 656 (708) T protein:vir:17 583 YNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMV--LAQAQMVAAQAEAQKATNETAQTQIKA----FTAQQDAM 656 (708) T ss_pred HHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHH Confidence 544333222211110 111111112222222222222 222222222222222222222221111 11111111 Q ss_pred ---HHHHHHH-----HHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 679 ---LMQAQAK-----GNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 679 ---~~~~q~~-----~~~~~~~~k~~~~~~~q~~~ 705 (705) .+..+.- +++.......+.....|..+ T Consensus 657 ~a~~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~q 691 (708) T protein:vir:17 657 ESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQ 691 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhH Confidence 1111111 11111111112222222222 No 147 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=58.40 E-value=0.41 Score=22.68 Aligned_cols=457 Identities=12% Similarity=0.009 Sum_probs=164.7 Q ss_pred CCCCCCCCCCCCcC---CCHHHHHHHHHHHHHHHHhhcCCCCEE----EEeCCC--cchHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021540. 58 GAYKPKQQVGRSSV---QPKLIRKQAEWRYSALSEPFLNDENIF----SIAPKT--WQDREAARQNEAILNYQFNNQLDK 128 (705) Q Consensus 58 ~~~~~~~~~grs~~---v~~~v~~~~e~~~~~l~~~f~~~~~~~----~~~p~~--~~D~~~A~~~t~~~n~~~~~~~~~ 128 (705) |..... -++.| .+-.....-.|- +++..|+|+... .+.|.- ++++ +.|-+|+- T Consensus 1 ~~~~~~---~~~~V~~~hp~y~a~~~~W~---~ird~~~G~~~~~~r~~yl~~~~~~~~e------~~Y~~rl~------ 62 (491) T protein:vir:95 1 MLTANG---QGSGVKTKHREWLHYAPKWQ---KVRHALAGDLVGYLRNVGLNEPDKAYGE------ARQAEYEA------ 62 (491) T ss_pred CcccCC---ccCCCCccCHHHHHHHHHHH---HHHHHhcCcchhhcccCCCcCCCCCCCH------HHHHHHHh------ Confidence 433221 12333 222333344554 566677776542 233321 1222 22554431 Q ss_pred cchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHHHhhchh-hhcchHHHHHHHHHhhh Q lcl|NC_021540. 129 VKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQMYQMNPS-ILDTMPEALAESVRYSV 207 (705) Q Consensus 129 ~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 207 (705) .-+++++++++|..=.|.+ |..+ |..+. ...++-...+.+ ...++...+...+.... T Consensus 63 rA~~~n~~~~tl~~l~G~v---frk~--------p~~~~-----------p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l 120 (491) T protein:vir:95 63 GGIVYNFTRRTLSGMVGSV---MRKE--------PEINI-----------PKELEYLLKNADGSGVGLIQHAQDTLMEID 120 (491) T ss_pred cccCCChHHHHHHHHhchh---hcCC--------ceeec-----------cHHHHHHHhccCCCCCCHHHHHHHHHHHHH Confidence 1235678888876655533 1111 11111 111222333333 34556667777777777 Q ss_pred hcCccceeccCccccc---ceeeeccCcceEEEechhheeeCCCccCChhhCCeEE--EEEeccHHHHHHhcCCcCcchh Q lcl|NC_021540. 208 ANNRPILAIINGYEEQ---EVIKTVKNQPEVTICDYHNVTIDPTCNGNLDEAKFVI--YSFESSRSDLEKYGIYSNLEYI 282 (705) Q Consensus 208 ~~g~~~~~~~~~~~~~---~~~~~~~~~~~i~~V~~~~~~~Dp~a~~d~~da~~~~--~~~~~t~~el~~~g~~~d~~~~ 282 (705) .+|..+.-+-...... ...+...-+|++..++|.+|+ ++.-. ..+.+..+ .+.+-+ T Consensus 121 ~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~Ii-nW~~~--~v~g~~~L~~v~l~E~---------------- 181 (491) T protein:vir:95 121 SVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENIV-NWRLT--RVGSVNRVTMVVLRET---------------- 181 (491) T ss_pred HcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhhc-Cceee--eeCCceeeeEEEEEEe---------------- Confidence 7776655332111100 000111236888888888885 33211 11111111 111100 Q ss_pred hhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEE--ECCEEEe-------cccCCCCCCCc Q lcl|NC_021540. 283 KEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASW--VDDVMIR-------LEKNPYPDGKL 353 (705) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~--~g~~iL~-------~~~~p~~~~~~ 353 (705) ....+....|+......++|++.. .+|....++.-+ .|+.... .+.+++ +.+ T Consensus 182 -----------~~~~d~~~~f~~~~~~qyRvL~l~------~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l--~~I 242 (491) T protein:vir:95 182 -----------WEYHEPGNEFETKYGEQYRVLDID------TDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESLR--GVI 242 (491) T ss_pred -----------EEeecCCCCcccceEEEEEEEeec------CCCceEEEEEEEcCCCcceeeeeeeeecCCCccc--Cee Confidence 000111122333333445555431 122111111111 0111111 111122 445 Q ss_pred ceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCchhhhh-hcCCcc-eeecCCcc Q lcl|NC_021540. 354 PFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKNL-LDPVNERK-FKMGED-YKYNPGTN 430 (705) Q Consensus 354 Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~a-v~~~d~~~-~~pg~~-i~~~~~~~ 430 (705) ||+++.... .+-..+.++.-.+..++..+=...+-.-+.+..++.|...+.-+- .+ .+... ..+.++ +..+.+.. T Consensus 243 Pfv~~~~~~-~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~-~~~~~~~~~~~i~~g~~~~~~ 320 (491) T protein:vir:95 243 PFTFIGATN-NDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLT-PQSFKEANPNGIKFGSRCGHN 320 (491) T ss_pred EEEEEecCC-CCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccC-cchhhccCcceeEecCcCCcC Confidence 665544332 122224455555655554444444555667788888777654221 11 11111 112111 11111111 Q ss_pred -c-ccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 431 -P-VTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTTAGVQGVIGASGKRELGILRRLANGLTEV 508 (705) Q Consensus 431 -~-~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a~~i~~l~~~~~~~~~~~~~n~~~~~~~~ 508 (705) + .....+.++... +.....|......+.. .|.. +...+ .+.||++++.-..+....+..++.++++++.. T Consensus 321 lP~~~~~~~ie~~~~-~~~~~~l~~~e~qm~~-~Ga~---l~~~~--~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~- 392 (491) T protein:vir:95 321 LGYGGSAQLIQAGEN-NLARQNMLDKEQQAIQ-IGAQ---LITPS--QQITAESARIQRGADTSVMATIARNVSQAYTD- 392 (491) T ss_pred CCCCCccceeecCcc-hHHHHHHHHHHHHHHH-HHHH---hccCC--cchhHHHHHHHHHHhhHHHHHHHHHHHHHHHH- Confidence 0 111223332221 1223334444444333 2321 11111 13556555554456666778888777765554 Q ss_pred HHHHHHHHHHhcCCceeEeEecCceeeechhhcccceeEEeeccchhHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHH Q lcl|NC_021540. 509 AKKILAMNSVWLSDEEVIRITDEEFVQINRDNLVGSFDIKLSISNAETDAIKAQELSFMLQTMGQSLPFDMTKLILGEIA 588 (705) Q Consensus 509 ~~~~l~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~dv~v~~~~~~~~~~~~q~~~~llq~~~~~~~~~~~~~il~~l~ 588 (705) +|.++..|....-- ++--+.+|++-... ..+.+.++++..+.++. .++... +...+ T Consensus 393 ---~l~~~a~w~G~~~~----~~v~i~~n~dF~~~-----------~~~~~~~~all~~~~~G--~is~~t----~~~~L 448 (491) T protein:vir:95 393 ---ALRWVAMMLGKPED----SEVEFQLNMDFFLQ-----------PMTAQDRAAWMADINAG--LLPATA----YYAAL 448 (491) T ss_pred ---HHHHHHHHcCCCCC----CceEEEeecccccc-----------cCCHHHHHHHHHHHhcC--CCCHHH----HHHHH Confidence 45556666443100 00012233321111 11122344444444432 111111 11111 Q ss_pred hhhccc-----hhhhhhhcccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021540. 589 KLRGMP-----DLSKMISKYNPEPSPQAQLEIQIKQLEAQELQ 626 (705) Q Consensus 589 e~~~~~-----~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q 626 (705) ...++. .....+++..+......+-.-...++.++..+ T Consensus 449 ~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 449 RKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HhCCCCCccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 111111 11111111110000000000000000000000 No 148 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=55.37 E-value=0.48 Score=22.33 Aligned_cols=470 Identities=10% Similarity=0.045 Sum_probs=170.3 Q ss_pred HhhHHhhHHHHHHHHHHHHhccCCCCCCCCCCC--CCcC--CCH-HHHHHHHHHHHHHHHhhcCCCCE-----EEEeCCC Q lcl|NC_021540. 35 NAKSTKDTQVAIIDDWLAQLNVTGAYKPKQQVG--RSSV--QPK-LIRKQAEWRYSALSEPFLNDENI-----FSIAPKT 104 (705) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--rs~~--v~~-~v~~~~e~~~~~l~~~f~~~~~~-----~~~~p~~ 104 (705) -|+....-. -+++. .-. .-|...|.-.=| -+.| ..+ .....-.|- +++.+|+|..- -.|.|.- T Consensus 1 ~~~~~~~~~-~~~~~--~~~-~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~---~ird~~~G~~~~r~~g~~YLP~~ 73 (535) T protein:vir:80 1 MARKRTTIR-RDVQS--KVL-IPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWR---KIMDCLSGQEAIKAKREEYLPMP 73 (535) T ss_pred CCcchhhhh-hhhhh--hcc-cCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHH---HHHHHhcChHHHHhcccccCCCC Confidence 111110000 00000 000 011111100001 1111 122 233333454 46666766533 2378875 Q ss_pred cchHHHHHHHHHHHHHHHHhhcCCcchHHHHHHHHHhcCCeEEEEeecchhhhhhhcccccccccCCchhHHHHHHHHHH Q lcl|NC_021540. 105 WQDREAARQNEAILNYQFNNQLDKVKLIDTMVRTAVNEGTVIFRTSWCLEETKVTENVPVFQYVEATGESIDLINQAVQM 184 (705) Q Consensus 105 ~~D~~~A~~~t~~~n~~~~~~~~~~~~~~~~~~~al~~g~gi~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (705) +......+-.+.|-+|+-. -+++++++++|..=.|.+-.. . |+.+.+ ..++- T Consensus 74 ~~~~~~~E~~~~Y~~rl~r------A~~~n~~~~tl~~l~G~vfrk---~--------p~~~~p-----------~~l~~ 125 (535) T protein:vir:80 74 SVDSRDEEQRRRYETYLQR------AIFYNVTARTLDGMMGQVFSR---D--------PIRQLP-----------PALEA 125 (535) T ss_pred CcccCCcCCHHHHHHHHhh------ccCCChhHHHHHHHhchhhcC---C--------cceecc-----------HHHHH Confidence 5333222334446665421 345678888887655643210 0 111111 12222 Q ss_pred Hhhchh-hhcchHHHHHHHHHhhhhcCccceeccCcc-ccc---ceeeeccCcceEEEechhheeeCCCccC-C-hhhCC Q lcl|NC_021540. 185 YQMNPS-ILDTMPEALAESVRYSVANNRPILAIINGY-EEQ---EVIKTVKNQPEVTICDYHNVTIDPTCNG-N-LDEAK 257 (705) Q Consensus 185 ~~~~~~-~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~---~~~~~~~~~~~i~~V~~~~~~~Dp~a~~-d-~~da~ 257 (705) +..+.+ ....+...+...+.....+|..+.-+-... +.. ...+....+|++..++|.+|+ ++.-.. + ..... T Consensus 126 l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~Ii-nW~~~~v~G~~~Lt 204 (535) T protein:vir:80 126 IVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSII-NWRTKLVGGKSVIS 204 (535) T ss_pred HHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhcc-CccccccCCcccee Confidence 333332 344566677777777777776654331110 000 001122346889989999886 332111 0 00111 Q ss_pred eEEEEEeccHHHHHHhcCCcCcchhhhhhhhhhccccccccccccccccccCeEEEEEEEEEeeecCCCeeEEEEEEEEC Q lcl|NC_021540. 258 FVIYSFESSRSDLEKYGIYSNLEYIKEDSSTSTSSDHYSSDTSFTFSDKARKKIVVYEYWGYWDIDGSGVTTPIVASWVD 337 (705) Q Consensus 258 ~~~~~~~~t~~el~~~g~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~w~k~~~~~dg~~~~~~~~~~g 337 (705) ++..+...+ ..++ .|+......+++++ .+.+|....++-..-+ T Consensus 205 ~v~lrE~~~-----------------------------~~dd--~f~~~~~~q~RvL~------~~~~G~y~v~~~~~~~ 247 (535) T protein:vir:80 205 LVVIQENVL-----------------------------AQDD--GFETTYVQQWRVLQ------LNAEGNYQVERWRRET 247 (535) T ss_pred EEEEEEEEE-----------------------------ecCC--CcccceeEEEEEEE------ecCCceEEEEEEEeec Confidence 221111100 0011 11212222233322 2222221111000000 Q ss_pred --C-------EE-EecccCCCCCCCcceEEeeeeeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhcCCCcEEeecc Q lcl|NC_021540. 338 --D-------VM-IRLEKNPYPDGKLPFVVVPYLPVKDSVYGEADAELLSDNQKLIGALTRGMIDAMARSANGQRGMSKN 407 (705) Q Consensus 338 --~-------~i-L~~~~~p~~~~~~Pfv~~~~~~~~~~~~g~g~~~~~~d~Q~~iN~~~~~~~d~~~~~~~~~~~~~~~ 407 (705) + ++ .....++ .+.+||+++.... -+...+......+..++..+=...+-.-+.+..+..|...+. | T Consensus 248 ~~~~~~~~~~~~~~~~g~~~--l~~IPfv~~~~~~-~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~-G 323 (535) T protein:vir:80 248 QEEMYYSYSKHVPTDGNGNP--FKEIPFQFIGPLD-NNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFT-G 323 (535) T ss_pred CCccccccceeecccCCCcc--cCeeEEEEeecCC-CCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeee-c Confidence 0 00 1111122 2445666543222 233345666777888877776666667778888888876654 3 Q ss_pred ccCch--hh-----hhhcCCcceeecCCcccccccccccCccchHHHHHHHHHHHHHHHHHhCcchHhcCCCccccchHH Q lcl|NC_021540. 408 LLDPV--NE-----RKFKMGEDYKYNPGTNPVTDIIEHKYPELPASSYNMLQMFTLEADALSGVKSFSQGLTGDSLGTTT 480 (705) Q Consensus 408 av~~~--d~-----~~~~pg~~i~~~~~~~~~~~i~~~~~~~i~~~~~~~l~~~~~~~~~~tGv~d~~~G~~~~~~~~~a 480 (705) ..+.. +. +..-+...+.+..++. ..+.......++ ...|+...+.+..+ |..-...+ ..+.|| T Consensus 324 ~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~--~~~~e~~~~~~a---~~~l~~~e~qM~~l-Ga~ll~~~----~~~~Ta 393 (535) T protein:vir:80 324 LTKDWVEDVFKDFKVHLGSRAIIPLPQGAT--AGILQITPNSVP---FEAMTHKESQMIAM-GANLLVKS----GGNRTF 393 (535) T ss_pred CchhhhhcCCCCcceEecCcccccCCCCCC--cceeeeccchhH---HHHHHHHHHHHHHH-HHHhhccC----cccccH Confidence 22111 11 1122223333322221 122233333333 23345555554442 32222222 112445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CceeEeEecCceee--echhhcccceeEE----ee Q lcl|NC_021540. 481 AGVQGVIGASGKRELGILRRLANGLTEVAKKILAMNSVWLS----DEEVIRITDEEFVQ--INRDNLVGSFDIK----LS 550 (705) Q Consensus 481 ~~i~~l~~~~~~~~~~~~~n~~~~~~~~~~~~l~li~q~~~----~~~~iri~~~~~v~--i~~~~~~~~~dv~----v~ 550 (705) ++++.-..+....|..++.++++++..+ |.++..|.. +..+.--.+.+|+. +++..+..-+.+. ++ T Consensus 394 ~~a~~~~~~~~S~L~~~a~~le~al~~a----L~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is 469 (535) T protein:vir:80 394 GEAQQEEASEQSILSACTKNVSMAFRKA----LRWANQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQGAIT 469 (535) T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHH----HHHHHHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCC Confidence 5554433445555777777777665554 445555543 23332223444432 2232221100000 00 Q ss_pred ------------c---c-chhHHHHHHHH-HHHHHHHHhhhch-hHHHH---HHHHHHHhhhccch Q lcl|NC_021540. 551 ------------I---S-NAETDAIKAQE-LSFMLQTMGQSLP-FDMTK---LILGEIAKLRGMPD 595 (705) Q Consensus 551 ------------~---~-~~~~~~~~~q~-~~~llq~~~~~~~-~~~~~---~il~~l~e~~~~~~ 595 (705) + . ..+....+.+. ...+-...+...+ ..... .+-..-.......+ T Consensus 470 ~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 470 FKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred HHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCccccccCCC Confidence 0 0 00000111110 0000001111111 11000 00000001111111 No 149 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=29.74 E-value=1.6 Score=19.42 Aligned_cols=126 Identities=13% Similarity=0.164 Sum_probs=15.8 Q ss_pred chhHHHHHHHHHHHhhhc-cchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH Q lcl|NC_021540. 575 LPFDMTKLILGEIAKLRG-MPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQ--AEAAKAR 651 (705) Q Consensus 575 ~~~~~~~~il~~l~e~~~-~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q--~e~a~a~ 651 (705) |- +..+-.++.++.. +......++........ .....+....+....+.++..++.++.......+ .+..+.. T Consensus 1 Mk---i~elk~el~~~~~el~~~~~elr~~~~~~~~-~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~ 76 (437) T protein:vir:10 1 MK---IEKLKKDLATKTAELNTKKAEIRSFTESEDK-TIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDD 76 (437) T ss_pred CC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1111111111110 00000001100000000 0000111111111222222222222211111111 1111110 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_021540. 652 KANTEADLNTLDF-VEQETGVKQE----RELELMQAQAKGNTQRDIVKTFLDTNKQGNQ 705 (705) Q Consensus 652 ~~~~ea~~~~~~~-~~q~~~~kq~----~e~e~~~~q~~~~~~~~~~k~~~~~~~q~~~ 705 (705) ............. .+.....+.. ...+.............. ............ T Consensus 77 ~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 134 (437) T protein:vir:10 77 SDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDA-GGLQDMKLKVGG 134 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH-HHHhHHHHHHHH Confidence 0000000000000 0111111111 111111111111111000 001111111111 No 150 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=20.35 E-value=2.8 Score=18.14 Aligned_cols=111 Identities=12% Similarity=0.099 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHhhhccchhhhhhhcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021540. 578 DMTKLILGEIAKLRGMPDLSKMISKYNPEPSPQAQLEIQIKQLEAQELQMRIAKLQAEIQLMPYEAQAEAAKARKANTEA 657 (705) Q Consensus 578 ~~~~~il~~l~e~~~~~~~~~~~~~~~~q~~~~~q~~~q~~q~~~q~~q~e~~k~qa~~q~~~~~~q~e~a~a~~~~~ea 657 (705) .+++..+.++.. .+.++...+.... .+.+.............+...++++.. ....+....+. T Consensus 1 ~~l~e~i~e~~~--~l~el~~~~~~~~------~e~r~~~e~~~~~~~~~~~~e~~~~~~---------~l~~ei~~l~e 63 (400) T protein:vir:38 1 MTLDEKLAAVKK--QLDEKRSALPAMK------TELRSLLEGEDSEENLKKAEGVRAKYD---------KAGKEIKDLEE 63 (400) T ss_pred CChHHHHHHHHH--HHHHHHHHHHHHH------HHHHHHHHhhccchHHHHHHHHHHHHH---------HHHHHHHHHHH Confidence 011111111100 0000000000000 000000000000000000000001000 00000000000 Q ss_pred HHHHHHHH----HHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------hh----ccC Q lcl|NC_021540. 658 DLNTLDFV----EQE-------TGVKQERELELMQAQAKGNTQRDIVKTFLDTN---------KQ----GNQ 705 (705) Q Consensus 658 ~~~~~~~~----~q~-------~~~kq~~e~e~~~~q~~~~~~~~~~k~~~~~~---------~q----~~~ 705 (705) +....+.. +.. ...+...+........................ .. .+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (400) T protein:vir:38 64 KRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNA 135 (400) T ss_pred HHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhh Confidence 00000000 000 00000000000000000000000000000000 00 000 Done!