Query lcl|NC_021532.1_cdsid_YP_008125986.1 [gene=M610_gp015] [protein=hypothetical protein] [protein_id=YP_008125986.1] [location=complement(16731..18722)] Match_columns 663 No_of_seqs 224 out of 340 Neff 9.6 Searched_HMMs 1612 Date Thu Nov 7 17:09:35 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_15 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_15_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95821 Length: 763 100.0 2E-133 1E-136 748.2 68.1 641 1-663 21-725 (763) 2 protein:vir:8846 Length: 705 # 100.0 5E-121 3E-124 680.1 72.4 645 1-661 8-705 (705) 3 protein:vir:93630 Length: 776 100.0 8E-104 5E-107 585.8 55.8 604 1-663 36-718 (776) 4 protein:vir:108295 Length: 711 100.0 3E-101 2E-104 571.8 65.0 589 1-655 24-711 (711) 5 protein:vir:2764 Length: 714 # 100.0 1.8E-99 1E-102 562.0 64.4 605 1-663 8-713 (714) 6 protein:vir:9950 Length: 714 # 100.0 1.8E-99 1E-102 562.0 64.4 605 1-663 8-713 (714) 7 protein:vir:817 Length: 714 # 100.0 1.8E-99 1E-102 562.0 64.4 605 1-663 8-713 (714) 8 protein:vir:10117 Length: 714 100.0 1.8E-99 1E-102 562.0 64.4 605 1-663 8-713 (714) 9 protein:vir:3296 Length: 714 # 100.0 1.8E-99 1E-102 562.0 64.4 605 1-663 8-713 (714) 10 protein:vir:104437 Length: 714 100.0 1.6E-97 1E-100 551.2 62.7 605 1-663 1-713 (714) 11 protein:vir:80165 Length: 651 100.0 1.6E-96 1E-99 545.7 59.6 599 1-641 14-651 (651) 12 protein:vir:105619 Length: 772 100.0 9E-96 5.6E-99 541.6 57.7 593 1-663 1-708 (772) 13 protein:vir:77597 Length: 725 100.0 3E-90 1.9E-93 511.4 63.4 609 3-663 1-721 (725) 14 protein:vir:100920 Length: 725 100.0 7.8E-90 4.8E-93 509.1 61.0 611 3-663 1-721 (725) 15 protein:vir:9263 Length: 725 # 100.0 1.7E-89 1.1E-92 507.2 61.6 609 3-663 1-722 (725) 16 protein:vir:3520 Length: 720 # 100.0 1.3E-85 7.9E-89 486.0 59.1 603 1-659 1-720 (720) 17 protein:vir:172 Length: 708 # 100.0 2.4E-85 1.5E-88 484.5 58.5 604 1-663 1-704 (708) 18 protein:vir:105520 Length: 706 100.0 7.7E-85 4.8E-88 481.7 55.7 606 1-663 1-699 (706) 19 protein:vir:105429 Length: 708 100.0 1.5E-82 9.6E-86 469.1 59.9 608 1-662 1-708 (708) 20 protein:vir:95449 Length: 584 100.0 4.7E-84 2.9E-87 477.4 44.2 540 1-595 1-584 (584) 21 protein:vir:3139 Length: 599 # 100.0 2.4E-80 1.5E-83 457.0 41.5 548 1-607 1-599 (599) 22 protein:vir:94599 Length: 641 100.0 1.6E-78 9.8E-82 447.1 45.9 594 1-628 18-641 (641) 23 protein:vir:345 Length: 663 # 100.0 1.2E-43 7.4E-47 255.9 41.4 605 1-662 1-663 (663) 24 protein:vir:7321 Length: 556 # 100.0 5.4E-35 3.3E-38 208.5 45.0 518 3-626 1-556 (556) 25 protein:vir:102668 Length: 547 100.0 6E-35 3.7E-38 208.2 43.8 507 3-597 1-547 (547) 26 protein:vir:95315 Length: 559 100.0 3.3E-35 2.1E-38 209.6 41.7 523 3-619 1-559 (559) 27 protein:vir:1538 Length: 535 # 100.0 9.1E-35 5.6E-38 207.2 43.3 509 3-611 1-535 (535) 28 protein:vir:103765 Length: 549 100.0 3.1E-34 1.9E-37 204.3 44.0 515 1-605 1-549 (549) 29 protein:vir:3361 Length: 535 # 100.0 5E-34 3.1E-37 203.2 42.8 504 3-611 1-535 (535) 30 protein:vir:107822 Length: 555 100.0 1.5E-33 9.1E-37 200.6 45.1 521 3-618 1-555 (555) 31 protein:vir:98506 Length: 555 100.0 1.5E-33 9.1E-37 200.6 45.1 521 3-618 1-555 (555) 32 protein:vir:107404 Length: 555 100.0 1.5E-33 9.1E-37 200.6 45.1 521 3-618 1-555 (555) 33 protein:vir:10447 Length: 536 100.0 1.9E-33 1.2E-36 200.0 44.6 511 3-617 1-536 (536) 34 protein:vir:1785 Length: 555 # 100.0 1.8E-33 1.1E-36 200.2 43.2 525 1-620 1-555 (555) 35 protein:vir:2198 Length: 536 # 100.0 4.1E-33 2.5E-36 198.2 44.8 505 3-617 1-536 (536) 36 protein:vir:94572 Length: 535 100.0 9.3E-33 5.8E-36 196.2 44.9 510 1-611 3-535 (535) 37 protein:vir:99672 Length: 532 100.0 7.4E-33 4.6E-36 196.8 43.9 505 3-609 1-532 (532) 38 protein:vir:94709 Length: 522 100.0 1.6E-32 1E-35 194.9 44.9 496 1-603 1-522 (522) 39 protein:vir:100039 Length: 522 100.0 5.7E-33 3.6E-36 197.3 40.4 499 1-612 1-522 (522) 40 protein:vir:8883 Length: 543 # 100.0 1.3E-31 7.8E-35 190.0 39.4 519 1-619 2-543 (543) 41 protein:vir:78696 Length: 542 100.0 6.2E-32 3.8E-35 191.7 35.7 507 1-623 1-542 (542) 42 protein:vir:96988 Length: 516 100.0 2.2E-30 1.4E-33 183.2 40.0 488 1-594 5-516 (516) 43 protein:vir:6322 Length: 510 # 100.0 1.4E-29 8.5E-33 178.8 43.1 486 1-591 1-510 (510) 44 protein:vir:78942 Length: 510 100.0 1.6E-29 9.6E-33 178.5 42.8 487 1-605 1-510 (510) 45 protein:vir:103330 Length: 517 100.0 2.1E-29 1.3E-32 177.8 42.3 495 1-600 1-517 (517) 46 protein:vir:7017 Length: 515 # 100.0 5.4E-29 3.4E-32 175.5 43.5 489 3-587 1-515 (515) 47 protein:vir:80211 Length: 514 100.0 6.3E-28 3.9E-31 169.7 42.7 487 1-601 1-514 (514) 48 protein:vir:105641 Length: 516 100.0 4.4E-28 2.7E-31 170.6 41.8 486 1-587 1-516 (516) 49 protein:vir:103385 Length: 666 99.9 5.6E-28 3.4E-31 170.0 17.1 566 1-602 1-666 (666) 50 protein:vir:96403 Length: 666 99.9 1.5E-27 9.6E-31 167.6 18.4 566 1-602 1-666 (666) 51 protein:vir:3609 Length: 452 # 99.8 2.7E-18 1.7E-21 116.9 32.4 429 1-583 1-452 (452) 52 protein:vir:9871 Length: 429 # 99.8 1.3E-17 8.1E-21 113.1 33.5 416 3-583 1-429 (429) 53 protein:vir:80680 Length: 441 99.8 6.4E-17 4E-20 109.4 36.8 421 1-594 1-441 (441) 54 protein:vir:733 Length: 453 # 99.8 2.2E-17 1.4E-20 111.9 33.8 426 1-594 9-453 (453) 55 protein:vir:3964 Length: 453 # 99.8 1.2E-17 7.4E-21 113.4 31.7 430 1-583 9-453 (453) 56 protein:vir:106639 Length: 481 99.8 3.9E-16 2.4E-19 105.1 39.7 432 1-582 28-481 (481) 57 protein:vir:96494 Length: 501 99.8 6.6E-17 4.1E-20 109.3 35.5 445 1-592 36-501 (501) 58 protein:vir:96240 Length: 511 99.7 1.9E-17 1.2E-20 112.3 30.4 452 1-595 29-511 (511) 59 protein:vir:103951 Length: 511 99.7 1.8E-17 1.1E-20 112.4 30.1 452 1-595 29-511 (511) 60 protein:vir:95806 Length: 440 99.7 7.9E-17 4.9E-20 108.9 33.5 416 11-583 1-440 (440) 61 protein:vir:97171 Length: 512 99.7 3.5E-17 2.2E-20 110.8 31.1 451 1-595 31-512 (512) 62 protein:vir:99522 Length: 470 99.7 1.2E-15 7.3E-19 102.4 39.1 429 1-585 23-470 (470) 63 protein:vir:1236 Length: 483 # 99.7 4.2E-17 2.6E-20 110.4 31.0 433 1-594 29-483 (483) 64 protein:vir:38 Length: 496 # N 99.7 5.7E-17 3.5E-20 109.6 31.6 432 1-553 16-496 (496) 65 protein:vir:2732 Length: 501 # 99.7 2.6E-16 1.6E-19 106.1 35.0 442 1-592 37-501 (501) 66 protein:vir:96839 Length: 474 99.7 6.2E-16 3.9E-19 103.9 36.8 434 1-581 20-474 (474) 67 protein:vir:93747 Length: 472 99.7 5.2E-17 3.2E-20 109.9 30.3 434 1-598 18-472 (472) 68 protein:vir:105292 Length: 478 99.7 2.8E-17 1.7E-20 111.3 28.7 443 1-592 2-478 (478) 69 protein:vir:107112 Length: 478 99.7 4.4E-17 2.8E-20 110.2 29.5 440 1-581 2-478 (478) 70 protein:vir:94101 Length: 474 99.7 4.2E-17 2.6E-20 110.3 29.4 443 1-592 1-474 (474) 71 protein:vir:105889 Length: 474 99.7 4.2E-17 2.6E-20 110.3 29.4 443 1-592 1-474 (474) 72 protein:vir:96179 Length: 468 99.7 2.1E-15 1.3E-18 101.1 38.3 427 1-580 24-468 (468) 73 protein:vir:106571 Length: 499 99.7 6.2E-17 3.8E-20 109.5 29.6 472 1-602 1-499 (499) 74 protein:vir:9306 Length: 511 # 99.7 7.9E-17 4.9E-20 108.9 30.2 452 1-595 29-511 (511) 75 protein:vir:99781 Length: 511 99.7 4.8E-17 3E-20 110.1 28.6 448 1-588 29-511 (511) 76 protein:vir:78805 Length: 511 99.7 6.4E-17 4E-20 109.4 29.2 453 1-592 29-511 (511) 77 protein:vir:96366 Length: 511 99.7 6.4E-17 4E-20 109.4 29.2 453 1-592 29-511 (511) 78 protein:vir:80959 Length: 499 99.7 9.5E-16 5.9E-19 102.9 35.4 435 4-553 1-499 (499) 79 protein:vir:97336 Length: 492 99.7 8.8E-17 5.5E-20 108.6 29.6 434 1-598 38-492 (492) 80 protein:vir:102950 Length: 471 99.7 1.1E-15 6.7E-19 102.6 35.4 429 3-592 1-471 (471) 81 protein:vir:105461 Length: 470 99.7 4.6E-16 2.8E-19 104.7 33.2 433 3-583 1-470 (470) 82 protein:vir:1587 Length: 508 # 99.7 3.7E-16 2.3E-19 105.2 32.1 451 1-562 1-508 (508) 83 protein:vir:4898 Length: 502 # 99.7 1.5E-15 9E-19 101.9 34.7 449 1-592 31-502 (502) 84 protein:vir:2427 Length: 485 # 99.7 7.7E-16 4.8E-19 103.4 32.2 451 1-594 6-485 (485) 85 protein:vir:3028 Length: 500 # 99.7 8E-16 5E-19 103.3 32.2 435 1-551 1-500 (500) 86 protein:vir:9815 Length: 500 # 99.7 8E-16 5E-19 103.3 32.2 435 1-551 1-500 (500) 87 protein:vir:9751 Length: 422 # 99.7 1.5E-15 9.1E-19 101.9 33.2 410 3-559 1-422 (422) 88 protein:vir:9922 Length: 489 # 99.7 1.3E-15 8.4E-19 102.1 33.0 426 1-556 13-489 (489) 89 protein:vir:94805 Length: 492 99.7 6.7E-16 4.2E-19 103.8 30.8 437 1-592 38-492 (492) 90 protein:vir:95899 Length: 474 99.7 8.5E-16 5.2E-19 103.2 30.7 433 1-595 21-474 (474) 91 protein:vir:96266 Length: 474 99.7 8.5E-16 5.2E-19 103.2 30.7 433 1-595 21-474 (474) 92 protein:vir:101494 Length: 527 99.7 4.8E-16 3E-19 104.5 29.3 473 1-564 1-527 (527) 93 protein:vir:102239 Length: 527 99.7 4.9E-16 3.1E-19 104.5 29.3 473 1-564 1-527 (527) 94 protein:vir:94498 Length: 474 99.7 1.8E-14 1.1E-17 96.0 37.3 436 1-583 7-474 (474) 95 protein:vir:97447 Length: 474 99.7 1.8E-14 1.1E-17 96.0 37.3 436 1-583 7-474 (474) 96 protein:vir:79703 Length: 505 99.7 9E-15 5.6E-18 97.6 35.2 441 1-551 1-505 (505) 97 protein:vir:2341 Length: 488 # 99.7 1.7E-15 1E-18 101.6 30.9 459 1-596 5-488 (488) 98 protein:vir:79043 Length: 479 99.7 4.3E-15 2.7E-18 99.3 33.1 432 1-582 16-479 (479) 99 protein:vir:104082 Length: 485 99.7 9E-15 5.6E-18 97.6 34.0 450 1-591 8-485 (485) 100 protein:vir:94742 Length: 409 99.7 4.8E-15 3E-18 99.1 32.4 392 3-532 1-409 (409) 101 protein:vir:94546 Length: 506 99.7 5.2E-16 3.2E-19 104.4 27.1 435 1-585 20-506 (506) 102 protein:vir:98883 Length: 517 99.7 1.7E-15 1E-18 101.6 29.9 468 1-569 1-517 (517) 103 protein:vir:102330 Length: 451 99.7 4.9E-14 3.1E-17 93.5 37.9 427 3-581 1-451 (451) 104 protein:vir:1634 Length: 409 # 99.6 1.4E-14 8.7E-18 96.5 33.5 392 3-532 1-409 (409) 105 protein:vir:78227 Length: 480 99.6 2.2E-15 1.4E-18 100.9 28.2 452 1-597 1-480 (480) 106 protein:vir:5961 Length: 503 # 99.6 3.5E-15 2.2E-18 99.9 29.1 458 1-604 13-503 (503) 107 protein:vir:78083 Length: 537 99.6 9.4E-14 5.8E-17 92.0 36.8 469 1-598 6-537 (537) 108 protein:vir:9568 Length: 410 # 99.6 1.1E-14 6.7E-18 97.1 30.2 392 20-560 1-410 (410) 109 protein:vir:7430 Length: 563 # 99.6 5.4E-16 3.3E-19 104.3 22.7 476 1-551 1-563 (563) 110 protein:vir:95113 Length: 474 99.6 1E-13 6.2E-17 91.9 34.8 436 1-581 7-474 (474) 111 protein:vir:78537 Length: 480 99.6 5.9E-15 3.7E-18 98.6 27.7 452 1-597 1-480 (480) 112 protein:vir:2500 Length: 501 # 99.6 4.8E-15 3E-18 99.1 27.2 458 1-596 16-501 (501) 113 protein:vir:7768 Length: 484 # 99.6 1.9E-14 1.2E-17 95.8 29.7 451 1-601 9-484 (484) 114 protein:vir:4223 Length: 486 # 99.6 6.4E-14 4E-17 92.9 32.3 451 1-591 6-486 (486) 115 protein:vir:99072 Length: 479 99.6 6E-13 3.7E-16 87.6 37.4 450 1-589 2-479 (479) 116 protein:vir:7987 Length: 456 # 99.6 5.5E-13 3.4E-16 87.8 35.4 431 1-581 1-456 (456) 117 protein:vir:8184 Length: 474 # 99.6 9.8E-13 6.1E-16 86.4 36.1 428 1-583 12-474 (474) 118 protein:vir:78907 Length: 518 99.5 6.8E-13 4.2E-16 87.3 34.3 460 1-552 1-518 (518) 119 protein:vir:4782 Length: 522 # 99.5 4.7E-13 2.9E-16 88.2 33.3 469 1-564 1-522 (522) 120 protein:vir:105819 Length: 456 99.5 3.2E-12 2E-15 83.6 36.0 431 1-584 1-456 (456) 121 protein:vir:102602 Length: 456 99.5 3.2E-12 2E-15 83.6 36.0 431 1-584 1-456 (456) 122 protein:vir:99916 Length: 504 99.5 9.8E-13 6.1E-16 86.4 32.5 440 1-590 16-504 (504) 123 protein:vir:98444 Length: 434 99.2 2.7E-10 1.7E-13 73.0 27.2 407 36-582 1-434 (434) 124 protein:vir:94956 Length: 452 98.2 2.1E-06 1.3E-09 51.7 29.4 432 1-555 1-452 (452) 125 protein:vir:3520 Length: 720 # 97.7 2.9E-05 1.8E-08 45.5 31.1 583 1-656 5-720 (720) 126 protein:vir:97265 Length: 513 97.6 3.3E-05 2E-08 45.2 29.4 455 3-575 1-513 (513) 127 protein:vir:78393 Length: 489 97.6 4.5E-05 2.8E-08 44.4 28.1 445 1-561 1-489 (489) 128 protein:vir:8846 Length: 705 # 97.6 4.6E-05 2.9E-08 44.4 36.7 604 1-653 11-705 (705) 129 protein:vir:95014 Length: 491 97.4 7E-05 4.4E-08 43.4 31.9 443 1-558 1-491 (491) 130 protein:vir:817 Length: 714 # 97.2 0.00013 8.2E-08 41.9 24.9 582 1-654 16-714 (714) 131 protein:vir:9950 Length: 714 # 97.2 0.00013 8.2E-08 41.9 24.9 582 1-654 16-714 (714) 132 protein:vir:3296 Length: 714 # 97.2 0.00013 8.2E-08 41.9 24.9 582 1-654 16-714 (714) 133 protein:vir:2764 Length: 714 # 97.2 0.00013 8.2E-08 41.9 24.9 582 1-654 16-714 (714) 134 protein:vir:10117 Length: 714 97.2 0.00013 8.2E-08 41.9 24.9 582 1-654 16-714 (714) 135 protein:vir:172 Length: 708 # 97.2 0.00014 8.7E-08 41.7 25.2 578 1-658 6-708 (708) 136 protein:vir:96783 Length: 488 97.2 0.00014 8.9E-08 41.6 31.8 439 1-545 14-488 (488) 137 protein:vir:80453 Length: 535 97.2 0.00015 9.2E-08 41.6 26.5 444 1-534 1-535 (535) 138 protein:vir:105429 Length: 708 97.1 0.00017 1.1E-07 41.2 27.8 582 1-654 5-708 (708) 139 protein:vir:93630 Length: 776 96.7 0.0004 2.5E-07 39.2 24.4 596 1-663 43-731 (776) 140 protein:vir:100920 Length: 725 95.8 0.0014 8.6E-07 36.3 34.4 587 1-661 4-725 (725) 141 protein:vir:105520 Length: 706 95.6 0.0018 1.1E-06 35.7 29.8 581 8-663 1-690 (706) 142 protein:vir:105619 Length: 772 94.7 0.0036 2.3E-06 34.0 19.3 586 1-663 20-721 (772) 143 protein:vir:104437 Length: 714 93.6 0.007 4.4E-06 32.4 36.9 582 1-654 18-714 (714) 144 protein:vir:95149 Length: 501 92.5 0.011 7E-06 31.3 33.2 455 1-558 1-501 (501) 145 protein:vir:77597 Length: 725 92.4 0.012 7.2E-06 31.2 29.3 585 1-661 4-725 (725) 146 protein:vir:108295 Length: 711 90.5 0.02 1.3E-05 29.9 27.0 564 1-642 28-711 (711) 147 protein:vir:98853 Length: 219 89.0 0.029 1.8E-05 29.0 15.8 196 248-477 1-219 (219) 148 protein:vir:78641 Length: 278 88.8 0.03 1.9E-05 28.9 24.4 266 73-476 1-278 (278) 149 protein:vir:100150 Length: 437 88.1 0.034 2.1E-05 28.6 18.9 417 1-577 1-437 (437) 150 protein:vir:9263 Length: 725 # 85.1 0.055 3.4E-05 27.5 27.4 590 1-656 4-725 (725) 151 protein:vir:95821 Length: 763 77.9 0.12 7.4E-05 25.6 32.3 587 1-663 106-743 (763) 152 protein:vir:1084 Length: 437 # 71.9 0.19 0.00012 24.5 15.0 138 526-663 1-145 (437) 153 protein:vir:4952 Length: 386 # 64.9 0.29 0.00018 23.5 20.6 368 1-547 1-386 (386) 154 protein:vir:1084 Length: 437 # 59.7 0.39 0.00024 22.8 16.3 145 510-663 1-152 (437) 155 protein:vir:93610 Length: 454 51.6 0.58 0.00036 21.9 13.2 418 8-586 1-454 (454) 156 protein:vir:79538 Length: 502 35.7 1.2 0.00075 20.1 29.8 433 1-554 15-502 (502) 157 protein:vir:80128 Length: 466 33.9 1.3 0.00082 19.9 13.1 120 523-663 1-133 (466) 158 protein:vir:98816 Length: 446 31.8 1.5 0.00091 19.7 19.3 407 1-544 1-446 (446) 159 protein:vir:10362 Length: 432 30.3 1.6 0.00098 19.5 23.8 397 3-563 1-432 (432) 160 protein:vir:102118 Length: 409 29.2 1.7 0.001 19.3 23.1 391 10-563 1-409 (409) 161 protein:vir:1326 Length: 457 # 27.3 1.9 0.0012 19.1 13.9 419 1-582 1-457 (457) 162 protein:vir:3843 Length: 397 # 25.5 2 0.0013 18.9 23.0 369 1-553 1-397 (397) 163 protein:vir:96980 Length: 409 24.3 2.2 0.0014 18.7 26.1 389 3-573 1-409 (409) 164 protein:vir:99853 Length: 488 24.2 2.2 0.0014 18.7 34.1 468 3-659 1-488 (488) 165 protein:vir:79063 Length: 491 23.0 2.4 0.0015 18.5 30.3 454 1-645 13-491 (491) 166 protein:vir:3153 Length: 467 # 22.6 2.4 0.0015 18.5 23.0 411 44-582 1-467 (467) 167 protein:vir:94426 Length: 409 22.2 2.5 0.0015 18.4 23.1 361 3-515 1-409 (409) No 1 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=1.9e-133 Score=748.15 Aligned_cols=641 Identities=36% Similarity=0.633 Sum_probs=501.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCcCCccccCCCccccHHHHHHHHHHHHHHHHhhcCCCce Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN--GEPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADI 78 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~--~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~ 78 (663) =.|++++++++|+++++.++.++....+....|.+||+ |+..+++.+|||+||+++|+++|+|++++|+++||++++| T Consensus 21 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~~v~~~ve~~~~~l~~~f~~~~~~ 100 (763) T protein:vir:95 21 TSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPKLVRRQAEWRYSALTEPFLGSNKL 100 (763) T ss_pred CCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccccCCCccccCHHHHHHHHHHHHHHHHhhcCCCcE Confidence 67999999999999999999999999988888888765 5666778899999999999999999999999999999999 Q ss_pred EEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccc------------- Q lcl|NC_021532. 79 IKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEA------------- 145 (663) Q Consensus 79 ~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~------------- 145 (663) |.|.|++++|+++|++.|.++||+|.++|+++.++++||+|++++|+||+||||+.+++.+...... T Consensus 101 ~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (763) T protein:vir:95 101 FKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQAD 180 (763) T ss_pred EEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHH Confidence 9999999999999999999999999999999999999999999999999999998665442221110 Q ss_pred -------------------ccc---------Cccccccccc----cccccceeecccceeeeccHHHheeCcccccChhh Q lcl|NC_021532. 146 -------------------VVV---------DEYGNETVVE----QEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDN 193 (663) Q Consensus 146 -------------------~~~---------~~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d 193 (663) +.. .+.+...... ......+..+++|++++|+|++|||||+|+.|++| T Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~D 260 (763) T protein:vir:95 181 ALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINK 260 (763) T ss_pred HHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheecCCCCCchhh Confidence 000 0000000000 01111244578999999999999999999888999 Q ss_pred CceEEEEeecCHHHHHHhcC-CcChhhhhhccchhhh---ccccccccccccccccceEEEEEEEEEeeecCCceeEEEE Q lcl|NC_021532. 194 AQFVIHRYETDLSTLKKDGR-YKNLDKLAKTSGEDFD---YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIV 269 (663) Q Consensus 194 ~~~~~~~~~~~~~~l~~~g~-~~~~~~~~~~~~~~~~---~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~ 269 (663) |+|+||++++|+++|.++|+ |.+++.+......... ........+.+.|.+.++|+|+|||+++|.++||++++|+ T Consensus 261 a~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~~~~~~ 340 (763) T protein:vir:95 261 AMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIEGNGVLEPIV 340 (763) T ss_pred CceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeeeccCCcceeEEEE Confidence 99999999999999999875 4445554432221111 1112233456677788999999999999999999999999 Q ss_pred EEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc Q lcl|NC_021532. 270 CAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT 349 (663) Q Consensus 270 ~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 349 (663) ++|+|+++|+.+++||+|+++||++++++|+++++||+|+++.++|+|+.+|+++|+++|+++++++|+|++++|++++. T Consensus 341 v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~ 420 (763) T protein:vir:95 341 ATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDAL 420 (763) T ss_pred EEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999988 Q ss_pred hhhhccCCcceEeCCCCCc---cccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHH Q lcl|NC_021532. 350 NRKKFLAGANFEFNGTAND---FWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATAT 426 (663) Q Consensus 350 d~~~~~p~~vi~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~ 426 (663) +...++||++++++++.+. +.+.+++++++..+.++++....++++|||+++++|.++++.++||++++++++++++ T Consensus 421 d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~ 500 (763) T protein:vir:95 421 NSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASK 500 (763) T ss_pred hhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHH Confidence 8888999999999977553 5566788899999999999999999999999999999988888999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHh Q lcl|NC_021532. 427 RRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTL 506 (663) Q Consensus 427 ~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~ 506 (663) ++..++++|++ +++++|++++.||++||+++++|||+|++|++|+++++.++|||.|.++++.....+.+++..+++.+ T Consensus 501 ~~~~~~r~~~~-~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~~as~~~q~~~~l~~ll~~l 579 (763) T protein:vir:95 501 REMAILRRLAK-GMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDISTAEVDNQKSQDLGFMLQTI 579 (763) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecccchHHHHHHHHHHHHHHHh Confidence 99999999987 67999999999999999999999999999999999999999999988887766666777788888889 Q ss_pred ccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 507 GPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKR 586 (663) Q Consensus 507 ~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~ 586 (663) ++.++|.+...++..++++.++++.++.++..++++++.++++.++++.+++++.+.++++++..+++++..+++++..+ T Consensus 580 ~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~ 659 (763) T protein:vir:95 580 GPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKN 659 (763) T ss_pred ccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999988888888888999999888888888888777766666555555555544444444333333332222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHhhhh Q lcl|NC_021532. 587 SKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANL----------EQMLAQRNAGD 656 (663) Q Consensus 587 ~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~----------e~~~~~~~~~~ 656 (663) +++ ++++.+.++..+++....+.+++.+.+.++.+.+. +....+-.... T Consensus 660 ~d~---------------------~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~~ea~~~~~~~~~~~~~~~~~ 718 (763) T protein:vir:95 660 LDY---------------------LEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPRKEGELPPNLSAAIGYNALTN 718 (763) T ss_pred HHH---------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHHhhhhccccc Confidence 111 11111111111111111112222222222222111 11111222234 Q ss_pred hhccccC Q lcl|NC_021532. 657 TNIGVVE 663 (663) Q Consensus 657 ~~~~~~~ 663 (663) ++++.+. T Consensus 719 ~~~~~~~ 725 (763) T protein:vir:95 719 GEDTGIQ 725 (763) T ss_pred ccCCCcc Confidence 4444444 No 2 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=5e-121 Score=680.14 Aligned_cols=645 Identities=20% Similarity=0.237 Sum_probs=441.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCcCCccccCCCccccHHHHHHHHHHHHHHHHhhcCCCceE Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDS-LISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADII 79 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~ 79 (663) =+|+++++++.|...++.|.++++..++ .+.++.+||+|++++...+|+|+||++.|+++|+|++++|+++||+|+++| T Consensus 8 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~~~~s~~~~~~v~~~v~~~~~~l~~~~~~~~~~~ 87 (705) T protein:vir:88 8 KPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVV 87 (705) T ss_pred ccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCcccCCCCccccHHHHHHHHHHHHHHHHhhcCCCceE Confidence 5689999999999999999999997665 678889999999999999999999999999999999999999999999999 Q ss_pred EEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC---------- Q lcl|NC_021532. 80 KCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD---------- 149 (663) Q Consensus 80 ~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~---------- 149 (663) +|.|++++|+++|+++|.++||+|.++++++++++++|+|++++|+||++|+|+.+...+......++.. T Consensus 88 ~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~ 167 (705) T protein:vir:88 88 KYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPD 167 (705) T ss_pred EEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhhhhhhhh Confidence 9999999999999999999999999999999999999999999999999999986555443322211111 Q ss_pred ----cccc--ccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhh-hhh Q lcl|NC_021532. 150 ----EYGN--ETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDK-LAK 222 (663) Q Consensus 150 ----~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~-~~~ 222 (663) +... .+.....+. ....++++.+++|+|++|||||+|+ +++||+|++|++++|+++|+++||+.+..+ +.. T Consensus 168 ~~~~~~~~~~~~~~~~~~~-~~~~~~~i~i~~V~p~d~~~dp~a~-~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~ 245 (705) T protein:vir:88 168 TSILAQSVDDDGTYTIKIR-KDKKKREIKVLCVKPENFLVDRLAT-CIDDARFLCHREKYTVSDLRLLGVPEDVIEELPY 245 (705) T ss_pred hhcccccccccceeeeEEe-eeeecCceeeeeccHHHceecCCCC-CcccCcEEEEEEeccHHHHHhhcCChhHhhhhhc Confidence 1111 111111122 2234678899999999999999987 699999999999999999999998876432 221 Q ss_pred ccch----hhh---ccccccc-----cccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCC Q lcl|NC_021532. 223 TSGE----DFD---YDSPDDT-----EFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 223 ~~~~----~~~---~~~~~~~-----~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~ 290 (663) .... ..+ ...++.. ...+.+...++|++||||+++++++||+.++++++|+|+++|+.. | ++++ T Consensus 246 ~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~--~--~~~~ 321 (705) T protein:vir:88 246 DEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNE--P--WDCR 321 (705) T ss_pred ccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCccccccc--c--CCCC Confidence 1111 010 0001110 111233445679999999999999999999999999999999753 2 3679 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccc Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFW 370 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~ 370 (663) ||++++++|+|+++||+|+++.++|+|+.+|+++|+++|+++++++|++++++|++++.+.++++||++++++++ +++. T Consensus 322 PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~~d~~~~~pg~vv~~~~~-~~i~ 400 (705) T protein:vir:88 322 PFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAGIVRVKSM-NSIT 400 (705) T ss_pred CEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccCcccccccCCCeeEEecCC-Cccc Confidence 999999999999999999999999999999999999999999999999999999999989999999999999854 6799 Q ss_pred cccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 371 HGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSL--GSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWM 448 (663) Q Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~--~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 448 (663) +++++++|++.++|++++.+.++++|||+++++|.++++. +.||++++++++++++++..++++|++++++++|++++ T Consensus 401 ~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~ 480 (705) T protein:vir:88 401 PLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLH 480 (705) T ss_pred cccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999887765 46999999999999999999999999889999999999 Q ss_pred HHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHH-HHHHHHHHHHH---hcc--CCCcchh----HHH Q lcl|NC_021532. 449 AYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAA-KSQELSFLLQT---LGP--NEDPKIR----RDI 518 (663) Q Consensus 449 ~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~-~~q~l~~~~~~---~~~--~~~p~~~----~~~ 518 (663) .||++||+++++|||+| +|++|+|+++.++|++.+.++.+..+.. +.+.+..++++ +.+ .+.+.+. ..+ T Consensus 481 ~li~~~~~~~~~~ri~g-~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~ 559 (705) T protein:vir:88 481 DHAIKYQNQEEVFQLRG-KWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNI 559 (705) T ss_pred HHHHHhCCCceEEeecc-chhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHH Confidence 99999999999999997 6999999999999999988877665433 33344444433 222 1222222 223 Q ss_pred HHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 519 MADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELE-----------NLMLENQMLVASINDKNARANENTIDAELKRS 587 (663) Q Consensus 519 l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~-----------~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~ 587 (663) +..+++..++.+..+ +...+...++.+.+++.. +++++.++.+++.+.++++++..+.+++++++ T Consensus 560 ~~el~e~~~~k~~~~----~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~ 635 (705) T protein:vir:88 560 LKEVTENAGYKDPDR----FWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLA 635 (705) T ss_pred HHHHHHhhhhhhHHH----HhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333344433332222 222221111111111111 11111111111111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccc Q lcl|NC_021532. 588 KAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGV 661 (663) Q Consensus 588 ~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~ 661 (663) +.+.++++++..+.+.+...++... +++..+.++...+.+.+++..+.++...-++.+.+.++-...+-- T Consensus 636 e~e~~~~~~~~~~~e~~~~~a~~~~----~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~~~~~~~k~~~~~rr 705 (705) T protein:vir:88 636 EIELKKQEAVLQQREMALKEAELQL----ERDRFTWERARNEAEYHLEATQARAAYIGDGKVPETKKPTKAVRR 705 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcC Confidence 1111111111000000000000000 000000000000000001100100000000000000000000000 No 3 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=100.00 E-value=8.1e-104 Score=585.78 Aligned_cols=604 Identities=14% Similarity=0.134 Sum_probs=406.3 Q ss_pred CCCcHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 1 MKINKAE---LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIVDP 71 (663) Q Consensus 1 ~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~~~ 71 (663) =+.+.++ +++.|...|++......++.....+..+||+|++|+ ++.+|++++|+|.|.++|+|++|++... T Consensus 36 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~n 115 (776) T protein:vir:93 36 NPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERGQAPTVYNVISQSVNWIIGSEKRG 115 (776) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCceEEecchHHHHHHHHHHHHhC Confidence 3454444 667777778888777778888888899999999996 4569999999999999999999999865 Q ss_pred hcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 72 FVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 72 ~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) +++ ++|.|++++|++.|++++.+++|+. ..++....++++|+|+++||+||++|+||++.+. T Consensus 116 r~~----~~~~p~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~------------- 177 (776) T protein:vir:93 116 RSD----FKVLPRRKDGGKAAERKTALLKYLS-DVNHTPFERSMAFEETTKAGIGWLESQVQDENDG------------- 177 (776) T ss_pred Ccc----eEEecCChhHHHHHHHHHHHHHHHH-HhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCC------------- Confidence 554 9999999999999999999999976 5688888999999999999999999999864321 Q ss_pred ccccccccccccceeecccceeeeccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccch---- Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGE---- 226 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~---- 226 (663) ..+..++|+|++|||||+++ .|++||+|++++.|+|+++++++. ....+.+...... T Consensus 178 -----------------~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~ 239 (776) T protein:vir:93 178 -----------------EPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIF-PERAAQLRAAAVDNFET 239 (776) T ss_pred -----------------CceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhc-CCchHHHHHhhhhcccc Confidence 11245679999999999776 489999999999999999999983 2222222111100 Q ss_pred ----hh---h-------ccccccccccccccccceEEEEEEEEEeeec-------------------------------- Q lcl|NC_021532. 227 ----DF---D-------YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD-------------------------------- 260 (663) Q Consensus 227 ----~~---~-------~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~-------------------------------- 260 (663) +. + ...++...+.+.+.++++|+|+|||+|..+. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~ 319 (776) T protein:vir:93 240 WGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRA 319 (776) T ss_pred cchhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCce Confidence 00 0 0011122234556677899999999985321 Q ss_pred ---CCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_021532. 261 ---GDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNG 337 (663) Q Consensus 261 ---~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~ 337 (663) ..++.++++++|+|+++|+.+++||+|++|||++++++++++++||+|+++.++|+|+++|+++|+++++++ +. T Consensus 320 ~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~---~~ 396 (776) T protein:vir:93 320 VLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS---TN 396 (776) T ss_pred eehheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc---CC Confidence 112356788999999999999999999999999999999999999999999999999999999999999874 45 Q ss_pred cEEeeccccCcchhhh---ccCCcceEeCCCCC-ccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhH Q lcl|NC_021532. 338 QVAIRKGALDQTNRKK---FLAGANFEFNGTAN-DFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGST 413 (663) Q Consensus 338 ~~~~~~~~i~~~d~~~---~~p~~vi~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~t 413 (663) ++++++|++++.+..+ ++||+++++++++. .+...+.++++++++++++++.+.++++|||+++++|..+++.|++ T Consensus 397 ~~~~~~gav~~~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ 476 (776) T protein:vir:93 397 KVLMEEGAVDDIDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGV 476 (776) T ss_pred ceeeccccccchHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHH Confidence 8999999998766544 78999999998753 3455667789999999999999999999999999999998887766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccchhhc-----CCceEEEE Q lcl|NC_021532. 414 ATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRKDDL-----SGRIDIDI 484 (663) Q Consensus 414 A~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~~~~-----~~~~d~~v 484 (663) | +.+++++|++++..+++||++ +++.+|+++|.||.+||+++++|||+|+ +||.||...+ .++|||.| T Consensus 477 a--i~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v 553 (776) T protein:vir:93 477 A--IQARQEQGSVATNKLFDNLRL-AFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFII 553 (776) T ss_pred H--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEE Confidence 6 788889999999999999976 7889999999999999999999999975 6999986543 36677777 Q ss_pred eecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh---hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHH Q lcl|NC_021532. 485 SISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP---EQAKRMREYEPKPDPVQEKIRQLELENLMLEN 561 (663) Q Consensus 485 ~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~---e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~ 561 (663) ..+.+..+. +.+++..|+++++ .++|.+...++..++++++++ +..+.++...+.+++.+.+..+.++++++++. T Consensus 554 ~~~~~~~s~-r~~~~~~l~ql~~-~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~ 631 (776) T protein:vir:93 554 DEAEWRATM-RQAAVAELMEVIG-KMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQ 631 (776) T ss_pred eecccchhH-HHHHHHHHHHHHh-hcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhh Confidence 665443322 3334444444443 356677666666666666655 45555555554444433333222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 562 QMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKH 641 (663) Q Consensus 562 ~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~ 641 (663) +..+.+.+.+.+++...++++. +++++ +++.++++.+...+... ...+ ..+.+++. ..+...+.+ T Consensus 632 ~~~q~q~~~~~a~~~~~qa~a~--~~~ae-----a~~~~aqa~~~~~~a~~--~~~~----a~q~a~qa--~~~~~~~~~ 696 (776) T protein:vir:93 632 QQQQYNDALAIATLEEQQAKAR--KAAAE-----AQVAEAKAKHISRMAIR--EGVG----AVKDATDA--ATAIAFMPE 696 (776) T ss_pred HHHHHHHHHhhhhhhHhhHHHH--HHHHH-----HHHHhhhhhhhhhcchh--hhhh----hhhhhhhh--hhhhhhhhh Confidence 2111111111111111111111 11111 11111111110000000 0000 00000000 000000000 Q ss_pred HHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 642 RANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 642 ~~~~e~~~~~~~~~~~~~~~~~ 663 (663) .........+...++..=.-++ T Consensus 697 ~a~~a~~~~~~a~~~~p~~p~~ 718 (776) T protein:vir:93 697 LAGLSDGILRESGWDDPNTPQP 718 (776) T ss_pred hhhhhhhhhccccccccccccc Confidence 0000000000000000000011 No 4 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=100.00 E-value=2.9e-101 Score=571.79 Aligned_cols=589 Identities=12% Similarity=0.077 Sum_probs=403.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) =+.+.+.++..+...|+.+..+..++.....+..+||+|++|+ ++.+|++++|+|.|++.|++++|+....+.+ T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~ 103 (711) T protein:vir:10 24 NNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPA 103 (711) T ss_pred CcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHhhhHhhCCcc Confidence 5566777899999999999999989888888999999999996 4679999999999999999999999866665 Q ss_pred CCceEEEEeCC----------------------cchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeee Q lcl|NC_021532. 75 TADIIKCTPIT----------------------WEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGW 132 (663) Q Consensus 75 ~~~~~~~~p~~----------------------~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~ 132 (663) ++|.|+. .+|.+.|++++.+++|+.+ .++....++++|+|+++||+||++++| T Consensus 104 ----~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~d~~~~G~G~~ev~~ 178 (711) T protein:vir:10 104 ----IKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEY-NCDAETEYDIAFQGAVESGMGYLRVRS 178 (711) T ss_pred ----eEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHH-hcChhHHHHHHHHHhhhcCcceEEEEe Confidence 9999985 6789999999999999765 567777899999999999999999999 Q ss_pred ccccceecccccccccCccccccccccccccceeecccceeeec-cHHHheeCcccc-cChhhCceEEEEeecCHHHHHH Q lcl|NC_021532. 133 DYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVC-RNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKK 210 (663) Q Consensus 133 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~ 210 (663) ||.++.. ..+.+.+.+| +|.+|||||.++ .|++||+|++++.|+|++++++ T Consensus 179 d~~~~d~---------------------------~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~ 231 (711) T protein:vir:10 179 DYLADDS---------------------------FEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) T ss_pred cccCCCC---------------------------CCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHH Confidence 8754211 1133455556 799999999665 5999999999999999999999 Q ss_pred hcCCcCh--hhhhhccchhhhccccccccccccccccceEEEEEEEEEeeec------CCc------------------- Q lcl|NC_021532. 211 DGRYKNL--DKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD------GDG------------------- 263 (663) Q Consensus 211 ~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~------~~g------------------- 263 (663) + |++. +.+.... ...... +...++|+|.|||++.... ++| T Consensus 232 ~--yp~~a~~~~~~~~---------~~~~~~--~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 298 (711) T protein:vir:10 232 L--YPDATAEPVYEDS---------VADYDT--WFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAG 298 (711) T ss_pred h--CCchhhhhhhccc---------ccccCc--ccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcC Confidence 8 3332 1111111 111111 2234789999999874321 111 Q ss_pred ----------eeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeee--ecCcccCCChHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 264 ----------IAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNS--IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNM 331 (663) Q Consensus 264 ----------~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~--~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~ 331 (663) ..+.++++|+|+++| .+++||+|++|||+++++++ ++++.+++|+++.++|+|+++|+++|++++++ T Consensus 299 ~~~~~~~~~~~~~v~~~~~~G~~~L-~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l 377 (711) T protein:vir:10 299 ISIVRTRKVKTFKTYWRKITGANVL-EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETV 377 (711) T ss_pred chhhhhhhhceeeEEEEEEecceee-cCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHH Confidence 124567788999999 56799999999999887654 57888899999999999999999999999999 Q ss_pred HhcCCCcEEeeccccCcchhh----hccCCcceEeCCCCC---ccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcC Q lcl|NC_021532. 332 AQSNNGQVAIRKGALDQTNRK----KFLAGANFEFNGTAN---DFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGG 404 (663) Q Consensus 332 ~~~~~~~~~~~~~~i~~~d~~----~~~p~~vi~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G 404 (663) ++++++++++++|+|++.+.. +.+||++++++++.. .+.+.+++++|+++++|+++..+.++++|||+++++| T Consensus 378 ~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G 457 (711) T protein:vir:10 378 ALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLG 457 (711) T ss_pred HhcCCCceeecCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcC Confidence 999999999999999865432 378999999998754 5778889999999999999999999999999999999 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccchhhc---- Q lcl|NC_021532. 405 INSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRKDDL---- 476 (663) Q Consensus 405 ~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~~~~---- 476 (663) ..+|+.|++| +++++++|++++..+++||++ +++.+|+++|+||.+||+++++|||+|+ +|+.||+..+ T Consensus 458 ~~~n~~Sg~a--i~~~q~qg~~~l~~~~dn~~~-~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~ 534 (711) T protein:vir:10 458 AMGNETSGRA--IIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEES 534 (711) T ss_pred CCccchHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEeccccccccc Confidence 9988766655 888889999999999999976 7889999999999999999999999985 6888886532 Q ss_pred ----------CCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhh---hhhhhhhhhhhcchh Q lcl|NC_021532. 477 ----------SGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRM---PEQAKRMREYEPKPD 543 (663) Q Consensus 477 ----------~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~---~e~~~~l~~~~~~~~ 543 (663) .++|||.|.++.+.. +..++....|..+.+.+ |...+.++..+++++++ .++++.++...+++. T Consensus 535 G~~~~~nDi~~g~~Dv~i~~~p~~~--s~r~~~~~~l~ql~~~~-p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~ 611 (711) T protein:vir:10 535 GEWVTIHDLNVQKYDVVVTTGPAFA--TQRIEAAEAMIQFAQAV-PSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNV 611 (711) T ss_pred ccceeeeccceeeeEEEEeeccCch--hHHHHHHHHHHHHHhhc-chhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCccc Confidence 356777777655433 33333333333333333 44444454555555554 456667776665544 Q ss_pred hHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 544 PVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARK--LSSEADMTDLKFVKEDNGYAHLE 621 (663) Q Consensus 544 ~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~--~~~~~~~~~~e~~~~~~~~~~~~ 621 (663) +.++...+.++.+++. ++..++.+.++++++....+++++.+++++++++++. .+.+.+...++. .++.. T Consensus 612 ~~~~~~~~~qq~~~e~--qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~------~aq~~ 683 (711) T protein:vir:10 612 LSKDEREAIEEDMPEQ--TEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIED------MAQGG 683 (711) T ss_pred CcchhhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHH Confidence 3322222222111111 1111111111122222222222222223332222111 111111000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021532. 622 QVELEDLRHAQHLEREAMKHRANLEQMLAQRNAG 655 (663) Q Consensus 622 ~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~ 655 (663) .+..++. ..+..+.+.++.....+..+| T Consensus 684 ~~~~qq~------~~~l~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 684 DVVYQQV------RELVAQALAEITASQANVTEQ 711 (711) T ss_pred HHHHHHH------HHHHHHHHHHHHHHHHHhhcC Confidence 1111110 011111111111111111111 No 5 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=100.00 E-value=1.8e-99 Score=561.98 Aligned_cols=605 Identities=12% Similarity=0.065 Sum_probs=395.5 Q ss_pred CC--CcHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MK--INKA---ELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~--~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) |. .+.+ ++...+...|........++.....+..+||+|++|+ ++.+|+|++|+|.|.+.|++++|+.. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 22 2221 2334455556666666666777778889999999996 46799999999999999999999998 Q ss_pred HhhcCCCceEEEEeCCcchH--HHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDT--DSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~--~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ..+++ ++|.|++++|+ +.|++++.+++|+.+ .++....++++|.|+++||+||++++++++.. T Consensus 88 ~nr~~----~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~---------- 152 (714) T protein:vir:27 88 KTRTD----LVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF---------- 152 (714) T ss_pred hCCcc----eEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccCCC---------- Confidence 76666 99999987665 789999999999876 56677788999999999999999999874321 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc--c Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT--S 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~--~ 224 (663) .+.+.+++|+|.+|||||.++ .|++||+|++|++|+|+++++++..+ ..+.+... + T Consensus 153 --------------------~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~-~a~~i~~~~~~ 211 (714) T protein:vir:27 153 --------------------GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPG-MAQVIDYAIDD 211 (714) T ss_pred --------------------CCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCC-chhhhhhhhhh Confidence 234678999999999999654 69999999999999999999998322 11111100 0 Q ss_pred ch---h---------------hhccccccccccccccccceEEEEEEEEEeee---------------cC---------- Q lcl|NC_021532. 225 GE---D---------------FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDV---------------DG---------- 261 (663) Q Consensus 225 ~~---~---------------~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~---------------~~---------- 261 (663) .. + .....++...+.|.+..+++|+|+|||++... ++ T Consensus 212 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~ 291 (714) T protein:vir:27 212 WRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVA 291 (714) T ss_pred hccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHh Confidence 00 0 00111222334556677889999999997431 11 Q ss_pred --------CceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 262 --------DGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQ 333 (663) Q Consensus 262 --------~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~ 333 (663) ..+.++++++|+|+++|+.+++||+|++|||++++++........+|+++.++|+|+.+|++.|++++++ T Consensus 292 ~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l-- 369 (714) T protein:vir:27 292 SGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL-- 369 (714) T ss_pred hcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh-- Confidence 1235678889999999999999999999999999888876555567899999999999999999998865 Q ss_pred cCCCcEEeeccccCcch----hhhccCCcceEeCCCC-------CccccccCccccHHHHHHHHHHHHHHHHHhCCChHH Q lcl|NC_021532. 334 SNNGQVAIRKGALDQTN----RKKFLAGANFEFNGTA-------NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFS 402 (663) Q Consensus 334 ~~~~~~~~~~~~i~~~d----~~~~~p~~vi~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~ 402 (663) +++ ++++.+|+++..+ ...++||+++.++++. ..+.+.+.+++|+++++++++..+.++++|||++++ T Consensus 370 ~~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~ 448 (714) T protein:vir:27 370 QAK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAF 448 (714) T ss_pred cCC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHH Confidence 455 4567788876543 2348999999998652 346677788999999999999999999999999999 Q ss_pred cCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC-------eeeccchhh Q lcl|NC_021532. 403 GGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND-------KFVPIRKDD 475 (663) Q Consensus 403 ~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~-------~~v~i~~~~ 475 (663) +|..+|+.|++| +++++++|++.+..+++||.. +++.+|+++|+||.+||++++++||+|+ .++.+|++. T Consensus 449 lG~~~na~SGvA--i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~ 525 (714) T protein:vir:27 449 LGQDSGATSGVA--ISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEG 525 (714) T ss_pred cCCCccchhHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecccc Confidence 999999988877 888889999999999999965 6899999999999999999999999974 388888765 Q ss_pred c---------CCceEEEEeecccchhHHHHH-HHHHHHHHhccCCCcchhHHHHHHHHHhhh---hhhhhhhhhhhhcch Q lcl|NC_021532. 476 L---------SGRIDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDPKIRRDIMADIMDLMR---MPEQAKRMREYEPKP 542 (663) Q Consensus 476 ~---------~~~~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~---~~e~~~~l~~~~~~~ 542 (663) . .++|||.|..+. .+.+..+ .+..|++++ +.++|.....++..++++++ ..++++.+++..+++ T Consensus 526 ~~~~~~nDi~~~~~Dv~i~~~p--~~~t~r~~~~~~l~~l~-~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:27 526 DNGELTNDISRLNTHIALAPVQ--QTPAFKAQLAQRMSEVI-QGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred CcceecccceeeeEEEEEeecc--CchHHHHHHHHHHHHHH-hhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 3 355666655544 3333333 333333333 33555544444443444444 446777887776654 Q ss_pred hhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQ---LELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAH 619 (663) Q Consensus 543 ~~~~~q~~q---~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~ 619 (663) ++......+ .++++++++.++++.+.+..+++.++.+++++..++++..+..+++...+.++. ++......+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~-----~~~~~~~~~ 677 (714) T protein:vir:27 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG-----QRYVDALNQ 677 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHH Confidence 432222111 111111111111112222222222222222222222222222222111111111 000000110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 620 LEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 620 ~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) +..++ .++..+-++.+. ...++ +.++.++..+-+++ T Consensus 678 a~~a~--~~~~~~~~~~~~-----~~~~~-q~~q~~~~~~~~~~ 713 (714) T protein:vir:27 678 AHTAE--IITGVQNMEQEQ-----DVLQQ-QMLYTLQQRMNEMS 713 (714) T ss_pred HHHHH--HHHhHhhhhhhh-----HHHHH-HHHHHHHHHHHhcC Confidence 10000 011111111110 00000 11222222233333 No 6 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=100.00 E-value=1.8e-99 Score=561.98 Aligned_cols=605 Identities=12% Similarity=0.065 Sum_probs=395.5 Q ss_pred CC--CcHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MK--INKA---ELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~--~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) |. .+.+ ++...+...|........++.....+..+||+|++|+ ++.+|+|++|+|.|.+.|++++|+.. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 22 2221 2334455556666666666777778889999999996 46799999999999999999999998 Q ss_pred HhhcCCCceEEEEeCCcchH--HHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDT--DSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~--~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ..+++ ++|.|++++|+ +.|++++.+++|+.+ .++....++++|.|+++||+||++++++++.. T Consensus 88 ~nr~~----~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~---------- 152 (714) T protein:vir:99 88 KTRTD----LVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF---------- 152 (714) T ss_pred hCCcc----eEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccCCC---------- Confidence 76666 99999987665 789999999999876 56677788999999999999999999874321 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc--c Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT--S 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~--~ 224 (663) .+.+.+++|+|.+|||||.++ .|++||+|++|++|+|+++++++..+ ..+.+... + T Consensus 153 --------------------~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~-~a~~i~~~~~~ 211 (714) T protein:vir:99 153 --------------------GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPG-MAQVIDYAIDD 211 (714) T ss_pred --------------------CCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCC-chhhhhhhhhh Confidence 234678999999999999654 69999999999999999999998322 11111100 0 Q ss_pred ch---h---------------hhccccccccccccccccceEEEEEEEEEeee---------------cC---------- Q lcl|NC_021532. 225 GE---D---------------FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDV---------------DG---------- 261 (663) Q Consensus 225 ~~---~---------------~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~---------------~~---------- 261 (663) .. + .....++...+.|.+..+++|+|+|||++... ++ T Consensus 212 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~ 291 (714) T protein:vir:99 212 WRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVA 291 (714) T ss_pred hccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHh Confidence 00 0 00111222334556677889999999997431 11 Q ss_pred --------CceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 262 --------DGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQ 333 (663) Q Consensus 262 --------~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~ 333 (663) ..+.++++++|+|+++|+.+++||+|++|||++++++........+|+++.++|+|+.+|++.|++++++ T Consensus 292 ~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l-- 369 (714) T protein:vir:99 292 SGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL-- 369 (714) T ss_pred hcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh-- Confidence 1235678889999999999999999999999999888876555567899999999999999999998865 Q ss_pred cCCCcEEeeccccCcch----hhhccCCcceEeCCCC-------CccccccCccccHHHHHHHHHHHHHHHHHhCCChHH Q lcl|NC_021532. 334 SNNGQVAIRKGALDQTN----RKKFLAGANFEFNGTA-------NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFS 402 (663) Q Consensus 334 ~~~~~~~~~~~~i~~~d----~~~~~p~~vi~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~ 402 (663) +++ ++++.+|+++..+ ...++||+++.++++. ..+.+.+.+++|+++++++++..+.++++|||++++ T Consensus 370 ~~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~ 448 (714) T protein:vir:99 370 QAK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAF 448 (714) T ss_pred cCC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHH Confidence 455 4567788876543 2348999999998652 346677788999999999999999999999999999 Q ss_pred cCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC-------eeeccchhh Q lcl|NC_021532. 403 GGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND-------KFVPIRKDD 475 (663) Q Consensus 403 ~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~-------~~v~i~~~~ 475 (663) +|..+|+.|++| +++++++|++.+..+++||.. +++.+|+++|+||.+||++++++||+|+ .++.+|++. T Consensus 449 lG~~~na~SGvA--i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~ 525 (714) T protein:vir:99 449 LGQDSGATSGVA--ISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEG 525 (714) T ss_pred cCCCccchhHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecccc Confidence 999999988877 888889999999999999965 6899999999999999999999999974 388888765 Q ss_pred c---------CCceEEEEeecccchhHHHHH-HHHHHHHHhccCCCcchhHHHHHHHHHhhh---hhhhhhhhhhhhcch Q lcl|NC_021532. 476 L---------SGRIDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDPKIRRDIMADIMDLMR---MPEQAKRMREYEPKP 542 (663) Q Consensus 476 ~---------~~~~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~---~~e~~~~l~~~~~~~ 542 (663) . .++|||.|..+. .+.+..+ .+..|++++ +.++|.....++..++++++ ..++++.+++..+++ T Consensus 526 ~~~~~~nDi~~~~~Dv~i~~~p--~~~t~r~~~~~~l~~l~-~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:99 526 DNGELTNDISRLNTHIALAPVQ--QTPAFKAQLAQRMSEVI-QGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred CcceecccceeeeEEEEEeecc--CchHHHHHHHHHHHHHH-hhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 3 355666655544 3333333 333333333 33555544444443444444 446777887776654 Q ss_pred hhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQ---LELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAH 619 (663) Q Consensus 543 ~~~~~q~~q---~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~ 619 (663) ++......+ .++++++++.++++.+.+..+++.++.+++++..++++..+..+++...+.++. ++......+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~-----~~~~~~~~~ 677 (714) T protein:vir:99 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG-----QRYVDALNQ 677 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHH Confidence 432222111 111111111111112222222222222222222222222222222111111111 000000110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 620 LEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 620 ~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) +..++ .++..+-++.+. ...++ +.++.++..+-+++ T Consensus 678 a~~a~--~~~~~~~~~~~~-----~~~~~-q~~q~~~~~~~~~~ 713 (714) T protein:vir:99 678 AHTAE--IITGVQNMEQEQ-----DVLQQ-QMLYTLQQRMNEMS 713 (714) T ss_pred HHHHH--HHHhHhhhhhhh-----HHHHH-HHHHHHHHHHHhcC Confidence 10000 011111111110 00000 11222222233333 No 7 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=100.00 E-value=1.8e-99 Score=561.98 Aligned_cols=605 Identities=12% Similarity=0.065 Sum_probs=395.5 Q ss_pred CC--CcHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MK--INKA---ELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~--~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) |. .+.+ ++...+...|........++.....+..+||+|++|+ ++.+|+|++|+|.|.+.|++++|+.. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 22 2221 2334455556666666666777778889999999996 46799999999999999999999998 Q ss_pred HhhcCCCceEEEEeCCcchH--HHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDT--DSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~--~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ..+++ ++|.|++++|+ +.|++++.+++|+.+ .++....++++|.|+++||+||++++++++.. T Consensus 88 ~nr~~----~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~---------- 152 (714) T protein:vir:81 88 KTRTD----LVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF---------- 152 (714) T ss_pred hCCcc----eEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccCCC---------- Confidence 76666 99999987665 789999999999876 56677788999999999999999999874321 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc--c Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT--S 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~--~ 224 (663) .+.+.+++|+|.+|||||.++ .|++||+|++|++|+|+++++++..+ ..+.+... + T Consensus 153 --------------------~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~-~a~~i~~~~~~ 211 (714) T protein:vir:81 153 --------------------GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPG-MAQVIDYAIDD 211 (714) T ss_pred --------------------CCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCC-chhhhhhhhhh Confidence 234678999999999999654 69999999999999999999998322 11111100 0 Q ss_pred ch---h---------------hhccccccccccccccccceEEEEEEEEEeee---------------cC---------- Q lcl|NC_021532. 225 GE---D---------------FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDV---------------DG---------- 261 (663) Q Consensus 225 ~~---~---------------~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~---------------~~---------- 261 (663) .. + .....++...+.|.+..+++|+|+|||++... ++ T Consensus 212 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~ 291 (714) T protein:vir:81 212 WRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVA 291 (714) T ss_pred hccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHh Confidence 00 0 00111222334556677889999999997431 11 Q ss_pred --------CceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 262 --------DGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQ 333 (663) Q Consensus 262 --------~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~ 333 (663) ..+.++++++|+|+++|+.+++||+|++|||++++++........+|+++.++|+|+.+|++.|++++++ T Consensus 292 ~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l-- 369 (714) T protein:vir:81 292 SGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL-- 369 (714) T ss_pred hcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh-- Confidence 1235678889999999999999999999999999888876555567899999999999999999998865 Q ss_pred cCCCcEEeeccccCcch----hhhccCCcceEeCCCC-------CccccccCccccHHHHHHHHHHHHHHHHHhCCChHH Q lcl|NC_021532. 334 SNNGQVAIRKGALDQTN----RKKFLAGANFEFNGTA-------NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFS 402 (663) Q Consensus 334 ~~~~~~~~~~~~i~~~d----~~~~~p~~vi~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~ 402 (663) +++ ++++.+|+++..+ ...++||+++.++++. ..+.+.+.+++|+++++++++..+.++++|||++++ T Consensus 370 ~~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~ 448 (714) T protein:vir:81 370 QAK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAF 448 (714) T ss_pred cCC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHH Confidence 455 4567788876543 2348999999998652 346677788999999999999999999999999999 Q ss_pred cCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC-------eeeccchhh Q lcl|NC_021532. 403 GGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND-------KFVPIRKDD 475 (663) Q Consensus 403 ~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~-------~~v~i~~~~ 475 (663) +|..+|+.|++| +++++++|++.+..+++||.. +++.+|+++|+||.+||++++++||+|+ .++.+|++. T Consensus 449 lG~~~na~SGvA--i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~ 525 (714) T protein:vir:81 449 LGQDSGATSGVA--ISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEG 525 (714) T ss_pred cCCCccchhHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecccc Confidence 999999988877 888889999999999999965 6899999999999999999999999974 388888765 Q ss_pred c---------CCceEEEEeecccchhHHHHH-HHHHHHHHhccCCCcchhHHHHHHHHHhhh---hhhhhhhhhhhhcch Q lcl|NC_021532. 476 L---------SGRIDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDPKIRRDIMADIMDLMR---MPEQAKRMREYEPKP 542 (663) Q Consensus 476 ~---------~~~~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~---~~e~~~~l~~~~~~~ 542 (663) . .++|||.|..+. .+.+..+ .+..|++++ +.++|.....++..++++++ ..++++.+++..+++ T Consensus 526 ~~~~~~nDi~~~~~Dv~i~~~p--~~~t~r~~~~~~l~~l~-~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:81 526 DNGELTNDISRLNTHIALAPVQ--QTPAFKAQLAQRMSEVI-QGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred CcceecccceeeeEEEEEeecc--CchHHHHHHHHHHHHHH-hhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 3 355666655544 3333333 333333333 33555544444443444444 446777887776654 Q ss_pred hhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQ---LELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAH 619 (663) Q Consensus 543 ~~~~~q~~q---~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~ 619 (663) ++......+ .++++++++.++++.+.+..+++.++.+++++..++++..+..+++...+.++. ++......+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~-----~~~~~~~~~ 677 (714) T protein:vir:81 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG-----QRYVDALNQ 677 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHH Confidence 432222111 111111111111112222222222222222222222222222222111111111 000000110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 620 LEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 620 ~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) +..++ .++..+-++.+. ...++ +.++.++..+-+++ T Consensus 678 a~~a~--~~~~~~~~~~~~-----~~~~~-q~~q~~~~~~~~~~ 713 (714) T protein:vir:81 678 AHTAE--IITGVQNMEQEQ-----DVLQQ-QMLYTLQQRMNEMS 713 (714) T ss_pred HHHHH--HHHhHhhhhhhh-----HHHHH-HHHHHHHHHHHhcC Confidence 10000 011111111110 00000 11222222233333 No 8 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=100.00 E-value=1.8e-99 Score=561.98 Aligned_cols=605 Identities=12% Similarity=0.065 Sum_probs=395.5 Q ss_pred CC--CcHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MK--INKA---ELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~--~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) |. .+.+ ++...+...|........++.....+..+||+|++|+ ++.+|+|++|+|.|.+.|++++|+.. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 22 2221 2334455556666666666777778889999999996 46799999999999999999999998 Q ss_pred HhhcCCCceEEEEeCCcchH--HHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDT--DSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~--~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ..+++ ++|.|++++|+ +.|++++.+++|+.+ .++....++++|.|+++||+||++++++++.. T Consensus 88 ~nr~~----~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~---------- 152 (714) T protein:vir:10 88 KTRTD----LVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF---------- 152 (714) T ss_pred hCCcc----eEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccCCC---------- Confidence 76666 99999987665 789999999999876 56677788999999999999999999874321 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc--c Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT--S 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~--~ 224 (663) .+.+.+++|+|.+|||||.++ .|++||+|++|++|+|+++++++..+ ..+.+... + T Consensus 153 --------------------~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~-~a~~i~~~~~~ 211 (714) T protein:vir:10 153 --------------------GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPG-MAQVIDYAIDD 211 (714) T ss_pred --------------------CCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCC-chhhhhhhhhh Confidence 234678999999999999654 69999999999999999999998322 11111100 0 Q ss_pred ch---h---------------hhccccccccccccccccceEEEEEEEEEeee---------------cC---------- Q lcl|NC_021532. 225 GE---D---------------FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDV---------------DG---------- 261 (663) Q Consensus 225 ~~---~---------------~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~---------------~~---------- 261 (663) .. + .....++...+.|.+..+++|+|+|||++... ++ T Consensus 212 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~ 291 (714) T protein:vir:10 212 WRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVA 291 (714) T ss_pred hccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHh Confidence 00 0 00111222334556677889999999997431 11 Q ss_pred --------CceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 262 --------DGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQ 333 (663) Q Consensus 262 --------~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~ 333 (663) ..+.++++++|+|+++|+.+++||+|++|||++++++........+|+++.++|+|+.+|++.|++++++ T Consensus 292 ~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l-- 369 (714) T protein:vir:10 292 SGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL-- 369 (714) T ss_pred hcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh-- Confidence 1235678889999999999999999999999999888876555567899999999999999999998865 Q ss_pred cCCCcEEeeccccCcch----hhhccCCcceEeCCCC-------CccccccCccccHHHHHHHHHHHHHHHHHhCCChHH Q lcl|NC_021532. 334 SNNGQVAIRKGALDQTN----RKKFLAGANFEFNGTA-------NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFS 402 (663) Q Consensus 334 ~~~~~~~~~~~~i~~~d----~~~~~p~~vi~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~ 402 (663) +++ ++++.+|+++..+ ...++||+++.++++. ..+.+.+.+++|+++++++++..+.++++|||++++ T Consensus 370 ~~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~ 448 (714) T protein:vir:10 370 QAK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAF 448 (714) T ss_pred cCC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHH Confidence 455 4567788876543 2348999999998652 346677788999999999999999999999999999 Q ss_pred cCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC-------eeeccchhh Q lcl|NC_021532. 403 GGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND-------KFVPIRKDD 475 (663) Q Consensus 403 ~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~-------~~v~i~~~~ 475 (663) +|..+|+.|++| +++++++|++.+..+++||.. +++.+|+++|+||.+||++++++||+|+ .++.+|++. T Consensus 449 lG~~~na~SGvA--i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~ 525 (714) T protein:vir:10 449 LGQDSGATSGVA--ISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEG 525 (714) T ss_pred cCCCccchhHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecccc Confidence 999999988877 888889999999999999965 6899999999999999999999999974 388888765 Q ss_pred c---------CCceEEEEeecccchhHHHHH-HHHHHHHHhccCCCcchhHHHHHHHHHhhh---hhhhhhhhhhhhcch Q lcl|NC_021532. 476 L---------SGRIDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDPKIRRDIMADIMDLMR---MPEQAKRMREYEPKP 542 (663) Q Consensus 476 ~---------~~~~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~---~~e~~~~l~~~~~~~ 542 (663) . .++|||.|..+. .+.+..+ .+..|++++ +.++|.....++..++++++ ..++++.+++..+++ T Consensus 526 ~~~~~~nDi~~~~~Dv~i~~~p--~~~t~r~~~~~~l~~l~-~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:10 526 DNGELTNDISRLNTHIALAPVQ--QTPAFKAQLAQRMSEVI-QGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred CcceecccceeeeEEEEEeecc--CchHHHHHHHHHHHHHH-hhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 3 355666655544 3333333 333333333 33555544444443444444 446777887776654 Q ss_pred hhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQ---LELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAH 619 (663) Q Consensus 543 ~~~~~q~~q---~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~ 619 (663) ++......+ .++++++++.++++.+.+..+++.++.+++++..++++..+..+++...+.++. ++......+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~-----~~~~~~~~~ 677 (714) T protein:vir:10 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG-----QRYVDALNQ 677 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHH Confidence 432222111 111111111111112222222222222222222222222222222111111111 000000110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 620 LEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 620 ~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) +..++ .++..+-++.+. ...++ +.++.++..+-+++ T Consensus 678 a~~a~--~~~~~~~~~~~~-----~~~~~-q~~q~~~~~~~~~~ 713 (714) T protein:vir:10 678 AHTAE--IITGVQNMEQEQ-----DVLQQ-QMLYTLQQRMNEMS 713 (714) T ss_pred HHHHH--HHHhHhhhhhhh-----HHHHH-HHHHHHHHHHHhcC Confidence 10000 011111111110 00000 11222222233333 No 9 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=100.00 E-value=1.8e-99 Score=561.98 Aligned_cols=605 Identities=12% Similarity=0.065 Sum_probs=395.5 Q ss_pred CC--CcHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MK--INKA---ELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~--~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) |. .+.+ ++...+...|........++.....+..+||+|++|+ ++.+|+|++|+|.|.+.|++++|+.. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHH Confidence 22 2221 2334455556666666666777778889999999996 46799999999999999999999998 Q ss_pred HhhcCCCceEEEEeCCcchH--HHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDT--DSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~--~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ..+++ ++|.|++++|+ +.|++++.+++|+.+ .++....++++|.|+++||+||++++++++.. T Consensus 88 ~nr~~----~~v~p~~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~---------- 152 (714) T protein:vir:32 88 KTRTD----LVVMSDEPDDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF---------- 152 (714) T ss_pred hCCcc----eEEecCCCCchhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcCcceEEeccccCCC---------- Confidence 76666 99999987665 789999999999876 56677788999999999999999999874321 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc--c Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT--S 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~--~ 224 (663) .+.+.+++|+|.+|||||.++ .|++||+|++|++|+|+++++++..+ ..+.+... + T Consensus 153 --------------------~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~-~a~~i~~~~~~ 211 (714) T protein:vir:32 153 --------------------GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPG-MAQVIDYAIDD 211 (714) T ss_pred --------------------CCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCC-chhhhhhhhhh Confidence 234678999999999999654 69999999999999999999998322 11111100 0 Q ss_pred ch---h---------------hhccccccccccccccccceEEEEEEEEEeee---------------cC---------- Q lcl|NC_021532. 225 GE---D---------------FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDV---------------DG---------- 261 (663) Q Consensus 225 ~~---~---------------~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~---------------~~---------- 261 (663) .. + .....++...+.|.+..+++|+|+|||++... ++ T Consensus 212 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~ 291 (714) T protein:vir:32 212 WRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVA 291 (714) T ss_pred hccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHh Confidence 00 0 00111222334556677889999999997431 11 Q ss_pred --------CceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 262 --------DGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQ 333 (663) Q Consensus 262 --------~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~ 333 (663) ..+.++++++|+|+++|+.+++||+|++|||++++++........+|+++.++|+|+.+|++.|++++++ T Consensus 292 ~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l-- 369 (714) T protein:vir:32 292 SGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL-- 369 (714) T ss_pred hcchhhhccccceEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh-- Confidence 1235678889999999999999999999999999888876555567899999999999999999998865 Q ss_pred cCCCcEEeeccccCcch----hhhccCCcceEeCCCC-------CccccccCccccHHHHHHHHHHHHHHHHHhCCChHH Q lcl|NC_021532. 334 SNNGQVAIRKGALDQTN----RKKFLAGANFEFNGTA-------NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFS 402 (663) Q Consensus 334 ~~~~~~~~~~~~i~~~d----~~~~~p~~vi~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~ 402 (663) +++ ++++.+|+++..+ ...++||+++.++++. ..+.+.+.+++|+++++++++..+.++++|||++++ T Consensus 370 ~~~-~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~ 448 (714) T protein:vir:32 370 QAK-RVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAF 448 (714) T ss_pred cCC-ceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHH Confidence 455 4567788876543 2348999999998652 346677788999999999999999999999999999 Q ss_pred cCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC-------eeeccchhh Q lcl|NC_021532. 403 GGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND-------KFVPIRKDD 475 (663) Q Consensus 403 ~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~-------~~v~i~~~~ 475 (663) +|..+|+.|++| +++++++|++.+..+++||.. +++.+|+++|+||.+||++++++||+|+ .++.+|++. T Consensus 449 lG~~~na~SGvA--i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~ 525 (714) T protein:vir:32 449 LGQDSGATSGVA--ISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEG 525 (714) T ss_pred cCCCccchhHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeecccc Confidence 999999988877 888889999999999999965 6899999999999999999999999974 388888765 Q ss_pred c---------CCceEEEEeecccchhHHHHH-HHHHHHHHhccCCCcchhHHHHHHHHHhhh---hhhhhhhhhhhhcch Q lcl|NC_021532. 476 L---------SGRIDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDPKIRRDIMADIMDLMR---MPEQAKRMREYEPKP 542 (663) Q Consensus 476 ~---------~~~~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~---~~e~~~~l~~~~~~~ 542 (663) . .++|||.|..+. .+.+..+ .+..|++++ +.++|.....++..++++++ ..++++.+++..+++ T Consensus 526 ~~~~~~nDi~~~~~Dv~i~~~p--~~~t~r~~~~~~l~~l~-~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~ 602 (714) T protein:vir:32 526 DNGELTNDISRLNTHIALAPVQ--QTPAFKAQLAQRMSEVI-QGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTP 602 (714) T ss_pred CcceecccceeeeEEEEEeecc--CchHHHHHHHHHHHHHH-hhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCC Confidence 3 355666655544 3333333 333333333 33555544444443444444 446777887776654 Q ss_pred hhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQ---LELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAH 619 (663) Q Consensus 543 ~~~~~q~~q---~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~ 619 (663) ++......+ .++++++++.++++.+.+..+++.++.+++++..++++..+..+++...+.++. ++......+ T Consensus 603 ~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~-----~~~~~~~~~ 677 (714) T protein:vir:32 603 KSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG-----QRYVDALNQ 677 (714) T ss_pred CCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHH Confidence 432222111 111111111111112222222222222222222222222222222111111111 000000110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 620 LEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 620 ~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) +..++ .++..+-++.+. ...++ +.++.++..+-+++ T Consensus 678 a~~a~--~~~~~~~~~~~~-----~~~~~-q~~q~~~~~~~~~~ 713 (714) T protein:vir:32 678 AHTAE--IITGVQNMEQEQ-----DVLQQ-QMLYTLQQRMNEMS 713 (714) T ss_pred HHHHH--HHHhHhhhhhhh-----HHHHH-HHHHHHHHHHHhcC Confidence 10000 011111111110 00000 11222222233333 No 10 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=100.00 E-value=1.6e-97 Score=551.24 Aligned_cols=605 Identities=13% Similarity=0.074 Sum_probs=393.9 Q ss_pred CCCc---------H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHH Q lcl|NC_021532. 1 MKIN---------K---AELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSE 62 (663) Q Consensus 1 ~~~~---------~---~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~ 62 (663) |+-+ . ..+...+...|........++.....++.+||+|++|+ ++.+|+|++|+|.|.+.|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~ 80 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNLIAPTVD 80 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHH Confidence 2211 1 11222333334444444445566677888999999996 5679999999999999999 Q ss_pred HHHHHHHHhhcCCCceEEEEeCCcchH--HHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 63 WQHATIVDPFVSTADIIKCTPITWEDT--DSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 63 ~~~~~l~~~~~~~~~~~~~~p~~~~D~--~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) +++|+....+++ ++|.|++++|+ +.|++++.+++|+.+ .++....++++|.|+++||+||+++++|++.. T Consensus 81 ~v~g~~~~nr~~----~~v~pr~~~~~~~~~Ae~l~~~~~~~~~-~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~--- 152 (714) T protein:vir:10 81 GVLGMEAKTRTD----LIVMSDDPNDETEKLAEAINAEFADACR-LGNMNKARSDAYAEQIKAGLSWVEVRRNSEPF--- 152 (714) T ss_pred HHHHHHHhCCcc----eEEecCCCChhhHHHHHHHHHHHHHHHH-hhchhHHHHHHHHHhhhcccceEEeeeccCCC--- Confidence 999999876666 99999988764 689999999999876 56667788999999999999999999986421 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCccc-ccChhhCceEEEEeecCHHHHHHhcCCcChhh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTC-QDNLDNAQFVIHRYETDLSTLKKDGRYKNLDK 219 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a-~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~ 219 (663) .+.+.+++|+|.+|||||.+ ..|++||+|++|++|+|+++++++.. ...+. T Consensus 153 ---------------------------~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp-~~a~~ 204 (714) T protein:vir:10 153 ---------------------------GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFP-GMAQV 204 (714) T ss_pred ---------------------------CCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcC-Cchhh Confidence 23467899999999999965 46999999999999999999999832 21211 Q ss_pred hhhccc--------h-----hh-------hccccccccccccccccceEEEEEEEEEeee---------------cC--- Q lcl|NC_021532. 220 LAKTSG--------E-----DF-------DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDV---------------DG--- 261 (663) Q Consensus 220 ~~~~~~--------~-----~~-------~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~---------------~~--- 261 (663) +..... . +. ....++...+.|.+.++++|+|+|||++... ++ T Consensus 205 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~ 284 (714) T protein:vir:10 205 IDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNL 284 (714) T ss_pred hhccchhhcCcccchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCH Confidence 111000 0 00 0111222334455667789999999987431 10 Q ss_pred ---------------CceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 262 ---------------DGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRG 326 (663) Q Consensus 262 ---------------~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 326 (663) ..+.++++++|.|.++|+.+++||+|+.|||++++++.......++|+++.++|+|+.+|+++|+ T Consensus 285 ~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~ 364 (714) T protein:vir:10 285 MQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIK 364 (714) T ss_pred HHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHH Confidence 12346778999999999999999999999999998888776666778999999999999999999 Q ss_pred HHHHHHhcCCCcEEeeccccCcchh----hhccCCcceEeCCC-------CCccccccCccccHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 327 IIDNMAQSNNGQVAIRKGALDQTNR----KKFLAGANFEFNGT-------ANDFWHGSYNAIPSSAFDMISLMNNEIESI 395 (663) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~i~~~d~----~~~~p~~vi~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (663) +++++ +++ ++++++|+++..+. ...+||+++.++++ +..+...+++++|+++++++++....++++ T Consensus 365 ~~~~l--~~~-~~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~ 441 (714) T protein:vir:10 365 LTWLL--QAK-RVIMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDT 441 (714) T ss_pred HHHHH--hCC-ceeeccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHh Confidence 99865 344 67888999876432 23799999999763 235677888999999999999999999999 Q ss_pred hCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC-------ee Q lcl|NC_021532. 396 TGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND-------KF 468 (663) Q Consensus 396 tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~-------~~ 468 (663) |||+++++|..+|+.|++| +++++++|++.+..+++||.+ +++.+|+++|+||.+||++++++||+|+ .+ T Consensus 442 tGv~~~~lG~~~na~SGvA--I~~r~~qg~~~l~~~~dnl~~-~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~ 518 (714) T protein:vir:10 442 MGVYSAFLGQDSGATSGVA--ISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQT 518 (714) T ss_pred hCCCHHHcCCCcchhHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCccccee Confidence 9999999999999888887 888889999999999999975 7899999999999999999999999964 36 Q ss_pred eccchhhc---------CCceEEEEeecccchhHHHHH-HHHHHHHHhccCCCcchhHHHHHHHHHhhhhh---hhhhhh Q lcl|NC_021532. 469 VPIRKDDL---------SGRIDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP---EQAKRM 535 (663) Q Consensus 469 v~i~~~~~---------~~~~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~---e~~~~l 535 (663) +.+|++.. .+++||.|..+ +.+.++.+ .+..|++++. .++|......+..+++++++| ++++.+ T Consensus 519 ~~~n~~~~~~~~~nDi~~~~~dv~i~~~--p~~~s~r~~~~~~l~ql~~-~~~p~~~~~~~~~~le~~d~p~~~ei~~~i 595 (714) T protein:vir:10 519 IVLNAEGDNGELTNDISRLNTHIALAPV--QQTPAFKAQLAQRMSEVIQ-GLPPQVQAVVLDLWVNLLDVPQKQEFVERI 595 (714) T ss_pred EeeccccCCccccccceeeeEEEEEeec--cCcHHHHHHHHHHHHHHHh-hcCchhhhhHHHHHHHhcCCcCHHHHHHHH Confidence 77775432 24566655544 44444333 3334444443 455655554444455555554 677777 Q ss_pred hhhhcchhhHHHHh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKI---RQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVK 612 (663) Q Consensus 536 ~~~~~~~~~~~~q~---~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~ 612 (663) ++..+++++..... .+.+.++++++.++++.+.+..+++.++.+++++..++++..+..+++.....++. +. T Consensus 596 r~~~~~~~~~~~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~-----q~ 670 (714) T protein:vir:10 596 RAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG-----QR 670 (714) T ss_pred HHHcCCCCCccccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HH Confidence 77665543322111 11111111111111222222222222222222222222222222222111111111 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 613 EDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 613 ~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) ......++.-+. .++..+.++.+... ..++ .++.++.++.+++ T Consensus 671 ~~~~~~~a~~a~--~l~~~~~~~q~~~~----~~q~--~~q~~~~~~~~~~ 713 (714) T protein:vir:10 671 YVDALNQAHTAE--IITGVQNMEQEQDV----LQQQ--MLYTLQQRMNEMS 713 (714) T ss_pred HHHHHHHHHHHH--HHHHHHhhhhhHHH----HHHH--HHHHHHHHHHhcC Confidence 000000000000 01111111111100 0111 1223333334444 No 11 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=1.6e-96 Score=545.70 Aligned_cols=599 Identities=15% Similarity=0.165 Sum_probs=404.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHhcCCcC----CccccCCCccccHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLIS----------TWKAEYNGEPY----GNEQKGKSAIVSRDIKKQSEWQHA 66 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~----------~~~~~y~~~~~----~~~~~g~s~~~~~~i~~~v~~~~~ 66 (663) -..+.+.|++.|.+.|+++.++|+....+|. ++++||+|..+ .++..|||++|++.|+++|+|+++ T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~~v~~~ve~~~~ 93 (651) T protein:vir:80 14 TYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKAFEAIETIHA 93 (651) T ss_pred hhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccChhHHHHHHHHHH Confidence 2335677899999999999999998877774 45788887655 345679999999999999999999 Q ss_pred HHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHh---ccchhHHHHHHHHHHHhcCceEEEeeeccccceecccc Q lcl|NC_021532. 67 TIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSR---KFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMG 143 (663) Q Consensus 67 ~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~---~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~ 143 (663) +|++.||++++||+|.|..++ +.|++.+++|++++.. ++++...++.+++|++++|+||+||+||.......... T Consensus 94 ~l~~~~~~~~~~~~~~p~~~~--d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~ 171 (651) T protein:vir:80 94 YLMSATFPNKNWFDVVPAKPG--QDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKV 171 (651) T ss_pred HHHHhhcCCCceeEeccCCch--hHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehhe Confidence 999999999999999996544 4577778888877653 45555667788999999999999999986654433221 Q ss_pred cccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHH---HHhcCCcChhhh Q lcl|NC_021532. 144 EAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTL---KKDGRYKNLDKL 220 (663) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l---~~~g~~~~~~~~ 220 (663) .. .............. ...+..+++|.+++|||++|||||+|+ +++||.|++|+.+ |+.++ .++|+|.+.+.. T Consensus 172 ~~-~~~~~~~~~~~~v~-~~~~~~~~~~~i~~v~p~~~~~dp~a~-~~~d~~~v~~~~~-t~~~l~~l~~~g~~~~~~~~ 247 (651) T protein:vir:80 172 QV-RTPLFEDEPTFEVV-SEEREVKSSPDFEVLDMFDCFYDPNVT-DPNRGAFIRKLTK-TKADILNLLSEGYYYGVDPL 247 (651) T ss_pred ec-cccccccccceeee-ccceeeeceeEEEEecHHHeeecCCCc-Cccccceeeeeee-eHHHHHHHHhcccccchhhH Confidence 10 00000111111111 123446688999999999999999985 7999999998854 55554 456787765433 Q ss_pred hhc---cchhhhccccccccc----cccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEE Q lcl|NC_021532. 221 AKT---SGEDFDYDSPDDTEF----QFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFL 293 (663) Q Consensus 221 ~~~---~~~~~~~~~~~~~~~----~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~ 293 (663) ... .....+......... ..+..+.++|.|||||++++.++++.. .+++++.|+.+|+..++||+++. ||+ T Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~-~~~v~~~g~~il~~~~~~~~~~~-Pf~ 325 (651) T protein:vir:80 248 DVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYH-DVVVTIMGNEVLRFEQNPYWCGR-PFV 325 (651) T ss_pred HHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceE-EEEEEEcCcEEecccccCCCCCC-Cee Confidence 221 111111111111111 112235678999999999999988874 56788889999999999999865 999 Q ss_pred EEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCcccccc Q lcl|NC_021532. 294 VVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGS 373 (663) Q Consensus 294 ~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~ 373 (663) +++|+++||++||+|+++.+.|.|+.+|+++|+++++++++++|+|++++|++.+.+...++||++|+++.+++ +.+++ T Consensus 326 ~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~vi~~~~~~~-~~~l~ 404 (651) T protein:vir:80 326 IGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGKVFLVSDHGD-LQPLA 404 (651) T ss_pred eecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCceEEecCCCC-ceeec Confidence 99999999999999999999999999999999999999999999999988876655556689999999987654 44443 Q ss_pred C-ccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 374 Y-NAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSL-GSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN 451 (663) Q Consensus 374 ~-~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~-~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li 451 (663) + .+.+...+++++++.+.++++|||+++++|..+... ..||++|+++++++++++..++++|+++++++|+++++.++ T Consensus 405 ~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~ 484 (651) T protein:vir:80 405 NQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLV 484 (651) T ss_pred cCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 445677889999999999999999999999876554 35999999999999999999999999989999999999999 Q ss_pred HHhcCCceEEEEecC-----eeeccchhhcCCceEEEEeeccc--chhHHHHHHHHHHHHHhccCCCcch---hHHHHHH Q lcl|NC_021532. 452 AEFLEEEEVIRVTND-----KFVPIRKDDLSGRIDIDISISTA--EDNAAKSQELSFLLQTLGPNEDPKI---RRDIMAD 521 (663) Q Consensus 452 ~q~~~~~~~iri~~~-----~~v~i~~~~~~~~~d~~v~~~~~--~~~~~~~q~l~~~~~~~~~~~~p~~---~~~~l~~ 521 (663) ++|++.++++||+|+ .|+.++++++.++++++. +|.+ ..+....+.+.++++.+++.++... ...++.. T Consensus 485 ~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~-~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~ 563 (651) T protein:vir:80 485 QQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVP-IGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVD 563 (651) T ss_pred HHhcCcccceeecccccccccccccCccceeeeeeeee-ccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHH Confidence 999999999999985 477888889988888742 2222 2234445566666776665432211 1233445 Q ss_pred HHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 522 IMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSS 601 (663) Q Consensus 522 ~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~ 601 (663) +++..++++....+.....+ .++..+ +....+++....++ + .++++.++++..+.++++ T Consensus 564 l~~~~g~~~~~~~l~~~~q~----~~~~~~-~~~~~q~~~~~~~a---------~-------~~~~~~~~~~~~~~~~~~ 622 (651) T protein:vir:80 564 LLQHWGFEEPEAYLKQQDQQ----APANPQ-EALLSQAKDVGGQA---------M-------SNMLQNQLQADGGTQMMS 622 (651) T ss_pred HHHHcCCCCcHHhcCCCccc----hhhhhh-HHHHhhHHHHHHHH---------H-------HHHHHHHHHHHHHHHHHH Confidence 55666665544443221111 110000 00000111000000 0 001111111110001111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 602 EADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKH 641 (663) Q Consensus 602 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~ 641 (663) ++...+. +.+.+++....+..++.++++| T Consensus 623 ~~~~~~~-----------~~~~~~~~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 623 EMYGTPN-----------ADQMQQELMATTPNVSEQQLTQ 651 (651) T ss_pred HHHHHHH-----------HHHHHHHHHHHHHHHHHhhccC Confidence 1111111 0111111122222222222222 No 12 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=100.00 E-value=9e-96 Score=541.64 Aligned_cols=593 Identities=14% Similarity=0.119 Sum_probs=392.3 Q ss_pred CCCcH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHH Q lcl|NC_021532. 1 MKINK--------------AELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQ 60 (663) Q Consensus 1 ~~~~~--------------~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~ 60 (663) |.+.+ ..+...+...|........++.....+..+||+|++|+ ++.+|++++|+|.|.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i~~~ 80 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLIGPA 80 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcchHHH Confidence 22211 11111222233333334445556667788899999995 56799999999999999 Q ss_pred HHHHHHHHHHhhcCCCceEEEEeCC-cchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeecccccee Q lcl|NC_021532. 61 SEWQHATIVDPFVSTADIIKCTPIT-WEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEV 139 (663) Q Consensus 61 v~~~~~~l~~~~~~~~~~~~~~p~~-~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~ 139 (663) |++++|+....+++ ++|.|+. .+|.+.|++++.+++|+.+ .++....++++|.|+++||+||++++++.+.. T Consensus 81 v~~v~g~~~~nr~d----~~v~Pr~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~-- 153 (772) T protein:vir:10 81 LLSLQGYEAVTRTD----WRVTPNGDVGGQEVADALNYRLNTAER-QSGADRACSEAFRPQIACGIGWVEVSRESDPF-- 153 (772) T ss_pred HHHHHHHHHhcCcc----eEEecCCCchHHHHHHHHHHHHHHHHH-hcChHHHHHHHHHHhhhcCceeEEeccccCCC-- Confidence 99999999876666 9999985 5889999999999999875 56777789999999999999999987753221 Q ss_pred cccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhh Q lcl|NC_021532. 140 TVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDK 219 (663) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~ 219 (663) ...+.+++|+|.+|||||++..|++||+|+++..|||+++++++ |++... T Consensus 154 ----------------------------~~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~--fp~~a~ 203 (772) T protein:vir:10 154 ----------------------------KFPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALV--FPEHAE 203 (772) T ss_pred ----------------------------CCCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHh--CCCchh Confidence 12357889999999999998779999999999999999999987 332221 Q ss_pred hhhccc-----------------hh--------hhccccccccccccccccceEEEEEEEEEeeec------CCc----- Q lcl|NC_021532. 220 LAKTSG-----------------ED--------FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD------GDG----- 263 (663) Q Consensus 220 ~~~~~~-----------------~~--------~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~------~~g----- 263 (663) +..... .+ .....++.....|.+.++++|+|+|||+|.... ++| T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~ 283 (772) T protein:vir:10 204 LIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEY 283 (772) T ss_pred HHHhhhhhcccccCcccccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEee Confidence 111000 00 011122233445667778999999999985421 111 Q ss_pred ----------------------eeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHH Q lcl|NC_021532. 264 ----------------------IAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKT 321 (663) Q Consensus 264 ----------------------~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N 321 (663) ..++++++|+|.++|+.+++||+|+.|||++++++++..+..++|+++.++|+|+.+| T Consensus 284 ~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N 363 (772) T protein:vir:10 284 DPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLN 363 (772) T ss_pred CcccHHHHHHHhhcccchheeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHH Confidence 2467788999999999999999999999999999988877777889999999999999 Q ss_pred HHHHHHHHHHHhcCCCcEEeeccccCcch----hhhccCCcceEeCCCC-----CccccccCccccHHHHHHHHHHHHHH Q lcl|NC_021532. 322 AVIRGIIDNMAQSNNGQVAIRKGALDQTN----RKKFLAGANFEFNGTA-----NDFWHGSYNAIPSSAFDMISLMNNEI 392 (663) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~d----~~~~~p~~vi~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~ 392 (663) ++.|+++++++.+ ++++++|+|+..+ ....+|++++.++++. ..+...+.+.+|++++++++.....| T Consensus 364 ~~~S~~~~~l~~~---~~~~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i 440 (772) T protein:vir:10 364 SGVSKLRWGMSVA---RVERTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATI 440 (772) T ss_pred HHHHHHHHHHhcc---cccccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHH Confidence 9999999988766 5789999998754 2457999999998763 34556678889999999999999999 Q ss_pred HHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC------ Q lcl|NC_021532. 393 ESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND------ 466 (663) Q Consensus 393 ~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~------ 466 (663) +++||++++++|..+|+.|++| ++++++++++.+..+++||.. +++.+|+++|+||.+||+++|+|||+|+ T Consensus 441 ~~vsGv~~~~lG~~~na~SGvA--i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~ 517 (772) T protein:vir:10 441 ERVSNITAGFQGRKGTATSGIQ--EQQQIEQSNQSIGRIMDNFRA-GRTLVGELLLAMIVEDIGQERTEVVIEGDAVTAD 517 (772) T ss_pred HHHhCCCHHHcCCCcchhhHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCC Confidence 9999999999999998888877 777789999999999999964 7889999999999999999999999974 Q ss_pred eeeccchhh-------------c-CCceEEEEeecccchhHHHHH-HHHHHHHHhccCCCcchhHHHHHHHHHhhhhh-- Q lcl|NC_021532. 467 KFVPIRKDD-------------L-SGRIDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP-- 529 (663) Q Consensus 467 ~~v~i~~~~-------------~-~~~~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~-- 529 (663) .++.||... + .+++||.|.. ++.+.++++ .+..+++++ +.++|.+...++..+++++++| T Consensus 518 ~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~--~p~~~t~r~~~~~~m~ql~-~~~~P~~~~~~~~~~le~~D~p~~ 594 (772) T protein:vir:10 518 RVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALED--VPSTNSYRGQQLNAMSEAV-KSMPPQYQAAVLPFLVSLMDVPFK 594 (772) T ss_pred ceEEeccceecccccccceeccceeeeEEEEeec--cccchHHHHHHHHHHHHHH-hccChhHHHHHHHHHHhhcCCCCh Confidence 456676421 1 3455655554 444444443 444455554 4468888888777777777776 Q ss_pred -hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 -EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDL 608 (663) Q Consensus 530 -e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~ 608 (663) ++++.++.+.+++++.+++..+.+ ..+.++++++++++..+.++ +.++.++++ +++++++..... T Consensus 595 ~ei~~~ir~~~~~~~peq~~~~~~q--~~qq~~~~~~~el~~~q~~a-------~~~~~~A~a-----~~~~aqa~~~~~ 660 (772) T protein:vir:10 595 RDVVEAIRAVDQQQTPEQIQQQIDQ--AVQDALAKAGNDIKLRELEI-------KERKADSEI-----SGLNAKAVQIGV 660 (772) T ss_pred HHHHHHHHHHhccCChHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-----HHHHHHHHHHHH Confidence 677777776655444332211111 11111111111111111111 111111111 111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hhhhhhccccC Q lcl|NC_021532. 609 KFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQR---NAGDTNIGVVE 663 (663) Q Consensus 609 e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~---~~~~~~~~~~~ 663 (663) + .+.++.+..+.. .+....+......+ ..+..+. .....+.-... T Consensus 661 ~--a~~~a~~aa~~~--~q~~q~a~~ad~~l------~~~g~~~~~~~~~~~~~p~~~ 708 (772) T protein:vir:10 661 Q--AAFSAMQAGAQI--AQMPMIAPIADAVM------QSAGYQRPNPAGDDPNYPIAD 708 (772) T ss_pred H--HHHHHhhhhhhH--HhhhhhhHHHHHHH------HhcccccccccccCCCCCCCC Confidence 0 000111111000 00000001110111 1111000 00000000001 No 13 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=100.00 E-value=3e-90 Score=511.35 Aligned_cols=609 Identities=10% Similarity=0.069 Sum_probs=388.2 Q ss_pred CcHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc------cccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 3 INKA-ELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN------EQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 3 ~~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~------~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) |.+. .++..+...|+.+..+..++.....+..+||+|++|.. +.+|++ ++|.|++.|+|++|+....+++ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~~~~nr~d- 77 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPID- 77 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCC--ccccHHHHHHHHHhhHHhCCcc- Confidence 5544 47888888888888888888888888899999999963 567776 6799999999999999876555 Q ss_pred CceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccc Q lcl|NC_021532. 76 ADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNET 155 (663) Q Consensus 76 ~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 155 (663) ++|.|+.++|.+.|+++|.+++|+.+ .++....++++|+|+++||+||++|.+||+++.. +. T Consensus 78 ---~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~-----------~~--- 139 (725) T protein:vir:77 78 ---VLYRPKDGARPDAADVLMGMYRTDMR-HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSP-----------TS--- 139 (725) T ss_pred ---eEEecCCccHHHHHHHHHHHHHHHHH-hhCchhHHHHHHHHHhhcCcceeeeeecccCCCC-----------CC--- Confidence 99999999999999999999999865 5777788999999999999999999988754311 00 Q ss_pred ccccccccceeecccceee----eccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhc--CCcChhhhhhccchhh Q lcl|NC_021532. 156 VVEQEVTETVVKKNQPTAR----VCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDG--RYKNLDKLAKTSGEDF 228 (663) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~----~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g--~~~~~~~~~~~~~~~~ 228 (663) +.+.|+ ..+|.+|||||.++ .|++||+|+|++.|+|++++..+. |..+...+.... T Consensus 140 -------------~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~---- 202 (725) T protein:vir:77 140 -------------NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQ---- 202 (725) T ss_pred -------------CceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhccccc---- Confidence 011122 34788899999765 499999999999999999766542 221111111100 Q ss_pred hccccccccccccccccceEEEEEEEEEeeec------------------------------CCce----------eEEE Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD------------------------------GDGI----------AEPI 268 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~------------------------------~~g~----------~~~~ 268 (663) ......+.| .+.++|+|+|||++..+. ..|. .+.+ T Consensus 203 ---~~~~~~~~~--~~~d~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~ 277 (725) T protein:vir:77 203 ---NPNDWVFPW--LTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVY 277 (725) T ss_pred ---ccccccccc--cCCCeeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeee Confidence 000011112 245689999999976421 1121 1234 Q ss_pred EEEEECCEEEecccCCCcCCCCCEEEEeeee--ecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc Q lcl|NC_021532. 269 VCAWINDVIVRLQSNPYPDGKPPFLVVPFNS--IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL 346 (663) Q Consensus 269 ~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~--~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i 346 (663) ++++.|.++|+ +++||+|+.|||+++++++ +++..|++|+++.++|+|+.+|+++|+++++++++++.++++..|++ T Consensus 278 ~~~~~g~~~l~-~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i 356 (725) T protein:vir:77 278 KSIITCTAVLK-DKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI 356 (725) T ss_pred EeeecCceeec-cCCcCCCCccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhh Confidence 45566777765 5789999999999877664 57888889999999999999999999999999999999999999999 Q ss_pred CcchhhhccCCcc-------eEeCCCC---CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHH Q lcl|NC_021532. 347 DQTNRKKFLAGAN-------FEFNGTA---NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATG 416 (663) Q Consensus 347 ~~~d~~~~~p~~v-------i~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~ 416 (663) +.....+..|+++ +..++|. .++..++++++|+++++|++.....|+++|||+++++|..+|++||.| T Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a-- 434 (725) T protein:vir:77 357 AGFEHMYDGNDDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDT-- 434 (725) T ss_pred hHHHHHHHhccCCceecccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHH-- Confidence 8766666666654 3334443 356677889999999999999999999999999999999998877776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccch-------------hhcCCc Q lcl|NC_021532. 417 ARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRK-------------DDLSGR 479 (663) Q Consensus 417 i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~-------------~~~~~~ 479 (663) +++++++|.+.+..+++||. .+++.+|+++|+||.+||+++++|||+|+ +++.||. .++.|+ T Consensus 435 i~~rq~qg~~~~~~~~Dnl~-~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~ 513 (725) T protein:vir:77 435 VNQLNMRADLETYVFQDNLA-TAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGR 513 (725) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccc Confidence 77888999999999999996 47899999999999999999999999986 4777774 245678 Q ss_pred eEEEEeecccchhHHHHHHHHHHHHHhccCCCc--chhHHHHHHHHHhhhhh---hhhhhhhhhhcc-----hhhHHHHh Q lcl|NC_021532. 480 IDIDISISTAEDNAAKSQELSFLLQTLGPNEDP--KIRRDIMADIMDLMRMP---EQAKRMREYEPK-----PDPVQEKI 549 (663) Q Consensus 480 ~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p--~~~~~~l~~~~~l~~~~---e~~~~l~~~~~~-----~~~~~~q~ 549 (663) |||+|+.+.+..+. +.+.+..|++++. .++| .+...++...+++++.+ +..+.++...+. +...++++ T Consensus 514 ~Dv~v~~~p~~~s~-r~~~~~~l~qll~-~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q 591 (725) T protein:vir:77 514 YECYTDVGPSFQSM-KQQNRAEILELLG-KTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQ 591 (725) T ss_pred eeeEEeeccchHHH-HHHHHHHHHHHHH-hccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHH Confidence 88888776543321 2222333333322 2222 12233344445555544 444444432221 11111111 Q ss_pred -----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 550 -----RQLELENLMLENQMLVASINDKNARANENTIDAE-------LKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGY 617 (663) Q Consensus 550 -----~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~-------~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~ 617 (663) .+.+++++++++.++++++...++++++.+.+.. ..+++++..+++..++..++...+ ....+... T Consensus 592 ~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q--~a~~~~~~ 669 (725) T protein:vir:77 592 WLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSK--QSEFREFL 669 (725) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH--HHHHHHHH Confidence 1111111111111111111111111111111110 000111111111111111111000 00111111 Q ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 618 AHLEQVELEDL-------RHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 618 ~~~~~~~~~~~-------~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) +.+..++.+.. +.....+....+++.+.++.+...+ .+.+=++|+ T Consensus 670 ~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~q~~~~~~~~~~~~-~~~~~~~~~ 721 (725) T protein:vir:77 670 KTVASFQQDRSEDARANAELLLKGDEQTHKQRMDIANILQSQR-QNQPSGSVA 721 (725) T ss_pred HHHHHHHHHHHHHHHHHhHHHHHhhhHHHhhHHHHHHHHHHHH-hcCCCcCcc Confidence 11111111111 1111111122356666776664444 444455666 No 14 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=100.00 E-value=7.8e-90 Score=509.10 Aligned_cols=611 Identities=11% Similarity=0.063 Sum_probs=383.9 Q ss_pred CcHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 3 INKAE-LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 3 ~~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) |.+.. ++..+...|+.+.....++.....+..+||+|++|+ ++.+|++ ++|.|++.|+|++|+....+++ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp--~~N~i~~~v~~v~g~e~~nr~d- 77 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPID- 77 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhHHhCCcc- Confidence 55444 788888888888887778888888899999999996 4567876 6799999999999999876666 Q ss_pred CceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccc Q lcl|NC_021532. 76 ADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNET 155 (663) Q Consensus 76 ~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 155 (663) ++|.|+.++|.+.|++++.+++|+.+ .++....++++|.|+++||+||++|.+||+++.. +. T Consensus 78 ---~~v~p~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~-----------~~--- 139 (725) T protein:vir:10 78 ---VLYRPKDGASPDAADVLMGMYRTDMR-HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSP-----------TS--- 139 (725) T ss_pred ---eEEecCCcchHHHHHHHHHHHHHHHH-hcCcchHHhHHHHHHhhcCcceeeeeccccCCCC-----------CC--- Confidence 99999999999999999999999865 5677778899999999999999999988754211 00 Q ss_pred ccccccccceeecccceee----eccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHh--cCCcChhhhhhccchhh Q lcl|NC_021532. 156 VVEQEVTETVVKKNQPTAR----VCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKD--GRYKNLDKLAKTSGEDF 228 (663) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~----~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~--g~~~~~~~~~~~~~~~~ 228 (663) ....++ ..||.+|||||.++ .|++||+|+++..|++++.+... .|..+...+.... T Consensus 140 -------------~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~---- 202 (725) T protein:vir:10 140 -------------NNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQ---- 202 (725) T ss_pred -------------CceeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCccccccccc---- Confidence 001112 34788899999664 59999999999999998654321 1222211111100 Q ss_pred hccccccccccccccccceEEEEEEEEEeeec-----------C-------------------Cce----------eEEE Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD-----------G-------------------DGI----------AEPI 268 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~-----------~-------------------~g~----------~~~~ 268 (663) ......+...+.++|+|+|||++.++. | .|. .+.+ T Consensus 203 -----~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~ 277 (725) T protein:vir:10 203 -----NPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVY 277 (725) T ss_pred -----ccccccccccCCCeEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEE Confidence 000111222345679999999986421 0 111 1344 Q ss_pred EEEEECCEEEecccCCCcCCCCCEEEEeeee--ecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc Q lcl|NC_021532. 269 VCAWINDVIVRLQSNPYPDGKPPFLVVPFNS--IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL 346 (663) Q Consensus 269 ~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~--~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i 346 (663) +++++|.++|+. ++|++|+.|||+++.+++ ++|..|++|+++.++|+|+.+|+++|+++++++++++.+++++.+++ T Consensus 278 ~~~~~g~~~l~~-~~~~~~~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i 356 (725) T protein:vir:10 278 KSIITCTAVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI 356 (725) T ss_pred EEeecchhhhcC-CCCCCCCceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhh Confidence 455678887754 678999999999877665 57888888999999999999999999999999999999999999999 Q ss_pred CcchhhhccCCcceEe-------CCCC---CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHH Q lcl|NC_021532. 347 DQTNRKKFLAGANFEF-------NGTA---NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATG 416 (663) Q Consensus 347 ~~~d~~~~~p~~vi~~-------~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~ 416 (663) +.....+.+|+++..+ ++|. .++...+++++|+++++|++.....++++|||+++++|..+|+.|+.| T Consensus 357 ~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a-- 434 (725) T protein:vir:10 357 AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDT-- 434 (725) T ss_pred hHHHHHHhccCCceeeecccccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHH-- Confidence 8666666777766443 2222 356677889999999999999999999999999999999998877776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccch-------------hhcCCc Q lcl|NC_021532. 417 ARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRK-------------DDLSGR 479 (663) Q Consensus 417 i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~-------------~~~~~~ 479 (663) +++++++|++.+..+++||.. +++.+|+++|+||.+||+++++|||+|+ +|+.||. .++.|+ T Consensus 435 i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~ 513 (725) T protein:vir:10 435 VNQLNMRADLETYVFQDNLAT-AMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGR 513 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccc Confidence 788889999999999999965 7899999999999999999999999986 3887775 345678 Q ss_pred eEEEEeecccchhHHHHHHHHHHHHHhccCCCc--chhHHHHHHHHHhhhhh---hhhhhhhhhhcch---hhHHHHhhH Q lcl|NC_021532. 480 IDIDISISTAEDNAAKSQELSFLLQTLGPNEDP--KIRRDIMADIMDLMRMP---EQAKRMREYEPKP---DPVQEKIRQ 551 (663) Q Consensus 480 ~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p--~~~~~~l~~~~~l~~~~---e~~~~l~~~~~~~---~~~~~q~~q 551 (663) |||+|+.+++..+. +.+.+..|++++.. ++| .+...++...+++++.+ +..+.++...++. ++..++..+ T Consensus 514 ~Dv~v~~~p~~~s~-r~~~~~~l~qll~~-~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q 591 (725) T protein:vir:10 514 YECYTDVGPSFQSM-KQQNRSEILELLGK-TPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQ 591 (725) T ss_pred eeEEEeeccCcHHH-HHHHHHHHHHHHHh-ccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhH Confidence 88888776543321 22333333333322 222 12333444445555444 4555554432221 111111111 Q ss_pred HHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHH-HHHHHHHHHH Q lcl|NC_021532. 552 LELENLMLENQMLVASIND-----KNARANENTIDAELKRSKAAVEKAKARKLSSEA------DMTDLK-FVKEDNGYAH 619 (663) Q Consensus 552 ~~~~~~q~~~~~~~a~~~~-----~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~------~~~~~e-~~~~~~~~~~ 619 (663) ..++.++++.+++.++..+ .+++++...++++...+++++.+.+.+.....+ .....+ ....+.+.++ T Consensus 592 ~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~~~ 671 (725) T protein:vir:10 592 WLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKT 671 (725) T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHH Confidence 1111111111111111111 111111111111111111111111111000000 000000 0011111111 Q ss_pred HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 620 LEQVELEDL-------RHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 620 ~~~~~~~~~-------~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) ...++++.. +.....+.+..+++++..+.+...+.+ +.=+.|. T Consensus 672 ~~~~q~~~~~~~~~~ae~~~~~~~~~~~~~~~~~~~~~~q~~~-~~~~~~~ 721 (725) T protein:vir:10 672 VASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQSQRQN-QPSGSVA 721 (725) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhhhhcccccccc-CCCcccc Confidence 111111111 111111112223444444444322222 2333343 No 15 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=100.00 E-value=1.7e-89 Score=507.22 Aligned_cols=609 Identities=10% Similarity=0.063 Sum_probs=385.3 Q ss_pred CcHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC------ccccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 3 INKA-ELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG------NEQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 3 ~~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~------~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) |.+. .++..+...|+.+..+..++.....+..+||+|++|. ++.+|++ ++|.|.+.|+|++|+....+++ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp--~~N~i~~~i~~v~g~e~~nr~d- 77 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRG--QFDVVRPVVRKLVSEMRQNPID- 77 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC--cccchHHHHHHHHhhHHhCCcc- Confidence 5543 5788888888888888888888888999999999996 4567876 6799999999999999876655 Q ss_pred CceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccc Q lcl|NC_021532. 76 ADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNET 155 (663) Q Consensus 76 ~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 155 (663) ++|.|+.++|.+.|+++|.+++|+.+ .++....++++|+|+++||+||++|.+||+++.. +. T Consensus 78 ---~~v~P~~~~d~~~Ae~l~~~~~~~~~-~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~-----------~~--- 139 (725) T protein:vir:92 78 ---VLYRPKDGASPDAADVLMGMYRTDMR-HNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSP-----------TS--- 139 (725) T ss_pred ---eEEecCCccHHHHHHHHHHHHHHHHH-hhCchHHHHHHHHHHhhcCcceeeeeecccCCCC-----------CC--- Confidence 99999999999999999999999865 6777788999999999999999999988754211 00 Q ss_pred ccccccccceeecccceee---ec-cHHHheeCcccc-cChhhCceEEEEeecCHHHHHHh--cCCcChhhhhhccchhh Q lcl|NC_021532. 156 VVEQEVTETVVKKNQPTAR---VC-RNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKD--GRYKNLDKLAKTSGEDF 228 (663) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~---~v-~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~--g~~~~~~~~~~~~~~~~ 228 (663) +.+.++ +. |+.+|||||.++ .|++||+|+|++.|++++++... .|..+...+.... T Consensus 140 -------------~~~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~---- 202 (725) T protein:vir:92 140 -------------NNQVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQ---- 202 (725) T ss_pred -------------CceeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcc---- Confidence 111222 22 456799999765 49999999999999999876543 1222211111100 Q ss_pred hccccccccccccccccceEEEEEEEEEeeec-----------C-------------------Cce----------eEEE Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD-----------G-------------------DGI----------AEPI 268 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~-----------~-------------------~g~----------~~~~ 268 (663) . ...+.....+.++|+|+|||++..+. + .|. .+.+ T Consensus 203 ---~--~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~ 277 (725) T protein:vir:92 203 ---N--PNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVY 277 (725) T ss_pred ---c--CCcccccccCCCeEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEe Confidence 0 01111112345789999999975421 1 111 1334 Q ss_pred EEEEECCEEEecccCCCcCCCCCEEEEeeee--ecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc Q lcl|NC_021532. 269 VCAWINDVIVRLQSNPYPDGKPPFLVVPFNS--IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL 346 (663) Q Consensus 269 ~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~--~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i 346 (663) +++++|.++|+. ++|++|+.|||+++.+++ ++|..|++|+++.++|+|+.+|+++|+++++++++++.+++++.+++ T Consensus 278 ~~~~~g~~~l~~-~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i 356 (725) T protein:vir:92 278 KSIITCTAVLKD-KQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI 356 (725) T ss_pred eeeecchhhhcC-CCCCCCCceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhh Confidence 455678777754 678999999999887665 57888889999999999999999999999999999999999999999 Q ss_pred CcchhhhccCCcceEe-------CCCC---CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHH Q lcl|NC_021532. 347 DQTNRKKFLAGANFEF-------NGTA---NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATG 416 (663) Q Consensus 347 ~~~d~~~~~p~~vi~~-------~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~ 416 (663) +.....+.+|+.+..+ ++|. .++...+++++|+++++|++.....++++|||+++++|..+|+.|+.| T Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~a-- 434 (725) T protein:vir:92 357 AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDT-- 434 (725) T ss_pred hHHHHHHhccCccceeeccccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHH-- Confidence 8665556666654332 2222 356677889999999999999999999999999999999998877776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccch-------------hhcCCc Q lcl|NC_021532. 417 ARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRK-------------DDLSGR 479 (663) Q Consensus 417 i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~-------------~~~~~~ 479 (663) +.+++++|++.+..+++||.. +++.+|+++|+||.+||++++++||+|+ +++.||. .++.++ T Consensus 435 i~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~ 513 (725) T protein:vir:92 435 VNQLNMRADLETYVFQDNLAT-AMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGR 513 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccc Confidence 777888999999999999964 7889999999999999999999999985 5787775 355678 Q ss_pred eEEEEeecccchhHHHHH-HHHHHHHHhccCCCc--chhHHHHHHHHHhhhhh---hhhhhhhhhhcc-----hhhHHHH Q lcl|NC_021532. 480 IDIDISISTAEDNAAKSQ-ELSFLLQTLGPNEDP--KIRRDIMADIMDLMRMP---EQAKRMREYEPK-----PDPVQEK 548 (663) Q Consensus 480 ~d~~v~~~~~~~~~~~~q-~l~~~~~~~~~~~~p--~~~~~~l~~~~~l~~~~---e~~~~l~~~~~~-----~~~~~~q 548 (663) |||+|+.+++. .++.+ .+..|++++ +.++| .+....+...+++++.+ +..+.++...++ +.+.+++ T Consensus 514 ~Dv~v~~~p~~--~s~r~~~~~~l~ql~-~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~ 590 (725) T protein:vir:92 514 YECYTDVGPSF--QSMKQQNRAEILELL-GKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQ 590 (725) T ss_pred eeeEEeeccCh--HHHHHHHHHHHHHHH-HhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhh Confidence 88887776543 33333 333333333 22222 12223334444555443 445555433221 1111111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH--HHHHHH------HHHHHHHH-HHHHH Q lcl|NC_021532. 549 IRQLELENLMLENQMLVASINDKNARANENTID---AELKRSKAAVEKAKAR--KLSSEA------DMTDLKFV-KEDNG 616 (663) Q Consensus 549 ~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~---~~~~~~~~~~e~~~~q--~~~~~~------~~~~~e~~-~~~~~ 616 (663) + ..++.++++.++++++..+.+++..+.+++ ++.+..+++.+.++.+ .....+ ..+.++.+ ..+.. T Consensus 591 q--~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~ 668 (725) T protein:vir:92 591 Q--WLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREF 668 (725) T ss_pred H--HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 1 111111222222222211111111111111 1111111111111100 000000 00000000 00111 Q ss_pred HHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 617 YAHLEQVELEDLRHA-------QHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 617 ~~~~~~~~~~~~~~~-------~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) .+...+.++++.... ...+.+..+++.+.++.+...+.++..+-.=+ T Consensus 669 ~~~~~~~q~~~~~~a~~~ae~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 722 (725) T protein:vir:92 669 LKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQSQRQNQPSGSVAE 722 (725) T ss_pred HHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHhcchhccCCcccccc Confidence 111111111111111 01111223445555565544444444433222 No 16 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=100.00 E-value=1.3e-85 Score=486.01 Aligned_cols=603 Identities=13% Similarity=0.125 Sum_probs=365.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCcCCc----------cccCCCccccHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN--GEPYGN----------EQKGKSAIVSRDIKKQSEWQHATI 68 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~--~~~~~~----------~~~g~s~~~~~~i~~~v~~~~~~l 68 (663) |-=....++..+...|+.+..+..++.+...+..+||+ |++|+. +.+|+|++++|.|.+.|++++|+. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 33334478999999999999988888877777778875 889963 246889999999999999999999 Q ss_pred HHhhcCCCceEEEEeCCcc-hHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 69 VDPFVSTADIIKCTPITWE-DTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 69 ~~~~~~~~~~~~~~p~~~~-D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ...+++ ++|.|++++ |.+.|++++.+++|+.+ .++....++++|.++++||+||+++.+||+++..+.. T Consensus 81 ~~nr~d----~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~----- 150 (720) T protein:vir:35 81 RHNRIT----VKFRPGDKTASEALANKLNGLFRADYE-ETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMD----- 150 (720) T ss_pred HhCCCc----eEEEcCCCcchHHHHHHHHHHHHHHHH-hcCchHHHhHHHHHhhhccceeEEeeecccccCCCCc----- Confidence 876666 999999664 89999999999999875 5677778899999999999999999998765421110 Q ss_pred cCccccccccccccccceeecccceee--eccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhcc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTAR--VCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTS 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~ 224 (663) ....+.++ .+|+.+|||||.++ .|++||+|+++..|+|+++++++ |++.....+. T Consensus 151 -------------------~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~--yp~~a~~~~~- 208 (720) T protein:vir:35 151 -------------------ERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAE--YNKDPATLMS- 208 (720) T ss_pred -------------------ccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHh--CCCccccccc- Confidence 00112233 35678999999876 48999999999999999999987 3322111111 Q ss_pred chhhhccccccccccccccccceEEEEEEEEEeeec---------CCc-----------------------------e-- Q lcl|NC_021532. 225 GEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD---------GDG-----------------------------I-- 264 (663) Q Consensus 225 ~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~---------~~g-----------------------------~-- 264 (663) .......+.+. ..+.|+|+|||.+..+. ..| + T Consensus 209 ------~~~~~~~~d~~--~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 280 (720) T protein:vir:35 209 ------GIERSWDYDWY--DVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKR 280 (720) T ss_pred ------ccccccccccc--CCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeE Confidence 11111112222 34579999999875321 000 0 Q ss_pred eEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEee Q lcl|NC_021532. 265 AEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSI--PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIR 342 (663) Q Consensus 265 ~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~ 342 (663) .++++.++.|..+| ..++|+||+.|||+++.+++. ++..+++|+++.++|+|+.+|++.|.+++++..+ +.+++ T Consensus 281 ~~v~~~~~~g~~~l-~~~~~~p~~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~---~~~~~ 356 (720) T protein:vir:35 281 RRVYVSVVDGEGFL-EKAQRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQD---TGSIP 356 (720) T ss_pred EEEEEEeeccchhc-ccCCCCCCCccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcC---Ccccc Confidence 11222333444444 457888889999998887655 6777778999999999999999999999988665 56677 Q ss_pred ccccCcch---hhhccCCcc----eEe-----CCC-----CCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCC Q lcl|NC_021532. 343 KGALDQTN---RKKFLAGAN----FEF-----NGT-----ANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGI 405 (663) Q Consensus 343 ~~~i~~~d---~~~~~p~~v----i~~-----~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~ 405 (663) .|+++..+ ..+..|+++ +.+ ++| +..+...+++++|++.+++++.....|+++|||+++++|. T Consensus 357 ~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~ 436 (720) T protein:vir:35 357 IVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPM 436 (720) T ss_pred ccCcchHHHHHHHhhccccccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCc Confidence 77765432 233445443 111 121 1345677788999999999999999999999999999998 Q ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccchh------- Q lcl|NC_021532. 406 NSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRKD------- 474 (663) Q Consensus 406 ~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~~------- 474 (663) .+| .||.| +++++++|++.+..+++||.. .++.+|+++|+||.+||+++|+|||+|+ +++.+|.. T Consensus 437 ~sn-~SG~A--i~~rq~qg~~~~~~~~Dnl~~-~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g 512 (720) T protein:vir:35 437 PSN-IAKET--VNHLMHRSDMSSFIYLDNMAK-SLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTG 512 (720) T ss_pred ccc-hHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCC Confidence 776 46555 788889999999999999965 6889999999999999999999999984 46555421 Q ss_pred ------hc-CCceEEEEeecccchhHHHHHH-HHHHHHHhccCCCcc--hhHHHHHHHHHhhhhh---hhhhhhhhhhcc Q lcl|NC_021532. 475 ------DL-SGRIDIDISISTAEDNAAKSQE-LSFLLQTLGPNEDPK--IRRDIMADIMDLMRMP---EQAKRMREYEPK 541 (663) Q Consensus 475 ------~~-~~~~d~~v~~~~~~~~~~~~q~-l~~~~~~~~~~~~p~--~~~~~l~~~~~l~~~~---e~~~~l~~~~~~ 541 (663) ++ .|+|||+|+.+.. +.+..++ +..+++++ +.++|. ....++..+++++++| +.++.++...+. T Consensus 513 ~~v~~NDi~~g~yDv~v~~~p~--~~s~req~~~~m~qll-~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~ 589 (720) T protein:vir:35 513 QVVAMNDLSSGRYDVTVDVGPS--YTARRDATVSVLTNLL-AGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLT 589 (720) T ss_pred ceeeeecceeeeeEEEEecccC--cccHHHHHHHHHHHHH-HhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcch Confidence 22 3667776665443 3333333 33333333 333332 2233444556666665 455555544332 Q ss_pred hhhHHHHhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH----H Q lcl|NC_021532. 542 PDPVQEKIRQLEL--ENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE--KAKARKLSSEADMTDLKFVK----E 613 (663) Q Consensus 542 ~~~~~~q~~q~~~--~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e--~~~~q~~~~~~~~~~~e~~~----~ 613 (663) ....++...+.++ +.++++.++++++.+..++.. .++++++++++++.. ++++.+.+..++..+++... + T Consensus 590 ~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l--~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~ 667 (720) T protein:vir:35 590 QGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVL--MQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASA 667 (720) T ss_pred hcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2111111111111 111111111111111111111 111111111111111 11111111111111111100 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHhhhhhhc Q lcl|NC_021532. 614 DNGYAHLEQVELEDLRHAQHLE----------REAMKHRANLEQMLAQRNAGDTNI 659 (663) Q Consensus 614 ~~~~~~~~~~~~~~~~~~~~~~----------~e~~k~~~~~e~~~~~~~~~~~~~ 659 (663) ..+.+.. ..++++....++ .+..++..+.++..+..-..+..| T Consensus 668 ~~~~q~~---i~qalq~~~~~q~~q~~~eqa~~el~~~~~~~~~~~~~~~~~~~~~ 720 (720) T protein:vir:35 668 DSAKRAE---IREALKMLHQFQKEQGDASRADAELILKATDTQHKQNRDAAKNHSI 720 (720) T ss_pred HHHHHHH---HHHHHHHHHHHHHhcchHHHHHHHHhhcccchhhhhhHHHhhccCC Confidence 0000000 001111111111 111112222222222333333344 No 17 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=100.00 E-value=2.4e-85 Score=484.52 Aligned_cols=604 Identities=12% Similarity=0.107 Sum_probs=367.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhcCCcCCc------ccc----CCCccccHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWK--AEYNGEPYGN------EQK----GKSAIVSRDIKKQSEWQHATI 68 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~y~~~~~~~------~~~----g~s~~~~~~i~~~v~~~~~~l 68 (663) |--..+.++..+...|..+..+..++...+.... +||+|++|+. +.+ |+|++|+|.|++.|++++|+. T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 6555667888888888888777777666655443 6899999973 333 579999999999999999999 Q ss_pred HHhhcCCCceEEEEeCCc-chHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 69 VDPFVSTADIIKCTPITW-EDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 69 ~~~~~~~~~~~~~~p~~~-~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ...+++ ++|.|+++ +|.+.|+++|.+++|+.+ .++....++++|.++++||+||+++..||.++.-... T Consensus 81 ~~nr~d----~~v~p~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~----- 150 (708) T protein:vir:17 81 RNNRIT----VKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMD----- 150 (708) T ss_pred hhCCcc----eEEecCCCcchHHHHHHHHHHHHHHHH-hcCchhHHhHHHHHhhhcccceeeeeecccccCCCCC----- Confidence 877666 99999965 589999999999999875 5667778999999999999999999877654321000 Q ss_pred cCccccccccccccccceeeccc-ceeeeccHHHheeCccccc-ChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQ-PTARVCRNEDIYLDPTCQD-NLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~~~~dp~a~~-d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) ...+. +....+|+.+|||||.++. |++||+|++++.|+|+++++++ |++... T Consensus 151 ------------------~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~--yp~~a~------ 204 (708) T protein:vir:17 151 ------------------DRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAE--YGKKPP------ 204 (708) T ss_pred ------------------CccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHh--Cccccc------ Confidence 00111 1222357789999997754 9999999999999999999987 332110 Q ss_pred hhhhccccccccccccccccceEEEEEEEEEeeec---------CC---------------------c----------ee Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVD---------GD---------------------G----------IA 265 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~---------~~---------------------g----------~~ 265 (663) ...+.......++.| ...++|+|+|||++..+. .. | .. T Consensus 205 ~~~~~~~~~~~~~~~--~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~ 282 (708) T protein:vir:17 205 ASLDVTSMTSWEYDW--FDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRR 282 (708) T ss_pred hhhhhhhhccccccc--cCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEE Confidence 000001111111222 234789999999875321 00 1 11 Q ss_pred EEEEEEEECCEEEecccCCCcCCCCCEEEEeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeec Q lcl|NC_021532. 266 EPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSI--PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRK 343 (663) Q Consensus 266 ~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~ 343 (663) +++++++.|..+|. .++|++|+.|||+++++++. ++....+|+++.++|+|+.+|+++|+++++++++++.+++++. T Consensus 283 ~v~~~~~~g~~~l~-~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~ 361 (708) T protein:vir:17 283 RVYVSVVDGDGFLE-KPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGM 361 (708) T ss_pred EEEEEeeccccccc-CCCCCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeech Confidence 34556677777774 57899999999999887755 6666568899999999999999999999999999999999998 Q ss_pred cccCcchhh--------------hccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCccc Q lcl|NC_021532. 344 GALDQTNRK--------------KFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGS 409 (663) Q Consensus 344 ~~i~~~d~~--------------~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~ 409 (663) +++...... +..++.+-.+..++..+..++++++|++++++++.....|+++|||+++++|..+| T Consensus 362 ~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn- 440 (708) T protein:vir:17 362 EQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN- 440 (708) T ss_pred hhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccc- Confidence 876422111 12244554455666677778899999999999999999999999999999998665 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccchh----------- Q lcl|NC_021532. 410 LGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRKD----------- 474 (663) Q Consensus 410 ~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~~----------- 474 (663) +||.| +++++++|++.+..+++||.. +++.+|+++|+||.+||+++|+|||+|+ +++.+|.. T Consensus 441 ~SG~A--i~~rq~qg~~~~~~~~Dnl~~-~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~ 517 (708) T protein:vir:17 441 IAQET--VNNLMNRADMASFIYLDNMAK-SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVA 517 (708) T ss_pred hHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCcccee Confidence 46555 778889999999999999965 7889999999999999999999999975 46666531 Q ss_pred --hc-CCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcc--hhHHHHHHHHHhhhhh---hhhhhhhhhhcchhhHH Q lcl|NC_021532. 475 --DL-SGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPK--IRRDIMADIMDLMRMP---EQAKRMREYEPKPDPVQ 546 (663) Q Consensus 475 --~~-~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~--~~~~~l~~~~~l~~~~---e~~~~l~~~~~~~~~~~ 546 (663) ++ .|+|||.|+.+.+ +.++.++....|..+.+.++|. ....++..+++++++| ++++.++...+.+...+ T Consensus 518 ~nDi~~g~~Dv~v~~~p~--~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~ 595 (708) T protein:vir:17 518 LNDLSVGRYDVTVDVGPS--YTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAK 595 (708) T ss_pred eccceeeeeeEEEecccC--chhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhcccccc Confidence 22 2566666655433 3333333332222222222221 2233334444555544 55666655443322111 Q ss_pred ---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 547 ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQV 623 (663) Q Consensus 547 ---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~ 623 (663) +...++.++.++.++ .+++....+++++..+.+++++++++++. +.+++..+++....+...+..+.. T Consensus 596 ~~~~e~~q~~~q~qq~~q--~q~~~~~~eaqa~~~~~qAe~~ka~aea~-------~~q~~a~q~~~~~~~a~~~a~q~~ 666 (708) T protein:vir:17 596 PRNEKEQQIVQQAQMAAQ--SQPNPEMVLAQAQMVAAQAEAQKATNETA-------QTQIKAFTAQQDAMESQANTVYKL 666 (708) T ss_pred CcchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111111111 11111111112211111222222221111 111111111111111100000000 Q ss_pred HH-HHHHHHHHH-HHHHHHH-HHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 624 EL-EDLRHAQHL-EREAMKH-RANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 624 ~~-~~~~~~~~~-~~e~~k~-~~~~e~~~~~~~~~~~~~~~~~ 663 (663) ++ ..++..++. ..+.++. +...+...+....| -++ T Consensus 667 ~q~~~~~~~~~~~~~~~l~~~q~~q~q~~~a~p~~-----~~~ 704 (708) T protein:vir:17 667 AQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQS-----PAD 704 (708) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccC-----chh Confidence 00 000000000 0000000 00001000001111 111 No 18 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=100.00 E-value=7.7e-85 Score=481.72 Aligned_cols=606 Identities=12% Similarity=0.094 Sum_probs=361.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCcCCcc------c----cCCCccccHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEY--NGEPYGNE------Q----KGKSAIVSRDIKKQSEWQHATI 68 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y--~~~~~~~~------~----~g~s~~~~~~i~~~v~~~~~~l 68 (663) |-=++.+++..+...|+.+..+..++.+...+..+|| .|++|... + .|+|++++|.|++.|++++|++ T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 7777888999999999999999888888877777777 48899742 2 3789999999999999999999 Q ss_pred HHhhcCCCceEEEEeC-CcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 69 VDPFVSTADIIKCTPI-TWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 69 ~~~~~~~~~~~~~~p~-~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ...+.+ ++|.|+ +.+|.+.|++++.+++|+.+ .++....++++|.|+++||+||++++.|++++..+.+ T Consensus 81 ~~nr~~----~~v~P~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~----- 150 (706) T protein:vir:10 81 RNNRIS----VKFRPGDNAASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMD----- 150 (706) T ss_pred HhCCCc----eEEecCCCCchHHHHHHHHHHHHHHHH-hcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCC----- Confidence 877666 999995 56688999999999999865 5677788999999999999999999888765321111 Q ss_pred cCccccccccccccccceeecccceeeec-cHH-HheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhcc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVC-RNE-DIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTS 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~~-~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~ 224 (663) ....+.+++| +|+ +|||||.++ .|++||+|++++.|||+++++++.... ...+.... T Consensus 151 -------------------~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~-~~~~~~~~ 210 (706) T protein:vir:10 151 -------------------ERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKA-PTSLDRVG 210 (706) T ss_pred -------------------CCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCC-hhhhhhhc Confidence 1122334443 455 799999665 599999999999999999999973221 11111100 Q ss_pred c-hhhhccccccccccccccccceEEEEEEEEEeeecC-------------------Cce----------eEEEEEEEEC Q lcl|NC_021532. 225 G-EDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDG-------------------DGI----------AEPIVCAWIN 274 (663) Q Consensus 225 ~-~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~-------------------~g~----------~~~~~~~~~g 274 (663) + +........+.........++.+.+..||++....+ .|. .+.++.+++| T Consensus 211 ~~~~~~d~~~~d~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g 290 (706) T protein:vir:10 211 SVSWQYDWFTPDVVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDG 290 (706) T ss_pred cccccccccCCCcceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeecc Confidence 0 000000000000000011122333445666543221 111 1235566788 Q ss_pred CEEEecccCCCcCCCCCEEEEeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhh Q lcl|NC_021532. 275 DVIVRLQSNPYPDGKPPFLVVPFNSI--PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRK 352 (663) Q Consensus 275 ~~~l~~~~~p~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~ 352 (663) ..+|+ +++||+|++|||++++.++. +++..++|+++.++|+|+.+|+++|+++++++++.+-...+..+.++..... T Consensus 291 ~~~l~-~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~ 369 (706) T protein:vir:10 291 DGFLE-KPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQH 369 (706) T ss_pred ccccc-cCCCCCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHH Confidence 88884 68999999999998887665 7778888999999999999999999999999888765544443333211112 Q ss_pred hccC-----------------CcceEeCCCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHH Q lcl|NC_021532. 353 KFLA-----------------GANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTAT 415 (663) Q Consensus 353 ~~~p-----------------~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~ 415 (663) +..+ |.++.. ...+..++++.+|++++++++.....|+++|||+++++|..+| .|+.| T Consensus 370 ~~~~~~~~~~~l~~~~~~~~~g~i~~~---~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn-~SG~A- 444 (706) T protein:vir:10 370 WEGRNRKRPAFLPLRTVTDKTGNVVAP---ANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN-VARET- 444 (706) T ss_pred hhhcccccccchhcccccCCCCccccc---ccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc-hHHHH- Confidence 2222 222221 1233455677899999999999999999999999999998766 46655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccchh-------------hc-C Q lcl|NC_021532. 416 GARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRKD-------------DL-S 477 (663) Q Consensus 416 ~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~~-------------~~-~ 477 (663) +++++++|++.+..+++||.. +++.+|+++|+||.+||+++|+|||+|+ +++.||.. ++ . T Consensus 445 -i~~rq~qg~~~~~~~~Dnl~~-~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~ 522 (706) T protein:vir:10 445 -VNSLLNRSDMASFIYLDNMAK-SLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLST 522 (706) T ss_pred -HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeeccee Confidence 888889999999999999965 7889999999999999999999999974 57777631 22 3 Q ss_pred CceEEEEeecccchh--HHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh---hhhhhhhhhhcchhhHHHH---h Q lcl|NC_021532. 478 GRIDIDISISTAEDN--AAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP---EQAKRMREYEPKPDPVQEK---I 549 (663) Q Consensus 478 ~~~d~~v~~~~~~~~--~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~---e~~~~l~~~~~~~~~~~~q---~ 549 (663) |+|||+|+.+....+ .+..+.|+++++.+.+.. ..++.++..+++++++| ++++.++...+.+...++. + T Consensus 523 g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~--~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~e 600 (706) T protein:vir:10 523 GRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQD--PMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQE 600 (706) T ss_pred eeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcc--hhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhH Confidence 566776665443332 222223333333332221 24455555556666655 5566665544433222211 1 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 550 RQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVELEDLR 629 (663) Q Consensus 550 ~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 629 (663) +++.++.+++++++++.++ .+++++..+.++++++ ++++..+.+.+...++........+....+ . T Consensus 601 q~~~~q~qq~q~~q~~~~~--~~~~aq~~~~qA~~~k-------~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~-----~ 666 (706) T protein:vir:10 601 QAIVQQAQQAQATQPDPNM--LLAQAQMVVAQAEAQK-------SQNETVQTQIKAFTAQQDAMESQANTVYKL-----A 666 (706) T ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----H Confidence 1111111111111111111 1111111111111111 111111111111111100000000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHhhhhhhcccc-C Q lcl|NC_021532. 630 HAQHLEREAMKHRANLEQML-AQRNAGDTNIGVV-E 663 (663) Q Consensus 630 ~~~~~~~e~~k~~~~~e~~~-~~~~~~~~~~~~~-~ 663 (663) ...++...+ ..+.-+++ ..+..|....-.+ | T Consensus 667 ~a~~~~~~~---~~q~~q~l~~~~a~q~~~~~~~~~ 699 (706) T protein:vir:10 667 QARNIDDKA---VMETLRLLKEVAASQQQTIPSPPS 699 (706) T ss_pred HHHHHHHHH---HHHHHHHHHHHHHhccCCCCCCCC Confidence 000111111 00111111 1111111111111 1 No 19 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=100.00 E-value=1.5e-82 Score=469.10 Aligned_cols=608 Identities=12% Similarity=0.103 Sum_probs=367.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCcCCc------ccc----CCCccccHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN--GEPYGN------EQK----GKSAIVSRDIKKQSEWQHATI 68 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~--~~~~~~------~~~----g~s~~~~~~i~~~v~~~~~~l 68 (663) |-=+.+.++..++..|+.+..+..++.+...+..+||+ |++|+. +++ |+|++|+|.|++.|++++|+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 77677789999999999998888888877777777764 899963 333 679999999999999999999 Q ss_pred HHhhcCCCceEEEEeCCc-chHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 69 VDPFVSTADIIKCTPITW-EDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 69 ~~~~~~~~~~~~~~p~~~-~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) ...+++ ++|.|+++ +|.+.|++++.+++|+.+ .++....++++|.|+++||+||+++..||+++.-+... T Consensus 81 ~~nr~d----~~v~P~~~~~d~~~Ae~l~~~~~~~~~-~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~---- 151 (708) T protein:vir:10 81 RNNRIT----VKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD---- 151 (708) T ss_pred HhCCcc----eEEEcCCCCchHHHHHHHHHHHHHHHH-hcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCC---- Confidence 876665 99999975 489999999999999875 56677789999999999999999998776543211100 Q ss_pred cCccccccccccccccceeecccc-eeeeccHHHheeCcccc-cChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQP-TARVCRNEDIYLDPTCQ-DNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~-~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) ..+.+ .....|+.+|||||.++ .|++||+|+++++|+|+++++++...+.. T Consensus 152 -------------------~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~-------- 204 (708) T protein:vir:10 152 -------------------RQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPP-------- 204 (708) T ss_pred -------------------ccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcc-------- Confidence 01112 22334567999999776 49999999999999999999987322211 Q ss_pred hhhhccccccccccccccccceEEEEEEEEEeee---------cCCc-------------------------------ee Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDV---------DGDG-------------------------------IA 265 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~---------~~~g-------------------------------~~ 265 (663) ...+.....+..+.| ...+.|+|.|||.+..+ ..+| .. T Consensus 205 ~~~d~~~~~~~~~~~--~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~ 282 (708) T protein:vir:10 205 TSLDVTSMTSWEYNW--FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRR 282 (708) T ss_pred cccccccCCCccccc--cCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeE Confidence 111111111111222 23456888888876321 0111 11 Q ss_pred EEEEEEEECCEEEecccCCCcCCCCCEEEEeeeee--cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeec Q lcl|NC_021532. 266 EPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSI--PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRK 343 (663) Q Consensus 266 ~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~--~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~ 343 (663) +++++++.|..+| ..++|++|+.|||+++++++. .+..+++|+++.++|+|+.+|+++|+++++++++.+...+++. T Consensus 283 ~v~~~~~~g~~~l-e~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~ 361 (708) T protein:vir:10 283 RVYVSVVDGDGFL-EKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGM 361 (708) T ss_pred EEEEEeecchhhh-ccCCCCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccCh Confidence 2445566777777 457899999999998887664 6777778999999999999999999999999999999889888 Q ss_pred cccCcchhhhccC----Ccce----------EeCCCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCccc Q lcl|NC_021532. 344 GALDQTNRKKFLA----GANF----------EFNGTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGS 409 (663) Q Consensus 344 ~~i~~~d~~~~~p----~~vi----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~ 409 (663) +++......+... ..+. .+..++..+..++++.+|++++++++.....|+++||++++++|..+| T Consensus 362 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn- 440 (708) T protein:vir:10 362 EQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSN- 440 (708) T ss_pred hhhhhHHHHHhhccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccc- Confidence 8775332221111 1111 112233455566788999999999999999999999999999997554 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----eeeccchh----------- Q lcl|NC_021532. 410 LGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----KFVPIRKD----------- 474 (663) Q Consensus 410 ~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----~~v~i~~~----------- 474 (663) .||+| |++++++|++.+..+++||.. +++.+|+++|+||.+||+++|+|||+|+ +++.+|.. T Consensus 441 ~SG~a--I~~rq~qg~~~l~~~~Dnl~~-~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~ 517 (708) T protein:vir:10 441 IAQET--VNNLMNRADMASFIYLDNMAK-SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVA 517 (708) T ss_pred hHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceee Confidence 45555 888889999999999999964 7889999999999999999999999975 46666532 Q ss_pred --hc-CCceEEEEeecccch--hHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh---hhhhhhhhhhcchhhHH Q lcl|NC_021532. 475 --DL-SGRIDIDISISTAED--NAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP---EQAKRMREYEPKPDPVQ 546 (663) Q Consensus 475 --~~-~~~~d~~v~~~~~~~--~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~---e~~~~l~~~~~~~~~~~ 546 (663) ++ .|+|||.|..+.... +.+..+.|+++++.+.+.. | ....++..+++++++| ++++.++...+.+.+.. T Consensus 518 ~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~-~-~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~ 595 (708) T protein:vir:10 518 LNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTD-P-MRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAK 595 (708) T ss_pred eeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCc-h-hhHHHHHHHHHhcCCcChHHHHHHHHHhhccccccc Confidence 11 356676666543322 2222233333333333321 1 2333344445555555 56666665544322211 Q ss_pred H---HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 547 E---KIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQV 623 (663) Q Consensus 547 ~---q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~ 623 (663) + ...++.++.++++.+ +++....+++++..+.+++++++++++.+ .+++..+++....+...+..+.. T Consensus 596 ~~~~ee~q~~~~~q~~~q~--q~~~~~~e~qa~~~~~qAe~~ka~a~a~~-------~~~~a~q~~~~~~~a~~~a~q~~ 666 (708) T protein:vir:10 596 PRNEKEQQIVQQAQMAAQS--QPNPEMVLAQAQMVAAQAEAQKATNETAQ-------TQIKAFTAQQDAMESQANTVYKL 666 (708) T ss_pred ccchhhHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHH Confidence 1 111111111111111 11111111122111222222222222111 11111111111111111111000 Q ss_pred HH-HHHHHHHHHH-HHHHHH-HHHHHHHHHHHhhhhhhcccc Q lcl|NC_021532. 624 EL-EDLRHAQHLE-REAMKH-RANLEQMLAQRNAGDTNIGVV 662 (663) Q Consensus 624 ~~-~~~~~~~~~~-~e~~k~-~~~~e~~~~~~~~~~~~~~~~ 662 (663) +. ..++.....+ .+.++- +...++..+....+....+-- T Consensus 667 ~~a~~~~~~~~~~~~q~l~~~q~~q~~~~~~~p~~~~~~~p~ 708 (708) T protein:vir:10 667 AQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS 708 (708) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccCchhccCC Confidence 00 0000000000 000000 000010000000111111111 No 20 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=4.7e-84 Score=477.40 Aligned_cols=540 Identities=15% Similarity=0.121 Sum_probs=397.3 Q ss_pred CCCc---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC----CccccCCCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKIN---------KAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPY----GNEQKGKSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~---------~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~----~~~~~g~s~~~~~~i~~~v~~~~~~ 67 (663) |-.+ .+.+.+.|..+|+++++.|++.+..|.+.++||.+.-. ..+-..|++++.|+++.+|++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~ 80 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSN 80 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHH Confidence 3222 45556999999999999999999999999999875321 2344568899999999999999999 Q ss_pred HHHhhcCCCceEEEEeCCcchHHH--HHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDS--AEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEA 145 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~--Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~ 145 (663) |++.+|++++|++|+|.-++|+++ ++.+..++...+. +++...+++.+|+|++++|+|+++|+|.....+.... T Consensus 81 l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~-e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~--- 156 (584) T protein:vir:95 81 YFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCR-ESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDG--- 156 (584) T ss_pred HHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhh-hccHHHHHHHHHHhhccCCceEEEEeEeecceeeecc--- Confidence 999999999999999999998877 7777888777764 5678888999999999999999999997654433311 Q ss_pred cccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhc-----CCcChhhh Q lcl|NC_021532. 146 VVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDG-----RYKNLDKL 220 (663) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g-----~~~~~~~~ 220 (663) +.......|.+++++|++|||||+|. +++|+.||+ +.++|+++|.++. .+-+.+.+ T Consensus 157 -----------------~~v~~~~~prieriSP~d~~~Dpsa~-~i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v 217 (584) T protein:vir:95 157 -----------------TLVPDYIGPRLVRISPLDIVFNPLAT-SISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEAL 217 (584) T ss_pred -----------------ccccccccceEEeeChhheeecCCCC-Cccchhhhh-hhhhhHHHHHHHHhhcCccccchHHH Confidence 11223567899999999999999995 899999999 6668999998873 23333322 Q ss_pred hh---c-------cchhhhcc---ccccccccccccccceEEEEEEEEEe-eecCCceeEEE-EEEEECCEEEecccCCC Q lcl|NC_021532. 221 AK---T-------SGEDFDYD---SPDDTEFQFSDAPRKKLIIYEYWGNY-DVDGDGIAEPI-VCAWINDVIVRLQSNPY 285 (663) Q Consensus 221 ~~---~-------~~~~~~~~---~~~~~~~~~~d~~~~~v~v~E~w~~~-~~~~~g~~~~~-~~~~~g~~~l~~~~~p~ 285 (663) .. . ..++.+.. ..+.....+.......|+++|+|+.+ +...++...++ ++++.|+++|+.+.||| T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~ 297 (584) T protein:vir:95 218 KRREEICRHLGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPT 297 (584) T ss_pred HHHHHhccCCCCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCC Confidence 21 1 11111111 01111123334445579999999954 44445555544 55668899999999999 Q ss_pred cCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCC Q lcl|NC_021532. 286 PDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGT 365 (663) Q Consensus 286 ~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~ 365 (663) |++++||++.+|+|+++++||+|+.+.+.|+|+.+|+++|+++||++++++|. .++.++..+ ...+||++++.... T Consensus 298 ~~~~~PF~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv---~k~~~~~~~-~~~~pg~~~~~~~~ 373 (584) T protein:vir:95 298 WFGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPP---LKIIGEVEE-FVWGPGAEIHLDQG 373 (584) T ss_pred CCCCCCEEEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcc---eeeccccch-hcccCCceeecCCC Confidence 99999999999999999999999999999999999999999999999999983 344444333 34679999998765 Q ss_pred CCccccccCcc-ccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 366 ANDFWHGSYNA-IPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLM 444 (663) Q Consensus 366 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~ 444 (663) + ..++++++. -....++.++++++.+++.|||+.+++|.++.+ ..||+++++++++++..++.+++.|.+.++++++ T Consensus 374 ~-~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~-~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~ 451 (584) T protein:vir:95 374 G-DVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPG-EKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVL 451 (584) T ss_pred C-CcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccch-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 455555442 112355679999999999999999999988554 7899999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCceEEEEecCe-----eeccchhhcCCceEEEEeecccchh-HHHHHHHHHHHH-HhccCCCcchhHH Q lcl|NC_021532. 445 RKWMAYNAEFLEEEEVIRVTNDK-----FVPIRKDDLSGRIDIDISISTAEDN-AAKSQELSFLLQ-TLGPNEDPKIRRD 517 (663) Q Consensus 445 ~~~~~li~q~~~~~~~iri~~~~-----~v~i~~~~~~~~~d~~v~~~~~~~~-~~~~q~l~~~~~-~~~~~~~p~~~~~ 517 (663) ..++.+..+|++.+.+||++|++ |++|.|+++.++|+++..++++... ++..+.+.++++ .+++.+.|..... T Consensus 452 ~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~ 531 (584) T protein:vir:95 452 NAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGK 531 (584) T ss_pred HHHHHHHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHH Confidence 99999999999999999999875 8999999999999999988876654 445666777776 6777777877766 Q ss_pred HHHH-HHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 518 IMAD-IMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 518 ~l~~-~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~ 595 (663) .+.. ++++.+++... +.. ++...++ |++.++...+.++. .++++ ++.++-+- T Consensus 532 ~l~~~ladl~~~p~~~-----~~~-~~~~~~~--Q~~~q~~~~~~q~~------~~~~~------------~~~~~~~~ 584 (584) T protein:vir:95 532 ALATFVDDVTGLQGYE-----IFR-PNVAVAE--QAETQSLVAQAQED------LQLQA------------QMPAEGAI 584 (584) T ss_pred HHHHHHHHHhCCCccc-----ccC-CCcccch--hHHHHhhhHHHHHH------HHHHH------------hhhhccCC Confidence 6665 45677766311 111 1111111 11111110000000 00000 00000000 No 21 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=2.4e-80 Score=457.05 Aligned_cols=548 Identities=13% Similarity=0.100 Sum_probs=397.7 Q ss_pred CC-------------CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----ccccCCCccccHHHHHHHHH Q lcl|NC_021532. 1 MK-------------INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG----NEQKGKSAIVSRDIKKQSEW 63 (663) Q Consensus 1 ~~-------------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~----~~~~g~s~~~~~~i~~~v~~ 63 (663) |+ .+...++++|...|+++.+.|+..+..|++.++|.+.+-.. ..-.-|++++.|+++..|+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~~~~ 80 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHLHLM 80 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHHHHH Confidence 33 34555778899999999999999999999999998754322 11234678999999999999 Q ss_pred HHHHHHHhhcCCCceEEEEeCCcchH--HHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 64 QHATIVDPFVSTADIIKCTPITWEDT--DSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 64 ~~~~l~~~~~~~~~~~~~~p~~~~D~--~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) ++++++..+|+|++||+|+|..++|+ +.++....++...+. +++...+++.++.|.+++|++|.++.|....-+. T Consensus 81 l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~-e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~-- 157 (599) T protein:vir:31 81 ITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVE-ASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVT-- 157 (599) T ss_pred HHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhh-hcchHHHHHHHHhhhcccCceeEeeeEEEcceee-- Confidence 99999999999999999999999865 557888888888874 5788888999999999999999999875322111 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHh---cCCc--C Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKD---GRYK--N 216 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~---g~~~--~ 216 (663) + +. .+...+..|.+++|+|+||||||+|. +++|+.|++ |.+.|+++|..+ ++++ + T Consensus 158 --------~---------d~-~v~~~~~~P~~ervsP~Di~~Dp~A~-si~d~~fiv-Rs~~Tk~~L~~l~~~~~~~~y~ 217 (599) T protein:vir:31 158 --------A---------EN-QVIKNYSGTVTERLSPSDVFWDVTAD-SLPKAAKCI-RQLYTLGSLKREIEEGTFPLMS 217 (599) T ss_pred --------c---------cc-ccccccccceEEeecccceeeCCCCC-CCCcceeee-ehhhhHHHHHHHhccCCccccc Confidence 0 00 23445678999999999999999995 799998877 888999999875 2222 2 Q ss_pred hhhhh---hccchhhhccccccccccccccccc-------------eEEEEEEEE-EeeecCCceeEEEEEEEECC-EEE Q lcl|NC_021532. 217 LDKLA---KTSGEDFDYDSPDDTEFQFSDAPRK-------------KLIIYEYWG-NYDVDGDGIAEPIVCAWIND-VIV 278 (663) Q Consensus 217 ~~~~~---~~~~~~~~~~~~~~~~~~~~d~~~~-------------~v~v~E~w~-~~~~~~~g~~~~~~~~~~g~-~~l 278 (663) ++.+. +...+..+...+......+.|.++. -|+++|||+ .++..+|+..+.++++|+|+ +++ T Consensus 218 ~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~li 297 (599) T protein:vir:31 218 MEDFQKLREERRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIG 297 (599) T ss_pred hHHHHHHHhhccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEe Confidence 22221 2222222222222233333343332 388999998 88888999999999999996 777 Q ss_pred ecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCc Q lcl|NC_021532. 279 RLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGA 358 (663) Q Consensus 279 ~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~ 358 (663) +...||||||+.||++.+|.|+++++||+|+...+.|+|..+|.++|+++|++.++..| +....+.+.+.|..+ .||+ T Consensus 298 R~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p-~l~~~~dl~~eD~~~-~P~~ 375 (599) T protein:vir:31 298 RKQSKDTWDGSQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHP-SLKKVGDVREKGMRG-GPNH 375 (599) T ss_pred ecccCCCCCCCCCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcc-cccccccccccCccC-CCCc Confidence 99999999999999999999999999999999999999999999999999999999987 444455566666554 5999 Q ss_pred ceEeCCCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 359 NFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAEN 438 (663) Q Consensus 359 vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~ 438 (663) +|++...++. +++++++-...+..+++++...+++.||++.+++|.++.+ ..||+++++++++|+.+++.+++.|++. T Consensus 376 v~~~~d~~~v-q~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag-~~TA~~is~l~naa~~~~~~~vr~~e~~ 453 (599) T protein:vir:31 376 VFEVEETGDV-QYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAG-EKTKFEVQLLDQGQNKVFRRKVKKFERE 453 (599) T ss_pred ceeecCCCcc-ccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccc-hhhHHHHHHHHhhhhhhHHHHHHHHHHH Confidence 9998776544 5555554445566679999999999999999999988776 5799999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCceEEEEecCe-----eeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccC---- Q lcl|NC_021532. 439 LVKPLMRKWMAYNAEFLEEEEVIRVTNDK-----FVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPN---- 509 (663) Q Consensus 439 ~~~~l~~~~~~li~q~~~~~~~iri~~~~-----~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~---- 509 (663) +++||+++++++.++|++++.+|||++++ |+.|+++++++++++...++.... ++.+.+..+++.+.++ T Consensus 454 ~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~--ere~~~q~l~~il~~~~~q~ 531 (599) T protein:vir:31 454 LLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFA--EKANTLQNLNAILGGPLGAA 531 (599) T ss_pred HHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHH--HHHHHHHHHHHHhcccCCCc Confidence 99999999999999999999999999986 999999999999998655543322 2444444444544444 Q ss_pred CCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 510 EDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKA 589 (663) Q Consensus 510 ~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~ 589 (663) +.|++....+. .+.+..+.+..++..+.+...+++|++...+ |+++++.+ .. ++-++.- T Consensus 532 ~~P~~~~k~l~------~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~-------Q~~lq~~~--~~------~~~~~~~ 590 (599) T protein:vir:31 532 LAPHMSRTKLF------NAVEYLGDLDAYGIFTFGIGVQEDQQLARMA-------QKSTQQTE--ET------ALTQEEV 590 (599) T ss_pred cchhhHHHHHH------HHHHHHHhccccccCCCchhHHHHHHHHHHH-------HHHHHHhH--hh------hhhhhhc Confidence 44544443322 2333455555555544433332222111111 11110000 00 0000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 590 AVEKAKARKLSSEADMTD 607 (663) Q Consensus 590 ~~e~~~~q~~~~~~~~~~ 607 (663) . -..++..+ T Consensus 591 ~---------~~~~~~~~ 599 (599) T protein:vir:31 591 G---------GPTTDTGQ 599 (599) T ss_pred C---------CCCcccCC Confidence 0 00000000 No 22 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=1.6e-78 Score=447.11 Aligned_cols=594 Identities=16% Similarity=0.165 Sum_probs=393.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC--------------ccccCCCccccHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG--------------NEQKGKSAIVSRDIKKQSEWQHA 66 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~--------------~~~~g~s~~~~~~i~~~v~~~~~ 66 (663) =|+.++.+++.|.+.|+.+++.|+.++++|.+.++||...... ....+|++++.+.+...++++++ T Consensus 18 ~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s 97 (641) T protein:vir:94 18 RKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVA 97 (641) T ss_pred hcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhh Confidence 7889999999999999999999999999999888888643221 12244789999999999999999 Q ss_pred HHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccc Q lcl|NC_021532. 67 TIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAV 146 (663) Q Consensus 67 ~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~ 146 (663) +|++.||++++||+|.|++++|++.|++++.++++.+ .+++++.+++++++|++..|+||++++|+......... + T Consensus 98 ~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l-~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~---~ 173 (641) T protein:vir:94 98 YFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKL-EAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKR---T 173 (641) T ss_pred HHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHH-hhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhh---h Confidence 9999999999999999999999999999999999988 46788899999999999999999999998665432211 1 Q ss_pred ccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEE-eecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 147 VVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHR-YETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~-~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) .+.. +. .........+......+.++.|+|++||+||++. .+++.|++++ +.+|+.+|+..||+ +.+.+..... T Consensus 174 ~~~~-~~-~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~--~~~~~f~~~r~t~~t~~~l~~eg~~-~~d~v~~~~~ 248 (641) T protein:vir:94 174 FVET-GD-IFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGG--KNTGTFVRLRHTREELHELVTSGYY-DLDLTQVEQY 248 (641) T ss_pred cccc-hh-hcccccccceecccceeeEEecchhheeecCCCC--cccccceehhhhHHHHHHHHhcCCC-Chhhcchhhc Confidence 1100 00 0000011112223455677889999999999875 3455666554 56677777777775 3332222111 Q ss_pred hhhh-ccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcc Q lcl|NC_021532. 226 EDFD-YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKL 304 (663) Q Consensus 226 ~~~~-~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~ 304 (663) .+.. ...+...+.++.+ ..+++++|||+.++.++.... .+++++.|+++|+.++++|+++ +||++++|.+.++++ T Consensus 249 ~~~~~~~~d~~~d~~~~~--~~~~~~~e~~gd~~~d~~~~~-~~~~~~~g~~il~~~~~~~~d~-~Pf~~~r~~~~~~~~ 324 (641) T protein:vir:94 249 VDYKFADPDTPKDVNGTD--TSGWDIIEYYGPLLVEGVQFW-CVHAVFYGKQLIRLSDSKYWCG-SPFVTTTLLPDRDSV 324 (641) T ss_pred cccccccccccccccccc--ccccceeeeeeeeccCCCcee-eEEEEEeCCEEeecccccccCc-CCeEEecceecCCcc Confidence 1111 0011111222222 335678999986554443322 3568889999999999998765 599999999999999 Q ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCcc-ccHHHHH Q lcl|NC_021532. 305 HGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNA-IPSSAFD 383 (663) Q Consensus 305 ~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~-~~~~~~~ 383 (663) ||+|+++.+.|.|+.+|++.+.+++++.++++|++++.++++.........||+++.++..+ .+.++.++. .....++ T Consensus 325 YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~-~v~pl~~~~~~~~~~~~ 403 (641) T protein:vir:94 325 YGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHG-SLQPIDMGRQDFVVTYQ 403 (641) T ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCC-cceeecCCccccchhHH Confidence 99999999999999999999999999999999999988877655555678899999887543 456654433 2334567 Q ss_pred HHHHHHHHHHHHhCCChHHcCCCcccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEE Q lcl|NC_021532. 384 MISLMNNEIESITGTKSFSGGINSGSLG-STATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIR 462 (663) Q Consensus 384 ~~~~~~~~~~~~tGi~~~~~G~~~~~~~-~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~ir 462 (663) +++++...+++.+|+...++|.++...+ .||+++++++++++.++..++++|++.++.+|++.++.+++++++.+.++| T Consensus 404 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R 483 (641) T protein:vir:94 404 EAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIR 483 (641) T ss_pred HHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhh Confidence 7899999999999999998887654333 499999999999999999999999999999999999999999999999999 Q ss_pred EecC-----eeeccchhhcCCceEEEEeecc--cchhHHHHHHHHHHHHHhccCCCcchh----HH-HHHHHHHhhhhhh Q lcl|NC_021532. 463 VTND-----KFVPIRKDDLSGRIDIDISIST--AEDNAAKSQELSFLLQTLGPNEDPKIR----RD-IMADIMDLMRMPE 530 (663) Q Consensus 463 i~~~-----~~v~i~~~~~~~~~d~~v~~~~--~~~~~~~~q~l~~~~~~~~~~~~p~~~----~~-~l~~~~~l~~~~e 530 (663) +.|. .|+++.|+++.+++++. ..+. ........+.|.++++.+++. |.+. .. ++..+++.++++. T Consensus 484 ~~~~~~~~~~~~~~~p~~L~~~~~iv-~l~~~q~~~~~~~i~~l~~~~~~~a~~--P~v~d~~d~~~~~~~~~~~~g~~~ 560 (641) T protein:vir:94 484 MYVPEEQMDGFFEVSPEYLHYPYKFL-ALGANYVVERERMVTDLLQLLDISGRV--PQIGQSLDYALILEDLLRQMRFTD 560 (641) T ss_pred hhchhhhcccCCCCCccceeeeeeEe-ecchhHHHHHHHHHHHHHHHHHHhhcC--hhhhhcCCHHHHHHHHHHHhCCCC Confidence 9985 58899999999888873 2222 223334445566666666542 3221 22 3455566667666 Q ss_pred hhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 531 QAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKF 610 (663) Q Consensus 531 ~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~ 610 (663) ....++....++++....+++ .+++.++++++-......+ +.+.+..+.++. +..+-...+.+.+..++ T Consensus 561 p~~~ir~~~~~~~~~~~~~~~----~q~~~~~~a~~~~~~~~~~-----a~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 629 (641) T protein:vir:94 561 PMRYIKKAEAPPAAPPIAPAE----PGALPPEMMNSVGGGLNDQ-----AIAGMTPEDVSD--LASRIGIDTSDVAPEAM 629 (641) T ss_pred chhhccCccCchhHHHHHHHH----HHHHHHHHHHHHHhhhHHH-----HHHHhhHHHHHH--HHHhhcCCchhhhHHHH Confidence 666655433222211110000 0000000000000000000 000100111110 00000000011100000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 611 VKEDNGYAHLEQVELEDL 628 (663) Q Consensus 611 ~~~~~~~~~~~~~~~~~~ 628 (663) .+ ...+ .-.+++ T Consensus 630 ~~---~~~~---~~~~~~ 641 (641) T protein:vir:94 630 AA---ATQQ---ITSGAL 641 (641) T ss_pred hc---cccc---ccccCC Confidence 00 0000 000000 No 23 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=100.00 E-value=1.2e-43 Score=255.90 Aligned_cols=605 Identities=11% Similarity=0.070 Sum_probs=309.7 Q ss_pred CCCc--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCCCccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKIN--------KAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~--------~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~ 72 (663) |.=. -+.+..--...+..++...++|-+.-......|.+..+..-..+ . .+|.+...|..|+|++ T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~-~--r~nl~~sni~~i~P~i---- 73 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAE-T--RWNLFSTNIQTQMASL---- 73 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCccc-c--ccchhhhhHHHHhhhh---- Confidence 3221 13344344444555555555555544444555555554332222 1 4899999999999986 Q ss_pred cCCCceEEEEeCCcc-hHHHHHHHHHHHhHH----HHh-ccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccc Q lcl|NC_021532. 73 VSTADIIKCTPITWE-DTDSAEQNELLLNTQ----FSR-KFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAV 146 (663) Q Consensus 73 ~~~~~~~~~~p~~~~-D~~~Ae~~~~~~~~~----~~~-~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~ 146 (663) ++..|.+.|.|+..+ |...+...+++++.. +.. ..+....+...++|+++||+|++++.|..+.+++ .+.+. T Consensus 74 Yar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~--~~~~~ 151 (663) T protein:vir:34 74 YGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEV--AGVDA 151 (663) T ss_pred hcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchh--ccccc Confidence 588899999999887 444565555555443 322 2335556778899999999999999886554432 22222 Q ss_pred ccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccch Q lcl|NC_021532. 147 VVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGE 226 (663) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~ 226 (663) ..|+.+.....+..-........+.++++|++.+|.+||. +.|++++|++.+.||+++++.+. |..+.+....+... T Consensus 152 ~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pA--r~W~ev~wva~r~~mtk~e~~~r-f~~~~~~~~~a~~~ 228 (663) T protein:vir:34 152 ILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPA--RVWHEVRWLAFRNLLDMREFNAR-FDADGSRNLWASVP 228 (663) T ss_pred cCCCccccchhcccccchhhcccceeeeeechhhcccchh--hccccccceeeeccCCHHHHHHh-hcCChhhhhhhhcc Confidence 3333332221111111223334667899999999999995 46999999999999999999876 44444443332221 Q ss_pred hhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECC--EEEecccCCCcCCCC---CEEEEeeeeec Q lcl|NC_021532. 227 DFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIND--VIVRLQSNPYPDGKP---PFLVVPFNSIP 301 (663) Q Consensus 227 ~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~--~~l~~~~~p~~~~~~---Pf~~~~~~~~~ 301 (663) ...... .......+.+.++.+|||+|.|.. .++++++.| .+|+.++-|....-| ||..+++. .. T Consensus 229 ~~~~~~--~~~~~~~~~~~~~a~VwEIWdK~~--------~~V~w~~eg~~~~L~~~~p~lgl~~ffPcPrpl~~~~-~~ 297 (663) T protein:vir:34 229 KVGKPK--DGKDGQSCHPWDRAEVWEIWDKGG--------RKVDWYVEGYSAVLDTQPDPLGLESFFPCPKPLLANW-TT 297 (663) T ss_pred CcCCcc--ccCCCCCcchhcCcceeEEEecCC--------cEEEEEEcCcceecccCCCCCCCCCCCCCccccccee-cC Confidence 110000 011112233445789999998842 345555554 567755444322112 44433333 34 Q ss_pred CcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc-hhh-hccCCcceEe-------CCCC--Cccc Q lcl|NC_021532. 302 FKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT-NRK-KFLAGANFEF-------NGTA--NDFW 370 (663) Q Consensus 302 ~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~-d~~-~~~p~~vi~~-------~~~~--~~~~ 370 (663) ++.++...+-...+.++++|.++.. +..+.....++++++.|+.... +.+ ...-+..+.+ ..+| +.+. T Consensus 298 ds~ipvpd~~~y~~~~~E~n~~t~R-in~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~ 376 (663) T protein:vir:34 298 DKVVPRPDFVLAQDLYKEIDLVSTR-ITLLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVD 376 (663) T ss_pred CCeecCCcHHHHHHHHHHHHHHHHH-HHHHHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcCccchhh Confidence 4666666666899999999987665 5567778899999986654311 111 1111222222 2233 5677 Q ss_pred cccCccccHHHHHHHH---HHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 371 HGSYNAIPSSAFDMIS---LMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKW 447 (663) Q Consensus 371 ~~~~~~~~~~~~~~~~---~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~ 447 (663) ++|.+++.+.+..+.+ .+...++++||++|++.|.. ..+.||++.+...+.++.+++.+.+.+.+ +.+++++.. T Consensus 377 ~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~--~a~ETatAQ~IKsq~gS~RIqe~qdevqR-~arDi~ql~ 453 (663) T protein:vir:34 377 WFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGAS--DPRETAMAQGVKAKFGSIRLQRLQDEVAR-FASDIQRLK 453 (663) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhccc--CcchhhHHHHHHHHHHhHHHHHHHHHHHH-HHHHHHHHH Confidence 7887777666665554 56788889999999999953 24789999999999999999999999965 688999999 Q ss_pred HHHHHHhcCCceEEEEecCeee---ccch-------hhcCCceEEEEeecc-cchhH-HHHHHH-------HHHHHHhcc Q lcl|NC_021532. 448 MAYNAEFLEEEEVIRVTNDKFV---PIRK-------DDLSGRIDIDISIST-AEDNA-AKSQEL-------SFLLQTLGP 508 (663) Q Consensus 448 ~~li~q~~~~~~~iri~~~~~v---~i~~-------~~~~~~~d~~v~~~~-~~~~~-~~~q~l-------~~~~~~~~~ 508 (663) -+.|.+.++-+.+-+++|.+.. +|.+ +.+ ..|.++|..++ +.... ...+.+ ..+++.+++ T Consensus 454 AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~-r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~p 532 (663) T protein:vir:34 454 AEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRF-SMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAP 532 (663) T ss_pred HHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCC-cceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999888878788875432 2222 222 34666665543 22222 112222 222222221 Q ss_pred C--CCcchhHHHHHHHHH--hhhhhhhhhhhhhhhcchhhHHHHhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 509 N--EDPKIRRDIMADIMD--LMRMPEQAKRMREYEPKPDPVQEKIRQLELE--NLMLENQMLVASINDKNARANENTIDA 582 (663) Q Consensus 509 ~--~~p~~~~~~l~~~~~--l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~--~~q~~~~~~~a~~~~~~a~~q~~~~~~ 582 (663) . ..|...+ .+..++. ..++.. ...+..+.. .+.....+.+.+ +++...++.++.+..++.+++.+.+++ T Consensus 533 l~~q~p~~~p-~l~Ellk~~~~~f~~-~~qie~ai~---~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeA 607 (663) T protein:vir:34 533 LAQQVPGSAP-FLLQMLKWSVSGLRG-SSTIEGVLD---KAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQEMAKV 607 (663) T ss_pred HHHhhhhhHH-HHHHHHHHHhhcCCh-hhhHHHHHH---HHHhhhHHHhhccCCCCcccchhhHHHHHHHHHHHHHHHHH Confidence 1 1122222 1111111 111111 111111110 000000000000 000111111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccc Q lcl|NC_021532. 583 ELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVV 662 (663) Q Consensus 583 ~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~ 662 (663) + ++++.++.+. ++++.+++. ++.+.++.. ..+..++..+. ...+.+ +.+..|=|-- T Consensus 608 q---~e~q~~~~~~----------ql~~~~~~~--k~~~~a~~~-~~~a~q~~~~~----~~~r~~----~~~a~~~~~~ 663 (663) T protein:vir:34 608 Q---AEVQGDLLRI----------QAETQANET--KERQQAEWN-VREAAQKNLIS----QAARAM----NPQARNGGMP 663 (663) T ss_pred H---HHHHHHHHHH----------HHHHHHHHH--HHHHHHHHH-HHHHHHhhHHH----HHHHhh----chhhhcCCCC Confidence 0 1111111111 111110000 000000000 00111111110 001111 1111121222 No 24 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=5.4e-35 Score=208.46 Aligned_cols=518 Identities=12% Similarity=0.084 Sum_probs=297.0 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCc-CCccccC---CCccccHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GEP-YGNEQKG---KSAIVSRDIKKQSEWQHATIVDPFVS- 74 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~~-~~~~~~g---~s~~~~~~i~~~v~~~~~~l~~~~~~- 74 (663) |.+ ...+.|.+.|+..++.|++++..|+++.+|.. +.- ......| .+.++.+.....++.+.+.|+..+|+ T Consensus 1 m~~-~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp 79 (556) T protein:vir:73 1 MAE-TEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSP 79 (556) T ss_pred CCh-hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCC Confidence 555 45777889999999999999999999999963 211 1112222 35677888888999999999999998 Q ss_pred CCceEEEEeCCcchHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccccc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVV 148 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~ 148 (663) +.+||++.+..++..+.+++ .+..|...+. .++++..++.++.+.+..|+|++.+..+.+ T Consensus 80 ~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~~~~------------- 145 (556) T protein:vir:73 80 ARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFN-KSNLYQSLPVMYASLGTFGTGAMAVMEDDQ------------- 145 (556) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeeeeecCC------------- Confidence 79999999876554433322 4444444454 467888889999999999999997755421 Q ss_pred CccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 149 DEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+.+..+++.+||+.+++.+.++. ++++..+|...+.+.....++ .+...... T Consensus 146 --------------------~~~r~~~~~l~~~~~~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l---~~~v~~~~ 199 (556) T protein:vir:73 146 --------------------DVIRTMPFPIGSYYLANSPRGSVDT---CIRQFSMTVRQMVQEFGLDNV---STSVKGMW 199 (556) T ss_pred --------------------ceEEEEEeecceeEEeeCCCCCeEE---EEEEEeccHHHHHHHcCcccC---CHHHHHHH Confidence 1234567899999999988776644 788899999999876332222 11111111 Q ss_pred hccccccccccccccccceEEEEEE-EEEeeecC---CceeEEEEEE-EE----CCEEEecccCCCcCCCCCEEEEeeee Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEY-WGNYDVDG---DGIAEPIVCA-WI----NDVIVRLQSNPYPDGKPPFLVVPFNS 299 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~-w~~~~~~~---~g~~~~~~~~-~~----g~~~l~~~~~p~~~~~~Pf~~~~~~~ 299 (663) +. +....+|.|+.| |.+.+.+. ++.-.++..+ |. ++++++ ++.| ..+||++..|.. T Consensus 200 ~~-----------~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~--esg~--~e~P~~~~Rw~~ 264 (556) T protein:vir:73 200 EN-----------GTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSVYFESGGDSDKLLR--ESGF--DEFPILAPRWEV 264 (556) T ss_pred hc-----------CCccceEEEEEEEeccccccccccCcccceEEEEEEEecCCCceecc--cCCc--ccCCceeeeeee Confidence 10 011124566554 33332222 1112223322 32 235664 5666 568999999999 Q ss_pred ecCcccCCC-hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCC--CccccccCc- Q lcl|NC_021532. 300 IPFKLHGEA-NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTA--NDFWHGSYN- 375 (663) Q Consensus 300 ~~~~~~g~g-~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~--~~~~~~~~~- 375 (663) .++..||+| ++....+..+.+|++.+.++.++...++|++.++.+... ...+..||+++.+..++ +.+.++... T Consensus 265 ~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~--~~~~~~pgg~~~~~~~~~~~~i~p~~~~~ 342 (556) T protein:vir:73 265 NGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKN--QRVSLLPGDVTYLDVISGQDGFKPAYLVN 342 (556) T ss_pred cCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccc--cceeeccCccccccCCCCccceeeecccc Confidence 999999999 799999999999999999999999999999999887532 34567899977554332 334443221 Q ss_pred cccHHHHHHHHHHHHHHHHHhCCChH-HcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 376 AIPSSAFDMISLMNNEIESITGTKSF-SGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEF 454 (663) Q Consensus 376 ~~~~~~~~~~~~~~~~~~~~tGi~~~-~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~ 454 (663) +--..+.+.++.+.+.|....-.+-+ +++. .++..-||++|+.+.+.....|..++.+|...++.|++.+.+.++.+. T Consensus 343 ~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~-~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~ 421 (556) T protein:vir:73 343 PNTADLLADIQDTRQTINSAYFVDLFMMLQN-INTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARK 421 (556) T ss_pred ccHHHHHHHHHHHHHHHHHHhhcchhhhhcc-CCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 11234556677777777766554432 2233 233346999999999999999999999999899999999999988885 Q ss_pred cCCceEEEEecCeeeccchhhcCC-ceEEEEeecccchhHHHHHH---HHHHHHHhcc--CCCcchhHHHHHHHHHhhhh Q lcl|NC_021532. 455 LEEEEVIRVTNDKFVPIRKDDLSG-RIDIDISISTAEDNAAKSQE---LSFLLQTLGP--NEDPKIRRDIMADIMDLMRM 528 (663) Q Consensus 455 ~~~~~~iri~~~~~v~i~~~~~~~-~~d~~v~~~~~~~~~~~~q~---l~~~~~~~~~--~~~p~~~~~~l~~~~~l~~~ 528 (663) .- ++--|+.+.+ .+++.+. +++....+... +..+++.+++ +++|++.. .-+. T Consensus 422 g~------------lP~~P~~l~~~~i~v~yi--s~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d--------~id~ 479 (556) T protein:vir:73 422 NM------------LPEPPDVLQGMPLRIEYI--SVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALD--------KLDV 479 (556) T ss_pred CC------------CCCCchhhcCceeEEEee--cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHh--------cCCH Confidence 21 2223334432 3334333 34444333332 2333333322 23454321 1122 Q ss_pred hhhhhhhhhhhcchhhHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_021532. 529 PEQAKRMREYEPKPDPVQ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK-ARKLSSEAD 604 (663) Q Consensus 529 ~e~~~~l~~~~~~~~~~~---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~-~q~~~~~~~ 604 (663) .+..+.+....+-|...- ++.+++.+++++++ +.++ +.+.++.+ ++.....+.+...... .+.+...+. T Consensus 480 d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~q-q~~~-----~~~~~~~a-~~~~~~~~~~~~~~~~~l~~~~~~~g 552 (556) T protein:vir:73 480 DQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQA-QAAQ-----AMAMGQAA-AQGAKTLSETQTSDPSALTAIANAAG 552 (556) T ss_pred HHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHH-HHHH-----HHHHHHHH-HHHHHHhhhccCCCHHHHHHHHHhhc Confidence 222222222222111100 00000000000000 0000 00000000 0000000000000000 000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 605 MTDLKFVKEDNGYAHLEQVELE 626 (663) Q Consensus 605 ~~~~e~~~~~~~~~~~~~~~~~ 626 (663) +-.| T Consensus 553 ------------------~~~~ 556 (556) T protein:vir:73 553 ------------------APQQ 556 (556) T ss_pred ------------------CCCC Confidence 0000 No 25 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=6e-35 Score=208.20 Aligned_cols=507 Identities=8% Similarity=0.033 Sum_probs=297.8 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCcCC-cc------ccCCCccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GEPYG-NE------QKGKSAIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~~~~-~~------~~g~s~~~~~~i~~~v~~~~~~l~~~~ 72 (663) |+.. .|.+.|+..++.|+.++..|+++.+|.. +.... .. .+..+.++.+.....++.+.+.|+..+ T Consensus 1 ~~~~----~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~l 76 (547) T protein:vir:10 1 MENS----KIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSL 76 (547) T ss_pred CCHH----HHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhh Confidence 5554 4556788889999999999999999874 11111 11 112355677888889999999999999 Q ss_pred cC-CCceEEEEeCCcchHHH---H---HHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccc Q lcl|NC_021532. 73 VS-TADIIKCTPITWEDTDS---A---EQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEA 145 (663) Q Consensus 73 ~~-~~~~~~~~p~~~~D~~~---A---e~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~ 145 (663) |+ +.+||++.+...+..+. . +..+..|...+. .++++..++.++.+.++.|+|++.+..|.+. T Consensus 77 tPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~--------- 146 (547) T protein:vir:10 77 TSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQ-DSNFNLEANETYIDLCGYGNAIMVEEEDEDE--------- 146 (547) T ss_pred cCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCcEeEEeccCCCC--------- Confidence 98 78999998754432222 2 223344434344 4678888899999999999999988654211 Q ss_pred cccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 146 VVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) .+.+.+..+++.+||+..++.+.+++ ++++..+|..++.+..-...+ ++... T Consensus 147 ----------------------~~~~r~~~~pl~~~~v~~d~~G~v~~---i~r~~~~t~~qi~~~fg~~~l---~~~v~ 198 (547) T protein:vir:10 147 ----------------------EGSVVFQSSPIQDSYFEEDSRGQVVN---FYRVFRWTPAQIYDRFGDEGT---PEAII 198 (547) T ss_pred ----------------------CCceeEEEeecceEEEeeCCCcCeee---eeeeeeccHHHHHHhcCcccC---CHHHH Confidence 13345778999999999988776655 678899999999886322222 11111 Q ss_pred hhhhccccccccccccccccceEEEEEEEEEee-ecCCc--------eeEEEEEEE--EC--CEEEecccCCCcCCCCCE Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYD-VDGDG--------IAEPIVCAW--IN--DVIVRLQSNPYPDGKPPF 292 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~-~~~~g--------~~~~~~~~~--~g--~~~l~~~~~p~~~~~~Pf 292 (663) .... . ..+....++.++.|.+... .+.+. .-..+..+| .+ .++++ ++.| ..+|| T Consensus 199 ~~~~---~------~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~--esg~--~e~P~ 265 (547) T protein:vir:10 199 KKAK---E------ASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGE--EGGY--YEMPA 265 (547) T ss_pred HHHh---c------CCCcccceEEEEEEEeeccCCCCCccccceeeccccceeEEEEEecCceeeee--cCCc--ccCCe Confidence 1110 0 0111123466666544332 11110 011122222 22 34554 5556 46899 Q ss_pred EEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccc Q lcl|NC_021532. 293 LVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHG 372 (663) Q Consensus 293 ~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~ 372 (663) ++..|...+|..||+|+++...+..+.+|++.+.++.++.++++|+++++.+.+.. ..+..||+++.+.+ .+.++++ T Consensus 266 ~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~--~~~~~pgg~~~~~~-~~~v~pl 342 (547) T protein:vir:10 266 YAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLIS--DIDLGASGLTVVRD-MESMKPF 342 (547) T ss_pred eeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecccccccc--cceecCCeeeecCC-cccceee Confidence 99999999999999999999999999999999999999999999999988665432 35567999887653 4566666 Q ss_pred cCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 373 SYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNA 452 (663) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~ 452 (663) ....-.......++.+.+.|....=++.+.+. ++..-||++|+.+.+.....|..++.+|...++.|++.+.+.++. T Consensus 343 ~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~---~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~ 419 (547) T protein:vir:10 343 ESRARFDVSSIQLTDLRSAVRRIYYVDQLQMK---DSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRF 419 (547) T ss_pred ecccchHHHHHHHHHHHHHHHHHhhhhhhhcC---CCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 66555555677788888887765433333322 234579999999999999999999999998899999999999887 Q ss_pred HhcCCceEEEEecCeeeccchhhc-C-CceEEEEeecccchhHHHHHHHHH---HHHHhcc--CCCcchhHHHHHHHHHh Q lcl|NC_021532. 453 EFLEEEEVIRVTNDKFVPIRKDDL-S-GRIDIDISISTAEDNAAKSQELSF---LLQTLGP--NEDPKIRRDIMADIMDL 525 (663) Q Consensus 453 q~~~~~~~iri~~~~~v~i~~~~~-~-~~~d~~v~~~~~~~~~~~~q~l~~---~~~~~~~--~~~p~~~~~~l~~~~~l 525 (663) +..- +-++ |+.+ . +..++.|+.-.++....+...+.. +++.+++ +++|++. +. T Consensus 420 r~g~-----------lP~~-p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vl--------d~ 479 (547) T protein:vir:10 420 RAGK-----------LGEL-PSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVL--------DI 479 (547) T ss_pred hcCC-----------CCCC-chhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhh--------hc Confidence 7422 1122 2222 1 122445554456666655554433 3333322 2344321 11 Q ss_pred hhhhhhhhhhhhhhcchhhHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 526 MRMPEQAKRMREYEPKPDPVQ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKAR 597 (663) Q Consensus 526 ~~~~e~~~~l~~~~~~~~~~~---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q 597 (663) -+..+..+.+....+-|...- ++..++.++++++ ++.+++ .+.++.+ .++...+....+-..+-| T Consensus 480 id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~--~q~~~q----aa~~~~~-g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 480 PDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQT--QQKAEQ----AAIAEAE-GNAMEAQGKGQAALKENQ 547 (547) T ss_pred CCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHH--HHHHHH----HHHHHHH-HHHHHhhcCcccchhccC Confidence 122222233222222211100 0000000000000 000000 0000000 000000000000000000 No 26 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=3.3e-35 Score=209.60 Aligned_cols=523 Identities=12% Similarity=0.101 Sum_probs=295.4 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCcC-Cccc---cCCCccccHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GEPY-GNEQ---KGKSAIVSRDIKKQSEWQHATIVDPFVS- 74 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~~~-~~~~---~g~s~~~~~~i~~~v~~~~~~l~~~~~~- 74 (663) |. +.+++.|.+.|+..++.|++++..|+++.+|.. +.-. .... +..+.++.+.....++.+.+.|+..+|+ T Consensus 1 m~-~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp 79 (559) T protein:vir:95 1 MA-ETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) T ss_pred CC-hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Confidence 43 344778899999999999999999999999963 2211 1111 2235677888888999999999999998 Q ss_pred CCceEEEEeCCcchHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccccc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVV 148 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~ 148 (663) +.+||++.+..++..+.+++ .+..|...+. .++++..++.++.+.+..|+|++.+.+|.+ T Consensus 80 ~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~Gta~l~~~~d~~------------- 145 (559) T protein:vir:95 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSNLYQSLPQLYGSLGTYSTGAMAVLDDDE------------- 145 (559) T ss_pred CCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeeEeecCCC------------- Confidence 79999998765543332222 2333433343 467888889999999999999988765421 Q ss_pred CccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 149 DEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +...+..+++.+||+..++.+.++. ++++..+|..++.+......+ .+...... T Consensus 146 --------------------~~~r~~~~~l~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l---~~~~~~~~ 199 (559) T protein:vir:95 146 --------------------DIIRTMPFPIGSYYLANSPRGSVDT---CFRKFSMTVRQLVQEFGLNNV---SESVKSMW 199 (559) T ss_pred --------------------ceeEEEEeecCeEEEeeCCCCCeEE---EEEeEecCHHHHHHHcCcccC---CHHHHHHH Confidence 1234567899999999988776644 688899999999876332222 11111111 Q ss_pred hccccccccccccccccceEEEEEEEE-EeeecCCce---eEEEE-EEEE---C-CEEEecccCCCcCCCCCEEEEeeee Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWG-NYDVDGDGI---AEPIV-CAWI---N-DVIVRLQSNPYPDGKPPFLVVPFNS 299 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~-~~~~~~~g~---~~~~~-~~~~---g-~~~l~~~~~p~~~~~~Pf~~~~~~~ 299 (663) +. +...+.|.|+.|-+ +.+.+.++. -..+. ++|. + .++++ ++.| .++||++..|.+ T Consensus 200 ~~-----------~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~--esg~--~e~P~~~~Rw~~ 264 (559) T protein:vir:95 200 ES-----------GTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLR--ESGF--DEFPIMAPRWEV 264 (559) T ss_pred hc-----------CCCCCeEEEEEEEeccccccccccccccceEEEEEEEecCCCceeee--cCCc--ccCCccceeeee Confidence 10 01123477776533 333222211 11122 2222 2 35664 4555 468999999999 Q ss_pred ecCcccCCC-hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCC--CccccccC-c Q lcl|NC_021532. 300 IPFKLHGEA-NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTA--NDFWHGSY-N 375 (663) Q Consensus 300 ~~~~~~g~g-~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~--~~~~~~~~-~ 375 (663) .++..||+| ++....+..+.+|++.+..+.++...++|++.++.+... ...+..||++..+.+.+ +.+.++.. + T Consensus 265 ~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~--~~~~l~pgg~~~~~~~~~~~~i~p~~~~~ 342 (559) T protein:vir:95 265 NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN--QRASLLPGDITYIDQITGQDGFRPAYLVN 342 (559) T ss_pred cCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccc--cceeeeccceeeeCCCCCcccceeecccc Confidence 999999999 799999999999999999999999999999999877643 33557799988776543 23333321 1 Q ss_pred cccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021532. 376 AIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFL 455 (663) Q Consensus 376 ~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~ 455 (663) +-...+...++.+.+.|....-.+-+.+=...++..-||++|+.+.+.....|..++.+|...++.|++.+.+.++.+.. T Consensus 343 ~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g 422 (559) T protein:vir:95 343 PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKN 422 (559) T ss_pred cchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 22233445567777777776655432221122333459999999999999999999999998999999999999988853 Q ss_pred CCceEEEEecCeeeccchhhcCC-ceEEEEeecccchhHHHHH---HHHHHHHHhcc--CCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 456 EEEEVIRVTNDKFVPIRKDDLSG-RIDIDISISTAEDNAAKSQ---ELSFLLQTLGP--NEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 456 ~~~~~iri~~~~~v~i~~~~~~~-~~d~~v~~~~~~~~~~~~q---~l~~~~~~~~~--~~~p~~~~~~l~~~~~l~~~~ 529 (663) - ++.-|+.+.+ .+++.+. +++....+.. .+..+++.+++ +++|++. +.-+.. T Consensus 423 ~------------lP~~p~~l~~~~i~v~~i--s~La~aqk~~~~~~i~~~~~~~~~laq~~Pevl--------d~id~d 480 (559) T protein:vir:95 423 M------------LPPPPDVMEGMPLKVEYI--SVMAQAQKSIGLSSLASTVNFIGQLAQVKPEAL--------DKLNVD 480 (559) T ss_pred C------------CCCCcccccCcceEEEee--cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhh--------hcCCHH Confidence 1 2223333332 3344443 4444433332 22233332222 2345432 111222 Q ss_pred hhhhhhhhhhcchhhHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMT 606 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~ 606 (663) +..+.+....+-|...- ++..++.++++++++++++++...+.++..+...+++.. ....++...... T Consensus 481 ~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~---------~~~~l~~~~~~~ 551 (559) T protein:vir:95 481 QAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTS---------DPSVLSAMANAV 551 (559) T ss_pred HHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCC---------ChhHHHHHHHhh Confidence 22222222222211100 001111001100000000000000000000000000000 000000000000 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_021532. 607 DLKFVKEDNGYAH 619 (663) Q Consensus 607 ~~e~~~~~~~~~~ 619 (663) .-.-. +++ T Consensus 552 ~~~~~-----~~~ 559 (559) T protein:vir:95 552 SGQGG-----QSQ 559 (559) T ss_pred cCccc-----cCC Confidence 00000 000 No 27 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=9.1e-35 Score=207.22 Aligned_cols=509 Identities=11% Similarity=0.011 Sum_probs=291.7 Q ss_pred CcHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc----cccCCCccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 3 INKAE----LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN----EQKGKSAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 3 ~~~~~----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |.+++ -.+.+++.|+..++.|+.++..|++..+|..-..... ..+....++.......++.+.+.|+..+|+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 44444 2466778899999999999999999999975332211 111123456777778899999999999999 Q ss_pred CCceEEEEeCCc-------chHHHH------HHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 75 TADIIKCTPITW-------EDTDSA------EQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 75 ~~~~~~~~p~~~-------~D~~~A------e~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) +.+||++.+... ++.+.+ +..+..|...+. .++++..++.++.+.+..|+|++.+..+.. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~~~~------ 153 (535) T protein:vir:15 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIE-SNSYRVTLFECLKQLIVAGNALLYLPEPEG------ 153 (535) T ss_pred CCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceeEEeecCCC------ Confidence 999999987532 222212 122333333343 577888899999999999999988764311 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLA 221 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~ 221 (663) +...++.++..++|+.+++.+.++. ++++..+|...|.+. +-. T Consensus 154 ---------------------------~~~~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~-~~~------ 196 (535) T protein:vir:15 154 ---------------------------SYNPMKLYRLSSYVVQRDAYGNVLQ---IVTRDQIAFGALPED-VRS------ 196 (535) T ss_pred ---------------------------CceeeEEEEcCeeEEeeCCCCCeeE---EEEeEeecHHHHHHH-HhH------ Confidence 1224566888899999887766654 788999998887543 111 Q ss_pred hccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeec Q lcl|NC_021532. 222 KTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIP 301 (663) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~ 301 (663) +..... ......+.|.||++.++. .+++...++ ..+.+..+...++.|++..+||++..|...+ T Consensus 197 -----~~~~~~-------~~~~~~~~v~v~~~v~~~--~~~~~~~~~--~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ 260 (535) T protein:vir:15 197 -----AVEKAG-------GEKKMDEMVDVYTHVYLD--EESGDYLKY--EEVEDVEIDGSDATYPTDAMPYIPVRMVRID 260 (535) T ss_pred -----hhhccc-------cccCCCCceeEEEEEEEe--cCCCcEEEE--EEeeCccccccccccccccCCceeeeeeecC Confidence 000000 011223468888876543 233333333 2344444443456677788999999999999 Q ss_pred CcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC-cchhhhccCCcceEeCCCCCccccccC--cccc Q lcl|NC_021532. 302 FKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD-QTNRKKFLAGANFEFNGTANDFWHGSY--NAIP 378 (663) Q Consensus 302 ~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~-~~d~~~~~p~~vi~~~~~~~~~~~~~~--~~~~ 378 (663) +..||+|+++...+..+.+|++.+..+.+...+++|+++++++.+. +.+.....+|.++.-. .+.+.+++. .+-. T Consensus 261 ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~g~~v~g~--~~~v~~~~~~~~~~~ 338 (535) T protein:vir:15 261 GESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGR--REDIDFLQLEKQADF 338 (535) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCceeeecCC--cccceeeecccccch Confidence 9999999999999999999999999999999999999999776654 3333344444444322 234444443 2334 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) +.....++.+.+.|.... ..+ +++.. ++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.+.. T Consensus 339 ~~~~~~i~~~~~~I~~af-~~~-~~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g--- 412 (535) T protein:vir:15 339 TVAKAVSDQIEARLSYAF-MLN-SAVQR-TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATS--- 412 (535) T ss_pred hHHHHHHHHHHHHHHHHH-hhh-hcccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcC--- Confidence 567778888888887765 222 22222 233469999999999999999999999999999999999999887642 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeeccc-chhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTA-EDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMRE 537 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~-~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~ 537 (663) .+.++..+ .+++.+.++.+ ..+....+.+..+++.++. +.|.+.-. .-+..+..+.+.. T Consensus 413 --------~lP~~p~~----~v~~~yis~La~aqr~~~~~~l~~~~~~la~-~~P~~ld~-------~id~d~~~~~~a~ 472 (535) T protein:vir:15 413 --------QIPELPKE----AVEPTISTGLEAIGRGQDLDKLERCISAWAA-LAPMQGDP-------DINLAVIKLRIAN 472 (535) T ss_pred --------CCCCCCcc----ceeEEEecHHHHHHHHHHHHHHHHHHHHHHh-cChhhhhc-------cCCHHHHHHHHHH Confidence 22223222 23444443322 1222333344445555544 33332110 0112222222222 Q ss_pred hhcchh-hHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 538 YEPKPD-PVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFV 611 (663) Q Consensus 538 ~~~~~~-~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~ 611 (663) ..+-+. ..-..+.+.++.++++ +++.+.+++..+..+..+ ..+...- +.+++.....-++.- T Consensus 473 ~~Gvp~~~i~~~~eev~~~~~q~-----~~~~~~~~~a~~~g~~~~--~~~~~~p-----~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 473 AIGIDTSGILLTDEQKQALMMQD-----AAQTGIENAAATGGAGVG--ALATSSP-----EAMQGAAAQAGLDAT 535 (535) T ss_pred HcCCChhhhcCCHHHHHHHHHHH-----HHHHHHHHHHHHHHhhcc--chhccCh-----HHHHHHHhccCCCCC Confidence 222110 0000000000000000 000000000000000000 0000000 011111110000000 No 28 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=3.1e-34 Score=204.30 Aligned_cols=515 Identities=14% Similarity=0.072 Sum_probs=303.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CC----cCCccccCC---CccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GE----PYGNEQKGK---SAIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~----~~~~~~~g~---s~~~~~~i~~~v~~~~~~l~~ 70 (663) |+-+.+.+++.|++.|+..++.|++++..|+++.+|.. |. .......|+ +.++.+.-...++.+.+.|+. T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~ 80 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDS 80 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999854 21 111222332 345667777789999999999 Q ss_pred hhcC-CCceEEEEeCCcchHHHHHH---HH---HHHhHHHH-hccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 71 PFVS-TADIIKCTPITWEDTDSAEQ---NE---LLLNTQFS-RKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 71 ~~~~-~~~~~~~~p~~~~D~~~Ae~---~~---~~~~~~~~-~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+|+ +.+||++.+...+..+.+++ +. ..+...+. ..++++..++.++.+.+..|+|++.+..+.. T Consensus 81 ~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~------- 153 (549) T protein:vir:10 81 MITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVG------- 153 (549) T ss_pred hccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCC------- Confidence 9998 68999998866554443322 22 23222221 3577888899999999999999988753311 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) +...+..++..++|+..++.+.++. ++++..+|...+.+..-... +.+ T Consensus 154 --------------------------~~~~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~---l~~ 201 (549) T protein:vir:10 154 --------------------------KGIVYRNVPMQRLWFAENNSGLIDK---THVQWELTLRQAAQRFGREN---LSP 201 (549) T ss_pred --------------------------CeeEEEEEEcCeEEEeeCCCCCeEE---EEEEeecCHHHHHHhcCccc---CCH Confidence 1234567899999999987776644 78899999999987632221 221 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEE-eeec---CCceeEEEEEE---EECCEEEecccCCCcCCCCCEEEE Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGN-YDVD---GDGIAEPIVCA---WINDVIVRLQSNPYPDGKPPFLVV 295 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~-~~~~---~~g~~~~~~~~---~~g~~~l~~~~~p~~~~~~Pf~~~ 295 (663) ......+ ..+.+.|.||.+=+. .+.+ .++.-..+..+ ..++++|+ ++.| ..+||++. T Consensus 202 ~v~~~~~------------~~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~--esg~--~e~P~~~~ 265 (549) T protein:vir:10 202 SMQSTLE------------KDPEKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQ--NSGF--RTFPFAIG 265 (549) T ss_pred HHHHHhh------------cCCCceEEEEEEeecCCCCCccccccccCceEEEEEEecCCEeec--cCCc--ccCCccee Confidence 1111111 012345677654221 1111 12221122222 23456765 4555 46899999 Q ss_pred eeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCC--CC-Cccccc Q lcl|NC_021532. 296 PFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNG--TA-NDFWHG 372 (663) Q Consensus 296 ~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~--~~-~~~~~~ 372 (663) .|...+|..||+|+++...+..+.+|++.+..+.++..+++|+++++.+.+.. ..+..||++..+.. ++ ..+.++ T Consensus 266 Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~--~~~l~pgg~~~~~~~~~~~~~~~pl 343 (549) T protein:vir:10 266 RFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLD--GFDLRSGALNWGGLNDKGEEMVKPL 343 (549) T ss_pred eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc--cceeccCCccccccCCCCccceeee Confidence 99999999999999999999999999999999999999999999998765432 23457888765432 22 223444 Q ss_pred cCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 373 SYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNA 452 (663) Q Consensus 373 ~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~ 452 (663) ....-.+.+..+++.+.+.|....-++-+.+-. ++..-||++|+.+.+.....|..++.++...++.|++.+.+.++. T Consensus 344 ~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~--~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~ 421 (549) T protein:vir:10 344 LTGKQAQIGIEFAQDTRQTINQWFYVTLFQILV--DSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILA 421 (549) T ss_pred ccccchhHHHHHHHHHHHHHHHHHhhhhhhhhc--CCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 333334556777888888888766544433322 344579999999999999999999999988899999999999887 Q ss_pred HhcCCceEEEEecCeeeccchhhcC-CceEEEEeecccchhHHHHHHHHHHHHH---hcc--CCCcchhHHHHHHHHHhh Q lcl|NC_021532. 453 EFLEEEEVIRVTNDKFVPIRKDDLS-GRIDIDISISTAEDNAAKSQELSFLLQT---LGP--NEDPKIRRDIMADIMDLM 526 (663) Q Consensus 453 q~~~~~~~iri~~~~~v~i~~~~~~-~~~d~~v~~~~~~~~~~~~q~l~~~~~~---~~~--~~~p~~~~~~l~~~~~l~ 526 (663) +..- + +--|+++. ...++.|..-+++....+...+..+++. +++ +++|++. +.- T Consensus 422 r~g~-----------l-P~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~l--------d~i 481 (549) T protein:vir:10 422 EAGQ-----------L-PDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAA--------KVP 481 (549) T ss_pred hcCC-----------C-CCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHH--------hcC Confidence 7421 2 22233332 2233444433456665555554433332 222 2444331 112 Q ss_pred hhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 527 RMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADM 605 (663) Q Consensus 527 ~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~ 605 (663) +..+..+.+....+-|...--...+.++..++.+++++++ ++.+.+ . ..+.+......+++....+.. T Consensus 482 d~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~---~~~~~a-----~---~a~~~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 482 NGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQ---QMLAAA-----P---VAAGAIKDLSDAQTAAQTARV 549 (549) T ss_pred CHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHH---HHHHHH-----H---HHHHHHHhhhhhcCCCcccCC Confidence 2223333333332222110000000000000000000000 000000 0 000000011111110000000 No 29 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=5e-34 Score=203.16 Aligned_cols=504 Identities=11% Similarity=0.036 Sum_probs=292.0 Q ss_pred CcHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc----ccCCCccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 3 INKAEL----LSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE----QKGKSAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 3 ~~~~~~----~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~----~~g~s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |.+.++ .+.+++.|+..++.|+.++..|++..+|..-...... .+....++.......++.+.+.|+..+|+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 666663 5777888999999999999999999999754322211 11123456677777899999999999999 Q ss_pred CCceEEEEeCCcc-------hHHHH------HHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 75 TADIIKCTPITWE-------DTDSA------EQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 75 ~~~~~~~~p~~~~-------D~~~A------e~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) +.+||++.+...+ +.+.+ +..+..|...+ ..++++..++.++.+.+..|+|++.+..+.. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~------ 153 (535) T protein:vir:33 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI-ESNSYRVTLFECLKQLIVAGNALLYLPEPEG------ 153 (535) T ss_pred CCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCceeEEeecCCC------ Confidence 9999999875421 11212 12233333334 3577888899999999999999998765411 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLA 221 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~ 221 (663) +...++.++..++|+.+++.+.++. ++++..+|..+|.+.. ..... . T Consensus 154 ---------------------------~~~~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~~-~~~~~--~ 200 (535) T protein:vir:33 154 ---------------------------SYNPMKLYRLSSYVVQRDAYGNVLQ---IVTRDQIAFGALPEDV-RSAVE--K 200 (535) T ss_pred ---------------------------CceeeEEEEcCeeEEeeCCCCCeeE---EEeeEeecHHHHHHHh-hhhhc--c Confidence 1234567888999999887766555 7889999999885531 11000 0 Q ss_pred hccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeec Q lcl|NC_021532. 222 KTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIP 301 (663) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~ 301 (663) ... .....+.+.+|.|.++. ..+|...++ ..+.+..+...++.|+++.+||++..|...+ T Consensus 201 ----~~~------------~k~~~~~~~v~~~v~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ 260 (535) T protein:vir:33 201 ----SGG------------EKKMDEMVDVYTHVYLD--EESGDYLKY--EEVEDVEIDGSDATYPTDAMPYIPVRMVRID 260 (535) T ss_pred ----ccc------------ccccccCCeEEEEEEee--CCCCcEEEE--EEEeCccccccccccccccCCceeeeeeecC Confidence 000 01112346677665432 223333333 3444544544556677788999999999999 Q ss_pred CcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc-chhhhccCCcceEeCCCCCccccccC--cccc Q lcl|NC_021532. 302 FKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ-TNRKKFLAGANFEFNGTANDFWHGSY--NAIP 378 (663) Q Consensus 302 ~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~-~d~~~~~p~~vi~~~~~~~~~~~~~~--~~~~ 378 (663) +..||+|+++...+..+.+|++.+..+.+...+++|+++++++.+.. .+.....+|.++.- ..+.+.+++. .+-. T Consensus 261 ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~g~~v~g--~~~~v~~~~~~~~~~~ 338 (535) T protein:vir:33 261 GESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPG--RREDIDFLQLEKQADF 338 (535) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCceeeecC--Ccccceeeecccccch Confidence 99999999999999999999999999999999999999997776543 33333344444332 2234444443 2334 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) +.....++.+.+.|.... ..+ +++.. ++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.+.. T Consensus 339 ~~~~~~i~~~~~~I~~af-~~~-~~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g--- 412 (535) T protein:vir:33 339 TVAKAVSDQIEARLSYAF-MLN-SAVQR-TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATS--- 412 (535) T ss_pred hHHHHHHHHHHHHHHHHH-hhh-hcccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcC--- Confidence 567778888888887765 222 22222 233469999999999999999999999999999999999999887642 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHH---HHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKS---QELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRM 535 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~---q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l 535 (663) .+.++..+ .+++.+.+ ++....+. +.+..+++.++. +.|.+.-. .-+..+..+.+ T Consensus 413 --------~lP~~p~~----~v~~~yis--~La~aqr~~~~~~l~~~~~~la~-~~P~~~d~-------~id~d~~~~~~ 470 (535) T protein:vir:33 413 --------QIPELPKE----AVEPTIST--GLEAIGRGQDLDKLERCISAWAA-LAPMQGDP-------DINLAVIKLRI 470 (535) T ss_pred --------CCCCCCcc----ceeEEEec--HHHHHHHHHHHHHHHHHHHHHHh-hChhhhhc-------cCCHHHHHHHH Confidence 22223222 24444443 33333333 344445555544 33322110 01122222222 Q ss_pred hhhhcchhh-H---HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDP-V---QEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFV 611 (663) Q Consensus 536 ~~~~~~~~~-~---~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~ 611 (663) ....+-+.. . +++.+++.+++ ++++.+.+++.+ ..+..+. .+....+.. +..++..-++.- T Consensus 471 a~~~Gvp~~~i~~~~ee~~~~~~q~-------~~~~~~~~~~~~-~g~~~~~--~~~~~~~~~-----~~~~~~~g~~~~ 535 (535) T protein:vir:33 471 ANAIGIDTSGILLTDEQKQALMMQD-------AAQTGVENAAAA-GGAGVGA--LATSSPEAM-----QGAAAKAGLNAT 535 (535) T ss_pred HHHcCCCHhHhcCCHHHHHHHHHHH-------HHHHHHHHHHHh-hhhhhcc--hhhcCChhH-----HHHHHhccCCCC Confidence 222221100 0 00000100000 000000000000 0000000 000001000 000111011100 No 30 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=1.5e-33 Score=200.60 Aligned_cols=521 Identities=11% Similarity=0.073 Sum_probs=305.5 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCcCC-ccccCC---CccccHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GEPYG-NEQKGK---SAIVSRDIKKQSEWQHATIVDPFVS- 74 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~~~~-~~~~g~---s~~~~~~i~~~v~~~~~~l~~~~~~- 74 (663) |......+.|.+.|+..++.|+.++..|+++.+|.. +.-.. ....|+ ..++.......++.+.+.|+..+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 888888899999999999999999999999999973 22211 222222 4467778888899999999999998 Q ss_pred CCceEEEEeCCcchHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccccc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVV 148 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~ 148 (663) +.+||++.+..++..+.+++ .+..+...+. .++++..++.++.+.+..|+|++.+..|.+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~------------- 146 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDFD------------- 146 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCCC------------- Confidence 79999999876654433322 3334433343 577888889999999999999987654321 Q ss_pred CccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 149 DEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +...+..+++.++++..++.+.++. ++++..+|...+.++....++ ++...... T Consensus 147 --------------------~~~rf~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l---~~~~~~~~ 200 (555) T protein:vir:10 147 --------------------AVVYHHSLTAGEYAIAADNQGRVNT---LYREFQITVAQMVREFGKDKC---STTVQSLF 200 (555) T ss_pred --------------------ceEEEEEeecceeEEeeCCCCCEEE---EEEEEeccHHHHHHhcCcccC---CHHHHHHH Confidence 1224556889999999887776633 778899999999887322222 11111111 Q ss_pred hccccccccccccccccceEEEEEEEEEee-ecC---CceeEEEE-EEEE----CCEEEecccCCCcCCCCCEEEEeeee Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYD-VDG---DGIAEPIV-CAWI----NDVIVRLQSNPYPDGKPPFLVVPFNS 299 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~-~~~---~g~~~~~~-~~~~----g~~~l~~~~~p~~~~~~Pf~~~~~~~ 299 (663) +. +.....|+|+.|.+..+ .+. ++....+. ++|. |.+++ .++.| ..+||++..|.+ T Consensus 201 ~~-----------~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~ 265 (555) T protein:vir:10 201 DR-----------GALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWAL 265 (555) T ss_pred hc-----------CCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeee Confidence 10 01113578888765432 211 11111222 2222 23555 35556 468999999999 Q ss_pred ecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCC--Ccccc-ccCcc Q lcl|NC_021532. 300 IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTA--NDFWH-GSYNA 376 (663) Q Consensus 300 ~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~--~~~~~-~~~~~ 376 (663) .+|..||+|+++...+..+.+|++.+..+.++...++|++.++.+... +..+..||++..+.++. +.+.+ ..... T Consensus 266 ~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~--~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~ 343 (555) T protein:vir:10 266 VGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN--QDISTVPGGLSYVDAAAPNGGIRTAFEVNL 343 (555) T ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc--ccceeccccccccccCCCCcceeccccccc Confidence 999999999999999999999999999999999999999999887642 23577899987665432 22222 12222 Q ss_pred ccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021532. 377 IPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE 456 (663) Q Consensus 377 ~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~ 456 (663) ..+.+.+.++.+.+.|....=.+-+.++...++..-||++|..+.+.....|..++.++...++.|++++.+.++.+.. T Consensus 344 d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g- 422 (555) T protein:vir:10 344 DLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEAN- 422 (555) T ss_pred chHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC- Confidence 3355667788888888776533323333334455679999999999999999999999998999999999999888752 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHH---HHHHhcc--CCCcchhHHHHHHHHHhhhhhhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSF---LLQTLGP--NEDPKIRRDIMADIMDLMRMPEQ 531 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~---~~~~~~~--~~~p~~~~~~l~~~~~l~~~~e~ 531 (663) .++.-|+.+.+ .++.|..-+++....+...+.. +++.+++ +++|.+. +.-+..+. T Consensus 423 -----------~lP~~P~~l~~-~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vl--------d~id~d~~ 482 (555) T protein:vir:10 423 -----------ILPPPPQEMQG-VDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVL--------DKFDADRW 482 (555) T ss_pred -----------CCCCCchhhcC-ceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhh--------hcCCHHHH Confidence 22333444433 2344444345555555544333 3333322 2344321 11122222 Q ss_pred hhhhhhhhcchhhHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 532 AKRMREYEPKPDPVQ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDL 608 (663) Q Consensus 532 ~~~l~~~~~~~~~~~---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~ 608 (663) .+.+....+-|...- ++..++.++++++++++.++.+..+.++..+....+..-. ..... T Consensus 483 ~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~-------~~~~~---------- 545 (555) T protein:vir:10 483 ADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSK-------QNALT---------- 545 (555) T ss_pred HHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCc-------chhHH---------- Confidence 222222222111100 0011111111111000000000000000000000000000 00000 Q ss_pred HHHHHHHHHH Q lcl|NC_021532. 609 KFVKEDNGYA 618 (663) Q Consensus 609 e~~~~~~~~~ 618 (663) .......... T Consensus 546 ~~~~~~~~~~ 555 (555) T protein:vir:10 546 DVTRAFSGYT 555 (555) T ss_pred HHHhhhccCC Confidence 0000000000 No 31 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=1.5e-33 Score=200.60 Aligned_cols=521 Identities=11% Similarity=0.073 Sum_probs=305.5 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCcCC-ccccCC---CccccHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GEPYG-NEQKGK---SAIVSRDIKKQSEWQHATIVDPFVS- 74 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~~~~-~~~~g~---s~~~~~~i~~~v~~~~~~l~~~~~~- 74 (663) |......+.|.+.|+..++.|+.++..|+++.+|.. +.-.. ....|+ ..++.......++.+.+.|+..+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 888888899999999999999999999999999973 22211 222222 4467778888899999999999998 Q ss_pred CCceEEEEeCCcchHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccccc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVV 148 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~ 148 (663) +.+||++.+..++..+.+++ .+..+...+. .++++..++.++.+.+..|+|++.+..|.+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~------------- 146 (555) T protein:vir:98 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDFD------------- 146 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCCC------------- Confidence 79999999876654433322 3334433343 577888889999999999999987654321 Q ss_pred CccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 149 DEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +...+..+++.++++..++.+.++. ++++..+|...+.++....++ ++...... T Consensus 147 --------------------~~~rf~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l---~~~~~~~~ 200 (555) T protein:vir:98 147 --------------------AVVYHHSLTAGEYAIAADNQGRVNT---LYREFQITVAQMVREFGKDKC---STTVQSLF 200 (555) T ss_pred --------------------ceEEEEEeecceeEEeeCCCCCEEE---EEEEEeccHHHHHHhcCcccC---CHHHHHHH Confidence 1224556889999999887776633 778899999999887322222 11111111 Q ss_pred hccccccccccccccccceEEEEEEEEEee-ecC---CceeEEEE-EEEE----CCEEEecccCCCcCCCCCEEEEeeee Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYD-VDG---DGIAEPIV-CAWI----NDVIVRLQSNPYPDGKPPFLVVPFNS 299 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~-~~~---~g~~~~~~-~~~~----g~~~l~~~~~p~~~~~~Pf~~~~~~~ 299 (663) +. +.....|+|+.|.+..+ .+. ++....+. ++|. |.+++ .++.| ..+||++..|.+ T Consensus 201 ~~-----------~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~ 265 (555) T protein:vir:98 201 DR-----------GALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWAL 265 (555) T ss_pred hc-----------CCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeee Confidence 10 01113578888765432 211 11111222 2222 23555 35556 468999999999 Q ss_pred ecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCC--Ccccc-ccCcc Q lcl|NC_021532. 300 IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTA--NDFWH-GSYNA 376 (663) Q Consensus 300 ~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~--~~~~~-~~~~~ 376 (663) .+|..||+|+++...+..+.+|++.+..+.++...++|++.++.+... +..+..||++..+.++. +.+.+ ..... T Consensus 266 ~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~--~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~ 343 (555) T protein:vir:98 266 VGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN--QDISTVPGGLSYVDAAAPNGGIRTAFEVNL 343 (555) T ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc--ccceeccccccccccCCCCcceeccccccc Confidence 999999999999999999999999999999999999999999887642 23577899987665432 22222 12222 Q ss_pred ccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021532. 377 IPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE 456 (663) Q Consensus 377 ~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~ 456 (663) ..+.+.+.++.+.+.|....=.+-+.++...++..-||++|..+.+.....|..++.++...++.|++++.+.++.+.. T Consensus 344 d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g- 422 (555) T protein:vir:98 344 DLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEAN- 422 (555) T ss_pred chHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC- Confidence 3355667788888888776533323333334455679999999999999999999999998999999999999888752 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHH---HHHHhcc--CCCcchhHHHHHHHHHhhhhhhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSF---LLQTLGP--NEDPKIRRDIMADIMDLMRMPEQ 531 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~---~~~~~~~--~~~p~~~~~~l~~~~~l~~~~e~ 531 (663) .++.-|+.+.+ .++.|..-+++....+...+.. +++.+++ +++|.+. +.-+..+. T Consensus 423 -----------~lP~~P~~l~~-~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vl--------d~id~d~~ 482 (555) T protein:vir:98 423 -----------ILPPPPQEMQG-VDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVL--------DKFDADRW 482 (555) T ss_pred -----------CCCCCchhhcC-ceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhh--------hcCCHHHH Confidence 22333444433 2344444345555555544333 3333322 2344321 11122222 Q ss_pred hhhhhhhhcchhhHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 532 AKRMREYEPKPDPVQ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDL 608 (663) Q Consensus 532 ~~~l~~~~~~~~~~~---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~ 608 (663) .+.+....+-|...- ++..++.++++++++++.++.+..+.++..+....+..-. ..... T Consensus 483 ~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~-------~~~~~---------- 545 (555) T protein:vir:98 483 ADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSK-------QNALT---------- 545 (555) T ss_pred HHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCc-------chhHH---------- Confidence 222222222111100 0011111111111000000000000000000000000000 00000 Q ss_pred HHHHHHHHHH Q lcl|NC_021532. 609 KFVKEDNGYA 618 (663) Q Consensus 609 e~~~~~~~~~ 618 (663) .......... T Consensus 546 ~~~~~~~~~~ 555 (555) T protein:vir:98 546 DVTRAFSGYT 555 (555) T ss_pred HHHhhhccCC Confidence 0000000000 No 32 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=1.5e-33 Score=200.60 Aligned_cols=521 Identities=11% Similarity=0.073 Sum_probs=305.5 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCcCC-ccccCC---CccccHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GEPYG-NEQKGK---SAIVSRDIKKQSEWQHATIVDPFVS- 74 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~~~~-~~~~g~---s~~~~~~i~~~v~~~~~~l~~~~~~- 74 (663) |......+.|.+.|+..++.|+.++..|+++.+|.. +.-.. ....|+ ..++.......++.+.+.|+..+|+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 888888899999999999999999999999999973 22211 222222 4467778888899999999999998 Q ss_pred CCceEEEEeCCcchHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccccc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVV 148 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~ 148 (663) +.+||++.+..++..+.+++ .+..+...+. .++++..++.++.+.+..|+|++.+..|.+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~------------- 146 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KSNTYRALHSMYEELGAFGTASSIVLPDFD------------- 146 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCceEEEEecCCC------------- Confidence 79999999876654433322 3334433343 577888889999999999999987654321 Q ss_pred CccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 149 DEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +...+..+++.++++..++.+.++. ++++..+|...+.++....++ ++...... T Consensus 147 --------------------~~~rf~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l---~~~~~~~~ 200 (555) T protein:vir:10 147 --------------------AVVYHHSLTAGEYAIAADNQGRVNT---LYREFQITVAQMVREFGKDKC---STTVQSLF 200 (555) T ss_pred --------------------ceEEEEEeecceeEEeeCCCCCEEE---EEEEEeccHHHHHHhcCcccC---CHHHHHHH Confidence 1224556889999999887776633 778899999999887322222 11111111 Q ss_pred hccccccccccccccccceEEEEEEEEEee-ecC---CceeEEEE-EEEE----CCEEEecccCCCcCCCCCEEEEeeee Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYD-VDG---DGIAEPIV-CAWI----NDVIVRLQSNPYPDGKPPFLVVPFNS 299 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~-~~~---~g~~~~~~-~~~~----g~~~l~~~~~p~~~~~~Pf~~~~~~~ 299 (663) +. +.....|+|+.|.+..+ .+. ++....+. ++|. |.+++ .++.| ..+||++..|.+ T Consensus 201 ~~-----------~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl--~esgy--~e~P~i~~Rw~~ 265 (555) T protein:vir:10 201 DR-----------GALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTL--RESGY--RSFRALCPRWAL 265 (555) T ss_pred hc-----------CCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEeccCCcccc--ccCCc--ccCCceeeeeee Confidence 10 01113578888765432 211 11111222 2222 23555 35556 468999999999 Q ss_pred ecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCC--Ccccc-ccCcc Q lcl|NC_021532. 300 IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTA--NDFWH-GSYNA 376 (663) Q Consensus 300 ~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~--~~~~~-~~~~~ 376 (663) .+|..||+|+++...+..+.+|++.+..+.++...++|++.++.+... +..+..||++..+.++. +.+.+ ..... T Consensus 266 ~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~--~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~ 343 (555) T protein:vir:10 266 VGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN--QDISTVPGGLSYVDAAAPNGGIRTAFEVNL 343 (555) T ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc--ccceeccccccccccCCCCcceeccccccc Confidence 999999999999999999999999999999999999999999887642 23577899987665432 22222 12222 Q ss_pred ccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021532. 377 IPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE 456 (663) Q Consensus 377 ~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~ 456 (663) ..+.+.+.++.+.+.|....=.+-+.++...++..-||++|..+.+.....|..++.++...++.|++++.+.++.+.. T Consensus 344 d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g- 422 (555) T protein:vir:10 344 DLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEAN- 422 (555) T ss_pred chHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC- Confidence 3355667788888888776533323333334455679999999999999999999999998999999999999888752 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHH---HHHHhcc--CCCcchhHHHHHHHHHhhhhhhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSF---LLQTLGP--NEDPKIRRDIMADIMDLMRMPEQ 531 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~---~~~~~~~--~~~p~~~~~~l~~~~~l~~~~e~ 531 (663) .++.-|+.+.+ .++.|..-+++....+...+.. +++.+++ +++|.+. +.-+..+. T Consensus 423 -----------~lP~~P~~l~~-~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vl--------d~id~d~~ 482 (555) T protein:vir:10 423 -----------ILPPPPQEMQG-VDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVL--------DKFDADRW 482 (555) T ss_pred -----------CCCCCchhhcC-ceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhh--------hcCCHHHH Confidence 22333444433 2344444345555555544333 3333322 2344321 11122222 Q ss_pred hhhhhhhhcchhhHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 532 AKRMREYEPKPDPVQ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDL 608 (663) Q Consensus 532 ~~~l~~~~~~~~~~~---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~ 608 (663) .+.+....+-|...- ++..++.++++++++++.++.+..+.++..+....+..-. ..... T Consensus 483 ~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~-------~~~~~---------- 545 (555) T protein:vir:10 483 ADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSK-------QNALT---------- 545 (555) T ss_pred HHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCc-------chhHH---------- Confidence 222222222111100 0011111111111000000000000000000000000000 00000 Q ss_pred HHHHHHHHHH Q lcl|NC_021532. 609 KFVKEDNGYA 618 (663) Q Consensus 609 e~~~~~~~~~ 618 (663) .......... T Consensus 546 ~~~~~~~~~~ 555 (555) T protein:vir:10 546 DVTRAFSGYT 555 (555) T ss_pred HHHhhhccCC Confidence 0000000000 No 33 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=1.9e-33 Score=200.01 Aligned_cols=511 Identities=11% Similarity=0.034 Sum_probs=293.5 Q ss_pred CcHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----ccccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 3 INKAE---LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG----NEQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 3 ~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~----~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) |.++. .+..+++.|+..++.|+.++..|++..+|..-.... ...+....++.+.....++.+.+.|+..+|++ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcCC Confidence 66655 467888999999999999999999999997532211 11111235677777888999999999999999 Q ss_pred CceEEEEeCCcc-------hHHHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 76 ADIIKCTPITWE-------DTDSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 76 ~~~~~~~p~~~~-------D~~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+||++.+..++ +...++ ..+..+...+. .++++..++.++.+.+..|+|++.+.=+ . T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~ly~~e~--~------ 151 (536) T protein:vir:10 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNSYRVTLFEALKQLVVAGNVLLYLPEP--E------ 151 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCcEeEEEeeC--C------ Confidence 999999765433 122222 23334433343 5778888999999999999999876311 0 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) ..+...++.++..++|+..++.+.++. ++++..+|...|.+.... + .+.. T Consensus 152 ------------------------~~~~~~~~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~fg~-~--~~~~ 201 (536) T protein:vir:10 152 ------------------------GSNYNPMKLYRLSSYVVQRDAFGNVLQ---MVTRDQIAFGALPEDIRK-A--VEGQ 201 (536) T ss_pred ------------------------CCceeeEEEEEcCeEEEeeCCCCCeeE---EeeeeeccHHHHHHhhhh-h--hccc Confidence 011123567888999999887766655 788999999998765211 0 0000 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) .. .....+.|+||+|-++. ++++...+|+ .+++..+...+..|+...+||++..|.+.+| T Consensus 202 ---------~~-------~~~~~~~v~v~~~V~~~--~~~~~~~~~~--e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~g 261 (536) T protein:vir:10 202 ---------GG-------EKKADETIDVYTHIYLD--EASGEYLRYE--EVEGMEVQGSDGTYPKEACPYIPIRMVRLDG 261 (536) T ss_pred ---------cc-------ccCcccceEEEEEEEEe--cCCCcEEEEE--eecCccccccccccccccCCceeeeeeecCC Confidence 00 01123468888765543 2333334433 2344333223445566789999999999999 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC-cchhhhccCCcceEeCCCCCcccccc--CccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD-QTNRKKFLAGANFEFNGTANDFWHGS--YNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~-~~d~~~~~p~~vi~~~~~~~~~~~~~--~~~~~~ 379 (663) ..||+|+++...+..+.+|++.+..+.+...++++.++++++.+. +.+.....+|.++.-.+ +.+..++ .....+ T Consensus 262 e~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~--~~v~~~~~~~~~~~~ 339 (536) T protein:vir:10 262 ESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRP--EDISFLQLEKQADFT 339 (536) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCc--ccceeeeccccccch Confidence 999999999999999999999999999999999999999777654 44445667777664333 3343333 333445 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...++.+.+.|....=+. +++. .++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.... T Consensus 340 ~~~~~i~~~~~rI~~af~~~--~l~~-~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g---- 412 (536) T protein:vir:10 340 VAKAVSDAIEARLSFAFMLN--SAVQ-RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ---- 412 (536) T ss_pred HHHHHHHHHHHHHHHHHhhh--hccc-CCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCC---- Confidence 67788888888888776333 2221 2333469999999999999999999999999999999999998886532 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeeccc-chhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTA-EDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~-~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) .+.++..+. +.++++++.+ ..+....+.+..+++.+++ +.|.+... .-+.....+.+... T Consensus 413 -------~lP~~p~~~----v~~~~vs~l~~l~r~~~~~~l~~~~~~la~-~~P~~ld~-------~id~d~~~~~~a~~ 473 (536) T protein:vir:10 413 -------QIPELPKEA----VEPTISTGLEAIGRGQDLDKLERCVTAWAA-LAPMRDDP-------DINLAMIKLRIANA 473 (536) T ss_pred -------CCCCCChhh----ccceEEecHHHHHHHHHHHHHHHHHHHHHh-hchhhhcc-------cCCHHHHHHHHHHH Confidence 222333322 2344433322 2333344445555555443 33332100 00111112222211 Q ss_pred hcc-hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPK-PDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGY 617 (663) Q Consensus 539 ~~~-~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~ 617 (663) .+- |...-....+.++..+++ .++++.+++.+.+.+.+ ..++....+. +++..+..-+ +-+. T Consensus 474 ~Gv~p~~~irt~eev~~~r~q~---~~~~~~~~~a~~~~~~~----~~~~~~~~~~-----~~~~~~~~g~-----~~~~ 536 (536) T protein:vir:10 474 IGIDTSGILLTEEQKQQKMAQQ---SMQMGMDNGAAALAQGM----AAQATASPEA-----MAAAADSVGL-----QPGI 536 (536) T ss_pred cCCCchhhcCCHHHHHHHHHHH---HHHHHHHHHHHHHHHHH----HHHHhcCchh-----HHhhhhcccc-----CCCC Confidence 110 110000000000000000 00000000000000000 0000000000 0000000000 0000 No 34 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=1.8e-33 Score=200.16 Aligned_cols=525 Identities=14% Similarity=0.106 Sum_probs=282.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC---C-ccccCCCccccHHHHHHHHHHHHHHHHhhcC-C Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPY---G-NEQKGKSAIVSRDIKKQSEWQHATIVDPFVS-T 75 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~---~-~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~-~ 75 (663) || ..+++.|+..++.|+.++..|++..+|..-... + ....-...++.+.....++.+.+.|+..+|+ + T Consensus 1 m~-------~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~ 73 (555) T protein:vir:17 1 MK-------HSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVN 73 (555) T ss_pred Ch-------hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCC Confidence 33 236678999999999999999999999753221 1 1111124566777778899999999999998 6 Q ss_pred CceEEEEeCCcchH-------HHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 76 ADIIKCTPITWEDT-------DSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 76 ~~~~~~~p~~~~D~-------~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+||++.+..++-. ..++ ..+..+...+. .++++..++.++.+.+..|+|++.+.=+ T Consensus 74 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~ly~~~~--------- 143 (555) T protein:vir:17 74 TSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIA-ESSDRVHLEMAMKHLIVTGNALLYQGKK--------- 143 (555) T ss_pred CcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCeEEEEecCC--------- Confidence 89999998644311 1111 13333433343 5678888999999999999999754210 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhc-CCcChhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDG-RYKNLDKLA 221 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g-~~~~~~~~~ 221 (663) .++.++..++++..++.+.++. ++++..+|...|.+.. .+.-.+... T Consensus 144 -----------------------------~~~~~pl~~y~v~~d~~G~vd~---v~rk~~~t~~ql~~~fg~~~l~~~~~ 191 (555) T protein:vir:17 144 -----------------------------NLKLYPLDRFVVSRDGEGNVME---IVTEEQIDRSLLPEEFQKVGGLEGAP 191 (555) T ss_pred -----------------------------ceeEEEcCeEEEeeCCCcCeeE---EEeeeeecHHHHHHHhhhccccchhh Confidence 1345677788888877665544 7889999999988762 111111111 Q ss_pred hccchhhhcccccc-ccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEe--cccCCCcCCCCCEEEEeee Q lcl|NC_021532. 222 KTSGEDFDYDSPDD-TEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVR--LQSNPYPDGKPPFLVVPFN 298 (663) Q Consensus 222 ~~~~~~~~~~~~~~-~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~--~~~~p~~~~~~Pf~~~~~~ 298 (663) .......+....-. ............+.+|.++.+. +|...+ ..-+++..+. ..++|| ..+||++..|. T Consensus 192 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~----~~~~~~--~~e~~~~~v~~~l~e~g~--~e~P~i~~Rw~ 263 (555) T protein:vir:17 192 DSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRK----DGQVKW--HQECDGKVIPGSNSSAPY--THNPWIPLRFN 263 (555) T ss_pred hhhhccccchhhhhhhhcccccCCCcceeEeeccccc----CCeeEE--EEecCceeccccccccCc--ccCCeeeeeee Confidence 10000000000000 0000111222346666655432 232222 2224454432 346666 57999999999 Q ss_pred eecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCc--c Q lcl|NC_021532. 299 SIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYN--A 376 (663) Q Consensus 299 ~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~--~ 376 (663) ..+|..||+|+++...+..+.+|++.+..+.++..+++|+++++++.+.........+++.+.. +..+.+.+++.. . T Consensus 264 ~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l~~~~~g~v~~-g~~~~v~~~~~~~~~ 342 (555) T protein:vir:17 264 IVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQNLALAANGAIIQ-GRPDDVSVVQANKAA 342 (555) T ss_pred ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcceeecCCCceeec-CCcccceeeeccccc Confidence 9999999999999999999999999999999999999999999777654333333233334432 223445555433 2 Q ss_pred ccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021532. 377 IPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE 456 (663) Q Consensus 377 ~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~ 456 (663) .-+.....++.+.+.|.+...+. + ..++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.+..- T Consensus 343 ~~~~~~~~i~~~~~~I~~aFm~~----~-~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~ 417 (555) T protein:vir:17 343 DFRTVLEMIQKLEQRISDAFLML----Q-VRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRK 417 (555) T ss_pred hhhHHHHHHHHHHHHHHHHHhhc----C-CCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCC Confidence 23456677888888887765332 1 123334699999999999999999999999989999999999999887532 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeeccc-chhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTA-EDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRM 535 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~-~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l 535 (663) +-++..+.. .+++.++.. +......+.+..+++.+++..+|.. +++.-+..+.++.+ T Consensus 418 -----------lP~~p~~~v----~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~-------~~d~id~d~~~~~~ 475 (555) T protein:vir:17 418 -----------LPQLPKDLV----QPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEI-------AMKYINPTEFIKRL 475 (555) T ss_pred -----------CCCCCHhhh----ccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchh-------HhhcCCHHHHHHHH Confidence 223332222 233443322 2233333444555555554432211 11222222223322 Q ss_pred hhhhcc-hhhH---HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPK-PDPV---QEKIRQLELENLMLENQMLVASINDKNARANEN-TIDAELKRSKAAVEKAKARKLSSEADMTDLKF 610 (663) Q Consensus 536 ~~~~~~-~~~~---~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~-~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~ 610 (663) ....+- |... +++..++.++++ .++++++...+.+++... ..++.++. ....+..+...-.++.. T Consensus 476 a~~~Gv~p~~ivrs~eev~~~rq~~~---~~~~q~~~~~qa~~~~~~~~~~~~~~~-------~~~~~~~a~~~~~a~~~ 545 (555) T protein:vir:17 476 AAAQGIDTLQLINSPETMKQLGDQQK---QDMVQASLINQAGQLAKTPMAEQAMQL-------IQQQQEGAQDAGAAESE 545 (555) T ss_pred HHHcCCChhhhcCCHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhhhhhhhHHhc-------cccchhhhhHHHHHHhh Confidence 222211 1100 001111100000 000000000010110000 00000000 00000000000000000 Q ss_pred HHHHHHHHHH Q lcl|NC_021532. 611 VKEDNGYAHL 620 (663) Q Consensus 611 ~~~~~~~~~~ 620 (663) -....+...+ T Consensus 546 ~~~~~~~~~~ 555 (555) T protein:vir:17 546 TSSAEAQAGA 555 (555) T ss_pred cCCcccccCC Confidence 0000000000 No 35 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=4.1e-33 Score=198.16 Aligned_cols=505 Identities=11% Similarity=0.057 Sum_probs=292.3 Q ss_pred CcHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----ccccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 3 INKAE---LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG----NEQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 3 ~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~----~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) |.++. .+..+++.|+..++.|+.++..|++..+|..-.... ...+....++.+.....++.+.+.|+..+|++ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC Confidence 66654 467888999999999999999999999997532211 11111235677777888999999999999999 Q ss_pred CceEEEEeCCcc-------hHHHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 76 ADIIKCTPITWE-------DTDSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 76 ~~~~~~~p~~~~-------D~~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+||++.+..++ +...++ ..+..+...+. .++++..++.++.+.+..|+|++.+.=+ . T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~ly~~e~--~------ 151 (536) T protein:vir:21 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNSYRVTLFEALKQLVVAGNVLLYLPEP--E------ 151 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCcEeEEEeeC--C------ Confidence 999999765433 122222 23334433343 5778888999999999999999876311 0 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) ..+...++.++..++|+..++.+.++. ++++..+|...|.+... .+. ... T Consensus 152 ------------------------~~~~~~f~~~pl~~~~v~~d~~G~vd~---i~r~~~~t~~~l~~~fg-~~~--~~~ 201 (536) T protein:vir:21 152 ------------------------GSNYNPMKLYRLSSYVVQRDAFGNVLQ---MVTRDQIAFGALPEDIR-KAV--EGQ 201 (536) T ss_pred ------------------------CCceeeEEEEEcCeEEEeeCCCCCeeE---EeeeeeccHHHHHHhhh-hhh--ccc Confidence 011123567888899999887665555 78899999999877521 100 000 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) .. .....+.|+||.|-++. ++++...+|+- +++..+.-.+..|+...+||++..|.+.+| T Consensus 202 ---------~~-------~~~~~~~v~v~~~v~~~--~~~~~~~~~~e--~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~g 261 (536) T protein:vir:21 202 ---------GG-------EKKADETIDVYTHIYLD--EDSGEYLRYEE--VEGMEVQGSDGTYPKEACPYIPIRMVRLDG 261 (536) T ss_pred ---------cc-------ccccccceeEEEEEEEe--cCCCcEEEEec--cCCeeeccccCccccccCCeeeeeeeecCC Confidence 00 01123467787654432 22333333332 233333233445566789999999999999 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC-cchhhhccCCcceEeCCCCCcccccc--CccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD-QTNRKKFLAGANFEFNGTANDFWHGS--YNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~-~~d~~~~~p~~vi~~~~~~~~~~~~~--~~~~~~ 379 (663) ..||+|+++...+..+.+|++.+..+.+...++++.++++++.+. +.+.....+|.++.-.+ +.+..++ .....+ T Consensus 262 e~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~--~~v~~~~~~~~~~~~ 339 (536) T protein:vir:21 262 ESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRP--EDISFLQLEKQADFT 339 (536) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCc--ccceeeeccccccch Confidence 999999999999999999999999999999999999998777654 44445667777664333 3343333 333445 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...++.+.+.|....=+. +++. .++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.... T Consensus 340 ~~~~~i~~~~~rI~~af~~~--~l~~-~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g---- 412 (536) T protein:vir:21 340 VAKAVSDAIEARLSFAFMLN--SAVQ-RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ---- 412 (536) T ss_pred HHHHHHHHHHHHHHHHHhhh--hccc-CCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCC---- Confidence 67788888888888776333 2221 2333469999999999999999999999999999999999998886532 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeeccc-chhHHHHHHHHHHHHHhccCCCcch-----hHH-HHHHHHHhhhh-hhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTA-EDNAAKSQELSFLLQTLGPNEDPKI-----RRD-IMADIMDLMRM-PEQ 531 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~-~~~~~~~q~l~~~~~~~~~~~~p~~-----~~~-~l~~~~~l~~~-~e~ 531 (663) .+.++..+. +.++++++.+ ..+....+.+..+++.+++ +.|.+ ... .+..+++..++ | T Consensus 413 -------~lP~~p~~~----v~~~~vs~l~~l~r~~~~~~l~~~~~~la~-~~Pe~ld~~id~d~~~~~~a~~~Gv~p-- 478 (536) T protein:vir:21 413 -------QIPELPKEA----VEPTISTGLEAIGRGQDLDKLERCVTAWAA-LAPMRDDPDINLAMIKLRIANAIGIDT-- 478 (536) T ss_pred -------CCCCCChhh----ccceEEecHHHHHHHHHHHHHHHHHHHHHh-hchhhhcccCCHHHHHHHHHHHcCCCh-- Confidence 222333222 2344433322 2333344445555554443 33322 111 11112222222 1 Q ss_pred hhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 532 AKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFV 611 (663) Q Consensus 532 ~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~ 611 (663) ...++ .++..+ ++.+++++ +++.+++.+.+.+.+ ..++....+ .+++..+..-+ T Consensus 479 ~~~ir----t~eev~----~~r~q~~~------~~~~~~~a~~~~~~~----~~~~~~~~~-----~~~~~~~~~g~--- 532 (536) T protein:vir:21 479 SGILL----TEEQKQ----QKMAQQSM------QMGMDNGAAALAQGM----AAQATASPE-----AMAAAADSVGL--- 532 (536) T ss_pred hhhcC----CHHHHH----HHHHHHHH------HHHHHHHHHHHHHHH----HHHHhcChh-----hHHhhhhcccc--- Confidence 01111 111110 00000000 000000000000000 000000000 00000000000 Q ss_pred HHHHHH Q lcl|NC_021532. 612 KEDNGY 617 (663) Q Consensus 612 ~~~~~~ 617 (663) +-+. T Consensus 533 --~~~~ 536 (536) T protein:vir:21 533 --QPGI 536 (536) T ss_pred --CCCC Confidence 0000 No 36 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=9.3e-33 Score=196.20 Aligned_cols=510 Identities=11% Similarity=0.023 Sum_probs=287.8 Q ss_pred CCCcHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----ccccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 1 MKINKAELL-SALKADMKAADVLKQEQDSLISTWKAEYNGEPYG----NEQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 1 ~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~----~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) -.+.++.+. ..+++.|+..++.|+.++..|++..+|..-.... ........++.......++.+.+.|+..+|++ T Consensus 3 ~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 82 (535) T protein:vir:94 3 SSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLALFPM 82 (535) T ss_pred chhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhhhcCC Confidence 122244454 3477779999999999999999999997532111 11122244677777888999999999999999 Q ss_pred CceEEEEeCCcc-------hHHHHH---HH---HHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 76 ADIIKCTPITWE-------DTDSAE---QN---ELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 76 ~~~~~~~p~~~~-------D~~~Ae---~~---~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+||++.+.... +.+.++ .+ +..+...+ ..++++..++.++.+.+..|+|++.+..+.+. T Consensus 83 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~------ 155 (535) T protein:vir:94 83 QTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYI-ESNSYRVTLFETLKQLVVAGNALLYIPEPEGT------ 155 (535) T ss_pred CCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCcEeEeeccCcCc------ Confidence 999999775321 222221 22 22232223 35788888999999999999999988654211 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) ...++.++..++++..++.+.++. ++++..++.+.|.... .. T Consensus 156 ---------------------------~~~f~~~pl~~y~v~~d~~G~vd~---i~r~~~~~~~~l~~~~--------~~ 197 (535) T protein:vir:94 156 ---------------------------YNPMKLYRLSSYVVQRDAFGTVLQ---IVTLDKTAYAALPEDV--------RN 197 (535) T ss_pred ---------------------------ccceEEEEcCeEEEeeCCCCCeEE---EEeeeeccHHHhhHHH--------HH Confidence 113456778889998877666654 6778888888874431 10 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) ...... .......|.||+|-++. ++++...+ ...+++..+...++.+++..+||++..|...+| T Consensus 198 ----~~~~~~--------~~~~~~~v~v~~~v~~~--~~~~~~~~--~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~g 261 (535) T protein:vir:94 198 ----SMDSSQ--------EHKGDEMIDVYTHIYLD--EESGEYLK--YEEIDGVEVEGTDASYPVDACPYIPVRMVRIDG 261 (535) T ss_pred ----HHHhcc--------ccCCCceeEEEEEEEee--CCCCcEEE--EEEecCeeeccccccCccccCCceeeeeeecCC Confidence 110000 01223457888765432 22333333 234556555434455566789999999999999 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-CcchhhhccCCcceEeCCCCCccccccCc--cccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQTNRKKFLAGANFEFNGTANDFWHGSYN--AIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~~d~~~~~p~~vi~~~~~~~~~~~~~~~--~~~~ 379 (663) ..||+|+++...+..+.+|++.+..+.+...+++|.++++++.+ ++.+.....+|.++... .+.+.+++.. ...+ T Consensus 262 e~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~--~~~v~~~~~~~~~~~~ 339 (535) T protein:vir:94 262 ESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGR--PEDISFLQLEKAADFS 339 (535) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccCCCceeecCC--cccceeeecccccchh Confidence 99999999999999999999999999999999999999876655 44444455566655432 2334444333 3345 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) ....+++.+.+.|....=+ + +++. .++..-||++|..+.+.....|..++.+|...++.|++++.+.++.+.. T Consensus 340 ~~~~~i~~~~~rI~~af~~-~-~~~~-~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g---- 412 (535) T protein:vir:94 340 VARAVSEQIEGRLSYAFML-N-SAVQ-RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATN---- 412 (535) T ss_pred HHHHHHHHHHHHHHHHHhH-h-hhcc-CCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCC---- Confidence 5677788888888765511 1 1221 2333469999999999999999999999999999999999999887642 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeeccc-chhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTA-EDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~-~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) .+-++..+. ++++++++.+ ..+....+.+..+++.+++ +.|.+.-. .-+..+.++.+... T Consensus 413 -------~lP~~p~~~----v~~~~vs~la~l~r~~~~~~l~~~~~~laq-~~P~~ld~-------~id~d~~~~~~a~~ 473 (535) T protein:vir:94 413 -------QIPELPKEA----VEPTISTGMEALGRGQDLDKLERCIAAWSA-LAPMQGDP-------DINIATIKLRIANA 473 (535) T ss_pred -------CCCCCChhh----ccceEeehHHHHHHHHHHHHHHHHHHHHHh-hChHHhhh-------cCCHHHHHHHHHHH Confidence 122232222 2344433222 2233333444445554443 44432110 11222223333322 Q ss_pred hcch-hhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPKP-DPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFV 611 (663) Q Consensus 539 ~~~~-~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~ 611 (663) .+-+ ...-..+.+.++..++++++++++....+.+++... + +.. ....++...+++.+.-- T Consensus 474 ~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~-----~--~~~-----~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 474 IGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGT-----M--ATA-----SPENMKAAAAQAGMAPN 535 (535) T ss_pred hCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-----c--ccc-----ChHHHHHHHHHhccCCC Confidence 2221 100000000000000000000000000000000000 0 000 00000000011100000 No 37 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=7.4e-33 Score=196.75 Aligned_cols=505 Identities=11% Similarity=0.027 Sum_probs=293.3 Q ss_pred CcHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC-ccccCC---CccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 3 INKAEL----LSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG-NEQKGK---SAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 3 ~~~~~~----~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~-~~~~g~---s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |.+.++ ...+++.|+..++.|+.++..|++..+|..-.... ....|. ..++.......++.+.+.|+..+|+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltp 80 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhhcC Confidence 665553 57788889999999999999999999987532211 111222 3466677777899999999999998 Q ss_pred -CCceEEEEeCCcch-------HHHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 75 -TADIIKCTPITWED-------TDSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 75 -~~~~~~~~p~~~~D-------~~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) +.+||++.+..++- .+.++ ..+..+...+ ..++++..++.++.+.+..|+|++.+.++..... T Consensus 81 p~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~-- 157 (532) T protein:vir:99 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM-ESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG-- 157 (532) T ss_pred CCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCcEeEEecccccccC-- Confidence 58999998854321 11121 2233333334 3578888899999999999999998876532110 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKL 220 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~ 220 (663) ....++.++..++++..++.+.+++ ++++..++.+.|-+. + T Consensus 158 ----------------------------~~~~f~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~e~--------~ 198 (532) T protein:vir:99 158 ----------------------------QSNAPKLYKLHNFVVERDAYDNVLQ---IVTEDKIARAALPED--------V 198 (532) T ss_pred ----------------------------cccceEEEEcCeEEEeeCCCCCeee---EeeeeeecHHhcChH--------H Confidence 1123566788899999887766655 667777776665221 1 Q ss_pred hhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeee Q lcl|NC_021532. 221 AKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSI 300 (663) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~ 300 (663) .. ..... .....+..+|.||+|.++.+ ++. .+.+..++.+..+-..++-|++..+||++..|... T Consensus 199 ~~----~~~~~-------~~~~~p~~~v~v~~~v~~~~---~~~-~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~ 263 (532) T protein:vir:99 199 RK----SLEDA-------QGDQNPSEEVTIYTHVYRDP---EAM-VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKM 263 (532) T ss_pred HH----Hhhcc-------ccccCCCcceEEEEEEEecC---CCC-eeEEEEeecCceecccccccccccCCceeeeeeec Confidence 10 00000 00112334688888776632 221 12223344554333335555557799999999999 Q ss_pred cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC-cchhhhccCCcceEeCCCCCccccccCc--cc Q lcl|NC_021532. 301 PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD-QTNRKKFLAGANFEFNGTANDFWHGSYN--AI 377 (663) Q Consensus 301 ~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~-~~d~~~~~p~~vi~~~~~~~~~~~~~~~--~~ 377 (663) +|..||+|++....+..+.+|++.+..+.+...+++|.++++++.+. +.+.....+|.++.-. .+.+.+++.. .. T Consensus 264 ~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~--~~~i~~~~~~~~~~ 341 (532) T protein:vir:99 264 PNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGR--KQDVEVFQLEKYND 341 (532) T ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhccCCCcceecCC--cccceeeecccccc Confidence 99999999999999999999999999999999999999999776654 4444455666655322 2344444432 33 Q ss_pred cHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021532. 378 PSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEE 457 (663) Q Consensus 378 ~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~ 457 (663) .+.+...++.+.+.|....= .+. +. ..++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.+.. T Consensus 342 ~~~~~~~i~~~~~rI~~af~-~~~-~~-~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g-- 416 (532) T protein:vir:99 342 FQVAKATADDIEKRLSYAFM-LNS-AV-QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATS-- 416 (532) T ss_pred hhHHHHHHHHHHHHHHHHHh-hhh-cc-cCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC-- Confidence 45567778888888877552 121 11 12333459999999999999999999999999999999999999888732 Q ss_pred ceEEEEecCeeeccchhhcCCceEEEEeec-ccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_021532. 458 EEVIRVTNDKFVPIRKDDLSGRIDIDISIS-TAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMR 536 (663) Q Consensus 458 ~~~iri~~~~~v~i~~~~~~~~~d~~v~~~-~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~ 536 (663) .+ +--|+++.+. ++ .++ .++++.++.+.+..+++.+++..++.. +..+..+.++.+. T Consensus 417 ---------~l-P~~p~~~~~~-~i--v~~is~Laraq~~~~l~~~~~~laq~~p~~~---------d~id~d~~~~~~a 474 (532) T protein:vir:99 417 ---------KI-PNLPKEAVEP-AI--ATGLEALGRGHDLNKLNVFIDYMIKLAGLQD---------DDINLLDVKMRLA 474 (532) T ss_pred ---------CC-CCCChhhccc-ce--eecchHHHHHHHHHHHHHHHHHHHhhcchhh---------hhCCHHHHHHHHH Confidence 22 2223333221 22 222 345666666677777777766544321 1122222222222 Q ss_pred hhhcc-hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 537 EYEPK-PDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLK 609 (663) Q Consensus 537 ~~~~~-~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e 609 (663) ...+- +...-....+.++..++.++++.++ ++..++.+...++.+. ..+++-.+..+ T Consensus 475 ~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~---------------~a~~~~~~~~~~~~~~-~~~~~~~~~~~ 532 (532) T protein:vir:99 475 NSLGMDTTGLILTQQDKQAKMAEASTAAGMV---------------TAGQQMGAAGGQAAAA-MMQQQAGMPTQ 532 (532) T ss_pred HHhCCChhhccCCHHHHHHHHHHHHHHHHHH---------------HHHHHHHHHHHHhcch-hHHhhcCCCCC Confidence 22221 1100000000000000000000000 0000000000000000 00000000000 No 38 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=1.6e-32 Score=194.89 Aligned_cols=496 Identities=12% Similarity=0.043 Sum_probs=288.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----ccccCCCccccHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG----NEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTA 76 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~----~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~ 76 (663) |....-.-...+++.|+..++.|+.++..|++..+|..-.... ........++.+.....++.+.+.|+..+|++. T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP~~ 80 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFPQS 80 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCCCC Confidence 6654444478899999999999999999999999997532211 111112346677777889999999999999999 Q ss_pred ceEEEEeCCc-------chHHHH------HHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccc Q lcl|NC_021532. 77 DIIKCTPITW-------EDTDSA------EQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMG 143 (663) Q Consensus 77 ~~~~~~p~~~-------~D~~~A------e~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~ 143 (663) +||++.+... ++...+ +..+..|...+ ..++++..++.++.+.+..|+|++.+.=+ .. T Consensus 81 ~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~l~~~~~--~~------ 151 (522) T protein:vir:94 81 PWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYM-ETNSFRVPLFEALKQLIVSGNCLLYIPEP--EQ------ 151 (522) T ss_pred cccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCcEeEeeecc--CC------ Confidence 9999987532 222222 22333333334 35678888999999999999999765311 00 Q ss_pred cccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc Q lcl|NC_021532. 144 EAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT 223 (663) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~ 223 (663) .....++.+|..++++..++.+.+++ ++++..++.+.|-.. + . T Consensus 152 ------------------------~~~~~~~~~pl~~y~v~~d~~G~vd~---i~r~~~~~~~~l~~~-----~---~-- 194 (522) T protein:vir:94 152 ------------------------GTYSPMRMYRLVSYVVQRDAFGNILQ---IVTIDKVAFSALPED-----V---K-- 194 (522) T ss_pred ------------------------CceeeEEEEEcceEEEeeCCCcCeEE---EeeeeeccHHhcchH-----H---H-- Confidence 01123566788889998877665554 677778887665321 0 0 Q ss_pred cchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 224 SGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 224 ~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) ..... . .....++|.||++.++.+ ++. .+|+ -+.+..+...++-|++..+||++..|.+.+|. T Consensus 195 --~~~~~--~-------~~~p~~~v~v~~~v~~~~---~~~-~~~~--~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge 257 (522) T protein:vir:94 195 --SQLNA--D-------DYEPDTELEVYTHIYRQD---DEY-LRYE--EVEGIEVTGTDGSYPLTACPYIPVRMVRLDGE 257 (522) T ss_pred --HHHhc--c-------cCCccceEEEEEEEEeeC---Cce-eEEe--eccCceecccCCCCccccCCceeeeeeecCCC Confidence 00000 0 011235688988877632 221 2222 23343443344556667899999999999999 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC-cchhhhccCCcceEeCCCCCccccccC--ccccHH Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD-QTNRKKFLAGANFEFNGTANDFWHGSY--NAIPSS 380 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~-~~d~~~~~p~~vi~~~~~~~~~~~~~~--~~~~~~ 380 (663) .||+|+++.+.+..+.+|++.+..+.+...+++|+++++++.+. +.+.....+|.++. +..+.+.+++. ++..+. T Consensus 258 ~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~g~~v~--g~~~~v~~~~~~~~~~~~~ 335 (522) T protein:vir:94 258 DYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEFVA--GRVEDINFLQLTKGQDFTI 335 (522) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCCceeec--CCcccceeeecccccchhH Confidence 99999999999999999999999999999999999999776654 44444555555543 22334444442 233455 Q ss_pred HHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceE Q lcl|NC_021532. 381 AFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEV 460 (663) Q Consensus 381 ~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~ 460 (663) +...++.+.+.|....-+. +++.. ++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.+..- T Consensus 336 ~~~~i~~~~~rI~~af~~~--~~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~---- 408 (522) T protein:vir:94 336 AKSVADAIEQRLGWAFLLN--SAVQR-NAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGM---- 408 (522) T ss_pred HHHHHHHHHHHHHHHHhhh--hhccC-CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCC---- Confidence 6788899999888877544 23322 2334699999999999999999999999999999999999998877532 Q ss_pred EEEecCeeeccchhhcCCceEEEEeecccchhHHHHH---HHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhh Q lcl|NC_021532. 461 IRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQ---ELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMRE 537 (663) Q Consensus 461 iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q---~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~ 537 (663) +-++..+ .+.+++. +++....+.+ .+..+++.++. +.|..... .-+..+.++.+.. T Consensus 409 -------lP~~p~~----~v~v~~~--s~La~~qr~~~~~~l~~~~~~ia~-l~P~~~~~-------~id~d~~~~~~a~ 467 (522) T protein:vir:94 409 -------IPDLPKE----AVEPTVS--TGLEALGRGQDLEKLTQAVNMMTG-LQPLSQDP-------DINLPTLKLRLLN 467 (522) T ss_pred -------CCCCCcc----cEEeeEe--cHHHHHHHHHHHHHHHHHHHHHHh-ccchhhhh-------cCCHHHHHHHHHH Confidence 2222222 1344444 3444444443 44444444443 44432110 0112222222222 Q ss_pred hhcc-hhhHHHHhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_021532. 538 YEPK-PDPVQEKIRQLELENLM-LENQMLVASINDKNARANENTIDAELKRSKAAVEKA-KARKLSSEA 603 (663) Q Consensus 538 ~~~~-~~~~~~q~~q~~~~~~q-~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~-~~q~~~~~~ 603 (663) ..+- +...-....+.++..++ +.++.+++. +..+.+...+... ++-+...++ T Consensus 468 ~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~--------------~~~~~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 468 ALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQG--------------ASAAGANMGAAVGQGAGEDMAQA 522 (522) T ss_pred HcCCChhhccCCHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHhhhhhhcccchhhhcC Confidence 2221 11000000000000000 000000000 0000000000000 000000000 No 39 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=5.7e-33 Score=197.34 Aligned_cols=499 Identities=13% Similarity=0.046 Sum_probs=285.9 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCcCCccccCC---CccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN---GEPYGNEQKGK---SAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~---~~~~~~~~~g~---s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) || +++.|+..++.|++++..|++..+|.. +........++ ..++.......++.+.+.|+..+|+ T Consensus 1 m~---------~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltp 71 (522) T protein:vir:10 1 MK---------ARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLP 71 (522) T ss_pred Cc---------hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 44 556788888889999999999998875 22221111221 2356677777889999999999998 Q ss_pred -CCceEEEEeCCcchHH------------HHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 75 -TADIIKCTPITWEDTD------------SAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 75 -~~~~~~~~p~~~~D~~------------~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) +.+||++.+...+..+ .-+..+..+...+ ..++++..++.++.+.+..|+|++.+.=+ T Consensus 72 p~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~~~~-------- 142 (522) T protein:vir:10 72 PQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYI-AASNDRVAVHQALKHLIVGGNALIFMGKD-------- 142 (522) T ss_pred CCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCceeEEEcCC-------- Confidence 5899999875432111 1122333443334 35788888999999999999999764211 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLA 221 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~ 221 (663) .++.++..++++..++.+.++. ++++.++|...+....-... +. T Consensus 143 ------------------------------~~~~~pl~~y~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~---~~ 186 (522) T protein:vir:10 143 ------------------------------GLKTFPLTRYVINRDGDGNVLE---IVTKELISRKVLDIELPEPK---PN 186 (522) T ss_pred ------------------------------CceEEEcceEEEeeCCCCCeeE---EEeeeeccHHHHHHhcchhc---cc Confidence 1245677889999887766654 78899999999987522111 11 Q ss_pred hccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeec Q lcl|NC_021532. 222 KTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIP 301 (663) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~ 301 (663) ... . ......+.|.||+|.+... +.|...++ ....+.++...++.+++..+||++..|...+ T Consensus 187 ~~~----~----------~~~~~~~~v~v~~~v~p~~--~~~~~~~~--~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ 248 (522) T protein:vir:10 187 TGI----D----------ESSTTNDDVTIYTYVKLDK--SSGRWVWH--QEAFDKIIPDSRSTAPKNASPWLPLRFNTVD 248 (522) T ss_pred hhh----h----------cccCCCCceEEEEEEEeec--cCCceEEE--EccCCccccccccccccccCCceeeeeeecC Confidence 000 0 0011234588888766532 22222222 2244555544455556678999999999999 Q ss_pred CcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc-hhhhccCCcceEeCCCCCccccccCc--ccc Q lcl|NC_021532. 302 FKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT-NRKKFLAGANFEFNGTANDFWHGSYN--AIP 378 (663) Q Consensus 302 ~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~-d~~~~~p~~vi~~~~~~~~~~~~~~~--~~~ 378 (663) |..||+|+++...+..+.+|++.+..+.++..+++|.++++++.+... +.....+|.++ ... .+.+.+++.. ... T Consensus 249 ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~~~~~~~v-~g~-~~~v~~~~~~~~~d~ 326 (522) T protein:vir:10 249 GEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAKAGNGAIV-QGR-PEDVAVIQVGKTADF 326 (522) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccCCCCccee-cCC-Cccceeecccccccc Confidence 999999999999999999999999999999999999999976665433 33233333332 222 2334444432 334 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) +.+..+++.+.+.|.+.. +++...++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.+.. T Consensus 327 ~~~~~~i~~~~~ri~~aF-----l~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g--- 398 (522) T protein:vir:10 327 STAANMATAIEKRLLEAF-----LVMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSN--- 398 (522) T ss_pred hHHHHHHHHHHHHHHHHH-----hhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcC--- Confidence 556777888888887653 344444554569999999999999999999999999999999999999887632 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) .++--|+++... . .|+.-.++++.++.+.+..+++.++...+|.... +.-+..+.++.+... T Consensus 399 ---------~lP~~p~~~~~~-~-~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~-------~~id~d~~~~~~a~~ 460 (522) T protein:vir:10 399 ---------QIPKLPKDIVRP-T-IVAGVNALGRGQDRESLTAFVGTIAQTLGPEALM-------QYLNPLEAIKRLAAA 460 (522) T ss_pred ---------CCCCCCcccccc-c-cccchhHHHHHHHHHHHHHHHHHHHHhhCchhhh-------hcCCHHHHHHHHHHH Confidence 122222232211 1 1222235666666677777777776554432211 111222222222222 Q ss_pred hcch-hhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPKP-DPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVK 612 (663) Q Consensus 539 ~~~~-~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~ 612 (663) .+-+ ...-..+.+.+..+++.+.+++++....+.++..... + ....+..+...+ ++...++ T Consensus 461 ~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~----~------~~~~~~~~~~~~---~~~~~~~ 522 (522) T protein:vir:10 461 QGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSP----L------MDPTKNPQLMDE---EQPPMEE 522 (522) T ss_pred hCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc----c------cCccccHHHHHH---hCCCCCC Confidence 2211 0000000000000000000000000000000000000 0 000000000000 0000000 No 40 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=1.3e-31 Score=189.98 Aligned_cols=519 Identities=10% Similarity=0.021 Sum_probs=288.4 Q ss_pred CCCcHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc---cc-CCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 1 MKINKAEL-LSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE---QK-GKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 1 ~~~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~---~~-g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) -+++++.+ ...+++.|+..++.|+.++..|++..+|..-...... .+ ....++.......++.+.+.|+..+|++ T Consensus 2 ~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 81 (543) T protein:vir:88 2 AETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFPL 81 (543) T ss_pred cccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Confidence 22333444 3556678999999999999999999999864322111 11 1124667777788999999999999999 Q ss_pred CceEEEEeCCcc-------hHHHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 76 ADIIKCTPITWE-------DTDSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 76 ~~~~~~~p~~~~-------D~~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+||++.+.... +.+.++ ..+..|...+ ..++++..++.++.+.+..|+|++.+.=+. .. T Consensus 82 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~snf~~~~~~~~~~L~~~G~a~ly~~~~~--~~---- 154 (543) T protein:vir:88 82 QSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYM-EANSYRVTLFELIRQLALAGTALIYLPPPD--AS---- 154 (543) T ss_pred CcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCceeeeeccCc--cc---- Confidence 999999875321 112111 2223333334 357788889999999999999998653111 00 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) ..+...+..++...+++..++.+.++. ++++..++...|.... .. T Consensus 155 ------------------------~~~~~~~~~~pl~~y~v~~d~~G~v~~---i~r~~~~~~~~l~~~~--------~~ 199 (543) T protein:vir:88 155 ------------------------SNSYNPMKLYTLHNHVVQRDAFGNVLQ---IVTLDKVAYAALPEDV--------RN 199 (543) T ss_pred ------------------------cceecceEEeEcceEEEeeCCCCCeee---eeeeeeccHHHHhHHh--------hH Confidence 000112345677788888776655433 7778889988875431 10 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) ...... ...+.++|.||++-++. ++.+...++. -+.+..+...++.|+...+||++..|...++ T Consensus 200 ----~v~~~~--------~~~p~~~~~v~~~V~pr--~~~~~~~~~~--~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~g 263 (543) T protein:vir:88 200 ----SLSGGQ--------EYKPEQELEVYTHIYID--DESGDFLSYQ--EIEGVEVDGSDGQYPQDALPWIAVRWTKRDG 263 (543) T ss_pred ----HHHHHh--------hcCCccceEEEEEEEee--cCCCcccccc--cccCeeeecCCCccccccCCceeeeeeecCC Confidence 000000 01122457787754332 2222222222 2345555555667777889999999999999 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC-cchhhhccCCcceEeCCCCCccccccC--ccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD-QTNRKKFLAGANFEFNGTANDFWHGSY--NAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~-~~d~~~~~p~~vi~~~~~~~~~~~~~~--~~~~~ 379 (663) ..||+|+++...+..+.+|++.+..+.++..+++|+++++++.+. +.+.....+|.++. . ..+.+.+++. .+... T Consensus 264 e~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~~~~g~~v~-g-~~~~v~~~~~~~~~~~~ 341 (543) T protein:vir:88 264 EHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTGDFVA-G-RKADIEFLQLEKTADFT 341 (543) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCceeec-C-CCCcceeeecccccchh Confidence 999999999999999999999999999999999999999777654 33333333333332 2 2234444433 33445 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .....++.+.+.|....=+. . ++.. ++..-||++|+.+.+.....|..++.+|...++.|++++.+.++.+..- T Consensus 342 ~~~~~i~~~~~rI~~af~~~-~-~~~~-~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~--- 415 (543) T protein:vir:88 342 VAKSVADAIEARLSYVFMLN-S-AVQR-SGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQ--- 415 (543) T ss_pred HHHHHHHHHHHHHHHHHhhh-h-hccC-CCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCC--- Confidence 67788899999888766333 2 2222 2334699999999999999999999999999999999999998887532 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecc-cchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISIST-AEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~-~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) +-++..+ .+.+++.++. ++.+....+.+..+++.++...+|.+. +.-+..+.++.+... T Consensus 416 --------lP~~p~~----~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vl--------d~id~d~~~~~~a~~ 475 (543) T protein:vir:88 416 --------IPNLPQE----AVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGD--------PDLNVNNIKLRLANA 475 (543) T ss_pred --------CCCCchh----ceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhh--------ccCCHHHHHHHHHHH Confidence 2233322 2344444332 344555556666666666544433321 111222222222222 Q ss_pred hcc-hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPK-PDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGY 617 (663) Q Consensus 539 ~~~-~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~ 617 (663) .+- +...-....+.++.+++++ +++++..+ +.++.....+.. ....+ +. +++...+. .+..=..- T Consensus 476 ~Gv~~~~i~r~~~e~~~~~~q~~---~q~~~~~~-~~~~~~~~~~~~---~~~~~-~~-~~~~~~~~-----~~~~p~~~ 541 (543) T protein:vir:88 476 IGIDTAGLLLTEAEKAQAQSQEM---LKQGGLNA-AAGIGSGVAAQA---TASPE-AM-ESAMDTAG-----VQPGPIAT 541 (543) T ss_pred hCCChhhhcCCHHHHHHHHHHHH---HHHHHHHH-HHHHhhchhhhh---ccChH-HH-HHHhhhcC-----CCCCCCCC Confidence 221 1100000000000000000 00000000 000000000000 00000 00 00000000 00000000 Q ss_pred HH Q lcl|NC_021532. 618 AH 619 (663) Q Consensus 618 ~~ 619 (663) +- T Consensus 542 ~~ 543 (543) T protein:vir:88 542 QV 543 (543) T ss_pred CC Confidence 00 No 41 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=6.2e-32 Score=191.69 Aligned_cols=507 Identities=15% Similarity=0.090 Sum_probs=279.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC---C-ccccCCCccccHHHHHHHHHHHHHHHHhhcC-C Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPY---G-NEQKGKSAIVSRDIKKQSEWQHATIVDPFVS-T 75 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~---~-~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~-~ 75 (663) || ...++.|+..++.|+.++..|++..+|..-... + ........++.+.....++.+.+.|+..+|+ + T Consensus 1 mk-------~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~ 73 (542) T protein:vir:78 1 MK-------GLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQ 73 (542) T ss_pred Ch-------hHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 33 234567888888899999999999999753211 1 1111123456677778899999999999998 6 Q ss_pred CceEEEEeCCcc--------hHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 76 ADIIKCTPITWE--------DTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 76 ~~~~~~~p~~~~--------D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) .+||++.+...+ ++..+++ .+..+...+. .++++..++.++.+.+..|+|++.+.-+ T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~~~~-------- 144 (542) T protein:vir:78 74 TSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIA-ESSDRVQLTAAMKHLIVTGNVLVFAGKK-------- 144 (542) T ss_pred CccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhhCeEEEEecCC-------- Confidence 899999875332 1111111 2334433343 5778888999999999999999764210 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLA 221 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~ 221 (663) .++.++..++++..++.+.++. ++++..+|..+|.+..-...+ . T Consensus 145 ------------------------------~~~~~pl~~y~v~~d~~G~vd~---v~r~~~~t~~ql~~~fg~~~l---~ 188 (542) T protein:vir:78 145 ------------------------------TLKVYPLDRYVIERDGDGNVIE---IITRELVDRSLLPAEFQKQSL---L 188 (542) T ss_pred ------------------------------CceEEecceeEEeeCCCCCeEE---EeeeeecCHHHHHHhhccccC---c Confidence 1345777889998887766655 788999999999876321111 1 Q ss_pred hccchhhhccccccccccccccccceEEEEEEEEEee-e-------cCCceeEEEEEEEECCEEE--ecccCCCcCCCCC Q lcl|NC_021532. 222 KTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYD-V-------DGDGIAEPIVCAWINDVIV--RLQSNPYPDGKPP 291 (663) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~-~-------~~~g~~~~~~~~~~g~~~l--~~~~~p~~~~~~P 291 (663) ...... ..+....++.++++++..+ . ...+...++ .-+++..+ ...+++| ..+| T Consensus 189 ~~~~~~------------~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~--~e~~g~~v~~~~~e~g~--~~~P 252 (542) T protein:vir:78 189 EGKDSN------------AVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWH--QECDGKEIKGSRSSSPL--KHSP 252 (542) T ss_pred hHHHhh------------ccccCCCeEEEEEEeecccCCccccccccCCCeEEEE--EEecccccccccccccc--ccCC Confidence 100000 0011123344544433221 1 112222222 22334333 2344444 6799 Q ss_pred EEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-CcchhhhccCCcceEeCCCCCccc Q lcl|NC_021532. 292 FLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQTNRKKFLAGANFEFNGTANDFW 370 (663) Q Consensus 292 f~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~~d~~~~~p~~vi~~~~~~~~~~ 370 (663) |++..|...++..||+|+++...+..+.+|++.+..+.++..+++|.++++++.+ ++.+.....+|.++.- .++.+. T Consensus 253 ~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~~~~~g~iv~g--~~~~v~ 330 (542) T protein:vir:78 253 WLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLARAGTGAIIQG--RAEDVS 330 (542) T ss_pred ceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCceeecC--Ccccee Confidence 9999999999999999999999999999999999999999999999999977655 4444445566655432 234444 Q ss_pred cccCc--cccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 371 HGSYN--AIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWM 448 (663) Q Consensus 371 ~~~~~--~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 448 (663) +++.. .-.......++.+.+.|.+..-+. ...++..-||++|+.+.+.....|..++.+|...++.|++++.+ T Consensus 331 ~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~-----~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~ 405 (542) T protein:vir:78 331 VVQANKGADFRTVQEMIRDLSQRISDAFLIL-----NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKL 405 (542) T ss_pred eeecccccchhHHHHHHHHHHHHHHHHhccc-----ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 44432 234556788888888888765332 22233345999999999999999999999998889999999999 Q ss_pred HHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeeccc-chhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhh Q lcl|NC_021532. 449 AYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTA-EDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMR 527 (663) Q Consensus 449 ~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~-~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~ 527 (663) .++.+..- +-++ |+++ +++.+.++.+ ..+....+.+..+++.+++.++|.... +.-+ T Consensus 406 ~il~r~g~-----------lP~~-p~~l---v~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~-------~~id 463 (542) T protein:vir:78 406 HLMQRSKQ-----------LPSL-PKGL---VMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQ-------QFID 463 (542) T ss_pred HHHHhcCC-----------CCCC-chhc---eeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHH-------hcCC Confidence 98887532 1122 2222 3344443322 223333344455555555443332211 1112 Q ss_pred hhhhhhhhhhhhcchh-hHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 528 MPEQAKRMREYEPKPD-PVQEKIRQLELENLMLENQMLVASINDKNARANEN-TIDAELKRSKAAVEKAKARKLSSEADM 605 (663) Q Consensus 528 ~~e~~~~l~~~~~~~~-~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~-~~~~~~~~~~~~~e~~~~q~~~~~~~~ 605 (663) ..+.++.+....+-+. ..-....+.++++++++.+++++....+.+.+... .......+..+.... +-...+. T Consensus 464 ~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~~~~~~~a~~~~---~~~~~~~-- 538 (542) T protein:vir:78 464 PTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEKMMQQINAPGQE---APAGPQT-- 538 (542) T ss_pred HHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhhcCCCCcC---CCCCCcc-- Confidence 2222222222222110 00000000000000000000111110000000000 000000000000000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 606 TDLKFVKEDNGYAHLEQV 623 (663) Q Consensus 606 ~~~e~~~~~~~~~~~~~~ 623 (663) -++. T Consensus 539 --------------~~~~ 542 (542) T protein:vir:78 539 --------------GEDL 542 (542) T ss_pred --------------cccC Confidence 0000 No 42 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=2.2e-30 Score=183.16 Aligned_cols=488 Identities=10% Similarity=0.028 Sum_probs=276.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc--ccCCCccccHHHHHHHHHHHHHHHHhhcC-CCc Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE--QKGKSAIVSRDIKKQSEWQHATIVDPFVS-TAD 77 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~--~~g~s~~~~~~i~~~v~~~~~~l~~~~~~-~~~ 77 (663) |++...--.+.+++.|+..++.|+.++..|+++.+|..-...+.. .++...++...-...++.+.+.|+..+|+ +.+ T Consensus 5 ~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 84 (516) T protein:vir:96 5 IDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRS 84 (516) T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCccccCCcccchHHHHHHHHHHHHHhhhcCCCCc Confidence 555555556889999999999999999999999999864332211 12223466777777899999999999998 579 Q ss_pred eEEEEeCCcch-------HHHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccc Q lcl|NC_021532. 78 IIKCTPITWED-------TDSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGE 144 (663) Q Consensus 78 ~~~~~p~~~~D-------~~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~ 144 (663) ||++.+..... .+.++ ..+..+...+. .++++..++.++.+.+..|+|++.+. .. T Consensus 85 WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~~L~~~G~a~l~~d--~~--------- 152 (516) T protein:vir:96 85 FFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELE-QRQFRPAVVEAFKHLIVAGSCMLYKP--SK--------- 152 (516) T ss_pred ccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHCeEeEEec--CC--------- Confidence 99998753221 12222 23344433343 47888889999999999999997652 10 Q ss_pred ccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhcc Q lcl|NC_021532. 145 AVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTS 224 (663) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~ 224 (663) ..++.+|..++++..++.+.+++ ++++.+++..+|.+.. .++ .... T Consensus 153 --------------------------~~~~~~pl~~y~v~~d~~G~v~~---i~rr~~~~~~~l~~~~--~~~---~~~~ 198 (516) T protein:vir:96 153 --------------------------GAISAIPMHHYVVNRDTNGDLLD---IILLQEKALRTFDPAT--RAV---VEVG 198 (516) T ss_pred --------------------------CCEEEEEcCeEEEeeCCCCCeee---ehhhhHhhHHHHHHhh--hhh---hhhh Confidence 01345677888888887766655 5677778877765431 110 0000 Q ss_pred chhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcc Q lcl|NC_021532. 225 GEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKL 304 (663) Q Consensus 225 ~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~ 304 (663) .... .......|.||.|=.+ ..++...++.. ..|.+++. ++-|+...+||++..|...+|.. T Consensus 199 ~~~~------------~~~~~~~v~v~~~v~~---~~~~~~~~~~~-~d~~~~~~--es~~~~~e~P~~~~Rw~~~~ge~ 260 (516) T protein:vir:96 199 LKGK------------KCKEDDSVKLYTHAKY---LGDGFWELKQS-ADDIPVGK--VSKIKSEKLPFIPLTWKRSYGED 260 (516) T ss_pred hhhh------------hcCCCCceEEEEeeee---eCCceeEEEEE-eCceeecc--ccccccccCCeeeeeeeecCCCC Confidence 0000 0011234566554333 33454333322 22334544 45555567999999999999999 Q ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCcc--ccHHHH Q lcl|NC_021532. 305 HGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNA--IPSSAF 382 (663) Q Consensus 305 ~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~--~~~~~~ 382 (663) ||+|+++...+.-+.+|++.+..+.+...+++|.++++++.+...+.....+.+.+... ..+.+.+++... -.+.+. T Consensus 261 YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~i~~g-~~~~v~~~q~~~~~d~~~~~ 339 (516) T protein:vir:96 261 WGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTG-VEEDIHIVQLGKYADLTPIS 339 (516) T ss_pred cccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccCCCceeecC-CcccceeeecCcccchhHHH Confidence 99999999999999999999999999999999999997776654333322232333222 234455554433 235567 Q ss_pred HHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEE Q lcl|NC_021532. 383 DMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIR 462 (663) Q Consensus 383 ~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~ir 462 (663) ..++.+.+.|....=++. ....++..-||++|..+.+.....|..++.++...++.|++.+.+..+. T Consensus 340 ~~i~~~~~rI~~af~~~~---l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~---------- 406 (516) T protein:vir:96 340 AVLEVYTRRIGVVFMMET---MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG---------- 406 (516) T ss_pred HHHHHHHHHHHHHHhhhh---hccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcC---------- Confidence 778888888877542221 1122333459999999999999999999999998899999877653321 Q ss_pred EecCeeeccchhhcCCceEEEEeecc-cchhHHHHHHHHHHHHHhccC--CCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 463 VTNDKFVPIRKDDLSGRIDIDISIST-AEDNAAKSQELSFLLQTLGPN--EDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 463 i~~~~~v~i~~~~~~~~~d~~v~~~~-~~~~~~~~q~l~~~~~~~~~~--~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) +. + |+ +.+++.+..+. ++.+....+.+..+++.++.. .+|.+. +.-+..+.++.+.... T Consensus 407 ---p~---l-p~---~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~--------d~id~d~~~~~~a~~~ 468 (516) T protein:vir:96 407 ---ES---F-TS---DLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVL--------AAVKWPDYMDWVRGQI 468 (516) T ss_pred ---CC---C-cc---ccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHH--------hcCCHHHHHHHHHHHh Confidence 11 1 11 11233333322 234444444555555554432 233221 2222222333333222 Q ss_pred cchhhHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQ---EKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKA 594 (663) Q Consensus 540 ~~~~~~~---~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~ 594 (663) +-|...- ++..++.+++++++++ +.+..+.+++...++ +.+.+.+ T Consensus 469 Gvp~~~irs~eev~~~~~~~~~~q~~---~~~a~~~~~~~~~~~-------~~~~~~~ 516 (516) T protein:vir:96 469 SAELPFLKSAEEMAQEQEAQMQAQQA---QMLEEGVAKAVPGVI-------QQELKEA 516 (516) T ss_pred CCCccccCCHHHHHHHHHHHHHHHHH---HHHHHHhhhhhhHHh-------hcccccC Confidence 2221110 0000000000000000 000000000000000 0000000 No 43 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=1.4e-29 Score=178.82 Aligned_cols=486 Identities=11% Similarity=0.016 Sum_probs=274.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC---ccccCC-CccccHHHHHHHHHHHHHHHHhhcCC- Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG---NEQKGK-SAIVSRDIKKQSEWQHATIVDPFVST- 75 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~---~~~~g~-s~~~~~~i~~~v~~~~~~l~~~~~~~- 75 (663) || +.+.+.|+..+ |+.++..|+++.+|..-.... ...+++ ...+.......++.+.+.|+..+|+. T Consensus 1 mk-------~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) T protein:vir:63 1 MK-------TTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) T ss_pred Ch-------hHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCC Confidence 33 33445555443 677888899888887632111 111111 23566777778899999999999985 Q ss_pred CceEEEEeCCc-------chHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 76 ADIIKCTPITW-------EDTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 76 ~~~~~~~p~~~-------~D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+||++.+... ++.+.+++ .+..+...+ ..++++..++.++.+.+..|++++.+. .+ T Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~Li~~G~a~l~~~--~~------- 141 (510) T protein:vir:63 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRL-FQNASLAVLTQVIKLLIVTGNALLYRD--SD------- 141 (510) T ss_pred CcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCeEEEEEc--CC------- Confidence 69999987532 12222222 333333334 357888889999999999999987752 10 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) ...+..++..++++..++.+.+++ ++++..+|..+|-+.. -.. T Consensus 142 ---------------------------~~~~~~~pl~~y~v~~d~~G~vd~---i~rr~~~t~~~l~e~~-~~~------ 184 (510) T protein:vir:63 142 ---------------------------AATVVAWSLRSYAVRRDATGRWMD---IVLKQRYKSKDLDEEY-KQD------ 184 (510) T ss_pred ---------------------------CcEEEEEEcceeEEeeCCCcCeeE---EEeeeeccHHHHhHHh-hhh------ Confidence 012456788889998887776655 6888999988874421 000 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEE-ECCEEEecccCCCcCCCCCEEEEeeeeec Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAW-INDVIVRLQSNPYPDGKPPFLVVPFNSIP 301 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~-~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~ 301 (663) ..... ....+.+.|.||.+-++.+..+ ...+.+++ +++..+ ..++-|++..+||++..|...+ T Consensus 185 -----~~~~~-------~~~~~~~~v~v~~~V~~~~~~~---~~~~sv~~e~dg~~~-~~~~~~~~~e~P~~~~Rw~~~~ 248 (510) T protein:vir:63 185 -----LMRAG-------RNLSGSGSVDLYTHVQRKKGTA---MEYAELYHEIDGVRV-GKEGRWPIHLCPYIVPTWNLAP 248 (510) T ss_pred -----hhccc-------cccCCCcceEEEEEEEeecCCC---ceEEEEEEEecCcee-ccccccccccCceeeeeeeecC Confidence 00000 0011224577887766543221 22222233 344333 2356677788999999999999 Q ss_pred CcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc-chhhhccCCcceEeCCCCCccccccCc--ccc Q lcl|NC_021532. 302 FKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ-TNRKKFLAGANFEFNGTANDFWHGSYN--AIP 378 (663) Q Consensus 302 ~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~-~d~~~~~p~~vi~~~~~~~~~~~~~~~--~~~ 378 (663) |..||.|+++...+.-+.+|++.+..+.....+++|.++++++.+.. .+.....+|.++. +..+.+.+++.. ... T Consensus 249 ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~--g~~~~v~~~~~~~~~d~ 326 (510) T protein:vir:63 249 GEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVP--GGAEAVRAYERGDYNKM 326 (510) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhccCCCceeec--CCcccceeeecCcccch Confidence 99999999999999999999999999999999999999998776543 3333444454532 223444444433 233 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) +.+...++.+.+.|....= ..+...++..-||++|+.+.+.....|..++.++...++.|++++.+.++.... T Consensus 327 ~~~~~~i~~~~~rI~~af~----~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g--- 399 (510) T protein:vir:63 327 AAIQQSLQAVVVRLNQAFM----YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL--- 399 (510) T ss_pred HHHHHHHHHHHHHHHHHHH----hhcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc--- Confidence 5567788888888877631 112223333459999999999999999999999999999999999988886532 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcc-hhHHHHHHHHHhhhhhhhhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPK-IRRDIMADIMDLMRMPEQAKRMRE 537 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~-~~~~~l~~~~~l~~~~e~~~~l~~ 537 (663) ..++.++.+... .|+.-.++....+.+.+..+.+.++...++. +.+. -+..+.++.+.. T Consensus 400 ---------l~p~p~~~~~~~---~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~--------id~d~~~~~~a~ 459 (510) T protein:vir:63 400 ---------LQGLITKQHKPA---IETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR--------ISLPKMMDTIWA 459 (510) T ss_pred ---------CCCCCchhcccc---eecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhcc--------CCHHHHHHHHHH Confidence 223344433321 1222345566666666666555554332221 1110 112222222222 Q ss_pred hhcc-hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 538 YEPK-PDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAV 591 (663) Q Consensus 538 ~~~~-~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~ 591 (663) ..+- |...-....+.++..+++.++.+++. ++++...+...++-...+.. T Consensus 460 ~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~----~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 460 AFSVDTSQFYKSADELQAEAEQQRQQAAQAQ----AAQETLLEGASDMTNALAGV 510 (510) T ss_pred HhCCChhHhcCCHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhhcccccCC Confidence 2221 11110000000000000000000000 00000000000000000011 No 44 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=1.6e-29 Score=178.53 Aligned_cols=487 Identities=10% Similarity=0.013 Sum_probs=274.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC---ccccCC-CccccHHHHHHHHHHHHHHHHhhcCC- Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG---NEQKGK-SAIVSRDIKKQSEWQHATIVDPFVST- 75 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~---~~~~g~-s~~~~~~i~~~v~~~~~~l~~~~~~~- 75 (663) ||- .+.+.|+..+ |+.++..|+++.+|..-.... ...+++ ...+.......++.+.+.|+..+|+. T Consensus 1 mk~-------~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 71 (510) T protein:vir:78 1 MKS-------TAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) T ss_pred Chh-------HHHHHHHHHh--ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCC Confidence 443 3344444433 667888899888887632211 111111 22456667778899999999999984 Q ss_pred CceEEEEeCCcch-------HHHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 76 ADIIKCTPITWED-------TDSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 76 ~~~~~~~p~~~~D-------~~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .+||++.+..... .+.++ ..+..+...+ ..++++..++.++.|.+..|++++.+. .. T Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~~--~~------- 141 (510) T protein:vir:78 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRL-FQNASLAVLTQVIKLLIVTGNALLYRN--SD------- 141 (510) T ss_pred CcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhhCeEEEEEe--CC------- Confidence 6999998754321 11121 1223333334 357888889999999999999887542 10 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) ...++.+|..++++..++.+.+++ ++++..+|..+|.+... .+. . T Consensus 142 ---------------------------~~~~~~~pl~~y~v~~d~~G~vd~---i~rr~~~t~~~l~~~~~-~~~--~-- 186 (510) T protein:vir:78 142 ---------------------------EATVVAWSLRSYAVRRDATGRWMD---IVLKQRYKSKDLDDVYK-QDL--M-- 186 (510) T ss_pred ---------------------------CCeEEEEEcceeEEeeCCCcCeeE---EEeeeeccHHHHHHHhh-HHh--h-- Confidence 002455777888888877666655 78889999999876521 100 0 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) .. . ....+.+.|.||.+.++.+..+.+...+|.- +++..+ ..++-|++.++||++..|...+| T Consensus 187 ---~~----~-------~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e--~dg~~i-~~~~~~~~~e~P~~~~Rw~~~~g 249 (510) T protein:vir:78 187 ---RA----G-------RNLSGSGSVDLYTHVQRRKGTAMDYAEMYHE--IDGVRV-GETGRWPIHLCPYIVPTWNLAPG 249 (510) T ss_pred ---hh----h-------hccCCCceEEEEEEEEeecCCCCcEEEEEEE--ecCeee-ccccccccccCCeeeeeeeecCC Confidence 00 0 0112334688888887754332222222222 344333 24566777889999999999999 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCc--cccHH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYN--AIPSS 380 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~--~~~~~ 380 (663) ..||+|+++...+.-+.+|++.+..+.+...+++|.++++++.+...+.....+.+.+... ..+.+.+++.. ..... T Consensus 250 e~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~~~g~~v~g-~~~~v~~~~~~~~~d~~~ 328 (510) T protein:vir:78 250 EHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPG-GAEAVRAYERGDYNKMAA 328 (510) T ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccCCCceeecC-CcccccccccCcccchHH Confidence 9999999999999999999999999999999999999998876644333332232333222 23445554433 33355 Q ss_pred HHHHHHHHHHHHHHHhCCChHHcC-CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 381 AFDMISLMNNEIESITGTKSFSGG-INSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 381 ~~~~~~~~~~~~~~~tGi~~~~~G-~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) +...++.+.+.|.... ++. ...++..-||++|+.+.+.....|..++.++...++.|++++.+.++.... T Consensus 329 ~~~~i~~~~~rI~~aF-----~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g---- 399 (510) T protein:vir:78 329 IQQSLQAVVVRLNQAF-----MYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL---- 399 (510) T ss_pred HHHHHHHHHHHHHHHH-----hhccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcc---- Confidence 6777888888887753 222 223333459999999999999999999999999999999999998886532 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCc-chhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDP-KIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p-~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) ..++.++.... ..|+.-.++.+..+.+.+..+.+.++...++ ++.+. -+..+..+.+... T Consensus 400 --------l~p~p~~~~~~---~~v~~is~Laraq~~~~l~~~~q~l~~~~~~~q~~~~--------id~d~~~~~~a~~ 460 (510) T protein:vir:78 400 --------LQGLITKQHKP---AIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR--------ISLPKMMDTIWAA 460 (510) T ss_pred --------CCCCCcccccc---eeeecccHHHHHHHHHHHHHHHHHHHHhcChhhhhhc--------CCHHHHHHHHHHH Confidence 12333333222 1133234566666666666666655433321 11111 1112222222222 Q ss_pred hcc-hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPK-PDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADM 605 (663) Q Consensus 539 ~~~-~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~ 605 (663) .+- |...-....+.++..++ .+++ +++++.++ ++-++++- .... ....+ T Consensus 461 ~Gv~p~~ivrs~eev~a~~~~-----~~~q-~~~~~~~~----~a~~~~~~---~~~~-----~~~g~ 510 (510) T protein:vir:78 461 FSVDTSQFYKSADELQAEAEE-----QRRQ-AAQAQAAQ----ETLLEGAS---DMTN-----ALAGV 510 (510) T ss_pred hCCChhhhcCCHHHHHHHHHH-----HHHH-HHHHHHHH----HHHHHhhh---hhcc-----cCCCC Confidence 221 11110000000000000 0000 00000000 00000000 0000 00000 No 45 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=2.1e-29 Score=177.85 Aligned_cols=495 Identities=12% Similarity=0.023 Sum_probs=283.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc--cCCCccccHHHHHHHHHHHHHHHHhhcC-CCc Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ--KGKSAIVSRDIKKQSEWQHATIVDPFVS-TAD 77 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~--~g~s~~~~~~i~~~v~~~~~~l~~~~~~-~~~ 77 (663) |.|.-..-.+.+.+.|+..++.|+.++..|++..+|..-....... .....++.......++.+.+.|+..+|+ +.+ T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 80 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLSNKLSQVLFPAQRS 80 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 7777666677899999999999999999999999997643221111 1123466677777889999999999998 579 Q ss_pred eEEEEeCCcchH-------HHH------HHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccc Q lcl|NC_021532. 78 IIKCTPITWEDT-------DSA------EQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGE 144 (663) Q Consensus 78 ~~~~~p~~~~D~-------~~A------e~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~ 144 (663) ||++.+...+.. ..+ +..+..+...+ ..++++..++.++.+.+..|+|++.+. . T Consensus 81 WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~ly~~--~---------- 147 (517) T protein:vir:10 81 FFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYG-ESLQFRPAVVEAFKHLIVTGNVMMYHP--D---------- 147 (517) T ss_pred cccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEEEEe--C---------- Confidence 999987543211 111 22233333333 457888899999999999999987541 0 Q ss_pred ccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhcc Q lcl|NC_021532. 145 AVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTS 224 (663) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~ 224 (663) ....++.++..++++..++.+.+++ ++++..++...|.+... ... T Consensus 148 ------------------------~~~~~~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~~~~~-~~~------- 192 (517) T protein:vir:10 148 ------------------------KTSPIQAVPLHHYCVRRDNNGTVLD---IVFLQEKALETFEPSIR-MAI------- 192 (517) T ss_pred ------------------------CCCcEEEEEcCeEEEeeCCCcCeEE---EEeeeeccHHHHHHHhh-hhc------- Confidence 0112455777889999887776655 67888999998876521 110 Q ss_pred chhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcc Q lcl|NC_021532. 225 GEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKL 304 (663) Q Consensus 225 ~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~ 304 (663) ..... .. ...+.+.|+||.|-++ ..+|...+|.. +++..+ ..++-|+...+||++..|...+|.. T Consensus 193 ~~~~~--------~~-~~~~~~~v~v~~~v~~---~~~~~~~~~~~--~d~~~~-~~~s~y~~~e~P~~~~Rw~~~~ge~ 257 (517) T protein:vir:10 193 QASRK--------GK-QYKDKDNVKLYTHAKR---TKDGKYLIRQS--ADDVPV-GKESTVTEDKSPFLILTWKRSYGED 257 (517) T ss_pred chhhh--------hh-ccCCcCceEEEEEEEE---eCCCceEEEEE--eCceee-ccccccccccCCeeeeeeeecCCCC Confidence 00000 00 0112245777765444 23444333222 355443 3466777788999999999999999 Q ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhh-hccCCcceEeCCCCCccccccCc--cccHHH Q lcl|NC_021532. 305 HGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRK-KFLAGANFEFNGTANDFWHGSYN--AIPSSA 381 (663) Q Consensus 305 ~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~-~~~p~~vi~~~~~~~~~~~~~~~--~~~~~~ 381 (663) ||+|+++...+.-+.+|++.+..+.+...+++|+++++++.+...+.. ...+|.++. +..+.+.+++.. .-.+.+ T Consensus 258 YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~~~g~~~~--g~~~~v~~~~~~~~~d~~~~ 335 (517) T protein:vir:10 258 YGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEGGSGAVLH--GVEGDIHIVQLGKYADYTPI 335 (517) T ss_pred cccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCCCcccccc--CCcccceeeecccccchhHH Confidence 999999999999999999999999999999999999988776543322 222333322 122344444432 234566 Q ss_pred HHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEE Q lcl|NC_021532. 382 FDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVI 461 (663) Q Consensus 382 ~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~i 461 (663) ...++.+.+.|....=++. ++. .++..-||++|..+.+.....|..++.+|...++.|+..+.+..+......+ T Consensus 336 ~~~i~~~~~rI~~af~~~~--l~~-~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~--- 409 (517) T protein:vir:10 336 QAVLNDYRQRIGRVFMMEA--MTR-RDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSK--- 409 (517) T ss_pred HHHHHHHHHHHHHHHhhhh--hhc-cCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCC--- Confidence 7788888888887663322 221 1222459999999999999999999999999999999988876654322111 Q ss_pred EEecCeeeccchhhcCCceEEEEeecc-cchhHHHHHHHHHHHHHhccC--CCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 462 RVTNDKFVPIRKDDLSGRIDIDISIST-AEDNAAKSQELSFLLQTLGPN--EDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 462 ri~~~~~v~i~~~~~~~~~d~~v~~~~-~~~~~~~~q~l~~~~~~~~~~--~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) ...+.+.++. ++.+....+.+..+++.+++. .+|.+.. .-+..+..+.+... T Consensus 410 -----------------~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~--------~id~d~~~~~~a~~ 464 (517) T protein:vir:10 410 -----------------NVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQ--------AIKWPDFTDWVQGQ 464 (517) T ss_pred -----------------CccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh--------cCCHHHHHHHHHHH Confidence 1223333332 233444444455555544332 2332221 11222222222222 Q ss_pred hcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLS 600 (663) Q Consensus 539 ~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~ 600 (663) .+-|...--...+ .++.+++.+ ++++.++.+++ ...+.....+ .-+...+-.+ T Consensus 465 ~Gvp~~~irs~~e--v~~~~~~~~--~~~~~~~~~~~---ag~~~~~~~~--~~~~~~~~~~ 517 (517) T protein:vir:10 465 ISANFPFFKTQDE--LNAEAQAQQ--EQEATKYAAEQ---AGKAIPDMVK--NGQINPQGGQ 517 (517) T ss_pred hCCChhhcCCHHH--HHHHHHHHH--HHHHHHHHHHH---HHHHHHHHHh--CCCCCCCCCC Confidence 2211110000000 000000000 00000000000 0000000000 0000000000 No 46 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=5.4e-29 Score=175.55 Aligned_cols=489 Identities=12% Similarity=0.045 Sum_probs=278.8 Q ss_pred CcHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccC--CCccccHHHHHHHHHHHHHHHHhhcC- Q lcl|NC_021532. 3 INKAEL-----LSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKG--KSAIVSRDIKKQSEWQHATIVDPFVS- 74 (663) Q Consensus 3 ~~~~~~-----~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g--~s~~~~~~i~~~v~~~~~~l~~~~~~- 74 (663) |.+..+ .+.|.+.|+..++.|+.++..|++..+|..-...+....+ ...++.......++.+.+.|+..+|+ T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp 80 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcccccccccchHHHHHHHHHHHHHHhhcCC Confidence 444443 5778888999999999999999999999865333222222 22356777777899999999999998 Q ss_pred CCceEEEEeCCcch-------HHHHH------HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 75 TADIIKCTPITWED-------TDSAE------QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 75 ~~~~~~~~p~~~~D-------~~~Ae------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) +.+||++.+..... .+.++ ..+..+...+ ..++++..++.++.+.+..|+|++.+ |.. T Consensus 81 ~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~--d~~------ 151 (515) T protein:vir:70 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKAL-EQRQFRPAIVEVFKHLIVAGNCLLYK--PSK------ 151 (515) T ss_pred CCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-HhcCchHHHHHHHHHHHhHCeEEEEE--eCC------ Confidence 57999998654322 22222 2333333334 35788888999999999999999765 210 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLA 221 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~ 221 (663) + .++.+|..++++..++.+.++. ++++..+|..+|.+..- ... .. T Consensus 152 ---------------------------~--~~~~~pl~~y~v~~d~~G~v~~---i~rr~~~t~~~l~~~f~-~~~--~~ 196 (515) T protein:vir:70 152 ---------------------------G--AMSAVPMHHYVVNRDTNGDLMD---VILLQEKALRTFDPATR-MAI--EV 196 (515) T ss_pred ---------------------------C--CeEEEEcCeEEEeeCCCcCeeE---EEeeeeccHHHHHHhhh-hhh--hh Confidence 0 1345677889998887776655 78889999999877521 110 00 Q ss_pred hccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeec Q lcl|NC_021532. 222 KTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIP 301 (663) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~ 301 (663) ..... . ....+.|.+|.+= ...++|...++.. +++..+ ..++-|++..+||++..|...+ T Consensus 197 ~~~~~-----~---------~~~~~~v~i~~~v---~~~~~~~~~~~~e--~d~~~~-~~es~y~~~e~P~~~~Rw~~~~ 256 (515) T protein:vir:70 197 GMKGK-----K---------CKEDDNVKLYTHA---QYAGEGFWKINQS--ADDIPV-GKESRIKSEKLPFIPLTWKRSY 256 (515) T ss_pred hhhhh-----h---------cCCCCceEEEEEE---EecCCCceEEEEe--cCceee-ccccccccccCCceeeeeeecC Confidence 00000 0 0012345665432 2234555544333 244433 3456677788999999999999 Q ss_pred CcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCcc--ccH Q lcl|NC_021532. 302 FKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNA--IPS 379 (663) Q Consensus 302 ~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~--~~~ 379 (663) |..||+|++....+.-+.+|++.+..+.+...+++|.++++++.+...+.....+.+.+..+ ..+.+.+++... -.+ T Consensus 257 ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~~~g~iv~g-~~~~v~~~~~~~~~d~~ 335 (515) T protein:vir:70 257 GEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITG-VAEDIHIVQLGKYADLT 335 (515) T ss_pred CCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccccCCceeecC-CcccceeeecCcccchh Confidence 99999999999999999999999999999999999999998877654443333332333222 234444554332 235 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...++.+.+.|....=++..... ++..-||++|..+.+.....|..++.++...++.|+..+.+. . T Consensus 336 ~~~~~i~~~~~rI~~af~~~~l~~r---d~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~---~------ 403 (515) T protein:vir:70 336 PISAVLEVYTRRIGVIFMMETMTRR---DAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ---E------ 403 (515) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhcc---CCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH---h------ Confidence 5677788888888776544332222 233459999999988888899999999988888888654321 1 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecc-cchhHHHHHHHHHHHHHhcc--CCCcchhHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISIST-AEDNAAKSQELSFLLQTLGP--NEDPKIRRDIMADIMDLMRMPEQAKRMR 536 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~-~~~~~~~~q~l~~~~~~~~~--~~~p~~~~~~l~~~~~l~~~~e~~~~l~ 536 (663) .+ +..|..+ .++.+..+. ++.+....+.+..+++.++. ..+|.+ ++..+..+..+.+. T Consensus 404 -------~~-p~~P~~~---v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~--------~~~id~d~~~~~~a 464 (515) T protein:vir:70 404 -------AG-DSFTSEL---VDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPA--------QRAIRWGDYMDWVR 464 (515) T ss_pred -------hC-CCCChhh---cccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhH--------HhhCCHHHHHHHHH Confidence 11 1122222 223333322 33444455555555555432 222222 11122222222222 Q ss_pred hhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 537 EYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRS 587 (663) Q Consensus 537 ~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~ 587 (663) .....+...-....+.++..++.+.+++++.+.++..++..-.....++++ T Consensus 465 ~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 465 GQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 222222111111111111100000000000000000000000000000000 No 47 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=99.96 E-value=6.3e-28 Score=169.72 Aligned_cols=487 Identities=10% Similarity=-0.033 Sum_probs=269.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc----CCccc-cCC-CccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEP----YGNEQ-KGK-SAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~----~~~~~-~g~-s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |+-+-+.+ |. +.-|+.++..|+++.+|..-.. ..... .++ ...+...-...++.+.+.|+..+|+ T Consensus 1 m~~~~~~l-------~~--k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltp 71 (514) T protein:vir:80 1 MRQQASAM-------WA--EYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFP 71 (514) T ss_pred CccchHHH-------HH--HhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcC Confidence 44333222 11 2236678888888888865321 11111 111 2234566667889999999999998 Q ss_pred -CCceEEEEeCC-------cchHHHHHH------HHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 75 -TADIIKCTPIT-------WEDTDSAEQ------NELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 75 -~~~~~~~~p~~-------~~D~~~Ae~------~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) +.+||++.+.. .+|.+.+++ .+..+...+ ..++++..++.++.+.+..|+|++.+. ... T Consensus 72 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~~--~~~---- 144 (514) T protein:vir:80 72 PGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRL-FVNASLSKLHRILKLLVVTGNALFYRE--PGT---- 144 (514) T ss_pred CCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEEEEEe--cCC---- Confidence 57999998742 122333333 223333334 357888889999999999999997752 100 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKL 220 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~ 220 (663) -.+..++..++++..++.+.+++ ++++.+++..+|.... T Consensus 145 ------------------------------~~~~~~pl~~y~v~~d~~G~v~~---i~rr~~~~~~~l~~~~-------- 183 (514) T protein:vir:80 145 ------------------------------GKMLVWTMQSYTVRRTSHGDPAV---VVLRQQMPFRELTPEI-------- 183 (514) T ss_pred ------------------------------CcEEEEEcCeEEEeeCCCcCeEE---EEeeeeecHHHhhhhh-------- Confidence 01345677889998887776655 6788899988875421 Q ss_pred hhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeee Q lcl|NC_021532. 221 AKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSI 300 (663) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~ 300 (663) .. .... ......+.++|.||.|.++.+..+.+...+|.. +++..+ ..++-|++.++||++..|... T Consensus 184 ~~----~~~~-------~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e--~~g~~i-~~es~y~~~e~P~i~~Rw~~~ 249 (514) T protein:vir:80 184 QA----DAQA-------KQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHE--LEGKRV-GPESSYPAHLCPYVPVAWNVP 249 (514) T ss_pred hh----hhhh-------hhccCCCCCceEEEEEEEeecCCCCeEEEEEEe--ccceee-cccCccccccCCeeeeeeEec Confidence 00 0000 001112334688888877765433333333221 344333 346677777899999999999 Q ss_pred cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCc--ccc Q lcl|NC_021532. 301 PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYN--AIP 378 (663) Q Consensus 301 ~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~--~~~ 378 (663) +|..||.|++....+..+.+|++.+..+.....+++|.++++++.+...+.....+.+.+..+ ..+.+.+++.. ... T Consensus 250 ~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~~~g~~v~g-~~~~v~~~~~~~~~d~ 328 (514) T protein:vir:80 250 DGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPG-QVGSVASYERGDYNKI 328 (514) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcccCCceeecC-CCccceeeecCcccch Confidence 999999999999999999999999999999999999999998876654443333333333322 23445554433 234 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCC-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGIN-SGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEE 457 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~-~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~ 457 (663) +.+...++.+.+.|.... ++... .++..-||++|..+.+.....|..++.++...++.|+..+.+.++.... T Consensus 329 ~~~~~~i~~~~~rI~~aF-----ml~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~-- 401 (514) T protein:vir:80 329 AQASASVESIVMRLNRAF-----MYTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGN-- 401 (514) T ss_pred HHHHHHHHHHHHHHHHHH-----hhhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhc-- Confidence 455677888888887643 22211 2332359999999999999999999999999999999999888775421 Q ss_pred ceEEEEecCeeeccchhhcCCceEEEEeecc-cchhHHHHHHHHHHHHHhccC--CCcchhHHHHHHHHHhhhhhhhhhh Q lcl|NC_021532. 458 EEVIRVTNDKFVPIRKDDLSGRIDIDISIST-AEDNAAKSQELSFLLQTLGPN--EDPKIRRDIMADIMDLMRMPEQAKR 534 (663) Q Consensus 458 ~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~-~~~~~~~~q~l~~~~~~~~~~--~~p~~~~~~l~~~~~l~~~~e~~~~ 534 (663) .+.+.++..+. +.++++++. .+.+....+.+..+++.++.. .+|.+ ++.-+..+.++. T Consensus 402 -------~g~lP~~p~~l----~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v--------~d~id~d~~~~~ 462 (514) T protein:vir:80 402 -------GGMLLGIAQGV----YRPSIITGIPALTRNIETANILRATQEASAIVPALVQL--------SKRFDPEKLVER 462 (514) T ss_pred -------cCCCCCCCchh----hcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhh--------hhcCCHHHHHHH Confidence 01122222222 234444332 334444444444444443321 11221 122222223333 Q ss_pred hhhhhcchhh-HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 535 MREYEPKPDP-VQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSS 601 (663) Q Consensus 535 l~~~~~~~~~-~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~ 601 (663) +....+-|.. .-....+ ..+.++ +.+++++++++.++.. ..+.+ ++... -. T Consensus 463 ~a~~~Gvp~~~i~~~~e~---~~~~~~--~~~~~~~~~~~~~~~~-~~~~~---~~~~~-------~~ 514 (514) T protein:vir:80 463 IFANNSVDLSTLSKDPDV---VAAEAE--QEAALAQQQLDVASGA-LAAET---SAGVL-------TS 514 (514) T ss_pred HHHHhCCCHhhccCCHHH---HHHHHH--HHHHHHHHHHHHHHHH-HHHhh---hcccc-------CC Confidence 3222222210 0000000 000000 0000000000000000 00000 00000 00 No 48 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=99.96 E-value=4.4e-28 Score=170.56 Aligned_cols=486 Identities=12% Similarity=0.045 Sum_probs=273.0 Q ss_pred CCCc----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc--cCCCccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 1 MKIN----KAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ--KGKSAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 1 ~~~~----~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~--~g~s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |+-+ -.--.+.+.+.|+..+..|++++..|+++.+|..-....... ++...++...-...++.+.+.|+..+|+ T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltp 80 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVLFP 80 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcccccccccchHHHHHHHHHHHHHhhhcC Confidence 4433 333457899999999999999999999999998643322211 2223466777777889999999999998 Q ss_pred -CCceEEEEeCCcch-------HHHH------HHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 75 -TADIIKCTPITWED-------TDSA------EQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 75 -~~~~~~~~p~~~~D-------~~~A------e~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) +.+||++.+..... .+.+ +..+..+...+ ..++++..++.++.+.+..|+|++.+ |..+ T Consensus 81 p~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~~~~~~~~~L~~~G~a~l~~--d~~~---- 153 (516) T protein:vir:10 81 AQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKEL-EQRQFRPAVVEAFKHLIVAGSCMLYK--PSKG---- 153 (516) T ss_pred CCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHH-HhcCcHHHHHHHHHHHHhHCeEeEEe--cCCC---- Confidence 57999998654321 1111 22333443334 35788888999999999999998754 2100 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKL 220 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~ 220 (663) .++.++..++++..++.+.+.+ ++++..++...|.+.. .++ T Consensus 154 -------------------------------~~~~~pl~~y~v~~d~~G~v~~---ivrr~~~~~~~l~e~~--~~~--- 194 (516) T protein:vir:10 154 -------------------------------AISAIPMHHYVVNRDTNGDLLD---IILLQEKSLRTFDPAT--RAV--- 194 (516) T ss_pred -------------------------------CeEEEEcCeEEEeeCCCCCeEE---EeeeecccHHHHHHHh--hhh--- Confidence 1345677889988887766655 6777888888876542 111 Q ss_pred hhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeee Q lcl|NC_021532. 221 AKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSI 300 (663) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~ 300 (663) ........ .......+.+|.|=.+ +.++...++ .-.++..+ ..++-|+...+||++..|... T Consensus 195 ~~~~~~~~------------~~~~~~~~~i~t~v~~---~~~~~~~~~--~~~d~~~~-~~~s~~~~~e~P~~~~Rw~~~ 256 (516) T protein:vir:10 195 VEVGLKGK------------KCKEDDSIKLYTHAKY---LGEGFWELK--QSADDIPV-GKVSKIKSEKLPFIPLTWKRS 256 (516) T ss_pred hhhhhhhh------------ccCCCCceEEEEEEEe---cCCCceEEE--EeeCceee-ccccccccccCCeeeeeeeec Confidence 00000000 0011234555543222 334433222 22344433 234555556899999999999 Q ss_pred cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCcc--cc Q lcl|NC_021532. 301 PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNA--IP 378 (663) Q Consensus 301 ~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~--~~ 378 (663) +|..||.|+++...+..+.+|++.+..+.....+++|.++++++.+...+.....+.+.+.. +..+.+.+++... -. T Consensus 257 ~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~~~~-g~~~~v~~~q~~~~~d~ 335 (516) T protein:vir:10 257 YGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVT-GVEEDIHIVQLGKYADL 335 (516) T ss_pred CCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhccCCCceeec-CCcccceeeecCcccch Confidence 99999999999999999999999999999999999999999777664433322222223322 2234445554433 23 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) +.+...++.+.+.|....=++... .-++..-||++|..+.+.....|..++.++...++.|++.+.+..+ T Consensus 336 ~~~~~~i~~~~~rI~~af~~~~l~---~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~------- 405 (516) T protein:vir:10 336 TPISAVLEVYTRRIGVVFMMETMT---RRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA------- 405 (516) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhh---ccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhh------- Confidence 556777888888887654333221 1123345999999999999999999999998889999887654211 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecc-cchhHHHHHHHHHHHHHhccC--CCcchhHH-----HHHHHHHhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISIST-AEDNAAKSQELSFLLQTLGPN--EDPKIRRD-----IMADIMDLMRMPE 530 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~-~~~~~~~~q~l~~~~~~~~~~--~~p~~~~~-----~l~~~~~l~~~~e 530 (663) .+ ++ |+.+. ++++..+. ++.+....+.+..+++.++.. .+|.+... .+...++..+.+- T Consensus 406 ------~p---~~-P~~lv---~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~ 472 (516) T protein:vir:10 406 ------GD---SF-TSDLV---DPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL 472 (516) T ss_pred ------CC---CC-Chhhc---CcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCCh Confidence 11 11 22221 22232222 334444445555555554432 23322111 1111222222220 Q ss_pred hhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 531 QAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRS 587 (663) Q Consensus 531 ~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~ 587 (663) . +...++..+ ++..++++.++ .+.+..+.++++......+++.+ T Consensus 473 --~----~irs~eev~----~~r~~~~~~q~---~~~~~~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 473 --P----FLKSAEEME----QEQEAQMQAQQ---AQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred --h----ccCCHHHHH----HHHHHHHHHHH---HHHHHHHhhhcccchhhhhhhcC Confidence 0 111111110 10000000000 00000011111111111111111 No 49 >protein:vir:103385 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024736;genbank:gi:48697078;genbank:GeneID:2846053 Probab=99.92 E-value=5.6e-28 Score=170.01 Aligned_cols=566 Identities=15% Similarity=0.115 Sum_probs=312.9 Q ss_pred CCCcH--------------HHHH-HHHHHHHHHH----HHHHHHHHHHHHHHHHHhc---CCc-CC-cc-----ccC--- Q lcl|NC_021532. 1 MKINK--------------AELL-SALKADMKAA----DVLKQEQDSLISTWKAEYN---GEP-YG-NE-----QKG--- 48 (663) Q Consensus 1 ~~~~~--------------~~~~-~~l~~~~~~~----~~~~~~~~~~~~~~~~~y~---~~~-~~-~~-----~~g--- 48 (663) |-++- ++++ .-|++-++-+ -+|-+++-.....|-+|.- +.. .+ +. .+- T Consensus 1 maispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V~ 80 (666) T protein:vir:10 1 MAISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQVV 80 (666) T ss_pred CCcCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhhHHHhHHhhhhccCCCceeeecccccccCcceee Confidence 44431 1222 2222222222 1345555444444544532 211 11 00 000 Q ss_pred CCccccHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHH-hHHHHhccchhHHHHHHHHHHHhcCceE Q lcl|NC_021532. 49 KSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLL-NTQFSRKFDRFNFMSKAVKVLDREGTLV 127 (663) Q Consensus 49 ~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~-~~~~~~~~~~~~~~~~~~~d~~~~G~g~ 127 (663) ...+|+|.|..+|++++++|.++|.++.|.+.|.. +|+..+.||++..+| +|.....+....++ +++|+++|...- T Consensus 81 ~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~~~~~~LiL--~L~D~~KYN~~~ 157 (666) T protein:vir:10 81 NKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMTSSIPELIL--CLQDAAKYNLVG 157 (666) T ss_pred ccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhhhhHHHHHH--HHhhhhhcceee Confidence 23578899999999999999999999999999985 889999999999888 45444444444444 899999999988 Q ss_pred EEeeeccccceecccccccccCccccccccccccccc-eeecccceeeeccHHHheeCccc-ccCh-hhCceEEEEeecC Q lcl|NC_021532. 128 VQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTET-VVKKNQPTARVCRNEDIYLDPTC-QDNL-DNAQFVIHRYETD 204 (663) Q Consensus 128 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~~~~~dp~a-~~d~-~d~~~~~~~~~~~ 204 (663) +++.|-.- .. +++.+.-...++..... |.-+..-++++++|.+.||||+. ..++ ....|.++...++ T Consensus 158 ~ET~Ws~I----E~------~~~~~~i~~~~~~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~ 227 (666) T protein:vir:10 158 WETEWSHI----ET------YDPQKEITDLEPGKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLN 227 (666) T ss_pred eeeccccc----cc------cchhhhhhcCCCceeecccchhhhhhhhccccccccccCCCCCCchhhhhhhhhHHHHHH Confidence 77777321 00 00000000001111110 11112236789999999999963 3444 3567999988888 Q ss_pred HHHHHHhcCCcC---------hhhhh-hcc--chh-------------------hhccccccccccccccccceEEEEE- Q lcl|NC_021532. 205 LSTLKKDGRYKN---------LDKLA-KTS--GED-------------------FDYDSPDDTEFQFSDAPRKKLIIYE- 252 (663) Q Consensus 205 ~~~l~~~g~~~~---------~~~~~-~~~--~~~-------------------~~~~~~~~~~~~~~d~~~~~v~v~E- 252 (663) +-.|++.-.+-. +...+ ..+ +.+ .+.+.+...+...+ ...++|-|-| T Consensus 228 R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~s-S~~~rvpvneq 306 (666) T protein:vir:10 228 RIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETS-STNRRVPVNEQ 306 (666) T ss_pred HHHHHHHHhhhhcchhhhHHHHHHHHHhhhccccccccCCccCccccccchhhccchhhcCccccccc-ccccccccccc Confidence 888776422211 11100 000 000 00000000000000 0112232221 Q ss_pred -------EEEEe---eec-----CCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHH Q lcl|NC_021532. 253 -------YWGNY---DVD-----GDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ 317 (663) Q Consensus 253 -------~w~~~---~~~-----~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 317 (663) .|.|+ |+. .+...-|..+.+.|+.++...+--..++.+|.-+.-...+.-..-..|+.+..+|.| T Consensus 307 g~Y~k~~~Y~RI~PSDF~~~~P~~N~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDG~G~QTQ~~~E~~~P~Q 386 (666) T protein:vir:10 307 GVYCKHTMYLRIIPSDFEMNVPNRNQVQIWKAVMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQ 386 (666) T ss_pred cceeeeeeeeeeccccceecCCCCCcceeeeeeeeccceeEeeehhhhccchhhhhhhhhhhhccccccccccccccchh Confidence 22222 111 122233444555667788765544456677765444444444445678899999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCC-------CCccccccCccccHH-HHHHHHHHH Q lcl|NC_021532. 318 KVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGT-------ANDFWHGSYNAIPSS-AFDMISLMN 389 (663) Q Consensus 318 ~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~-------~~~~~~~~~~~~~~~-~~~~~~~~~ 389 (663) +..++++|..+....+.+..+.+++++.+...+.....|-..|.+++. ++...++||...... +.+-.+.+. T Consensus 387 ~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a~~iNSP~~~~KIP~~~~sL~N~~~~~~Y~~IPFD~RG~E~~~Q~A~~l~ 466 (666) T protein:vir:10 387 SATTELWNAYIQGARRAVMDRALYNPSMIRANDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLT 466 (666) T ss_pred hhhhHHhhhhhhhhhhhhhhhhccChhhhhhhcccCCCCCcccceeehhhcccchhhhhccCCccccchhHHHhhhHHHH Confidence 999999999999999999999999988876444333333322322221 234556676655432 344456777 Q ss_pred HHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeee Q lcl|NC_021532. 390 NEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFV 469 (663) Q Consensus 390 ~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v 469 (663) +.-++++|++...+|+---+ +++-.+-.-.|..+..|++..+-.+.+.++.++-+.+.-.+.+|.++..+|.-+.++.+ T Consensus 467 ~~~r~L~GMN~~~~GQFQKG-NKt~~E~~~~MG~a~NR~RLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~ 545 (666) T protein:vir:10 467 DWQRELSGMNSATRGQFQKG-NKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGV 545 (666) T ss_pred hhHHHhhccCCccccccccc-CcceeehhhhcCCcccceehhhHHhhhhhhhhHHHHHhhhhhhccccchhcccccCcee Confidence 88899999999999964222 34555666778889999999988888888888888888889999999999998888888 Q ss_pred ccchhhcCCceEEEEeecc---cchhHHHHHHHHHHHHHhccC------CCcchhHHHHHHHHHhhhhhhhhhhhhhhhc Q lcl|NC_021532. 470 PIRKDDLSGRIDIDISIST---AEDNAAKSQELSFLLQTLGPN------EDPKIRRDIMADIMDLMRMPEQAKRMREYEP 540 (663) Q Consensus 470 ~i~~~~~~~~~d~~v~~~~---~~~~~~~~q~l~~~~~~~~~~------~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~ 540 (663) .|+-+.+.. .-+...+++ +..+.....++..++++++.. .+|+ .+.+++++++|.+.....++-....+ T Consensus 546 ~vDi~~L~~-~~L~F~~~DG~TP~SK~ASs~~lT~~LQMI~sS~~~~~A~G~~-~P~M~AH~~QLGGVRG~E~Y~daalP 623 (666) T protein:vir:10 546 RVDIKELQD-LGLKFELGDGLTPASKLASSDFLTALLQMIMSSETTLQAFGTQ-VPGMIAHLAQLGGVRGFEKYADAALP 623 (666) T ss_pred eeeHHHHhh-hhheeeeccCCCchhhhhhhHHHHHHHHHHhhhhhhHhhhccc-chHHHHHHHHhccccchhhhhhccCC Confidence 888776652 112233333 334555566777777776543 2332 35677888888775554444433333 Q ss_pred chhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 541 KPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSE 602 (663) Q Consensus 541 ~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~ 602 (663) +=.+.- -++++.++.-++.+++.+.+.+++ +.++-..+.+- .+ T Consensus 624 ~~~~~~----~~~Q~LQ~~~LQ~~~QSA~Q~~A~-------------Q~~L~~~Q~~P--Sq 666 (666) T protein:vir:10 624 QWQITY----GMQQQLQQMLLQLQQQSAMQLQAR-------------QGELSNDQSQP--SQ 666 (666) T ss_pred cccccc----chhHHHHHHHHHHhhhhhcccccc-------------cccCcccccCC--CC Confidence 211111 111111111111111111111111 00000000000 00 No 50 >protein:vir:96403 Length: 666 # NCBI annotation: hypothetical protein # Family: family:all:11276 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218810;genbank:gi:147917327;genbank:GeneID:5142606 Probab=99.92 E-value=1.5e-27 Score=167.57 Aligned_cols=566 Identities=14% Similarity=0.114 Sum_probs=310.8 Q ss_pred CCCcH--------------HHHH-HHHHHHHHHHH----HHHHHHHHHHHHHHHHhc---CCc-CC-cc-----ccC--- Q lcl|NC_021532. 1 MKINK--------------AELL-SALKADMKAAD----VLKQEQDSLISTWKAEYN---GEP-YG-NE-----QKG--- 48 (663) Q Consensus 1 ~~~~~--------------~~~~-~~l~~~~~~~~----~~~~~~~~~~~~~~~~y~---~~~-~~-~~-----~~g--- 48 (663) |-++- ++++ .-|++-++-++ +|-+++-.....|-+|.- +.. .+ +. .+- T Consensus 1 maispsepninsfvytqrvdellkahlkkildfsktnkanyiqKMD~ID~AYARY~~~~~N~~LlG~~~~~~A~V~C~V~ 80 (666) T protein:vir:96 1 MAISPSEPNINSFVYTQRVDELLKAHLKKILDFSKTNKANYIQKMDLIDKAYARYITAQENNELLGYNQNIAAKVRCQVV 80 (666) T ss_pred CccCCCCCcchhhhhHHHHHHHHHHHHHHHhhhhccchhhHHHHhhHHHHhHHhhhhccCCCceeeecccccccccceee Confidence 44431 1222 22222222221 345555444444554542 211 11 00 000 Q ss_pred CCccccHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHH-hHHHHhccchhHHHHHHHHHHHhcCceE Q lcl|NC_021532. 49 KSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLL-NTQFSRKFDRFNFMSKAVKVLDREGTLV 127 (663) Q Consensus 49 ~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~-~~~~~~~~~~~~~~~~~~~d~~~~G~g~ 127 (663) ...+|+|.|..+|++++++|.++|.++.|.+.|.. +|+..+.||++..+| +|.....+....++ +++|+++|...- T Consensus 81 ~~~~V~PIViSQV~S~~~YLT~VF~SG~Pi~PVVS-~P~~K~~AE~LE~ii~DH~t~~~~~~~LiL--~L~D~~KYN~~~ 157 (666) T protein:vir:96 81 NKATVNPIVISQVQSMTAYLTEVFASGYPILPVVS-TPDKKEQAEALEGIIQDHMTMTSSIPELIL--CLQDAAKYNLVG 157 (666) T ss_pred ccccCCchhhhhHHHHHHHHHHHHhcCCccceeec-CCchhHHHHHHHHHHHhhhhhhhhHHHHHH--HHhhhhhcceee Confidence 23578899999999999999999999999999985 889999999999888 45444444444444 899999999977 Q ss_pred EEeeeccccceecccccccccCccccccccccccccc-eeecccceeeeccHHHheeCccc-ccCh-hhCceEEEEeecC Q lcl|NC_021532. 128 VQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTET-VVKKNQPTARVCRNEDIYLDPTC-QDNL-DNAQFVIHRYETD 204 (663) Q Consensus 128 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~~~~~dp~a-~~d~-~d~~~~~~~~~~~ 204 (663) +++.|--- .. +++.+.-...++..... |.-+..-++++++|.+.||||+. ..++ ....|.++...++ T Consensus 158 ~ET~Ws~I----E~------~~~~~~i~~~~~~K~TlrR~~r~~~KIrRLN~RN~~~D~~~~~~~VA~~G~~~G~~~L~~ 227 (666) T protein:vir:96 158 WETEWSNI----ET------YDPQKEITDLEPGKTTLRRNYRHVNKIRRLNLRNVHWDPIPDIPNVATEGSFLGETTLLN 227 (666) T ss_pred eeeccccc----cc------cchhhhhhcCCCceeeeccchhhhhhhhccccccccccCCCCCCchhhhhhhhhhHHHHH Confidence 77776310 00 01111000001111110 11112236789999999999963 3444 3567999988888 Q ss_pred HHHHHHhcCCcC---------hhhhh-hcc--chh-------------------hhccccccccccccccccceEEEE-- Q lcl|NC_021532. 205 LSTLKKDGRYKN---------LDKLA-KTS--GED-------------------FDYDSPDDTEFQFSDAPRKKLIIY-- 251 (663) Q Consensus 205 ~~~l~~~g~~~~---------~~~~~-~~~--~~~-------------------~~~~~~~~~~~~~~d~~~~~v~v~-- 251 (663) +-.|++.-.+-. +...+ ..+ +.+ .+.+.+...+...+ ...++|-|- T Consensus 228 R~~LKK~LN~LT~EKkltykkvV~~Al~~s~~~sD~T~~P~IS~vY~~~~~~SDi~WD~~G~~~T~~s-S~~~rvpvneq 306 (666) T protein:vir:96 228 RIQLKKYLNYLTNEKKLTYKKVVNEALKSSFQGSDWTDNPQISPVYQEMEMASDINWDRFGGFETETS-STNRRVPVNEQ 306 (666) T ss_pred HHHHHHHHhhhhcchhhhHHHHHHHHHhhhccccccccCCcccccccccchhhccchhhcCccccccc-ccccccccccc Confidence 888776422211 11100 000 000 00000000000000 111223222 Q ss_pred ------EEEEEe---eec-----CCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHH Q lcl|NC_021532. 252 ------EYWGNY---DVD-----GDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ 317 (663) Q Consensus 252 ------E~w~~~---~~~-----~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 317 (663) ..|.|+ |+. .+...-|..+.+.|+.++...+--..++.+|.-+.-...+.-..-..|+.+..+|.| T Consensus 307 g~Y~k~~mY~RI~PSDF~~~~P~~N~~QIWK~v~IN~~~iIS~~~~I~AY~~~~~~~~~~LEDGmG~QTQ~~~E~~~P~Q 386 (666) T protein:vir:96 307 GVYCKHTMYLRIIPSDFEMNVPNRNQVQIWKAVMINRDAIISFEPYIGAYGSFGMGLAFALEDGMGLQTQGYGEMAAPLQ 386 (666) T ss_pred cceeeeeeeeeeccccceecCCCCCcceeeeeeeeccceeEeeehhhcccchhhhhhhhhhhhccccccccccccccchh Confidence 123232 111 122233444555667788765544456677765444444444445678899999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCC-------CCccccccCccccHH-HHHHHHHHH Q lcl|NC_021532. 318 KVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGT-------ANDFWHGSYNAIPSS-AFDMISLMN 389 (663) Q Consensus 318 ~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~-------~~~~~~~~~~~~~~~-~~~~~~~~~ 389 (663) +..+++++..+....+.+..+.+++++.+...+.....|-..|.+++. ++...++||...... +.+-.+.+. T Consensus 387 ~A~t~L~N~~~~~aRRAV~DRAl~~~S~i~a~~iNSP~~~~KIP~~~~sL~N~~m~~~Y~~IPFD~RG~E~~~Q~A~~l~ 466 (666) T protein:vir:96 387 SATTELWNAYIQGARRAVMDRALYNPSMIRANDINSPIPQIKIPVVPQSLVNGTMDQAYRQIPFDSRGMETVMQNALMLT 466 (666) T ss_pred hhhhHHhhhhhhhhhhhhhhhhhcchhhhhhhcccCCCCCcccceeehhhhccchhhhhccCCccccchhHHHhhhHHHh Confidence 999999999999999999999999988876444333333322322221 234556676655432 344456777 Q ss_pred HHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeee Q lcl|NC_021532. 390 NEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFV 469 (663) Q Consensus 390 ~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v 469 (663) +.-++++|++...+|+---+ +++-.+-.-.|..+..+++..+-.+.+.++.++-+.+.-.+.+|.++..+|.-+.++.+ T Consensus 467 ~~~r~L~GMN~~~~GQFQKG-NKt~~E~~~~MG~a~NRmRLPALiLEH~~F~~iK~~L~LNl~~YG~DT~ViS~RtG~~~ 545 (666) T protein:vir:96 467 DWQRELSGMNSATRGQFQKG-NKTRAEFDTIMGNAENRMRLPALILEHRMFTKIKEQLKLNLLMYGEDTEVISPRTGKGV 545 (666) T ss_pred hhHHHhhccCCccccccccc-CcceeehhhhcCCcccceehhhHHHhhhhhhhHHHHHhhhhhhccccchhcccccCcee Confidence 88899999999999964222 34555666778889999999988888888888888888889999999999998888888 Q ss_pred ccchhhcCCceEEEEeeccc---chhHHHHHHHHHHHHHhccC------CCcchhHHHHHHHHHhhhhhhhhhhhhhhhc Q lcl|NC_021532. 470 PIRKDDLSGRIDIDISISTA---EDNAAKSQELSFLLQTLGPN------EDPKIRRDIMADIMDLMRMPEQAKRMREYEP 540 (663) Q Consensus 470 ~i~~~~~~~~~d~~v~~~~~---~~~~~~~q~l~~~~~~~~~~------~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~ 540 (663) .|+-+.+.. .-+...++++ ..+.....++..++++++.. .+|+ .+.+++++++|.+.....++.....+ T Consensus 546 ~vDi~~L~~-~~L~F~~~DGlTP~SKlASs~~lT~~LQMI~sS~~~~~A~G~~-~P~M~AHl~QLGGVRG~E~Y~~~ALP 623 (666) T protein:vir:96 546 RVDIKELQD-LGLKFELGDGLTPASKLASSDFLTALLQMIMSSETTLQAFGTQ-VPGMIAHLAQLGGVRGFEKYANAALP 623 (666) T ss_pred eeeHHHHhh-hhheeeeccCCCchhhhhhhHHHHHHHHHHhcchhhHhhhccc-chHHHHHHHHhccccchhhcccccCc Confidence 888776652 1122333333 34555566777777776543 2332 35677778887765444443222222 Q ss_pred chhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 541 KPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSE 602 (663) Q Consensus 541 ~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~ 602 (663) +=+ -.--+.++.++.-++.+++.+.+.+++ +.++-..+.+- .+ T Consensus 624 qwq----itygm~Q~LQ~~~LQ~~~QSA~Q~~A~-------------Q~~L~~~Q~~P--Sq 666 (666) T protein:vir:96 624 QWQ----ITYGMQQQLQQMLLQLQQQSAMQLQAR-------------QGELSNDQSQP--SQ 666 (666) T ss_pred chh----hhhhhhHHHHHHHHHHhhhhccccccc-------------cccCcccccCC--CC Confidence 100 000000011111111111111111111 00000000000 00 No 51 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.79 E-value=2.7e-18 Score=116.92 Aligned_cols=429 Identities=12% Similarity=0.056 Sum_probs=198.9 Q ss_pred CCCc---------HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc----cccCC--CccccHHHHHHHHHH Q lcl|NC_021532. 1 MKIN---------KAEL-LSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN----EQKGK--SAIVSRDIKKQSEWQ 64 (663) Q Consensus 1 ~~~~---------~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~~~g~--s~~~~~~i~~~v~~~ 64 (663) ||.. ++++ .+.|.+.++ .+...........+||.|.+.-+ +..++ .+++.|.....|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~----~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~ 76 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFME----KHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTF 76 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHH----HHHHHHHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHH Confidence 3321 2222 223333222 24445556677889999976322 12222 245556655566655 Q ss_pred HHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccc Q lcl|NC_021532. 65 HATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGE 144 (663) Q Consensus 65 ~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~ 144 (663) ++++ ++..+. |.+ +|++ ..+.++.++. .|+....+..+.++++++|.|++.+++|.+ T Consensus 77 ~~~l----~g~~~~--~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------- 133 (452) T protein:vir:36 77 TGYF----NGIPVK--KSH---SDKE----ILTKLQEFDN-LNDMEDEESELAKMACIYGRAFEFLYQDED--------- 133 (452) T ss_pred hhhh----cccCce--eec---CChh----HHHHHHHHHh-hcChhHHHHHHHHHHHhcCeEEEEEEecCC--------- Confidence 5544 454433 332 2322 2344556553 466767788899999999999999887621 Q ss_pred ccccCccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 145 AVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) +.+.+..++|.++++ |+.... ..-++ .+.+... T Consensus 134 ------------------------g~~~i~~~~p~~~~~v~d~~~~~---~~~~~-i~~~~~~----------------- 168 (452) T protein:vir:36 134 ------------------------TQTNVVYNSPENMFMVYDDTVKQ---EPLFA-VRYGVDE----------------- 168 (452) T ss_pred ------------------------CeeEEEEEcccceEEEEcCCCCC---ceEEE-EEEEEec----------------- Confidence 345667788877753 332111 11111 1222100 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) +....+|+|.. +. .+++...++........|.+.|.+|++.+ ++ T Consensus 169 -----------------------~~~~~~~vyt~-----~~---i~~~~~~~~~~~~~~~~~~~~g~iPvv~~-----~n 212 (452) T protein:vir:36 169 -----------------------DKKLQGEVYTL-----LE---TIKISGENDEISFGEGTYNPYPDLPVVEF-----YF 212 (452) T ss_pred -----------------------CceEEEEEEec-----Ce---EEEEEEcCCceEEecceeccCCcccEEEe-----cC Confidence 00122344422 10 11111112222222233444467777644 33 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCC----ccccccCcccc Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTAN----DFWHGSYNAIP 378 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~----~~~~~~~~~~~ 378 (663) ...|.|.+..++++++.+|.+.+.+.+.+...++|.+.+--..++.++.....+++++.+..++. ...++..+.-. T Consensus 213 ~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 292 (452) T protein:vir:36 213 NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSD 292 (452) T ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCchhhhhhhhcceEEecCCCCccCCcceeEeecCCH Confidence 45689999999999999999999999999999998777643344444455566677777755432 23444434434 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) ......+..+.+.|...|++++.+.+..++. |+.| +..+............+.|. .+++.++++++.+.... T Consensus 293 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~-Sg~A--l~~~~~~l~~k~~~~~~~~~-~~l~~~~~li~~~~~~~---- 364 (452) T protein:vir:36 293 SQTENLLDRLTKLIFQTTMVANISDESFGSS-SGVS--LAYKLQAMSNLALSFQRKFQ-SSLNSRYKLFCELSTNV---- 364 (452) T ss_pred HHHHHHHHHHHHHHHHHhCccccCcccccCC-cHHH--HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcc---- Confidence 5667778899999999999998877654432 4444 43333333333333444443 24444555544443321 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) |. ..+.. +..+...-..+....+..+. ++.++..++ ...+... +....+.... T Consensus 365 ------~~---~~~~~----~i~i~f~~~~p~d~~~~a~~----~~k~~g~iS----~et~~~~--~~~~~d~~~E---- 417 (452) T protein:vir:36 365 ------SN---KDSWK----DIEYTFTRNEPKDIKEQAET----ANILMGITS----QETALSV--ISVIPDVQAE---- 417 (452) T ss_pred ------CC---ccccc----cceEEeCCCCCcCHHHHHHH----HHHHhccCC----hHHHHHh--CCCCCCHHHH---- Confidence 11 01111 11222222222222222222 222222222 1111111 1111111111 Q ss_pred hcchhhHHHHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPKPDPVQEKIRQLELENLMLENQMLVAS-INDKNARANENTIDAE 583 (663) Q Consensus 539 ~~~~~~~~~q~~q~~~~~~q~~~~~~~a~-~~~~~a~~q~~~~~~~ 583 (663) ..+...++.+. .+..+.. .-..-.......-+.+ T Consensus 418 ----------~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 418 ----------MEKIKKEEAST-AIFDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred ----------HHHHHHHHHHH-HHHHhhccCCCCcccccCccccCC Confidence 11111111000 0000000 0000000000000000 No 52 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.77 E-value=1.3e-17 Score=113.14 Aligned_cols=416 Identities=12% Similarity=0.031 Sum_probs=195.6 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc----cccCC--CccccHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN----EQKGK--SAIVSRDIKKQSEWQHATIVDPFVSTA 76 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~~~g~--s~~~~~~i~~~v~~~~~~l~~~~~~~~ 76 (663) |+.+.|...+.+. ........++.+||.|++.-+ +.+++ .+++.|-.+..|+..++++ ++.. T Consensus 1 l~~~~l~~~i~~~--------~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~g~~ 68 (429) T protein:vir:98 1 MTKDLLSELIQKH--------RSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYF----IGVP 68 (429) T ss_pred CCHHHHHHHHHHH--------HHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhh----cccC Confidence 8887777666542 234455677888999986322 22232 2456666666666666555 4443 Q ss_pred ceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccccc Q lcl|NC_021532. 77 DIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETV 156 (663) Q Consensus 77 ~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 156 (663) +. +.+ +++ ...+.++.++. .|+....+..+.++++++|.|++.++++.+ T Consensus 69 ~~--~~~---~~~----~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------------------- 117 (429) T protein:vir:98 69 VQ--TSH---ENK----QVSNYLELLDG-YNDQDDNNAELSKICSIYGHGYELVFNDEN--------------------- 117 (429) T ss_pred ce--eec---CCh----HHHHHHHHHHh-hcCHhHHHHHHHHHHhhcCeEEEEEEecCC--------------------- Confidence 33 332 222 23335556554 466666778899999999999999877521 Q ss_pred cccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccc Q lcl|NC_021532. 157 VEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPD 234 (663) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 234 (663) +.+.+..++|.++++ |.... .-...+.+.+.+. T Consensus 118 ------------g~~~~~~~~p~~~~~v~dd~~~----~~~~~~i~~~~~~----------------------------- 152 (429) T protein:vir:98 118 ------------AEAGITYLTPLEAFIVYDDSIR----QKPLFAVRYFYNK----------------------------- 152 (429) T ss_pred ------------CcEEEEEEcccceEEEEeCCCC----CceEEEEEEEEec----------------------------- Confidence 335566788877753 22111 1111222222110 Q ss_pred cccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC--CEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHH Q lcl|NC_021532. 235 DTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN--DVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEM 312 (663) Q Consensus 235 ~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g--~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~ 312 (663) ..+..+++|... . .+.+..+ +..+ ....|.+.|.+|++. .+++.+|.|.+.. T Consensus 153 -----------~~~~~~~~~~~~-----~----~~~~~~~~~~~~~-~~~~~~~~g~vPvv~-----~~n~~~g~sd~e~ 206 (429) T protein:vir:98 153 -----------GGVLEGSYSDAS-----N----ITYFKDGEKGIEI-GESEPHPFDGVPMIE-----YVENEERQSLLAS 206 (429) T ss_pred -----------CceEEEEEEeCc-----e----EEEEEecCCceEe-cccccccCCccceEE-----ecCCCCCCCcHHH Confidence 011222333210 0 0000111 1111 122233346667664 3456679999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCC---ccccccCccccHHHHHHHHHHH Q lcl|NC_021532. 313 IGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTAN---DFWHGSYNAIPSSAFDMISLMN 389 (663) Q Consensus 313 ~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~ 389 (663) +++.++.+|.+.+.+.+.+...++|.+.+--...+.........++++.+.++++ ...++..+.-...+...++.+. T Consensus 207 v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 286 (429) T protein:vir:98 207 VVTLINAFNKAISEKANDVEYFADAYLKILGAELDDETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLE 286 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCcchhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHH Confidence 9999999999999999999999998766532223333334455567777654332 2344444443455667789999 Q ss_pred HHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeee Q lcl|NC_021532. 390 NEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFV 469 (663) Q Consensus 390 ~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v 469 (663) +.+...|++++.+.+..++ .|+.| +...............+.|.+ ++ +.++.++..+... .+.. T Consensus 287 ~~i~~~s~~p~~~~~~~gn-~Sg~A--l~~~~~~l~~k~~~~~~~~~~-~l----~~~~~li~~~~~~------~~~~-- 350 (429) T protein:vir:98 287 NLIFRTAMVANISDESFGT-ASGIA--LRYRLQAMDNLAKTKERKFMS-GM----NRRYKLIASYPTS------KIGP-- 350 (429) T ss_pred HHHHHHhCccccCcccccc-chHHH--HHHHHHHHHHHHHHHHHHHHH-HH----HHHHHHHHHHhcc------CCCc-- Confidence 9999999999876654332 23333 333333333333333344432 33 3344444444221 1111 Q ss_pred ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHh Q lcl|NC_021532. 470 PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKI 549 (663) Q Consensus 470 ~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~ 549 (663) .+.. +..+...-..+....+..+ .+..+++.++ ...+... +....+....+ T Consensus 351 -~d~~----~i~v~f~~~~p~~~~~~a~----~~~kl~g~is----~et~~~~--l~~v~d~~~E~-------------- 401 (429) T protein:vir:98 351 -KDWI----GIKYKFTRNLPANLLEESQ----IAGNLAGIVS----EETQVGV--LSIVENPQKEI-------------- 401 (429) T ss_pred -cccc----cceEEeCCCCCcCHHHHHH----HHHHHhccCc----hHHHHHh--CCCCCCHHHHH-------------- Confidence 1111 1122222112222222222 2222222221 1211111 11111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 550 RQLELENLMLENQMLVASINDKNARANENTIDAE 583 (663) Q Consensus 550 ~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~ 583 (663) .+...++.+. .+.++. ....+ ....... T Consensus 402 ~ri~~E~~~~-~~~~~~---~~~~~--~~~~~~~ 429 (429) T protein:vir:98 402 ERKNSDKSTL-ISRQAG---GLNGQ--NTTTILE 429 (429) T ss_pred HHHHHHHHHH-HHHHHh---hhcCC--CCCCCCC Confidence 1111111000 000000 00000 0000001 No 53 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.77 E-value=6.4e-17 Score=109.36 Aligned_cols=421 Identities=10% Similarity=0.010 Sum_probs=189.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc----cc---CCCccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE----QK---GKSAIVSRDIKKQSEWQHATIVDPFV 73 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~----~~---g~s~~~~~~i~~~v~~~~~~l~~~~~ 73 (663) |.=+..+++..|...+.. ......++.+||.|++.-+. .. ..-+++.|-+.-.|+...+.+. T Consensus 1 ~~~~~~~~i~~l~~~~~~-------~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~---- 69 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQR-------LSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD---- 69 (441) T ss_pred CCccHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc---- Confidence 555555567777776554 23345667789999864211 00 1123444555555554444331 Q ss_pred CCCceEEEEe-CCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 74 STADIIKCTP-ITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 74 ~~~~~~~~~p-~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) +.+ +.++++ .+..++. .|+.......++.+++++|.|++.++.|. T Consensus 70 -------~~g~~~~d~~--------~l~~i~~-~n~~~~~~~~~~~~~~~~G~a~~~v~~d~------------------ 115 (441) T protein:vir:80 70 -------WLGWTNGDGY--------GLDGVYA-ANRLATASCDVHLDALIFGLSFVAIIPHG------------------ 115 (441) T ss_pred -------cccccCCChH--------HHHHHHH-hcCHHHHHHHHHHHHhhcCeeEEEEEeCC------------------ Confidence 111 122322 1344443 46777778889999999999999886541 Q ss_pred cccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) .+.|.+..++|.+++ ||+... .-...+++.... .. T Consensus 116 ---------------~g~~~i~~~~p~~~~~i~d~~~~----~~~~~~~~~~~~---------~~--------------- 152 (441) T protein:vir:80 116 ---------------DGTVSVRPQSPKNCTGKFSADGS----RLDAGLVVQQTC---------DP--------------- 152 (441) T ss_pred ---------------CCceEEEEEccceEEEEEeCCCC----ceeEEEEEEEEe---------cC--------------- Confidence 234566778888875 455321 111111111100 00 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC-CEEEecccCCCcCCCCCEEEEeeeeecCcccCCCh Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN-DVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEAN 309 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g-~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~ 309 (663) .+...+.|.. +. .+.....| +......+.|.+.|++|++.++..+..+.+||.|- T Consensus 153 ----------------~~~~~~vy~~-----~~---~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~ 208 (441) T protein:vir:80 153 ----------------EVVEAELLLP-----DV---IVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSE 208 (441) T ss_pred ----------------ceEEEEEEec-----Ce---EEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCccc Confidence 0011122211 00 01111111 11222334444558899999998889999999986 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCc--chhhhccCCcceEeCCCCCc--cccccCccccHHHHH Q lcl|NC_021532. 310 A-EMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQ--TNRKKFLAGANFEFNGTAND--FWHGSYNAIPSSAFD 383 (663) Q Consensus 310 ~-~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~--~d~~~~~p~~vi~~~~~~~~--~~~~~~~~~~~~~~~ 383 (663) + +.++++++.+|+..+.+.+.+...+.|...+ .|+ .+. .+.....+++++.+..+++. +.....+. +.... T Consensus 209 l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i-~G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~ 285 (441) T protein:vir:80 209 ITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWV-TGVSADEFSQPGWVLSMASVWAVDKDDDGDTPNVGSFPV--NSPTP 285 (441) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcCceeee-ecCCccccccchhhhcccccccCCCCCCCCcceeEecCc--cchHH Confidence 5 5699999999999999999999999987655 343 221 22334567888776544322 22222222 12333 Q ss_pred HHHHHHHHH---HHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceE Q lcl|NC_021532. 384 MISLMNNEI---ESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEV 460 (663) Q Consensus 384 ~~~~~~~~~---~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~ 460 (663) .+..++..+ -.++++++...|..++.. .++.++...............+.|.. -++.++.++..++... T Consensus 286 ~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~-~Sg~Al~~~~~~l~~k~~~~~~~f~~-----~l~~~~~l~~~~~~~~-- 357 (441) T protein:vir:80 286 YSDQMRLLAQLTAGEAAVPERYFGFITSNP-PSGEALAAEESRLVKRAERRQTSFGQ-----GWLSVGFLAAKALDSR-- 357 (441) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccCCCcc-hHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhcCC-- Confidence 444454444 445888888888655431 12222333322222222333333432 2334444555443211 Q ss_pred EEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhc Q lcl|NC_021532. 461 IRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEP 540 (663) Q Consensus 461 iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~ 540 (663) +. ....+ .++.+..+-..+....+..+.+..+ .+....+ .....+. .+.+..+ T Consensus 358 ----~~-----~~~~~-~~i~~~f~~~~~~~~~e~ad~~~kl---~~~g~~~-~s~~~~~---~~l~~~~---------- 410 (441) T protein:vir:80 358 ----VD-----EADFF-GDVGLRWRDASTPTRAATADAVTKL---VGAGILP-ADSRTVL---EMLGLDD---------- 410 (441) T ss_pred ----Cc-----ccccc-eeeeEEeCCCCCcCHHHHHHHHHHH---HhcCccc-ccHHHHH---HhCCCCH---------- Confidence 00 00000 1122222222222222222222222 2222211 1111111 1111110 Q ss_pred chhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 541 KPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKA 594 (663) Q Consensus 541 ~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~ 594 (663) ++ ..+++.++.+. +...++. .. .. ..+..+. T Consensus 411 --~e----~~~~~~e~~e~--~~~~~~~---~~---------~~---~~~~~~~ 441 (441) T protein:vir:80 411 --VQ----VEAVMRHRAES--SDPLAVL---AG---------AI---SRQTNEV 441 (441) T ss_pred --HH----HHHHHHHHHHH--HHHHHHH---hh---------hh---hcccccC Confidence 00 00111110000 0000000 00 00 0000011 No 54 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.76 E-value=2.2e-17 Score=111.92 Aligned_cols=426 Identities=11% Similarity=0.036 Sum_probs=188.0 Q ss_pred CCCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc----ccCC--CccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021532. 1 MKIN-KAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE----QKGK--SAIVSRDIKKQSEWQHATIVDPFV 73 (663) Q Consensus 1 ~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~----~~g~--s~~~~~~i~~~v~~~~~~l~~~~~ 73 (663) |-++ ++++.+.. +.+....++.......++.+||.|.+.-+. ..++ .+++.|-....|+..++++ + T Consensus 9 ~~~~~~~~~~~~~---i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l----~ 81 (453) T protein:vir:73 9 MTYSRDEEITDKV---VNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFVGYF----N 81 (453) T ss_pred eeccccccCCHHH---HHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHhhhhh----c Confidence 3333 23333222 222333345556667788999999764221 1222 3566666666666666554 4 Q ss_pred CCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccc Q lcl|NC_021532. 74 STADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGN 153 (663) Q Consensus 74 ~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~ 153 (663) +..+ .|.+ +|+. ..+.++.++. .|+.......+.++++++|.|++.++++.+ T Consensus 82 g~~~--~~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~------------------ 133 (453) T protein:vir:73 82 GIPI--KKTH---DDKS----VLEAMQLFDN-LNDMEDEESELAKIACVYGRAYELMYQNES------------------ 133 (453) T ss_pred ccCc--eeec---CChH----HHHHHHHHHH-hcChhHHHHHHHHHHHhcCeEEEEEEeCCC------------------ Confidence 4432 3333 3332 2334555553 466767778899999999999999987631 Q ss_pred ccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcc Q lcl|NC_021532. 154 ETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYD 231 (663) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 231 (663) +.|.+..++|..+++ |+.. ......+.+.+.+. T Consensus 134 ---------------~~~~i~~~~p~~~~~v~dd~~----~~~~~~~i~~~~~~-------------------------- 168 (453) T protein:vir:73 134 ---------------TESEVIYCSPLNVFMVYDDSI----KQKPLFAVYYGFDE-------------------------- 168 (453) T ss_pred ---------------CceEEEEEcccceEEEEeCCC----CceeEEEEEEEEec-------------------------- Confidence 234566677777653 2221 11111111111110 Q ss_pred ccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHH Q lcl|NC_021532. 232 SPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAE 311 (663) Q Consensus 232 ~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~ 311 (663) . .....++|.. +. .+++...++........|.+.|.+|++.+ .++.+|.|.+. T Consensus 169 ---~-----------~~~~~~vyt~-----~~---i~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~n~~~g~s~~~ 221 (453) T protein:vir:73 169 ---E-----------GNLSGTVYTL-----LE---TISITGKAGEVKFGESTYNVYSDLPIVEY-----NFNEERQSIFE 221 (453) T ss_pred ---C-----------ceEEEEEEeC-----Ce---EEEEEecCCceEEccceeccCCceeEEEe-----cCCCCCCcchh Confidence 0 0112233322 10 11111111111111222333466777644 34557899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeC----------CCCCccccccCccccHHH Q lcl|NC_021532. 312 MIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFN----------GTANDFWHGSYNAIPSSA 381 (663) Q Consensus 312 ~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~----------~~~~~~~~~~~~~~~~~~ 381 (663) .++++++.+|...+.+.+.+...++|.+.+--..++..+......+.++... +++..+.++..+.-...+ T Consensus 222 ~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~ 301 (453) T protein:vir:73 222 PVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQT 301 (453) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHH Confidence 9999999999999999999999999877663222333333333333322211 111223444433334556 Q ss_pred HHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEE Q lcl|NC_021532. 382 FDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVI 461 (663) Q Consensus 382 ~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~i 461 (663) ...++.+...|...|++++.+.+..++ .|+.| +...............+.|.+ +++.++ .++..+... T Consensus 302 ~~~~~~l~~~I~~~s~~p~~~~~~~gn-~Sg~A--l~~~~~~l~~ka~~~~~~~~~-~l~~~~----~li~~~~~~---- 369 (453) T protein:vir:73 302 ENLLNRLERSIFQFTMAANISDENFGN-SSGVA--LAYKLQAMSNLALSFQRKFQS-ALNRRY----SLWSSLSTN---- 369 (453) T ss_pred HHHHHHHHHHHHHHhCCcccCcccccC-ccHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHH----HHHHHHHhc---- Confidence 777888999999999999877664433 24444 333322333333333334432 333444 444433211 Q ss_pred EEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcc Q lcl|NC_021532. 462 RVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPK 541 (663) Q Consensus 462 ri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~ 541 (663) .|. ..+ +. +..+..+-..+....+..+. +..+...++ ...+... +.... T Consensus 370 --~~~---~~~---~~-~i~v~f~~~~p~~~~~~a~~----~~k~~giis----~et~~~~--~~~~~------------ 418 (453) T protein:vir:73 370 --ASN---KDA---WK-DIEYTFTRNEPKDIKEQAET----ANILKGITS----EETALSV--ISVIP------------ 418 (453) T ss_pred --cCC---ccc---cc-cceEEeCCCCCCCHHHHHHH----HHHHhccCc----HHHHHHh--CCCCC------------ Confidence 111 011 11 12222222222222222222 222221111 1111111 11111 Q ss_pred hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 542 PDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKA 594 (663) Q Consensus 542 ~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~ 594 (663) ++ +....+...++.+. ..+++. ..+.+.+.....+ T Consensus 419 -d~-~~E~~ri~~E~~~~------------~~~~~~----~~~~~~~~~~~~~ 453 (453) T protein:vir:73 419 -DV-QAEMEKIKKKKLLQ------------LSLTRT----SNLVRMKQMRGNL 453 (453) T ss_pred -CH-HHHHHHHHHHHHHH------------HHHHHh----ccCCcchhhhcCC Confidence 11 01111111110000 000000 0000000000000 No 55 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.76 E-value=1.2e-17 Score=113.37 Aligned_cols=430 Identities=12% Similarity=0.054 Sum_probs=197.0 Q ss_pred CCCcHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc----cccCC--CccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKINKAE-LL-SALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN----EQKGK--SAIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~~~~-~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~~~g~--s~~~~~~i~~~v~~~~~~l~~~~ 72 (663) |.|..++ +. ..|...+. .+......++++.+||.|++.-+ +..++ .+++.|-..-.|+...+++ T Consensus 9 ~~~p~d~~~~~~~l~~~i~----~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l---- 80 (453) T protein:vir:39 9 MTFPKDEPITNEVVTKFME----KHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFTGYF---- 80 (453) T ss_pred eEcCCCCCCCHHHHHHHHH----HHHHHHHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHhhhh---- Confidence 5555422 22 22222222 24445556778888999875322 12222 3566666666677666655 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) ++..+. |.+ +|++. .+.++.++. .|+....+..+.++++++|.|++.+++|.. T Consensus 81 ~g~~~~--~~~---~d~~~----~~~l~~i~~-~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~----------------- 133 (453) T protein:vir:39 81 NGIPVK--KSH---SDKET----LSKLQEFDN-LNDMEDEESELAKMACIYGRAFELLYQNEE----------------- 133 (453) T ss_pred cccCce--ecc---CChHH----HHHHHHHHH-hcChhHHHHHHHHHHhhcCeEEEEEEecCC----------------- Confidence 443322 222 33332 234555554 466667788899999999999999987621 Q ss_pred cccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) +.+.+..++|.+++ ||+... ....+.+ +.+.. T Consensus 134 ----------------g~~~i~~~~p~~~~~v~d~~~~---~~~~~~i-r~~~~-------------------------- 167 (453) T protein:vir:39 134 ----------------TQTNVIYNTPENMFMVYDDTIK---QEPLFAV-RYGYD-------------------------- 167 (453) T ss_pred ----------------CceEEEEEcccceEEEecCCCC---CeEEEEE-EEEEe-------------------------- Confidence 33556777887775 333221 1111222 11100 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA 310 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~ 310 (663) .+.+.++|+|.. +. .+++...++..-...+.|.+.|.+|++.+ ++..+|.|.+ T Consensus 168 --------------~~~~~~~~~yt~-----~~---i~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~n~~~g~sd~ 220 (453) T protein:vir:39 168 --------------DDYKLYGEVYTK-----ET---TYALNGTMGFYNMTEQAPNPFDDLPVVEF-----YFNEERMSIF 220 (453) T ss_pred --------------CCeEEEEEEEeC-----Ce---EEEEEecCCceeeecccccCCCceeEEEe-----cCCCCCCcch Confidence 001234455532 11 11111122221111222333356676643 3456799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCC-----CCccccccCccccHHHHHHH Q lcl|NC_021532. 311 EMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGT-----ANDFWHGSYNAIPSSAFDMI 385 (663) Q Consensus 311 ~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~-----~~~~~~~~~~~~~~~~~~~~ 385 (663) +.++++++.+|++++.+.+.+...++|.+.+--..++..+......++++.+.++ +....++..+.-...+...+ T Consensus 221 e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~ 300 (453) T protein:vir:39 221 ESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLL 300 (453) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCchhhhhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHH Confidence 9999999999999999999999999987776433444444445566666655321 22334444443345566778 Q ss_pred HHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEec Q lcl|NC_021532. 386 SLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTN 465 (663) Q Consensus 386 ~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~ 465 (663) ..+...|..+|++++...+..++. |+.| +...............+.|.. +++.++++++.+.... | T Consensus 301 ~~l~~~I~~~s~~p~~~~~~~gn~-Sg~A--l~~~~~~l~~ka~~~~~~~~~-~l~~~~~li~~~~~~~----------~ 366 (453) T protein:vir:39 301 DRLTKLIFQTTMVANISDESFGSS-SGVS--LAYKLQAMSNLALSFQRKFQS-SLNSRYKLYCELSTNV----------S 366 (453) T ss_pred HHHHHHHHHHhCCcccccccccCC-hHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcc----------C Confidence 889999999999988776644332 4444 433333333333333444432 3444444444433221 1 Q ss_pred CeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhH Q lcl|NC_021532. 466 DKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPV 545 (663) Q Consensus 466 ~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~ 545 (663) . ..+.. +..+..+-+.+....+..+ .+..+++.++. ..+... +....+....++.+. T Consensus 367 ~---~~~~~----~i~v~f~~~~p~~~~~~a~----~~~kl~g~is~----et~l~~--l~~v~D~~~E~~ri~------ 423 (453) T protein:vir:39 367 N---KEAWK----DIEYTFTRNEPKDIKEQAE----TANILMGITSQ----ETALSV--ISVIPDVQAEMEKIK------ 423 (453) T ss_pred C---ccccc----cceEEeCCCCCcCHHHHHH----HHHHHhccCCh----HHHHHh--CCCCCCHHHHHHHHH------ Confidence 1 01111 1112222222222222222 22222222221 111111 111111111111110 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 546 QEKIRQLELENLMLENQMLVASINDKNARANENTIDAE 583 (663) Q Consensus 546 ~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~ 583 (663) .++..........................+ T Consensus 424 --------~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 424 --------KEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred --------HHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 000000000000000000000000000000 No 56 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.76 E-value=3.9e-16 Score=105.08 Aligned_cols=432 Identities=13% Similarity=0.052 Sum_probs=192.6 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc--------cccCCC--ccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN--------EQKGKS--AIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~--------~~~g~s--~~~~~~i~~~v~~~~~~l~~ 70 (663) +.++.+.|...|... .......++++.+||.|+.... +..+++ ++..|.....|+...+++ T Consensus 28 ~~~~~~~i~~~i~~~-------~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l-- 98 (481) T protein:vir:10 28 ELLKEENLRNFISRH-------QTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYL-- 98 (481) T ss_pred hhcCHHHHHHHHHHH-------HHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhh-- Confidence 555555554444432 2334556788899999875321 112222 355666666666655544 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++..+. |.+ +|.+..+ .+..++. .|+....+.++.++++++|.|++.++++.+ T Consensus 99 --~g~~~~--~~~---~d~~~~~----~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~~~~d~d--------------- 151 (481) T protein:vir:10 99 --TGNPIT--ITH---QDNQTND----KIIELND-LNDADEVNSDLALNLSIYGRAYEIVYRDFE--------------- 151 (481) T ss_pred --ccCCce--Eec---CChhHHH----HHHHHHH-hcChhHHHHHHHHHHHhcCeEEEEEEeCCC--------------- Confidence 443332 332 3333333 3444443 356666778899999999999999887521 Q ss_pred cccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.|.+..++|..+|+ |+.... .. ..+.+.+... T Consensus 152 ------------------g~~~i~~~~p~~~~~v~d~~~~~---~~-~~~i~~~~~~----------------------- 186 (481) T protein:vir:10 152 ------------------DRDTFKVLDPKSTFVVYDQTLDK---KV-VAGVRYFEKQ----------------------- 186 (481) T ss_pred ------------------CeEEEEEEcccceEEEEcCCCCC---ce-EEEEEEEEEe----------------------- Confidence 335567788888763 332211 11 1111222110 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCC Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEA 308 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g 308 (663) +.....+..+|+|.. + ..+++...++..-...+.|.+.|.+|++.+ +++.+|.| T Consensus 187 -------------~~~~~~~~~~~~y~~-----~---~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~n~~~g~~ 240 (481) T protein:vir:10 187 -------------DKDKVPVQHVEVYTT-----D---KIYYIEIKGGTYHRVEEVEHYYNDVPIIEY-----LNDQFKQG 240 (481) T ss_pred -------------eCCCceEEEEEEEec-----C---eEEEEEecCCceeecccccccCCceeEEEe-----ecCCCCCC Confidence 000112344555533 1 112222223322222233333466676543 34567899 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCcchhhhccCCcceEeCC--------CCCccccccCccccH Q lcl|NC_021532. 309 NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQTNRKKFLAGANFEFNG--------TANDFWHGSYNAIPS 379 (663) Q Consensus 309 ~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~~d~~~~~p~~vi~~~~--------~~~~~~~~~~~~~~~ 379 (663) .+..++++++.+|...+.+.+.+...++|.+.+.... .+..+...+..++.+.+.. ++....++..+.-.. T Consensus 241 ~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 320 (481) T protein:vir:10 241 DFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVA 320 (481) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCcceeEEeecCCHH Confidence 9999999999999999999999998998877663322 2333333444454443321 112233444333335 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...++.+...+..+|++++...|..++..|+.| +...............+.|.. +++.++ .++..+..- T Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~~-~l~~~~----~li~~~~~~-- 391 (481) T protein:vir:10 321 GVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGES--MKYKLFGLEQVRAIKERLFKK-GLMKRY----KLLLNNVNL-- 391 (481) T ss_pred HHHHHHHHHHHHHHHHhCCccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHH----HHHHHHHhc-- Confidence 56677889999999999999988775433334443 322222222222333333422 333344 444433211 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) .+.. ..+ ..++.+...-..+....+..+.+. .+...++ ...+... +....+....++.+. T Consensus 392 ----~~~~--~~~----~~~i~v~f~~~~~~~~~~~a~~~~----kl~g~is----~et~~~~--l~~i~d~~~E~~ri~ 451 (481) T protein:vir:10 392 ----TGLK--QHN----YAELTITFTPNLPKSMMESINAFN----ALSGGVS----ESTRLSL--LDFIDNPKEELEKMQ 451 (481) T ss_pred ----cCCC--ccc----cceeeEEeCCCCCcCHHHHHHHHH----HHhccCC----hHHHHHh--CCCCCCHHHHHHHHH Confidence 1100 000 001122222122222222222222 2222222 1111111 111111111111111 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKN-ARANENTIDA 582 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~-a~~q~~~~~~ 582 (663) . ++.+......+....... ........+- T Consensus 452 ~--------------E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 452 E--------------EEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred H--------------HHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 0 000000000000000000 0000000000 No 57 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.76 E-value=6.6e-17 Score=109.30 Aligned_cols=445 Identities=12% Similarity=0.060 Sum_probs=196.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc------ccCCC--ccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE------QKGKS--AIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~------~~g~s--~~~~~~i~~~v~~~~~~l~~~~ 72 (663) ..++..+++..+-..++ ......+.++.+||.|+..... ..+++ +++.|-..-.|+...+++ T Consensus 36 ~~~~~~~~i~~~i~~~~------~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl---- 105 (501) T protein:vir:96 36 LMVNNWELLKNFINHHK------LRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYL---- 105 (501) T ss_pred ccCChHHHHHHHHHHHH------HHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhh---- Confidence 22222222222222222 1223456788899999754221 11222 455666555666555544 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) ++..+ .|..... ...+....+++.++. .|+....+..+.++++++|.|++.++++.. T Consensus 106 ~g~p~--~~~~~~~---~~~~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~v~~ded----------------- 162 (501) T protein:vir:96 106 AGNPI--RVEYDDN---DDNSQNDDAIKRIGR-INDLDSLNRTLIRDLSQTGRAYEVIYRSEY----------------- 162 (501) T ss_pred cccCe--eEeeCCc---cchhHHHHHHHHHHH-hcCHHHHHHHHHHHHhhcCeEEEEEEEcCC----------------- Confidence 45443 3433222 223455666766664 467777788899999999999999987631 Q ss_pred cccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) +.+.+..++|..+| ||+...+ ...+.++ .+..... T Consensus 163 ----------------g~~~i~~~~p~~~~~v~d~~~~~---~~~~~v~-~~~~~~~----------------------- 199 (501) T protein:vir:96 163 ----------------DETRIKRLSPLETFVIYDNSLED---NSIAAVR-YYNRGTL----------------------- 199 (501) T ss_pred ----------------CceEEEEEccceeEEEEcCCCCC---ceEEEEE-EEEeecC----------------------- Confidence 23556778888775 3332211 1112221 1100000 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA 310 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~ 310 (663) ...+.++++|.. +.+ +... .++........|.+.|.+|++.+ +++.+|.|.+ T Consensus 200 --------------~~~~~~~~vyt~-----~~i---~~~~-~~~~~~~~~~~~~~~g~vPvv~~-----~nn~~g~sd~ 251 (501) T protein:vir:96 200 --------------QSAKDVVEIYTD-----EHI---YTLD-ASDDFNEISVTTHAFGTVPITEY-----LNNIDGIGDY 251 (501) T ss_pred --------------CCcEEEEEEEcC-----CcE---EEEe-eCCCceeccccccCCCccceEEe-----cCCccCCCch Confidence 001234555532 111 1111 11111122233334477787654 3456799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc--chhhhccCCcceEeCCCC--------CccccccCccccH Q lcl|NC_021532. 311 EMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQ--TNRKKFLAGANFEFNGTA--------NDFWHGSYNAIPS 379 (663) Q Consensus 311 ~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~--~d~~~~~p~~vi~~~~~~--------~~~~~~~~~~~~~ 379 (663) ..++++++.+|...+.+.+.+...++|.+.+ .|.. .. .........+++.+...+ ..+.++..+.-.. T Consensus 252 e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 330 (501) T protein:vir:96 252 ETELYLIDLYDSAESDTANHMSDMADAILAI-YGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVS 330 (501) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCHH Confidence 9999999999999999999999999887665 3332 21 112233444555554321 1233333333335 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...+..+...|..+|++++.+.|..++..|+.| +...............+.|.. +++.++++++.++...... T Consensus 331 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~~~~-~l~~~~~li~~~~~~~~~~-- 405 (501) T protein:vir:96 331 GAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEA--LKYKLFGLDQDRVDTQSQFTK-GLKRRYRLAARIGSLVNEF-- 405 (501) T ss_pred HHHHHHHHHHHHHHHHhCCcccCcccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcccc-- Confidence 56778899999999999999988775443344444 333333333333344444533 4444555554443321110 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) ..++. .++.+..+-..+....+..+ .+..+...++ ...+... +....+....++.+. T Consensus 406 ---------~~~d~----~~i~i~f~~~~p~n~~e~ad----~~~kl~g~iS----~et~~~~--l~~v~D~~~E~~ri~ 462 (501) T protein:vir:96 406 ---------KDFDE----SLLKITFTPNLPKSLNEQVS----ILTGLGGQVS----QETALSL--SGLVESPNEELDKIN 462 (501) T ss_pred ---------ccccc----ccceEEeCCCCCcCHHHHHH----HHHHHhccCc----hHHHHHh--CCCCCCHHHHHHHHH Confidence 00010 01122222222222222222 2222222221 1211111 111111111111111 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) .+.+ +........+.............+.+....+-..| T Consensus 463 ~E~~--------------~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 463 KEMS--------------EIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred HHHH--------------HhhccccccchhhcccccCCcCCCCCCCccccccC Confidence 0000 00000000000000000000000000000000000 No 58 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.74 E-value=1.9e-17 Score=112.27 Aligned_cols=452 Identities=10% Similarity=0.021 Sum_probs=194.6 Q ss_pred CCCcHHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcCCcc------c--cCCCccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAEL-LSALKADMKAADVLKQE-QDSLISTWKAEYNGEPYGNE------Q--KGKSAIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~-~~~l~~~~~~~~~~~~~-~~~~~~~~~~~y~~~~~~~~------~--~g~s~~~~~~i~~~v~~~~~~l~~ 70 (663) -.|...+. .......+..+...+.. +...++++.+||.|++.-+. . +...+++.|-..-.|+...+++ T Consensus 29 ~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-- 106 (511) T protein:vir:96 29 YTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-- 106 (511) T ss_pred cccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhh-- Confidence 22221111 11111222222222222 33457778899998765321 1 2223556666666666665544 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++..+.+ . .+|++. .+.++.++. .|+.......+.+++.++|.|+..+++|.. T Consensus 107 --~g~p~~~--~---~~~~~~----~~~l~~~~~-~n~~~~~~~~~~~~~~i~G~a~~~vy~ded--------------- 159 (511) T protein:vir:96 107 --LGNPIQY--Q---DDDKDV----LEAIEAFND-LNDVESHNRSLGLDLSIYGKAYELMIRNQD--------------- 159 (511) T ss_pred --ccCCcee--e---cCchHH----HHHHHHHHh-hcCHHHHHHHHHHHHHhcCeeEEEEEeCCC--------------- Confidence 4543333 2 233332 345556554 467777778899999999999999887632 Q ss_pred cccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+.+..++|.++|+ |.+... . ...+.+.+.+... . T Consensus 160 ------------------~~~~i~~~~p~~~~~vydd~~~~---~-~~~~vr~~~~~~~----------------d---- 197 (511) T protein:vir:96 160 ------------------DETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKPI----------------D---- 197 (511) T ss_pred ------------------CceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeeec----------------c---- Confidence 235567788888774 432211 1 1222222211000 0 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEE-----ecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIV-----RLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l-----~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) +.....+..+++|.. ++ .+.++..++..+ .....|.+.+..|++. .+++ T Consensus 198 -------------~~~~~~~~~~~iyt~-----~~---i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~-----~~nn 251 (511) T protein:vir:96 198 -------------KTDEDEVFTVDLFTS-----HG---VYRYLTSRTNGLKLTPRENGFESHSFERMPITE-----FSNN 251 (511) T ss_pred -------------ccccceEEEEEEEeC-----Cc---EEEEEecCCCcccccccccccccccCCceeeEE-----ecCC Confidence 000112333455532 11 111111221111 1111222334555543 3445 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc--cCcchhhhccCCcceEeCC------------CCCcc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA--LDQTNRKKFLAGANFEFNG------------TANDF 369 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~--i~~~d~~~~~p~~vi~~~~------------~~~~~ 369 (663) .+|.|.++.++++++.+|...|.+.+.+...++|.+++ .|. .+..+......+.++.+.+ ++..+ T Consensus 252 ~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (511) T protein:vir:96 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDG 330 (511) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCccCCchhhcccccccceecccccccccccccCCCCcce Confidence 67899999999999999999999999998888876654 332 2222222233333333321 12223 Q ss_pred ccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 370 WHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) .++..+.-.......+..+.+.|..+|++++.+.+..++..|+.| +..+............+.|.+ +++.++++++. T Consensus 331 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~ 407 (511) T protein:vir:96 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLET 407 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 444444444566778899999999999999987765433334444 444444444444445555543 44445555554 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~ 529 (663) ++....... ... ++. +..+..+-+.+....+..+. +..++..++. ..+... +.... T Consensus 408 ~~~~~~~~~----------~~~---d~~-~i~~~f~~~~p~n~~e~~~~----~~kl~G~iS~----et~l~~--l~~v~ 463 (511) T protein:vir:96 408 ILKNTWSID----------ANK---DFN-TVRYVYNRNLPKSLIEELKA----YIDSGGKISQ----TTLMSL--FSFFQ 463 (511) T ss_pred HHHhhcCcc----------ccc---ccc-cceEEeCCCCCCCHHHHHHH----HHHHhccCCh----HHHHHh--CCCCC Confidence 433211100 000 010 11222222222222222222 2222222221 111111 11111 Q ss_pred hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~ 595 (663) +....++.+.. ++... ....+...............+ ...+-..++.. T Consensus 464 D~~~E~~ri~~--------------E~~~~-~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 511 (511) T protein:vir:96 464 DPELEVKKIEE--------------DEKES-IKKAQKGIYKDPRDINDDEQD---DDTKDTVDKKE 511 (511) T ss_pred CHHHHHHHHHH--------------HHHHH-HHHHhhccccCCCCCCCCCCC---CcccccccccC Confidence 11111111110 00000 000000000000000000000 00000000000 No 59 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.74 E-value=1.8e-17 Score=112.36 Aligned_cols=452 Identities=10% Similarity=0.019 Sum_probs=193.0 Q ss_pred CCCcHHHHHH-HHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcCCcc------cc--CCCccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLS-ALKADMKAADVLKQE-QDSLISTWKAEYNGEPYGNE------QK--GKSAIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~~~-~l~~~~~~~~~~~~~-~~~~~~~~~~~y~~~~~~~~------~~--g~s~~~~~~i~~~v~~~~~~l~~ 70 (663) -.|...+... .....+..+-..+.. ....++++.+||.|++.-+. .. ...+++.|-.+-.|+...+++ T Consensus 29 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-- 106 (511) T protein:vir:10 29 YTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-- 106 (511) T ss_pred ccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhh-- Confidence 2222222111 111222222222222 33456778889999765321 12 223556666666667666654 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++..+. +. .+|++. .+.++.++. .|+.......+.+++.++|.|+..+++|.. T Consensus 107 --~g~p~~--~~---~~d~~~----~~~l~~~~~-~n~~~~~~~~~~~~~~i~G~ay~~vy~ded--------------- 159 (511) T protein:vir:10 107 --LGNPIQ--YQ---DDDKDV----LEAIEAFND-LNDVESHNRSLGLDLSIYGKAYEIMIRNQD--------------- 159 (511) T ss_pred --cccCce--ee---cCchHH----HHHHHHHHh-hcCHHHHHHHHHHHHHhcCeeEEEEEeCCC--------------- Confidence 443333 32 233332 244555554 466666778899999999999998887521 Q ss_pred cccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+.+..++|.++|+ |++... . ...+.+.+.+... . T Consensus 160 ------------------g~~~i~~~~p~~~~~vydd~~~~---~-~~~~vr~~~~~~~----------------d---- 197 (511) T protein:vir:10 160 ------------------DETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKPI----------------D---- 197 (511) T ss_pred ------------------CceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeeec----------------c---- Confidence 235567788887763 332211 1 1222222211000 0 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEE-----ecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIV-----RLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l-----~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) +.....+..+|+|.. ++ .+.++..++..+ .....|.+.+..|++ +-+++ T Consensus 198 -------------~~~~~~~~~~~iyt~-----~~---i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv-----~f~nn 251 (511) T protein:vir:10 198 -------------KTDEDEVFTVDLFTS-----HG---VYRYLTSRTNGLKLTPRENGFESHSFERMPIT-----EFSNN 251 (511) T ss_pred -------------cCccceEEEEEEEeC-----Cc---EEEEEecCCCcccccccccccccccCcceeEE-----EecCC Confidence 000112334455532 11 111112221111 111222233445554 34445 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc--cCcchhhhccCCcceEeCCC------------CCcc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA--LDQTNRKKFLAGANFEFNGT------------ANDF 369 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~--i~~~d~~~~~p~~vi~~~~~------------~~~~ 369 (663) .+|.|.++.++++++.+|...|.+.+.+...++|.+++ .|. .+..+......+.++.+.+. +... T Consensus 252 ~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 330 (511) T protein:vir:10 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDG 330 (511) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-eccccCCchhhccchhccceecccccccccccccCCCCcce Confidence 67899999999999999999999999998888876654 332 22222223333444433221 1223 Q ss_pred ccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 370 WHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) .++..+.-.......+..+...|..+|++++.+.+..++..|+.| +..+...........-+.|.+ +++.+++++.. T Consensus 331 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~f~~-~l~~~~~li~~ 407 (511) T protein:vir:10 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLET 407 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 444434334566778899999999999999987764333334444 444433444444444444533 34444455444 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~ 529 (663) ++....... .+.++. ++.+..+-..+....+..+.+. .+...++ ...+... +.... T Consensus 408 ~~~~~~~~~-------------~~~d~~-~i~i~f~~~~p~d~~~~~~~~~----kl~G~iS----~et~~~~--l~~v~ 463 (511) T protein:vir:10 408 ILKNTRSID-------------ANKDFN-TVRYVYNRNLPKSLIEELKAYI----DSGGKIS----QTTLMSL--FSFFQ 463 (511) T ss_pred HHHhhCCcc-------------cccccc-eeeEEeCCCCCcCHHHHHHHHH----HHhccCc----HHHHHHh--CCCCC Confidence 433211000 000110 1122222222222222222222 2222221 1111111 11111 Q ss_pred hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~ 595 (663) +....++. +..++... +..............-........+-..++.. T Consensus 464 d~~~E~~r--------------i~~E~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 464 DPELEVKK--------------IEEDEKES----IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CHHHHHHH--------------HHHHHHHH----HHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 11111111 11000000 00000000000000000000000000000000 No 60 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.74 E-value=7.9e-17 Score=108.85 Aligned_cols=416 Identities=10% Similarity=-0.015 Sum_probs=185.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc------ccCCC--ccccHHHHHHHHHHHHHHHHhhcCCCceEEEE Q lcl|NC_021532. 11 ALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE------QKGKS--AIVSRDIKKQSEWQHATIVDPFVSTADIIKCT 82 (663) Q Consensus 11 ~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~------~~g~s--~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~ 82 (663) +|... .....+.++++.+||.|++.-.. ..+++ +++.|-....|+...+. +++..+.+ . T Consensus 1 ~~~~~-------~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~----l~g~~~~~--~ 67 (440) T protein:vir:95 1 MLAAF-------LGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGY----VIGNPVSI--G 67 (440) T ss_pred ChhhH-------HHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhh----eeccCceE--e Confidence 22211 22234557778889999765321 12232 45566666666655554 45655443 3 Q ss_pred eCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccccccccccc Q lcl|NC_021532. 83 PITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVT 162 (663) Q Consensus 83 p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (663) ....++.+..+ .+..++ ..|+.......+.++++++|.|++.+++|. T Consensus 68 ~~~~~~~~~~~----~l~~~~-~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~---------------------------- 114 (440) T protein:vir:95 68 VMEGGSADQLS----TIKDIE-WQNDINALNSDLAFDASVYGRAYEYHFRDK---------------------------- 114 (440) T ss_pred eCCCccHHHHH----HHHHHH-HhcCHhHHHHHHHHHHhhcCeEEEEEEecC---------------------------- Confidence 33333333332 233434 346666677789999999999999988752 Q ss_pred cceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccccccc Q lcl|NC_021532. 163 ETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQF 240 (663) Q Consensus 163 ~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (663) .+.|.+..++|.++++ |+...+ ...+.+ +.+.. .+ T Consensus 115 -----~~~~~i~~~~p~~~~~~~d~~~~~---~~~~~i-~~~~~----------~~------------------------ 151 (440) T protein:vir:95 115 -----DKVDRVVLISPLEMFVIRDLTVEQ---NIIAAV-HLPIY----------AD------------------------ 151 (440) T ss_pred -----CCceEEEEEcccceEEEEcCCCCC---ceEEEE-EEEEe----------cC------------------------ Confidence 1345677788888874 443211 112222 22211 00 Q ss_pred cccccceEEEEEEEEEeeecCCceeEEEEEEEEC-CEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHH Q lcl|NC_021532. 241 SDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN-DVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKV 319 (663) Q Consensus 241 ~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g-~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~ 319 (663) ...+++|.. +++..+ .+.-.+ +........|.+.|.+|++.+ +++.+|.|.++.++++++. T Consensus 152 -------~~~~~vyt~-----~~~~~~-~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~n~~~g~sd~e~v~~lida 213 (440) T protein:vir:95 152 -------KVNMTVYTK-----DKVITY-KPYSNNSVRLVVDDVKKHSYNDVPVVEW-----WNNRFRMGDYESEISLIDA 213 (440) T ss_pred -------ceEEEEEeC-----CeEEEE-EEecCCccceeecceeeccCceeeEEEe-----eCCCCCCCchhhhHHHHHH Confidence 001122211 111000 000000 011111122223355666543 4456799999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCcEEeecccc-----CcchhhhccCCcceEeCC--------CCCccccccCccccHHHHHHHH Q lcl|NC_021532. 320 KTAVIRGIIDNMAQSNNGQVAIRKGAL-----DQTNRKKFLAGANFEFNG--------TANDFWHGSYNAIPSSAFDMIS 386 (663) Q Consensus 320 ~N~~~~~~~~~~~~~~~~~~~~~~~~i-----~~~d~~~~~p~~vi~~~~--------~~~~~~~~~~~~~~~~~~~~~~ 386 (663) +|..++.+.+++...++|.+++ .|.. +.+........+.+.+.. ++..+.++..+.-...+...++ T Consensus 214 ~~~~~s~~~~~~~~~~~~~~v~-~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~ 292 (440) T protein:vir:95 214 YDAGQSDTANYMSDLNDAMLLV-KGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKN 292 (440) T ss_pred HHHHHHHHHHHHHHhhcceeee-ecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHH Confidence 9999999999999999887654 3321 222222333333333221 1122444444433455677889 Q ss_pred HHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC Q lcl|NC_021532. 387 LMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND 466 (663) Q Consensus 387 ~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~ 466 (663) .+...|...|++++...+.-++..|+.| +..+..............|.+ +++.+++++..++.... +. T Consensus 293 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~-~l~~~~~li~~~~~~~~---------~~ 360 (440) T protein:vir:95 293 RLANDIHRFSRIPNLDDDRFNSTSSGIA--LLYKMIGLEQVRKDKETYFTK-ALRRRYELISNIHKAIN---------GP 360 (440) T ss_pred HHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcC---------Cc Confidence 9999999999999987765433344444 444433333344444445533 34444444443332111 00 Q ss_pred eeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHH Q lcl|NC_021532. 467 KFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQ 546 (663) Q Consensus 467 ~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~ 546 (663) .++. .++.+...-..+....+..+. +..++..++ ...+... +... +.. + T Consensus 361 ---~~~~----~~v~i~f~~~~p~~~~~~ad~----~~kl~g~iS----~et~~~~--l~~~-d~~-------------~ 409 (440) T protein:vir:95 361 ---VIEA----NKLTFTFHPNIPQDVWTEIKA----YIEAGGEIS----QETLMEN--ASFT-DYK-------------T 409 (440) T ss_pred ---cccc----ccceEEeCCCCCCCHHHHHHH----HHHHhccCc----HHHHHHh--CCCC-CcH-------------H Confidence 0110 112222222222222222222 222222222 1111111 1110 000 0 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 547 EKIRQLELENLMLENQMLVASINDKNARANENTIDAE 583 (663) Q Consensus 547 ~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~ 583 (663) +..+...++.. ...+....-........+.+ T Consensus 410 -E~~ri~~E~~~-----~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 410 -EHSRILKQGGS-----SDLEIGQIVGDADVGQADTE 440 (440) T ss_pred -HHHHHHHHHHH-----hhhhHHhhccCCCCCCcCCC Confidence 00000000000 00000000000000000000 No 61 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.74 E-value=3.5e-17 Score=110.82 Aligned_cols=451 Identities=10% Similarity=0.020 Sum_probs=195.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCcCCcc------ccCC--CccccHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQ-EQDSLISTWKAEYNGEPYGNE------QKGK--SAIVSRDIKKQSEWQHATIVDP 71 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~y~~~~~~~~------~~g~--s~~~~~~i~~~v~~~~~~l~~~ 71 (663) |.-.+.++.... ..+..+...+. .+...+.++.+||.|++.-+. ..++ .+++.|-..-.|+...+++ T Consensus 31 ~~~~e~~~~~~~-~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl--- 106 (512) T protein:vir:97 31 YDGTESDLLQNI-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF--- 106 (512) T ss_pred cCchhhhhhhhH-HHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhh--- Confidence 322232222211 22222222222 233456778889998765321 1122 3456666666666665554 Q ss_pred hcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 72 FVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 72 ~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) ++..+. |. .+|++ ..+.++.++. .|+.......+.++++++|.|++.++++.. T Consensus 107 -~g~p~~--~~---~~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~i~G~ay~~vy~ded---------------- 159 (512) T protein:vir:97 107 -LGNPIQ--CQ---DDDKD----VLEAIEAFND-LNDVESHNRSLGLDLSIYGKAYELMIRNQD---------------- 159 (512) T ss_pred -cccCce--ec---cCChH----HHHHHHHHHh-hcCHHHHHHHHHHHHHhcCeEEEEEEeCCC---------------- Confidence 443322 32 23333 2345566554 466767778899999999999999887532 Q ss_pred ccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhh Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFD 229 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 229 (663) +.+.+..++|.++| ||++... .. ..+.+.+.+.. ++ + T Consensus 160 -----------------~~~~i~~~~p~~~~~iyd~~~~~---~~-~~~vr~~~~~~------~~----------~---- 198 (512) T protein:vir:97 160 -----------------DETRLYKSDAMSTFVIYDNTIER---NS-IAGVRYLRTKP------ID----------K---- 198 (512) T ss_pred -----------------CceEEEEEcccceEEEEcCCCCC---ce-EEEEEEEEeee------cc----------c---- Confidence 23556778888886 4443221 11 22222221100 00 0 Q ss_pred ccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEE-----ecccCCCcCCCCCEEEEeeeeecCcc Q lcl|NC_021532. 230 YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIV-----RLQSNPYPDGKPPFLVVPFNSIPFKL 304 (663) Q Consensus 230 ~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l-----~~~~~p~~~~~~Pf~~~~~~~~~~~~ 304 (663) .....+..+|+|.. +++ ++++..++... ...+.|.+.|..|++.+ .++. T Consensus 199 -------------~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~nn~ 252 (512) T protein:vir:97 199 -------------TDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNE 252 (512) T ss_pred -------------cccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccccCcccceEee-----cCCC Confidence 00112334455532 111 11111111110 11122333455666543 3456 Q ss_pred cCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc--CcchhhhccCCcceEeCC-------------CCCcc Q lcl|NC_021532. 305 HGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL--DQTNRKKFLAGANFEFNG-------------TANDF 369 (663) Q Consensus 305 ~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i--~~~d~~~~~p~~vi~~~~-------------~~~~~ 369 (663) +|.|.++.++++++.+|...|.+.+.+...++|.+++ .|.. +..+......+.++...+ ++... T Consensus 253 ~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 331 (512) T protein:vir:97 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDG 331 (512) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcce Confidence 7899999999999999999999999999888887654 3322 222222223333332211 12224 Q ss_pred ccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 370 WHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) .++..+.-.......+..+...|..+|++++.+.|..++..|+.| +...............+.|.+ +++.++++++. T Consensus 332 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~A--l~~~~~~l~~ka~~k~~~f~~-~l~~~~~li~~ 408 (512) T protein:vir:97 332 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLET 408 (512) T ss_pred EEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 444444334556778899999999999999988775433334444 443333444444444445533 34445555544 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~ 529 (663) ++....... .. .++. ++.+..+-+.+....+..+.+. .+...++ ...+... +.... T Consensus 409 ~~~~~~~~~----------~~---~d~~-~i~~~f~~~~p~~~~e~~~~~~----kl~giiS----~et~~~~--l~~v~ 464 (512) T protein:vir:97 409 ILKNTRSID----------AN---KDFN-TVRYVYNRNLPKSLIEELKAYI----DSGGKIS----QTTLMSL--FSFFQ 464 (512) T ss_pred HHHhcCCcc----------cc---cccc-cceEEeCCCCCcCHHHHHHHHH----HHhccCc----hHHHHHh--CCCCC Confidence 433211100 00 0111 1222222222222222222222 2222222 1211111 11111 Q ss_pred hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~ 595 (663) +....++ ++..++... +...+. .........-........+-...+.+ T Consensus 465 d~~~E~e--------------ri~~E~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 465 DPELEVK--------------KIEEDEKES-IKKAQK---GIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred CHHHHHH--------------HHHHHHHHH-HHHHhh---cccCCCCCCCCCCCCCCccccccccC Confidence 1111111 111000000 000000 00000000000000000000000000 No 62 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.74 E-value=1.2e-15 Score=102.43 Aligned_cols=429 Identities=13% Similarity=0.042 Sum_probs=188.9 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC---ccccCC--CccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG---NEQKGK--SAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~---~~~~g~--s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) .++..++|...|.... ......+++..+||.|++.- ....++ .+++.|-....|+..++++ ++. T Consensus 23 ~~~~~~~i~~~i~~~~-------~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l----~g~ 91 (470) T protein:vir:99 23 EKLTSNELLGFIAYNE-------TVLKPRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAKYVVDVYNGYF----CGI 91 (470) T ss_pred CCcCHHHHHHHHHHHH-------HhhHHHHHHHHHHhccccccccCcccccCCcceeecchHHHHHHHHhhhh----ccC Confidence 4455554444443321 22234467778899997532 122222 2455566555666555544 454 Q ss_pred CceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccc Q lcl|NC_021532. 76 ADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNET 155 (663) Q Consensus 76 ~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 155 (663) ...+.+ .+|.+..+ .+..++. .|+....+..++++++++|.+++.++++.+ T Consensus 92 p~~~~~----~~d~~~~~----~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d-------------------- 142 (470) T protein:vir:99 92 EPKLAL----LNDSSKID----EIARWNR-QENFFDTINEISKQCDIFGRSIASIYQGED-------------------- 142 (470) T ss_pred CeeEee----CCchhHHH----HHHHHHH-hcCHhHHHHHHHHHHHhcCeeEEEEEeCCC-------------------- Confidence 433322 23333222 2333343 466777778899999999999999887521 Q ss_pred ccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccc Q lcl|NC_021532. 156 VVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSP 233 (663) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~ 233 (663) +.|.+..++|..+|+ |+.... ...+ +.+.+.... . T Consensus 143 -------------g~~~i~~~~p~~~~~i~d~~~~~---~~~~-~vr~~~~~~-------------------~------- 179 (470) T protein:vir:99 143 -------------ARPHLMYSSPNHAFIIYDDTVQR---QPLA-FVHYQIDNS-------------------N------- 179 (470) T ss_pred -------------CeEEEEEEccceeEEEEcCCCCc---ceEE-EEEEEEEec-------------------C------- Confidence 235567788888753 332111 1111 112221100 0 Q ss_pred ccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC-C-EEEecccCCCcCCCCCEEEEeeeeecCcccCCChHH Q lcl|NC_021532. 234 DDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN-D-VIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAE 311 (663) Q Consensus 234 ~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g-~-~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~ 311 (663) ......+++|.. + ..+.....+ + ........|.+.|.+|++.+ .++.+|.|.+. T Consensus 180 -----------~~~~~~~~~~~~-----~---~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~n~~~g~sd~e 235 (470) T protein:vir:99 180 -----------NWTDAYGVIQYA-----D---KFYKFKGYDIEEDTNAAGYAINPYGLVPAVEF-----FENEERQGIFD 235 (470) T ss_pred -----------CeeEEEEEEEec-----C---eEEEEEecccccccccccccccCCCccceEee-----cCCCCCCcchH Confidence 000111122211 0 001000001 0 01111122333356676643 44567999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcch----hhhccCCcceEeCCC----CCccccccCccccHHHHH Q lcl|NC_021532. 312 MIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTN----RKKFLAGANFEFNGT----ANDFWHGSYNAIPSSAFD 383 (663) Q Consensus 312 ~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d----~~~~~p~~vi~~~~~----~~~~~~~~~~~~~~~~~~ 383 (663) .++++++.+|...+.+.+.+...++|.+.+.-......+ ......++++.+.+. +..+.++..+.....+.. T Consensus 236 ~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 315 (470) T protein:vir:99 236 SIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQEN 315 (470) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHH Confidence 999999999999999999999999987776433332221 112233444544321 223455554444455667 Q ss_pred HHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEE Q lcl|NC_021532. 384 MISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRV 463 (663) Q Consensus 384 ~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri 463 (663) .++.+...|-..||+++.+.+..++..|+.| +..............-+.|.. +++.++++++.++....... T Consensus 316 ~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--i~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~~~----- 387 (470) T protein:vir:99 316 LIQHLTDFIFMMAMVPNIQDKNFAGNSSGVA--LQYKLFAMKNKADSKERKFDK-SLMQLYRIVLATLFNNKQDQ----- 387 (470) T ss_pred HHHHHHHHHHHHhCCccccccccccCchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCCcc----- Confidence 8899999999999999887665433334444 333333333333334444432 34444444444332211100 Q ss_pred ecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchh Q lcl|NC_021532. 464 TNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPD 543 (663) Q Consensus 464 ~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~ 543 (663) .+ + .++.+..+-..+....+..+.+. .+...++ ...+... +... +....++ T Consensus 388 -------~~---~-~~i~v~f~~~~p~~~~e~a~~~~----kl~giis----~et~l~~--l~~v-d~~~E~e------- 438 (470) T protein:vir:99 388 -------EL---W-SELDFKFTRNLPEDMASAIDNAK----NAEGIVS----KKTQLGM--IPDI-EPDAEMK------- 438 (470) T ss_pred -------cc---c-ccceEEeCCCCCcCHHHHHHHHH----HHhccCC----HHHHHHh--CCCC-CHHHHHH------- Confidence 00 0 11222222222222222222222 2222222 1111111 1111 1000011 Q ss_pred hHHHHhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 544 PVQEKIRQLELENLML--ENQMLVASINDKNARANENTIDAELK 585 (663) Q Consensus 544 ~~~~q~~q~~~~~~q~--~~~~~~a~~~~~~a~~q~~~~~~~~~ 585 (663) ++..++... ...+............ ..+-+ T Consensus 439 -------ri~~E~~~~~~~~~~~~~~~d~~~~d~-----~~ee~ 470 (470) T protein:vir:99 439 -------QIAKEKADAIKQTQQLSMPIDILKRDN-----NAEEE 470 (470) T ss_pred -------HHHHHHHHHHHHHHhhcCCCCcCCCCC-----CccCC Confidence 111000000 0000000000000000 00000 No 63 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.73 E-value=4.2e-17 Score=110.38 Aligned_cols=433 Identities=10% Similarity=0.039 Sum_probs=196.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-------------ccCCCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-------------QKGKSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-------------~~g~s~~~~~~i~~~v~~~~~~ 67 (663) +..+.+.+-..|.+.++. +.++...+.+..+||.|++.-+. .+...+++.|..+..|+..+++ T Consensus 29 ~~~~~e~~~~~i~~~i~~----~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~ 104 (483) T protein:vir:12 29 TNNKPETLEEMIVRYIKQ----HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 104 (483) T ss_pred cCCchhhHHHHHHHHHHH----HHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhh Confidence 555555555555444443 44455667888899999753211 1111346667777777766665 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) + ++..+. +. .+|.+..+ .++.++. ++.......+.++++++|.|++.+++|.+ T Consensus 105 l----~G~p~~--~~---~~d~~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~G~~y~~v~~d~d------------ 157 (483) T protein:vir:12 105 I----VGKPIA--FK---HTDDEVVK----RIDEVLG--NRFDDKLHSVLTGASNKGIEWLHPYLDEE------------ 157 (483) T ss_pred h----cccCce--ec---cCChHHHH----HHHHHHh--ccHHHHHHHHHHHHhhCCeEEEEEEEcCC------------ Confidence 5 443322 32 24444333 3444443 45556677788999999999999987632 Q ss_pred cCccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) +.|.+.+++|.++|+ |++..+. ..+.+ +.+..... T Consensus 158 ---------------------~~~~i~~~~p~~~~~v~d~~~~~~---~~~~i-r~~~~~~~------------------ 194 (483) T protein:vir:12 158 ---------------------GEFKLFRVPAEQGIPIWTDKEHEE---LEAFI-RMYKLENE------------------ 194 (483) T ss_pred ---------------------CceEEEEEcccceEEEEcCCCCCc---eEEEE-EEEEeecc------------------ Confidence 335677889988764 4433222 12222 22211000 Q ss_pred hhhhccccccccccccccccceEEEEE---EEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYE---YWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E---~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) ..+++|. +++. ..++.+. ........+...++.. |.+.|..|++.+ ++ T Consensus 195 --------------------~~~~~y~~~~v~~~-~~~~~~~-~~~~~~~~~~~~~~~~--~~~~g~vPvv~~-----~n 245 (483) T protein:vir:12 195 --------------------TKVEYWDKVTVNYY-VYENGSL-IPDYSNNLENSKTHFS--TGSWGKIPFIPF-----KN 245 (483) T ss_pred --------------------eEEEEEecCeEEEE-EEeCCee-eecccccccccccccc--cCCCCccceEEe-----cC Confidence 0011111 0000 0011100 0000000011112222 223355666533 44 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc-ch-hhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQ-TN-RKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~-~d-~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +.+|.|.+..++++++.+|...|.+.+.+...++|.+.+ .|.- +. .+ ......++++.+..+++ ..++..+.-.. T Consensus 246 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~ 323 (483) T protein:vir:12 246 NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNYDDQELPEFKRLLRYYGAIKVSDNGG-VDTIQVEVPVE 323 (483) T ss_pred CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcccchhHHHhhhhccccccCCCCc-ceEEeecCCHH Confidence 567899999999999999999999999999999987655 3432 11 11 12234556666655443 45554444445 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...++.+.+.|...|++++.+.+..++..|+.| +..+............+.|.. ++ +.++.++..+.... T Consensus 324 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~f~~-~l----~~~~~li~~~~~~~- 395 (483) T protein:vir:12 324 NSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKV-AI----QELLWFVFEHFDIK- 395 (483) T ss_pred HHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHH--HHHHHHHHHHHHHHHHHHHHH-HH----HHHHHHHHHHhcCC- Confidence 67778899999999999999887664433334444 433333333333334444432 33 34444444443211 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) + ++ .++++..+-..+....+..+. ++.+.+.+ ........ +....+....++.+ T Consensus 396 -----~-~~---------~~i~v~f~~~~p~~~~~~a~~----~~kl~Gii----S~et~~~~--~~~v~d~~~E~~ri- 449 (483) T protein:vir:12 396 -----G-EH---------KDVDISFNYNKVANTELQVQT----AQQSMGIV----SHETVLEN--HPFVEDLQAELERI- 449 (483) T ss_pred -----C-cc---------ceeeEEeCCCCCCCHHHHHHH----HHHHhccC----chHHHHHh--CCCCCCHHHHHHHH- Confidence 1 11 112222222222222222222 22222211 11111111 11111111111110 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRS-KAAVEKA 594 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~-~~~~e~~ 594 (663) ..++.+. ...+. ..... ..+...+.. .-+.++. T Consensus 450 -------------~~E~~~~-~~~~~-~~~~~-------~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 450 -------------EQEQMEY-NKQLP-NLDDG-------GADGAQQQERSNNKESE 483 (483) T ss_pred -------------HHHHHHH-Hhhcc-ccccc-------ccCCcccCCCCCcccCC Confidence 0000000 00000 00000 000000000 0000000 No 64 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.73 E-value=5.7e-17 Score=109.64 Aligned_cols=432 Identities=13% Similarity=0.075 Sum_probs=196.6 Q ss_pred CCCcHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhcCCcCCcc-----ccCCC----ccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAA-DVLKQEQDSLISTWKAEYNGEPYGNE-----QKGKS----AIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~y~~~~~~~~-----~~g~s----~~~~~~i~~~v~~~~~~l~~ 70 (663) |-+. +.++...++. .....++......|++||.|.+.... ..|+. .++.|. ...++..+.. T Consensus 16 ~~~~-----~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~----~k~i~~~~a~ 86 (496) T protein:vir:38 16 MGLL-----KALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNL----PKVTAKYMSK 86 (496) T ss_pred hccc-----hhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecch----HHHHHHHHhh Confidence 2211 1111111111 11234455667788999998653211 11211 122333 3334444445 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) .+++..+.+.+ +|.+.++ +|+.++. .+++...+.+++.+++.+|.||++++||.. T Consensus 87 ~l~~~p~~i~~-----~d~~~~e----~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~--------------- 141 (496) T protein:vir:38 87 LLFNEKVKINI-----DDKAAEE----FVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDGN--------------- 141 (496) T ss_pred hhhCCcceEee-----CChHHHH----HHHHHHh-ccCHHHHHHHHHHHHhhhCcEEEEEEEcCC--------------- Confidence 55654444433 4444444 4555553 467778888999999999999999999731 Q ss_pred cccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) +.|.+.+|+|..||+=..-...+..+-|+ ..+ +.+ T Consensus 142 ------------------~~~~i~~v~~~~~~P~~~~~~~~~~~~f~--~~~-~~~------------------------ 176 (496) T protein:vir:38 142 ------------------KNVKVSFATADCMYPLSNDSENVDECVIA--NSF-HKN------------------------ 176 (496) T ss_pred ------------------CcEEEEEEcccceEEEEecCCcEEEEEEE--EEE-EeC------------------------ Confidence 34678889999988422111233333222 111 100 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEE-------CCEE--------EecccCCCcCCCCCEEEE Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWI-------NDVI--------VRLQSNPYPDGKPPFLVV 295 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~-------g~~~--------l~~~~~p~~~~~~Pf~~~ 295 (663) ...+..+|+|...+ +.+..+ ..+|. |..+ +..........++||+.+ T Consensus 177 --------------~~~y~~le~h~~~~--~~~~I~--~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~ 238 (496) T protein:vir:38 177 --------------NKYYTLLEWNEWQG--DVYTVT--TELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYI 238 (496) T ss_pred --------------CeEEEEEEEEEEeC--ceEEEE--EEEEecCCccccCccccccccccccccceeecCCCcceEEEe Confidence 01122333332211 001000 00010 0000 000000011245667665 Q ss_pred ee----eeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchh------hhccC-CcceEe-C Q lcl|NC_021532. 296 PF----NSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNR------KKFLA-GANFEF-N 363 (663) Q Consensus 296 ~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~------~~~~p-~~vi~~-~ 363 (663) +. ....++.+|.|.+.+++++++.+|..++.+.+.+.. +.+++.++.+.+..... ..+.+ ..++.. . T Consensus 239 ~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~ 317 (496) T protein:vir:38 239 KPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQ 317 (496) T ss_pred cCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccCCCCCccccCCCCccceEEEee Confidence 43 224567899999999999999999999999988875 57678887766532111 01111 111111 1 Q ss_pred C--CC--CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 364 G--TA--NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENL 439 (663) Q Consensus 364 ~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~ 439 (663) . ++ ..+....+.-........++.+...+...+|++....|...++ ..||+++.......-.......+.| +.. T Consensus 318 ~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g-~~tAtei~~~~~~l~~~~~~~~~~~-~~~ 395 (496) T protein:vir:38 318 GDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLI-EQG 395 (496) T ss_pred cCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccc-cchHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 1 11 1233222222224456778888888888999999999876544 3467777655444444444555556 446 Q ss_pred HHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHH Q lcl|NC_021532. 440 VKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIM 519 (663) Q Consensus 440 ~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l 519 (663) ++.+++.++.+..-+..- .|. .+.. ....+...-+......+..+.+..+.+ ++.++.. .. T Consensus 396 l~~l~~~il~~~~~~~~~------~g~---~~~~----~~i~v~f~d~i~~d~~~~~~~~~~~~~--~GiiS~e----t~ 456 (496) T protein:vir:38 396 IKEMIVSILEVGKFIEAY------SGE---VVEL----DTITVDFDDSIAQDEDTTINRYTNAKN--QGMIPLK----IA 456 (496) T ss_pred HHHHHHHHHHHHHHHHhh------cCC---CCCc----cceEEEeCCCCCCCHHHHHHHHHHHHh--cCCCCHH----HH Confidence 777777777766533210 000 0000 111222222222222222222222211 1222211 11 Q ss_pred HHHHHhhhhh--hhhhhhhhhhcchh---hHHH---HhhHHH Q lcl|NC_021532. 520 ADIMDLMRMP--EQAKRMREYEPKPD---PVQE---KIRQLE 553 (663) Q Consensus 520 ~~~~~l~~~~--e~~~~l~~~~~~~~---~~~~---q~~q~~ 553 (663) +..+.+.. +..+.++.+..+.. +... .....+ T Consensus 457 --l~~~~~~~d~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 457 --LQRAWNITEAEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred --HHhcCCCChHHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 01111111 11111111110000 0000 000000 No 65 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.73 E-value=2.6e-16 Score=106.06 Aligned_cols=442 Identities=13% Similarity=0.063 Sum_probs=195.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcCCccc------cCCC--ccccHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQE-QDSLISTWKAEYNGEPYGNEQ------KGKS--AIVSRDIKKQSEWQHATIVDP 71 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~y~~~~~~~~~------~g~s--~~~~~~i~~~v~~~~~~l~~~ 71 (663) +-.+-+. |.+.+. .+.. ....++++.+||.|+...... ++++ +++.|-.+-.|+...+++ T Consensus 37 ~~~~~~~----l~~~i~----~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl--- 105 (501) T protein:vir:27 37 MVNNWEL----LKNFIN----HHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYL--- 105 (501) T ss_pred ccccHHH----HHHHHH----HHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhh--- Confidence 2222222 222222 2222 234577888999997533211 2222 455555555555555544 Q ss_pred hcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 72 FVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 72 ~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) +++.+.+.+ +|....+....+++.++. .|+....+..+.+++.++|.|++.++++.+ T Consensus 106 -~g~p~~~~~-----~d~~~~~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~vy~ded---------------- 162 (501) T protein:vir:27 106 -AGNPIRVEY-----DDNDNNSQNDDTIKRIGR-INDIDSHNRTLIRDLSQTGRAYEVIYRNEY---------------- 162 (501) T ss_pred -cccCeeEec-----CCccchHHHHHHHHHHHH-hcChhHHHHHHHHHHhhCCeEEEEEEeCCC---------------- Confidence 554433322 222333445566666554 467777788899999999999999987632 Q ss_pred ccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhh Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFD 229 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 229 (663) +.|.+..++|.++|+ |+...+ +..+.++ .+..... T Consensus 163 -----------------~~~~i~~~~p~~~~~v~d~~~~~---~~~~~ir-~~~~~~~---------------------- 199 (501) T protein:vir:27 163 -----------------DETRIKRLNPLETFVIYDNSLED---NSIAAVR-YYNRGTL---------------------- 199 (501) T ss_pred -----------------CceEEEEEccceeEEEecCCCCC---ceEEEEE-EEEeeec---------------------- Confidence 235567788888763 443221 1222221 2111000 Q ss_pred ccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCCh Q lcl|NC_021532. 230 YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEAN 309 (663) Q Consensus 230 ~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~ 309 (663) ...+.++++|.. +. .+++...++. ......|.+.|.+|++.+ +++.+|.|. T Consensus 200 ---------------~~~~~~~~vyt~-----~~---v~~~~~~~~~-~~~~~~~~~~g~vPvv~~-----~nn~~g~sd 250 (501) T protein:vir:27 200 ---------------QNAKDVVEIYTN-----EH---IYTLDASDDF-NEISVTTHAFGTVPITEF-----LNNVDGIGD 250 (501) T ss_pred ---------------CCcEEEEEEEeC-----Ce---EEEEEeCCce-eeccccccCCCcccEEEe-----cCCCCCCCc Confidence 001334555533 11 1111112221 122233334467777644 345679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Ccc--hhhhccCCcceEeCCCC--------CccccccCcccc Q lcl|NC_021532. 310 AEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQT--NRKKFLAGANFEFNGTA--------NDFWHGSYNAIP 378 (663) Q Consensus 310 ~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~~--d~~~~~p~~vi~~~~~~--------~~~~~~~~~~~~ 378 (663) +..++++++.+|...+.+.+.+...++|.+.+ .|.. +.. ........+.+.+..++ ..+.++..+.-. T Consensus 251 ~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 329 (501) T protein:vir:27 251 YETELYLIDLYDSAESDTANHMSDMADAILAI-YGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDV 329 (501) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCH Confidence 99999999999999999999999888877665 3332 211 11222333444443221 123344444434 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) ..+...++.+...|..+|++++.+.|..++..|+.| +...............+.|.+ +++.++++++.++..... T Consensus 330 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~~~~-~l~~~~~li~~~~~~~~~-- 404 (501) T protein:vir:27 330 SGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEA--LKYKLFGLDQDRVDTQSQFTQ-GLKRRYRLAARIGSLVNE-- 404 (501) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccc-- Confidence 556778899999999999999887775433334444 333322333333334444533 344444544443322110 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) ...++.. .+.+..+-..+....+..+ .+..+++.++ ...+... +....+....++.+ T Consensus 405 ---------~~~~d~~----~i~v~f~~~~p~n~~e~ad----~~~kl~g~iS----~et~l~~--l~~v~D~~~E~eri 461 (501) T protein:vir:27 405 ---------FKDFDES----LLKITFTPNLPKSLNEQVS----ILTGLGGQVS----QETALSL--SGLVESPNEELDKI 461 (501) T ss_pred ---------ccccccc----cceEEeCCCCCcCHHHHHH----HHHHHhccCc----HHHHHHh--CCCCCCHHHHHHHH Confidence 0011100 1112222122222222222 2222222222 1221111 11111111111111 Q ss_pred hcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_021532. 539 EPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTI-DAELKRSKAAVE 592 (663) Q Consensus 539 ~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~-~~~~~~~~~~~e 592 (663) . .++.+......+............... ...... +-+.| T Consensus 462 ~--------------~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~-e~~~~ 501 (501) T protein:vir:27 462 N--------------KEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDF-ERAYE 501 (501) T ss_pred H--------------HHHHhhhHhhhcCccccccccccCCCCCCccccc-cccCC Confidence 1 111000000000000000000000000 000000 00000 No 66 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.73 E-value=6.2e-16 Score=103.95 Aligned_cols=434 Identities=9% Similarity=0.057 Sum_probs=195.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc-----------cCC--CccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ-----------KGK--SAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~-----------~g~--s~~~~~~i~~~v~~~~~~ 67 (663) |+.+.+.....|...++. +..+.....++.+||.|++.-+.. ..+ .+++.|-..-.|+..++ T Consensus 20 ~~~~~~~~~~~i~~~i~~----~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~- 94 (474) T protein:vir:96 20 IKPKYETQEEMIIRLIND----HKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVA- 94 (474) T ss_pred hhhccCChHHHHHHHHHH----HHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhh- Confidence 333333333444444433 334556677888999998631110 111 13444544444554444 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) .+++..+.+ . .+|++..+.+ +.++. ++.......+.+++.++|.|++.+++|.. T Consensus 95 ---~l~g~p~~~--~---~~d~~~~~~l----~~~~~--n~~~~~~~~~~~~~~~~G~~~~~~y~d~~------------ 148 (474) T protein:vir:96 95 ---YAVANPVTF--S---SDDDKSLKTI----QEVLN--HKWDDKLVDILTAASNKGIEWLQPYIDEN------------ 148 (474) T ss_pred ---hhcccCcee--e---cCchHHHHHH----HHHHh--cCHHHHHHHHHHHHHhcCeeEEEEEecCC------------ Confidence 445554333 2 2444444443 34332 45555667788999999999999987632 Q ss_pred cCccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) +.+.+..++|..+| +|++.. .+..++ .+.+..... .. T Consensus 149 ---------------------~~~~i~~~~p~~~~~v~d~~~~---~~~~~~-vr~~~~~~~----------~~------ 187 (474) T protein:vir:96 149 ---------------------GEFKTFRVPAEQAIPIWTNKER---DTLKAF-IRYYRLDGA----------ER------ 187 (474) T ss_pred ---------------------CceEEEEEcccceEEEEcCCCC---CceEEE-EEEEeecCc----------eE------ Confidence 23566778888887 333322 222222 222211000 00 Q ss_pred hhhhccccccccccccccccceEEEEEEEEEeeecCCceeEE-EE-E-EEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEP-IV-C-AWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~-~~-~-~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) . .. +. ..+|. +|.. .+.+.... .. . ...+... ....|...|.+|++.++ + T Consensus 188 --~--~~-------yt---~~~v~---~~~~---~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~iPvv~~~-----n 240 (474) T protein:vir:96 188 --V--EY-------WT---DSDVT---YYEY---QDGILIPDYYHGEEHIQSHYY--VGNKRVSWGRVPFIPFK-----N 240 (474) T ss_pred --E--EE-------Ee---CCeEE---EEEe---cCCceeecccccccccccccc--ccccccCCCceeEEEec-----c Confidence 0 00 00 00111 1111 00000000 00 0 0000111 12234445677776554 4 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCc-ch-hhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQ-TN-RKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~-~d-~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +.+|.|.+..++++++.+|.+.|.+.+.+...++|.+.+ .|. ... .+ ......++++.+.+.++...++..+.-.. T Consensus 241 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~ 319 (474) T protein:vir:96 241 NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYIL-KGYEGQDLDEFMRNLKYYKAINVDGDGSGVDTIQIEVPVQ 319 (474) T ss_pred CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCcccccchhhhhhcCceEEecCCCCceeEEeecCChH Confidence 567899999999999999999999999999999987655 332 211 11 22345567787765555567766555556 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .....++.+.+.+-..|++++...+..++..|+.| +..+............+.|.+ +++.++ .+|..++.. T Consensus 320 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~~-~l~~~~----~~i~~~~~~-- 390 (474) T protein:vir:96 320 SSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTLT-ALQELL----QYIIDFYKL-- 390 (474) T ss_pred HHHHHHHHHHHHHHHHhCCccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHH----HHHHHHhCC-- Confidence 67788899999999999999887664433334444 433333333333333344432 344444 444444321 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) .++.. ++.+..+-+.+....+.. +++.. ++.+ ....+... +....+....++.+. T Consensus 391 ----------~~~~~----~i~i~f~~~~p~~~~e~~----~~~~~-ag~i----S~et~~~~--~~~v~d~~~E~~ri~ 445 (474) T protein:vir:96 391 ----------NIKVQ----DVEITFNFNVMVNELEQS----QIGVQ-SQYL----SKETVVTN--HPWVDDPVAELERIE 445 (474) T ss_pred ----------Ccccc----eeeEEeccCCCcCHHHHH----HHHHh-cCCC----chHHHHHh--CCCCCCHHHHHHHHH Confidence 01111 112222222222221222 22222 1111 12221111 111111111111111 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTID 581 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~ 581 (663) .+.....+.. ..+...........++ .-+ T Consensus 446 ~E~~e~~~~~------------~~~~~~~~~~~~d~~~-e~~ 474 (474) T protein:vir:96 446 QDNIDFNKQL------------PPLEGDANGRAQDNES-ETN 474 (474) T ss_pred HHHHHHHhcc------------cccccccccccCCCcc-cCC Confidence 1000000000 0000000000000000 000 No 67 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.73 E-value=5.2e-17 Score=109.86 Aligned_cols=434 Identities=10% Similarity=0.047 Sum_probs=193.1 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------c--cCCCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------Q--KGKSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~--~g~s~~~~~~i~~~v~~~~~~ 67 (663) |..+.+.+-..|.+.++. +.++...+.++.+||.|++.-+. . +...+++.|-.+..|+...++ T Consensus 18 ~~~~~~~~~~~i~~~i~~----~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~ 93 (472) T protein:vir:93 18 TNNKPETLEEMIVRYIKQ----HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 93 (472) T ss_pred ecCchhhHHHHHHHHHHH----HHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhh Confidence 444444444444443333 44555667788899999753111 1 111245667666677766665 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) + ++..+ .|. .+|.+..+ .++.++. ++....+..+.++++++|.|++.+++|.+ T Consensus 94 l----~g~~~--~~~---~~d~~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~d------------ 146 (472) T protein:vir:93 94 I----VGKPI--AFK---HTDDEVVK----RIDEVLG--NRFDDKLHSVLTGASNKGIEWLHPYLDEE------------ 146 (472) T ss_pred h----cccCe--eec---cCChHHHH----HHHHHHh--ccHHHHHHHHHHHHhhcCeEEEEEEECCC------------ Confidence 5 44332 232 24444333 3444442 45566677788999999999999887521 Q ss_pred cCccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) +.+.+.+++|..+++ |++..+ +..+.+ +.+.+... T Consensus 147 ---------------------~~~~i~~~~p~~~~~i~d~~~~~---~~~~~i-r~~~~~~~------------------ 183 (472) T protein:vir:93 147 ---------------------GEFKLFRVPAEQGIPIWTDKEHE---ELEAFI-RMYKLENE------------------ 183 (472) T ss_pred ---------------------CceEEEEEcccceEEEEcCCCCC---ceEEEE-EEEEeecc------------------ Confidence 335677788888864 433222 222222 22211000 Q ss_pred hhhhccccccccccccccccceEEEEE---EEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYE---YWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E---~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) ..+++|. +|.. ...+++. ........+...+.. .|.+.+.+|++.++ + T Consensus 184 --------------------~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~vPvv~~~-----n 234 (472) T protein:vir:93 184 --------------------TKVEYWDKVTVNYY-VYENGSL-IPDYSNNLENSKTHF--STGSWGKIPFIPFK-----N 234 (472) T ss_pred --------------------eeEEEEecCeEEEE-EEecCee-eeccccccccccccc--ccCCCCCcceEEec-----C Confidence 0011110 1000 0011100 000000011111222 23334667776443 4 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc--ch-hhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ--TN-RKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~--~d-~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +.+|.|.+..++++++.+|.+.+.+.+.+...+.|.+++ .|.-.. .+ ......++++.+..+++ ..++..+.-.. T Consensus 235 n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~ 312 (472) T protein:vir:93 235 NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNYDDQELPEFKRLLRYYGAIKVSDNGG-VDTIQVEVPVE 312 (472) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEe-ecCCcccchhhHHHHhhccccccCCCCc-ceeEeecCCHH Confidence 567999999999999999999999999999999987665 333211 11 11234556666655543 44444343345 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...++.+.+.|...|++++.+.+..++..|+.| +...............+.|.. + ++.++.++..+.... T Consensus 313 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~~~~-~----l~~~~~li~~~~~~~- 384 (472) T protein:vir:93 313 NSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKV-A----IQELLWFVFEHFDIK- 384 (472) T ss_pred HHHHHHHHHHHHHHHHhCCCCCCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHH-H----HHHHHHHHHHHhCCC- Confidence 67778899999999999999887765433334444 333322333333333344432 3 344444444443211 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) + ++ .++++..+-+.+....+..+ .+..+.+.++ ...+... +....+....++.+. T Consensus 385 -----~-~~---------~~i~v~f~~~~p~~~~~~~~----~~~k~~giis----~et~l~~--l~~~~d~~~E~~ri~ 439 (472) T protein:vir:93 385 -----G-EH---------KDVDISFNYNKVANTELQVQ----TAQQSMGIVS----HETVLEN--HPFVEDLQAELERIE 439 (472) T ss_pred -----c-cc---------ceeeEEeCCCCCCCHHHHHH----HHHHHhccCc----hHHHHHh--CCCCCCHHHHHHHHH Confidence 0 11 01222222122211111221 1222222221 1111111 111111111111110 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARK 598 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~ 598 (663) . ++.+. .+.+.. ... . ..+..... +. ..+.. ++ T Consensus 440 ~--------------E~~~~-~~~~~~-~~~--~-----~~d~~~~~-~~-~~~~~-~e 472 (472) T protein:vir:93 440 Q--------------EQMEY-NKQLPN-LDD--G-----GADGAQQQ-ER-SNNKE-SE 472 (472) T ss_pred H--------------HHHHH-HHhccC-cCc--c-----cCCCCCCC-CC-CCccc-CC Confidence 0 00000 000000 000 0 00000000 00 00000 00 No 68 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.72 E-value=2.8e-17 Score=111.32 Aligned_cols=443 Identities=12% Similarity=0.078 Sum_probs=193.8 Q ss_pred CCCcH-------HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc-----------cCC--Cccc Q lcl|NC_021532. 1 MKINK-------AELLSALKA-------DMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ-----------KGK--SAIV 53 (663) Q Consensus 1 ~~~~~-------~~~~~~l~~-------~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~-----------~g~--s~~~ 53 (663) |.++. +++++.|.. .+......+..+.+.+..+.+||.|++.-+.. .++ .+++ T Consensus 2 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~ 81 (478) T protein:vir:10 2 ISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMY 81 (478) T ss_pred ccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccceec Confidence 22211 222222221 12223333445566778888999987642211 112 2355 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeec Q lcl|NC_021532. 54 SRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWD 133 (663) Q Consensus 54 ~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d 133 (663) .|...-.|+..+++ +++..+.+ . .+|++..+. +..++. ++....+..+.++++++|.|++.+++| T Consensus 82 ~n~~~~ivd~~~~~----l~g~~~~~--~---~~~d~~~~~----l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~~~~d 146 (478) T protein:vir:10 82 TNYHQNLVDQKVAY----AVANPVTF--G---VDNDKALKQ----IQHTLN--HKWDDKLVDILTAASNKGIEWVQPYVD 146 (478) T ss_pred cchHHHHHHHHHhh----hccCCeee--e---cCChHHHHH----HHHHHh--cCHHHHHHHHHHHHHhcCeEEEEEEec Confidence 56555555555554 45544333 2 244443333 333343 456667778899999999999999876 Q ss_pred cccceecccccccccCccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHh Q lcl|NC_021532. 134 YEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKD 211 (663) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~ 211 (663) .+ +.+.+.+++|..+|+ |++..+ +..+++ +.+.... T Consensus 147 ~~---------------------------------g~~~~~~~~p~~~~~i~d~~~~~---~~~~~v-~~~~~~~----- 184 (478) T protein:vir:10 147 EE---------------------------------GEFKTFRVPAEQAVPIWTNKERD---ELQAFI-RVYELDG----- 184 (478) T ss_pred CC---------------------------------CeeEEEEEcccceEEEEcCCCCC---ceEEEE-EEEEecC----- Confidence 31 235667788888774 443322 222222 2221100 Q ss_pred cCCcChhhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEE-EEEECCEEEecccCCCcCCCC Q lcl|NC_021532. 212 GRYKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIV-CAWINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 212 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~-~~~~g~~~l~~~~~p~~~~~~ 290 (663) . . .... ....++..|++- .+........ ..-..... .....|.+.|.+ T Consensus 185 ---~-----------~---------~~~~--y~~~~i~~~~~~-----~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v 233 (478) T protein:vir:10 185 ---A-----------E---------RVEY--WTKDDVTYYELK-----EGQLIPDFYRSDDHIQPHY-YQGNKLMSWGRV 233 (478) T ss_pred ---c-----------e---------EEEE--EeCCeEEEEEEc-----CCeeeccccccccccccce-ecccccccCCcc Confidence 0 0 0000 000111111110 0000000000 00000111 112234444677 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc-ch-hhhccCCcceEeCCC-C Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQ-TN-RKKFLAGANFEFNGT-A 366 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~-~d-~~~~~p~~vi~~~~~-~ 366 (663) |++.+ +++.+|.|.+..++++++.+|...+.+.+.+...++|.+.+ .|.- +. .+ ......++++.+.+. + T Consensus 234 Pvv~~-----~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (478) T protein:vir:10 234 PFIPF-----KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYIL-KGYEGEDMKDFMHNLKYYKAISVAGESG 307 (478) T ss_pred ceEEe-----ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccchhhhhhhhcceEEecCCCC Confidence 76644 44668999999999999999999999999999999886654 4432 11 11 223345566666533 3 Q ss_pred CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 367 NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRK 446 (663) Q Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~ 446 (663) +...++..+.-.......++.+...|...|++++.+.+..++..|+.| +...............+.|. ..++. T Consensus 308 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~-----~~l~~ 380 (478) T protein:vir:10 308 SGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTL-----TALQE 380 (478) T ss_pred CcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHH Confidence 345555444434556778899999999999999887664433334444 43333333333333333443 23344 Q ss_pred HHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhh Q lcl|NC_021532. 447 WMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLM 526 (663) Q Consensus 447 ~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~ 526 (663) ++.++..++.. .++.. +..+..+-+.+....+..+ .++.+++.++ ...+...+ . T Consensus 381 ~~~li~~~~g~------------~~~~~----~i~i~f~~~~p~d~~e~a~----~~~kl~g~iS----~et~~~~l--~ 434 (478) T protein:vir:10 381 LLQYIIDFYRL------------DVKVQ----DIEITFNFNVMVNELENSQ----IAMNSTGLLS----KETILSNH--A 434 (478) T ss_pred HHHHHHHHhCC------------Ccccc----cceEEecCCCCCCHHHHHH----HHHHHhCCCC----hHHHHHhC--C Confidence 44555555321 01111 1222222222222222222 2222333222 11111111 1 Q ss_pred hhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 527 RMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 527 ~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) ...+....++.+.. ++.... +.+..-.... ..+.+.+..-.+.| T Consensus 435 ~v~D~~~E~~ri~~--------------E~~~~~-~~~~~~~~~~-------~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 435 WVEDPVAEMERIEQ--------------ENIELN-QQLPDIEEGL-------NGEQQRQSENNQPE 478 (478) T ss_pred CCCCHHHHHHHHHH--------------HHHHHH-hhcccccccc-------CCCCCCCCCCCCCC Confidence 11111111111110 000000 0000000000 00000000000000 No 69 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.72 E-value=4.4e-17 Score=110.23 Aligned_cols=440 Identities=11% Similarity=0.094 Sum_probs=192.4 Q ss_pred CCC----c---HHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------ccCCC--ccc Q lcl|NC_021532. 1 MKI----N---KAELLSALKAD-------MKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------QKGKS--AIV 53 (663) Q Consensus 1 ~~~----~---~~~~~~~l~~~-------~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~g~s--~~~ 53 (663) |.+ + .++++..+... ++..-..+..+...+..+.+||.|++.-+. .++++ +++ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~ 81 (478) T protein:vir:10 2 ISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMY 81 (478) T ss_pred ccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceec Confidence 111 0 12222222211 222222344556667888999999763221 11222 345 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeec Q lcl|NC_021532. 54 SRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWD 133 (663) Q Consensus 54 ~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d 133 (663) .|-....|+..++++ ++..+. +. .+|++..+ .++.++. ++....+.++.+++.++|.|++.+++| T Consensus 82 ~n~~k~ivd~~~~yl----~g~p~~--~~---~~~~~~~~----~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d 146 (478) T protein:vir:10 82 TNYHQNLVDQKVAYA----VANPVT--FG---VDNDKALK----QIQHTLN--HKWDDKLVDILTAASNKGIEWVQPYVD 146 (478) T ss_pred cchHHHHHHHHhhhh----cccCce--ee---cCChHHHH----HHHHHHh--ccHHHHHHHHHHHHhhCCeEEEEEEec Confidence 565555555555544 454433 32 24444333 3444443 566667778899999999999999876 Q ss_pred cccceecccccccccCccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHh Q lcl|NC_021532. 134 YEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKD 211 (663) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~ 211 (663) .+ +.+.+..++|.+++ ||+...+ +..+++ +.+..... T Consensus 147 ~~---------------------------------~~~~~~~~~p~~~~~v~d~~~~~---~~~~~i-r~~~~~~~---- 185 (478) T protein:vir:10 147 EE---------------------------------GEFKTFRVPAEQAVPIWTNKERD---ELQAFI-RVYELDGA---- 185 (478) T ss_pred CC---------------------------------CceEEEEEcccceEEEEcCCCCC---ceEEEE-EEEeeeCc---- Confidence 32 23556778888875 3433222 222222 22211000 Q ss_pred cCCcChhhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEE-EEEECCEEEecccCCCcCCCC Q lcl|NC_021532. 212 GRYKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIV-CAWINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 212 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~-~~~~g~~~l~~~~~p~~~~~~ 290 (663) .. . .. | ...+|..|.+... ........ ....... ......|...|.+ T Consensus 186 ------~~--------~--~~-------y---~~~~i~~~~~~~~-----~~~~~~~~~~~~~~~~-~~~~~~~~~~g~v 233 (478) T protein:vir:10 186 ------ER--------V--EY-------W---TKDDVTFYELKEG-----QLIPDFYRSEDHIQPH-YYQGNKLMSWGRV 233 (478) T ss_pred ------eE--------E--EE-------E---eCCcEEEEEecCC-----eeeccccccccccccc-eecccccccCCcc Confidence 00 0 00 0 0011221111100 00000000 0000111 1122334555677 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCc-ch-hhhccCCcceEeCCC-C Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQ-TN-RKKFLAGANFEFNGT-A 366 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~-~d-~~~~~p~~vi~~~~~-~ 366 (663) |++.+. ++..|.|.+..++++++.+|.+.|.+.+.+...++|.+.+ .|. .+. .+ .......+++.+.++ + T Consensus 234 Pvv~~~-----n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (478) T protein:vir:10 234 PFIPFK-----NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYIL-KGYEGEDMKDFMHNLKYYKAISVAGESG 307 (478) T ss_pred eEEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceee-ecCCcccccchhhhhhhCceeEecCCCC Confidence 766544 3557899999999999999999999999999988886654 343 111 11 222344556666543 3 Q ss_pred CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 367 NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRK 446 (663) Q Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~ 446 (663) ..+.++..+.-...+...++.+.+.|...|++++.+.+..++..|+.| +..+............+.|. ..++. T Consensus 308 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--i~~~~~~l~~k~~~~~~~~~-----~~l~~ 380 (478) T protein:vir:10 308 SGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTL-----TALQE 380 (478) T ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHH Confidence 345565555445667778899999999999999877664333334433 33333333333333333443 33444 Q ss_pred HHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhh Q lcl|NC_021532. 447 WMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLM 526 (663) Q Consensus 447 ~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~ 526 (663) ++.++.++.... ++.. ++.+..+-+.+....+.. +.+..++..++ ...+.. .+. T Consensus 381 ~~~li~~~~~~~------------~d~~----~i~i~f~~~~p~~~~e~~----~~~~~~~g~iS----~et~i~--~~~ 434 (478) T protein:vir:10 381 LLQYIIDFYRLD------------VRVQ----DIEITFNFNVMVNELENS----QIAMNSTGLLS----KETILG--NHS 434 (478) T ss_pred HHHHHHHHhCCC------------cccc----cceEEeCCCCCCCHHHHH----HHHHHHhCCCC----hHHHHH--hCC Confidence 555555554210 1111 122222222222111111 22222222221 111111 111 Q ss_pred hhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHH--H-HHHHHHHHHHHHHHHH Q lcl|NC_021532. 527 RMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQML--V-ASINDKNARANENTID 581 (663) Q Consensus 527 ~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~--~-a~~~~~~a~~q~~~~~ 581 (663) ...+....++.+.. ++.+...+.. . .....+..+......+ T Consensus 435 ~v~d~~~E~~ri~~--------------E~~~~~~~~~~~~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 435 WVQDPVAEMERIEQ--------------ENIELNQQLPDIEEGLNDEQQRQSEDNQSE 478 (478) T ss_pred CCCCHHHHHHHHHH--------------HHHHHHHhccccCCCCcccccccCcCCCCC Confidence 11111111111110 0000000000 0 0000000000000000 No 70 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.72 E-value=4.2e-17 Score=110.35 Aligned_cols=443 Identities=10% Similarity=0.051 Sum_probs=194.2 Q ss_pred CCCcH-------HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------------cCC--ccccCCC--ccc Q lcl|NC_021532. 1 MKINK-------AEL-LSALKADMKAADVLKQEQDSLISTWKAEYNGE---------------PYG--NEQKGKS--AIV 53 (663) Q Consensus 1 ~~~~~-------~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~---------------~~~--~~~~g~s--~~~ 53 (663) |.|.+ .++ .+.|...++.-...+........+|..+...- .+. ...++++ +++ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 44321 111 12222222222222222222222322211100 000 1122233 455 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeec Q lcl|NC_021532. 54 SRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWD 133 (663) Q Consensus 54 ~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d 133 (663) .|-....|+..+++ +++..+.+.+.+ |....+....+++.++. .|+.......+.++++++|.|+..++.| T Consensus 81 ~n~~~~ivd~~~~y----l~g~pv~~~~~~----~~~~~e~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~~~d 151 (474) T protein:vir:94 81 NSFDSEIVDTRVGY----LHGVPVTYDLDE----NAEKNEKLKKFITNFAI-RNSVDDEDSEIGKMAAICGYGARLAYID 151 (474) T ss_pred cchHHHHHHhHhhh----eeccceeEeeCC----CCcchHHHHHHHHHHHh-hcCHhHHHHHHHHHHhhcCeEEEEEEeC Confidence 66655556655554 455544444432 22333455566666553 4566667788999999999999888654 Q ss_pred cccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcC Q lcl|NC_021532. 134 YEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGR 213 (663) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~ 213 (663) .+ +.+.+.+++|.++|+=.+-. . +.-+.++ .+....+ T Consensus 152 ~~---------------------------------~~~~~~~i~p~~~~~v~d~~--~-~~~~~i~-~~~~~~~------ 188 (474) T protein:vir:94 152 TN---------------------------------GDIRIKNIDPYNVIFVGDNI--L-EPTYSLR-YFYEKDD------ 188 (474) T ss_pred CC---------------------------------CeeEEEEEcccceEEEEcCC--C-ceEEEEE-EEEEeeC------ Confidence 21 23566778888775332111 1 1112222 1111000 Q ss_pred CcChhhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC---CEEEecccCCCcCCCC Q lcl|NC_021532. 214 YKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN---DVIVRLQSNPYPDGKP 290 (663) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g---~~~l~~~~~p~~~~~~ 290 (663) .....+..+++|... .+.+|.+ +......+.|.+.|.+ T Consensus 189 -----------------------------~~~~~~~~~~~y~~~----------~~~~~~~~~~~~~~~~~~~~~~~g~v 229 (474) T protein:vir:94 189 -----------------------------DNGTDYVYAEFYDNA----------YYYVFRGEGIDALQEVGRYEHLFDYN 229 (474) T ss_pred -----------------------------CCceEEEEEEEEcCc----------eEEEEeecCCCcccccccccCCCCcc Confidence 000112233444221 0111111 1111122223333566 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCcchhhhccCCcceEeCCCCCcc Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQTNRKKFLAGANFEFNGTANDF 369 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~~d~~~~~p~~vi~~~~~~~~~ 369 (663) |++.+ +++.+|.|.++.++++++.+|...|.+.+.+...++|.+.+ .|. ++.++.......+.+.+.+++... T Consensus 230 Pvv~~-----~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~~~~~~i~~~~~~~~~ 303 (474) T protein:vir:94 230 PLFGV-----PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVL-RGMGMSEEMIQETQKSGAFELFDKDMDV 303 (474) T ss_pred ceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-ccCCCCchhhhhhhhcceeEecCCCCce Confidence 76643 45667999999999999999999999999999999887766 443 333333344556666665555566 Q ss_pred ccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 370 WHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) .++..+.-.......+..+.+.|...|++++.+.+..++..|+.| +..+............+.|.+ +++.++++++. T Consensus 304 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~ 380 (474) T protein:vir:94 304 KYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIG--MKLKLMALENKCMTFERKMTA-MLRYQFKVILS 380 (474) T ss_pred eEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 666655445667788899999999999999987764333334444 444433444444444445533 44555555555 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~ 529 (663) ++..-. ......+.. ++.+..+-..+....+..+.+ ..+.+.++ ...+... +.... T Consensus 381 ~l~~~~----------~~~~~~~~~----~i~~~f~~~~p~d~~e~a~~~----~kl~g~iS----~et~~~~--l~~v~ 436 (474) T protein:vir:94 381 ALKRKG----------YNLDDDSYL----NLIFKFTRNIPVNKLEESQVL----INLKGQVS----ERTRLGQ--SQLVD 436 (474) T ss_pred HHhhcc----------CCCCccccc----cceEEeCCCCCCCHHHHHHHH----HHHhccCc----hHHHHHh--CCCCC Confidence 433211 111011110 112222222222222222222 22222111 1111111 11111 Q ss_pred hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) +....++.+ ..++.+.. +..... ........ ....+.+ T Consensus 437 d~~~E~eri--------------~~E~~e~~--~~~~~~--~~~~~~~~-------~~~~~s~ 474 (474) T protein:vir:94 437 DVDYELDEM--------------EKESLEFN--DKLPDI--DEGDANDK-------SQNNQSE 474 (474) T ss_pred CHHHHHHHH--------------HHHHHHHH--hhcccc--cCCCcCCC-------CccccCC Confidence 111111100 00000000 000000 00000000 0000000 No 71 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.72 E-value=4.2e-17 Score=110.35 Aligned_cols=443 Identities=10% Similarity=0.051 Sum_probs=194.2 Q ss_pred CCCcH-------HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------------cCC--ccccCCC--ccc Q lcl|NC_021532. 1 MKINK-------AEL-LSALKADMKAADVLKQEQDSLISTWKAEYNGE---------------PYG--NEQKGKS--AIV 53 (663) Q Consensus 1 ~~~~~-------~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~---------------~~~--~~~~g~s--~~~ 53 (663) |.|.+ .++ .+.|...++.-...+........+|..+...- .+. ...++++ +++ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 44321 111 12222222222222222222222322211100 000 1122233 455 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeec Q lcl|NC_021532. 54 SRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWD 133 (663) Q Consensus 54 ~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d 133 (663) .|-....|+..+++ +++..+.+.+.+ |....+....+++.++. .|+.......+.++++++|.|+..++.| T Consensus 81 ~n~~~~ivd~~~~y----l~g~pv~~~~~~----~~~~~e~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~~~d 151 (474) T protein:vir:10 81 NSFDSEIVDTRVGY----LHGVPVTYDLDE----NAEKNEKLKKFITNFAI-RNSVDDEDSEIGKMAAICGYGARLAYID 151 (474) T ss_pred cchHHHHHHhHhhh----eeccceeEeeCC----CCcchHHHHHHHHHHHh-hcCHhHHHHHHHHHHhhcCeEEEEEEeC Confidence 66655556655554 455544444432 22333455566666553 4566667788999999999999888654 Q ss_pred cccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcC Q lcl|NC_021532. 134 YEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGR 213 (663) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~ 213 (663) .+ +.+.+.+++|.++|+=.+-. . +.-+.++ .+....+ T Consensus 152 ~~---------------------------------~~~~~~~i~p~~~~~v~d~~--~-~~~~~i~-~~~~~~~------ 188 (474) T protein:vir:10 152 TN---------------------------------GDIRIKNIDPYNVIFVGDNI--L-EPTYSLR-YFYEKDD------ 188 (474) T ss_pred CC---------------------------------CeeEEEEEcccceEEEEcCC--C-ceEEEEE-EEEEeeC------ Confidence 21 23566778888775332111 1 1112222 1111000 Q ss_pred CcChhhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC---CEEEecccCCCcCCCC Q lcl|NC_021532. 214 YKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN---DVIVRLQSNPYPDGKP 290 (663) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g---~~~l~~~~~p~~~~~~ 290 (663) .....+..+++|... .+.+|.+ +......+.|.+.|.+ T Consensus 189 -----------------------------~~~~~~~~~~~y~~~----------~~~~~~~~~~~~~~~~~~~~~~~g~v 229 (474) T protein:vir:10 189 -----------------------------DNGTDYVYAEFYDNA----------YYYVFRGEGIDALQEVGRYEHLFDYN 229 (474) T ss_pred -----------------------------CCceEEEEEEEEcCc----------eEEEEeecCCCcccccccccCCCCcc Confidence 000112233444221 0111111 1111122223333566 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCcchhhhccCCcceEeCCCCCcc Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQTNRKKFLAGANFEFNGTANDF 369 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~~d~~~~~p~~vi~~~~~~~~~ 369 (663) |++.+ +++.+|.|.++.++++++.+|...|.+.+.+...++|.+.+ .|. ++.++.......+.+.+.+++... T Consensus 230 Pvv~~-----~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~~~~~~i~~~~~~~~~ 303 (474) T protein:vir:10 230 PLFGV-----PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVL-RGMGMSEEMIQETQKSGAFELFDKDMDV 303 (474) T ss_pred ceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-ccCCCCchhhhhhhhcceeEecCCCCce Confidence 76643 45667999999999999999999999999999999887766 443 333333344556666665555566 Q ss_pred ccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 370 WHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) .++..+.-.......+..+.+.|...|++++.+.+..++..|+.| +..+............+.|.+ +++.++++++. T Consensus 304 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~ 380 (474) T protein:vir:10 304 KYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIG--MKLKLMALENKCMTFERKMTA-MLRYQFKVILS 380 (474) T ss_pred eEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 666655445667788899999999999999987764333334444 444433444444444445533 44555555555 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~ 529 (663) ++..-. ......+.. ++.+..+-..+....+..+.+ ..+.+.++ ...+... +.... T Consensus 381 ~l~~~~----------~~~~~~~~~----~i~~~f~~~~p~d~~e~a~~~----~kl~g~iS----~et~~~~--l~~v~ 436 (474) T protein:vir:10 381 ALKRKG----------YNLDDDSYL----NLIFKFTRNIPVNKLEESQVL----INLKGQVS----ERTRLGQ--SQLVD 436 (474) T ss_pred HHhhcc----------CCCCccccc----cceEEeCCCCCCCHHHHHHHH----HHHhccCc----hHHHHHh--CCCCC Confidence 433211 111011110 112222222222222222222 22222111 1111111 11111 Q ss_pred hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) +....++.+ ..++.+.. +..... ........ ....+.+ T Consensus 437 d~~~E~eri--------------~~E~~e~~--~~~~~~--~~~~~~~~-------~~~~~s~ 474 (474) T protein:vir:10 437 DVDYELDEM--------------EKESLEFN--DKLPDI--DEGDANDK-------SQNNQSE 474 (474) T ss_pred CHHHHHHHH--------------HHHHHHHH--hhcccc--cCCCcCCC-------CccccCC Confidence 111111100 00000000 000000 00000000 0000000 No 72 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.72 E-value=2.1e-15 Score=101.07 Aligned_cols=427 Identities=10% Similarity=0.062 Sum_probs=192.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc-----------cC--CCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ-----------KG--KSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~-----------~g--~s~~~~~~i~~~v~~~~~~ 67 (663) ..++.+.|...+ ..+.++......+.+||.|++.-+.. +. ..+++.|-....|+..+++ T Consensus 24 ~~~~~~~i~~~i--------~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~ 95 (468) T protein:vir:96 24 YETQEEMILRLI--------TKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAY 95 (468) T ss_pred ccCcHHHHHHHH--------HHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh Confidence 333333332222 22344455678888999997532111 11 1245666666666665555 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) + ++..+.+. .+|.+..+. +..++. ++....+.++..++.++|.|++.+++|.+ T Consensus 96 l----~g~p~~~~-----~~d~~~~~~----l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~------------ 148 (468) T protein:vir:96 96 A----VANPVTYG-----TEDEKSLKT----IQEVLN--HKWDDKLVDILTAASNKGVEWIQPYVDEQ------------ 148 (468) T ss_pred h----ccCCceec-----cCChHHHHH----HHHHHh--cCHHHHHHHHHHHHhhcCeEEEEEEEcCC------------ Confidence 4 45443332 244443333 333342 45666677889999999999999988632 Q ss_pred cCccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) +.+.+..++|..+|+ |+... .+..++++ .+.. .. .. T Consensus 149 ---------------------~~~~i~~~~p~~~~~v~~~~~~---~~~~~~ir-~~~~-~~---------~~------- 186 (468) T protein:vir:96 149 ---------------------GEFKTFRVPAEQAIPIWTNKER---DELKAFIR-LYEL-DG---------GE------- 186 (468) T ss_pred ---------------------CceEEEEEcccceEEEEcCCCC---CceEEEEE-EEEe-cC---------ce------- Confidence 235567788888763 33222 22222222 2211 00 00 Q ss_pred hhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCccc Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLH 305 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~ 305 (663) ..+ . + ....+..|.++. +......................|.+.|++|++.+ +++.+ T Consensus 187 -~~~--~-------~---~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~-----~n~~~ 243 (468) T protein:vir:96 187 -RVE--Y-------W---TANDVTFYELKD-----GQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPF-----KNNPQ 243 (468) T ss_pred -EEE--E-------E---eCCeEEEEEEcC-----CceeecccccccccccceeeccccccCCcccEEEe-----cCCCC Confidence 000 0 0 001122221110 00000000000000011112233445577777754 34556 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc-h-hhhccCCcceEeCCC-CCccccccCccccHHHH Q lcl|NC_021532. 306 GEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT-N-RKKFLAGANFEFNGT-ANDFWHGSYNAIPSSAF 382 (663) Q Consensus 306 g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~-d-~~~~~p~~vi~~~~~-~~~~~~~~~~~~~~~~~ 382 (663) |.|.+..++++++.+|...|.+.+.+...++|.+.+.-...+.. + ......++++.+.+. ++.+.++..+.-..... T Consensus 244 g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~~l~~~~~~~~~~ 323 (468) T protein:vir:96 244 EVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGSGGVDTIQIDVPVQSAK 323 (468) T ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCCCcceEEeecCChHHHH Confidence 89999999999999999999999999999998766532222211 1 222344667777543 23456666555456677 Q ss_pred HHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEE Q lcl|NC_021532. 383 DMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIR 462 (663) Q Consensus 383 ~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~ir 462 (663) ..++.+...+...|++++.+.+..++..|+.| +...............+.|. ..++.++.+|..++.. T Consensus 324 ~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~k~~~~~-----~~l~~~~~li~~~~g~----- 391 (468) T protein:vir:96 324 EYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIA--LKFMYSNLDLKANKLKNKTL-----TALQELLQYIIDFYKL----- 391 (468) T ss_pred HHHHHHHHHHHHHhCcccccccccccchHHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHhCC----- Confidence 78899999999999999887654333334433 33333333333333333442 2334444555554321 Q ss_pred EecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcch Q lcl|NC_021532. 463 VTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKP 542 (663) Q Consensus 463 i~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~ 542 (663) .++.. ++.++.+-+.+....+..+ .+... +.+ ........ +....+....++.+..+ T Consensus 392 -------~~d~~----~i~i~f~~~~p~d~~e~a~----~~~~~-g~i----S~et~i~~--l~~v~D~~~E~~ri~~E- 448 (468) T protein:vir:96 392 -------SIKVQ----DVEITFNFNVMVNELEQSQ----IGVNS-QYL----SKETVVTN--HPWVDDPVAEMERIDQE- 448 (468) T ss_pred -------Ccccc----eeeEEecCCCCcCHHHHHH----HHHhc-CCC----chHHHHHh--CCCCCCHHHHHHHHHHH- Confidence 11111 1122222222222212221 22221 111 11111111 11111111111111110 Q ss_pred hhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQLELENLMLENQMLVASINDKNARANENTI 580 (663) Q Consensus 543 ~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~ 580 (663) +.+.. +.+... .......-. T Consensus 449 -------------~~~~~--~~~~~~---~~~~~~~~~ 468 (468) T protein:vir:96 449 -------------ELALP--SIEEGL---NGKENNEPT 468 (468) T ss_pred -------------HHHHH--HHhhcc---CCCCCCCCC Confidence 00000 000000 000000000 No 73 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.72 E-value=6.2e-17 Score=109.46 Aligned_cols=472 Identities=10% Similarity=0.025 Sum_probs=201.7 Q ss_pred CCCc-HHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc----cccC--CCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKIN-KAELLSALK----ADMKAADVLKQEQDSLISTWKAEYNGEPYGN----EQKG--KSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~~~-~~~~~~~l~----~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~~~g--~s~~~~~~i~~~v~~~~~~l~ 69 (663) |.+- +++++..+. ..++.....++.+...+.++.+||.|.+.-+ +.++ ..+++.|-.+..|+..++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l- 79 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMNVGFM- 79 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHHhhhh- Confidence 5554 444444432 1133333445556667788899999975321 1122 23455666666666555544 Q ss_pred HhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD 149 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~ 149 (663) ++..+. +.+ +|.+..+. ++.++ ..|+....+..+.+++.++|.|+..++++...... T Consensus 80 ---~g~p~~--~~~---~~~~~~~~----l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~---------- 136 (499) T protein:vir:10 80 ---TGNPVK--YVA---EKGKNIDD----ILEVF-NQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPIS---------- 136 (499) T ss_pred ---cccCce--eec---CChhHHHH----HHHHH-hhcCHhHHHHHHHHHHHhcCceEEEEEeccccccc---------- Confidence 454332 332 33333333 33334 33566666788999999999999999876322110 Q ss_pred ccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhh Q lcl|NC_021532. 150 EYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFD 229 (663) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 229 (663) ...............+.+..|+|.+.|+-.+- ....-...+.+.+.+.+. T Consensus 137 ------~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d--~~~~~~~~~i~~~~~~~~---------------------- 186 (499) T protein:vir:10 137 ------VRDELGNEKLTPNTELKIEVIDPRATVVVCDD--TVEHDPLFAVFTQEKKDL---------------------- 186 (499) T ss_pred ------ccccccccccccccceEEEEEcccceEEEecC--CCCcceEEEEEEEEEeec---------------------- Confidence 00011111112223345666777766532110 011111111121111000 Q ss_pred ccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEE-------CCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 230 YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWI-------NDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 230 ~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~-------g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) .....+..+|.|... .+ ++.+.. +..++...++++ |.+|++.+ .+ T Consensus 187 -------------~~~~~~~~~~iyt~~-----~i---~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~-----~n 238 (499) T protein:vir:10 187 -------------EGNTNGYSITVYMPQ-----RI---VEYRTKTTMEVSANDPIVYDGENLF--GAVPIIEF-----RN 238 (499) T ss_pred -------------CCCceEEEEEEEeCC-----eE---EEEEecCCccccCcceecccccCCC--CccceEEe-----cC Confidence 000123334444321 11 111111 112333333433 56676643 34 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc-c-hhhhccCCcceEeCCC-CCccccccCccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ-T-NRKKFLAGANFEFNGT-ANDFWHGSYNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~-~-d~~~~~p~~vi~~~~~-~~~~~~~~~~~~~~ 379 (663) +.+|.|.+..++++++.+|.+.|.+.+.+...++|.+++--..++. . .......++++.+..+ +....++..+.-.. T Consensus 239 ~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~ 318 (499) T protein:vir:10 239 NEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLTKSFDET 318 (499) T ss_pred CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEeccCCHH Confidence 5678999999999999999999999999999999877764222221 1 1222345666655432 23355555554456 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .+...++.+...|..+|++++.+.+.-++..|+.| +..+............+.|. ..++.++.++..+.. T Consensus 319 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~A--l~~~~~~l~~k~~~k~~~~~-----~~l~~~~~li~~~~~--- 388 (499) T protein:vir:10 319 QVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEA--MKFKLFGLENLLSIKQRYFF-----DGLRRRLKLIQTIVN--- 388 (499) T ss_pred HHHHHHHHHHHHHHHHhCcccCCchhhcccchHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHh--- Confidence 67788899999999999998776543222334444 44333344444444444443 233444444444432 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) +.|. ..+.. ++.+..+-..+....+..+.+ +.++..++ ...+... +....+....++.+. T Consensus 389 ---~~~~---~~d~~----~i~i~f~~~~p~n~~e~~~~~----~kl~g~iS----~et~~~~--l~~v~d~~~E~~ri~ 448 (499) T protein:vir:10 389 ---IKGA---NDDAS----GCKISLVANIPSNLSDVVNNV----KNADGIIP----RKYTYSW--LPDVDNPQDVIDEMN 448 (499) T ss_pred ---ccCC---ccccc----cceEEeCCCCCCCHHHHHHHH----HHHhccCC----hHHHHHh--CCCCCCHHHHHHHHH Confidence 1111 11111 112222222222222222222 22222222 1111111 111111000011100 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH----HHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANEN--TIDAELK----RSKAAVEKAKARKLSSE 602 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~--~~~~~~~----~~~~~~e~~~~q~~~~~ 602 (663) . ++... ....+.. ........ ..+.+.. ..+...+..+..+-.+. T Consensus 449 ~--------------E~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 449 Q--------------QDAET-IKKNQEA---LRGQDPDRLELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred H--------------HHHHH-HHHHHhh---hccCCCCCCCCCCCCcccCCCCCCCccccccCCCCCCC Confidence 0 00000 0000000 00000000 0000000 00000000000000000 No 74 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.72 E-value=7.9e-17 Score=108.86 Aligned_cols=452 Identities=10% Similarity=0.027 Sum_probs=194.1 Q ss_pred CCCcHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCcCCccc--------cCCCccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAE-LLSALKADMKAADVLKQ-EQDSLISTWKAEYNGEPYGNEQ--------KGKSAIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~-~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~y~~~~~~~~~--------~g~s~~~~~~i~~~v~~~~~~l~~ 70 (663) -.|...+ ........+..+...+. .....+.++.+||.|.+.-+.. +...+++.|-..-.|+...+++ T Consensus 29 ~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-- 106 (511) T protein:vir:93 29 YTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-- 106 (511) T ss_pred ccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhh-- Confidence 2222111 11111222333333232 2345577788999997653211 1123456666666666555544 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++..+. +. .+|++. .+.++.++. .|+.......+.+++.++|.|+..++++.. T Consensus 107 --~g~p~~--~~---~~d~~~----~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~ay~~vy~de~--------------- 159 (511) T protein:vir:93 107 --LGNPIQ--YQ---DDDKDV----LEVIEAFND-LNDVESHNRSLGLDLSIYGKAYELMIRNQD--------------- 159 (511) T ss_pred --cccCee--ec---cCChHH----HHHHHHHHh-hcCHhHHHHHHHHHHHhcCeeEEEEEeCCC--------------- Confidence 443322 22 233332 344555443 466667778899999999999999887632 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+.+..++|.++| ||+.... . ...+.+.+.+... . T Consensus 160 ------------------~~~~i~~~~p~~~~~vydd~~~~---~-~~~~vr~~~~~~~----------------~---- 197 (511) T protein:vir:93 160 ------------------DETRLYKSDAMSTFVIYDNTIER---N-SIAGVRYLRTKPI----------------D---- 197 (511) T ss_pred ------------------CceEEEEEccceeEEEEcCCCCC---c-eEEEEEEEEeeec----------------c---- Confidence 23556778888876 4433221 1 1222233221000 0 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEE-----ecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIV-----RLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l-----~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) +.....+..+|+|.. +.+ ++++..++..+ .....|.+.|..|++.+ +++ T Consensus 198 -------------~~~~~~~~~~~iyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~nn 251 (511) T protein:vir:93 198 -------------KTDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNN 251 (511) T ss_pred -------------ccccceEEEEEEEeC-----CcE---EEEEecCCCccccccccccccccCCCccceEEe-----cCC Confidence 000112334455532 111 11111111110 11112223356666543 345 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc--CcchhhhccCCcceEeCC------------CCCcc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL--DQTNRKKFLAGANFEFNG------------TANDF 369 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i--~~~d~~~~~p~~vi~~~~------------~~~~~ 369 (663) .+|.|.++.++++++.+|...|.+.+.+...++|.+++- |.. +..+......+.++...+ ++... T Consensus 252 ~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (511) T protein:vir:93 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK-GNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDG 330 (511) T ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeee-cCcccCchhhcccccccceecccccccccccccCCCCcce Confidence 578999999999999999999999999988888766543 322 222222223333333221 12233 Q ss_pred ccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 370 WHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) .++..+.-...+...+..+...|..+|++++...+..++..|+.| +..+............+.|.+ +++.++++++. T Consensus 331 ~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~f~~-~l~~~~~li~~ 407 (511) T protein:vir:93 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLET 407 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 344433334566778889999999999999987764433334444 444444444444444455543 44555555554 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~ 529 (663) ++........ ..++. ++.+..+-..+....+..+. +..+...++ ...+... +.... T Consensus 408 ~l~~~~~~~~-------------~~d~~-~i~~~f~~~~p~n~~e~~~~----~~kl~g~iS----~et~~~~--l~~v~ 463 (511) T protein:vir:93 408 ILKNTWSIDA-------------NKDFN-TVRYVYNRNLPKSLIEELKA----YIDSGGKIS----QTTLMSL--FSFFQ 463 (511) T ss_pred HHHhccCccc-------------ccccc-cceEEeCCCCCCCHHHHHHH----HHHHhccCc----hHHHHHh--CCCCC Confidence 4332211100 00110 11222222222222222222 222222222 1111111 11111 Q ss_pred hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~ 595 (663) + + .....++..++.. +....+.... ......-........+-...+.+ T Consensus 464 d-------------~-~~E~~ri~~E~~~-~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 464 D-------------P-ELEVKKIEEDEKE-SIKKAQKGIY---KDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred C-------------H-HHHHHHHHHHHHH-HHHHHhhhcc---cCCCCCCCCCCCCcccccccccC Confidence 1 0 0001111100000 0000000000 00000000000000000000000 No 75 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.71 E-value=4.8e-17 Score=110.06 Aligned_cols=448 Identities=9% Similarity=0.023 Sum_probs=194.5 Q ss_pred CCCcHHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcCCccc--------cCCCccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAEL-LSALKADMKAADVLKQE-QDSLISTWKAEYNGEPYGNEQ--------KGKSAIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~-~~~l~~~~~~~~~~~~~-~~~~~~~~~~~y~~~~~~~~~--------~g~s~~~~~~i~~~v~~~~~~l~~ 70 (663) -.|...+. +......+..+...+.. ....++++.+||.|.+.-+.. +...+++.|-..-.|+...+++ T Consensus 29 ~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-- 106 (511) T protein:vir:99 29 YTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-- 106 (511) T ss_pred cccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhh-- Confidence 22222111 11111222332222222 334577788999997653211 1223466666666666665554 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++..+. |. .+|++. .+.++.++. .|+.......+.+++.++|.|++.+++|.. T Consensus 107 --~g~p~~--~~---~~d~~~----~~~l~~~~~-~n~~~~~~~~~~~~~~i~G~a~~~vy~ded--------------- 159 (511) T protein:vir:99 107 --LGNPIQ--YQ---DDDKDV----LEAIEAFND-LNDVESHNRSLGLDLSIYGKAYELMIRNQD--------------- 159 (511) T ss_pred --cccCce--ee---cCchHH----HHHHHHHHh-hcCHhHHHHHHHHHHHhcCeeEEEEEeCCC--------------- Confidence 444333 32 233332 345555554 466667778899999999999999987632 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+.+.+++|.++| +|++... ..-+.+ +.+..... . T Consensus 160 ------------------~~~~i~~~~p~~~~~vyd~~~~~---~~~~~v-r~~~~~~~----------------~---- 197 (511) T protein:vir:99 160 ------------------DETRLYKSDAMSTFVIYDNTIER---NSIAGV-RYLRTKPI----------------D---- 197 (511) T ss_pred ------------------CceEEEEEccceeEEEEcCCCCC---ceEEEE-EEEEeeec----------------c---- Confidence 23566778888886 3443211 111222 22211000 0 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEE-----ecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIV-----RLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l-----~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) +.....+..+|+|.. +++ +.+...++..+ .....|.+.|..|++.+ +++ T Consensus 198 -------------~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~nn 251 (511) T protein:vir:99 198 -------------KTDEDEVFTVDLFTS-----HGV---YRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNN 251 (511) T ss_pred -------------cCccceEEEEEEEeC-----CcE---EEEEecCCccccccccccccccCCCCccceEEe-----cCC Confidence 000112334455532 111 11111111110 01122223355665544 345 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc--cCcchhhhccCCcceEeC------------CCCCcc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA--LDQTNRKKFLAGANFEFN------------GTANDF 369 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~--i~~~d~~~~~p~~vi~~~------------~~~~~~ 369 (663) .+|.|.++.++++++.+|...|.+.+.+...++|.+.+ .|. .+..+......++++... .++..+ T Consensus 252 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 330 (511) T protein:vir:99 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDG 330 (511) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhh-ccCcccCchhhcccccccceecccccccccccccCCCCcce Confidence 67899999999999999999999999998888876654 332 222222222223333221 112234 Q ss_pred ccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 370 WHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) .++..+.-.......+..+.+.|..+|++++.+.+..++..|+.| +..+............+.|.+ +++.++++++. T Consensus 331 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~A--lk~~~~~l~~ka~~k~~~~~~-~l~~~~~li~~ 407 (511) T protein:vir:99 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLET 407 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 444444334566778899999999999999987764333334444 444443444444444455543 44455555555 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~ 529 (663) ++....... . +.++. ++.+..+-+.+....+..+. +..+...++ ...+... +.... T Consensus 408 ~~~~~~~~~----------~---~~~~~-~i~i~f~~~~p~n~~e~~~~----~~kl~GiiS----~et~l~~--l~~v~ 463 (511) T protein:vir:99 408 ILKNTRSID----------V---SKDFN-TVRYVYNRNLPKSLIEELKA----YIDSGGKIS----QTTLMSL--FSFFQ 463 (511) T ss_pred HHHhcCCcc----------c---ccccc-cceEEeCCCCCcCHHHHHHH----HHHHhccCC----HHHHHHh--CCCCC Confidence 443321100 0 00110 11222222222222222222 222222222 1111111 11111 Q ss_pred hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHH Q lcl|NC_021532. 530 EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDK----NARANENTIDAELKRSK 588 (663) Q Consensus 530 e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~----~a~~q~~~~~~~~~~~~ 588 (663) +....++.+. .++.... ...+...... .........+....+.+ T Consensus 464 D~~~E~~ri~--------------~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 464 DPELEVKKIE--------------EDEKESI-KKAQKNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred CHHHHHHHHH--------------HHHHHHH-HHHhhcccccCCCCCCCCCCCCCcCcccccC Confidence 1111111111 0000000 0000000000 00000000000000000 No 76 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.71 E-value=6.4e-17 Score=109.36 Aligned_cols=453 Identities=9% Similarity=0.015 Sum_probs=192.2 Q ss_pred CCCcHHHH-HHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCcCCccc--------cCCCccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAEL-LSALKADMKAADVLKQ-EQDSLISTWKAEYNGEPYGNEQ--------KGKSAIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~-~~~l~~~~~~~~~~~~-~~~~~~~~~~~~y~~~~~~~~~--------~g~s~~~~~~i~~~v~~~~~~l~~ 70 (663) -.|...+. .......+..+-..+. .....++++.+||.|++.-+.. +...+++.|-.+..|+...+++ T Consensus 29 ~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-- 106 (511) T protein:vir:78 29 YTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-- 106 (511) T ss_pred ccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhh-- Confidence 22211111 1111111222222221 2334567788899998653211 1223566676666777666654 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++..+. +. .+|++.. +.++.++. .|+.......+.++++++|.|+..+++|.. T Consensus 107 --~g~p~~--~~---~~d~~~~----~~l~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~vy~d~d--------------- 159 (511) T protein:vir:78 107 --LGNPIQ--YQ---DDDKDVL----EAIEAFND-LNDVESHNRSLGLDLSIYGKAYELMIRNQD--------------- 159 (511) T ss_pred --cccCce--ee---cCchHHH----HHHHHHHh-hcChhHHHHHHHHHHHhcCeeEEEEEeCCC--------------- Confidence 443333 22 2333332 34555553 456666777899999999999999887531 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+.+..++|.++| +|+.... .. ..+.+.+.+... . T Consensus 160 ------------------g~~~i~~~~p~~~~~v~dd~~~~---~~-~~~vr~~~~~~~----------------~---- 197 (511) T protein:vir:78 160 ------------------DETRLYKSDAMSTFIIYDNTVER---NS-IAGVRYLRTKPI----------------D---- 197 (511) T ss_pred ------------------CceEEEEEcccceEEEEcCCCCC---ce-EEEEEEEEeeec----------------c---- Confidence 23566778888887 3433211 11 222222211000 0 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCE---EE--ecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDV---IV--RLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~---~l--~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) +.....+..+|+|.. +++ +.++..++. +. .....|.+.+..|++. -+++ T Consensus 198 -------------~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-----~~n~ 251 (511) T protein:vir:78 198 -------------KTDEDEVFTVDLFTS-----HGV---YRYLTNRTNGLKLTPRENSFESHSFERMPITE-----FSNN 251 (511) T ss_pred -------------ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccCcCcccceEE-----ecCC Confidence 000112334455532 111 112221111 10 1112233335556553 3445 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecc-ccCcchhhhccCCcceEeCC------------CCCccc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKG-ALDQTNRKKFLAGANFEFNG------------TANDFW 370 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~-~i~~~d~~~~~p~~vi~~~~------------~~~~~~ 370 (663) .+|.|.++.++++++.+|...|.+.+.+...++|.+.+--. ..+..+......+.++...+ ++.... T Consensus 252 ~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (511) T protein:vir:78 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGG 331 (511) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCccee Confidence 67899999999999999999999999998888876654221 12222222222333332211 112233 Q ss_pred cccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 371 HGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAY 450 (663) Q Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~l 450 (663) ++..+.-.......+..+.+.|..+|++++.+.+..++..|+.| +...............+.|.+ +++.++++++.+ T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~f~~-~l~~~~~li~~~ 408 (511) T protein:vir:78 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLETI 408 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 44433334556778888999999999999987775433334444 443333334444444445533 444555555554 Q ss_pred HHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhh Q lcl|NC_021532. 451 NAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPE 530 (663) Q Consensus 451 i~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e 530 (663) +....... .. .++. ++.+..+-..+....+..+. +..+...++ ...+... +....+ T Consensus 409 ~~~~~~~~----------~~---~~~~-~i~~~f~~~~p~n~~e~~d~----~~kl~G~iS----~et~l~~--l~~v~d 464 (511) T protein:vir:78 409 LKNTRSID----------AN---KDFN-TVRYVYNRNLPKSLIEELKA----YIDSGGKIS----QTTLMSL--FSFFQD 464 (511) T ss_pred HHhcCCCc----------cc---cccc-cceEEeCCCCCcCHHHHHHH----HHHHhccCC----hHHHHHh--CCCCCC Confidence 43211100 00 0110 11222222222222222222 222222222 1111111 111111 Q ss_pred hhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 531 QAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 531 ~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) ....++.+.. ++... ....+...............+-+.+....+.+ T Consensus 465 ~~~El~ri~~--------------E~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 465 PELEVKKIEE--------------DEKES-IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHH--------------HHHHH-HHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 1111111110 00000 00000000000000000000000000000000 No 77 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.71 E-value=6.4e-17 Score=109.36 Aligned_cols=453 Identities=9% Similarity=0.015 Sum_probs=192.2 Q ss_pred CCCcHHHH-HHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCcCCccc--------cCCCccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAEL-LSALKADMKAADVLKQ-EQDSLISTWKAEYNGEPYGNEQ--------KGKSAIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~~-~~~l~~~~~~~~~~~~-~~~~~~~~~~~~y~~~~~~~~~--------~g~s~~~~~~i~~~v~~~~~~l~~ 70 (663) -.|...+. .......+..+-..+. .....++++.+||.|++.-+.. +...+++.|-.+..|+...+++ T Consensus 29 ~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-- 106 (511) T protein:vir:96 29 YTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-- 106 (511) T ss_pred ccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhh-- Confidence 22211111 1111111222222221 2334567788899998653211 1223566676666777666654 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++..+. +. .+|++.. +.++.++. .|+.......+.++++++|.|+..+++|.. T Consensus 107 --~g~p~~--~~---~~d~~~~----~~l~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~vy~d~d--------------- 159 (511) T protein:vir:96 107 --LGNPIQ--YQ---DDDKDVL----EAIEAFND-LNDVESHNRSLGLDLSIYGKAYELMIRNQD--------------- 159 (511) T ss_pred --cccCce--ee---cCchHHH----HHHHHHHh-hcChhHHHHHHHHHHHhcCeeEEEEEeCCC--------------- Confidence 443333 22 2333332 34555553 456666777899999999999999887531 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+.+..++|.++| +|+.... .. ..+.+.+.+... . T Consensus 160 ------------------g~~~i~~~~p~~~~~v~dd~~~~---~~-~~~vr~~~~~~~----------------~---- 197 (511) T protein:vir:96 160 ------------------DETRLYKSDAMSTFIIYDNTVER---NS-IAGVRYLRTKPI----------------D---- 197 (511) T ss_pred ------------------CceEEEEEcccceEEEEcCCCCC---ce-EEEEEEEEeeec----------------c---- Confidence 23566778888887 3433211 11 222222211000 0 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCE---EE--ecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDV---IV--RLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~---~l--~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) +.....+..+|+|.. +++ +.++..++. +. .....|.+.+..|++. -+++ T Consensus 198 -------------~~~~~~~~~~~vyt~-----~~i---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-----~~n~ 251 (511) T protein:vir:96 198 -------------KTDEDEVFTVDLFTS-----HGV---YRYLTNRTNGLKLTPRENSFESHSFERMPITE-----FSNN 251 (511) T ss_pred -------------ccccceEEEEEEEeC-----CcE---EEEEecCCCcccccccccccccCcCcccceEE-----ecCC Confidence 000112334455532 111 112221111 10 1112233335556553 3445 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecc-ccCcchhhhccCCcceEeCC------------CCCccc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKG-ALDQTNRKKFLAGANFEFNG------------TANDFW 370 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~-~i~~~d~~~~~p~~vi~~~~------------~~~~~~ 370 (663) .+|.|.++.++++++.+|...|.+.+.+...++|.+.+--. ..+..+......+.++...+ ++.... T Consensus 252 ~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (511) T protein:vir:96 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGG 331 (511) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCccee Confidence 67899999999999999999999999998888876654221 12222222222333332211 112233 Q ss_pred cccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 371 HGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAY 450 (663) Q Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~l 450 (663) ++..+.-.......+..+.+.|..+|++++.+.+..++..|+.| +...............+.|.+ +++.++++++.+ T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~f~~-~l~~~~~li~~~ 408 (511) T protein:vir:96 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA--MKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLETI 408 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 44433334556778888999999999999987775433334444 443333334444444445533 444555555554 Q ss_pred HHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhh Q lcl|NC_021532. 451 NAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPE 530 (663) Q Consensus 451 i~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e 530 (663) +....... .. .++. ++.+..+-..+....+..+. +..+...++ ...+... +....+ T Consensus 409 ~~~~~~~~----------~~---~~~~-~i~~~f~~~~p~n~~e~~d~----~~kl~G~iS----~et~l~~--l~~v~d 464 (511) T protein:vir:96 409 LKNTRSID----------AN---KDFN-TVRYVYNRNLPKSLIEELKA----YIDSGGKIS----QTTLMSL--FSFFQD 464 (511) T ss_pred HHhcCCCc----------cc---cccc-cceEEeCCCCCcCHHHHHHH----HHHHhccCC----hHHHHHh--CCCCCC Confidence 43211100 00 0110 11222222222222222222 222222222 1111111 111111 Q ss_pred hhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 531 QAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 531 ~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) ....++.+.. ++... ....+...............+-+.+....+.+ T Consensus 465 ~~~El~ri~~--------------E~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 465 PELEVKKIEE--------------DEKES-IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHHHHHH--------------HHHHH-HHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 1111111110 00000 00000000000000000000000000000000 No 78 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.71 E-value=9.5e-16 Score=102.93 Aligned_cols=435 Identities=12% Similarity=0.083 Sum_probs=199.0 Q ss_pred cHHHHHHHHHHHHHH---------HH-----HHHHHHHHHHHHHHHHhcCCcCCccc-----cC----CCccccHHHHHH Q lcl|NC_021532. 4 NKAELLSALKADMKA---------AD-----VLKQEQDSLISTWKAEYNGEPYGNEQ-----KG----KSAIVSRDIKKQ 60 (663) Q Consensus 4 ~~~~~~~~l~~~~~~---------~~-----~~~~~~~~~~~~~~~~y~~~~~~~~~-----~g----~s~~~~~~i~~~ 60 (663) =-+-|...|+.-++. +. ....++......|++||.|+...... .+ +..+..|. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~---- 76 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNL---- 76 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecch---- Confidence 011122222222211 11 12344556678889999987432111 11 12233343 Q ss_pred HHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 61 SEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 61 v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) ...++..+.+.+++-.+.+.+ +|.+ .+++++.++. .+++...+.+++.+|+.+|.||++++||.. T Consensus 77 ~~~iv~~~a~~l~~ep~~i~~-----~d~~----~~e~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~----- 141 (499) T protein:vir:80 77 PKVTAKYMSKLLFNEKVKINI-----DDET----AEEFVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDGN----- 141 (499) T ss_pred HHHHHHHHHHhhhCCcceEee-----CCHH----HHHHHHHHHh-hccHHHHHHHHHHHHhhcCcEEEEEEECCC----- Confidence 333344444445554444444 3444 4444555553 466777788899999999999999999731 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKL 220 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~ 220 (663) +.+.+..|+|..||+-..-.+.+..|-|+-. .++++ T Consensus 142 ----------------------------~~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~---~~~~~------------- 177 (499) T protein:vir:80 142 ----------------------------KNVKVSFATADCMYPLSNDSENVDECLIANS---FHKNN------------- 177 (499) T ss_pred ----------------------------CcEEEEEEcCCceEEEEecCCCeEEEEEEEE---EeecC------------- Confidence 3467888999998853211233444433211 11100 Q ss_pred hhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEE-------CCEE----EecccCC---C- Q lcl|NC_021532. 221 AKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWI-------NDVI----VRLQSNP---Y- 285 (663) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~-------g~~~----l~~~~~p---~- 285 (663) ..++.+|+|.+.+.. .+.-......|. |..+ +.....| + T Consensus 178 -------------------------~~y~~lE~h~~~~~~-~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~ 231 (499) T protein:vir:80 178 -------------------------KYYKLLEWNEWKGEK-EEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLP 231 (499) T ss_pred -------------------------eEEEEEEEEEecccc-eeeEEEEEEEEeccCccccCcccchhhhccCcCCceeec Confidence 012223332221100 000000000110 1100 0000001 1 Q ss_pred cCCCCCEEEEeee----eecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchh------hhcc Q lcl|NC_021532. 286 PDGKPPFLVVPFN----SIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNR------KKFL 355 (663) Q Consensus 286 ~~~~~Pf~~~~~~----~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~------~~~~ 355 (663) ..+++||+.++.. ...++++|.|++..++++.+.+|...+.+.+.+.. +..++.++.+.+..... ..+. T Consensus 232 ~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~ 310 (499) T protein:vir:80 232 SLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQYFD 310 (499) T ss_pred CCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh-cccceecchhhhhccCCCCCCcccCCC Confidence 1246777766542 24577899999999999999999999999988865 56678887776642111 1111 Q ss_pred CC-cceEe---CCC--CCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 356 AG-ANFEF---NGT--ANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRM 429 (663) Q Consensus 356 p~-~vi~~---~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~ 429 (663) ++ .++.. .++ +..+..+.+.-........++.+...+....|++....|...++ ..||+++....+..-.... T Consensus 311 ~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g-~~TAtei~s~~~~l~~~~~ 389 (499) T protein:vir:80 311 STDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKN 389 (499) T ss_pred cccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCccc-chhHHHHHHHHHHHHHHHH Confidence 11 11211 111 11233333333334456778888888889999999988876544 3578888765444444445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhc-c Q lcl|NC_021532. 430 NIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLG-P 508 (663) Q Consensus 430 ~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~-~ 508 (663) .+.+.|. ..++.+.+.++.+..-|.-- .|.. .. ...+.++..-+......+..+. .++..+ + T Consensus 390 ~~~~~~~-~~l~~l~~~il~~~~~~~~~------~~~~---~~----~~~v~v~f~d~i~~d~~~~~~~---~~~~~~~G 452 (499) T protein:vir:80 390 SHSQLIE-QGIKEMIVSILEVGKLIKAY------DGDT---VE----LDTITVDFDDSIAQDEDTTINR---YTTAKNQG 452 (499) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHhccc------cCCC---CC----ccceEEEeCCCCCCCHHHHHHH---HHHHHHcC Confidence 5556663 45666667776665443210 0000 00 0111222222222222222222 222221 2 Q ss_pred CCCcchhHHHHHHHHHhhhhhh--hhhhhhhhhcc-------hhhHHHHhhHHH Q lcl|NC_021532. 509 NEDPKIRRDIMADIMDLMRMPE--QAKRMREYEPK-------PDPVQEKIRQLE 553 (663) Q Consensus 509 ~~~p~~~~~~l~~~~~l~~~~e--~~~~l~~~~~~-------~~~~~~q~~q~~ 553 (663) .+.... . ++.+-+..+ ..+.+.++..+ +++.- ...+.+ T Consensus 453 i~S~et----~--l~~~~~~~d~ea~~el~~i~~E~~~~~~~~d~~g-~~ge~e 499 (499) T protein:vir:80 453 MIPLKI----A--LQRAWNITEAEADEWAEMLAKEKQAEIPNNDMTG-IFGEEE 499 (499) T ss_pred CCCHHH----H--HhhcCCCChHHHHHHHHHHHHHhhcCCCCCCccc-cCCCCC Confidence 222111 0 111111111 11111111100 00000 000000 No 79 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.71 E-value=8.8e-17 Score=108.60 Aligned_cols=434 Identities=10% Similarity=0.045 Sum_probs=191.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-------------ccCCCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-------------QKGKSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-------------~~g~s~~~~~~i~~~v~~~~~~ 67 (663) +..+.+.+-..|.+.+.. +..+...+.+..+||.|++.-+. .+...+++.|-.+..|+..+++ T Consensus 38 ~~~~~~~~~~~i~~~i~~----~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~y 113 (492) T protein:vir:97 38 TNNKPETLEEMIVRYIKQ----HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 113 (492) T ss_pred CCCchhhHHHHHHHHHHH----HHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhh Confidence 222222223333333332 34455667778889999753211 1112346677777777766665 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) + ++..+ .+. .+|.+.. +.++.++. |+....+.++.++++++|.|+..++++.+ T Consensus 114 l----~g~p~--~~~---~~d~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~G~a~~~v~~d~d------------ 166 (492) T protein:vir:97 114 I----VGKPI--AFK---HTDDEVV----KRIDEVLG--NRFDDKLHSVLTGASNKGIEWLHPYLDEE------------ 166 (492) T ss_pred h----cccCc--eec---cCchHHH----HHHHHHHh--ccHHHHHHHHHHHHhhcCeEEEEEEecCC------------ Confidence 5 44332 232 2444433 34444442 45666777889999999999998876521 Q ss_pred cCccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) +.+.+.+++|.++|+ |++..+ +..+ +.+.+..... T Consensus 167 ---------------------g~~~~~~~~p~~~~~i~d~~~~~---~~~~-~vr~~~~~~~------------------ 203 (492) T protein:vir:97 167 ---------------------GEFKLFRVPAEQGIPIWTDKEHE---ELEA-FIRMYKLENE------------------ 203 (492) T ss_pred ---------------------CceEEEEEcccceEEEEcCCCCC---ceEE-EEEEEeeccc------------------ Confidence 335677788888864 433222 1222 2222211000 Q ss_pred hhhhccccccccccccccccceEEEEE---EEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYE---YWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E---~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) ..+++|. +++ +...+++. ........+...++..++ ..|..|++.+ .+ T Consensus 204 --------------------~~~~~y~~~~v~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~--~~g~vPvv~~-----~n 254 (492) T protein:vir:97 204 --------------------TKVEYWDKVTVNY-YVYENGSL-IPDYSNNLENSKTHFSTG--SWGKIPFIPF-----KN 254 (492) T ss_pred --------------------eeEEEEecCeEEE-EEEecCee-eecccccccccccccccC--CCCCcceEEe-----cC Confidence 0011111 000 00011110 000000011112222223 3356676644 34 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc--ch-hhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ--TN-RKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~--~d-~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +.+|.|.++.++++++.+|.+.|.+.+.+...+.|.+.+ .|.-.. .+ .......+++.+..+++ ..++..+.-.. T Consensus 255 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~ 332 (492) T protein:vir:97 255 NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-KNYDDQELPEFKRLLRYYGAIKVSDNGG-VDTIQVEVPVE 332 (492) T ss_pred CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCcccchhHHHHHhhccceecCCCCc-ceeEeccCCHH Confidence 557899999999999999999999999999998886655 342211 11 12234556676665544 44444343345 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .....++.+.+.|...|++++.+.+..++..|+.| +..+............+.|.. +++ .++.++..+... T Consensus 333 ~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~ka~~~~~~f~~-~l~----~~~~li~~~~~~-- 403 (492) T protein:vir:97 333 NSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKV-AIQ----ELLWFVFEHFDI-- 403 (492) T ss_pred HHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHH--HHHHHHHHHHHHHHHHHHHHH-HHH----HHHHHHHHHhcC-- Confidence 56778899999999999999877664333334444 433333333333334444432 333 344444444321 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) .+ ++. ++++..+-..+....+..+ .++.+++.++ ...+... +....+....++.+ T Consensus 404 ----~~-~~~---------~i~v~f~~~~p~~~~e~a~----~~~kl~G~iS----~et~l~~--l~~v~d~~~Eleri- 458 (492) T protein:vir:97 404 ----KG-EHK---------DVDISFNYNKVANTELQVQ----TAQQSMGIVS----HETVLEN--HPFVEDLQAELERI- 458 (492) T ss_pred ----Cc-ccc---------eeeEEecCCCCCCHHHHHH----HHHHHhccCc----hHHHHHh--CCCCCCHHHHHHHH- Confidence 11 110 1222222222221222222 2222222221 1211111 11111111111110 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARK 598 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~ 598 (663) ..+..+. .+.++. .. ....+...+. .+..... ++ T Consensus 459 -------------~~E~~~~-~~~~~~-~~-------~~~~~~~~~~--~~~~~~~-~e 492 (492) T protein:vir:97 459 -------------EQEQTEY-NKQLPN-LD-------DGGADSAQQQ--ERSNNKE-SE 492 (492) T ss_pred -------------HHHHHHH-HHhhhc-cc-------cCCCCCCccc--ccccccc-cC Confidence 0000000 000000 00 0000000000 0000000 00 No 80 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.71 E-value=1.1e-15 Score=102.63 Aligned_cols=429 Identities=11% Similarity=0.046 Sum_probs=196.3 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc-------------------cCC--CccccHHHHHHH Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ-------------------KGK--SAIVSRDIKKQS 61 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~-------------------~g~--s~~~~~~i~~~v 61 (663) |+-+.+...|...+.. +.++...+.+..+||.|++.-+.. +++ .++..|-....| T Consensus 1 ~~~e~~~~~i~~~~~~----~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 76 (471) T protein:vir:10 1 MEIEVIKKIISSQMVK----HGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLL 76 (471) T ss_pred CCHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHH Confidence 7777776666666544 344555678888899987521110 111 135555555555 Q ss_pred HHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 62 EWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 62 ~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) +..++++ ++..+. |. .+|.+.. +.++.++. ++.......+.+++.++|.|+..+++|.++ T Consensus 77 d~~~~yl----~G~p~~--~~---~~~~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~----- 136 (471) T protein:vir:10 77 DQKKAYA----LTYPPT--FD---VDDKKVN----DMIVDVLG--DDYERISKQLCVNAGNAGIAWLHVWKDASD----- 136 (471) T ss_pred Hhhhhhh----cccCce--ec---cCChHHH----HHHHHHHh--cCHHHHHHHHHHHHhhCCeEEEEEEeeCCC----- Confidence 5555544 554433 32 2333333 34455443 555666777889999999999999886321 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDK 219 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~ 219 (663) +.+.+..++|..+| ||++. .+-...+.+.+.+.... T Consensus 137 ---------------------------g~~~~~~~~p~~~~~i~d~~~----~~~~~~~ir~~~~~~~~----------- 174 (471) T protein:vir:10 137 ---------------------------NSFRYACVDSKEVIPIYSKSL----DKKSIGVLRVYSSIDET----------- 174 (471) T ss_pred ---------------------------CeeEEEEEcccceEEEEcCCC----CCceEEEEEEEEeeccC----------- Confidence 23556778888775 33321 11112222233221110 Q ss_pred hhhccchhhhccccccccccccccccceEEEEEEEEEe-----eecCCceeE-E-----E-EEEEECCEEEecccCCCcC Q lcl|NC_021532. 220 LAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNY-----DVDGDGIAE-P-----I-VCAWINDVIVRLQSNPYPD 287 (663) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~-----~~~~~g~~~-~-----~-~~~~~g~~~l~~~~~p~~~ 287 (663) ....+..+|+|... -..+.+... . . ......+.......-|... T Consensus 175 ------------------------~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (471) T protein:vir:10 175 ------------------------DGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDF 230 (471) T ss_pred ------------------------CCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCC Confidence 00112233333210 000000000 0 0 0000111222222223333 Q ss_pred CCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-c--CcchhhhccCCcceEeCC Q lcl|NC_021532. 288 GKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-L--DQTNRKKFLAGANFEFNG 364 (663) Q Consensus 288 ~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i--~~~d~~~~~p~~vi~~~~ 364 (663) |.+|++.+ +++..|.|.+..++++++.+|.+.|.+.+.+...++|.+++ .|. . ..........++.+.+.. T Consensus 231 g~iPvv~~-----~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~ 304 (471) T protein:vir:10 231 GLVPFIPF-----KNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVL-TNYGGQDKQEFLEDLKRYKMIKMDN 304 (471) T ss_pred CceeEEEe-----ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccchhHHHhhcCCeEEecC Confidence 55665533 44567899999999999999999999999999999986655 332 1 112223344556666643 Q ss_pred CC----CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 365 TA----NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLV 440 (663) Q Consensus 365 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~ 440 (663) .+ ....++..+.-...+...++.+.+.|-..|++++...+..++ .|++| +..+...........-+.|.+ T Consensus 305 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn-~Sg~A--lk~~~~~l~~k~~~~~~~~~~--- 378 (471) T protein:vir:10 305 DGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGN-SSGVA--LKFLYSLLELKAGNMETQFRS--- 378 (471) T ss_pred CCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccC-ccHHH--HHHHHHHHHHHHHHHHHHHHH--- Confidence 22 234555545445667788899999999999998886664433 24444 444433333333333344432 Q ss_pred HHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHH Q lcl|NC_021532. 441 KPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMA 520 (663) Q Consensus 441 ~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~ 520 (663) .++.++.++..+... .+ + .++.+..+-..+....+..+ .++.++..+ ....+. T Consensus 379 --~l~~~~~li~~~~~~------~d--~---------~~i~i~f~~~~p~n~~e~~~----~~~kl~g~i----S~et~~ 431 (471) T protein:vir:10 379 --GYATLVKMILKHLGL------SD--K---------LKIKQTWTRNSINNDTEMAQ----VVSTLATIT----SRENVA 431 (471) T ss_pred --HHHHHHHHHHHHhcc------CC--C---------ceeEEEeCCCCCCCHHHHHH----HHHHHhccC----chHHHH Confidence 334444555554321 11 1 11222222222222222222 222222211 111111 Q ss_pred HHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 521 DIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 521 ~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) .. +....+....+ .+ ++++......+.. .... .-...+.+ T Consensus 432 ~~--~p~v~D~~~E~--------------er------------i~~E~~~~~~~~~-~~~~---~~~~~e~~ 471 (471) T protein:vir:10 432 KS--NPIVEDWQDEL--------------RL------------QKAEQEGRSEKLY-DMEE---VEHESEVE 471 (471) T ss_pred Hh--CCCCCCHHHHH--------------HH------------HHHHHHHHHhccc-ccCC---CCCccccC Confidence 11 11111000000 00 0000000000000 0000 00000000 No 81 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.71 E-value=4.6e-16 Score=104.67 Aligned_cols=433 Identities=10% Similarity=0.034 Sum_probs=195.2 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc---------------ccCC--CccccHHHHHHHHHHH Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE---------------QKGK--SAIVSRDIKKQSEWQH 65 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~---------------~~g~--s~~~~~~i~~~v~~~~ 65 (663) |.-+.+...|...... +...........+||.|++--+. ..++ .+++.|.....|+..+ T Consensus 1 ~~~~~~~~~i~~~~~~----~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~ 76 (470) T protein:vir:10 1 MELDALKKLIQNTSTS----RNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (470) T ss_pred CchHHHHHHHHHHHHH----HHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhh Confidence 6666666655555444 45566667788899998652111 0111 2345555555555444 Q ss_pred HHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccc Q lcl|NC_021532. 66 ATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEA 145 (663) Q Consensus 66 ~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~ 145 (663) + .++++.+.+ . .+|.+..+.+.++ +. ++....+..+.++++++|.|+..+++|.. T Consensus 77 ~----yl~G~p~~~--~---~~d~~~~~~l~~~----~~--~~~~~~~~~l~~~~~~~G~a~~~~y~d~~---------- 131 (470) T protein:vir:10 77 G----YVASVFPDI--D---VGKDADNKKIIDV----LG--DDRALTLNGLLVDSSNAGRAWLHYWIDED---------- 131 (470) T ss_pred h----heeccceee--e---cCchHHHHHHHHH----Hh--hhHHHHHHHHHHHHhhcCeeEEEEEecCC---------- Confidence 4 445654333 2 2444444444444 32 23445566788899999999999988632 Q ss_pred cccCccccccccccccccceeecccceeeeccHHHheeC--cccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc Q lcl|NC_021532. 146 VVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLD--PTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT 223 (663) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~d--p~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~ 223 (663) +.+.+.+++|..+|+- ++.. ....++ .+.+.+.+. + T Consensus 132 -----------------------~~~~~~~~~p~~~~~v~d~~~~---~~~~a~-ir~y~~~~~------~--------- 169 (470) T protein:vir:10 132 -----------------------GNFRYGIIQPDQITPIYATTLD---NKLLGI-LRSYKQLDP------D--------- 169 (470) T ss_pred -----------------------CceEEEEEcccceEEEEcCCCC---CceEEE-EEEEEeeec------C--------- Confidence 2345667888877633 2211 111222 222221100 0 Q ss_pred cchhhhccccccccccccccccceEEEEEEEEEe-------eecCCceeEEEEEEEE--C---CEEEecccCCCcCCCCC Q lcl|NC_021532. 224 SGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNY-------DVDGDGIAEPIVCAWI--N---DVIVRLQSNPYPDGKPP 291 (663) Q Consensus 224 ~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~-------~~~~~g~~~~~~~~~~--g---~~~l~~~~~p~~~~~~P 291 (663) ....+..+|+|... ...+......+..... . ...-.....|...|..| T Consensus 170 --------------------~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 229 (470) T protein:vir:10 170 --------------------SGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVP 229 (470) T ss_pred --------------------CceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeee Confidence 00112333443210 0000000000000000 0 00000111122224444 Q ss_pred EEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc--chhhhccCCcceEeCCCC--- Q lcl|NC_021532. 292 FLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ--TNRKKFLAGANFEFNGTA--- 366 (663) Q Consensus 292 f~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~--~d~~~~~p~~vi~~~~~~--- 366 (663) ++ +.+++.+|.|.++.++++++.+|.+.|.+.+.+...++|.+++--...+. ....+...++.+.+...+ T Consensus 230 vv-----~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 304 (470) T protein:vir:10 230 FI-----EFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGD 304 (470) T ss_pred EE-----EeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCc Confidence 44 44455679999999999999999999999999999999877764322221 223334455566664322 Q ss_pred -CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 367 -NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMR 445 (663) Q Consensus 367 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~ 445 (663) ....++..+.-.......++.+.+.|...|++++...+..+ ..|++| +..+...........-+.|. ..++ T Consensus 305 ~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~g-n~Sg~A--lk~~~~~l~~k~~~~~~~~~-----~~l~ 376 (470) T protein:vir:10 305 NSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESS-NASGVA--IKMLYSHLELKAAKTQTYFE-----HAIN 376 (470) T ss_pred CceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccc-cchHHH--HHHHHHHHHHHHHHHHHHHH-----HHHH Confidence 23455554554566778889999999999999988766433 234444 44444444444444444443 3344 Q ss_pred HHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHh Q lcl|NC_021532. 446 KWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDL 525 (663) Q Consensus 446 ~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l 525 (663) .++.+|..++.- .+.++ .+..+..+-..+....+..+ +++.+++.+ ....+... + T Consensus 377 ~~~~~i~~~l~~------~~~d~---------~~i~i~f~~~~p~d~~e~~~----~~~~~~g~i----S~et~l~~--~ 431 (470) T protein:vir:10 377 ELVRAIMRYLNF------SDADK---------RHISQHWTRTKVEDSLTKAQ----IVSTVANYS----SKEAVAKA--N 431 (470) T ss_pred HHHHHHHHHhcc------cCccc---------ceeeEEeccCCCCCHHHHHH----HHHHHhccC----cHHHHHHh--C Confidence 455555554421 11111 01122222222222222222 222222211 11111110 1 Q ss_pred hhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 526 MRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAE 583 (663) Q Consensus 526 ~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~ 583 (663) ....+....++ +...++.+. .....+....... ..-+.+ T Consensus 432 p~v~D~~~E~e--------------ri~~E~~e~--~~~~~~~~~~~~~---~~dde~ 470 (470) T protein:vir:10 432 PIVDDWQQELK--------------DLAKDKEEN--DPYSNQADELNGK---GVNDEQ 470 (470) T ss_pred CCCCCHHHHHH--------------HHHHHHHHH--HHhhccccccCCC---CCCCCC Confidence 11111111111 111110000 0000000000000 000000 No 82 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.70 E-value=3.7e-16 Score=105.20 Aligned_cols=451 Identities=13% Similarity=0.047 Sum_probs=210.1 Q ss_pred CCCcHHHHHHHHHHH---------HHHH-----HHHHHHHHHHHHHHHHHhcCCcCCcc---ccC----CCccccHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKAD---------MKAA-----DVLKQEQDSLISTWKAEYNGEPYGNE---QKG----KSAIVSRDIKK 59 (663) Q Consensus 1 ~~~~~~~~~~~l~~~---------~~~~-----~~~~~~~~~~~~~~~~~y~~~~~~~~---~~g----~s~~~~~~i~~ 59 (663) |+|=.. |-..+++- ++.+ -..-.++....+.|++||.|++.... ..| +..+..|.-.. T Consensus 1 m~~~~~-~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~ 79 (508) T protein:vir:15 1 MGLIQR-IKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKT 79 (508) T ss_pred CChHHH-HHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHH Confidence 555211 11111110 0110 01123455667888999998753221 112 11223344333 Q ss_pred HHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeecccccee Q lcl|NC_021532. 60 QSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEV 139 (663) Q Consensus 60 ~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~ 139 (663) .++ .+.+.+|+-.+.+.|. +|+ ...+.|+.++. .|++...+.+++.+++..|.|+++++||. T Consensus 80 i~~----~~A~lv~~e~~~i~v~----~~~----~~~e~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~----- 141 (508) T protein:vir:15 80 AAR----RIASVVFNEKAEIHVK----DNN----EADKFLNDVLE-DNDFKNKFEEALEKGVALGGFAMRPYIDG----- 141 (508) T ss_pred HHH----HHHhhhhCCCceEEeC----Cch----HHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEEeC----- Confidence 333 3333344433344432 222 22344555553 46777778889999999999999999972 Q ss_pred cccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhh Q lcl|NC_021532. 140 TVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDK 219 (663) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~ 219 (663) +.+.++.|++..||+-..-..++..|-|+..... + + T Consensus 142 -----------------------------~~~~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~-~--~------------ 177 (508) T protein:vir:15 142 -----------------------------NHIKIAWVRADQFYPLQSNTNDISEAAIASRTQR-T--E------------ 177 (508) T ss_pred -----------------------------CeeEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEe-e--c------------ Confidence 2356788899888852111223444433221111 0 0 Q ss_pred hhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC------CEEEecccCC--------- Q lcl|NC_021532. 220 LAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN------DVIVRLQSNP--------- 284 (663) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g------~~~l~~~~~p--------- 284 (663) +.....++.+|+|.+.+ ++.|.-+ ..+|.+ |..+....-| T Consensus 178 ----------------------~~~~~~yt~lE~h~~~~-~~~~~I~--n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~ 232 (508) T protein:vir:15 178 ----------------------SNQTKYYTLLEFHQWQD-NGSYQIT--NELYKSDSPDIVGNQVPLSTLPVYKELAPQV 232 (508) T ss_pred ----------------------CCCceEEEEEEEEEEec-CcceEEE--EEEEecCCchhcCcccchhhcccccCCCcce Confidence 00001233444443221 1111111 111111 0111000001 Q ss_pred --CcCCCCCEEEEee----eeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchh--hhccC Q lcl|NC_021532. 285 --YPDGKPPFLVVPF----NSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNR--KKFLA 356 (663) Q Consensus 285 --~~~~~~Pf~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~--~~~~p 356 (663) ....++||+.++. ....++++|.|++.++++.++.+|..++++.+.+ ..+.+++.++++.+..+.. ..+.+ T Consensus 233 ~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v~~~~l~~d~~~~~~~~~ 311 (508) T protein:vir:15 233 TISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAVQPGMLRFDDEHKPTFDT 311 (508) T ss_pred EecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-HhcccceeechHHhcCCCCCccccCC Confidence 0123466765543 1234678999999999999999999999999988 5778889998888753221 12223 Q ss_pred Ccc-eE-eCCC---CCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 357 GAN-FE-FNGT---ANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNI 431 (663) Q Consensus 357 ~~v-i~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~ 431 (663) +.- +. ++.. +..+..+++.-....+...++.+...+....|++....|..+++ ..||+++....+..-.....+ T Consensus 312 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~-~~TAtei~s~~~~~~~t~~~~ 390 (508) T protein:vir:15 312 EQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDG-VKTATEVVSNNSMTYQTRSSY 390 (508) T ss_pred CCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCc-cccHHHHHHHHHHHHHHHHHH Confidence 322 21 2211 12233333333334567778888889999999999998876654 358999887666665666667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecc--cchhHHHHHHHHHHHHHhc-c Q lcl|NC_021532. 432 VRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISIST--AEDNAAKSQELSFLLQTLG-P 508 (663) Q Consensus 432 ~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~--~~~~~~~~q~l~~~~~~~~-~ 508 (663) .+.|. ..++.+.+.++.+..-+.--. ......+.......+++.|+=+. .....+..+. .++..+ + T Consensus 391 ~~~~~-~al~~lv~~il~l~~~~~~~~-------~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~---~~~~v~aG 459 (508) T protein:vir:15 391 LTMVE-KAIDELCQSIFELANAGALFD-------DGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEE---DAKVLAIG 459 (508) T ss_pred HHHHH-HHHHHHHHHHHHHHHHhcccc-------ccccccccccccCCcceEEEeCCCCCCCHHHHHHH---HHHHHhcC Confidence 77774 467777777777665432111 00001111111112233333222 2222222222 222222 2 Q ss_pred CCCcchhHHHHHHHHHhhhhh--hhhhhhhhhhc---chhhHHHHhhHHHHHHHHHHHH Q lcl|NC_021532. 509 NEDPKIRRDIMADIMDLMRMP--EQAKRMREYEP---KPDPVQEKIRQLELENLMLENQ 562 (663) Q Consensus 509 ~~~p~~~~~~l~~~~~l~~~~--e~~~~l~~~~~---~~~~~~~q~~q~~~~~~q~~~~ 562 (663) .+.+... ++.+-+.. +..+.+..+.. ...+.........- -.- + T Consensus 460 i~s~e~~------i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g--~~g--e 508 (508) T protein:vir:15 460 ALSKQTF------LQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNG--GDG--E 508 (508) T ss_pred CCCHHHH------HHhcCCCChHHHHHHHHHHHHhccccCccccccccCCC--CCC--C Confidence 2222111 11111211 11111111111 00000000000000 000 0 No 83 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.70 E-value=1.5e-15 Score=101.93 Aligned_cols=449 Identities=12% Similarity=0.059 Sum_probs=193.0 Q ss_pred CCCcHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc------cccCC--CccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAE---LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN------EQKGK--SAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~------~~~g~--s~~~~~~i~~~v~~~~~~l~ 69 (663) +...++. ..+.|.+.++. .........+++.+||.|+.... ...++ .+++.|-..-.|+...+++ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~---h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl- 106 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINH---HKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYL- 106 (502) T ss_pred ccchhhhccccHHHHHHHHHH---HHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhh- Confidence 1111110 01122222222 11222345677889999965321 11222 2455666566666655554 Q ss_pred HhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD 149 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~ 149 (663) +++.. .|.. +|....+...++++.++. .|+....+..+.+++.++|.|++.+++|.+ T Consensus 107 ---~g~p~--~~~~---~d~~~~~~~~~~l~~~~~-~N~~~~~~~~~~~~~~~~G~a~~~v~~ded-------------- 163 (502) T protein:vir:48 107 ---AGNPI--RVEY---DDNEDNSQNDDAIKRIGR-INDIDTHNRNLIRDLSQTGRAYEVIYRSEY-------------- 163 (502) T ss_pred ---cccCe--eEec---CCccchhHHHHHHHHHHh-hcCHhHHHHHHHHHHhhcCeEEEEEEeCCC-------------- Confidence 44433 3333 222233455666666554 467777788899999999999999887521 Q ss_pred ccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchh Q lcl|NC_021532. 150 EYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGED 227 (663) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~ 227 (663) +.+.+..++|.++|+ |+.... +..+.+ +.+..... T Consensus 164 -------------------g~~~i~~~~p~~~~~vydd~~~~---~~~~~i-r~~~~~~~-------------------- 200 (502) T protein:vir:48 164 -------------------DETRIKRLSPLETFVIYDNSLED---NSIAAV-RYYNRGTL-------------------- 200 (502) T ss_pred -------------------CceEEEEEcccceEEEEcCCCCC---ceEEEE-EEEEEeec-------------------- Confidence 235567788888764 332211 122222 21110000 Q ss_pred hhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCC Q lcl|NC_021532. 228 FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGE 307 (663) Q Consensus 228 ~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~ 307 (663) ...+.++|+|... ..++....++.. .....|...|..|++.+ +++..|. T Consensus 201 -----------------~~~~~~~~iyt~~--------~i~~~~~~~~~~-~~~~~~~~~g~vPvv~~-----~nn~~g~ 249 (502) T protein:vir:48 201 -----------------QNAKDVVEIYTNQ--------HIYTLDASDSFN-EISVTPHAFGTVPITEF-----LNNADGI 249 (502) T ss_pred -----------------CCcEEEEEEEeCC--------eEEEEEeCCcee-eccceecCCCccceEEe-----cCCCCCC Confidence 0012345555331 112222222222 22233334467777644 3456789 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc--chhhhccCCcceEeCCC--------CCccccccCccc Q lcl|NC_021532. 308 ANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ--TNRKKFLAGANFEFNGT--------ANDFWHGSYNAI 377 (663) Q Consensus 308 g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~--~d~~~~~p~~vi~~~~~--------~~~~~~~~~~~~ 377 (663) |.++.++++++.+|...+.+.+.+...++|.+.+.-..... .........+.+.+..+ +..+.++..+.- T Consensus 250 sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~ 329 (502) T protein:vir:48 250 GDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYD 329 (502) T ss_pred CchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecCC Confidence 99999999999999999999999999988876653322211 11222223333433221 123344444433 Q ss_pred cHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021532. 378 PSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEE 457 (663) Q Consensus 378 ~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~ 457 (663) .......+..+...|...|++++.+.|..++..|+.| +...............+.|.+ +++.++++++.++..... T Consensus 330 ~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~- 405 (502) T protein:vir:48 330 VSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEA--LKYKLFGLDQDRVDTQSQFTQ-GLKRRYRLAARIGSLVNE- 405 (502) T ss_pred HHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccc- Confidence 3556677899999999999999887765433334544 333332333333333344432 334444444443322110 Q ss_pred ceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhh Q lcl|NC_021532. 458 EEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMRE 537 (663) Q Consensus 458 ~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~ 537 (663) ...++. .++.+..+-..+....+..+ .+..+++.++ ...+...+ ....+....++. T Consensus 406 ----------~~~~d~----~~i~i~f~~~~p~d~~e~a~----~~~kl~g~iS----~et~l~~l--~~v~D~~~E~~r 461 (502) T protein:vir:48 406 ----------FKDFDE----SRLKITFTPNLPKSLYEQVS----ILNDLGGQVS----QETALSLS--GLVENPTEELDK 461 (502) T ss_pred ----------cccccc----ccceEEeCCCCCcCHHHHHH----HHHHHhccCc----HHHHHHhC--CCCCCHHHHHHH Confidence 000110 01122222222222222222 2222222222 11111111 111110011111 Q ss_pred hhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 538 YEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 538 ~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) +..++.+......................+......+--.| T Consensus 462 --------------i~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 462 --------------INEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred --------------HHHHHHhhhhhcccccccccccccCCCccCCCCcCcCCCCC Confidence 11111000000000000000000000000000000000000 No 84 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.69 E-value=7.7e-16 Score=103.43 Aligned_cols=451 Identities=12% Similarity=0.044 Sum_probs=185.7 Q ss_pred CCCcHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc----c---ccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAEL----LSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN----E---QKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~~~~~~~----~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~----~---~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) +..++.+. +..|.+.| ..+......+.+||.|++.-+ . +...-.++.|-..-.|+...+.|. T Consensus 6 ~~~~~~~~~~~~~~~L~~~~-------~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSAF-------EDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCCCcccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhc Confidence 44433333 33333333 223455566778999986411 0 001112344555555565555441 Q ss_pred HhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD 149 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~ 149 (663) .+ . |. ..++....+.+ +.++. .|+.......+..+++++|.|++.|+.+..... T Consensus 79 ---~~--g---~~--~~~~~~~~~~l----~~i~~-~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~----------- 132 (485) T protein:vir:24 79 ---VE--G---FR--LGDADEADEEL----WQWWQ-ANNLDIEAPLGYTDAYVHGRSYITISRPDPQID----------- 132 (485) T ss_pred ---cC--c---ee--cCCCchhHHHH----HHHHH-hcChhHHHHHHHHHHhhcCceEEEEecCCcccc----------- Confidence 11 1 11 22333333333 34343 466666678899999999999999987632110 Q ss_pred ccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchh Q lcl|NC_021532. 150 EYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGED 227 (663) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~ 227 (663) .....+.|.+..++|.+++ ||++.. .-.+.+++.+.+ T Consensus 133 --------------~~~~~~~~~i~~~~p~~~~~i~D~~~~----~~~~~~~~~~~~----------------------- 171 (485) T protein:vir:24 133 --------------LGWDPNVPLIRVEPPTRMYAEIDPRIG----RPAKAIRVAYDA----------------------- 171 (485) T ss_pred --------------cccCCCcceEEEeccceeEEEeeCCcC----ceeEEEEEEEee----------------------- Confidence 1112344567778888885 444321 111111111100 Q ss_pred hhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCC Q lcl|NC_021532. 228 FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGE 307 (663) Q Consensus 228 ~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~ 307 (663) ....+..+++|.. + +.+..+..++........|.+.|.+|++.++..+..++.||. T Consensus 172 ----------------~~~~~~~~~~y~~-----~---~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~ 227 (485) T protein:vir:24 172 ----------------EGNEIQAATLYTP-----N---ETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGT 227 (485) T ss_pred ----------------cCCeEEEEEEEcC-----C---cEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCc Confidence 0001222333321 1 111112222222222223444578999999988888999999 Q ss_pred ChHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc----cC--c---chhhhccCCcceEeCCCCCccccccCccc Q lcl|NC_021532. 308 ANAE-MIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA----LD--Q---TNRKKFLAGANFEFNGTANDFWHGSYNAI 377 (663) Q Consensus 308 g~~~-~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~----i~--~---~d~~~~~p~~vi~~~~~~~~~~~~~~~~~ 377 (663) |-+. .++++++.+|+..+.+..++...+.|...+- |. +. + .......+|.++...++. ....+++. T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~q~~~- 303 (485) T protein:vir:24 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEIGVDPETGQTLFDAYLARILAFEDAE--GKIQQFSA- 303 (485) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhc-cCCccccccccccccchhhhcccceeccCCCC--ceEEeecc- Confidence 9886 5899999999999999999999998876542 21 11 1 111234566666554322 22222222 Q ss_pred cHHHHHHHHHHHHHHHHH---hCCChHHcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 378 PSSAFDMISLMNNEIESI---TGTKSFSGGINSGS-LGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAE 453 (663) Q Consensus 378 ~~~~~~~~~~~~~~~~~~---tGi~~~~~G~~~~~-~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q 453 (663) +.+...+..++..+..+ +++++...|..+.. .|+.| +..............-+.|.+ +++.+++ ++.. T Consensus 304 -~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~A--l~~~~~~l~~ka~~~~~~f~~-~l~~~~~----l~~~ 375 (485) T protein:vir:24 304 -AELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEA--IRAAESRLIKKVERKNAIFGG-AWEEAMR----LAYR 375 (485) T ss_pred -cchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHH----HHHH Confidence 12334555555555555 67888888854322 23333 333332333333333444433 3333444 4433 Q ss_pred hcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh-hhh Q lcl|NC_021532. 454 FLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP-EQA 532 (663) Q Consensus 454 ~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~-e~~ 532 (663) +.... + ...+ +. .+.+...-.......+....+..+.+... ..+....+..+ .++. +.. T Consensus 376 ~~~~~------~---~~~d---~~-~i~v~f~~~~~~s~~~~ad~~~kl~~~g~----~~~s~et~~~~---l~~~~d~~ 435 (485) T protein:vir:24 376 LMKGG------D---VPPD---ML-RMETVWRDPSTPTYAAKADAATKLYGNGQ----GVIPRERARKD---MGYSIAER 435 (485) T ss_pred HhcCC------C---Cccc---cc-eeeEEecCCCCCCHHHHHHHHHHHHhccc----ccCCHHHHHhh---CCCCHhHH Confidence 32110 0 0000 00 11222221112222222222222222111 11112222111 1111 101 Q ss_pred hhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_021532. 533 KRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDA-ELKRSKAAVEKA 594 (663) Q Consensus 533 ~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~-~~~~~~~~~e~~ 594 (663) +.++....+..... ......+-......... ....+. .-+.+....+-+ T Consensus 436 ~e~~~~~ee~~~~~-----------~~~~~~~~~~~~~~~~~--~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 436 EEMRRWDEEEAAMG-----------LGLLGTMVDADPTVPGS--PNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred HHHHHHHHHHhhhh-----------hhHHHhhcccCCCCCCC--CCCCCCCCCccCCCCCCCC Confidence 11111000000000 00000000000000000 000000 000000000000 No 85 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.69 E-value=8e-16 Score=103.34 Aligned_cols=435 Identities=11% Similarity=0.040 Sum_probs=203.2 Q ss_pred CCCcHHHHHHHHHH--------HHHH----H-HHHHHHHHHHHHHHHHHhcCCcCCccccC-------CCccccHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKA--------DMKA----A-DVLKQEQDSLISTWKAEYNGEPYGNEQKG-------KSAIVSRDIKKQ 60 (663) Q Consensus 1 ~~~~~~~~~~~l~~--------~~~~----~-~~~~~~~~~~~~~~~~~y~~~~~~~~~~g-------~s~~~~~~i~~~ 60 (663) |+|-+. |-..++. .+.. . -....++....+.|+.||.|+.......+ +.....|.- T Consensus 1 m~~~~~-~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~--- 76 (500) T protein:vir:30 1 MGVIQK-IKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIA--- 76 (500) T ss_pred CchHHH-HHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchH--- Confidence 555321 1111111 1111 0 01234456678889999998755432221 112222332 Q ss_pred HHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 61 SEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 61 v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) ..++..+.+.+|+-.+.+.+ +|+. .+++++.++. .|++...+.+++.+++..|.++++++||. T Consensus 77 -~~i~~~~A~lv~~e~~~i~~-----~d~~----~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~------ 139 (500) T protein:vir:30 77 -RTAAKKIASLVFNEQAEIKV-----DDDA----ANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVDG------ 139 (500) T ss_pred -HHHHHHHhhhhcCCcceEec-----CChH----HHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeC------ Confidence 23333334444554444444 3444 4445555553 46777788899999999999999999962 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKL 220 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~ 220 (663) +.|.++.|++..||+-..-...+..+-++++ ...+... +. T Consensus 140 ----------------------------~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~-~~~~~~~--~~--------- 179 (500) T protein:vir:30 140 ----------------------------DKVRVAFVQAPVFLPLQSNTQDVSSAAVVIK-SVKTING--KE--------- 179 (500) T ss_pred ----------------------------CceEEEEEcCCeeEEEEEcCCCeEEEEEEEE-EeeeecC--Cc--------- Confidence 2356778888888752111112222222211 1111000 00 Q ss_pred hhccchhhhccccccccccccccccceEEEEEEEEEeeecCCcee-E--EEEEE---EECCEE--------EecccCCCc Q lcl|NC_021532. 221 AKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIA-E--PIVCA---WINDVI--------VRLQSNPYP 286 (663) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~-~--~~~~~---~~g~~~--------l~~~~~p~~ 286 (663) ..++.+|+|...+ +.+.. + .++.. ..|..+ |........ T Consensus 180 -------------------------~~yt~lE~h~~~~--~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 232 (500) T protein:vir:30 180 -------------------------VYYTLIEFHEWQS--SDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTD 232 (500) T ss_pred -------------------------eEEEEEEEEEEeC--CceeEEEEEEEecccccccCcccccccccCCcCcceEecc Confidence 0122333332211 00000 0 00000 001100 000000011 Q ss_pred CCCCCEEEEee----eeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcch-hhh-------- Q lcl|NC_021532. 287 DGKPPFLVVPF----NSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTN-RKK-------- 353 (663) Q Consensus 287 ~~~~Pf~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d-~~~-------- 353 (663) ..++||+++.. ....++++|.|++.++++..+.+|...+++.+.+.. +..++.++.+.+.... ... T Consensus 233 ~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~ 311 (500) T protein:vir:30 233 VTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPR 311 (500) T ss_pred CCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcc Confidence 23456665432 224578899999999999999999999999988865 6668888887764221 111 Q ss_pred ccCCc-ceE-eCCC---CCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHH Q lcl|NC_021532. 354 FLAGA-NFE-FNGT---ANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRR 428 (663) Q Consensus 354 ~~p~~-vi~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l 428 (663) +.++. ++. ++.. +..+..+.+.-....+...++.+...+....|++....|...++ ..||+++....+..-... T Consensus 312 ~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:30 312 FESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMR 390 (500) T ss_pred cCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHH Confidence 11111 111 2211 12233333222234466778888888888899999988876654 358999887666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh--cCCceEEEEecCeeeccchhhcCCceEEEEeecc--cchhHHHHHHHHHHHH Q lcl|NC_021532. 429 MNIVRNIAENLVKPLMRKWMAYNAEF--LEEEEVIRVTNDKFVPIRKDDLSGRIDIDISIST--AEDNAAKSQELSFLLQ 504 (663) Q Consensus 429 ~~~~~~~~~~~~~~l~~~~~~li~q~--~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~--~~~~~~~~q~l~~~~~ 504 (663) ..+.+.+. ..++.|.+.++.+..-+ +... .....++.|+=+. .....+..+..+.+.+ T Consensus 391 ~~~~~~~~-~al~~lv~~il~~~~~~~~~~~~-----------------~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~ 452 (500) T protein:vir:30 391 NSIVALVE-QSLKELVISIFEIAKAYDLYQSE-----------------VPSMDNISISLDDGVFTDRDAELDYWIKVVN 452 (500) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHhhcCCC-----------------CCCCcceEEEeCCCCCCCHHHHHHHHHHHHH Confidence 66777774 46677777777655432 2211 0112223333222 2222222222222211 Q ss_pred HhccCCCcchhHHHHHHHHHhhhhhh--hhhhhhhhhcch-------hhHHHHhhH Q lcl|NC_021532. 505 TLGPNEDPKIRRDIMADIMDLMRMPE--QAKRMREYEPKP-------DPVQEKIRQ 551 (663) Q Consensus 505 ~~~~~~~p~~~~~~l~~~~~l~~~~e--~~~~l~~~~~~~-------~~~~~q~~q 551 (663) ++.+++... ++.+-+..+ ..+.+.+.+.+. ++....-.+ T Consensus 453 --aGi~s~~~~------i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 453 --AGFGTREMA------IQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred --cCCCCHHHH------HHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 122222211 111212211 111111111100 000000000 No 86 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.69 E-value=8e-16 Score=103.34 Aligned_cols=435 Identities=11% Similarity=0.040 Sum_probs=203.2 Q ss_pred CCCcHHHHHHHHHH--------HHHH----H-HHHHHHHHHHHHHHHHHhcCCcCCccccC-------CCccccHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKA--------DMKA----A-DVLKQEQDSLISTWKAEYNGEPYGNEQKG-------KSAIVSRDIKKQ 60 (663) Q Consensus 1 ~~~~~~~~~~~l~~--------~~~~----~-~~~~~~~~~~~~~~~~~y~~~~~~~~~~g-------~s~~~~~~i~~~ 60 (663) |+|-+. |-..++. .+.. . -....++....+.|+.||.|+.......+ +.....|.- T Consensus 1 m~~~~~-~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~--- 76 (500) T protein:vir:98 1 MGVIQK-IKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIA--- 76 (500) T ss_pred CchHHH-HHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchH--- Confidence 555321 1111111 1111 0 01234456678889999998755432221 112222332 Q ss_pred HHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 61 SEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 61 v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) ..++..+.+.+|+-.+.+.+ +|+. .+++++.++. .|++...+.+++.+++..|.++++++||. T Consensus 77 -~~i~~~~A~lv~~e~~~i~~-----~d~~----~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~------ 139 (500) T protein:vir:98 77 -RTAAKKIASLVFNEQAEIKV-----DDDA----ANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVDG------ 139 (500) T ss_pred -HHHHHHHhhhhcCCcceEec-----CChH----HHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeC------ Confidence 23333334444554444444 3444 4445555553 46777788899999999999999999962 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKL 220 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~ 220 (663) +.|.++.|++..||+-..-...+..+-++++ ...+... +. T Consensus 140 ----------------------------~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~-~~~~~~~--~~--------- 179 (500) T protein:vir:98 140 ----------------------------DKVRVAFVQAPVFLPLQSNTQDVSSAAVVIK-SVKTING--KE--------- 179 (500) T ss_pred ----------------------------CceEEEEEcCCeeEEEEEcCCCeEEEEEEEE-EeeeecC--Cc--------- Confidence 2356778888888752111112222222211 1111000 00 Q ss_pred hhccchhhhccccccccccccccccceEEEEEEEEEeeecCCcee-E--EEEEE---EECCEE--------EecccCCCc Q lcl|NC_021532. 221 AKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIA-E--PIVCA---WINDVI--------VRLQSNPYP 286 (663) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~-~--~~~~~---~~g~~~--------l~~~~~p~~ 286 (663) ..++.+|+|...+ +.+.. + .++.. ..|..+ |........ T Consensus 180 -------------------------~~yt~lE~h~~~~--~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 232 (500) T protein:vir:98 180 -------------------------VYYTLIEFHEWQS--SDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTD 232 (500) T ss_pred -------------------------eEEEEEEEEEEeC--CceeEEEEEEEecccccccCcccccccccCCcCcceEecc Confidence 0122333332211 00000 0 00000 001100 000000011 Q ss_pred CCCCCEEEEee----eeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcch-hhh-------- Q lcl|NC_021532. 287 DGKPPFLVVPF----NSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTN-RKK-------- 353 (663) Q Consensus 287 ~~~~Pf~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d-~~~-------- 353 (663) ..++||+++.. ....++++|.|++.++++..+.+|...+++.+.+.. +..++.++.+.+.... ... T Consensus 233 ~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~ 311 (500) T protein:vir:98 233 VTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPR 311 (500) T ss_pred CCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcc Confidence 23456665432 224578899999999999999999999999988865 6668888887764221 111 Q ss_pred ccCCc-ceE-eCCC---CCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHH Q lcl|NC_021532. 354 FLAGA-NFE-FNGT---ANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRR 428 (663) Q Consensus 354 ~~p~~-vi~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l 428 (663) +.++. ++. ++.. +..+..+.+.-....+...++.+...+....|++....|...++ ..||+++....+..-... T Consensus 312 ~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g-~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:98 312 FESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKS-MKTATEIVSENSDTYQMR 390 (500) T ss_pred cCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCc-cccHHHHHHHHHHHHHHH Confidence 11111 111 2211 12233333222234466778888888888899999988876654 358999887666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh--cCCceEEEEecCeeeccchhhcCCceEEEEeecc--cchhHHHHHHHHHHHH Q lcl|NC_021532. 429 MNIVRNIAENLVKPLMRKWMAYNAEF--LEEEEVIRVTNDKFVPIRKDDLSGRIDIDISIST--AEDNAAKSQELSFLLQ 504 (663) Q Consensus 429 ~~~~~~~~~~~~~~l~~~~~~li~q~--~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~--~~~~~~~~q~l~~~~~ 504 (663) ..+.+.+. ..++.|.+.++.+..-+ +... .....++.|+=+. .....+..+..+.+.+ T Consensus 391 ~~~~~~~~-~al~~lv~~il~~~~~~~~~~~~-----------------~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~ 452 (500) T protein:vir:98 391 NSIVALVE-QSLKELVISIFEIAKAYDLYQSE-----------------VPSMDNISISLDDGVFTDRDAELDYWIKVVN 452 (500) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHhhcCCC-----------------CCCCcceEEEeCCCCCCCHHHHHHHHHHHHH Confidence 66777774 46677777777655432 2211 0112223333222 2222222222222211 Q ss_pred HhccCCCcchhHHHHHHHHHhhhhhh--hhhhhhhhhcch-------hhHHHHhhH Q lcl|NC_021532. 505 TLGPNEDPKIRRDIMADIMDLMRMPE--QAKRMREYEPKP-------DPVQEKIRQ 551 (663) Q Consensus 505 ~~~~~~~p~~~~~~l~~~~~l~~~~e--~~~~l~~~~~~~-------~~~~~q~~q 551 (663) ++.+++... ++.+-+..+ ..+.+.+.+.+. ++....-.+ T Consensus 453 --aGi~s~~~~------i~~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 453 --AGFGTREMA------IQKVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred --cCCCCHHHH------HHhcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 122222211 111212211 111111111100 000000000 No 87 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.68 E-value=1.5e-15 Score=101.91 Aligned_cols=410 Identities=12% Similarity=0.048 Sum_probs=180.2 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCCCccccHHHHHH---HHHHHHHHHHhhcCCCceE Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQ---SEWQHATIVDPFVSTADII 79 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~---v~~~~~~l~~~~~~~~~~~ 79 (663) |+...|- .|...+.+ .........+||.|++.-+ .-| .-+++..+.. +-......+.++... . T Consensus 1 m~~~~i~-~L~~~~~~-------~~~r~~~~~~yy~g~~~~~-~~~--~~~p~~~~~~~~~v~nw~~~~Vd~~a~r---l 66 (422) T protein:vir:97 1 MNYMGMG-YLRRKLAL-------FKTGVDKRYRYYAMDDRDD-TRS--IVMPNNVREMYRSVLEWTAKGVDSLADR---I 66 (422) T ss_pred CChHHHH-HHHHHHHH-------HHHHHHHHHHHHhcCCChh-hcC--ccccHHHHHHHHhhcchhHHHHHHHHhc---c Confidence 5554443 33333333 2234566778999976421 111 1122222211 111112222222211 1 Q ss_pred EEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccccccc Q lcl|NC_021532. 80 KCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQ 159 (663) Q Consensus 80 ~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (663) .|.+.+-+|.+ +..++. .|+.......++++++++|+|++.|+.+..+ T Consensus 67 ~~~Gf~~~d~~--------l~~~w~-~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~----------------------- 114 (422) T protein:vir:97 67 IFREFTNDDFN--------AWEIFK-ANNPDIFFDTAIQSALIASCCFVYIMPGAED----------------------- 114 (422) T ss_pred ccceeeCCchh--------HHHHHH-hcChHHHHHHHHHHHHHhcceeEEEeeCCCC----------------------- Confidence 22233333432 234454 4666667778999999999999998754211 Q ss_pred ccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccccccc Q lcl|NC_021532. 160 EVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTE 237 (663) Q Consensus 160 ~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (663) +.|.+..++|.+++ |||... .+ ..+.+.+ . .+. + T Consensus 115 ---------~~p~i~~~sp~~~~~i~D~~~~-~~----~~a~~~~-~----------~~~-------------------~ 150 (422) T protein:vir:97 115 ---------GLPKMQVIEASKATGILDPTTF-LL----TEGYAIL-E----------SDS-------------------N 150 (422) T ss_pred ---------CeeEEEEechhhEEEEEeCCCC-cc----eeeEEEE-E----------ecC-------------------C Confidence 23456778888775 555321 11 1111110 0 000 0 Q ss_pred ccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH-HHHHHH Q lcl|NC_021532. 238 FQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA-EMIGDN 316 (663) Q Consensus 238 ~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~-~~~~d~ 316 (663) . .+....+|. ++. +.++.++......++|+ |.+|+++++..+..+++||.|-+ +.++++ T Consensus 151 -------~-~~~~~~~~~------~~~----~~~~~~~~~~~~~~~~~--g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l 210 (422) T protein:vir:97 151 -------G-NPTLEAYFT------DKD----IWYYPKKGKPYNIKNPT--GHPLLVPIIHRPDAVRPFGRSRITKAGMYH 210 (422) T ss_pred -------C-cEEEEEEEc------Cce----EEEEcCCCccccccCCC--CCcceEEecccCCCccccCccccchhHHHH Confidence 0 000010110 000 00001111111225655 67899999999999999999976 889999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcEEeecccc---CcchhhhccCCcceEeCCC--CCccccccCcccc-HHHHHHHHHHHH Q lcl|NC_021532. 317 QKVKTAVIRGIIDNMAQSNNGQVAIRKGAL---DQTNRKKFLAGANFEFNGT--ANDFWHGSYNAIP-SSAFDMISLMNN 390 (663) Q Consensus 317 Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i---~~~d~~~~~p~~vi~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~ 390 (663) |+.+|+.+..++......+.|+..+ .|.- +..+......+.++.+..+ +..+...+++.-. ..+...+..+.. T Consensus 211 ~da~~r~~~~~~~~~e~~a~pqr~i-~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~ 289 (422) T protein:vir:97 211 QKAAKRTLERAEVTAEFYSFPQKYV-LGMDPDAKPMEKWRATVSTLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYAS 289 (422) T ss_pred HHHHHHHHHHHHHHHHHhcchhhhh-cccCcccccCchhhhhhhhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHH Confidence 9999999999999999999988654 2221 1122333455677766432 2223333333222 223445555666 Q ss_pred HHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeec Q lcl|NC_021532. 391 EIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVP 470 (663) Q Consensus 391 ~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~ 470 (663) .+-.+||+|....|..++. +.+|.++......-........+.|.. ..+.++++++ ...+.... T Consensus 290 ~~a~~s~lP~~~lg~~~~N-psSa~Ai~a~~~~L~~ka~~k~~~fg~-~l~~~~rla~----~~~~~~~~---------- 353 (422) T protein:vir:97 290 LFAGGSGLTLDDLGFPSDN-PSSVESIKAAHENLRAAGRKAQRSFSS-GFLNVAYIAV----CLRDEFPY---------- 353 (422) T ss_pred HHhcccCCCHHHhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH----HHhcCCcc---------- Confidence 6666789999999976532 123333443222222222333334432 2333444433 32221110 Q ss_pred cchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhh Q lcl|NC_021532. 471 IRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIR 550 (663) Q Consensus 471 i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~ 550 (663) .+..+. +..+.-.-.......+..+. +..+..+.+..+.......+...+-+.+.......+.+.... T Consensus 354 -~~~~~~-~~~~~w~p~~~~~~~s~a~~-aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~~~~~~~~~~d--------- 421 (422) T protein:vir:97 354 -LRNQFM-DTVIKWEPLFEADANMLTLV-GDGAIKLNQAIPGFMDADVIRDLTGVKGADKPIPAITEVTTD--------- 421 (422) T ss_pred -cchhhc-cceEEEccCCCCChHHHHHH-HHHHHHHHhhccccccHHHHHHHcCCCchhHHHHHHHhhhcc--------- Confidence 011111 11111110001111111111 111222222222222233332222111111111111111100 Q ss_pred HHHHHHHHH Q lcl|NC_021532. 551 QLELENLML 559 (663) Q Consensus 551 q~~~~~~q~ 559 (663) . T Consensus 422 --------~ 422 (422) T protein:vir:97 422 --------G 422 (422) T ss_pred --------C Confidence 0 No 88 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.68 E-value=1.3e-15 Score=102.11 Aligned_cols=426 Identities=13% Similarity=0.020 Sum_probs=191.7 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----ccCC--CccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----QKGK--SAIVSRDIKKQSEWQHATIVDPFV 73 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----~~g~--s~~~~~~i~~~v~~~~~~l~~~~~ 73 (663) ++++.+++...|.+.. ..+...++++.+||.|++.-+. ..++ .+++.|..+-.|+...+++ + T Consensus 13 ~~~~~~~~~~~i~~~~-------~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l----~ 81 (489) T protein:vir:99 13 SKLWIDQLKNYISRFK-------AEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYM----L 81 (489) T ss_pred CCCCHHHHHHHHHHHH-------HHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhh----c Confidence 8888888777766542 2233457788899998763211 1122 2466676666666666554 4 Q ss_pred CCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccc Q lcl|NC_021532. 74 STADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGN 153 (663) Q Consensus 74 ~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~ 153 (663) +..+. |.+ +|+. ..+.++.++. .|+.......+.++++++|.|+..+++..... T Consensus 82 g~~~~--~~~---~d~~----~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d---------------- 135 (489) T protein:vir:99 82 GVPVE--YKN---ENKD----LQAAIDLMSV-RNNEDYHNVKIKTDLSIYGRAYELLTVEKIDD---------------- 135 (489) T ss_pred cCCce--eec---CChh----HHHHHHHHHh-hcChhHHHHHHHHHHhhCCeEEEEEeeccCcC---------------- Confidence 43332 332 3333 3445656554 35655667789999999999998887642110 Q ss_pred ccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcc Q lcl|NC_021532. 154 ETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYD 231 (663) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 231 (663) ..+.+.+..++|.++|+ |+... .+..+.++ ++.. +. T Consensus 136 -------------~~~~~~i~~~~p~~~~~v~dd~~~---~~~~~~i~-~~~~-~~------------------------ 173 (489) T protein:vir:99 136 -------------KKTEVKLYQLPAEQTFVIYDDTYQ---RNSLMAVH-FYDI-DY------------------------ 173 (489) T ss_pred -------------CCcceEEEEEcccceEEEEcCCCC---CceEEEEE-EEEE-ec------------------------ Confidence 12446677888888753 32221 11222222 2210 00 Q ss_pred ccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC--C-EEEecccCCCcCCCCCEEEEeeeeecCcccCCC Q lcl|NC_021532. 232 SPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN--D-VIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEA 308 (663) Q Consensus 232 ~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g--~-~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g 308 (663) .....+.++++|.. +.+..+. ....+ + .+.. ..|.+.|.+|++.++ ++..|.| T Consensus 174 -----------~~~~~~~~~~~y~~-----~~i~~~~-~~~~~~~~~~~~~--~~~~~~g~vPvv~~~-----n~~~~~s 229 (489) T protein:vir:99 174 -----------GSGKRKQIIKAYTS-----DTIYTYE-DYNLETKGMRLKD--YEGHFFKGVPVNEYA-----NNEERTG 229 (489) T ss_pred -----------CCCceEEEEEEEeC-----CcEEEEE-ecCCCcccceecc--cccccCCceeEEEee-----cCCCCCC Confidence 00011334444422 1111111 11001 1 1222 223333667776543 3456889 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Ccch------hhhccCCc------------ceEeCCCCC-- Q lcl|NC_021532. 309 NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQTN------RKKFLAGA------------NFEFNGTAN-- 367 (663) Q Consensus 309 ~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~~d------~~~~~p~~------------vi~~~~~~~-- 367 (663) .+..++++++.+|...+.+.+.+...+++.+.+ .|.. ...+ .....+++ ++.+.+.+. T Consensus 230 ~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (489) T protein:vir:99 230 AYESVLDNIDAYDLSQSELANFQQDSVNALLVI-AGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPN 308 (489) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhh-ccCCcccccchhhhhhcccccccccccccccccceeeeeccccCcc Confidence 999999999999999999999998888877654 3321 1111 11111221 222222211 Q ss_pred ----ccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 368 ----DFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPL 443 (663) Q Consensus 368 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l 443 (663) ...++..+.-.......+..+...+...||+++.+.+..++..|+.| +...............+.|. .+++.+ T Consensus 309 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~~-~~l~~~ 385 (489) T protein:vir:99 309 GVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGES--MKYKLMASDNYREKQERLFK-KGLMRR 385 (489) T ss_pred ccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 12223323333455667888899999999999876543222224444 33333333333333444443 244444 Q ss_pred HHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHH Q lcl|NC_021532. 444 MRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIM 523 (663) Q Consensus 444 ~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~ 523 (663) +++++.++..... ..+......++.+..+-+.+....+..+.+.. +...++... .... T Consensus 386 ~~li~~~~~~~~~-------------~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~k----l~giis~et----~~~~- 443 (489) T protein:vir:99 386 LRLAANIWAIKGN-------------EATTYSLVNDTSIVFTPNLPQNDNEIVTAAQN----LYGIVSDQT----IFEI- 443 (489) T ss_pred HHHHHHHHhhcCC-------------ccccccccccceEEeCCCCCcCHHHHHHHHHH----HhccCCHHH----HHHh- Confidence 4544444322110 00100000112222222222222222222222 222222211 1111 Q ss_pred Hhhhh--hhhhhhhhhhhcchhhHH------------HHhhHHHHHH Q lcl|NC_021532. 524 DLMRM--PEQAKRMREYEPKPDPVQ------------EKIRQLELEN 556 (663) Q Consensus 524 ~l~~~--~e~~~~l~~~~~~~~~~~------------~q~~q~~~~~ 556 (663) +..+ ++....++.+..+.+... .+..+.+.++ T Consensus 444 -l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 444 -LNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred -cCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 1111 011111111110000000 0000000000 No 89 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.68 E-value=6.7e-16 Score=103.78 Aligned_cols=437 Identities=11% Similarity=0.065 Sum_probs=190.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------ccCC--CccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------QKGK--SAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~g~--s~~~~~~i~~~v~~~~~~ 67 (663) ..-+.+.+...|.+.+.. +.++...+.++.+||.|++.-+. ...+ .+++.|-.+..|+..+++ T Consensus 38 ~~~~~~~~~~~i~~~i~~----~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~y 113 (492) T protein:vir:94 38 TNNKPETLEEMIVRYIKQ----HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 113 (492) T ss_pred cCCchhhHHHHHHHHHHH----HHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhh Confidence 222222233333333332 33445667788899998752111 1111 245666666666766665 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) + ++..+. | +.+|.+..+. ++.++. ++.......+.++++++|.|++.+++|.+ T Consensus 114 l----~G~p~~--~---~~~d~~~~~~----l~~~~~--n~~~~~~~~~~~~a~~~G~a~~~v~~d~d------------ 166 (492) T protein:vir:94 114 I----VGKPIA--F---KHTDDEVVKR----IDEVLG--NRFDDKLHSVLTGASNKGIEWLHPYLDEE------------ 166 (492) T ss_pred h----cccCce--e---ccCchHHHHH----HHHHHh--ccHHHHHHHHHHHHhhCCeEEEEEEecCC------------ Confidence 4 444333 2 2244444333 444442 45566677899999999999999887521 Q ss_pred cCccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSG 225 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 225 (663) +.+.+.+++|.++| ||++..+. ..+. .+.+.. ++ . T Consensus 167 ---------------------g~~~~~~~~p~~~~~v~d~~~~~~---~~a~-ir~~~~-~~-------~---------- 203 (492) T protein:vir:94 167 ---------------------GEFKLFRVPAEQGIPIWTDKEHEE---LEAF-IRMYKL-EN-------E---------- 203 (492) T ss_pred ---------------------CceEEEEEcccceEEEEcCCCCCc---eEEE-EEEEee-cc-------c---------- Confidence 33567778888875 34433222 2222 222211 00 0 Q ss_pred hhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCccc Q lcl|NC_021532. 226 EDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLH 305 (663) Q Consensus 226 ~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~ 305 (663) ......+ ...|..+++ .+++. ........+...++..++ +.|..|++.+ .++.+ T Consensus 204 ----------~~~~~y~--~~~v~~~~~------~~~~~-~~~~~~~~~~~~~~~~~~--~~g~vPvv~~-----~nn~~ 257 (492) T protein:vir:94 204 ----------TKVEYWD--KVTVNYYVY------ENGSL-IPDYSNNLENSKTHFSTG--SWGKIPFIPF-----KNNDL 257 (492) T ss_pred ----------eeEEEEe--cCeEEEEEE------ecCee-eecccccccccccccccc--CCCccceEEe-----cCCCC Confidence 0000000 001111111 11110 000000011112222233 3356676643 34557 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc--h-hhhccCCcceEeCCCCCccccccCccccHHHH Q lcl|NC_021532. 306 GEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT--N-RKKFLAGANFEFNGTANDFWHGSYNAIPSSAF 382 (663) Q Consensus 306 g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~--d-~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~ 382 (663) |.|.++.++++++.+|.+.|.+.+.+...++|.+.+ .|.-... + .......+++.+..+++ ..++..+.-...+. T Consensus 258 ~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~~~~ 335 (492) T protein:vir:94 258 EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-KNYDDQELPEFKRLLRYYGAIKVSDNGG-VDTIQVEVPVENSK 335 (492) T ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcccchhhHHHHhhccceecCCCCc-ceeEeccCCHHHHH Confidence 899999999999999999999999999999987665 3432111 1 12234455666655443 44444343345567 Q ss_pred HHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEE Q lcl|NC_021532. 383 DMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIR 462 (663) Q Consensus 383 ~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~ir 462 (663) ..++.+.+.|..+|++++.+.+.-++..|+.| +...............+.|.. +++.+ +.++..+... T Consensus 336 ~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~f~~-~l~~~----~~li~~~~~~----- 403 (492) T protein:vir:94 336 KYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA--LEFLYTNLNLKADKLARKAKV-AIQEL----LWFVFEHFDI----- 403 (492) T ss_pred HHHHHHHHHHHHHhCCcCCCccccccCchHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHH----HHHHHHHhcC----- Confidence 78899999999999999877664333334444 333333333333444444432 33444 4444444321 Q ss_pred EecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcch Q lcl|NC_021532. 463 VTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKP 542 (663) Q Consensus 463 i~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~ 542 (663) .+ ++ .++.+..+-+.+....+..+ .+..+.+.++ ...+... +....+....++.+ T Consensus 404 -~~-~~---------~~i~v~f~~~~p~~~~e~~~----~~~kl~giiS----~et~~~~--l~~v~d~~~E~eri---- 458 (492) T protein:vir:94 404 -KG-EH---------KDVDISFNYNKVANTELQVQ----TAQQSMGIVS----HETVLEN--HPFVEDLQAELERI---- 458 (492) T ss_pred -Cc-cc---------ceeeEEecCCCCCCHHHHHH----HHHHHhccCc----hHHHHHh--CCCCCCHHHHHHHH---- Confidence 11 11 11222222222222222222 2222222221 1221111 11111111111110 Q ss_pred hhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE 592 (663) Q Consensus 543 ~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e 592 (663) ..++.+. ++.++.......... . .+. +....+.| T Consensus 459 ----------~~E~~~~-~~~~~~~~~~~~~~~-~--~~~--~~~~~e~e 492 (492) T protein:vir:94 459 ----------EQEQMEY-NKQLPNLDDGGADSA-Q--QQE--RSNNKESE 492 (492) T ss_pred ----------HHHHHHH-HhhccccccccCCCC-c--ccc--CCccccCC Confidence 0000000 000000000000000 0 000 00000000 No 90 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.67 E-value=8.5e-16 Score=103.22 Aligned_cols=433 Identities=10% Similarity=0.068 Sum_probs=191.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------ccC--CCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------QKG--KSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~g--~s~~~~~~i~~~v~~~~~~ 67 (663) +..+-+.+-+.|...++. +..+.....+..+||.|++.-+. ... ..+++.|...-.|+..+++ T Consensus 21 ~~~~~~~~~~~i~~~i~~----~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y 96 (474) T protein:vir:95 21 MKPKVETQEEMIIRLINN----HKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSY 96 (474) T ss_pred ccccccchHHHHHHHHHH----HHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhh Confidence 333333333333333333 44455667778889998752111 111 1245666666666665555 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) + ++..+. +.+ +|.+.. +.++.++. ++....+..+.++++++|.|+..++++.. T Consensus 97 l----~g~p~~--~~~---~~~~~~----~~l~~~~~--n~~~~~~~~l~~~~~~~G~~~~~~~~d~~------------ 149 (474) T protein:vir:95 97 V----AGKPVT--YAH---DDDKVL----DVIHQVLD--TRWDNKLIDILTAASNKGIDWLQVYINED------------ 149 (474) T ss_pred h----cccCce--ecc---CChHHH----HHHHHHHh--ccHHHHHHHHHHHHhhCCeEEEEeeeCCC------------ Confidence 4 454333 322 333332 34444443 56667778899999999999999887632 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchh Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGED 227 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~ 227 (663) +.+.+..++|.++|+-.+-. ...+.. .+.+.+.... T Consensus 150 ---------------------~~~~i~~~~p~~~~~v~d~~-~~~~~~-a~ir~~~~~~--------------------- 185 (474) T protein:vir:95 150 ---------------------GELKLFRVPAEQAIPIWTDK-EREQLN-AFIRIFTFNG--------------------- 185 (474) T ss_pred ---------------------CceEEEEEcccceEEEEcCC-CCCceE-EEEEEEeecC--------------------- Confidence 23556778888887433211 111222 2222221100 Q ss_pred hhccccccccccccccccceEEEEEEEEEe-----eecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 228 FDYDSPDDTEFQFSDAPRKKLIIYEYWGNY-----DVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 228 ~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~-----~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) ...+|+|... ...+.+... ....+.........|...+.+|++ +.++ T Consensus 186 --------------------~~~~~vy~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~vPvv-----~~~n 237 (474) T protein:vir:95 186 --------------------ETKVEYWTAETVTYYVYENGGLIP---DFYYGDEHIQTHFSTGSWERVPFI-----AFKN 237 (474) T ss_pred --------------------eeEEEEEeCCeEEEEEEcCCceee---ccccccccccCcccccCCCccceE-----EecC Confidence 0112222110 001111000 000111111111222233455555 3445 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc-ch-hhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQ-TN-RKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~-~d-~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +..|.|.+..++++++.+|.+.|.+.+.+...++|.+.+ .|.- +. .+ .......+++.+.+++ ...++..+.-.. T Consensus 238 n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~l~~~~~~~ 315 (474) T protein:vir:95 238 NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGLKYYKAINVSSDG-GVETIQVEVPVA 315 (474) T ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhhhccceeeccCCC-ceeEEeccCCHH Confidence 567899999999999999999999999999999887654 4431 11 11 1223344566666544 345555454456 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .....++.+...|-..|++++...+..++..|+.| +..+...........-+.|. ..++.++.+|..+... T Consensus 316 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~~~~~~~-----~~l~~~~~~i~~~~g~-- 386 (474) T protein:vir:95 316 STKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIA--LKFLYTNLNLKANKLKNKAN-----VALQELMQFILDFNKI-- 386 (474) T ss_pred HHHHHHHHHHHHHHHHhCCcCccccccccccHHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHhCC-- Confidence 67788999999999999999887654333334444 33333333333333333443 2334444555554321 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) .++.. ++.+..+-..+....+..+ ++... +.+ ....+... +.... T Consensus 387 ----------~~d~~----~i~i~f~~~~p~~~~e~a~----~~~~~-gii----S~et~~~~--lp~v~---------- 431 (474) T protein:vir:95 387 ----------KLDAK----EIEITFNFNVMVNDLEQSQ----IGAQS-QYL----SKETLVRH--HPWVD---------- 431 (474) T ss_pred ----------Ccccc----eeeEEecCCCccCHHHHHH----HHHHc-CCC----ChHHHHHh--CCCCC---------- Confidence 01111 1222222222222222222 11111 111 11111111 11111 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~ 595 (663) ++ .....+...++.+. .+.+.. .. ........+..+....+.+ T Consensus 432 ---D~-~~E~eri~~E~~~~-~~~~~~-~~-------~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 432 ---DP-KAELERLDEEQLEL-NKQLPN-LD-------DGGADGAQQQQQSENNQSK 474 (474) T ss_pred ---CH-HHHHHHHHHHHHHH-Hhhccc-cc-------cccCCCCCCcCCCCccccC Confidence 10 00000100000000 000000 00 0000000000000000000 No 91 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.67 E-value=8.5e-16 Score=103.22 Aligned_cols=433 Identities=10% Similarity=0.068 Sum_probs=191.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------ccC--CCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------QKG--KSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~g--~s~~~~~~i~~~v~~~~~~ 67 (663) +..+-+.+-+.|...++. +..+.....+..+||.|++.-+. ... ..+++.|...-.|+..+++ T Consensus 21 ~~~~~~~~~~~i~~~i~~----~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y 96 (474) T protein:vir:96 21 MKPKVETQEEMIIRLINN----HKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSY 96 (474) T ss_pred ccccccchHHHHHHHHHH----HHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhh Confidence 333333333333333333 44455667778889998752111 111 1245666666666665555 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) + ++..+. +.+ +|.+.. +.++.++. ++....+..+.++++++|.|+..++++.. T Consensus 97 l----~g~p~~--~~~---~~~~~~----~~l~~~~~--n~~~~~~~~l~~~~~~~G~~~~~~~~d~~------------ 149 (474) T protein:vir:96 97 V----AGKPVT--YAH---DDDKVL----DVIHQVLD--TRWDNKLIDILTAASNKGIDWLQVYINED------------ 149 (474) T ss_pred h----cccCce--ecc---CChHHH----HHHHHHHh--ccHHHHHHHHHHHHhhCCeEEEEeeeCCC------------ Confidence 4 454333 322 333332 34444443 56667778899999999999999887632 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchh Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGED 227 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~ 227 (663) +.+.+..++|.++|+-.+-. ...+.. .+.+.+.... T Consensus 150 ---------------------~~~~i~~~~p~~~~~v~d~~-~~~~~~-a~ir~~~~~~--------------------- 185 (474) T protein:vir:96 150 ---------------------GELKLFRVPAEQAIPIWTDK-EREQLN-AFIRIFTFNG--------------------- 185 (474) T ss_pred ---------------------CceEEEEEcccceEEEEcCC-CCCceE-EEEEEEeecC--------------------- Confidence 23556778888887433211 111222 2222221100 Q ss_pred hhccccccccccccccccceEEEEEEEEEe-----eecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecC Q lcl|NC_021532. 228 FDYDSPDDTEFQFSDAPRKKLIIYEYWGNY-----DVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPF 302 (663) Q Consensus 228 ~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~-----~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~ 302 (663) ...+|+|... ...+.+... ....+.........|...+.+|++ +.++ T Consensus 186 --------------------~~~~~vy~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~vPvv-----~~~n 237 (474) T protein:vir:96 186 --------------------ETKVEYWTAETVTYYVYENGGLIP---DFYYGDEHIQTHFSTGSWERVPFI-----AFKN 237 (474) T ss_pred --------------------eeEEEEEeCCeEEEEEEcCCceee---ccccccccccCcccccCCCccceE-----EecC Confidence 0112222110 001111000 000111111111222233455555 3445 Q ss_pred cccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc-ch-hhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 303 KLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQ-TN-RKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 303 ~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~-~d-~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +..|.|.+..++++++.+|.+.|.+.+.+...++|.+.+ .|.- +. .+ .......+++.+.+++ ...++..+.-.. T Consensus 238 n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~-~g~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~l~~~~~~~ 315 (474) T protein:vir:96 238 NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL-RGYEGEDLSEFMEGLKYYKAINVSSDG-GVETIQVEVPVA 315 (474) T ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-cCCCcccccchhhhhhccceeeccCCC-ceeEEeccCCHH Confidence 567899999999999999999999999999999887654 4431 11 11 1223344566666544 345555454456 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEE 459 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~ 459 (663) .....++.+...|-..|++++...+..++..|+.| +..+...........-+.|. ..++.++.+|..+... T Consensus 316 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~~~~~~~-----~~l~~~~~~i~~~~g~-- 386 (474) T protein:vir:96 316 STKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIA--LKFLYTNLNLKANKLKNKAN-----VALQELMQFILDFNKI-- 386 (474) T ss_pred HHHHHHHHHHHHHHHHhCCcCccccccccccHHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHhCC-- Confidence 67788999999999999999887654333334444 33333333333333333443 2334444555554321 Q ss_pred EEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_021532. 460 VIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYE 539 (663) Q Consensus 460 ~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~ 539 (663) .++.. ++.+..+-..+....+..+ ++... +.+ ....+... +.... T Consensus 387 ----------~~d~~----~i~i~f~~~~p~~~~e~a~----~~~~~-gii----S~et~~~~--lp~v~---------- 431 (474) T protein:vir:96 387 ----------KLDAK----EIEITFNFNVMVNDLEQSQ----IGAQS-QYL----SKETLVRH--HPWVD---------- 431 (474) T ss_pred ----------Ccccc----eeeEEecCCCccCHHHHHH----HHHHc-CCC----ChHHHHHh--CCCCC---------- Confidence 01111 1222222222222222222 11111 111 11111111 11111 Q ss_pred cchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 540 PKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 540 ~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~ 595 (663) ++ .....+...++.+. .+.+.. .. ........+..+....+.+ T Consensus 432 ---D~-~~E~eri~~E~~~~-~~~~~~-~~-------~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 432 ---DP-KAELERLDEEQLEL-NKQLPN-LD-------DGGADGAQQQQQSENNQSK 474 (474) T ss_pred ---CH-HHHHHHHHHHHHHH-Hhhccc-cc-------cccCCCCCCcCCCCccccC Confidence 10 00000100000000 000000 00 0000000000000000000 No 92 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.67 E-value=4.8e-16 Score=104.55 Aligned_cols=473 Identities=12% Similarity=0.069 Sum_probs=213.9 Q ss_pred CCCcHHHH--HHHHHHH---HHHHHHHHHH-HHHHHHHHHHHhcCCcCCc-------cccCCCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAEL--LSALKAD---MKAADVLKQE-QDSLISTWKAEYNGEPYGN-------EQKGKSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~--~~~l~~~---~~~~~~~~~~-~~~~~~~~~~~y~~~~~~~-------~~~g~s~~~~~~i~~~v~~~~~~ 67 (663) |--++... .+.+..- |-+...-.++ ++..++.|.+||+|+...+ ..+++-++..|.... T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~-------- 72 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEK-------- 72 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHH-------- Confidence 22222111 1111000 0000122233 4456788999999874322 233345566665422 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) +++...-|-+.+....+...++....+++.+++.+| ....+.+.-+++++.|-|++++.||.... T Consensus 73 ----~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~-l~~~~~~~~r~~~vlGDg~f~l~wD~~k~---------- 137 (527) T protein:vir:10 73 ----LIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDREN-WEQKFESLKRWTEIRGDYVLLLIGDDEKD---------- 137 (527) T ss_pred ----hhCCcceeeccCccccccchhHHHHHHHHHHHHHhh-hHHHHHHHHHhhhhhcceeEEEeeccCCC---------- Confidence 233333344555555566667777888888887654 44455667789999999999999985431 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEe----ecCHHHHHHhcCC-cChhhhhh Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRY----ETDLSTLKKDGRY-KNLDKLAK 222 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~----~~~~~~l~~~g~~-~~~~~~~~ 222 (663) ...+|.++.++|.-+|.- .|-++..++-.++ |-..++-++ ++. ..+-++ T Consensus 138 -------------------~~~R~~v~~~DP~~~f~~----ed~d~~~~v~~v~~~~~~~~P~d~~~-~~~~ar~~~~-- 191 (527) T protein:vir:10 138 -------------------EGSRLSLHEVDPSTYFPY----EDPRYPGQVLGVYLVDEYPHPDSEKK-NEKCARVQKY-- 191 (527) T ss_pred -------------------cCCCceEeecCcceeeee----ecCCCCCceeeEEEeeeccCCccccc-cceehhhhhh-- Confidence 112455666776655532 2333444443332 111111111 000 000000 Q ss_pred ccchhhhccccccccccccccccceEEEEE-EEEE--ee-ecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeee Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYE-YWGN--YD-VDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFN 298 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E-~w~~--~~-~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~ 298 (663) -+...+.....++ -++++.+ .|.- ++ .+.-..-+-.+.+.+++.+++..++|+ +.+|+++++-. T Consensus 192 ------~~~l~~~g~~~~~----G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~ 259 (527) T protein:vir:10 192 ------MKTLDDDGKPVPG----GAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGH 259 (527) T ss_pred ------hhhcCcccccccC----cceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCC Confidence 0000000000011 1233322 3431 11 111111112344567888888877777 66899999999 Q ss_pred eecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc--cC---cchhhhccCCcceEeCCCCCcccccc Q lcl|NC_021532. 299 SIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA--LD---QTNRKKFLAGANFEFNGTANDFWHGS 373 (663) Q Consensus 299 ~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~--i~---~~d~~~~~p~~vi~~~~~~~~~~~~~ 373 (663) |.+++.||+|-+.+++++.+.+|+..+....++..+++|.+.. .|+ ++ ..+.....||++|....++ .+..+. T Consensus 260 p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~-tg~~~vd~~G~~~~~~VgPG~iweL~e~a-k~~~v~ 337 (527) T protein:vir:10 260 PIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYAT-DSAPPRDSRGNMVPWTISPLGMVEHGQNN-KIYRVN 337 (527) T ss_pred CccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeee-cccccccccCCcCccccCCceeEecCCCc-ceeecc Confidence 9999999999999999999999999999999999998887665 332 11 1122335688899876543 344444 Q ss_pred CccccHHHHHHHHHHHHHHHHHhCCChHHcCC--CcccchhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 374 YNAIPSSAFDMISLMNNEIESITGTKSFSGGI--NSGSLGSTATGARG--ALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 374 ~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~--~~~~~~~tA~~i~~--~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) ..+-...+...+..+...+.+++|++....|. .++..|+.|-.++. +..... +-+.+.....+.+...+..+++. T Consensus 338 ~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~-rk~L~~~~vqrq~~~~~~~~~L~ 416 (527) T protein:vir:10 338 GVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCA-EQELELKSVLKQFFYNLVTQWLP 416 (527) T ss_pred chhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhhHHHHHH Confidence 33333446677888889999999999999993 24444555533322 111110 00111111111111112222222 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchh--HHHHHHHHHH--------------HHHhccCCCcc Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDN--AAKSQELSFL--------------LQTLGPNEDPK 513 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~--~~~~q~l~~~--------------~~~~~~~~~p~ 513 (663) ....+.. .+......+.++-+....+ .+.-+++..+ |..++.--+|+ T Consensus 417 aye~v~~-----------------~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E 479 (527) T protein:vir:10 417 AYEGVGI-----------------DDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTE 479 (527) T ss_pred Hhhhccc-----------------CCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChH Confidence 2111111 1111112233333332211 1111111111 11111111221 Q ss_pred hh-HHHHHHHH-Hhhhhhhhhhhhh----hhhcchhh-HHHHhhHHHHHHHHHHHHHH Q lcl|NC_021532. 514 IR-RDIMADIM-DLMRMPEQAKRMR----EYEPKPDP-VQEKIRQLELENLMLENQML 564 (663) Q Consensus 514 ~~-~~~l~~~~-~l~~~~e~~~~l~----~~~~~~~~-~~~q~~q~~~~~~q~~~~~~ 564 (663) .. ..+....+ +.....+...... ...+-++. ..++..... + T Consensus 480 ~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~~~~----------~ 527 (527) T protein:vir:10 480 EDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQP----------L 527 (527) T ss_pred HHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCC----------C Confidence 11 00111000 0000000000000 00000000 000000000 0 No 93 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.67 E-value=4.9e-16 Score=104.50 Aligned_cols=473 Identities=12% Similarity=0.068 Sum_probs=213.8 Q ss_pred CCCcHHHH--HHHHHHH---HHHHHHHHHH-HHHHHHHHHHHhcCCcCCc-------cccCCCccccHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAEL--LSALKAD---MKAADVLKQE-QDSLISTWKAEYNGEPYGN-------EQKGKSAIVSRDIKKQSEWQHAT 67 (663) Q Consensus 1 ~~~~~~~~--~~~l~~~---~~~~~~~~~~-~~~~~~~~~~~y~~~~~~~-------~~~g~s~~~~~~i~~~v~~~~~~ 67 (663) |--++... .+.+..- |-+...-.++ ++..++.|.+||+|+...+ ..+++-++..|.... T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~-------- 72 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEK-------- 72 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHH-------- Confidence 22222111 1111000 0000122233 4456788999999874322 233345566665422 Q ss_pred HHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccc Q lcl|NC_021532. 68 IVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVV 147 (663) Q Consensus 68 l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~ 147 (663) +++...-|-+.+....+...++....+++.+++.+| ....+.+.-+++++.|-|++++.||.... T Consensus 73 ----~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~-l~~~~~~~~r~~~vlGDg~f~l~wD~~k~---------- 137 (527) T protein:vir:10 73 ----LIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDREN-WEQKFESLKRWTEIRGDYVLLLIGDDEKD---------- 137 (527) T ss_pred ----hhCCcceeeccCccccccchhHHHHHHHHHHHHHhh-hHHHHHHHHHhhhhhcceeEEEeeccCCC---------- Confidence 233333344555555556667777888888887654 44455667789999999999999985431 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEe----ecCHHHHHHhcCC-cChhhhhh Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRY----ETDLSTLKKDGRY-KNLDKLAK 222 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~----~~~~~~l~~~g~~-~~~~~~~~ 222 (663) ...+|.++.++|.-+|.- .|-++..++-.++ |-..++-++ ++. ..+-++ T Consensus 138 -------------------~~~R~~v~~~DP~~~f~~----ed~d~~~~v~~v~~~~~~~~P~d~~~-~~~~ar~~~~-- 191 (527) T protein:vir:10 138 -------------------EGSRLSLHEVDPSTYFPY----EDPRYPGQVLGVYLVDEYPHPDSEKK-NEKCARVQKY-- 191 (527) T ss_pred -------------------cCCCceEeecCcceeeee----ecCCCCCceeeEEEeeeccCCccccc-cceehhhhhh-- Confidence 112455666676655532 2333444443332 111111111 000 000000 Q ss_pred ccchhhhccccccccccccccccceEEEEE-EEEE--ee-ecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeee Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYE-YWGN--YD-VDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFN 298 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E-~w~~--~~-~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~ 298 (663) -+...+.....++ -++++.+ .|.- ++ .+.-..-+-.+.+.+++.+++..++|+ +.+|+++++-. T Consensus 192 ------~~~l~~~g~~~~~----G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~ 259 (527) T protein:vir:10 192 ------MKTLDDDGKPVPG----GAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGH 259 (527) T ss_pred ------hhhcCcccccccC----cceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCC Confidence 0000000000011 1233322 3431 11 111111112344567888888877777 66899999999 Q ss_pred eecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc--cC---cchhhhccCCcceEeCCCCCcccccc Q lcl|NC_021532. 299 SIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA--LD---QTNRKKFLAGANFEFNGTANDFWHGS 373 (663) Q Consensus 299 ~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~--i~---~~d~~~~~p~~vi~~~~~~~~~~~~~ 373 (663) |.+++.||+|-+.+++++.+.+|+..+....++..+++|.+.. .|+ ++ ..+.....||++|....++ .+..+. T Consensus 260 p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~-tg~~~vd~~G~~~~~~VgPG~iweL~e~a-k~~~v~ 337 (527) T protein:vir:10 260 PIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYAT-DSAPPRDSRGNMVPWTISPLGMVEHGQNN-KIYRVN 337 (527) T ss_pred CccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeee-cccccccccCCcCccccCCceeEecCCCc-ceeecc Confidence 9999999999999999999999999999999999998887665 332 11 1122335688899876543 344444 Q ss_pred CccccHHHHHHHHHHHHHHHHHhCCChHHcCC--CcccchhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 374 YNAIPSSAFDMISLMNNEIESITGTKSFSGGI--NSGSLGSTATGARG--ALDATATRRMNIVRNIAENLVKPLMRKWMA 449 (663) Q Consensus 374 ~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~--~~~~~~~tA~~i~~--~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~ 449 (663) ..+-...+...+..+...+.+++|++....|. .++..|+.|-.++. +..... +-+.+.....+.+...+..+++. T Consensus 338 ~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~-rk~L~~~~Vqrq~~~~~~~~~L~ 416 (527) T protein:vir:10 338 GVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCA-EQELELKSVLKQFFYNLVTQWLP 416 (527) T ss_pred chhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhhHHHHHH Confidence 33333446677888889999999999999993 24444555533322 111110 00111111111111112222222 Q ss_pred HHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchh--HHHHHHHHHH--------------HHHhccCCCcc Q lcl|NC_021532. 450 YNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDN--AAKSQELSFL--------------LQTLGPNEDPK 513 (663) Q Consensus 450 li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~--~~~~q~l~~~--------------~~~~~~~~~p~ 513 (663) ....+.. .+......+.++-+....+ .+.-+++..+ |..++.--+|+ T Consensus 417 aye~v~~-----------------~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E 479 (527) T protein:vir:10 417 AYEGVGI-----------------DDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTE 479 (527) T ss_pred Hhhhccc-----------------CCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchH Confidence 2111111 1111112233333332211 1111111111 11111111222 Q ss_pred hh-HHHHHHHHH-hhhhhhhhhhhh----hhhcchhh-HHHHhhHHHHHHHHHHHHHH Q lcl|NC_021532. 514 IR-RDIMADIMD-LMRMPEQAKRMR----EYEPKPDP-VQEKIRQLELENLMLENQML 564 (663) Q Consensus 514 ~~-~~~l~~~~~-l~~~~e~~~~l~----~~~~~~~~-~~~q~~q~~~~~~q~~~~~~ 564 (663) .. ..+....+. .....+...... ...+-++. ..++..... + T Consensus 480 ~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~----------~ 527 (527) T protein:vir:10 480 EDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQP----------L 527 (527) T ss_pred HHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCC----------C Confidence 11 011111000 000000000000 00000000 000000000 0 No 94 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.67 E-value=1.8e-14 Score=96.00 Aligned_cols=436 Identities=12% Similarity=0.061 Sum_probs=194.9 Q ss_pred CCCc---HHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------ccCCC--ccccHHH Q lcl|NC_021532. 1 MKIN---KAELLSALKA-------DMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------QKGKS--AIVSRDI 57 (663) Q Consensus 1 ~~~~---~~~~~~~l~~-------~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~g~s--~~~~~~i 57 (663) |.++ .+++.+.+.. .+++....+..+.....++.+||.|++.-+. ..+++ +++.|.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~ 86 (474) T protein:vir:94 7 MPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFH 86 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchH Confidence 4333 2233333322 2333334455556667888999999753211 12222 3566666 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccc Q lcl|NC_021532. 58 KKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDE 137 (663) Q Consensus 58 ~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~ 137 (663) ...|+..++++ ++..+. +. .+|++. ...++.+++ ++....+..+.++++++|.|++.+++|.. T Consensus 87 k~Ivd~~~~~l----~g~p~~--~~---~~d~~~----~~~l~~~~~--n~~~~~~~e~~~~~~~~G~~~~~~~~d~~-- 149 (474) T protein:vir:94 87 QNLVDQKVSYV----ASKPVT--YS---CEDENV----LKVIHDVLD--TRWDNKLIDILTATSNKGIDWLQVYINEN-- 149 (474) T ss_pred HHHHHHHHhhh----hcCCce--ec---cCcHHH----HHHHHHHHh--ccHHHHHHHHHHHHhhcCceEEEEEecCC-- Confidence 66666666555 444333 22 234333 334445443 56667778899999999999999887521 Q ss_pred eecccccccccCccccccccccccccceeecccceeeeccHHHheeC--cccccChhhCceEEEEeecCHHHHHHhcCCc Q lcl|NC_021532. 138 EVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLD--PTCQDNLDNAQFVIHRYETDLSTLKKDGRYK 215 (663) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~d--p~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~ 215 (663) +.+.+..++|..+|+- ++.. .+..+++ +.+.... T Consensus 150 -------------------------------~~~~i~~~~p~~~~~v~d~~~~---~~~~~~i-r~~~~~~--------- 185 (474) T protein:vir:94 150 -------------------------------GEMKLFRVPAEQAIPIWVDKER---EELKSFI-RYYKFNN--------- 185 (474) T ss_pred -------------------------------CeeEEEEEcccceEEEEcCCCC---CceEEEE-EEEEecC--------- Confidence 2356677888888743 3222 1222222 2221100 Q ss_pred ChhhhhhccchhhhccccccccccccccccceEEEEEEEEE-----eeecCCceeEEEEEEEECCEEEecccCCCcCCCC Q lcl|NC_021532. 216 NLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGN-----YDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~ 290 (663) ...+++|.. +-..+.+.. .....-.+..... ..|-..|.+ T Consensus 186 --------------------------------~~~~~~yt~~~~~~y~~~~~~~~-~~~~~~~~~~~~~--~~~~~~g~v 230 (474) T protein:vir:94 186 --------------------------------EEKVEFWTDTTVTYYVLENGGLI-PDYYYGANHVQSH--FSNGNWGRV 230 (474) T ss_pred --------------------------------eEEEEEEeCCeEEEEEEcCCccc-cccccCcCccccc--ccccCCCcc Confidence 001122211 000111100 0000000111111 112233556 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc-h-hhhccCCcceEeCCCCCc Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT-N-RKKFLAGANFEFNGTAND 368 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~-d-~~~~~p~~vi~~~~~~~~ 368 (663) |++.+ .++.+|.|.+..++++++.+|.+.+.+.+.+...+.|.+++.-...+.. + ......++++.+.++++ T Consensus 231 Pvv~~-----~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~- 304 (474) T protein:vir:94 231 PFIAF-----KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGG- 304 (474) T ss_pred ceEEe-----cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc- Confidence 66543 4456799999999999999999999999999999998776643222221 1 11223455676665543 Q ss_pred cccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 369 FWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWM 448 (663) Q Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 448 (663) +.++..+.-...+...++.+...|...|++++.+.+.-++..|+.| +..+............+.| ...++.++ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~-----~~~l~~~~ 377 (474) T protein:vir:94 305 VETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIA--LKFLYGNLDLKANKLKNKA-----TVAIQELI 377 (474) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHH--HHHHHHHHHHHHHHHHHHH-----HHHHHHHH Confidence 4555544444566778899999999999999877653332334443 3333222323323333333 33344455 Q ss_pred HHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhh Q lcl|NC_021532. 449 AYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRM 528 (663) Q Consensus 449 ~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~ 528 (663) .+|..++... .++. ++.+..+-+.+....+..+ .+... +.++ ...+.. .+... T Consensus 378 ~li~~~~~~~-------~d~~---------~i~v~f~~~~p~~~~e~a~----~~~~~-g~iS----~et~l~--~l~~v 430 (474) T protein:vir:94 378 SFIIDFNNLK-------TDVK---------DIEISFNFNRMMNDAEQSQ----IIAQS-QYLS----RETLVK--SSPLV 430 (474) T ss_pred HHHHHHhCCC-------cccc---------eeeEEeccCcccCHHHHHH----HHHHc-CCCC----HHHHHH--hCCCC Confidence 5555554211 0110 1222222222222222222 22222 2222 121111 11111 Q ss_pred hhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 529 PEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAE 583 (663) Q Consensus 529 ~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~ 583 (663) .+....++.+..+..... +.........+.... ...+....+.+ T Consensus 431 ~D~~~E~eri~~E~~~~~---------~~~~~~~~~~~~~~~--~~~~~~~~~~e 474 (474) T protein:vir:94 431 DDYKAELERIEQEQMEYN---------KQLPNLDDGGADGAQ--QQEGSNNKESE 474 (474) T ss_pred CCHHHHHHHHHHHHHHHH---------hhccccCCCCCCCcc--cCCCCcccccC Confidence 111111111111000000 000000000000000 00000000000 No 95 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.67 E-value=1.8e-14 Score=96.00 Aligned_cols=436 Identities=12% Similarity=0.061 Sum_probs=194.9 Q ss_pred CCCc---HHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------ccCCC--ccccHHH Q lcl|NC_021532. 1 MKIN---KAELLSALKA-------DMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------QKGKS--AIVSRDI 57 (663) Q Consensus 1 ~~~~---~~~~~~~l~~-------~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~g~s--~~~~~~i 57 (663) |.++ .+++.+.+.. .+++....+..+.....++.+||.|++.-+. ..+++ +++.|.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~ 86 (474) T protein:vir:97 7 MPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFH 86 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchH Confidence 4333 2233333322 2333334455556667888999999753211 12222 3566666 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccc Q lcl|NC_021532. 58 KKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDE 137 (663) Q Consensus 58 ~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~ 137 (663) ...|+..++++ ++..+. +. .+|++. ...++.+++ ++....+..+.++++++|.|++.+++|.. T Consensus 87 k~Ivd~~~~~l----~g~p~~--~~---~~d~~~----~~~l~~~~~--n~~~~~~~e~~~~~~~~G~~~~~~~~d~~-- 149 (474) T protein:vir:97 87 QNLVDQKVSYV----ASKPVT--YS---CEDENV----LKVIHDVLD--TRWDNKLIDILTATSNKGIDWLQVYINEN-- 149 (474) T ss_pred HHHHHHHHhhh----hcCCce--ec---cCcHHH----HHHHHHHHh--ccHHHHHHHHHHHHhhcCceEEEEEecCC-- Confidence 66666666555 444333 22 234333 334445443 56667778899999999999999887521 Q ss_pred eecccccccccCccccccccccccccceeecccceeeeccHHHheeC--cccccChhhCceEEEEeecCHHHHHHhcCCc Q lcl|NC_021532. 138 EVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLD--PTCQDNLDNAQFVIHRYETDLSTLKKDGRYK 215 (663) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~d--p~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~ 215 (663) +.+.+..++|..+|+- ++.. .+..+++ +.+.... T Consensus 150 -------------------------------~~~~i~~~~p~~~~~v~d~~~~---~~~~~~i-r~~~~~~--------- 185 (474) T protein:vir:97 150 -------------------------------GEMKLFRVPAEQAIPIWVDKER---EELKSFI-RYYKFNN--------- 185 (474) T ss_pred -------------------------------CeeEEEEEcccceEEEEcCCCC---CceEEEE-EEEEecC--------- Confidence 2356677888888743 3222 1222222 2221100 Q ss_pred ChhhhhhccchhhhccccccccccccccccceEEEEEEEEE-----eeecCCceeEEEEEEEECCEEEecccCCCcCCCC Q lcl|NC_021532. 216 NLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGN-----YDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~ 290 (663) ...+++|.. +-..+.+.. .....-.+..... ..|-..|.+ T Consensus 186 --------------------------------~~~~~~yt~~~~~~y~~~~~~~~-~~~~~~~~~~~~~--~~~~~~g~v 230 (474) T protein:vir:97 186 --------------------------------EEKVEFWTDTTVTYYVLENGGLI-PDYYYGANHVQSH--FSNGNWGRV 230 (474) T ss_pred --------------------------------eEEEEEEeCCeEEEEEEcCCccc-cccccCcCccccc--ccccCCCcc Confidence 001122211 000111100 0000000111111 112233556 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc-h-hhhccCCcceEeCCCCCc Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT-N-RKKFLAGANFEFNGTAND 368 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~-d-~~~~~p~~vi~~~~~~~~ 368 (663) |++.+ .++.+|.|.+..++++++.+|.+.+.+.+.+...+.|.+++.-...+.. + ......++++.+.++++ T Consensus 231 Pvv~~-----~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~- 304 (474) T protein:vir:97 231 PFIAF-----KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGG- 304 (474) T ss_pred ceEEe-----cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc- Confidence 66543 4456799999999999999999999999999999998776643222221 1 11223455676665543 Q ss_pred cccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 369 FWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWM 448 (663) Q Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 448 (663) +.++..+.-...+...++.+...|...|++++.+.+.-++..|+.| +..+............+.| ...++.++ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--l~~~~~~l~~k~~~k~~~~-----~~~l~~~~ 377 (474) T protein:vir:97 305 VETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIA--LKFLYGNLDLKANKLKNKA-----TVAIQELI 377 (474) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHH--HHHHHHHHHHHHHHHHHHH-----HHHHHHHH Confidence 4555544444566778899999999999999877653332334443 3333222323323333333 33344455 Q ss_pred HHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhh Q lcl|NC_021532. 449 AYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRM 528 (663) Q Consensus 449 ~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~ 528 (663) .+|..++... .++. ++.+..+-+.+....+..+ .+... +.++ ...+.. .+... T Consensus 378 ~li~~~~~~~-------~d~~---------~i~v~f~~~~p~~~~e~a~----~~~~~-g~iS----~et~l~--~l~~v 430 (474) T protein:vir:97 378 SFIIDFNNLK-------TDVK---------DIEISFNFNRMMNDAEQSQ----IIAQS-QYLS----RETLVK--SSPLV 430 (474) T ss_pred HHHHHHhCCC-------cccc---------eeeEEeccCcccCHHHHHH----HHHHc-CCCC----HHHHHH--hCCCC Confidence 5555554211 0110 1222222222222222222 22222 2222 121111 11111 Q ss_pred hhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 529 PEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAE 583 (663) Q Consensus 529 ~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~ 583 (663) .+....++.+..+..... +.........+.... ...+....+.+ T Consensus 431 ~D~~~E~eri~~E~~~~~---------~~~~~~~~~~~~~~~--~~~~~~~~~~e 474 (474) T protein:vir:97 431 DDYKAELERIEQEQMEYN---------KQLPNLDDGGADGAQ--QQEGSNNKESE 474 (474) T ss_pred CCHHHHHHHHHHHHHHHH---------hhccccCCCCCCCcc--cCCCCcccccC Confidence 111111111111000000 000000000000000 00000000000 No 96 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.66 E-value=9e-15 Score=97.58 Aligned_cols=441 Identities=12% Similarity=0.060 Sum_probs=207.5 Q ss_pred CCCcHHHHHHHHHHHHH-----HH---------HHHHHHHHHHHHHHHHHhcCCcCCcc---ccCCC----ccccHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMK-----AA---------DVLKQEQDSLISTWKAEYNGEPYGNE---QKGKS----AIVSRDIKK 59 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~-----~~---------~~~~~~~~~~~~~~~~~y~~~~~~~~---~~g~s----~~~~~~i~~ 59 (663) |+|-.. |-..+++-+. .. -..-+++....+.|+.||.|++..+. ..|+. .+..|. T Consensus 1 m~~~~~-ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl--- 76 (505) T protein:vir:79 1 MAFWDT-LKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNV--- 76 (505) T ss_pred CchHHH-HHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecch--- Confidence 444311 1111111100 00 01123455566788899998765332 12221 122232 Q ss_pred HHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeecccccee Q lcl|NC_021532. 60 QSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEV 139 (663) Q Consensus 60 ~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~ 139 (663) ...++..+.+.+++-.+.+.+ +|.+ .++.|+.++. .+++...+.+++.+++..|.++++++||. T Consensus 77 -~~~i~~~~A~ll~~e~~~i~~-----~d~~----~~e~l~~i~~-~n~f~~~~~~~~e~a~a~G~~~~k~~~D~----- 140 (505) T protein:vir:79 77 -TKLASAKLASLIFNEQCQVTV-----SDET----ANDFLDDVFQ-QNDFYTTFEEKLEEWIALGSGCVRPYVDS----- 140 (505) T ss_pred -HHHHHHHHHhhhcCCCceeec-----CChH----HHHHHHHHHH-hccHHHHHHHHHHHHhhcCCeEEEEEEeC----- Confidence 233333344444554444443 3433 4445566553 46677778889999999999999999962 Q ss_pred cccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhh Q lcl|NC_021532. 140 TVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDK 219 (663) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~ 219 (663) +.+.++.|++..||+-..-...+.++-|+.+ +...+. . T Consensus 141 -----------------------------~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~--~~~~~~------~----- 178 (505) T protein:vir:79 141 -----------------------------GKIKLAWATADQVYPLQADTNQVNELAIASR--TTEVEN------H----- 178 (505) T ss_pred -----------------------------CceEEEEEcCCeeEEEEEcCCCeEEEEEEEE--EEEecC------C----- Confidence 2356778888888753211123444433322 111000 0 Q ss_pred hhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC------CEEEecccCC--------- Q lcl|NC_021532. 220 LAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN------DVIVRLQSNP--------- 284 (663) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g------~~~l~~~~~p--------- 284 (663) ...-++++|+|...+ +.+.-. ...|.+ |..+...+-| T Consensus 179 ------------------------~~~~yt~lE~h~~~~--~~~~I~--n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~ 230 (505) T protein:vir:79 179 ------------------------RTIYYTLLEFHQWDH--GDYVIT--NELYRSEAAETVGINVPLNSLEQYEGLEPQV 230 (505) T ss_pred ------------------------cceEEEEEEEEEecC--ceEEEE--EEEEecCCCCccCcccchhhcccccccCcce Confidence 000133455543321 111111 111111 0001111111 Q ss_pred --CcCCCCCEEEEee----eeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchh------- Q lcl|NC_021532. 285 --YPDGKPPFLVVPF----NSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNR------- 351 (663) Q Consensus 285 --~~~~~~Pf~~~~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~------- 351 (663) ....+++|+.++. ....++++|.|++.++++..+.+|..++++.+.+.. +..++.++++.+..... T Consensus 231 ~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~ 309 (505) T protein:vir:79 231 KITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKK-GQRRLIVPAEWLKTGSSYGGQASE 309 (505) T ss_pred eecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcccCCCCccccc Confidence 1124556766542 224467899999999999999999999999988865 55578887776532110 Q ss_pred -h--hccCCcce--EeC--CCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHH Q lcl|NC_021532. 352 -K--KFLAGANF--EFN--GTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDAT 424 (663) Q Consensus 352 -~--~~~p~~vi--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~ 424 (663) . .+.++..+ .+. ++...+..+.+.-........++.+.+.+...+|++....|..+++ ..||+++....+.. T Consensus 310 ~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~-~~TAtei~s~~~~l 388 (505) T protein:vir:79 310 THPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSG-IQTATEVVTNNSQT 388 (505) T ss_pred ccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccc-cchHHHHHHHHhHH Confidence 0 01122221 111 2222344444332334567778888888889999999999876654 45898887665555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeeccc--chhHHHHHHHHHH Q lcl|NC_021532. 425 ATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTA--EDNAAKSQELSFL 502 (663) Q Consensus 425 ~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~--~~~~~~~q~l~~~ 502 (663) -.....+.+.+. ..++.|.+.++.+..-|.-..- | ...-. .....+++.|+=+.+ ....+..+..+++ T Consensus 389 ~~t~~~~~~~~~-~al~~li~~i~~~~~~~~~~~~-----g--~~~~~--~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~ 458 (505) T protein:vir:79 389 YQTRSSYITQVE-KTIKALTYAILELASVPSFYAD-----G--QARWT--GDVDSLDITINFNDGVFVDQESKRAADLQA 458 (505) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccccc-----c--ccccc--CCCCceeEEEEeCCCCCCCHHHHHHHHHHH Confidence 555566666674 4667777777777665532110 0 00000 001123344433332 2222222222222 Q ss_pred HHHhccCCCcchhHHHHHHHHHhhhhhh--hhhhhhhhhcchh---h-HHHHhhH Q lcl|NC_021532. 503 LQTLGPNEDPKIRRDIMADIMDLMRMPE--QAKRMREYEPKPD---P-VQEKIRQ 551 (663) Q Consensus 503 ~~~~~~~~~p~~~~~~l~~~~~l~~~~e--~~~~l~~~~~~~~---~-~~~q~~q 551 (663) .+ +..+.+... ++...++.+ ..+.+..+..+.. | +...... T Consensus 459 v~--~Gi~s~e~~------l~~~~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 459 VQ--AQVMPKKQF------LMRNYGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred HH--cCCCCHHHH------HHhcCCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 11 112222111 111222211 1111111111100 0 0000000 No 97 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.66 E-value=1.7e-15 Score=101.59 Aligned_cols=459 Identities=12% Similarity=0.069 Sum_probs=192.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccC--------CCccccHHHHHHHHHHHHHHH-Hh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKG--------KSAIVSRDIKKQSEWQHATIV-DP 71 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g--------~s~~~~~~i~~~v~~~~~~l~-~~ 71 (663) =.++.+.++..|...+.. .......+.+||.|++. ....| .-.++.|-..-.|+.+...|. .. T Consensus 5 ~~~d~~~~i~~L~~~~~~-------~~~r~~~~~~Yy~g~~~-i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~G 76 (488) T protein:vir:23 5 ESIDPEKLRDQLLDAFEN-------KQNELKSSKAYYDAERR-PDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEG 76 (488) T ss_pred cCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcccc-hhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccc Confidence 455666666666655444 23445777789998763 11111 112344555555555554442 11 Q ss_pred hcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 72 FVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 72 ~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) |+-+.+. .+.....+|.+..+.+ +.++ ..|+.......+.++++++|.|++.|+....... T Consensus 77 f~~~~~~-~~~~~~~~d~~~~~~l----~~i~-~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~------------- 137 (488) T protein:vir:23 77 FRIPSAN-GEEPESGGENDPASEL----WDWW-QANNLDIEATLGHTDALIYGTAYITISMPDPEVD------------- 137 (488) T ss_pred eeccCCc-ccccccccchhHHHHH----HHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCcccc------------- Confidence 2211110 1111123344444433 3334 3566667778899999999999999875421100 Q ss_pred ccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhh Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFD 229 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 229 (663) .....+.+.+..++|.+++ |||... ...+.+++.. . . . T Consensus 138 ------------~~~~~~~~~i~~~~p~~~~~~~d~~~~----~~~~~~~~~~-~----------~--------~----- 177 (488) T protein:vir:23 138 ------------FDVDPEVPLIRVEPPTALYAEVDPRTR----KVLYAIRAIY-G----------A--------D----- 177 (488) T ss_pred ------------cCCCCCcceEEEeccceeEEEEecCCC----ceEEEEEEEE-e----------c--------C----- Confidence 0111234556778888775 454311 1112222111 0 0 0 Q ss_pred ccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCCh Q lcl|NC_021532. 230 YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEAN 309 (663) Q Consensus 230 ~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~ 309 (663) ...+..+++|.. +. .++++-.++...-....|...|.+|++.+...+..+..+|.|- T Consensus 178 ---------------~~~~~~~~~y~~-----~~---~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~ 234 (488) T protein:vir:23 178 ---------------GNEIVSATLYLP-----DT---TMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSE 234 (488) T ss_pred ---------------CCcEEEEEEEec-----Cc---EEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccc Confidence 001122223321 11 1111112222211223455568899999988888999999998 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-C--------cchhhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 310 AE-MIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-D--------QTNRKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 310 ~~-~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~--------~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +. .++++++.+|+..+.+.+.+...+.|...+- |.. + ........+|.++....+ ......+.+..+ T Consensus 235 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g-~~~~~~q~~~~~- 311 (488) T protein:vir:23 235 ISPELRSVTDAAAQILMNMQGTANLMAIPQRLIF-GAKPEELGINAETGQRMFDAYMARILAFEGG-EGAHAEQFSAAE- 311 (488) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHh-CCCcccccccccccchhhhhhhhhhccCCCC-CCceeEecCCCC- Confidence 85 6899999999999999999998888765441 221 1 111223345666655433 333333433322 Q ss_pred HHHHHHHHHHHH---HHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021532. 380 SAFDMISLMNNE---IESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE 456 (663) Q Consensus 380 ~~~~~~~~~~~~---~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~ 456 (663) +.+.+..++.. +-.+|++++...|..+.. +.++.++...............+.|. .-++.++.++..+.. T Consensus 312 -~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~-----~~l~~~~~l~~~~~~ 384 (488) T protein:vir:23 312 -LRNFVDALDALDRKAASYSGLPPQYLSSSSDN-PASAEAIKAAESRLVKKVERKNKIFG-----GAWEQAMRLAYKMVK 384 (488) T ss_pred -hHHHHHHHHHHHHHHhcccCCCHHHhccccCc-chHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHhc Confidence 23344444444 445688888888854321 11222233332222222223333332 233444455555432 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhh-hhhhhhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRM-PEQAKRM 535 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~-~e~~~~l 535 (663) ... .. .++. ++.+...-.......+..+.+..+.+..... +....+...+ ++ ++..+.+ T Consensus 385 ~~~-----------~~-~~~~-~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~----~s~et~~~~l---~~~~d~~~~~ 444 (488) T protein:vir:23 385 GGD-----------IP-TEYY-RMETVWRDPSTPTYAAKADAAAKLFANGAGL----IPRERGWVDM---GYTIVEREQM 444 (488) T ss_pred CCC-----------cc-hhhc-cceEEecCCCCCCHHHHHHHHHHHHhccccc----CCHHHHHHhC---CCCchHHHHH Confidence 110 00 0111 1222222222222222333333222211111 1222222111 11 0101111 Q ss_pred hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKA 596 (663) Q Consensus 536 ~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~ 596 (663) +... + .+..+.. ..+.+............ ..... ....-+-..+ T Consensus 445 ~~~~-----------~--~~~~~~~-~~~~~~~~~~~~~~~~~--~~~~~-~~~~~e~~~a 488 (488) T protein:vir:23 445 RQWL-----------E--QDQKQGL-GLIGSLYGASTPEGKPG--EAPVG-EPPAPEPDAA 488 (488) T ss_pred HHHH-----------H--HHHHHHH-HHHHHHhccCCCcccCC--CCCCC-CCCCCCCCCC Confidence 1000 0 0000000 00000000000000000 00000 0000000000 No 98 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.66 E-value=4.3e-15 Score=99.34 Aligned_cols=432 Identities=10% Similarity=0.056 Sum_probs=190.5 Q ss_pred CCCcHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-------------ccCCC--ccccHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAE-LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-------------QKGKS--AIVSRDIKKQSEWQ 64 (663) Q Consensus 1 ~~~~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-------------~~g~s--~~~~~~i~~~v~~~ 64 (663) .+++... +.+.|... ...+ +.+.+.++.+||.|++.-+. ...++ +++.|-.+-.|+.. T Consensus 16 ~~~~~~~~~~~~i~~~----~~~~--~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~ 89 (479) T protein:vir:79 16 LKKESTINLVKVIEHY----ILKH--RPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQK 89 (479) T ss_pred cccCChhHHHHHHHHH----Hhhh--hHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHH Confidence 2222222 22222222 1111 33557788899998753211 11122 35555555556655 Q ss_pred HHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccc Q lcl|NC_021532. 65 HATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGE 144 (663) Q Consensus 65 ~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~ 144 (663) ++.+ ++..+. +.+ +|.+ ...+++.++. |+......++.++++++|.|++.+++|.. T Consensus 90 ~~~l----~g~p~~--~~~---~~~~----~~~~~~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------- 145 (479) T protein:vir:79 90 VGYS----VGNPIV--FNA---DDDN----LTKLLNDLLG--EEFDDTITELYLNASNKGVEWLHPYINRK--------- 145 (479) T ss_pred Hhhh----hcCCce--ecc---CCHH----HHHHHHHHHh--cCHHHHHHHHHHHHHhcCeEEEEEEeCCC--------- Confidence 5544 454433 322 3332 2345555443 56677778899999999999999987632 Q ss_pred ccccCccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 145 AVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) +.+.+..++|..+| ||+.... ...+. .+.+..... T Consensus 146 ------------------------~~~~i~~~~p~~~~~v~d~~~~~---~~~~~-ir~y~~~~~--------------- 182 (479) T protein:vir:79 146 ------------------------GEFKYVIIPAEEAIPIWDSKRQR---ELVAF-IRFYYIEDI--------------- 182 (479) T ss_pred ------------------------CceEEEEEccceeEEEEeCCCCC---ceEEE-EEEEEEeec--------------- Confidence 33567778888875 3333221 12222 222221100 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEe-----eecCCceeEEE----EEEE--ECCEEEecccCCCcCCCCC Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNY-----DVDGDGIAEPI----VCAW--INDVIVRLQSNPYPDGKPP 291 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~-----~~~~~g~~~~~----~~~~--~g~~~l~~~~~p~~~~~~P 291 (663) ..+.+..+|+|... ...+++..... .... ...........|.+.|..| T Consensus 183 ---------------------~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 241 (479) T protein:vir:79 183 ---------------------DGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVP 241 (479) T ss_pred ---------------------CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCccc Confidence 00011222332110 00111100000 0000 0011111122233335566 Q ss_pred EEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc-ch-hhhccCCcceEeCCCCCc Q lcl|NC_021532. 292 FLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQ-TN-RKKFLAGANFEFNGTAND 368 (663) Q Consensus 292 f~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~-~d-~~~~~p~~vi~~~~~~~~ 368 (663) |+. .+++.+|.|.+..++++++.+|...+.+.+.+...++|.+.+ .|.- .. .+ ......++++.+.++++ T Consensus 242 vv~-----~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~- 314 (479) T protein:vir:79 242 FIP-----FKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVL-KEYPGTSLQEFIDNIRYYKSIKVDGGGG- 314 (479) T ss_pred EEE-----ecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeee-ecCCccccccchhhhhhccceecCCCCc- Confidence 654 345667999999999999999999999999999999987665 3421 11 11 12234566777765543 Q ss_pred cccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 369 FWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWM 448 (663) Q Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 448 (663) ..++..+.-.......++.+...|...|++++...+..++ .|++| +...............+.|.+ +++.+++ T Consensus 315 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn-~Sg~A--i~~~~~~l~~k~~~~~~~~~~-~l~~~~~--- 387 (479) T protein:vir:79 315 VDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGD-KSGVA--LKFLYSLLDLKCSKTEKKFKK-AIRELLW--- 387 (479) T ss_pred ceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccccc-hhHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHH--- Confidence 5555544444567778899999999999999987775433 24444 433333333333333334432 3444444 Q ss_pred HHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhh Q lcl|NC_021532. 449 AYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRM 528 (663) Q Consensus 449 ~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~ 528 (663) ++..+..- .+. ..++.. ++.+...-+.+....+..+. +..+.+.++ ...+.. .+... T Consensus 388 -li~~~~~~------~~~--~~~~~~----~i~i~f~~~~p~~~~~~a~~----~~kl~g~iS----~et~l~--~l~~v 444 (479) T protein:vir:79 388 -FVCEYLKI------SGN--KSYDYK----TVQITFNHSMIINEAEKIDM----AAKSTGIVS----DETIVS--NHPWV 444 (479) T ss_pred -HHHHHHhc------cCC--Cccccc----cceEEeCCCCCcCHHHHHHH----HHHHhccCc----HHHHHH--hCCCC Confidence 44443211 111 011111 11222222222222222222 222222221 111111 11111 Q ss_pred hhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 529 PEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDA 582 (663) Q Consensus 529 ~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~ 582 (663) .+....++ +...++.+.. ....... ........+. T Consensus 445 ~d~~~E~~--------------ri~~E~~~~~--~~~~~~~---~~~~~~~~e~ 479 (479) T protein:vir:79 445 EDVNDELE--------------RLKKQEDTQK--EYDDLIP---NNQDGVIDET 479 (479) T ss_pred CCHHHHHH--------------HHHHHHHHHH--HHHhccC---cccCCCcCcC Confidence 11111111 1110000000 0000000 0000000000 No 99 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.65 E-value=9e-15 Score=97.58 Aligned_cols=450 Identities=11% Similarity=0.036 Sum_probs=187.8 Q ss_pred CC--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-ccC------CCccccHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 1 MK--INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-QKG------KSAIVSRDIKKQSEWQHATIVDP 71 (663) Q Consensus 1 ~~--~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-~~g------~s~~~~~~i~~~v~~~~~~l~~~ 71 (663) |. -+.+.++..|...+.. +....+++.+||.|++.-+. .+. .-.++.|-.+-.|+...+.|. T Consensus 8 ~~~~~~~~~~~~~l~~~~~~-------~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~-- 78 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFED-------STQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQA-- 78 (485) T ss_pred CCCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhc-- Confidence 32 2233345555555443 33456778899999875211 111 112234555555565555441 Q ss_pred hcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 72 FVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 72 ~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) +.+ |. .++|.+..+.+. .++ ..|+.......+.++++++|.|++.|+.+.... T Consensus 79 -~~g-----~~--~~~~~~~~~~~~----~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~-------------- 131 (485) T protein:vir:10 79 -VEG-----FR--FGDADEADEELW----QWW-QANNLDIEAPLGYTDAYVHGRSYITISRPDPQI-------------- 131 (485) T ss_pred -ccc-----ee--cCCCchhHHHHH----HHH-HhcCHhHHHHHHHHHHhhcCceEEEEeeCCccc-------------- Confidence 111 21 234444443333 334 346666677789999999999999987652210 Q ss_pred ccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhh Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFD 229 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 229 (663) ......+.+.+..++|.+++ +||... ...+.+++.+.. T Consensus 132 -----------~~~~~~~~~~i~~~~p~~~~~~~D~~~~----~~~~~~~~~~~~------------------------- 171 (485) T protein:vir:10 132 -----------DLGWDPNTPIIRVEPPTRMYAEIDPRIG----RVSKAIRVAYDA------------------------- 171 (485) T ss_pred -----------ccccCCCeeEEEEEccceeEEEEcCCCC----ceeEEEEEEEee------------------------- Confidence 01112345667788888875 555321 111111111000 Q ss_pred ccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCCh Q lcl|NC_021532. 230 YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEAN 309 (663) Q Consensus 230 ~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~ 309 (663) ....+..+++|.. + ..+.....++........|.+.|.+|++.++..+..+..||.|- T Consensus 172 --------------~~~~~~~~~~y~~-----~---~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~ 229 (485) T protein:vir:10 172 --------------EGNEIQAATLYTP-----N---DIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSE 229 (485) T ss_pred --------------CCCeEEEEEEEeC-----C---eEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccc Confidence 0011223333322 1 01111112222222233455558899999999999999999998 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cC-----c---chhhhccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 310 AE-MIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LD-----Q---TNRKKFLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 310 ~~-~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~-----~---~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) +. .++++++.+|+..+.+.......+.|...+- |. .+ + .......+|.++...++ .....+++.. T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--d~k~~q~~~~-- 304 (485) T protein:vir:10 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIF-GIKPEEIGVDPETGQTLFDAYLARILAFEDA--EGKIQQFSAA-- 304 (485) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHh-cCCcccccccccccchhhhhcccceeccCCC--CceEEeeccc-- Confidence 86 5899999999999999999998998876542 21 11 1 11123345666655322 2223333322 Q ss_pred HHHHHHHHHHHHHHHH---hCCChHHcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021532. 380 SAFDMISLMNNEIESI---TGTKSFSGGINSGS-LGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFL 455 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~---tGi~~~~~G~~~~~-~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~ 455 (663) .....++.++..++.+ |++++...|..+.. .|+.| +...............+.|. .-++.++.++..+. T Consensus 305 ~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~A--l~~~~~~l~~k~~~k~~~f~-----~~l~~~~~l~~~~~ 377 (485) T protein:vir:10 305 ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEA--IRAAESRLIKKVERKNSIFG-----GAWEEAMRLAYRMM 377 (485) T ss_pred chHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHh Confidence 2334455555555555 77787888754321 23333 33322222222233333332 22334444444432 Q ss_pred CCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh-hhhhh Q lcl|NC_021532. 456 EEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP-EQAKR 534 (663) Q Consensus 456 ~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~-e~~~~ 534 (663) .... ... ++ .++.+...-.......+..+.+..+.+. +. +.+....+..+ .++. +..+. T Consensus 378 ~~~~-----------~~~-~~-~~i~v~w~~~~~~~~~~~ada~~kl~~a-g~---~~~s~et~~~~---lg~~~~~~~~ 437 (485) T protein:vir:10 378 KGGD-----------VPP-DM-LRMETVWRDPSTPTYAAKADAASKLYNG-GT---GVIPRERARKD---MGYSIAEREE 437 (485) T ss_pred CCCC-----------Ccc-cc-eeeeEEecCCCCCCHHHHHHHHHHHHhc-cc---cCCCHHHHHHh---CCCCHhHHHH Confidence 2110 000 00 0112222211222222222222222211 11 11222222211 1111 00111 Q ss_pred hhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 535 MREYEPKPDPVQEKIRQLELENLMLENQMLVASINDK--NARANENTIDAELKRSKAAV 591 (663) Q Consensus 535 l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~--~a~~q~~~~~~~~~~~~~~~ 591 (663) ++....+... ........+-...... +...+.+.......-..-.+ T Consensus 438 ~~~~~ee~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 438 MRRWDEEEAA-----------MGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHHHHHHHH-----------HHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 1110000000 0000000000000000 00000000000000000000 No 100 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.65 E-value=4.8e-15 Score=99.09 Aligned_cols=392 Identities=10% Similarity=-0.007 Sum_probs=183.2 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc--------cccCCCccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN--------EQKGKSAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~--------~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |+...| ..|.+.+.+ +......+.+||.|++.-. +-+...+++.|-+.-.|+++...+. T Consensus 1 ~~~~~i-~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----- 67 (409) T protein:vir:94 1 MTEKGI-GYLRFKLSV-------HKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV----- 67 (409) T ss_pred CCHHHH-HHHHHHHHH-------HhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc----- Confidence 555444 444444333 2234566778999976321 1111222344444445554443321 Q ss_pred CCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNE 154 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 154 (663) |.+.+.+|.+ +..++. .|+.......++++++++|.+++.|+=+. T Consensus 68 ------~~Gf~~~d~~--------l~~i~~-~N~ld~~~~~~~~~aliyG~sf~~v~~~~-------------------- 112 (409) T protein:vir:94 68 ------FREFENDDFT--------VNEIFE-ENNPDIFFDSAVLSSLIASCSFTYISKGE-------------------- 112 (409) T ss_pred ------cCcccCCchH--------HHHHHH-hcChhHHHHHHHHHHHHhcceeEEEecCC-------------------- Confidence 2222223321 344554 56666677889999999999999885321 Q ss_pred cccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccc Q lcl|NC_021532. 155 TVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDS 232 (663) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~ 232 (663) .+.|.+..++|.+++ |||...+ ...+.+.+-. + . T Consensus 113 -------------dg~~~i~~~sp~~~~~i~D~~~~~-----~~~a~~~~~~--d-------~----------------- 148 (409) T protein:vir:94 113 -------------NDAVRLQVIEAVNATGIIDPITGL-----LTEGYAVLER--D-------E----------------- 148 (409) T ss_pred -------------CCceEEEEeccceEEEEEecCCCc-----eeeeEEEEEe--c-------C----------------- Confidence 123456677787664 5553211 1111111100 0 0 Q ss_pred cccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH-H Q lcl|NC_021532. 233 PDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA-E 311 (663) Q Consensus 233 ~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~-~ 311 (663) . .......+|.. + +.+..+..++ .....+||+ |.+|++.++..++.++++|.|-+ + T Consensus 149 -----~-------~~~~~~~~~~~-----~---~~~~~~~~~~-~~~~~~n~~--g~vPvV~f~n~~~~~~~~G~s~I~e 205 (409) T protein:vir:94 149 -----N-------NNVVLEAHFLP-----D---RTDYYYRDSR-NNISIANPT--GHPLLVPIIHRPDAVRPFGRSRITR 205 (409) T ss_pred -----C-------CceEEEEEEec-----C---cEEEEEecCc-eeEeeeCCC--CCcceEEeccccccccccCccccch Confidence 0 00011111111 0 0000001111 112235665 78999999999999999999976 7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEEe---eccccCcchhhhccCCcceEeCCC--CCccccccCcccc-HHHHHHH Q lcl|NC_021532. 312 MIGDNQKVKTAVIRGIIDNMAQSNNGQVAI---RKGALDQTNRKKFLAGANFEFNGT--ANDFWHGSYNAIP-SSAFDMI 385 (663) Q Consensus 312 ~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~---~~~~i~~~d~~~~~p~~vi~~~~~--~~~~~~~~~~~~~-~~~~~~~ 385 (663) .++++|+.+|+.+..++......++|+..+ ++++ ++.+.....++.++.+... +..+...+++.-. ..+...+ T Consensus 206 ~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~-~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l 284 (409) T protein:vir:94 206 SGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA-EPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQL 284 (409) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC-cccchhhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHH Confidence 899999999999999999999999987554 2222 2233344556777766432 2223332332221 2234455 Q ss_pred HHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEec Q lcl|NC_021532. 386 SLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTN 465 (663) Q Consensus 386 ~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~ 465 (663) ..+...+-.+||+|....|..++. +.+|.++......-........+.|.. ..+.++++++.+.-..-.. T Consensus 285 ~~~~~~~a~~t~lP~~~lg~~~~N-psSa~Al~a~~~~L~~~a~~k~~~fg~-~~~~~~rla~~i~~~~~~~-------- 354 (409) T protein:vir:94 285 RTAAAGFAGETGLTLDDLGFVSDN-PSSVEAIKASHENLRLAGRKAQRSLGA-GLLNVAYLAACLRDDAPYL-------- 354 (409) T ss_pred HHHHHHHhhhcCCCHHHhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhCCCCcc-------- Confidence 555666667789999999965432 223333443222222222333333433 2344444443332221100 Q ss_pred CeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhh Q lcl|NC_021532. 466 DKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQA 532 (663) Q Consensus 466 ~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~ 532 (663) +.++. +..+.-.-.......+..+. +..+..+.+..++......+.. +.++.+.. T Consensus 355 -------~~~~~-~~~v~W~p~~~~~~~~~a~~-aDa~~Kl~~ag~~~~~~~~~~~---~lG~~~~d 409 (409) T protein:vir:94 355 -------REQFR-KTKPKWEPLFEADASMLSLI-GDGAIKLNQAIPEFINKDTIRD---LTGIEGGE 409 (409) T ss_pred -------ccccc-cceEEeccCCCcchHHHHHH-HHHHHHHHHhcccccchhHHHH---HcCCCCCC Confidence 01110 11111110001111111211 1222222222222223333333 23332211 No 101 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.65 E-value=5.2e-16 Score=104.37 Aligned_cols=435 Identities=10% Similarity=0.021 Sum_probs=191.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc-------cccCC--CccccHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN-------EQKGK--SAIVSRDIKKQSEWQHATIVDP 71 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~-------~~~g~--s~~~~~~i~~~v~~~~~~l~~~ 71 (663) =++..++|...|.+.. ..+...++++.+||.|++... ...++ .+++.|..+..|+...+++ T Consensus 20 ~~l~~~~i~~li~~~~-------~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l--- 89 (506) T protein:vir:94 20 ENLTPNKIMKFITHHF-------NYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYS--- 89 (506) T ss_pred hcCCHHHHHHHHHHHH-------HHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhh--- Confidence 1233444444433321 122345677888999976421 12233 3456666666666666554 Q ss_pred hcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 72 FVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 72 ~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) ++..+ .|.+ +|+. ..+.++.++. .|+....+..+.++++++|.+++.++++.+ T Consensus 90 -~G~p~--~~~~---~d~~----~~~~l~~~~~-~N~~~~~~~~~~~~~~~~G~a~~~v~~ded---------------- 142 (506) T protein:vir:94 90 -VGNPI--NVKL---PDDG----SNSGFDTFNK-ANDVDAENYDLFLDMSRYGRAYEYVYRGED---------------- 142 (506) T ss_pred -cccCc--eeec---Ccch----HHHHHHHHHh-ccCHhHHHHHHHHHHHhcCeEEEEEEecCC---------------- Confidence 44432 3333 2222 2345666554 466767778899999999999999988632 Q ss_pred ccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhh Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFD 229 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 229 (663) +.+.+.+++|..+|+ |+... .....+.+.+..... + T Consensus 143 -----------------~~~~i~~~~p~~~~~v~dd~~~----~~~~~~v~~~~~~~~------~--------------- 180 (506) T protein:vir:94 143 -----------------NEEHLAKLDPLDTFVIYSTDVD----PKPIMAVRYHQIELV------D--------------- 180 (506) T ss_pred -----------------CeeEEEEEcccceEEEecCCCC----CceEEEEEEEeeeec------c--------------- Confidence 235567788887753 33211 112222222221000 0 Q ss_pred ccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC----CEEEecccCCCcCCCCCEEEEeeeeecCccc Q lcl|NC_021532. 230 YDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN----DVIVRLQSNPYPDGKPPFLVVPFNSIPFKLH 305 (663) Q Consensus 230 ~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g----~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~ 305 (663) .......+..+++|... .+.++.+ ..+....++| .|.+|++.+ +++.. T Consensus 181 -----------~~~~~~~~~~~~~yt~~----------~~~~~~~~~~~~~~~~~~~~~--~g~vPvv~~-----~n~~~ 232 (506) T protein:vir:94 181 -----------DNQVSTINYVPETWTAD----------TYTLYNPTPIMGKMQVDTTKP--ITTFPVVEF-----KNSNF 232 (506) T ss_pred -----------CCceeEEEEEEEEEeCc----------eEEEeccccCccceecccccc--CCccceEEe-----cCCCC Confidence 00000012223333211 1112221 1222222333 356676644 33445 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Ccc----------------------h----hhhccCCc Q lcl|NC_021532. 306 GEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQT----------------------N----RKKFLAGA 358 (663) Q Consensus 306 g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~~----------------------d----~~~~~p~~ 358 (663) |.|.++.++++++.+|...|.+.+.+.-.++|.+++ .|.. ... + .....-++ T Consensus 233 ~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (506) T protein:vir:94 233 RLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLII-QGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDAN 311 (506) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHH-hcCccccccchhccccccccccccccccccchhHHHhhhhhcC Confidence 789999999999999999999999888777665443 1110 000 0 00111123 Q ss_pred ceEeCCCC--------CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 359 NFEFNGTA--------NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMN 430 (663) Q Consensus 359 vi~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~ 430 (663) .+.+.+++ ..+.++..+.-.......+..+...|...|++++...+..++..|+.| +..+.......... T Consensus 312 ~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A--ik~~~~~l~~k~~~ 389 (506) T protein:vir:94 312 MLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVA--MQYKVLGTVELAST 389 (506) T ss_pred eeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHH--HHHHHHHHHHHHHH Confidence 34443322 123344444445667788899999999999999876554333334444 44443344444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCC Q lcl|NC_021532. 431 IVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNE 510 (663) Q Consensus 431 ~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~ 510 (663) .-+.|.+ +++.++++++.++...... ..++.. +..+..+-+.+....+..+ .+..+++.+ T Consensus 390 k~~~~~~-~l~~~~~li~~~~~~~~~~-----------~~~d~~----~i~i~f~~~~p~d~~e~a~----~~~kl~g~i 449 (506) T protein:vir:94 390 KRRMFER-GLYARYQIISDIENSIHGD-----------WTFDPQ----ELTFTFRDNLPADNISQIK----ALVQAGATL 449 (506) T ss_pred HHHHHHH-HHHHHHHHHHHHHHhcCCc-----------cccccc----cceEEeCCCCCcCHHHHHH----HHHHHhccC Confidence 4444533 4555555555544332110 011111 1122222222222222222 222222222 Q ss_pred CcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 511 DPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVAS--INDKNARANENTIDAELK 585 (663) Q Consensus 511 ~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~--~~~~~a~~q~~~~~~~~~ 585 (663) + ...+... +....+....++.+. .++.+......... ....+........+.+.+ T Consensus 450 S----~et~~~~--lp~v~d~~~E~~ri~--------------~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 450 P----QKYLYQQ--LPGVTNPQDIVDMMK--------------EQSANGDYSFDQNGVISNDGQTNTTATQTDEEVR 506 (506) T ss_pred C----hHHHHHh--CCCCCCHHHHHHHHH--------------HHHHHHhhcchhhcCCCcccCccccccccccCCC Confidence 2 2222111 111111111111111 00000000000000 000000000000000000 No 102 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.65 E-value=1.7e-15 Score=101.61 Aligned_cols=468 Identities=12% Similarity=0.062 Sum_probs=206.8 Q ss_pred CCCcHHHHHHHHHHH--------HHHHH-----HHHHHHHHHHHHHHHHhcCCcCCcccc-------CCCccccHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKAD--------MKAAD-----VLKQEQDSLISTWKAEYNGEPYGNEQK-------GKSAIVSRDIKKQ 60 (663) Q Consensus 1 ~~~~~~~~~~~l~~~--------~~~~~-----~~~~~~~~~~~~~~~~y~~~~~~~~~~-------g~s~~~~~~i~~~ 60 (663) |+|-. .|-..+++- ++... ..-.++......|+.||.|+++....+ .+.....|.-+ T Consensus 1 m~~~~-~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~-- 77 (517) T protein:vir:98 1 MKVIQ-RIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRK-- 77 (517) T ss_pred CchHH-HHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHH-- Confidence 55532 121111111 11100 112334556777889999987643221 11222222221 Q ss_pred HHHHHHHHHHhhcCCCceEEEEeCC--cchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccce Q lcl|NC_021532. 61 SEWQHATIVDPFVSTADIIKCTPIT--WEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEE 138 (663) Q Consensus 61 v~~~~~~l~~~~~~~~~~~~~~p~~--~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~ 138 (663) .++..+.+.+|+-.+.+.|-... ..+.......++.++.++. .|++...+.+++.+++..|.|++|++||. T Consensus 78 --~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~-~n~f~~~~~~~~e~a~a~G~~a~k~~~d~---- 150 (517) T protein:vir:98 78 --LSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQ-HNKFIKNLSDYLEPTFALGGLTVRPYVDN---- 150 (517) T ss_pred --HHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHH-hccHHHHHHHHHHHHhhhCCEEEEEEEeC---- Confidence 22222223334444445554211 1112223345667777664 56777788899999999999999999972 Q ss_pred ecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChh Q lcl|NC_021532. 139 VTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLD 218 (663) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~ 218 (663) +.+.++.|++..||+-..-...+..|-+++ ..+.+... ...+|-.++ T Consensus 151 ------------------------------~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~-~~~~~~~~--~~~~Yt~lE 197 (517) T protein:vir:98 151 ------------------------------GEIEFSWALANAFYPLRSNSNGISEGVMKS-VTTKVIGN--KTVYYTLLE 197 (517) T ss_pred ------------------------------CeeEEEEEcCCeeEEEEecCCCeEEEEEEE-EEEEeecC--CceEEEEEE Confidence 234578888888885211111222232221 22222111 000111000 Q ss_pred hhhhccchhhhccccccccccccccccceEEE-EEEEEEeeecCCceeEEEEEEEECCEEEecccC-CCc-CCCCCEEEE Q lcl|NC_021532. 219 KLAKTSGEDFDYDSPDDTEFQFSDAPRKKLII-YEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSN-PYP-DGKPPFLVV 295 (663) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v-~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~-p~~-~~~~Pf~~~ 295 (663) .+.+....+. + ....| .+.|.......-|....-.-+|.+ | .+. .+. ..+++|+++ T Consensus 198 -----------~H~~~~~~~~--~---~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~---l--~~~~~~~g~~~Plf~y~ 256 (517) T protein:vir:98 198 -----------FHEWEKTEEG--E---SLYVITNELYKSDNEGEIGKRIPLEELYEG---M--QEKTYIQGLSRPLFNYL 256 (517) T ss_pred -----------EEecCceecc--C---CcEEEEEEEEecCCCccccccccccccccC---C--CcceeECCCCcceEEEe Confidence 0000000000 0 00011 122211100000110000000110 0 000 001 123345443 Q ss_pred ee----eeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcch-hhhccCC-------cceE-e Q lcl|NC_021532. 296 PF----NSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTN-RKKFLAG-------ANFE-F 362 (663) Q Consensus 296 ~~----~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d-~~~~~p~-------~vi~-~ 362 (663) +. ..+.++++|.|++.++++..+.+|..++++++.+.+ +..++.++.+.+.... .....++ .++. + T Consensus 257 ~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~ 335 (517) T protein:vir:98 257 KPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-GQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSI 335 (517) T ss_pred cCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-CCcceecChhhhccccCCCCcccCCCCCcccceeeec Confidence 22 123367899999999999999999999999988877 5668889888874221 1111111 1211 1 Q ss_pred CCCC--CccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 363 NGTA--NDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLV 440 (663) Q Consensus 363 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~ 440 (663) ..+. ..+..+++.-....+.+.++.+.+.+....|++....|..+.+ .+||++|....+..-.....+.+.+. ..+ T Consensus 336 ~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~-~kTATEi~s~~~~~~~t~~~~~~~~~-~aL 413 (517) T protein:vir:98 336 RMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRS-MKTATEIVSENDLTYRTRNDHVYEVE-QFI 413 (517) T ss_pred cCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 2211 1222223222334677788888899999999999999987655 36899998765555555566666674 456 Q ss_pred HHHHHHHHHHHHHh--cCCceEEEEecCeeeccchhhcCCceEEEEee--cccchhHHHHHHHHHHHHHhccCCCcchhH Q lcl|NC_021532. 441 KPLMRKWMAYNAEF--LEEEEVIRVTNDKFVPIRKDDLSGRIDIDISI--STAEDNAAKSQELSFLLQTLGPNEDPKIRR 516 (663) Q Consensus 441 ~~l~~~~~~li~q~--~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~--~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~ 516 (663) +.+.+.++.+..-| +... .....++.|.= +......+..+.+.++.+ ++.+++.... T Consensus 414 ~~lv~~i~~l~~~~~~~~~~-----------------~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~--aG~ms~~~~i 474 (517) T protein:vir:98 414 KGLVISVLELAKTYKLFGGE-----------------IPSAEHIGVDFDDGVFQDRSALLRFYGQAKT--FGFIPTVEAI 474 (517) T ss_pred HHHHHHHHHHHHHHhhcCCC-----------------CCCCcceEEEcCCCCCCCHHHHHHHHHHHHh--cCCCCHHHHH Confidence 67777776655433 2111 01122333332 222222233332222221 1223322211 Q ss_pred HHHHHHHHhhhhhh--hhhhhhhhh---cchhhHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 517 DIMADIMDLMRMPE--QAKRMREYE---PKPDPVQEKIRQLELENLMLENQMLVASIN 569 (663) Q Consensus 517 ~~l~~~~~l~~~~e--~~~~l~~~~---~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~ 569 (663) . .+-++.+ ..+.+.++. ...++....+.+. ..+--+.+ T Consensus 475 ~------~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~---------~~~~gd~e 517 (517) T protein:vir:98 475 Q------RIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQ---------KRMFGDEE 517 (517) T ss_pred H------HhCCCChHHHHHHHHHHHHhccccCCCCcccccc---------CCCCCCCC Confidence 1 1112111 111111111 1111100000000 00000000 No 103 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.65 E-value=4.9e-14 Score=93.54 Aligned_cols=427 Identities=9% Similarity=0.031 Sum_probs=196.1 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-----------ccCCC--ccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-----------QKGKS--AIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-----------~~g~s--~~~~~~i~~~v~~~~~~l~ 69 (663) |+-+.|...+... ..+......+.+||.|++.-+. ..+++ +++.|..+..|+..+++ T Consensus 1 l~~~~i~~~i~~~--------~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~y-- 70 (451) T protein:vir:10 1 MELEKIRAIISAD--------AARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASY-- 70 (451) T ss_pred CCHHHHHHHHHHH--------HHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhh-- Confidence 6666666655442 2334456778889999753211 11112 45556666666655554 Q ss_pred HhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD 149 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~ 149 (663) +++..+.+.+ .+|.+..+ ++++++ .++.......+.++++++|.|+..+++|.+.... T Consensus 71 --l~G~p~~~~~----~~~~~~~~----~~~~~~--~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~---------- 128 (451) T protein:vir:10 71 --MFTYPVLFDI----DNNKELNE----KVTDVL--GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGE---------- 128 (451) T ss_pred --eecccceeec----CCcHHHHH----HHHHHh--ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccc---------- Confidence 4554433321 23333333 344433 2556666677889999999999999886432111 Q ss_pred ccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchh Q lcl|NC_021532. 150 EYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGED 227 (663) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~ 227 (663) ....+.+.+..++|.++|+ |.+..+ +..+.+ +.+....+- . T Consensus 129 ---------------~~~~~~~~~~~i~p~~~~~vydd~~~~---~~~~~i-r~~~~~~~~--~---------------- 171 (451) T protein:vir:10 129 ---------------QVTNQTFKYGVVNTEEIIPIYRNGIER---ELEAVI-RYYIQLEDV--K---------------- 171 (451) T ss_pred ---------------cccccceeEEEEcccceEEEEcCCCCC---ceEEEE-EEEEeeecc--c---------------- Confidence 1122345567788888863 332211 222222 222111110 0 Q ss_pred hhccccccccccccccccceEEEEEEEEEeeecCCceeEEEE--EEEECCEEEecccCCCcCCCCCEEEEeeeeecCccc Q lcl|NC_021532. 228 FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIV--CAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLH 305 (663) Q Consensus 228 ~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~--~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~ 305 (663) +-.....+..+|+|.. +.+..+.. .-..++.++ ...-|...|.+|++. ..++.. T Consensus 172 -------------~~~~~~~~~~~e~yt~-----~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~vPvv~-----~~nn~~ 227 (451) T protein:vir:10 172 -------------GQIQKQAYTYVEFWTD-----KILDKYKFFGVSCCGSQIE-HITVQHRFNSVPFVE-----FSNNIK 227 (451) T ss_pred -------------ccccceEEEEEEEEeC-----CeEEEEEecccCccccccc-cccccCCCCeeeEEE-----eccCCC Confidence 0000112334455532 11111100 001112222 112222334555543 344556 Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-C--cchhhhccCCcceEeCCC----CCccccccCcccc Q lcl|NC_021532. 306 GEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-D--QTNRKKFLAGANFEFNGT----ANDFWHGSYNAIP 378 (663) Q Consensus 306 g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~--~~d~~~~~p~~vi~~~~~----~~~~~~~~~~~~~ 378 (663) |.|.++.++++++.+|.+.|.+.+.+.-.++|.+++ .|.- . .+.......++++.+.+. +....++..+.-. T Consensus 228 ~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~ 306 (451) T protein:vir:10 228 KQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYIL-ENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPT 306 (451) T ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCH Confidence 889999999999999999999999999999987655 3321 1 122233455566666432 2345555555555 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) ..+...++.+...|...|++++.+.+..++ .|+.| +..+...........-+.|. ..++.++.++..+.... T Consensus 307 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn-~Sg~A--lk~~~~~l~~k~~~k~~~f~-----~~l~~~~~li~~~~~~~ 378 (451) T protein:vir:10 307 EARKIILEILKKQIYESGQGLQQDTENFGN-ASGVA--LKFFYRKLELKSGLLETEFR-----TSFDKLIKAILYFLGVT 378 (451) T ss_pred HHHHHHHHHHHHHHHHHhCccccccccccc-ccHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHhCCC Confidence 667788999999999999999876553332 24444 33333233333333333342 33345555555554211 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREY 538 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~ 538 (663) ++ .++.+..+-+.+....+..+ .+..+... +....+... +....+....+ T Consensus 379 --------d~---------~~i~i~f~~~~p~n~~e~~~----~~~kl~g~----iS~et~~~~--~p~v~d~~~e~--- 428 (451) T protein:vir:10 379 --------DY---------KKIQQTYTRNMMSNDLEDAD----IATKSVGI----IPTKIILRH--HPWVDDVEEAE--- 428 (451) T ss_pred --------Cc---------cceeEEecCCCCCCHHHHHH----HHHHHhcc----CchHHHHHh--CCCCCCHHHHH--- Confidence 01 01222222222222222222 22222221 112222111 11111100000 Q ss_pred hcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 539 EPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTID 581 (663) Q Consensus 539 ~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~ 581 (663) +....++. .+.++.+.... ..-+ T Consensus 429 -----------~~~~ee~~-~~~~~~~~~~~--------~~~~ 451 (451) T protein:vir:10 429 -----------KLYLEEKK-IQASKVSDDYN--------NFTE 451 (451) T ss_pred -----------HHHHHHHH-HHHHHHHhhcC--------CCCC Confidence 00000000 00000000000 0000 No 104 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.64 E-value=1.4e-14 Score=96.54 Aligned_cols=392 Identities=10% Similarity=0.015 Sum_probs=181.0 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc--------cccCCCccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN--------EQKGKSAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~--------~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |+...| ..|.+.+.+ .......+.+||.|++.-. +-+.+-..+.|-+.-.|+++...+. T Consensus 1 ~~~~~i-~~L~~~~~~-------~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----- 67 (409) T protein:vir:16 1 MTEKGI-GYLRFKLSV-------HKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV----- 67 (409) T ss_pred CCHHHH-HHHHHHHHH-------HhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcc----- Confidence 555444 444444333 2244566778999876421 0111112333444444454443321 Q ss_pred CCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNE 154 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 154 (663) |.+.+.+|.. +..++. .|+.......+.++++++|.|++.|+=+. T Consensus 68 ------~~Gf~~~d~~--------l~~i~~-~N~ld~~~~~~~~~al~yG~sf~~v~~~~-------------------- 112 (409) T protein:vir:16 68 ------FREFENDDFT--------VNEIFE-ENNPDIFFDSTVLSALIASCSFTYISKGE-------------------- 112 (409) T ss_pred ------cccccCcchH--------HHHHHH-hcChhHHHHHHHHHHHHhCceeEEEecCC-------------------- Confidence 2222233322 344453 56766677889999999999999875221 Q ss_pred cccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccc Q lcl|NC_021532. 155 TVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDS 232 (663) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~ 232 (663) .+.|.+..++|.+++ |||...+ + ..+.+.+- .+ T Consensus 113 -------------dg~~~i~~~sP~~~~~i~D~~~~~-~----~~a~~~~~-----------~d---------------- 147 (409) T protein:vir:16 113 -------------NDAVRLQVIEATNATGIIDPITGL-L----TEGYAVLE-----------RD---------------- 147 (409) T ss_pred -------------CCceEEEEEcccceEEEeeccccc-c----eeeeEEEE-----------ec---------------- Confidence 123556777887664 5553221 1 01111110 00 Q ss_pred cccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH-H Q lcl|NC_021532. 233 PDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA-E 311 (663) Q Consensus 233 ~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~-~ 311 (663) .. .. ...+.+|.. + +.+.. +-++..-...++|+ |.+|++.++..++.++++|.|-+ + T Consensus 148 ----~~------~~-~~~~~~~~~-----~---~~~~~-~~~~~~~~~~~~~~--g~vPvV~f~n~~~~~~~~G~seI~~ 205 (409) T protein:vir:16 148 ----EN------NN-VVLEAHFLP-----D---RTDYY-YRDSRNNISIANPT--GNPLLVPIIHRPDAVRPFGRSRITR 205 (409) T ss_pred ----CC------Cc-eEEEEEEec-----C---cEEEE-EecCccccceecCC--CCcceEEecccccccccCCccccch Confidence 00 00 001111110 0 00000 00111111234555 78999999999999999999865 8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEEe---eccccCcchhhhccCCcceEeCCC--CCccccccCcccc-HHHHHHH Q lcl|NC_021532. 312 MIGDNQKVKTAVIRGIIDNMAQSNNGQVAI---RKGALDQTNRKKFLAGANFEFNGT--ANDFWHGSYNAIP-SSAFDMI 385 (663) Q Consensus 312 ~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~---~~~~i~~~d~~~~~p~~vi~~~~~--~~~~~~~~~~~~~-~~~~~~~ 385 (663) .++++|+.+|+.+..++......++|+..+ ++++ ++.+.....++.++.+... +..+...+++.-. ..+...+ T Consensus 206 ~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~-~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l 284 (409) T protein:vir:16 206 SGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA-EPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQL 284 (409) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC-CccchhhhhhhHhhccCCCCCCCCceEEecCCCChhHHHHHH Confidence 899999999999999999999999988654 2221 2233344556777776432 2223332333221 2344555 Q ss_pred HHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEec Q lcl|NC_021532. 386 SLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTN 465 (663) Q Consensus 386 ~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~ 465 (663) ..+...+-.+||+|....|..++. +.+|.++......-........+.|.. ..+.++++++.+.-..-. T Consensus 285 ~~~~~~~a~~s~lP~~~lg~~~~N-psSa~Ai~a~~~~L~~ka~~k~~~fg~-~l~~~~rla~~~~~~~~~--------- 353 (409) T protein:vir:16 285 RTAAAGFAGETGLTLDDLGFVSDN-PSSVEAIKASHENLRLAGRKAQRSLGA-GLLNVAYLAACLRDDVPY--------- 353 (409) T ss_pred HHHHHHHhhhcCCCHHHcccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCc--------- Confidence 666666667889999999965432 223333443222222222223333432 233344433333211100 Q ss_pred CeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhh Q lcl|NC_021532. 466 DKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQA 532 (663) Q Consensus 466 ~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~ 532 (663) .++.+. +..+.-.-.......+..+.. .....+.+..++......+... .++.... T Consensus 354 ------~~~~~~-~~~v~W~~~~~~~~~s~a~~a-Da~~Kl~~a~~~~~~~~v~~~~---~g~~~~d 409 (409) T protein:vir:16 354 ------LREQFS-KTKPKWEPLFEADASMLSLIG-DGAIKLNQAIPEFINKDTIRDL---TGIKGAE 409 (409) T ss_pred ------cchhhc-cceEEecCCCCcchhhHHHHH-HHHHHHHhhcccccchhHHHHh---ccCCCCC Confidence 011110 011111100011111122221 2222222222222222333322 2332211 No 105 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.63 E-value=2.2e-15 Score=100.94 Aligned_cols=452 Identities=13% Similarity=0.045 Sum_probs=187.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCC--------CccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGK--------SAIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~--------s~~~~~~i~~~v~~~~~~l~~~~ 72 (663) |-. .++++..|.+.+.+ +.....++.+||+|++. ++..|. -+++.|-..-.|+...+.+. T Consensus 1 ~~t-~~~~i~~L~~~~~~-------~~~r~~~l~~Yy~G~~~-i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~--- 68 (480) T protein:vir:78 1 MTT-YHEHVERLQGLLAR-------DLPNLLEAEAYRNGTRR-LKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD--- 68 (480) T ss_pred CCC-HHHHHHHHHHHHHH-------HHHHHHHHHHHHhcccc-ccccccccchhHhhhhhhcchHHHHHHHHHhhhc--- Confidence 332 44455556555432 33446677889999764 222221 12445555555555555441 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) +.+ | +.++|.+..+.+ ..+++ .|+....+..++++++++|.|++.|+-..... T Consensus 69 ~~g-----~--~~~~d~~~~~~l----~~i~~-~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~--------------- 121 (480) T protein:vir:78 69 IEG-----F--RISEDSEGLEEL----WNWWQ-ANDLDEESVLGHDDSLTFGRSYITVSHPDVES--------------- 121 (480) T ss_pred cCc-----e--ecCCCchhHHHH----HHHHH-hcCHHHHHHHHHHHHhhcCceEEEEecCcccc--------------- Confidence 111 1 233444443333 34443 46666777889999999999998875210000 Q ss_pred cccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) ....+.+.+..++|..+| |||...+. ..+.++ ++.+.++ T Consensus 122 ------------~d~~g~~~i~~~~p~~~~~~~D~~~~~~---~~~~i~-~~~~~~~----------------------- 162 (480) T protein:vir:78 122 ------------GDPAGIPLIRVESPLYMYAELDPRNTRR---VTRAVR-LYTTRDD----------------------- 162 (480) T ss_pred ------------CCCCCeeEEEEEcccceEEEEcCCCccc---eEEEEE-EEEeecC----------------------- Confidence 011245667788888876 55543222 122221 1111000 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECC----EEEecccCCCcCCCCCEEEEeeeeecCcccC Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIND----VIVRLQSNPYPDGKPPFLVVPFNSIPFKLHG 306 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~----~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g 306 (663) . ..+..+++|.. +. .+++...++ .+......|...|.+|++.++..+..+.++| T Consensus 163 -------~-------~~~~~~~~y~~-----~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G 220 (480) T protein:vir:78 163 -------V-------AVPDRATLYLP-----DE---TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYG 220 (480) T ss_pred -------C-------CceEEEEEEeC-----Ce---EEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccC Confidence 0 01122333322 00 111111111 0111222334457899999999899999999 Q ss_pred CChHHH-HHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Ccc--h----hhhccCCcceEeCCCCCccccccCcccc Q lcl|NC_021532. 307 EANAEM-IGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQT--N----RKKFLAGANFEFNGTANDFWHGSYNAIP 378 (663) Q Consensus 307 ~g~~~~-~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~~--d----~~~~~p~~vi~~~~~~~~~~~~~~~~~~ 378 (663) .|-+.. ++++++.+|+.++.+.+.+...+.|...+ .|.- +.. + ......|.++...+ .......++... T Consensus 221 ~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 297 (480) T protein:vir:78 221 RSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-SGVTTDELTNDGENTTLDIYYGRILTLAS--EAAKISEFKAAE 297 (480) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh-hcCCccccccccccchhhhhhhhhccCCC--CCceEEecCccC Confidence 998875 89999999999999999999888887654 2321 110 0 01223344443332 223333333321 Q ss_pred -HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021532. 379 -SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEE 457 (663) Q Consensus 379 -~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~ 457 (663) ..+...+......+-.++|+++...|..+.. +.++.++...............+.|. .-++.++.++..+... T Consensus 298 ~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n-~~Sg~Alk~~~~~l~~ka~~~~~~f~-----~~l~~~~~l~~~~~g~ 371 (480) T protein:vir:78 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFG-----GAWERAMRIAMQIMGR 371 (480) T ss_pred HHHHHHHHHHHHHHHhcccCCChHHhccccCc-chHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHcCC Confidence 2233444445555555688988888864321 11222233322222222222233332 2233444555554321 Q ss_pred ceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh-hhhhhhh Q lcl|NC_021532. 458 EEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP-EQAKRMR 536 (663) Q Consensus 458 ~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~-e~~~~l~ 536 (663) .. ..++. ++++...-.......+....+..+.+.... .+....+... .++. +..+.++ T Consensus 372 ~~-------------~~~~~-~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~----~~s~et~~~~---lg~~~d~~~~~~ 430 (480) T protein:vir:78 372 EV-------------TEEYT-RLETVWRDPSTPTVAAKADAVSKLYANGQG----PIPKEQARID---LGYTATQREQMR 430 (480) T ss_pred Cc-------------cccce-eeeEEecCCCCCCHHHHHHHHHHHHHhccc----cCCHHHHHhc---CCCCHhHHHHHH Confidence 10 00110 122222211122222233333333222111 1222222211 1211 1011111 Q ss_pred hhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH--HHHHHHHHHHHHHHHH Q lcl|NC_021532. 537 EYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANE--NTID--AELKRSKAAVEKAKAR 597 (663) Q Consensus 537 ~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~--~~~~--~~~~~~~~~~e~~~~q 597 (663) +.. .++.+.....+.+... ..+.... ...+ -+.+.+..+.=..+.+ T Consensus 431 ~~~--------------~e~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 431 DWD--------------KQETEDMIDTLYSTTK-AQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHH--------------HHHHHHHHHHhhcccc-ccCCCCCCCCCCCCCCccccccCCCCcccCC Confidence 000 0000000000000000 0000000 0000 0000000000000000 No 106 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.63 E-value=3.5e-15 Score=99.85 Aligned_cols=458 Identities=10% Similarity=0.011 Sum_probs=187.3 Q ss_pred CCCc------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc--------------ccCCC--ccccHHHH Q lcl|NC_021532. 1 MKIN------KAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE--------------QKGKS--AIVSRDIK 58 (663) Q Consensus 1 ~~~~------~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~--------------~~g~s--~~~~~~i~ 58 (663) |.+. -..+.......+++....+ ....+.+..+||.|++.-+. .++++ +++.|-.. T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~ 90 (503) T protein:vir:59 13 EELNEIIVESAKEIAEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHK 90 (503) T ss_pred HhHHHhhhhhhhhccchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHH Confidence 1110 0000011111111111111 23456788889998753111 11122 34555555 Q ss_pred HHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccce Q lcl|NC_021532. 59 KQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEE 138 (663) Q Consensus 59 ~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~ 138 (663) ..|+...+++ ++... .+. .+|++.. ++++.++. ++.......+.++++++|.|++.+++|.+ T Consensus 91 ~ivd~~~~yl----~g~~~--~~~---~~d~~~~----~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d~d--- 152 (503) T protein:vir:59 91 LFVDQKTQYL----VGEPV--TFT---SDNKTLL----EYVNELAD--DDFDDILNETVKNMSNKGIEYWHPFVDEE--- 152 (503) T ss_pred HHHHHHHhhh----hcCCe--eec---cCcHHHH----HHHHHHHh--cCHHHHHHHHHHHHhhCCeEEEEEeecCC--- Confidence 5566555544 44332 232 2444333 34555443 56667778899999999999999988632 Q ss_pred ecccccccccCccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcC Q lcl|NC_021532. 139 VTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKN 216 (663) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~ 216 (663) +.+.+..++|..+| ||+... ....++ .+.+.+... T Consensus 153 ------------------------------g~~~i~~~~p~~~~~i~d~~~~---~~~~~~-ir~~~~~~~--------- 189 (503) T protein:vir:59 153 ------------------------------GEFDYVIFPAEEMIVVYKDNTR---RDILFA-LRYYSYKGI--------- 189 (503) T ss_pred ------------------------------CceEEEEEccceeEEEEeCCCC---CceEEE-EEEEEEecC--------- Confidence 23567788888876 333221 122222 222221100 Q ss_pred hhhhhhccchhhhccccccccccccccccceEEEEEEEEEe-----eecCCceeEEEEEE-EECCEEEecccCCCcCCCC Q lcl|NC_021532. 217 LDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNY-----DVDGDGIAEPIVCA-WINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~-----~~~~~g~~~~~~~~-~~g~~~l~~~~~p~~~~~~ 290 (663) ....+..+|+|... ...+++........ ......+.....|...+.+ T Consensus 190 ---------------------------~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 242 (503) T protein:vir:59 190 ---------------------------MGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRV 242 (503) T ss_pred ---------------------------CCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCcc Confidence 00012233333221 00111100000000 0000001112223344566 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCc-ch-hhhccCCcceEeCCCCC Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQ-TN-RKKFLAGANFEFNGTAN 367 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~-~d-~~~~~p~~vi~~~~~~~ 367 (663) ||+.+ .++.+|.|.+..++++++.+|.+.+.+.+.+...++|.+.+ .|. ... .+ ......++++.+..+++ T Consensus 243 Piv~~-----~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (503) T protein:vir:59 243 PIIPF-----KNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVL-KNYDGENPKEFTANLRYHSVIKVSGDGG 316 (503) T ss_pred ceEEe-----cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEe-ecCCccccchhhhhhhcccceeccCCCc Confidence 66544 45567999999999999999999999999999999987665 332 111 11 12344556676665544 Q ss_pred ccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 368 DFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKW 447 (663) Q Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~ 447 (663) ..++..+.-.......++.+.+.|...+++++...+..++..|++| +...............+.|. .+++.+++++ T Consensus 317 -~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A--i~~~~~~l~~k~~~~~~~~~-~~l~~~~~~i 392 (503) T protein:vir:59 317 -VDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPA--LENLYALLDLKANMAERKIR-AGLRLFFWFF 392 (503) T ss_pred -ceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHH--HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Confidence 4555444434566778899999999999998876553333334444 33333333333333333442 2344444444 Q ss_pred HHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhh Q lcl|NC_021532. 448 MAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMR 527 (663) Q Consensus 448 ~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~ 527 (663) +.++...... .+. .. .++.+...-..+....+..+.+..+.+ +..++ ...+... +.. T Consensus 393 ~~~~~~~~~~---------~~~-----~~-~~i~i~f~~~~p~d~~~~~~~~~kl~~--~GiiS----~et~l~~--l~~ 449 (503) T protein:vir:59 393 AEYLRNTGKG---------DFN-----PD-KELTMTFTRTRIQNDSEIVQSLVQGVT--GGIMS----KETAVAR--NPF 449 (503) T ss_pred HHHHHhccCc---------ccc-----cc-cceeEEeCCCCCCCHHHHHHHHHHHHh--CCCCc----hHHHHHh--CCC Confidence 4433221110 000 00 012222222222222222222222211 11111 1111111 111 Q ss_pred hhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 528 MPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEAD 604 (663) Q Consensus 528 ~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~ 604 (663) ..+....++ +...++... .+........ .....+.. +-....-+..+... .++. T Consensus 450 v~d~~~E~~--------------ri~~E~~~~--~~~~~~~~~~----~~~~~~~~-~~~~~~~~~~~~~~--g~~~ 503 (503) T protein:vir:59 450 VQDPEEELA--------------RIEEEMNQY--AEMQGNLLDD----EGGDDDLE-EDDPNAGAAESGGA--GQVS 503 (503) T ss_pred CCCHHHHHH--------------HHHHHHHHH--HhhhccccCc----cCCCCCCC-cCCCCCCcccCCCC--CCcC Confidence 110000000 000000000 0000000000 00000000 00000000000000 0000 No 107 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.62 E-value=9.4e-14 Score=92.00 Aligned_cols=469 Identities=10% Similarity=0.039 Sum_probs=196.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc--------------cCC--CccccHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ--------------KGK--SAIVSRDIKKQSEWQ 64 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~--------------~g~--s~~~~~~i~~~v~~~ 64 (663) .+|.-+.+-+.|..++.... .+.+.+.....++||.|++.-+.. ..+ .+++.|-....|+.. T Consensus 6 ~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~ 83 (537) T protein:vir:78 6 LNKPIDQLGGLLNTEITTYM--ASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQL 83 (537) T ss_pred ccccHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHH Confidence 67777777777777766643 233445667778899987532111 011 134555544455555 Q ss_pred HHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccc Q lcl|NC_021532. 65 HATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGE 144 (663) Q Consensus 65 ~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~ 144 (663) +++ ++++.+. |.+-..++ +....+++.++. ++......+...++.++|.|+..+++|.. T Consensus 84 ~~y----l~G~Pv~--~~~~d~~~----~e~~~~l~~~~~--~~~~~~~~el~~~~s~~G~ay~~~y~de~--------- 142 (537) T protein:vir:78 84 AQY----LLSNGVE--VKVKDEDN----TQLDEILQEYFD--EDFQATIDTLVTNASKKGFEGIFARTTSE--------- 142 (537) T ss_pred hhh----hcccCce--eecCcchh----HHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeeEEEeeecCC--------- Confidence 444 4565433 33322222 233444555442 45556667888999999999999988632 Q ss_pred ccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhcc Q lcl|NC_021532. 145 AVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTS 224 (663) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~ 224 (663) +.+.+.+++|.++|+=.+-.. +...+++.......+. T Consensus 143 ------------------------~~~~~~~i~p~~~~pv~d~~~---~~~~~~~~y~~~~~~~---------------- 179 (537) T protein:vir:78 143 ------------------------GKLKFQTVDGLTLIPVFDDYG---VLKMIIRWYSEIRYST---------------- 179 (537) T ss_pred ------------------------CceEEEEEccceeEEEEcCCC---CceeEEEEEeeeeccc---------------- Confidence 234566788888753221111 1111222111110000 Q ss_pred chhhhccccccccccccccccceEEEEEEEEE-----eeecCCceeE-------------EEEEEEEC----CEEEeccc Q lcl|NC_021532. 225 GEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGN-----YDVDGDGIAE-------------PIVCAWIN----DVIVRLQS 282 (663) Q Consensus 225 ~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~-----~~~~~~g~~~-------------~~~~~~~g----~~~l~~~~ 282 (663) .+.....+..+|+|.. +...+.+... .++..+.. +..-.... T Consensus 180 ----------------~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 243 (537) T protein:vir:78 180 ----------------KQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGY 243 (537) T ss_pred ----------------cccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccccccccc Confidence 0000111223333321 1111111110 00111100 00000111 Q ss_pred CCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc--chhhhccCCcce Q lcl|NC_021532. 283 NPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ--TNRKKFLAGANF 360 (663) Q Consensus 283 ~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~--~d~~~~~p~~vi 360 (663) .+.+ +||-.+|+.+-.++-+|.|.++.++++++.+|.+.|.+.+.+...++|.+++--..++. ......+-.+++ T Consensus 244 ~~~~---~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i 320 (537) T protein:vir:78 244 QVLG---RSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMI 320 (537) T ss_pred cccc---cCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCce Confidence 1111 23333334444556678999999999999999999999999999999776653222222 112233444567 Q ss_pred EeCCCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 361 EFNGTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLV 440 (663) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~ 440 (663) .+.+.++.+.++..+.-.......++.+.+.|...|.+++......++ .|+.| +..+...........-+.| T Consensus 321 ~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn-~SGvA--lk~~~~~l~~ka~~ke~~f----- 392 (537) T protein:vir:78 321 GVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGN-VTNVV--IKSRYTLLAMKARKMETSL----- 392 (537) T ss_pred eecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccC-CcHHH--HHHHHhhHHHHHHHHHHHH----- Confidence 776555556776666555667788999999999888776654332222 24444 3333333333333333334 Q ss_pred HHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHH Q lcl|NC_021532. 441 KPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMA 520 (663) Q Consensus 441 ~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~ 520 (663) +..++.++.+|..++.- .|.. .++.. ..++..+-..+....+..+.+..+.+ .. .+....+. T Consensus 393 ~~~l~~~~~~i~~~~~~------~~~~--~~d~~----~i~i~f~~~~P~n~~e~a~~~~~l~~---~g---iiS~eT~l 454 (537) T protein:vir:78 393 RKVLRWCADMVVSDIAL------RGLG--EYDSN----DICFEIEPHVLANELDIATTRKTEAE---TE---ALKIGNIM 454 (537) T ss_pred HHHHHHHHHHHHHHHhh------cCCc--ccccc----eeeEEeccCCCCCHHHHHHHHHHHHh---cC---cchHHHHH Confidence 23344444444444321 1110 01111 12222232222222222222211110 00 01111111 Q ss_pred HHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------H Q lcl|NC_021532. 521 DIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVE-------K 593 (663) Q Consensus 521 ~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e-------~ 593 (663) ..+..-.-++..+..++ +..............++++......+.+......... . T Consensus 455 ~~~p~vdd~e~ek~~~e------------------e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 516 (537) T protein:vir:78 455 TVAPRIGDDETLKLIAE------------------ELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDP 516 (537) T ss_pred HhCCCCCCHHHHHHHHH------------------HHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCc Confidence 00000000000000000 0000000000000000000000000000000000000 0 Q ss_pred ----------------HHHHH Q lcl|NC_021532. 594 ----------------AKARK 598 (663) Q Consensus 594 ----------------~~~q~ 598 (663) +--+. T Consensus 517 ~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 517 NQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred cCCCCCCCCCCCCCCccCCCC Confidence 00000 No 108 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.61 E-value=1.1e-14 Score=97.14 Aligned_cols=392 Identities=13% Similarity=0.044 Sum_probs=176.2 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcCCc--------cccCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHH Q lcl|NC_021532. 20 DVLKQEQDSLISTWKAEYNGEPYGN--------EQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDS 91 (663) Q Consensus 20 ~~~~~~~~~~~~~~~~~y~~~~~~~--------~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~ 91 (663) -+++ ......+.+||.|++.-. +-+....++.|-+.-.|+++...+. |.+.+.+|.. T Consensus 1 l~~~---~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~-----------~~Gf~~~d~~- 65 (410) T protein:vir:95 1 MNLY---QSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLI-----------FRAFANDDFN- 65 (410) T ss_pred CCcc---hhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhc-----------cccccCCCch- Confidence 2222 233555678999976421 1111222344554445554444331 2222333332 Q ss_pred HHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccccccccccccceeecccc Q lcl|NC_021532. 92 AEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQP 171 (663) Q Consensus 92 Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 171 (663) +..++. .|+.......+.++++++|+|++.|+=+. .+.| T Consensus 66 -------l~~i~~-~N~ld~~~~~~~~~al~~G~sf~~v~~~~---------------------------------d~~~ 104 (410) T protein:vir:95 66 -------VTEIFD-RNNPDIFFDSAILSALIGSCSFVYISKGE---------------------------------DDEV 104 (410) T ss_pred -------HHHHHh-hcChHHHHHHHHHHHHHhCceeEEEecCC---------------------------------CCce Confidence 344453 57777777889999999999998874211 1235 Q ss_pred eeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccccccccccccceEE Q lcl|NC_021532. 172 TARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLI 249 (663) Q Consensus 172 ~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~ 249 (663) .+..++|.+++ |||... . ...+.+.+-. +.. .... T Consensus 105 ~i~~~sP~~~~~i~Dp~~~----~-~~~al~~~~~----------------------------~~~----------~~~~ 141 (410) T protein:vir:95 105 RLQVIESSNATGVIDPITG----L-LVEGYAVLAR----------------------------DDY----------NRPT 141 (410) T ss_pred EEEEEcccceEEEEeCCCC----c-eEEEEEEEEe----------------------------cCC----------CeEE Confidence 56778888775 555211 1 1111111100 000 0011 Q ss_pred EEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCCh-HHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 250 IYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEAN-AEMIGDNQKVKTAVIRGII 328 (663) Q Consensus 250 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~-~~~~~d~Q~~~N~~~~~~~ 328 (663) ...+|.. + .+.++.++..-...++|+ |.+|++.++..++.++++|.|- .+.++++|+.+|+.+..++ T Consensus 142 ~~~~~~~-----~-----~~~~~~~~~~~~~~~~~~--g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~ 209 (410) T protein:vir:95 142 LEAYFEP-----N-----ATHFIPKDGEPYSVTNET--GIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERAD 209 (410) T ss_pred EEEEEeC-----C-----cEEEEeeCCccccccCCC--CCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHH Confidence 1112210 0 000111111112235555 7899999999999999999984 5899999999999999999 Q ss_pred HHHHhcCCCcEEeecccc---CcchhhhccCCcceEeCCCCC--ccccccCcccc-HHHHHHHHHHHHHHHHHhCCChHH Q lcl|NC_021532. 329 DNMAQSNNGQVAIRKGAL---DQTNRKKFLAGANFEFNGTAN--DFWHGSYNAIP-SSAFDMISLMNNEIESITGTKSFS 402 (663) Q Consensus 329 ~~~~~~~~~~~~~~~~~i---~~~d~~~~~p~~vi~~~~~~~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~~tGi~~~~ 402 (663) ......++|+..+ .|.- +..+.....+++++.+..+.+ .+...+++.-. ..+...+..+...+-.+||+|... T Consensus 210 ~~~e~~a~pqr~i-~G~d~d~~~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~ 288 (410) T protein:vir:95 210 ITAEFYSWPQKYI-LGLDPDAEPMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDD 288 (410) T ss_pred HHHHHhcchhhee-eccCCCCCcCchhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHH Confidence 9999999988654 1221 122233455677777654322 23332332221 223455556666666778999999 Q ss_pred cCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-eEEEEecCeeeccchhhcCCceE Q lcl|NC_021532. 403 GGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE-EVIRVTNDKFVPIRKDDLSGRID 481 (663) Q Consensus 403 ~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~-~~iri~~~~~v~i~~~~~~~~~d 481 (663) .|..++. +.+|.++......-........+.|.. ..+.++++++.+.-..-..+ ...++. -.|-++ .+ T Consensus 289 lg~~~~N-psSa~Al~a~~~~L~~ka~~k~~~fg~-~l~~~~rla~~i~~~~~~~~~~~~~~~-v~W~p~--------~d 357 (410) T protein:vir:95 289 LGFVSDN-PSSVEAIKASHENLRLAGRKAQRSLGA-GLLNVAYVAACLRDEFRYTRSQFVRTA-VKWEPL--------FE 357 (410) T ss_pred hccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCCcccccceee-EEeeec--------CC Confidence 9965432 123333443222222222333344433 33444554444332211100 000000 001110 00 Q ss_pred EEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHH Q lcl|NC_021532. 482 IDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLE 560 (663) Q Consensus 482 ~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~ 560 (663) . + .....+....+.-+.+. .+.......+...+-+..- +..+...+ .+.++.+ T Consensus 358 ~----~-~~s~a~~aDa~~Kl~~a----~~g~~~~~~~~~~lg~~~~-~~~~~~~~----------------e~~~~g~ 410 (410) T protein:vir:95 358 A----D-ANTMTMIGDGVVKLNQA----LPGYINAETIRDLTGIAGD-MSAKPVVS----------------EGGSNGE 410 (410) T ss_pred c----c-hhhHHHHHHHHHHHHHh----ccCCccHHHHHHhcCCChH-HHHHHHHH----------------HHHhCCC Confidence 0 0 00111112222212221 1111222222222211110 00000000 0000000 No 109 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.61 E-value=5.4e-16 Score=104.29 Aligned_cols=476 Identities=13% Similarity=0.090 Sum_probs=212.5 Q ss_pred CCCcHHHH---HHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCcCCcc--ccCCCccc--cHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKINKAEL---LSALKADMKAADVLKQ-EQDSLISTWKAEYNGEPYGNE--QKGKSAIV--SRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~~~~~---~~~l~~~~~~~~~~~~-~~~~~~~~~~~~y~~~~~~~~--~~g~s~~~--~~~i~~~v~~~~~~l~~~~ 72 (663) |--+.... ...+...-..+....+ .+...+..|.+||.|.+|.++ -+|+-... .|.-+..|++++ .| T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~~ps~r~~V~~~~-----~~ 75 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDDSVPILMPSGRKIVEAVH-----RF 75 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCceeeeccchHHHHHHHHH-----Hh Confidence 32221111 0001111111111222 255667888999999999866 35654443 334556666643 23 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) ++..-.+.|.|.. +|+...+..+.+++.++..++-..+ +...-+++++.|-|++++-||.+.. T Consensus 76 Lg~~~~~~Ve~~~-~de~~~~avq~~Lr~~~~~e~l~~~-~~~~~r~a~vlGDgvf~l~wDp~K~--------------- 138 (563) T protein:vir:74 76 LGVGFDYLVEPDM-GDEGIRQSLNAYFRTTFKREAIKAK-FTSNKRWGLIRGDAHFYIHADPNKK--------------- 138 (563) T ss_pred cCCCcEEecCccc-cCcchHHHHHHHHHHHHHHhhhHHH-HHHHHHhhhhhcceeEEEeeccccc--------------- Confidence 4555556677766 3555556678899998877655544 4557789999999999999985321 Q ss_pred cccccccccccceeecccceeeeccHHHhee-C-ccc---------------ccChhhCceEEEEeecCHHHHHHhcCCc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIYL-D-PTC---------------QDNLDNAQFVIHRYETDLSTLKKDGRYK 215 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-d-p~a---------------~~d~~d~~~~~~~~~~~~~~l~~~g~~~ 215 (663) ...++.+..|+|.-+|. + |+. ..+ ..+.++++...++. T Consensus 139 --------------~g~R~rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd--~~~~~~r~~~~~~~--------- 193 (563) T protein:vir:74 139 --------------AGERISVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDD--PSKKLARRRTFRRV--------- 193 (563) T ss_pred --------------cCCCceEeecCCceeeeccCCCCcccceeeecccCCCCCcc--hhccceeeeeeeee--------- Confidence 01122333344433321 1 100 000 01112222111100 Q ss_pred ChhhhhhccchhhhccccccccccccccccceEEEEEEEE-----EeeecCCceeEEEEEEEECCEEEecccCCCcCCCC Q lcl|NC_021532. 216 NLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWG-----NYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~-----~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~ 290 (663) .++. ..+.. .-..-.|.|. ....+..-....-.-++...+.++...-|.+.+.+ T Consensus 194 -----------------lnde-g~~~~---~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~i 252 (563) T protein:vir:74 194 -----------------RNDE-GMFTG---RISSELTHWTLGNWDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQL 252 (563) T ss_pred -----------------eCCC-CCccc---eeeeccchhccccccccCccchhhhcccchhhhhhhhchhhhccccccCc Confidence 0000 00000 0001122232 11111111111111233333333333335555789 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cC----cchhhhccCCcceEeCCC Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LD----QTNRKKFLAGANFEFNGT 365 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~----~~d~~~~~p~~vi~~~~~ 365 (663) ||++++-.|.+++.||.|-...+..+.+.+|...+-.--++..+.+|.+..+... ++ .....+..||.+|+.-.+ T Consensus 253 Piv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~ 332 (563) T protein:vir:74 253 PLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGN 332 (563) T ss_pred cEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccccccccccccccccCCceeEeccCC Confidence 9999999999999999999999999999999999998888899999877765322 11 111223568999887544 Q ss_pred CC--ccccccC-ccccHHHHHHHHHH-HHHHHHHhCCChHHcC--CCcccchhHHHHHHHH-HHHHHHHHHH-HHHHHHH Q lcl|NC_021532. 366 AN--DFWHGSY-NAIPSSAFDMISLM-NNEIESITGTKSFSGG--INSGSLGSTATGARGA-LDATATRRMN-IVRNIAE 437 (663) Q Consensus 366 ~~--~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~tGi~~~~~G--~~~~~~~~tA~~i~~~-~~~~~~~l~~-~~~~~~~ 437 (663) .. -+..+.. +++ ..+..=+..+ ...+.+++|++....| -.+...|+.|=.++.- .-++..+-+. +..-+ + T Consensus 333 ~~~g~l~~v~g~~~l-~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~m-r 410 (563) T protein:vir:74 333 RNDNYFERVSGVQDV-SPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVM-D 410 (563) T ss_pred ccccceeeecchhhh-HHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHH-H Confidence 22 2222222 222 1122223333 3467899999999999 3455566666554431 1111111111 22223 3 Q ss_pred HHHHHHHHHHHHHHHHhcC--------------CceEEEEecCeeeccchhhcCCce----EEEE-----------eecc Q lcl|NC_021532. 438 NLVKPLMRKWMAYNAEFLE--------------EEEVIRVTNDKFVPIRKDDLSGRI----DIDI-----------SIST 488 (663) Q Consensus 438 ~~~~~l~~~~~~li~q~~~--------------~~~~iri~~~~~v~i~~~~~~~~~----d~~v-----------~~~~ 488 (663) .+....++++|.+++.-+- ....+.|+=+...++|......+. ..-| .++. T Consensus 411 ~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~ 490 (563) T protein:vir:74 411 QFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGW 490 (563) T ss_pred HHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCC Confidence 3455566777766665221 112233332234444443211000 0000 0000 Q ss_pred --cchhHH----HHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhh-----hhhhhh----hcchhhHHHHhhH Q lcl|NC_021532. 489 --AEDNAA----KSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQA-----KRMREY----EPKPDPVQEKIRQ 551 (663) Q Consensus 489 --~~~~~~----~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~-----~~l~~~----~~~~~~~~~q~~q 551 (663) .....+ .......++..-+..-+|... .++.-.++++.. .-+..+ .-.|+-.+-.... T Consensus 491 ~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~-----~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 491 EYPEVDDQGNALTDDDIADMLLAEAEADASLGL-----SAMDNGGAGEQQFDDQGNPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred CCCcHHHHHhhcCHHHHHHHHHHHhhccCcccc-----eecccCCCCcccccccCCchhHcCCcccCCccccccCCCC Confidence 000000 011111111110000111111 011111111100 000000 0000100000000 No 110 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.60 E-value=1e-13 Score=91.86 Aligned_cols=436 Identities=11% Similarity=0.070 Sum_probs=189.8 Q ss_pred CCCc---HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc-----------cccCCC--ccccHHH Q lcl|NC_021532. 1 MKIN---KAELLSALK-------ADMKAADVLKQEQDSLISTWKAEYNGEPYGN-----------EQKGKS--AIVSRDI 57 (663) Q Consensus 1 ~~~~---~~~~~~~l~-------~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~-----------~~~g~s--~~~~~~i 57 (663) |-++ .++..+.+. ..++.....+.........+.+||.|++.-+ ...+++ +++.|.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~ 86 (474) T protein:vir:95 7 MPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFH 86 (474) T ss_pred cCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchH Confidence 3333 111222221 1122222223445566778889999875211 112222 4556666 Q ss_pred HHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccc Q lcl|NC_021532. 58 KKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDE 137 (663) Q Consensus 58 ~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~ 137 (663) ...|+..++++ ++..+. +. .+|++. .+.++.+++ ++....+..+.+++.++|.|++.+++|.. T Consensus 87 ~~Ivd~~~~~l----~g~p~~--~~---~~d~~~----~~~l~~~~~--n~~~~~~~e~~~~~~~~G~~~~~v~~d~~-- 149 (474) T protein:vir:95 87 QNLVDQKVSYV----ASKPVT--YS---CEDESV----LKIIHDVLD--TRWDNKLIDILTATSNKGIDWLQVYINEN-- 149 (474) T ss_pred HHHHHHHHhhh----ccCCce--ec---cCchHH----HHHHHHHHh--ccHHHHHHHHHHHHhhcCcEEEEEEecCC-- Confidence 66666655544 454433 22 234333 334455443 45666777889999999999999887521 Q ss_pred eecccccccccCccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCc Q lcl|NC_021532. 138 EVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYK 215 (663) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~ 215 (663) +.+.+..++|.++| ||+...+ +..+++ +.+..... T Consensus 150 -------------------------------~~~~i~~~~p~~~~~v~d~~~~~---~~~~~i-~~~~~~~~-------- 186 (474) T protein:vir:95 150 -------------------------------GEMKLFRVPAEQAIPIWVDKERE---ELKSFI-RYYKFNNE-------- 186 (474) T ss_pred -------------------------------CceEEEEEcccceEEEEcCCCCC---ceEEEE-EEEEEcCe-------- Confidence 23556778888776 3433221 222222 22211000 Q ss_pred ChhhhhhccchhhhccccccccccccccccceEEEEEEEEE-----eeecCCceeEEEEEEEECCEEEecccCCCcCCCC Q lcl|NC_021532. 216 NLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGN-----YDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKP 290 (663) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~ 290 (663) ..+++|.. +-..+.+. ... ...+.........|...|.+ T Consensus 187 ---------------------------------~~~~~y~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~g~i 230 (474) T protein:vir:95 187 ---------------------------------EKVEFWTDTTVTYYVLENGGL-IPD--YYYGANHIQSHFSNGNWGRV 230 (474) T ss_pred ---------------------------------eEEEEEeCCeEEEEEEcCCcc-ccc--cccCcccccccccccCCCcc Confidence 01112211 00011110 000 00010011111222334566 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc-ch-hhhccCCcceEeCCCCCc Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ-TN-RKKFLAGANFEFNGTAND 368 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~-~d-~~~~~p~~vi~~~~~~~~ 368 (663) |++.+ .++..|.|.++.++++++.+|.+.+.+.+.+...+.|.+++.-...+. .+ ......++++.+.++++ T Consensus 231 Pvv~~-----~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~- 304 (474) T protein:vir:95 231 PFIAF-----KNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGG- 304 (474) T ss_pred ceEee-----cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCc- Confidence 66644 345678999999999999999999999999999999876653222221 11 12234456676655543 Q ss_pred cccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 369 FWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWM 448 (663) Q Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 448 (663) ..++..+.-...+...+..+...|...+++++.+.|..++..|+.| +..+............+.|. ..++.++ T Consensus 305 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A--lk~~~~~l~~k~~~k~~~~~-----~~l~~~~ 377 (474) T protein:vir:95 305 VETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIA--LKFLYGNLDLKANKLKNKAT-----VAIQELI 377 (474) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHH Confidence 4555544444566777899999999999999877664333334444 43333333333333333443 2334444 Q ss_pred HHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhh Q lcl|NC_021532. 449 AYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRM 528 (663) Q Consensus 449 ~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~ 528 (663) .+|..+... .++.. ++.+..+-+.+....+..+ .+... +.++ ...+.. .+... T Consensus 378 ~li~~~~g~------------~~d~~----~i~v~f~~~~p~d~~e~a~----~~~~~-g~iS----~et~i~--~l~~v 430 (474) T protein:vir:95 378 GFIIDFNNL------------KMDVK----DIEISFNFNRMMNDAEQSQ----IIAQS-QYLS----RETLVK--SSPLV 430 (474) T ss_pred HHHHHHhCC------------Ccccc----eeeEEeccCCCcCHHHHHH----HHHhc-CCCc----hHHHHH--hCCCC Confidence 455554321 01111 1122222222222222222 22221 1111 111111 11111 Q ss_pred hhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 529 PEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTID 581 (663) Q Consensus 529 ~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~ 581 (663) .+....++.+..+......+ .............+.++.....-+ T Consensus 431 ~d~~~E~~ri~~E~~~~~~~---------~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 431 DDYKAELERIEQEQMEYNKQ---------LPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred CCHHHHHHHHHHHHHHHHhc---------ccccccccCCCCcCCCCCccCCCC Confidence 11111111111000000000 000000000000000000000000 No 111 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.60 E-value=5.9e-15 Score=98.58 Aligned_cols=452 Identities=13% Similarity=0.039 Sum_probs=186.9 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccC--------CCccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKG--------KSAIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g--------~s~~~~~~i~~~v~~~~~~l~~~~ 72 (663) |-. ..+++..|...+.+ +.....++.+||+|++.- +..| .-+++.|-..-.|+...+.+. T Consensus 1 ~~t-~~d~i~~L~~~~~~-------~~~r~~~~~~Yy~G~~~i-~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~--- 68 (480) T protein:vir:78 1 MTT-YHEHVERLQGLLAR-------DLPNLLEAEAYRNGTRRL-KTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD--- 68 (480) T ss_pred CCC-HHHHHHHHHHHHHH-------HHHHHHHHHHHHhccccc-hhcccccchhhhhhhhhcchHHHHHHHHHhhhc--- Confidence 433 33455555555433 334567778899997631 1111 112445555555555555441 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) +.+ | +.++|.+.. +.++.++. .|+.......++++++++|.|++.|+=.... . T Consensus 69 ~~g---~----~~~~d~~~~----~~l~~i~~-~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~----------~----- 121 (480) T protein:vir:78 69 IEG---F----RISEDSEGL----EELWNWWQ-ANDLDEESVLGHDDSLTFGRAYITVSHPDVE----------S----- 121 (480) T ss_pred cCc---e----ecCCCchhH----HHHHHHHH-hcCHHHHHHHHHHHHhhcCceEEEeecCccc----------c----- Confidence 111 1 123444333 33444453 4677677788999999999999887521000 0 Q ss_pred cccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) ....+.+.+..++|.+++ |||...+ ...+.++ .+...++ T Consensus 122 ------------~d~~~~~~i~~~~p~~~~~i~D~~~~~---~~~~~i~-~~~~~d~----------------------- 162 (480) T protein:vir:78 122 ------------GDPAGIPLIRVESPLYMYAELDPRNTR---RVTRAVR-LYTTRDD----------------------- 162 (480) T ss_pred ------------CCCCCeeEEEEEcccceEEEEcCCCcc---ceEEEEE-EEEeecC----------------------- Confidence 011244667888998876 5554322 1222222 2211000 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECC----EEEecccCCCcCCCCCEEEEeeeeecCcccC Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIND----VIVRLQSNPYPDGKPPFLVVPFNSIPFKLHG 306 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~----~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g 306 (663) . ..+..+++|.. +. .+.....++ .+......|...|.+|++.++..+..+..|| T Consensus 163 -------~-------~~~~~~~~y~~-----~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G 220 (480) T protein:vir:78 163 -------V-------AVPDRATLYLP-----DE---TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYG 220 (480) T ss_pred -------C-------cceEEEEEEeC-----Ce---EEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccC Confidence 0 00122333321 00 011111111 1111222334457899999999989999999 Q ss_pred CChHHH-HHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-Cc---ch---hhhccCCcceEeCCCCCccccccCcccc Q lcl|NC_021532. 307 EANAEM-IGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-DQ---TN---RKKFLAGANFEFNGTANDFWHGSYNAIP 378 (663) Q Consensus 307 ~g~~~~-~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~~---~d---~~~~~p~~vi~~~~~~~~~~~~~~~~~~ 378 (663) .|-+.. ++++++.+|+.++.+...+...++|...+ .|.- +. +. .....+|.++...+ .......++... T Consensus 221 ~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 297 (480) T protein:vir:78 221 RSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-SGVTTDELTNDGENTTLDIYYGRILTLAS--EAAKISEFKAAE 297 (480) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhh-hCCCccccccccccchhhhhhhhhccCCC--CCceEEecCccC Confidence 998874 89999999999999999999889887654 2321 11 10 11123344443332 223333333322 Q ss_pred -HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021532. 379 -SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEE 457 (663) Q Consensus 379 -~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~ 457 (663) ..+.+.+......+-.++++++...|..+.. +.++.++......-.......-+.|. .-++.++.++..+... T Consensus 298 ~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~-----~~l~~~~rl~~~~~~~ 371 (480) T protein:vir:78 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFG-----GAWERAMRIAMQIMGR 371 (480) T ss_pred HHHHHHHHHHHHHHHhcccCCCHHHhccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHcCC Confidence 2233444555555555788888888854321 11222233322222222222333332 2234444555554321 Q ss_pred ceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh-hhhhhhh Q lcl|NC_021532. 458 EEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP-EQAKRMR 536 (663) Q Consensus 458 ~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~-e~~~~l~ 536 (663) .. . .++ .++.+...-.......+....+..+.+... +.+....+.. +.++. +..+.++ T Consensus 372 ~~------------~-~~~-~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~----~~~s~et~~~---~lg~~~d~~~e~~ 430 (480) T protein:vir:78 372 EV------------T-EEY-TRLETVWRDPSTPTVAAKADAVSKLYANGQ----GPIPKEQARI---DLGYTATQREQMR 430 (480) T ss_pred Cc------------c-ccc-eeeeEEecCCCCCCHHHHHHHHHHHHHhcc----cCCCHHHHHh---cCCCCHhHHHHHH Confidence 10 0 000 011222211111222222222222222111 1112222211 11211 1111111 Q ss_pred hhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH--H--HHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 537 EYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANEN--T--IDAELKRSKAAVEKAKAR 597 (663) Q Consensus 537 ~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~--~--~~~~~~~~~~~~e~~~~q 597 (663) +... ++.+.....+.+... ..+.++.. . ...+.+.+..+.=..+.+ T Consensus 431 ~~~~--------------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 431 DWDK--------------QETEDMIDTLYSTTK-AQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHH--------------HHHHHHHHHhhcccc-CCCccccCCCCCCCCCccCCCcccCCCcCCC Confidence 1000 000000000000000 00000000 0 000000000000000000 No 112 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.60 E-value=4.8e-15 Score=99.06 Aligned_cols=458 Identities=11% Similarity=0.004 Sum_probs=179.9 Q ss_pred CCC-----cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc-ccCCC--------ccccHHHHHHHHHHHH Q lcl|NC_021532. 1 MKI-----NKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE-QKGKS--------AIVSRDIKKQSEWQHA 66 (663) Q Consensus 1 ~~~-----~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~-~~g~s--------~~~~~~i~~~v~~~~~ 66 (663) +++ +.+.++..+...+.. +.......+++.+||.|++.... .++.+ ..+.|-.+-.|+.+.+ T Consensus 16 ~~~p~~~~~~~~~~~l~~~l~~~----~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~ 91 (501) T protein:vir:25 16 VEFPEDSMSREQLGALVADMWRL----HISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQ 91 (501) T ss_pred ccCCcccCChHHHHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHh Confidence 444 444444444444443 44455667888899999864211 11111 1233343444444443 Q ss_pred HHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccc Q lcl|NC_021532. 67 TIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAV 146 (663) Q Consensus 67 ~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~ 146 (663) .+ ++. +..-.|.+..+.+ ..++ ..|+.......++.+++++|.|++.|+.+.+ T Consensus 92 ~l---~~~--------gf~~~d~~~~~~l----~~i~-~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~----------- 144 (501) T protein:vir:25 92 NL---SVV--------GYRNALAKENDPA----WEMW-QRNRMDARQAEVHRPALTYGASYVTVTPTDE----------- 144 (501) T ss_pred hh---ccc--------ceecCCccchHHH----HHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCC----------- Confidence 32 112 1111222222222 3334 3466666677899999999999998876421 Q ss_pred ccCccccccccccccccceeecccceeeeccHHHhe--e-CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc Q lcl|NC_021532. 147 VVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--L-DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT 223 (663) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~-dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~ 223 (663) .+.+..++|.+++ | ||..... ..++++ .+....+ .+ T Consensus 145 -----------------------~~~i~~~sp~~~~~iy~D~~~~~~---~~~ai~-~~~~~~~-------~~------- 183 (501) T protein:vir:25 145 -----------------------GPVFRTRSPRQILAVYADPSVDAW---PQYALE-TWVAQKD-------AK------- 183 (501) T ss_pred -----------------------CCeEEEeccccEEEEEecCCCCcc---eeEEEE-EEeeccc-------cC------- Confidence 1345667887775 2 5543211 122222 2211100 00 Q ss_pred cchhhhccccccccccccccccceEEEE---EEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeee Q lcl|NC_021532. 224 SGEDFDYDSPDDTEFQFSDAPRKKLIIY---EYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSI 300 (663) Q Consensus 224 ~~~~~~~~~~~~~~~~~~d~~~~~v~v~---E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~ 300 (663) .......++. ..+..+ ..|......+............++.. .....|-+.+..||+.++-.+. T Consensus 184 ----------~~~~~~~y~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~vPiv~f~N~~~ 250 (501) T protein:vir:25 184 ----------PHRRGVLYDD--TYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVI-EHGATFEGKPVCPVVRFVNGRD 250 (501) T ss_pred ----------cceeEEEecC--eeEEEEecCceeeeecccccccccccccccccccc-ccccccCCccceeeEeccCccc Confidence 0000000000 000000 00000000000000111111111111 1111233346678877666555 Q ss_pred cCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCcchhhhccCCcceEeCCCCCccccccCcccc- Q lcl|NC_021532. 301 PFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQTNRKKFLAGANFEFNGTANDFWHGSYNAIP- 378 (663) Q Consensus 301 ~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~- 378 (663) . +.+|.|.++.++++++.+|+..+.+.......+.|...+ .|. .+..+.....++.++...++. ....+++... T Consensus 251 ~-~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i-~G~~~~~~~~~~~~~~~i~~~~~~~--~~~~q~~~~~~ 326 (501) T protein:vir:25 251 A-DDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVI-SGWTGSKAEVLKASALRVWTFEDPE--VKAQAFPPASV 326 (501) T ss_pred c-CccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHH-hCCCCCccchhhhcccceeccCCCC--ceEEEecccCh Confidence 4 346899999999999999999999999988888876443 333 223333455677777654322 2223333221 Q ss_pred HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021532. 379 SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEE 458 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~ 458 (663) ..+...+..+...+-..|++|+...|..++..|+.| +......-........+.|.+ . ++.++.|+..+.... T Consensus 327 ~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~A--l~~~~~~l~~ka~~k~~~f~~-~----l~~~~rl~~~~~~~~ 399 (501) T protein:vir:25 327 EPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEA--LAAAEANQQRKLAAKRESFGE-S----WEQLLRLAAEMDDDP 399 (501) T ss_pred HHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHH--HHHHHHHHHHHHHHHHHHHHH-H----HHHHHHHHHHHhCCC Confidence 223444555555666678899888885433233333 333222222223333334432 2 233444444443321 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEe--ecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhh-hhhhh Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDIS--ISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPE-QAKRM 535 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~--~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e-~~~~l 535 (663) .- . ..+++.+. -..+....+..+.+..+.+ ..++. ..+. ..+.++.. ..+.+ T Consensus 400 ~~------------~----~~~~i~v~w~~~~~~s~~~~ada~~kl~~---~gis~----et~~--~~~~g~~~~~ie~~ 454 (501) T protein:vir:25 400 DT------------A----ADSGAEVLWRDTEARSFGAVVDGITKLAS---AGIPI----EHLL--SMVPGMTQQTIQAI 454 (501) T ss_pred cc------------c----cceeeeEEecCCCCCCHHHHHHHHHHHHh---cCCCH----HHHH--HHcCCCCHHHHHHH Confidence 10 0 01122222 1112222222222222221 11221 1111 11122210 00000 Q ss_pred hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHHH-HHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKIRQLELENLMLENQMLVASIND-KNARANENTIDAELKRS-KAAVE-KAKA 596 (663) Q Consensus 536 ~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~-~~a~~q~~~~~~~~~~~-~~~~e-~~~~ 596 (663) +. ...++....+...... ..........+...+.. +.+.. ..-+ T Consensus 455 ~~-----------------~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 455 KD-----------------SLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred HH-----------------HHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCCCCC Confidence 00 0000000000000000 00000000000000000 00000 0000 No 113 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.59 E-value=1.9e-14 Score=95.78 Aligned_cols=451 Identities=11% Similarity=0.050 Sum_probs=183.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccC--------CCccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKG--------KSAIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g--------~s~~~~~~i~~~v~~~~~~l~~~~ 72 (663) =.++.++++..|...+..- .....++.+||.|++.-. ..| +-.++.|-.+-.|+...+.+. T Consensus 9 ~~~~~~~~~~~l~~~~~~~-------~~rl~~l~~Yy~G~~~i~-~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~--- 77 (484) T protein:vir:77 9 ENVDPEKAREEMLNLFTER-------TQDLGDNTAYYESERRPD-AVGVTVPQQMQKLLAHVGYPRLYIDAIAARQE--- 77 (484) T ss_pred CCCCHHHHHHHHHHHHHHH-------HHHHHHHHHHHhccccch-hcccccchhHHhhhhhcCcHHHHHHHHHhhhc--- Confidence 3455666777777766542 233456778999976421 111 112234444445555554431 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) +++ | . .+++.+..+ .++.++ ..|+.......+.++++++|.|++.|+++..... T Consensus 78 ~~g---~--~--~~~~~~~~~----~l~~i~-~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~-------------- 131 (484) T protein:vir:77 78 LEG---F--R--LGGADKADE----QLWDWW-QANDLDIESTLGHTDSLVHGRSYITISKPDPNID-------------- 131 (484) T ss_pred cCc---e--e--cCCcchhHH----HHHHHH-HhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcc-------------- Confidence 111 1 1 233333322 334444 3567766778899999999999999987632210 Q ss_pred cccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) .......|.+..++|.++| |||... ...+.+ +.+.+ + T Consensus 132 -----------~~~~~~~~~i~~~~p~~~~~~~D~~~~----~~~~a~-~~~~~--~----------------------- 170 (484) T protein:vir:77 132 -----------PGVDPEVPIIRVEPPTNLYAQIDPRTR----QVMRAI-RAIED--E----------------------- 170 (484) T ss_pred -----------cccccccceEEEeccceeEEEecCCCC----ceEEEE-EEEEe--e----------------------- Confidence 0112234556778888876 454311 111121 11111 0 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA 310 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~ 310 (663) . ...+..++.|.. +. .+...-.++........|-+.|.+|++.++..+..+.++|.|.+ T Consensus 171 ---~----------~~~~~~~~~y~~-----~~---~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i 229 (484) T protein:vir:77 171 ---E----------GNEVIGATLYLP-----NN---TVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEI 229 (484) T ss_pred ---c----------CCcEEEEEEEec-----Ce---EEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccc Confidence 0 000122222221 00 00001111111111222334578999999888889999999988 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc-C-----cc---hhhhccCCcceEeCCCCCccccccCccccHH Q lcl|NC_021532. 311 E-MIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL-D-----QT---NRKKFLAGANFEFNGTANDFWHGSYNAIPSS 380 (663) Q Consensus 311 ~-~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i-~-----~~---d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~ 380 (663) . .++++++.+|+..+.+.......+.|...+- |.- + +. ......+|.++...++ .....+++..+ T Consensus 230 ~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~q~~~~~-- 304 (484) T protein:vir:77 230 TPELRSVTDAAARTLMLMQATAELMGVPQRLLF-GVKGEELGVDPETGQTLFDAYLARILAFEDH--ESKAQQFSAAE-- 304 (484) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh-CCCcchhcccccccchhhhhhhhhhcccCCC--CceeEeecCCC-- Confidence 6 5899999999999999999998888776542 221 1 00 0112234445443322 22223333221 Q ss_pred HHHHHHHHHHHHHHH---hCCChHHcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021532. 381 AFDMISLMNNEIESI---TGTKSFSGGINSGS-LGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE 456 (663) Q Consensus 381 ~~~~~~~~~~~~~~~---tGi~~~~~G~~~~~-~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~ 456 (663) ....+..++..+..+ +++++...|..+.. .|+.| +......-........+.|.+ -++.++.++..+.. T Consensus 305 ~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~A--l~~~~~~l~~ka~~k~~~f~~-----~l~~~~~l~~~~~~ 377 (484) T protein:vir:77 305 LRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEA--IRSSESRLVKTVERKNKIFGG-----AWEQAMRVAYKVMN 377 (484) T ss_pred hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHH--HHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhC Confidence 233445555555554 67888888854322 22222 332222222222222333332 23344444444322 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh-hhhhhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP-EQAKRM 535 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~-e~~~~l 535 (663) .. .... ++. ++.+...-.......+....+..+.+. +. +.+....+... .++. +..+.+ T Consensus 378 ~~-----------~~~~-~~~-~i~v~w~~~~~~s~~~~ad~~~kl~~~-g~---gi~s~et~~~~---l~~~~~~~~e~ 437 (484) T protein:vir:77 378 GG-----------DIPP-EYY-RMESIWRDPSTPTYAAKADAATKLYNN-GQ---GVIPKERARID---MGYSITEREEM 437 (484) T ss_pred CC-----------Cccc-ccc-cceEEecCCCCCCHHHHHHHHHHHHhc-cC---CCCCHHHHHhc---CCCChhHHHHH Confidence 11 0000 000 112222111122222222222222111 11 11112222111 1111 000111 Q ss_pred hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSS 601 (663) Q Consensus 536 ~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~ 601 (663) +.+..+ +..+. ++.+.+..... .+...........+.+......+. + T Consensus 438 ~~~~~e-------------e~~~~-~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~--~ 484 (484) T protein:vir:77 438 RKWDEE-------------EQAQG-LGLMGTMFGTD---PSGGGNPDNPETPEPQPNPAEEAA--A 484 (484) T ss_pred HHHHHH-------------HHHHH-HHHHhhhcccc---ccCCCCCCCCCcccccCCCccccC--C Confidence 000000 00000 00000000000 000000000000000000000000 0 No 114 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.59 E-value=6.4e-14 Score=92.91 Aligned_cols=451 Identities=10% Similarity=0.031 Sum_probs=185.3 Q ss_pred CCCcH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCcc----cc---CCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINK----AELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNE----QK---GKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~~~~----~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~----~~---g~s~~~~~~i~~~v~~~~~~l~ 69 (663) .-+++ ..++..|...|.. +......+.+||+|++.-+. .. .+-.++.|-.+-.|+.+.+.+. T Consensus 6 ~~~~e~~~~~~~~~~l~~~~~~-------~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~ 78 (486) T protein:vir:42 6 PGMEEIEDPAVVREEMISAFED-------ASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQA 78 (486) T ss_pred CCCCCcccHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhc Confidence 33332 3345555555433 33455667789999863211 00 0112234444445554444431 Q ss_pred HhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD 149 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~ 149 (663) +.+ |. .+++....+.+ +.++. .|+.......+..+++++|.|++.|+.+.... T Consensus 79 ---~~g-----~~--~~~~~~~~~~~----~~i~~-~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~------------ 131 (486) T protein:vir:42 79 ---VEG-----FR--LGDADEADEEL----WQWWQ-ANNLDIEAPLGYTDAYVHGRSFITISKPDPQL------------ 131 (486) T ss_pred ---ccc-----ee--cCCCchhHHHH----HHHHH-hcChhHHHHHHHHHHhhcCceEEEEecCCccc------------ Confidence 111 11 22333333333 33343 46666667789999999999999987642110 Q ss_pred ccccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchh Q lcl|NC_021532. 150 EYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGED 227 (663) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~ 227 (663) ......+.+.+..++|.+++ |||... ...+.+ +.+.+ + T Consensus 132 -------------~~~~~~~~~~i~~~~p~~~~~i~d~~~~----~~~~~~-~~~~~--~-------------------- 171 (486) T protein:vir:42 132 -------------DLGWDQNVPIIRVEPPTRMHAEIDPRIN----RVSKAI-RVAYD--K-------------------- 171 (486) T ss_pred -------------ccccCCCeeEEEEecccceEEEEeCCCC----CeEEEE-EEEEe--c-------------------- Confidence 00112344567778888876 554321 111111 11110 0 Q ss_pred hhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCC Q lcl|NC_021532. 228 FDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGE 307 (663) Q Consensus 228 ~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~ 307 (663) ....+..+++|.. + ..++.+..++......+.|...|.+|++.++..+..+..+|. T Consensus 172 ----------------~~~~~~~~~~y~~-----~---~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~ 227 (486) T protein:vir:42 172 ----------------EGNEIQAATLYTP-----M---ETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGT 227 (486) T ss_pred ----------------CCCeEEEEEEEcC-----C---cEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCc Confidence 0011233444421 1 011111122222112233445578999999988999999999 Q ss_pred ChHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcEEee---ccccCcch-----hhhccCCcceEeCCCCCccccccCcccc Q lcl|NC_021532. 308 ANAE-MIGDNQKVKTAVIRGIIDNMAQSNNGQVAIR---KGALDQTN-----RKKFLAGANFEFNGTANDFWHGSYNAIP 378 (663) Q Consensus 308 g~~~-~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~---~~~i~~~d-----~~~~~p~~vi~~~~~~~~~~~~~~~~~~ 378 (663) |-+. .++++++.+|+..+.+..+....+.|...+- ...+...+ .....+|.++....+ .....+++.. T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~q~~~~- 304 (486) T protein:vir:42 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFEDA--EGKIQQFSAA- 304 (486) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhcccCCC--CceEEeeccc- Confidence 9987 5889999999999999999888888876542 11111111 112235555544322 2223333332 Q ss_pred HHHHHHHHHHHHHHHHH---hCCChHHcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 379 SSAFDMISLMNNEIESI---TGTKSFSGGINSGS-LGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEF 454 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~~---tGi~~~~~G~~~~~-~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~ 454 (663) .....++.++..+..+ +++++...|..+.. .|+.| +...............+.|.. .+ +.++.++..+ T Consensus 305 -~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~A--l~~~~~~l~~ka~~~~~~f~~-~l----~~~~~l~~~~ 376 (486) T protein:vir:42 305 -ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEA--IRAAESRLIKKVERKNLMFGG-AW----EEAMRIAYRI 376 (486) T ss_pred -CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHH--HHHHHHHHHHHHHHHHHHHHH-HH----HHHHHHHHHH Confidence 2334556666665555 77888887754322 12222 333322222222333334432 23 3334444443 Q ss_pred cCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhh-hhhh Q lcl|NC_021532. 455 LEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMP-EQAK 533 (663) Q Consensus 455 ~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~-e~~~ 533 (663) ..... +.. ++. ++.+...-.......+..+.+..+.+... ..+....+..+ .++. +..+ T Consensus 377 ~~~~~-----------~~~-d~~-~i~v~w~~~~~~s~~~~ad~~~kl~~~~~----g~~s~et~~~~---lg~~~d~~~ 436 (486) T protein:vir:42 377 MKGGD-----------VPP-DML-RMETVWRDPSTPTYAAKADAATKLYGNGQ----GVIPRERARID---MGYSVKERE 436 (486) T ss_pred hcCCC-----------ccc-cce-eeeEEecCCCCCCHHHHHHHHHHHHhccc----CCCCHHHHHhc---CCCChhHHH Confidence 32110 000 110 12222221222222222332222222111 11122222111 1111 1111 Q ss_pred hhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 534 RMREYEPKPDPVQEKIRQLELENLMLENQMLVAS---INDKNARANENTIDAELKRSKAAV 591 (663) Q Consensus 534 ~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~---~~~~~a~~q~~~~~~~~~~~~~~~ 591 (663) .++.+..+..... +.....+-.. ...+.+..+....+-...++.... T Consensus 437 e~~~~~~e~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 437 EMRRWDEEEAAMG-----------LGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred HHHHHHHHHHHHH-----------HHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 1111100000000 0000000000 000000000000000000000000 No 115 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.58 E-value=6e-13 Score=87.60 Aligned_cols=450 Identities=9% Similarity=0.012 Sum_probs=177.0 Q ss_pred CC-----CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc--cCCCc--------cccHHHHHHHHHHH Q lcl|NC_021532. 1 MK-----INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ--KGKSA--------IVSRDIKKQSEWQH 65 (663) Q Consensus 1 ~~-----~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~--~g~s~--------~~~~~i~~~v~~~~ 65 (663) .+ |+.+++...|..++- ..+.......+++.+||.|++.-+.. +.+.. .+.|-.+-.|+.+. T Consensus 2 ~~~p~~~l~~~~~~~~~~~~l~---~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~ 78 (479) T protein:vir:99 2 IDLPDEDLSSEGLAKYLETKVF---PKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFA 78 (479) T ss_pred ccCCcccCChhHHHHHHHHHHH---HHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHH Confidence 34 455555554443221 22334556677888999998753221 11111 12233333334333 Q ss_pred HHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccc Q lcl|NC_021532. 66 ATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEA 145 (663) Q Consensus 66 ~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~ 145 (663) +.+ + +.+.+..|.+..+.+.. ++. .|+.......+..+++++|.|++.++..... T Consensus 79 ~~l----~-------~~gf~~~d~~~~~~~~~----i~~-~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~--------- 133 (479) T protein:vir:99 79 QQL----I-------VDGYRKTGTNENAKGWD----TWR-LNQMDKQQFWLNRAVLTFGYAFIKVTSGISP--------- 133 (479) T ss_pred hhc----c-------cccccCCCchhhHHHHH----HHH-hcChhHHHHHHHHHHhhcCceEEEEecCCCC--------- Confidence 322 1 22223334333443333 343 3555566678889999999999887642110 Q ss_pred cccCccccccccccccccceeecccceeeeccHHHhee--CcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhc Q lcl|NC_021532. 146 VVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYL--DPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKT 223 (663) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~--dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~ 223 (663) ....+.+.+.+++|.+++. |....+ . +..+.. + ++. T Consensus 134 -------------------~d~~g~~~i~~~~p~~~~~iydd~~~~---~--~~~~~~--~--------~~~-------- 171 (479) T protein:vir:99 134 -------------------LDGTTVARIKCIDPRDAFAIWEDPYWD---E--WPKYLL--E--------RQP-------- 171 (479) T ss_pred -------------------cCCCCceEEEEechhheEEEecCCccc---c--eeeEEE--e--------ecC-------- Confidence 0112345677788888753 221111 0 111000 0 000 Q ss_pred cchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 224 SGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 224 ~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) .. ...+|... ..+.....++........|-..|.+|++.+...+..+ T Consensus 172 ---------------------~~---~~~~~~~~--------~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~- 218 (479) T protein:vir:99 172 ---------------------NG---QYWWWTEE--------DYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLR- 218 (479) T ss_pred ---------------------ce---eEEEEecc--------eEEEEEecCCceeeccccccCCCCcceEEeecCCCcC- Confidence 00 00111110 0000011111111111223334788999888887764 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc------hhhhccCCcceEeCCCCCccccccCccc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT------NRKKFLAGANFEFNGTANDFWHGSYNAI 377 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~------d~~~~~p~~vi~~~~~~~~~~~~~~~~~ 377 (663) .+|.|.++.++++++.+|+..+.+...+...+.|...+- |....+ .......++++...++. .....++.. T Consensus 219 ~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~~~q~~~~ 295 (479) T protein:vir:99 219 GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT-GLMLPEGANADQEKMRFAQESMLISQNEK--ASFGAIPAA 295 (479) T ss_pred cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc-CCCcccccccchhccccccccceeecCCC--ceEEEeccc Confidence 479999999999999999999999999999999875542 322111 11122334555544332 233333322 Q ss_pred c-HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021532. 378 P-SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE 456 (663) Q Consensus 378 ~-~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~ 456 (663) . ..+...++.+...+-..||+++...|..+|. |+.| +...............+.|. .-++.++.++..+.. T Consensus 296 ~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~-Sg~A--l~~~~~~l~~ka~~~~~~f~-----~al~~~~~l~~~~~~ 367 (479) T protein:vir:99 296 PLDGLLNAYKESLLEFLALAQLPPHIAGQIVNV-AADA--LAAGTRQTMQKLFEKQATWK-----ASHNQTMRLVNKIEG 367 (479) T ss_pred chHHHHHHHHHHHHHHhccCCCCHHHcccccch-HHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHcC Confidence 1 2233444444555555678888988865442 3333 33322222222223333332 223344444444332 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeecc--cchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhh-hhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISIST--AEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPE-QAK 533 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~--~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e-~~~ 533 (663) ... +.+ .+++.+.=.. .....+..+.+..+.+. ..++. ..+... +.++.. ..+ T Consensus 368 ~~~------------~~~----~~~i~~~w~~~~~~s~~~~ad~~~kl~~a--g~is~----et~l~~--l~gv~~~~~e 423 (479) T protein:vir:99 368 RTE------------EAT----DLDFTITWQDVTIQSLAQFADAWAKMVES--LKIPA----EGVWDM--IPNLDQSTVN 423 (479) T ss_pred CCc------------ccc----ceeeeEEecCCCCCCHHHHHHHHHHHHhc--CCCCH----HHHHHh--cCCCCHHHHH Confidence 110 000 1122222111 11122222222222221 11211 111111 111110 001 Q ss_pred hhhhhhcchhhHHHHhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 534 RMREYEPKPDPVQEKIRQLELENLMLENQM-LVASINDKNARANENTIDAELKRSKA 589 (663) Q Consensus 534 ~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~-~~a~~~~~~a~~q~~~~~~~~~~~~~ 589 (663) .++...............+......++... ........++.. ..-.-+++-+..+ T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 479 (479) T protein:vir:99 424 GWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANN-KTGEPASLNKSGA 479 (479) T ss_pred HHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCC-CCcchhccCCCCC Confidence 111000000000000000000000000000 000000000000 0000111111111 No 116 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.56 E-value=5.5e-13 Score=87.78 Aligned_cols=431 Identities=16% Similarity=0.064 Sum_probs=180.8 Q ss_pred CCCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCc------cccCCC---ccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKIN-KAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGN------EQKGKS---AIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~------~~~g~s---~~~~~~i~~~v~~~~~~l~~ 70 (663) |.-. .++++..|.+.+.. .......+.+||.|++.-+ ....++ .++.|-.+..|+...+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~-------~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~- 72 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII- 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhc- Confidence 3332 34466666665443 3345677788999876321 111111 1344555555565555442 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) . ++ |.....+|.+..+.+.++ +. .|+.......+..+++++|.|++.++-+. T Consensus 73 ---~-~g---~~~~~~~d~~~~~~~~~~----~~-~n~~d~~~~~~~~~a~~~G~a~~~~~~~e---------------- 124 (456) T protein:vir:79 73 ---P-NG---ITVGGSADSDLALRARRI----WR-DNRMDSVCKQWVKYGLDFGESYLTCWRRD---------------- 124 (456) T ss_pred ---c-CC---eecCCCCCccHHHHHHHH----HH-hcChhHHHHHHHHHHhhcCeeEEEEeeCC---------------- Confidence 2 22 222223344434433333 33 35666667789999999999998776432 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) .+.+.+..++|.+++ |||.... ...+.+ +.+.+.++ ...... T Consensus 125 -----------------dg~~~i~~~~p~~~~~i~d~~~~~---~~~~~~-~~~~~~d~--------~~~~~~------- 168 (456) T protein:vir:79 125 -----------------DGTATITADSPETMVVSVDPLQPW---RIRSAM-RWWRDLDA--------ESDFAI------- 168 (456) T ss_pred -----------------CCceEEEEeccceeEEEEcCCCCC---ceEEEE-EEEEecCC--------ceeEEE------- Confidence 133456778888775 4443321 111122 22211100 000000 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCC Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEA 308 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g 308 (663) .+.+ ...+.++.+|.... + ......+..++......+-|...+.+|++.+ .+..|.| T Consensus 169 ----------~~~~--~~~~~~~~~~~~~~-~----~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~~~g 225 (456) T protein:vir:79 169 ----------VWSG--DGWQKFARPCFVQS-S----SRRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMG 225 (456) T ss_pred ----------EEcC--CceEEEEEEEEeec-c----ccceeeeccCCceeecccccCCCCceeEEEe------cCCCCCc Confidence 0000 01112222221110 0 0111112222222222223334466777643 2467889 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-------------cCcchhhhccCCcceEeCCCCCccccccCc Q lcl|NC_021532. 309 NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-------------LDQTNRKKFLAGANFEFNGTANDFWHGSYN 375 (663) Q Consensus 309 ~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-------------i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~ 375 (663) .++.++++++.+|+..+.++..+...+.|...+ .|. ++..+.....+|+++...++. .+..++.. T Consensus 226 d~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~-~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~-~~~q~~~~ 303 (456) T protein:vir:79 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRAL-KSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGV-DIWESQTN 303 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHH-hcCCcccccccccccccchhhhhhhhccccccCCCCc-ceeeeccc Confidence 999999999999999998887777777765443 121 111112223456666554433 23333322 Q ss_pred cccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021532. 376 AIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFL 455 (663) Q Consensus 376 ~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~ 455 (663) ++ ..+...+......+-..||+++...|...+..|+.| +......-.......-+.|. .-++.++.++..+. T Consensus 304 ~~-~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~A--l~~~~~~l~~k~~~~~~~f~-----~~l~~~~~l~~~~~ 375 (456) T protein:vir:79 304 DF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEG--AHNIEKGFLFKCEDRLSIAK-----IGLEAILVKALQIE 375 (456) T ss_pred Ch-HHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHH--HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhc Confidence 22 345556777777777889999999885433334443 33322222222222333343 23344455555543 Q ss_pred CCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhh Q lcl|NC_021532. 456 EEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRM 535 (663) Q Consensus 456 ~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l 535 (663) ..... ..+.+.-.-.......+..+.+..+ .+..++.. ... ..+.++.. T Consensus 376 g~~~~-----------------~~i~v~w~~~~~~s~~~~ada~~kl---~~~G~~~~---~~~---~~~lg~~~----- 424 (456) T protein:vir:79 376 GESVE-----------------DTVDVSFESPDRVTLGEKYSAASLA---KAAGESWA---SIR---RNILNYNA----- 424 (456) T ss_pred CCCcc-----------------ccceEEeCCCCCcCHHHHHHHHHHH---HhcCCChH---HHH---HhcCCCCH----- Confidence 32110 0112222111111122222222221 11112111 111 11111110 Q ss_pred hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTID 581 (663) Q Consensus 536 ~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~ 581 (663) + +..+.++++...+.. ..+....+ ..+..... T Consensus 425 -------~----~i~~~e~~r~~~e~~-~~~~~~~~--~~~~~~~~ 456 (456) T protein:vir:79 425 -------D----QIKQDDLDRAREQIT-LFAGNPVQ--RPQEDGSR 456 (456) T ss_pred -------H----HHHHHHHHHHHHHHH-HHhhhHhh--cCCCCCCC Confidence 0 000111111111111 00000000 00000000 No 117 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.55 E-value=9.8e-13 Score=86.41 Aligned_cols=428 Identities=11% Similarity=0.072 Sum_probs=182.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCCC--------ccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGKS--------AIVSRDIKKQSEWQHATIVDPF 72 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s--------~~~~~~i~~~v~~~~~~l~~~~ 72 (663) |.-++..++..|.+.|..-. .......+||.|++.- +.-|.. ..+.|-..-.|+++...+. T Consensus 12 l~~~~~~~~~~L~~~~~~~~-------~~~~~~~~Yy~G~~~~-~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~--- 80 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLR-------WKNLLRTSYYENKRTI-QYVGTLIPPQYFNLGLVLGWTGKAVDALARRCN--- 80 (474) T ss_pred CChhHHHHHHHHHHHHHHHh-------hHHHHHHHHhccCCCh-hhccccccHHHHHHHhhcChHHHHHHHHHhhhc--- Confidence 77777777777777766532 3356667899998642 211211 0122222222332222211 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) +++ |. .| +++... ..+..++ ..|+.......+..+++++|++++.|+.+.+ T Consensus 81 ~~G---f~-~~---d~~~~~----~~l~~iw-~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d----------------- 131 (474) T protein:vir:81 81 LEG---FV-WP---DGDLDS----LGGTEVV-DDNHLLSEIDSAIVAAMQHGPAFLINTVGED----------------- 131 (474) T ss_pred ccc---eE-CC---CCCccc----hHHHHHH-HhcChhHHHHHHHHHHHhhCceeEEEecCCC----------------- Confidence 111 21 12 211111 1133444 3566666677899999999999998875421 Q ss_pred cccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) ....|.+..++|.+++ |||... .+ .+.+.+...+ .+ + T Consensus 132 --------------~~~~~~i~~~sp~~~~~~~D~~~~-~~---~~al~~~~~~----------~~--------g----- 170 (474) T protein:vir:81 132 --------------DEPEALIHVKDASEATGEWNRRRR-GL---NNLLSIIDKD----------KE--------G----- 170 (474) T ss_pred --------------CCceeEEEEeccceEEEEEeCCCC-cc---eeeeEEEEEc----------CC--------C----- Confidence 1123556778888876 676422 11 1111111000 00 0 Q ss_pred cccccccccccccccceEEEEEEEEE-----eeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCccc Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGN-----YDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLH 305 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~ 305 (663) ..+...+|.. +..+ ++...| .....++|+ | .|++.++..++-...+ T Consensus 171 ----------------~~~~~~ly~~~~~~~~~~~-~~~~~w---------~~~~~~~~~--g-vPvV~~~n~~~~~~~~ 221 (474) T protein:vir:81 171 ----------------KVLSLALYLDNETVTAQRD-KATLKW---------QVDRDEHVY--G-VPAQVLPYKPAPKRPF 221 (474) T ss_pred ----------------cEEEEEEEeCCcEEEEEEc-Ccccee---------eeccCCCCC--C-cceEEecccccccCcC Confidence 0011111210 0000 110111 112234554 5 5899999999888999 Q ss_pred CCChH-HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc----Cc-----chhhhccCCcceEeCCCCCc------- Q lcl|NC_021532. 306 GEANA-EMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL----DQ-----TNRKKFLAGANFEFNGTAND------- 368 (663) Q Consensus 306 g~g~~-~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i----~~-----~d~~~~~p~~vi~~~~~~~~------- 368 (663) |.|-+ +.++++|+.+|+.+..++......+.|+..+- |.- .. .+......+.++.+..+.+. T Consensus 222 G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~ 300 (474) T protein:vir:81 222 GQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL-GADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLAR 300 (474) T ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-cCChhhcccccccccchhhhhHHHHhcCCCccccccccccc Confidence 98855 89999999999999999999999999886542 221 11 11222334556555433221 Q ss_pred cccccCcccc-HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 369 FWHGSYNAIP-SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKW 447 (663) Q Consensus 369 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~ 447 (663) ....+++... ..+...+..+...+-..||++...+|..+...+.+|.++......-........+.|.. ..+.+++++ T Consensus 301 ~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~-~l~~~~rla 379 (474) T protein:vir:81 301 ADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTP-ALRKAFIRA 379 (474) T ss_pred ccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 1112222211 12333445555555667899999999543222234444544332333333334444543 334444444 Q ss_pred HHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecc--cchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHh Q lcl|NC_021532. 448 MAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISIST--AEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDL 525 (663) Q Consensus 448 ~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~--~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l 525 (663) +.+.-.+--+ .+..+. +.+.+.=.. .....+....+..+.+. ++ +......+..+ T Consensus 380 ~~i~~~~~~~------------~~~~~~----~~~~v~W~d~~~~s~a~~aDa~~Kl~~a-~~---~~~~~~~~~~~--- 436 (474) T protein:vir:81 380 LAMKNKVAID------------EIPDEW----KSIDAKWRDPRYLSKSAQADAGMKQLAA-VP---WLAETEVGLEL--- 436 (474) T ss_pred HHHhCCCCcc------------ccchhh----ccceeEecCCCccCHHHHHHHHHHHHhc-cc---CCCcHHHHHhh--- Confidence 4332111000 000000 112221111 11112222222222221 11 11112222221 Q ss_pred hhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 526 MRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAE 583 (663) Q Consensus 526 ~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~ 583 (663) .++.. + +..+...+..+++...+ +....++... ...+| T Consensus 437 lg~t~------------~----~i~~~~~~~~~~~~~~~---~~~l~~~~~~-~~~aq 474 (474) T protein:vir:81 437 IGLTP------------Q----QARRAMADKRRVQGRGT---LQALIDRSNN-GATAQ 474 (474) T ss_pred cCCCH------------H----HHHHHHHHHHHHhHHHH---HHHHHhcCCC-CCCCC Confidence 12210 0 00000000000000000 0000000000 00000 No 118 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.54 E-value=6.8e-13 Score=87.27 Aligned_cols=460 Identities=12% Similarity=0.051 Sum_probs=196.5 Q ss_pred CCCcHHHHHHHHHHHHHHH-------------HHHHHHHHHH-----HHHHHHHhcCCcCCccccCCCccccHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAA-------------DVLKQEQDSL-----ISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQSE 62 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~-------------~~~~~~~~~~-----~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~v~ 62 (663) |.+ .+-++..++.+ ..+.+-.... ...|..++|+...++.... ..+..|.- . T Consensus 1 ~~~-----~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~-~~~~~~l~----~ 70 (518) T protein:vir:78 1 MGV-----WSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHD-KLMNSGTG----N 70 (518) T ss_pred Ccc-----hhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCcccc-ccccCChH----H Confidence 221 11122221111 1111100001 1123334455443322111 11222221 1 Q ss_pred HHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 63 WQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 63 ~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .++.-+.+.+|+-.+.+.|.+-...|. +.+++.++.++. .+++...+.+.+.+++..|.++++++|+. T Consensus 71 ~i~~~~A~ll~~e~~~i~v~~~~~~d~---e~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-------- 138 (518) T protein:vir:78 71 EIVVVAAEYISGKPLSIDVTGVNGSKD---ENLTKQLKEALR-IDNFDSKSVKIVELAGGSGVSAVKINILN-------- 138 (518) T ss_pred HHHHHHHHhhcCCCceEEecCccccCc---HHHHHHHHHHHH-hccHHHHHHHHHHHhhccCceEEEEEEEC-------- Confidence 122223333455555666654333332 345666777664 46777778889999999999999999861 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAK 222 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 222 (663) +.+.++.|++..||+..+- +++-.|-|+ ... ...+ +..+|.-++. .+ T Consensus 139 --------------------------~~~~i~~v~ad~~~P~~~~-g~~~~~~f~--~~~-~~~~--k~~~y~~lE~-he 185 (518) T protein:vir:78 139 --------------------------GRPSISVHSSSQFWIDFKN-NEPFRFNFF--EEI-PTSN--KADIYYLVES-RE 185 (518) T ss_pred --------------------------CeeEEEEEcCCeeEEEeec-CcEEEEEEE--EEe-ecCC--cceeEEEEEe-ec Confidence 3356788889888875432 233333222 111 1110 0001110000 00 Q ss_pred ccchhhhccccccccccccccccceEEEEEEEEEeeecCCce-------eEEEEEEEECCEEEecccCCCc-CCCCCEEE Q lcl|NC_021532. 223 TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGI-------AEPIVCAWINDVIVRLQSNPYP-DGKPPFLV 294 (663) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~-------~~~~~~~~~g~~~l~~~~~p~~-~~~~Pf~~ 294 (663) ........+... .-...++.| +.+ .++++ .+....++.-..+.. +..+. ..+.||+. T Consensus 186 --~~~~~~~~~~~~---------~~~I~n~ly-~~~-~~~~v~~~~~~~~~~l~~~~~~~~~~e--~~~~~tg~~~~~~~ 250 (518) T protein:vir:78 186 --IKQWDKEGKKLS---------GGFVTYSVI-KID-GDKTTPISAERLPEQITSYLHTNDIQL--NHSVSIGLKSMGAY 250 (518) T ss_pred --cccccceeeccc---------ceeEEEEEe-eec-CcccccccccccccccccccccccCcc--ceeeccCCccceEE Confidence 000000000000 000011111 110 00000 000000000000000 00011 12456766 Q ss_pred Eeeee-----ecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhh-------hccCC--cce Q lcl|NC_021532. 295 VPFNS-----IPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRK-------KFLAG--ANF 360 (663) Q Consensus 295 ~~~~~-----~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~-------~~~p~--~vi 360 (663) +.+.+ .+++++|.|++.++++.++.+|...+++.+.+.. +.+++.++++.+...... .+..+ ... T Consensus 251 ~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~ 329 (518) T protein:vir:78 251 LINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFM 329 (518) T ss_pred eeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCccccccCCCCceEE Confidence 64443 3467889999999999999999999999999865 788899988877422110 11111 122 Q ss_pred EeCC----CCCc---cccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 361 EFNG----TAND---FWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVR 433 (663) Q Consensus 361 ~~~~----~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~ 433 (663) .++. +++. ++.+++.-........++.+...+....|++....|.++ + ..||+++....+..-..+..+.. T Consensus 330 ~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~-~-~~TATei~s~~~~~~~t~~~~~~ 407 (518) T protein:vir:78 330 QFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGN-R-EVKATEIWSLQDATVRKIEKKKR 407 (518) T ss_pred EecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCccc-c-cccHHHHHHHHHHHHHHHHHHHH Confidence 2221 1121 333333322345677788888888889999999888643 2 46888888766665555566666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeec--ccchhHHHHHHHHHHHHHhccCCC Q lcl|NC_021532. 434 NIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISIS--TAEDNAAKSQELSFLLQTLGPNED 511 (663) Q Consensus 434 ~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~--~~~~~~~~~q~l~~~~~~~~~~~~ 511 (663) .+. ..++.+...++.+..-|+..... .......++.|.=+ ......+..+.+..+.+ ++.+. T Consensus 408 ~~e-~al~~l~~~i~~l~~~~~~~~~~-------------~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~--aGimS 471 (518) T protein:vir:78 408 LIQ-NVYEQMLWDFLYLLTGGTNNKEK-------------AIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNS--ALAMS 471 (518) T ss_pred HHH-HHHHHHHHHHHHHHHhhcCcccc-------------ccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh--cCCCC Confidence 664 45666666666665544321100 00011223333322 22222333332222211 12222 Q ss_pred cchhHHHHHHHHHhhhhhhhhhhhhhhh---c-----chhhHHH-HhhHH Q lcl|NC_021532. 512 PKIRRDIMADIMDLMRMPEQAKRMREYE---P-----KPDPVQE-KIRQL 552 (663) Q Consensus 512 p~~~~~~l~~~~~l~~~~e~~~~l~~~~---~-----~~~~~~~-q~~q~ 552 (663) +......+. ..+.. .+..+.++.+. . .|+++.- ..++- T Consensus 472 ~e~~i~~~~--~~~~d-eea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 472 VEEKVKLIH--PKWED-EEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHHHHhC--CCCCH-HHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 221111100 00000 11111111111 1 0110000 00000 No 119 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.54 E-value=4.7e-13 Score=88.19 Aligned_cols=469 Identities=12% Similarity=0.025 Sum_probs=201.2 Q ss_pred CCCcH---HHHHHHHHH----HHHHHH-----HHHHHHHHHHHHHHHHhcCCcCCccccC-------CCccccHHHHHHH Q lcl|NC_021532. 1 MKINK---AELLSALKA----DMKAAD-----VLKQEQDSLISTWKAEYNGEPYGNEQKG-------KSAIVSRDIKKQS 61 (663) Q Consensus 1 ~~~~~---~~~~~~l~~----~~~~~~-----~~~~~~~~~~~~~~~~y~~~~~~~~~~g-------~s~~~~~~i~~~v 61 (663) |.|=. .-|-..+.. .+.... ....++...+..|+.||.|+......++ +.....|. . T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl----~ 76 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPI----A 76 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecch----H Confidence 54421 111111111 111111 1145566678888999998755332211 11122222 2 Q ss_pred HHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 62 EWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 62 ~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) ..++..+.+.+|+-.+.+.+ +|+... ++++.++. .+++...+.+++.+++-.|.++++.+||. T Consensus 77 ~~i~~~~A~lv~~e~~~i~v-----~d~~~~----~~l~~~l~-~n~f~~~~~~~~e~a~a~G~~a~k~~~d~------- 139 (522) T protein:vir:47 77 RTASKKIASLVYNEQATITT-----KNEILQ----KFLDDMLT-NDRFNKNFERYLESCLALGGLAMRPYIDG------- 139 (522) T ss_pred HHHHHHHhhhhcCCcceeec-----CChHHH----HHHHHHHh-hcchHHHHHHHHHHhhccCCEEEEEEEcC------- Confidence 22222233333443333333 344444 44555443 46777778889999999999999999972 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLA 221 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~ 221 (663) +.+++..|++..||+-..-...+..|-++.+..... .. ...||. ..+.. T Consensus 140 ---------------------------~~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~-~~--~~~~yt-~lE~h 188 (522) T protein:vir:47 140 ---------------------------DKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSE-GR--KNVYYT-LVEFH 188 (522) T ss_pred ---------------------------CceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeec-cc--ceeEEE-EEEEe Confidence 235677788888875321111233333332222111 00 000111 00000 Q ss_pred h-ccchhhhccccccccccccccccceEEEEEEEEEee-ecCCceeEEEEEE--EEC---CEEEecccCCCcCCCCCEEE Q lcl|NC_021532. 222 K-TSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYD-VDGDGIAEPIVCA--WIN---DVIVRLQSNPYPDGKPPFLV 294 (663) Q Consensus 222 ~-~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~-~~~~g~~~~~~~~--~~g---~~~l~~~~~p~~~~~~Pf~~ 294 (663) + ..+.... .... .......|-..+++.. .+.-|.......+ |.+ .+.+. ...+++|++ T Consensus 189 e~~~~~~~~--------~~~~-~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~------~~~~Plf~y 253 (522) T protein:vir:47 189 EWVTADGQE--------TGST-NDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFE------NLSRPLFTY 253 (522) T ss_pred eeccccccc--------cccc-ccCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeC------CCCcceEEE Confidence 0 0000000 0000 0000011111111111 0000111000000 000 01110 113445554 Q ss_pred Ee----eeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhh---------hccCCcce- Q lcl|NC_021532. 295 VP----FNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRK---------KFLAGANF- 360 (663) Q Consensus 295 ~~----~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~---------~~~p~~vi- 360 (663) +. -....++++|.|++.++++..+.+|..++++++-+..+-. ++.+++..+...... .+.++.-+ T Consensus 254 ~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f 332 (522) T protein:vir:47 254 LKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQR-RVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVY 332 (522) T ss_pred ecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-eeecchHHhccCCCCCCcccccccccCcccceE Confidence 32 2234578999999999999999999999999988875544 788877776432111 11112211 Q ss_pred -EeC---CCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 361 -EFN---GTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIA 436 (663) Q Consensus 361 -~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~ 436 (663) .++ +++..+..+++.-....+...+..+...+....|++.-..|..+.+ ..||+++....+..-.....+.+.+. T Consensus 333 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~-~kTAtEi~s~~~~~~~t~~~~~~~~~ 411 (522) T protein:vir:47 333 MQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQG-MKTATEIVSENSDTYQMRSSIVALVE 411 (522) T ss_pred eecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 1122344444333334566778888888888999998888876554 46899998766666666667777775 Q ss_pred HHHHHHHHHHHHHHHHHhc--CCceEEEEecCeeeccchhhcCCceEEEEeecc--cchhHHHHHHHHHHHHHhccCCCc Q lcl|NC_021532. 437 ENLVKPLMRKWMAYNAEFL--EEEEVIRVTNDKFVPIRKDDLSGRIDIDISIST--AEDNAAKSQELSFLLQTLGPNEDP 512 (663) Q Consensus 437 ~~~~~~l~~~~~~li~q~~--~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~--~~~~~~~~q~l~~~~~~~~~~~~p 512 (663) ..++.|...++.+..-+. ... . ....++.|+=+. .....+..+..+++.+ ++.+.+ T Consensus 412 -~al~~lv~~i~~l~~~~~~~~~~-------------~----~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~--aG~~s~ 471 (522) T protein:vir:47 412 -QSIKELCVSMCELGKAVGVYSGE-------------I----PELDDISVNLDDGVFTDRHAELDYWAKMVA--AGFSTK 471 (522) T ss_pred -HHHHHHHHHHHHHHhhhhhccCC-------------C----CCcceeEEEcCCCCCCCHHHHHHHHHHHHh--cCCCCH Confidence 456777777776664321 110 0 012223333222 2222222222222211 122222 Q ss_pred chhHHHHHHHHHhhhhh--hhhhhhhhhhcc---hhhHHHHhhHHHHHHHHHHHHHH Q lcl|NC_021532. 513 KIRRDIMADIMDLMRMP--EQAKRMREYEPK---PDPVQEKIRQLELENLMLENQML 564 (663) Q Consensus 513 ~~~~~~l~~~~~l~~~~--e~~~~l~~~~~~---~~~~~~q~~q~~~~~~q~~~~~~ 564 (663) ..... .+-++. +..+.+..+..+ ..+...-.-.+.-++.+.--++- T Consensus 472 e~~i~------~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 472 KRAIG------KTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HHHHH------hcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 21111 111111 111111111111 00000000000000000000000 No 120 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.52 E-value=3.2e-12 Score=83.60 Aligned_cols=431 Identities=15% Similarity=0.060 Sum_probs=177.4 Q ss_pred CCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----cc--ccCCC---ccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKI-NKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG----NE--QKGKS---AIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~----~~--~~g~s---~~~~~~i~~~v~~~~~~l~~ 70 (663) |-- ..++++..|...+.. +......+.+||.|++.- .+ ...++ +++.|-..-.|+...+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~-------~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII- 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc- Confidence 322 244566666555433 345567788899998631 11 11112 2455666666666666543 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++++ .+ + ..+|.+..+.+.+ ++. .|+.......+..+++++|.+++.++-+. T Consensus 73 ----~~~~-~~-~-~~~d~~~~~~~~~----i~~-~N~~d~~~~~~~~~a~i~G~ay~~v~~d~---------------- 124 (456) T protein:vir:10 73 ----PNGI-TV-G-GSADSDLALRARR----IWR-DNRMDSVCKQWVKYGLDFGESYLTCWRRD---------------- 124 (456) T ss_pred ----cCCe-ec-C-CCCCcchHHHHHH----HHH-hcChhhHHHHHHHHHhhcCeeEEEEeeCC---------------- Confidence 1222 11 1 2233333333333 333 45666667788999999999998776431 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) .+.+.+..++|.+++ +||..... ..++++ ++.+.+ .. .. T Consensus 125 -----------------~g~~~i~~~~p~~~~~i~d~~~~~~---~~~~i~-~~~~~d--------~~----~~------ 165 (456) T protein:vir:10 125 -----------------DGTATITADSPETMVVSVDPLQPWR---IRAAMR-WWRDLD--------AE----SD------ 165 (456) T ss_pred -----------------CCceEEEEEccceeEEEEcCCCCcc---eEEEEE-EEEecC--------Cc----ee------ Confidence 134566778888865 45533221 111221 221100 00 00 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCC Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEA 308 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g 308 (663) ...-+.+ ...+..+..+..... ......++.++........|...+.+|++++ .+..|.| T Consensus 166 -------~~~~~~~--~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~g 225 (456) T protein:vir:10 166 -------FAIVWSG--DGWQKFARPCFVQSS-----SRRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMG 225 (456) T ss_pred -------EEEEEec--cceeEEEEEEEEeec-----ccceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCc Confidence 0000000 000111111110000 0111222233333222233333355666533 2457899 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc------cC-------cchhhhccCCcceEeCCCCCccccccCc Q lcl|NC_021532. 309 NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA------LD-------QTNRKKFLAGANFEFNGTANDFWHGSYN 375 (663) Q Consensus 309 ~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~------i~-------~~d~~~~~p~~vi~~~~~~~~~~~~~~~ 375 (663) .++.++++++.+|+..+.++..+...+.|...+ .|. ++ ..+.....+|.++...++++ +..++.. T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i-~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~-~~q~~~~ 303 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRAL-KSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD-IWESQAN 303 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhh-hccCcccccccccccccchhhhhhhhccccccCCCCcc-eEEeccc Confidence 999999999999999998887777777665433 111 11 11112234555655544332 2233222 Q ss_pred cccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021532. 376 AIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFL 455 (663) Q Consensus 376 ~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~ 455 (663) .+ ..+...+..+...+-.+||+|+...|..++..|+.| +......-........+.|.+ +++.+++ ++.... T Consensus 304 ~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~A--i~~~~~~l~~k~~~~~~~f~~-~l~~~~r----l~~~~~ 375 (456) T protein:vir:10 304 DF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEG--AHNIEKGFLFKCEDRLSIAKI-GLEAILV----KALQIE 375 (456) T ss_pred Ch-hHHHHHHHHHHHHHHhccCCChHHhcccccChHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHH----HHHHhc Confidence 22 334455666666666778999998885433234443 433322323333333344433 2333333 433322 Q ss_pred CCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhh Q lcl|NC_021532. 456 EEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRM 535 (663) Q Consensus 456 ~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l 535 (663) .... . ..+++...-.......+..+.+..+.+ ..+++ ...+. ++.++.. T Consensus 376 g~~~---------------~--~~~~v~w~~~~~~~~~~~ada~~kl~~---~gi~~---~~~~~---~~lg~~~----- 424 (456) T protein:vir:10 376 GESV---------------E--DTVDVSFESPDRVTLGEKYSAASLAKA---AGESW---ASIRR---NILNYNA----- 424 (456) T ss_pred CCCc---------------c--cceeEEecCCCCcCHHHHHHHHHHHHH---cCCCh---HHHHH---hhCCCCH----- Confidence 2110 0 112222221112222222222222211 11211 11111 1111110 Q ss_pred hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAEL 584 (663) Q Consensus 536 ~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~ 584 (663) +. ..+.++++...+.. ..+....+.. +.++-. T Consensus 425 -------~~----i~~~e~er~~~e~~-~~~~~~~~~~-----~~~~~~ 456 (456) T protein:vir:10 425 -------DQ----IKQDDLDRAREQIT-LFAGNPVQRP-----QEDGSR 456 (456) T ss_pred -------HH----HHHHHHHHHHHHHH-HHhhhhhhcC-----CCCCCC Confidence 00 00000000000000 0000000000 000000 No 121 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.52 E-value=3.2e-12 Score=83.60 Aligned_cols=431 Identities=15% Similarity=0.060 Sum_probs=177.4 Q ss_pred CCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC----cc--ccCCC---ccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKI-NKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG----NE--QKGKS---AIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~----~~--~~g~s---~~~~~~i~~~v~~~~~~l~~ 70 (663) |-- ..++++..|...+.. +......+.+||.|++.- .+ ...++ +++.|-..-.|+...+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~-------~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII- 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc- Confidence 322 244566666555433 345567788899998631 11 11112 2455666666666666543 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++++ .+ + ..+|.+..+.+.+ ++. .|+.......+..+++++|.+++.++-+. T Consensus 73 ----~~~~-~~-~-~~~d~~~~~~~~~----i~~-~N~~d~~~~~~~~~a~i~G~ay~~v~~d~---------------- 124 (456) T protein:vir:10 73 ----PNGI-TV-G-GSADSDLALRARR----IWR-DNRMDSVCKQWVKYGLDFGESYLTCWRRD---------------- 124 (456) T ss_pred ----cCCe-ec-C-CCCCcchHHHHHH----HHH-hcChhhHHHHHHHHHhhcCeeEEEEeeCC---------------- Confidence 1222 11 1 2233333333333 333 45666667788999999999998776431 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) .+.+.+..++|.+++ +||..... ..++++ ++.+.+ .. .. T Consensus 125 -----------------~g~~~i~~~~p~~~~~i~d~~~~~~---~~~~i~-~~~~~d--------~~----~~------ 165 (456) T protein:vir:10 125 -----------------DGTATITADSPETMVVSVDPLQPWR---IRAAMR-WWRDLD--------AE----SD------ 165 (456) T ss_pred -----------------CCceEEEEEccceeEEEEcCCCCcc---eEEEEE-EEEecC--------Cc----ee------ Confidence 134566778888865 45533221 111221 221100 00 00 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCC Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEA 308 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g 308 (663) ...-+.+ ...+..+..+..... ......++.++........|...+.+|++++ .+..|.| T Consensus 166 -------~~~~~~~--~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~g 225 (456) T protein:vir:10 166 -------FAIVWSG--DGWQKFARPCFVQSS-----SRRRLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMG 225 (456) T ss_pred -------EEEEEec--cceeEEEEEEEEeec-----ccceeeeecCCceeeccccCCCCCceeEEEe------cCCCCCc Confidence 0000000 000111111110000 0111222233333222233333355666533 2457899 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc------cC-------cchhhhccCCcceEeCCCCCccccccCc Q lcl|NC_021532. 309 NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA------LD-------QTNRKKFLAGANFEFNGTANDFWHGSYN 375 (663) Q Consensus 309 ~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~------i~-------~~d~~~~~p~~vi~~~~~~~~~~~~~~~ 375 (663) .++.++++++.+|+..+.++..+...+.|...+ .|. ++ ..+.....+|.++...++++ +..++.. T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i-~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~-~~q~~~~ 303 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRAL-KSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD-IWESQAN 303 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhh-hccCcccccccccccccchhhhhhhhccccccCCCCcc-eEEeccc Confidence 999999999999999998887777777665433 111 11 11112234555655544332 2233222 Q ss_pred cccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021532. 376 AIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFL 455 (663) Q Consensus 376 ~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~ 455 (663) .+ ..+...+..+...+-.+||+|+...|..++..|+.| +......-........+.|.+ +++.+++ ++.... T Consensus 304 ~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~A--i~~~~~~l~~k~~~~~~~f~~-~l~~~~r----l~~~~~ 375 (456) T protein:vir:10 304 DF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEG--AHNIEKGFLFKCEDRLSIAKI-GLEAILV----KALQIE 375 (456) T ss_pred Ch-hHHHHHHHHHHHHHHhccCCChHHhcccccChHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHH----HHHHhc Confidence 22 334455666666666778999998885433234443 433322323333333344433 2333333 433322 Q ss_pred CCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhh Q lcl|NC_021532. 456 EEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRM 535 (663) Q Consensus 456 ~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l 535 (663) .... . ..+++...-.......+..+.+..+.+ ..+++ ...+. ++.++.. T Consensus 376 g~~~---------------~--~~~~v~w~~~~~~~~~~~ada~~kl~~---~gi~~---~~~~~---~~lg~~~----- 424 (456) T protein:vir:10 376 GESV---------------E--DTVDVSFESPDRVTLGEKYSAASLAKA---AGESW---ASIRR---NILNYNA----- 424 (456) T ss_pred CCCc---------------c--cceeEEecCCCCcCHHHHHHHHHHHHH---cCCCh---HHHHH---hhCCCCH----- Confidence 2110 0 112222221112222222222222211 11211 11111 1111110 Q ss_pred hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAEL 584 (663) Q Consensus 536 ~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~ 584 (663) +. ..+.++++...+.. ..+....+.. +.++-. T Consensus 425 -------~~----i~~~e~er~~~e~~-~~~~~~~~~~-----~~~~~~ 456 (456) T protein:vir:10 425 -------DQ----IKQDDLDRAREQIT-LFAGNPVQRP-----QEDGSR 456 (456) T ss_pred -------HH----HHHHHHHHHHHHHH-HHhhhhhhcC-----CCCCCC Confidence 00 00000000000000 0000000000 000000 No 122 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.51 E-value=9.8e-13 Score=86.41 Aligned_cols=440 Identities=10% Similarity=0.007 Sum_probs=176.6 Q ss_pred CCCcHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCCC--------ccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAE--LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGKS--------AIVSRDIKKQSEWQHATIVD 70 (663) Q Consensus 1 ~~~~~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s--------~~~~~~i~~~v~~~~~~l~~ 70 (663) -.|++++ .+..|...+... .....+..+||.|++.- +..|.+ ..+.|-.+-.|+.+...+ T Consensus 16 ~~l~~~e~~~i~~L~~~~~~~-------~~r~~~l~~YY~G~~~i-~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl-- 85 (504) T protein:vir:99 16 PELNDDVVDKVNGLYQQLVDR-------TPRNLLRASFYDGKYAI-RQIGNLIPPEYLRTATVLGWSAKAVDTLARRC-- 85 (504) T ss_pred CCCCHHHHHHHHHHHHHHHHH-------hHHHHHHHHHHhccccc-hhccccccHHHHHHhhccCcHHHHHHHHHhhh-- Confidence 3344444 456665555442 23456667899998642 222211 112222222233322221 Q ss_pred hhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 71 PFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 71 ~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) ++++ | . .+++.+..+ .+..++. .|+.......+..+++++|.|++.|+-+.+ T Consensus 86 -~~~G---f--~--~~d~~~~~~----~l~~i~~-~N~ld~~~~~~~~~a~iyG~af~~v~~~~d--------------- 137 (504) T protein:vir:99 86 -NLES---F--V--WPDGDYGSI----GGPDVWD-ENFFATKANNAMVSSLIHGPAFLINTEGGA--------------- 137 (504) T ss_pred -ccce---e--e--CCCCChhhH----HHHHHHH-hcChhhHHHHHHHHHHhhCceeEEEecCCC--------------- Confidence 1111 1 1 123332222 2344443 466666677899999999999988853211 Q ss_pred cccccccccccccceeecccceeeeccHHHhe--eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) ....+.+..++|.++| |||... ...+.+++...+ . T Consensus 138 ----------------~~~~~~I~~~sP~~~~~iyD~~~~----~~~~a~~~~~~d----------~------------- 174 (504) T protein:vir:99 138 ----------------GEPDSLIHVKSAMQATGEWNSRRN----AMDSLLSITSRD----------A------------- 174 (504) T ss_pred ----------------CCceeEEEEeccceeEEEEeCCCC----ceeEEEEEEEec----------C------------- Confidence 0123456778888875 666422 111111111000 0 Q ss_pred hccccccccccccccccceEEEEEEEEE-----eeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGN-----YDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~-----~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) +. .....++|.. +..+++|. ......++|+ | .|++.++..+..++ T Consensus 175 ------~g----------~~~~~~~y~~~~~~~~~~~~~~~-----------~~~~~~~~~~--g-vPvV~~~n~~~~~~ 224 (504) T protein:vir:99 175 ------EG----------HPTGIALYEDGVTVTADMDDDGD-----------WHADVRTHKL--G-VPVEVLPYKPREDR 224 (504) T ss_pred ------CC----------eEEEEEEEcCCcEEEEEEcCCce-----------eeeccccCCC--C-cceEEecccccCcc Confidence 00 0111222211 00111111 0112224444 5 69999888888889 Q ss_pred ccCCChH-HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecccc----C-----cchhhhccCCcceEeCCCCCc----- Q lcl|NC_021532. 304 LHGEANA-EMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGAL----D-----QTNRKKFLAGANFEFNGTAND----- 368 (663) Q Consensus 304 ~~g~g~~-~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i----~-----~~d~~~~~p~~vi~~~~~~~~----- 368 (663) ++|.|-+ +.++++++.+|+.++.++......+.|+..+ -|.- . .........++++.+....+. T Consensus 225 ~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i-~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 303 (504) T protein:vir:99 225 PLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLIL-LGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAAR 303 (504) T ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-ccCCccccccccccccchhhhhhhhhhcCCCccccccccC Confidence 9998855 6899999999999999998898888887544 1221 1 111222344556655433221 Q ss_pred --cccccCcccc-HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 369 --FWHGSYNAIP-SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMR 445 (663) Q Consensus 369 --~~~~~~~~~~-~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~ 445 (663) ....+++.-. ..+...+..+...+-.+||+|+..+|..++..+.+|.++......-........+.|.. ..+.+++ T Consensus 304 ~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~-~l~~~~r 382 (504) T protein:vir:99 304 ARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSP-AFRRSMI 382 (504) T ss_pred ccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 2222222221 12233344444445555999999999655432334444544333333333444445543 3444555 Q ss_pred HHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHh Q lcl|NC_021532. 446 KWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDL 525 (663) Q Consensus 446 ~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l 525 (663) +++.+....-..+ .++. +..+.-.-.......+....+..+.+. ...+......+...+.+ T Consensus 383 la~~~~~~~~~~~-------~~~~---------~~~v~w~d~~~~s~a~~aDa~~Kl~~a---g~~l~~~~~~l~~~lg~ 443 (504) T protein:vir:99 383 RALAIKNGLDRIP-------PEWK---------TIDSKFRSPLYLSKAAQADAGAKMLGA---GPEWLKETEVGLELLGL 443 (504) T ss_pred HHHHHhcCCCccc-------cccc---------cceeEecCCCccCHHHHHHHHHHHHhh---ccccccchHHHHhhcCC Confidence 5444332211000 0000 011111101111111222222222111 11011111111111100 Q ss_pred h-----------hhh---hhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 526 M-----------RMP---EQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAA 590 (663) Q Consensus 526 ~-----------~~~---e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~ 590 (663) . +.. .....+......+........+...+.+. . .......+=.+. . T Consensus 444 ~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~---~-------------~~~~~~~~p~~~--~ 504 (504) T protein:vir:99 444 TPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPA---N-------------EPPAALGRPTLV--G 504 (504) T ss_pred CHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCC---C-------------CCCccCCCcccC--C Confidence 0 000 00000000000000000000000000000 0 000000000000 0 No 123 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.18 E-value=2.7e-10 Score=73.04 Aligned_cols=407 Identities=10% Similarity=-0.002 Sum_probs=161.5 Q ss_pred Hhc-CCcCCccccCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHH Q lcl|NC_021532. 36 EYN-GEPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMS 114 (663) Q Consensus 36 ~y~-~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~ 114 (663) |.- +.....+ ..+-..+.|-.+-.|+.+.+.+. +.+ |. -.|.+.-+.+ ..++. .|+...... T Consensus 1 ~l~~~~~~~~~-~~~~~~v~n~~~~ivd~~~~~l~---~~g---f~-----~~d~~~~~~~----~~i~~-~N~~d~~~~ 63 (434) T protein:vir:98 1 MLPKNAEQAFL-DFQRKARTNFCGLIANASVHRLL---ALG---VT-----GPDGEPDTRA----SRWWQ-ANRLDSRQK 63 (434) T ss_pred CCCCCccHHHH-HhhhhhhccchHHHHHHHHhhhc---cCc---ee-----cCCCchHHHH----HHHHH-hcChhHHHH Confidence 211 1000000 01112345666666666665442 221 22 1222222222 33343 466666778 Q ss_pred HHHHHHHhcCceEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHhe--eCcccccChh Q lcl|NC_021532. 115 KAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY--LDPTCQDNLD 192 (663) Q Consensus 115 ~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~--~dp~a~~d~~ 192 (663) .+..+++++|.|++.|+.+..... ......|.+..++|..++ |||... T Consensus 64 ~~~~~a~i~G~ay~~v~~~~~~~~--------------------------~~~~~~~~I~~~~p~~~~~i~D~~~~---- 113 (434) T protein:vir:98 64 LVWRMAMAQSAGYMLVGAHPTRTE--------------------------DNGRPSPLITMEHPSECIVEYDPETG---- 113 (434) T ss_pred HHHHHHhhcCceEEEEecCCCccc--------------------------ccCCceeEEEEeccceeEEEEeCCCC---- Confidence 899999999999999976532110 011234567778888865 554322 Q ss_pred hCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccccccccccccceEEEE--EEEEEeeecCCceeEEEEE Q lcl|NC_021532. 193 NAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIY--EYWGNYDVDGDGIAEPIVC 270 (663) Q Consensus 193 d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~--E~w~~~~~~~~g~~~~~~~ 270 (663) ...+.+++...+.+. . ....+.++ ++++.......+...+--. T Consensus 114 ~~~~ai~~~~~~~~~------------------~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (434) T protein:vir:98 114 EPLVGLKVWHNDIDG------------------F-----------------GYARVFFDDTSFPYRTRERTGARLPWGPD 158 (434) T ss_pred ceEEEEEEEEeccCC------------------c-----------------eEEEEEEeCcEEEEEEeeccccccccccc Confidence 222333322111000 0 00001110 0111000000000000000 Q ss_pred EEE-CCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc-cCc Q lcl|NC_021532. 271 AWI-NDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA-LDQ 348 (663) Q Consensus 271 ~~~-g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~-i~~ 348 (663) .+. ....-...++ +.|..|++.+.-.+..+. +|.|.++.++++++.+|+.++.+.......+.|+..+- |. ... T Consensus 159 ~~~~~~~~~~~~~h--~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~ 234 (434) T protein:vir:98 159 SWVYTGTADSGDVH--DLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIK-GHKFAK 234 (434) T ss_pred cceecccccccccC--CCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccc Confidence 011 1111111223 347789988877776655 69999999999999999999999999999988875542 21 110 Q ss_pred -c----------hhhhccCCcceEeCCCCCccccccCccc-cHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHH Q lcl|NC_021532. 349 -T----------NRKKFLAGANFEFNGTANDFWHGSYNAI-PSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATG 416 (663) Q Consensus 349 -~----------d~~~~~p~~vi~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~ 416 (663) . ......+++++...+ ......+++.. ...+...+......+-.+|++++...|...+..|+.| T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~A-- 310 (434) T protein:vir:98 235 RTDPATGMTVVDQPFVPSPSAVWASEG--ENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADT-- 310 (434) T ss_pred ccccccccchhhhhhhccccccccCCC--CCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHH-- Confidence 0 011123444443322 12223333221 1223344444555555668888888885322223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHH Q lcl|NC_021532. 417 ARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKS 496 (663) Q Consensus 417 i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~ 496 (663) +......-........+.|.+ +++.+++ ++.... |. ..+. .++.+..+-.......+.. T Consensus 311 l~~~~~~l~~k~~~k~~~f~~-~l~~~~r----l~~~~~---------g~-----~~~~--~~~~v~w~~~~~~s~~~~a 369 (434) T protein:vir:98 311 IGALDILHVAKVREHIASFSE-GLESVLA----LAAAQA---------GV-----PEDY--TEAEVRWANPAHVTMAVKA 369 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHH----HHHHhc---------CC-----Chhh--eeeeEEecCCCCCCHHHHH Confidence 333322223333333444433 3333444 433321 11 0000 0122222222222223333 Q ss_pred HHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_021532. 497 QELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKN-ARA 575 (663) Q Consensus 497 q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~-a~~ 575 (663) +.+..+.+. .++. ..+.. +.++.. ..++ +.+.+. ..+.+.+.....+ .+. T Consensus 370 da~~kl~~~---g~~~----e~~~~---~lg~~~--~e~~--------------r~~~e~---~~~~~~~~~~~~~~~~~ 420 (434) T protein:vir:98 370 DAATKLKSI---GYPL----DVIAE---ELDESP--ARVR--------------RIVAGA---ASQALLAASLLPAPGAP 420 (434) T ss_pred HHHHHHHhc---CCcH----HHHHH---hCCCCH--HHHH--------------HHHHHH---HHHHHHHHhhhccCCCC Confidence 333332221 1221 11111 111110 0000 000000 0000000000000 000 Q ss_pred -------HHHHHHH Q lcl|NC_021532. 576 -------NENTIDA 582 (663) Q Consensus 576 -------q~~~~~~ 582 (663) .....+- T Consensus 421 ~~g~~~~~~~~~dg 434 (434) T protein:vir:98 421 SAGNVPDSGGAVDG 434 (434) T ss_pred CCCCCCcccCCCCC Confidence 0000011 No 124 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.24 E-value=2.1e-06 Score=51.67 Aligned_cols=432 Identities=13% Similarity=0.096 Sum_probs=184.6 Q ss_pred CCCc-HHHHHHHHHHHHHHHHHHHHHH--HHH-HHHHHHHhcCCcCC-ccccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 1 MKIN-KAELLSALKADMKAADVLKQEQ--DSL-ISTWKAEYNGEPYG-NEQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 1 ~~~~-~~~~~~~l~~~~~~~~~~~~~~--~~~-~~~~~~~y~~~~~~-~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) |..+ ...........|+-.......- .+. -..|.-.+.++... -+.+=...+++|.++.+++.++|.+. .. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf----~k 76 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVL----DQ 76 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhh----cC Confidence 7755 3444555555555544332211 000 01111111111110 01111235678999999999988764 33 Q ss_pred CceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccc Q lcl|NC_021532. 76 ADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNET 155 (663) Q Consensus 76 ~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 155 (663) ++.+++ | . .+..+... -..++....+.+++..++.+|.|++-|.|... T Consensus 77 ~p~~~~-p-----~----~l~~~~~D--~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~-------------------- 124 (452) T protein:vir:94 77 PPVITH-P-----D----AMSKYFED--QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLT-------------------- 124 (452) T ss_pred Cceecc-c-----H----HHHHHHhc--ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccC-------------------- Confidence 333322 1 1 22222111 12344455677899999999999998866411 Q ss_pred ccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccccc Q lcl|NC_021532. 156 VVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDD 235 (663) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 235 (663) ..+|++..++|.++. +.+... ...-.++..+.... ..+ T Consensus 125 ------------g~rPy~~~~~~~~Ii-~W~~~~-~g~l~~v~lre~~~----------------------------~~d 162 (452) T protein:vir:94 125 ------------GGDPYISVYTTENIL-NWEEDE-DGRLLMVVLREFYT----------------------------VRD 162 (452) T ss_pred ------------CCceEEEEechhhhc-Cccccc-cCCeeEEEEEEEEE----------------------------Eec Confidence 135788888888885 332111 11111111111000 000 Q ss_pred ccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECC--------EEEecccCCCcCCCCCEEEEeeeeecCcccCC Q lcl|NC_021532. 236 TEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIND--------VIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGE 307 (663) Q Consensus 236 ~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~--------~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~ 307 (663) ....++......+++++.. +|..+.++....++ .....+.+| .+.+||+++..... +...|. T Consensus 163 ~~d~f~~~~~~~yRvL~l~-------~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--l~~IP~v~~~~~~~-~~~~~~ 232 (452) T protein:vir:94 163 TADRYVQNIRVRYRCLELV-------DGLLQITVHETQDGKVWELAKTSTIQNVGVT--MDYIPFFCITPSGL-SMTPAK 232 (452) T ss_pred CCCcccceeEEEEEEEEEe-------CCeEEEEEEEccCCceeeeccceeecCCCcc--cceeEEEEEcCCCC-CCCCCc Confidence 0111222222233333211 12111110000111 122222333 35678876543332 233477 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCcccc-HHHHHHHH Q lcl|NC_021532. 308 ANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIP-SSAFDMIS 386 (663) Q Consensus 308 g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~-~~~~~~~~ 386 (663) ++.-.+..++..+....+-.-+++..+..|...+. |. +..+.....|+.+|.+-..+....++.+..-+ .....-|+ T Consensus 233 pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~-g~-~~~~~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~ 310 (452) T protein:vir:94 233 PPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWIT-GA-ESQSTMHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKALS 310 (452) T ss_pred cchHHHHHHHHHHhcchhHHHHHHHHcccceeEee-cC-cCCCceEecccccccCCCCCCcceEEccCchhHHHHHHHHH Confidence 78889999988888888888889999999866553 32 33344556777777665323346666654332 22344455 Q ss_pred HHHHHHHHHhCCChHHcCCCcccchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEec Q lcl|NC_021532. 387 LMNNEIESITGTKSFSGGINSGSLGSTATGA-RGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTN 465 (663) Q Consensus 387 ~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i-~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~ 465 (663) .+.+.|..+ |.. ...+... +++++.. ..........|..++.++.+ .+..++.++..|...+.-+. T Consensus 311 ~le~~m~~~-Ga~-ll~~~~~---~~~s~ea~~~~~~~~~s~L~~~a~~~e~-----al~~~l~~~a~w~g~~~~~~--- 377 (452) T protein:vir:94 311 EKQAQLASL-SAR-LIDNSTR---GSEATETVKLRYMSETASLKSVTRAVEA-----LLNKAYSCIMDMESMGGTLN--- 377 (452) T ss_pred HHHHHHHHH-HHH-hhccCCC---cchHHHHHHHHHHHhhHHHHHHHHHHHH-----HHHHHHHHHHHHcCCCCceE--- Confidence 666666443 332 2222111 1222222 22222335667777777754 24567777878765432111 Q ss_pred CeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhh----hhhhhhhh-c Q lcl|NC_021532. 466 DKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQ----AKRMREYE-P 540 (663) Q Consensus 466 ~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~----~~~l~~~~-~ 540 (663) +.+|++-... .-.. +.+.++++.+... .+....+...+.-.++.+. ...+.+.. + T Consensus 378 ---v~~n~dF~~~-----------~~~~---~~~~al~~~~~~G---~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~ 437 (452) T protein:vir:94 378 ---IKLNSAFLDS-----------KLTA---AELKAWVEAYLSG---GISKEIYIHALKVGKVLPPPGESMGVIPDPPAP 437 (452) T ss_pred ---EEeccccccc-----------cCCH---HHHHHHHHHHhcC---CCcHHHHHHHHHhCCCCCCccCHHHHHHHhhcc Confidence 1222111100 0011 2223333332221 1222223222222222111 00000000 0 Q ss_pred chhhHHHHhhHHHHH Q lcl|NC_021532. 541 KPDPVQEKIRQLELE 555 (663) Q Consensus 541 ~~~~~~~q~~q~~~~ 555 (663) .+.+........... T Consensus 438 ~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 438 EPSPSNTPPNPSSKA 452 (452) T ss_pred CcccCCCCCCCccCC Confidence 000000000000000 No 125 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=97.68 E-value=2.9e-05 Score=45.48 Aligned_cols=583 Identities=11% Similarity=0.026 Sum_probs=135.3 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--H-------------Hh--cCCcC-C-c------------cccCC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWK--A-------------EY--NGEPY-G-N------------EQKGK 49 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~-------------~y--~~~~~-~-~------------~~~g~ 49 (663) .+=.=+.+.+.+..++.....+|.+...+.++|+ . -+ .|.+. . + +.++| T Consensus 5 ~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~~~nr 84 (720) T protein:vir:35 5 LQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEYRHNR 84 (720) T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHHHhCC Confidence 3333344566666666665555555544444432 0 11 12221 0 0 11223 Q ss_pred Cc--cccHHH--HHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH--HHHh- Q lcl|NC_021532. 50 SA--IVSRDI--KKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK--VLDR- 122 (663) Q Consensus 50 s~--~~~~~i--~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~--d~~~- 122 (663) +. +++..- -..+..++..+++.+... +.-+.. .+.+....+.....-.++..++.. |... T Consensus 85 ~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~---------~~~~~~----~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~ 151 (720) T protein:vir:35 85 ITVKFRPGDKTASEALANKLNGLFRADYEE---------TDGGEA----CDNAFDDGSTGGFGCFRLTTNLVNALDPMDE 151 (720) T ss_pred CceEEEcCCCcchHHHHHHHHHHHHHHHHh---------cCchHH----HhHHHHHhhhccceeEEeeecccccCCCCcc Confidence 22 222100 112222333333333221 111111 111111111000000000000000 0000 Q ss_pred cCceE--------EEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCccccc--Chh Q lcl|NC_021532. 123 EGTLV--------VQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQD--NLD 192 (663) Q Consensus 123 ~G~g~--------~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~--d~~ 192 (663) .+..+ ..|+||+.. .+.++++.++.....++..-.....|++-.+.. ++. T Consensus 152 ~~~i~i~~v~~~~~~v~~Dp~a--------------------~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~ 211 (720) T protein:vir:35 152 RQRICLEPIYDPARSVWFDPDA--------------------KKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIE 211 (720) T ss_pred cceeeEecccCchhheeecccc--------------------cccChhhhhhhhhhcCCCHHHHHHhCCCcccccccccc Confidence 01111 124454332 122333334333333332222333344322110 010 Q ss_pred hCceEEEEeecCHHHHH-HhcCCcC--hhhhh----hccchhhhcccccc------ccccccccc-cceE-EEEEEEEEe Q lcl|NC_021532. 193 NAQFVIHRYETDLSTLK-KDGRYKN--LDKLA----KTSGEDFDYDSPDD------TEFQFSDAP-RKKL-IIYEYWGNY 257 (663) Q Consensus 193 d~~~~~~~~~~~~~~l~-~~g~~~~--~~~~~----~~~~~~~~~~~~~~------~~~~~~d~~-~~~v-~v~E~w~~~ 257 (663) +..+ ..|.+.+.+. ...++.. ...+. ...+.-..+..+.. ......... ++.+ +..-+|+.+ T Consensus 212 ~~~~---~d~~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~ 288 (720) T protein:vir:35 212 RSWD---YDWYDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVV 288 (720) T ss_pred cccc---ccccCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEee Confidence 0000 0011100000 0000000 00000 00000000000000 000000111 1111 111234332 Q ss_pred eecCCcee----------EEEEEEEECCEEEecccCCCcCCC-CCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 258 DVDGDGIA----------EPIVCAWINDVIVRLQSNPYPDGK-PPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRG 326 (663) Q Consensus 258 ~~~~~g~~----------~~~~~~~~g~~~l~~~~~p~~~~~-~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 326 (663) .++.+. -+++.+|.-.. |.+|. .+|-++....++-..+-+ ..++++.. T Consensus 289 --~g~~~l~~~~~~p~~~fP~vP~~g~r~--------~~d~~~~~~G~vr~~kd~Q~~~N~-----------~~s~~~~~ 347 (720) T protein:vir:35 289 --DGEGFLEKAQRIPGEHIPLIPVYGKRW--------FIDDIERVEGHIAKAMDAQRLYNL-----------QVSMLADS 347 (720) T ss_pred --ccchhcccCCCCCCCccceEEEEeeee--------ccCCCcccceeeecchhHHHHHHH-----------HHHHHHHH Confidence 111111 12222222111 11221 123344444444443321 11111111 Q ss_pred HHHH---HHhcCCCcEEeeccccC----------cchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHH- Q lcl|NC_021532. 327 IIDN---MAQSNNGQVAIRKGALD----------QTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEI- 392 (663) Q Consensus 327 ~~~~---~~~~~~~~~~~~~~~i~----------~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 392 (663) +.-+ +...+...+..-++.+. +.+.....+|.++............+.++....++++-......+ T Consensus 348 ~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vs 427 (720) T protein:vir:35 348 ATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVT 427 (720) T ss_pred HHcCCccccccCcchHHHHHHHhhccccccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHh Confidence 1000 00000001000000000 001111223444333333333333344444444443333222222 Q ss_pred ---HHHhCCChHHcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHhcCCceE Q lcl|NC_021532. 393 ---ESITGTKSFSGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN--------AEFLEEEEV 460 (663) Q Consensus 393 ---~~~tGi~~~~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li--------~q~~~~~~~ 460 (663) .+..|.+.-..|..-+.... .+......+.+-....+. ..+.+..++-..+ ..--..+.+ T Consensus 428 Gi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~--------~g~~lL~lI~~~y~~er~~RI~~ed~~~~~ 499 (720) T protein:vir:35 428 GSSQAMQPMPSNIAKETVNHLMHRSDMSSFIYLDNMAKSLKR--------AGEVWLSMAREVYGSDRQVRIVNADGTDDI 499 (720) T ss_pred CCChHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHcCCCcEEEEecCCCCcce Confidence 23444432223321111111 111111222221111111 1111222222221 111112333 Q ss_pred EEEec-------Ceee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccC-CCcchhHHHHHHHHHh---h Q lcl|NC_021532. 461 IRVTN-------DKFV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPN-EDPKIRRDIMADIMDL---M 526 (663) Q Consensus 461 iri~~-------~~~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~-~~p~~~~~~l~~~~~l---~ 526 (663) +.++. +..+ .|+.-.+.-.++......+. +.+..+.+++++..+.+. ....+...++...+++ . T Consensus 500 v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~--req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~ 577 (720) T protein:vir:35 500 ALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTAR--RDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLD 577 (720) T ss_pred EeechhhhccCCCceeeeecceeeeeEEEEecccCcccH--HHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHH Confidence 33321 1111 12221111112222221111 223344455555544332 1111111111111100 0 Q ss_pred hhhhhhhhh-------hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHH Q lcl|NC_021532. 527 RMPEQAKRM-------REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARAN-------ENTIDAELKRSKAAVE 592 (663) Q Consensus 527 ~~~e~~~~l-------~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q-------~~~~~~~~~~~~~~~e 592 (663) .+.+..+.. +...++ .+...+..+.+.++++++++++++.+.+.+++.+ ..++++...+++++.+ T Consensus 578 e~~erirk~~~~~~~~~~~~~e-~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~ 656 (720) T protein:vir:35 578 EFKEYNRKQLLTQGVVKPRNTE-EEQMVAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVA 656 (720) T ss_pred HHHHHHHhhcchhcccCccChh-HHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111100 000110 0001111111222222223333332222222222 2222222222233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHH-HHHHHHH-HHH-HHHHHHHHHHHHHHHhhhh Q lcl|NC_021532. 593 KAKARKLSSEADMTDLKFVKEDNGYAHLEQ--VELED-LRHAQHL-ERE-AMKHRANLEQMLAQRNAGD 656 (663) Q Consensus 593 ~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~--~~~~~-~~~~~~~-~~e-~~k~~~~~e~~~~~~~~~~ 656 (663) +++..+..+++...+.... .++++.... .++.. .+..+++ +++ ..++ .+++..+++..= T Consensus 657 ~a~~~~~~aq~~~~~q~~i--~qalq~~~~~q~~q~~~eqa~~el~~~~~~~~~---~~~~~~~~~~~~ 720 (720) T protein:vir:35 657 EAKMVQILASADSAKRAEI--REALKMLHQFQKEQGDASRADAELILKATDTQH---KQNRDAAKNHSI 720 (720) T ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhcchHHHHHHHHhhcccchhh---hhhHHHhhccCC Confidence 3333222222221111111 111111111 11000 1111111 111 1111 233333444444 No 126 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=97.65 E-value=3.3e-05 Score=45.15 Aligned_cols=455 Identities=10% Similarity=0.010 Sum_probs=183.3 Q ss_pred CcHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC-------Cc-c----------ccCCCccccHHHHHHHH Q lcl|NC_021532. 3 INKAELL--SALKADMKAADVLKQEQDSLISTWKAEYNGEPY-------GN-E----------QKGKSAIVSRDIKKQSE 62 (663) Q Consensus 3 ~~~~~~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~-------~~-~----------~~g~s~~~~~~i~~~v~ 62 (663) |.+.++- +....+ +......|+..++.|.|+.. ++ + .+=+..+++|.+..+++ T Consensus 1 m~~~~~~~v~~~h~~-------y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~ 73 (513) T protein:vir:97 1 MADKDPKSPATTSGA-------YDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLD 73 (513) T ss_pred CCCCCCCCCCcCCHH-------HHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHH Confidence 4444322 111112 22334445555666655311 11 1 11124678999999999 Q ss_pred HHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHH-HHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecc Q lcl|NC_021532. 63 WQHATIVDPFVSTADIIKCTPITWEDTDSAEQNEL-LLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTV 141 (663) Q Consensus 63 ~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~ 141 (663) .++|.+.+. + |.. ++.. ...+.. ++..+=-..++...++.+++..++.+|.+++-|.+...... . T Consensus 74 ~l~G~vf~k----~------p~~-~~~~-p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~--~ 139 (513) T protein:vir:97 74 TLSGKPFSE----P------IKL-NEDV-PKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPR--E 139 (513) T ss_pred HHhhhhhhc----C------ccc-CcCc-hHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCc--c Confidence 999877532 1 211 1111 112222 33332112344555678899999999999887755311000 0 Q ss_pred cccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhh---CceEEEEeecCHHHHHHhcCCcChh Q lcl|NC_021532. 142 MGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDN---AQFVIHRYETDLSTLKKDGRYKNLD 218 (663) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d---~~~~~~~~~~~~~~l~~~g~~~~~~ 218 (663) .+.. ....+......+|++..++|+++. +.... .+.. ..++..+.... T Consensus 140 ---------~~~~----~T~Ade~~~~~rPy~~~~~~e~Ii-nW~~~-~v~G~~~L~~v~l~E~~~-------------- 190 (513) T protein:vir:97 140 ---------DGQP----RTLADDRREGLRPYWVMIKPECLL-FARSE-VINGVEVLQHVRIIEHYM-------------- 190 (513) T ss_pred ---------chhH----HhHHHHHhhccCceEEEecHhhhc-Cccee-ccCcceeeeeEEEEEEEe-------------- Confidence 0000 001111222345778788888774 22111 1111 11111111000 Q ss_pred hhhhccchhhhccccccccccccccccceEEEEE-----EEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEE Q lcl|NC_021532. 219 KLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYE-----YWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFL 293 (663) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E-----~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~ 293 (663) .. .. ++.....++++++ .|-+......+..++ ++......+ .+.+||+ T Consensus 191 --------------~~-Dg--f~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~--------~~~~~g~~~--l~~IP~v 243 (513) T protein:vir:97 191 --------------EQ-DG--FAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEW--------ALADEWATG--LNYVPLV 243 (513) T ss_pred --------------ec-CC--CcceEEEEEEEEeCceEEEEEeecCCCccccce--------EEecCCCCc--CCceeEE Confidence 00 00 1111112233322 221110000000010 122222233 3567887 Q ss_pred EEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCc-chhhhccCCcceEeCCCCCccccc Q lcl|NC_021532. 294 VVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQ-TNRKKFLAGANFEFNGTANDFWHG 372 (663) Q Consensus 294 ~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~-~d~~~~~p~~vi~~~~~~~~~~~~ 372 (663) ++.... -+...|.++.-.+..+...+=...+-.-+++..+..|...+. |.-+. .+.....|+.++.+-..+....++ T Consensus 244 ~~~~~~-~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~-G~~~~~~~~i~iG~~~~~~lpe~~~~~~yi 321 (513) T protein:vir:97 244 TFYADR-QGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACS-GASGEDSDPVVVGPNKVLYNPDPAGRFYYV 321 (513) T ss_pred EEecCC-CCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeee-cCCcCCCCceEeeccccccCCCCCCcceee Confidence 665432 233346677777777776665566667788888888876663 32221 123345566666554223446666 Q ss_pred cCcccc-HHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 373 SYNAIP-SSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN 451 (663) Q Consensus 373 ~~~~~~-~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li 451 (663) .+.... .....-+..+.+.|.. .|.. ..... ..+.||++.+....+....|..++.++.+ .++.++.++ T Consensus 322 e~~g~~i~~~~~~l~~le~qm~~-~Ga~-ll~~~---~~~~Ta~a~~~~~~~~~S~L~~~a~~le~-----al~~~l~~~ 391 (513) T protein:vir:97 322 EHTGQAIAAGRTDLKDLEEQMAG-YGAE-FLKRK---TGGQTATARALDSAEATSDLSAMTGLFED-----ALAQALDIT 391 (513) T ss_pred ccCchhHHHHHHHHHHHHHHHHH-HHHH-hhccC---CccccHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHH Confidence 655432 2234456666666643 4443 22221 22467777777777778888888888754 345667777 Q ss_pred HHhcCCc-eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhc----------------cCCCcch Q lcl|NC_021532. 452 AEFLEEE-EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLG----------------PNEDPKI 514 (663) Q Consensus 452 ~q~~~~~-~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~----------------~~~~p~~ 514 (663) ..|+..+ .-.. +.|++ +|... . . . .+.+.++++.+. ..++|.. T Consensus 392 a~wlg~~~~~~~------v~in~-----dF~~~----~-~-~---~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~ 451 (513) T protein:vir:97 392 ADWLRLGPNGGT------VELVK-----DYDLE----E-M-D---APGLQALQVAREKRDISRKTYLNGLRLRGVLPEDF 451 (513) T ss_pred HHHhCCCCCccE------EEecc-----ccCcc----c-C-C---HHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccC Confidence 7776421 1111 22222 11110 0 0 0 011111121111 1122222 Q ss_pred hHHHHHHHHHhhhhhhhhhhhhhhhcch----hhHHHHhh---HH--HHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_021532. 515 RRDIMADIMDLMRMPEQAKRMREYEPKP----DPVQEKIR---QL--ELENLMLENQML-VASINDKNARA 575 (663) Q Consensus 515 ~~~~l~~~~~l~~~~e~~~~l~~~~~~~----~~~~~q~~---q~--~~~~~q~~~~~~-~a~~~~~~a~~ 575 (663) ....... +....+.+..... ++..+... +. +-...-++-+.. ..-.. --... T Consensus 452 d~~~~~e--------~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 513 (513) T protein:vir:97 452 DEDEDWE--------ELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGEGGEGGGN-PGGES 513 (513) T ss_pred CHHHHHH--------HHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCccccCCC-CCCCC Confidence 1111100 0111111000000 00000000 00 000000000000 00000 00000 No 127 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=97.56 E-value=4.5e-05 Score=44.43 Aligned_cols=445 Identities=12% Similarity=0.058 Sum_probs=173.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC-C---------ccccC--------CCccccHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPY-G---------NEQKG--------KSAIVSRDIKKQSE 62 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~-~---------~~~~g--------~s~~~~~~i~~~v~ 62 (663) |-..+- -.++.+.-...+......|+..++.|.|+.. . ++..+ ...+++|.+..+++ T Consensus 1 ~~~~~~-----~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~ 75 (489) T protein:vir:78 1 MLTENG-----QGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLS 75 (489) T ss_pred CccCCC-----ccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHH Confidence 211100 0011111122234445556666777766421 1 11111 12457888888999 Q ss_pred HHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 63 WQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 63 ~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .++|.+. ..++.+++ | ..+..++..+=-..++....+.+++..++.+|.+++-|.+.... T Consensus 76 ~l~G~vf----rk~p~~~~-p---------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~------ 135 (489) T protein:vir:78 76 GMVGSVM----RKEPEINI-P---------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETG------ 135 (489) T ss_pred HHhchhh----cCCcceec-c---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCC------ Confidence 8888764 33344331 1 12444444432233445556788999999999999887653110 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChh---hCceEEEEeecCHHHHHHhcCCcChhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLD---NAQFVIHRYETDLSTLKKDGRYKNLDK 219 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~---d~~~~~~~~~~~~~~l~~~g~~~~~~~ 219 (663) .....+......+|++..++|+++. +.... .+. ...++..+.... T Consensus 136 ---------------~~T~ade~~~~~rPy~~~~~~~~Ii-nW~~~-~v~G~~~Lt~v~lrE~~~--------------- 183 (489) T protein:vir:78 136 ---------------AATAAEQNAGLLNPTIAFYTTENIV-NWRLT-RVGSVNRVTMVVLRETWE--------------- 183 (489) T ss_pred ---------------CcCHHHHHHhcCCcEEEEechhhhc-Cceee-eeCCccceeEEEEEEeEE--------------- Confidence 0111112223346888888888884 22111 111 111221111000 Q ss_pred hhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEE--EEEEECCE------EE-ecccCCCcCCCC Q lcl|NC_021532. 220 LAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPI--VCAWINDV------IV-RLQSNPYPDGKP 290 (663) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~--~~~~~g~~------~l-~~~~~p~~~~~~ 290 (663) ..+....++.....++++++. +.+|..+.+ +..-.|+. ++ ..+.++ .+.+ T Consensus 184 -------------~~d~~~~f~~~~~~q~RvL~~------~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~~--l~~I 242 (489) T protein:vir:78 184 -------------YNEPGNEFETKYGEQYRVLDI------DSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGESL--RGVI 242 (489) T ss_pred -------------eecCCCCccceeEEEEEEEec------CCCcceEEEEEEeecCCcccceeeEEeccCCCCc--cCee Confidence 000111222223334444431 112211110 00001111 11 111222 2556 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC--cchhhhccCC-------cceE Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD--QTNRKKFLAG-------ANFE 361 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~--~~d~~~~~p~-------~vi~ 361 (663) ||+++.... -+...+.++.-.+..+....=...+-.-+++..+..|...+. |.-+ ........+. ..+. T Consensus 243 Pfv~~~~~~-~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~-G~d~~~~~~~~~~~~~~i~~g~~~~~~ 320 (489) T protein:vir:78 243 PFTFIGATN-NDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIY-PGENLTPQAFKEANPNGIKFGSRRGHN 320 (489) T ss_pred eEEEEecCC-CCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeee-cCccCCcccccccCccceeeCCccccc Confidence 776554322 122234555666666643332233445667777778776542 2211 1111111122 2221 Q ss_pred eCCCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 362 FNGTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVK 441 (663) Q Consensus 362 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~ 441 (663) + +.++...++.+....- ....|..+.+.|.. .|..-.. .+. +.||++.+.-..+....|..++.++.+ T Consensus 321 l-p~~~~~~~ie~~~~~~-~r~~l~~le~qm~~-lGa~l~~----~~~-~~Ta~~~~~~~~~~~S~L~~~a~~~e~---- 388 (489) T protein:vir:78 321 L-GYGGSAQLIQAGENNL-ARQNMLDKEQQAIQ-IGAQLIT----PTQ-QITAQSARIQRGADTSVMATIARNVSQ---- 388 (489) T ss_pred C-CCCCCcceeccCcchH-HHHHHHHHHHHHHH-Hhhhhcc----CCc-chhHHHHHHHHHHhhHHHHHHHHHHHH---- Confidence 1 1122233444433322 23334444444432 2332221 112 467877877777778888888888864 Q ss_pred HHHHHHHHHHHHhcCCc--eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHH Q lcl|NC_021532. 442 PLMRKWMAYNAEFLEEE--EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIM 519 (663) Q Consensus 442 ~l~~~~~~li~q~~~~~--~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l 519 (663) .+..++.++..|...+ .-+.| .+|++ |.. .... .+.+.+++..+... .+....+ T Consensus 389 -al~~~l~~~a~w~G~~~~~~~~i------~~n~d-----F~~------~~~d---~~~~~al~~~~~~G---~is~~t~ 444 (489) T protein:vir:78 389 -AYTDALRWVAVMLGKPEDTEVEF------RLNMD-----FFL------EPMT---AQDRAAWMADINAG---LLPATAY 444 (489) T ss_pred -HHHHHHHHHHHHcCCCCCCceEE------Eeecc-----cCc------ccCC---HHHHHHHHHHHhcC---CCCHHHH Confidence 3456777777775421 11111 11211 111 0001 12233333332221 1222222 Q ss_pred HHHHHhhhhhh--hhhhhhhhhcchhhHHH-HhhHHHHHHHHHHH Q lcl|NC_021532. 520 ADIMDLMRMPE--QAKRMREYEPKPDPVQE-KIRQLELENLMLEN 561 (663) Q Consensus 520 ~~~~~l~~~~e--~~~~l~~~~~~~~~~~~-q~~q~~~~~~q~~~ 561 (663) ...+.-.++.+ ..+...++..++.+..- -....++..++.+. T Consensus 445 ~~~L~~~gv~d~~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 445 YAALRKAGVTDWTDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHHhCCCCCccHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 22222211110 00111111111100000 00000000000000 No 128 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=97.55 E-value=4.6e-05 Score=44.35 Aligned_cols=604 Identities=10% Similarity=-0.002 Sum_probs=127.7 Q ss_pred CCCcHHHHHHHHHHHHHHH-H-HHHHHHHHHHHHHHHHhcCCcCCccccCCCccccHHHHHHHHHHHHHHHH-------- Q lcl|NC_021532. 1 MKINKAELLSALKADMKAA-D-VLKQEQDSLISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQSEWQHATIVD-------- 70 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~-~-~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~-------- 70 (663) =+-..-.++....++.+.. . .+..++...+++|..-.++.....+.+=.++.|.+.|-...-.+...+.. T Consensus 11 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~~~~s~~~~~~v~~~v~~~~~~l~~~~~~~~~~~~~~ 90 (705) T protein:vir:88 11 DDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNERPGKSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYE 90 (705) T ss_pred CHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCcccCCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEe Confidence 2222222222222222222 2 12234555566665554443322222222333333222222222211110 Q ss_pred hhcCCCc--------eE--EEEeCCcchHHHHHHHHHHH-------hHHHHhccch-hHHHH--------HH-------H Q lcl|NC_021532. 71 PFVSTAD--------II--KCTPITWEDTDSAEQNELLL-------NTQFSRKFDR-FNFMS--------KA-------V 117 (663) Q Consensus 71 ~~~~~~~--------~~--~~~p~~~~D~~~Ae~~~~~~-------~~~~~~~~~~-~~~~~--------~~-------~ 117 (663) ++..+|. .+ .|.-.+......-....+++ +..|+..... .+.+. .. + T Consensus 91 p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~ 170 (705) T protein:vir:88 91 PDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSI 170 (705) T ss_pred eCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhhhhhhhhhhc Confidence 0000000 00 00001111111111222211 1111110000 00000 00 0 Q ss_pred HHHHhcCceEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceE Q lcl|NC_021532. 118 KVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFV 197 (663) Q Consensus 118 ~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~ 197 (663) .+-...|.|.+-|.+......-.+..+.|.+..+..+.. ..++.+..+........+-..-.+++|-.....+.. T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~-a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~---- 245 (705) T protein:vir:88 171 LAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRL-ATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPY---- 245 (705) T ss_pred ccccccccceeeeEEeeeeecCceeeeeccHHHceecCC-CCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhc---- Confidence 011123333333333222111111111111111110000 001111111100000000011222222111100000 Q ss_pred EEEeecCHHHHHHhcCCcChhhhhhccchhhhcccc-ccccccccccc-----cceEEEEEEEEEeeecCCceeEEEEEE Q lcl|NC_021532. 198 IHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSP-DDTEFQFSDAP-----RKKLIIYEYWGNYDVDGDGIAEPIVCA 271 (663) Q Consensus 198 ~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~-----~~~v~v~E~w~~~~~~~~g~~~~~~~~ 271 (663) +....+....-..+.+ ..+.... .......+ ......+.... .+...+. +|++...-|+-+.. ... T Consensus 246 -~~~~~~~~~~e~~~~~-~~d~~~~---~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~-~~~~~~~~g~~il~--~~~ 317 (705) T protein:vir:88 246 -DEYEFSDSQPERLVRD-NFDMTGQ---LQYNSGDDAEANREVWASECYTLLDVDGDGIS-ELRRILYVGDYIIS--NEP 317 (705) T ss_pred -ccccchhhhhhhcccc-ccccccc---cccccccccCCceeEEEEEeeeEecccCCcce-eeEEEEEeCccccc--ccc Confidence 0000000000000000 0000000 00000000 00000000000 0000111 23333222221111 000 Q ss_pred EECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHH---HHHhcCCCcEEeeccccCc Q lcl|NC_021532. 272 WINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIID---NMAQSNNGQVAIRKGALDQ 348 (663) Q Consensus 272 ~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~---~~~~~~~~~~~~~~~~i~~ 348 (663) .+.-||.+ +||...+-. ..|..++.-+.+.-.-.-...|.....+-. .-.....+.+ ...+-++. T Consensus 318 --------~~~~PF~~--~~~~p~~~~-~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v-~~~d~~~~ 385 (705) T protein:vir:88 318 --------WDCRPFAD--LNAYRIAHK-FHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQV-NLEDLLTN 385 (705) T ss_pred --------CCCCCEEE--ecceeecCc-cccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecccccc-Cccccccc Confidence 11223322 111111111 112222222222222223333333222111 0111112211 11222221 Q ss_pred --chhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHH----HHHHhCCChHHcCCCcccchhHHHHHHHHHH Q lcl|NC_021532. 349 --TNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNE----IESITGTKSFSGGINSGSLGSTATGARGALD 422 (663) Q Consensus 349 --~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~ 422 (663) .......+++.+.+.+.+ +.++-...+.+++...... -....|++..+++.. ++..+. ..+....+ T Consensus 386 ~pg~vv~~~~~~~i~~~~~~------~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~-~Ta~~i-~~~~~~~~ 457 (705) T protein:vir:88 386 EAAGIVRVKSMNSITPLETP------QLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSN-QAAMSV-NQLMTAAE 457 (705) T ss_pred CCCeeEEecCCCccccccCC------cCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccch-hhHHHH-HHHHHHHH Confidence 111112233333333211 2223223343444433333 345668765544321 111222 22221112 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHH-- Q lcl|NC_021532. 423 A-TATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE---EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKS-- 496 (663) Q Consensus 423 ~-~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~---~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~-- 496 (663) . ....+..+.+.+-+.+++.++.++..+...--. ....+.|...++.. ...-..++.+............ T Consensus 458 ~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~----~~~v~v~v~~~~~~~eq~~a~l~~ 533 (705) T protein:vir:88 458 QQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRE----RSDLTVTVGIGNMNKDQQMLHLMR 533 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhcc----CCceEEeeccccchHHHHHHHHHH Confidence 2 222223333343333444455544443321100 00112222111100 0000011111111111110000 Q ss_pred ------------------------HHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchh-hHHHHhhH Q lcl|NC_021532. 497 ------------------------QELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPD-PVQEKIRQ 551 (663) Q Consensus 497 ------------------------q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~-~~~~q~~q 551 (663) ..+..++..++.. -....+.. -..+......+...+.... ...+...+ T Consensus 534 ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k----~~~~~~~~---~~~~e~~~~~~~~~q~e~~~~~~~~~~q 606 (705) T protein:vir:88 534 IWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYK----DPDRFWTN---PNSPEALQAKAIREQKEAQPKPEDIKAQ 606 (705) T ss_pred HHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhh----hHHHHhhh---hhhHHHHHHHHhhhhhhhhHHHHHHHHH Confidence 0111111111000 00000000 0000000000000000000 11111233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH Q lcl|NC_021532. 552 LELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVEL--EDLR 629 (663) Q Consensus 552 ~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~--~~~~ 629 (663) ++.++.+++.+..+++++..+.+++..+++.+.++++.++++...+..+++......+.. ..++..+++. +..+ T Consensus 607 ~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e----~e~~~~e~e~~~e~~q 682 (705) T protein:vir:88 607 ADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFT----WERARNEAEYHLEATQ 682 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHH Confidence 444444555444444444333333333333333333333222211111111111110000 0111111111 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 630 HAQHLEREAMKHRANLEQMLAQRN 653 (663) Q Consensus 630 ~~~~~~~e~~k~~~~~e~~~~~~~ 653 (663) .. +.+.+..+...+..-..++|. T Consensus 683 ~~-~~~~~~~~~~~~~k~~~~~rr 705 (705) T protein:vir:88 683 AR-AAYIGDGKVPETKKPTKAVRR 705 (705) T ss_pred HH-HHHHHHHhHHHHHHHHHHhcC Confidence 11 011111111111111111111 No 129 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=97.42 E-value=7e-05 Score=43.35 Aligned_cols=443 Identities=12% Similarity=0.056 Sum_probs=174.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcC------Cc----cccC--------CCccccHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPY------GN----EQKG--------KSAIVSRDIKKQSE 62 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~------~~----~~~g--------~s~~~~~~i~~~v~ 62 (663) |-..+ =-.++.+.-...+......|+..++.|.|+.. ++ +..+ ...+++|.+..+++ T Consensus 1 ~~~~~-----~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~ 75 (491) T protein:vir:95 1 MLTAN-----GQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLS 75 (491) T ss_pred CcccC-----CccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHH Confidence 11110 00011111122234445556666677766421 11 1111 23567889999999 Q ss_pred HHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccc Q lcl|NC_021532. 63 WQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVM 142 (663) Q Consensus 63 ~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~ 142 (663) .++|.+. ..+|.+++ | ..+..++..+--..++....+.+++..++.+|.+++-|.+.... T Consensus 76 ~l~G~vf----rk~p~~~~-p---------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~------ 135 (491) T protein:vir:95 76 GMVGSVM----RKEPEINI-P---------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETA------ 135 (491) T ss_pred HHhchhh----cCCceeec-c---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCc------ Confidence 8888764 33333321 1 12344444432233445556788999999999999887553110 Q ss_pred ccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChh---hCceEEEEeecCHHHHHHhcCCcChhh Q lcl|NC_021532. 143 GEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLD---NAQFVIHRYETDLSTLKKDGRYKNLDK 219 (663) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~---d~~~~~~~~~~~~~~l~~~g~~~~~~~ 219 (663) .....+.+....+|++..++|+++. +.... .+. ...++..+... T Consensus 136 ---------------~~T~Ade~~~~~rPy~~~~~~~~Ii-nW~~~-~v~g~~~L~~v~l~E~~---------------- 182 (491) T protein:vir:95 136 ---------------AATAAEQNAGLLNPTIAFYTTENIV-NWRLT-RVGSVNRVTMVVLRETW---------------- 182 (491) T ss_pred ---------------ccCHHHHHHhcCCcEEEEechhhhc-Cceee-eeCCceeeeEEEEEEeE---------------- Confidence 0011111222346888888888874 22111 111 11111111100 Q ss_pred hhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEE--CCEE------E-ecccCCCcCCCC Q lcl|NC_021532. 220 LAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWI--NDVI------V-RLQSNPYPDGKP 290 (663) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~--g~~~------l-~~~~~p~~~~~~ 290 (663) ...+....++......++|++.. .+|..+.++.... |+.. + ..+.+++ +.+ T Consensus 183 ------------~~~d~~~~f~~~~~~qyRvL~l~------~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l--~~I 242 (491) T protein:vir:95 183 ------------EYHEPGNEFETKYGEQYRVLDID------TDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESLR--GVI 242 (491) T ss_pred ------------EeecCCCCcccceEEEEEEEeec------CCCceEEEEEEEcCCCcceeeeeeeeecCCCccc--Cee Confidence 00111122333334455555431 1221111111111 1111 1 1122222 456 Q ss_pred CEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeecc-ccCcchhhhccCCcceEeC------ Q lcl|NC_021532. 291 PFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKG-ALDQTNRKKFLAGANFEFN------ 363 (663) Q Consensus 291 Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~-~i~~~d~~~~~p~~vi~~~------ 363 (663) ||+++.... .+...+.++.-.+..+....=...+-.-+++..+..|...+.-+ ..+........+.++. +. T Consensus 243 Pfv~~~~~~-~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~-~g~~~~~~ 320 (491) T protein:vir:95 243 PFTFIGATN-NDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIK-FGSRCGHN 320 (491) T ss_pred EEEEEecCC-CCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeE-ecCcCCcC Confidence 776554332 22223455566666654332223333456677777776654211 1111111122222221 11 Q ss_pred -CCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 364 -GTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKP 442 (663) Q Consensus 364 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~ 442 (663) +.+.....+.+.+.. .....|..+.+.|.. .|..- ... + .+.||++.+.-..+....|..++.++.+ . T Consensus 321 lP~~~~~~~ie~~~~~-~~~~~l~~~e~qm~~-~Ga~l---~~~-~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~-a--- 389 (491) T protein:vir:95 321 LGYGGSAQLIQAGENN-LARQNMLDKEQQAIQ-IGAQL---ITP-S-QQITAESARIQRGADTSVMATIARNVSQ-A--- 389 (491) T ss_pred CCCCCccceeecCcch-HHHHHHHHHHHHHHH-HHHHh---ccC-C-cchhHHHHHHHHHHhhHHHHHHHHHHHH-H--- Confidence 112233334333322 123334444444433 23321 111 1 2467877877777778888888888864 2 Q ss_pred HHHHHHHHHHHhcCCc--eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHH Q lcl|NC_021532. 443 LMRKWMAYNAEFLEEE--EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMA 520 (663) Q Consensus 443 l~~~~~~li~q~~~~~--~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~ 520 (663) ++.+|.++..|.... .-+. +.+|++ |.. .... .+.+.+++..+... .+....+. T Consensus 390 -l~~~l~~~a~w~G~~~~~~v~------i~~n~d-----F~~------~~~~---~~~~~all~~~~~G---~is~~t~~ 445 (491) T protein:vir:95 390 -YTDALRWVAMMLGKPEDSEVE------FQLNMD-----FFL------QPMT---AQDRAAWMADINAG---LLPATAYY 445 (491) T ss_pred -HHHHHHHHHHHcCCCCCCceE------EEeecc-----ccc------ccCC---HHHHHHHHHHHhcC---CCCHHHHH Confidence 455677777775421 0001 111211 110 0001 12233344433221 12222232 Q ss_pred HHHHhhhhhh--hhhhhhhhhcch------hhHHHHhhHHHHHHHH Q lcl|NC_021532. 521 DIMDLMRMPE--QAKRMREYEPKP------DPVQEKIRQLELENLM 558 (663) Q Consensus 521 ~~~~l~~~~e--~~~~l~~~~~~~------~~~~~q~~q~~~~~~q 558 (663) ..+.-.++.+ ..+.+.++..+. .+..-...+..++..+ T Consensus 446 ~~L~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 446 AALRKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHhCCCCCccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 2222222221 001111111110 0000000000000000 No 130 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=582 Identities=10% Similarity=0.048 Sum_probs=159.8 Q ss_pred CCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCC--------------ccccCCCc--c Q lcl|NC_021532. 1 MKI-NKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYG--------------NEQKGKSA--I 52 (663) Q Consensus 1 ~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~--------------~~~~g~s~--~ 52 (663) .+- --..++..+..++.....+|.+...+.++|..-.| |.+.- .+.++|.. + T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v 95 (714) T protein:vir:81 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVV 95 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHHhCCcceEE Confidence 222 23356667788888888888888888887743222 32210 11223322 2 Q ss_pred ccHH---HHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH-HHHhcCceEE Q lcl|NC_021532. 53 VSRD---IKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK-VLDREGTLVV 128 (663) Q Consensus 53 ~~~~---i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~G~g~~ 128 (663) .++. -...+..++..+++.+.. .+.-+..-.+...+.+ .. .-+. +...+. |. ..| -+ T Consensus 96 ~p~~~~~~~~~~Ae~l~~~~~~~~~---------~~~~~~~~s~af~~~~----~~-G~G~--~~~~~~~d~-~~~--~i 156 (714) T protein:vir:81 96 MSDEPDDETEKLAEAINAEFADACR---------LGNMNKARSDAYAEQI----KA-GLSW--VEVRRNSDP-FGP--EF 156 (714) T ss_pred ecCCCCchhHHHHHHHHHHHHHHHH---------hhchhHHHHHHHHHhh----hc-Ccce--EEeccccCC-CCC--Ce Confidence 2211 111122222233322222 1111111111111111 00 0011 000011 11 111 12 Q ss_pred EeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCccc--------ccChhhCc---eE Q lcl|NC_021532. 129 QTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTC--------QDNLDNAQ---FV 197 (663) Q Consensus 129 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a--------~~d~~d~~---~~ 197 (663) ++. .+....+++|+.. .+.++++.++.....++..-....+|++-.. ..++.+.. .. T Consensus 157 ~i~--------~v~p~~v~~Dp~a----~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:81 157 KVS--------TVSRNEVFWDWLS----READLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred EEE--------ecchhheeecccc----ccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 221 1122233333332 2344555554443333333333444654210 00000000 00 Q ss_pred EEEeecCHHHHHH------hcCCcChhhhh----hc------------cchhhhcccccccc-----ccccccccce-EE Q lcl|NC_021532. 198 IHRYETDLSTLKK------DGRYKNLDKLA----KT------------SGEDFDYDSPDDTE-----FQFSDAPRKK-LI 249 (663) Q Consensus 198 ~~~~~~~~~~l~~------~g~~~~~~~~~----~~------------~~~~~~~~~~~~~~-----~~~~d~~~~~-v~ 249 (663) .....-..++... .++.++..++. ++ ++....++..+... .........+ -+ T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:81 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0000000000000 00000000000 00 00000000000000 0000000111 12 Q ss_pred EEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCc------CCCCCEEEEeeeeecCcccCCChHHHHHHHH------ Q lcl|NC_021532. 250 IYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYP------DGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ------ 317 (663) Q Consensus 250 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~------~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q------ 317 (663) ++.+|+ .++++....-+-|-++++- -.|+| .|. ||-++....++-+.+.+.....+--++ T Consensus 305 v~~~~~----~g~~~L~~~~~p~p~~~fp---~vp~~g~~~~~~g~-~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~ 376 (714) T protein:vir:81 305 IREAWF----VGPHFIVDRPCSAPQGMFP---LVPFWGYRKDKTGE-PYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIM 376 (714) T ss_pred EEEEEE----ecCcccccCCCCCCCCcee---EEEEeeeeeeccCc-eeehhhhchhHHHHHHHHHHHHHHhhcCCceee Confidence 333332 1111111000111111100 00111 122 454444444443332210110000000 Q ss_pred --HHHHHHHHHHHHHHHhcCCC--cEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 318 --KVKTAVIRGIIDNMAQSNNG--QVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIE 393 (663) Q Consensus 318 --~~~N~~~~~~~~~~~~~~~~--~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (663) ..++..-+..... .+.| -+.++.+. .....++..+.+.+....+.. .-.+.+.....++..-..-. T Consensus 377 ~~~a~~~~d~~~~e~---~arp~~vi~~~p~~-----~~~~~~~~~~~~~~~~~~~~~--~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:81 377 DEDATQLSDNDLMEQ---IERPDGIIKLNPVR-----KNQKSVADVFRVEQDFQVASQ--QFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ecCcccccHHHHHHh---ccCCCCceeecccc-----cccCCCCccccccCCCCccHH--HHHHHHHHHHHHHHhhCCCh Confidence 0000000011111 1221 11111111 111223344555443211100 01111222222333222223 Q ss_pred HHhCCChH-HcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHhcC--CceE Q lcl|NC_021532. 394 SITGTKSF-SGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA---------YNAEFLE--EEEV 460 (663) Q Consensus 394 ~~tGi~~~-~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~---------li~q~~~--~~~~ 460 (663) ...|...- ..| .+.+. .-.+...+ ...+..+.+.+.. ..+.++.++-. ++-+.-. ..++ T Consensus 447 ~~lG~~~na~SG---vAi~~rq~qg~~~l----~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~ 518 (714) T protein:vir:81 447 AFLGQDSGATSG---VAISNLVEQGATTL----AEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQT 518 (714) T ss_pred HHcCCCccchhH---HHHHHHHHHHHHHH----HHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceE Confidence 34444321 222 11111 11111111 1111111111111 11112222211 1111111 1234 Q ss_pred EEEecCe--eec---cchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhH------------HHHHHHH Q lcl|NC_021532. 461 IRVTNDK--FVP---IRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRR------------DIMADIM 523 (663) Q Consensus 461 iri~~~~--~v~---i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~------------~~l~~~~ 523 (663) +-|.... -+. |..-.+.-..+......+ .+.+..+.|+++++.+.|.+...+.. .+...+- T Consensus 519 v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t--~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir 596 (714) T protein:vir:81 519 IVLNAEGDNGELTNDISRLNTHIALAPVQQTPA--FKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIR 596 (714) T ss_pred EeeccccCcceecccceeeeEEEEEeeccCchH--HHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHH Confidence 4443111 000 111111111222222221 23456666777777666654322111 1222221 Q ss_pred HhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 524 DLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEA 603 (663) Q Consensus 524 ~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~ 603 (663) +..+.... ....++.+.+.+.++.+++.++++.+.++++++.+..+++++++++++.+..++++.+...++...... T Consensus 597 ~~~~~~~~---~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~ 673 (714) T protein:vir:81 597 AALGTPKS---PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD 673 (714) T ss_pred HHcCCCCC---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12121111 122222333333444444555555566666666666666666666666555555544433332222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 604 DMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNA 654 (663) Q Consensus 604 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~ 654 (663) ...+++..+ .++..+..++...-.++++ .+ ..++++.+... T Consensus 674 ~~~~a~~a~---~~~~~~~~~~~~~~~~~q~-~q------~~~~~~~~~~~ 714 (714) T protein:vir:81 674 ALNQAHTAE---IITGVQNMEQEQDVLQQQM-LY------TLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHH---HHHhHhhhhhhhHHHHHHH-HH------HHHHHHHhcCC Confidence 111111111 1111111111111111111 01 11111111111 No 131 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=582 Identities=10% Similarity=0.048 Sum_probs=159.8 Q ss_pred CCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCC--------------ccccCCCc--c Q lcl|NC_021532. 1 MKI-NKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYG--------------NEQKGKSA--I 52 (663) Q Consensus 1 ~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~--------------~~~~g~s~--~ 52 (663) .+- --..++..+..++.....+|.+...+.++|..-.| |.+.- .+.++|.. + T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v 95 (714) T protein:vir:99 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVV 95 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHHhCCcceEE Confidence 222 23356667788888888888888888887743222 32210 11223322 2 Q ss_pred ccHH---HHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH-HHHhcCceEE Q lcl|NC_021532. 53 VSRD---IKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK-VLDREGTLVV 128 (663) Q Consensus 53 ~~~~---i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~G~g~~ 128 (663) .++. -...+..++..+++.+.. .+.-+..-.+...+.+ .. .-+. +...+. |. ..| -+ T Consensus 96 ~p~~~~~~~~~~Ae~l~~~~~~~~~---------~~~~~~~~s~af~~~~----~~-G~G~--~~~~~~~d~-~~~--~i 156 (714) T protein:vir:99 96 MSDEPDDETEKLAEAINAEFADACR---------LGNMNKARSDAYAEQI----KA-GLSW--VEVRRNSDP-FGP--EF 156 (714) T ss_pred ecCCCCchhHHHHHHHHHHHHHHHH---------hhchhHHHHHHHHHhh----hc-Ccce--EEeccccCC-CCC--Ce Confidence 2211 111122222233322222 1111111111111111 00 0011 000011 11 111 12 Q ss_pred EeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCccc--------ccChhhCc---eE Q lcl|NC_021532. 129 QTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTC--------QDNLDNAQ---FV 197 (663) Q Consensus 129 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a--------~~d~~d~~---~~ 197 (663) ++. .+....+++|+.. .+.++++.++.....++..-....+|++-.. ..++.+.. .. T Consensus 157 ~i~--------~v~p~~v~~Dp~a----~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:99 157 KVS--------TVSRNEVFWDWLS----READLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred EEE--------ecchhheeecccc----ccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 221 1122233333332 2344555554443333333333444654210 00000000 00 Q ss_pred EEEeecCHHHHHH------hcCCcChhhhh----hc------------cchhhhcccccccc-----ccccccccce-EE Q lcl|NC_021532. 198 IHRYETDLSTLKK------DGRYKNLDKLA----KT------------SGEDFDYDSPDDTE-----FQFSDAPRKK-LI 249 (663) Q Consensus 198 ~~~~~~~~~~l~~------~g~~~~~~~~~----~~------------~~~~~~~~~~~~~~-----~~~~d~~~~~-v~ 249 (663) .....-..++... .++.++..++. ++ ++....++..+... .........+ -+ T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:99 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0000000000000 00000000000 00 00000000000000 0000000111 12 Q ss_pred EEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCc------CCCCCEEEEeeeeecCcccCCChHHHHHHHH------ Q lcl|NC_021532. 250 IYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYP------DGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ------ 317 (663) Q Consensus 250 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~------~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q------ 317 (663) ++.+|+ .++++....-+-|-++++- -.|+| .|. ||-++....++-+.+.+.....+--++ T Consensus 305 v~~~~~----~g~~~L~~~~~p~p~~~fp---~vp~~g~~~~~~g~-~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~ 376 (714) T protein:vir:99 305 IREAWF----VGPHFIVDRPCSAPQGMFP---LVPFWGYRKDKTGE-PYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIM 376 (714) T ss_pred EEEEEE----ecCcccccCCCCCCCCcee---EEEEeeeeeeccCc-eeehhhhchhHHHHHHHHHHHHHHhhcCCceee Confidence 333332 1111111000111111100 00111 122 454444444443332210110000000 Q ss_pred --HHHHHHHHHHHHHHHhcCCC--cEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 318 --KVKTAVIRGIIDNMAQSNNG--QVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIE 393 (663) Q Consensus 318 --~~~N~~~~~~~~~~~~~~~~--~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (663) ..++..-+..... .+.| -+.++.+. .....++..+.+.+....+.. .-.+.+.....++..-..-. T Consensus 377 ~~~a~~~~d~~~~e~---~arp~~vi~~~p~~-----~~~~~~~~~~~~~~~~~~~~~--~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:99 377 DEDATQLSDNDLMEQ---IERPDGIIKLNPVR-----KNQKSVADVFRVEQDFQVASQ--QFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ecCcccccHHHHHHh---ccCCCCceeecccc-----cccCCCCccccccCCCCccHH--HHHHHHHHHHHHHHhhCCCh Confidence 0000000011111 1221 11111111 111223344555443211100 01111222222333222223 Q ss_pred HHhCCChH-HcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHhcC--CceE Q lcl|NC_021532. 394 SITGTKSF-SGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA---------YNAEFLE--EEEV 460 (663) Q Consensus 394 ~~tGi~~~-~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~---------li~q~~~--~~~~ 460 (663) ...|...- ..| .+.+. .-.+...+ ...+..+.+.+.. ..+.++.++-. ++-+.-. ..++ T Consensus 447 ~~lG~~~na~SG---vAi~~rq~qg~~~l----~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~ 518 (714) T protein:vir:99 447 AFLGQDSGATSG---VAISNLVEQGATTL----AEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQT 518 (714) T ss_pred HHcCCCccchhH---HHHHHHHHHHHHHH----HHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceE Confidence 34444321 222 11111 11111111 1111111111111 11112222211 1111111 1234 Q ss_pred EEEecCe--eec---cchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhH------------HHHHHHH Q lcl|NC_021532. 461 IRVTNDK--FVP---IRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRR------------DIMADIM 523 (663) Q Consensus 461 iri~~~~--~v~---i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~------------~~l~~~~ 523 (663) +-|.... -+. |..-.+.-..+......+ .+.+..+.|+++++.+.|.+...+.. .+...+- T Consensus 519 v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t--~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir 596 (714) T protein:vir:99 519 IVLNAEGDNGELTNDISRLNTHIALAPVQQTPA--FKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIR 596 (714) T ss_pred EeeccccCcceecccceeeeEEEEEeeccCchH--HHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHH Confidence 4443111 000 111111111222222221 23456666777777666654322111 1222221 Q ss_pred HhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 524 DLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEA 603 (663) Q Consensus 524 ~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~ 603 (663) +..+.... ....++.+.+.+.++.+++.++++.+.++++++.+..+++++++++++.+..++++.+...++...... T Consensus 597 ~~~~~~~~---~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~ 673 (714) T protein:vir:99 597 AALGTPKS---PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD 673 (714) T ss_pred HHcCCCCC---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12121111 122222333333444444555555566666666666666666666666555555544433332222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 604 DMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNA 654 (663) Q Consensus 604 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~ 654 (663) ...+++..+ .++..+..++...-.++++ .+ ..++++.+... T Consensus 674 ~~~~a~~a~---~~~~~~~~~~~~~~~~~q~-~q------~~~~~~~~~~~ 714 (714) T protein:vir:99 674 ALNQAHTAE---IITGVQNMEQEQDVLQQQM-LY------TLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHH---HHHhHhhhhhhhHHHHHHH-HH------HHHHHHHhcCC Confidence 111111111 1111111111111111111 01 11111111111 No 132 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=582 Identities=10% Similarity=0.048 Sum_probs=159.8 Q ss_pred CCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCC--------------ccccCCCc--c Q lcl|NC_021532. 1 MKI-NKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYG--------------NEQKGKSA--I 52 (663) Q Consensus 1 ~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~--------------~~~~g~s~--~ 52 (663) .+- --..++..+..++.....+|.+...+.++|..-.| |.+.- .+.++|.. + T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v 95 (714) T protein:vir:32 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVV 95 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHHhCCcceEE Confidence 222 23356667788888888888888888887743222 32210 11223322 2 Q ss_pred ccHH---HHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH-HHHhcCceEE Q lcl|NC_021532. 53 VSRD---IKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK-VLDREGTLVV 128 (663) Q Consensus 53 ~~~~---i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~G~g~~ 128 (663) .++. -...+..++..+++.+.. .+.-+..-.+...+.+ .. .-+. +...+. |. ..| -+ T Consensus 96 ~p~~~~~~~~~~Ae~l~~~~~~~~~---------~~~~~~~~s~af~~~~----~~-G~G~--~~~~~~~d~-~~~--~i 156 (714) T protein:vir:32 96 MSDEPDDETEKLAEAINAEFADACR---------LGNMNKARSDAYAEQI----KA-GLSW--VEVRRNSDP-FGP--EF 156 (714) T ss_pred ecCCCCchhHHHHHHHHHHHHHHHH---------hhchhHHHHHHHHHhh----hc-Ccce--EEeccccCC-CCC--Ce Confidence 2211 111122222233322222 1111111111111111 00 0011 000011 11 111 12 Q ss_pred EeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCccc--------ccChhhCc---eE Q lcl|NC_021532. 129 QTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTC--------QDNLDNAQ---FV 197 (663) Q Consensus 129 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a--------~~d~~d~~---~~ 197 (663) ++. .+....+++|+.. .+.++++.++.....++..-....+|++-.. ..++.+.. .. T Consensus 157 ~i~--------~v~p~~v~~Dp~a----~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:32 157 KVS--------TVSRNEVFWDWLS----READLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred EEE--------ecchhheeecccc----ccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 221 1122233333332 2344555554443333333333444654210 00000000 00 Q ss_pred EEEeecCHHHHHH------hcCCcChhhhh----hc------------cchhhhcccccccc-----ccccccccce-EE Q lcl|NC_021532. 198 IHRYETDLSTLKK------DGRYKNLDKLA----KT------------SGEDFDYDSPDDTE-----FQFSDAPRKK-LI 249 (663) Q Consensus 198 ~~~~~~~~~~l~~------~g~~~~~~~~~----~~------------~~~~~~~~~~~~~~-----~~~~d~~~~~-v~ 249 (663) .....-..++... .++.++..++. ++ ++....++..+... .........+ -+ T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:32 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0000000000000 00000000000 00 00000000000000 0000000111 12 Q ss_pred EEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCc------CCCCCEEEEeeeeecCcccCCChHHHHHHHH------ Q lcl|NC_021532. 250 IYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYP------DGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ------ 317 (663) Q Consensus 250 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~------~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q------ 317 (663) ++.+|+ .++++....-+-|-++++- -.|+| .|. ||-++....++-+.+.+.....+--++ T Consensus 305 v~~~~~----~g~~~L~~~~~p~p~~~fp---~vp~~g~~~~~~g~-~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~ 376 (714) T protein:vir:32 305 IREAWF----VGPHFIVDRPCSAPQGMFP---LVPFWGYRKDKTGE-PYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIM 376 (714) T ss_pred EEEEEE----ecCcccccCCCCCCCCcee---EEEEeeeeeeccCc-eeehhhhchhHHHHHHHHHHHHHHhhcCCceee Confidence 333332 1111111000111111100 00111 122 454444444443332210110000000 Q ss_pred --HHHHHHHHHHHHHHHhcCCC--cEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 318 --KVKTAVIRGIIDNMAQSNNG--QVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIE 393 (663) Q Consensus 318 --~~~N~~~~~~~~~~~~~~~~--~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (663) ..++..-+..... .+.| -+.++.+. .....++..+.+.+....+.. .-.+.+.....++..-..-. T Consensus 377 ~~~a~~~~d~~~~e~---~arp~~vi~~~p~~-----~~~~~~~~~~~~~~~~~~~~~--~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:32 377 DEDATQLSDNDLMEQ---IERPDGIIKLNPVR-----KNQKSVADVFRVEQDFQVASQ--QFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ecCcccccHHHHHHh---ccCCCCceeecccc-----cccCCCCccccccCCCCccHH--HHHHHHHHHHHHHHhhCCCh Confidence 0000000011111 1221 11111111 111223344555443211100 01111222222333222223 Q ss_pred HHhCCChH-HcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHhcC--CceE Q lcl|NC_021532. 394 SITGTKSF-SGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA---------YNAEFLE--EEEV 460 (663) Q Consensus 394 ~~tGi~~~-~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~---------li~q~~~--~~~~ 460 (663) ...|...- ..| .+.+. .-.+...+ ...+..+.+.+.. ..+.++.++-. ++-+.-. ..++ T Consensus 447 ~~lG~~~na~SG---vAi~~rq~qg~~~l----~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~ 518 (714) T protein:vir:32 447 AFLGQDSGATSG---VAISNLVEQGATTL----AEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQT 518 (714) T ss_pred HHcCCCccchhH---HHHHHHHHHHHHHH----HHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceE Confidence 34444321 222 11111 11111111 1111111111111 11112222211 1111111 1234 Q ss_pred EEEecCe--eec---cchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhH------------HHHHHHH Q lcl|NC_021532. 461 IRVTNDK--FVP---IRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRR------------DIMADIM 523 (663) Q Consensus 461 iri~~~~--~v~---i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~------------~~l~~~~ 523 (663) +-|.... -+. |..-.+.-..+......+ .+.+..+.|+++++.+.|.+...+.. .+...+- T Consensus 519 v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t--~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir 596 (714) T protein:vir:32 519 IVLNAEGDNGELTNDISRLNTHIALAPVQQTPA--FKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIR 596 (714) T ss_pred EeeccccCcceecccceeeeEEEEEeeccCchH--HHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHH Confidence 4443111 000 111111111222222221 23456666777777666654322111 1222221 Q ss_pred HhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 524 DLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEA 603 (663) Q Consensus 524 ~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~ 603 (663) +..+.... ....++.+.+.+.++.+++.++++.+.++++++.+..+++++++++++.+..++++.+...++...... T Consensus 597 ~~~~~~~~---~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~ 673 (714) T protein:vir:32 597 AALGTPKS---PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD 673 (714) T ss_pred HHcCCCCC---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12121111 122222333333444444555555566666666666666666666666555555544433332222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 604 DMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNA 654 (663) Q Consensus 604 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~ 654 (663) ...+++..+ .++..+..++...-.++++ .+ ..++++.+... T Consensus 674 ~~~~a~~a~---~~~~~~~~~~~~~~~~~q~-~q------~~~~~~~~~~~ 714 (714) T protein:vir:32 674 ALNQAHTAE---IITGVQNMEQEQDVLQQQM-LY------TLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHH---HHHhHhhhhhhhHHHHHHH-HH------HHHHHHHhcCC Confidence 111111111 1111111111111111111 01 11111111111 No 133 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=582 Identities=10% Similarity=0.048 Sum_probs=159.8 Q ss_pred CCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCC--------------ccccCCCc--c Q lcl|NC_021532. 1 MKI-NKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYG--------------NEQKGKSA--I 52 (663) Q Consensus 1 ~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~--------------~~~~g~s~--~ 52 (663) .+- --..++..+..++.....+|.+...+.++|..-.| |.+.- .+.++|.. + T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v 95 (714) T protein:vir:27 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVV 95 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHHhCCcceEE Confidence 222 23356667788888888888888888887743222 32210 11223322 2 Q ss_pred ccHH---HHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH-HHHhcCceEE Q lcl|NC_021532. 53 VSRD---IKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK-VLDREGTLVV 128 (663) Q Consensus 53 ~~~~---i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~G~g~~ 128 (663) .++. -...+..++..+++.+.. .+.-+..-.+...+.+ .. .-+. +...+. |. ..| -+ T Consensus 96 ~p~~~~~~~~~~Ae~l~~~~~~~~~---------~~~~~~~~s~af~~~~----~~-G~G~--~~~~~~~d~-~~~--~i 156 (714) T protein:vir:27 96 MSDEPDDETEKLAEAINAEFADACR---------LGNMNKARSDAYAEQI----KA-GLSW--VEVRRNSDP-FGP--EF 156 (714) T ss_pred ecCCCCchhHHHHHHHHHHHHHHHH---------hhchhHHHHHHHHHhh----hc-Ccce--EEeccccCC-CCC--Ce Confidence 2211 111122222233322222 1111111111111111 00 0011 000011 11 111 12 Q ss_pred EeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCccc--------ccChhhCc---eE Q lcl|NC_021532. 129 QTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTC--------QDNLDNAQ---FV 197 (663) Q Consensus 129 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a--------~~d~~d~~---~~ 197 (663) ++. .+....+++|+.. .+.++++.++.....++..-....+|++-.. ..++.+.. .. T Consensus 157 ~i~--------~v~p~~v~~Dp~a----~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:27 157 KVS--------TVSRNEVFWDWLS----READLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred EEE--------ecchhheeecccc----ccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 221 1122233333332 2344555554443333333333444654210 00000000 00 Q ss_pred EEEeecCHHHHHH------hcCCcChhhhh----hc------------cchhhhcccccccc-----ccccccccce-EE Q lcl|NC_021532. 198 IHRYETDLSTLKK------DGRYKNLDKLA----KT------------SGEDFDYDSPDDTE-----FQFSDAPRKK-LI 249 (663) Q Consensus 198 ~~~~~~~~~~l~~------~g~~~~~~~~~----~~------------~~~~~~~~~~~~~~-----~~~~d~~~~~-v~ 249 (663) .....-..++... .++.++..++. ++ ++....++..+... .........+ -+ T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:27 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0000000000000 00000000000 00 00000000000000 0000000111 12 Q ss_pred EEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCc------CCCCCEEEEeeeeecCcccCCChHHHHHHHH------ Q lcl|NC_021532. 250 IYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYP------DGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ------ 317 (663) Q Consensus 250 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~------~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q------ 317 (663) ++.+|+ .++++....-+-|-++++- -.|+| .|. ||-++....++-+.+.+.....+--++ T Consensus 305 v~~~~~----~g~~~L~~~~~p~p~~~fp---~vp~~g~~~~~~g~-~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~ 376 (714) T protein:vir:27 305 IREAWF----VGPHFIVDRPCSAPQGMFP---LVPFWGYRKDKTGE-PYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIM 376 (714) T ss_pred EEEEEE----ecCcccccCCCCCCCCcee---EEEEeeeeeeccCc-eeehhhhchhHHHHHHHHHHHHHHhhcCCceee Confidence 333332 1111111000111111100 00111 122 454444444443332210110000000 Q ss_pred --HHHHHHHHHHHHHHHhcCCC--cEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 318 --KVKTAVIRGIIDNMAQSNNG--QVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIE 393 (663) Q Consensus 318 --~~~N~~~~~~~~~~~~~~~~--~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (663) ..++..-+..... .+.| -+.++.+. .....++..+.+.+....+.. .-.+.+.....++..-..-. T Consensus 377 ~~~a~~~~d~~~~e~---~arp~~vi~~~p~~-----~~~~~~~~~~~~~~~~~~~~~--~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:27 377 DEDATQLSDNDLMEQ---IERPDGIIKLNPVR-----KNQKSVADVFRVEQDFQVASQ--QFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ecCcccccHHHHHHh---ccCCCCceeecccc-----cccCCCCccccccCCCCccHH--HHHHHHHHHHHHHHhhCCCh Confidence 0000000011111 1221 11111111 111223344555443211100 01111222222333222223 Q ss_pred HHhCCChH-HcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHhcC--CceE Q lcl|NC_021532. 394 SITGTKSF-SGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA---------YNAEFLE--EEEV 460 (663) Q Consensus 394 ~~tGi~~~-~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~---------li~q~~~--~~~~ 460 (663) ...|...- ..| .+.+. .-.+...+ ...+..+.+.+.. ..+.++.++-. ++-+.-. ..++ T Consensus 447 ~~lG~~~na~SG---vAi~~rq~qg~~~l----~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~ 518 (714) T protein:vir:27 447 AFLGQDSGATSG---VAISNLVEQGATTL----AEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQT 518 (714) T ss_pred HHcCCCccchhH---HHHHHHHHHHHHHH----HHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceE Confidence 34444321 222 11111 11111111 1111111111111 11112222211 1111111 1234 Q ss_pred EEEecCe--eec---cchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhH------------HHHHHHH Q lcl|NC_021532. 461 IRVTNDK--FVP---IRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRR------------DIMADIM 523 (663) Q Consensus 461 iri~~~~--~v~---i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~------------~~l~~~~ 523 (663) +-|.... -+. |..-.+.-..+......+ .+.+..+.|+++++.+.|.+...+.. .+...+- T Consensus 519 v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t--~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir 596 (714) T protein:vir:27 519 IVLNAEGDNGELTNDISRLNTHIALAPVQQTPA--FKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIR 596 (714) T ss_pred EeeccccCcceecccceeeeEEEEEeeccCchH--HHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHH Confidence 4443111 000 111111111222222221 23456666777777666654322111 1222221 Q ss_pred HhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 524 DLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEA 603 (663) Q Consensus 524 ~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~ 603 (663) +..+.... ....++.+.+.+.++.+++.++++.+.++++++.+..+++++++++++.+..++++.+...++...... T Consensus 597 ~~~~~~~~---~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~ 673 (714) T protein:vir:27 597 AALGTPKS---PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD 673 (714) T ss_pred HHcCCCCC---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12121111 122222333333444444555555566666666666666666666666555555544433332222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 604 DMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNA 654 (663) Q Consensus 604 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~ 654 (663) ...+++..+ .++..+..++...-.++++ .+ ..++++.+... T Consensus 674 ~~~~a~~a~---~~~~~~~~~~~~~~~~~q~-~q------~~~~~~~~~~~ 714 (714) T protein:vir:27 674 ALNQAHTAE---IITGVQNMEQEQDVLQQQM-LY------TLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHH---HHHhHhhhhhhhHHHHHHH-HH------HHHHHHHhcCC Confidence 111111111 1111111111111111111 01 11111111111 No 134 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=582 Identities=10% Similarity=0.048 Sum_probs=159.8 Q ss_pred CCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCC--------------ccccCCCc--c Q lcl|NC_021532. 1 MKI-NKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYG--------------NEQKGKSA--I 52 (663) Q Consensus 1 ~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~--------------~~~~g~s~--~ 52 (663) .+- --..++..+..++.....+|.+...+.++|..-.| |.+.- .+.++|.. + T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v 95 (714) T protein:vir:10 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLVV 95 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhHHHhCCcceEE Confidence 222 23356667788888888888888888887743222 32210 11223322 2 Q ss_pred ccHH---HHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH-HHHhcCceEE Q lcl|NC_021532. 53 VSRD---IKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK-VLDREGTLVV 128 (663) Q Consensus 53 ~~~~---i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~G~g~~ 128 (663) .++. -...+..++..+++.+.. .+.-+..-.+...+.+ .. .-+. +...+. |. ..| -+ T Consensus 96 ~p~~~~~~~~~~Ae~l~~~~~~~~~---------~~~~~~~~s~af~~~~----~~-G~G~--~~~~~~~d~-~~~--~i 156 (714) T protein:vir:10 96 MSDEPDDETEKLAEAINAEFADACR---------LGNMNKARSDAYAEQI----KA-GLSW--VEVRRNSDP-FGP--EF 156 (714) T ss_pred ecCCCCchhHHHHHHHHHHHHHHHH---------hhchhHHHHHHHHHhh----hc-Ccce--EEeccccCC-CCC--Ce Confidence 2211 111122222233322222 1111111111111111 00 0011 000011 11 111 12 Q ss_pred EeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCccc--------ccChhhCc---eE Q lcl|NC_021532. 129 QTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTC--------QDNLDNAQ---FV 197 (663) Q Consensus 129 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a--------~~d~~d~~---~~ 197 (663) ++. .+....+++|+.. .+.++++.++.....++..-....+|++-.. ..++.+.. .. T Consensus 157 ~i~--------~v~p~~v~~Dp~a----~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:10 157 KVS--------TVSRNEVFWDWLS----READLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred EEE--------ecchhheeecccc----ccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 221 1122233333332 2344555554443333333333444654210 00000000 00 Q ss_pred EEEeecCHHHHHH------hcCCcChhhhh----hc------------cchhhhcccccccc-----ccccccccce-EE Q lcl|NC_021532. 198 IHRYETDLSTLKK------DGRYKNLDKLA----KT------------SGEDFDYDSPDDTE-----FQFSDAPRKK-LI 249 (663) Q Consensus 198 ~~~~~~~~~~l~~------~g~~~~~~~~~----~~------------~~~~~~~~~~~~~~-----~~~~d~~~~~-v~ 249 (663) .....-..++... .++.++..++. ++ ++....++..+... .........+ -+ T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:10 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0000000000000 00000000000 00 00000000000000 0000000111 12 Q ss_pred EEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCc------CCCCCEEEEeeeeecCcccCCChHHHHHHHH------ Q lcl|NC_021532. 250 IYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYP------DGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ------ 317 (663) Q Consensus 250 v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~------~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q------ 317 (663) ++.+|+ .++++....-+-|-++++- -.|+| .|. ||-++....++-+.+.+.....+--++ T Consensus 305 v~~~~~----~g~~~L~~~~~p~p~~~fp---~vp~~g~~~~~~g~-~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~ 376 (714) T protein:vir:10 305 IREAWF----VGPHFIVDRPCSAPQGMFP---LVPFWGYRKDKTGE-PYGLISRAIPAQDEVNFRRIKLTWLLQAKRVIM 376 (714) T ss_pred EEEEEE----ecCcccccCCCCCCCCcee---EEEEeeeeeeccCc-eeehhhhchhHHHHHHHHHHHHHHhhcCCceee Confidence 333332 1111111000111111100 00111 122 454444444443332210110000000 Q ss_pred --HHHHHHHHHHHHHHHhcCCC--cEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 318 --KVKTAVIRGIIDNMAQSNNG--QVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIE 393 (663) Q Consensus 318 --~~~N~~~~~~~~~~~~~~~~--~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (663) ..++..-+..... .+.| -+.++.+. .....++..+.+.+....+.. .-.+.+.....++..-..-. T Consensus 377 ~~~a~~~~d~~~~e~---~arp~~vi~~~p~~-----~~~~~~~~~~~~~~~~~~~~~--~~~llq~~~~~i~~~tGv~~ 446 (714) T protein:vir:10 377 DEDATQLSDNDLMEQ---IERPDGIIKLNPVR-----KNQKSVADVFRVEQDFQVASQ--QFQVMQESEKLIQDTMGVYS 446 (714) T ss_pred ecCcccccHHHHHHh---ccCCCCceeecccc-----cccCCCCccccccCCCCccHH--HHHHHHHHHHHHHHhhCCCh Confidence 0000000011111 1221 11111111 111223344555443211100 01111222222333222223 Q ss_pred HHhCCChH-HcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHhcC--CceE Q lcl|NC_021532. 394 SITGTKSF-SGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA---------YNAEFLE--EEEV 460 (663) Q Consensus 394 ~~tGi~~~-~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~---------li~q~~~--~~~~ 460 (663) ...|...- ..| .+.+. .-.+...+ ...+..+.+.+.. ..+.++.++-. ++-+.-. ..++ T Consensus 447 ~~lG~~~na~SG---vAi~~rq~qg~~~l----~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~ 518 (714) T protein:vir:10 447 AFLGQDSGATSG---VAISNLVEQGATTL----AEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQT 518 (714) T ss_pred HHcCCCccchhH---HHHHHHHHHHHHHH----HHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceE Confidence 34444321 222 11111 11111111 1111111111111 11112222211 1111111 1234 Q ss_pred EEEecCe--eec---cchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhH------------HHHHHHH Q lcl|NC_021532. 461 IRVTNDK--FVP---IRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRR------------DIMADIM 523 (663) Q Consensus 461 iri~~~~--~v~---i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~------------~~l~~~~ 523 (663) +-|.... -+. |..-.+.-..+......+ .+.+..+.|+++++.+.|.+...+.. .+...+- T Consensus 519 v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t--~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir 596 (714) T protein:vir:10 519 IVLNAEGDNGELTNDISRLNTHIALAPVQQTPA--FKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIR 596 (714) T ss_pred EeeccccCcceecccceeeeEEEEEeeccCchH--HHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHH Confidence 4443111 000 111111111222222221 23456666777777666654322111 1222221 Q ss_pred HhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 524 DLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEA 603 (663) Q Consensus 524 ~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~ 603 (663) +..+.... ....++.+.+.+.++.+++.++++.+.++++++.+..+++++++++++.+..++++.+...++...... T Consensus 597 ~~~~~~~~---~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~ 673 (714) T protein:vir:10 597 AALGTPKS---PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVD 673 (714) T ss_pred HHcCCCCC---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12121111 122222333333444444555555566666666666666666666666555555544433332222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 604 DMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNA 654 (663) Q Consensus 604 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~ 654 (663) ...+++..+ .++..+..++...-.++++ .+ ..++++.+... T Consensus 674 ~~~~a~~a~---~~~~~~~~~~~~~~~~~q~-~q------~~~~~~~~~~~ 714 (714) T protein:vir:10 674 ALNQAHTAE---IITGVQNMEQEQDVLQQQM-LY------TLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHH---HHHhHhhhhhhhHHHHHHH-HH------HHHHHHHhcCC Confidence 111111111 1111111111111111111 01 11111111111 No 135 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=97.18 E-value=0.00014 Score=41.71 Aligned_cols=578 Identities=12% Similarity=0.039 Sum_probs=136.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH---------h--cCCc---CCc---------------cccCC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLI--STWKAE---------Y--NGEP---YGN---------------EQKGK 49 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~--~~~~~~---------y--~~~~---~~~---------------~~~g~ 49 (663) .| --+.+.+.+..+++....++.++..+. .+|-.- . .|.. .++ +.++| T Consensus 6 ~~-~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e~~nr 84 (708) T protein:vir:17 6 EK-KHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNR 84 (708) T ss_pred HH-HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhHhhCC Confidence 33 346667777777776666666654432 333111 1 1111 111 12223 Q ss_pred Cc--cccHH-H-HHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHH----HH Q lcl|NC_021532. 50 SA--IVSRD-I-KKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKV----LD 121 (663) Q Consensus 50 s~--~~~~~-i-~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d----~~ 121 (663) +. +++.. - -..+..++..+++.+... +.-+.. .+.+....+.....-.+++.++..+ .. T Consensus 85 ~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~---------~~~~~~----~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~ 151 (708) T protein:vir:17 85 ITVKFRPGDREASEELANKLNGLFRADYEE---------TDGGEA----CDNAFDDAATGGFGCFRLTSMLVNEYDPMDD 151 (708) T ss_pred cceEEecCCCcchHHHHHHHHHHHHHHHHh---------cCchhH----HhHHHHHhhhcccceeeeeecccccCCCCCC Confidence 22 22210 1 112223333333333221 111111 1111111110000001111111110 00 Q ss_pred hcCce-------EEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhC Q lcl|NC_021532. 122 REGTL-------VVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNA 194 (663) Q Consensus 122 ~~G~g-------~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~ 194 (663) -.|.- +..|+||+. ..+.++++.++.....++..-....+|++-... + .+. T Consensus 152 ~~~i~i~~~~~~~~~v~~Dp~--------------------a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~-~-~~~ 209 (708) T protein:vir:17 152 RQRIAIEPIYDPSRSVWFDPD--------------------AKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPA-S-LDV 209 (708) T ss_pred ccccceEeeccchhheecCcc--------------------ccccChhhhhhhhhhccCCHHHHHHhCccccch-h-hhh Confidence 01111 112334422 233455556655555555444455566543211 0 111 Q ss_pred ceEE--EEeecCHHHHH-HhcCCcChh--hhh---h-ccchhhhccccccc-------cccccccccceE-EEEEEEEEe Q lcl|NC_021532. 195 QFVI--HRYETDLSTLK-KDGRYKNLD--KLA---K-TSGEDFDYDSPDDT-------EFQFSDAPRKKL-IIYEYWGNY 257 (663) Q Consensus 195 ~~~~--~~~~~~~~~l~-~~g~~~~~~--~~~---~-~~~~~~~~~~~~~~-------~~~~~d~~~~~v-~v~E~w~~~ 257 (663) .... ...|.+.+.+. ...|++..+ .+. + ..+.-......... .....-...+.+ +...+|+.+ T Consensus 210 ~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~ 289 (708) T protein:vir:17 210 TSMTSWEYDWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVV 289 (708) T ss_pred hhhccccccccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEee Confidence 1000 00111111000 000000000 000 0 00000000000000 000000011111 222234332 Q ss_pred eecCCceeE----------EEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 258 DVDGDGIAE----------PIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGI 327 (663) Q Consensus 258 ~~~~~g~~~----------~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~ 327 (663) .|+.+.+ +++.+|+-...++..+. ||-++....++-..+-+ ..-....+-.+.... T Consensus 290 --~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~-------~yG~vr~~kd~Q~~~N~-----~~S~~~~~~a~~~~~ 355 (708) T protein:vir:17 290 --DGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIER-------VEGHIAKAMDPQRLYNL-----QVSMLADTAAQDPGQ 355 (708) T ss_pred --cccccccCCCCCCCCccceEEEecccccccCCCc-------ccchhhhchhHHHHHHH-----HHHHHHHHHHhcCCc Confidence 2222211 22222221111111111 23333333344333211 000000000000000 Q ss_pred HHHHHhcCCCcEEe-------eccccCcchhh-----hccCC--cceEeCCCCCccccccCccccHH----HHHHHHHHH Q lcl|NC_021532. 328 IDNMAQSNNGQVAI-------RKGALDQTNRK-----KFLAG--ANFEFNGTANDFWHGSYNAIPSS----AFDMISLMN 389 (663) Q Consensus 328 ~~~~~~~~~~~~~~-------~~~~i~~~d~~-----~~~p~--~vi~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 389 (663) .+.+...+...+.. +.+++...... +..+| ...+.. +-+.++.... ....++..- T Consensus 356 ~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~-------~~~~~~~~~~llq~~~~~i~~~t 428 (708) T protein:vir:17 356 IPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAGYTQ-------PAVMNQALAALLQQTSADIQEVT 428 (708) T ss_pred ceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcccCC-------CccccHHHHHHHHHHHHHHHHhc Confidence 11111111111100 00011000000 01111 111111 1122222222 222333332 Q ss_pred HHHHHHhCCChHHcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHhcCCce Q lcl|NC_021532. 390 NEIESITGTKSFSGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN---------AEFLEEEE 459 (663) Q Consensus 390 ~~~~~~tGi~~~~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li---------~q~~~~~~ 459 (663) ..-....|.+.-..|..-+.... ..+.+..++.+-....+.. .+.+..++-.+. -+- ..++ T Consensus 429 Gi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~--------g~~lL~lI~~~y~~~R~~RI~~ed-g~~~ 499 (708) T protein:vir:17 429 GGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRA--------GEVWLSMAREVYGSEREVRIVNED-GSDD 499 (708) T ss_pred CCChHHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHcCCCcEEEEecCC-CCcc Confidence 22233445443334422111111 1111122222211111111 111111111111 111 1223 Q ss_pred EEEEec-------Ceee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccC--CCcchhHHHHHHHHHh-- Q lcl|NC_021532. 460 VIRVTN-------DKFV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPN--EDPKIRRDIMADIMDL-- 525 (663) Q Consensus 460 ~iri~~-------~~~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~--~~p~~~~~~l~~~~~l-- 525 (663) .+.|.+ +.++ .|+.-.+.-.++......+- +....+.|+.++..+.+. +.+.+...++ ..+++ T Consensus 500 ~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~--r~~~~~~l~qll~~~~~~~~~~~~~~~l~l-~~~D~p~ 576 (708) T protein:vir:17 500 IAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTAR--RDATVSVLTNVLSSMLPADPMRPAIQGIIL-DNIDGEG 576 (708) T ss_pred eeeecceeccCCCccceeeccceeeeeeEEEecccCchhH--HHHHHHHHHHHHHhcCCccchhHHHHHHHH-HhcCCCC Confidence 333321 1111 12221111112222222221 223455666666665542 1222211111 11110 Q ss_pred -hhhhhhhhhh-------hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 526 -MRMPEQAKRM-------REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKAR 597 (663) Q Consensus 526 -~~~~e~~~~l-------~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q 597 (663) ..+.+..+.. +...+...+..++..+.+++++++.+.+++++..+.++++++.++++.. +++.+.++..+ T Consensus 577 ~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~--~q~~a~q~~~~ 654 (708) T protein:vir:17 577 LDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQ--TQIKAFTAQQD 654 (708) T ss_pred hHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH Confidence 1111111110 1111111111122222333333333333333333333333333322221 11111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhhh Q lcl|NC_021532. 598 KLSSEADMTDLKFVKEDNGYAHLEQV-ELEDLRHAQHLEREAMKHRANLEQMLAQRN-AGDTN 658 (663) Q Consensus 598 ~~~~~~~~~~~e~~~~~~~~~~~~~~-~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~-~~~~~ 658 (663) .++++ ....+..++.........+ ..+.++..+.. +....++.-+.. ...-. T Consensus 655 ~~~a~--~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~-------q~q~~~a~p~~~~~~~~~ 708 (708) T protein:vir:17 655 AMESQ--ANTVYKLAQARNIDDKAVMEAIRLLKDVAES-------QQQQFQSPPQSPADLMPS 708 (708) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-------HHHHHhccccCchhccCC Confidence 11111 1111111111111111111 11111111100 111111111111 00011 No 136 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=97.17 E-value=0.00014 Score=41.65 Aligned_cols=439 Identities=10% Similarity=0.015 Sum_probs=166.0 Q ss_pred CCCc-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCcC-------C-----------ccccCCCccccHHHHHH Q lcl|NC_021532. 1 MKIN-KAELLSALKADMKAADVLKQEQDSL-ISTWKAEYNGEPY-------G-----------NEQKGKSAIVSRDIKKQ 60 (663) Q Consensus 1 ~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~-~~~~~~~y~~~~~-------~-----------~~~~g~s~~~~~~i~~~ 60 (663) |=.+ ...........|+.....-...++. -..|.-...+... . ...+.+-.+++|....+ T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~t 93 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNPT 93 (488) T ss_pred ecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHHH Confidence 3322 2333344444444332221111111 1111100000000 0 00011235678999999 Q ss_pred HHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceec Q lcl|NC_021532. 61 SEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVT 140 (663) Q Consensus 61 v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~ 140 (663) ++.++|.+. .-+|.++. ++.. .+..++..+=-..++....+.+++..++.+|.+++-|.+..+. T Consensus 94 l~~l~G~vf----rk~p~~~~----~~~~----~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~---- 157 (488) T protein:vir:96 94 MNAITGAVM----RREPEFDT----MDNP----VLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPES---- 157 (488) T ss_pred HHHhcchhh----ccCceecc----CCcH----HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCc---- Confidence 998888764 22222221 1111 1444444432233445556788999999999999887543110 Q ss_pred ccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhh---CceEEEEeecCHHHHHHhcCCcCh Q lcl|NC_021532. 141 VMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDN---AQFVIHRYETDLSTLKKDGRYKNL 217 (663) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d---~~~~~~~~~~~~~~l~~~g~~~~~ 217 (663) ..+.+.+....+|++..++|+++. +.... .+.. ..++..+...+ + T Consensus 158 ------------------~T~ade~~~~~rPy~~~~~a~~Ii-nW~~~-~v~G~~~L~~v~lrE~~~--~---------- 205 (488) T protein:vir:96 158 ------------------ATMADWNKGKKLPTAAFYDALHII-DWEVE-YIDGEEKLTYLSLLEDYQ--E---------- 205 (488) T ss_pred ------------------CCHHHHHHhcCCcEEEEechhhhc-Cccee-ccCCceeeEEEEEEEEEE--e---------- Confidence 011112223446888888888884 22111 1111 11121111000 0 Q ss_pred hhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEEC---CEEEec-ccCCCcCCCCCEE Q lcl|NC_021532. 218 DKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWIN---DVIVRL-QSNPYPDGKPPFL 293 (663) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g---~~~l~~-~~~p~~~~~~Pf~ 293 (663) .++ +.+. +...++++. .. +|..+.++....+ ..++.. +.. ..+..||+ T Consensus 206 -----~D~------------~~~~--~~~~~~~~~---l~----~g~~~v~~~~~~~~~~e~~~~~~g~~--~l~~IP~v 257 (488) T protein:vir:96 206 -----RDG------------GTYV--SKQRLINHR---LV----DGLCEFQEVTDDEYSDEWTPVLINSK--QSDTIPFF 257 (488) T ss_pred -----ccC------------CCcc--cceEEEEEE---EE----CcEEEEEEEecCCcccceEeecCCCc--ccCeeEEE Confidence 000 0011 111122211 10 2222221111111 111111 122 22556777 Q ss_pred EEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeC-----CCCCc Q lcl|NC_021532. 294 VVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFN-----GTAND 368 (663) Q Consensus 294 ~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~-----~~~~~ 368 (663) ++.... -+...|.++.-.+..++...=...+-.-+++..+.-|.++..-+..+........+.++.... .+.+. T Consensus 258 ~~~~~~-~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~g~ 336 (488) T protein:vir:96 258 LASSQS-NEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTMASEMNPLGFTLAGRMPYYVKNGD 336 (488) T ss_pred EEecCC-CCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCcccccccccceeeecccccccccCCc Confidence 554332 222335566666666654333333334555556666666543222222212222233321111 11112 Q ss_pred cccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 369 FWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWM 448 (663) Q Consensus 369 ~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~ 448 (663) ..+.+.. ..+-..+.|+.+.+.|.. .|..-...| .+.||++.+.-..+....|..++.++.+ .++.++ T Consensus 337 ~~~~e~~-~~~l~~~~l~~l~~qm~~-~Ga~l~~~~-----~~~Ta~~~~~~~~~~~S~L~~~a~~le~-----al~~~l 404 (488) T protein:vir:96 337 VKVIQAQ-FSPETENKVEKLFEQAVK-VGASLFTQQ-----SNETATGAAIRSGSSTASMATLGNNVED-----TVRNML 404 (488) T ss_pred eeecCCc-hhHHHHHHHHHHHHHHHH-HhHhhccCC-----CcchHHHHHHHHHHhhHHHHHHHHHHHH-----HHHHHH Confidence 2222222 111123445555555533 343322222 1367887877777778888888888864 245667 Q ss_pred HHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhh Q lcl|NC_021532. 449 AYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRM 528 (663) Q Consensus 449 ~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~ 528 (663) .++..|....-- ..+ .....+.+...-..... -.+.+.+++..+... .+....+...+.-.++ T Consensus 405 ~~~A~w~g~~~~---------~~~----~~~~~~~in~dF~~~~l-d~~~~~al~~~~~~G---~Is~~t~~~~L~~~gv 467 (488) T protein:vir:96 405 RFIMRYFEGTNL---------YVN----PDELVFKLNRDYFDVEV-NPQMLQVAYAAMMEG---NLPQVSWFELLKRARV 467 (488) T ss_pred HHHHHHcCCCCC---------CcC----ccceEEEeccCCCCccC-CHHHHHHHHHHHhcC---CCCHHHHHHHHHhCCc Confidence 777777542100 000 00111111111011000 011223333332211 1222222222222111 Q ss_pred --hhh--hhhhhhhhcchhhH Q lcl|NC_021532. 529 --PEQ--AKRMREYEPKPDPV 545 (663) Q Consensus 529 --~e~--~~~l~~~~~~~~~~ 545 (663) ++. .+...+++...-.+ T Consensus 468 l~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 468 VRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred CCccCCHHHHHHHHhhcCCCC Confidence 000 01111111000000 No 137 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=97.16 E-value=0.00015 Score=41.58 Aligned_cols=444 Identities=12% Similarity=0.043 Sum_probs=172.9 Q ss_pred CCCcHHHHHHHHHHHH-------------------HHHHHHHHHHHHHHHHHHHHhcCCcC-------Cc---------- Q lcl|NC_021532. 1 MKINKAELLSALKADM-------------------KAADVLKQEQDSLISTWKAEYNGEPY-------GN---------- 44 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~-------------------~~~~~~~~~~~~~~~~~~~~y~~~~~-------~~---------- 44 (663) |--++.++-...+..- ..-...+..+...|+..++.+-|+.. ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~ 80 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDE 80 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCc Confidence 3333333222222110 00001122233334444444443210 00 Q ss_pred ------cccCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH Q lcl|NC_021532. 45 ------EQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK 118 (663) Q Consensus 45 ------~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~ 118 (663) +.+=...+++|.++.+++.++|.+. ..++.+++ | ..+..++..+=-..++...++.+++. T Consensus 81 E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf----rk~p~~~~-p---------~~l~~l~~d~D~~G~~L~~f~~~~~~ 146 (535) T protein:vir:80 81 EQRRRYETYLQRAIFYNVTARTLDGMMGQVF----SRDPIRQL-P---------PALEAIVEDIDGEGVSLDQQAKKALG 146 (535) T ss_pred CCHHHHHHHHhhccCCChhHHHHHHHhchhh----cCCcceec-c---------HHHHHHHhccCCCCCCHHHHHHHHHH Confidence 0011236789999999999998764 22333332 2 23444444322123345556788999 Q ss_pred HHHhcCceEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHhe-eCcccccChhhCceE Q lcl|NC_021532. 119 VLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY-LDPTCQDNLDNAQFV 197 (663) Q Consensus 119 d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~dp~a~~d~~d~~~~ 197 (663) .++.+|.+++-|.+..... ...+.+......+|++..++|+++. |+-..........++ T Consensus 147 ~~l~~G~~~iLVD~P~~~~--------------------~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v 206 (535) T protein:vir:80 147 YTMGFGRAAIFTDYPNVGR--------------------PVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLV 206 (535) T ss_pred HHHhcCeEEEEEeecCCCC--------------------cccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEE Confidence 9999999998875531100 0011122233456888888888874 221111001111222 Q ss_pred EEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEE-ECC- Q lcl|NC_021532. 198 IHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAW-IND- 275 (663) Q Consensus 198 ~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~-~g~- 275 (663) ..+...+ ..+ ..++......+++++. +.+|....++-.. .++ T Consensus 207 ~lrE~~~----------------------------~~d--d~f~~~~~~q~RvL~~------~~~G~y~v~~~~~~~~~~ 250 (535) T protein:vir:80 207 VIQENVL----------------------------AQD--DGFETTYVQQWRVLQL------NAEGNYQVERWRRETQEE 250 (535) T ss_pred EEEEEEE----------------------------ecC--CCcccceeEEEEEEEe------cCCceEEEEEEEeecCCc Confidence 1111100 000 1112222222333321 1222211100000 000 Q ss_pred --------EEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccC Q lcl|NC_021532. 276 --------VIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALD 347 (663) Q Consensus 276 --------~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~ 347 (663) +....+.++ .+.+||+++... .-+...|.++...+..++..+=...+-.-+++..+..|...+ .|..+ T Consensus 251 ~~~~~~~~~~~~~g~~~--l~~IPfv~~~~~-~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i-~G~~~ 326 (535) T protein:vir:80 251 MYYSYSKHVPTDGNGNP--FKEIPFQFIGPL-DNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFF-TGLTK 326 (535) T ss_pred cccccceeecccCCCcc--cCeeEEEEeecC-CCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeee-ecCch Confidence 011112222 356677755422 223344666777777776655444555566777888876554 23222 Q ss_pred cc-------hhhhccCCcceEeCCCCCccccccC--ccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHH Q lcl|NC_021532. 348 QT-------NRKKFLAGANFEFNGTANDFWHGSY--NAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGAR 418 (663) Q Consensus 348 ~~-------d~~~~~p~~vi~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~ 418 (663) .. ......++.+|.+-.+ ....++.+ ..++ .+.++.+.+.|.. .|..-...+ + .+.||++.+ T Consensus 327 ~~~~~~~~~~~i~iG~~~~~~lP~~-~~~~~~e~~~~~~a---~~~l~~~e~qM~~-lGa~ll~~~--~--~~~Ta~~a~ 397 (535) T protein:vir:80 327 DWVEDVFKDFKVHLGSRAIIPLPQG-ATAGILQITPNSVP---FEAMTHKESQMIA-MGANLLVKS--G--GNRTFGEAQ 397 (535) T ss_pred hhhhcCCCCcceEecCcccccCCCC-CCcceeeeccchhH---HHHHHHHHHHHHH-HHHHhhccC--c--ccccHHHHH Confidence 11 1122334444444322 22333332 2332 2334445555543 233322222 1 235677776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CceEEEEe-cCeeec--cchhhcC--------CceE--- Q lcl|NC_021532. 419 GALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLE---EEEVIRVT-NDKFVP--IRKDDLS--------GRID--- 481 (663) Q Consensus 419 ~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~---~~~~iri~-~~~~v~--i~~~~~~--------~~~d--- 481 (663) .-..+....|..++.++.+ . ++.+|.++..|.. ++.-+.|+ +.+|.. +++..+. |.+. T Consensus 398 ~~~~~~~S~L~~~a~~le~-a----l~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et 472 (535) T protein:vir:80 398 QEEASEQSILSACTKNVSM-A----FRKALRWANQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKE 472 (535) T ss_pred HHHHHHhHHHHHHHHHHHH-H----HHHHHHHHHHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHH Confidence 6555666677888888754 3 4456667777753 22223332 233432 2333221 1000 Q ss_pred E--EE-eecccchhHHHHHHHHHHHHH------hcc-CCCc--chhHHHHHHHHHhhhhhhhhhh Q lcl|NC_021532. 482 I--DI-SISTAEDNAAKSQELSFLLQT------LGP-NEDP--KIRRDIMADIMDLMRMPEQAKR 534 (663) Q Consensus 482 ~--~v-~~~~~~~~~~~~q~l~~~~~~------~~~-~~~p--~~~~~~l~~~~~l~~~~e~~~~ 534 (663) + .+ ..+.........+....+... .++ ..++ ..++..-.. .-.+-.+.+.. T Consensus 473 ~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~--~~~~~~~~~~~ 535 (535) T protein:vir:80 473 MRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLN--NGNGGGNQAGN 535 (535) T ss_pred HHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCccc--CCccccccCCC Confidence 0 00 000000000000111101000 111 1110 000000000 00000111111 No 138 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=97.10 E-value=0.00017 Score=41.24 Aligned_cols=582 Identities=10% Similarity=0.025 Sum_probs=134.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHh-----------c----CCcC--------------CccccCC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWK--AEY-----------N----GEPY--------------GNEQKGK 49 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~y-----------~----~~~~--------------~~~~~g~ 49 (663) -+-.-+++++.+..+......++..+..+.++|+ .-. . |.+. +.+.++| T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~~~nr 84 (708) T protein:vir:10 5 LEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNR 84 (708) T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHHHhCC Confidence 3333356666777777766666666655554442 111 1 1111 0011222 Q ss_pred Cc--cccHH-H-HHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHH----HH Q lcl|NC_021532. 50 SA--IVSRD-I-KKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKV----LD 121 (663) Q Consensus 50 s~--~~~~~-i-~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d----~~ 121 (663) .. +++.. - -..+..++..+++.+.. .+.-+..-.+...+.+.- ...-.+.+.++..+ .. T Consensus 85 ~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~---------~~~~~~~~s~Af~d~i~~----G~Gw~~~~~d~~~e~d~~~~ 151 (708) T protein:vir:10 85 ITVKFRPGDREASEELANKLNGLFRADYE---------ETDGGEACDNAFDDAATG----GFGCFRLTSMLVNEYDPMDD 151 (708) T ss_pred cceEEEcCCCCchHHHHHHHHHHHHHHHH---------hcCchHHHHHHHHhhhhc----ccceeeeeeccccccCCCCC Confidence 22 11110 0 01122233333333322 111111111111111100 00000000000000 00 Q ss_pred hcCceE-------EEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccc-cChh- Q lcl|NC_021532. 122 REGTLV-------VQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ-DNLD- 192 (663) Q Consensus 122 ~~G~g~-------~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~-~d~~- 192 (663) -.|..+ ..|+||+. ..+.++++.++.....++..-....+|++-... .++. T Consensus 152 ~~~i~i~~~~~p~~~v~~Dp~--------------------a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~ 211 (708) T protein:vir:10 152 RQRIAIEPIYDPSRSVWFDPD--------------------AKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTS 211 (708) T ss_pred ccccceEEeecchhhcccCcc--------------------ccccChhhhhhhhhccCCCHHHHHHhCCCCccccccccc Confidence 011111 12334322 223345555555444444444445556543111 0110 Q ss_pred hCceEEEEeecCHHHHHHhcCCc-C--hhhhhh----ccchhhhcccccc-------ccccccccccceEE-EEEEEEEe Q lcl|NC_021532. 193 NAQFVIHRYETDLSTLKKDGRYK-N--LDKLAK----TSGEDFDYDSPDD-------TEFQFSDAPRKKLI-IYEYWGNY 257 (663) Q Consensus 193 d~~~~~~~~~~~~~~l~~~g~~~-~--~~~~~~----~~~~~~~~~~~~~-------~~~~~~d~~~~~v~-v~E~w~~~ 257 (663) ++.|. ..|.+.+.+.-..||. . ...+.. .++.-........ ......-...+.+. +..+|+.. T Consensus 212 ~~~~~--~~~~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~ 289 (708) T protein:vir:10 212 MTSWE--YNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVV 289 (708) T ss_pred CCCcc--ccccCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEee Confidence 00000 0011100000000100 0 000000 0000000000000 00000000111111 11222221 Q ss_pred eecCCcee----------EEEEEEEECCEEEecccCCCcCCC-CCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 258 DVDGDGIA----------EPIVCAWINDVIVRLQSNPYPDGK-PPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRG 326 (663) Q Consensus 258 ~~~~~g~~----------~~~~~~~~g~~~l~~~~~p~~~~~-~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~ 326 (663) .++.+. -+++.+|.-.. |.+|. .+|-++....++-+.+- ........+..+... T Consensus 290 --~g~~~le~~~~~p~~~fP~vP~~g~r~--------~~d~~~~~yG~vr~~kd~Q~~~N-----~~~S~~~~~~a~~~~ 354 (708) T protein:vir:10 290 --DGDGFLEKPRRIPGEHIPLIPVYGKRW--------FIDDIERVEGHIAKAMDPQRLYN-----LQVSMLADTAAQDPG 354 (708) T ss_pred --cchhhhccCCCCCCCceeeEEEeeeee--------ccCCCcccceeecccchhHHHHH-----HHHHHHHHHHHhcCC Confidence 111111 12222222111 11221 11333333334433221 111111111111111 Q ss_pred HHHHHHhcCCCcEEeecc-------ccCcchhhhccCCcceEeCCCCCccccccCcc----ccHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 327 IIDNMAQSNNGQVAIRKG-------ALDQTNRKKFLAGANFEFNGTANDFWHGSYNA----IPSSAFDMISLMNNEIESI 395 (663) Q Consensus 327 ~~~~~~~~~~~~~~~~~~-------~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 395 (663) ..+.+...+...+...-+ .+...+.....+|.++........+.+...++ +.+.....++.+-..-... T Consensus 355 ~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~ 434 (708) T protein:vir:10 355 QIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAM 434 (708) T ss_pred cccccChhhhhhHHHHHhhccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhH Confidence 122222222221111100 00001111123344433322222222222332 2222233333332223345 Q ss_pred hCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHh--c---CCceEEEEec-- Q lcl|NC_021532. 396 TGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN---AEF--L---EEEEVIRVTN-- 465 (663) Q Consensus 396 tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li---~q~--~---~~~~~iri~~-- 465 (663) .|...-..|..-+.. ...+...+ ...+..+.+.. +...+.+..++-..+ +.+ . ..++++.|.+ T Consensus 435 lG~~sn~SG~aI~~r--q~qg~~~l----~~~~Dnl~~~~-~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~ 507 (708) T protein:vir:10 435 QQMPSNIAQETVNNL--MNRADMAS----FIYLDNMAKSL-KRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQV 507 (708) T ss_pred ccCccchHHHHHHHH--HHHHHHHH----HHHHHHHHHHH-HHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEeccee Confidence 554332334211111 11111110 11111111111 111122222222222 000 0 1223444421 Q ss_pred -----C-----------ee-eccchhhcCCceEE-------EEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHH Q lcl|NC_021532. 466 -----D-----------KF-VPIRKDDLSGRIDI-------DISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMAD 521 (663) Q Consensus 466 -----~-----------~~-v~i~~~~~~~~~d~-------~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~ 521 (663) + +| |.|+.......++- .+....++.... . ..++.++...++......+... T Consensus 508 ~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~-~---~~~~~~~l~~~D~p~~~ei~er 583 (708) T protein:vir:10 508 VDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM-R---PAIQGIILDNIDGEGLDDFKEY 583 (708) T ss_pred ccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchh-h---HHHHHHHHHhcCCcChHHHHHH Confidence 1 11 11211100100000 000000111111 1 1112222222332222333322 Q ss_pred HHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 522 IMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSS 601 (663) Q Consensus 522 ~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~ 601 (663) +-........ .+...+...+..++.++++++++++++.+++++..+.++++++..+++... ++.+.++..+..+. T Consensus 584 ir~~~~~~~~---~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~--~~~a~q~~~~~~~a 658 (708) T protein:vir:10 584 NRNQLLISGI---AKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQT--QIKAFTAQQDAMES 658 (708) T ss_pred HHHhhccccc---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH Confidence 2111111110 111111112222222233333333333333333333333333332222211 11111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH--HHHHHHHhh Q lcl|NC_021532. 602 EADMTDLKFVKEDNGYAHLEQVE-LEDLRHAQHLEREAMKHRANL--EQMLAQRNA 654 (663) Q Consensus 602 ~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~e~~k~~~~~--e~~~~~~~~ 654 (663) ++ ...+..++.......+.++ .+.++..+..+ .++.++ ...+.+-.. T Consensus 659 ~~--~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q----~~~~~~~p~~~~~~~p~ 708 (708) T protein:vir:10 659 QA--NTVYKLAQARNIDDKAVMEAIRLLKDVAESQ----QQQFQSPPQSPADLMPS 708 (708) T ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhH----HHHHhccccCchhccCC Confidence 11 1111111111111111111 11111111110 011111 111122222 No 139 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=96.69 E-value=0.0004 Score=39.18 Aligned_cols=596 Identities=11% Similarity=0.034 Sum_probs=90.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCCccccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYGNEQKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) +.=--..|++.++.++.....+|.+...+.++|..-.| |.+ ..+-++ +-+.|...+.....+-. T Consensus 43 ~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p--~~~~N~---i~~~i~~v~g~~~~nr~ 117 (776) T protein:vir:93 43 AVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERGQA--PTVYNV---ISQSVNWIIGSEKRGRS 117 (776) T ss_pred HHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCc--eEEecc---hHHHHHHHHHHHHhCCc Confidence 33334455555555555555555555555555422111 111 000000 00000000000000000 Q ss_pred HhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHh----------cCceEEEeeecccccee Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDR----------EGTLVVQTGWDYEDEEV 139 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~----------~G~g~~~~~~d~~~~~~ 139 (663) .+-+.++ .+-..+-.+....+-..+-+..+-......++.+++.-++- .|-.+ .+.+ T Consensus 118 ~~~~~p~-----~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~-~~~~------- 184 (776) T protein:vir:93 118 DFKVLPR-----RKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPI-YAGA------- 184 (776) T ss_pred ceEEecC-----ChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCce-Eeec------- Confidence 0000000 00000001100000000000000000000001111000000 01111 1110 Q ss_pred cccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHH-----H-Hhc- Q lcl|NC_021532. 140 TVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTL-----K-KDG- 212 (663) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l-----~-~~g- 212 (663) ....++++|... .+.++.+.++.....++..-....+|++-. ..+.+.....+..+ ...+. . ..+ T Consensus 185 -~~p~~i~~Dp~a----~~~D~sDar~~~~~~~~~~~~~~~~~p~~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 256 (776) T protein:vir:93 185 -ESWRNILWDSTY----RRLDMDDCRYIFRVKWVDLDVMLAIFPERA--AQLRAAAVDNFETW-GTDDIDGDDAMDSPEY 256 (776) T ss_pred -cChhheeecccc----ccCCHHHHhhhhhhccCCHHHHHHhcCCch--HHHHHhhhhccccc-chhccccccccccccc Confidence 011111111111 112222223222222222222222332210 00100000000000 00000 0 000 Q ss_pred -------------CCcChhhhhhcc-------------chhhhcccccccc--------ccccccccceEE-EEEEEEEe Q lcl|NC_021532. 213 -------------RYKNLDKLAKTS-------------GEDFDYDSPDDTE--------FQFSDAPRKKLI-IYEYWGNY 257 (663) Q Consensus 213 -------------~~~~~~~~~~~~-------------~~~~~~~~~~~~~--------~~~~d~~~~~v~-v~E~w~~~ 257 (663) ...+...+.+.. ..+......+... .+........+. ++.+| T Consensus 257 ~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~--- 333 (776) T protein:vir:93 257 ERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAI--- 333 (776) T ss_pred ccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEE--- Confidence 000000000000 0000000000000 000000000000 11011 Q ss_pred eecCCceeEEEEEEEECCEEEec--ccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHH----HHHH--- Q lcl|NC_021532. 258 DVDGDGIAEPIVCAWINDVIVRL--QSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVI----RGII--- 328 (663) Q Consensus 258 ~~~~~g~~~~~~~~~~g~~~l~~--~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~----~~~~--- 328 (663) ..|+.+.....+.|-++++-.. -..+.++..+||.++....++-+.+.. ....+.+ .+|+.. ...+ T Consensus 334 -~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~-~~s~~~~---~l~~~~~~~~~gav~~~ 408 (776) T protein:vir:93 334 -MTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNK-RLSKALY---ILSTNKVLMEEGAVDDI 408 (776) T ss_pred -EecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHH-HHHHHHH---hhcCCceeeccccccch Confidence 0111111000011111100000 000001111222222222222211110 0000110 011000 0000 Q ss_pred HHHH-hcCCCcEEeeccccCcchhhhccCCcc--eEeCCCCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCC Q lcl|NC_021532. 329 DNMA-QSNNGQVAIRKGALDQTNRKKFLAGAN--FEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGI 405 (663) Q Consensus 329 ~~~~-~~~~~~~~~~~~~i~~~d~~~~~p~~v--i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~ 405 (663) +.+. ..+.|..++ ...||+. +.+....... ...-.+.+.....++..-..-....|...- +. T Consensus 409 d~~~~~~~rp~~vi-----------~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n--~~ 473 (776) T protein:vir:93 409 DEFRREAARPDAVM-----------TVKNGKLGAVKMDVDRDLA--PAHLELASRSIQMIQQVGGVTDEMLGRTTN--AV 473 (776) T ss_pred HHHHHhcccCCcee-----------eeCCccccccccccCcCcc--HHHHHHHHHHHHHHHHhhCcChHHhCCCcc--hh Confidence 0000 000110000 0111110 1111100000 000011111222222222111222232211 11 Q ss_pred Ccccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---Hh-----cCCceEEEEecCeeec-cchhh Q lcl|NC_021532. 406 NSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNA---EF-----LEEEEVIRVTNDKFVP-IRKDD 475 (663) Q Consensus 406 ~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~---q~-----~~~~~~iri~~~~~v~-i~~~~ 475 (663) ++.+.+. ...+. ......+..+.+.+. ...+.++.++....- .| -...++|.|....+.. +..-. T Consensus 474 Sg~ai~~~~~~~~----~~~~~~~dn~~~~~~-~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~ 548 (776) T protein:vir:93 474 SGVAIQARQEQGS----VATNKLFDNLRLAFQ-QHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTK 548 (776) T ss_pred hHHHHHHHHHHHH----HHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccce Confidence 1111110 00000 001111111222111 112222222222221 00 0112344432110000 00000 Q ss_pred cCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHH-----HHHhh-hhhhhhhhhhhhhcchhhHHHHh Q lcl|NC_021532. 476 LSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMAD-----IMDLM-RMPEQAKRMREYEPKPDPVQEKI 549 (663) Q Consensus 476 ~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~-----~~~l~-~~~e~~~~l~~~~~~~~~~~~q~ 549 (663) +.-.++......+.-.. . ..+|++++..+.+.+.+.+...++.. .-++. .+..........+....+.+++. T Consensus 549 ~dv~v~~~~~~~s~r~~-~-~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~ 626 (776) T protein:vir:93 549 ADFIIDEAEWRATMRQA-A-VAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAR 626 (776) T ss_pred eeEEEeecccchhHHHH-H-HHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHH Confidence 00011111111111100 0 11122222222222211111110000 00000 00000000000011111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 550 RQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVELEDLR 629 (663) Q Consensus 550 ~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 629 (663) .+.+++.++.+.+.+.+++..+++++.+..++++..+++++..+.++.....++...+++... +. ....+... T Consensus 627 qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~------~~-~~~~~~a~ 699 (776) T protein:vir:93 627 EQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAAT------AI-AFMPELAG 699 (776) T ss_pred HHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhh------hh-hhhhhhhh Confidence 111122222222222222222233322222222222222211111110000000000000000 00 00000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhcc--ccC Q lcl|NC_021532. 630 HAQHLEREAMKHRANLEQMLAQRNAGDTNIG--VVE 663 (663) Q Consensus 630 ~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~--~~~ 663 (663) ........... ..... .+...+..... ..+ T Consensus 700 ~a~~~~~~a~~--~~p~~--p~~~~~~~~~~~~~~~ 731 (776) T protein:vir:93 700 LSDGILRESGW--DDPNT--PQPASAASGMPPAPAQ 731 (776) T ss_pred hhhhhhccccc--ccccc--ccccccccCCCCCCCC Confidence 00000000000 00000 00000000000 000 No 140 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=95.84 E-value=0.0014 Score=36.27 Aligned_cols=587 Identities=12% Similarity=-0.013 Sum_probs=127.9 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCCc------------cccCCCc--ccc- Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYGN------------EQKGKSA--IVS- 54 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~~------------~~~g~s~--~~~- 54 (663) =+.+=+.+.+.+..++.....+|.++..+.++|..-.| |.+.-+ +.++|.. +++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~rp~~N~i~~~v~~v~g~e~~nr~d~~v~p~ 83 (725) T protein:vir:10 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLYRPK 83 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcccchHHHHHHHHhhHHhCCcceEEecC Confidence 66667888999999999999999998888887743222 222100 1122221 111 Q ss_pred ----HHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHH-HHH-----HhcC Q lcl|NC_021532. 55 ----RDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAV-KVL-----DREG 124 (663) Q Consensus 55 ----~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~-~d~-----~~~G 124 (663) -.++.....++.+..+. ++ . ....++.-...+..++ -| .++..++. .|. .+.. T Consensus 84 ~~~d~~~Ae~l~~~~~~~~~~-~~---~----~~~~s~Af~~~i~~G~-G~--------~ev~~d~~~~d~~~~~~~i~~ 146 (725) T protein:vir:10 84 DGASPDAADVLMGMYRTDMRH-NT---A----KIAVNIAVREQIEAGV-GA--------WRLVTDYEDQSPTSNNQVIRR 146 (725) T ss_pred CcchHHHHHHHHHHHHHHHHh-cC---c----chHHhHHHHHHhhcCc-ce--------eeeeccccCCCCCCCceeeee Confidence 12222333333222211 00 0 0000111100011111 01 00000000 000 0000 Q ss_pred c----eEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHh---eeC-cccccCh----- Q lcl|NC_021532. 125 T----LVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDI---YLD-PTCQDNL----- 191 (663) Q Consensus 125 ~----g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~---~~d-p~a~~d~----- 191 (663) . -+..|+||+.. .+.++++.++.....++......+| |+. .....++ T Consensus 147 ~~i~~~~~~v~~Dp~a--------------------~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 206 (725) T protein:vir:10 147 EPIHSACSHVIWDSNS--------------------KLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPND 206 (725) T ss_pred eecccCHhHcccCchh--------------------hccChhhhhhhhhhccCCHHHHHHHHHhCCCccccccccccccc Confidence 0 01113444322 1223333332222222211111111 110 0000000 Q ss_pred -------hhCceEEE---EeecCHHHHHHhcCCcChhhhhhccchhhhcccccccccccc-ccccceEEEEEEEEEee-- Q lcl|NC_021532. 192 -------DNAQFVIH---RYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFS-DAPRKKLIIYEYWGNYD-- 258 (663) Q Consensus 192 -------~d~~~~~~---~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~v~v~E~w~~~~-- 258 (663) .++-.+++ +.+... .+... .++...++......+............+. -..+..-+.-.+|+... T Consensus 207 ~~~~~~~~~~vrv~E~~~r~~~~~-~~~~~-~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~ 284 (725) T protein:vir:10 207 WVFPWLTQDTIQIAEFYEVVEKKE-TAFIY-QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT 284 (725) T ss_pred ccccccCCCeEEEEEEEEEEEEee-EEEEe-ccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecch Confidence 00101111 110000 00000 00000000000000000000000000000 00011011112232210 Q ss_pred --ec----CCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCC--Ch-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 259 --VD----GDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGE--AN-AEMIGDNQKVKTAVIRGIID 329 (663) Q Consensus 259 --~~----~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~--g~-~~~~~d~Q~~~N~~~~~~~~ 329 (663) ++ ..|..-+++.+|.-...+ ...||++ -++....++-..+-+ |- .+.+.-.+..........++ T Consensus 285 ~~l~~~~~~~~~~fP~vP~~g~r~~~--~g~~~~~-----G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~ 357 (725) T protein:vir:10 285 AVLKDKQLIAGEHIPIVPVFGEWGFV--EDKEVYE-----GVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIA 357 (725) T ss_pred hhhcCCCCCCCCceeEEEEEeeeecc--CCcceee-----eeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhh Confidence 00 011112233333221111 1123322 333344444333211 10 01110000000000000000 Q ss_pred ---HHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHH----HHHHHHHHHHHHHHhCCChH- Q lcl|NC_021532. 330 ---NMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSA----FDMISLMNNEIESITGTKSF- 401 (663) Q Consensus 330 ---~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~tGi~~~- 401 (663) .....+++...+..+.+...+ ...+...+.+. ...+.|+-...+ ...++..-..-.+..|...- T Consensus 358 ~~e~~~~~~~~~~~~~~~~~~~~~--g~~~~~~i~~~------~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~ 429 (725) T protein:vir:10 358 GFEHMYDGNDDYPYYLLNRTDENN--GEMPTQPLAYY------ENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ 429 (725) T ss_pred HHHHHHhccCCceeeecccccccC--cccccccCccc------CCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchh Confidence 001112222222211111000 01122222221 122233222223 33333333323455566432 Q ss_pred HcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHhcCCceEEEEecC------- Q lcl|NC_021532. 402 SGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN--------AEFLEEEEVIRVTND------- 466 (663) Q Consensus 402 ~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li--------~q~~~~~~~iri~~~------- 466 (663) ..|..-.. ....+...+ ...+..+...+ +...+.+..++-..+ ..--..++++.|+.. T Consensus 430 ~SG~ai~~--rq~qg~~~l----~~~~Dnl~~~~-~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G 502 (725) T protein:vir:10 430 VAYDTVNQ--LNMRADLET----YVFQDNLATAM-RRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATG 502 (725) T ss_pred hHHHHHHH--HHHHHHHHH----HHHHHHHHHHH-HHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEecccccccccc Confidence 22321111 111111111 11111111111 111112222222222 111123345544321 Q ss_pred eee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhc--------------cCCCcchhHHHHHHHHHhhhhh Q lcl|NC_021532. 467 KFV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLG--------------PNEDPKIRRDIMADIMDLMRMP 529 (663) Q Consensus 467 ~~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~--------------~~~~p~~~~~~l~~~~~l~~~~ 529 (663) +++ .+.. .+.-.++......+--. .. ...|++++..+. ..++......+...+ +.. T Consensus 503 ~~v~~Ndi~g-~~Dv~v~~~p~~~s~r~-~~-~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~eri----rkq 575 (725) T protein:vir:10 503 ERQVLNDIRG-RYECYTDVGPSFQSMKQ-QN-RSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYA----NKQ 575 (725) T ss_pred chhhhhcccc-ceeEEEeeccCcHHHHH-HH-HHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHH----Hhh Confidence 110 1211 22223333333222111 11 112222222222 112111111111111 000 Q ss_pred hhhhh-hhhhhcchhhH-----HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_021532. 530 EQAKR-MREYEPKPDPV-----QEKIRQLELENLMLENQMLVASINDKNARANE--NTIDAELKRSKAAVEKAKAR---- 597 (663) Q Consensus 530 e~~~~-l~~~~~~~~~~-----~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~--~~~~~~~~~~~~~~e~~~~q---- 597 (663) ..... .....+...+. +.++.++..+..+++.+.++++++.++++++. .+.++...+.+++...++.. T Consensus 576 ~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~ 655 (725) T protein:vir:10 576 LIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFN 655 (725) T ss_pred hhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000 00000110000 00111111111111111112222222222221 22222111111111111100 Q ss_pred -----H--HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHH-HHHHHhhhhhhccc Q lcl|NC_021532. 598 -----K--LSSEADMTDLKFVKEDNGYAHLEQVEL----EDLRHAQHLEREAMKHRANLEQ-MLAQRNAGDTNIGV 661 (663) Q Consensus 598 -----~--~~~~~~~~~~e~~~~~~~~~~~~~~~~----~~~~~~~~~~~e~~k~~~~~e~-~~~~~~~~~~~~~~ 661 (663) + ...++......++++. ..+.+..++. +..+++++++.. .-+.. ..++....=.+--. T Consensus 656 q~~~~q~~~~~~~~~~~~~~q~~~-~~~~~~~ae~~~~~~~~~~~~~~~~~-----~~~~~q~~~~~~~~~~~~~~ 725 (725) T protein:vir:10 656 NMDLSKQSEFREFLKTVASFQQDR-SEDARANAELLLKGNEQTHKQRMDIA-----NILQSQRQNQPSGSVAETPQ 725 (725) T ss_pred HhhhHHHHHHHHHHHHHHHHHHHH-HHHHHHhhHHHHHHHHHHHHHHhhhh-----hccccccccCCCcccccCCC Confidence 0 0001111111111111 1111111111 111111222111 11111 11111111000000 No 141 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=95.60 E-value=0.0018 Score=35.66 Aligned_cols=581 Identities=10% Similarity=-0.003 Sum_probs=172.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCCCcccc----------------HHHHHHHHHHHHHHHHh Q lcl|NC_021532. 8 LLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGKSAIVS----------------RDIKKQSEWQHATIVDP 71 (663) Q Consensus 8 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s~~~~----------------~~i~~~v~~~~~~l~~~ 71 (663) -++..+..++++..+..........|+.....+-...-..| ..|-. +.+.|.|. ..+.. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G-~QW~~~~~~~l~~~~q~~grP~~~~N~i~----~~v~~ 75 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPG-GQWEGATVAGTKLDEQFEKYPKFEINKVA----TELNR 75 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC-ccCCHHHHHHHHhhhhhcCCCceEecchH----HHHHH Confidence 22223444555555544444443344333322222111123 12322 22223333 33333 Q ss_pred hcCCCceEEEEeC-CcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCc Q lcl|NC_021532. 72 FVSTADIIKCTPI-TWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDE 150 (663) Q Consensus 72 ~~~~~~~~~~~p~-~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~ 150 (663) .++....-...+. .|.+.+.-+.+.++++.++..-.. ..-...+..++...+..+ .++|-. T Consensus 76 v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~a~s~Af~d~i~~-G~G~~e---------------- 137 (706) T protein:vir:10 76 IISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATG-GFGCFR---------------- 137 (706) T ss_pred HhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHH-hcCchHHHHHHHHHHhhc-CcceEE---------------- Confidence 3333222233332 456666566788888888754322 222345666666555433 222210 Q ss_pred cccccccccccccceeecccceeeeccHHHheeCcccccChhhC-ceEEE---EeecCHHHHHHhcC--CcChhhhhhcc Q lcl|NC_021532. 151 YGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNA-QFVIH---RYETDLSTLKKDGR--YKNLDKLAKTS 224 (663) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~-~~~~~---~~~~~~~~l~~~g~--~~~~~~~~~~~ 224 (663) ++..+. .-.+|.++=.+.... .+.+. +-|++ .+..+.++..-..+ .-+.+++.... T Consensus 138 -------------v~~d~~----~~~d~~~~~~~i~i~-~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~f 199 (706) T protein:vir:10 138 -------------LTTSFV----NEYDPMDERQRIAVE-PIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEY 199 (706) T ss_pred -------------eeeccc----cccCCCCCCccceee-eeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhc Confidence 000000 001111111111000 00000 00111 11223333211111 12223332222 Q ss_pred ch---hhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEeccc------------------- Q lcl|NC_021532. 225 GE---DFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQS------------------- 282 (663) Q Consensus 225 ~~---~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~------------------- 282 (663) +. +.+.....+....| ...+.|++.|||.+.....+- .++.....++....... T Consensus 200 p~~~~~~~~~~~~~~~~d~--~~~d~~~~~eyy~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 275 (706) T protein:vir:10 200 DKAPTSLDRVGSVSWQYDW--FTPDVVYIAKYYEVRKESVDV--ISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGR 275 (706) T ss_pred CCChhhhhhhccccccccc--cCCCcceecccccccceeEEE--EEeeccccCCceeeccchhhhhHHHHhhCCchhhhh Confidence 21 11111111111111 233568888988764321111 11111111111100000 Q ss_pred CC------Cc---------CC--CCCEEEEeeeeecCcc---cCCChHHHHHHHHHHHHHHHH-HHHHHHHhcCCCcEEe Q lcl|NC_021532. 283 NP------YP---------DG--KPPFLVVPFNSIPFKL---HGEANAEMIGDNQKVKTAVIR-GIIDNMAQSNNGQVAI 341 (663) Q Consensus 283 ~p------~~---------~~--~~Pf~~~~~~~~~~~~---~g~g~~~~~~d~Q~~~N~~~~-~~~~~~~~~~~~~~~~ 341 (663) .+ || ++ -||.-.+|+.|..+.. .|.+....++..=+..=...| .+...+...+..+... T Consensus 276 ~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~ 355 (706) T protein:vir:10 276 RSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQT 355 (706) T ss_pred cccceeeEEEEeeccccccccCCCCCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcc Confidence 00 00 01 1333566666665543 244444455444444444555 4455555666666667 Q ss_pred eccccCcchhhhccCCcceE--------eCCCC---Cccccc----cCccccHHHHHHHHHHHHHHHHHhCCChHHcCCC Q lcl|NC_021532. 342 RKGALDQTNRKKFLAGANFE--------FNGTA---NDFWHG----SYNAIPSSAFDMISLMNNEIESITGTKSFSGGIN 406 (663) Q Consensus 342 ~~~~i~~~d~~~~~p~~vi~--------~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~ 406 (663) ..++++..+.... .+..-. +++.+ +.+... ...+.|......++++......+ ....|.. T Consensus 356 ~~~~~~~i~~~~~-~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i----~~vsGi~ 430 (706) T protein:vir:10 356 PIVDMEQIRGLEQ-HWEGRNRKRPAFLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADI----QEVTGSS 430 (706) T ss_pred cccchhHHHHHHH-HhhhcccccccchhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHH----HHHhCCC Confidence 7666543333222 122211 11111 111110 11112223334555555555544 3345654 Q ss_pred cccchhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCce-E-- Q lcl|NC_021532. 407 SGSLGSTATGARGA--LDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRI-D-- 481 (663) Q Consensus 407 ~~~~~~tA~~i~~~--~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~-d-- 481 (663) .... |..++++.. ...-......+...| +.+ +...+.+.+++..+... . .+.+..+.|..++-..++ . T Consensus 431 ~~~l-G~~sn~SG~Ai~~rq~qg~~~~~~~~-Dnl-~~~~~~~g~~lL~li~~--~--y~~~R~~RI~~ed~~~~~v~in 503 (706) T protein:vir:10 431 QAMQ-QMPSNVARETVNSLLNRSDMASFIYL-DNM-AKSLKRAGEIWLSMARE--I--YGSDREVRIVHEDGTDDIALMN 503 (706) T ss_pred HHHc-CCccchHHHHHHHHHHHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHH--H--cCCCcEEEEecCCCCccceeec Confidence 4333 222333332 111222222233222 332 23334444444443211 0 123344445433321110 0 Q ss_pred --------------EEEeecc-----cchhHHHH--HHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhc Q lcl|NC_021532. 482 --------------IDISIST-----AEDNAAKS--QELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEP 540 (663) Q Consensus 482 --------------~~v~~~~-----~~~~~~~~--q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~ 540 (663) .+|+.+. ..+..... ++....|..+.+.+ |...+ +...++.+ ..+.+ .+ + T Consensus 504 ~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~-~p~~~-~~~~l~~~-----~~~~~-d~-p 574 (706) T protein:vir:10 504 AAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGM-LPQDP-MRPALMGI-----IIDNM-EG-E 574 (706) T ss_pred cceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhc-CCcch-hhHHHHHH-----HHhhc-Cc-c Confidence 0111110 00111111 11111111111110 11111 11111100 11110 00 1 Q ss_pred chhhHHHHhh-HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 541 KPDPVQEKIR-QLELEN-LMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYA 618 (663) Q Consensus 541 ~~~~~~~q~~-q~~~~~-~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~ 618 (663) ..+...+... ++.++. .+.. ...+++..+++++++..+.+.++..+++++.+ .+++...++.+..+...+ T Consensus 575 ~~~e~~e~irk~~~~q~~~~~~-~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~-------~qA~~~k~~a~~~q~~~~ 646 (706) T protein:vir:10 575 GLDDFKAFNRRQLLTQGIVKPR-NQQEQAIVQQAQQAQATQPDPNMLLAQAQMVV-------AQAEAQKSQNETVQTQIK 646 (706) T ss_pred chHHHHHHHHHhhcccCCcccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHH Confidence 1111111111 111000 0000 00001111111111111112222111111111 111111111111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccC Q lcl|NC_021532. 619 HLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVVE 663 (663) Q Consensus 619 ~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~~ 663 (663) . ..++.+..+.+.++......+.....++..++.+...++...+ T Consensus 647 a-~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~l~~~~a~q 690 (706) T protein:vir:10 647 A-FTAQQDAMESQANTVYKLAQARNIDDKAVMETLRLLKEVAASQ 690 (706) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 0 1111111111111111111111111222223333234555555 No 142 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=94.73 E-value=0.0036 Score=33.96 Aligned_cols=586 Identities=9% Similarity=0.015 Sum_probs=155.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCC--------------ccccCCCc--cc Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYG--------------NEQKGKSA--IV 53 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~--------------~~~~g~s~--~~ 53 (663) +++.-+.+.. +..++.....+|.+...+.++|..-.| |.+.- .+.++|.. ++ T Consensus 20 ~~~~~~~~~~-~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~d~~v~ 98 (772) T protein:vir:10 20 TPLTVDEYAD-INYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLIGPALLSLQGYEAVTRTDWRVT 98 (772) T ss_pred cccCHHHHHH-HHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHHHHHHhcCcceEEe Confidence 7777777665 666788888888888888887743222 32210 11222322 22 Q ss_pred cH-HHH-HHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEee Q lcl|NC_021532. 54 SR-DIK-KQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTG 131 (663) Q Consensus 54 ~~-~i~-~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~ 131 (663) +. ... ..+..++..+++.+.. .+.-+.. .+.+....+.. .-++..+ .+.+-...+ -+++. T Consensus 99 Pr~~~~d~~~Ae~l~~~~~~~~~---------~~~~~~~----~s~Af~~~i~~-G~Gw~e~--~~~~d~~~~--~i~i~ 160 (772) T protein:vir:10 99 PNGDVGGQEVADALNYRLNTAER---------QSGADRA----CSEAFRPQIAC-GIGWVEV--SRESDPFKF--PYRCR 160 (772) T ss_pred cCCCchHHHHHHHHHHHHHHHHH---------hcChHHH----HHHHHHHhhhc-CceeEEe--ccccCCCCC--CeEEE Confidence 21 011 1222333333333322 1111111 22221111110 0011000 011111111 11111 Q ss_pred eccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccC-hhhC--ceEEE-----Eeec Q lcl|NC_021532. 132 WDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDN-LDNA--QFVIH-----RYET 203 (663) Q Consensus 132 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d-~~d~--~~~~~-----~~~~ 203 (663) . +....+++|+.. ..++++.++.....++..-....+|++-...-+ ..+. .|... ..+. T Consensus 161 ~--------v~p~~v~~Dp~a-----~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (772) T protein:vir:10 161 P--------IRRDEIHWDMKC-----GDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGG 227 (772) T ss_pred e--------eCcccceecCCC-----CCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccc Confidence 1 112223333322 124555555444444444444455665321100 0000 00000 0000 Q ss_pred CHHHHHH-------------hcCCcChhh--hhh--ccchhh-h-cccccc--cccccc-----------ccccce---E Q lcl|NC_021532. 204 DLSTLKK-------------DGRYKNLDK--LAK--TSGEDF-D-YDSPDD--TEFQFS-----------DAPRKK---L 248 (663) Q Consensus 204 ~~~~l~~-------------~g~~~~~~~--~~~--~~~~~~-~-~~~~~~--~~~~~~-----------d~~~~~---v 248 (663) +...+.. .++..+..+ +.+ +-.... . ....+. ..+.-. -...++ - T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~ 307 (772) T protein:vir:10 228 TSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVS 307 (772) T ss_pred cccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeee Confidence 0000000 001110000 000 000000 0 000000 000000 000011 1 Q ss_pred EEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCc------CCCCCEEEEeeeeecCcccCC--ChHHHHHHHHHHH Q lcl|NC_021532. 249 IIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYP------DGKPPFLVVPFNSIPFKLHGE--ANAEMIGDNQKVK 320 (663) Q Consensus 249 ~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~------~~~~Pf~~~~~~~~~~~~~g~--g~~~~~~d~Q~~~ 320 (663) +|+.+|+ .|+++.+..-+-|.++++- -.||| .| .||-++....++.+.+.+ |-...+.-.++.+ T Consensus 308 rv~~~~~----~g~~~L~~~~~p~~~~~fP---~vP~~g~r~~~~g-~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~~~ 379 (772) T protein:vir:10 308 RVRRSYW----LGPHCLHDGPTPYTHRHFP---YVPFFGFREDATG-IPYGYVRGMKYAQDSLNSGVSKLRWGMSVARVE 379 (772) T ss_pred EEEEEEE----ecceeeccCCCCCCCCccc---eEEEeeeEeccCC-cccchhhhhhhHHHHHHHHHHHHHHHHhccccc Confidence 2222221 1222221111111122110 01111 11 244334444444333321 1111111111100 Q ss_pred HH------HHHHHHHHHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHHHH Q lcl|NC_021532. 321 TA------VIRGIIDNMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEIES 394 (663) Q Consensus 321 N~------~~~~~~~~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (663) .. .-....+.++. .+.-+.++.|.. ..||+.+.+++....+. ..-.+.+.....++.+-..-.+ T Consensus 380 ~~~gav~~~d~~~~e~~ar-p~~vi~~~~~~~-------~~~~~~~~~~~~~~~~~--~~~~llq~~~~~i~~vsGv~~~ 449 (772) T protein:vir:10 380 RTKGAVAMTDAQFRRQIAR-PDADIVLDENHM-------AKPGARFDVKRDYTLTD--QHFQMLQDNRATIERVSNITAG 449 (772) T ss_pred ccCCCccchhHHHHHhccC-CCCeEEeCCccc-------cCCCCCccccCCccccH--HHHHHHHHHHHHHHHHhCCCHH Confidence 00 00000111100 111222322222 23556565554322110 1112222223334443333334 Q ss_pred HhCCChHHcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHhcCCceEEEE Q lcl|NC_021532. 395 ITGTKSFSGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN----------AEFLEEEEVIRV 463 (663) Q Consensus 395 ~tGi~~~~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li----------~q~~~~~~~iri 463 (663) ..|...... ++-+... .-.+.. .....+..+.+... ...+.++.++-..+ ....+.++++.| T Consensus 450 ~lG~~~na~--SGvAi~~rq~qg~~----~l~~~~Dnl~~~~~-~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~i 522 (772) T protein:vir:10 450 FQGRKGTAT--SGIQEQQQIEQSNQ----SIGRIMDNFRAGRT-LVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVL 522 (772) T ss_pred HcCCCcchh--hHHHHHHHHHHHHH----HHHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEe Confidence 555432211 1111110 110010 01111111111111 11112222222221 122334566665 Q ss_pred ec-------Ceee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhh---hhhh Q lcl|NC_021532. 464 TN-------DKFV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLM---RMPE 530 (663) Q Consensus 464 ~~-------~~~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~---~~~e 530 (663) .. +..+ .|..-...-..+.... .+..+....+.+++++..+.|.+.+.+...++ ..+++. .+.+ T Consensus 523 n~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~--~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~l-e~~D~p~~~ei~~ 599 (772) T protein:vir:10 523 NEPQRDPQTGAAYLSNDLLRTRIKVALEDVPS--TNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLV-SLMDVPFKRDVVE 599 (772) T ss_pred ccceecccccccceeccceeeeEEEEeecccc--chHHHHHHHHHHHHHHhccChhHHHHHHHHHH-hhcCCCChHHHHH Confidence 42 1111 1222221111222222 23445566666666666555554333222111 111110 1111 Q ss_pred hhhhhhhh----hcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 531 QAKRMREY----EPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMT 606 (663) Q Consensus 531 ~~~~l~~~----~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~ 606 (663) ..+.+... +.+..+.++.+.+++.++++.+..++.++.+...+++++.++++.....+++..++++.+. .. T Consensus 600 ~ir~~~~~~~peq~~~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~-----~~ 674 (772) T protein:vir:10 600 AIRAVDQQQTPEQIQQQIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQ-----IA 674 (772) T ss_pred HHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-----HH Confidence 11111111 1111222222222333333444444555544444444444443333332222211111100 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhhhhhhccccC Q lcl|NC_021532. 607 DLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQML--AQRNAGDTNIGVVE 663 (663) Q Consensus 607 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~--~~~~~~~~~~~~~~ 663 (663) ++ .+......+.++... .+.. .........- .+....+..-..++ T Consensus 675 q~---------~q~a~~ad~~l~~~g-~~~~--~~~~~~~~~p~~~~~a~~~~~~~~~~ 721 (772) T protein:vir:10 675 QM---------PMIAPIADAVMQSAG-YQRP--NPAGDDPNYPIADQTAAMNIRSPYIQ 721 (772) T ss_pred hh---------hhhhHHHHHHHHhcc-cccc--cccccCCCCCCCCCccCCCCCccCCC Confidence 00 000011111111000 0000 0000000000 00000000000111 No 143 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=93.59 E-value=0.007 Score=32.39 Aligned_cols=582 Identities=12% Similarity=0.070 Sum_probs=160.4 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCC--------------ccccCCCc--cc Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYG--------------NEQKGKSA--IV 53 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~--------------~~~~g~s~--~~ 53 (663) =++..+. +..+..++.....+|.+...+.++|..-.| |.+.- ...++|.. +. T Consensus 18 ~~~~~~~-l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~ 96 (714) T protein:vir:10 18 PRFSQRQ-LLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNLIAPTVDGVLGMEAKTRTDLIVM 96 (714) T ss_pred hhhhHHH-HHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHHHHHhCCcceEEe Confidence 3455554 455566788888888888888888743322 32210 11223322 22 Q ss_pred cH-------HHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHH-HHHhcCc Q lcl|NC_021532. 54 SR-------DIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVK-VLDREGT 125 (663) Q Consensus 54 ~~-------~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~G~ 125 (663) +. .++..+..++.. +.. .+.-+.. .+.+....+.. .-+...+ .+. |. .- T Consensus 97 pr~~~~~~~~~Ae~l~~~~~~----~~~---------~~~~~~~----~s~af~~~~~~-G~G~~~~--~~d~d~---~~ 153 (714) T protein:vir:10 97 SDDPNDETEKLAEAINAEFAD----ACR---------LGNMNKA----RSDAYAEQIKA-GLSWVEV--RRNSEP---FG 153 (714) T ss_pred cCCCChhhHHHHHHHHHHHHH----HHH---------hhchhHH----HHHHHHHhhhc-ccceEEe--eeccCC---CC Confidence 21 122233322222 221 1111111 11111111100 0010000 000 11 11 Q ss_pred eEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccc--------cChhhCce- Q lcl|NC_021532. 126 LVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQ--------DNLDNAQF- 196 (663) Q Consensus 126 g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~--------~d~~d~~~- 196 (663) |-+++. .+....+++|+.. .+.++++.++.....++..-....+|++.... .++.+... T Consensus 154 ~~i~i~--------~v~p~~v~~Dp~a----~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~ 221 (714) T protein:vir:10 154 PEFKVS--------TVSRNEVFWDWLS----READLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVT 221 (714) T ss_pred CCeEEE--------ecChhheeecccc----ccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhh Confidence 222221 1122233333332 23444455544443333333344456552210 00000000 Q ss_pred --EEEEeecCHHHHHH------hcCCcChhhhh----h------------ccchhhhcccccccc-----ccccccccce Q lcl|NC_021532. 197 --VIHRYETDLSTLKK------DGRYKNLDKLA----K------------TSGEDFDYDSPDDTE-----FQFSDAPRKK 247 (663) Q Consensus 197 --~~~~~~~~~~~l~~------~g~~~~~~~~~----~------------~~~~~~~~~~~~~~~-----~~~~d~~~~~ 247 (663) ..+...-..++... .++..+...+. + .++.-..++..+... .........+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~ 301 (714) T protein:vir:10 222 EGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGR 301 (714) T ss_pred hhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccc Confidence 00000000000000 00000000000 0 000000000000000 0000011111 Q ss_pred E-EEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCc------CCCCCEEEEeeeeecCcccCCChHHHHHHHHHHH Q lcl|NC_021532. 248 L-IIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYP------DGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVK 320 (663) Q Consensus 248 v-~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~------~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~ 320 (663) + +|+.+|+- |+.+.....+.|-++.+- ..|+| .| .||-++....++.+.+.+..... -..+ T Consensus 302 ~~rv~~~~~~----g~~~L~~~~~p~p~~~fp---~vP~~g~~~~~~g-~~~G~vr~~~d~Qr~~N~~~s~~----~~~l 369 (714) T protein:vir:10 302 VSRIREAWFV----GPHFIVDRPCSAPQGMFP---LVPFWGYRKDKTG-EPYGLISRAIPAQDEVNFRRIKL----TWLL 369 (714) T ss_pred eeeEEEEEEe----cchhhhcCCCCCCCCcee---eEEecceeeeccC-ccceehhhhhhHHHHHHHHHHHH----HHHH Confidence 1 23333321 111111000111111110 01221 12 25544444444433332111110 0011 Q ss_pred HHHHHHHH----------HHHHhcCCCc--EEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHHHHHHHH Q lcl|NC_021532. 321 TAVIRGII----------DNMAQSNNGQ--VAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAFDMISLM 388 (663) Q Consensus 321 N~~~~~~~----------~~~~~~~~~~--~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 388 (663) |... .+. ......+.|. +.++.+. .....++..+.+.+....+.. .-.+.+.....++.. T Consensus 370 ~~~~-~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~-----~~~~~~~~~~~~~~~~~~~~~--~~~llq~~~~~i~~~ 441 (714) T protein:vir:10 370 QAKR-VIMDEDATQLSDNDLMEQLERPDGIIKLNPVR-----KNQKSVADVFRVEQDFQVASQ--QFQVMQESEKLIQDT 441 (714) T ss_pred hCCc-eeeccccccccHHHHHHhccCCCCeEEecccc-----cccCCccccccccCCCCCcHH--HHHHHHHHHHHHHHh Confidence 1100 000 0001112221 1121111 111223344555442211110 011112222333333 Q ss_pred HHHHHHHhCCChHH-cCCCcccchhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHhcC--CceEEE Q lcl|NC_021532. 389 NNEIESITGTKSFS-GGINSGSLGST-ATGARGALDATATRRMNIVRNIAENLVKPLM--RKWMAYNAEFLE--EEEVIR 462 (663) Q Consensus 389 ~~~~~~~tGi~~~~-~G~~~~~~~~t-A~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~--~~~~~li~q~~~--~~~~ir 462 (663) -..-....|...-. .|..-++.... .+.......+-....+...+.+-. ++...+ +.++.++-+... ..+++. T Consensus 442 tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~-li~~~~~~~rv~RI~~e~~~~~~~~~~~ 520 (714) T protein:vir:10 442 MGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLA-YLLDDLKKRRNHAVVINRDDRQRRQTIV 520 (714) T ss_pred hCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHcCCCcEEEEeccCCCcccceeEe Confidence 22223444544221 12110010111 111111222211121222221111 010000 111112222111 123333 Q ss_pred EecCe--ee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhH------------HHHHHHHHh Q lcl|NC_021532. 463 VTNDK--FV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRR------------DIMADIMDL 525 (663) Q Consensus 463 i~~~~--~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~------------~~l~~~~~l 525 (663) +.... -. .|..-.+.-..+..... +..+.+..+.|+++++.+.|.+...+.. .++..+... T Consensus 521 ~n~~~~~~~~~nDi~~~~~dv~i~~~p~~--~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~ 598 (714) T protein:vir:10 521 LNAEGDNGELTNDISRLNTHIALAPVQQT--PAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAA 598 (714) T ss_pred eccccCCccccccceeeeEEEEEeeccCc--HHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHH Confidence 32110 00 11111111122222222 2335566777777777766654332111 122222222 Q ss_pred hhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 526 MRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADM 605 (663) Q Consensus 526 ~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~ 605 (663) .+.+.. .+..++.+.+.+.++.+++.++.+++.++++++.+..+++++++++++.....+++.+.+.++........ T Consensus 599 ~~~~~~---~~~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~ 675 (714) T protein:vir:10 599 LGTPKS---PDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDAL 675 (714) T ss_pred cCCCCC---ccccCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222111 12222333333334444444555556666666666666666555555555444444333222211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021532. 606 TDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNA 654 (663) Q Consensus 606 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~ 654 (663) .+++... .++.....++.....++++ .+ ..++++.+... T Consensus 676 ~~a~~a~---~l~~~~~~~q~~~~~~q~~-~q------~~~~~~~~~~~ 714 (714) T protein:vir:10 676 NQAHTAE---IITGVQNMEQEQDVLQQQM-LY------TLQQRMNEMSL 714 (714) T ss_pred HHHHHHH---HHHHHHhhhhhHHHHHHHH-HH------HHHHHHHhcCC Confidence 1111111 0111111111110011111 00 01111111111 No 144 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=92.48 E-value=0.011 Score=31.27 Aligned_cols=455 Identities=11% Similarity=0.059 Sum_probs=167.4 Q ss_pred CC-Cc-HHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhcCCcCCcc------ccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_021532. 1 MK-IN-KAELLSALKADMKAADVLKQEQDSL---ISTWKAEYNGEPYGNE------QKGKSAIVSRDIKKQSEWQHATIV 69 (663) Q Consensus 1 ~~-~~-~~~~~~~l~~~~~~~~~~~~~~~~~---~~~~~~~y~~~~~~~~------~~g~s~~~~~~i~~~v~~~~~~l~ 69 (663) |- .+ ...........|+-.......-... -..|.=...+..++.+ .+-.-.+++|.+..+++.++|.+. T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf 80 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVF 80 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhh Confidence 32 11 2223333333333332222111000 0111111111111111 111236789999999999998775 Q ss_pred HhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC Q lcl|NC_021532. 70 DPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD 149 (663) Q Consensus 70 ~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~ 149 (663) + .+|.+++ | ..+..++..+--..++...++.+++..++.+|.+++-|.+..... T Consensus 81 ~----k~p~~~~-p---------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~------------ 134 (501) T protein:vir:95 81 M----RDPVVKV-P---------ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEA------------ 134 (501) T ss_pred c----CCcceeC-c---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCC------------ Confidence 2 2333321 1 234444444322233455567889999999999998875431100 Q ss_pred ccccccccccccccceeecccceeeeccHHHhe-eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 150 EYGNETVVEQEVTETVVKKNQPTARVCRNEDIY-LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +. ...+.+.+.....|++..++|+++. |+-..........++..+...+ T Consensus 135 ~~------~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~------------------------ 184 (501) T protein:vir:95 135 EG------GASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWC------------------------ 184 (501) T ss_pred cc------cccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEe------------------------ Confidence 00 0011122333456888888888884 2211111111112221111100 Q ss_pred hccccccccccccccccceEEE----------EEEEEEeeec-CCceeEEEEEEEECC--EEEecccCCCcCCCCCEEEE Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLII----------YEYWGNYDVD-GDGIAEPIVCAWIND--VIVRLQSNPYPDGKPPFLVV 295 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v----------~E~w~~~~~~-~~g~~~~~~~~~~g~--~~l~~~~~p~~~~~~Pf~~~ 295 (663) ..+. .++......+++ ++.|.+-... .+|.....-.+.... .....+.++ .+..||+++ T Consensus 185 ----~~d~--~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~--l~~IPfv~~ 256 (501) T protein:vir:95 185 ----AADD--GFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKR--LTEIPFMFI 256 (501) T ss_pred ----ecCC--CcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccceeeeeccCCCc--CCeeeEEEE Confidence 0000 111111122222 2223221100 000000000000000 011112222 245676643 Q ss_pred eeeeecCccc--CCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccccCcc------hhhhccCCcceEeCCCCC Q lcl|NC_021532. 296 PFNSIPFKLH--GEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGALDQT------NRKKFLAGANFEFNGTAN 367 (663) Q Consensus 296 ~~~~~~~~~~--g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~i~~~------d~~~~~p~~vi~~~~~~~ 367 (663) . ..++.+ +.++.-.+..+.-.+=...+-.-+++..++.|...+ .|.-+.. ....+.++..+.+- .+. T Consensus 257 ~---~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i-~G~~~~~~~~~~~~~i~~G~~~~~~lP-~~~ 331 (501) T protein:vir:95 257 G---SENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVL-IGLTEEWVTNVLKGSVNFGSRGGIPLP-VGA 331 (501) T ss_pred e---cCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeee-eCCcccccccCCCCceeecccccccCC-CCC Confidence 2 223322 344555555553332112233466677777776554 3322211 11122333333322 223 Q ss_pred ccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 368 DFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKW 447 (663) Q Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~ 447 (663) ...++.+.+.. .....|+.+.+.|..+ |.. ...+ +..+.||++.+....+....|..++.++.+ . ++.+ T Consensus 332 ~~~~ie~~~~~-i~~~~l~~l~~~m~~~-Ga~-ll~~---~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~-a----l~~~ 400 (501) T protein:vir:95 332 DAKLLQASENT-MLKEAMDTKERQMVAL-GAK-LVEQ---KEVQRTATEAELEAASEGSTLSSATKNVSA-A----FEWA 400 (501) T ss_pred ceeEEecChhh-HHHHHHHHHHHHHHHH-HHh-hccC---CccchhHHHHHHHHHHHhHHHHHHHHHHHH-H----HHHH Confidence 34444443221 1234455555556443 432 2222 222467777777666777788888888854 3 4556 Q ss_pred HHHHHHhcCCc-eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhh Q lcl|NC_021532. 448 MAYNAEFLEEE-EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLM 526 (663) Q Consensus 448 ~~li~q~~~~~-~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~ 526 (663) |.++..|.... .-.. +.|+++ |.. ..-.. +.+.+++...... .+....+...+.-. T Consensus 401 l~~~a~w~g~~~~~~~------v~i~~d-----f~~------~~~~~---~~~~al~~~~~~G---~is~~t~~~~L~~~ 457 (501) T protein:vir:95 401 LKWAARWVGQADSGVK------FELNTD-----FDI------ARMTP---DERRSLVEEWQKG---AITFEEMRTGLRKA 457 (501) T ss_pred HHHHHHHcCCCCCceE------EEEecc-----ccc------ccCCH---HHHHHHHHHHhCC---CCcHHHHHHHHHhC Confidence 77777775321 1011 122221 100 00011 1122222222111 12222222222222 Q ss_pred hhhhh-----hhhhhhhhcchhhHH---HHhhH----HHHHHHH Q lcl|NC_021532. 527 RMPEQ-----AKRMREYEPKPDPVQ---EKIRQ----LELENLM 558 (663) Q Consensus 527 ~~~e~-----~~~l~~~~~~~~~~~---~q~~q----~~~~~~q 558 (663) ++... .+.++.....++... ..... -..-..+ T Consensus 458 ~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 458 GVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred CCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccccccCCC Confidence 22110 011110000000000 00000 0000000 No 145 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=92.38 E-value=0.012 Score=31.19 Aligned_cols=585 Identities=12% Similarity=0.022 Sum_probs=128.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCCc------------cccCCCc--ccc- Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYGN------------EQKGKSA--IVS- 54 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~~------------~~~g~s~--~~~- 54 (663) -+.+=+.+.+.+..++.....+|.++..+.++|..-.| |.+.-+ +.++|.. +++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~~~~nr~d~~v~P~ 83 (725) T protein:vir:77 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLYRPK 83 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHhcCCCccccHHHHHHHHHhhHHhCCcceEEecC Confidence 66677889999999999999999999888887743222 222100 1112221 111 Q ss_pred ----HHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHH-----H-HHHHhc- Q lcl|NC_021532. 55 ----RDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKA-----V-KVLDRE- 123 (663) Q Consensus 55 ----~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~-----~-~d~~~~- 123 (663) -.++.....++.+..+. ++ -+..-.+...+.| .......++..++ | .+..+. T Consensus 84 ~~~d~~~Ae~l~~~~~~~~~~-~~------------~~~a~s~Af~~~i----~~G~G~~ev~~d~~~~d~~~~~~~i~~ 146 (725) T protein:vir:77 84 DGARPDAADVLMGMYRTDMRH-NT------------AKIAVNIAVREQI----EAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) T ss_pred CccHHHHHHHHHHHHHHHHHh-hC------------chhHHHHHHHHHh----hcCcceeeeeecccCCCCCCCceeeEE Confidence 12222233333322211 01 1111111111110 0000000000000 0 000000 Q ss_pred ---CceEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHH---Hhe--------------- Q lcl|NC_021532. 124 ---GTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNE---DIY--------------- 182 (663) Q Consensus 124 ---G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~---~~~--------------- 182 (663) ...+..|+||+.... .++++.++.....+ +++. .|+ T Consensus 147 ~~~~~~~~~v~~Dp~a~~--------------------~D~sDar~~~~~~~---~~~d~~~~~~~~~~~~~~~~~~~~~ 203 (725) T protein:vir:77 147 EPIHSACSHVIWDSNSKL--------------------MDKSDARHCTVIHS---MSQNGWEDFAEKYDLDADDIPSFQN 203 (725) T ss_pred eecccChhhceeCchhhc--------------------cChhhHHHHHHHhc---CCHHHHHHHHhhCCcchhhcccccc Confidence 011222455543221 12222222111111 1111 111 Q ss_pred -----eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccccccccc-cccceEEEEEEEEE Q lcl|NC_021532. 183 -----LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFSD-APRKKLIIYEYWGN 256 (663) Q Consensus 183 -----~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~~~v~v~E~w~~ 256 (663) .+......+.-+.| +++.+...--+.. .++...+...+...+..............- ..+..-+.-.+|+. T Consensus 204 ~~~~~~~~~~~d~vrv~E~-~~r~~~~~~~~~~--~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~ 280 (725) T protein:vir:77 204 PNDWVFPWLTQDTIQIAEF-YEVVEKKETAFIY--QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSI 280 (725) T ss_pred cccccccccCCCeeEEEEE-EEEEEEeeEEEEe--cCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEee Confidence 11000000000111 1111110000000 000000000000000000000000000000 00000011122332 Q ss_pred ee----e-cCC---ceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCC--C-hHHHHHHHHHHHHHHHH Q lcl|NC_021532. 257 YD----V-DGD---GIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGE--A-NAEMIGDNQKVKTAVIR 325 (663) Q Consensus 257 ~~----~-~~~---g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~--g-~~~~~~d~Q~~~N~~~~ 325 (663) .. + +.+ |-.-+++.+|.-...+ ...||++| ++....++-..+-+ | ..+.+.-.+........ T Consensus 281 ~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~--~g~~~~~G-----~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~ 353 (725) T protein:vir:77 281 ITCTAVLKDKQLIAGEHIPIVPVFGEWGFV--EDKEVYEG-----VVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWP 353 (725) T ss_pred ecCceeeccCCcCCCCccceEEEeeeeecc--CCcccccc-----hhhhhhhHHHHHHHHHHHHHHHHHhccccccccch Confidence 10 0 000 0011222222111111 11233222 22222333332211 1 11111111111100011 Q ss_pred HHHHH---HHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHHH----HHHHHHHHHHHHHhCC Q lcl|NC_021532. 326 GIIDN---MAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSAF----DMISLMNNEIESITGT 398 (663) Q Consensus 326 ~~~~~---~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~tGi 398 (663) ..++. ....++....+....+...+ ...|.+.+...+ ..+.|+-...++ ..++..-..-.+..|. T Consensus 354 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~i~~~~------~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~ 425 (725) T protein:vir:77 354 EQIAGFEHMYDGNDDYPYYLLNRTDENS--GDLPTQPLAYYE------NPEVPQANAYMLEAATSAVKEVATLGVDTEAV 425 (725) T ss_pred hhhhHHHHHHHhccCCceecccccccCC--CcccccCccccC------CCCchHHHHHHHHHHHHHHHHHhCCCHHHhCC Confidence 11111 11111111111111111000 011212222211 223333222233 3333333333445555 Q ss_pred Ch-HHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHhcCCceEEEEecC--- Q lcl|NC_021532. 399 KS-FSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN--------AEFLEEEEVIRVTND--- 466 (663) Q Consensus 399 ~~-~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li--------~q~~~~~~~iri~~~--- 466 (663) .. ...|..-+.... .+...+ ...+..+.+... ...+.+..++-..+ ..--..++++.|+.. T Consensus 426 ~~n~~SG~ai~~rq~--qg~~~~----~~~~Dnl~~~~~-~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~ 498 (725) T protein:vir:77 426 NGGQVAFDTVNQLNM--RADLET----YVFQDNLATAMR-RDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVD 498 (725) T ss_pred CchhhHHHHHHHHHH--HHHHHH----HHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccc Confidence 53 233422211111 111111 111111111111 11111222222211 111123345555421 Q ss_pred ----eee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhc--------------cCCCcchhHHHHHHHHH- Q lcl|NC_021532. 467 ----KFV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLG--------------PNEDPKIRRDIMADIMD- 524 (663) Q Consensus 467 ----~~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~--------------~~~~p~~~~~~l~~~~~- 524 (663) .++ .+.+ .+.-.++......+-- ... ...|++++..++ ..++......+...+-. T Consensus 499 ~~~G~~~~~NDi~g-~~Dv~v~~~p~~~s~r-~~~-~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq 575 (725) T protein:vir:77 499 LATGEKQVLNDIRG-RYECYTDVGPSFQSMK-QQN-RAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQ 575 (725) T ss_pred cccchhHhhhhhcc-ceeeEEeeccchHHHH-HHH-HHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhh Confidence 111 1211 1112222222221111 111 111112222211 11211111111111100 Q ss_pred --hhhh-----hhhhhhhhhhhcchhhHHHHhhHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 525 --LMRM-----PEQAKRMREYEPKPDPVQEKIRQLE-------LENLMLENQMLVASINDKNARANENTIDAELKRSKAA 590 (663) Q Consensus 525 --l~~~-----~e~~~~l~~~~~~~~~~~~q~~q~~-------~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~ 590 (663) .... ++..+...+. ...++.+.+.+ ..+.++++++.+++....+.++.+.+.+++...++.. T Consensus 576 ~~~~~~~q~~~~~e~q~~~~~----qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~ 651 (725) T protein:vir:77 576 LIQMGVKKPETPEEQQWLVEA----QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIA 651 (725) T ss_pred hhhhhccCCCChhhHHHHHHH----HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0000000000 00011111111 1122222222222222222222222223322222221 Q ss_pred HH--HH--HHHHHHHHHHHHHHHHHHH--HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccc Q lcl|NC_021532. 591 VE--KA--KARKLSSEADMTDLKFVKE--DNGYAHLE-QVELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGV 661 (663) Q Consensus 591 ~e--~~--~~q~~~~~~~~~~~e~~~~--~~~~~~~~-~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~ 661 (663) .. ++ .++....++..+...++.+ ....+.++ ..+.+...+.++++.++. . ..+..+|....=.+--+ T Consensus 652 ~~~~q~~~~q~a~~~~~~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~q~~~~~~~--~--~~~~~~~~~~~~~~~~~ 725 (725) T protein:vir:77 652 EIFNNMDLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGDEQTHKQRMDIANI--L--QSQRQNQPSGSVAETPQ 725 (725) T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhhHHHhhHHHHHHH--H--HHHHhcCCCcCcccCCC Confidence 11 11 0011111111111111111 11111111 111111122222222211 1 11111121111111111 No 146 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=90.53 E-value=0.02 Score=29.86 Aligned_cols=564 Identities=12% Similarity=0.077 Sum_probs=141.5 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcC--------------CccccCCCcc--c Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPY--------------GNEQKGKSAI--V 53 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~--------------~~~~~g~s~~--~ 53 (663) -+-.=+.+.+.+..++.....++.....+.++|..-.| |.+. +...++|..+ . T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~ 107 (711) T protein:vir:10 28 DRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVS 107 (711) T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHhhhHhhCCcceEEe Confidence 44444556666777777777777777666666633222 2211 0112223222 2 Q ss_pred cH---------------------------HHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhc Q lcl|NC_021532. 54 SR---------------------------DIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRK 106 (663) Q Consensus 54 ~~---------------------------~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~ 106 (663) +. .++...+.++.... . .+.-+. ..+.+....+.. T Consensus 108 p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~----~---------~~~~~~----~~s~af~d~~~~- 169 (711) T protein:vir:10 108 STEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE----Y---------NCDAET----EYDIAFQGAVES- 169 (711) T ss_pred cccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHH----H---------hcChhH----HHHHHHHHhhhc- Confidence 21 22333333322221 1 011111 122222111110 Q ss_pred cchh-HHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCc Q lcl|NC_021532. 107 FDRF-NFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDP 185 (663) Q Consensus 107 ~~~~-~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp 185 (663) .-+. ++..++..+--..|-=+++... ....+++|+. ..+.++++.++.....++..-....+|++- T Consensus 170 G~G~~ev~~d~~~~d~~~~e~~i~~v~---------~p~~v~~Dp~----a~~~D~sDar~~~~~~~~~~~~~~~~yp~~ 236 (711) T protein:vir:10 170 GMGYLRVRSDYLADDSFEQDLIIEAIQ---------NQFSVTIDPD----AKKRDRSDMNWCLIDDTMSKEKFKALYPDA 236 (711) T ss_pred CcceEEEEecccCCCCCCCCeEEeeec---------ChhheeeCcc----ccccChhhhcceeeeecCCHHHHHHhCCch Confidence 0111 0000000000011211111100 1112222222 223444555554444444444445556542 Q ss_pred ccccChhhCceEEEEeecCHHHH--HHhcCCcCh-hhhhhccchh-hhccc-----cccccccccccccceEE-EEEEEE Q lcl|NC_021532. 186 TCQDNLDNAQFVIHRYETDLSTL--KKDGRYKNL-DKLAKTSGED-FDYDS-----PDDTEFQFSDAPRKKLI-IYEYWG 255 (663) Q Consensus 186 ~a~~d~~d~~~~~~~~~~~~~~l--~~~g~~~~~-~~~~~~~~~~-~~~~~-----~~~~~~~~~d~~~~~v~-v~E~w~ 255 (663) .. ..++...-..+-.|.+.+.+ .+.++.... ..+.....+. ..... .............+.+. .-.+|+ T Consensus 237 a~-~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~ 315 (711) T protein:vir:10 237 TA-EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWR 315 (711) T ss_pred hh-hhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEE Confidence 11 11110000000001111000 000000000 0000000000 00000 00000000000011111 111232 Q ss_pred EeeecCCce----------eEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHH- Q lcl|NC_021532. 256 NYDVDGDGI----------AEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVI- 324 (663) Q Consensus 256 ~~~~~~~g~----------~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~- 324 (663) .. .|+.+ .-+++++|+-...++. ...||-++....++-+.+.. ....+. +.+|... T Consensus 316 ~~--~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~-------~~~~~G~vr~~~d~Qr~~N~-~~s~~~---~~l~~~~~ 382 (711) T protein:vir:10 316 KI--TGANVLEGPVEIPSTTIPVIPVWGKSLIIKK-------KEIFRSIIRHSKDAQRMANY-WDSAAT---ETVALAPK 382 (711) T ss_pred EE--ecceeecCCCCCCCCcccEEEEeeeeecccc-------ccccchhhhhhhhhHHHHHH-HHHHHH---HHHHhcCC Confidence 21 11111 1122222221111111 11244444444444443321 111111 2222110 Q ss_pred ------HHHHH----HHHhc-C--CCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHH----HHHHHH Q lcl|NC_021532. 325 ------RGIID----NMAQS-N--NGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSA----FDMISL 387 (663) Q Consensus 325 ------~~~~~----~~~~~-~--~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 387 (663) ...++ .+... + ++-+.+..|.. +++.+.+.+.. +.++-.... ...++. T Consensus 383 ~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~---------~~~~~~~~~~~------~~~~~~~~ll~~~~~~i~~ 447 (711) T protein:vir:10 383 APFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQ---------GDPGPRRQPPA------AVPAAELTLGQNSVEKIKS 447 (711) T ss_pred CceeecCcccCChHHHHHhccccCCCeeEeccccc---------CcCCccccCCC------CCCHHHHHHHHHHHHHHHH Confidence 01111 11111 1 22222333221 11223222211 122222222 233333 Q ss_pred HHHHHHHHhCCChH-HcCCCcccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHhcCC Q lcl|NC_021532. 388 MNNEIESITGTKSF-SGGINSGSLGS-TATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN--------AEFLEE 457 (663) Q Consensus 388 ~~~~~~~~tGi~~~-~~G~~~~~~~~-tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li--------~q~~~~ 457 (663) .-..-....|...- ..| .+.+. ...+... ....+..+.+.+. ...+.++.++.... ..-... T Consensus 448 ~tGi~~~~~G~~~n~~Sg---~ai~~~q~qg~~~----l~~~~dn~~~~~~-~~g~~ll~li~~~~~~er~~rI~ged~~ 519 (711) T protein:vir:10 448 TMGMYDASLGAMGNETSG---RAIIARQRQGDRG----SFAFIDNLTKSIR-RVGKILVEMIPHIYDTERVVRLKFPDET 519 (711) T ss_pred HhCCChHHcCCCccchHH---HHHHHHHHHHHHH----HHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCeEEEEecCCCC Confidence 32222444555421 122 11111 1111111 1112222222221 12222333333322 111134 Q ss_pred ceEEEEecCe-------ee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHH---H Q lcl|NC_021532. 458 EEVIRVTNDK-------FV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIM---D 524 (663) Q Consensus 458 ~~~iri~~~~-------~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~---~ 524 (663) ++++.|+... .+ .|..-.+.-.+++.....+--.. ....|+++++++ |...+.+...++ .++ . T Consensus 520 ~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~--~~~~l~ql~~~~-p~~~~~~~~~il-~~~d~p~ 595 (711) T protein:vir:10 520 EDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIE--AAEAMIQFAQAV-PSAAAVMADLIA-QNMDWPG 595 (711) T ss_pred cceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHH--HHHHHHHHHhhc-chhhhHHHHHHH-HhcCCCC Confidence 5667775321 11 12233222234444433332221 122344444443 222222111111 110 0 Q ss_pred hhhhhhhhhhhhhhhcc----hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 525 LMRMPEQAKRMREYEPK----PDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLS 600 (663) Q Consensus 525 l~~~~e~~~~l~~~~~~----~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~ 600 (663) ...+.+..+.+...... +...++...+++++..+++.+..+++....+++++..++++...+++++.++++++... T Consensus 596 ~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~ 675 (711) T protein:vir:10 596 ADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAM 675 (711) T ss_pred HHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111111111110 01111111111111112222222222222233333333333333333333333322222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 601 SEADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHR 642 (663) Q Consensus 601 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~ 642 (663) ............++. . .+. .+.+.+.+....++.++ T Consensus 676 ~~~~aq~~~~~~qq~-~---~~l--~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 676 IEDMAQGGDVVYQQV-R---ELV--AQALAEITASQANVTEQ 711 (711) T ss_pred HHHHHHHHHHHHHHH-H---HHH--HHHHHHHHHHHHHhhcC Confidence 221111111111110 0 000 00111111111111111 No 147 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=89.01 E-value=0.029 Score=29.02 Aligned_cols=196 Identities=8% Similarity=-0.031 Sum_probs=72.2 Q ss_pred EEEE---EEEEEeee---cCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHH Q lcl|NC_021532. 248 LIIY---EYWGNYDV---DGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKT 321 (663) Q Consensus 248 v~v~---E~w~~~~~---~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N 321 (663) |++. ++||.... +..|. ..++.-+.||+ +..+...+..||.|++..+...=.... T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~g~----~~~~~~~eilH---------------~r~~~~~~~~~Glspi~~a~~~i~~~~ 61 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTKSE----IYEYNKNDVIF---------------IKLYDPMQQVYGSPDYVGGITSALLNS 61 (219) T ss_pred CceeecCeEEEEEecceecCCce----eEEeccccEEE---------------ecCCCCCCCcceecHHHHHHHHHHHHH Confidence 1211 12222110 00010 11112222222 222112345678887666543332222 Q ss_pred HHHHHHHHHHHhcCCCcEEe--eccccCcchhhh-------cc---C-CcceEeCCCC----CccccccCccccHHHHHH Q lcl|NC_021532. 322 AVIRGIIDNMAQSNNGQVAI--RKGALDQTNRKK-------FL---A-GANFEFNGTA----NDFWHGSYNAIPSSAFDM 384 (663) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~--~~~~i~~~d~~~-------~~---p-~~vi~~~~~~----~~~~~~~~~~~~~~~~~~ 384 (663) ...+-........+.|..++ ..+.++++.... .. . +.++...+++ ....++...+.-....+. T Consensus 62 aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~ 141 (219) T protein:vir:98 62 DATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANI 141 (219) T ss_pred HHHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHH Confidence 21111111223344555433 333566533221 11 1 1223333322 123333333333334455 Q ss_pred HHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEe Q lcl|NC_021532. 385 ISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVT 464 (663) Q Consensus 385 ~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~ 464 (663) -......|-.+.||+..++|....+.+ +.+.+.+ ....|....+.|+...+-.-+-.++.-+..+++ T Consensus 142 rk~~~~eIa~~fgVPp~~lG~~~~~~~-~~sn~eq-----------~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~~~~- 208 (219) T protein:vir:98 142 KNISAQDVLTSHRFPPGLSGIIPVNTA-GLGDPLK-----------IREAYQADEVLPLQEIIAESINSDYEIKSALKV- 208 (219) T ss_pred HHhhHHHHHHHhCCCHHHcccccCCCC-CccCHHH-----------HHHHHHHHHHHHHHHHHHHHhhhhhcCCCccEE- Confidence 556678888899999999996432211 1111211 111233333445444443333222111222222 Q ss_pred cCeeeccchhhcC Q lcl|NC_021532. 465 NDKFVPIRKDDLS 477 (663) Q Consensus 465 ~~~~v~i~~~~~~ 477 (663) +|.....+|.+ T Consensus 209 --~F~~~~~~d~~ 219 (219) T protein:vir:98 209 --NFKQPEKRDKN 219 (219) T ss_pred --eecCcccccCC Confidence 12222233333 No 148 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=88.83 E-value=0.03 Score=28.94 Aligned_cols=266 Identities=9% Similarity=0.091 Sum_probs=114.5 Q ss_pred cCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccc Q lcl|NC_021532. 73 VSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYG 152 (663) Q Consensus 73 ~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~ 152 (663) .++=|+--+.-....+...+.. ++.--...-...+.+..++.+.+++|.|++.+..+.. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~l----L~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~----------------- 59 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDL----LTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY----------------- 59 (278) T ss_pred CccceeEEEecCcccccHHHHH----HHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCC----------------- Confidence 2333332222222222222222 2211111234566778889999999999987643210 Q ss_pred cccccccccccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcc Q lcl|NC_021532. 153 NETVVEQEVTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYD 231 (663) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 231 (663) +.| .+..++|..+-+.+. . T Consensus 60 ----------------G~~~~l~~l~~~~v~v~~~----------------------------~---------------- 79 (278) T protein:vir:78 60 ----------------HQPSKLFLLNPDVVEMLIE----------------------------N---------------- 79 (278) T ss_pred ----------------CcEEEEEEECCceeEEEEc----------------------------C---------------- Confidence 100 111222221110000 0 Q ss_pred ccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHH Q lcl|NC_021532. 232 SPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAE 311 (663) Q Consensus 232 ~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~ 311 (663) +. . ..|+.+. .++|.. .++..+.|++. ......+.++|.|++. T Consensus 80 ---~~---------~-----~~~y~~~-~~~g~~----~~~~~~evih~---------------~~~~~~~~~~G~s~~~ 122 (278) T protein:vir:78 80 ---QS---------R-----ELYYSIH-AATGNK----LIVHNMDMLHF---------------KHIVASNMVQGISPID 122 (278) T ss_pred ---CC---------c-----eEEEEEE-cCCceE----EEEccccEEEE---------------CCCCCCCCeeeccHHH Confidence 00 0 0111111 112211 12222233332 2221235568999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcEE-eeccccCcchhhh---------ccCCcceEeCCCCCccccccCccccHHH Q lcl|NC_021532. 312 MIGDNQKVKTAVIRGIIDNMAQSNNGQVA-IRKGALDQTNRKK---------FLAGANFEFNGTANDFWHGSYNAIPSSA 381 (663) Q Consensus 312 ~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~-~~~~~i~~~d~~~---------~~p~~vi~~~~~~~~~~~~~~~~~~~~~ 381 (663) .+...-...+...+..+... ...|..+ ...+.++++.... ...|+++.+. ++..+..+..++.-... T Consensus 123 ~~~~~i~~~~~~~~~~~~~~--~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~~~ 199 (278) T protein:vir:78 123 VLKNTTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDI 199 (278) T ss_pred HHHHHHHHHHHHHHHHHHHh--cCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecC-CCceEEEccCChhHHHH Confidence 88776666555444433222 2234443 4444555433211 1245566553 34445555544444445 Q ss_pred HHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhcCCceE Q lcl|NC_021532. 382 FDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN-AEFLEEEEV 460 (663) Q Consensus 382 ~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li-~q~~~~~~~ 460 (663) .+..+...+.+-...||++...|...++.-.++ .+ ..+.|.+..+.|+.+.+-.-+ .+.+++... T Consensus 200 ~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~---~~-----------~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~ 265 (278) T protein:vir:78 200 VASENLTRERVANVFQLPSVFLNARSNTNFAKN---EE-----------LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDR 265 (278) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH---HH-----------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHh Confidence 566677888888999999999996543321121 11 112333333445444443322 233332211 Q ss_pred EEEecCeeeccchhhc Q lcl|NC_021532. 461 IRVTNDKFVPIRKDDL 476 (663) Q Consensus 461 iri~~~~~v~i~~~~~ 476 (663) ..+-|+.+|.+.+ T Consensus 266 ---~~g~~~~f~~~~l 278 (278) T protein:vir:78 266 ---EKIGILNLTLNLI 278 (278) T ss_pred ---cCCceEEEecccC Confidence 0112333333333 No 149 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=88.12 E-value=0.034 Score=28.61 Aligned_cols=417 Identities=13% Similarity=0.033 Sum_probs=154.8 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CcCCccccCCCccccHHHHHHHHHHHHHHHHhhcCCCceE Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNG-EPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADII 79 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~-~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~ 79 (663) ||-.++-+++.+...+..+.+.--. .+.-..|..+.-+ ...+........+..+.|+..|+-|...+... | + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~l-----p-~ 73 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPIS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATL-----P-L 73 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCccc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhC-----c-e Confidence 9999999999998887765332100 0011112111111 11111112223455566777777666655422 2 2 Q ss_pred EEEeCCcchHHHHHHHHHHHhHHHH----hccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccc Q lcl|NC_021532. 80 KCTPITWEDTDSAEQNELLLNTQFS----RKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNET 155 (663) Q Consensus 80 ~~~p~~~~D~~~Ae~~~~~~~~~~~----~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 155 (663) ++.-+..+.. .-.....-+.+++. ..-..+...+.++.+++++|.|++.+..+. T Consensus 74 ~~~~~~~~g~-~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--------------------- 131 (437) T protein:vir:10 74 NLYQTKPDGT-RVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA--------------------- 131 (437) T ss_pred eEEEEcCCCc-eeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC--------------------- Confidence 3322222111 00111111122222 222456677788999999999998765431 Q ss_pred ccccccccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccc Q lcl|NC_021532. 156 VVEQEVTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPD 234 (663) Q Consensus 156 ~~~~~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 234 (663) +.| .+..++|..+-+.+ +. T Consensus 132 -------------g~~~~L~~l~p~~v~i~~----------------------------~~------------------- 151 (437) T protein:vir:10 132 -------------GVLIGLELMLPQRTTVKR----------------------------LT------------------- 151 (437) T ss_pred -------------CcEEEEEEEcCcceEEEE----------------------------CC------------------- Confidence 000 01112221110000 00 Q ss_pred cccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHH Q lcl|NC_021532. 235 DTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIG 314 (663) Q Consensus 235 ~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~ 314 (663) . ..+ . |++ . ..+|.. .++..+.|||+ ..+. .+.++|.|++..+. T Consensus 152 ---------~-g~~-~--y~~-~--~~~g~~----~~~~~~dIih~---------------r~~~-~d~~~G~spi~~~~ 195 (437) T protein:vir:10 152 ---------S-GAL-Q--YTY-R--NVDGTV----STLAEDDVFHV---------------RGFS-LDGLMGLTPIQYAR 195 (437) T ss_pred ---------C-CeE-E--EEE-E--ecCceE----EEEccccEEEe---------------cCcC-CCCcccccHHHHHH Confidence 0 000 0 111 1 112211 12223334433 2222 22468999998888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEe-eccccCcchhhhc------------cCCcceEeCCCCCccccccCccccHHH Q lcl|NC_021532. 315 DNQKVKTAVIRGIIDNMAQSNNGQVAI-RKGALDQTNRKKF------------LAGANFEFNGTANDFWHGSYNAIPSSA 381 (663) Q Consensus 315 d~Q~~~N~~~~~~~~~~~~~~~~~~~~-~~~~i~~~d~~~~------------~p~~vi~~~~~~~~~~~~~~~~~~~~~ 381 (663) +.-.............+...+.|..++ -++.++++..... ..|+++.+.. +....++...+..... T Consensus 196 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~l~~~~~d~q~ 274 (437) T protein:vir:10 196 EVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEA-GMKYQAITMNPGDVQL 274 (437) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceeccC-CceEEeccCChhhHHH Confidence 776666655555555555556666554 3445554332111 1244555443 3345555544444455 Q ss_pred HHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhcCCceE Q lcl|NC_021532. 382 FDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN-AEFLEEEEV 460 (663) Q Consensus 382 ~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li-~q~~~~~~~ 460 (663) .+...+....+-.+.||+....|....+.. ..+.+.+. ...|....+.|+...+-.-+ ...++.... T Consensus 275 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~-~~sn~e~~-----------~~~f~~~tl~P~~~~ie~~l~~kll~~~e~ 342 (437) T protein:vir:10 275 LETRAFNIEEICRWYRVPPFMVGHSEKSTS-WGTGIEQQ-----------TLGFLTFTLRPWLTRIEQAARRSLLRPGER 342 (437) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCccc-ccchHHHH-----------HHHHHHHHHHHHHHHHHHHHHhhccCcccc Confidence 666667788899999999999996543321 22222221 11222223334333332222 222221110 Q ss_pred EEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhc Q lcl|NC_021532. 461 IRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEP 540 (663) Q Consensus 461 iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~ 540 (663) ..-|+.+|.+.+. . ....++.+....+... +-+.| .-......+..++.-...+. ... T Consensus 343 ----~~~~~~fd~~~ll-------~----~d~~~r~~~~~~~~~~--G~~T~----NE~R~~~gl~pi~gg~~~~~-~~~ 400 (437) T protein:vir:10 343 ----DQFYAEFSVEGLL-------R----ADSAGRAAFYSTMTQN--GLMTR----DECRAKENLPPMGGNAAVLT-VQS 400 (437) T ss_pred ----CceEEEEechhhh-------c----cCHHHHHHHHHHHHhC--CCcCH----HHHHHHhCCCCCCCCcceEe-ecC Confidence 0112222221110 0 0011222221111110 11111 11111111222111100000 000 Q ss_pred chhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 541 KPDPVQEKIRQLELENLMLENQMLVASINDKNARANE 577 (663) Q Consensus 541 ~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~ 577 (663) ...+....-.+......+..............+..++ T Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 401 ALLPIDKLGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred cccchhhccCcCCCcchhccccccCCCCCCCCccccC Confidence 0000000000000000000000000000000000000 No 150 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=85.12 E-value=0.055 Score=27.48 Aligned_cols=590 Identities=11% Similarity=-0.007 Sum_probs=128.7 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------CCcCCc------------cccCCCc--ccc- Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYN-----------GEPYGN------------EQKGKSA--IVS- 54 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~-----------~~~~~~------------~~~g~s~--~~~- 54 (663) =+.+=+.+.+.+..++.....+|.++..+.++|..-.| |.+.-+ +.++|.. +++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp~~N~i~~~i~~v~g~e~~nr~d~~v~P~ 83 (725) T protein:vir:92 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPIDVLYRPK 83 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcccchHHHHHHHHhhHHhCCcceEEecC Confidence 55567888999999999999999999888887743222 222101 1112211 111 Q ss_pred ----HHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCc----- Q lcl|NC_021532. 55 ----RDIKKQSEWQHATIVDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGT----- 125 (663) Q Consensus 55 ----~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~----- 125 (663) -.++.....++.+..+. +.-+..-.+...+.| .......++..++..+--..+- T Consensus 84 ~~~d~~~Ae~l~~~~~~~~~~-------------~~~~~a~s~Af~~~i----~~G~G~~ev~~d~~~~d~~~~~~~i~~ 146 (725) T protein:vir:92 84 DGASPDAADVLMGMYRTDMRH-------------NTAKIAVNVAVREQI----ESGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) T ss_pred CccHHHHHHHHHHHHHHHHHh-------------hCchHHHHHHHHHHh----hcCcceeeeeecccCCCCCCCceeeEE Confidence 12222223333222211 001111111111111 0000000000000000000010 Q ss_pred -----eEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHhe------------------ Q lcl|NC_021532. 126 -----LVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIY------------------ 182 (663) Q Consensus 126 -----g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~------------------ 182 (663) .+-.|+||+.... .++++.++.....++..-....|+ T Consensus 147 ~~i~~~~~~V~~Dp~a~~--------------------~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 206 (725) T protein:vir:92 147 EPIHSACSHVIWDSNSKL--------------------MDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPND 206 (725) T ss_pred eeccCChhhcccCchhhc--------------------cChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCc Confidence 1112444433221 122222221111111100000110 Q ss_pred --eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccccccccccc-ccccceEEEEEEEEEee- Q lcl|NC_021532. 183 --LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFS-DAPRKKLIIYEYWGNYD- 258 (663) Q Consensus 183 --~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~v~v~E~w~~~~- 258 (663) .+......+.-|.| +++.+... .+... .++...+...+...+............+. -..+..-+.-.+|+... T Consensus 207 ~~~~~~~~d~vrv~e~-~~r~~~~~-~~~~~-~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g 283 (725) T protein:vir:92 207 WVFPWLTQDTIQIAEF-YEVVEKKE-TAFIY-QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITC 283 (725) T ss_pred ccccccCCCeEEEEEE-EEEEEEee-eEEee-cCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecc Confidence 00000000000111 01111000 00000 00000000000000000000000000000 00000001112232210 Q ss_pred ---ec----CCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCC--ChH-HHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 259 ---VD----GDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGE--ANA-EMIGDNQKVKTAVIRGII 328 (663) Q Consensus 259 ---~~----~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~--g~~-~~~~d~Q~~~N~~~~~~~ 328 (663) .+ ..|..-+++.+|+-...+ ...||++ -++....++-..+-+ |-. +.+.-............+ T Consensus 284 ~~~l~~~~~~~~~~~P~vP~~g~r~~~--~g~~~~~-----G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i 356 (725) T protein:vir:92 284 TAVLKDKQLIAGEHIPIVPVFGEWGFV--EDKEVYE-----GVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI 356 (725) T ss_pred hhhhcCCCCCCCCceeeEEEEeeeecc--CCccccc-----ceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhh Confidence 00 011112233333221111 1123322 233333344333211 111 111000000000000011 Q ss_pred H---HHHhcCCCcEEeeccccCcchhhhccCCcceEeCCCCCccccccCccccHHH----HHHHHHHHHHHHHHhCCChH Q lcl|NC_021532. 329 D---NMAQSNNGQVAIRKGALDQTNRKKFLAGANFEFNGTANDFWHGSYNAIPSSA----FDMISLMNNEIESITGTKSF 401 (663) Q Consensus 329 ~---~~~~~~~~~~~~~~~~i~~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~tGi~~~ 401 (663) + .....++....+....+...+ ...|...+.+ ....+.|+-...+ ...++..-..-.+..|...- T Consensus 357 ~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~i~~------~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n 428 (725) T protein:vir:92 357 AGFEHMYDGNDDYPYYLLNRTDENN--GEMPTQPLAY------YENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGG 428 (725) T ss_pred hHHHHHHhccCccceeecccccccc--ccccccCCcc------cCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCch Confidence 0 001111211111111110000 0111112222 1222333222223 33333332222344454321 Q ss_pred -HcCCCcccchh-HHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEecC----- Q lcl|NC_021532. 402 -SGGINSGSLGS-TATGARGALDA--------TATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVTND----- 466 (663) Q Consensus 402 -~~G~~~~~~~~-tA~~i~~~~~~--------~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~~~----- 466 (663) ..|..-+.... ..++...+..+ +...|..+...|.. ..++++ +-+. ..++++.|+.. T Consensus 429 ~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~---~r~~RI----~~ed-g~~~~v~in~~~~~~~ 500 (725) T protein:vir:92 429 QVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDV---PRNVTI----TLED-GSEKEVQLMAEVVDLA 500 (725) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---CcEEEE----ecCC-CCcceEEecccccccc Confidence 12211111000 01111111111 11111111111110 111221 1111 23455555321 Q ss_pred --eee---ccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhcc---CCCc-------chhHHHHHHHHHhhhhhhh Q lcl|NC_021532. 467 --KFV---PIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGP---NEDP-------KIRRDIMADIMDLMRMPEQ 531 (663) Q Consensus 467 --~~v---~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~---~~~p-------~~~~~~l~~~~~l~~~~e~ 531 (663) +++ .+.+ .+.-.++......+-- .+....|++++..+.+ .... .........+++..+.... T Consensus 501 ~G~~~~~Ndi~g-~~Dv~v~~~p~~~s~r--~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~ 577 (725) T protein:vir:92 501 TGERQVLNDIRG-RYECYTDVGPSFQSMK--QQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI 577 (725) T ss_pred ccchhhhhcccc-ceeeEEeeccChHHHH--HHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhc Confidence 111 2221 2233333433322221 2233333444443321 1100 0000011111111110000 Q ss_pred hh-hhhhhhcchhhHHHHhhHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------- Q lcl|NC_021532. 532 AK-RMREYEPKPDPVQEKIRQLEL-------ENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKA------- 596 (663) Q Consensus 532 ~~-~l~~~~~~~~~~~~q~~q~~~-------~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~------- 596 (663) .. ......+...+...+..+.+. .++++...+.+++++..+++..+.+.++...+.+++..+++. T Consensus 578 ~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~ 657 (725) T protein:vir:92 578 QMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNM 657 (725) T ss_pred hhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000111111111111111111 111111112222222222222222222211111111111100 Q ss_pred ----HHHHHHHHHHHHHHH--HHHHHHHHHHH-HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhh Q lcl|NC_021532. 597 ----RKLSSEADMTDLKFV--KEDNGYAHLEQ-VELEDLRHAQHLEREA-MKHRANLEQMLAQRNAGD 656 (663) Q Consensus 597 ----q~~~~~~~~~~~e~~--~~~~~~~~~~~-~~~~~~~~~~~~~~e~-~k~~~~~e~~~~~~~~~~ 656 (663) +...++......++. .+..++..++. ++.++..+++++..++ ++.+........-.+.=+ T Consensus 658 ~~~q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 725 (725) T protein:vir:92 658 DLSKQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQSQRQNQPSGSVAETPQ 725 (725) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHhcchhccCCccccccCCC Confidence 000011111111111 11111111111 1222222222332222 222211111111111111 No 151 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=77.94 E-value=0.12 Score=25.64 Aligned_cols=587 Identities=9% Similarity=0.008 Sum_probs=135.8 Q ss_pred CCCcHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhc-CCc----CCcc-ccCCCccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021532. 1 MKINKAELLSALKADMKAA-DVLKQEQDSLISTWKAEYN-GEP----YGNE-QKGKSAIVSRDIKKQSEWQHATIVDPFV 73 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~y~-~~~----~~~~-~~g~s~~~~~~i~~~v~~~~~~l~~~~~ 73 (663) ++..+.+.+.+...-++.. ....+.+.......++.+. |.. |... .+- ....+. ...+ T Consensus 106 ~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~--------~~~~~~--~~~~----- 170 (763) T protein:vir:95 106 VTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRK--------EKQEVP--VFSL----- 170 (763) T ss_pred CCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeee--------eeeeeh--hhhh----- Confidence 5555555554443332221 1111111111111111111 110 0000 000 000000 0000 Q ss_pred CCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeecccccee--ccccc----ccc Q lcl|NC_021532. 74 STADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEV--TVMGE----AVV 147 (663) Q Consensus 74 ~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~--~~~~~----~~~ 147 (663) +...+.. .-...++.+..........+-+....+........-.|.++..+........+ ..... .+. T Consensus 171 -----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~ 244 (763) T protein:vir:95 171 -----FPIQTQE-QADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLN 244 (763) T ss_pred -----ccccchh-HHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeec Confidence 0000000 00111111221111111100000111222233334445555554321111111 00000 111 Q ss_pred cCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEE-eecCHH--HHHHhcCCcChhhhhhcc Q lcl|NC_021532. 148 VDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHR-YETDLS--TLKKDGRYKNLDKLAKTS 224 (663) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~-~~~~~~--~l~~~g~~~~~~~~~~~~ 224 (663) +..+..+.....++.+..+.... ...+..++ .+-.|..+. ...+.+ ......-+. ........ T Consensus 245 p~d~~iDp~a~sD~~Da~~~~~~---~~~t~~dL----------~~~~~~y~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 310 (763) T protein:vir:95 245 PENIIIDPSCQGDINKAMFAIVS---FETCKADL----------LKEKDRYHNLNKIDWQSSAPVNEPDHA-TTTPQEFQ 310 (763) T ss_pred HHHheecCCCCCchhhCceEeeE---EeccHHHH----------HhccCCccccchhcchhcccccccccc-ccchhhcc Confidence 11111111100111111111000 01111111 110000000 000000 000000000 00000000 Q ss_pred chhh-hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCc Q lcl|NC_021532. 225 GEDF-DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFK 303 (663) Q Consensus 225 ~~~~-~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~ 303 (663) ..+. .....-..-+...|...+. +. +|++.-.-++-+.+ .+..-...+.-||-+ +|+...+- ...|. T Consensus 311 ~~d~~~~~V~v~E~y~~~d~~gdg--~~-~~~~v~~~g~~iL~------~~~~p~~~~~~PFv~--~~~~p~~~-~~~G~ 378 (763) T protein:vir:95 311 ISDPMRKRVVAYEYWGFWDIEGNG--VL-EPIVATWIGSTLIR------LEKNPYPDGKLPFVL--IPYMPVKR-DMYGE 378 (763) T ss_pred CCCcccceEEEEEeeeeeccCCcc--ee-EEEEEEEEcCeeee------cccccccCCCcCEEE--ecceeecC-cccCC Confidence 0000 0000000000001111111 12 22322222221111 111111122334432 22222211 12333 Q ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHH---HHhcCCCcEEeeccccC--cchhhhccCCcceEeCCCCCccccccCcccc Q lcl|NC_021532. 304 LHGEANAEMIGDNQKVKTAVIRGIIDN---MAQSNNGQVAIRKGALD--QTNRKKFLAGANFEFNGTANDFWHGSYNAIP 378 (663) Q Consensus 304 ~~g~g~~~~~~d~Q~~~N~~~~~~~~~---~~~~~~~~~~~~~~~i~--~~d~~~~~p~~vi~~~~~~~~~~~~~~~~~~ 378 (663) .++..+.+.-...-...|.....+--. ......+.+ -..+.+. +.......||+.+.-........ +.++-+ T Consensus 379 gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav-~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p--~~~~~~ 455 (763) T protein:vir:95 379 PDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGML-DALNSRRYREGEDYEYNPTQNPAQMIIEHKFP--ELPQSA 455 (763) T ss_pred chHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccc-cchhhhcccCCceEEeeCCCChhhhcccccCC--CCcchH Confidence 333333333333333334333222111 111122222 2222221 11122344555443222222222 334555 Q ss_pred HHHHHHHHHHHHHH----HHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021532. 379 SSAFDMISLMNNEI----ESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEF 454 (663) Q Consensus 379 ~~~~~~~~~~~~~~----~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~ 454 (663) ....++++.....+ ....|++....|..+++.++ .+..........++.+.+.+ +...+.++.++..+.-.- T Consensus 456 ~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~---l~qa~~~~~~~~~r~~~~~~-k~l~~~~l~Li~q~~d~~ 531 (763) T protein:vir:95 456 LTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRG---VLDAASKREMAILRRLAKGM-SEIGNKIIAMNAVFLAEH 531 (763) T ss_pred HHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHH---HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhCCCC Confidence 66666555555544 34457776666644333322 23333334445566666666 345566666665543221 Q ss_pred ----cCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHH-HhccCCCcchhH---------HHHH Q lcl|NC_021532. 455 ----LEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQ-TLGPNEDPKIRR---------DIMA 520 (663) Q Consensus 455 ----~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~-~~~~~~~p~~~~---------~~l~ 520 (663) ...+..+.|+...+.. .-++. .++.. .+.-......-.+|..++. .+.+.+...+.. .+.. T Consensus 532 rviRI~g~e~v~v~~~~~~~--~~DV~--V~~~~-as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~ 606 (763) T protein:vir:95 532 EVVRITNEEFVTIKREDLKG--NFDLE--VDIST-AEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAH 606 (763) T ss_pred cEEEEeCCccccccHHHhcC--CcceE--Eeccc-chHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHH Confidence 0111233333221110 00000 00100 0111111111122222111 111111111100 1111 Q ss_pred HHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 521 DIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLS 600 (663) Q Consensus 521 ~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~ 600 (663) .+.....-++ ..+.++...+..+.+.+.+..+++.+..++++...++++....+++..+..+++ +.+..+. T Consensus 607 ~lr~~q~~~d------~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q--~~~e~~~- 677 (763) T protein:vir:95 607 DLRTWQPQPD------PVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTK--HARDLEK- 677 (763) T ss_pred HHHhcCCCcc------chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH- Confidence 1111111111 111111111222223333333333333333333333333322222222222222 1111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhh---hhhcccc-------C Q lcl|NC_021532. 601 SEADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANL-EQMLAQRNAG---DTNIGVV-------E 663 (663) Q Consensus 601 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~-e~~~~~~~~~---~~~~~~~-------~ 663 (663) .+ ...+.+.+....++..+.+.+ +....+++.... .+. .... ...++ +.-+-.+ + T Consensus 678 -~~--~~~eaq~~l~~~~a~~~~~~e-a~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 743 (763) T protein:vir:95 678 -MK--AQSQGNQQLEITKALTKPRKE-GELPPNLSAAIG---YNALTNGE-DTGIQSVSERDIAAEANPAYSLG 743 (763) T ss_pred -HH--HHHHHHHHHHHHHHHHHHHHH-hccChhHHHhhh---hccccccc-CCCccchhhcccCccccccccCC Confidence 00 001101000001100000000 000001111110 000 1000 00111 1111111 1 No 152 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=71.91 E-value=0.19 Score=24.54 Aligned_cols=138 Identities=12% Similarity=0.154 Sum_probs=13.4 Q ss_pred hhhhhhhhhhhhhhcchhhHHHHhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 526 MRMPEQAKRMREYEPKPDPVQEKIRQL--ELENLMLENQMLVASINDKNARANENTI--DAELKRSKAAVEKAKARKLSS 601 (663) Q Consensus 526 ~~~~e~~~~l~~~~~~~~~~~~q~~q~--~~~~~q~~~~~~~a~~~~~~a~~q~~~~--~~~~~~~~~~~e~~~~q~~~~ 601 (663) |.+.++.+.+.+...+-........+. .......+.+....+......+...... +...+..+...++.+...... T Consensus 1 Mki~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~ 80 (437) T protein:vir:10 1 MKIEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLV 80 (437) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223332222222111100000000000 0000000001111111100000000000 000000000000000000000 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHhhhhhhccccC Q lcl|NC_021532. 602 EADMTDLKFVKEDN-GYAHLEQVELEDLRHAQHLEREAMKHRANLE--QMLAQRNAGDTNIGVVE 663 (663) Q Consensus 602 ~~~~~~~e~~~~~~-~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e--~~~~~~~~~~~~~~~~~ 663 (663) ..+........... ..+...+.................+...... ...............+. T Consensus 81 ~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 145 (437) T protein:vir:10 81 APELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFA 145 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhH Confidence 00000000000000 0000000000000000000000000000000 00111111111112222 No 153 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=64.91 E-value=0.29 Score=23.51 Aligned_cols=368 Identities=11% Similarity=0.058 Sum_probs=133.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCC-----ccccCCCccccHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYG-----NEQKGKSAIVSRDIKKQSEWQHATIVDPFVST 75 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~-----~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~ 75 (663) |++= +.+. ....+.......+.++..+.-.+ .....+..+..+.|...|+-|...+... T Consensus 1 M~~f-----~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~---- 64 (386) T protein:vir:49 1 MPIF-----NITN-------LATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATA---- 64 (386) T ss_pred Cchh-----hhhc-------cCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhC---- Confidence 5551 1111 11111111112222222111111 1111122344556666665554444321 Q ss_pred CceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccc Q lcl|NC_021532. 76 ADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNET 155 (663) Q Consensus 76 ~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 155 (663) | +++.-.. +. .++.. -...-..+..+..++.+.+++|.|++.+.++.. T Consensus 65 -p-~~~~~~~------~~---~l~~~-PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-------------------- 112 (386) T protein:vir:49 65 -K-ITTSRKQ------LQ---GIVDN-PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDN-------------------- 112 (386) T ss_pred -c-eeeccch------hh---hhhhc-cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC-------------------- Confidence 1 2222111 11 12211 111224566778889999999999988765421 Q ss_pred ccccccccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccc Q lcl|NC_021532. 156 VVEQEVTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPD 234 (663) Q Consensus 156 ~~~~~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 234 (663) +.+ .+..++|..+-+.. +.+ T Consensus 113 -------------g~~~~l~~i~~~~v~v~~----------------------------~~~------------------ 133 (386) T protein:vir:49 113 -------------GRDMKWEYLRPSQVSFNR----------------------------LDN------------------ 133 (386) T ss_pred -------------CcEEEEEEecCceeEEEE----------------------------cCC------------------ Confidence 000 11122221110000 000 Q ss_pred cccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHH Q lcl|NC_021532. 235 DTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIG 314 (663) Q Consensus 235 ~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~ 314 (663) . ....|.++ ......|.. .++..+.|+| +.++...+.++|.|++..+. T Consensus 134 -~----------~~~~y~~~--~~~~~~~~~----~~~~~~evih---------------~~~~~~~~~~~G~s~l~~~~ 181 (386) T protein:vir:49 134 -Q----------NGLYYNIT--FDDPHIAPK----QHVPQNDILH---------------FRLLSVDGGLTSVSPLMALG 181 (386) T ss_pred -C----------ceEEEEEE--EcCccccce----eEEccccEEE---------------ecCCCCCCccccccHHHHHH Confidence 0 00111111 110111111 1111222332 33333446678999999888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcEEe-eccccCcchhh---------hccCCcceEeCCCCCccccccCccccHHHHHH Q lcl|NC_021532. 315 DNQKVKTAVIRGIIDNMAQSNNGQVAI-RKGALDQTNRK---------KFLAGANFEFNGTANDFWHGSYNAIPSSAFDM 384 (663) Q Consensus 315 d~Q~~~N~~~~~~~~~~~~~~~~~~~~-~~~~i~~~d~~---------~~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~ 384 (663) +.-...+...+.........+.|..++ -++..+++... ....|+++.+.. +.....+...+......+. T Consensus 182 ~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~-g~~~~~l~~~~~d~~~~e~ 260 (386) T protein:vir:49 182 REFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDD-LEDFTPLEIKSNVAQLLSQ 260 (386) T ss_pred HHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecCC-CceEEEccCChhHHHHHHH Confidence 877776666666666666667777655 33444432211 123456655543 3344455444443445566 Q ss_pred HHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEEEe Q lcl|NC_021532. 385 ISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIRVT 464 (663) Q Consensus 385 ~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~iri~ 464 (663) .++....+-.+.||+....|...... .++..+.+ +...++.++++.+..-+.+.+.. T Consensus 261 ~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~~~---------------~~~~~i~~~l~~i~~~~~~~l~~------- 317 (386) T protein:vir:49 261 ADWTTGQFAKVYGIPESIVGGDGDQQ-SSLEMIYN---------------IYFKSVSRYLRPFVSEMSKKLSC------- 317 (386) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCcc-chHHHHHH---------------HHHHHHHHHHHHHHHHHHHHhcc------- Confidence 77788888899999999999643221 12211111 11112233333332222211110 Q ss_pred cCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHH-HHhhhhhhhhhh-hhhhhcch Q lcl|NC_021532. 465 NDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADI-MDLMRMPEQAKR-MREYEPKP 542 (663) Q Consensus 465 ~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~-~~l~~~~e~~~~-l~~~~~~~ 542 (663) .+.++...+ +. .........+..++. +..+.+.-.+.++... .....++..... .....+.. T Consensus 318 ---~~~~~~~~~-------~~----~d~~~~~~~~~~l~~--~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~~~~gGd 381 (386) T protein:vir:49 318 ---EVDVDISPA-------VD----PTGSNYISLINSMVK--SGTLAQNQGLYILQQAEILPKELPDGKNPNRTSLKGGE 381 (386) T ss_pred ---hhcccchhh-------hc----cCHHHHHHHHHHHHh--CCCcCHHHHHHHHhhCCCCCCcCcchhccCCCCCCCCC Confidence 001110000 00 000011111111110 1111121111111100 000000000000 00000000 Q ss_pred hhHHH Q lcl|NC_021532. 543 DPVQE 547 (663) Q Consensus 543 ~~~~~ 547 (663) +..+. T Consensus 382 ~~~~~ 386 (386) T protein:vir:49 382 INEQD 386 (386) T ss_pred CCCCC Confidence 00000 No 154 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=59.70 E-value=0.39 Score=22.84 Aligned_cols=145 Identities=12% Similarity=0.043 Sum_probs=13.3 Q ss_pred CCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH Q lcl|NC_021532. 510 EDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANEN--TIDAELKRS 587 (663) Q Consensus 510 ~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~--~~~~~~~~~ 587 (663) |........+. ++. .++.+...++.............+ +....+...+..++...+.+.... .....++.. T Consensus 1 Mki~elk~el~---~~~--~el~~~~~elr~~~~~~~~~~~el--~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~ 73 (437) T protein:vir:10 1 MKIEKLKKDLA---TKT--AELNTKKAEIRSFTESEDKTIDEV--KAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEK 73 (437) T ss_pred CCHHHHHHHHH---HHH--HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11100000000 000 001111111110000000000000 111111111111111111111100 000111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccc Q lcl|NC_021532. 588 KAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQVEL-----EDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNIGVV 662 (663) Q Consensus 588 ~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~-----~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~~~~ 662 (663) +....................+..+.....+...+... ..............+...+.... ....-...+-.- T Consensus 74 ~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 151 (437) T protein:vir:10 74 RDDSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADK--KVTAFADYLKTG 151 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHh--hhhhhHHHHHhh Confidence 00000000000000000000000000000000000000 00000000000000000000000 000000000000 Q ss_pred C Q lcl|NC_021532. 663 E 663 (663) Q Consensus 663 ~ 663 (663) + T Consensus 152 e 152 (437) T protein:vir:10 152 E 152 (437) T ss_pred h Confidence 0 No 155 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=51.63 E-value=0.58 Score=21.90 Aligned_cols=418 Identities=12% Similarity=0.119 Sum_probs=139.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhcCC-cCCccccCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEE Q lcl|NC_021532. 8 LLSALKADMKAADVLKQEQDSLISTWK----AEYNGE-PYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCT 82 (663) Q Consensus 8 ~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~y~~~-~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~ 82 (663) +.+.++..-+..+..+.-....|.... +-+.|- ..+........+....|+..|+-|..++.. =| +++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~-----lp-~~~~ 74 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAK-----MR-LRLM 74 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhcc-----Cc-eEEE Confidence 222222111111111000001111111 111110 001111122234445566666655544431 12 3333 Q ss_pred eCCcchHHHHHHHHHHHhHHHHh---ccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccccccc Q lcl|NC_021532. 83 PITWEDTDSAEQNELLLNTQFSR---KFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQ 159 (663) Q Consensus 83 p~~~~D~~~Ae~~~~~~~~~~~~---~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (663) -++.+ ....+.....+..++.. .-..+...+.++.+++++|.|++.+-++.. T Consensus 75 ~~~~~-g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~------------------------ 129 (454) T protein:vir:93 75 QTDAQ-GIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR------------------------ 129 (454) T ss_pred EeccC-CccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC------------------------ Confidence 33322 22222233332222222 123456777888899999999988765411 Q ss_pred ccccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccccc Q lcl|NC_021532. 160 EVTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEF 238 (663) Q Consensus 160 ~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (663) +.+ .+.. ++|+... .+ + +.+ T Consensus 130 ---------G~~~~L~~-------i~~~~v~------v~----~-----------~~~---------------------- 150 (454) T protein:vir:93 130 ---------GQIKELRI-------LDWNRVE------PL----V-----------ADD---------------------- 150 (454) T ss_pred ---------CcEEEEEE-------EcCcceE------EE----E-----------cCC---------------------- Confidence 110 0111 2222110 00 0 000 Q ss_pred cccccccceEEEEEEEEEeeecCC-ceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHH Q lcl|NC_021532. 239 QFSDAPRKKLIIYEYWGNYDVDGD-GIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQ 317 (663) Q Consensus 239 ~~~d~~~~~v~v~E~w~~~~~~~~-g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q 317 (663) . +.|+++....+ |.. ...++..+.|||.. +....+.+||.|++..+.... T Consensus 151 ------g------~~~y~~~~~~~~~~~--~~~~~~~~eViH~k---------------~~~~~~~~~G~sp~~~~~~~i 201 (454) T protein:vir:93 151 ------G------EVFYRITPDRNCGIT--EAVTVPAREVIHDR---------------FNCFFHPLIGLPPVYAAGLAA 201 (454) T ss_pred ------C------cEEEEEEeccccccc--eeEEecCcceEEec---------------cCCCCCCceeccHHHHHHHHH Confidence 0 01111111100 000 11122233344332 112335568999998888777 Q ss_pred HHHHHHHHHHHHHHHhcCCCcEEe-eccccCcchhhhc-----------cCCcceEeCCCCCccccccCccccHHHHHHH Q lcl|NC_021532. 318 KVKTAVIRGIIDNMAQSNNGQVAI-RKGALDQTNRKKF-----------LAGANFEFNGTANDFWHGSYNAIPSSAFDMI 385 (663) Q Consensus 318 ~~~N~~~~~~~~~~~~~~~~~~~~-~~~~i~~~d~~~~-----------~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~ 385 (663) .................+.|..++ -++.++++..... ..|+++.+.. +....++...+......+.. T Consensus 202 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~-g~~~~~l~~~~~d~q~le~~ 280 (454) T protein:vir:93 202 TQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILSN-GAKYNPTTFSPVDSQTVEQL 280 (454) T ss_pred HHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceeccC-CceEEEcccChhHHHHHHHH Confidence 666665555555555555665544 3455654432211 1344555543 33455555444444455666 Q ss_pred HHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCceEEEEe Q lcl|NC_021532. 386 SLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAY-NAEFLEEEEVIRVT 464 (663) Q Consensus 386 ~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~l-i~q~~~~~~~iri~ 464 (663) .+....+-.+.||+....|....+.. +.+.+. ...|....+.|+.+.+-.. ....++.. T Consensus 281 ~~~~~~Ia~~fgVPp~~lg~~~~~t~---sn~e~~-----------~~~f~~~~l~P~~~~ie~~ln~~L~~~~------ 340 (454) T protein:vir:93 281 KMTAEIVCSVFRVPAYKIGVGQPPSS---DNVEAL-----------EQQYYSQCLQTLIESIELLLDEALETGE------ 340 (454) T ss_pred HHHHHHHHHHhCCCHHHcCCCCCCcc---hhHHHH-----------HHHHHHHHHHHHHHHHHHHHHHhhcCCC------ Confidence 67888888999999999996433321 112211 1122222333333333211 11222211 Q ss_pred cCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhh---------- Q lcl|NC_021532. 465 NDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKR---------- 534 (663) Q Consensus 465 ~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~---------- 534 (663) +.++.++.+.+. . + ...++...+..+... ..+.+ .-......+..++.-.+. T Consensus 341 -~~~~~f~~~~ll-------~-~---D~~~r~~~~~~~~~~--G~~T~----NE~R~~~gl~pi~ggD~~~~~~~~~~~~ 402 (454) T protein:vir:93 341 -NESTEFDVTTLL-------R-M---DSERRMKTLGDAVKN--TLLTP----NEARKRENLPPLAGGDALYLQQQNYSLE 402 (454) T ss_pred -CcEEEeechhhh-------c-c---CHHHHHHHHHHHHhC--CCcCH----HHHHHHhCCCCCCCCCeeeeccCccchH Confidence 112333322110 0 0 011122111111110 01111 111111111111110000 Q ss_pred -hhhhhcchhhHHH--HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 535 -MREYEPKPDPVQE--KIRQLELENLMLENQMLVASINDKNARANENTIDAELKR 586 (663) Q Consensus 535 -l~~~~~~~~~~~~--q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~ 586 (663) +.......++... +...........+.-+...+ ...........-..+. T Consensus 403 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e---~~~d~~~~~~~~~~~~ 454 (454) T protein:vir:93 403 ALSRRDAREDPFASSGKTASVPQAVAASDGNKAITE---TEHDAVKAMFRGILKK 454 (454) T ss_pred hhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccC---CccchhhhhhhhhhcC Confidence 0000000000000 00000000000000000000 0000000000000000 No 156 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=35.68 E-value=1.2 Score=20.12 Aligned_cols=433 Identities=11% Similarity=0.002 Sum_probs=160.2 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHH----------------HHHHHHHHHHHHHhcCCcCCccccCCCccccHHHHHHHHHH Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLK----------------QEQDSLISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQSEWQ 64 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~----------------~~~~~~~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~v~~~ 64 (663) ....+. ........|+.+...+ ........+.++++.+++ .+...|+.+ T Consensus 15 ~~~~R~-~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~--------------~a~~av~~~ 79 (502) T protein:vir:79 15 WKAARL-RSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHD--------------LVIGVFDKL 79 (502) T ss_pred HHHHHH-hhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcCh--------------HHHHHHHHH Confidence 000000 1111122233322111 001111222233333222 222334433 Q ss_pred HHHHHHhhcCCCceEEEEeCCcc---hHHHHHHHHHHHhHHHHh-----ccchhHHHHHHHHHHHhcCceEEEeeecccc Q lcl|NC_021532. 65 HATIVDPFVSTADIIKCTPITWE---DTDSAEQNELLLNTQFSR-----KFDRFNFMSKAVKVLDREGTLVVQTGWDYED 136 (663) Q Consensus 65 ~~~l~~~~~~~~~~~~~~p~~~~---D~~~Ae~~~~~~~~~~~~-----~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~ 136 (663) +.+.+.+ +.-.+...|...+ +.+.++.+..+-+.+... ..+++....-+++..+..|-+++...|+... T Consensus 80 ~~nvVG~---ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~ 156 (502) T protein:vir:79 80 EERVVGK---NGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRIN 156 (502) T ss_pred HHhhccC---CceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccC Confidence 3333321 0111222232221 233344444443333221 1234444455788889999999998775321 Q ss_pred ceecccccccccCccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcC Q lcl|NC_021532. 137 EEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKN 216 (663) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~ 216 (663) ....+. . -.-.+..|+|..+ ..| ..+..++.. |. T Consensus 157 ~~~~g~-------------------~------~~l~lq~iepd~l-~~~-----~~~~~~i~~------------GV--- 190 (502) T protein:vir:79 157 SLTPSA-------------------G------VHFWLEALEPDFI-PMT-----SDESNRLNQ------------GV--- 190 (502) T ss_pred ccCCCc-------------------c------cceEEEEecchhc-CCC-----CCCCCeeEe------------ee--- Confidence 100000 0 0002334444433 011 111111110 00 Q ss_pred hhhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEe Q lcl|NC_021532. 217 LDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVP 296 (663) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~ 296 (663) + .|... +.+-||......+++...-+..+-+. . ++|+- T Consensus 191 --------------------e---~d~~G---r~~aY~i~~~hPgd~~~~~~~rvpA~-~---------------vlH~f 228 (502) T protein:vir:79 191 --------------------F---VDDWG---RPEKYLVYKSRPVSGRQMETKEVDAE-R---------------MLHLK 228 (502) T ss_pred --------------------E---ECCCC---ceEEEEEeecCCCCCcccceeEechh-h---------------eEEee Confidence 0 00011 13345554433333211111111111 1 22222 Q ss_pred eeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEeeccc---cC--------cchhhhccCCcceE-eCC Q lcl|NC_021532. 297 FNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIRKGA---LD--------QTNRKKFLAGANFE-FNG 364 (663) Q Consensus 297 ~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~~~~---i~--------~~d~~~~~p~~vi~-~~~ 364 (663) -...++..-|.|.+-.++..-+.+..+....+.....++.-.+++..+. .. ........||.++. +.+ T Consensus 229 ~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~p 308 (502) T protein:vir:79 229 FVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKP 308 (502) T ss_pred cccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCccccccccCCccccccCC Confidence 2235666777777766666666666666666655555554444433211 00 01123467888775 454 Q ss_pred CCCccccccCccccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 365 TANDFWHGSYNAIPSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLM 444 (663) Q Consensus 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~ 444 (663) |..+....++...+....++..+...+-.-.||+-..+-.+-+ .+-+.+++-+...-..+......|...++++++ T Consensus 309 -Ge~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s---~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~ 384 (502) T protein:vir:79 309 -GEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN---GTYSAQRQELVESTDGYLILQDWFIGAVTRPMY 384 (502) T ss_pred -CceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4556666554444455666677777777778999666544332 244445555555556666666677778999999 Q ss_pred HHHHHHHHHhc--------CCceEEEE--ecCeeeccchhh-cCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcc Q lcl|NC_021532. 445 RKWMAYNAEFL--------EEEEVIRV--TNDKFVPIRKDD-LSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPK 513 (663) Q Consensus 445 ~~~~~li~q~~--------~~~~~iri--~~~~~v~i~~~~-~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~ 513 (663) +.++....--. +...+.+. .++.+..|||-. .++.. ..|. .++.+. ....... ..+|. T Consensus 385 ~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~-~~i~--~Gl~t~------~~~~a~~--G~D~~ 453 (502) T protein:vir:79 385 RAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWK-IQIR--GGAATE------SDWVRAG--GRNPD 453 (502) T ss_pred HHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHH-HHHH--cCCCCH------HHHHHHc--CCCHH Confidence 98887554321 11111111 122222233211 00000 0000 000000 0011111 12232 Q ss_pred hhHHHHHHHHHh---hhhhhhh-hhh----hhhhcchhhHHHHhhHHHH Q lcl|NC_021532. 514 IRRDIMADIMDL---MRMPEQA-KRM----REYEPKPDPVQEKIRQLEL 554 (663) Q Consensus 514 ~~~~~l~~~~~l---~~~~e~~-~~l----~~~~~~~~~~~~q~~q~~~ 554 (663) .....+..-.+. .+++-.. ... .......+...+-..+.+. T Consensus 454 ~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 454 DVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 221111110000 0111000 000 0000000000000000000 No 157 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=33.86 E-value=1.3 Score=19.91 Aligned_cols=120 Identities=13% Similarity=0.098 Sum_probs=10.1 Q ss_pred HHhhhhhhhhhhhhhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 523 MDLMRMPEQAKRMREYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARAN-------ENTIDAELKRSKAAVEKAK 595 (663) Q Consensus 523 ~~l~~~~e~~~~l~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q-------~~~~~~~~~~~~~~~e~~~ 595 (663) +-|..+. .+.+.++...++.++.++....+.+.. +...+.+....+.+.+... T Consensus 1 ~~~~~~~--------------------l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~ 60 (466) T protein:vir:80 1 MALRQLM--------------------LAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLE 60 (466) T ss_pred CchHHHH--------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Confidence 1111100 000111111111112111111111000 0000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh------hhhccccC Q lcl|NC_021532. 596 ARKLSSEADMTDLKFVKEDNGYAHLEQVELEDLRHAQHLEREAMKHRANLEQMLAQRNAG------DTNIGVVE 663 (663) Q Consensus 596 ~q~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~------~~~~~~~~ 663 (663) ......+.....++.+-. .......+................................. .......+ T Consensus 61 ~~~~el~e~~~~l~~ei~-~le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (466) T protein:vir:80 61 GEKTELEEKKSKLEGEIK-ELENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIA 133 (466) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHH Confidence 000000000000000000 00000000000000000000000000000000000000000 00000011 No 158 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=31.79 E-value=1.5 Score=19.66 Aligned_cols=407 Identities=12% Similarity=0.075 Sum_probs=154.1 Q ss_pred CCCc--HHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCCCc----cccHHHH---HHHHHHHHHH Q lcl|NC_021532. 1 MKIN--KAEL---LSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGKSA----IVSRDIK---KQSEWQHATI 68 (663) Q Consensus 1 ~~~~--~~~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s~----~~~~~i~---~~v~~~~~~l 68 (663) |+|- +.+. ..++...-+. ..-...|. +....++..|... -++..+. .+|-+.+... T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~-----------~~~~~g~~-~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~R 68 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEH-----------LGLATSYL-SEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSI 68 (446) T ss_pred CcccccCCCchhhhhhhhhcccc-----------chhhcccC-CcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHH Confidence 5553 3332 2333221100 11111222 3333455555432 1222221 3455555555 Q ss_pred HHhhcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceeccccccccc Q lcl|NC_021532. 69 VDPFVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVV 148 (663) Q Consensus 69 ~~~~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~ 148 (663) ....++-+ +.|.| .|.+.|+.+.+.+... . +..+...+.|++.+|+.+.++-|+.....- T Consensus 69 k~av~~~~--w~V~p---~~~~~a~~v~~~l~~~-----~-~~~~~~~~ldai~~G~s~~Eivw~~~~g~~--------- 128 (446) T protein:vir:98 69 ALSVLNKV--GPYQH---GDKRIKKFIDDQLRNR-----A-KTWISHCVKSIMTYGFSLSEQIYAHGARDN--------- 128 (446) T ss_pred HHHhhcCC--ceecC---ccHHHHHHHHHHHhhc-----C-chhHHHHHHHHHhhCceeeeEEEeeccccc--------- Confidence 55544322 45666 4566666655555332 1 222233467999999999999996422100 Q ss_pred CccccccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 149 DEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) ...++..+....+..++ .+++|..-. -+..+ ..+.. T Consensus 129 -------------~p~~~~d~~~~~~~~~~-r~~~~~~~~-~~~~~-------~~~~~---------------------- 164 (446) T protein:vir:98 129 -------------MPATVLDDIVNYHPLQV-MLIANDNGR-IVDGD-------TVTAS---------------------- 164 (446) T ss_pred -------------ccchhhccccccccccc-eeeeccCCc-ccccc-------ccchh---------------------- Confidence 00000000000000000 011110000 00000 00000 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCC Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEA 308 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g 308 (663) .+.. -.|...+...-+. ........|+ +.+.|.. -|+++.+.+..+++||.| T Consensus 165 ----------~~~~---------~~~~~~~~~~~~~-~~~~~~~~g~------~~~iP~~--kfi~~~~~~~~~~p~G~g 216 (446) T protein:vir:98 165 ----------QYKS---------GYWVPLPPYRIGD-PPKKVDVVGS------HVRLPSH--KRLFINYNTKGNNPWGTS 216 (446) T ss_pred ----------hccc---------ccccCcccchhhh-hhhhcccCcc------ccccccc--ceEEEEecCCCCCccccc Confidence 0000 0000000000000 0000001111 1122222 367888888999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcE--EeeccccCcch---------------hhhc-----cCCcceE---eC Q lcl|NC_021532. 309 NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQV--AIRKGALDQTN---------------RKKF-----LAGANFE---FN 363 (663) Q Consensus 309 ~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~--~~~~~~i~~~d---------------~~~~-----~p~~vi~---~~ 363 (663) +++.+--+=-..|...+-...-+...+.|-. .++.|+.+.+. ...+ ..++++. .. T Consensus 217 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~ 296 (446) T protein:vir:98 217 CLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSK 296 (446) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccC Confidence 9999999988888887777777777666543 45666543211 1110 1223332 12 Q ss_pred CCCCccccccCcc-ccHHHHHHHHHHHHHHHHHhCCChHHcCCCcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 364 GTANDFWHGSYNA-IPSSAFDMISLMNNEIESITGTKSFSGGINSGSL-GSTATGARGALDATATRRMNIVRNIAENLVK 441 (663) Q Consensus 364 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~-~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~ 441 (663) +.+..+..+.... .+.....++++.+..|..........+|++.... |..++.+.. +--...+..-++.+.+++.+ T Consensus 297 P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~--~V~~d~~~aDa~~i~~tln~ 374 (446) T protein:vir:98 297 EQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQL--ELFDGKINSIFDTVIHAFTE 374 (446) T ss_pred CCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHH--HHHHHHHHHHHHHHHHHHHH Confidence 4444555544332 2234667889998888765544444445332211 111111111 00111222333444444444 Q ss_pred HHHHHHHHHHHHhcCCceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHH Q lcl|NC_021532. 442 PLMRKWMAYNAEFLEEEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMAD 521 (663) Q Consensus 442 ~l~~~~~~li~q~~~~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~ 521 (663) .|+.-++.++ |.......++..+ .+ ..............+.+..+. .++-..++. .. . T Consensus 375 ~Li~~l~~lN--f~~~~~~~~~~~~------------~~--~~~~~e~eDl~~~a~~~~~L~-~~G~~~p~~--~~---~ 432 (446) T protein:vir:98 375 QVIGNLIRLN--FDPALYPLASNTG------------YI--TRLPGRATDLAALVEAIKQMH-DMGFLVDGD--KD---H 432 (446) T ss_pred HHHHHHHHhC--CCccccccccccc------------cc--eeccCChhhHHHHHHHHHHHH-hCCcccccc--HH---H Confidence 4444444433 2221111111100 00 111111111111122222111 111111111 11 1 Q ss_pred HHHhhhhhhhhhhhhhhhcchhh Q lcl|NC_021532. 522 IMDLMRMPEQAKRMREYEPKPDP 544 (663) Q Consensus 522 ~~~l~~~~e~~~~l~~~~~~~~~ 544 (663) +-+.-++|+.. ++. T Consensus 433 ire~~giP~~~---------~~~ 446 (446) T protein:vir:98 433 IRSITGLPDAI---------SST 446 (446) T ss_pred HHHHhCcCCCC---------CCC Confidence 12222333221 111 No 159 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=30.32 E-value=1.6 Score=19.49 Aligned_cols=397 Identities=15% Similarity=0.056 Sum_probs=142.5 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHh-cCCcCCccccCCCccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQE--------QDSLISTWKAEY-NGEPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFV 73 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~--------~~~~~~~~~~~y-~~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~ 73 (663) |-.|..+..+... +.+-.-.++ +.......+.+. .....+........+..+.|+..|+-|...+... T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~l-- 77 (432) T protein:vir:10 1 MPDEKKLGLLGQL-KAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAM-- 77 (432) T ss_pred CCCCcccchhhhh-HhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhC-- Confidence 5555544444322 111110111 000000111111 1011111111112334455666666555544322 Q ss_pred CCCceEEEEeCCcchHHHHHHHHHHHhHHHHh----ccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccC Q lcl|NC_021532. 74 STADIIKCTPITWEDTDSAEQNELLLNTQFSR----KFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVD 149 (663) Q Consensus 74 ~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~----~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~ 149 (663) | +++.-++.+... +....-+.+++.. .-..+...+.++.+++++|+|++.+.++. T Consensus 78 ---p-~~~y~~~~~g~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~--------------- 136 (432) T protein:vir:10 78 ---P-LTMYMRTPDGRK--EAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD--------------- 136 (432) T ss_pred ---c-eeEEEecCCCcc--cccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC--------------- Confidence 2 233222222211 1112222233322 22355667778889999999998765420 Q ss_pred ccccccccccccccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhh Q lcl|NC_021532. 150 EYGNETVVEQEVTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDF 228 (663) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 228 (663) +.+ ....++|..+-+... .+. T Consensus 137 -------------------g~~~~L~~l~~~~v~v~~~-----~~g---------------------------------- 158 (432) T protein:vir:10 137 -------------------GRIESLQYLANDRLTITTD-----TKG---------------------------------- 158 (432) T ss_pred -------------------CcEEEEEEEcCCceEEEEc-----CCC---------------------------------- Confidence 000 011122211111000 000 Q ss_pred hccccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCC Q lcl|NC_021532. 229 DYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEA 308 (663) Q Consensus 229 ~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g 308 (663) + .+|++. ..+|.. ..+..+.++|. .+++.+ .++|.| T Consensus 159 ------------------~-----~~y~~~-~~~g~~----~~~~~~~iih~---------------~~~~~d-g~~G~s 194 (432) T protein:vir:10 159 ------------------N-----TAYRYR-RTDGQM----IDIPKQQIWKI---------------MGYSLD-GENGLS 194 (432) T ss_pred ------------------c-----EEEEEE-ecCceE----EEEcCccEEEe---------------cCCCCC-Cccccc Confidence 0 001110 112211 11222333332 222222 367989 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEee-ccccCcchhhh--------ccCCcceEeCCCCCccccccCccccH Q lcl|NC_021532. 309 NAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAIR-KGALDQTNRKK--------FLAGANFEFNGTANDFWHGSYNAIPS 379 (663) Q Consensus 309 ~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~-~~~i~~~d~~~--------~~p~~vi~~~~~~~~~~~~~~~~~~~ 379 (663) ++..+...-.................+.|..++. ++.++++.... ...|+++.+. ++..+..+...+.-. T Consensus 195 pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~-~g~~~~~l~~~~~d~ 273 (432) T protein:vir:10 195 AIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLE-GGMDVKSLGLNPVDA 273 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhhCCCceecC-CCceEEEccCChHHH Confidence 9888776555444433333333444455555543 45555433221 1235555553 334555565555444 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhcCCc Q lcl|NC_021532. 380 SAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN-AEFLEEE 458 (663) Q Consensus 380 ~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li-~q~~~~~ 458 (663) ...+...+....|-.+.||+....|....+...+++.+.+. ...|.+..+.|+.+.+-.-+ ...++.. T Consensus 274 q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~-----------~~~f~~~tl~P~~~~ie~~ln~kL~~~~ 342 (432) T protein:vir:10 274 QLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQ-----------QLGFLSMTLSPWLRRIEQSIALNLLSPA 342 (432) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHH-----------HHHHHHHHHHHHHHHHHHHHHhhhcCcc Confidence 55666778888899999999999996543333333333221 12222333444444433322 2333322 Q ss_pred eEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhh--- Q lcl|NC_021532. 459 EVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRM--- 535 (663) Q Consensus 459 ~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l--- 535 (663) .. ..-++.+|.+.+. -....++.+....++.. .-+.+ .-......+..+++-...+ T Consensus 343 ~~----~~~~~~fd~~~ll-----------~~d~~~r~~~~~~~~~~--G~~T~----NE~R~~~glppi~g~~~~~~~~ 401 (432) T protein:vir:10 343 ER----RRYFADFDTSALL-----------RADSAARSSYYSQLVNN--GLMTR----DEAREIEGLPKLGGNAAVLTVQ 401 (432) T ss_pred cc----CceEEEeechhhh-----------ccCHHHHHHHHHHHHhC--CCCCH----HHHHHHhCCCCCCCCcceEeec Confidence 10 0112222221110 00111222222222111 11111 1111112222222111100 Q ss_pred ------hhh--hcchhhHHHHhhHHHHHHHHHHHHH Q lcl|NC_021532. 536 ------REY--EPKPDPVQEKIRQLELENLMLENQM 563 (663) Q Consensus 536 ------~~~--~~~~~~~~~q~~q~~~~~~q~~~~~ 563 (663) ... +..+++......+ ++-..++ T Consensus 402 ~~~~pl~~~~~~~~~~~~~~~~~~-----~~~~~~~ 432 (432) T protein:vir:10 402 SAMVPLDSIGLQASPEPASGLGNQ-----QQDKVSK 432 (432) T ss_pred CcccchhhhcccCCCCCCCCCCCc-----ccccccC Confidence 000 0000000000000 0000000 No 160 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=29.20 E-value=1.7 Score=19.35 Aligned_cols=391 Identities=12% Similarity=0.076 Sum_probs=138.7 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCcCCccccCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcch Q lcl|NC_021532. 10 SALKADMKAADVLKQEQDSL-ISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCTPITWED 88 (663) Q Consensus 10 ~~l~~~~~~~~~~~~~~~~~-~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D 88 (663) -.++..+++ +..-.+. ...+..+.-+........+.+.+....|+..|+.|-..+... | +++.-.. ++ T Consensus 1 m~f~~~~~~----~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l-----p-~~~~~~~-~~ 69 (409) T protein:vir:10 1 MLFRKGFKN----QSQEISIDDKKILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNISKL-----P-IKIYQKK-DG 69 (409) T ss_pred CcccccccC----cCCCCCCChHHHHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhhhhC-----c-eEEEEec-CC Confidence 001111111 0000000 011111111111111111223445556666666554444322 2 2332121 11 Q ss_pred HHHHH--HHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccccccccccccee Q lcl|NC_021532. 89 TDSAE--QNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVV 166 (663) Q Consensus 89 ~~~Ae--~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (663) .+... -+..+++.--...-..+..+..++.+++++|.|++.+.++.. T Consensus 70 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~------------------------------- 118 (409) T protein:vir:10 70 IKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKN------------------------------- 118 (409) T ss_pred eeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC------------------------------- Confidence 11111 122222221112224566777889999999999988765421 Q ss_pred ecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccccccccccccccc Q lcl|NC_021532. 167 KKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPR 245 (663) Q Consensus 167 ~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 245 (663) +.+ .+..++|..+-+-. ++ .|... .. T Consensus 119 --G~~~~L~~i~~~~V~v~~------~~-----------------~~~~~----------------------------~~ 145 (409) T protein:vir:10 119 --GEIKGLYPLKSDGMKIFV------DD-----------------TGLLN----------------------------SE 145 (409) T ss_pred --CcEEEEEEEcCCceEEEE------cC-----------------Ccccc----------------------------cc Confidence 000 11112221110000 00 00000 00 Q ss_pred ceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHH Q lcl|NC_021532. 246 KKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIR 325 (663) Q Consensus 246 ~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~ 325 (663) ..+ +++.. .+.|... ++..+.||+. ..+. ++.++|.|++..+.+.-...+...+ T Consensus 146 ~~~-----~y~~~-~~~g~~~----~~~~~evih~---------------r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~ 199 (409) T protein:vir:10 146 NNV-----WYLYT-DDLGQRH----KFMSDEILHF---------------KGLT-ADGLAGLSVIELLNHLIENGKSSET 199 (409) T ss_pred ceE-----EEEEE-eCCceeE----EeccccEEEe---------------cCcC-CCCcccccHHHHHHHHHHHHHHHHH Confidence 001 11110 1112111 1222333333 1111 2346799999888887766666655 Q ss_pred HHHHHHHhcCCCcEEee-ccccCcchhhh-------c-----cCCcceEeCCCCCccccccCccccHHHHHHHHHHHHHH Q lcl|NC_021532. 326 GIIDNMAQSNNGQVAIR-KGALDQTNRKK-------F-----LAGANFEFNGTANDFWHGSYNAIPSSAFDMISLMNNEI 392 (663) Q Consensus 326 ~~~~~~~~~~~~~~~~~-~~~i~~~d~~~-------~-----~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 392 (663) .....+...+.|..++. ++.++++.... . ..|+++.+. ++..+.++..++.-....+..++....+ T Consensus 200 ~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~I 278 (409) T protein:vir:10 200 YLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLP-IGYKFEPISQKLVDAQFLENSQLTIRQI 278 (409) T ss_pred HHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecC-CCceEEEccCChhhHHHHHHHHHHHHHH Confidence 55555666666665543 34455432211 1 134455443 3344555555554455566677788889 Q ss_pred HHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCceEEEEecCeeecc Q lcl|NC_021532. 393 ESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAY-NAEFLEEEEVIRVTNDKFVPI 471 (663) Q Consensus 393 ~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~l-i~q~~~~~~~iri~~~~~v~i 471 (663) -.+.||+....|....+...+ +.+. ...|....+.|+.+.+-.- ....++.... ..+-++.+ T Consensus 279 a~~fgVPp~~lg~~~~~~~~~---~e~~-----------~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~---~~~~~~~f 341 (409) T protein:vir:10 279 ASVFGVKMHQLNDLDRATHSN---ITEQ-----------NREFYIDTLQSILNMYELEINYKLFLISEI---KNGFYSKF 341 (409) T ss_pred HHHhCCCHHHcCCCCCCcccc---HHHH-----------HHHHHHHHHHHHHHHHHHHHHHhhcCchhc---cCCcEEEE Confidence 999999999999654332111 1111 1122222334444333221 1222222110 01112233 Q ss_pred chhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcchhhHHHHhhH Q lcl|NC_021532. 472 RKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKPDPVQEKIRQ 551 (663) Q Consensus 472 ~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~~~~~~q~~q 551 (663) |.+.+. -....++.+.+..+.. .+-+.|.- ...++.+..++.-.+......-.+ ...-..+ T Consensus 342 d~~~ll-----------~~d~~~~~~~~~~~~~--~G~~T~NE----~R~~lgl~p~~ggD~~~~~~n~~~--~~~~~~~ 402 (409) T protein:vir:10 342 NVDTIL-----------RADIKTRYESYKEAIQ--NGFKTPNE----IRELEEDEPLEGGDVLLINGNMIP--VKMAGEQ 402 (409) T ss_pred echhhh-----------ccCHHHHHHHHHHHHh--CCCcCHHH----HHHHhCCCCCCCcCeeeeccCccc--hhhcccc Confidence 222210 0011122222222211 11122211 111222222221111100000000 0000000 Q ss_pred HHHHHHHHHHHH Q lcl|NC_021532. 552 LELENLMLENQM 563 (663) Q Consensus 552 ~~~~~~q~~~~~ 563 (663) ...-.++ T Consensus 403 -----~~kgGe~ 409 (409) T protein:vir:10 403 -----YSKGGEK 409 (409) T ss_pred -----ccccCCC Confidence 0000000 No 161 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=27.32 E-value=1.9 Score=19.11 Aligned_cols=419 Identities=15% Similarity=0.114 Sum_probs=136.7 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHH-HH-HHHH---HHHHHhcCCcCC-ccccCCCccccHHHHHHHHHHHHHHHHhhcC Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQE-QD-SLIS---TWKAEYNGEPYG-NEQKGKSAIVSRDIKKQSEWQHATIVDPFVS 74 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~-~~-~~~~---~~~~~y~~~~~~-~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~ 74 (663) |-+ +..-+.+.....-. .. ..|. .+.-.+.+..++ ...-....+....|+..|+-|..++... T Consensus 1 Mg~--------~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~l--- 69 (457) T protein:vir:13 1 MGF--------WSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATL--- 69 (457) T ss_pred Cch--------hhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccC--- Confidence 332 11112221111100 00 0000 000001111111 0111112233344555555554444321 Q ss_pred CCceEEEEeCCcchHHHHHHHHHHHhHHHHhcc---chhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 75 TADIIKCTPITWEDTDSAEQNELLLNTQFSRKF---DRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 75 ~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~---~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) | +++.-+..++.+. .....+...+...+ ..++.+..++.+++++|+|++.+.++. T Consensus 70 --p-~~~~~~~~~~~~~--~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~----------------- 127 (457) T protein:vir:13 70 --P-LSTYSKRGGSRKE--IVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQG----------------- 127 (457) T ss_pred --c-eEEEEecCCcccc--cccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC----------------- Confidence 2 3443333222222 22222333333322 245567778889999999998875431 Q ss_pred ccccccccccccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) +.| .+.. ++|+... ++... . T Consensus 128 -----------------g~~~~l~~-------l~p~~v~--------v~~~~---------------------~------ 148 (457) T protein:vir:13 128 -----------------PNIVGLDV-------LDPTKIH--------VHMVM---------------------V------ 148 (457) T ss_pred -----------------CcEEEEEE-------EccCceE--------EEEec---------------------C------ Confidence 000 0111 2222110 00000 0 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA 310 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~ 310 (663) +. . . ...|+.+.+..+|. +.....+..+.||+. .++...+.++|.|++ T Consensus 149 --~~---~-----~------~~~~~~y~~~~~~~-~~~~~~~~~~diih~---------------~~~~~~~~~~G~s~i 196 (457) T protein:vir:13 149 --DG---L-----R------RKVFEAYDIDADGN-EVLLGWFTPRDVLHI---------------PGMMLPGDFVGCSPI 196 (457) T ss_pred --CC---c-----c------ceeEEEEEEecCCc-eeeEEeeCccceEEe---------------cCCCCCCccccccHH Confidence 00 0 0 00111111121221 111111222233332 222334557899988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEe-eccccCcchhhhc------------cCCcceEeCCCCCccccccCccc Q lcl|NC_021532. 311 EMIGDNQKVKTAVIRGIIDNMAQSNNGQVAI-RKGALDQTNRKKF------------LAGANFEFNGTANDFWHGSYNAI 377 (663) Q Consensus 311 ~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~-~~~~i~~~d~~~~------------~p~~vi~~~~~~~~~~~~~~~~~ 377 (663) ..+.+.=.............+...+.|..++ -++.++++..... ..|+++.+. ++....++...+. T Consensus 197 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~ 275 (457) T protein:vir:13 197 SYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLT-EGAKFSKVAMSPD 275 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEccCChh Confidence 7777654444444333334444555555444 3455554432111 124455554 3344555554444 Q ss_pred cHHHHHHHHHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcC Q lcl|NC_021532. 378 PSSAFDMISLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA-YNAEFLE 456 (663) Q Consensus 378 ~~~~~~~~~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~-li~q~~~ 456 (663) -....+...+....+-.+.||+..+.|....+.. +++.+.+.. ..|.+..+.|+.+.+-. +....++ T Consensus 276 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~-~~sn~eq~~-----------~~f~~~tl~P~~~~ie~~ln~~L~~ 343 (457) T protein:vir:13 276 EAQFLQTRQFQVPEIARIFGVPPHLISDATNSTS-WGSGLAEQN-----------IAFTMFSLRPWLERIEAGFNRLLFA 343 (457) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccc-ccchHHHHH-----------HHHHHHHHHHHHHHHHHHHHHhhcC Confidence 3445556667888888999999999996543321 222222211 12222233444333322 2223332 Q ss_pred CceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhh--hhh Q lcl|NC_021532. 457 EEEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQ--AKR 534 (663) Q Consensus 457 ~~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~--~~~ 534 (663) +..- ...|+.++.+.+.. + ...++.+....+... ..+.| .-......+..+++- ... T Consensus 344 ~~~~----~~~~i~fd~~~l~~----------~-D~~~r~~~~~~~~~~--G~~T~----NE~R~~~gl~Pi~~g~~d~~ 402 (457) T protein:vir:13 344 ETAD----RFRFVKFNLDEIKR----------G-APKERMELWSLGLQN--GIYSI----DEVRAAEDMTPLPDGLGEKY 402 (457) T ss_pred cccc----CceeEEeechhhhc----------c-CHHHHHHHHHHHHhC--CCcCH----HHHHHHhCCCCCCCCcccce Confidence 2210 11234444332210 0 011111111111110 11111 111111111111110 000 Q ss_pred hh-----------hhhcchhhHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 535 MR-----------EYEPKPDPVQEKIRQLE-LENLMLENQMLVASINDKNARANENTIDA 582 (663) Q Consensus 535 l~-----------~~~~~~~~~~~q~~q~~-~~~~q~~~~~~~a~~~~~~a~~q~~~~~~ 582 (663) .. +.++.+.+.+.+....+ .+..+.... . ..+.+-......++ T Consensus 403 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~---d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 403 RVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGK--P---DDEGATEEDDEDDA 457 (457) T ss_pred eeccccccccccccccccCCCCCCCCCccccCCCCCCCCC--C---ccccCCCCcccccC Confidence 00 00000000000000000 000000000 0 00000000000000 No 162 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=25.54 E-value=2 Score=18.88 Aligned_cols=369 Identities=11% Similarity=0.066 Sum_probs=134.0 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccccCCCccccHHHHHHHHHHHHHHHHhhcCCCceEE Q lcl|NC_021532. 1 MKINKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADIIK 80 (663) Q Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~ 80 (663) |..=+ ..... .+........|..+..+...+........+..+.|+..|+-|..++... . T Consensus 1 M~~f~--------~~~~~----~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~--------p 60 (397) T protein:vir:38 1 MPLLK--------LNKSH----SQGFSLNDPDWVNFLTGGEAQKYVSADTALKNSDIFSLIMQLSGDLAMV--------R 60 (397) T ss_pred Ccchh--------hhhcc----cCcccCCchhhhhhhcCCcCCceechHHhhccHHHHHHHHHHHHHHhhC--------c Confidence 33311 11000 0000000122333433221111112223455556666676665555322 1 Q ss_pred EEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccccccccc Q lcl|NC_021532. 81 CTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQE 160 (663) Q Consensus 81 ~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (663) |. -++.. ...++.. -...-..+.++..++.+.+++|.|++.+-++.. T Consensus 61 ~~---~~~~~----~~~l~~~-PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~------------------------- 107 (397) T protein:vir:38 61 YT---SESDR----SQSIISN-PSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTN------------------------- 107 (397) T ss_pred cc---ccccH----HHHHHhc-CCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC------------------------- Confidence 11 11111 1111111 111123456677888899999999987655411 Q ss_pred cccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccccccccc Q lcl|NC_021532. 161 VTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQ 239 (663) Q Consensus 161 ~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (663) +.+ .+.. ++|+... +. . +.+ T Consensus 108 --------g~~~~l~~-------l~~~~v~-------i~---~-----------~~~----------------------- 128 (397) T protein:vir:38 108 --------GVDLSWEY-------LRPSQVQ-------PM---L-----------LQD----------------------- 128 (397) T ss_pred --------CcEEEEEE-------EcCceeE-------EE---E-----------cCC----------------------- Confidence 000 1111 2222110 00 0 000 Q ss_pred ccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHH Q lcl|NC_021532. 240 FSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKV 319 (663) Q Consensus 240 ~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~ 319 (663) .. ...|. +..+....|... ++--+. .+++.+....+..||.|++..+...-.. T Consensus 129 -----~~-~~~y~--~~~~~~~~~~~~----~~~~~e---------------iih~~~~~~~~~~~G~s~i~~~~~~i~~ 181 (397) T protein:vir:38 129 -----GS-GLIYN--INFDEPAIGYME----NVPAAD---------------VIHIRLLSKNGGKTGISPLSALINEQQI 181 (397) T ss_pred -----Cc-eEEEE--EEecccccccee----EecCcc---------------EEEecCCCCCCccccccHHHHHHHHHHH Confidence 00 00111 111111111111 111111 2233333445567899999988887776 Q ss_pred HHHHHHHHHHHHHhcCCCcEEee-ccccCcchhhh-----------ccCCcceEeCCCCCccccccCccccHHHHHHHHH Q lcl|NC_021532. 320 KTAVIRGIIDNMAQSNNGQVAIR-KGALDQTNRKK-----------FLAGANFEFNGTANDFWHGSYNAIPSSAFDMISL 387 (663) Q Consensus 320 ~N~~~~~~~~~~~~~~~~~~~~~-~~~i~~~d~~~-----------~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 387 (663) .+.........+...+.|..++. ++.+++++... ...|+++.+. ++..+..+..++.-....+..++ T Consensus 182 ~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~ 260 (397) T protein:vir:38 182 KDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVID-ALEDYKPLEVKGNIASLLNQVDW 260 (397) T ss_pred HHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecC-CCceEEecCCChhHHHHHHHHHH Confidence 66666655555566666665543 34444332111 1134444333 33344444444444445666778 Q ss_pred HHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCceEEEEecC Q lcl|NC_021532. 388 MNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMA-YNAEFLEEEEVIRVTND 466 (663) Q Consensus 388 ~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~-li~q~~~~~~~iri~~~ 466 (663) ....+-.+.||+....|...+.. +. +.+. .. .+ ...+.|+...+-. +..+++++-. +.+ T Consensus 261 ~~~~Ia~afgVp~~~lg~~~~~~--~~--~e~~----~~-------~~-~~~l~P~~~~ie~~ln~~l~~~~~-~~~--- 320 (397) T protein:vir:38 261 TRDQIAKVYGVPDSYLNGQGDQQ--SS--ITQI----SG-------QY-AKSLNRYVQAIVGELNDKLHANIS-ANI--- 320 (397) T ss_pred HHHHHHHHhCCCHHHhCCCCCcc--cH--HHHH----HH-------HH-HHHHHHHHHHHHHHHHHhccChhc-ccc--- Confidence 88888899999999999654322 11 1111 00 11 1123344443322 2223332210 111 Q ss_pred eeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhh-----hh------hhh Q lcl|NC_021532. 467 KFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPE-----QA------KRM 535 (663) Q Consensus 467 ~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e-----~~------~~l 535 (663) ++.+ .. + ..++.+.+..++.. ..+.|.-.+.. +.+..++. .. ... T Consensus 321 ------------~~~~--~~-d---~~~~~~~~~~~~~~--G~~t~nE~R~~----lg~~p~~~~d~~~~~~~~~~~~~~ 376 (397) T protein:vir:38 321 ------------RFAI--DA-M---GDQYASTISSSVKG--GTIAGNQARFI----LQNSGYLAKDLPDPEKEPQQAIQL 376 (397) T ss_pred ------------cccc--cC-C---HHHHHHHHHHHHhC--CCcCHHHHHHH----hCCCCCCCCccccccccccccccc Confidence 1111 11 0 11222222211111 11122111111 11111100 00 000 Q ss_pred hhhhcch---hhHHHHhhHHH Q lcl|NC_021532. 536 REYEPKP---DPVQEKIRQLE 553 (663) Q Consensus 536 ~~~~~~~---~~~~~q~~q~~ 553 (663) ....+.. ....++....+ T Consensus 377 ~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 377 IQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred cccccCCCCCCCCCCCCCCCC Confidence 0000000 00000000000 No 163 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=24.30 E-value=2.2 Score=18.71 Aligned_cols=389 Identities=12% Similarity=0.129 Sum_probs=144.1 Q ss_pred CcHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCcCCccc-cC---CCccccHHHHHHHHHHHHHHHHhhcCCCc Q lcl|NC_021532. 3 INKAELLSALKADMK-AADVLKQEQDSLISTWKAEYNGEPYGNEQ-KG---KSAIVSRDIKKQSEWQHATIVDPFVSTAD 77 (663) Q Consensus 3 ~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~y~~~~~~~~~-~g---~s~~~~~~i~~~v~~~~~~l~~~~~~~~~ 77 (663) |+++-|++.++..+- .+..... + ..+....|..+. -| ...+..+.|+..|+-|..++.. =| T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~---~------~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~-----lp 66 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSA---S------KLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMAS-----LP 66 (409) T ss_pred CccccchhhhhhHHhhhhhcccc---c------cccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhh-----Cc Confidence 888999888887632 2221110 0 111111221110 00 1123345555556655444432 12 Q ss_pred eEEEEeCC-cchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccccc Q lcl|NC_021532. 78 IIKCTPIT-WEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETV 156 (663) Q Consensus 78 ~~~~~p~~-~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 156 (663) +++.-.+ ..+...+ .+++.--...-+.+...+.++.+++++|.|++.+.++.. T Consensus 67 -~~~~~~~~~~~~~l~----~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~--------------------- 120 (409) T protein:vir:96 67 -LKMYEDYKVVNTEVS----DLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY--------------------- 120 (409) T ss_pred -eEEeecccccchhHH----HHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC--------------------- Confidence 2332222 2222222 223221111224566677889999999999987654310 Q ss_pred cccccccceeecccceeeeccHHHhe-eCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccccc Q lcl|NC_021532. 157 VEQEVTETVVKKNQPTARVCRNEDIY-LDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDD 235 (663) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~v~~~~~~-~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 235 (663) +.| -.++ ++|.... .. . +. + T Consensus 121 ------------G~~-------~~L~~l~~~~v~------v~---~------------~~-------------------~ 141 (409) T protein:vir:96 121 ------------HQP-------SKLFLLNPDVVE------ML---I------------EN-------------------Q 141 (409) T ss_pred ------------CcE-------EEEEEEcCceeE------EE---E------------eC-------------------C Confidence 110 0111 2222110 00 0 00 0 Q ss_pred ccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHH Q lcl|NC_021532. 236 TEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGD 315 (663) Q Consensus 236 ~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d 315 (663) . . +.||++. ..+|. ..++..+.|+|. ..++-.+.++|.|++..+.+ T Consensus 142 --------~-~-----~~~y~~~-~~~g~----~~~~~~~evih~---------------r~~~~~~~~~G~s~l~~~~~ 187 (409) T protein:vir:96 142 --------S-R-----ELYYSIH-AATGN----KLIVHNMDMLHF---------------KHIVASNMVQGISPIDVLKN 187 (409) T ss_pred --------C-c-----EEEEEEE-cCCce----EEEEccccEEEe---------------CCCCCCCccccccHHHHHHH Confidence 0 0 0111111 11111 112222233332 22222455789999988887 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEe-eccccCcchhhh---------ccCCcceEeCCCCCccccccCccccHHHHHHH Q lcl|NC_021532. 316 NQKVKTAVIRGIIDNMAQSNNGQVAI-RKGALDQTNRKK---------FLAGANFEFNGTANDFWHGSYNAIPSSAFDMI 385 (663) Q Consensus 316 ~Q~~~N~~~~~~~~~~~~~~~~~~~~-~~~~i~~~d~~~---------~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~ 385 (663) .....+...+..+.+. ...+.+++ ..+.++++.... ...|+++.+. ++..+.++...+......+.. T Consensus 188 ~i~~~~~~~~~~~~~~--~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~ 264 (409) T protein:vir:96 188 TTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred HHHHHHHHHHHHHHhc--CCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHH Confidence 7776666554433322 22233444 334455433221 1245555443 344555555554444556666 Q ss_pred HHHHHHHHHHhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhcCCceEEEEe Q lcl|NC_021532. 386 SLMNNEIESITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYN-AEFLEEEEVIRVT 464 (663) Q Consensus 386 ~~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li-~q~~~~~~~iri~ 464 (663) .+....+-.+.||++...|...++.-.+ +.+ ..+.|....+.|+.+.+-.-+ ...++.... . T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~~~~s~---~e~-----------~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~---~ 327 (409) T protein:vir:96 265 NLTRERVANVFQLPSIFLNARSNTNFAK---NEE-----------LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDR---E 327 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCccc---HHH-----------HHHHHHHHHHHHHHHHHHHHHHhhcCCcccc---c Confidence 6777888899999999999643321111 111 112233333455554443322 223322111 0 Q ss_pred cCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhh--hhcch Q lcl|NC_021532. 465 NDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMRE--YEPKP 542 (663) Q Consensus 465 ~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~--~~~~~ 542 (663) .+.++.+|.+.+. -....++.+.+..++.. +-+.| .-......+..++.-...+.. +.+-. T Consensus 328 ~g~~i~fd~~~ll-----------~~d~~~~~e~~~~~~~~--G~~T~----NE~R~~~g~~pi~ggD~~~~~~n~~~~~ 390 (409) T protein:vir:96 328 KNRYFKFNVKSYL-----------RADSATQAEVYFKAVRS--GYYTI----NDIREWEDLPPVEGGDKPLISGDLYPID 390 (409) T ss_pred CcceEEeechhhh-----------ccCHHHHHHHHHHHHhC--CCCCH----HHHHHHhCCCCCCCcceeeecccccccc Confidence 0112222221110 00011222222111111 11111 111111112222111110000 00000 Q ss_pred hhHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQLELENLMLENQMLVASINDKNA 573 (663) Q Consensus 543 ~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a 573 (663) .....+. .. . --.. -.... T Consensus 391 ~~~~~~~-~~-----~----gG~~--n~~e~ 409 (409) T protein:vir:96 391 TPLELRK-SL-----K----GGDK--NVNES 409 (409) T ss_pred cchhhcc-cc-----c----CCCC--CcCCC Confidence 0000000 00 0 0000 00000 No 164 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=24.21 E-value=2.2 Score=18.70 Aligned_cols=468 Identities=13% Similarity=0.037 Sum_probs=154.0 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CcCCccc-cCCCccccHHHHH--HHHHHHHHHHHhhcCC Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNG----EPYGNEQ-KGKSAIVSRDIKK--QSEWQHATIVDPFVST 75 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~----~~~~~~~-~g~s~~~~~~i~~--~v~~~~~~l~~~~~~~ 75 (663) +++..+.+++...... .+.+..|..| +...+.. .|.+.-++..+.. +|.+.+.......++- T Consensus 1 v~~~~l~~e~at~~~~-----------~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~ 69 (488) T protein:vir:99 1 MEKPALGREIATSGDG-----------RDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSR 69 (488) T ss_pred CCccchhHHHHHHHhh-----------hhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcC Confidence 5555555554422111 1111222211 1111111 1112112222222 4455555555555442 Q ss_pred CceEEEEeCCc--chHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccc Q lcl|NC_021532. 76 ADIIKCTPITW--EDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGN 153 (663) Q Consensus 76 ~~~~~~~p~~~--~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~ 153 (663) -+.|+|-+. .|.+.|+.+.+. +.. .++..++.+++ |++.+|++++++.|...+.. T Consensus 70 --~w~i~p~~~~~~~~~~ae~v~~~----l~~-~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~--------------- 126 (488) T protein:vir:99 70 --EWKVEAGGDRPIDQAAAEHLEQQ----LQR-VGWDRVTSKML-FGVFYGYAVSELIYGRDDRY--------------- 126 (488) T ss_pred --CceEEcCCCChHHHHHHHHHHHH----HhC-CCHHHHHHHHH-hhhhhcceeEEEEEeecCCe--------------- Confidence 256777542 333444444444 332 34555666555 89999999999988632100 Q ss_pred ccccccccccceeecccceeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhcccc Q lcl|NC_021532. 154 ETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSP 233 (663) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~ 233 (663) ..-..+..+++..|.+|+... ..+ T Consensus 127 --------------~~~~~l~~r~~~~f~~d~~~~-------l~~----------------------------------- 150 (488) T protein:vir:99 127 --------------ITLEAIKVRNRRRFRYDQDGG-------LRL----------------------------------- 150 (488) T ss_pred --------------eeEeeeeeecccceeecCCCc-------eEE----------------------------------- Confidence 000022334444444443210 000 Q ss_pred ccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHH Q lcl|NC_021532. 234 DDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMI 313 (663) Q Consensus 234 ~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~ 313 (663) + .+ +++ .+ ..|++.. +-|++..+.+..+++||.|+.+.+ T Consensus 151 ---------------~-----~~----~~~---------~~-------g~~lp~~-~~~i~~~~~~~~g~p~g~gLl~~~ 189 (488) T protein:vir:99 151 ---------------L-----TP----NNM---------FE-------GEPCPAP-YFWHFSTGADNDDEPYGLGLAHWL 189 (488) T ss_pred ---------------e-----cc----CCC---------CC-------ccccccC-ceEEEEeecCCCCCcccchHHHHH Confidence 0 00 000 00 0122110 124555667778899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcEEee--ccccCcchhh-------hccCCcceEeCCCCCccccccCcccc-HHHHH Q lcl|NC_021532. 314 GDNQKVKTAVIRGIIDNMAQSNNGQVAIR--KGALDQTNRK-------KFLAGANFEFNGTANDFWHGSYNAIP-SSAFD 383 (663) Q Consensus 314 ~d~Q~~~N~~~~~~~~~~~~~~~~~~~~~--~~~i~~~d~~-------~~~p~~vi~~~~~~~~~~~~~~~~~~-~~~~~ 383 (663) ..+=-..+...+.....+...+.|..+.- ...-++.+.. ....+++..+ |.+..+..+....-. ..... T Consensus 190 ~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~vi-P~~~~ie~~ea~~~~~~~~~~ 268 (488) T protein:vir:99 190 YWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAIIM-PAGMQAELLEAGRSGTADYKT 268 (488) T ss_pred HHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEEe-cCCceeEEeecCCCChHHHHH Confidence 99888888877777777777777755432 2111222211 1222332222 334556665543333 33567 Q ss_pred HHHHHHHHHHHH-hCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCceEEE Q lcl|NC_021532. 384 MISLMNNEIESI-TGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEEEEVIR 462 (663) Q Consensus 384 ~~~~~~~~~~~~-tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~~~~ir 462 (663) ++++.+..|..+ .|=+-.+.+. ++ +..++.+.. +--...+..-++.+.+++-+.+...++.++ +...+.- T Consensus 269 li~~~d~~Isk~iLGqtlts~~~--~G-s~a~~~vh~--~v~~d~~~aDa~~i~~tln~~li~~l~~~N---~~~~~~p- 339 (488) T protein:vir:99 269 LHDTMDATIAKVGLGQVASTQGT--PG-RLGNDDLQA--DVRLDLVKADADLICESFNLGPARWLTEWN---FPGAQPP- 339 (488) T ss_pred HHHHHHHHHHHHHhhhhhccccc--cc-chhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---cCCcCCc- Confidence 888888777543 3433221111 00 111111111 111122233334444443334444444443 2111110 Q ss_pred EecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhhhhhhhhhcch Q lcl|NC_021532. 463 VTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQAKRMREYEPKP 542 (663) Q Consensus 463 i~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~~~l~~~~~~~ 542 (663) .+. +............+.+..+..+.+-.+ ....+.+ .-+++.....-....+.+ T Consensus 340 ----------------~~~--~~~~e~edl~~~a~~~~~l~~~~G~~i----~~~~i~e---~~Gip~~~~~~~~~~~~~ 394 (488) T protein:vir:99 340 ----------------RVY--RVIEEPEDITAKAERDEKVFRMSGFRP----TRGYVQE---TYGVEVESTQAEATAPTP 394 (488) T ss_pred ----------------eeE--ecCCCcccHHHHHHHHHHHHhhcCCCC----CHHHHHH---HcCCCCcccccccccCCC Confidence 011 111111111122222222222212112 1222222 222221110000000000 Q ss_pred hhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 543 DPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDNGYAHLEQ 622 (663) Q Consensus 543 ~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~~~~~~~~ 622 (663) .......... .... ..+..++.. ..+...+.-+.+.....+... - .. +.+......-.... T Consensus 395 ~~~~~~~~~~-~~~~----~~~~~~~~~----~~~~~~~~~~~~i~~~l~~a~--s------~e--e~~~~L~~l~~~~d 455 (488) T protein:vir:99 395 STEFAEGDQP-SDPA----AAMAPQLAE----AMQPVVGNWTTQLRTLIEQAS--S------LE--DLRERLLDLAPQLS 455 (488) T ss_pred cccCCCCCCC-CCch----HHHHHHHHH----HHHHHHHHHHHHHHHHHHhcC--C------HH--HHHHHHHHHhccCC Confidence 0000000000 0000 000000000 000000000000000000000 0 00 00000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Q lcl|NC_021532. 623 VELEDLRHAQHLEREAMKHRANLEQMLAQRNAGDTNI 659 (663) Q Consensus 623 ~~~~~~~~~~~~~~e~~k~~~~~e~~~~~~~~~~~~~ 659 (663) -..-+......+....+.=+.+. .... ....+| T Consensus 456 ~~~l~~~l~~a~~~a~l~G~~~~--~~e~--~~~~~~ 488 (488) T protein:vir:99 456 LDQYAQAMAEGLEAAHLAGRNDV--QEEL--DGREQI 488 (488) T ss_pred HHHHHHHHHHHHHHHHHhhhhhH--hhhh--cccCCC Confidence 00000000001111111101000 0000 001111 No 165 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=23.03 E-value=2.4 Score=18.54 Aligned_cols=454 Identities=11% Similarity=0.034 Sum_probs=150.5 Q ss_pred CCCc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcC----CcCCccccCCCccccHHHHH--HHHHHHHHHHHh Q lcl|NC_021532. 1 MKIN--KAELLSALKADMKAADVLKQEQDSLISTWKAE-YNG----EPYGNEQKGKSAIVSRDIKK--QSEWQHATIVDP 71 (663) Q Consensus 1 ~~~~--~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-y~~----~~~~~~~~g~s~~~~~~i~~--~v~~~~~~l~~~ 71 (663) ++.+ +.++...+...- ..+..+ +.| ...-++.+|++.-++..+.. +|-+.+...... T Consensus 13 ~~~~~~~~~~~~~ia~~~--------------~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l~~Rk~a 78 (491) T protein:vir:79 13 VKFGEPDKSLSSQIATRA--------------RSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRKAA 78 (491) T ss_pred ccccccchhHHHHHhhhc--------------cccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHHHHHHHH Confidence 3322 233333332110 011111 111 11112334444323333321 344444444444 Q ss_pred hcCCCceEEEEeCCcchHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcc Q lcl|NC_021532. 72 FVSTADIIKCTPITWEDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEY 151 (663) Q Consensus 72 ~~~~~~~~~~~p~~~~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~ 151 (663) .++- -+.|.|-.. |.+.|+..++.++. .++..++.++ .|++.+|++++++.|+.++.. T Consensus 79 v~~~--~w~i~~~~~-~~~~a~~i~e~l~~-----~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~------------- 136 (491) T protein:vir:79 79 VKAL--EWGLDRGKA-KSRVAKSIADVFAD-----LDLSRIATEM-LDAVLYGYQPMEITWGKVGNY------------- 136 (491) T ss_pred HhCC--CcEEecCCC-CHHHHHHHHHHHhc-----CCHHHHHHHH-HHhhhhcceeEEEEEeecCCe------------- Confidence 4432 356777544 34556666655543 2444555555 489999999999999642110 Q ss_pred ccccccccccccceeecccc-eeeeccHHHheeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhc Q lcl|NC_021532. 152 GNETVVEQEVTETVVKKNQP-TARVCRNEDIYLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDY 230 (663) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 230 (663) ..| .+..+++..|.+|+... .++ T Consensus 137 -----------------~~~~~l~~r~~~~f~~d~~~~-------l~l-------------------------------- 160 (491) T protein:vir:79 137 -----------------IVPIDVVGKPADWFVYDPENQ-------LRF-------------------------------- 160 (491) T ss_pred -----------------eeEEeeeeecccceeeccCCc-------eEE-------------------------------- Confidence 000 23334444444443210 000 Q ss_pred cccccccccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChH Q lcl|NC_021532. 231 DSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANA 310 (663) Q Consensus 231 ~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~ 310 (663) +. .++ ..++ .|++.. =|+++++....++.||.|++ T Consensus 161 ------------------~~---------~~~---------~~~g-------~~lp~~--k~i~~~~~~~~g~p~g~gLl 195 (491) T protein:vir:79 161 ------------------RS---------KEH---------WVQG-------EELPAR--KFLVPRQEATYLNPYGFPDL 195 (491) T ss_pred ------------------ee---------cCC---------CCCc-------eeecCC--CeEEEEecCCCCCcccchhH Confidence 00 000 0000 122211 26777777888999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEE--eeccccCcchhh-------hccCCcceEeCCCCCccccccCccc---c Q lcl|NC_021532. 311 EMIGDNQKVKTAVIRGIIDNMAQSNNGQVA--IRKGALDQTNRK-------KFLAGANFEFNGTANDFWHGSYNAI---P 378 (663) Q Consensus 311 ~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~--~~~~~i~~~d~~-------~~~p~~vi~~~~~~~~~~~~~~~~~---~ 378 (663) +.+..+=-..+...+....-+...+.|..+ ++.++-+. +.. ....++.+.+ +.+..+..+..... . T Consensus 196 ~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~-ek~~l~~al~~~~~~a~~vi-P~~~~ie~~ea~~~~g~~ 273 (491) T protein:vir:79 196 SMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDA-ETNLLLDRLEDMVQDAVAVI-PDDSSIEIKEAAGKSGSA 273 (491) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHH-HHHHHHHHHHHHhcCeEEEe-cCCceeEEEeccCCCCCh Confidence 999999999998888888888887777543 44444332 211 1222222222 33455666544322 2 Q ss_pred HHHHHHHHHHHHHHHH-HhCCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021532. 379 SSAFDMISLMNNEIES-ITGTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNAEFLEE 457 (663) Q Consensus 379 ~~~~~~~~~~~~~~~~-~tGi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~~~~ 457 (663) .....++++.+..|.. +.|=+-.+.+.. +..++.+.. +-....+..-.+.+.+++- .++..++.++ |... T Consensus 274 ~~y~~li~~~d~~Isk~iLGqtlTt~~~g----s~a~~~vh~--~v~~~i~~~D~~~i~~tln-~li~~l~~~N--~~~~ 344 (491) T protein:vir:79 274 DVYERLLHFCRGEVSIALLGQNQTTEATS----TRASAQAGL--EVTDDIRDGDKAIVVEAMN-MLIRWICDLN--FDGA 344 (491) T ss_pred hHHHHHHHHHHHHHHHHHhhhhhccCccc----chhhHHHHH--HHHHHHHHHHHHHHHHHHH-HHHHHHHHhc--CCCC Confidence 2356788888777654 333321111100 111111111 0011112222223332222 1332222222 2222 Q ss_pred ceEEEEecCeeeccchhhcCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhh--hhh Q lcl|NC_021532. 458 EEVIRVTNDKFVPIRKDDLSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQA--KRM 535 (663) Q Consensus 458 ~~~iri~~~~~v~i~~~~~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~--~~l 535 (663) +.. + |....+++ ......+.+.. +..++ ..+....+.+ .-+++... +.. T Consensus 345 ~~p-~-----f~~~e~ee---------------~~~~~a~~~~~-L~~~G----~~i~~~~~~e---~~Gip~~~~~e~~ 395 (491) T protein:vir:79 345 ARP-V-----FDMWEQEQ---------------VDEIQAGRDEK-LTRAG----ARFTPAYFKR---AYNLQDGDLDERP 395 (491) T ss_pred Ccc-e-----EeecCcCc---------------hhHHHHHHHHH-HHhCC----CccCHHHHHH---HhCCCCCCCCccc Confidence 111 1 10001110 00111111111 11111 1122222222 22222111 100 Q ss_pred hhhhcchhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 536 REYEPKPDPVQEKIRQLELENLMLENQMLVASINDKNARANENTIDAELKRSKAAVEKAKARKLSSEADMTDLKFVKEDN 615 (663) Q Consensus 536 ~~~~~~~~~~~~q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~~~~~~~~~~e~~~~q~~~~~~~~~~~e~~~~~~ 615 (663) ........+............ + ...........+..-+...+.-+.+-...++ ++. ... +.+.... T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~-~---~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~--~~~------s~~--e~~~~L~ 461 (491) T protein:vir:79 396 LPVSAVDAVGAASFAEFEAPD-Q---DALDAALNALSARDLNADAQALVAPLLKRIA--NGA------SAD--ELLGMLA 461 (491) T ss_pred cCcCcccccccccccccCCCC-C---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hcC------CHH--HHHHHHH Confidence 000000000000000000000 0 0000000000000000000000000000000 000 000 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 616 GYAHLEQVELEDLRHAQHLEREAMKHRANL 645 (663) Q Consensus 616 ~~~~~~~~~~~~~~~~~~~~~e~~k~~~~~ 645 (663) ..-....-..-+......+....+.=+.+. T Consensus 462 ~l~~~~d~~~l~~~l~~a~~~A~l~Gr~~a 491 (491) T protein:vir:79 462 ELYPSLDTDALQERLARAIFVANLWGRLHA 491 (491) T ss_pred HHhhcCCHHHHHHHHHHHHHHHHHhhhccC Confidence 000000000000000000111111111101 No 166 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=22.58 E-value=2.4 Score=18.47 Aligned_cols=411 Identities=12% Similarity=0.051 Sum_probs=119.1 Q ss_pred ccccCCCccccHHHHHHHHHHHHHHHHhhcCCCceEEEEeCCcch-HHHH-HHHHHHHhHHHHhccc------------h Q lcl|NC_021532. 44 NEQKGKSAIVSRDIKKQSEWQHATIVDPFVSTADIIKCTPITWED-TDSA-EQNELLLNTQFSRKFD------------R 109 (663) Q Consensus 44 ~~~~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D-~~~A-e~~~~~~~~~~~~~~~------------~ 109 (663) ++.=-+ ..+.|+..|+-+..++... + +.+.++...+ ...+ .....+.++++....+ . T Consensus 1 l~~l~~---~n~~v~~ci~~ia~~ia~~--p----~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~ 71 (467) T protein:vir:31 1 MAELLE---HNETHAKCVHAKSRYVAGF--G----INIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATA 71 (467) T ss_pred Chhhhh---cCHHHHHHHHHHHHhhhcC--C----eEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHH Confidence 111000 1233444444444433211 2 4555554322 1222 2333333344332221 2 Q ss_pred hHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCccccccccccccccceeecccceeeeccHHHheeCccccc Q lcl|NC_021532. 110 FNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVVEQEVTETVVKKNQPTARVCRNEDIYLDPTCQD 189 (663) Q Consensus 110 ~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~dp~a~~ 189 (663) ...+..++.+++++|.|++++.++...+. ..+..|+|..+-+.+ T Consensus 72 ~~~~~~~~~~l~l~Gn~~i~~~r~~~G~~--------------------------------~~l~~l~~~~v~~~~---- 115 (467) T protein:vir:31 72 TNVLQTAWTDYEAIGWLTIEILTQTDGTP--------------------------------TGLAYVPGHTIRKRM---- 115 (467) T ss_pred HHHHHHHHHHHHhcCCeEEEEEECCCCcE--------------------------------EEEEEeCCceeEeee---- Confidence 34556788899999999998766421100 012223332221111 Q ss_pred ChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccccccccccccceEEEEEEEEEeeecCCceeEEEE Q lcl|NC_021532. 190 NLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDTEFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIV 269 (663) Q Consensus 190 d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~ 269 (663) +...|+.. . ..... ++. ..+.......--+.+.......++. .... T Consensus 116 --d~~~~~~~-~-----~~~~~-~~~------------------------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 161 (467) T protein:vir:31 116 --DERGFVQL-L-----EEKEK-YFG------------------------VAGDRYQTNGNGDLDPVFVDADDGS-TGTS 161 (467) T ss_pred --ecceeEee-c-----CCcee-eEE------------------------eccccceeecccceeeeeeeecccc-ccce Confidence 01111100 0 00000 000 0000000000000000000000000 0001 Q ss_pred EEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEe--eccccC Q lcl|NC_021532. 270 CAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDNQKVKTAVIRGIIDNMAQSNNGQVAI--RKGALD 347 (663) Q Consensus 270 ~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~Q~~~N~~~~~~~~~~~~~~~~~~~~--~~~~i~ 347 (663) ..+-.+.||| +..+...+.+||.|.+..+...-...+....-....+...+.|..++ ..+.++ T Consensus 162 ~~~~~~diih---------------~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~ 226 (467) T protein:vir:31 162 VSNPANELIF---------------KRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELT 226 (467) T ss_pred eEeccccEEE---------------ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCC Confidence 1111122222 22222345678888888776654444333333333334444555443 344565 Q ss_pred cchhhhcc---------CC--------------cceEeCCCCCc-----cccccCccccH---HHHHHHHHHHHHHHHHh Q lcl|NC_021532. 348 QTNRKKFL---------AG--------------ANFEFNGTAND-----FWHGSYNAIPS---SAFDMISLMNNEIESIT 396 (663) Q Consensus 348 ~~d~~~~~---------p~--------------~vi~~~~~~~~-----~~~~~~~~~~~---~~~~~~~~~~~~~~~~t 396 (663) ++.....+ ++ .+..+ +++.. +...+....++ ............|-.+. T Consensus 227 ~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l-~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~f 305 (467) T protein:vir:31 227 EKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNL-ADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVH 305 (467) T ss_pred HHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccc-cCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHh Confidence 44322110 11 11111 11111 11112222222 23445556677788899 Q ss_pred CCChHHcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhcCCceEEEEecCeeeccchhh Q lcl|NC_021532. 397 GTKSFSGGINSGSLGSTATGARGALDATATRRMNIVRNIAENLVKPLMRKWMAYNA-EFLEEEEVIRVTNDKFVPIRKDD 475 (663) Q Consensus 397 Gi~~~~~G~~~~~~~~tA~~i~~~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~-q~~~~~~~iri~~~~~v~i~~~~ 475 (663) ||+....|....+ +.++.+.+. ..+|.+..+.|+.+.+-..+- .+++.. .+.. +.++.++... T Consensus 306 gVpp~~lG~~~~~--~~~s~~e~~-----------~~~f~~~~l~P~~~~ie~~ln~~l~~~~--~~~~-~~~i~f~~~~ 369 (467) T protein:vir:31 306 DVPPVIAGVVESG--AFSTDAEEQ-----------RKEFAEETIQPKQHDFGELLYELVHKQG--LDAP-DWTIEFELAK 369 (467) T ss_pred CCCHHHcccCCCC--CcccCHHHH-----------HHHHHHHHHHHHHHHHHHHHHHhhcchh--hccC-CceEEEecch Confidence 9999999964322 122222211 112222223444333332221 222110 0000 1111121111 Q ss_pred cCCceEEEEeecccchhHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHhhhhhhhh-----hhhhhhhcchhhH---HH Q lcl|NC_021532. 476 LSGRIDIDISISTAEDNAAKSQELSFLLQTLGPNEDPKIRRDIMADIMDLMRMPEQA-----KRMREYEPKPDPV---QE 547 (663) Q Consensus 476 ~~~~~d~~v~~~~~~~~~~~~q~l~~~~~~~~~~~~p~~~~~~l~~~~~l~~~~e~~-----~~l~~~~~~~~~~---~~ 547 (663) + .......+.+....+.. .+.+.+.- +.....+..+++-. ...........+. .. T Consensus 370 l-----------~~~d~~~~~~~~~~~~~--~G~~T~NE----~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (467) T protein:vir:31 370 P-----------DTKLQDVEIASQRVQAM--QGLLTVNE----LRDEFGFEPFPEEHVYGGETLVAEVTGGSGPGGGIGD 432 (467) T ss_pred h-----------hccCHHHHHHHHHHHHh--CCCcCHHH----HHHHhCCCCCCcccccCCcccccccccccCCCCcccC Confidence 1 00011111111111111 00111111 11111111111100 0000000000000 00 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021532. 548 KIRQLELENLMLENQMLVASINDKNARANENTIDA 582 (663) Q Consensus 548 q~~q~~~~~~q~~~~~~~a~~~~~~a~~q~~~~~~ 582 (663) +..+....+......-+++....++--+..++++- T Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 433 QIEQLVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred cCCCCCCCcccchHhhhhhccccchhhhhccccCC Confidence 00000000000000001111110000000000100 No 167 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=22.15 E-value=2.5 Score=18.41 Aligned_cols=361 Identities=12% Similarity=0.135 Sum_probs=137.7 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCCccc----cCCCccccHHHHHHHHHHHHHHHHhhcCCCce Q lcl|NC_021532. 3 INKAELLSALKADMKAADVLKQEQDSLISTWKAEYNGEPYGNEQ----KGKSAIVSRDIKKQSEWQHATIVDPFVSTADI 78 (663) Q Consensus 3 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~----~g~s~~~~~~i~~~v~~~~~~l~~~~~~~~~~ 78 (663) |+++-|.+.++..+-.-.. ..- .. ..+.-..|..+. -....+..+.|+..|+-|..++.. -| T Consensus 1 ~~~~~~~~~~k~~~~~~~~--~~~---~~---~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~-----lp- 66 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWI--DQS---AS---KLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMAS-----LP- 66 (409) T ss_pred CcccccchhhhhHHhhhhh--cCC---cc---cccccccccCccccccchhhhhccHHHHHHHHHHHHhhhh-----Cc- Confidence 8889998888886422110 000 00 111101111110 001233445566666655554432 12 Q ss_pred EEEEeCCc-chHHHHHHHHHHHhHHHHhccchhHHHHHHHHHHHhcCceEEEeeeccccceecccccccccCcccccccc Q lcl|NC_021532. 79 IKCTPITW-EDTDSAEQNELLLNTQFSRKFDRFNFMSKAVKVLDREGTLVVQTGWDYEDEEVTVMGEAVVVDEYGNETVV 157 (663) Q Consensus 79 ~~~~p~~~-~D~~~Ae~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~G~g~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 157 (663) +++.-.+. .+...+. +++.--...-+.+...+.++.+++++|.|++.+.++.. T Consensus 67 ~~~~~~~~~~~~~~~~----lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---------------------- 120 (409) T protein:vir:94 67 LKMYEDYKVVNTEVSD----LLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---------------------- 120 (409) T ss_pred eeEeecccccchhHHH----HHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---------------------- Confidence 23332222 2222222 23221122234566677888999999999987654310 Q ss_pred ccccccceeecccceeeeccHHHh-eeCcccccChhhCceEEEEeecCHHHHHHhcCCcChhhhhhccchhhhccccccc Q lcl|NC_021532. 158 EQEVTETVVKKNQPTARVCRNEDI-YLDPTCQDNLDNAQFVIHRYETDLSTLKKDGRYKNLDKLAKTSGEDFDYDSPDDT 236 (663) Q Consensus 158 ~~~~~~~~~~~~~~~~~~v~~~~~-~~dp~a~~d~~d~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 236 (663) +.| + .+ +++|+... .. . +. + T Consensus 121 -----------G~~----~---~L~~l~~~~v~------v~---~------------~~-------------------~- 141 (409) T protein:vir:94 121 -----------HQP----S---KLFLLNPDVVE------ML---I------------EN-------------------Q- 141 (409) T ss_pred -----------CcE----E---EEEEEcCceeE------EE---E------------eC-------------------C- Confidence 111 0 11 12222110 00 0 00 0 Q ss_pred cccccccccceEEEEEEEEEeeecCCceeEEEEEEEECCEEEecccCCCcCCCCCEEEEeeeeecCcccCCChHHHHHHH Q lcl|NC_021532. 237 EFQFSDAPRKKLIIYEYWGNYDVDGDGIAEPIVCAWINDVIVRLQSNPYPDGKPPFLVVPFNSIPFKLHGEANAEMIGDN 316 (663) Q Consensus 237 ~~~~~d~~~~~v~v~E~w~~~~~~~~g~~~~~~~~~~g~~~l~~~~~p~~~~~~Pf~~~~~~~~~~~~~g~g~~~~~~d~ 316 (663) .. ..||.+. ..+|. ..++..+.|+|. ..+...+.++|.|++..+.+. T Consensus 142 --------~~-----~~~y~~~-~~~g~----~~~~~~~dvih~---------------r~~~~~~~~~G~s~l~~~~~~ 188 (409) T protein:vir:94 142 --------SR-----ELYYSIH-AATGN----KLIVHNMDMLHF---------------KHIVASNMVQGISPIDVLKNT 188 (409) T ss_pred --------Cc-----EEEEEEE-cCCce----EEEEccccEEEe---------------cCCCCCCccccccHHHHHHHH Confidence 00 0111111 11111 112222334433 111223557899999888887 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcEEe-eccccCcchhhh---------ccCCcceEeCCCCCccccccCccccHHHHHHHH Q lcl|NC_021532. 317 QKVKTAVIRGIIDNMAQSNNGQVAI-RKGALDQTNRKK---------FLAGANFEFNGTANDFWHGSYNAIPSSAFDMIS 386 (663) Q Consensus 317 Q~~~N~~~~~~~~~~~~~~~~~~~~-~~~~i~~~d~~~---------~~p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ 386 (663) ....+......+.+.. ..+.+++ ..+.++++.... ...|+++.+. ++..+.++...+......+... T Consensus 189 i~~~~~~~~~~~~~~~--~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~ 265 (409) T protein:vir:94 189 TDFDNAVRTFNLTEMQ--KPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASEN 265 (409) T ss_pred HHHHHHHHHHHHHhcC--CCCeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHH Confidence 7776665444332222 2233333 334454432111 2345555543 3344555554444445556666 Q ss_pred HHHHHHHHHhCCChHHcCCCcccchhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCceEEEEe Q lcl|NC_021532. 387 LMNNEIESITGTKSFSGGINSGSLGSTATGARGA-LDATATRRMNIVRNIAENLVKPLMRKWMAYNAEF-LEEEEVIRVT 464 (663) Q Consensus 387 ~~~~~~~~~tGi~~~~~G~~~~~~~~tA~~i~~~-~~~~~~~l~~~~~~~~~~~~~~l~~~~~~li~q~-~~~~~~iri~ 464 (663) .....+-.+.||++...|...++ +.+.+.+. ..--...+..++..+...+-+.|+ ..+ .+....|++. T Consensus 266 ~~~~~Ia~~fgVPp~~lg~~~~~---~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll-------~~~~~~~~~~i~fd 335 (409) T protein:vir:94 266 LTRERVANVFQLPSVFLNARSNT---NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLL-------TKTDREKNRYFKFN 335 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCC---CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhC-------CcccccCcceEEee Confidence 77788888999999999964332 11112211 111222333444444322221111 000 0111223321 Q ss_pred c--------------------CeeeccchhhcCC--------ceEEE-Eeeccc-chhHHHHHHHHHHHHHhccCCCcch Q lcl|NC_021532. 465 N--------------------DKFVPIRKDDLSG--------RIDID-ISISTA-EDNAAKSQELSFLLQTLGPNEDPKI 514 (663) Q Consensus 465 ~--------------------~~~v~i~~~~~~~--------~~d~~-v~~~~~-~~~~~~~q~l~~~~~~~~~~~~p~~ 514 (663) - ..++++| +++. ..|.- +..... .......+ ...-+..-+-.- T Consensus 336 ~~~ll~~d~~~~~~~~~~~~~~G~~T~N--E~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~-----~~~kGG~~n~~e 408 (409) T protein:vir:94 336 VKSYLRADSATQAEVYFKAVRSGYYTIN--DIREWEDLPPVEGGDKPLISGDLYPIDTPLELR-----KSLKGGDKNVNE 408 (409) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHH--HHHHHhCCCCCCCcCeEeecccccccccchhhc-----ccccCCCCCcCC Confidence 0 0122222 1110 11211 111111 00000000 000011000000 Q ss_pred h Q lcl|NC_021532. 515 R 515 (663) Q Consensus 515 ~ 515 (663) . T Consensus 409 ~ 409 (409) T protein:vir:94 409 S 409 (409) T ss_pred C Confidence 0 Done!