Query lcl|NC_019406.1_cdsid_YP_006988272.1 [gene=D866_gp411] [protein=putative portal protein] [protein_id=YP_006988272.1] [location=19957..21942] Match_columns 661 No_of_seqs 120 out of 152 Neff 6.5 Searched_HMMs 1612 Date Thu Nov 7 17:37:25 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_38 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_38_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97265 Length: 513 100.0 5E-152 3E-155 850.1 51.7 489 1-597 1-513 (513) 2 protein:vir:80453 Length: 535 100.0 1E-148 6E-152 832.0 49.8 499 1-597 1-535 (535) 3 protein:vir:95149 Length: 501 100.0 1E-147 9E-151 825.6 48.1 467 21-567 1-501 (501) 4 protein:vir:95014 Length: 491 100.0 7E-147 4E-150 822.0 49.8 472 6-576 1-491 (491) 5 protein:vir:78393 Length: 489 100.0 3E-146 2E-149 818.6 49.0 471 6-574 1-489 (489) 6 protein:vir:94956 Length: 452 100.0 5E-145 3E-148 811.7 50.2 450 21-563 1-452 (452) 7 protein:vir:96783 Length: 488 100.0 1E-138 6E-142 777.1 46.0 455 1-549 1-488 (488) 8 protein:vir:106571 Length: 499 99.9 1E-26 6.5E-30 163.0 32.9 474 1-591 1-499 (499) 9 protein:vir:5961 Length: 503 # 99.9 5.8E-25 3.6E-28 153.5 35.7 478 1-608 1-503 (503) 10 protein:vir:105889 Length: 474 99.9 1E-24 6.4E-28 152.1 34.3 452 9-578 1-474 (474) 11 protein:vir:94101 Length: 474 99.9 1E-24 6.4E-28 152.1 34.3 452 9-578 1-474 (474) 12 protein:vir:79043 Length: 479 99.9 2.3E-24 1.4E-27 150.2 34.8 469 1-571 1-479 (479) 13 protein:vir:102330 Length: 451 99.9 1.9E-24 1.1E-27 150.7 33.4 443 9-581 1-451 (451) 14 protein:vir:94805 Length: 492 99.9 1.3E-23 8.1E-27 146.0 37.0 459 1-576 21-492 (492) 15 protein:vir:99522 Length: 470 99.9 2.3E-23 1.4E-26 144.7 36.4 450 1-582 1-470 (470) 16 protein:vir:105292 Length: 478 99.9 1.8E-23 1.1E-26 145.3 35.4 464 1-578 1-478 (478) 17 protein:vir:97336 Length: 492 99.9 7.4E-24 4.6E-27 147.4 33.3 463 1-588 21-492 (492) 18 protein:vir:106639 Length: 481 99.9 5.2E-23 3.2E-26 142.8 37.5 454 1-577 1-481 (481) 19 protein:vir:97171 Length: 512 99.9 1.5E-23 9.4E-27 145.7 34.1 469 1-589 1-512 (512) 20 protein:vir:9871 Length: 429 # 99.9 1.8E-23 1.1E-26 145.2 33.9 424 4-580 1-429 (429) 21 protein:vir:93747 Length: 472 99.9 9.8E-23 6.1E-26 141.3 37.7 459 1-582 1-472 (472) 22 protein:vir:95806 Length: 440 99.9 5.5E-23 3.4E-26 142.6 35.7 424 25-580 1-440 (440) 23 protein:vir:103951 Length: 511 99.9 5.9E-23 3.6E-26 142.5 35.1 469 1-585 1-511 (511) 24 protein:vir:107112 Length: 478 99.9 5.4E-23 3.4E-26 142.6 34.7 466 1-583 1-478 (478) 25 protein:vir:9306 Length: 511 # 99.9 9.2E-23 5.7E-26 141.4 35.8 469 1-589 1-511 (511) 26 protein:vir:95113 Length: 474 99.9 1.3E-22 7.9E-26 140.6 36.2 458 1-580 1-474 (474) 27 protein:vir:96240 Length: 511 99.9 1.5E-22 9.2E-26 140.3 36.1 469 1-585 1-511 (511) 28 protein:vir:3964 Length: 453 # 99.9 5.1E-23 3.2E-26 142.8 33.5 439 1-574 8-453 (453) 29 protein:vir:1236 Length: 483 # 99.9 1.5E-22 9E-26 140.3 35.4 463 1-585 1-483 (483) 30 protein:vir:2732 Length: 501 # 99.9 1.7E-22 1.1E-25 139.9 35.8 463 1-585 9-501 (501) 31 protein:vir:102950 Length: 471 99.9 1.8E-22 1.1E-25 139.9 35.6 453 21-580 1-471 (471) 32 protein:vir:96179 Length: 468 99.9 1.8E-22 1.1E-25 139.9 34.2 456 1-580 1-468 (468) 33 protein:vir:4898 Length: 502 # 99.9 2.9E-22 1.8E-25 138.7 34.2 464 1-585 1-502 (502) 34 protein:vir:94498 Length: 474 99.9 5.7E-22 3.5E-25 137.0 35.4 450 8-576 1-474 (474) 35 protein:vir:97447 Length: 474 99.9 5.7E-22 3.5E-25 137.0 35.4 450 8-576 1-474 (474) 36 protein:vir:3609 Length: 452 # 99.9 1.1E-21 6.9E-25 135.5 36.4 440 1-585 1-452 (452) 37 protein:vir:99781 Length: 511 99.9 6.1E-22 3.8E-25 136.9 34.7 468 1-585 1-511 (511) 38 protein:vir:78083 Length: 537 99.9 9.2E-22 5.7E-25 135.9 35.4 506 4-607 1-537 (537) 39 protein:vir:96366 Length: 511 99.9 9E-22 5.6E-25 136.0 34.9 462 1-585 10-511 (511) 40 protein:vir:78805 Length: 511 99.9 9E-22 5.6E-25 136.0 34.9 462 1-585 10-511 (511) 41 protein:vir:9922 Length: 489 # 99.9 2.9E-21 1.8E-24 133.2 37.1 449 1-569 1-489 (489) 42 protein:vir:96494 Length: 501 99.9 1.4E-21 8.6E-25 134.9 34.4 460 1-585 15-501 (501) 43 protein:vir:98444 Length: 434 99.9 1.6E-21 9.8E-25 134.6 34.4 411 57-577 1-434 (434) 44 protein:vir:78537 Length: 480 99.9 1.1E-21 6.7E-25 135.5 33.2 463 16-603 1-480 (480) 45 protein:vir:96839 Length: 474 99.9 4.8E-22 3E-25 137.5 31.2 463 1-578 1-474 (474) 46 protein:vir:105461 Length: 470 99.9 2.1E-21 1.3E-24 134.0 33.5 453 21-581 1-470 (470) 47 protein:vir:96266 Length: 474 99.9 2.2E-21 1.4E-24 133.8 33.5 458 1-583 1-474 (474) 48 protein:vir:95899 Length: 474 99.9 2.2E-21 1.4E-24 133.8 33.5 458 1-583 1-474 (474) 49 protein:vir:78227 Length: 480 99.9 2.1E-21 1.3E-24 133.9 33.3 466 16-599 1-480 (480) 50 protein:vir:104082 Length: 485 99.9 8.8E-21 5.5E-24 130.5 35.8 457 1-580 1-485 (485) 51 protein:vir:105819 Length: 456 99.9 5E-21 3.1E-24 131.9 34.0 441 1-562 1-456 (456) 52 protein:vir:102602 Length: 456 99.9 5E-21 3.1E-24 131.9 34.0 441 1-562 1-456 (456) 53 protein:vir:94546 Length: 506 99.9 1.3E-21 8.3E-25 135.0 30.5 449 1-587 19-506 (506) 54 protein:vir:733 Length: 453 # 99.9 6.8E-21 4.2E-24 131.2 34.2 438 2-579 1-453 (453) 55 protein:vir:2500 Length: 501 # 99.9 1.7E-20 1E-23 129.0 34.6 479 4-597 1-501 (501) 56 protein:vir:2341 Length: 488 # 99.8 5.5E-21 3.4E-24 131.7 30.4 468 5-592 1-488 (488) 57 protein:vir:99072 Length: 479 99.8 2.2E-20 1.4E-23 128.3 33.6 470 1-609 1-479 (479) 58 protein:vir:2427 Length: 485 # 99.8 1.1E-19 6.9E-23 124.5 34.4 461 7-605 1-485 (485) 59 protein:vir:7768 Length: 484 # 99.8 4E-20 2.5E-23 127.0 31.8 462 1-601 1-484 (484) 60 protein:vir:80680 Length: 441 99.8 1.3E-19 7.9E-23 124.2 33.9 427 4-591 1-441 (441) 61 protein:vir:9751 Length: 422 # 99.8 1.4E-19 8.8E-23 123.9 32.8 408 18-560 1-422 (422) 62 protein:vir:7987 Length: 456 # 99.8 1.2E-19 7.5E-23 124.3 31.5 441 1-575 1-456 (456) 63 protein:vir:9568 Length: 410 # 99.8 7.9E-19 4.9E-22 119.8 34.2 392 23-551 1-410 (410) 64 protein:vir:4223 Length: 486 # 99.8 1E-18 6.2E-22 119.3 34.3 459 1-593 1-486 (486) 65 protein:vir:94742 Length: 409 99.8 1.3E-18 8.2E-22 118.6 31.6 392 18-532 1-409 (409) 66 protein:vir:101494 Length: 527 99.8 6.2E-17 3.8E-20 109.4 34.8 492 1-580 1-527 (527) 67 protein:vir:102239 Length: 527 99.8 6.6E-17 4.1E-20 109.3 34.8 492 1-580 1-527 (527) 68 protein:vir:1634 Length: 409 # 99.7 2.7E-17 1.7E-20 111.4 31.6 391 18-532 1-409 (409) 69 protein:vir:99916 Length: 504 99.7 2.3E-16 1.4E-19 106.3 34.5 464 4-616 1-504 (504) 70 protein:vir:38 Length: 496 # N 99.7 2.3E-14 1.4E-17 95.3 39.5 444 1-563 15-496 (496) 71 protein:vir:80959 Length: 499 99.6 1.8E-13 1.1E-16 90.5 39.6 449 1-569 15-499 (499) 72 protein:vir:8184 Length: 474 # 99.6 1.3E-14 8.3E-18 96.7 32.7 443 4-572 1-474 (474) 73 protein:vir:1587 Length: 508 # 99.5 3.1E-12 1.9E-15 83.7 35.8 442 1-564 17-508 (508) 74 protein:vir:98883 Length: 517 99.5 1.1E-12 7E-16 86.1 32.6 468 1-580 1-517 (517) 75 protein:vir:79703 Length: 505 99.5 1E-11 6.3E-15 80.8 38.1 444 1-567 14-505 (505) 76 protein:vir:7430 Length: 563 # 99.4 1.8E-11 1.1E-14 79.5 32.9 516 1-611 1-563 (563) 77 protein:vir:9815 Length: 500 # 99.4 3.9E-11 2.4E-14 77.6 38.8 453 1-561 1-500 (500) 78 protein:vir:3028 Length: 500 # 99.4 3.9E-11 2.4E-14 77.6 38.8 453 1-561 1-500 (500) 79 protein:vir:78907 Length: 518 99.3 2.8E-10 1.8E-13 72.9 36.8 451 21-566 1-518 (518) 80 protein:vir:93630 Length: 776 99.2 2.5E-10 1.5E-13 73.3 28.6 611 1-661 1-765 (776) 81 protein:vir:4782 Length: 522 # 99.2 8.1E-10 5E-13 70.4 34.9 471 1-574 1-522 (522) 82 protein:vir:105619 Length: 772 99.0 4.1E-09 2.5E-12 66.6 30.0 605 1-661 3-735 (772) 83 protein:vir:8846 Length: 705 # 98.8 2.8E-08 1.7E-11 62.0 29.4 570 1-653 1-705 (705) 84 protein:vir:80165 Length: 651 98.8 3.2E-08 2E-11 61.6 35.3 549 1-640 3-651 (651) 85 protein:vir:94599 Length: 641 98.6 2.1E-07 1.3E-10 57.2 34.4 570 1-655 1-641 (641) 86 protein:vir:345 Length: 663 # 98.3 1.3E-06 8E-10 52.9 30.9 570 1-655 1-663 (663) 87 protein:vir:100920 Length: 725 98.2 2.7E-06 1.7E-09 51.1 22.0 582 21-657 1-725 (725) 88 protein:vir:105520 Length: 706 98.1 4E-06 2.5E-09 50.2 30.6 584 23-652 1-706 (706) 89 protein:vir:95821 Length: 763 98.0 6E-06 3.7E-09 49.2 33.6 588 1-661 1-740 (763) 90 protein:vir:105429 Length: 708 97.9 9.8E-06 6.1E-09 48.0 27.5 569 23-657 1-708 (708) 91 protein:vir:108295 Length: 711 97.9 1.1E-05 7E-09 47.7 30.0 584 1-643 1-711 (711) 92 protein:vir:104437 Length: 714 97.7 2.3E-05 1.4E-08 46.0 32.7 577 1-643 1-714 (714) 93 protein:vir:95449 Length: 584 97.7 3.2E-05 2E-08 45.2 25.3 535 1-606 1-584 (584) 94 protein:vir:172 Length: 708 # 97.5 4.9E-05 3.1E-08 44.2 31.5 571 23-657 1-708 (708) 95 protein:vir:9263 Length: 725 # 97.5 5.8E-05 3.6E-08 43.8 22.5 568 21-648 1-725 (725) 96 protein:vir:78393 Length: 489 97.4 1.1E-06 6.9E-10 53.2 6.6 439 49-584 1-489 (489) 97 protein:vir:79233 Length: 526 97.4 8.5E-05 5.3E-08 42.9 30.3 484 1-644 1-526 (526) 98 protein:vir:99853 Length: 488 97.3 9.3E-05 5.8E-08 42.7 28.6 462 25-652 1-488 (488) 99 protein:vir:2198 Length: 536 # 97.2 0.00014 8.4E-08 41.8 35.3 498 1-649 1-536 (536) 100 protein:vir:1785 Length: 555 # 97.2 0.00015 9.3E-08 41.5 31.2 526 11-648 1-555 (555) 101 protein:vir:79063 Length: 491 97.1 0.00016 9.7E-08 41.4 27.8 465 1-638 1-491 (491) 102 protein:vir:10447 Length: 536 97.1 0.00017 1E-07 41.3 36.2 498 1-649 1-536 (536) 103 protein:vir:9950 Length: 714 # 97.1 0.00017 1.1E-07 41.2 34.7 590 1-661 1-713 (714) 104 protein:vir:10117 Length: 714 97.1 0.00017 1.1E-07 41.2 34.7 590 1-661 1-713 (714) 105 protein:vir:817 Length: 714 # 97.1 0.00017 1.1E-07 41.2 34.7 590 1-661 1-713 (714) 106 protein:vir:2764 Length: 714 # 97.1 0.00017 1.1E-07 41.2 34.7 590 1-661 1-713 (714) 107 protein:vir:3296 Length: 714 # 97.1 0.00017 1.1E-07 41.2 34.7 590 1-661 1-713 (714) 108 protein:vir:99232 Length: 526 97.0 0.00022 1.3E-07 40.7 29.8 486 1-644 1-526 (526) 109 protein:vir:1986 Length: 512 # 96.8 0.0003 1.9E-07 39.9 24.6 467 1-637 1-512 (512) 110 protein:vir:95149 Length: 501 96.8 0.00028 1.7E-07 40.1 14.7 423 89-583 1-501 (501) 111 protein:vir:78161 Length: 355 96.6 0.00049 3.1E-07 38.7 25.5 322 253-618 1-355 (355) 112 protein:vir:94572 Length: 535 96.4 0.00063 3.9E-07 38.1 33.7 506 1-643 1-535 (535) 113 protein:vir:98816 Length: 446 96.3 0.00076 4.7E-07 37.7 28.0 396 19-537 1-446 (446) 114 protein:vir:77597 Length: 725 96.3 0.00077 4.8E-07 37.6 29.9 571 21-648 1-725 (725) 115 protein:vir:107880 Length: 491 96.3 0.00082 5.1E-07 37.5 28.4 461 1-640 1-491 (491) 116 protein:vir:1538 Length: 535 # 96.1 0.001 6.4E-07 37.0 35.7 506 1-644 1-535 (535) 117 protein:vir:78942 Length: 510 95.8 0.0014 8.5E-07 36.3 32.1 483 11-645 1-510 (510) 118 protein:vir:7017 Length: 515 # 95.7 0.0015 9.5E-07 36.0 31.3 482 1-624 7-515 (515) 119 protein:vir:94709 Length: 522 95.7 0.0016 9.8E-07 36.0 35.5 486 1-626 1-522 (522) 120 protein:vir:103860 Length: 528 95.7 0.0016 9.9E-07 35.9 33.6 478 1-645 1-528 (528) 121 protein:vir:79538 Length: 502 95.6 0.0017 1.1E-06 35.7 35.1 450 9-582 1-502 (502) 122 protein:vir:7321 Length: 556 # 95.6 0.0018 1.1E-06 35.6 31.4 518 1-657 1-556 (556) 123 protein:vir:102668 Length: 547 95.5 0.002 1.2E-06 35.4 33.2 500 21-634 1-547 (547) 124 protein:vir:96783 Length: 488 95.5 0.002 1.2E-06 35.4 16.2 443 39-570 1-488 (488) 125 protein:vir:107742 Length: 537 95.4 0.0021 1.3E-06 35.3 29.0 459 1-595 47-537 (537) 126 protein:vir:100039 Length: 522 95.2 0.0026 1.6E-06 34.8 32.0 484 21-642 1-522 (522) 127 protein:vir:10321 Length: 495 95.1 0.0027 1.7E-06 34.7 32.1 445 9-583 1-495 (495) 128 protein:vir:8883 Length: 543 # 94.9 0.0032 2E-06 34.3 34.0 512 1-652 1-543 (543) 129 protein:vir:80040 Length: 461 94.9 0.0033 2E-06 34.2 28.0 427 1-560 1-461 (461) 130 protein:vir:105641 Length: 516 94.8 0.0034 2.1E-06 34.1 28.6 488 1-610 1-516 (516) 131 protein:vir:78696 Length: 542 94.8 0.0034 2.1E-06 34.1 32.0 504 11-651 1-542 (542) 132 protein:vir:80211 Length: 514 94.5 0.0043 2.7E-06 33.5 32.6 482 11-624 1-514 (514) 133 protein:vir:95315 Length: 559 94.4 0.0044 2.7E-06 33.5 31.6 519 21-651 1-559 (559) 134 protein:vir:6322 Length: 510 # 94.2 0.0052 3.2E-06 33.1 33.3 479 11-617 1-510 (510) 135 protein:vir:96068 Length: 765 93.9 0.0061 3.8E-06 32.7 26.3 525 1-661 43-643 (765) 136 protein:vir:3361 Length: 535 # 93.8 0.0065 4E-06 32.6 36.1 505 1-647 1-535 (535) 137 protein:vir:107404 Length: 555 93.7 0.0066 4.1E-06 32.6 34.2 511 1-631 1-555 (555) 138 protein:vir:107822 Length: 555 93.7 0.0066 4.1E-06 32.6 34.2 511 1-631 1-555 (555) 139 protein:vir:98506 Length: 555 93.7 0.0066 4.1E-06 32.6 34.2 511 1-631 1-555 (555) 140 protein:vir:108215 Length: 469 92.7 0.01 6.4E-06 31.5 32.4 438 17-603 1-469 (469) 141 protein:vir:107662 Length: 427 92.3 0.012 7.3E-06 31.2 23.4 393 21-559 1-427 (427) 142 protein:vir:96988 Length: 516 91.3 0.017 1E-05 30.4 31.0 487 1-624 1-516 (516) 143 protein:vir:99672 Length: 532 91.0 0.018 1.1E-05 30.2 34.5 503 1-652 1-532 (532) 144 protein:vir:104338 Length: 422 90.0 0.023 1.4E-05 29.6 26.6 391 21-553 1-422 (422) 145 protein:vir:5249 Length: 437 # 89.0 0.029 1.8E-05 29.0 28.0 413 34-570 1-437 (437) 146 protein:vir:3139 Length: 599 # 87.3 0.039 2.4E-05 28.3 16.1 530 12-651 1-599 (599) 147 protein:vir:79647 Length: 435 86.8 0.043 2.7E-05 28.1 23.5 406 1-564 5-435 (435) 148 protein:vir:79511 Length: 448 81.2 0.088 5.4E-05 26.4 26.7 417 1-590 1-448 (448) 149 protein:vir:103765 Length: 549 80.8 0.092 5.7E-05 26.3 32.6 500 21-631 1-549 (549) 150 protein:vir:95254 Length: 488 79.4 0.1 6.5E-05 26.0 28.2 442 1-613 1-488 (488) 151 protein:vir:77981 Length: 448 78.3 0.12 7.2E-05 25.7 25.2 414 1-590 1-448 (448) 152 protein:vir:3843 Length: 397 # 77.7 0.12 7.6E-05 25.6 25.4 380 1-580 1-397 (397) 153 protein:vir:103330 Length: 517 77.4 0.12 7.7E-05 25.5 32.6 475 21-654 1-517 (517) 154 protein:vir:3520 Length: 720 # 76.1 0.14 8.6E-05 25.3 30.7 572 21-661 1-719 (720) 155 protein:vir:4854 Length: 386 # 74.9 0.15 9.5E-05 25.1 23.4 369 1-556 1-386 (386) 156 protein:vir:63755 Length: 547 73.6 0.17 0.0001 24.8 27.2 484 1-628 9-547 (547) 157 protein:vir:95542 Length: 548 51.9 0.57 0.00035 21.9 34.4 485 21-661 1-540 (548) 158 protein:vir:3989 Length: 392 # 43.0 0.86 0.00054 20.9 29.4 373 1-569 2-392 (392) 159 protein:vir:1023 Length: 392 # 43.0 0.86 0.00054 20.9 29.4 373 1-569 2-392 (392) 160 protein:vir:6382 Length: 553 # 42.9 0.87 0.00054 20.9 36.1 455 1-588 9-553 (553) 161 protein:vir:99563 Length: 862 39.8 1 0.00062 20.6 29.5 542 1-661 52-681 (862) 162 protein:vir:3420 Length: 533 # 33.3 1.4 0.00085 19.8 33.4 449 25-592 1-533 (533) 163 protein:vir:4952 Length: 386 # 32.3 1.4 0.00089 19.7 28.3 369 21-556 1-386 (386) 164 protein:vir:103219 Length: 201 31.1 1.5 0.00094 19.6 11.2 188 331-553 1-201 (201) 165 protein:vir:4337 Length: 434 # 29.0 1.7 0.0011 19.3 28.0 412 1-595 1-434 (434) 166 protein:vir:7407 Length: 392 # 27.8 1.8 0.0011 19.2 29.9 373 1-569 2-392 (392) 167 protein:vir:100882 Length: 383 25.5 2.1 0.0013 18.9 26.0 363 1-564 1-383 (383) 168 protein:vir:389 Length: 530 # 23.9 2.2 0.0014 18.7 35.4 446 9-576 1-530 (530) 169 protein:vir:81072 Length: 432 23.8 2.3 0.0014 18.6 28.2 399 27-588 1-432 (432) No 1 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=100.00 E-value=5e-152 Score=850.08 Aligned_cols=489 Identities=19% Similarity=0.233 Sum_probs=411.4 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |+- .+..+|+++||+|.+|+++|++|||||+|+++||++|++||||+++|++++|++||+||+||| T Consensus 1 m~~--------------~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 66 (513) T protein:vir:97 1 MAD--------------KDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLN 66 (513) T ss_pred CCC--------------CCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCC Confidence 332 334569999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHhchhhccCcccc-ccchhhH-hhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhC Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIR-NLPNTGA-ITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMG 158 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~-~~p~~l~-~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~G 158 (661) +|++||++|+|+||+|||+++ ++|+.+. +|.+|||| +|++|++|++++++++|.+| T Consensus 67 ~~~~tl~~l~G~vf~k~p~~~~~~p~~~~~~l~~d~D~----------------------~G~~L~~f~~~~~~~~l~~G 124 (513) T protein:vir:97 67 MVEQTLDTLSGKPFSEPIKLNEDVPKAIEETILPDVDL----------------------QGNNLDVFARQWFREGMAKA 124 (513) T ss_pred hHHHHHHHHhhhhhhcCcccCcCchHHHHHHHhhccCC----------------------CCCCHHHHHHHHHHHHHhcC Confidence 999999999999999999995 5788876 46667665 79999999999999999999 Q ss_pred CEEEEEeccCC------------CchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccccccccee Q lcl|NC_019406. 159 RFGALVDVAPS------------SDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWI 226 (661) Q Consensus 159 r~gvLVD~P~a------------~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i 226 (661) |||||||||.+ ++.+.++|||+++|.|++||||+++.++|+.+|++|||+|++..+ T Consensus 125 ~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~------------ 192 (513) T protein:vir:97 125 LCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQ------------ 192 (513) T ss_pred eEEEEEecCCCCCccchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeec------------ Confidence 99999999974 446778999999999999999999999999999999999987521 Q ss_pred eeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccccee Q lcl|NC_019406. 227 GREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYT 306 (661) Q Consensus 227 ~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~ 306 (661) ++|..+.+.|+|+|++|. |+++...+++.....+++. T Consensus 193 --------------------------------------Dgf~~~~~~q~rvL~~g~-----~~v~r~~~~~~~~~~e~~~ 229 (513) T protein:vir:97 193 --------------------------------------DGFAEVCKRRIRVLEPGL-----VQLWEPVKKSNAQKEEWAL 229 (513) T ss_pred --------------------------------------CCCcceEEEEEEEEeCce-----EEEEEeecCCCccccceEE Confidence 246677888888888763 3333333333333445566 Q ss_pred eccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEeccccee Q lcl|NC_019406. 307 PMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVW 386 (661) Q Consensus 307 p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~ 386 (661) +..++++|++|||||+++.++++++++|||++||+||++|||++|||++|||+++||+||++|++++|.++|+||++++| T Consensus 230 ~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~~i~iG~~~~~ 309 (513) T protein:vir:97 230 ADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSDPVVVGPNKVL 309 (513) T ss_pred ecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCCceEeeccccc Confidence 67788999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 387 VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV 466 (661) Q Consensus 387 ~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a 466 (661) .+|.++++++||||+|+++.+++++|++|++||+++||+|+.. +++++||++++++++++||+|++||.+|++|++++ T Consensus 310 ~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~ll~~--~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~ 387 (513) T protein:vir:97 310 YNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEFLKR--KTGGQTATARALDSAEATSDLSAMTGLFEDALAQA 387 (513) T ss_pred cCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHhhcc--CCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9997789999999999999999999999999999999999974 35689999999999999999999999999999999 Q ss_pred HHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCH----HHHHHHH Q lcl|NC_019406. 467 VRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTL----EEFTIKM 542 (661) Q Consensus 467 L~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~----Eee~~~l 542 (661) |+|||+|+|.. ..+++|+||+||....++++++++|++++++|.||++||+++|+|+|||+++++. |+++++| T Consensus 388 l~~~a~wlg~~---~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~ 464 (513) T protein:vir:97 388 LDITADWLRLG---PNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEI 464 (513) T ss_pred HHHHHHHhCCC---CCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhh Confidence 99999999963 3468899999999999999999999999999999999999999999999988884 4455555 Q ss_pred hccCC--CCCCchhhhhhc-C-C--ccccCCCcchhhhhcCChhhHHHHHHHhccCCCchh Q lcl|NC_019406. 543 NDPKS--FIGQPDAIAMRR-G-Y--VSRQQELDQQRAARDADFQQQELEQAERHLEIDEEK 597 (661) Q Consensus 543 ~~~~~--~l~~ddae~~~~-g-~--~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~ 597 (661) ++.+. +++.+++..++. + . .+-..+-.+.++..++- +-|--+. T Consensus 465 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~ 513 (513) T protein:vir:97 465 SEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGEGGEGG------------GNPGGES 513 (513) T ss_pred hhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCccccC------------CCCCCCC Confidence 55543 223333332221 1 0 00122222223322221 1111111 No 2 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=100.00 E-value=9.7e-149 Score=832.04 Aligned_cols=499 Identities=22% Similarity=0.318 Sum_probs=414.1 Q ss_pred CC-------------CCCCcccccccccc-ccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC--- Q lcl|NC_019406. 1 MA-------------GLSPNSANIRRTKR-GAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG--- 63 (661) Q Consensus 1 ~~-------------~~~~~~~~~~~~~~-~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~--- 63 (661) || -++|-|+- -|-- .++-.||+++||+|.+|+++|++|||||+|+++||++|++|||+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~ 78 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAP--PTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSR 78 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCc--CCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccC Confidence 33 22222221 0111 23334599999999999999999999999999999999999999874 Q ss_pred --CChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCC Q lcl|NC_019406. 64 --FDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGT 141 (661) Q Consensus 64 --E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~ 141 (661) |++++|++||+||+|||+|++||++|+|+||+|||+++ +|+.|++|.+|||| +|+ T Consensus 79 ~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~-~p~~l~~l~~d~D~----------------------~G~ 135 (535) T protein:vir:80 79 DEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQ-LPPALEAIVEDIDG----------------------EGV 135 (535) T ss_pred CcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCccee-ccHHHHHHHhccCC----------------------CCC Confidence 56778999999999999999999999999999999995 89999999888766 699 Q ss_pred CHHHHHHHHHHHHHhhCCEEEEEeccCC-------CchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeec Q lcl|NC_019406. 142 SHQGFAKTVALEQVAMGRFGALVDVAPS-------SDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVD 214 (661) Q Consensus 142 sL~~fa~~~~~~~L~~Gr~gvLVD~P~a-------~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~ 214 (661) +|++|++++++++|.+||||||||||.+ ++.+.+.|||+++|+|++||||+++.++|+.+|+||||+|++.++ T Consensus 136 ~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~ 215 (535) T protein:vir:80 136 SLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQ 215 (535) T ss_pred CHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEec Confidence 9999999999999999999999999974 446779999999999999999999999999999999999988653 Q ss_pred cccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEe Q lcl|NC_019406. 215 EHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYV 294 (661) Q Consensus 215 ~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~ 294 (661) + ++|..+.+.|+|+|+++.+|.+. +++|+ T Consensus 216 d-------------------------------------------------d~f~~~~~~q~RvL~~~~~G~y~--v~~~~ 244 (535) T protein:vir:80 216 D-------------------------------------------------DGFETTYVQQWRVLQLNAEGNYQ--VERWR 244 (535) T ss_pred C-------------------------------------------------CCcccceeEEEEEEEecCCceEE--EEEEE Confidence 3 35566677777788887776654 34443 Q ss_pred cC----cccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecC Q lcl|NC_019406. 295 ED----PLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPEL 370 (661) Q Consensus 295 ~~----~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl 370 (661) .. ......+.+.+..++++|++|||||+|+.++++++++|||++||+|||+|||++|||++|||++++|+||++|+ T Consensus 245 ~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~ 324 (535) T protein:vir:80 245 RETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGL 324 (535) T ss_pred eecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecC Confidence 21 12223345556778899999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC------CceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHH Q lcl|NC_019406. 371 DDSD------ASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALR 444 (661) Q Consensus 371 ~~~~------~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d 444 (661) +++| +.+|+||++++|.+|. +++++|+|++|.++. +++|++|++||+++||+|+... .+++|+++++++ T Consensus 325 ~~~~~~~~~~~~~i~iG~~~~~~lP~-~~~~~~~e~~~~~~a--~~~l~~~e~qM~~lGa~ll~~~--~~~~Ta~~a~~~ 399 (535) T protein:vir:80 325 TKDWVEDVFKDFKVHLGSRAIIPLPQ-GATAGILQITPNSVP--FEAMTHKESQMIAMGANLLVKS--GGNRTFGEAQQE 399 (535) T ss_pred chhhhhcCCCCcceEecCcccccCCC-CCCcceeeeccchhH--HHHHHHHHHHHHHHHHHhhccC--cccccHHHHHHH Confidence 8765 2469999999999996 789999999999887 5789999999999999999753 578999999999 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHH Q lcl|NC_019406. 445 EANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFV 524 (661) Q Consensus 445 ~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~ 524 (661) ++++||+|++||.+|++|+++||+|||+|+|+.. +.+++.|+||+||....++++++++|++++++|.||++||+++|+ T Consensus 400 ~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~-~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L~ 478 (535) T protein:vir:80 400 EASEQSILSACTKNVSMAFRKALRWANQFQTGIV-NDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGLR 478 (535) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCcc-CCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHH Confidence 9999999999999999999999999999999754 456789999999999999999999999999999999999999999 Q ss_pred hcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchh Q lcl|NC_019406. 525 KNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEK 597 (661) Q Consensus 525 r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~ 597 (661) |+|||++++++|+++.+|++++..+++.-....-++ ..-+...+.+ + -...+-+++. T Consensus 479 r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~-~~g~~~~~~~----~-----------~~~~~~~~~~ 535 (535) T protein:vir:80 479 RAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAA-SGGTNKAKLN----N-----------GNGGGNQAGN 535 (535) T ss_pred hCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCC-CCCCCcCccc----C-----------CccccccCCC Confidence 999999999999999999988665543211111101 0011111111 0 0122222222 No 3 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=100.00 E-value=1.5e-147 Score=825.60 Aligned_cols=467 Identities=22% Similarity=0.352 Sum_probs=400.0 Q ss_pred CC-ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCC-----CCCChHHHHHHHhhhcccchHHHHHHHHhchhh Q lcl|NC_019406. 21 FT-HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAP-----KGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIF 94 (661) Q Consensus 21 ~~-V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~-----~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vF 94 (661) |+ |+++||+|.+|+++|++|||||+|+++||++|++|||++ ++|+++.|++||+||+|||+|++|+++|+|+|| T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf 80 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVF 80 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhh Confidence 77 999999999999999999999999999999999999986 556678999999999999999999999999999 Q ss_pred ccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCC----- Q lcl|NC_019406. 95 RRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPS----- 169 (661) Q Consensus 95 rk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a----- 169 (661) +|||+++ +|+.|++|.+|+|| +|++|++|++++++++|.+||||||||||.+ T Consensus 81 ~k~p~~~-~p~~l~~l~~d~D~----------------------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~ 137 (501) T protein:vir:95 81 MRDPVVK-VPALLNPLVANATG----------------------SGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGG 137 (501) T ss_pred cCCccee-CcHHHHHHHhccCC----------------------CCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCccc Confidence 9999995 99999999888876 6999999999999999999999999999964 Q ss_pred ----CchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcch Q lcl|NC_019406. 170 ----SDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGL 245 (661) Q Consensus 170 ----~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~ 245 (661) ++++.++|||+++|+|++||||+++.+||+.+|+||||+|++.++++ T Consensus 138 ~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~----------------------------- 188 (501) T protein:vir:95 138 ASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADD----------------------------- 188 (501) T ss_pred ccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCC----------------------------- Confidence 34577899999999999999999999999999999999998864333 Q ss_pred hhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcc----------cccccceee-ccCCccc Q lcl|NC_019406. 246 AERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPL----------GQARDVYTP-MVRGRTL 314 (661) Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~----------~~~~~~~~p-~~~g~~L 314 (661) +|..+.+.|+|+|+++.+|.++|+++....... +.....+.| ..+|++| T Consensus 189 --------------------~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l 248 (501) T protein:vir:95 189 --------------------GFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRL 248 (501) T ss_pred --------------------CcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccceeeeeccCCCcC Confidence 455566777777777777777666554433211 111233444 4678999 Q ss_pred ceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC-----ceeEecccceeecC Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA-----SEYHIGPGRVWVVD 389 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~-----~~l~iGs~~~~~lp 389 (661) ++|||||+++.++++++++|||++||+|||+|||++|||++|||++++|+||++|+++++. .+++||++++|.+| T Consensus 249 ~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G~~~~~~lP 328 (501) T protein:vir:95 249 TEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVNFGSRGGIPLP 328 (501) T ss_pred CeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCceeecccccccCC Confidence 9999999999999999999999999999999999999999999999999999999987653 46999999999999 Q ss_pred CCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 390 KESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRY 469 (661) Q Consensus 390 ~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~ 469 (661) + +++++||||+|.++. +++|++|++||+++||+|++. +.+++||++++++++++||+|++||.+|++|+++||+| T Consensus 329 ~-~~~~~~ie~~~~~i~--~~~l~~l~~~m~~~Ga~ll~~--~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~ 403 (501) T protein:vir:95 329 V-GADAKLLQASENTML--KEAMDTKERQMVALGAKLVEQ--KEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKW 403 (501) T ss_pred C-CCceeEEecChhhHH--HHHHHHHHHHHHHHHHhhccC--CccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 5 789999999998875 899999999999999999964 45789999999999999999999999999999999999 Q ss_pred HHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhcc--CC Q lcl|NC_019406. 470 WLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDP--KS 547 (661) Q Consensus 470 ~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~--~~ 547 (661) ||+|+|+.+ .+++|+||+||....++++++++|++++++|.||++||+++|+|+||+++++++++++.+.+.. .+ T Consensus 404 ~a~w~g~~~---~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~ 480 (501) T protein:vir:95 404 AARWVGQAD---SGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMA 480 (501) T ss_pred HHHHcCCCC---CceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCccc Confidence 999999752 4578999999999999999999999999999999999999999999999988777655443322 21 Q ss_pred CCCCchhhhhhcCCccc-cCC Q lcl|NC_019406. 548 FIGQPDAIAMRRGYVSR-QQE 567 (661) Q Consensus 548 ~l~~ddae~~~~g~~~~-~~~ 567 (661) ...+.+....-+|..+. -.| T Consensus 481 ~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 481 LATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred ccccCCCCCCCcccccccCCC Confidence 21222222223333332 111 No 4 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=100.00 E-value=6.5e-147 Score=822.02 Aligned_cols=472 Identities=24% Similarity=0.363 Sum_probs=410.4 Q ss_pred CccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC-CChHHHHHHHhhhcccchHHH Q lcl|NC_019406. 6 PNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG-FDDEDYANYLDRAAFYNMTSQ 84 (661) Q Consensus 6 ~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~-E~~~~Y~~rl~rA~~~n~~~~ 84 (661) --+|| ...+||+++||+|.+|+++|++|||||+|++. +..++.|||++++ +++++|++||+||+|||+|++ T Consensus 1 ~~~~~-------~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~-~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~ 72 (491) T protein:vir:95 1 MLTAN-------GQGSGVKTKHREWLHYAPKWQKVRHALAGDLV-GYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRR 72 (491) T ss_pred CcccC-------CccCCCCccCHHHHHHHHHHHHHHHHhcCcch-hhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHH Confidence 12333 67789999999999999999999999999654 4457789999876 667789999999999999999 Q ss_pred HHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEE Q lcl|NC_019406. 85 TQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALV 164 (661) Q Consensus 85 tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLV 164 (661) ||++|+|+||+|||+++ +|+.|++|.+|||| +|++|++|++++++++|.+||||||| T Consensus 73 tl~~l~G~vfrk~p~~~-~p~~l~~l~~d~D~----------------------~G~~L~~f~~~~~~~~l~~G~~~ilV 129 (491) T protein:vir:95 73 TLSGMVGSVMRKEPEIN-IPKELEYLLKNADG----------------------SGVGLIQHAQDTLMEIDSVGRGGLLV 129 (491) T ss_pred HHHHHhchhhcCCceee-ccHHHHHHHhccCC----------------------CCCCHHHHHHHHHHHHHHcCeEEEEE Confidence 99999999999999995 99999999888766 69999999999999999999999999 Q ss_pred eccCCC------chhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 165 DVAPSS------DPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 165 D~P~a~------~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) |||.++ +++.++|||+++|+|++||||+++.+||+.+|++|||+|++.+.+ T Consensus 130 D~P~~~~~T~Ade~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d----------------------- 186 (491) T protein:vir:95 130 DAPETAAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHE----------------------- 186 (491) T ss_pred ecCCCcccCHHHHHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeec----------------------- Confidence 999754 467899999999999999999999999999999999999875422 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccc---cccceeeccCCcccc Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQ---ARDVYTPMVRGRTLP 315 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~---~~~~~~p~~~g~~L~ 315 (661) ..++|+.+.++|+|+|+++.+|.+ ++++|+++..+. ..++++|..++++|+ T Consensus 187 ------------------------~~~~f~~~~~~qyRvL~l~~~g~~--~~~v~r~~~~g~~~~~~~~~~~~~g~~~l~ 240 (491) T protein:vir:95 187 ------------------------PGNEFETKYGEQYRVLDIDTDGNY--RQRLFRFDAEGGAQEEVVEIYPDLGESLRG 240 (491) T ss_pred ------------------------CCCCcccceEEEEEEEeecCCCce--EEEEEEEcCCCcceeeeeeeeecCCCcccC Confidence 345788888899999998877764 555555543333 345667778888999 Q ss_pred eeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC--------CceeEecccceee Q lcl|NC_019406. 316 FIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD--------ASEYHIGPGRVWV 387 (661) Q Consensus 316 ~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~--------~~~l~iGs~~~~~ 387 (661) +|||||+|+.++++++++|||++||+|||+|||++|||++|||++++|+||++|.++.. +..+++|++++|. T Consensus 241 ~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~~ 320 (491) T protein:vir:95 241 VIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGSRCGHN 320 (491) T ss_pred eeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecCcCCcC Confidence 99999999999999999999999999999999999999999999999999999976432 2358999999999 Q ss_pred cCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVV 467 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL 467 (661) +|. +++++|+|++|.++ .+++|++|++||+++||+|++. +.++||++++++++++||+|++||.+|++|+++|| T Consensus 321 lP~-~~~~~~ie~~~~~~--~~~~l~~~e~qm~~~Ga~l~~~---~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l 394 (491) T protein:vir:95 321 LGY-GGSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQLITP---SQQITAESARIQRGADTSVMATIARNVSQAYTDAL 394 (491) T ss_pred CCC-CCccceeecCcchH--HHHHHHHHHHHHHHHHHHhccC---CcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 996 78999999998775 5999999999999999999964 34799999999999999999999999999999999 Q ss_pred HHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCC Q lcl|NC_019406. 468 RYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKS 547 (661) Q Consensus 468 ~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~ 547 (661) +|||+|+|+++ ..++.|+||+||....++++++++|++++++|.||++||+++|+|+||++ .++|+++++|+++++ T Consensus 395 ~~~a~w~G~~~--~~~v~i~~n~dF~~~~~~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~--~~~e~~~~~ie~~~~ 470 (491) T protein:vir:95 395 RWVAMMLGKPE--DSEVEFQLNMDFFLQPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTD--WTDEDILNAIEDAPL 470 (491) T ss_pred HHHHHHcCCCC--CCceEEEeecccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCC--ccHHHHHHHHHhcCC Confidence 99999999864 45788999999999999999999999999999999999999999999984 578999999999988 Q ss_pred CCCCchhhhhhcCCccccCCCcch-hhhhc Q lcl|NC_019406. 548 FIGQPDAIAMRRGYVSRQQELDQQ-RAARD 576 (661) Q Consensus 548 ~l~~ddae~~~~g~~~~~~~~~q~-~~~~e 576 (661) .++.-- ....+++|+ +...| T Consensus 471 ~~~~~~---------~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 471 PSGAVT---------QVAGEIPQAAQQQQE 491 (491) T ss_pred CCCccc---------cccccchhhhhhccC Confidence 877431 223355555 11112 No 5 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=100.00 E-value=2.8e-146 Score=818.56 Aligned_cols=471 Identities=24% Similarity=0.363 Sum_probs=409.5 Q ss_pred CccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCC-ChHHHHHHHhhhcccchHHH Q lcl|NC_019406. 6 PNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGF-DDEDYANYLDRAAFYNMTSQ 84 (661) Q Consensus 6 ~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E-~~~~Y~~rl~rA~~~n~~~~ 84 (661) --++| ...+||+++||+|.+|+++|++|||||+|++.++. +..|||+++.+ +++.|++||+||+|||+|++ T Consensus 1 ~~~~~-------~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~-r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~ 72 (489) T protein:vir:78 1 MLTEN-------GQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYL-RNVGLNEPDKAYGEARQAEYEAGGIVYNFTRR 72 (489) T ss_pred CccCC-------CccCCCCccCHHHHHHHHHHHHHHHHhcCcccccc-cCCCCCCCCCCCChHHHHHHHhccccCChHHH Confidence 12233 67789999999999999999999999999876555 44799998865 46779999999999999999 Q ss_pred HHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEE Q lcl|NC_019406. 85 TQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALV 164 (661) Q Consensus 85 tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLV 164 (661) ||++|+|+||+|||+++ +|+.|++|.+|||| +|++|++|++++++++|.+||||||| T Consensus 73 tl~~l~G~vfrk~p~~~-~p~~l~~l~~d~D~----------------------~G~~L~~f~~~~~~~~l~~G~~~ilV 129 (489) T protein:vir:78 73 TLSGMVGSVMRKEPEIN-IPKELEYLLKNADG----------------------SGVGLIQHAQDTLMEIDSVGRGGLLV 129 (489) T ss_pred HHHHHhchhhcCCccee-ccHHHHHHHhccCC----------------------CCCCHHHHHHHHHHHHHhcCeEEEEE Confidence 99999999999999995 89999999888776 69999999999999999999999999 Q ss_pred eccCCC------chhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 165 DVAPSS------DPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 165 D~P~a~------~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) |||.++ +.+.++|||+++|+|++||||+++.+||+.+|+||||+|++.+.+ T Consensus 130 D~P~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d----------------------- 186 (489) T protein:vir:78 130 DAPETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNE----------------------- 186 (489) T ss_pred eeCCCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeec----------------------- Confidence 999753 467899999999999999999999999999999999999886432 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccc---ceeeccCCcccc Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARD---VYTPMVRGRTLP 315 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~---~~~p~~~g~~L~ 315 (661) ..++|.++.+.|+|+|+++.+|. |++++|++...+.... .+.|..+|++|+ T Consensus 187 ------------------------~~~~f~~~~~~q~RvL~~~~~g~--~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l~ 240 (489) T protein:vir:78 187 ------------------------PGNEFETKYGEQYRVLDIDSDGN--YRQRLFRFDAEGGAQEDVVEIYPDLGESLRG 240 (489) T ss_pred ------------------------CCCCccceeEEEEEEEecCCCcc--eEEEEEEeecCCcccceeeEEeccCCCCccC Confidence 34578888899999999887764 5555666555454433 345677889999 Q ss_pred eeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC--------CceeEecccceee Q lcl|NC_019406. 316 FIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD--------ASEYHIGPGRVWV 387 (661) Q Consensus 316 ~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~--------~~~l~iGs~~~~~ 387 (661) +|||||+|+.++++++++|||++||+|||+|||++|||++|||++++|+||++|.++.. +..+++|++++|. T Consensus 241 ~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g~~~~~~ 320 (489) T protein:vir:78 241 VIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFGSRRGHN 320 (489) T ss_pred eeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeCCccccc Confidence 99999999999999999999999999999999999999999999999999999986433 2358999999999 Q ss_pred cCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVV 467 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL 467 (661) +|. +++++|+|++|.++ .+++|++|++||+++||+|++. ++++|+++++++++++||+|++||.+|++|+++|| T Consensus 321 lp~-~~~~~~ie~~~~~~--~r~~l~~le~qm~~lGa~l~~~---~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l 394 (489) T protein:vir:78 321 LGY-GGSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQLITP---TQQITAQSARIQRGADTSVMATIARNVSQAYTDAL 394 (489) T ss_pred CCC-CCCcceeccCcchH--HHHHHHHHHHHHHHHhhhhccC---CcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 996 78999999988665 5999999999999999999963 35799999999999999999999999999999999 Q ss_pred HHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCC Q lcl|NC_019406. 468 RYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKS 547 (661) Q Consensus 468 ~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~ 547 (661) +|||+|+|+++ +.++.|+||+||....+|++++++|++++++|.||++||+++|+|+||+++ ++|+++++|+++++ T Consensus 395 ~~~a~w~G~~~--~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~--~~e~~~~ei~~~~~ 470 (489) T protein:vir:78 395 RWVAVMLGKPE--DTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDW--TDADIKDAVADQPL 470 (489) T ss_pred HHHHHHcCCCC--CCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCc--cHHHHHHHHhhcCC Confidence 99999999864 457899999999999999999999999999999999999999999999864 67999999999988 Q ss_pred CCCCchhhhhhcCCccccCCCcchhhh Q lcl|NC_019406. 548 FIGQPDAIAMRRGYVSRQQELDQQRAA 574 (661) Q Consensus 548 ~l~~ddae~~~~g~~~~~~~~~q~~~~ 574 (661) .++..+.-.++++ .|+++. T Consensus 471 ~~~~~~~g~~~~~--------~q~~~~ 489 (489) T protein:vir:78 471 PVATEVQGEIPQS--------AQQQEK 489 (489) T ss_pred CcccCCcccCCCC--------cccccC Confidence 7776555444422 222111 No 6 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=100.00 E-value=4.9e-145 Score=811.73 Aligned_cols=450 Identities=28% Similarity=0.448 Sum_probs=395.7 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccc Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVI 100 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i 100 (661) |+|+++||+|.+++++|++|||||+|+++||++|++||||+++|++++|++||+||+|||+|++||++|+|+||+|||++ T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~p~~ 80 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQPPVI 80 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcCCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhccccee Q lcl|NC_019406. 101 RNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYT 180 (661) Q Consensus 101 ~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~ 180 (661) + +|+.|+++ |+|| +|++|++|++++++++|.+||||||||||.+ +.|||+ T Consensus 81 ~-~p~~l~~~--~~D~----------------------~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~-----g~rPy~ 130 (452) T protein:vir:94 81 T-HPDAMSKY--FEDQ----------------------SGIQFYEVFTRAVEETLLMGRVGVFIDRPLT-----GGDPYI 130 (452) T ss_pred c-ccHHHHHH--Hhcc----------------------cCCCHHHHHHHHHHHHHhcCeEEEEEeeccC-----CCceEE Confidence 5 89998887 3444 6999999999999999999999999999964 679999 Q ss_pred Eeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecc Q lcl|NC_019406. 181 VGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARP 260 (661) Q Consensus 181 ~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~ 260 (661) +.|+|++||||+++.+|| |++|+|||++.+.+++ T Consensus 131 ~~~~~~~Ii~W~~~~~g~---l~~v~lre~~~~~d~~------------------------------------------- 164 (452) T protein:vir:94 131 SVYTTENILNWEEDEDGR---LLMVVLREFYTVRDTA------------------------------------------- 164 (452) T ss_pred EEechhhhcCccccccCC---eeEEEEEEEEEEecCC------------------------------------------- Confidence 999999999999988765 8999999987653321 Q ss_pred cccCCCceeeEEEEEEEeecccccceEEEEEEEecCccc--ccccceeeccCCcccceeeEEEEecCCCCCCccccchhH Q lcl|NC_019406. 261 SRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLG--QARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLD 338 (661) Q Consensus 261 ~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~--~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLld 338 (661) +.++.++.++||++.|++| .|++++|+....+ ..+++..+..+|++|++|||||+++.++++++++|||++ T Consensus 165 --d~f~~~~~~~yRvL~l~~g-----~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pPLl~ 237 (452) T protein:vir:94 165 --DRYVQNIRVRYRCLELVDG-----LLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAKPPMID 237 (452) T ss_pred --CcccceeEEEEEEEEEeCC-----eEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCCCCCCCCCccchHH Confidence 1122344455555555554 3556666543333 335677888899999999999999999999999999999 Q ss_pred HHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHH Q lcl|NC_019406. 339 IVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQ 418 (661) Q Consensus 339 LA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~q 418 (661) ||+||++|||++|||++|||++++|+||++|+++.+ +++||++++|.+|++|++++||||+|.++++++++|++|++| T Consensus 238 LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~--~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~~ 315 (452) T protein:vir:94 238 IVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS--TMHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKALSEKQAQ 315 (452) T ss_pred HHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC--ceEecccccccCCCCCCcceEEccCchhHHHHHHHHHHHHHH Confidence 999999999999999999999999999999998765 699999999999977899999999999999999999999999 Q ss_pred HHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCC Q lcl|NC_019406. 419 IAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALD 498 (661) Q Consensus 419 M~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~ld 498 (661) |+++||+|+... +.+++|+++++++++++||+|++||.+|++|++++|+|||+|+|.+ .++.|+||+||....++ T Consensus 316 m~~~Ga~ll~~~-~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~----~~~~v~~n~dF~~~~~~ 390 (452) T protein:vir:94 316 LASLSARLIDNS-TRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMG----GTLNIKLNSAFLDSKLT 390 (452) T ss_pred HHHHHHHhhccC-CCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCC----CceEEEeccccccccCC Confidence 999999999764 3567788899999999999999999999999999999999999973 36789999999999999 Q ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCcc Q lcl|NC_019406. 499 ARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVS 563 (661) Q Consensus 499 a~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~ 563 (661) ++++++|++++++|.||++||+++|+|+|||+.+...+.+.+++..+++...+ ...+.|++- T Consensus 391 ~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~~~~---~~~~~~~~~ 452 (452) T protein:vir:94 391 AAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPSPSN---TPPNPSSKA 452 (452) T ss_pred HHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcccCC---CCCCCccCC Confidence 99999999999999999999999999999999988888888887776654432 223333322 No 7 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=100.00 E-value=1e-138 Score=777.06 Aligned_cols=455 Identities=18% Similarity=0.246 Sum_probs=382.8 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC-----CChHHHHHHH-- Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG-----FDDEDYANYL-- 73 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~-----E~~~~Y~~rl-- 73 (661) |. ..+-|. --.=.|.|+++||+|.+|+++|++++|| |+.+||++|++||||+++ |++..|+.|+ T Consensus 1 ~~----~~~~~~---~~~~~m~V~~~hp~y~a~~~~W~~~~d~--g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~ 71 (488) T protein:vir:96 1 ML----KCLYIK---HRGFFMLTPIYHPDYLVNAPQWLRNLDC--VMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAK 71 (488) T ss_pred Cc----eeEEEe---ecceeecccccCHHHHHHhhhhhHhhhh--hhHHHHHhhhhcCCCCCCccccccCcchhhhhhcc Confidence 21 122232 3445689999999999999999999985 667899999999999875 3444444444 Q ss_pred ----------hhhcccchHHHHHHHHhchhhccCcccccc-chhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCC Q lcl|NC_019406. 74 ----------DRAAFYNMTSQTQAGMVGQIFRRPPVIRNL-PNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTS 142 (661) Q Consensus 74 ----------~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~-p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~s 142 (661) +||+|||+|++|+++|+|+||+|||+++.. |+.|++|.+|||| +|++ T Consensus 72 ~~~~y~~~~~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~----------------------~G~~ 129 (488) T protein:vir:96 72 IEKDWEDLTWRLANYVNIVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDG----------------------KGNG 129 (488) T ss_pred chhhhHhhhhhccccCchhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCC----------------------CCCC Confidence 389999999999999999999999999632 3568777777665 7999 Q ss_pred HHHHHHHHHHHHHhhCCEEEEEeccC-----CCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 143 HQGFAKTVALEQVAMGRFGALVDVAP-----SSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 143 L~~fa~~~~~~~L~~Gr~gvLVD~P~-----a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) |++|++++++++|.+||||||||||. +++++.++|||++.|+|++||||+++.+||+.+|++|||+|++.+.++ T Consensus 130 L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~- 208 (488) T protein:vir:96 130 IDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDG- 208 (488) T ss_pred HHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccC- Confidence 99999999999999999999999996 466778999999999999999999999999999999999998754221 Q ss_pred ccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 218 TPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 218 ~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) +.|..+.++++++|++| .|+++++..++ T Consensus 209 -----------------------------------------------~~~~~~~~~~~~~l~~g-----~~~v~~~~~~~ 236 (488) T protein:vir:96 209 -----------------------------------------------GTYVSKQRLINHRLVDG-----LCEFQEVTDDE 236 (488) T ss_pred -----------------------------------------------CCcccceEEEEEEEECc-----EEEEEEEecCC Confidence 23455678888888765 35555555443 Q ss_pred ccccccceee-ccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe--cCCCCC Q lcl|NC_019406. 298 LGQARDVYTP-MVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP--ELDDSD 374 (661) Q Consensus 298 ~~~~~~~~~p-~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~--Gl~~~~ 374 (661) . . .+++| ..+|++|++|||||+|+.++++++++|||++||+|||+|||++|||++|+|++++|++++. |++.++ T Consensus 237 ~--~-~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~ 313 (488) T protein:vir:96 237 Y--S-DEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTM 313 (488) T ss_pred c--c-cceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCccc Confidence 2 2 34455 4678899999999999999999999999999999999999999999999999999999975 333332 Q ss_pred Cc-----eeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhh Q lcl|NC_019406. 375 AS-----EYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQ 449 (661) Q Consensus 375 ~~-----~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~ 449 (661) .. ++.+|+...+..| .+.++|+|++++++ .+++|++|++||+++||+|++. ++++||++++++++++| T Consensus 314 ~~~~~~~g~~~~~~~~~~~~--~g~~~~~e~~~~~l--~~~~l~~l~~qm~~~Ga~l~~~---~~~~Ta~~~~~~~~~~~ 386 (488) T protein:vir:96 314 ASEMNPLGFTLAGRMPYYVK--NGDVKVIQAQFSPE--TENKVEKLFEQAVKVGASLFTQ---QSNETATGAAIRSGSST 386 (488) T ss_pred ccccccceeeeccccccccc--CCceeecCCchhHH--HHHHHHHHHHHHHHHhHhhccC---CCcchHHHHHHHHHHhh Confidence 22 2334444333333 46899999988876 5999999999999999999963 35799999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCC--CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcC Q lcl|NC_019406. 450 SLLLNVIMALEDGMTSVVRYWLMFRDIPLT--DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNG 527 (661) Q Consensus 450 S~L~~~A~~le~Al~~aL~~~A~w~G~~~~--~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~g 527 (661) |+|++||.+|++|+++||+|||+|+|+.++ ++.+++|+||+||....+|++++++|++++++|.||++||+++|+|+| T Consensus 387 S~L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~g 466 (488) T protein:vir:96 387 ASMATLGNNVEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRAR 466 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCC Confidence 999999999999999999999999998763 356789999999999999999999999999999999999999999999 Q ss_pred CCCccCCHHHHHHHHhccCCCC Q lcl|NC_019406. 528 IIPSTQTLEEFTIKMNDPKSFI 549 (661) Q Consensus 528 vl~~~~~~Eee~~~l~~~~~~l 549 (661) ||+++.++|+|+++|++++.++ T Consensus 467 vl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 467 VVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred cCCccCCHHHHHHHHhhcCCCC Confidence 9999999999999999987776 No 8 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.94 E-value=1e-26 Score=163.01 Aligned_cols=474 Identities=10% Similarity=0.007 Sum_probs=255.3 Q ss_pred CCCCCCccc--cccccccccccCC---ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhh Q lcl|NC_019406. 1 MAGLSPNSA--NIRRTKRGAQQFT---HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDR 75 (661) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~---V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~r 75 (661) ||=..+-++ ||. +++ +..---.+....++++++.+.|.|.+.+..+ +......-..|+ T Consensus 1 ~~~~~~~~~~~~~~-------~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~--------~~~~~~~~~~ki-- 63 (499) T protein:vir:10 1 MAVVIDKDLLDDVN-------EPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKH--------EFDNATVEAANV-- 63 (499) T ss_pred CccchhhhHHhhhh-------cCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcC--------CcCcCCCCccee-- Confidence 665555444 322 122 1111123456678899999999997654321 111111111222 Q ss_pred hcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHH Q lcl|NC_019406. 76 AAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQV 155 (661) Q Consensus 76 A~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L 155 (661) + .|+.+.+|+..+|.+|.+||+++. ++ .+..+.|+++ ++.++++.+...+.+.++ T Consensus 64 -~-~n~~~~Iv~~~~~~l~g~p~~~~~-~~-----------------~~~~~~l~~~-----~~~n~~~~~~~~~~~~~~ 118 (499) T protein:vir:10 64 -M-VNHAKYITDMNVGFMTGNPVKYVA-EK-----------------GKNIDDILEV-----FNQIDIHKHDIELEKDLS 118 (499) T ss_pred -e-cchHHHHHHHHhhhhcccCceeec-CC-----------------hhHHHHHHHH-----HhhcCHhHHHHHHHHHHH Confidence 2 599999999999999999999852 11 1122334443 346789999999999999 Q ss_pred hhCCEEEEEeccCCCch-----------hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccc Q lcl|NC_019406. 156 AMGRFGALVDVAPSSDP-----------TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNP 224 (661) Q Consensus 156 ~~Gr~gvLVD~P~a~~~-----------~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~ 224 (661) .+|+++++|-....+.. ....++.+..++|.+++-... +..++..+..|+.. .. T Consensus 119 ~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~-d~~~~~~~~~i~~~--~~------------ 183 (499) T protein:vir:10 119 VFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCD-DTVEHDPLFAVFTQ--EK------------ 183 (499) T ss_pred hcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEec-CCCCcceEEEEEEE--EE------------ Confidence 99999999965432211 112234456666665432211 11111111111111 00 Q ss_pred eeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccc Q lcl|NC_019406. 225 WIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDV 304 (661) Q Consensus 225 ~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~ 304 (661) .+.. ..+.++.+.++.++. ++++.....+. ..... T Consensus 184 -----------------------------------~~~~----~~~~~~~~~iyt~~~----i~~~~~~~~~~--~~~~~ 218 (499) T protein:vir:10 184 -----------------------------------KDLE----GNTNGYSITVYMPQR----IVEYRTKTTME--VSAND 218 (499) T ss_pred -----------------------------------eecC----CCceEEEEEEEeCCe----EEEEEecCCcc--ccCcc Confidence 0000 011222333333321 22222111110 11111 Q ss_pred eeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC--ceeEecc Q lcl|NC_019406. 305 YTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA--SEYHIGP 382 (661) Q Consensus 305 ~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~--~~l~iGs 382 (661) .......++|+.||||.+.... .. .+=|.++-.|-=+.=...|++.+.+.+.++|+++++|...++. ....+.. T Consensus 219 ~~~~~~~~~~g~vPvv~~~n~~--~~--~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~ 294 (499) T protein:vir:10 219 PIVYDGENLFGAVPIIEFRNNE--ER--QGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKR 294 (499) T ss_pred eecccccCCCCccceEEecCCC--CC--CCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhh Confidence 2222334679999999875422 22 2223333333333444668899999999999999999753321 1122344 Q ss_pred cceeecC-CCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_019406. 383 GRVWVVD-KESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALE 460 (661) Q Consensus 383 ~~~~~lp-~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le 460 (661) +..+.++ .++++++||+.+. +.+..+..++.+.++|+.+..-.- ....-+++.||++.+................+. T Consensus 295 ~~~~~~~~~~~~d~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~ 373 (499) T protein:vir:10 295 GAIEAPPREEGADIEWLTKSF-DETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFF 373 (499) T ss_pred cceeccCCCCCCcceEEeccC-CHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444443 3467899999876 457788899999999988754221 111224577999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCCCCC--cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHH Q lcl|NC_019406. 461 DGMTSVVRYWLMFRDIPLTD--TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEF 538 (661) Q Consensus 461 ~Al~~aL~~~A~w~G~~~~~--~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee 538 (661) .++.+++++++.|++..... ...+.|.+++.... . ..+.++.+.++ +|.||++|.+..| |...+.++| T Consensus 374 ~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~-n-~~e~~~~~~kl--~g~iS~et~~~~l------~~v~d~~~E 443 (499) T protein:vir:10 374 DGLRRRLKLIQTIVNIKGANDDASGCKISLVANIPS-N-LSDVVNNVKNA--DGIIPRKYTYSWL------PDVDNPQDV 443 (499) T ss_pred HHHHHHHHHHHHHHhccCCccccccceEEeCCCCCC-C-HHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHH Confidence 99999999999998754322 22445555544332 2 25567777776 6899999997543 333446777 Q ss_pred HHHHhccCCCC-C-CchhhhhhcCCccccCCC-cchhhhhcCChhhHHHHHHHhcc Q lcl|NC_019406. 539 TIKMNDPKSFI-G-QPDAIAMRRGYVSRQQEL-DQQRAARDADFQQQELEQAERHL 591 (661) Q Consensus 539 ~~~l~~~~~~l-~-~ddae~~~~g~~~~~~~~-~q~~~~~e~d~~q~~~~~~e~~~ 591 (661) .++|.++.... . ..+.....++......+. +.+++....+-++....-|=|++ T Consensus 444 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 444 IDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred HHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCccccccCCCCCCC Confidence 78886652210 0 000000000000001111 11111111111222222222222 No 9 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.93 E-value=5.8e-25 Score=153.45 Aligned_cols=478 Identities=11% Similarity=0.001 Sum_probs=258.7 Q ss_pred CCCCCCcccccccccc---ccccCCccccCHH-H-----HHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKR---GAQQFTHLVVHPE-Y-----EYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~---~~~~~~V~~~hPe-y-----~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) ||.+-|---|.-.-.- -.....+...-.+ + ....+++.++.+-|.|...+..+...+...........++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 80 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKT 80 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccc Confidence 8877664433211100 0111111111111 1 11246788888889898776554433333333333222222 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) - .|.+ .|+++.+|+..+|++|.+||+++.-.+. ..++++.+ ..++++.....+. T Consensus 81 ~-~ri~-~n~~~~ivd~~~~yl~g~~~~~~~~d~~----------------------~~~~l~~~--~~n~~~~~~~~~~ 134 (503) T protein:vir:59 81 N-NRTS-HAWHKLFVDQKTQYLVGEPVTFTSDNKT----------------------LLEYVNEL--ADDDFDDILNETV 134 (503) T ss_pred c-ceee-cchHHHHHHHHHhhhhcCCeeeccCcHH----------------------HHHHHHHH--HhcCHHHHHHHHH Confidence 1 1223 7999999999999999999998521111 12222222 2468999999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +.++.+|+++++|.... ..+|-+..++|.+++-..-+...+ .+..+ +|.+.....+ ....-++..|.. T Consensus 135 ~~~~~~G~~~~~v~~d~------dg~~~i~~~~p~~~~~i~d~~~~~--~~~~~-ir~~~~~~~~---~~~~~~~evy~~ 202 (503) T protein:vir:59 135 KNMSNKGIEYWHPFVDE------EGEFDYVIFPAEEMIVVYKDNTRR--DILFA-LRYYSYKGIM---GEETQKAELYTD 202 (503) T ss_pred HHHhhCCeEEEEEeecC------CCceEEEEEccceeEEEEeCCCCC--ceEEE-EEEEEEecCC---CceEEEEEEEeC Confidence 99999999999997653 246888899998876532222211 12111 1111111000 001111222222 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccc--------ccc Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQ--------ARD 303 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~--------~~~ 303 (661) +.|. .+ ....+.... ... T Consensus 203 ~~i~-----------------------------------------------------~~-~~~~~~~~~~~~~~~~~~~~ 228 (503) T protein:vir:59 203 THVY-----------------------------------------------------YY-EKIDGVYQMDYSYGENNPRP 228 (503) T ss_pred CcEE-----------------------------------------------------EE-EEcCCccccccccccccccc Confidence 2111 11 011110000 000 Q ss_pred ceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc--eeEec Q lcl|NC_019406. 304 VYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS--EYHIG 381 (661) Q Consensus 304 ~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~--~l~iG 381 (661) .......-++++.|||+.+... .+.. +=|.++..|-=+.=+..|++.+.+.+.+.|+++++|.+..+.. ...+. T Consensus 229 ~~~~~~~~~~~~~vPiv~~~nn--~~~~--sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~ 304 (503) T protein:vir:59 229 HMTKGGQAIGWGRVPIIPFKNN--EEMV--SDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLR 304 (503) T ss_pred ceeecceeccCCccceEEecCC--CCCC--cchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhh Confidence 0111112356899999988542 2222 2222222222233345688888899999999999998654422 13355 Q ss_pred ccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_019406. 382 PGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALE 460 (661) Q Consensus 382 s~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le 460 (661) ...++.+++ +++++|+..+. +.+..+..++.+++.|..++.-.- .....+++.||++..............+...+. T Consensus 305 ~~~~~~~~~-~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~ 382 (503) T protein:vir:59 305 YHSVIKVSG-DGGVDTLRAEI-PVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIR 382 (503) T ss_pred cccceeccC-CCcceeEeccC-CHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 566777775 56899998765 457888899999999888753221 112234678999999998888888889999999 Q ss_pred HHHHHHHHHHHHHcCCCCCC----cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHH Q lcl|NC_019406. 461 DGMTSVVRYWLMFRDIPLTD----TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLE 536 (661) Q Consensus 461 ~Al~~aL~~~A~w~G~~~~~----~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~E 536 (661) .+|.+++++++.+++..... ...+.|.+++ -.+.. ..+.++++++++++|.||++|.+.. +|...+++ T Consensus 383 ~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~-~~p~d-~~~~~~~~~kl~~~GiiS~et~l~~------l~~v~d~~ 454 (503) T protein:vir:59 383 AGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTR-TRIQN-DSEIVQSLVQGVTGGIMSKETAVAR------NPFVQDPE 454 (503) T ss_pred HHHHHHHHHHHHHHHhccCcccccccceeEEeCC-CCCCC-HHHHHHHHHHHHhCCCCchHHHHHh------CCCCCCHH Confidence 99999999999998753322 1234555543 22333 2467889999999999999999754 33334567 Q ss_pred HHHHHHhccCCCCCCchhhhhhcCCccccC-CCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhh Q lcl|NC_019406. 537 EFTIKMNDPKSFIGQPDAIAMRRGYVSRQQ-ELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTS 608 (661) Q Consensus 537 ee~~~l~~~~~~l~~ddae~~~~g~~~~~~-~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~ 608 (661) +|.++|+++... ....+........ ...+++..++.+ .++ ...+|+++ T Consensus 455 ~E~~ri~~E~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~-~~~~g~~~ 503 (503) T protein:vir:59 455 EELARIEEEMNQ-----YAEMQGNLLDDEGGDDDLEEDDPNAG------------------AAE-SGGAGQVS 503 (503) T ss_pred HHHHHHHHHHHH-----HHhhhccccCccCCCCCCCcCCCCCC------------------ccc-CCCCCCcC Confidence 777777654210 0000101000000 000000011100 000 00011111 No 10 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.92 E-value=1e-24 Score=152.07 Aligned_cols=452 Identities=8% Similarity=-0.030 Sum_probs=247.0 Q ss_pred ccccccccccccCCccccCHHHH--------HHHHHHHHHHHHhcchHHH-HhCCcccCCCCCCC-ChHHHHHHHhhh-- Q lcl|NC_019406. 9 ANIRRTKRGAQQFTHLVVHPEYE--------YYRPDWAKIRDAIAGEREI-KAQGVKYLKAPKGF-DDEDYANYLDRA-- 76 (661) Q Consensus 9 ~~~~~~~~~~~~~~V~~~hPey~--------a~~~~W~~irD~~~G~~~v-r~~g~~YLPk~~~E-~~~~Y~~rl~rA-- 76 (661) .|+..+.---.+.+++ |+.. ...+++.....-|.|.... +......+++.... ....++....++ T Consensus 1 ~~~~~~~~~~~~~~~~---~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:10 1 MTLYKLIDDIEAQGIL---PKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNN 77 (474) T ss_pred CchHHHHhhccccCCC---HHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccc Confidence 3333332222222211 1111 1122222333333332111 11111111111100 000111111111 Q ss_pred -cccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHH Q lcl|NC_019406. 77 -AFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQV 155 (661) Q Consensus 77 -~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L 155 (661) .-.|+.+.+|+..+|.+|.+||+++.-++ .+.-..+.++++++... ++++.....+.+.++ T Consensus 78 ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~-----------------~~~~e~~~~~l~~~~~~-n~~~~~~~~~~~~~~ 139 (474) T protein:vir:10 78 KLNNSFDSEIVDTRVGYLHGVPVTYDLDEN-----------------AEKNEKLKKFITNFAIR-NSVDDEDSEIGKMAA 139 (474) T ss_pred ccccchHHHHHHhHhhheeccceeEeeCCC-----------------CcchHHHHHHHHHHHhh-cCHhHHHHHHHHHHh Confidence 45899999999999999999999852111 11223455556666433 589999999999999 Q ss_pred hhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhh Q lcl|NC_019406. 156 AMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQ 235 (661) Q Consensus 156 ~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi 235 (661) .+|+|+++|-... ..+|.+..++|.+++-+. ++.+ +.+.-|+... .. T Consensus 140 ~~G~a~~~~~~d~------~~~~~~~~i~p~~~~~v~-d~~~--~~~~~i~~~~--~~---------------------- 186 (474) T protein:vir:10 140 ICGYGARLAYIDT------NGDIRIKNIDPYNVIFVG-DNIL--EPTYSLRYFY--EK---------------------- 186 (474) T ss_pred hcCeEEEEEEeCC------CCeeEEEEEcccceEEEE-cCCC--ceEEEEEEEE--Ee---------------------- Confidence 9999999995432 235888899998875442 2211 1121111110 00 Q ss_pred cchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccc Q lcl|NC_019406. 236 RTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLP 315 (661) Q Consensus 236 ~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~ 315 (661) +.. .-..++.+.++... .+|. +.....+... .....-++++ T Consensus 187 -------------------------~~~----~~~~~~~~~~y~~~----~~~~---~~~~~~~~~~---~~~~~~~~~g 227 (474) T protein:vir:10 187 -------------------------DDD----NGTDYVYAEFYDNA----YYYV---FRGEGIDALQ---EVGRYEHLFD 227 (474) T ss_pred -------------------------eCC----CceEEEEEEEEcCc----eEEE---EeecCCCccc---ccccccCCCC Confidence 000 01122333333221 1222 2222111111 1111236799 Q ss_pred eeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCCCCcc Q lcl|NC_019406. 316 FIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKESGIP 395 (661) Q Consensus 316 ~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~ga~~ 395 (661) .||||.+.. +.. +.+=|.++..|-=+.-...|++.+.+.+.+.|+++++|....+.....+...+++.++++++++ T Consensus 228 ~vPvv~~~n--~~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~i~~~~~~~~~ 303 (474) T protein:vir:10 228 YNPLFGVPN--NKE--MIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEMIQETQKSGAFELFDKDMDV 303 (474) T ss_pred ccceEEecC--CCC--CCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchhhhhhhhcceeEecCCCCce Confidence 999998753 222 2334556666655666678999999999999999999976444322233344556665568899 Q ss_pred eEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019406. 396 GIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFR 474 (661) Q Consensus 396 ~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~ 474 (661) +|+..+. +.+..+..++.+++.|...+.-.- ....-+++.||++.+................+..++.+.+++++.++ T Consensus 304 ~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 382 (474) T protein:vir:10 304 KYLTKDV-NDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSAL 382 (474) T ss_pred eEEeccC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999765 568889999999999998763221 11122357899999888888888888888899999999999999998 Q ss_pred CCCCCC-----cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCC Q lcl|NC_019406. 475 DIPLTD-----TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFI 549 (661) Q Consensus 475 G~~~~~-----~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l 549 (661) +....+ ...+.+.+++.. +.. ..+.++++.++ .|.||++|.+..| +.+ .+.++|.++|+++.. T Consensus 383 ~~~~~~~~~~~~~~i~~~f~~~~-p~d-~~e~a~~~~kl--~g~iS~et~~~~l---~~v---~d~~~E~eri~~E~~-- 450 (474) T protein:vir:10 383 KRKGYNLDDDSYLNLIFKFTRNI-PVN-KLEESQVLINL--KGQVSERTRLGQS---QLV---DDVDYELDEMEKESL-- 450 (474) T ss_pred hhccCCCCccccccceEEeCCCC-CCC-HHHHHHHHHHH--hccCchHHHHHhC---CCC---CCHHHHHHHHHHHHH-- Confidence 754221 123444443322 222 24566666665 5999999997644 333 356778888876532 Q ss_pred CCchhhhhhc---CCccccCCCcchhhhhcCC Q lcl|NC_019406. 550 GQPDAIAMRR---GYVSRQQELDQQRAARDAD 578 (661) Q Consensus 550 ~~ddae~~~~---g~~~~~~~~~q~~~~~e~d 578 (661) +.+...++ |..+..+ ...++| T Consensus 451 --e~~~~~~~~~~~~~~~~~------~~~~s~ 474 (474) T protein:vir:10 451 --EFNDKLPDIDEGDANDKS------QNNQSE 474 (474) T ss_pred --HHHhhcccccCCCcCCCC------ccccCC Confidence 11111111 1111111 122223 No 11 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.92 E-value=1e-24 Score=152.07 Aligned_cols=452 Identities=8% Similarity=-0.030 Sum_probs=247.0 Q ss_pred ccccccccccccCCccccCHHHH--------HHHHHHHHHHHHhcchHHH-HhCCcccCCCCCCC-ChHHHHHHHhhh-- Q lcl|NC_019406. 9 ANIRRTKRGAQQFTHLVVHPEYE--------YYRPDWAKIRDAIAGEREI-KAQGVKYLKAPKGF-DDEDYANYLDRA-- 76 (661) Q Consensus 9 ~~~~~~~~~~~~~~V~~~hPey~--------a~~~~W~~irD~~~G~~~v-r~~g~~YLPk~~~E-~~~~Y~~rl~rA-- 76 (661) .|+..+.---.+.+++ |+.. ...+++.....-|.|.... +......+++.... ....++....++ T Consensus 1 ~~~~~~~~~~~~~~~~---~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (474) T protein:vir:94 1 MTLYKLIDDIEAQGIL---PKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNN 77 (474) T ss_pred CchHHHHhhccccCCC---HHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccc Confidence 3333332222222211 1111 1122222333333332111 11111111111100 000111111111 Q ss_pred -cccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHH Q lcl|NC_019406. 77 -AFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQV 155 (661) Q Consensus 77 -~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L 155 (661) .-.|+.+.+|+..+|.+|.+||+++.-++ .+.-..+.++++++... ++++.....+.+.++ T Consensus 78 ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~-----------------~~~~e~~~~~l~~~~~~-n~~~~~~~~~~~~~~ 139 (474) T protein:vir:94 78 KLNNSFDSEIVDTRVGYLHGVPVTYDLDEN-----------------AEKNEKLKKFITNFAIR-NSVDDEDSEIGKMAA 139 (474) T ss_pred ccccchHHHHHHhHhhheeccceeEeeCCC-----------------CcchHHHHHHHHHHHhh-cCHhHHHHHHHHHHh Confidence 45899999999999999999999852111 11223455556666433 589999999999999 Q ss_pred hhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhh Q lcl|NC_019406. 156 AMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQ 235 (661) Q Consensus 156 ~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi 235 (661) .+|+|+++|-... ..+|.+..++|.+++-+. ++.+ +.+.-|+... .. T Consensus 140 ~~G~a~~~~~~d~------~~~~~~~~i~p~~~~~v~-d~~~--~~~~~i~~~~--~~---------------------- 186 (474) T protein:vir:94 140 ICGYGARLAYIDT------NGDIRIKNIDPYNVIFVG-DNIL--EPTYSLRYFY--EK---------------------- 186 (474) T ss_pred hcCeEEEEEEeCC------CCeeEEEEEcccceEEEE-cCCC--ceEEEEEEEE--Ee---------------------- Confidence 9999999995432 235888899998875442 2211 1121111110 00 Q ss_pred cchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccc Q lcl|NC_019406. 236 RTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLP 315 (661) Q Consensus 236 ~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~ 315 (661) +.. .-..++.+.++... .+|. +.....+... .....-++++ T Consensus 187 -------------------------~~~----~~~~~~~~~~y~~~----~~~~---~~~~~~~~~~---~~~~~~~~~g 227 (474) T protein:vir:94 187 -------------------------DDD----NGTDYVYAEFYDNA----YYYV---FRGEGIDALQ---EVGRYEHLFD 227 (474) T ss_pred -------------------------eCC----CceEEEEEEEEcCc----eEEE---EeecCCCccc---ccccccCCCC Confidence 000 01122333333221 1222 2222111111 1111236799 Q ss_pred eeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCCCCcc Q lcl|NC_019406. 316 FIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKESGIP 395 (661) Q Consensus 316 ~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~ga~~ 395 (661) .||||.+.. +.. +.+=|.++..|-=+.-...|++.+.+.+.+.|+++++|....+.....+...+++.++++++++ T Consensus 228 ~vPvv~~~n--~~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~i~~~~~~~~~ 303 (474) T protein:vir:94 228 YNPLFGVPN--NKE--MIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEMIQETQKSGAFELFDKDMDV 303 (474) T ss_pred ccceEEecC--CCC--CCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchhhhhhhhcceeEecCCCCce Confidence 999998753 222 2334556666655666678999999999999999999976444322233344556665568899 Q ss_pred eEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019406. 396 GIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFR 474 (661) Q Consensus 396 ~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~ 474 (661) +|+..+. +.+..+..++.+++.|...+.-.- ....-+++.||++.+................+..++.+.+++++.++ T Consensus 304 ~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 382 (474) T protein:vir:94 304 KYLTKDV-NDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSAL 382 (474) T ss_pred eEEeccC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999765 568889999999999998763221 11122357899999888888888888888899999999999999998 Q ss_pred CCCCCC-----cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCC Q lcl|NC_019406. 475 DIPLTD-----TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFI 549 (661) Q Consensus 475 G~~~~~-----~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l 549 (661) +....+ ...+.+.+++.. +.. ..+.++++.++ .|.||++|.+..| +.+ .+.++|.++|+++.. T Consensus 383 ~~~~~~~~~~~~~~i~~~f~~~~-p~d-~~e~a~~~~kl--~g~iS~et~~~~l---~~v---~d~~~E~eri~~E~~-- 450 (474) T protein:vir:94 383 KRKGYNLDDDSYLNLIFKFTRNI-PVN-KLEESQVLINL--KGQVSERTRLGQS---QLV---DDVDYELDEMEKESL-- 450 (474) T ss_pred hhccCCCCccccccceEEeCCCC-CCC-HHHHHHHHHHH--hccCchHHHHHhC---CCC---CCHHHHHHHHHHHHH-- Confidence 754221 123444443322 222 24566666665 5999999997644 333 356778888876532 Q ss_pred CCchhhhhhc---CCccccCCCcchhhhhcCC Q lcl|NC_019406. 550 GQPDAIAMRR---GYVSRQQELDQQRAARDAD 578 (661) Q Consensus 550 ~~ddae~~~~---g~~~~~~~~~q~~~~~e~d 578 (661) +.+...++ |..+..+ ...++| T Consensus 451 --e~~~~~~~~~~~~~~~~~------~~~~s~ 474 (474) T protein:vir:94 451 --EFNDKLPDIDEGDANDKS------QNNQSE 474 (474) T ss_pred --HHHhhcccccCCCcCCCC------ccccCC Confidence 11111111 1111111 122223 No 12 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.92 E-value=2.3e-24 Score=150.20 Aligned_cols=469 Identities=10% Similarity=0.005 Sum_probs=252.2 Q ss_pred CCC--CCCccccccccccccccCCccccCHHHH-HHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAG--LSPNSANIRRTKRGAQQFTHLVVHPEYE-YYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~V~~~hPey~-a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~ 77 (661) |.. +.|+-.=+..+..-.....+....--.. ...++++++.+.|.|...+..+...+ ..........++... | . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~-~~~~~~~~~~~~~~~-k-i 77 (479) T protein:vir:79 1 MLNIYISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYY-LLDGAKVDDFTKVNN-K-A 77 (479) T ss_pred CCCceecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCccccccccc-ccccccccccccCcc-e-e Confidence 211 1111111111111111111111111111 13567888899998876554332111 111111111111111 1 3 Q ss_pred ccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhh Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAM 157 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~ 157 (661) -.|+.+.+|+.++|.+|.+||+++.-.+.. .++++.+ ..++++.....+.+.++.+ T Consensus 78 ~~~~~~~Ivd~~~~~l~g~p~~~~~~~~~~----------------------~~~~~~~--~~n~~~~~~~~~~~~~~~~ 133 (479) T protein:vir:79 78 INNYHKLLVDQKVGYSVGNPIVFNADDDNL----------------------TKLLNDL--LGEEFDDTITELYLNASNK 133 (479) T ss_pred ecchHHHHHHHHHhhhhcCCceeccCCHHH----------------------HHHHHHH--HhcCHHHHHHHHHHHHHhc Confidence 379999999999999999999995322222 2222222 2468999999999999999 Q ss_pred CCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcc Q lcl|NC_019406. 158 GRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRT 237 (661) Q Consensus 158 Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w 237 (661) |+++++|-.+. ..+|.+..++|.+++-. +++.... .+...+........+ .+...++..|....|.-| T Consensus 134 G~~~~~v~~d~------~~~~~i~~~~p~~~~~v-~d~~~~~-~~~~~ir~y~~~~~~----~~~~~~~e~y~~~~i~~~ 201 (479) T protein:vir:79 134 GVEWLHPYINR------KGEFKYVIIPAEEAIPI-WDSKRQR-ELVAFIRFYYIEDID----GNKIKRVEYYTENDITYF 201 (479) T ss_pred CeEEEEEEeCC------CCceEEEEEccceeEEE-EeCCCCC-ceEEEEEEEEEeecC----CceEEEEEEEeCCcEEEE Confidence 99999997653 24678888899886543 2221111 122211111111100 112223333333333222 Q ss_pred hhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcc--cccccceeeccCCcccc Q lcl|NC_019406. 238 SGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPL--GQARDVYTPMVRGRTLP 315 (661) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~--~~~~~~~~p~~~g~~L~ 315 (661) +...... .... ....... +...........-++++ T Consensus 202 ~~~~~~~------------------------------------------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (479) T protein:vir:79 202 IERGNSF------------------------------------------IQEF-LYDEYGKMTDIQEGHFRINNKEQGWG 238 (479) T ss_pred EecCCcc------------------------------------------cccc-cccccccccccccccccccccccCCC Confidence 1110000 0000 0000000 00000111112235799 Q ss_pred eeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeecCCCCC Q lcl|NC_019406. 316 FIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVVDKESG 393 (661) Q Consensus 316 ~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~lp~~ga 393 (661) .||||.+... .+. .+-|.++..|-=+.=...|++.+.+.+.+.|+++++|.+...... -.+....++.++ +++ T Consensus 239 ~vPvv~~~nn--~~g--~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~-~~~ 313 (479) T protein:vir:79 239 KVPFIPFKNN--EKC--VSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVD-GGG 313 (479) T ss_pred cccEEEecCC--CCC--CcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecC-CCC Confidence 9999988543 222 233555555544555577889999999999999999975433221 123334456666 467 Q ss_pred cceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 394 IPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMF 473 (661) Q Consensus 394 ~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w 473 (661) +++|++.+. +.+..+..++.+++.|...+.-.--...+.++.|+++..................+..++.+++++++.| T Consensus 314 ~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 392 (479) T protein:vir:79 314 GVDKLEINI-PVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEY 392 (479) T ss_pred cceEEeccC-CHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 899999876 5688899999999999877532211223456789999999988888889999999999999999999999 Q ss_pred cCCCCC---CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCC Q lcl|NC_019406. 474 RDIPLT---DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIG 550 (661) Q Consensus 474 ~G~~~~---~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~ 550 (661) ++.... +..++.|.+++ ..+.. ..+.++++.++ .|.||.+|.+..| +.+ .+.++|.++|+++... T Consensus 393 ~~~~~~~~~~~~~i~i~f~~-~~p~~-~~~~a~~~~kl--~g~iS~et~l~~l---~~v---~d~~~E~~ri~~E~~~-- 460 (479) T protein:vir:79 393 LKISGNKSYDYKTVQITFNH-SMIIN-EAEKIDMAAKS--TGIVSDETIVSNH---PWV---EDVNDELERLKKQEDT-- 460 (479) T ss_pred HhccCCCccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCcHHHHHHhC---CCC---CCHHHHHHHHHHHHHH-- Confidence 876532 23344555533 22222 24566666665 5999999997543 333 3456777777665221 Q ss_pred CchhhhhhcCCccccCCCcch Q lcl|NC_019406. 551 QPDAIAMRRGYVSRQQELDQQ 571 (661) Q Consensus 551 ~ddae~~~~g~~~~~~~~~q~ 571 (661) +.+..+.-+.......+++ T Consensus 461 --~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 461 --QKEYDDLIPNNQDGVIDET 479 (479) T ss_pred --HHHHHhccCcccCCCcCcC Confidence 0111110001111112222 No 13 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.92 E-value=1.9e-24 Score=150.69 Aligned_cols=443 Identities=11% Similarity=0.049 Sum_probs=250.6 Q ss_pred ccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHH Q lcl|NC_019406. 9 ANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAG 88 (661) Q Consensus 9 ~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~ 88 (661) +||.-+..- --.+....++.....+.|.|...+..+...++.+... ..++.- .|.+ .|+++.+++. T Consensus 1 l~~~~i~~~---------i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~---~~~~~~-~ki~-~n~~~~Ivd~ 66 (451) T protein:vir:10 1 MELEKIRAI---------ISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDEN---PLRNAD-NRIS-HNFHEILVDE 66 (451) T ss_pred CCHHHHHHH---------HHHHHHHHHHHHHHHHHhcccCccccccccccccccc---cccccc-cccc-cchHHHHHHh Confidence 333322211 0123345667788888898976554443333332221 111111 1222 5999999999 Q ss_pred HhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccC Q lcl|NC_019406. 89 MVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAP 168 (661) Q Consensus 89 l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~ 168 (661) .+|.+|.+||+++. ++ |. +....+..+ ..++++.....+.+.++.+|+++++|-... T Consensus 67 ~~~yl~G~p~~~~~-~~-------~~---------~~~~~~~~~------~~n~~~~~~~~~~~~~~~~G~a~~~~y~de 123 (451) T protein:vir:10 67 KASYMFTYPVLFDI-DN-------NK---------ELNEKVTDV------LGNEFTRKAKNLAIEASNCGSAWLHYWIDE 123 (451) T ss_pred hhhheecccceeec-CC-------cH---------HHHHHHHHH------hccCHHHHHHHHHHHHhhcCeEEEEEeecC Confidence 99999999999841 11 00 011112222 257899999999999999999999986543 Q ss_pred CCch--hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchh Q lcl|NC_019406. 169 SSDP--TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLA 246 (661) Q Consensus 169 a~~~--~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~ 246 (661) .... ....++-+..++|++++----+.+.+ .+. .-||.+.. T Consensus 124 ~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~--~~~-~~ir~~~~---------------------------------- 166 (451) T protein:vir:10 124 EYSGEQVTNQTFKYGVVNTEEIIPIYRNGIER--ELE-AVIRYYIQ---------------------------------- 166 (451) T ss_pred CcccccccccceeEEEEcccceEEEEcCCCCC--ceE-EEEEEEEe---------------------------------- Confidence 2111 11224446667787754221111111 111 11111111 Q ss_pred hhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecCC Q lcl|NC_019406. 247 ERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMS 326 (661) Q Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~ 326 (661) .....+....+++.++.++... .+|.+.....+..+ ........-+.++.||||.+... T Consensus 167 -------------~~~~~~~~~~~~~~~~e~yt~~----~~~~~~~~~~~~~~---~~~~~~~~~~~~g~vPvv~~~nn- 225 (451) T protein:vir:10 167 -------------LEDVKGQIQKQAYTYVEFWTDK----ILDKYKFFGVSCCG---SQIEHITVQHRFNSVPFVEFSNN- 225 (451) T ss_pred -------------eecccccccceEEEEEEEEeCC----eEEEEEecccCccc---cccccccccCCCCeeeEEEeccC- Confidence 1111122223344444444332 23333322222111 11111122357999999988542 Q ss_pred CCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeecCC----CCCcceEeec Q lcl|NC_019406. 327 NAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVVDK----ESGIPGIIEF 400 (661) Q Consensus 327 ~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~lp~----~ga~~~ylE~ 400 (661) ... .+=|.++..|-=++=...|++.+.+.+.+.|+++++|+...+... -.+...+++.++. ++++++||.. T Consensus 226 -~~~--~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~ 302 (451) T protein:vir:10 226 -IKK--QSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQI 302 (451) T ss_pred -CCC--CCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEee Confidence 222 233455555555555678999999999999999999986543221 2233444554442 3578999997 Q ss_pred CchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC Q lcl|NC_019406. 401 KGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTD 480 (661) Q Consensus 401 ~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~ 480 (661) +. +.+..+..++.+++.|...+.-.--...+.++-||++..................+..++.+.+++++.++|..+ T Consensus 303 ~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~d-- 379 (451) T protein:vir:10 303 EI-PTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVTD-- 379 (451) T ss_pred cC-CHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-- Confidence 75 568889999999999998754221122344678999999999999999999999999999999999999999764 Q ss_pred cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcC Q lcl|NC_019406. 481 TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRG 560 (661) Q Consensus 481 ~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g 560 (661) ..++.+.+++.. +.. ..+.++++.++ .|.||++|++..| |-..+++++..++.++.. +..+.+++. T Consensus 380 ~~~i~i~f~~~~-p~n-~~e~~~~~~kl--~g~iS~et~~~~~------p~v~d~~~e~~~~~ee~~----~~~~~~~~~ 445 (451) T protein:vir:10 380 YKKIQQTYTRNM-MSN-DLEDADIATKS--VGIIPTKIILRHH------PWVDDVEEAEKLYLEEKK----IQASKVSDD 445 (451) T ss_pred ccceeEEecCCC-CCC-HHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHH----HHHHHHHhh Confidence 345555554432 222 34567777776 4899999996443 333345566656644311 001111100 Q ss_pred CccccCCCcchhhhhcCChhh Q lcl|NC_019406. 561 YVSRQQELDQQRAARDADFQQ 581 (661) Q Consensus 561 ~~~~~~~~~q~~~~~e~d~~q 581 (661) +.. +- . T Consensus 446 ~~~-~~--------------~ 451 (451) T protein:vir:10 446 YNN-FT--------------E 451 (451) T ss_pred cCC-CC--------------C Confidence 000 00 0 No 14 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.92 E-value=1.3e-23 Score=146.05 Aligned_cols=459 Identities=12% Similarity=0.048 Sum_probs=252.7 Q ss_pred CCCCCCcccccc-ccccccccCCc-----cccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHh Q lcl|NC_019406. 1 MAGLSPNSANIR-RTKRGAQQFTH-----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLD 74 (661) Q Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~V-----~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~ 74 (661) |--++|---++- .+..-....+. ..--..+....+++.++.+.|.|...+..+..++..... . +..+-. T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~---~--~~~~~~ 95 (492) T protein:vir:94 21 LYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA---V--DPLKPD 95 (492) T ss_pred eecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc---c--cccccc Confidence 333333333331 01111111111 111123445567888888988887554333222211111 1 111111 Q ss_pred hhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHH Q lcl|NC_019406. 75 RAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQ 154 (661) Q Consensus 75 rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~ 154 (661) .-.-.|+.+.+++..+|.+|.+||+++.-.+ +..+.++.+. +++++.....+.+.+ T Consensus 96 ~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~------------------~~~~~l~~~~------~n~~~~~~~~~~~~a 151 (492) T protein:vir:94 96 DRMITNFHANLVDQKVSYIVGKPIAFKHTDD------------------EVVKRIDEVL------GNRFDDKLHSVLTGA 151 (492) T ss_pred cccccchHHHHHHHHHhhhcccCceeccCch------------------HHHHHHHHHH------hccHHHHHHHHHHHH Confidence 1134699999999999999999999852111 1112233222 467899999999999 Q ss_pred HhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) +.+|+++++|.... ..+|-+..++|.+++-.--+...+. +.. -+|.+.. ++. .++..| T Consensus 152 ~~~G~a~~~v~~d~------dg~~~~~~~~p~~~~~v~d~~~~~~--~~a-~ir~~~~-~~~-------~~~~~y----- 209 (492) T protein:vir:94 152 SNKGIEWLHPYLDE------EGEFKLFRVPAEQGIPIWTDKEHEE--LEA-FIRMYKL-ENE-------TKVEYW----- 209 (492) T ss_pred hhCCeEEEEEEecC------CCceEEEEEcccceEEEEcCCCCCc--eEE-EEEEEee-ccc-------eeEEEE----- Confidence 99999999997643 2357788899988654211111111 111 0111110 000 000001 Q ss_pred hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccc Q lcl|NC_019406. 235 QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTL 314 (661) Q Consensus 235 i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L 314 (661) ....+++.. ... +..++.. ......+.+...-+++ T Consensus 210 ---------------------------------~~~~v~~~~-~~~---~~~~~~~--------~~~~~~~~~~~~~~~~ 244 (492) T protein:vir:94 210 ---------------------------------DKVTVNYYV-YEN---GSLIPDY--------SNNLENSKTHFSTGSW 244 (492) T ss_pred ---------------------------------ecCeEEEEE-Eec---Ceeeecc--------ccccccccccccccCC Confidence 001111111 000 0011100 0111112222233679 Q ss_pred ceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeecCCCC Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVVDKES 392 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~lp~~g 392 (661) +.||+|.+.. +.+. .+=|.++..|-=+.-...|++.+.+.+.+.|+++++|++..+... -.++...++.++ ++ T Consensus 245 g~vPvv~~~n--n~~~--~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~~~~~-~~ 319 (492) T protein:vir:94 245 GKIPFIPFKN--NDLE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVS-DN 319 (492) T ss_pred CccceEEecC--CCCC--CCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHHHhhccceecC-CC Confidence 9999998854 2222 233444555444555577999999999999999999987655332 234455666676 46 Q ss_pred CcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 393 GIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWL 471 (661) Q Consensus 393 a~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A 471 (661) ++++|+..+. +.+..+..++.+++.|..++.-.-.. ..-+++.||++..................+..++.+++++++ T Consensus 320 ~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~ 398 (492) T protein:vir:94 320 GGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 398 (492) T ss_pred CcceeEeccC-CHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7899988654 45778888999999998876422111 122356799999999999999999999999999999999999 Q ss_pred HHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCC Q lcl|NC_019406. 472 MFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQ 551 (661) Q Consensus 472 ~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ 551 (661) .++|.+. +..++.|..++... .. ..+.++++.++ .|.||++|.+..| +.++ +.++|.++|+++.. T Consensus 399 ~~~~~~~-~~~~i~v~f~~~~p-~~-~~e~~~~~~kl--~giiS~et~~~~l---~~v~---d~~~E~eri~~E~~---- 463 (492) T protein:vir:94 399 EHFDIKG-EHKDVDISFNYNKV-AN-TELQVQTAQQS--MGIVSHETVLENH---PFVE---DLQAELERIEQEQM---- 463 (492) T ss_pred HHhcCCc-ccceeeEEecCCCC-CC-HHHHHHHHHHH--hccCchHHHHHhC---CCCC---CHHHHHHHHHHHHH---- Confidence 9999765 34556666543222 22 34567777766 4899999997544 4333 45677777765421 Q ss_pred chhhhhhc---CCccccCCCcch-hhhhc Q lcl|NC_019406. 552 PDAIAMRR---GYVSRQQELDQQ-RAARD 576 (661) Q Consensus 552 ddae~~~~---g~~~~~~~~~q~-~~~~e 576 (661) +.++.+++ +..+...+.+++ +.+.| T Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 464 EYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHhhccccccccCCCCccccCCccccCC Confidence 11111111 111111100000 00111 No 15 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.91 E-value=2.3e-23 Score=144.74 Aligned_cols=450 Identities=11% Similarity=0.030 Sum_probs=248.0 Q ss_pred CCCCCCcccccccccc----ccccCCccccCHHH----HHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKR----GAQQFTHLVVHPEY----EYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANY 72 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~~~V~~~hPey----~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~r 72 (661) |+-++-|..++.-... ...+.+.+...--+ ...+++++++.+.|.|...+..+ ........+ | T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~-------~~~~~~~~~--k 71 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTA-------PEKETGADN--R 71 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccC-------cccccCCcc--e Confidence 9988888876652211 22333333211111 23467899999999997554322 111111122 1 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) .-.|+.+.+++.++|++|.+||+++. ++ |. .....|+.+. +.++++.+.+.+++ T Consensus 72 ----i~~n~~~~Ivd~~~~~l~g~p~~~~~-~~-------d~---------~~~~~l~~~~-----~~n~~~~~~~~~~~ 125 (470) T protein:vir:99 72 ----IVVNSAKYVVDVYNGYFCGIEPKLAL-LN-------DS---------SKIDEIARWN-----RQENFFDTINEISK 125 (470) T ss_pred ----eecchHHHHHHHHhhhhccCCeeEee-CC-------ch---------hHHHHHHHHH-----HhcCHhHHHHHHHH Confidence 34699999999999999999999852 21 10 0112223222 35799999999999 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .++.+|+++++|-... ..+|.+..++|.+++-. +++..++..+..| |.+... + T Consensus 126 ~~~~~G~~~~~v~~d~------dg~~~i~~~~p~~~~~i-~d~~~~~~~~~~v--r~~~~~-~----------------- 178 (470) T protein:vir:99 126 QCDIFGRSIASIYQGE------DARPHLMYSSPNHAFII-YDDTVQRQPLAFV--HYQIDN-S----------------- 178 (470) T ss_pred HHHhcCeeEEEEEeCC------CCeEEEEEEccceeEEE-EcCCCCcceEEEE--EEEEEe-c----------------- Confidence 9999999999995432 24688888999886422 1221111111111 111100 0 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR 312 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 312 (661) . .....+ ..++... .+|++.....+.. ...-....+ T Consensus 179 --------------------------------~--~~~~~~-~~~~~~~----~~~~~~~~~~~~~-----~~~~~~~~~ 214 (470) T protein:vir:99 179 --------------------------------N--NWTDAY-GVIQYAD----KFYKFKGYDIEED-----TNAAGYAIN 214 (470) T ss_pred --------------------------------C--CeeEEE-EEEEecC----eEEEEEecccccc-----ccccccccc Confidence 0 000011 1111111 1222221111110 111112236 Q ss_pred ccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC---ce-eEecccceeec Q lcl|NC_019406. 313 TLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA---SE-YHIGPGRVWVV 388 (661) Q Consensus 313 ~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~---~~-l~iGs~~~~~l 388 (661) +++.||||.+.. +.+. .+=+.++..|-=+.=+..|++.+++.+.++|+++++|...... +. ..+.....+.+ T Consensus 215 ~~g~vPvv~~~n--~~~g--~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~ 290 (470) T protein:vir:99 215 PYGLVPAVEFFE--NEER--QGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYV 290 (470) T ss_pred CCCccceEeecC--CCCC--CcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeee Confidence 799999998753 2222 2223344444333334678889999999999999999753321 11 23344555555 Q ss_pred CC----CCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 389 DK----ESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 389 p~----~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) +. .+++++|+..+. .....+..++.+++.|..++.-.- ....-+++.||++..............+-..+..++ T Consensus 291 ~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 369 (470) T protein:vir:99 291 SQLDPDTNPQIGFIAKPD-ADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSL 369 (470) T ss_pred cCCCCCCCCcceEEeecC-ChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 42 456899998654 557778888999999887753221 111123567999999888888888888899999999 Q ss_pred HHHHHHHHHHcCCCCCC---cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHH Q lcl|NC_019406. 464 TSVVRYWLMFRDIPLTD---TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTI 540 (661) Q Consensus 464 ~~aL~~~A~w~G~~~~~---~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~ 540 (661) .+.+++++.+++..... ..++.+.+++... .. ..+.++++.++ .|.||++|.+..| +.+ +.++|.+ T Consensus 370 ~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p-~~-~~e~a~~~~kl--~giis~et~l~~l---~~v----d~~~E~e 438 (470) T protein:vir:99 370 MQLYRIVLATLFNNKQDQELWSELDFKFTRNLP-ED-MASAIDNAKNA--EGIVSKKTQLGMI---PDI----EPDAEMK 438 (470) T ss_pred HHHHHHHHHHHhccCCcccccccceEEeCCCCC-cC-HHHHHHHHHHH--hccCCHHHHHHhC---CCC----CHHHHHH Confidence 99999999998754322 2244454433222 22 34566777766 4899999997654 222 4556677 Q ss_pred HHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhH Q lcl|NC_019406. 541 KMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQ 582 (661) Q Consensus 541 ~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~ 582 (661) +|+++.. +..+..+.. ....+......+ -++| T Consensus 439 ri~~E~~----~~~~~~~~~----~~~~d~~~~d~~--~ee~ 470 (470) T protein:vir:99 439 QIAKEKA----DAIKQTQQL----SMPIDILKRDNN--AEEE 470 (470) T ss_pred HHHHHHH----HHHHHHHhh----cCCCCcCCCCCC--ccCC Confidence 7765521 111111100 000011100111 1111 No 16 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.91 E-value=1.8e-23 Score=145.26 Aligned_cols=464 Identities=11% Similarity=0.040 Sum_probs=251.7 Q ss_pred CCCCC-Ccc---cc--ccc--cccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHH Q lcl|NC_019406. 1 MAGLS-PNS---AN--IRR--TKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANY 72 (661) Q Consensus 1 ~~~~~-~~~---~~--~~~--~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~r 72 (661) |+-++ |+- .| +.+ .+.-...-.+..-...+....+++.++...|.|...+..+.. ++.. ....++.+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~----~~~~-~~~~~~~~ 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPP----KRDV-NGDYDETK 75 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccc----cccc-cccccccc Confidence 66542 111 11 111 111112223444455566667788888888888654432211 1111 11111111 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) -..-.-.|+.+.+++..+|++|.+||+++.-.+ +..+.++.++ +++++.....+.+ T Consensus 76 ~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d------------------~~~~~l~~~~------~n~~~~~~~~~~~ 131 (478) T protein:vir:10 76 PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDND------------------KALKQIQHTL------NHKWDDKLVDILT 131 (478) T ss_pred ccceeccchHHHHHHHHHhhhccCCeeeecCCh------------------HHHHHHHHHH------hcCHHHHHHHHHH Confidence 111245699999999999999999999852111 1223344433 3578999999999 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .++.+|+++++|.+.. ..+|-+..++|.+++--- ++ +....+..+ ++.+. .++ ..++..|... T Consensus 132 ~~~~~G~~~~~~~~d~------~g~~~~~~~~p~~~~~i~-d~-~~~~~~~~~-v~~~~-~~~-------~~~~~~y~~~ 194 (478) T protein:vir:10 132 AASNKGIEWVQPYVDE------EGEFKTFRVPAEQAVPIW-TN-KERDELQAF-IRVYE-LDG-------AERVEYWTKD 194 (478) T ss_pred HHHhcCeEEEEEEecC------CCeeEEEEEcccceEEEE-cC-CCCCceEEE-EEEEE-ecC-------ceEEEEEeCC Confidence 9999999999996542 235778888888764211 11 001112211 11111 000 0111222111 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR 312 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 312 (661) .|..|. ... +...+.. ... .......+......+ T Consensus 195 ~i~~~~---------------------------------------~~~---~~~~~~~--~~~--~~~~~~~~~~~~~~~ 228 (478) T protein:vir:10 195 DVTYYE---------------------------------------LKE---GQLIPDF--YRS--DDHIQPHYYQGNKLM 228 (478) T ss_pred eEEEEE---------------------------------------EcC---Ceeeccc--ccc--ccccccceecccccc Confidence 111110 000 0000000 000 000111111122236 Q ss_pred ccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeecC- Q lcl|NC_019406. 313 TLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVVD- 389 (661) Q Consensus 313 ~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~lp- 389 (661) +++.||||.+.. +.+. .+=|.++-.|-=+.=...|++.+.+.+.+.|+++++|.+.++... ..+....++.++ T Consensus 229 ~~~~vPvv~~~n--~~~g--~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 304 (478) T protein:vir:10 229 SWGRVPFIPFKN--NPQE--VSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAG 304 (478) T ss_pred cCCccceEEecc--CCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEEecC Confidence 799999998843 3332 233444444444444577888889999999999999986554221 122223344343 Q ss_pred CCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 390 KESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVR 468 (661) Q Consensus 390 ~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~ 468 (661) .+|++++|+..+. +.+..+..++.+++.|..++.-.-.. ..-+++.||++.+................+..++.++++ T Consensus 305 ~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 383 (478) T protein:vir:10 305 ESGSGVDTIKVEV-PIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQ 383 (478) T ss_pred CCCCcceEEeecC-ChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4578999998665 56888899999999998875322111 112357799999988888888888889999999999999 Q ss_pred HHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCC Q lcl|NC_019406. 469 YWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSF 548 (661) Q Consensus 469 ~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~ 548 (661) +++.+.|... +..++.+.+++ ..+.+ ..+.++++.++ +|.||++|.+..| +.+ .+.+++.++|+++... T Consensus 384 li~~~~g~~~-~~~~i~i~f~~-~~p~d-~~e~a~~~~kl--~g~iS~et~~~~l---~~v---~D~~~E~~ri~~E~~~ 452 (478) T protein:vir:10 384 YIIDFYRLDV-KVQDIEITFNF-NVMVN-ELENSQIAMNS--TGLLSKETILSNH---AWV---EDPVAEMERIEQENIE 452 (478) T ss_pred HHHHHhCCCc-ccccceEEecC-CCCCC-HHHHHHHHHHH--hCCCChHHHHHhC---CCC---CCHHHHHHHHHHHHHH Confidence 9999999654 44456666543 22332 24456666665 8999999997544 433 3467778888765321 Q ss_pred CC--CchhhhhhcCCccccCCCcchhhhhcCC Q lcl|NC_019406. 549 IG--QPDAIAMRRGYVSRQQELDQQRAARDAD 578 (661) Q Consensus 549 l~--~ddae~~~~g~~~~~~~~~q~~~~~e~d 578 (661) .. ..+......+..+.++ ...+++ T Consensus 453 ~~~~~~~~~~~~~~~~~~~~------~~~~~~ 478 (478) T protein:vir:10 453 LNQQLPDIEEGLNGEQQRQS------ENNQPE 478 (478) T ss_pred HHhhccccccccCCCCCCCC------CCCCCC Confidence 11 1111111111111111 111111 No 17 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.91 E-value=7.4e-24 Score=147.39 Aligned_cols=463 Identities=11% Similarity=0.046 Sum_probs=251.0 Q ss_pred CCCCCCcccccc-ccccccccCCc-----cccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHh Q lcl|NC_019406. 1 MAGLSPNSANIR-RTKRGAQQFTH-----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLD 74 (661) Q Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~V-----~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~ 74 (661) |--++|--.|+- .+-.-...++. ..--..+....+++.++.+.|.|...+-.+..++.... ....++... T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~---~~~~~~~~~- 96 (492) T protein:vir:97 21 LYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATG---AVDPLKPDD- 96 (492) T ss_pred eeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccc---ccccccccc- Confidence 223333333321 11111111111 11112345567788888888888654322211111111 011111111 Q ss_pred hhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHH Q lcl|NC_019406. 75 RAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQ 154 (661) Q Consensus 75 rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~ 154 (661) | .-.|+.+.+|+.++|++|.+||+++.-.+ ...+.++++ .+++++.....+.+.+ T Consensus 97 r-i~~n~~k~Ivd~~~~yl~g~p~~~~~~d~----------------------~~~~~l~~~--~~n~~~~~~~~~~~~~ 151 (492) T protein:vir:97 97 R-MITNFHANLVDQKVSYIVGKPIAFKHTDD----------------------EVVKRIDEV--LGNRFDDKLHSVLTGA 151 (492) T ss_pred c-cccchHHHHHHHHhhhhcccCceeccCch----------------------HHHHHHHHH--HhccHHHHHHHHHHHH Confidence 1 23699999999999999999999852111 112223333 1468889999999999 Q ss_pred HhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) +.+|+|+++|-... ..+|-+..++|++++-.--+...+ .+... +|.+.. ++. .++..|. T Consensus 152 ~~~G~a~~~v~~d~------dg~~~~~~~~p~~~~~i~d~~~~~--~~~~~-vr~~~~-~~~-------~~~~~y~---- 210 (492) T protein:vir:97 152 SNKGIEWLHPYLDE------EGEFKLFRVPAEQGIPIWTDKEHE--ELEAF-IRMYKL-ENE-------TKVEYWD---- 210 (492) T ss_pred hhcCeEEEEEEecC------CCceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEee-ccc-------eeEEEEe---- Confidence 99999999997542 235778888888765431111111 11111 111100 000 0000010 Q ss_pred hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccc Q lcl|NC_019406. 235 QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTL 314 (661) Q Consensus 235 i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L 314 (661) ...+++.. .+.+ .....+ ........+...-++| T Consensus 211 ----------------------------------~~~v~~~~-~~~~---~~~~~~--------~~~~~~~~~~~~~~~~ 244 (492) T protein:vir:97 211 ----------------------------------KVTVNYYV-YENG---SLIPDY--------SNNLENSKTHFSTGSW 244 (492) T ss_pred ----------------------------------cCeEEEEE-EecC---eeeecc--------cccccccccccccCCC Confidence 00111110 0000 000000 0111112222334679 Q ss_pred ceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeecCCCC Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVVDKES 392 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~lp~~g 392 (661) +.||||.+... .+. .+=|.++-.|-=+.=...|++.+.+.+.+.|+++++|.+..+... -.++...++.++. + T Consensus 245 g~vPvv~~~nn--~~g--~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~-~ 319 (492) T protein:vir:97 245 GKIPFIPFKNN--DLE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSD-N 319 (492) T ss_pred CCcceEEecCC--CCC--CCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHHhhccceecCC-C Confidence 99999988542 222 222333333333444467888999999999999999987654332 2355566777774 6 Q ss_pred CcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 393 GIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWL 471 (661) Q Consensus 393 a~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A 471 (661) ++++|+..+. +.+..+..++.+++.|..++.-.-.. ..-+++.||++.+................+..++.+.+++++ T Consensus 320 ~~~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~ 398 (492) T protein:vir:97 320 GGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 398 (492) T ss_pred CcceeEeccC-CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7899998653 56788888999999998875422111 122356799999999888888889999999999999999999 Q ss_pred HHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCC Q lcl|NC_019406. 472 MFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQ 551 (661) Q Consensus 472 ~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ 551 (661) .++|.+. +..++.|..++ -.+.+ ..+.++++.++ +|.||++|.+..| +.+ .++++|.++|+++.... . T Consensus 399 ~~~~~~~-~~~~i~v~f~~-~~p~~-~~e~a~~~~kl--~G~iS~et~l~~l---~~v---~d~~~Eleri~~E~~~~-~ 466 (492) T protein:vir:97 399 EHFDIKG-EHKDVDISFNY-NKVAN-TELQVQTAQQS--MGIVSHETVLENH---PFV---EDLQAELERIEQEQTEY-N 466 (492) T ss_pred HHhcCCc-ccceeeEEecC-CCCCC-HHHHHHHHHHH--hccCchHHHHHhC---CCC---CCHHHHHHHHHHHHHHH-H Confidence 9999765 34556666543 22222 24567777776 6999999996543 333 34567777876653200 0 Q ss_pred chhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHH Q lcl|NC_019406. 552 PDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAE 588 (661) Q Consensus 552 ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e 588 (661) ...+...++..+...+.+++ .+ +.+| T Consensus 467 ~~~~~~~~~~~~~~~~~~~~-----~~------~~~e 492 (492) T protein:vir:97 467 KQLPNLDDGGADSAQQQERS-----NN------KESE 492 (492) T ss_pred HhhhccccCCCCCCcccccc-----cc------cccC Confidence 00111112222211111111 11 1111 No 18 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.91 E-value=5.2e-23 Score=142.76 Aligned_cols=454 Identities=12% Similarity=0.026 Sum_probs=246.3 Q ss_pred CCCCCCccccccccccc---------cccCCccccC---HHH-HHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRG---------AQQFTHLVVH---PEY-EYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDE 67 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~---------~~~~~V~~~h---Pey-~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~ 67 (661) |.-.+=|-.|+.+...+ .+.++..... ..+ ....++|+++.+.|.|.... ++.+. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~------i~~~~--~~~~ 72 (481) T protein:vir:10 1 MTVYTINNINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTD------ILAGE--RRLQ 72 (481) T ss_pred CeeEeeehhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc------cccCc--cccc Confidence 33333333333322221 1122221111 122 23457788888888886321 11111 0111 Q ss_pred HHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHH Q lcl|NC_019406. 68 DYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFA 147 (661) Q Consensus 68 ~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa 147 (661) .+..+..+-.-.|+++.+++.++|.+|.+||+++.-.+. ..+.++++. +-++++.++ T Consensus 73 ~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~------------------~~~~l~~~~-----~~n~~~~~~ 129 (481) T protein:vir:10 73 KYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITHQDNQ------------------TNDKIIELN-----DLNDADEVN 129 (481) T ss_pred cccccccceeecchHHHHHHHHHhhhccCCceEecCChh------------------HHHHHHHHH-----HhcChhHHH Confidence 122222223457999999999999999999998521111 112333333 236899999 Q ss_pred HHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceee Q lcl|NC_019406. 148 KTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIG 227 (661) Q Consensus 148 ~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~ 227 (661) +.+.+.++.+|+++++|-... ..+|-+..++|.+++-+- ++.... .+... ++.+... T Consensus 130 ~~~~~~~~~~G~~~~~~~~d~------dg~~~i~~~~p~~~~~v~-d~~~~~-~~~~~-i~~~~~~-------------- 186 (481) T protein:vir:10 130 SDLALNLSIYGRAYEIVYRDF------EDRDTFKVLDPKSTFVVY-DQTLDK-KVVAG-VRYFEKQ-------------- 186 (481) T ss_pred HHHHHHHHhcCeEEEEEEeCC------CCeEEEEEEcccceEEEE-cCCCCC-ceEEE-EEEEEEe-------------- Confidence 999999999999999996542 246788889998875432 111111 11111 1111100 Q ss_pred eechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee Q lcl|NC_019406. 228 REGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 228 ~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p 307 (661) +.. ...++++.++..+ .++++. .++.+.... T Consensus 187 ---------------------------------~~~-----~~~~~~~~~y~~~----~i~~~~---~~~~~~~~~---- 217 (481) T protein:vir:10 187 ---------------------------------DKD-----KVPVQHVEVYTTD----KIYYIE---IKGGTYHRV---- 217 (481) T ss_pred ---------------------------------eCC-----CceEEEEEEEecC----eEEEEE---ecCCceeec---- Confidence 000 0111222222221 122221 111111101 Q ss_pred ccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc---e------e Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS---E------Y 378 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~---~------l 378 (661) ...-++++.||||.+... .+. .+=+.++..|-=+.-+..|++.+.+.+.+.|+++++|....+.+ . + T Consensus 218 ~~~~~~~g~vPvv~~~n~--~~g--~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 293 (481) T protein:vir:10 218 EEVEHYYNDVPIIEYLND--QFK--QGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMI 293 (481) T ss_pred ccccccCCceeEEEeecC--CCC--CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccce Confidence 111256899999987542 222 23344444444455566799999999999999999986433221 1 1 Q ss_pred EecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHH Q lcl|NC_019406. 379 HIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIM 457 (661) Q Consensus 379 ~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~ 457 (661) .+..+.....+.++++++|+..+. +.+..+..++.+.+.|..++.-.- .....+++.||++................. T Consensus 294 ~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~ 372 (481) T protein:vir:10 294 HLEPGTNANGSEGKAEVKYVYKQY-DVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKER 372 (481) T ss_pred eccccccccCCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 222222223334467889998765 457788888999999888754221 111223567899988888888888899999 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCC---cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCC Q lcl|NC_019406. 458 ALEDGMTSVVRYWLMFRDIPLTD---TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQT 534 (661) Q Consensus 458 ~le~Al~~aL~~~A~w~G~~~~~---~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~ 534 (661) .+..++.+++++++++++..... ..++.+.+++. .+.+ .++.++++.++ .|.||.+|.+..| +.+ .+ T Consensus 373 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~-~~~~-~~~~a~~~~kl--~g~is~et~~~~l---~~i---~d 442 (481) T protein:vir:10 373 LFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPN-LPKS-MMESINAFNAL--SGGVSESTRLSLL---DFI---DN 442 (481) T ss_pred HHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCC-CCcC-HHHHHHHHHHH--hccCChHHHHHhC---CCC---CC Confidence 99999999999999998765432 23445555432 2222 24566777766 5899999997543 333 34 Q ss_pred HHHHHHHHhccCCCCCCchhhhhhcCCccccCC-CcchhhhhcC Q lcl|NC_019406. 535 LEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQE-LDQQRAARDA 577 (661) Q Consensus 535 ~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~-~~q~~~~~e~ 577 (661) +.+|.++|++|..... ...++.++.+.... -+-+ ..|+ T Consensus 443 ~~~E~~ri~~E~~~~~---~~~~~~~~~~~~~~~~~~d--d~~g 481 (481) T protein:vir:10 443 PKEELEKMQEEEAQRE---KQADKRGYGEAFENHLNVD--DSNG 481 (481) T ss_pred HHHHHHHHHHHHHHHH---hhhhhccCCccCCCCCCCC--CCCC Confidence 6777778876532111 01111121111110 0001 1111 No 19 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.91 E-value=1.5e-23 Score=145.69 Aligned_cols=469 Identities=12% Similarity=0.064 Sum_probs=247.3 Q ss_pred CCCC----------------CCccccccccccccccCCccc-----cC--HHHHHHHHHHHHHHHHhcchHHHHhCCccc Q lcl|NC_019406. 1 MAGL----------------SPNSANIRRTKRGAQQFTHLV-----VH--PEYEYYRPDWAKIRDAIAGEREIKAQGVKY 57 (661) Q Consensus 1 ~~~~----------------~~~~~~~~~~~~~~~~~~V~~-----~h--Pey~a~~~~W~~irD~~~G~~~vr~~g~~Y 57 (661) |.-. -|--+|+.-.-...+.-.+.. .. -......++++++.+.|.|...+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~---- 76 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE---- 76 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccc---- Confidence 1111 134444443222222211111 01 11233467888999999886543111 Q ss_pred CCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhcc Q lcl|NC_019406. 58 LKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFA 137 (661) Q Consensus 58 LPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d 137 (661) + ......|+.. .| +-.|+.+-+++.++|.+|.+||+++.-.+. ..+.++++- T Consensus 77 -~---~~~~~~~~~~-~k-i~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~----------------------~~~~l~~~~ 128 (512) T protein:vir:97 77 -L---TRRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQCQDDDKD----------------------VLEAIEAFN 128 (512) T ss_pred -c---CcccccccCc-ce-eecchHHHHHHHHhhhhcccCceeccCChH----------------------HHHHHHHHH Confidence 1 1111112211 12 347999999999999999999998521111 122233332 Q ss_pred CCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 138 KDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 138 l~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) +-++++.....+.+.++.+|+++++|-... ..+|-+..++|.+++-.- ++......+.-|+. T Consensus 129 -~~n~~~~~~~~~~~~~~i~G~ay~~vy~de------d~~~~i~~~~p~~~~~iy-d~~~~~~~~~~vr~---------- 190 (512) T protein:vir:97 129 -DLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVIY-DNTIERNSIAGVRY---------- 190 (512) T ss_pred -hhcCHHHHHHHHHHHHHhcCeEEEEEEeCC------CCceEEEEEcccceEEEE-cCCCCCceEEEEEE---------- Confidence 336899999999999999999999997542 235777888888765431 11111111111111 Q ss_pred ccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 218 TPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 218 ~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) |... ... ......++.+.++..+ .+|++.. ..+. T Consensus 191 -----------~~~~----------------------------~~~--~~~~~~~~~~~vyt~~----~i~~~~~-~~~~ 224 (512) T protein:vir:97 191 -----------LRTK----------------------------PID--KTDEDEVFTVDLFTSH----GVYRYLT-SRTN 224 (512) T ss_pred -----------EEee----------------------------ecc--ccccceEEEEEEEeCC----cEEEEEe-cCCC Confidence 1000 000 0011222333333222 1222221 1111 Q ss_pred ccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce Q lcl|NC_019406. 298 LGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE 377 (661) Q Consensus 298 ~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~ 377 (661) .. ...........++++.||||.+.. +.+. .+=|.++..|-=+.-...|++.+.+.+.+.|+++++|....+... T Consensus 225 ~~-~~~~~~~~~~~~~~g~vPvv~~~n--n~~~--~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (512) T protein:vir:97 225 GL-KLTPRENGFESHSFERMPITEFSN--NERR--KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (512) T ss_pred cc-cccccccccccccCcccceEeecC--CCCC--CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchh Confidence 11 111112223357899999998753 2222 233555555555555678999999999999999999965433221 Q ss_pred eE-eccccee-------------ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHH Q lcl|NC_019406. 378 YH-IGPGRVW-------------VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSA 442 (661) Q Consensus 378 l~-iGs~~~~-------------~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~ 442 (661) +. ...+..+ .-+.++++++|+..+ .+.+..+..++.+.+.|...+.-. +....-+++.||++.+ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~-~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~ 378 (512) T protein:vir:97 300 VRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMK 378 (512) T ss_pred hhhhhhcccccccccchhhcccccCCCCCcceEEEeec-CCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHH Confidence 11 0001111 112346789999865 456778888899999988765322 1111123577999999 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHH Q lcl|NC_019406. 443 LREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT-----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPID 517 (661) Q Consensus 443 ~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~e 517 (661) ................+..++.+.+++++.+++.... +-..+.+.+++... .. ..+.++++.++ .|.||++ T Consensus 379 ~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p-~~-~~e~~~~~~kl--~giiS~e 454 (512) T protein:vir:97 379 YKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLP-KS-LIEELKAYIDS--GGKISQT 454 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCC-cC-HHHHHHHHHHH--hccCchH Confidence 9988888888888999999999999999998754321 12234555543222 22 34567777776 5999999 Q ss_pred HHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHh Q lcl|NC_019406. 518 ALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAER 589 (661) Q Consensus 518 t~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~ 589 (661) |++..| +.+ .+.++|.++|+++... .....+..........+......+++ ....||. T Consensus 455 t~~~~l---~~v---~d~~~E~eri~~E~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 512 (512) T protein:vir:97 455 TLMSLF---SFF---QDPELEVKKIEEDEKE----SIKKAQKGIYKDPRDINDDEQDDDTK----DTVDKKE 512 (512) T ss_pred HHHHhC---CCC---CCHHHHHHHHHHHHHH----HHHHHhhcccCCCCCCCCCCCCCCcc----ccccccC Confidence 997554 333 3466777788765221 01111100000111111111111111 1111111 No 20 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.91 E-value=1.8e-23 Score=145.21 Aligned_cols=424 Identities=10% Similarity=0.016 Sum_probs=246.7 Q ss_pred CCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHH Q lcl|NC_019406. 4 LSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTS 83 (661) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~ 83 (661) ++++-++ .---.+....+++.++++.|.|...+ |.+...... .-..|+ -.|+.+ T Consensus 1 l~~~~l~--------------~~i~~~~~~~~r~~~l~~yy~g~~~i-------l~~~~~~~~-~~~~ki----~~n~~~ 54 (429) T protein:vir:98 1 MTKDLLS--------------ELIQKHRSFNLSYSAYKQLYEGDHAI-------LQQKQKEQY-KPDNRL----VVNFAK 54 (429) T ss_pred CCHHHHH--------------HHHHHHHHHHHHHHHHHHHhcccccc-------ccccccccC-CCccee----ecchHH Confidence 2222211 00012345568899999999997544 222221111 111122 369999 Q ss_pred HHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEE Q lcl|NC_019406. 84 QTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGAL 163 (661) Q Consensus 84 ~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvL 163 (661) .+|+..+|.+|.+||+++.-.+. ..+.++++ .+.++++.++..+.+.++.+|+++++ T Consensus 55 ~ivd~~~~~l~g~~~~~~~~~~~----------------------~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~~~~~ 111 (429) T protein:vir:98 55 YIVDTFNGYFIGVPVQTSHENKQ----------------------VSNYLELL-DGYNDQDDNNAELSKICSIYGHGYEL 111 (429) T ss_pred HHHHHHhhhhcccCceeecCChH----------------------HHHHHHHH-HhhcCHhHHHHHHHHHHhhcCeEEEE Confidence 99999999999999998521111 11222233 23578999999999999999999999 Q ss_pred EeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhc Q lcl|NC_019406. 164 VDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRA 243 (661) Q Consensus 164 VD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~ 243 (661) |-... ..+|-+..++|.+++--- ++......+..|+ .+..+ T Consensus 112 v~~d~------~g~~~~~~~~p~~~~~v~-dd~~~~~~~~~i~---------------------~~~~~----------- 152 (429) T protein:vir:98 112 VFNDE------NAEAGITYLTPLEAFIVY-DDSIRQKPLFAVR---------------------YFYNK----------- 152 (429) T ss_pred EEecC------CCcEEEEEEcccceEEEE-eCCCCCceEEEEE---------------------EEEec----------- Confidence 96542 235778888888764211 1111111111111 00000 Q ss_pred chhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEe Q lcl|NC_019406. 244 GLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFG 323 (661) Q Consensus 244 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~ 323 (661) + .++....+... +++.+..+..+. .......++++.||||.+. T Consensus 153 ---------------------~-----~~~~~~~~~~~-------~~~~~~~~~~~~----~~~~~~~~~~g~vPvv~~~ 195 (429) T protein:vir:98 153 ---------------------G-----GVLEGSYSDAS-------NITYFKDGEKGI----EIGESEPHPFDGVPMIEYV 195 (429) T ss_pred ---------------------C-----ceEEEEEEeCc-------eEEEEEecCCce----EecccccccCCccceEEec Confidence 0 01111111111 011111111111 1111224679999999875 Q ss_pred cCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCCC---CcceEeec Q lcl|NC_019406. 324 SMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKES---GIPGIIEF 400 (661) Q Consensus 324 ~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~g---a~~~ylE~ 400 (661) . +.+ +.+-|.++..|-=+.-+..|++.+.+.+.++|+++++|.+..+...-.+-..+++.++..+ ++++|+.. T Consensus 196 n--~~~--g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 271 (429) T protein:vir:98 196 E--NEE--RQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAELDDETLKSLRDTRIINLKDTDAQQLTVEFLQK 271 (429) T ss_pred C--CCC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCcchhhhHhhCceeeccCCCCCCcceeEEee Confidence 3 223 3344666666666778888999999999999999999986544322233334566666543 46899987 Q ss_pred CchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC Q lcl|NC_019406. 401 KGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTD 480 (661) Q Consensus 401 ~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~ 480 (661) +. +.+..+..++.+.+.|.....-.--...+.++.||++.+.............-..+..++.+++++++.+++..... T Consensus 272 ~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 350 (429) T protein:vir:98 272 PD-ADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGP 350 (429) T ss_pred cC-CHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 65 56778888899999888765321112234467799999888888888888888999999999999999998765432 Q ss_pred --cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhh Q lcl|NC_019406. 481 --TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMR 558 (661) Q Consensus 481 --~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~ 558 (661) ..++.|..++ -.+.. ..+.++++.++ +|.||++|.+..| +.+ .++++|.++|+++.. +..+.++ T Consensus 351 ~d~~~i~v~f~~-~~p~~-~~~~a~~~~kl--~g~is~et~~~~l---~~v---~d~~~E~~ri~~E~~----~~~~~~~ 416 (429) T protein:vir:98 351 KDWIGIKYKFTR-NLPAN-LLEESQIAGNL--AGIVSEETQVGVL---SIV---ENPQKEIERKNSDKS----TLISRQA 416 (429) T ss_pred cccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCchHHHHHhC---CCC---CCHHHHHHHHHHHHH----HHHHHHH Confidence 2234455543 22333 34567777776 7899999997544 333 346677777776522 1111111 Q ss_pred cCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 559 RGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 559 ~g~~~~~~~~~q~~~~~e~d~~ 580 (661) +....+ ..+.|++ T Consensus 417 -~~~~~~--------~~~~~~~ 429 (429) T protein:vir:98 417 -GGLNGQ--------NTTTILE 429 (429) T ss_pred -hhhcCC--------CCCCCCC Confidence 111111 1111222 No 21 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.90 E-value=9.8e-23 Score=141.25 Aligned_cols=459 Identities=12% Similarity=0.063 Sum_probs=248.1 Q ss_pred CCCCCCcccccc-ccccccccCCcc-----ccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHh Q lcl|NC_019406. 1 MAGLSPNSANIR-RTKRGAQQFTHL-----VVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLD 74 (661) Q Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~V~-----~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~ 74 (661) |-=.-|-+.+|. .+..-..++++. .--..+....++|.++.+.|.|...+-.+...|...... ..++ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~---~~~~--~~ 75 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV---DPLK--PD 75 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccc---cccc--cc Confidence 332223333332 112222222211 111234556688888899998975543322222221111 1111 11 Q ss_pred hhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHH Q lcl|NC_019406. 75 RAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQ 154 (661) Q Consensus 75 rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~ 154 (661) .-+-.|+.+.+|+..+|++|.+||+++.-.+ ...+.++++ -.++++.....+.+.+ T Consensus 76 ~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~----------------------~~~~~l~~~--~~n~~~~~~~~~~~~~ 131 (472) T protein:vir:93 76 DRMITNFHANLVDQKVSYIVGKPIAFKHTDD----------------------EVVKRIDEV--LGNRFDDKLHSVLTGA 131 (472) T ss_pred cccccchHHHHHHHHhhhhcccCeeeccCCh----------------------HHHHHHHHH--HhccHHHHHHHHHHHH Confidence 1123599999999999999999999852111 111222223 1468899999999999 Q ss_pred HhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) +.+|+++++|.... ..+|-+..++|.+++-.--+...+ .+..+ +|.+.. ++.. ++..| T Consensus 132 ~~~G~~~~~v~~d~------d~~~~i~~~~p~~~~~i~d~~~~~--~~~~~-ir~~~~-~~~~-------~~~~~----- 189 (472) T protein:vir:93 132 SNKGIEWLHPYLDE------EGEFKLFRVPAEQGIPIWTDKEHE--ELEAF-IRMYKL-ENET-------KVEYW----- 189 (472) T ss_pred hhcCeEEEEEEECC------CCceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEEe-ecce-------eEEEE----- Confidence 99999999997643 235778888888876532111111 11111 111110 0000 00000 Q ss_pred hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccc Q lcl|NC_019406. 235 QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTL 314 (661) Q Consensus 235 i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L 314 (661) ....+++.. ...+ ..+.. .........+....++| T Consensus 190 ---------------------------------~~~~~~~~~-~~~~---~~~~~--------~~~~~~~~~~~~~~~~~ 224 (472) T protein:vir:93 190 ---------------------------------DKVTVNYYV-YENG---SLIPD--------YSNNLENSKTHFSTGSW 224 (472) T ss_pred ---------------------------------ecCeEEEEE-EecC---eeeec--------ccccccccccccccCCC Confidence 001111110 1110 00000 00111222233445779 Q ss_pred ceeeEEEEecCCCCCCccc-cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee--EecccceeecCCC Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCEK-PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY--HIGPGRVWVVDKE 391 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~~-pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l--~iGs~~~~~lp~~ 391 (661) +.||||.+... .+..+. -++.+|-+ +.=...|++.+.+.+.++|+++++|.+..+.... .++...++.++. T Consensus 225 ~~vPvv~~~nn--~~g~s~~e~v~~liD---a~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~- 298 (472) T protein:vir:93 225 GKIPFIPFKNN--DLEISDIFMYKTLID---AYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD- 298 (472) T ss_pred CCcceEEecCC--CCCCCchhhhHHHHH---HHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHHhhccccccCC- Confidence 99999988542 222211 12322221 2223677888899999999999999876543221 244556666774 Q ss_pred CCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 392 SGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYW 470 (661) Q Consensus 392 ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~ 470 (661) +++++|+..+. +.+..+..++.+++.|+.++.-.- ....-+++.||++.+................+..++.++++++ T Consensus 299 ~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li 377 (472) T protein:vir:93 299 NGGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFV 377 (472) T ss_pred CCcceeEeecC-CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67899998654 457788888999998887753221 1112235679999998888888888999999999999999999 Q ss_pred HHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCC Q lcl|NC_019406. 471 LMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIG 550 (661) Q Consensus 471 A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~ 550 (661) +.++|.+. +...+.|..++. .+.+ ..+.++++.++ +|.||++|.+..| +.+ .++++|.++|+++.. T Consensus 378 ~~~~~~~~-~~~~i~v~f~~~-~p~~-~~~~~~~~~k~--~giis~et~l~~l---~~~---~d~~~E~~ri~~E~~--- 443 (472) T protein:vir:93 378 FEHFDIKG-EHKDVDISFNYN-KVAN-TELQVQTAQQS--MGIVSHETVLENH---PFV---EDLQAELERIEQEQM--- 443 (472) T ss_pred HHHhCCCc-ccceeeEEeCCC-CCCC-HHHHHHHHHHH--hccCchHHHHHhC---CCC---CCHHHHHHHHHHHHH--- Confidence 99999764 334555655432 2222 24566777765 6899999986543 333 346677777765421 Q ss_pred Cchhhhhh---cCCccccCCCcchhhhhcCChhhH Q lcl|NC_019406. 551 QPDAIAMR---RGYVSRQQELDQQRAARDADFQQQ 582 (661) Q Consensus 551 ~ddae~~~---~g~~~~~~~~~q~~~~~e~d~~q~ 582 (661) +.++.++ .+..+..++- ..+.|-++| T Consensus 444 -~~~~~~~~~~~~~~d~~~~~-----~~~~~~~~e 472 (472) T protein:vir:93 444 -EYNKQLPNLDDGGADGAQQQ-----ERSNNKESE 472 (472) T ss_pred -HHHHhccCcCcccCCCCCCC-----CCCCcccCC Confidence 1111111 1111111110 111111111 No 22 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.90 E-value=5.5e-23 Score=142.61 Aligned_cols=424 Identities=9% Similarity=-0.017 Sum_probs=235.9 Q ss_pred ccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccc Q lcl|NC_019406. 25 VVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLP 104 (661) Q Consensus 25 ~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p 104 (661) ...--.....++|+++.+.|.|....-.... +........+ | +-.|+.+.+|+..+|.+|.+||+++.-. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~----~~~~~~~~~~-----k-i~~n~~~~ivd~~~~~l~g~~~~~~~~~ 70 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGH----RRLDDEKADY-----R-VRHKWGGYISSFATGYVIGNPVSIGVME 70 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCccccccc----ccccccCCcc-----e-eecchHHHHHHhhhhheeccCceEeeCC Confidence 2222334578899999999988633211110 0011111111 1 4579999999999999999999985211 Q ss_pred hhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeec Q lcl|NC_019406. 105 NTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYA 184 (661) Q Consensus 105 ~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~ 184 (661) .. ..+....++.+ .+.++++.....+.+.++.+|+++++|-... ..+|-+..++ T Consensus 71 ~~---------------~~~~~~~l~~~-----~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~------~~~~~i~~~~ 124 (440) T protein:vir:95 71 GG---------------SADQLSTIKDI-----EWQNDINALNSDLAFDASVYGRAYEYHFRDK------DKVDRVVLIS 124 (440) T ss_pred Cc---------------cHHHHHHHHHH-----HHhcCHhHHHHHHHHHHhhcCeEEEEEEecC------CCceEEEEEc Confidence 10 01111222222 2467999999999999999999999996432 2357788888 Q ss_pred hhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccC Q lcl|NC_019406. 185 AENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFT 264 (661) Q Consensus 185 p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~ 264 (661) |.+++--. ++......+-.|+. +. .. T Consensus 125 p~~~~~~~-d~~~~~~~~~~i~~--~~-----------------------------------------------~~---- 150 (440) T protein:vir:95 125 PLEMFVIR-DLTVEQNIIAAVHL--PI-----------------------------------------------YA---- 150 (440) T ss_pred ccceEEEE-cCCCCCceEEEEEE--EE-----------------------------------------------ec---- Confidence 98764321 11111111111100 00 00 Q ss_pred CCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHH Q lcl|NC_019406. 265 SSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNL 344 (661) Q Consensus 265 ~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl 344 (661) .....-++.. ..++++.....+..+.... ....++++.||||.+... .+ +.+-+.++..|-= T Consensus 151 ------~~~~~~vyt~----~~~~~~~~~~~~~~~~~~~----~~~~~~~g~vPvv~~~n~--~~--g~sd~e~v~~lid 212 (440) T protein:vir:95 151 ------DKVNMTVYTK----DKVITYKPYSNNSVRLVVD----DVKKHSYNDVPVVEWWNN--RF--RMGDYESEISLID 212 (440) T ss_pred ------CceEEEEEeC----CeEEEEEEecCCccceeec----ceeeccCceeeEEEeeCC--CC--CCCchhhhHHHHH Confidence 0001111111 1122222222221111111 122367999999987542 22 2344555666655 Q ss_pred HHHhhhhhHHHHHHHhcCceeEEecCCCCC---Ccee-Eecccceeec--------CCCCCcceEeecCchhHHHHHHHH Q lcl|NC_019406. 345 KHYRTYAELEHGRFFTALPTYYAPELDDSD---ASEY-HIGPGRVWVV--------DKESGIPGIIEFKGEGLKTLERAL 412 (661) Q Consensus 345 ~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~---~~~l-~iGs~~~~~l--------p~~ga~~~ylE~~g~~i~a~~~~L 412 (661) +.-...|++.+.+.+.+.|+++++|..... .+.. .+-....+.+ ..++++++|+..+. +.+..+..+ T Consensus 213 a~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~-~~~~~~~~~ 291 (440) T protein:vir:95 213 AYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQY-DVNGTEAYK 291 (440) T ss_pred HHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecC-CHHHHHHHH Confidence 666677888999999999999999963221 1100 0111111111 23457899998764 568888899 Q ss_pred HHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC---CcceEEEEe Q lcl|NC_019406. 413 NEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT---DTATLRYEI 488 (661) Q Consensus 413 ~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~---~~~~~~v~l 488 (661) +.+++.|..+..-. +....-+++.||++.+.............-..+..++.+++++++.+++...+ +...+.+.+ T Consensus 292 ~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f 371 (440) T protein:vir:95 292 NRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTF 371 (440) T ss_pred HHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEe Confidence 99999888764311 11112235679999888888888888888889999999999999999865432 233445555 Q ss_pred ccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCC Q lcl|NC_019406. 489 DATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQEL 568 (661) Q Consensus 489 n~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~ 568 (661) ++ ..+.+ ..+.++++.++ +|.||++|.+..| +.++ .+.|.++|.++...-..+-.+ ..|..+. T Consensus 372 ~~-~~p~~-~~~~ad~~~kl--~g~iS~et~~~~l---~~~d----~~~E~~ri~~E~~~~~~~~~~--~~~~~~~---- 434 (440) T protein:vir:95 372 HP-NIPQD-VWTEIKAYIEA--GGEISQETLMENA---SFTD----YKTEHSRILKQGGSSDLEIGQ--IVGDADV---- 434 (440) T ss_pred CC-CCCCC-HHHHHHHHHHH--hccCcHHHHHHhC---CCCC----cHHHHHHHHHHHHHhhhhHHh--hccCCCC---- Confidence 43 33333 35677788776 6899999997654 3332 234556665543211100000 0010000 Q ss_pred cchhhhhcCChh Q lcl|NC_019406. 569 DQQRAARDADFQ 580 (661) Q Consensus 569 ~q~~~~~e~d~~ 580 (661) .+.|-| T Consensus 435 ------~~~~~e 440 (440) T protein:vir:95 435 ------GQADTE 440 (440) T ss_pred ------CCcCCC Confidence 001111 No 23 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.90 E-value=5.9e-23 Score=142.47 Aligned_cols=469 Identities=12% Similarity=0.055 Sum_probs=239.8 Q ss_pred CCCC------CCccccccccccccccCCcc-------------ccCHHH----HHHHHHHHHHHHHhcchHHHHhCCccc Q lcl|NC_019406. 1 MAGL------SPNSANIRRTKRGAQQFTHL-------------VVHPEY----EYYRPDWAKIRDAIAGEREIKAQGVKY 57 (661) Q Consensus 1 ~~~~------~~~~~~~~~~~~~~~~~~V~-------------~~hPey----~a~~~~W~~irD~~~G~~~vr~~g~~Y 57 (661) |.-. +--+-||+.-=.-.+|.... ....-. ...+++++++.+-|.|...+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~------ 74 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNL------ 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccc------ Confidence 1110 00111222111111222111 111111 123577888888888864431 Q ss_pred CCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhcc Q lcl|NC_019406. 58 LKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFA 137 (661) Q Consensus 58 LPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d 137 (661) +.+......++... | .-.|+.+-+++.++|.+|.+||+++.-.+. ..+.++++- T Consensus 75 --~~~~~~~~~~~~~~-k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~----------------------~~~~l~~~~ 128 (511) T protein:vir:10 75 --VELTRRKEEYMADN-R-VAHDYASYISDFINGYFLGNPIQYQDDDKD----------------------VLEAIEAFN 128 (511) T ss_pred --cccCcccccccCcc-e-eecchHHHHHHHHhhhhcccCceeecCchH----------------------HHHHHHHHH Confidence 11111112222211 2 226999999999999999999998522111 122233332 Q ss_pred CCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 138 KDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 138 l~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) +-++++.....+.+.++.+|+++++|-... ..+|-+..++|.+++-.--+.+. ...+..|+.- ... T Consensus 129 -~~n~~~~~~~~~~~~~~i~G~ay~~vy~de------dg~~~i~~~~p~~~~~vydd~~~-~~~~~~vr~~--~~~---- 194 (511) T protein:vir:10 129 -DLNDVESHNRSLGLDLSIYGKAYEIMIRNQ------DDETRLYKSDAMSTFVIYDNTIE-RNSIAGVRYL--RTK---- 194 (511) T ss_pred -hhcCHHHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEEEcCCCC-CceEEEEEEE--Eee---- Confidence 236899999999999999999999996542 23567777888776432111111 1111111110 000 Q ss_pred ccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 218 TPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 218 ~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) ... ....+.++++.++..+ .++++.. ..+. T Consensus 195 -------------------------------------------~~d--~~~~~~~~~~~iyt~~----~i~~~~~-~~~~ 224 (511) T protein:vir:10 195 -------------------------------------------PID--KTDEDEVFTVDLFTSH----GVYRYLT-SRTN 224 (511) T ss_pred -------------------------------------------ecc--cCccceEEEEEEEeCC----cEEEEEe-cCCC Confidence 000 0011222223233222 1222221 1111 Q ss_pred ccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce Q lcl|NC_019406. 298 LGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE 377 (661) Q Consensus 298 ~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~ 377 (661) ... ..........++++.||||.+... .+. .+=|.++..|-=+.-...|++.+.++..+.|+++++|....+... T Consensus 225 ~~~-~~~~~~~~~~~~~~~vPvv~f~nn--~~g--~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~ 299 (511) T protein:vir:10 225 GLK-LTPRENGFESHSFERMPITEFSNN--ERR--KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred ccc-ccccccccccccCcceeEEEecCC--CCC--CCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchh Confidence 111 111111223467999999987532 222 222344444433444577888899999999999999964332211 Q ss_pred e-Eecccceeec------------CCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHH Q lcl|NC_019406. 378 Y-HIGPGRVWVV------------DKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSAL 443 (661) Q Consensus 378 l-~iGs~~~~~l------------p~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~ 443 (661) + ....+..+.+ ..++++++||..+ .+.+..+..++.+.+.|..+..-. +....-+++-||++.+. T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~-~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:10 300 VRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hccchhccceecccccccccccccCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 1 1111222211 2346789999864 355777888889999888764311 11112235779999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019406. 444 REANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT-----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDA 518 (661) Q Consensus 444 d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et 518 (661) ............-..+..++.+.+++++.+++.... +-.++.|.+++.. +.. ..+.++++.++ .|.||++| T Consensus 379 ~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~-p~d-~~~~~~~~~kl--~G~iS~et 454 (511) T protein:vir:10 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL-PKS-LIEELKAYIDS--GGKISQTT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCC-CcC-HHHHHHHHHHH--hccCcHHH Confidence 988888888888899999999999999998764321 1224555554322 222 24567777777 48999999 Q ss_pred HHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 519 LYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 519 ~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) ++..| +.+ .+.++|.++|+++... ..+..+.........++......+++-+-+++| T Consensus 455 ~~~~l---~~v---~d~~~E~~ri~~E~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 455 LMSLF---SFF---QDPELEVKKIEEDEKE----SIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhC---CCC---CCHHHHHHHHHHHHHH----HHHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 97554 333 3456777888765321 011111000000111111111111110000111 No 24 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.90 E-value=5.4e-23 Score=142.65 Aligned_cols=466 Identities=12% Similarity=0.049 Sum_probs=254.0 Q ss_pred CCCC-----CCccccc-ccccc--ccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHH Q lcl|NC_019406. 1 MAGL-----SPNSANI-RRTKR--GAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANY 72 (661) Q Consensus 1 ~~~~-----~~~~~~~-~~~~~--~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~r 72 (661) |+-+ .|.+.+. .+..- -...--+..-...+....+++.++.+.|.|...+..+..++-... ....++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~---~~~~~~~- 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNG---DYDETKP- 76 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhccc---ccccccc- Confidence 7665 2222211 11100 011112334444566677888888998988765433322111100 0000110 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) ..=+-.|+.+.+++..+|.+|.+||+++.-.+ +..+.++.++ +++++.....+.+ T Consensus 77 -~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~------------------~~~~~l~~~~------~n~~~~~~~~~~~ 131 (478) T protein:vir:10 77 -DWRMYTNYHQNLVDQKVAYAVANPVTFGVDND------------------KALKQIQHTL------NHKWDDKLVDILT 131 (478) T ss_pred -cceeccchHHHHHHHHhhhhcccCceeecCCh------------------HHHHHHHHHH------hccHHHHHHHHHH Confidence 00123699999999999999999999852111 1223333332 3688899999999 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .++.+|+++++|.+.. ..+|-+..++|.+++-.--+...| .+..+ ++.+.. ++ ..++..|... T Consensus 132 ~~~~~G~~~~~v~~d~------~~~~~~~~~~p~~~~~v~d~~~~~--~~~~~-ir~~~~-~~-------~~~~~~y~~~ 194 (478) T protein:vir:10 132 AASNKGIEWVQPYVDE------EGEFKTFRVPAEQAVPIWTNKERD--ELQAF-IRVYEL-DG-------AERVEYWTKD 194 (478) T ss_pred HHhhCCeEEEEEEecC------CCceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEee-eC-------ceEEEEEeCC Confidence 9999999999997653 235778888998865331111111 12222 111111 00 0112222221 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR 312 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 312 (661) .|..|+ ... +...+.. +.... ............+ T Consensus 195 ~i~~~~---------------------------------------~~~---~~~~~~~-~~~~~---~~~~~~~~~~~~~ 228 (478) T protein:vir:10 195 DVTFYE---------------------------------------LKE---GQLIPDF-YRSED---HIQPHYYQGNKLM 228 (478) T ss_pred cEEEEE---------------------------------------ecC---Ceeeccc-ccccc---ccccceecccccc Confidence 111111 000 0000000 00000 0000111112236 Q ss_pred ccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeec-C Q lcl|NC_019406. 313 TLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVV-D 389 (661) Q Consensus 313 ~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~l-p 389 (661) .++.||||.+... .. +.+-|.++..|-=+.-...|++.+.+.+.+.|+++++|.+.++... ..+....++.+ + T Consensus 229 ~~g~vPvv~~~n~--~~--g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 304 (478) T protein:vir:10 229 SWGRVPFIPFKNN--PQ--EVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAG 304 (478) T ss_pred cCCcceEEEeccC--CC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeEecC Confidence 7999999988542 22 2333555555555666678888899999999999999986554221 12223334444 3 Q ss_pred CCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 390 KESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVR 468 (661) Q Consensus 390 ~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~ 468 (661) .+|++++|+..+. +.+..+..++.+++.|..++.-.- ....-+++.||++..................++.++.++++ T Consensus 305 ~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 383 (478) T protein:vir:10 305 ESGSGVDTIKVEV-PIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQ 383 (478) T ss_pred CCCCcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4578899998765 567888889999999888753221 11122357899999999999999999999999999999999 Q ss_pred HHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCC Q lcl|NC_019406. 469 YWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSF 548 (661) Q Consensus 469 ~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~ 548 (661) +++.+.|... +..++.|++++ ..+.. ..+.++++.++ +|.||++|++..+ +.+ .+..++.++|+++... T Consensus 384 li~~~~~~~~-d~~~i~i~f~~-~~p~~-~~e~~~~~~~~--~g~iS~et~i~~~---~~v---~d~~~E~~ri~~E~~~ 452 (478) T protein:vir:10 384 YIIDFYRLDV-RVQDIEITFNF-NVMVN-ELENSQIAMNS--TGLLSKETILGNH---SWV---QDPVAEMERIEQENIE 452 (478) T ss_pred HHHHHhCCCc-ccccceEEeCC-CCCCC-HHHHHHHHHHH--hCCCChHHHHHhC---CCC---CCHHHHHHHHHHHHHH Confidence 9999999764 44456666643 22222 23455665554 7999999996433 333 3456666677655321 Q ss_pred CCCchhhhhhcCCccccCCCcchhhhhcCChhhHH Q lcl|NC_019406. 549 IGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQE 583 (661) Q Consensus 549 l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~ 583 (661) .. ........+..+.++ +.++.| |.+ T Consensus 453 ~~-~~~~~~~~~~~d~~~------~~~~d~--~~e 478 (478) T protein:vir:10 453 LN-QQLPDIEEGLNDEQQ------RQSEDN--QSE 478 (478) T ss_pred HH-HhccccCCCCccccc------ccCcCC--CCC Confidence 00 000000111111111 111111 000 No 25 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.90 E-value=9.2e-23 Score=141.39 Aligned_cols=469 Identities=12% Similarity=0.070 Sum_probs=244.9 Q ss_pred CCCC------CCccccc----cccccccccCCcc----ccCHH----H-----HHHHHHHHHHHHHhcchHHHHhCCccc Q lcl|NC_019406. 1 MAGL------SPNSANI----RRTKRGAQQFTHL----VVHPE----Y-----EYYRPDWAKIRDAIAGEREIKAQGVKY 57 (661) Q Consensus 1 ~~~~------~~~~~~~----~~~~~~~~~~~V~----~~hPe----y-----~a~~~~W~~irD~~~G~~~vr~~g~~Y 57 (661) |.-. +--+-|| ++.+...-++... ...++ + ....++++++.+.|.|...+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~----- 75 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV----- 75 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcccc----- Confidence 1110 0001112 2222222221100 00111 1 2346678888888888654311 Q ss_pred CCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhcc Q lcl|NC_019406. 58 LKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFA 137 (661) Q Consensus 58 LPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d 137 (661) +..... ..++.. .| .-.|+.+.+++.++|.+|.+||+++.-.+ ...+.++++- T Consensus 76 --~~~~~~-~~~~~~-~k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~----------------------~~~~~l~~~~ 128 (511) T protein:vir:93 76 --ELTRRK-EEYMAD-NR-VAHDYASYISDFINGYFLGNPIQYQDDDK----------------------DVLEVIEAFN 128 (511) T ss_pred --ccCcCc-ccccCc-ce-eecchHHHHHHHHhhhhcccCeeeccCCh----------------------HHHHHHHHHH Confidence 111111 111111 12 34799999999999999999999852111 1222233332 Q ss_pred CCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 138 KDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 138 l~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) +-++++.+...+.+.++.+|+++++|.... ..+|-+..++|++++-.--+.+.+ ..+..|+. T Consensus 129 -~~n~~~~~~~~~~~~~~~~G~ay~~vy~de------~~~~~i~~~~p~~~~~vydd~~~~-~~~~~vr~---------- 190 (511) T protein:vir:93 129 -DLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVIYDNTIER-NSIAGVRY---------- 190 (511) T ss_pred -hhcCHhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEEEcCCCCC-ceEEEEEE---------- Confidence 346899999999999999999999997543 235777888888764321111111 11111111 Q ss_pred ccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 218 TPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 218 ~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) |.. ... .+...+.++++.++..+ .++++. ...+. T Consensus 191 -----------~~~----------------------------~~~--~~~~~~~~~~~~iyt~~----~i~~~~-~~~~~ 224 (511) T protein:vir:93 191 -----------LRT----------------------------KPI--DKTDEDEVFTVDLFTSH----GVYRYL-TSRTN 224 (511) T ss_pred -----------EEe----------------------------eec--cccccceEEEEEEEeCC----cEEEEE-ecCCC Confidence 000 000 00111223333333332 122322 11111 Q ss_pred ccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce Q lcl|NC_019406. 298 LGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE 377 (661) Q Consensus 298 ~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~ 377 (661) .... .........++++.||||.+.. +.+. .+-|.++-.|-=+.-...|++.+.+++.+.|+++++|....+... T Consensus 225 ~~~~-~~~~~~~~~~~~g~vPvv~~~n--n~~g--~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (511) T protein:vir:93 225 GLKL-TPRENGFESHSFERMPITEFSN--NERR--KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred cccc-ccccccccccCCCccceEEecC--CCCC--CCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchh Confidence 1111 1111122346799999998753 2232 233445555544555688899999999999999999964333211 Q ss_pred e-Eeccccee------------ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHH Q lcl|NC_019406. 378 Y-HIGPGRVW------------VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSAL 443 (661) Q Consensus 378 l-~iGs~~~~------------~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~ 443 (661) + .......+ ..+.++++++||..+. ..+..+..++.+++.|..+..-. +....-+++.||++.+. T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:93 300 VRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcccccccceecccccccccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 1 11111111 1123467899998654 46778888999999988765322 11111235779999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC-----cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019406. 444 REANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTD-----TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDA 518 (661) Q Consensus 444 d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~-----~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et 518 (661) ...............+..++.+.+++++.+++..... -..+.+.+++ -.+.. .++.++++.++ .|.||++| T Consensus 379 ~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~-~~p~n-~~e~~~~~~kl--~g~iS~et 454 (511) T protein:vir:93 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNR-NLPKS-LIEELKAYIDS--GGKISQTT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCC-CCCCC-HHHHHHHHHHH--hccCchHH Confidence 9888888888999999999999999999887653211 1234454432 22222 34577777777 68999999 Q ss_pred HHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHh Q lcl|NC_019406. 519 LYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAER 589 (661) Q Consensus 519 ~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~ 589 (661) ++..| |-..+.++|.++|+++.. +.....+.........+..+....+++ ....||. T Consensus 455 ~~~~l------~~v~d~~~E~~ri~~E~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 511 (511) T protein:vir:93 455 LMSLF------SFFQDPELEVKKIEEDEK----ESIKKAQKGIYKDPRDINDDEQDDDTK----DTVDKKE 511 (511) T ss_pred HHHhC------CCCCCHHHHHHHHHHHHH----HHHHHHhhhcccCCCCCCCCCCCCccc----ccccccC Confidence 97543 333356777888876532 111111111111111111111111111 1111111 No 26 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.90 E-value=1.3e-22 Score=140.63 Aligned_cols=458 Identities=10% Similarity=0.027 Sum_probs=238.1 Q ss_pred CCCCC--Ccc-cccc-ccccccccCCcc-----ccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLS--PNS-ANIR-RTKRGAQQFTHL-----VVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~--~~~-~~~~-~~~~~~~~~~V~-----~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |...- |+- -+++ .+.....+..++ .---.+....+++.+..+.|.|...+-.+-. | ........... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~-~---~~~~~~~~~~~ 76 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMK-K---VDVYGNIDYDK 76 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcccc-c---ccccccccccc Confidence 21100 000 0000 011111111111 1111345566677778888888655432211 1 00000001111 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) ...| .-.|+.+.+++..+|++|.+||+++.-.+.. .+.++.++ .++++.....+. T Consensus 77 ~~~k-i~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~------------------~~~l~~~~------~n~~~~~~~e~~ 131 (474) T protein:vir:95 77 PDWR-ITTNFHQNLVDQKVSYVASKPVTYSCEDESV------------------LKIIHDVL------DTRWDNKLIDIL 131 (474) T ss_pred ccce-eccchHHHHHHHHHhhhccCCceeccCchHH------------------HHHHHHHH------hccHHHHHHHHH Confidence 1112 2369999999999999999999985211111 12233332 357888999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +.++.+|+++++|..+. ..+|-+..++|.+++-.--+...+ .+..+ ++.+.. ++. .++..|.. T Consensus 132 ~~~~~~G~~~~~v~~d~------~~~~~i~~~~p~~~~~v~d~~~~~--~~~~~-i~~~~~-~~~-------~~~~~y~~ 194 (474) T protein:vir:95 132 TATSNKGIDWLQVYINE------NGEMKLFRVPAEQAIPIWVDKERE--ELKSF-IRYYKF-NNE-------EKVEFWTD 194 (474) T ss_pred HHHhhcCcEEEEEEecC------CCceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEEE-cCe-------eEEEEEeC Confidence 99999999999997653 235778888888876432111111 11111 111110 000 00111111 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccc----ccccceee Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLG----QARDVYTP 307 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~----~~~~~~~p 307 (661) . . ++.+.... +... .......+ T Consensus 195 ~--------------------------------------~---------------~~~~~~~~-~~~~~~~~~~~~~~~~ 220 (474) T protein:vir:95 195 T--------------------------------------T---------------VTYYVLEN-GGLIPDYYYGANHIQS 220 (474) T ss_pred C--------------------------------------e---------------EEEEEEcC-CccccccccCcccccc Confidence 0 1 11111111 1000 00011111 Q ss_pred ccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccce Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRV 385 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~ 385 (661) ...-++++.||||.+.....+ . +=|.++-.|-=+.=...|++.+.+.+.++|+++++|.+.++... -.+....+ T Consensus 221 ~~~~~~~g~iPvv~~~nn~~g--~--sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~ 296 (474) T protein:vir:95 221 HFSNGNWGRVPFIAFKNNPEE--V--SDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKA 296 (474) T ss_pred cccccCCCccceEeecCCCCC--C--CcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccce Confidence 122357899999987543222 1 21222222221222356777778888999999999986554222 22344456 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT 464 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~ 464 (661) +.+++ +++++|+..+ .+....+..|+.++++|...+.-. +...+.+++-||++.+................+..++. T Consensus 297 i~~~~-~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~ 374 (474) T protein:vir:95 297 INVDG-DGGVETIQVE-VPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQ 374 (474) T ss_pred eeccC-CCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66664 6789999876 467888999999999998775322 11122335679999999988888888999999999999 Q ss_pred HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhc Q lcl|NC_019406. 465 SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMND 544 (661) Q Consensus 465 ~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~ 544 (661) +++++++.++|... +..++.|.+++ ....-..+.+++ +.++|.||++|++..| +...+++++.++|.+ T Consensus 375 ~~~~li~~~~g~~~-d~~~i~v~f~~--~~p~d~~e~a~~---~~~~g~iS~et~i~~l------~~v~d~~~E~~ri~~ 442 (474) T protein:vir:95 375 ELIGFIIDFNNLKM-DVKDIEISFNF--NRMMNDAEQSQI---IAQSQYLSRETLVKSS------PLVDDYKAELERIEQ 442 (474) T ss_pred HHHHHHHHHhCCCc-ccceeeEEecc--CCCcCHHHHHHH---HHhcCCCchHHHHHhC------CCCCCHHHHHHHHHH Confidence 99999999999754 44555565543 222112334444 4457999999997433 333446677778776 Q ss_pred cCCCCCCchhhhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 545 PKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 545 ~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) +...-. ........+..+..++.++. .+.+.+ T Consensus 443 E~~~~~-~~~~~~~~~~~d~~~~~~~~---~~~~~~ 474 (474) T protein:vir:95 443 EQMEYN-KQLPNLDDGGADGAQQQERS---NDKESE 474 (474) T ss_pred HHHHHH-hcccccccccCCCCcCCCCC---ccCCCC Confidence 531100 00000111111111111111 010111 No 27 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.90 E-value=1.5e-22 Score=140.26 Aligned_cols=469 Identities=12% Similarity=0.067 Sum_probs=242.1 Q ss_pred CCCC------CCccccc----cccccccccCCcccc----CH----HH-----HHHHHHHHHHHHHhcchHHHHhCCccc Q lcl|NC_019406. 1 MAGL------SPNSANI----RRTKRGAQQFTHLVV----HP----EY-----EYYRPDWAKIRDAIAGEREIKAQGVKY 57 (661) Q Consensus 1 ~~~~------~~~~~~~----~~~~~~~~~~~V~~~----hP----ey-----~a~~~~W~~irD~~~G~~~vr~~g~~Y 57 (661) |.-. +--+-|| ++.+...-.+..... .+ .+ ....++++++.+.|.|....- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~------ 74 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL------ 74 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccc------ Confidence 1110 0001112 222222111110000 11 11 224567888888888864431 Q ss_pred CCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhcc Q lcl|NC_019406. 58 LKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFA 137 (661) Q Consensus 58 LPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d 137 (661) +........++... | .-.|+.+.+++.++|.+|.+||+++.-.+. ..+.+.++- T Consensus 75 --~~~~~~~~~~~~~~-k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~----------------------~~~~l~~~~ 128 (511) T protein:vir:96 75 --VELTRRKEEYMADN-R-VAHDYASYISDFINGYFLGNPIQYQDDDKD----------------------VLEAIEAFN 128 (511) T ss_pred --cccCcCcccccCcc-e-eecchHHHHHHHHHhhhccCCceeecCchH----------------------HHHHHHHHH Confidence 11111111222111 2 337999999999999999999999521111 122233332 Q ss_pred CCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 138 KDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 138 l~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) +-++++.+...+.+.++.+|+++++|-... ..+|-+..++|.+++-.- ++....+.+..|+. T Consensus 129 -~~n~~~~~~~~~~~~~~i~G~a~~~vy~de------d~~~~i~~~~p~~~~~vy-dd~~~~~~~~~vr~---------- 190 (511) T protein:vir:96 129 -DLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVIY-DNTIERNSIAGVRY---------- 190 (511) T ss_pred -hhcCHHHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEEE-cCCCCCceEEEEEE---------- Confidence 346899999999999999999999997542 235667777887754321 11111111111111 Q ss_pred ccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 218 TPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 218 ~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) |. ..... .....+++++-++..+ .++++. ...+. T Consensus 191 -----------~~----------------------------~~~~d--~~~~~~~~~~~iyt~~----~i~~~~-~~~~~ 224 (511) T protein:vir:96 191 -----------LR----------------------------TKPID--KTDEDEVFTVDLFTSH----GVYRYL-TSRTN 224 (511) T ss_pred -----------EE----------------------------eeecc--ccccceEEEEEEEeCC----cEEEEE-ecCCC Confidence 00 00000 0011222333333222 122221 11111 Q ss_pred ccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce Q lcl|NC_019406. 298 LGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE 377 (661) Q Consensus 298 ~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~ 377 (661) .... .........++++.||||.+.. +.+. .+-|.++-.|-=+.-...|++.+.++..+.|+++++|....+... T Consensus 225 ~~~~-~~~~~~~~~~~~~~vPvv~~~n--n~~g--~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~ 299 (511) T protein:vir:96 225 GLKL-TPRENGFESHSFERMPITEFSN--NERR--KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred cccc-cccccccccccCCceeeEEecC--CCCC--CCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchh Confidence 1111 1111122346799999998753 2222 233444444444555678889999999999999999954332111 Q ss_pred e-Eecccceeec------------CCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHH Q lcl|NC_019406. 378 Y-HIGPGRVWVV------------DKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSAL 443 (661) Q Consensus 378 l-~iGs~~~~~l------------p~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~ 443 (661) + ....+..+.+ ...+++++||..+. +.+..+..++.+.+.|..+..-. +....-+++.||++.+. T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:96 300 VRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcccccccceecccccccccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 1 1111111111 22367899998654 45777888999999988765322 11112235779999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019406. 444 REANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT-----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDA 518 (661) Q Consensus 444 d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et 518 (661) ...............+..++.+.+++++.+++.... +-..+.|.+++.. +.. ..+.++++.++ .|.||++| T Consensus 379 ~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~-p~n-~~e~~~~~~kl--~G~iS~et 454 (511) T protein:vir:96 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNL-PKS-LIEELKAYIDS--GGKISQTT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCC-CCC-HHHHHHHHHHH--hccCChHH Confidence 988888888899999999999999999998765321 1224555554322 222 24567777776 69999999 Q ss_pred HHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 519 LYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 519 ~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) ++..| +.+ .+.++|.++|+++... ..+..+.........++......+++-+-+++| T Consensus 455 ~l~~l---~~v---~D~~~E~~ri~~E~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 455 LMSLF---SFF---QDPELEVKKIEEDEKE----SIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhC---CCC---CCHHHHHHHHHHHHHH----HHHHHhhccccCCCCCCCCCCCCcccccccccC Confidence 97544 333 3467788888776321 111111111111111111111111110000111 No 28 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.90 E-value=5.1e-23 Score=142.79 Aligned_cols=439 Identities=10% Similarity=-0.019 Sum_probs=244.9 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) +.=++++.-++... +..---.+....++++.+.+-|.|...+..+. .. ...... .| . -.| T Consensus 8 ~~~~p~d~~~~~~~--------l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~-----~~-~~~~~~--~k---i-~~n 67 (453) T protein:vir:39 8 LMTFPKDEPITNEV--------VTKFMEKHRLEVARYEYLKNMYRGIMAIDAEP-----TK-DLWKPD--NR---L-TVN 67 (453) T ss_pred ceEcCCCCCCCHHH--------HHHHHHHHHHHHHHHHHHHHHhhccCchhcCC-----Cc-cccCcc--ce---e-ecc Confidence 22222222222210 11111134556678899999999976553322 11 111111 12 2 359 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+.+|+.++|++|.+||+++.-.+. ....|+.+. .-++++.....+.+.++.+|++ T Consensus 68 ~~~~ivd~~~~~l~g~~~~~~~~d~~------------------~~~~l~~i~-----~~N~~~~~~~~~~~~~~~~G~~ 124 (453) T protein:vir:39 68 FTKYIVDTFTGYFNGIPVKKSHSDKE------------------TLSKLQEFD-----NLNDMEDEESELAKMACIYGRA 124 (453) T ss_pred hHHHHHHHHhhhhcccCceeccCChH------------------HHHHHHHHH-----HhcChhHHHHHHHHHHhhcCeE Confidence 99999999999999999998521111 111222222 2479999999999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) +++|-... ..+|-+..++|.+++-+--+.. +...+--|+.. . T Consensus 125 ~~~v~~d~------~g~~~i~~~~p~~~~~v~d~~~-~~~~~~~ir~~--~----------------------------- 166 (453) T protein:vir:39 125 FELLYQNE------ETQTNVIYNTPENMFMVYDDTI-KQEPLFAVRYG--Y----------------------------- 166 (453) T ss_pred EEEEEecC------CCceEEEEEcccceEEEecCCC-CCeEEEEEEEE--E----------------------------- Confidence 99996543 2357788888888754432111 11111101000 0 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFV 320 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv 320 (661) .. .....+.++..+ +++.+ ..+..+.... ....++++.|||| T Consensus 167 ---------------------~~------~~~~~~~~yt~~----~i~~~---~~~~~~~~~~----~~~~~~~g~vPvv 208 (453) T protein:vir:39 167 ---------------------DD------DYKLYGEVYTKE----TTYAL---NGTMGFYNMT----EQAPNPFDDLPVV 208 (453) T ss_pred ---------------------eC------CeEEEEEEEeCC----eEEEE---EecCCceeee----cccccCCCceeEE Confidence 00 001111122221 11111 1111111111 1123679999999 Q ss_pred EEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecC-----CCCCcc Q lcl|NC_019406. 321 FFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVD-----KESGIP 395 (661) Q Consensus 321 ~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp-----~~ga~~ 395 (661) .+... .+. .+=|.++-.|-=+.=+..|++.+.+.+.+.|+++++|....+.....+=.+.++.++ .+++++ T Consensus 209 ~~~n~--~~g--~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (453) T protein:vir:39 209 EFYFN--EER--MSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYGESSEAKNVDV 284 (453) T ss_pred EecCC--CCC--CcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCchhhhhhhhcceeeecCCCCCCCCCce Confidence 88542 222 233444444444555677889999999999999999965433211111112233222 235688 Q ss_pred eEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019406. 396 GIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRD 475 (661) Q Consensus 396 ~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G 475 (661) +|+..+. +.+..+..++.+.+.|..+..-.-....+.++.|+++.+................+..++.+++++++.+.+ T Consensus 285 ~~lt~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~ 363 (453) T protein:vir:39 285 KFLEKPD-SDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELST 363 (453) T ss_pred eEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9998654 457788888999998887653221122344577999988888888888888889999999999999999987 Q ss_pred CCCC--CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCch Q lcl|NC_019406. 476 IPLT--DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPD 553 (661) Q Consensus 476 ~~~~--~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~dd 553 (661) .... +..++.|..++.. +.. ..+.++++.++ +|.||++|.+..| +.++ +.+++.++|+.+.......+ T Consensus 364 ~~~~~~~~~~i~v~f~~~~-p~~-~~~~a~~~~kl--~g~is~et~l~~l---~~v~---D~~~E~~ri~~E~~~~~~~~ 433 (453) T protein:vir:39 364 NVSNKEAWKDIEYTFTRNE-PKD-IKEQAETANIL--MGITSQETALSVI---SVIP---DVQAEMEKIKKEEASTAIFD 433 (453) T ss_pred ccCCccccccceEEeCCCC-CcC-HHHHHHHHHHH--hccCChHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHH Confidence 5432 2234455554322 222 24456666665 7899999997544 4433 46777888877644322111 Q ss_pred hhhhhcCCccccCCCcchhhh Q lcl|NC_019406. 554 AIAMRRGYVSRQQELDQQRAA 574 (661) Q Consensus 554 ae~~~~g~~~~~~~~~q~~~~ 574 (661) ... ..+.+...++.+++..+ T Consensus 434 ~~~-~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 434 KDK-QPSEKGTDTVVPETNEE 453 (453) T ss_pred Hhc-cCCCCCCCCCCCCcCCC Confidence 111 11222223333333211 No 29 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.90 E-value=1.5e-22 Score=140.30 Aligned_cols=463 Identities=12% Similarity=0.031 Sum_probs=246.4 Q ss_pred CCCCCCcccccccccccccc------------CCc-----cccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQ------------FTH-----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG 63 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~------------~~V-----~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~ 63 (661) ||----..-||--+-.-+.. ... ..--..+....+++.++.+.|.|...+-.+...|.+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~ 80 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV 80 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Confidence 54433333343211111111 100 0001123445677888888888865443332222221111 Q ss_pred CChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCH Q lcl|NC_019406. 64 FDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSH 143 (661) Q Consensus 64 E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL 143 (661) +..+...-.-.|+.+.+|+..+|.+|.+||+++.-.+ +..+.++.+. .+++ T Consensus 81 -----~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~------------------~~~~~l~~~~------~n~~ 131 (483) T protein:vir:12 81 -----DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD------------------EVVKRIDEVL------GNRF 131 (483) T ss_pred -----cccccccccccchHHHHHHHHhhhhcccCceeccCCh------------------HHHHHHHHHH------hccH Confidence 1111111123699999999999999999999852111 1112233322 3678 Q ss_pred HHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccccccc Q lcl|NC_019406. 144 QGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQN 223 (661) Q Consensus 144 ~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~ 223 (661) +.....+.+.++.+|+++++|-... ..+|-+..++|.+++-.--+...+. .+-.|+. +.. ++.. T Consensus 132 ~~~~~~~~~~~~~~G~~y~~v~~d~------d~~~~i~~~~p~~~~~v~d~~~~~~-~~~~ir~--~~~-~~~~------ 195 (483) T protein:vir:12 132 DDKLHSVLTGASNKGIEWLHPYLDE------EGEFKLFRVPAEQGIPIWTDKEHEE-LEAFIRM--YKL-ENET------ 195 (483) T ss_pred HHHHHHHHHHHhhCCeEEEEEEEcC------CCceEEEEEcccceEEEEcCCCCCc-eEEEEEE--EEe-ecce------ Confidence 8889999999999999999997542 2457888889988653211111111 1111111 100 0000 Q ss_pred ceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccc Q lcl|NC_019406. 224 PWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARD 303 (661) Q Consensus 224 ~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~ 303 (661) ++..|.. ..+++.. .+.+ ..+..+ ..... T Consensus 196 -~~~~y~~--------------------------------------~~v~~~~-~~~~---~~~~~~--------~~~~~ 224 (483) T protein:vir:12 196 -KVEYWDK--------------------------------------VTVNYYV-YENG---SLIPDY--------SNNLE 224 (483) T ss_pred -EEEEEec--------------------------------------CeEEEEE-EeCC---eeeecc--------ccccc Confidence 0011100 0111110 0000 000000 00111 Q ss_pred ceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEec Q lcl|NC_019406. 304 VYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIG 381 (661) Q Consensus 304 ~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iG 381 (661) ...+...-++|+.||||.+... .+.. +=|.++..|-=+.=...|++.+.+.+.+.|+++++|.+..+... -.+. T Consensus 225 ~~~~~~~~~~~g~vPvv~~~nn--~~g~--sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~ 300 (483) T protein:vir:12 225 NSKTHFSTGSWGKIPFIPFKNN--DLEI--SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLR 300 (483) T ss_pred ccccccccCCCCccceEEecCC--CCCC--CchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhh Confidence 1222233467999999988542 2221 22333333322222357888889999999999999987665322 1244 Q ss_pred ccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_019406. 382 PGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALE 460 (661) Q Consensus 382 s~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le 460 (661) ..+++.++. +++++|+..+. +.+..+..++.+++.|...+.-.-.. ..-+++.||++.+.............-..+. T Consensus 301 ~~~~~~~~~-~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~ 378 (483) T protein:vir:12 301 YYGAIKVSD-NGGVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAK 378 (483) T ss_pred hccccccCC-CCcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 445666664 67899998754 55788888999999888775322111 1223567999998888888888899999999 Q ss_pred HHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHH Q lcl|NC_019406. 461 DGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTI 540 (661) Q Consensus 461 ~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~ 540 (661) .++.+.+++++.++|.+. +...+.|..++ -.+.+ ..+.++++.++ +|.||++|.+..+ +.+ .+.++|.+ T Consensus 379 ~~l~~~~~li~~~~~~~~-~~~~i~v~f~~-~~p~~-~~~~a~~~~kl--~GiiS~et~~~~~---~~v---~d~~~E~~ 447 (483) T protein:vir:12 379 VAIQELLWFVFEHFDIKG-EHKDVDISFNY-NKVAN-TELQVQTAQQS--MGIVSHETVLENH---PFV---EDLQAELE 447 (483) T ss_pred HHHHHHHHHHHHHhcCCC-ccceeeEEeCC-CCCCC-HHHHHHHHHHH--hccCchHHHHHhC---CCC---CCHHHHHH Confidence 999999999999999765 34556666543 22222 24566777776 6999999996543 333 34567777 Q ss_pred HHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 541 KMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 541 ~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) +|+++.. +.++..++........-.++....+++ | | T Consensus 448 ri~~E~~----~~~~~~~~~~~~~~d~~~~~~~~~~~e--~---e 483 (483) T protein:vir:12 448 RIEQEQM----EYNKQLPNLDDGGADGAQQQERSNNKE--S---E 483 (483) T ss_pred HHHHHHH----HHHhhcccccccccCCcccCCCCCccc--C---C Confidence 7766521 111111111000000001111111111 1 1 No 30 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.90 E-value=1.7e-22 Score=139.90 Aligned_cols=463 Identities=11% Similarity=-0.003 Sum_probs=244.8 Q ss_pred CCCCC-------CccccccccccccccCCccc---cCHH---H-HHHHHHHHHHHHHhcch-HHHHhCCcccCCCCCCCC Q lcl|NC_019406. 1 MAGLS-------PNSANIRRTKRGAQQFTHLV---VHPE---Y-EYYRPDWAKIRDAIAGE-REIKAQGVKYLKAPKGFD 65 (661) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~V~~---~hPe---y-~a~~~~W~~irD~~~G~-~~vr~~g~~YLPk~~~E~ 65 (661) --|++ +-.+|+.-......+-..+. ..-. + ....++|++..+-|.|. +.+... ...+. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~-------~~~~~ 81 (501) T protein:vir:27 9 STGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQF-------GRRKD 81 (501) T ss_pred ccchhhhhhcccChhHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc-------CccCc Confidence 12222 33444443333333222111 1111 1 23457788888888884 233211 11111 Q ss_pred hHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHH Q lcl|NC_019406. 66 DEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQG 145 (661) Q Consensus 66 ~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~ 145 (661) . ++ ...-.-.|+.+.+++.++|.+|.+||+++..+.. ..+.+.+++.++- +-++++. T Consensus 82 ~--~~--~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~------------------~~~~~~~~l~~~~-~~n~~~~ 138 (501) T protein:vir:27 82 R--EM--ADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDND------------------NNSQNDDTIKRIG-RINDIDS 138 (501) T ss_pred c--cc--ccceeccchHHHHHHHHhhhhcccCeeEecCCcc------------------chHHHHHHHHHHH-HhcChhH Confidence 1 10 1111347999999999999999999998522110 1122333444443 3469999 Q ss_pred HHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccce Q lcl|NC_019406. 146 FAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPW 225 (661) Q Consensus 146 fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~ 225 (661) +...+.+.++.+|+++++|-... + .+|-+..++|.+++-.- ++....+.+-.| |.+.. T Consensus 139 ~~~~~~~~~~~~G~a~~~vy~de-d-----~~~~i~~~~p~~~~~v~-d~~~~~~~~~~i--r~~~~------------- 196 (501) T protein:vir:27 139 HNRTLIRDLSQTGRAYEVIYRNE-Y-----DETRIKRLNPLETFVIY-DNSLEDNSIAAV--RYYNR------------- 196 (501) T ss_pred HHHHHHHHHhhCCeEEEEEEeCC-C-----CceEEEEEccceeEEEe-cCCCCCceEEEE--EEEEe------------- Confidence 99999999999999999995432 1 35777888887764221 111111111111 00000 Q ss_pred eeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccce Q lcl|NC_019406. 226 IGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVY 305 (661) Q Consensus 226 i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~ 305 (661) ... .. .++.+.++..+ .++++ ..++.. . T Consensus 197 ----------------------------------~~~-~~-----~~~~~~vyt~~----~v~~~---~~~~~~-----~ 224 (501) T protein:vir:27 197 ----------------------------------GTL-QN-----AKDVVEIYTNE----HIYTL---DASDDF-----N 224 (501) T ss_pred ----------------------------------eec-CC-----cEEEEEEEeCC----eEEEE---EeCCce-----e Confidence 000 00 11112222221 12222 111111 1 Q ss_pred eeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc--------- Q lcl|NC_019406. 306 TPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS--------- 376 (661) Q Consensus 306 ~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~--------- 376 (661) .....-++++.||||.+.. +.+ +.+-|.++..|-=+.-...|++.+.+.+.+.|+++++|....... T Consensus 225 ~~~~~~~~~g~vPvv~~~n--n~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~ 300 (501) T protein:vir:27 225 EISVTTHAFGTVPITEFLN--NVD--GIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRT 300 (501) T ss_pred eccccccCCCcccEEEecC--CCC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhc Confidence 1112235799999998753 222 233355555555555567788999999999999999996433211 Q ss_pred -eeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHH Q lcl|NC_019406. 377 -EYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLN 454 (661) Q Consensus 377 -~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~ 454 (661) .+.+...........+++++|+..+- +.+..+..++.+++.|..++.-. +....-+++.||++.+............ T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~ 379 (501) T protein:vir:27 301 RLMQLKPPKSADGKEGTVKAEYLTKSY-DVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVD 379 (501) T ss_pred CceeecccccccCCCCCcceeeeeccC-CHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHH Confidence 12222222222233456888987664 44667777888888888765422 1111223567999999998888888889 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCCCC----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCC Q lcl|NC_019406. 455 VIMALEDGMTSVVRYWLMFRDIPLT----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIP 530 (661) Q Consensus 455 ~A~~le~Al~~aL~~~A~w~G~~~~----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~ 530 (661) ....+..++.+.+++++.+++.... +...+.|..++.. +.. .++.++++.++ +|.||++|++..| | T Consensus 380 ~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~-p~n-~~e~ad~~~kl--~g~iS~et~l~~l------~ 449 (501) T protein:vir:27 380 TQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNL-PKS-LNEQVSILTGL--GGQVSQETALSLS------G 449 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCC-CcC-HHHHHHHHHHH--hccCcHHHHHHhC------C Confidence 9999999999999999999875432 1223555554322 222 24567777776 6899999996533 3 Q ss_pred ccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 531 STQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 531 ~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) -..++++|.++|+++.... +...+++++.+..............|-.++..| T Consensus 450 ~v~D~~~E~eri~~E~~e~---~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 450 LVESPNEELDKINKEVSEI---DFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred CCCCHHHHHHHHHHHHHhh---hHhhhcCccccccccccCCCCCCccccccccCC Confidence 3334677788886652211 111122221111111111111111111111111 No 31 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.90 E-value=1.8e-22 Score=139.86 Aligned_cols=453 Identities=10% Similarity=0.032 Sum_probs=241.7 Q ss_pred CCccccCH-------HHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhh---hcccchHHHHHHHHh Q lcl|NC_019406. 21 FTHLVVHP-------EYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDR---AAFYNMTSQTQAGMV 90 (661) Q Consensus 21 ~~V~~~hP-------ey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~r---A~~~n~~~~tv~~l~ 90 (661) |++...-- .+....+++....+.|.|.+.+..+...+.++............+.+ =.-.|+.+.+++..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 66544322 22344577888888898876665443222222211111111111111 145799999999999 Q ss_pred chhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCC Q lcl|NC_019406. 91 GQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSS 170 (661) Q Consensus 91 G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~ 170 (661) |.+|.+||++.. ++ .+..+.++.+. .++++...+.+.+.++.+|+++++|=+... T Consensus 81 ~yl~G~p~~~~~-~~---------------------~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~- 135 (471) T protein:vir:10 81 AYALTYPPTFDV-DD---------------------KKVNDMIVDVL--GDDYERISKQLCVNAGNAGIAWLHVWKDAS- 135 (471) T ss_pred hhhcccCceecc-CC---------------------hHHHHHHHHHH--hcCHHHHHHHHHHHHhhCCeEEEEEEeeCC- Confidence 999999999852 11 11122233331 468899999999999999999998844321 Q ss_pred chhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhh Q lcl|NC_019406. 171 DPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQG 250 (661) Q Consensus 171 ~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~ 250 (661) ..+|-+..++|.+++----+. ....+..+ ||.+...+... .....++..|..+.+ T Consensus 136 ----~g~~~~~~~~p~~~~~i~d~~--~~~~~~~~-ir~~~~~~~~~--~~~~~~~~vy~~~~~---------------- 190 (471) T protein:vir:10 136 ----DNSFRYACVDSKEVIPIYSKS--LDKKSIGV-LRVYSSIDETD--GKNYTVYEYWNDKEC---------------- 190 (471) T ss_pred ----CCeeEEEEEcccceEEEEcCC--CCCceEEE-EEEEEeeccCC--CceeEEEEEEeCCcE---------------- Confidence 235778888888764221111 11111111 11111111100 011111211211111 Q ss_pred hhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc--ccccccceeeccCCcccceeeEEEEecCCCC Q lcl|NC_019406. 251 SARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP--LGQARDVYTPMVRGRTLPFIPFVFFGSMSNA 328 (661) Q Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~--~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~ 328 (661) +++. ... +........+.... .+..+.......-.+.++.||||.+.... T Consensus 191 ----------------------~~y~-~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-- 242 (471) T protein:vir:10 191 ----------------------SFYR-HEK---EKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKNNE-- 242 (471) T ss_pred ----------------------EEEE-ecC---CcccccccccccccccccccccccccccccCCCCceeEEEeccCC-- Confidence 1100 000 00000000000000 00011111111223579999999884322 Q ss_pred CCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeecCC----CCCcceEeecCc Q lcl|NC_019406. 329 ADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVVDK----ESGIPGIIEFKG 402 (661) Q Consensus 329 ~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~lp~----~ga~~~ylE~~g 402 (661) .. .+-|.++-.|-=+.=...|++.+.+.+.+.|+++++|.+.+.... -.+-...++.++. .+++++|+..+. T Consensus 243 ~~--~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~ 320 (471) T protein:vir:10 243 IE--TNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDI 320 (471) T ss_pred CC--CCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecC Confidence 21 122333333222222356778888899999999999975433211 1122233444432 346899999765 Q ss_pred hhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcc Q lcl|NC_019406. 403 EGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTA 482 (661) Q Consensus 403 ~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~ 482 (661) +.+..+..++.+++.|...+.-.-....+.++-|+++.+........-....-..+..++.+.+++++.++|..+ .. T Consensus 321 -~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~d--~~ 397 (471) T protein:vir:10 321 -PTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLSD--KL 397 (471) T ss_pred -ChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--Cc Confidence 468889999999999988753221122345678999998888888888888888999999999999999999764 34 Q ss_pred eEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCc Q lcl|NC_019406. 483 TLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYV 562 (661) Q Consensus 483 ~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~ 562 (661) ++.|.+++ ..+.. ..+.++.+.++ +|.||.+|.+..+ |...+.++|.++|+++... .++ ... T Consensus 398 ~i~i~f~~-~~p~n-~~e~~~~~~kl--~g~iS~et~~~~~------p~v~D~~~E~eri~~E~~~----~~~----~~~ 459 (471) T protein:vir:10 398 KIKQTWTR-NSINN-DTEMAQVVSTL--ATITSRENVAKSN------PIVEDWQDELRLQKAEQEG----RSE----KLY 459 (471) T ss_pred eeEEEeCC-CCCCC-HHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHHH----HHh----ccc Confidence 55565543 23333 24566666665 6899999996433 3334567777888764110 010 000 Q ss_pred cccCCCcchhhhhcCChh Q lcl|NC_019406. 563 SRQQELDQQRAARDADFQ 580 (661) Q Consensus 563 ~~~~~~~q~~~~~e~d~~ 580 (661) ++.....+.+.+ T Consensus 460 ------~~~~~~~~~e~~ 471 (471) T protein:vir:10 460 ------DMEEVEHESEVE 471 (471) T ss_pred ------ccCCCCCccccC Confidence 111001110111 No 32 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.89 E-value=1.8e-22 Score=139.86 Aligned_cols=456 Identities=10% Similarity=0.044 Sum_probs=245.9 Q ss_pred CCCC-CCcc-cccc-ccccccccCCcc-----ccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHH Q lcl|NC_019406. 1 MAGL-SPNS-ANIR-RTKRGAQQFTHL-----VVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANY 72 (661) Q Consensus 1 ~~~~-~~~~-~~~~-~~~~~~~~~~V~-----~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~r 72 (661) |+-. .||- .=+. .+.+..-...++ .-.-.+....++..+..+.|.|...+-.....+..+ .....++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~---~~~~~~~~~ 77 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVK---GEIDPFKPD 77 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccc---ccccccccc Confidence 5544 2211 0011 111111111111 111223445566777778888875443222111111 111122211 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) . | .-.|+.+.+++..+|.+|.+||+++. ++ .+..+.|+.++ +++++.....+.+ T Consensus 78 ~-k-i~~n~~~~Iv~~~~~~l~g~p~~~~~-~d-----------------~~~~~~l~~~~------~n~~~~~~~~~~~ 131 (468) T protein:vir:96 78 W-R-MYTNYHQNLVDQKVAYAVANPVTYGT-ED-----------------EKSLKTIQEVL------NHKWDDKLVDILT 131 (468) T ss_pred c-c-cccchHHHHHHHHHhhhccCCceecc-CC-----------------hHHHHHHHHHH------hcCHHHHHHHHHH Confidence 1 1 23799999999999999999999852 11 11223334332 3578888899999 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .++.+|+++++|.... ..+|.+..++|.+++-.--+...+ .+..+ ++.+... +. .++..|... T Consensus 132 ~~~~~G~~~~~v~~d~------~~~~~i~~~~p~~~~~v~~~~~~~--~~~~~-ir~~~~~-~~-------~~~~~~~~~ 194 (468) T protein:vir:96 132 AASNKGVEWIQPYVDE------QGEFKTFRVPAEQAIPIWTNKERD--ELKAF-IRLYELD-GG-------ERVEYWTAN 194 (468) T ss_pred HHhhcCeEEEEEEEcC------CCceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEEec-Cc-------eEEEEEeCC Confidence 9999999999997653 235788889998865321111111 11111 1111110 00 001111111 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR 312 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 312 (661) .++++. .. .+..+..+. .. ..............+ T Consensus 195 --------------------------------------~~~~~~-~~---~~~~~~~~~---~~-~~~~~~~~~~~~~~~ 228 (468) T protein:vir:96 195 --------------------------------------DVTFYE-LK---DGQLIPDYY---QG-EEHVQAHYYVGNKSM 228 (468) T ss_pred --------------------------------------eEEEEE-Ec---CCceeeccc---cc-ccccccceeeccccc Confidence 111111 00 011111110 00 001111122222346 Q ss_pred ccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee--EecccceeecC- Q lcl|NC_019406. 313 TLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY--HIGPGRVWVVD- 389 (661) Q Consensus 313 ~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l--~iGs~~~~~lp- 389 (661) +++.||||.+.... . +.+=|.++..|-=+.-...|++.+.+.+.++|+++++|...++.... .+....++.++ T Consensus 229 ~~~~iPvv~~~n~~--~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~ 304 (468) T protein:vir:96 229 SWNRVPFIPFKNNP--Q--EVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDG 304 (468) T ss_pred cCCcccEEEecCCC--C--CCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecC Confidence 79999999885432 2 22335555555445556778888889999999999999865442221 22223344444 Q ss_pred CCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 390 KESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVR 468 (661) Q Consensus 390 ~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~ 468 (661) .++++++|+..+. +.+..+..++.+++++..++.-. +.....+++.||++.+................+..++.++++ T Consensus 305 d~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~ 383 (468) T protein:vir:96 305 DGSGGVDTIQIDV-PVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQ 383 (468) T ss_pred CCCCcceEEeecC-ChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3457899999776 45888888999999998875321 111122357899999999999999999999999999999999 Q ss_pred HHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCC Q lcl|NC_019406. 469 YWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSF 548 (661) Q Consensus 469 ~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~ 548 (661) +++.+.|... +..++.|.+++ -.+.. ..+.++ .+.++|.||++|.+..| |...+.++|.++|+++.. T Consensus 384 li~~~~g~~~-d~~~i~i~f~~-~~p~d-~~e~a~---~~~~~g~iS~et~i~~l------~~v~D~~~E~~ri~~E~~- 450 (468) T protein:vir:96 384 YIIDFYKLSI-KVQDVEITFNF-NVMVN-ELEQSQ---IGVNSQYLSKETVVTNH------PWVDDPVAEMERIDQEEL- 450 (468) T ss_pred HHHHHhCCCc-ccceeeEEecC-CCCcC-HHHHHH---HHHhcCCCchHHHHHhC------CCCCCHHHHHHHHHHHHH- Confidence 9999999764 34455555542 12222 223333 34567999999996433 333346778888876522 Q ss_pred CCCchhhhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 549 IGQPDAIAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 549 l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) .....++++.. .+.+. ++ T Consensus 451 ----~~~~~~~~~~~----~~~~~------~~ 468 (468) T protein:vir:96 451 ----ALPSIEEGLNG----KENNE------PT 468 (468) T ss_pred ----HHHHHhhccCC----CCCCC------CC Confidence 11111111111 11111 11 No 33 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.89 E-value=2.9e-22 Score=138.66 Aligned_cols=464 Identities=11% Similarity=0.015 Sum_probs=242.2 Q ss_pred CC---------CCC-------CccccccccccccccCCccc------cCHHH-HHHHHHHHHHHHHhcchHHHHhCCccc Q lcl|NC_019406. 1 MA---------GLS-------PNSANIRRTKRGAQQFTHLV------VHPEY-EYYRPDWAKIRDAIAGEREIKAQGVKY 57 (661) Q Consensus 1 ~~---------~~~-------~~~~~~~~~~~~~~~~~V~~------~hPey-~a~~~~W~~irD~~~G~~~vr~~g~~Y 57 (661) |+ |+. +--+|+.-......+...+. -.-.+ ....++++.+.+.|.|.. ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~------~~i 74 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGEN------HDV 74 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC------ccc Confidence 21 111 22233322222222221111 00111 233567888888888841 111 Q ss_pred CCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhcc Q lcl|NC_019406. 58 LKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFA 137 (661) Q Consensus 58 LPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d 137 (661) +.+...... . +..+-.-.|+.+.+++.++|.+|.+||+++...+ +....+.++++++- T Consensus 75 ~~~~~~~~~--~--~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~------------------~~~~~~~~~l~~~~ 132 (502) T protein:vir:48 75 LKSGRRKDN--E--MADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN------------------EDNSQNDDAIKRIG 132 (502) T ss_pred ccccccccc--c--cccceeecchHHHHHHHHhhhhcccCeeEecCCc------------------cchhHHHHHHHHHH Confidence 222211111 1 1111244799999999999999999999852111 11233444455554 Q ss_pred CCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 138 KDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 138 l~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) . -++++.+...+.+.++.+|+++++|-... ..+|-+..++|.+++-- +++....+.+- T Consensus 133 ~-~N~~~~~~~~~~~~~~~~G~a~~~v~~de------dg~~~i~~~~p~~~~~v-ydd~~~~~~~~-------------- 190 (502) T protein:vir:48 133 R-INDIDTHNRNLIRDLSQTGRAYEVIYRSE------YDETRIKRLSPLETFVI-YDNSLEDNSIA-------------- 190 (502) T ss_pred h-hcCHhHHHHHHHHHHhhcCeEEEEEEeCC------CCceEEEEEcccceEEE-EcCCCCCceEE-------------- Confidence 3 36999999999999999999999996532 12466777777764311 01000000000 Q ss_pred ccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 218 TPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 218 ~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) .|+.|. . ...... ++.+.++..+ .+|++. ..+. T Consensus 191 -------~ir~~~----------------------------~-~~~~~~-----~~~~~iyt~~----~i~~~~--~~~~ 223 (502) T protein:vir:48 191 -------AVRYYN----------------------------R-GTLQNA-----KDVVEIYTNQ----HIYTLD--ASDS 223 (502) T ss_pred -------EEEEEE----------------------------E-eecCCc-----EEEEEEEeCC----eEEEEE--eCCc Confidence 000000 0 000000 1112222221 122221 1111 Q ss_pred ccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC-- Q lcl|NC_019406. 298 LGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA-- 375 (661) Q Consensus 298 ~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~-- 375 (661) .......-+.++.||||.+.. +.+.. +-|.++..|-=+.-+..|++.+.+.+.+.|+++++|...... T Consensus 224 ------~~~~~~~~~~~g~vPvv~~~n--n~~g~--sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~ 293 (502) T protein:vir:48 224 ------FNEISVTPHAFGTVPITEFLN--NADGI--GDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGM 293 (502) T ss_pred ------eeeccceecCCCccceEEecC--CCCCC--CchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccc Confidence 111112236789999998753 33322 334444444445556778889999999999999999643221 Q ss_pred ceeEecccceeec--------CCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHH Q lcl|NC_019406. 376 SEYHIGPGRVWVV--------DKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREA 446 (661) Q Consensus 376 ~~l~iGs~~~~~l--------p~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~ 446 (661) ....+.....+.+ ..++++++|+..+. +.+..+..++.+.++|..++.-.- ....-+++.||++.+.... T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~ 372 (502) T protein:vir:48 294 QASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSY-DVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLF 372 (502) T ss_pred chhhhhhcceeeccccccccccccCcceeEeeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHH Confidence 1111222222322 22456899998764 457788889999999987753221 1111235779999999988 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHH Q lcl|NC_019406. 447 NEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYEN 522 (661) Q Consensus 447 ~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~e 522 (661) ............+..++.+++++++.+++.... +...+.+.+++ ..+.. ..+.++++.++ +|.||++|++.. T Consensus 373 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~-~~p~d-~~e~a~~~~kl--~g~iS~et~l~~ 448 (502) T protein:vir:48 373 GLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTP-NLPKS-LYEQVSILNDL--GGQVSQETALSL 448 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCcHHHHHHh Confidence 888888889999999999999999999875422 22335555533 22332 24567777776 589999999765 Q ss_pred HHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 523 FVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 523 L~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) | +.+. +.++|.+||+++..... ....+.+..+....-...+.+.+.|=.++..| T Consensus 449 l---~~v~---D~~~E~~ri~~E~~~~~---~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 449 S---GLVE---NPTEELDKINEESSKID---FKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred C---CCCC---CHHHHHHHHHHHHHhhh---hhcccccccccccccCCCccCCCCcCcCCCCC Confidence 4 4433 45677888875532111 11111111100000000011111111111111 No 34 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.89 E-value=5.7e-22 Score=137.05 Aligned_cols=450 Identities=11% Similarity=0.014 Sum_probs=233.5 Q ss_pred cccccc---ccccc----ccCC-ccccCHH--------HHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 8 SANIRR---TKRGA----QQFT-HLVVHPE--------YEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 8 ~~~~~~---~~~~~----~~~~-V~~~hPe--------y~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) -.|+=| ++.-+ .+.+ =....++ +....++..++.+.|.|...+..+-..+.+....+ .. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~--~~--~ 76 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID--YD--K 76 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccc--cc--c Confidence 122221 11110 0000 0000111 22344566677777777655533221121111111 10 0 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) ...|.+ .|+.+.+++.++|.+|.+||+++.-.+ ...++++.+ ..++++.....+. T Consensus 77 ~~~ki~-~n~~k~Ivd~~~~~l~g~p~~~~~~d~----------------------~~~~~l~~~--~~n~~~~~~~e~~ 131 (474) T protein:vir:94 77 PDWRIT-TNFHQNLVDQKVSYVASKPVTYSCEDE----------------------NVLKVIHDV--LDTRWDNKLIDIL 131 (474) T ss_pred Ccceee-cchHHHHHHHHHhhhhcCCceeccCcH----------------------HHHHHHHHH--HhccHHHHHHHHH Confidence 112222 699999999999999999999852111 112222222 1367889999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +.++.+|+++++|..+. ..+|.+..++|.+++-.--+...+ .+..+ +|.+... + ..++..|.. T Consensus 132 ~~~~~~G~~~~~~~~d~------~~~~~i~~~~p~~~~~v~d~~~~~--~~~~~-ir~~~~~-~-------~~~~~~yt~ 194 (474) T protein:vir:94 132 TATSNKGIDWLQVYINE------NGEMKLFRVPAEQAIPIWVDKERE--ELKSF-IRYYKFN-N-------EEKVEFWTD 194 (474) T ss_pred HHHhhcCceEEEEEecC------CCeeEEEEEcccceEEEEcCCCCC--ceEEE-EEEEEec-C-------eEEEEEEeC Confidence 99999999999997543 235888889998876442111111 11111 1111100 0 001111111 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccc----ccceee Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQA----RDVYTP 307 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~----~~~~~p 307 (661) . . ++.+.... +..... ...... T Consensus 195 ~--------------------------------------~---------------~~~y~~~~-~~~~~~~~~~~~~~~~ 220 (474) T protein:vir:94 195 T--------------------------------------T---------------VTYYVLEN-GGLIPDYYYGANHVQS 220 (474) T ss_pred C--------------------------------------e---------------EEEEEEcC-CccccccccCcCcccc Confidence 1 1 11111111 000000 001111 Q ss_pred ccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccce Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRV 385 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~ 385 (661) ...-++++.||||.+.....+- +=|.++-.|-=+.=...|++.+.+-+.+.|+++++|.+.++... -.+....+ T Consensus 221 ~~~~~~~g~vPvv~~~nn~~g~----sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~ 296 (474) T protein:vir:94 221 HFSNGNWGRVPFIAFKNNPEEV----SDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKA 296 (474) T ss_pred cccccCCCccceEEecCCcCCC----CcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccce Confidence 1223569999999885432221 11222222221222245666777778899999999986543222 12334455 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT 464 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~ 464 (661) +.++ ++++++|+..+. +.+..+..++.+.+.|...+.-. +....-+++.||++...............-..+..++. T Consensus 297 i~~~-~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 374 (474) T protein:vir:94 297 INVD-GDGGVETIQVEV-PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQ 374 (474) T ss_pred eecc-CCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6666 467899998764 56888888999999888775322 11122235679999998888888888999999999999 Q ss_pred HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhc Q lcl|NC_019406. 465 SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMND 544 (661) Q Consensus 465 ~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~ 544 (661) +.+++++.++|... +...+.|.+++.- +.. ..+.+ ..+.++|.||++|++..| +.+ .++++|.++|++ T Consensus 375 ~~~~li~~~~~~~~-d~~~i~v~f~~~~-p~~-~~e~a---~~~~~~g~iS~et~l~~l---~~v---~D~~~E~eri~~ 442 (474) T protein:vir:94 375 ELISFIIDFNNLKT-DVKDIEISFNFNR-MMN-DAEQS---QIIAQSQYLSRETLVKSS---PLV---DDYKAELERIEQ 442 (474) T ss_pred HHHHHHHHHhCCCc-ccceeeEEeccCc-ccC-HHHHH---HHHHHcCCCCHHHHHHhC---CCC---CCHHHHHHHHHH Confidence 99999999999765 3345555554321 111 22333 344567999999997544 333 346677777776 Q ss_pred cCCCCCCchhhhhhcCCccccCCCcch-hhhhc Q lcl|NC_019406. 545 PKSFIGQPDAIAMRRGYVSRQQELDQQ-RAARD 576 (661) Q Consensus 545 ~~~~l~~ddae~~~~g~~~~~~~~~q~-~~~~e 576 (661) +...-. .......++..+..++.+++ ..+.| T Consensus 443 E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 443 EQMEYN-KQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHH-hhccccCCCCCCCcccCCCCcccccC Confidence 532100 00000111111111111111 11111 No 35 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.89 E-value=5.7e-22 Score=137.05 Aligned_cols=450 Identities=11% Similarity=0.014 Sum_probs=233.5 Q ss_pred cccccc---ccccc----ccCC-ccccCHH--------HHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 8 SANIRR---TKRGA----QQFT-HLVVHPE--------YEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 8 ~~~~~~---~~~~~----~~~~-V~~~hPe--------y~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) -.|+=| ++.-+ .+.+ =....++ +....++..++.+.|.|...+..+-..+.+....+ .. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~--~~--~ 76 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID--YD--K 76 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccc--cc--c Confidence 122221 11110 0000 0000111 22344566677777777655533221121111111 10 0 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) ...|.+ .|+.+.+++.++|.+|.+||+++.-.+ ...++++.+ ..++++.....+. T Consensus 77 ~~~ki~-~n~~k~Ivd~~~~~l~g~p~~~~~~d~----------------------~~~~~l~~~--~~n~~~~~~~e~~ 131 (474) T protein:vir:97 77 PDWRIT-TNFHQNLVDQKVSYVASKPVTYSCEDE----------------------NVLKVIHDV--LDTRWDNKLIDIL 131 (474) T ss_pred Ccceee-cchHHHHHHHHHhhhhcCCceeccCcH----------------------HHHHHHHHH--HhccHHHHHHHHH Confidence 112222 699999999999999999999852111 112222222 1367889999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +.++.+|+++++|..+. ..+|.+..++|.+++-.--+...+ .+..+ +|.+... + ..++..|.. T Consensus 132 ~~~~~~G~~~~~~~~d~------~~~~~i~~~~p~~~~~v~d~~~~~--~~~~~-ir~~~~~-~-------~~~~~~yt~ 194 (474) T protein:vir:97 132 TATSNKGIDWLQVYINE------NGEMKLFRVPAEQAIPIWVDKERE--ELKSF-IRYYKFN-N-------EEKVEFWTD 194 (474) T ss_pred HHHhhcCceEEEEEecC------CCeeEEEEEcccceEEEEcCCCCC--ceEEE-EEEEEec-C-------eEEEEEEeC Confidence 99999999999997543 235888889998876442111111 11111 1111100 0 001111111 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccc----ccceee Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQA----RDVYTP 307 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~----~~~~~p 307 (661) . . ++.+.... +..... ...... T Consensus 195 ~--------------------------------------~---------------~~~y~~~~-~~~~~~~~~~~~~~~~ 220 (474) T protein:vir:97 195 T--------------------------------------T---------------VTYYVLEN-GGLIPDYYYGANHVQS 220 (474) T ss_pred C--------------------------------------e---------------EEEEEEcC-CccccccccCcCcccc Confidence 1 1 11111111 000000 001111 Q ss_pred ccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccce Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRV 385 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~ 385 (661) ...-++++.||||.+.....+- +=|.++-.|-=+.=...|++.+.+-+.+.|+++++|.+.++... -.+....+ T Consensus 221 ~~~~~~~g~vPvv~~~nn~~g~----sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~ 296 (474) T protein:vir:97 221 HFSNGNWGRVPFIAFKNNPEEV----SDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKA 296 (474) T ss_pred cccccCCCccceEEecCCcCCC----CcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccce Confidence 1223569999999885432221 11222222221222245666777778899999999986543222 12334455 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT 464 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~ 464 (661) +.++ ++++++|+..+. +.+..+..++.+.+.|...+.-. +....-+++.||++...............-..+..++. T Consensus 297 i~~~-~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 374 (474) T protein:vir:97 297 INVD-GDGGVETIQVEV-PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQ 374 (474) T ss_pred eecc-CCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6666 467899998764 56888888999999888775322 11122235679999998888888888999999999999 Q ss_pred HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhc Q lcl|NC_019406. 465 SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMND 544 (661) Q Consensus 465 ~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~ 544 (661) +.+++++.++|... +...+.|.+++.- +.. ..+.+ ..+.++|.||++|++..| +.+ .++++|.++|++ T Consensus 375 ~~~~li~~~~~~~~-d~~~i~v~f~~~~-p~~-~~e~a---~~~~~~g~iS~et~l~~l---~~v---~D~~~E~eri~~ 442 (474) T protein:vir:97 375 ELISFIIDFNNLKT-DVKDIEISFNFNR-MMN-DAEQS---QIIAQSQYLSRETLVKSS---PLV---DDYKAELERIEQ 442 (474) T ss_pred HHHHHHHHHhCCCc-ccceeeEEeccCc-ccC-HHHHH---HHHHHcCCCCHHHHHHhC---CCC---CCHHHHHHHHHH Confidence 99999999999765 3345555554321 111 22333 344567999999997544 333 346677777776 Q ss_pred cCCCCCCchhhhhhcCCccccCCCcch-hhhhc Q lcl|NC_019406. 545 PKSFIGQPDAIAMRRGYVSRQQELDQQ-RAARD 576 (661) Q Consensus 545 ~~~~l~~ddae~~~~g~~~~~~~~~q~-~~~~e 576 (661) +...-. .......++..+..++.+++ ..+.| T Consensus 443 E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 443 EQMEYN-KQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHH-hhccccCCCCCCCcccCCCCcccccC Confidence 532100 00000111111111111111 11111 No 36 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.88 E-value=1.1e-21 Score=135.45 Aligned_cols=440 Identities=13% Similarity=0.022 Sum_probs=245.0 Q ss_pred CCCCCCccccccccccccccCCcccc---CHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVV---HPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~---hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~ 77 (661) |---.|.-.+.. ....++.+.- --.+....+++..+..-|.|...+..+.. ...... .. | . T Consensus 1 ~~~~~~~~~~~~----~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~------~~~~~~--~~---k-i 64 (452) T protein:vir:36 1 MKYKPPKLMTFS----KDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPA------KDSWKP--DN---R-L 64 (452) T ss_pred CcccCceeEEcC----CccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcc------ccccCc--cc---e-e Confidence 444344333333 2222221111 11334556778888888888765533221 111111 11 2 2 Q ss_pred ccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhh Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAM 157 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~ 157 (661) -.|+.+.+|+..+|.+|.+||++..-.+. ..+.++++ +.-++++.....+.+.++.+ T Consensus 65 ~~n~~~~ivd~~~~~l~g~~~~~~~~d~~----------------------~~~~l~~~-~~~n~~~~~~~~~~~~~~~~ 121 (452) T protein:vir:36 65 AVNFTKYIVDTFTGYFNGIPVKKSHSDKE----------------------ILTKLQEF-DNLNDMEDEESELAKMACIY 121 (452) T ss_pred ecchHHHHHHHHhhhhcccCceeecCChh----------------------HHHHHHHH-HhhcChhHHHHHHHHHHHhc Confidence 36999999999999999999998521111 11122222 13478999999999999999 Q ss_pred CCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcc Q lcl|NC_019406. 158 GRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRT 237 (661) Q Consensus 158 Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w 237 (661) |+++++|- +.. ..+|-+..++|.+++-.- ++......+-. ++.+.. T Consensus 122 G~~~~~v~-~d~-----~g~~~i~~~~p~~~~~v~-d~~~~~~~~~~--i~~~~~------------------------- 167 (452) T protein:vir:36 122 GRAFEFLY-QDE-----DTQTNVVYNSPENMFMVY-DDTVKQEPLFA--VRYGVD------------------------- 167 (452) T ss_pred CeEEEEEE-ecC-----CCeeEEEEEcccceEEEE-cCCCCCceEEE--EEEEEe------------------------- Confidence 99999884 322 236778888888765332 11111111100 010000 Q ss_pred hhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccccee Q lcl|NC_019406. 238 SGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFI 317 (661) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~I 317 (661) .. ..+.+.++..+ .+|++. ....+. ......-++++.| T Consensus 168 ------------------------~~-------~~~~~~vyt~~----~i~~~~---~~~~~~----~~~~~~~~~~g~i 205 (452) T protein:vir:36 168 ------------------------ED-------KKLQGEVYTLL----ETIKIS---GENDEI----SFGEGTYNPYPDL 205 (452) T ss_pred ------------------------cC-------ceEEEEEEecC----eEEEEE---EcCCce----EEecceeccCCcc Confidence 00 00111122111 122221 111111 1111122568999 Q ss_pred eEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCC----CC Q lcl|NC_019406. 318 PFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKE----SG 393 (661) Q Consensus 318 Pfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~----ga 393 (661) |||.+.... . +.+-|.++..|-=+.=...|++.+.+.+.+.|+++++|........-.+-.+.++.++.. ++ T Consensus 206 Pvv~~~n~~--~--g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (452) T protein:vir:36 206 PVVEFYFNE--E--RMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKNIRSNRVINYYADGEGKNV 281 (452) T ss_pred cEEEecCCC--C--CCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCchhhhhhhhcceEEecCCCCccCC Confidence 999875422 2 223355555554455557788899999999999999997654432222333445655543 24 Q ss_pred cceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 394 IPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMF 473 (661) Q Consensus 394 ~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w 473 (661) +++|+..+. ..+..+..++.+.+.|...+.-.-....+.++-||++...............-..+..++.+++++++.+ T Consensus 282 ~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 360 (452) T protein:vir:36 282 DVKFLEKPD-SDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCEL 360 (452) T ss_pred cceeEeecC-CHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 689998765 4677888899999998877532211223446779999888877777788888888999999999999998 Q ss_pred cCCCCC--CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCC Q lcl|NC_019406. 474 RDIPLT--DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQ 551 (661) Q Consensus 474 ~G~~~~--~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ 551 (661) ++.... +..++.|.+++. .+.. ..+.++++.++ +|.||.+|.+..+ +.+ .+.+++.++|+++.. T Consensus 361 ~~~~~~~~~~~~i~i~f~~~-~p~d-~~~~a~~~~k~--~g~iS~et~~~~~---~~~---~d~~~E~~ri~~E~~---- 426 (452) T protein:vir:36 361 STNVSNKDSWKDIEYTFTRN-EPKD-IKEQAETANIL--MGITSQETALSVI---SVI---PDVQAEMEKIKKEEA---- 426 (452) T ss_pred HhccCCccccccceEEeCCC-CCcC-HHHHHHHHHHH--hccCChHHHHHhC---CCC---CCHHHHHHHHHHHHH---- Confidence 875422 223445555432 2222 23456666665 6899999997544 333 346777888876532 Q ss_pred chhhhhhcCC---ccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 552 PDAIAMRRGY---VSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 552 ddae~~~~g~---~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) ..++..+++. .....+..++ ..| T Consensus 427 ~~~~~~~~~~~~~~~~~~~~~~~-----------~~e 452 (452) T protein:vir:36 427 STAIFDKDKQPSEKGTDTVVSET-----------NEE 452 (452) T ss_pred HHHHHHhhccCCCCcccccCccc-----------cCC Confidence 1122222221 1111111111 111 No 37 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.88 E-value=6.1e-22 Score=136.90 Aligned_cols=468 Identities=12% Similarity=0.042 Sum_probs=243.5 Q ss_pred CCC------CCCccccccccccccccCCcccc--------CH----HH-----HHHHHHHHHHHHHhcchHHHHhCCccc Q lcl|NC_019406. 1 MAG------LSPNSANIRRTKRGAQQFTHLVV--------HP----EY-----EYYRPDWAKIRDAIAGEREIKAQGVKY 57 (661) Q Consensus 1 ~~~------~~~~~~~~~~~~~~~~~~~V~~~--------hP----ey-----~a~~~~W~~irD~~~G~~~vr~~g~~Y 57 (661) |.- .+--+-||+.-=.-.+|...... .+ .+ ...+++++++.+.|.|...+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~----- 75 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV----- 75 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcccc----- Confidence 110 11111222211111222221111 11 11 1235678888888888644311 Q ss_pred CCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhcc Q lcl|NC_019406. 58 LKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFA 137 (661) Q Consensus 58 LPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d 137 (661) +. ......++.. .| .-.|+.+.+++.++|.+|.+||+++.-.+. ..+.++++- T Consensus 76 --~~-~~~~~~~~~~-~k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~----------------------~~~~l~~~~ 128 (511) T protein:vir:99 76 --EL-TRRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQYQDDDKD----------------------VLEAIEAFN 128 (511) T ss_pred --cc-CcccccccCc-ce-eecchHHHHHHHHHhhhcccCceeecCchH----------------------HHHHHHHHH Confidence 11 1111111111 12 447999999999999999999999522211 122333332 Q ss_pred CCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 138 KDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 138 l~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) +-++++.+...+.+.++.+|+++++|-... ..+|.+..++|.+++-.--+.+.+ +.+..|+. +... T Consensus 129 -~~n~~~~~~~~~~~~~~i~G~a~~~vy~de------d~~~~i~~~~p~~~~~vyd~~~~~-~~~~~vr~--~~~~---- 194 (511) T protein:vir:99 129 -DLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFVIYDNTIER-NSIAGVRY--LRTK---- 194 (511) T ss_pred -hhcCHhHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEccceeEEEEcCCCCC-ceEEEEEE--EEee---- Confidence 235899999999999999999999996532 235778888888765321111111 11111111 0000 Q ss_pred ccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 218 TPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 218 ~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) .. .......++++.++..+ .++++.. ..+. T Consensus 195 -------------------------------------------~~--~~~~~~~~~~~~vyt~~----~i~~~~~-~~~~ 224 (511) T protein:vir:99 195 -------------------------------------------PI--DKTDEDEVFTVDLFTSH----GVYRYLT-SRTN 224 (511) T ss_pred -------------------------------------------ec--ccCccceEEEEEEEeCC----cEEEEEe-cCCc Confidence 00 00111223333333322 1222221 1111 Q ss_pred ccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce Q lcl|NC_019406. 298 LGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE 377 (661) Q Consensus 298 ~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~ 377 (661) .. ...........++++.||||.+.. +.+ +.+-|.++-.|-=+.-...|++.+.+++.+.|+++++|....+... T Consensus 225 ~~-~~~~~~~~~~~~~~g~vPvv~~~n--n~~--g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (511) T protein:vir:99 225 GL-KLTPRENGFESHSFERMPITEFSN--NER--RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred cc-cccccccccccCCCCccceEEecC--CCC--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchh Confidence 11 111111223346799999998753 222 2333455555444566778889999999999999999854322111 Q ss_pred e--------Eeccccee-----ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHH Q lcl|NC_019406. 378 Y--------HIGPGRVW-----VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSAL 443 (661) Q Consensus 378 l--------~iGs~~~~-----~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~ 443 (661) + -......+ ....++++++||..+- +.+..+..++.+++.|..++.-. +....-+++.||++.+. T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~ 378 (511) T protein:vir:99 300 VRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcccccccceecccccccccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 1 11111111 1123467899998754 46778888999999988765322 11112235779999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019406. 444 REANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT-----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDA 518 (661) Q Consensus 444 d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et 518 (661) ...............+..++.+.+++++.+++.... +-..+.|.+++ -.+.. .++.++++.++ .|.||++| T Consensus 379 ~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~-~~p~n-~~e~~~~~~kl--~GiiS~et 454 (511) T protein:vir:99 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNR-NLPKS-LIEELKAYIDS--GGKISQTT 454 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCCHHH Confidence 988888888888899999999999999998864321 11234444432 22333 34567777776 48999999 Q ss_pred HHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCC-cchhhhhcCChhhHHHH Q lcl|NC_019406. 519 LYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQEL-DQQRAARDADFQQQELE 585 (661) Q Consensus 519 ~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~-~q~~~~~e~d~~q~~~~ 585 (661) .+..| |...+.++|.++|+++...- .+..+....+....-..- +...+..++| ..| T Consensus 455 ~l~~l------~~v~D~~~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~d----~~e 511 (511) T protein:vir:99 455 LMSLF------SFFQDPELEVKKIEEDEKES-IKKAQKNMYQDPRNINDDEQDDSTKDSID----KKE 511 (511) T ss_pred HHHhC------CCCCCHHHHHHHHHHHHHHH-HHHHhhcccccCCCCCCCCCCCCCcCccc----ccC Confidence 97654 33335677888887653210 000000000000000000 0001112222 111 No 38 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.88 E-value=9.2e-22 Score=135.90 Aligned_cols=506 Identities=11% Similarity=0.002 Sum_probs=257.9 Q ss_pred CCCccccccccccccccCCccccC---------HHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHh Q lcl|NC_019406. 4 LSPNSANIRRTKRGAQQFTHLVVH---------PEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLD 74 (661) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~V~~~h---------Pey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~ 74 (661) .||--+| |.+...- -......++...+++-|.|.+.+..+...+.-.........++. . T Consensus 1 ~~~~~~~----------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~--n 68 (537) T protein:vir:78 1 MTSPLLN----------KPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYAS--N 68 (537) T ss_pred CCccccc----------ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhccccccccccccccccccc--c Confidence 4444333 2221111 11223456667778888888766444322211111111111210 0 Q ss_pred hhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHH Q lcl|NC_019406. 75 RAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQ 154 (661) Q Consensus 75 rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~ 154 (661) .=.-.|+.+.+++..+|.+|.+||+++.-.. .. .++...+..+ -+++.+.....+...+ T Consensus 69 nki~~nf~k~Ivd~~~~yl~G~Pv~~~~~d~---------------~~----~e~~~~l~~~--~~~~~~~~~~el~~~~ 127 (537) T protein:vir:78 69 VKISHGFFTELVDQLAQYLLSNGVEVKVKDE---------------DN----TQLDEILQEY--FDEDFQATIDTLVTNA 127 (537) T ss_pred cccccchHHHHHHHHhhhhcccCceeecCcc---------------hh----HHHHHHHHHH--hhccHHHHHHHHHHHH Confidence 0144799999999999999999999852111 01 1222222222 1467777788888999 Q ss_pred HhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) +.+|++++++-.... ..+.+..++|++++=. +++.+. ....+++.................++..|+...| T Consensus 128 s~~G~ay~~~y~de~------~~~~~~~i~p~~~~pv-~d~~~~--~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i 198 (537) T protein:vir:78 128 SKKGFEGIFARTTSE------GKLKFQTVDGLTLIPV-FDDYGV--LKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAV 198 (537) T ss_pred hhcCeeEEEeeecCC------CceEEEEEccceeEEE-EcCCCC--ceeEEEEEeeeeccccccCcceEEEEEEEcCCcE Confidence 999999999865532 2466778888875421 122221 1112222222222222222223333444444433 Q ss_pred hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccc Q lcl|NC_019406. 235 QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTL 314 (661) Q Consensus 235 i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L 314 (661) .-|+...-+... .+ .+.....+..+-.......................+++ T Consensus 199 ~~y~~~~~~~~~-------------------~~---------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (537) T protein:vir:78 199 CYYIQDDEGVST-------------------TY---------KLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSY 250 (537) T ss_pred EEEEecCCcccc-------------------cc---------cccccccccccceeeeccccccccccccccccccccCC Confidence 322211000000 00 00000000000000101111111111111122234679 Q ss_pred ceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee--EecccceeecCCCC Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY--HIGPGRVWVVDKES 392 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l--~iGs~~~~~lp~~g 392 (661) +.|||+.+... .... +-|.++-.|-=+.=...|++.+.+...+.|+++++|...++...+ .+....++.++.++ T Consensus 251 g~iPvv~f~nn--~~~~--sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~ 326 (537) T protein:vir:78 251 SKFPFQLLYNN--KDGM--SDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDN 326 (537) T ss_pred cceeEEEeccC--ccCC--CchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecCCC Confidence 99999988543 2222 223333333222223568888888999999999999754432211 12233455666667 Q ss_pred CcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 393 GIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLM 472 (661) Q Consensus 393 a~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~ 472 (661) +++.|+..+. +.++.+..++.+++.|...+--........++.|+++.++........-...-+-+..+|.+.+++++. T Consensus 327 ~~v~~l~~~~-~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~ 405 (537) T protein:vir:78 327 AGMEIQTVSI-PYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVS 405 (537) T ss_pred CceeEEEecC-CHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8899998765 558888899999999998863332234466788999998887777666677777788888888888988 Q ss_pred HcCCCC---CCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhcc---- Q lcl|NC_019406. 473 FRDIPL---TDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDP---- 545 (661) Q Consensus 473 w~G~~~---~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~---- 545 (661) +++... -+...+.|.+++... .. ..+.++.+.++.+.|.||++|.+..+ +.+++ .|.+ ..+.++ T Consensus 406 ~~~~~~~~~~d~~~i~i~f~~~~P-~n-~~e~a~~~~~l~~~giiS~eT~l~~~---p~vdd---~e~e-k~~~ee~~~~ 476 (537) T protein:vir:78 406 DIALRGLGEYDSNDICFEIEPHVL-AN-ELDIATTRKTEAETEALKIGNIMTVA---PRIGD---DETL-KLIAEELDLD 476 (537) T ss_pred HHhhcCCcccccceeeEEeccCCC-CC-HHHHHHHHHHHHhcCcchHHHHHHhC---CCCCC---HHHH-HHHHHHHHhh Confidence 876542 234455666654322 22 23567777888899999999997543 55543 2222 111111 Q ss_pred -----------CCCC-C-CchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhh Q lcl|NC_019406. 546 -----------KSFI-G-QPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGST 607 (661) Q Consensus 546 -----------~~~l-~-~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~ 607 (661) .... + .++.+.+.+|....+++.|-+...+++| |--.---=+..|-+| T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~--------------~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 477 YNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVAD--------------PNVVPPTDPNAVPQT 537 (537) T ss_pred hhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCC--------------CCCCCCCCCccCCCC Confidence 1111 1 1333334444444433333333344443 210000001111122 No 39 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.88 E-value=9e-22 Score=135.96 Aligned_cols=462 Identities=13% Similarity=0.076 Sum_probs=238.4 Q ss_pred CCCCCCccccccccccccccCCcccc--------CH----HH-----HHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVV--------HP----EY-----EYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG 63 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~--------hP----ey-----~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~ 63 (661) -++++ -||+--=.-.+|+..... .+ ++ ....++++++.+.|.|...+- +... T Consensus 10 ~~~~~---~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il--------~~~~ 78 (511) T protein:vir:96 10 DTDLR---GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL--------VELT 78 (511) T ss_pred hhhhh---hhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccc--------cccC Confidence 22222 222211111122211110 11 11 123567888888888865431 1111 Q ss_pred CChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCH Q lcl|NC_019406. 64 FDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSH 143 (661) Q Consensus 64 E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL 143 (661) .....++.. .| .-.|+.+.+++.++|.+|.+||+++.-.+. ..+.+.++ .+-+++ T Consensus 79 ~~~~~~~~~-~k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~----------------------~~~~l~~~-~~~n~~ 133 (511) T protein:vir:96 79 RRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQYQDDDKD----------------------VLEAIEAF-NDLNDV 133 (511) T ss_pred cccccccCc-ce-eecchHHHHHHHHhhhhcccCceeecCchH----------------------HHHHHHHH-HhhcCh Confidence 111111111 12 336999999999999999999998521111 11222333 134689 Q ss_pred HHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccccccc Q lcl|NC_019406. 144 QGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQN 223 (661) Q Consensus 144 ~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~ 223 (661) +.+...+.+.++.+|+++++|-... ..+|-+..++|.+++-.--+.+. ...+..| T Consensus 134 ~~~~~~~~~~~~~~G~a~~~vy~d~------dg~~~i~~~~p~~~~~v~dd~~~-~~~~~~v------------------ 188 (511) T protein:vir:96 134 ESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFIIYDNTVE-RNSIAGV------------------ 188 (511) T ss_pred hHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEcccceEEEEcCCCC-CceEEEE------------------ Confidence 9999999999999999999996542 23577778888876432111111 1111111 Q ss_pred ceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccc Q lcl|NC_019406. 224 PWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARD 303 (661) Q Consensus 224 ~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~ 303 (661) +.|... ... ....+.++.+.++..+ .++++.. ..+. +.... T Consensus 189 ---r~~~~~----------------------------~~~--~~~~~~~~~~~vyt~~----~i~~~~~-~~~~-~~~~~ 229 (511) T protein:vir:96 189 ---RYLRTK----------------------------PID--KTDEDEVFTVDLFTSH----GVYRYLT-NRTN-GLKLT 229 (511) T ss_pred ---EEEEee----------------------------ecc--ccccceEEEEEEEeCC----cEEEEEe-cCCC-ccccc Confidence 111000 000 0011222233333322 1222221 1111 11111 Q ss_pred ceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee----- Q lcl|NC_019406. 304 VYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY----- 378 (661) Q Consensus 304 ~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l----- 378 (661) ........++++.||+|.+... .+. .+=+.++..|-=+.-...|++.+.+++.+.|+++++|....+...+ T Consensus 230 ~~~~~~~~~~~g~vPvv~~~n~--~~g--~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~ 305 (511) T protein:vir:96 230 PRENSFESHSFERMPITEFSNN--ERR--KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKE 305 (511) T ss_pred ccccccccCcCcccceEEecCC--CCC--CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhccccc Confidence 1222344578999999987532 222 2223444444334445678889999999999999999543221111 Q ss_pred ---Eecccceee-----cCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhh Q lcl|NC_019406. 379 ---HIGPGRVWV-----VDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQ 449 (661) Q Consensus 379 ---~iGs~~~~~-----lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~ 449 (661) .+.....+. ....+++++||..+- ..+..+..++.++++|..+..-. +....-+++.||++.+....... T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~ 384 (511) T protein:vir:96 306 ANVLFLEPTVYVDAEGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 384 (511) T ss_pred ccceeccccceeccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHH Confidence 111111111 112357899998653 45777888889999888764311 11112235679999998888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHH Q lcl|NC_019406. 450 SLLLNVIMALEDGMTSVVRYWLMFRDIPLT-----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFV 524 (661) Q Consensus 450 S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~ 524 (661) ......-..+..++.+.+++++.+++.... +-.++.|.+++... .. ..+.++++.++ .|.||++|.+..| T Consensus 385 ~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p-~n-~~e~~d~~~kl--~G~iS~et~l~~l- 459 (511) T protein:vir:96 385 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLP-KS-LIEELKAYIDS--GGKISQTTLMSLF- 459 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCC-cC-HHHHHHHHHHH--hccCChHHHHHhC- Confidence 888888888999999999999999864321 12245555544222 22 34577777777 4899999997543 Q ss_pred hcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcc----hhhhhcCChhhHHHH Q lcl|NC_019406. 525 KNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQ----QRAARDADFQQQELE 585 (661) Q Consensus 525 r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q----~~~~~e~d~~q~~~~ 585 (661) +.+ .+.++|.++|+++... .....+.........++. .++..++| +.| T Consensus 460 --~~v---~d~~~El~ri~~E~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----e~~ 511 (511) T protein:vir:96 460 --SFF---QDPELEVKKIEEDEKE----SIKKAQKGIYKDPRDINDDEQDDDTKDTVD----KKE 511 (511) T ss_pred --CCC---CCHHHHHHHHHHHHHH----HHHHHhhccccCCCCCCCCCCCCCccCccc----ccC Confidence 333 3467788888765221 000000000000000000 11112211 111 No 40 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.88 E-value=9e-22 Score=135.96 Aligned_cols=462 Identities=13% Similarity=0.076 Sum_probs=238.4 Q ss_pred CCCCCCccccccccccccccCCcccc--------CH----HH-----HHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVV--------HP----EY-----EYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG 63 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~--------hP----ey-----~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~ 63 (661) -++++ -||+--=.-.+|+..... .+ ++ ....++++++.+.|.|...+- +... T Consensus 10 ~~~~~---~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il--------~~~~ 78 (511) T protein:vir:78 10 DTDLR---GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL--------VELT 78 (511) T ss_pred hhhhh---hhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccc--------cccC Confidence 22222 222211111122211110 11 11 123567888888888865431 1111 Q ss_pred CChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCH Q lcl|NC_019406. 64 FDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSH 143 (661) Q Consensus 64 E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL 143 (661) .....++.. .| .-.|+.+.+++.++|.+|.+||+++.-.+. ..+.+.++ .+-+++ T Consensus 79 ~~~~~~~~~-~k-i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~----------------------~~~~l~~~-~~~n~~ 133 (511) T protein:vir:78 79 RRKEEYMAD-NR-VAHDYASYISDFINGYFLGNPIQYQDDDKD----------------------VLEAIEAF-NDLNDV 133 (511) T ss_pred cccccccCc-ce-eecchHHHHHHHHhhhhcccCceeecCchH----------------------HHHHHHHH-HhhcCh Confidence 111111111 12 336999999999999999999998521111 11222333 134689 Q ss_pred HHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccccccc Q lcl|NC_019406. 144 QGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQN 223 (661) Q Consensus 144 ~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~ 223 (661) +.+...+.+.++.+|+++++|-... ..+|-+..++|.+++-.--+.+. ...+..| T Consensus 134 ~~~~~~~~~~~~~~G~a~~~vy~d~------dg~~~i~~~~p~~~~~v~dd~~~-~~~~~~v------------------ 188 (511) T protein:vir:78 134 ESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKSDAMSTFIIYDNTVE-RNSIAGV------------------ 188 (511) T ss_pred hHHHHHHHHHHHhcCeeEEEEEeCC------CCceEEEEEcccceEEEEcCCCC-CceEEEE------------------ Confidence 9999999999999999999996542 23577778888876432111111 1111111 Q ss_pred ceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccc Q lcl|NC_019406. 224 PWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARD 303 (661) Q Consensus 224 ~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~ 303 (661) +.|... ... ....+.++.+.++..+ .++++.. ..+. +.... T Consensus 189 ---r~~~~~----------------------------~~~--~~~~~~~~~~~vyt~~----~i~~~~~-~~~~-~~~~~ 229 (511) T protein:vir:78 189 ---RYLRTK----------------------------PID--KTDEDEVFTVDLFTSH----GVYRYLT-NRTN-GLKLT 229 (511) T ss_pred ---EEEEee----------------------------ecc--ccccceEEEEEEEeCC----cEEEEEe-cCCC-ccccc Confidence 111000 000 0011222233333322 1222221 1111 11111 Q ss_pred ceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee----- Q lcl|NC_019406. 304 VYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY----- 378 (661) Q Consensus 304 ~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l----- 378 (661) ........++++.||+|.+... .+. .+=+.++..|-=+.-...|++.+.+++.+.|+++++|....+...+ T Consensus 230 ~~~~~~~~~~~g~vPvv~~~n~--~~g--~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~ 305 (511) T protein:vir:78 230 PRENSFESHSFERMPITEFSNN--ERR--KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKE 305 (511) T ss_pred ccccccccCcCcccceEEecCC--CCC--CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhccccc Confidence 1222344578999999987532 222 2223444444334445678889999999999999999543221111 Q ss_pred ---Eecccceee-----cCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhh Q lcl|NC_019406. 379 ---HIGPGRVWV-----VDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQ 449 (661) Q Consensus 379 ---~iGs~~~~~-----lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~ 449 (661) .+.....+. ....+++++||..+- ..+..+..++.++++|..+..-. +....-+++.||++.+....... T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~ 384 (511) T protein:vir:78 306 ANVLFLEPTVYVDAEGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 384 (511) T ss_pred ccceeccccceeccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHH Confidence 111111111 112357899998653 45777888889999888764311 11112235679999998888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHH Q lcl|NC_019406. 450 SLLLNVIMALEDGMTSVVRYWLMFRDIPLT-----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFV 524 (661) Q Consensus 450 S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~ 524 (661) ......-..+..++.+.+++++.+++.... +-.++.|.+++... .. ..+.++++.++ .|.||++|.+..| T Consensus 385 ~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p-~n-~~e~~d~~~kl--~G~iS~et~l~~l- 459 (511) T protein:vir:78 385 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLP-KS-LIEELKAYIDS--GGKISQTTLMSLF- 459 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCC-cC-HHHHHHHHHHH--hccCChHHHHHhC- Confidence 888888888999999999999999864321 12245555544222 22 34577777777 4899999997543 Q ss_pred hcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcc----hhhhhcCChhhHHHH Q lcl|NC_019406. 525 KNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQ----QRAARDADFQQQELE 585 (661) Q Consensus 525 r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q----~~~~~e~d~~q~~~~ 585 (661) +.+ .+.++|.++|+++... .....+.........++. .++..++| +.| T Consensus 460 --~~v---~d~~~El~ri~~E~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----e~~ 511 (511) T protein:vir:78 460 --SFF---QDPELEVKKIEEDEKE----SIKKAQKGIYKDPRDINDDEQDDDTKDTVD----KKE 511 (511) T ss_pred --CCC---CCHHHHHHHHHHHHHH----HHHHHhhccccCCCCCCCCCCCCCccCccc----ccC Confidence 333 3467788888765221 000000000000000000 11112211 111 No 41 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=99.88 E-value=2.9e-21 Score=133.15 Aligned_cols=449 Identities=11% Similarity=0.026 Sum_probs=238.3 Q ss_pred CCC-----C-CCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHh Q lcl|NC_019406. 1 MAG-----L-SPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLD 74 (661) Q Consensus 1 ~~~-----~-~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~ 74 (661) |.. + .+...++..+..- |.. | -....++|+++++.|.|...+..+. ....+.... .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-----i~~-~--~~~~~~r~~~~~~yy~g~~~i~~~~-----~~~~~~~~~--~k-- 63 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNY-----ISR-F--KAEQLERLKELKRYYLGDNNIKYRP-----AKTDKYAAD--NR-- 63 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHH-----HHH-H--HHHHHHHHHHHHHHhcccCcccccc-----ccccccCCc--ce-- Confidence 111 0 0111111100000 111 1 1235678999999999976553321 111111111 12 Q ss_pred hhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHH Q lcl|NC_019406. 75 RAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQ 154 (661) Q Consensus 75 rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~ 154 (661) .-.|+.+.+|+.++|.+|.+||+++.-.+ ...++++++- +-++++.+...+.+.+ T Consensus 64 --i~~n~~~~iv~~~~~~l~g~~~~~~~~d~----------------------~~~~~l~~~~-~~n~~~~~~~~~~~~~ 118 (489) T protein:vir:99 64 --IASDFAKYITVFEQGYMLGVPVEYKNENK----------------------DLQAAIDLMS-VRNNEDYHNVKIKTDL 118 (489) T ss_pred --eecchHHHHHHHHhhhhccCCceeecCCh----------------------hHHHHHHHHH-hhcChhHHHHHHHHHH Confidence 24799999999999999999999852111 1223333332 2368999999999999 Q ss_pred HhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) +.+|+++++|-..+.. ....+|.+..++|.+++-.- ++......+-.|+. |..+ T Consensus 119 ~~~G~~~~~v~~~~~~--d~~~~~~i~~~~p~~~~~v~-dd~~~~~~~~~i~~---------------------~~~~-- 172 (489) T protein:vir:99 119 SIYGRAYELLTVEKID--DKKTEVKLYQLPAEQTFVIY-DDTYQRNSLMAVHF---------------------YDID-- 172 (489) T ss_pred hhCCeEEEEEeeccCc--CCCcceEEEEEcccceEEEE-cCCCCCceEEEEEE---------------------EEEe-- Confidence 9999999988643211 12457888888998864321 11111111111111 0000 Q ss_pred hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccc Q lcl|NC_019406. 235 QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTL 314 (661) Q Consensus 235 i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L 314 (661) .... .......++..+ .+|++.....+..+ ........+++ T Consensus 173 --------------------------~~~~-----~~~~~~~~y~~~----~i~~~~~~~~~~~~----~~~~~~~~~~~ 213 (489) T protein:vir:99 173 --------------------------YGSG-----KRKQIIKAYTSD----TIYTYEDYNLETKG----MRLKDYEGHFF 213 (489) T ss_pred --------------------------cCCC-----ceEEEEEEEeCC----cEEEEEecCCCccc----ceecccccccC Confidence 0000 011112222222 12222211111111 11111223679 Q ss_pred ceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc------eeEecccc---- Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS------EYHIGPGR---- 384 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~------~l~iGs~~---- 384 (661) +.||||.+.... . +.+-|.++..|-=++-...|++.+.+.+.++|+++++|......+ ...+.++. T Consensus 214 g~vPvv~~~n~~--~--~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (489) T protein:vir:99 214 KGVPVNEYANNE--E--RTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAI 289 (489) T ss_pred CceeEEEeecCC--C--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhccccccccccc Confidence 999999886422 2 223344555555566667788999999999999999997533211 11111111 Q ss_pred --------eeecCC------CCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhh Q lcl|NC_019406. 385 --------VWVVDK------ESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQ 449 (661) Q Consensus 385 --------~~~lp~------~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~ 449 (661) .+.+.. .+.+++||..+- +.+..+..|+.+++.|+.++.-. +.....+++.||++.+....... T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 368 (489) T protein:vir:99 290 SIGFKKAQVLILDDNPNPNGVKPQAYFLKKEY-DTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASD 368 (489) T ss_pred ccccccceeeeeccccCccccccceeeeeecC-ChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHH Confidence 111111 134567777543 55777888899999998875322 11112235779999888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCc------ceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHH Q lcl|NC_019406. 450 SLLLNVIMALEDGMTSVVRYWLMFRDIPLTDT------ATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENF 523 (661) Q Consensus 450 S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~------~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL 523 (661) .........+..++.+++++++.+++...... .++.|.+++ -.+.. ..+.++++.++ .|.||++|.+..| T Consensus 369 ~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~-~~p~d-~~~~~~~~~kl--~giis~et~~~~l 444 (489) T protein:vir:99 369 NYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTP-NLPQN-DNEIVTAAQNL--YGIVSDQTIFEIL 444 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCCHHHHHHhc Confidence 88888889999999999999999987543211 124444432 22222 24567777776 4899999997654 Q ss_pred HhcCCCCccCCHHHHHHHHhccCCC-CCCchhhhhh--cCCccccCCCc Q lcl|NC_019406. 524 VKNGIIPSTQTLEEFTIKMNDPKSF-IGQPDAIAMR--RGYVSRQQELD 569 (661) Q Consensus 524 ~r~gvl~~~~~~Eee~~~l~~~~~~-l~~ddae~~~--~g~~~~~~~~~ 569 (661) ..+ ++.+.++|.++|+++... ..+++..... ++..+..++.| T Consensus 445 ---~~v-~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 445 ---NTV-TGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred ---CCC-CchhHHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 222 223567777788655221 1111111110 01111111111 No 42 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.88 E-value=1.4e-21 Score=134.92 Aligned_cols=460 Identities=10% Similarity=0.019 Sum_probs=238.3 Q ss_pred CCCC-CCccccccccccccccCCcc------ccCHHH-HHHHHHHHHHHHHhcch-HHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGL-SPNSANIRRTKRGAQQFTHL------VVHPEY-EYYRPDWAKIRDAIAGE-REIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~V~------~~hPey-~a~~~~W~~irD~~~G~-~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) ..++ -+..+|+.=......+-..+ .-.-.| ....++|+.+.+.|.|. ..+..+ +...+. + T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~------~~~~~~---~-- 83 (501) T protein:vir:96 15 VLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKS------GRRKDN---E-- 83 (501) T ss_pred ccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCc------cccCcc---c-- Confidence 0010 01122222111112211111 111112 13346788888888884 222111 111110 1 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +...=+-.|+.+.+|+.++|.+|.+||+++. ++ +. ..+.+.++++++ .+-++++..+..++ T Consensus 84 ~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~-~~-------~~----------~~~~~~~~l~~~-~~~n~~~~~~~~~~ 144 (501) T protein:vir:96 84 MADKRAVHNYGRMISKFKTGYLAGNPIRVEY-DD-------ND----------DNSQNDDAIKRI-GRINDLDSLNRTLI 144 (501) T ss_pred cccceeecchHHHHHHHHhhhhcccCeeEee-CC-------cc----------chhHHHHHHHHH-HHhcCHHHHHHHHH Confidence 1111245899999999999999999999852 11 00 112223333333 23579999999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +.++.+|+++++|= +.. ..+|.+..++|.+++-.--+.+.+ ..+-.|+. +.. T Consensus 145 ~~~~~~G~a~~~v~-~de-----dg~~~i~~~~p~~~~~v~d~~~~~-~~~~~v~~--~~~------------------- 196 (501) T protein:vir:96 145 RDLSQTGRAYEVIY-RSE-----YDETRIKRLSPLETFVIYDNSLED-NSIAAVRY--YNR------------------- 196 (501) T ss_pred HHHhhcCeEEEEEE-EcC-----CCceEEEEEccceeEEEEcCCCCC-ceEEEEEE--EEe------------------- Confidence 99999999999983 322 235778888888764331111111 11111110 000 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCC Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRG 311 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g 311 (661) ....+. +..+.++..+ .++++. .++.. ....... T Consensus 197 -----------------------------~~~~~~-----~~~~~vyt~~----~i~~~~---~~~~~-----~~~~~~~ 230 (501) T protein:vir:96 197 -----------------------------GTLQSA-----KDVVEIYTDE----HIYTLD---ASDDF-----NEISVTT 230 (501) T ss_pred -----------------------------ecCCCc-----EEEEEEEcCC----cEEEEe---eCCCc-----eeccccc Confidence 000000 0111112111 122221 11111 1111223 Q ss_pred cccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc--e--------eEec Q lcl|NC_019406. 312 RTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS--E--------YHIG 381 (661) Q Consensus 312 ~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~--~--------l~iG 381 (661) ++++.||+|.+... .+ +.+-|.++-.|-=+.=...|++.+.+.+.+.|+++++|....... . +.+. T Consensus 231 ~~~g~vPvv~~~nn--~~--g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~ 306 (501) T protein:vir:96 231 HAFGTVPITEYLNN--ID--GIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLK 306 (501) T ss_pred cCCCccceEEecCC--cc--CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeec Confidence 67999999987532 22 223344444433333346688888999999999999997543321 1 2222 Q ss_pred ccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_019406. 382 PGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLNVIMALE 460 (661) Q Consensus 382 s~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le 460 (661) +..+......+++++|+..+. +.+..+..++.+.+.|..++.-. +....-+++.||++...............-..+. T Consensus 307 ~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~ 385 (501) T protein:vir:96 307 PPKSADGKEGTVKAEYLTKSY-DVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFT 385 (501) T ss_pred ccccccccccCcceeeEeccC-CHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222223456788987654 33567777888888888775322 1111123567999998888888888888889999 Q ss_pred HHHHHHHHHHHHHcCCCCC----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHH Q lcl|NC_019406. 461 DGMTSVVRYWLMFRDIPLT----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLE 536 (661) Q Consensus 461 ~Al~~aL~~~A~w~G~~~~----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~E 536 (661) .++.+++++++.+++.... +...+.|..++ ..+.. ..+.++++.++ .|.||++|++..| +.+ .+++ T Consensus 386 ~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~-~~p~n-~~e~ad~~~kl--~g~iS~et~~~~l---~~v---~D~~ 455 (501) T protein:vir:96 386 KGLKRRYRLAARIGSLVNEFKDFDESLLKITFTP-NLPKS-LNEQVSILTGL--GGQVSQETALSLS---GLV---ESPN 455 (501) T ss_pred HHHHHHHHHHHHHHHhcccccccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCchHHHHHhC---CCC---CCHH Confidence 9999999999999865321 22235555543 23333 34567777777 4899999997544 333 3466 Q ss_pred HHHHHHhccCCCCC--C-chhhhhhcCCccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 537 EFTIKMNDPKSFIG--Q-PDAIAMRRGYVSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 537 ee~~~l~~~~~~l~--~-ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) +|.++|+.+..... . .+......| +-...+...++|=.++..| T Consensus 456 ~E~~ri~~E~~~~~~~~~~~~~~~~~~------~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 456 EELDKINKEMSEIDFKGYSNDFNEHVG------KYTDEVKETHTDDFEREYE 501 (501) T ss_pred HHHHHHHHHHHHhhccccccchhhccc------ccCCcCCCCCCCccccccC Confidence 77778865532110 0 000001111 1111123344443333333 No 43 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.88 E-value=1.6e-21 Score=134.63 Aligned_cols=411 Identities=12% Similarity=0.098 Sum_probs=223.6 Q ss_pred cCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhc Q lcl|NC_019406. 57 YLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRF 136 (661) Q Consensus 57 YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~ 136 (661) |||+-- .+.|+.+.++++ .|+.+-+|+.++++++-...+.. |.+ .. ..++++. T Consensus 1 ~l~~~~---~~~~~~~~~~~v-~n~~~~ivd~~~~~l~~~gf~~~-----------d~~------~~---~~~~~i~--- 53 (434) T protein:vir:98 1 MLPKNA---EQAFLDFQRKAR-TNFCGLIANASVHRLLALGVTGP-----------DGE------PD---TRASRWW--- 53 (434) T ss_pred CCCCCc---cHHHHHhhhhhh-ccchHHHHHHHHhhhccCceecC-----------CCc------hH---HHHHHHH--- Confidence 888754 477887776654 49999999999998875443321 111 11 2233332 Q ss_pred cCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCc-hhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecc Q lcl|NC_019406. 137 AKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSD-PTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDE 215 (661) Q Consensus 137 dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~-~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~ 215 (661) +-|+++..+..+.+.++.||+++++|....... .....+|.+..++|++++ --++...++ +...+ +-+... . T Consensus 54 --~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~-~i~D~~~~~--~~~ai-~~~~~~-~ 126 (434) T protein:vir:98 54 --QANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECI-VEYDPETGE--PLVGL-KVWHND-I 126 (434) T ss_pred --HhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeE-EEEeCCCCc--eEEEE-EEEEec-c Confidence 347999999999999999999999997643221 112346777788998764 222322221 21111 101000 0 Q ss_pred ccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEec Q lcl|NC_019406. 216 HATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVE 295 (661) Q Consensus 216 ~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~ 295 (661) + +.....+| ..+. ..++... ... T Consensus 127 ~-------------------------------------------------~~~~~~~~-----~~~~--~~~~~~~-~~~ 149 (434) T protein:vir:98 127 D-------------------------------------------------GFGYARVF-----FDDT--SFPYRTR-ERT 149 (434) T ss_pred C-------------------------------------------------CceEEEEE-----EeCc--EEEEEEe-ecc Confidence 0 00000000 0000 0000000 000 Q ss_pred Ccccc------cccceeeccCCcccceeeEEEEecC-CC-CCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEE Q lcl|NC_019406. 296 DPLGQ------ARDVYTPMVRGRTLPFIPFVFFGSM-SN-AADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYA 367 (661) Q Consensus 296 ~~~~~------~~~~~~p~~~g~~L~~IPfv~~~~~-~~-~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i 367 (661) ..... ......+...-++++.||+|.|-.. .. .+ +..=+.++-.|-=+.=+..|+...+..+.++|++++ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~--g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i 227 (434) T protein:vir:98 150 GARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGED--PEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWI 227 (434) T ss_pred ccccccccccceecccccccccCCCCccceEEeccCCCcCcC--CcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhh Confidence 00000 0011112222357999999976422 11 12 222244444444455556778899999999999999 Q ss_pred ecCCCCCCc-----e------eEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhH---HhcccccCc Q lcl|NC_019406. 368 PELDDSDAS-----E------YHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGG---RLMPGMSKS 433 (661) Q Consensus 368 ~Gl~~~~~~-----~------l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGA---rll~~~~~~ 433 (661) +|.+..+.. . +..+.+..|.+| +.++++.|+++..++.+.+.|+.+..++....- ..+- . .. T Consensus 228 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~-~-~~ 303 (434) T protein:vir:98 228 KGHKFAKRTDPATGMTVVDQPFVPSPSAVWASE--GENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYA-T-DL 303 (434) T ss_pred cCCCcccccccccccchhhhhhhccccccccCC--CCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhc-c-cc Confidence 997654311 1 223444555555 456888899999999998888888887754321 1111 1 12 Q ss_pred cchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCC Q lcl|NC_019406. 434 VSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGL 513 (661) Q Consensus 434 ~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~ 513 (661) ++.||++..............+-..+..++.+++++++.+.|.+. +...+.+.. ++..+.. .++.++++.++..+| T Consensus 304 ~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~-~~~~~~v~w-~~~~~~s-~~~~ada~~kl~~~g- 379 (434) T protein:vir:98 304 VNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPE-DYTEAEVRW-ANPAHVT-MAVKADAATKLKSIG- 379 (434) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCh-hheeeeEEe-cCCCCCC-HHHHHHHHHHHHhcC- Confidence 467999999888888888888888888899999999999998754 223444444 3344444 356888999998877 Q ss_pred CCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcC Q lcl|NC_019406. 514 LPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDA 577 (661) Q Consensus 514 Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~ 577 (661) +|+++++..| |+.+++ .+.+.++.+++..- .....++.|.. ...+.+.+..+..+ T Consensus 380 ~~~e~~~~~l---g~~~~e--~~r~~~e~~~~~~~---~~~~~~~~~~~-~~g~~~~~~~~~dg 434 (434) T protein:vir:98 380 YPLDVIAEEL---DESPAR--VRRIVAGAASQALL---AASLLPAPGAP-SAGNVPDSGGAVDG 434 (434) T ss_pred CcHHHHHHhC---CCCHHH--HHHHHHHHHHHHHH---HHhhhccCCCC-CCCCCCcccCCCCC Confidence 6888886543 443221 12222222111000 00001111111 01112222222222 No 44 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.87 E-value=1.1e-21 Score=135.52 Aligned_cols=463 Identities=11% Similarity=0.045 Sum_probs=234.8 Q ss_pred cccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhc Q lcl|NC_019406. 16 RGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFR 95 (661) Q Consensus 16 ~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFr 95 (661) =+|..--|..---.+....+++.++.+-|.|...++. ++.. -...++... .-.|+.+-+|+.++++++- T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~-----~~~~---~~~~~~~~~---~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-----IGIG---APPELAYLD---VQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchh-----cccc---cchhhhhhh---hhcchHHHHHHHHHhhhcc Confidence 0111111333334566677888889999988765432 3322 222333221 3369999999999999976 Q ss_pred cCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhc Q lcl|NC_019406. 96 RPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAP 175 (661) Q Consensus 96 k~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g 175 (661) ....+.+ |. ...+.+..++ +-++++..+..+++.++.+|+|+++|........-.. T Consensus 70 ~g~~~~~----------d~------~~~~~l~~i~--------~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~ 125 (480) T protein:vir:78 70 EGFRISE----------DS------EGLEELWNWW--------QANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPA 125 (480) T ss_pred CceecCC----------Cc------hhHHHHHHHH--------HhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCC Confidence 5544321 11 1122233333 3578999999999999999999999974322222234 Q ss_pred ccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhh Q lcl|NC_019406. 176 AKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARAD 255 (661) Q Consensus 176 ~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~ 255 (661) .+|.+..++|++++-- ++... .+.++..+. .+... T Consensus 126 ~~~~i~~~~p~~~~~i-~D~~~-~~~~~~~i~-~~~~~------------------------------------------ 160 (480) T protein:vir:78 126 GIPLIRVESPLYMYAE-LDPRN-TRRVTRAVR-LYTTR------------------------------------------ 160 (480) T ss_pred CeeEEEEEcccceEEE-EcCCC-ccceEEEEE-EEEee------------------------------------------ Confidence 5577888888876521 11111 111111111 00000 Q ss_pred heecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecC-CCCCCcccc Q lcl|NC_019406. 256 ALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM-SNAADCEKP 334 (661) Q Consensus 256 ~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~-~~~~~~~~p 334 (661) .+. ..+.++.++..+. ++.+. ...+....+. +.....-+.++.||||.|... ..+...+.+ T Consensus 161 -------d~~----~~~~~~~~y~~~~----~~~~~-~~~~~~~~~~--~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s 222 (480) T protein:vir:78 161 -------DDV----AVPDRATLYLPDE----TVPLR-RNGGLNDQWV--VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) T ss_pred -------cCC----cceEEEEEEeCCe----EEEEE-ecCCCccccc--ccccccccCCCCcceEEeecccccCCccCcc Confidence 000 0112223333321 11111 1111111110 000111256899999976422 111112222 Q ss_pred chh-HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC------CceeEecccceeecCCCCCcceEeecCchhHHH Q lcl|NC_019406. 335 PLL-DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD------ASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKT 407 (661) Q Consensus 335 PLl-dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~------~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a 407 (661) =|. +|..|.=+.=+..|+...++.+.++|+++++|.+... ...+....+..|.++ |..++|.++++..++. T Consensus 223 di~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 300 (480) T protein:vir:78 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA--SEAAKISEFKAAELRN 300 (480) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccCC--CCCceEEecCccCHHH Confidence 222 2333333333456678889999999999999976432 112333445555555 4568899999999999 Q ss_pred HHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-Ccce Q lcl|NC_019406. 408 LERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT-DTAT 483 (661) Q Consensus 408 ~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-~~~~ 483 (661) +.+.|+.+..++....- .-+ ......+-||.+..............+-..+..+|.+++++++.+.|.... +... T Consensus 301 ~~~~l~~~i~~~~~~~~~p~~~f-g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~ 379 (480) T protein:vir:78 301 FAEEMEVFRKEAASITGLPPQYL-SSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTR 379 (480) T ss_pred HHHHHHHHHHHHhcccCCCHHHh-ccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcccccee Confidence 99988888888864321 111 111111247888877766666666666777778899999999999985432 1223 Q ss_pred EEEEeccccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCC Q lcl|NC_019406. 484 LRYEIDATFLTTALDARALRAIQQLYEGG--LLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGY 561 (661) Q Consensus 484 ~~v~ln~DF~~~~lda~~l~all~~~~aG--~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~ 561 (661) +.+....- .+.. .++.++++.+++++| .+|+++++..| |+.++.. +++++..+++ ++.+.+.. T Consensus 380 i~v~w~~~-~~~s-~~~~ad~~~kl~~~g~~~~s~et~~~~l---g~~~d~~--~e~~~~~~~~--------~~~~~~~~ 444 (480) T protein:vir:78 380 LETVWRDP-STPT-VAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQR--EQMRDWDKQE--------TEDMIDTL 444 (480) T ss_pred eeEEecCC-CCCC-HHHHHHHHHHHHHhcccCCCHHHHHhcC---CCCHhHH--HHHHHHHHHH--------HHHHHHHh Confidence 44443322 2233 357788899999876 68999986543 6654322 2222222111 01111100 Q ss_pred ccccCCCcchh---hhhcCChhhHHHHHHHhccCCCchhHHHhhh Q lcl|NC_019406. 562 VSRQQELDQQR---AARDADFQQQELEQAERHLEIDEEKLRISAK 603 (661) Q Consensus 562 ~~~~~~~~q~~---~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~ 603 (661) ....++-++.+ ...+.+ .|.|+.+.+. +|+. .| T Consensus 445 ~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~--~~~~--~~ 480 (480) T protein:vir:78 445 YSTTKAQADATPKPTVTETK-----TETQTSPSGF--NRTK--TR 480 (480) T ss_pred hccccCCCccccCCCCCCCC-----CccCCCcccC--CCcC--CC Confidence 00000000000 011111 1222222222 1111 11 No 45 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.87 E-value=4.8e-22 Score=137.48 Aligned_cols=463 Identities=9% Similarity=0.002 Sum_probs=243.9 Q ss_pred CCCC-CCccc-ccccc-ccccccCCc--cccCH---HHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHH Q lcl|NC_019406. 1 MAGL-SPNSA-NIRRT-KRGAQQFTH--LVVHP---EYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANY 72 (661) Q Consensus 1 ~~~~-~~~~~-~~~~~-~~~~~~~~V--~~~hP---ey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~r 72 (661) |+-+ -|+-. .+... .+...+... ....- .+....++..++.+.|.|...+..+..++.-+ .. .+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~---~~--~~~~~ 75 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNK---GE--IDPLK 75 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhccc---cc--ccccc Confidence 6655 23211 11111 000001100 00000 12334566677777787865554433222111 11 11111 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) -..=.-.|+.+.+++..+|.+|.+||+++ .++ .+..+.+.+++ .+++......+.+ T Consensus 76 ~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~-~~d-----------------~~~~~~l~~~~------~n~~~~~~~~~~~ 131 (474) T protein:vir:96 76 PDWRMFTNYHQNLVDQKVAYAVANPVTFS-SDD-----------------DKSLKTIQEVL------NHKWDDKLVDILT 131 (474) T ss_pred cchhcccchHHHHHHhhhhhhcccCceee-cCc-----------------hHHHHHHHHHH------hcCHHHHHHHHHH Confidence 11112369999999999999999999985 221 11223444443 2467777888889 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .++.+|+++++|..+. ..++.+..++|.+++-.--+...+ .+..+ ++.+. .++ ..++..|... T Consensus 132 ~~~~~G~~~~~~y~d~------~~~~~i~~~~p~~~~~v~d~~~~~--~~~~~-vr~~~-~~~-------~~~~~~yt~~ 194 (474) T protein:vir:96 132 AASNKGIEWLQPYIDE------NGEFKTFRVPAEQAIPIWTNKERD--TLKAF-IRYYR-LDG-------AERVEYWTDS 194 (474) T ss_pred HHHhcCeeEEEEEecC------CCceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEe-ecC-------ceEEEEEeCC Confidence 9999999999997653 246888899998876442221111 11111 11111 000 0011111111 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR 312 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 312 (661) .|..| ....+ ... ....+.... ..........-+ T Consensus 195 ~v~~~---------------------------------------~~~~~---~~~-~~~~~~~~~---~~~~~~~~~~~~ 228 (474) T protein:vir:96 195 DVTYY---------------------------------------EYQDG---ILI-PDYYHGEEH---IQSHYYVGNKRV 228 (474) T ss_pred eEEEE---------------------------------------EecCC---cee-ecccccccc---cccccccccccc Confidence 11111 01000 000 000000000 000000111235 Q ss_pred ccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc--eeEecccceeecCC Q lcl|NC_019406. 313 TLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS--EYHIGPGRVWVVDK 390 (661) Q Consensus 313 ~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~--~l~iGs~~~~~lp~ 390 (661) .++.||||.+.... .. .+=|.++-.|-=+.=...|++.+.+...+.|+++++|.+.++.. ...+....++.+++ T Consensus 229 ~~g~iPvv~~~nn~--~g--~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~ 304 (474) T protein:vir:96 229 SWGRVPFIPFKNNP--QE--MSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDG 304 (474) T ss_pred CCCceeEEEeccCC--CC--CCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecC Confidence 79999999885432 22 22233333332222235567788888899999999998654422 22455667888887 Q ss_pred CCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 391 ESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRY 469 (661) Q Consensus 391 ~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~ 469 (661) +|++++|+..+. +.+..+..++.++++|.....-. +....-+++.||++.+.............-..+..++.+.+++ T Consensus 305 ~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (474) T protein:vir:96 305 DGSGVDTIQIEV-PVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQY 383 (474) T ss_pred CCCceeEEeecC-ChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 788999999754 55788888999999998765322 1111223567999988888888888888888999999999999 Q ss_pred HHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCC Q lcl|NC_019406. 470 WLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFI 549 (661) Q Consensus 470 ~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l 549 (661) ++.+.|... +..++.|.+++. .+. +..++. ..+.++|.||++|++..| +. ..++++|.++|+++.. T Consensus 384 i~~~~~~~~-~~~~i~i~f~~~-~p~--~~~e~~--~~~~~ag~iS~et~~~~~---~~---v~d~~~E~~ri~~E~~-- 449 (474) T protein:vir:96 384 IIDFYKLNI-KVQDVEITFNFN-VMV--NELEQS--QIGVQSQYLSKETVVTNH---PW---VDDPVAELERIEQDNI-- 449 (474) T ss_pred HHHHhCCCc-ccceeeEEeccC-CCc--CHHHHH--HHHHhcCCCchHHHHHhC---CC---CCCHHHHHHHHHHHHH-- Confidence 999999764 344556655432 222 222222 234568999999997543 33 3356778888876532 Q ss_pred CCchhhhhhcCCccccCCCcchhhhhcCC Q lcl|NC_019406. 550 GQPDAIAMRRGYVSRQQELDQQRAARDAD 578 (661) Q Consensus 550 ~~ddae~~~~g~~~~~~~~~q~~~~~e~d 578 (661) +..+.+.....+. .-.++....|-| T Consensus 450 --e~~~~~~~~~~~~--~~~~~d~~~e~~ 474 (474) T protein:vir:96 450 --DFNKQLPPLEGDA--NGRAQDNESETN 474 (474) T ss_pred --HHHhccccccccc--ccccCCCcccCC Confidence 1111111110010 001111111222 No 46 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.87 E-value=2.1e-21 Score=133.97 Aligned_cols=453 Identities=10% Similarity=0.022 Sum_probs=244.8 Q ss_pred CCccccC-------HHHHHHHHHHHHHHHHhcchHHHHhCCcc-cCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhch Q lcl|NC_019406. 21 FTHLVVH-------PEYEYYRPDWAKIRDAIAGEREIKAQGVK-YLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQ 92 (661) Q Consensus 21 ~~V~~~h-------Pey~a~~~~W~~irD~~~G~~~vr~~g~~-YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~ 92 (661) |++.... ..+....++++.+.+.|.|.+.+..+-.. |-....+...... +...=.-.|+.+.+++..+|. T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~--~~~~ki~~n~~k~Iv~~~~~y 78 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLR--SADNRIPSNFYQLLVDQEAGY 78 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccc--cCCcccccchHHHHHHhhhhh Confidence 7654432 23455668888889999998766543211 1111111111111 111122489999999999999 Q ss_pred hhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCch Q lcl|NC_019406. 93 IFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDP 172 (661) Q Consensus 93 vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~ 172 (661) +|.+||+++. .+ .+..+++++++. .+.......+.+.++.+|+++++|=+.. T Consensus 79 l~G~p~~~~~-~d-----------------~~~~~~l~~~~~------~~~~~~~~~l~~~~~~~G~a~~~~y~d~---- 130 (470) T protein:vir:10 79 VASVFPDIDV-GK-----------------DADNKKIIDVLG------DDRALTLNGLLVDSSNAGRAWLHYWIDE---- 130 (470) T ss_pred eeccceeeec-Cc-----------------hHHHHHHHHHHh------hhHHHHHHHHHHHHhhcCeeEEEEEecC---- Confidence 9999999852 11 011234444432 3455566678889999999999985432 Q ss_pred hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhh Q lcl|NC_019406. 173 TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSA 252 (661) Q Consensus 173 ~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~ 252 (661) ..++-+..++|.+++=.--+.+.+ .+..+ +|.+...+.+. .....++..|..+.+.-|+... T Consensus 131 --~~~~~~~~~~p~~~~~v~d~~~~~--~~~a~-ir~y~~~~~~~--~~~~~~~e~yt~~~~~~~~~~~----------- 192 (470) T protein:vir:10 131 --DGNFRYGIIQPDQITPIYATTLDN--KLLGI-LRSYKQLDPDS--GKYFTVHEYWTDKEAQFFRTNA----------- 192 (470) T ss_pred --CCceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEEeeecCC--ceEEEEEEEEcCCcEEEEEeec----------- Confidence 234667778887765432221111 12221 22221111110 0111111222222111111000 Q ss_pred hhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecC---cccccccceeeccCCcccceeeEEEEecCCCCC Q lcl|NC_019406. 253 RADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVED---PLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAA 329 (661) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~---~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~ 329 (661) . .......+... ......+........+.++.||||.+.... . T Consensus 193 ----------------------------~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~--~ 238 (470) T protein:vir:10 193 ----------------------------T----DSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNK--Y 238 (470) T ss_pred ----------------------------C----cceeccccccccccccccccccccccccccCCCeeeEEEeecCC--C Confidence 0 00000000000 000001111112234679999999886432 2 Q ss_pred CccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccceeecCC----CCCcceEeecCch Q lcl|NC_019406. 330 DCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRVWVVDK----ESGIPGIIEFKGE 403 (661) Q Consensus 330 ~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~~~lp~----~ga~~~ylE~~g~ 403 (661) +.+-|.++-.|-=+.=...|++.+.+.+.+.|+++++|....+... ..+....++.++. .+++++|+..+.. T Consensus 239 --g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~ 316 (470) T protein:vir:10 239 --RLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIP 316 (470) T ss_pred --CCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCC Confidence 2233444444443444466778888889999999999976443211 1222334455543 2467899997664 Q ss_pred hHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcce Q lcl|NC_019406. 404 GLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTAT 483 (661) Q Consensus 404 ~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~ 483 (661) .+..+..|+.++++|...+.-.-....+.++.|+++...............-..+..++.+++++++.++|....+... T Consensus 317 -~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~~~d~~~ 395 (470) T protein:vir:10 317 -VEARDDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRH 395 (470) T ss_pred -hHHHHHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccce Confidence 5888999999999999876433222334567899999888777777777788888889999999999999987766667 Q ss_pred EEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCcc Q lcl|NC_019406. 484 LRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVS 563 (661) Q Consensus 484 ~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~ 563 (661) +.+++++..... ..+.++.+.++ +|.||.+|.+..+ +. ..++++|.++|+++.. +..+-. . T Consensus 396 i~i~f~~~~p~d--~~e~~~~~~~~--~g~iS~et~l~~~---p~---v~D~~~E~eri~~E~~-------e~~~~~-~- 456 (470) T protein:vir:10 396 ISQHWTRTKVED--SLTKAQIVSTV--ANYSSKEAVAKAN---PI---VDDWQQELKDLAKDKE-------ENDPYS-N- 456 (470) T ss_pred eeEEeccCCCCC--HHHHHHHHHHH--hccCcHHHHHHhC---CC---CCCHHHHHHHHHHHHH-------HHHHhh-c- Confidence 777766533322 23345555443 7999999996543 33 3356777888876411 111100 0 Q ss_pred ccCCCcchhhhhcCChhh Q lcl|NC_019406. 564 RQQELDQQRAARDADFQQ 581 (661) Q Consensus 564 ~~~~~~q~~~~~e~d~~q 581 (661) .+. +-...+.|=+| T Consensus 457 ~~~----~~~~~~~dde~ 470 (470) T protein:vir:10 457 QAD----ELNGKGVNDEQ 470 (470) T ss_pred ccc----ccCCCCCCCCC Confidence 000 00011111111 No 47 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.87 E-value=2.2e-21 Score=133.84 Aligned_cols=458 Identities=11% Similarity=0.039 Sum_probs=234.4 Q ss_pred CCCCC--Cccccc--cccccccccCC-----ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLS--PNSANI--RRTKRGAQQFT-----HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~--~~~~~~--~~~~~~~~~~~-----V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |...- |+.-|= .=+++.....+ |..---.+....++...+.+-|.|.+.+..+. .........++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~-----~~~~~~~~~~~~ 75 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQA-----YKQDLHGNIDYT 75 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccc-----chhhhccccccc Confidence 22110 111000 00111111111 11111223445566777777788875543321 111100011111 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +-..-+-.|+.+.+++..+|.+|.+||+++.-.+ +..+.++.++ +++++.....+. T Consensus 76 ~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~------------------~~~~~l~~~~------~n~~~~~~~~l~ 131 (474) T protein:vir:96 76 KPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDD------------------KVLDVIHQVL------DTRWDNKLIDIL 131 (474) T ss_pred ccccccccchHHHHHHhhhhhhcccCceeccCCh------------------HHHHHHHHHH------hccHHHHHHHHH Confidence 1111134699999999999999999999852111 1123333332 367889999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +.++.+|+++++|-.... .+|-+..++|++++=.--+...+ .+..+ +|.+.. + ...++..|.. T Consensus 132 ~~~~~~G~~~~~~~~d~~------~~~~i~~~~p~~~~~v~d~~~~~--~~~a~-ir~~~~-~-------~~~~~~vy~~ 194 (474) T protein:vir:96 132 TAASNKGIDWLQVYINED------GELKLFRVPAEQAIPIWTDKERE--QLNAF-IRIFTF-N-------GETKVEYWTA 194 (474) T ss_pred HHHhhCCeEEEEeeeCCC------CceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEee-c-------CeeEEEEEeC Confidence 999999999999976432 35677788888765221111111 11111 111100 0 0001111111 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccc---c-cccceee Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLG---Q-ARDVYTP 307 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~---~-~~~~~~p 307 (661) ..| + . +++..+... . ....... T Consensus 195 ~~i--------------------------------------~---------------~-~~~~~~~~~~~~~~~~~~~~~ 220 (474) T protein:vir:96 195 ETV--------------------------------------T---------------Y-YVYENGGLIPDFYYGDEHIQT 220 (474) T ss_pred CeE--------------------------------------E---------------E-EEEcCCceeeccccccccccC Confidence 111 0 0 011111000 0 0001111 Q ss_pred ccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccce Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRV 385 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~ 385 (661) ....+.++.||||.+.....+ .+-|.++-.|-=+.=...|++.+.+.+.+.|+++++|+...+... -.+....+ T Consensus 221 ~~~~~~~~~vPvv~~~nn~~~----~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~ 296 (474) T protein:vir:96 221 HFSTGSWERVPFIAFKNNPEE----VSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKA 296 (474) T ss_pred cccccCCCccceEEecCCCCC----CCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccce Confidence 122357999999988543221 122222222222222466777888889999999999986544222 22333445 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT 464 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~ 464 (661) +.++ ++++++|+..+. +.+..+..++.+.+.+..++.-.- ....-+++.||++.+.............-..+..++. T Consensus 297 i~~~-~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~ 374 (474) T protein:vir:96 297 INVS-SDGGVETIQVEV-PVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQ 374 (474) T ss_pred eecc-CCCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5566 467899998764 457888999999999988753221 1112235678888888877777777888889999999 Q ss_pred HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhc Q lcl|NC_019406. 465 SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMND 544 (661) Q Consensus 465 ~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~ 544 (661) +.+++++.+.|... +..++.+.+++.. +.. ..+.++ .+.++|.||++|++..| +. ..++++|.++|++ T Consensus 375 ~~~~~i~~~~g~~~-d~~~i~i~f~~~~-p~~-~~e~a~---~~~~~giiS~et~~~~l---p~---v~D~~~E~eri~~ 442 (474) T protein:vir:96 375 ELMQFILDFNKIKL-DAKEIEITFNFNV-MVN-DLEQSQ---IGAQSQYLSKETLVRHH---PW---VDDPKAELERLDE 442 (474) T ss_pred HHHHHHHHHhCCCc-ccceeeEEecCCC-ccC-HHHHHH---HHHHcCCCChHHHHHhC---CC---CCCHHHHHHHHHH Confidence 99999999999754 4455666654322 222 122333 34468999999996443 33 3356777788876 Q ss_pred cCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHH Q lcl|NC_019406. 545 PKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQE 583 (661) Q Consensus 545 ~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~ 583 (661) +.... .+..+.+.++....-. .....+++ |-| T Consensus 443 E~~~~-~~~~~~~~~~~~~~~~----~~~~~~~~--e~~ 474 (474) T protein:vir:96 443 EQLEL-NKQLPNLDDGGADGAQ----QQQQSENN--QSK 474 (474) T ss_pred HHHHH-HhhccccccccCCCCC----CcCCCCcc--ccC Confidence 53210 0001111111111100 00011111 001 No 48 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.87 E-value=2.2e-21 Score=133.84 Aligned_cols=458 Identities=11% Similarity=0.039 Sum_probs=234.4 Q ss_pred CCCCC--Cccccc--cccccccccCC-----ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLS--PNSANI--RRTKRGAQQFT-----HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~--~~~~~~--~~~~~~~~~~~-----V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |...- |+.-|= .=+++.....+ |..---.+....++...+.+-|.|.+.+..+. .........++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~-----~~~~~~~~~~~~ 75 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQA-----YKQDLHGNIDYT 75 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccc-----chhhhccccccc Confidence 22110 111000 00111111111 11111223445566777777788875543321 111100011111 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +-..-+-.|+.+.+++..+|.+|.+||+++.-.+ +..+.++.++ +++++.....+. T Consensus 76 ~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~------------------~~~~~l~~~~------~n~~~~~~~~l~ 131 (474) T protein:vir:95 76 KPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDD------------------KVLDVIHQVL------DTRWDNKLIDIL 131 (474) T ss_pred ccccccccchHHHHHHhhhhhhcccCceeccCCh------------------HHHHHHHHHH------hccHHHHHHHHH Confidence 1111134699999999999999999999852111 1123333332 367889999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +.++.+|+++++|-.... .+|-+..++|++++=.--+...+ .+..+ +|.+.. + ...++..|.. T Consensus 132 ~~~~~~G~~~~~~~~d~~------~~~~i~~~~p~~~~~v~d~~~~~--~~~a~-ir~~~~-~-------~~~~~~vy~~ 194 (474) T protein:vir:95 132 TAASNKGIDWLQVYINED------GELKLFRVPAEQAIPIWTDKERE--QLNAF-IRIFTF-N-------GETKVEYWTA 194 (474) T ss_pred HHHhhCCeEEEEeeeCCC------CceEEEEEcccceEEEEcCCCCC--ceEEE-EEEEee-c-------CeeEEEEEeC Confidence 999999999999976432 35677788888765221111111 11111 111100 0 0001111111 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccc---c-cccceee Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLG---Q-ARDVYTP 307 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~---~-~~~~~~p 307 (661) ..| + . +++..+... . ....... T Consensus 195 ~~i--------------------------------------~---------------~-~~~~~~~~~~~~~~~~~~~~~ 220 (474) T protein:vir:95 195 ETV--------------------------------------T---------------Y-YVYENGGLIPDFYYGDEHIQT 220 (474) T ss_pred CeE--------------------------------------E---------------E-EEEcCCceeeccccccccccC Confidence 111 0 0 011111000 0 0001111 Q ss_pred ccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eEecccce Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YHIGPGRV 385 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~iGs~~~ 385 (661) ....+.++.||||.+.....+ .+-|.++-.|-=+.=...|++.+.+.+.+.|+++++|+...+... -.+....+ T Consensus 221 ~~~~~~~~~vPvv~~~nn~~~----~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~ 296 (474) T protein:vir:95 221 HFSTGSWERVPFIAFKNNPEE----VSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKA 296 (474) T ss_pred cccccCCCccceEEecCCCCC----CCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccce Confidence 122357999999988543221 122222222222222466777888889999999999986544222 22333445 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhc-ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLM-PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT 464 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll-~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~ 464 (661) +.++ ++++++|+..+. +.+..+..++.+.+.+..++.-.- ....-+++.||++.+.............-..+..++. T Consensus 297 i~~~-~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~ 374 (474) T protein:vir:95 297 INVS-SDGGVETIQVEV-PVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQ 374 (474) T ss_pred eecc-CCCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5566 467899998764 457888999999999988753221 1112235678888888877777777888889999999 Q ss_pred HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhc Q lcl|NC_019406. 465 SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMND 544 (661) Q Consensus 465 ~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~ 544 (661) +.+++++.+.|... +..++.+.+++.. +.. ..+.++ .+.++|.||++|++..| +. ..++++|.++|++ T Consensus 375 ~~~~~i~~~~g~~~-d~~~i~i~f~~~~-p~~-~~e~a~---~~~~~giiS~et~~~~l---p~---v~D~~~E~eri~~ 442 (474) T protein:vir:95 375 ELMQFILDFNKIKL-DAKEIEITFNFNV-MVN-DLEQSQ---IGAQSQYLSKETLVRHH---PW---VDDPKAELERLDE 442 (474) T ss_pred HHHHHHHHHhCCCc-ccceeeEEecCCC-ccC-HHHHHH---HHHHcCCCChHHHHHhC---CC---CCCHHHHHHHHHH Confidence 99999999999754 4455666654322 222 122333 34468999999996443 33 3356777788876 Q ss_pred cCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHH Q lcl|NC_019406. 545 PKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQE 583 (661) Q Consensus 545 ~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~ 583 (661) +.... .+..+.+.++....-. .....+++ |-| T Consensus 443 E~~~~-~~~~~~~~~~~~~~~~----~~~~~~~~--e~~ 474 (474) T protein:vir:95 443 EQLEL-NKQLPNLDDGGADGAQ----QQQQSENN--QSK 474 (474) T ss_pred HHHHH-HhhccccccccCCCCC----CcCCCCcc--ccC Confidence 53210 0001111111111100 00011111 001 No 49 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.87 E-value=2.1e-21 Score=133.93 Aligned_cols=466 Identities=12% Similarity=0.035 Sum_probs=231.7 Q ss_pred cccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhc Q lcl|NC_019406. 16 RGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFR 95 (661) Q Consensus 16 ~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFr 95 (661) =+|..--|..---.+....+++.++.+-|.|...++ |++ ..-...++++ -.-.|+.+-+|+.++++++- T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~-----~~~---~~~~~~~~~~---~~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK-----TIG---IGAPPELAYL---DVQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----ccc---cccchhHhhh---hhhcchHHHHHHHHHhhhcc Confidence 011111122222345566777888889998875543 232 2233344433 24468999999999999865 Q ss_pred cCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhc Q lcl|NC_019406. 96 RPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAP 175 (661) Q Consensus 96 k~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g 175 (661) ...++.+ |. ...+.+..++ +-++++..+..+++.++.||+|+++|........-.. T Consensus 70 ~g~~~~~----------d~------~~~~~l~~i~--------~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~ 125 (480) T protein:vir:78 70 EGFRISE----------DS------EGLEELWNWW--------QANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPA 125 (480) T ss_pred CceecCC----------Cc------hhHHHHHHHH--------HhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCC Confidence 5444321 11 1122223333 3579999999999999999999999975432222234 Q ss_pred ccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhh Q lcl|NC_019406. 176 AKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARAD 255 (661) Q Consensus 176 ~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~ 255 (661) .+|.+..++|.+++-.- +.. ....++..+. .+.. T Consensus 126 g~~~i~~~~p~~~~~~~-D~~-~~~~~~~~i~-~~~~------------------------------------------- 159 (480) T protein:vir:78 126 GIPLIRVESPLYMYAEL-DPR-NTRRVTRAVR-LYTT------------------------------------------- 159 (480) T ss_pred CeeEEEEEcccceEEEE-cCC-CccceEEEEE-EEEe------------------------------------------- Confidence 56778888888775322 110 0111111110 0000 Q ss_pred heecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEec-CCCCCCcccc Q lcl|NC_019406. 256 ALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGS-MSNAADCEKP 334 (661) Q Consensus 256 ~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~-~~~~~~~~~p 334 (661) ..+ ...+.++.++..+. ++.+. ...+....+... ....-+.++.||||.|.. ...+..-+.+ T Consensus 160 ------~~~----~~~~~~~~~y~~~~----~~~~~-~~~~~~~~~~~~--~~~~~~~~g~vPvv~f~n~~~~~~~~G~s 222 (480) T protein:vir:78 160 ------RDD----VAVPDRATLYLPDE----TVPLR-RNGGLNDQWVVD--GDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) T ss_pred ------ecC----CCceEEEEEEeCCe----EEEEE-ecCCCccccccc--cccccCCCCCcceEEeecccccCCccCcc Confidence 000 01122333333321 11111 111111111000 011125689999997642 1222111222 Q ss_pred chh-HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC------ceeEecccceeecCCCCCcceEeecCchhHHH Q lcl|NC_019406. 335 PLL-DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA------SEYHIGPGRVWVVDKESGIPGIIEFKGEGLKT 407 (661) Q Consensus 335 PLl-dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~------~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a 407 (661) =|. +|-.|.=+.=+..|+...++.+.++|++++.|.+.... ..+....+..|.++ |++++|.++++..++. T Consensus 223 ~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 300 (480) T protein:vir:78 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA--SEAAKISEFKAAELRN 300 (480) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhccCC--CCCceEEecCccCHHH Confidence 222 23232222234556788899999999999999864331 11233344455554 4578899999999999 Q ss_pred HHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-Ccce Q lcl|NC_019406. 408 LERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT-DTAT 483 (661) Q Consensus 408 ~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~-~~~~ 483 (661) +.+.|+.+..++....- .-+ ......+-||.+.+............+-..+..+|.+++++++.+.|.... +... T Consensus 301 ~~~~l~~~i~~~~~~~~~p~~~~-g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~ 379 (480) T protein:vir:78 301 FAEEMEVFRKEAASITGLPPQYL-SSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTR 379 (480) T ss_pred HHHHHHHHHHHHhcccCCChHHh-ccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcccccee Confidence 98888888888764321 111 001111247777777655555555556666677899999999999985321 1123 Q ss_pred EEEEeccccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCC Q lcl|NC_019406. 484 LRYEIDATFLTTALDARALRAIQQLYEGG--LLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGY 561 (661) Q Consensus 484 ~~v~ln~DF~~~~lda~~l~all~~~~aG--~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~ 561 (661) +.+.... -.... .++.++++.+++++| .+|++|++..| |+.++.. +++++..+++ ++.+.+.. T Consensus 380 i~v~f~~-~~~~s-~~~~ad~~~kl~~~g~~~~s~et~~~~l---g~~~d~~--~~~~~~~~e~--------~~~~~~~~ 444 (480) T protein:vir:78 380 LETVWRD-PSTPT-VAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQR--EQMRDWDKQE--------TEDMIDTL 444 (480) T ss_pred eeEEecC-CCCCC-HHHHHHHHHHHHHhccccCCHHHHHhcC---CCCHhHH--HHHHHHHHHH--------HHHHHHHh Confidence 3443322 12223 357888899999876 78999986553 6654422 2222111111 11111010 Q ss_pred ccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHH Q lcl|NC_019406. 562 VSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLR 599 (661) Q Consensus 562 ~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~ 599 (661) ....++..+..+..++. ....|.++...+.--.+++ T Consensus 445 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 445 YSTTKAQADATPKPTVT--ETKTETQTSPSGFNRTKTR 480 (480) T ss_pred hccccccCCCCCCCCCC--CCCCccccccCCCCcccCC Confidence 00000000000000000 0011222222222111222 No 50 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.87 E-value=8.8e-21 Score=130.52 Aligned_cols=457 Identities=11% Similarity=0.039 Sum_probs=244.0 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.|=-|+-..+. ....-++.--..+....++++++.+-|.|...++ |+++ +..+.|+.. ++ ..| T Consensus 1 ~~~~i~~~~~~~-----~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~-----~~~~---~~~~~~~~~--~~-~~n 64 (485) T protein:vir:10 1 MTAPLPGQEEIE-----DPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPE-----AIGV---TVPIQMQSL--LA-HVG 64 (485) T ss_pred CCCCCCCCCCCC-----CHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcch-----hcCC---CCChhhhhh--hh-hcC Confidence 777666554433 1112244445677788889999999999975442 3443 333444432 22 359 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+-+|+.++++++-...++.+ | ....+.+..|+ +-++++.+...+.+.++.||+| T Consensus 65 ~~~~ivd~~~~~l~~~g~~~~~----------~------~~~~~~~~~i~--------~~N~~d~~~~~~~~~a~i~G~a 120 (485) T protein:vir:10 65 YPRLYVDSIAERQAVEGFRFGD----------A------DEADEELWQWW--------QANNLDIEAPLGYTDAYVHGRS 120 (485) T ss_pred cHHHHHHHHHhhhcccceecCC----------C------chhHHHHHHHH--------HhcCHhHHHHHHHHHHhhcCce Confidence 9999999999998755443311 0 01112233333 3579999999999999999999 Q ss_pred EEEEeccCCCch--hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 161 GALVDVAPSSDP--TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 161 gvLVD~P~a~~~--~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) +++|-....... ....+|.+..++|++++-.. +...++ ....+++. .. T Consensus 121 y~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~-D~~~~~-~~~~~~~~--~~-------------------------- 170 (485) T protein:vir:10 121 YITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEI-DPRIGR-VSKAIRVA--YD-------------------------- 170 (485) T ss_pred EEEEeeCCcccccccCCCeeEEEEEccceeEEEE-cCCCCc-eeEEEEEE--Ee-------------------------- Confidence 999976532211 12346778888888764222 222211 11111110 00 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP 318 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP 318 (661) .. ..++.++.++..+ .+|++. ..+. +.. .....-++++.|| T Consensus 171 -----------------------~~-----~~~~~~~~~y~~~----~~~~~~--~~~~-~~~----~~~~~~~~~g~vP 211 (485) T protein:vir:10 171 -----------------------AE-----GNEIQAATLYTPN----DIFGWY--RVEN-EWQ----EWFNNPHGLGVVP 211 (485) T ss_pred -----------------------eC-----CCeEEEEEEEeCC----eEEEEE--EcCC-ceE----EeccccCCCCccc Confidence 00 0011122222221 122221 1111 111 1111126789999 Q ss_pred EEEEecC-CCCCCccc----cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC-----c---eeEecccce Q lcl|NC_019406. 319 FVFFGSM-SNAADCEK----PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA-----S---EYHIGPGRV 385 (661) Q Consensus 319 fv~~~~~-~~~~~~~~----pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~-----~---~l~iGs~~~ 385 (661) +|.|... ..+..-+. +++.+|. =+.=...|+...++++.++|++++.|.+.... + .+..+.+.. T Consensus 212 vv~~~n~~~~~~~~G~s~i~~~v~~li---Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i 288 (485) T protein:vir:10 212 VVPIPNRTRLSDLYGTSEITPELRSMT---DAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARI 288 (485) T ss_pred EEEeccccccCCCCCccchhHHHHHHH---HHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccce Confidence 9976532 11111122 2333332 12223566888899999999999999754321 1 134455667 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG 462 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A 462 (661) |.+|+ ++++|.|++..+++.+.+.|+.+..++....- .-+ ......+-||++.+............+-..+..+ T Consensus 289 ~~~~~--~d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~f-g~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~ 365 (485) T protein:vir:10 289 LAFED--AEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYL-STAADNPASAEAIRAAESRLIKKVERKNSIFGGA 365 (485) T ss_pred eccCC--CCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHh-ccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77764 46778899999988888888877777754311 112 1111123578888888777777777777888889 Q ss_pred HHHHHHHHHHHcCCCCC--CcceEEEEeccccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCCCccCCHHHH Q lcl|NC_019406. 463 MTSVVRYWLMFRDIPLT--DTATLRYEIDATFLTTALDARALRAIQQLYEGG--LLPIDALYENFVKNGIIPSTQTLEEF 538 (661) Q Consensus 463 l~~aL~~~A~w~G~~~~--~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG--~Is~et~~~eL~r~gvl~~~~~~Eee 538 (661) |.++++++..+.+.... +...+.|...+ -.+.. .++..+++.+++++| .+|++++++. .|+.++.. + + T Consensus 366 l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~-~~~~~-~~~~ada~~kl~~ag~~~~s~et~~~~---lg~~~~~~--~-~ 437 (485) T protein:vir:10 366 WEEAMRLAYRMMKGGDVPPDMLRMETVWRD-PSTPT-YAAKADAASKLYNGGTGVIPRERARKD---MGYSIAER--E-E 437 (485) T ss_pred HHHHHHHHHHHhCCCCCcccceeeeEEecC-CCCCC-HHHHHHHHHHHHhccccCCCHHHHHHh---CCCCHhHH--H-H Confidence 99999999888864321 11233444322 22233 356788899999977 8999999754 37765432 2 2 Q ss_pred HHHHhccCCCCC--Cchh----hhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 539 TIKMNDPKSFIG--QPDA----IAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 539 ~~~l~~~~~~l~--~dda----e~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) .+++.++....+ .-++ ....++..+..++-++....+.+|-- T Consensus 438 ~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 438 MRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 222222110000 0000 00111111111111111112222211 No 51 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.86 E-value=5e-21 Score=131.88 Aligned_cols=441 Identities=10% Similarity=0.060 Sum_probs=247.8 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.-+||.- -+..--..+....++++++++-|.|...+ .|+|+. -.+.|+.+..+++ .| T Consensus 1 ~~~~t~~~-------------~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-----~~~~~~---~~~~~~~~~~k~~-~n 58 (456) T protein:vir:10 1 MTASTPAE-------------WLPVLTKRIDDGMSRVRLLARYSNGDAPL-----PELTRN---TSAAWRSFQREAR-TN 58 (456) T ss_pred CCCCCHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----hhcCcc---cChhhhhhhhhhh-cc Confidence 32222211 01111224556778889999999986433 345443 3345655555555 69 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+-+|+..+|++|.+++++..-++ .+ ..+ .++++. +-++++.+...+++.++.||++ T Consensus 59 ~~~~ivd~~~~~l~~~~~~~~~~~d--------~~------~~~---~~~~i~-----~~N~~d~~~~~~~~~a~i~G~a 116 (456) T protein:vir:10 59 WGLMVRDSVADRIIPNGITVGGSAD--------SD------LAL---RARRIW-----RDNRMDSVCKQWVKYGLDFGES 116 (456) T ss_pred hHHHHHHHHHhhhccCCeecCCCCC--------cc------hHH---HHHHHH-----HhcChhhHHHHHHHHHhhcCee Confidence 9999999999999999987732111 10 111 233332 2368999999999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) +++|-... + ..|-+..++|.+++-. ++...++..+-.|+.. ...++ ..-+...|... T Consensus 117 y~~v~~d~-~-----g~~~i~~~~p~~~~~i-~d~~~~~~~~~~i~~~--~~~d~------~~~~~~~~~~~-------- 173 (456) T protein:vir:10 117 YLTCWRRD-D-----GTATITADSPETMVVS-VDPLQPWRIRAAMRWW--RDLDA------ESDFAIVWSGD-------- 173 (456) T ss_pred EEEEeeCC-C-----CceEEEEEccceeEEE-EcCCCCcceEEEEEEE--EecCC------ceeEEEEEecc-------- Confidence 99985432 1 2577888888876532 2222222222222111 11000 00111111100 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee-ccCCcccceeeE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP-MVRGRTLPFIPF 319 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p-~~~g~~L~~IPf 319 (661) ...+.|+........ + . ....+. .....+ ....+.++.+|+ T Consensus 174 ---------------------------~~~~~~~~~~~~~~~--~-~-~~~~~~-------~~~~~~~~~~~~~~~~~pv 215 (456) T protein:vir:10 174 ---------------------------GWQKFARPCFVQSSS--R-R-RLVTRI-------SDSWVPVGDAVVTGSPPPV 215 (456) T ss_pred ---------------------------ceeEEEEEEEEeecc--c-c-eeeeec-------CCceeeccccCCCCCceeE Confidence 011111111000000 0 0 000000 011111 112356788999 Q ss_pred EEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCC------CCce------eEecccceee Q lcl|NC_019406. 320 VFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDS------DASE------YHIGPGRVWV 387 (661) Q Consensus 320 v~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~------~~~~------l~iGs~~~~~ 387 (661) +++. |.+..+ -+.++-.|.=+.-+..||.....++.++|++++.|.+.. +++. +..+.+..|. T Consensus 216 v~~~---N~~g~g--d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~ 290 (456) T protein:vir:10 216 VVYQ---NPDGMG--EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWE 290 (456) T ss_pred EEec---CCCCCc--hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhcccccc Confidence 8874 333332 244445555555556677778899999999999996432 2211 2334455666 Q ss_pred cCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV 466 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a 466 (661) ++ +++++++ ++...++.+.+.++.+..++....--.... +...+|.||++.+............+-..+..++.++ T Consensus 291 ~~-~~~~~~q--~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 367 (456) T protein:vir:10 291 LP-PGVDIWE--SQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI 367 (456) T ss_pred CC-CCcceEE--ecccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66 4566554 557778888899999988887542211100 0112456899888888888888888888899999999 Q ss_pred HHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccC Q lcl|NC_019406. 467 VRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPK 546 (661) Q Consensus 467 L~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~ 546 (661) +++++.+.|..+.. .+.+... +-.+.. .++.++++.++.++|.+|+++++..| |+.+++. .+.+.+++.++. T Consensus 368 ~rl~~~~~g~~~~~--~~~v~w~-~~~~~~-~~~~ada~~kl~~~gi~~~~~~~~~l---g~~~~~i-~~~e~er~~~e~ 439 (456) T protein:vir:10 368 LVKALQIEGESVED--TVDVSFE-SPDRVT-LGEKYSAASLAKAAGESWASIRRNIL---NYNADQI-KQDDLDRAREQI 439 (456) T ss_pred HHHHHHhcCCCccc--ceeEEec-CCCCcC-HHHHHHHHHHHHHcCCChHHHHHhhC---CCCHHHH-HHHHHHHHHHHH Confidence 99999998865432 3344332 223333 36788999999999999999986544 6655443 345666776654 Q ss_pred CCCCC-chhhhhhcCCc Q lcl|NC_019406. 547 SFIGQ-PDAIAMRRGYV 562 (661) Q Consensus 547 ~~l~~-ddae~~~~g~~ 562 (661) ..++. .-....++|.+ T Consensus 440 ~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 440 TLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHHhhhhhhcCCCCCCC Confidence 43321 12222333333 No 52 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.86 E-value=5e-21 Score=131.88 Aligned_cols=441 Identities=10% Similarity=0.060 Sum_probs=247.8 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.-+||.- -+..--..+....++++++++-|.|...+ .|+|+. -.+.|+.+..+++ .| T Consensus 1 ~~~~t~~~-------------~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-----~~~~~~---~~~~~~~~~~k~~-~n 58 (456) T protein:vir:10 1 MTASTPAE-------------WLPVLTKRIDDGMSRVRLLARYSNGDAPL-----PELTRN---TSAAWRSFQREAR-TN 58 (456) T ss_pred CCCCCHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----hhcCcc---cChhhhhhhhhhh-cc Confidence 32222211 01111224556778889999999986433 345443 3345655555555 69 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+-+|+..+|++|.+++++..-++ .+ ..+ .++++. +-++++.+...+++.++.||++ T Consensus 59 ~~~~ivd~~~~~l~~~~~~~~~~~d--------~~------~~~---~~~~i~-----~~N~~d~~~~~~~~~a~i~G~a 116 (456) T protein:vir:10 59 WGLMVRDSVADRIIPNGITVGGSAD--------SD------LAL---RARRIW-----RDNRMDSVCKQWVKYGLDFGES 116 (456) T ss_pred hHHHHHHHHHhhhccCCeecCCCCC--------cc------hHH---HHHHHH-----HhcChhhHHHHHHHHHhhcCee Confidence 9999999999999999987732111 10 111 233332 2368999999999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) +++|-... + ..|-+..++|.+++-. ++...++..+-.|+.. ...++ ..-+...|... T Consensus 117 y~~v~~d~-~-----g~~~i~~~~p~~~~~i-~d~~~~~~~~~~i~~~--~~~d~------~~~~~~~~~~~-------- 173 (456) T protein:vir:10 117 YLTCWRRD-D-----GTATITADSPETMVVS-VDPLQPWRIRAAMRWW--RDLDA------ESDFAIVWSGD-------- 173 (456) T ss_pred EEEEeeCC-C-----CceEEEEEccceeEEE-EcCCCCcceEEEEEEE--EecCC------ceeEEEEEecc-------- Confidence 99985432 1 2577888888876532 2222222222222111 11000 00111111100 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee-ccCCcccceeeE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP-MVRGRTLPFIPF 319 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p-~~~g~~L~~IPf 319 (661) ...+.|+........ + . ....+. .....+ ....+.++.+|+ T Consensus 174 ---------------------------~~~~~~~~~~~~~~~--~-~-~~~~~~-------~~~~~~~~~~~~~~~~~pv 215 (456) T protein:vir:10 174 ---------------------------GWQKFARPCFVQSSS--R-R-RLVTRI-------SDSWVPVGDAVVTGSPPPV 215 (456) T ss_pred ---------------------------ceeEEEEEEEEeecc--c-c-eeeeec-------CCceeeccccCCCCCceeE Confidence 011111111000000 0 0 000000 011111 112356788999 Q ss_pred EEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCC------CCce------eEecccceee Q lcl|NC_019406. 320 VFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDS------DASE------YHIGPGRVWV 387 (661) Q Consensus 320 v~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~------~~~~------l~iGs~~~~~ 387 (661) +++. |.+..+ -+.++-.|.=+.-+..||.....++.++|++++.|.+.. +++. +..+.+..|. T Consensus 216 v~~~---N~~g~g--d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~ 290 (456) T protein:vir:10 216 VVYQ---NPDGMG--EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWE 290 (456) T ss_pred EEec---CCCCCc--hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhcccccc Confidence 8874 333332 244445555555556677778899999999999996432 2211 2334455666 Q ss_pred cCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV 466 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a 466 (661) ++ +++++++ ++...++.+.+.++.+..++....--.... +...+|.||++.+............+-..+..++.++ T Consensus 291 ~~-~~~~~~q--~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~ 367 (456) T protein:vir:10 291 LP-PGVDIWE--SQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI 367 (456) T ss_pred CC-CCcceEE--ecccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66 4566554 557778888899999988887542211100 0112456899888888888888888888899999999 Q ss_pred HHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccC Q lcl|NC_019406. 467 VRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPK 546 (661) Q Consensus 467 L~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~ 546 (661) +++++.+.|..+.. .+.+... +-.+.. .++.++++.++.++|.+|+++++..| |+.+++. .+.+.+++.++. T Consensus 368 ~rl~~~~~g~~~~~--~~~v~w~-~~~~~~-~~~~ada~~kl~~~gi~~~~~~~~~l---g~~~~~i-~~~e~er~~~e~ 439 (456) T protein:vir:10 368 LVKALQIEGESVED--TVDVSFE-SPDRVT-LGEKYSAASLAKAAGESWASIRRNIL---NYNADQI-KQDDLDRAREQI 439 (456) T ss_pred HHHHHHhcCCCccc--ceeEEec-CCCCcC-HHHHHHHHHHHHHcCCChHHHHHhhC---CCCHHHH-HHHHHHHHHHHH Confidence 99999998865432 3344332 223333 36788999999999999999986544 6655443 345666776654 Q ss_pred CCCCC-chhhhhhcCCc Q lcl|NC_019406. 547 SFIGQ-PDAIAMRRGYV 562 (661) Q Consensus 547 ~~l~~-ddae~~~~g~~ 562 (661) ..++. .-....++|.+ T Consensus 440 ~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 440 TLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHHhhhhhhcCCCCCCC Confidence 43321 12222333333 No 53 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=99.86 E-value=1.3e-21 Score=135.03 Aligned_cols=449 Identities=13% Similarity=0.047 Sum_probs=237.1 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) ...++++ .|... |+ +| .....++|+.+.+.|.|....--....+.+. .....+ | .-.| T Consensus 19 ~~~l~~~--~i~~l--------i~-~~--~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~---~~~~~~-----k-i~~n 76 (506) T protein:vir:94 19 LENLTPN--KIMKF--------IT-HH--FNYQRPRLEMLDDYYQGYNLKILDKQSRRHE---DGKADH-----R-ATHS 76 (506) T ss_pred hhcCCHH--HHHHH--------HH-HH--HHHHHHHHHHHHHHhcCCCcccccccccccc---ccCCcc-----e-eecc Confidence 1222222 11100 11 11 2446788999999999975321111111111 111111 1 2369 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+.+++..+|.+|.+||++..-.+... +.+..+. +-++++.....+.+.++.+|++ T Consensus 77 ~~~~Iv~~~~~~l~G~p~~~~~~d~~~~------------------~~l~~~~-----~~N~~~~~~~~~~~~~~~~G~a 133 (506) T protein:vir:94 77 FAKYIADFQTSYSVGNPINVKLPDDGSN------------------SGFDTFN-----KANDVDAENYDLFLDMSRYGRA 133 (506) T ss_pred hHHHHHHHhhhhhcccCceeecCcchHH------------------HHHHHHH-----hccCHhHHHHHHHHHHHhcCeE Confidence 9999999999999999999852111111 1222222 3468999999999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) +++|.... ..+|-+..++|.+++----+.+.+ .+... |+.|... T Consensus 134 ~~~v~~de------d~~~~i~~~~p~~~~~v~dd~~~~--~~~~~--------------------v~~~~~~-------- 177 (506) T protein:vir:94 134 YEYVYRGE------DNEEHLAKLDPLDTFVIYSTDVDP--KPIMA--------------------VRYHQIE-------- 177 (506) T ss_pred EEEEEecC------CCeeEEEEEcccceEEEecCCCCC--ceEEE--------------------EEEEeee-------- Confidence 99997542 236778888888764322111111 11111 1111000 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFV 320 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv 320 (661) ..+... .....+...++... +++.+..+..+. .......++++.|||| T Consensus 178 --------------------~~~~~~-~~~~~~~~~~yt~~-------~~~~~~~~~~~~----~~~~~~~~~~g~vPvv 225 (506) T protein:vir:94 178 --------------------LVDDNQ-VSTINYVPETWTAD-------TYTLYNPTPIMG----KMQVDTTKPITTFPVV 225 (506) T ss_pred --------------------eccCCc-eeEEEEEEEEEeCc-------eEEEeccccCcc----ceeccccccCCccceE Confidence 000000 01111111122111 122222221111 1112223679999999 Q ss_pred EEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc------------------------ Q lcl|NC_019406. 321 FFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS------------------------ 376 (661) Q Consensus 321 ~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~------------------------ 376 (661) .+.... . +.+-+.++-.|-=+.=...|++-+.+.+.+.|.++++|....+.. T Consensus 226 ~~~n~~--~--~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (506) T protein:vir:94 226 EFKNSN--F--RLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKL 301 (506) T ss_pred EecCCC--C--CCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchh Confidence 875322 1 123344444443344456778888888899999999886432211 Q ss_pred ----------eeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh-cccccCccchhHHHHHHHH Q lcl|NC_019406. 377 ----------EYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL-MPGMSKSVSESDNQSALRE 445 (661) Q Consensus 377 ----------~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl-l~~~~~~~~eTataa~~d~ 445 (661) .+.+.++........+++++||..+. ..+..+..++.+.+.|...+.-. +....-+++.||++.+... T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~ 380 (506) T protein:vir:94 302 ELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTY-DVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKV 380 (506) T ss_pred HHHhhhhhcCeeeecccccccCccccccceeeeecC-CHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHH Confidence 12222222222223456888988765 45788888999999998775322 1111224577999999888 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC----CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_019406. 446 ANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT----DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYE 521 (661) Q Consensus 446 ~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~ 521 (661) ..........-..+..++.+++++++.+++.... +..++.|.+++ ..+.+ ..+.++++.++ +|.||++|.+. T Consensus 381 ~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~-~~p~d-~~e~a~~~~kl--~g~iS~et~~~ 456 (506) T protein:vir:94 381 LGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRD-NLPAD-NISQIKALVQA--GATLPQKYLYQ 456 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCC-CCCcC-HHHHHHHHHHH--hccCChHHHHH Confidence 8888888888889999999999999998765332 12234444433 22222 24566777766 69999999975 Q ss_pred HHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHH Q lcl|NC_019406. 522 NFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQA 587 (661) Q Consensus 522 eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~ 587 (661) .| |...+..+|.++|.++...- +....+.+.....++.+ .-.|-+ ..|.+ T Consensus 457 ~l------p~v~d~~~E~~ri~~E~~~~---~~~~~~~~~~~~~~~~~-----~~~~~~--~~e~~ 506 (506) T protein:vir:94 457 QL------PGVTNPQDIVDMMKEQSANG---DYSFDQNGVISNDGQTN-----TTATQT--DEEVR 506 (506) T ss_pred hC------CCCCCHHHHHHHHHHHHHHH---hhcchhhcCCCcccCcc-----cccccc--ccCCC Confidence 43 43344667777887653210 00011111111111111 111111 11111 No 54 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.86 E-value=6.8e-21 Score=131.15 Aligned_cols=438 Identities=10% Similarity=-0.017 Sum_probs=237.2 Q ss_pred CCCCCccccccccccccccCC---ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcc Q lcl|NC_019406. 2 AGLSPNSANIRRTKRGAQQFT---HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAF 78 (661) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~---V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~ 78 (661) ..+-|+..=+-. ...+++ |..---.+....++..++.+-|.|...+..+.. +.+.. .. .|. - T Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~----~~~~~--~~-----~ki-~ 65 (453) T protein:vir:73 1 MNLKPIKLMTYS---RDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKA----KDSWK--PD-----NRL-T 65 (453) T ss_pred Cccccceeeecc---ccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCC----CCccC--cc-----cee-e Confidence 334444432221 011111 211112345556788888999999876654321 11111 11 122 3 Q ss_pred cchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhC Q lcl|NC_019406. 79 YNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMG 158 (661) Q Consensus 79 ~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~G 158 (661) .|+.+.+|+..+|.+|.+||+++.-.+.. ...+..+. +.++++.....+.+.++.+| T Consensus 66 ~n~~~~ivd~~~~~l~g~~~~~~~~d~~~------------------~~~l~~~~-----~~n~~~~~~~~~~~~~~~~G 122 (453) T protein:vir:73 66 NNFAKYIVDTFVGYFNGIPIKKTHDDKSV------------------LEAMQLFD-----NLNDMEDEESELAKIACVYG 122 (453) T ss_pred cchHHHHHHHhhhhhcccCceeecCChHH------------------HHHHHHHH-----HhcChhHHHHHHHHHHHhcC Confidence 59999999999999999999985211111 11222221 34789999999999999999 Q ss_pred CEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 159 RFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 159 r~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) +++++|-... ..+|-+..++|.+++----+.+ ++..+- .++... T Consensus 123 ~~~~~v~~d~------~~~~~i~~~~p~~~~~v~dd~~-~~~~~~--~i~~~~--------------------------- 166 (453) T protein:vir:73 123 RAYELMYQNE------STESEVIYCSPLNVFMVYDDSI-KQKPLF--AVYYGF--------------------------- 166 (453) T ss_pred eEEEEEEeCC------CCceEEEEEcccceEEEEeCCC-CceeEE--EEEEEE--------------------------- Confidence 9999995432 2356777788877643221111 111110 000000 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP 318 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP 318 (661) +. .+... ..++..+ .++++. ..+. .........++++.|| T Consensus 167 ----------------------~~--~~~~~-----~~vyt~~----~i~~~~--~~~~-----~~~~~~~~~~~~g~vP 206 (453) T protein:vir:73 167 ----------------------DE--EGNLS-----GTVYTLL----ETISIT--GKAG-----EVKFGESTYNVYSDLP 206 (453) T ss_pred ----------------------ec--CceEE-----EEEEeCC----eEEEEE--ecCC-----ceEEccceeccCCcee Confidence 00 00011 1111111 122211 1111 1111122236789999 Q ss_pred EEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEeccccee----------ec Q lcl|NC_019406. 319 FVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVW----------VV 388 (661) Q Consensus 319 fv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~----------~l 388 (661) ||.+... .+ +.+-+.++..|-=+.=+..|+..+.+.+.+.|+++++|....+...-.+....++ .. T Consensus 207 vv~~~n~--~~--g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (453) T protein:vir:73 207 IVEYNFN--EE--RQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDEEDAKNIKDNRLINFFDKNSNGQGT 282 (453) T ss_pred EEEecCC--CC--CCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhhcccccccccccccccccccc Confidence 9987532 22 1222334444433444567788888889999999999985443221111111111 11 Q ss_pred CCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 389 DKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVR 468 (661) Q Consensus 389 p~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~ 468 (661) ...+++++|++.+. +.+..+..++.+.+.|..+..-.--...+.++-||++.+.............-..+..++.++++ T Consensus 283 ~~~~~d~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~ 361 (453) T protein:vir:73 283 NAAKVDVKFLDKPD-SDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYS 361 (453) T ss_pred cccCceeEEeeecC-CHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12356789998765 34667888889999888764322112223456799998888877778888888889999999999 Q ss_pred HHHHHcCCCCC--CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccC Q lcl|NC_019406. 469 YWLMFRDIPLT--DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPK 546 (661) Q Consensus 469 ~~A~w~G~~~~--~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~ 546 (661) +++.+++.... +...+.|.+++.. +.. ..+.++++.++. |.||.+|+++.+ +.+ .++++|.++|+++. T Consensus 362 li~~~~~~~~~~~~~~~i~v~f~~~~-p~~-~~~~a~~~~k~~--giis~et~~~~~---~~~---~d~~~E~~ri~~E~ 431 (453) T protein:vir:73 362 LWSSLSTNASNKDAWKDIEYTFTRNE-PKD-IKEQAETANILK--GITSEETALSVI---SVI---PDVQAEMEKIKKKK 431 (453) T ss_pred HHHHHHhccCCccccccceEEeCCCC-CCC-HHHHHHHHHHHh--ccCcHHHHHHhC---CCC---CCHHHHHHHHHHHH Confidence 99998764432 2234455554322 222 345677777774 899999996543 333 34677777887641 Q ss_pred CCCCCchhhhhhcCCccccCCCcchhhhhcCCh Q lcl|NC_019406. 547 SFIGQPDAIAMRRGYVSRQQELDQQRAARDADF 579 (661) Q Consensus 547 ~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~ 579 (661) . +.....+.+. .++...--.|| T Consensus 432 ~----~~~~~~~~~~-------~~~~~~~~~~~ 453 (453) T protein:vir:73 432 L----LQLSLTRTSN-------LVRMKQMRGNL 453 (453) T ss_pred H----HHHHHHHhcc-------CCcchhhhcCC Confidence 1 1111111111 11111111222 No 55 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.86 E-value=1.7e-20 Score=129.01 Aligned_cols=479 Identities=13% Similarity=0.051 Sum_probs=243.0 Q ss_pred CCCcccccccccccc------ccCC-------ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHH Q lcl|NC_019406. 4 LSPNSANIRRTKRGA------QQFT-------HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYA 70 (661) Q Consensus 4 ~~~~~~~~~~~~~~~------~~~~-------V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~ 70 (661) .| --.|.--.+.++ ..|+ +..-.+.|....++++++.+-|.|...+ .|||+. -.+.|+ T Consensus 1 ~~-~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~-----~~~~~~---~~~~~~ 71 (501) T protein:vir:25 1 MT-VPVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGR-----PEVPEG---ASDEVK 71 (501) T ss_pred Cc-ccchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----hhcccc---CChhhh Confidence 11 011111111111 1111 2223456667778888888888885433 344433 334565 Q ss_pred HHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHH Q lcl|NC_019406. 71 NYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTV 150 (661) Q Consensus 71 ~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~ 150 (661) ....++ -.|+.+-+|+.+++++|-...++. |+ ...+.+..++ +-|+++....++ T Consensus 72 ~~~~~~-v~n~~~~ivd~~a~~l~~~gf~~~-------------d~----~~~~~l~~i~--------~~N~~d~~~~~~ 125 (501) T protein:vir:25 72 ELAKLS-VKNVLSLVRDSFAQNLSVVGYRNA-------------LA----KENDPAWEMW--------QRNRMDARQAEV 125 (501) T ss_pred hhHhhh-hcChHHHHHHHHHhhhcccceecC-------------Cc----cchHHHHHHH--------HhcChhHHHHHH Confidence 543333 348999999999998875443321 11 0111122222 357899999999 Q ss_pred HHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhc-cceeeccccccceeeeeeeeeeeeccccccccccceeeee Q lcl|NC_019406. 151 ALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIV-DWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 151 ~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~Ii-nW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~ 229 (661) .+.++.||+++++| |+..+ + |.+..++|.+++ =|.- ... +.... ..++.+....+. ....++..| T Consensus 126 ~~~a~i~G~ay~~v-~~de~----~--~~i~~~sp~~~~~iy~D-~~~-~~~~~-~ai~~~~~~~~~----~~~~~~~~y 191 (501) T protein:vir:25 126 HRPALTYGASYVTV-TPTDE----G--PVFRTRSPRQILAVYAD-PSV-DAWPQ-YALETWVAQKDA----KPHRRGVLY 191 (501) T ss_pred HHHHhhcCceEEEE-ecCCC----C--CeEEEeccccEEEEEec-CCC-Cccee-EEEEEEeecccc----CcceeEEEe Confidence 99999999999998 33222 1 567788888764 2211 111 11111 111111111000 001111111 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEee--cccccceEEEEEEEecCcccccccceee Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILE--LQKDGSRVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~--~g~~g~~~~~~~~~~~~~~~~~~~~~~p 307 (661) .+. .+|....-. ........+.. .........+.... T Consensus 192 ~~~--------------------------------------~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 230 (501) T protein:vir:25 192 DDT--------------------------------------YMYELDLGEVVLGDAGGGQATQ---QPVNVREVTDVIEH 230 (501) T ss_pred cCe--------------------------------------eEEEEecCceeeeecccccccc---cccccccccccccc Confidence 111 111110000 00000000000 00111111121111 Q ss_pred ccCCcccceeeEEEEecCC--CCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccce Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFGSMS--NAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRV 385 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~~~~--~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~ 385 (661) ...-++++.||||.+-... +++.. +=+.++-.|.=+.=+..|+...+.++.++|++|+.|++..+.+.+.+..++. T Consensus 231 ~~~~~~~~~vPiv~f~N~~~~~~~g~--sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~i 308 (501) T protein:vir:25 231 GATFEGKPVCPVVRFVNGRDADDMIV--GEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEVLKASALRV 308 (501) T ss_pred ccccCCccceeeEeccCccccCcccc--chhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccchhhhcccce Confidence 2223679999999874322 23323 2233333333344456677889999999999999999877666567777778 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG 462 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A 462 (661) |.+++ ++++|.++++..++.+.+.|+.+..+|....- ..+.. ..++-||++.+............+-..+..+ T Consensus 309 ~~~~~--~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~--~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 384 (501) T protein:vir:25 309 WTFED--PEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTG--KMINVSAEALAAAEANQQRKLAAKRESFGES 384 (501) T ss_pred eccCC--CCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhcc--ccCChHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88774 45778888898899899999999998865431 11111 2345689988888888777778888888899 Q ss_pred HHHHHHHHHHHcCCCCCC-cceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHH Q lcl|NC_019406. 463 MTSVVRYWLMFRDIPLTD-TATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIK 541 (661) Q Consensus 463 l~~aL~~~A~w~G~~~~~-~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~ 541 (661) |.+++++++...|..... ...+.+.. ++..+.. .++.++++.++.++| ||.++++.+| -|+-+++ .++.++. T Consensus 385 l~~~~rl~~~~~~~~~~~~~~~i~v~w-~~~~~~s-~~~~ada~~kl~~~g-is~et~~~~~--~g~~~~~--ie~~~~~ 457 (501) T protein:vir:25 385 WEQLLRLAAEMDDDPDTAADSGAEVLW-RDTEARS-FGAVVDGITKLASAG-IPIEHLLSMV--PGMTQQT--IQAIKDS 457 (501) T ss_pred HHHHHHHHHHHhCCCccccceeeeEEe-cCCCCCC-HHHHHHHHHHHHhcC-CCHHHHHHHc--CCCCHHH--HHHHHHH Confidence 999999999999854321 12233322 2333333 367888999999887 7999986544 3552222 1222222 Q ss_pred HhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchh Q lcl|NC_019406. 542 MNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEK 597 (661) Q Consensus 542 l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~ 597 (661) .+++... +. ...+..+.+.... .....++.+.+ .+-++.+..+- T Consensus 458 ~~e~~~~-~~--~~~~~~~~~~~~~-~~~~~~~~~~~--------~~~~~~~~~g~ 501 (501) T protein:vir:25 458 LRGGEVK-SL--VDKLLSNEPAPVP-PPPPQAAAQAL--------NEGGVNGNGGA 501 (501) T ss_pred HHHHhHH-HH--HHHhhccCcCCCC-CCCCCCCcccc--------ccccCCCCCCC Confidence 2222110 11 0011111111111 01111111111 01111111111 No 56 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.85 E-value=5.5e-21 Score=131.68 Aligned_cols=468 Identities=10% Similarity=0.018 Sum_probs=239.2 Q ss_pred CCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHH Q lcl|NC_019406. 5 SPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQ 84 (661) Q Consensus 5 ~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~ 84 (661) -|++-|+.. ..-+..--..+....++++++.+-|.|...++. +| ......++.++ ...|+.+- T Consensus 1 ~~~~~~~d~------~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~-----~~---~~~~~~~~~~~---~~~n~~~~ 63 (488) T protein:vir:23 1 MAETESIDP------EKLRDQLLDAFENKQNELKSSKAYYDAERRPDA-----IG---LAVPLDMRKYL---AHVGYPRT 63 (488) T ss_pred CCcccCCCH------HHHHHHHHHHHHHHHHHHHHHHHHHhcccchhh-----cC---cccchhhhhhh---hhcchHHH Confidence 233333331 112344457777888999999999998654432 33 23334444442 23699999 Q ss_pred HHHHHhchhhccCccccccchhhHhh-hhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEE Q lcl|NC_019406. 85 TQAGMVGQIFRRPPVIRNLPNTGAIT-GRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGAL 163 (661) Q Consensus 85 tv~~l~G~vFrk~p~i~~~p~~l~~l-~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvL 163 (661) +|+.++.+++-.-..+- .|.....- ..|. .+.+.+..++ +-++++...+.+.+.++.||+++++ T Consensus 64 ivd~~a~~l~~~Gf~~~-~~~~~~~~~~~d~------~~~~~l~~i~--------~~N~~~~~~~~~~~~a~i~G~a~~~ 128 (488) T protein:vir:23 64 YVDAIAERQELEGFRIP-SANGEEPESGGEN------DPASELWDWW--------QANNLDIEATLGHTDALIYGTAYIT 128 (488) T ss_pred HHHHHHHhhhccceecc-CCcccccccccch------hHHHHHHHHH--------HhcChhHHHHHHHHHHhhcCceEEE Confidence 99998876643322210 01000000 0000 1111122222 4568999999999999999999999 Q ss_pred EeccCCCc--hhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhh Q lcl|NC_019406. 164 VDVAPSSD--PTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGR 241 (661) Q Consensus 164 VD~P~a~~--~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~ 241 (661) |....... .-....|.+..++|.+++-+-- ...+. .+.-+++. .. T Consensus 129 v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~d-~~~~~-~~~~~~~~--~~----------------------------- 175 (488) T protein:vir:23 129 ISMPDPEVDFDVDPEVPLIRVEPPTALYAEVD-PRTRK-VLYAIRAI--YG----------------------------- 175 (488) T ss_pred EecCCcccccCCCCCcceEEEeccceeEEEEe-cCCCc-eEEEEEEE--Ee----------------------------- Confidence 97643111 1122335667778877665532 22111 11111110 00 Q ss_pred hcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEE Q lcl|NC_019406. 242 RAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVF 321 (661) Q Consensus 242 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~ 321 (661) . +++ .+++.-++..+. ++.+ ..+..+ +.......+.++.||||. T Consensus 176 --------------------~-~~~----~~~~~~~y~~~~----~~~~---~~~~~~----~~~~~~~~h~~g~vPvv~ 219 (488) T protein:vir:23 176 --------------------A-DGN----EIVSATLYLPDT----TMTW---LRAEGE----WEAPTSTPHGLEMVPVIP 219 (488) T ss_pred --------------------c-CCC----cEEEEEEEecCc----EEEE---EecCCc----eEeccccccCCCCcceEE Confidence 0 000 011111222211 1111 111111 111112236799999997 Q ss_pred EecC-CCCCCccccchh-HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC--------ceeEecccceeecCCC Q lcl|NC_019406. 322 FGSM-SNAADCEKPPLL-DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA--------SEYHIGPGRVWVVDKE 391 (661) Q Consensus 322 ~~~~-~~~~~~~~pPLl-dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~--------~~l~iGs~~~~~lp~~ 391 (661) |-.. ..+..-+.+=|. ++-.|.=++=+..|+...++++.++|++++.|++.... ..+..+.+..|.++ + T Consensus 220 f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~-~ 298 (488) T protein:vir:23 220 ISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFE-G 298 (488) T ss_pred eccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCC-C Confidence 6422 111111222221 22233333445667888999999999999999754331 12455667788887 4 Q ss_pred CCcceEeecCchhHHHHHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 392 SGIPGIIEFKGEGLKTLERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVR 468 (661) Q Consensus 392 ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~ 468 (661) |.+++|.++++.+++...+.|+.+..++....- .-+ ..+...+-||.+.+............+-..+..++.++++ T Consensus 299 g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~ 377 (488) T protein:vir:23 299 GEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYL-SSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMR 377 (488) T ss_pred CCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHh-ccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 678999999999999999988888888764321 111 1111112478888887777777777777788889999999 Q ss_pred HHHHHcCCCCCC--cceEEEEeccccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCCCccCCHHHHHHHHhc Q lcl|NC_019406. 469 YWLMFRDIPLTD--TATLRYEIDATFLTTALDARALRAIQQLYEGG--LLPIDALYENFVKNGIIPSTQTLEEFTIKMND 544 (661) Q Consensus 469 ~~A~w~G~~~~~--~~~~~v~ln~DF~~~~lda~~l~all~~~~aG--~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~ 544 (661) +++.++|..... ...+.+.... -.+.. .++.++++.+++++| .+|++|++..| |.+++.. ++ .+++.+ T Consensus 378 l~~~~~~~~~~~~~~~~i~v~f~~-~~~~s-~~~~ada~~kl~~~g~~~~s~et~~~~l---~~~~d~~--~~-~~~~~~ 449 (488) T protein:vir:23 378 LAYKMVKGGDIPTEYYRMETVWRD-PSTPT-YAAKADAAAKLFANGAGLIPRERGWVDM---GYTIVER--EQ-MRQWLE 449 (488) T ss_pred HHHHHhcCCCcchhhccceEEecC-CCCCC-HHHHHHHHHHHHhcccccCCHHHHHHhC---CCCchHH--HH-HHHHHH Confidence 999998854311 1233333322 12222 356788999999976 79999997665 6654432 21 222211 Q ss_pred cCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccC Q lcl|NC_019406. 545 PKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLE 592 (661) Q Consensus 545 ~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~ 592 (661) +...-.......+.... ..++.+++.+..+.+ .-|=.+| T Consensus 450 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-------~~e~~~a 488 (488) T protein:vir:23 450 QDQKQGLGLIGSLYGAS--TPEGKPGEAPVGEPP-------APEPDAA 488 (488) T ss_pred HHHHHHHHHHHHHhccC--CCcccCCCCCCCCCC-------CCCCCCC Confidence 10000000000111010 011111111111111 0001111 No 57 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.85 E-value=2.2e-20 Score=128.34 Aligned_cols=470 Identities=10% Similarity=0.031 Sum_probs=231.9 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCCh-HHHHHHHhhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDD-EDYANYLDRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~-~~Y~~rl~rA~~~ 79 (661) |--+ |.. ||....-.. .-...-.+.+....+++++.++-|.|...+ +..+.+.. +.++....+++ . T Consensus 1 ~~~~-p~~-~l~~~~~~~--~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i--------~~~~~~~~~~~~~~~~~~~~-~ 67 (479) T protein:vir:99 1 MIDL-PDE-DLSSEGLAK--YLETKVFPKMNTECERLDDFEAWTKNGQEV--------PDLATRHKNKEREVLQQLSR-K 67 (479) T ss_pred CccC-Ccc-cCChhHHHH--HHHHHHHHHHHHHhHHHHHHHHHHhcCCcc--------cccccccCChhHHHHHHHhh-c Confidence 4332 222 333110000 000122357777889999999999887543 22222222 22322222223 5 Q ss_pred chHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCC Q lcl|NC_019406. 80 NMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGR 159 (661) Q Consensus 80 n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr 159 (661) |+.+-+|+.+++++|-...+.. + .+ . ...++.+.+ -++++.....++..++.+|+ T Consensus 68 n~~~~iVd~~~~~l~~~gf~~~---d--------~~------~---~~~~~~i~~-----~N~~d~~~~~~~~~a~~~G~ 122 (479) T protein:vir:99 68 PWMGLMVNSFAQQLIVDGYRKT---G--------TN------E---NAKGWDTWR-----LNQMDKQQFWLNRAVLTFGY 122 (479) T ss_pred CcHHHHHHHHHhhcccccccCC---C--------ch------h---hHHHHHHHH-----hcChhHHHHHHHHHHhhcCc Confidence 9999999999999875443321 1 00 0 112333322 36888999999999999999 Q ss_pred EEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 160 FGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 160 ~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) ++++|- |..+..-....|-+..++|++++-.-.+...+. .. -|.+.+ T Consensus 123 af~~v~-~~~~~~d~~g~~~i~~~~p~~~~~iydd~~~~~-~~---------------------~~~~~~---------- 169 (479) T protein:vir:99 123 AFIKVT-SGISPLDGTTVARIKCIDPRDAFAIWEDPYWDE-WP---------------------KYLLER---------- 169 (479) T ss_pred eEEEEe-cCCCCcCCCCceEEEEechhheEEEecCCcccc-ee---------------------eEEEee---------- Confidence 999994 422211122346677788887653210100000 00 000000 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeE Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPF 319 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPf 319 (661) ...+ ...+| ... .+. .+..+..... .....-+.++.||| T Consensus 170 -----------------------~~~~--~~~~~------~~~----~~~--~~~~~~~~~~----~~~~~~h~~g~vPv 208 (479) T protein:vir:99 170 -----------------------QPNG--QYWWW------TEE----DYS--IFEFKQGKFI----YRETVSHDYGHIPF 208 (479) T ss_pred -----------------------cCce--eEEEE------ecc----eEE--EEEecCCcee----eccccccCCCCcce Confidence 0000 00000 000 011 1111111111 11112356899999 Q ss_pred EEEecC-CCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC-----ceeEecccceeecCCCCC Q lcl|NC_019406. 320 VFFGSM-SNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA-----SEYHIGPGRVWVVDKESG 393 (661) Q Consensus 320 v~~~~~-~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~-----~~l~iGs~~~~~lp~~ga 393 (661) +.|-.. ..+ .-+.+=|.++-.|-=+.=...|+...++.+.++|++|+.|....+. ....+...++|.+++ . T Consensus 209 v~f~n~~~~~-~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~~~--~ 285 (479) T protein:vir:99 209 VRYVNVMDLR-GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESMLISQN--E 285 (479) T ss_pred EEeecCCCcC-cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccccccccceeecC--C Confidence 976432 221 1123334444444444445678899999999999999999754332 123445555666553 4 Q ss_pred cceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcc-cccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 394 IPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMP-GMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLM 472 (661) Q Consensus 394 ~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~-~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~ 472 (661) +++|.++++..++.+.+.|+.+..++..... +-. .-...++.||++.+.........-..+-..+..+|.+++++++. T Consensus 286 ~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~-~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~ 364 (479) T protein:vir:99 286 KASFGAIPAAPLDGLLNAYKESLLEFLALAQ-LPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNK 364 (479) T ss_pred CceEEEecccchHHHHHHHHHHHHHHhccCC-CCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5678888888888888888877777654321 100 00113456888877776666666666666677789999999999 Q ss_pred HcCCCCCCc-ceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCC Q lcl|NC_019406. 473 FRDIPLTDT-ATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQ 551 (661) Q Consensus 473 w~G~~~~~~-~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ 551 (661) +.|...... ..+.+.. ++-.+.. .++.++++.+++++|.||++|.+..| -|+-++. .|++.+..+.+ ...+. T Consensus 365 ~~~~~~~~~~~~i~~~w-~~~~~~s-~~~~ad~~~kl~~ag~is~et~l~~l--~gv~~~~--~e~~~~~~~~~-~~~~~ 437 (479) T protein:vir:99 365 IEGRTEEATDLDFTITW-QDVTIQS-LAQFADAWAKMVESLKIPAEGVWDMI--PNLDQST--VNGWKEIYDRE-GDFGK 437 (479) T ss_pred HcCCCccccceeeeEEe-cCCCCCC-HHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHH--HHHHHHHHHHH-HHHHH Confidence 998654211 1233332 2222333 36788999999999999999997655 3442221 12222221111 10000 Q ss_pred chhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhh Q lcl|NC_019406. 552 PDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSV 609 (661) Q Consensus 552 ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 609 (661) .++.+..+. .. -+|+.+...++ +.+ ++-.+++. ++.+..|+- T Consensus 438 -~~~~~~~~~-~~---~~~~~~~~~~~-~~~-------~~~~~~~~---~~~~~~~~~ 479 (479) T protein:vir:99 438 -YMRKLQNGP-DP---AEQRGGPNGAT-NMQ-------QANNKTGE---PASLNKSGA 479 (479) T ss_pred -HHHHHhccc-Cc---ccccCCCCCCC-CCC-------CCCCCCcc---hhccCCCCC Confidence 011111110 00 01110000000 000 11111111 122222222 No 58 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.84 E-value=1.1e-19 Score=124.48 Aligned_cols=461 Identities=10% Similarity=0.043 Sum_probs=238.1 Q ss_pred ccccccccccccccCCccc-----cCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccch Q lcl|NC_019406. 7 NSANIRRTKRGAQQFTHLV-----VHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNM 81 (661) Q Consensus 7 ~~~~~~~~~~~~~~~~V~~-----~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~ 81 (661) -.++|. +..--+++. ---.|....+++.++.+-|.|...++ |+++.. ...+++. +. ..|+ T Consensus 1 ~~~~i~----~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~-----~~~~~~---~~~~~~~--~~-~~n~ 65 (485) T protein:vir:24 1 MTAPLP----GQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPE-----AIGVTV---PVQMQSL--LA-HVGY 65 (485) T ss_pred CCCCCC----CCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchh-----hcCccc---chhhhhh--hh-ccch Confidence 334555 222222222 12445666788888888888876543 344332 2234332 22 3599 Q ss_pred HHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEE Q lcl|NC_019406. 82 TSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFG 161 (661) Q Consensus 82 ~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~g 161 (661) .+-+|+.++++++-...++.+-. ...+.+..++ +-++++.+.+.++..++.||+++ T Consensus 66 ~~~ivd~~~~~l~~~g~~~~~~~----------------~~~~~l~~i~--------~~N~~d~~~~~~~~~a~i~G~ay 121 (485) T protein:vir:24 66 PRLYVDSIAERQAVEGFRLGDAD----------------EADEELWQWW--------QANNLDIEAPLGYTDAYVHGRSY 121 (485) T ss_pred HHHHHHHHhhhhccCceecCCCc----------------hhHHHHHHHH--------HhcChhHHHHHHHHHHhhcCceE Confidence 99999999999987766553110 0111223333 24789999999999999999999 Q ss_pred EEEeccCCCch--hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 162 ALVDVAPSSDP--TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 162 vLVD~P~a~~~--~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) ++|........ -...+|-+..++|++++-. ++...++ +...+.+.. T Consensus 122 ~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i-~D~~~~~--~~~~~~~~~----------------------------- 169 (485) T protein:vir:24 122 ITISRPDPQIDLGWDPNVPLIRVEPPTRMYAE-IDPRIGR--PAKAIRVAY----------------------------- 169 (485) T ss_pred EEEecCCcccccccCCCcceEEEeccceeEEE-eeCCcCc--eeEEEEEEE----------------------------- Confidence 99976532111 1224566777888776422 1211111 111111000 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeE Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPF 319 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPf 319 (661) +. .+ ..++++.++..+ .+|.+. ..++ .+ ......-++++.||| T Consensus 170 ---------------------~~-~~----~~~~~~~~y~~~----~~~~~~--~~~~--~~---~~~~~~~h~~g~vPv 212 (485) T protein:vir:24 170 ---------------------DA-EG----NEIQAATLYTPN----ETFGWF--RAEG--EW---VEWFSDPHGLGAVPV 212 (485) T ss_pred ---------------------ee-cC----CeEEEEEEEcCC----cEEEEE--ecCC--ce---EeecccccCCCcccE Confidence 00 00 011222222221 112211 1111 11 111112267999999 Q ss_pred EEEecC-CCCCCccccchh-HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC--------CceeEecccceeecC Q lcl|NC_019406. 320 VFFGSM-SNAADCEKPPLL-DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD--------ASEYHIGPGRVWVVD 389 (661) Q Consensus 320 v~~~~~-~~~~~~~~pPLl-dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~--------~~~l~iGs~~~~~lp 389 (661) |.|-.. ..+..-+.+-|. +|-.|.=+.=+..|+...++.+.++|++++.|.+... ...+..+.+..|.++ T Consensus 213 v~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 292 (485) T protein:vir:24 213 VPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFE 292 (485) T ss_pred EEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccC Confidence 977422 111111222222 2333322333456789999999999999999975432 112345666777777 Q ss_pred CCCCcceEeecCchhHHHHHHHHHHHHHHHHHHh---HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_019406. 390 KESGIPGIIEFKGEGLKTLERALNEKEQQIAAIG---GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV 466 (661) Q Consensus 390 ~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lG---Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a 466 (661) + ++++|.|++..+++.+.+.|+.+..++.... ..-+ ..+...+-||++.+............+-..+..+|.+. T Consensus 293 ~--~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~f-g~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~ 369 (485) T protein:vir:24 293 D--AEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYL-STAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEA 369 (485) T ss_pred C--CCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHh-ccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4 4567788889898888888887777775331 1112 11111235888888887777777788888888899999 Q ss_pred HHHHHHHcCCCC--CCcceEEEEeccccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCCCccCCHHHHHHHH Q lcl|NC_019406. 467 VRYWLMFRDIPL--TDTATLRYEIDATFLTTALDARALRAIQQLYEGG--LLPIDALYENFVKNGIIPSTQTLEEFTIKM 542 (661) Q Consensus 467 L~~~A~w~G~~~--~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG--~Is~et~~~eL~r~gvl~~~~~~Eee~~~l 542 (661) +++++.+.+... .+...+.|...... +.. .++.++++.+++++| .||++|++.. .|+.++.. ++.+++ T Consensus 370 ~~l~~~~~~~~~~~~d~~~i~v~f~~~~-~~s-~~~~ad~~~kl~~~g~~~~s~et~~~~---l~~~~d~~---~e~~~~ 441 (485) T protein:vir:24 370 MRLAYRLMKGGDVPPDMLRMETVWRDPS-TPT-YAAKADAATKLYGNGQGVIPRERARKD---MGYSIAER---EEMRRW 441 (485) T ss_pred HHHHHHHhcCCCCccccceeeEEecCCC-CCC-HHHHHHHHHHHHhcccccCCHHHHHhh---CCCCHhHH---HHHHHH Confidence 999999876432 12234444443222 222 356788889998866 7999999654 36643332 233333 Q ss_pred hccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhh Q lcl|NC_019406. 543 NDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVG 605 (661) Q Consensus 543 ~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~ 605 (661) .++....+..-...+.+... .++.++.. .|.++.+.++..+-. | T Consensus 442 ~ee~~~~~~~~~~~~~~~~~-----~~~~~~~~--------~e~~~~~~~~~~~~~------a 485 (485) T protein:vir:24 442 DEEEAAMGLGLLGTMVDADP-----TVPGSPNP--------TPAPKPQPAIEGGDS------A 485 (485) T ss_pred HHHHhhhhhhHHHhhcccCC-----CCCCCCCC--------CCCCCCccCCCCCCC------C Confidence 32211111111111111100 01110000 000000111100000 0 No 59 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.83 E-value=4e-20 Score=126.96 Aligned_cols=462 Identities=10% Similarity=0.031 Sum_probs=234.0 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |+-+.|+.-|+. .. .-+..---.+....+++.++.+-|.|...++. |+.. -...+++++ .-.| T Consensus 1 ~~~~~~~~~~~~----~~--~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~-----~~~~---~~~~~~~~~---~~~n 63 (484) T protein:vir:77 1 MTSPLQKQENVD----PE--KAREEMLNLFTERTQDLGDNTAYYESERRPDA-----VGVT---VPQQMQKLL---AHVG 63 (484) T ss_pred CCCcccccCCCC----HH--HHHHHHHHHHHHHHHHHHHHHHHHhccccchh-----cccc---cchhHHhhh---hhcC Confidence 777777665554 00 00111222234456778888888999765543 3322 223333332 3459 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+-+|+.+++++|-...++.+-. .+- ..++.+. +-++++...+.+++.++.||++ T Consensus 64 ~~~~ivd~~~~~l~~~g~~~~~~~----------------~~~---~~l~~i~-----~~N~~d~~~~~~~~~a~~~G~a 119 (484) T protein:vir:77 64 YPRLYIDAIAARQELEGFRLGGAD----------------KAD---EQLWDWW-----QANDLDIESTLGHTDSLVHGRS 119 (484) T ss_pred cHHHHHHHHHhhhccCceecCCcc----------------hhH---HHHHHHH-----HhcCHhHHHHHHHHHHhhcCce Confidence 999999999998876554432100 011 1233332 2478999999999999999999 Q ss_pred EEEEeccCCCch--hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 161 GALVDVAPSSDP--TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 161 gvLVD~P~a~~~--~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) +++|-....+.. ....+|-+..++|++++-.. +...+ ..+.-|++ +. T Consensus 120 ~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~-D~~~~-~~~~a~~~--~~--------------------------- 168 (484) T protein:vir:77 120 YITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQI-DPRTR-QVMRAIRA--IE--------------------------- 168 (484) T ss_pred EEEEecCCCCcccccccccceEEEeccceeEEEe-cCCCC-ceEEEEEE--EE--------------------------- Confidence 999965433221 13345677778888765332 11111 11111100 00 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP 318 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP 318 (661) +...+ .+.++.++..+. ++.+ +..++... ..... -++++.|| T Consensus 169 ----------------------~~~~~-----~~~~~~~y~~~~----~~~~--~~~~~~~~-~~~~~----~~~~g~vP 210 (484) T protein:vir:77 169 ----------------------DEEGN-----EVIGATLYLPNN----TVIW--NREDGQWV-QVANV----AHNLEMVP 210 (484) T ss_pred ----------------------eecCC-----cEEEEEEEecCe----EEEE--EecCCceE-eeccc----cCCCCCcc Confidence 00000 011111122211 1111 11111111 11111 26789999 Q ss_pred EEEEecC-CCCCCccc----cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC-----c---eeEecccce Q lcl|NC_019406. 319 FVFFGSM-SNAADCEK----PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA-----S---EYHIGPGRV 385 (661) Q Consensus 319 fv~~~~~-~~~~~~~~----pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~-----~---~l~iGs~~~ 385 (661) ||.|-.. ..+..-+. +++.+|. =+.=+..|+...++++.++|++++.|.+..+. . .+..+.+.. T Consensus 211 vv~f~N~~~~~~~~G~s~i~~~v~~L~---Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~ 287 (484) T protein:vir:77 211 VIPIPNRTRLSDLYGTTEITPELRSVT---DAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARI 287 (484) T ss_pred eEEeccccccCccCCcccchHHHHHHH---HHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhh Confidence 9976421 11111122 2233331 12223456888899999999999999764321 1 134455667 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG 462 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A 462 (661) |.+|++ +++|.+++..+++.+.+.|+.+..++....- .-+ ......+-||++.+............+-..+..+ T Consensus 288 ~~~~~~--~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~f-g~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 364 (484) T protein:vir:77 288 LAFEDH--ESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYL-SFSSENPASAEAIRSSESRLVKTVERKNKIFGGA 364 (484) T ss_pred cccCCC--CceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHh-ccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777753 5677888899888888888877777754321 111 1111112478887777666666666667778888 Q ss_pred HHHHHHHHHHHcCCCCCCc--ceEEEEeccccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCCCccCCHHHH Q lcl|NC_019406. 463 MTSVVRYWLMFRDIPLTDT--ATLRYEIDATFLTTALDARALRAIQQLYEGG--LLPIDALYENFVKNGIIPSTQTLEEF 538 (661) Q Consensus 463 l~~aL~~~A~w~G~~~~~~--~~~~v~ln~DF~~~~lda~~l~all~~~~aG--~Is~et~~~eL~r~gvl~~~~~~Eee 538 (661) +.+++++++...|...... ..+.+.... -.+.. .++.++++.++.++| .+|++|++..| |+.++.. ++ T Consensus 365 l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~-~~~~s-~~~~ad~~~kl~~~g~gi~s~et~~~~l---~~~~~~~---~e 436 (484) T protein:vir:77 365 WEQAMRVAYKVMNGGDIPPEYYRMESIWRD-PSTPT-YAAKADAATKLYNNGQGVIPKERARIDM---GYSITER---EE 436 (484) T ss_pred HHHHHHHHHHHhCCCCcccccccceEEecC-CCCCC-HHHHHHHHHHHHhccCCCCCHHHHHhcC---CCChhHH---HH Confidence 9999999998887533211 223333322 12222 356788999999976 89999997655 6655432 22 Q ss_pred HHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHh Q lcl|NC_019406. 539 TIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRIS 601 (661) Q Consensus 539 ~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~ 601 (661) .+++.++.... ++...+.......+..++-...+. -+.++..+...-. T Consensus 437 ~~~~~~ee~~~----~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~ 484 (484) T protein:vir:77 437 MRKWDEEEQAQ----GLGLMGTMFGTDPSGGGNPDNPET-----------PEPQPNPAEEAAA 484 (484) T ss_pred HHHHHHHHHHH----HHHHHhhhccccccCCCCCCCCCc-----------ccccCCCccccCC Confidence 33333221110 111111111111111111000000 0011111111100 No 60 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.83 E-value=1.3e-19 Score=124.17 Aligned_cols=427 Identities=11% Similarity=0.064 Sum_probs=227.1 Q ss_pred CCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHH Q lcl|NC_019406. 4 LSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTS 83 (661) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~ 83 (661) +++. +..-|..---.|....++++.+.+-|.|...++ |+|+ .....|++.. ...|+.+ T Consensus 1 ~~~~-----------~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~-----~~~~---~~~~~~~~~k---~~~n~~~ 58 (441) T protein:vir:80 1 MNSD-----------ELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVR-----DLGV---AIPPELQRVQ---TVVSWPG 58 (441) T ss_pred CCcc-----------HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcch-----hcCc---ccchhhhhhh---hhcchHH Confidence 1111 111122223345666677888888888865442 2333 3333444332 3579999 Q ss_pred HHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEE Q lcl|NC_019406. 84 QTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGAL 163 (661) Q Consensus 84 ~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvL 163 (661) -+|+.++++++-. .+. .++ .+.++.++. -++++.+...++..++.+|+++++ T Consensus 59 ~ivd~~~~~l~~~--g~~-~~d-----------------~~~l~~i~~--------~n~~~~~~~~~~~~~~~~G~a~~~ 110 (441) T protein:vir:80 59 IAVDALEERLDWL--GWT-NGD-----------------GYGLDGVYA--------ANRLATASCDVHLDALIFGLSFVA 110 (441) T ss_pred HHHHHHHhhhccc--ccc-CCC-----------------hHHHHHHHH--------hcCHHHHHHHHHHHHhhcCeeEEE Confidence 9999999988532 221 111 112223332 368999999999999999999999 Q ss_pred EeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhc Q lcl|NC_019406. 164 VDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRA 243 (661) Q Consensus 164 VD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~ 243 (661) |= +..+ ..|.+..++|++++-. ++...++..+ .+++.... + T Consensus 111 v~-~d~~-----g~~~i~~~~p~~~~~i-~d~~~~~~~~-~~~~~~~~---~---------------------------- 151 (441) T protein:vir:80 111 II-PHGD-----GTVSVRPQSPKNCTGK-FSADGSRLDA-GLVVQQTC---D---------------------------- 151 (441) T ss_pred EE-eCCC-----CceEEEEEccceEEEE-EeCCCCceeE-EEEEEEEe---c---------------------------- Confidence 84 3222 3467888889886532 1222222111 11111000 0 Q ss_pred chhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEe Q lcl|NC_019406. 244 GLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFG 323 (661) Q Consensus 244 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~ 323 (661) .. ..+ +-++..+ .+++. +..+..+.... ....++++.||+|.+- T Consensus 152 ---------------------~~----~~~-~~vy~~~----~~~~~--~~~~~~~~~~~----~~~~~~~g~vPvv~~~ 195 (441) T protein:vir:80 152 ---------------------PE----VVE-AELLLPD----VIVQV--ERRGSREWVEV----DRIPNVLGAVPLVPIV 195 (441) T ss_pred ---------------------Cc----eEE-EEEEecC----eEEEE--EEcCCcceeec----cccccCCCceeEEEee Confidence 00 000 1111111 01111 11111111111 1123678999999764 Q ss_pred cC-CCCCCccccchh-HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC--CceeEecccceeecCCC--CCcceE Q lcl|NC_019406. 324 SM-SNAADCEKPPLL-DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD--ASEYHIGPGRVWVVDKE--SGIPGI 397 (661) Q Consensus 324 ~~-~~~~~~~~pPLl-dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~--~~~l~iGs~~~~~lp~~--ga~~~y 397 (661) .. ..+..-+.+-|. ++-.|-=+.=...|+...++.+.++|+++++|.+.++ .+...+..+..|.+|.. +..+.+ T Consensus 196 n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 275 (441) T protein:vir:80 196 NRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQPGWVLSMASVWAVDKDDDGDTPNV 275 (441) T ss_pred ccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCccccccchhhhcccccccCCCCCCCCccee Confidence 22 222222333332 2322222444556788899999999999999976443 22345566677777643 335777 Q ss_pred eecCchhHHHHHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019406. 398 IEFKGEGLKTLERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFR 474 (661) Q Consensus 398 lE~~g~~i~a~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~ 474 (661) .+++..+++...+.|+.+..++....- .-+ ..+...+.||++.+............+-..+..+|.+++++++.++ T Consensus 276 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 354 (441) T protein:vir:80 276 GSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYF-GFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKAL 354 (441) T ss_pred EecCccchHHHHHHHHHHHHHHhcccCCCHHHh-ccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 888888888888888888877754321 111 1111223589999888888888888888888889999999999999 Q ss_pred CCCCCCc---ceEEEEeccccccccCCHHHHHHHHHHHhcCCC--CHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCC Q lcl|NC_019406. 475 DIPLTDT---ATLRYEIDATFLTTALDARALRAIQQLYEGGLL--PIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFI 549 (661) Q Consensus 475 G~~~~~~---~~~~v~ln~DF~~~~lda~~l~all~~~~aG~I--s~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l 549 (661) |...... ..+.+..++ ..+.. ..+.++++.+++++|.+ |+++++..| |..+++ .++++.+.. T Consensus 355 ~~~~~~~~~~~~i~~~f~~-~~~~~-~~e~ad~~~kl~~~g~~~~s~~~~~~~l---~~~~~e------~~~~~~e~~-- 421 (441) T protein:vir:80 355 DSRVDEADFFGDVGLRWRD-ASTPT-RAATADAVTKLVGAGILPADSRTVLEML---GLDDVQ------VEAVMRHRA-- 421 (441) T ss_pred cCCCcccccceeeeEEeCC-CCCcC-HHHHHHHHHHHHhcCcccccHHHHHHhC---CCCHHH------HHHHHHHHH-- Confidence 8654332 233444433 22222 25678889999999986 566665333 543222 222221100 Q ss_pred CCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhcc Q lcl|NC_019406. 550 GQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHL 591 (661) Q Consensus 550 ~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~ 591 (661) -..+......|.. +.+ - .++ T Consensus 422 e~~~~~~~~~~~~-------------~~~----~-----~~~ 441 (441) T protein:vir:80 422 ESSDPLAVLAGAI-------------SRQ----T-----NEV 441 (441) T ss_pred HHHHHHHHHhhhh-------------hcc----c-----ccC Confidence 0000001111100 000 0 011 No 61 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.83 E-value=1.4e-19 Score=123.91 Aligned_cols=408 Identities=10% Similarity=0.013 Sum_probs=236.9 Q ss_pred cccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccC Q lcl|NC_019406. 18 AQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRP 97 (661) Q Consensus 18 ~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~ 97 (661) =..|.|+.--..+....+++.++.+-|.|...++ |||. .-.+.|+.+. ++ ..|+.+-.|++++++++-.- T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~-----~~~~---~~p~~~~~~~-~~-v~nw~~~~Vd~~a~rl~~~G 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYYAMDDRDD-----TRSI---VMPNNVREMY-RS-VLEWTAKGVDSLADRIIFRE 70 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhcCCChh-----hcCc---cccHHHHHHH-Hh-hcchhHHHHHHHHhccccce Confidence 3344467777888889999999999999975543 4443 3445566543 33 45999999999999886653 Q ss_pred ccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhccc Q lcl|NC_019406. 98 PVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAK 177 (661) Q Consensus 98 p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~r 177 (661) .+. | |+ .+++.. +=|+++.....+.+.+|.||+|+++|= +..+ ..+ T Consensus 71 f~~---~----------d~-----------~l~~~w-----~~N~ld~~~~~~~~~al~~G~sf~~v~-~~~~----~~~ 116 (422) T protein:vir:97 71 FTN---D----------DF-----------NAWEIF-----KANNPDIFFDTAIQSALIASCCFVYIM-PGAE----DGL 116 (422) T ss_pred eeC---C----------ch-----------hHHHHH-----HhcChHHHHHHHHHHHHHhcceeEEEe-eCCC----CCe Confidence 322 1 11 122222 248899999999999999999999993 2111 135 Q ss_pred ceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhhe Q lcl|NC_019406. 178 SYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADAL 257 (661) Q Consensus 178 PY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~ 257 (661) |.+..++|++++... +...++ +.....+ | T Consensus 117 p~i~~~sp~~~~~i~-D~~~~~--~~~a~~~----------------------------~-------------------- 145 (422) T protein:vir:97 117 PKMQVIEASKATGIL-DPTTFL--LTEGYAI----------------------------L-------------------- 145 (422) T ss_pred eEEEEechhhEEEEE-eCCCCc--ceeeEEE----------------------------E-------------------- Confidence 788889999876543 322221 1100000 0 Q ss_pred ecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecC-CCCCCccccch Q lcl|NC_019406. 258 ARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM-SNAADCEKPPL 336 (661) Q Consensus 258 ~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~-~~~~~~~~pPL 336 (661) . ....+.. ..+..+..+ .+ +.+..+.... . -.++++.||+|.|... +.+-.-+.+-+ T Consensus 146 ~---~~~~~~~----~~~~~~~~~----~~---~~~~~~~~~~----~----~~~~~g~vPvv~~~n~~~~~~~~G~s~I 203 (422) T protein:vir:97 146 E---SDSNGNP----TLEAYFTDK----DI---WYYPKKGKPY----N----IKNPTGHPLLVPIIHRPDAVRPFGRSRI 203 (422) T ss_pred E---ecCCCcE----EEEEEEcCc----eE---EEEcCCCccc----c----ccCCCCCcceEEecccCCCccccCcccc Confidence 0 0000000 011111111 01 1111111110 1 1367889999977532 22211222222 Q ss_pred -hHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC--ceeEecccceeecCCC--CCcceEeecCchhHHHHHHH Q lcl|NC_019406. 337 -LDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA--SEYHIGPGRVWVVDKE--SGIPGIIEFKGEGLKTLERA 411 (661) Q Consensus 337 -ldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~--~~l~iGs~~~~~lp~~--ga~~~ylE~~g~~i~a~~~~ 411 (661) .++-.|.=+.=+..++...+.++.++|++|+.|++.... +.+.....+.|.+|++ |..+++-|+++.++.-+.+. T Consensus 204 ~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~ 283 (422) T protein:vir:97 204 TKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRATVSTLLEISKDEDGDKPTVGQFTTASMAPFMEH 283 (422) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhhhhhhhhccCCCCCCCcceeeecCCCChhHHHHH Confidence 123233334445667788899999999999999975331 2234455578888753 34677888999999999999 Q ss_pred HHHHHHHHHHHhHHhccc-ccCcc-chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCc---ceEEE Q lcl|NC_019406. 412 LNEKEQQIAAIGGRLMPG-MSKSV-SESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDT---ATLRY 486 (661) Q Consensus 412 L~~le~qM~~lGArll~~-~~~~~-~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~---~~~~v 486 (661) |+.+..++.....=.... +..+. +.||++.+............+-..+..++.+++++++...|...... .++.+ T Consensus 284 l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~ 363 (422) T protein:vir:97 284 LKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVI 363 (422) T ss_pred HHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceE Confidence 988888887542111110 00111 35788888776666666667777778888888889888877432111 12333 Q ss_pred EeccccccccC-CHHHHHHHHHHHhc--CCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcC Q lcl|NC_019406. 487 EIDATFLTTAL-DARALRAIQQLYEG--GLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRG 560 (661) Q Consensus 487 ~ln~DF~~~~l-da~~l~all~~~~a--G~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g 560 (661) ...+-+..... .++..+++.+++++ |.++.++.++.| |+ ++ .+++..++++... +| T Consensus 364 ~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~l---g~-~~---~~~~~~~~~~~~~-----------d~ 422 (422) T protein:vir:97 364 KWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLT---GV-KG---ADKPIPAITEVTT-----------DG 422 (422) T ss_pred EEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHc---CC-Cc---hhHHHHHHHhhhc-----------cC Confidence 33332222111 24567889999998 778899887665 77 21 2333344443311 11 No 62 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.82 E-value=1.2e-19 Score=124.29 Aligned_cols=441 Identities=11% Similarity=0.090 Sum_probs=240.3 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |-..||.-. +..---.+....++++++++-|.|...++ |+|+ .....|+..-.+++ .| T Consensus 1 ~~~~t~~~~-------------~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~-----~~~~---~~~~~~~~~~~~~~-~n 58 (456) T protein:vir:79 1 MTASTPAEW-------------LPVLTKRIDDGMSRVRLLARYSNGDAPLP-----ELTR---NTSAAWRSFQREAR-TN 58 (456) T ss_pred CCCCCHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhccCChh-----hcCc---ccChhhchhhhhhh-cc Confidence 333333211 11111235566777888888888864433 3433 22344555444444 78 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+.+|+.++|++|-++.++..-++ .+ .. ..++++.+ -++++.+.+.+++.++.+|+| T Consensus 59 ~~~~ivd~~~~~l~~~g~~~~~~~d--------~~------~~---~~~~~~~~-----~n~~d~~~~~~~~~a~~~G~a 116 (456) T protein:vir:79 59 WGLMVRDSVADRIIPNGITVGGSAD--------SD------LA---LRARRIWR-----DNRMDSVCKQWVKYGLDFGES 116 (456) T ss_pred hHHHHHHHHHhhhccCCeecCCCCC--------cc------HH---HHHHHHHH-----hcChhHHHHHHHHHHhhcCee Confidence 9999999999999999987632111 11 11 12333332 357889999999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) +++|= +..+ ..|.+..++|.+++-. +++..++.....+++.. .. +. ..-+...|... T Consensus 117 ~~~~~-~~ed-----g~~~i~~~~p~~~~~i-~d~~~~~~~~~~~~~~~--~~--d~----~~~~~~~~~~~-------- 173 (456) T protein:vir:79 117 YLTCW-RRDD-----GTATITADSPETMVVS-VDPLQPWRIRSAMRWWR--DL--DA----ESDFAIVWSGD-------- 173 (456) T ss_pred EEEEe-eCCC-----CceEEEEeccceeEEE-EcCCCCCceEEEEEEEE--ec--CC----ceeEEEEEcCC-------- Confidence 99874 3222 2467788888886533 23333332222222211 00 00 00011111100 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeec-cCCcccceeeE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPM-VRGRTLPFIPF 319 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~-~~g~~L~~IPf 319 (661) ...+.++.-..... .+ . ...+... ..+.+. ...+.++.||+ T Consensus 174 ---------------------------~~~~~~~~~~~~~~--~~-~-~~~~~~~-------~~~~~~~~~~~~~~~~pv 215 (456) T protein:vir:79 174 ---------------------------GWQKFARPCFVQSS--SR-R-RLVTRIS-------DSWVPVGDAVVTGSPPPV 215 (456) T ss_pred ---------------------------ceEEEEEEEEeecc--cc-c-eeeeccC-------CceeecccccCCCCceeE Confidence 00000110000000 00 0 0000010 111111 12346889999 Q ss_pred EEEecCCCCCCccc-cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCC------CCc------eeEeccccee Q lcl|NC_019406. 320 VFFGSMSNAADCEK-PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDS------DAS------EYHIGPGRVW 386 (661) Q Consensus 320 v~~~~~~~~~~~~~-pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~------~~~------~l~iGs~~~~ 386 (661) +++. |.+..+. -|+.+|-+ +.=+..|+....+.+.++|++++.|.+.. +++ .+..+.+..| T Consensus 216 v~~~---N~~~~gd~e~v~~liD---~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~ 289 (456) T protein:vir:79 216 VVYQ---NPDGMGEVEPHIDIIN---RINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALW 289 (456) T ss_pred EEec---CCCCCchhhhhHHHHH---HHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccc Confidence 9873 3333322 13333322 22235567778889999999999997532 121 1334555666 Q ss_pred ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_019406. 387 VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALEDGMTS 465 (661) Q Consensus 387 ~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~ 465 (661) .+| ++++++ +++...++.+.+.|+.+..++....--.... +...++-|+++.+............+-..+..+|.+ T Consensus 290 ~~~-~~~~~~--q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 366 (456) T protein:vir:79 290 ELP-PGVDIW--ESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA 366 (456) T ss_pred cCC-CCccee--eecccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666 355655 4557778888888888888886542210000 111235689988888888888888888889999999 Q ss_pred HHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhcc Q lcl|NC_019406. 466 VVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDP 545 (661) Q Consensus 466 aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~ 545 (661) ++++++.+.|..+.. .+.+... +-.+.. .++.++++.++.++|.+|+++++..| |+-+++. ...+.+++.++ T Consensus 367 ~~~l~~~~~g~~~~~--~i~v~w~-~~~~~s-~~~~ada~~kl~~~G~~~~~~~~~~l---g~~~~~i-~~~e~~r~~~e 438 (456) T protein:vir:79 367 ILVKALQIEGESVED--TVDVSFE-SPDRVT-LGEKYSAASLAKAAGESWASIRRNIL---NYNADQI-KQDDLDRAREQ 438 (456) T ss_pred HHHHHHHhcCCCccc--cceEEeC-CCCCcC-HHHHHHHHHHHHhcCCChHHHHHhcC---CCCHHHH-HHHHHHHHHHH Confidence 999999999865433 3444332 222232 36788999999999999999885443 6654433 24456666665 Q ss_pred CCCCCCchhhhhhcCCccccCCCcchhhhh Q lcl|NC_019406. 546 KSFIGQPDAIAMRRGYVSRQQELDQQRAAR 575 (661) Q Consensus 546 ~~~l~~ddae~~~~g~~~~~~~~~q~~~~~ 575 (661) ...+.-. +.+++ |.++++ T Consensus 439 ~~~~~~~---~~~~~---------~~~~~~ 456 (456) T protein:vir:79 439 ITLFAGN---PVQRP---------QEDGSR 456 (456) T ss_pred HHHHhhh---HhhcC---------CCCCCC Confidence 4443211 11222 111122 No 63 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.81 E-value=7.9e-19 Score=119.83 Aligned_cols=392 Identities=9% Similarity=-0.010 Sum_probs=219.0 Q ss_pred ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccc Q lcl|NC_019406. 23 HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRN 102 (661) Q Consensus 23 V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~ 102 (661) .+.. .++..+..+-|.|...++ ||+. +-.+.|+.++ ++ ..|+.+-+|++++++++-.--+ T Consensus 1 l~~~-------~~r~~~~~~yY~g~~~~~-----~~~~---~~p~~~~~~~-~~-v~nw~~~~Vds~a~rl~~~Gf~--- 60 (410) T protein:vir:95 1 MNLY-------QSRVNLRYKHYAMQHYEA-----PTGI---TIPAHIRAKY-QA-VLGWAAKGVDSLADRLIFRAFA--- 60 (410) T ss_pred CCcc-------hhhHHHHHHHhcCCCCcc-----ccch---hccHHHHhHH-Hh-hcchhHHHHHHhHhhhcccccc--- Confidence 3333 455556677788865442 3432 3344566554 34 4699999999998877653221 Q ss_pred cchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEe Q lcl|NC_019406. 103 LPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVG 182 (661) Q Consensus 103 ~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~ 182 (661) +.|+ .++++. +-|+++.....+.+.+|.||+|+++| ++..+ .+|.+.. T Consensus 61 ----------~~d~-----------~l~~i~-----~~N~ld~~~~~~~~~al~~G~sf~~v-~~~~d-----~~~~i~~ 108 (410) T protein:vir:95 61 ----------NDDF-----------NVTEIF-----DRNNPDIFFDSAILSALIGSCSFVYI-SKGED-----DEVRLQV 108 (410) T ss_pred ----------CCCc-----------hHHHHH-----hhcChHHHHHHHHHHHHHhCceeEEE-ecCCC-----CceEEEE Confidence 1121 123322 35899999999999999999999999 33222 3578888 Q ss_pred echhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccc Q lcl|NC_019406. 183 YAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSR 262 (661) Q Consensus 183 ~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~ 262 (661) ++|.++.-.- +...++ ...-+++ | . .. T Consensus 109 ~sP~~~~~i~-Dp~~~~-~~~al~~-----------------------------~--------------------~--~~ 135 (410) T protein:vir:95 109 IESSNATGVI-DPITGL-LVEGYAV-----------------------------L--------------------A--RD 135 (410) T ss_pred EcccceEEEE-eCCCCc-eEEEEEE-----------------------------E--------------------E--ec Confidence 9998765332 222111 0000000 0 0 00 Q ss_pred cCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecC-CCC--CCccc--cchh Q lcl|NC_019406. 263 FTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM-SNA--ADCEK--PPLL 337 (661) Q Consensus 263 ~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~-~~~--~~~~~--pPLl 337 (661) .++ ...++..+.++. ++.+ .+.+.. + .+ .+++++||+|.|... +.+ +..+. .|+. T Consensus 136 -~~~----~~~~~~~~~~~~----~~~~--~~~~~~--~---~~----~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~ 195 (410) T protein:vir:95 136 -DYN----RPTLEAYFEPNA----THFI--PKDGEP--Y---SV----TNETGIPLLVPVIHRPDAVRPFGRSRITRAGM 195 (410) T ss_pred -CCC----eEEEEEEEeCCc----EEEE--eeCCcc--c---cc----cCCCCCcceEEecccccCCccCCccccchhHH Confidence 000 001111111211 1111 111111 1 11 256899999977522 211 11221 3555 Q ss_pred HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC--CceeEecccceeecCCC--CCcceEeecCchhHHHHHHHHH Q lcl|NC_019406. 338 DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD--ASEYHIGPGRVWVVDKE--SGIPGIIEFKGEGLKTLERALN 413 (661) Q Consensus 338 dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~--~~~l~iGs~~~~~lp~~--ga~~~ylE~~g~~i~a~~~~L~ 413 (661) +|.+ +.=+..++...+.++.++|+.|+.|+++.. .+.+....++.|.+|++ |..+++-|+++.++.-+.+.|+ T Consensus 196 ~l~d---a~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~ 272 (410) T protein:vir:95 196 YYQK---YAKRTLERADITAEFYSWPQKYILGLDPDAEPMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLR 272 (410) T ss_pred HHHH---HHHHHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHH Confidence 5543 444566778889999999999999997532 22345566678888864 2367888999999999989888 Q ss_pred HHHHHHHHHhHHhccc-ccCcc-chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC---CcceEEEEe Q lcl|NC_019406. 414 EKEQQIAAIGGRLMPG-MSKSV-SESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLT---DTATLRYEI 488 (661) Q Consensus 414 ~le~qM~~lGArll~~-~~~~~-~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~---~~~~~~v~l 488 (661) .+..++.....=.... +..+. +-||++.+............+-..+..++.++++++....+.... ....+.+.. T Consensus 273 ~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W 352 (410) T protein:vir:95 273 TAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKW 352 (410) T ss_pred HHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEe Confidence 8888887552111111 00111 256777665544444444445555677788888887777653221 112233333 Q ss_pred cc--ccccccCCHHHHHHHHHHHhc--CCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCC Q lcl|NC_019406. 489 DA--TFLTTALDARALRAIQQLYEG--GLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQ 551 (661) Q Consensus 489 n~--DF~~~~lda~~l~all~~~~a--G~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ 551 (661) -+ |=.... .++..+++.+++++ |.++++++++.| |+-+ +++.....++..+.|. T Consensus 353 ~p~~d~~~~s-~a~~aDa~~Kl~~a~~g~~~~~~~~~~l---g~~~-----~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 353 EPLFEADANT-MTMIGDGVVKLNQALPGYINAETIRDLT---GIAG-----DMSAKPVVSEGGSNGE 410 (410) T ss_pred eecCCcchhh-HHHHHHHHHHHHHhccCCccHHHHHHhc---CCCh-----HHHHHHHHHHHHhCCC Confidence 21 222223 36789999999998 788999986555 6632 3333333332222222 No 64 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.81 E-value=1e-18 Score=119.27 Aligned_cols=459 Identities=12% Similarity=0.039 Sum_probs=233.0 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |--..|.-.+..- ...-+..--..|....++.+++.+-|.|...++. ++.. .+..+++. ++ ..| T Consensus 1 ~~~~~~~~~e~~~-----~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~-----~~~~---~~~~~~~~--~~-v~n 64 (486) T protein:vir:42 1 MTAPLPGMEEIED-----PAVVREEMISAFEDASKDLASNTSYYDAERRPEA-----IGVT---VPREMQQL--LA-HVG 64 (486) T ss_pred CCCCCCCCCCccc-----HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchh-----cccc---cchhHhhh--hh-ccc Confidence 3222222111110 0000223335566777888899999999765543 3221 12223322 22 349 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) +.+-+|+.++++++-...++.+ .+ ...+.+..++ +.++++.....++..++.||++ T Consensus 65 ~~~~iVd~~~~~l~~~g~~~~~-----------~~-----~~~~~~~~i~--------~~N~~d~~~~~~~~~a~~~G~a 120 (486) T protein:vir:42 65 YPRLYVDSVAERQAVEGFRLGD-----------AD-----EADEELWQWW--------QANNLDIEAPLGYTDAYVHGRS 120 (486) T ss_pred hHHHHHHHHHhhhcccceecCC-----------Cc-----hhHHHHHHHH--------HhcChhHHHHHHHHHHhhcCce Confidence 9999999999988544333211 00 0111122232 3578999999999999999999 Q ss_pred EEEEeccCCCch--hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 161 GALVDVAPSSDP--TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 161 gvLVD~P~a~~~--~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) |++|........ ....+|-+..++|++++-+- +...++ .+..|++.. T Consensus 121 y~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~-d~~~~~-~~~~~~~~~----------------------------- 169 (486) T protein:vir:42 121 FITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEI-DPRINR-VSKAIRVAY----------------------------- 169 (486) T ss_pred EEEEecCCcccccccCCCeeEEEEecccceEEEE-eCCCCC-eEEEEEEEE----------------------------- Confidence 999975432211 12345777788888876553 222211 111111100 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP 318 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP 318 (661) +.+ + +.+..+.++..+ .++.+. ...+. ....... -+.++.|| T Consensus 170 ----------------------~~~-~----~~~~~~~~y~~~----~~~~~~-~~~~~--~~~~~~~----~h~~g~vP 211 (486) T protein:vir:42 170 ----------------------DKE-G----NEIQAATLYTPM----ETIGWF-RADGE--WAEWFNV----PHGLGVVP 211 (486) T ss_pred ----------------------ecC-C----CeEEEEEEEcCC----cEEEEE-ecCCc--EEeecce----ecCCCCce Confidence 000 0 001111112111 111111 11111 1111111 25689999 Q ss_pred EEEEecC-CCCCCccc----cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC-----c---eeEecccce Q lcl|NC_019406. 319 FVFFGSM-SNAADCEK----PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA-----S---EYHIGPGRV 385 (661) Q Consensus 319 fv~~~~~-~~~~~~~~----pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~-----~---~l~iGs~~~ 385 (661) ||.|-.. ..+...+. +++.+|- =+.=+..|+...+..+.++|++++.|.+.... . .+....+.. T Consensus 212 vv~~~n~~~~~~~~G~s~i~~~v~~li---Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~ 288 (486) T protein:vir:42 212 VVPLPNRTRLSDLYGTSEITPELRSMT---DAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARI 288 (486) T ss_pred EEEeccccccCCCCCcccchhhHHHHH---HHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchh Confidence 9976421 11111122 2233322 12223456888899999999999999764321 1 123344556 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG 462 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A 462 (661) |.+++ ++++|.|+++.+++.+.+.|+.+..++....- .-+ ......+-||++.+............+-..+..+ T Consensus 289 ~~~~~--~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~f-g~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~ 365 (486) T protein:vir:42 289 LAFED--AEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYL-STAADNPASAEAIRAAESRLIKKVERKNLMFGGA 365 (486) T ss_pred cccCC--CCceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHh-ccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77664 46778899999999888888888777754321 111 1111112478888888777777777888888999 Q ss_pred HHHHHHHHHHHcCCCCC--CcceEEEEeccccccccCCHHHHHHHHHHHhc--CCCCHHHHHHHHHhcCCCCccCCHHHH Q lcl|NC_019406. 463 MTSVVRYWLMFRDIPLT--DTATLRYEIDATFLTTALDARALRAIQQLYEG--GLLPIDALYENFVKNGIIPSTQTLEEF 538 (661) Q Consensus 463 l~~aL~~~A~w~G~~~~--~~~~~~v~ln~DF~~~~lda~~l~all~~~~a--G~Is~et~~~eL~r~gvl~~~~~~Eee 538 (661) |.+++++++.+.|.... +...+.+...... +.. .++.++++.+++++ |.+|++|+++.| |+.++. .++ T Consensus 366 l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~-~~s-~~~~ad~~~kl~~~~~g~~s~et~~~~l---g~~~d~---~~e 437 (486) T protein:vir:42 366 WEEAMRIAYRIMKGGDVPPDMLRMETVWRDPS-TPT-YAAKADAATKLYGNGQGVIPRERARIDM---GYSVKE---REE 437 (486) T ss_pred HHHHHHHHHHHhcCCCccccceeeeEEecCCC-CCC-HHHHHHHHHHHHhcccCCCCHHHHHhcC---CCChhH---HHH Confidence 99999999998875322 1223444443221 222 35678888898886 789999996543 665443 333 Q ss_pred HHHHhccCCCCCCchhhhhhcCCccccC-----CCcchhhhhcCChhhHHHHHHHhccCC Q lcl|NC_019406. 539 TIKMNDPKSFIGQPDAIAMRRGYVSRQQ-----ELDQQRAARDADFQQQELEQAERHLEI 593 (661) Q Consensus 539 ~~~l~~~~~~l~~ddae~~~~g~~~~~~-----~~~q~~~~~e~d~~q~~~~~~e~~~~~ 593 (661) .+++.++....+..-...+.+.....+. +-++.+++.++ .++.- T Consensus 438 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~ 486 (486) T protein:vir:42 438 MRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIES-----------SGGDA 486 (486) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCC-----------CCCCC Confidence 4444332211111100111111000000 00000011000 00000 No 65 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.79 E-value=1.3e-18 Score=118.59 Aligned_cols=392 Identities=10% Similarity=0.019 Sum_probs=220.5 Q ss_pred cccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccC Q lcl|NC_019406. 18 AQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRP 97 (661) Q Consensus 18 ~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~ 97 (661) =..+.|+.---.+....+++..+.+-|.|...++ ||+. .-.+.++.+++ . ..|+.+-+|++++++++-.- T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~-----~~~~---~~p~~~~~~~~-~-v~nw~~~iVds~a~rl~~~G 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDR-----FKGI---TIPQALSQQYR-S-ILGWCAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchh-----hcCh---hhhHHHHHHHh-h-hcchhHHHHHHhHhhcccCc Confidence 1122245555567777888889999999986554 3432 22223333332 2 45999999999998776432 Q ss_pred ccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhccc Q lcl|NC_019406. 98 PVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAK 177 (661) Q Consensus 98 p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~r 177 (661) .+ ..|+ .++++. +=|+++.....+.+.+|.||+|+++|- +..+ .+ T Consensus 71 f~-------------~~d~-----------~l~~i~-----~~N~ld~~~~~~~~~aliyG~sf~~v~-~~~d-----g~ 115 (409) T protein:vir:94 71 FE-------------NDDF-----------TVNEIF-----EENNPDIFFDSAVLSSLIASCSFTYIS-KGEN-----DA 115 (409) T ss_pred cc-------------CCch-----------HHHHHH-----HhcChhHHHHHHHHHHHHhcceeEEEe-cCCC-----Cc Confidence 11 1111 223322 348999999999999999999999994 3222 35 Q ss_pred ceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhhe Q lcl|NC_019406. 178 SYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADAL 257 (661) Q Consensus 178 PY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~ 257 (661) |-+..++|.+++-+- +...++ ....+++ |. T Consensus 116 ~~i~~~sp~~~~~i~-D~~~~~-~~~a~~~-----------------------------~~------------------- 145 (409) T protein:vir:94 116 VRLQVIEAVNATGII-DPITGL-LTEGYAV-----------------------------LE------------------- 145 (409) T ss_pred eEEEEeccceEEEEE-ecCCCc-eeeeEEE-----------------------------EE------------------- Confidence 888888998765332 222111 1111110 00 Q ss_pred ecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecC-CCCCCccc--- Q lcl|NC_019406. 258 ARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM-SNAADCEK--- 333 (661) Q Consensus 258 ~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~-~~~~~~~~--- 333 (661) ++ ..+.. +. ...+.++. ++. .++.++. + ... .++++.||+|.|... +.+-..+. T Consensus 146 ---~d-~~~~~--~~--~~~~~~~~----~~~--~~~~~~~--~--~~~----~n~~g~vPvV~f~n~~~~~~~~G~s~I 203 (409) T protein:vir:94 146 ---RD-ENNNV--VL--EAHFLPDR----TDY--YYRDSRN--N--ISI----ANPTGHPLLVPIIHRPDAVRPFGRSRI 203 (409) T ss_pred ---ec-CCCce--EE--EEEEecCc----EEE--EEecCce--e--Eee----eCCCCCcceEEeccccccccccCcccc Confidence 00 00000 00 01111111 111 1122111 1 111 256889999977532 11111222 Q ss_pred -cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC--CceeEecccceeecCCC--CCcceEeecCchhHHHH Q lcl|NC_019406. 334 -PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD--ASEYHIGPGRVWVVDKE--SGIPGIIEFKGEGLKTL 408 (661) Q Consensus 334 -pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~--~~~l~iGs~~~~~lp~~--ga~~~ylE~~g~~i~a~ 408 (661) .|+.+|. =+.=+..++...+.++.++|+.|+.|+++.. .+.+..+.++.|.+|++ |.++++-|+++..+.-+ T Consensus 204 ~e~v~~l~---da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~ 280 (409) T protein:vir:94 204 TRSGMYWQ---SNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPF 280 (409) T ss_pred chhHHHHH---HHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhHHHhhcCCCCCCCCCceEEecCCCChhHH Confidence 2444432 2334555778899999999999999996532 22456677788888753 45688889999999999 Q ss_pred HHHHHHHHHHHHHHhHHhccc-ccCcc-chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCc---ce Q lcl|NC_019406. 409 ERALNEKEQQIAAIGGRLMPG-MSKSV-SESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDT---AT 483 (661) Q Consensus 409 ~~~L~~le~qM~~lGArll~~-~~~~~-~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~---~~ 483 (661) -+.|+.+..++.....=.... +..+. +.||++.+............+-..+..++.++++++....|...... .. T Consensus 281 ~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~ 360 (409) T protein:vir:94 281 TEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRK 360 (409) T ss_pred HHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccc Confidence 899999988888652211110 00111 25677776554443334444444566778888888877766422111 22 Q ss_pred EEEEeccccccccC-CHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCCCcc Q lcl|NC_019406. 484 LRYEIDATFLTTAL-DARALRAIQQLYEGG--LLPIDALYENFVKNGIIPST 532 (661) Q Consensus 484 ~~v~ln~DF~~~~l-da~~l~all~~~~aG--~Is~et~~~eL~r~gvl~~~ 532 (661) +.+...+-+.+... -++..+++.+++++| ..+.+++++ +.|+=.++ T Consensus 361 ~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~---~lG~~~~d 409 (409) T protein:vir:94 361 TKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRD---LTGIEGGE 409 (409) T ss_pred ceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHH---HcCCCCCC Confidence 34444322221111 145689999999999 456676644 45885555 No 66 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.75 E-value=6.2e-17 Score=109.45 Aligned_cols=492 Identities=12% Similarity=0.025 Sum_probs=255.8 Q ss_pred CC----------CCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHH Q lcl|NC_019406. 1 MA----------GLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYA 70 (661) Q Consensus 1 ~~----------~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~ 70 (661) |+ -+-+++.||- +.+.+.=.+.+..+++..|.|.|.. ++-=++...++ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p-----------~~v~~~d~~Rl~aY~l~~~~y~n~~-----~~~~~~lrg~~------ 58 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFP-----------NAVTDFDKARLASYRLYEDMYLTNT-----SDYQVILRGGD------ 58 (527) T ss_pred CCccccccCCCcCcCCccccCc-----------ccCCHHHHHHHHHHHHHHHHhcCch-----hheeeecCCcc------ Confidence 32 2245555552 1256666778889999999998841 11112222222 Q ss_pred HHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHH Q lcl|NC_019406. 71 NYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTV 150 (661) Q Consensus 71 ~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~ 150 (661) +|=+|-++.|.- +.++.++-.|. +|+ +++ ..++.-.. +- ..+..+ .+=++|+..+.+. T Consensus 59 ~~~~r~~~~ps~--------~~~~~~~~~~~-~~g-~~~---~~~~~~e~-v~---~~lr~~-----~~~e~l~~~~~~~ 116 (527) T protein:vir:10 59 EGDQRPIYVPNG--------EKLIEAKMRFL-GQG-LKW---EFSKKDAK-VD---DAIKVL-----FDRENWEQKFESL 116 (527) T ss_pred ccccceeeehhh--------HHhhCCcceee-ccC-ccc---cccchhHH-HH---HHHHHH-----HHHhhhHHHHHHH Confidence 111233333333 22333332231 111 111 11111000 00 112222 2347899999999 Q ss_pred HHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeee-eeeeeccccccccccceeeee Q lcl|NC_019406. 151 ALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLR-EFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 151 ~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ir-e~~~~~~~~~~~~~~~~i~~~ 229 (661) -+.++..|-...+|=.. +++..+.||-+..++|.++--|. +-+|.....-|-+. ++..-++.... T Consensus 117 ~r~~~vlGDg~f~l~wD--~~k~~~~R~~v~~~DP~~~f~~e--d~d~~~~v~~v~~~~~~~~P~d~~~~---------- 182 (527) T protein:vir:10 117 KRWTEIRGDYVLLLIGD--DEKDEGSRLSLHEVDPSTYFPYE--DPRYPGQVLGVYLVDEYPHPDSEKKN---------- 182 (527) T ss_pred HHhhhhhcceeEEEeec--cCCCcCCCceEeecCcceeeeee--cCCCCCceeeEEEeeeccCCcccccc---------- Confidence 99988888554444322 23345679999999999988773 22222222222221 12111111100 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheec-ccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccc-ccccceee Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALAR-PSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLG-QARDVYTP 307 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~-~~~~~~~p 307 (661) .+++|+ +...+. .+...-.+.-++.|-+..+++|..-+. -.+-...... ......+- T Consensus 183 -------~~~ar~-----------~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~---~e~p~~~~~~~~~~~~~~l 241 (527) T protein:vir:10 183 -------EKCARV-----------QKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDR---PESPLEPDDIKKLSTLTEE 241 (527) T ss_pred -------ceehhh-----------hhhhhhcCcccccccCcceeeeeceeeccccccc---cccccchhhhhhhcCceee Confidence 000000 000000 000000112223333333333221000 0000000000 00111222 Q ss_pred ccCCcccceeeEEEEe-cCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC----CceeEecc Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFG-SMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD----ASEYHIGP 382 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~-~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~----~~~l~iGs 382 (661) ...-++|++||+|.+. ........+.+-|.++-.+--+.=++.||++-|+-+++.|+.+++|+...+ ..++.||+ T Consensus 242 ~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgP 321 (527) T protein:vir:10 242 EPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISP 321 (527) T ss_pred ecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCC Confidence 2345789999999663 333445567888999999999999999999999999999999999985332 35689999 Q ss_pred cceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccc----cCccchhHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_019406. 383 GRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGM----SKSVSESDNQSALREANEQSLLLNVIMA 458 (661) Q Consensus 383 ~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~----~~~~~eTataa~~d~~~~~S~L~~~A~~ 458 (661) +..|.+|. ++++..|. ....++.+++.|+.+.++|... +++=... ..+.+.|+.+..+..+.-.+..+....- T Consensus 322 G~iweL~e-~ak~~~v~-~~~~la~~~~h~~~L~~~l~~v-A~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~ 398 (527) T protein:vir:10 322 LGMVEHGQ-NNKIYRVN-GVASLEPSQTHMTKAEEAMQQT-KGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELE 398 (527) T ss_pred ceeEecCC-Ccceeecc-chhhhHHHHHHHHHHHHHHHHh-hcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHH Confidence 99999995 78998877 3457888999999999988765 3331111 1134678888888876665555544444 Q ss_pred HHHHHHHHHH-HHHHH----cCCCCCCc---ceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhc-CCC Q lcl|NC_019406. 459 LEDGMTSVVR-YWLMF----RDIPLTDT---ATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKN-GII 529 (661) Q Consensus 459 le~Al~~aL~-~~A~w----~G~~~~~~---~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~-gvl 529 (661) +...+.+.+. |+-+| .|+...+. -.+++... +..+.+ ..+.+.++..++++|.||+++.+++|.+. |+- T Consensus 399 ~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~-p~lP~D-~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~e 476 (527) T protein:vir:10 399 LKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFR-DPKPVN-SEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFE 476 (527) T ss_pred HHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEec-ccCCCC-HHHHHHHHHHHHHcCchhHHHHHHHHHhccCCC Confidence 5555554443 33344 34332221 12233322 222222 34679999999999999999999999886 555 Q ss_pred CccCCHHHHHHHHhccCC----CCCCchhhhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 530 PSTQTLEEFTIKMNDPKS----FIGQPDAIAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 530 ~~~~~~Eee~~~l~~~~~----~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) +++...+++.++...+.. +.+.-.|+++-+|.-...+. +-+.+.=+- T Consensus 477 D~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~----d~~~~~~~~ 527 (527) T protein:vir:10 477 LTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEED----DQALNGQPL 527 (527) T ss_pred ChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCc----ccccCCCCC Confidence 556666666666654432 23333444443332222110 001110000 No 67 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.75 E-value=6.6e-17 Score=109.30 Aligned_cols=492 Identities=12% Similarity=0.027 Sum_probs=256.0 Q ss_pred CC----------CCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHH Q lcl|NC_019406. 1 MA----------GLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYA 70 (661) Q Consensus 1 ~~----------~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~ 70 (661) |+ -+-+++.||- +.+.+.=.+.+..+++..|.|.|.. ++-=++...++ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p-----------~~v~~~d~~Rl~aY~l~~~~y~n~~-----~~~~~~lrg~~------ 58 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFP-----------NAVTDFDKARLASYRLYEDMYLTNT-----SDYQVILRGGD------ 58 (527) T ss_pred CCccccccCCCcCcCCccccCc-----------ccCCHHHHHHHHHHHHHHHHhcCch-----hheeeecCCcc------ Confidence 32 2245555552 1256666778889999999998841 11112222222 Q ss_pred HHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHH Q lcl|NC_019406. 71 NYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTV 150 (661) Q Consensus 71 ~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~ 150 (661) ++=+|-++.|.- +.++.++-.|. +|+ +++ ..++.-.. +- ..+..+ .+=++|+..+.+. T Consensus 59 ~~~~r~~~~ps~--------~~~~~~~~~~~-~~g-~~~---~~~~~~e~-v~---~~lr~~-----~~~e~l~~~~~~~ 116 (527) T protein:vir:10 59 EGDQRPIYVPNG--------EKLIEAKMRFL-GQG-LKW---EFSKKDAK-VD---DAIRVL-----FDRENWEQKFESL 116 (527) T ss_pred ccccceeeehhh--------HHhhCCcceee-ccC-ccc---cccchhHH-HH---HHHHHH-----HHHhhhHHHHHHH Confidence 111233333333 22333332231 111 111 11111000 01 112222 2347899999999 Q ss_pred HHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeee-eeeeeccccccccccceeeee Q lcl|NC_019406. 151 ALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLR-EFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 151 ~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ir-e~~~~~~~~~~~~~~~~i~~~ 229 (661) -+.++..|-...+|=.. +++..+.||-+..++|.++--|. +-+|.....-|-+. ++..-++.... T Consensus 117 ~r~~~vlGDg~f~l~wD--~~k~~~~R~~v~~~DP~~~f~~e--d~d~~~~v~~v~~~~~~~~P~d~~~~---------- 182 (527) T protein:vir:10 117 KRWTEIRGDYVLLLIGD--DEKDEGSRLSLHEVDPSTYFPYE--DPRYPGQVLGVYLVDEYPHPDSEKKN---------- 182 (527) T ss_pred HHhhhhhcceeEEEeec--cCCCcCCCceEeecCcceeeeee--cCCCCCceeeEEEeeeccCCcccccc---------- Confidence 99988888554444322 23345679999999999988773 22222222222221 12111111100 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheec-ccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccc-ccccceee Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALAR-PSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLG-QARDVYTP 307 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~-~~~~~~~p 307 (661) .+++|+ +...+. .+...-.+.-++.|-+..+++|..-+. -.+-...... ......+- T Consensus 183 -------~~~ar~-----------~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~---~e~p~~~~~~~~~~~~~~l 241 (527) T protein:vir:10 183 -------EKCARV-----------QKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDR---PESPLEPDDIKKLSTLTEE 241 (527) T ss_pred -------ceehhh-----------hhhhhhcCcccccccCcceeeeeceeeccccccc---cccccchhhhhhhcCceee Confidence 000000 000000 000000112223333333333221000 0000000000 00111222 Q ss_pred ccCCcccceeeEEEEe-cCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC----CceeEecc Q lcl|NC_019406. 308 MVRGRTLPFIPFVFFG-SMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD----ASEYHIGP 382 (661) Q Consensus 308 ~~~g~~L~~IPfv~~~-~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~----~~~l~iGs 382 (661) ...-++|++||+|.+. ........+.+-|.++-.+--+.=++.||++-|+-+++.|+.+++|+...+ ..++.||+ T Consensus 242 ~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgP 321 (527) T protein:vir:10 242 EPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISP 321 (527) T ss_pred ecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCC Confidence 2345789999999663 333445567888999999999999999999999999999999999985332 35689999 Q ss_pred cceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhcccc----cCccchhHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_019406. 383 GRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPGM----SKSVSESDNQSALREANEQSLLLNVIMA 458 (661) Q Consensus 383 ~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~~----~~~~~eTataa~~d~~~~~S~L~~~A~~ 458 (661) +..|.+|. ++++..|. ....++.+++.|+.+.++|... +++=... ..+.+.|+.+..+..+.-.+..+....- T Consensus 322 G~iweL~e-~ak~~~v~-~~~~la~~~~h~~~L~~~l~~v-A~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~ 398 (527) T protein:vir:10 322 LGMVEHGQ-NNKIYRVN-GVASLEPSQTHMNKAEEAMQQT-KGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELE 398 (527) T ss_pred ceeEecCC-Ccceeecc-chhhhHHHHHHHHHHHHHHHHh-hcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHH Confidence 99999995 78998877 3457888999999999988765 3331111 1134678888888876665555544444 Q ss_pred HHHHHHHHHH-HHHHH----cCCCCCCc---ceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhc-CCC Q lcl|NC_019406. 459 LEDGMTSVVR-YWLMF----RDIPLTDT---ATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKN-GII 529 (661) Q Consensus 459 le~Al~~aL~-~~A~w----~G~~~~~~---~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~-gvl 529 (661) +...+.+.+. |+-+| .|+...+. -.+++... +..+.+ ..+.+.++..++++|.||+++.+++|.+. |+- T Consensus 399 ~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~-p~lP~D-~~avie~v~tL~~aGiiS~etAv~~L~~~~g~e 476 (527) T protein:vir:10 399 LKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFR-DPKPVN-NEKRFAQLLELWEAGLIPAKKLTEELSKIMGFE 476 (527) T ss_pred HHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEec-ccCCCC-HHHHHHHHHHHHHcCchhHHHHHHHHHhccCCC Confidence 5555554443 33344 34332221 12233322 222222 34679999999999999999999999886 555 Q ss_pred CccCCHHHHHHHHhccCC----CCCCchhhhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 530 PSTQTLEEFTIKMNDPKS----FIGQPDAIAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 530 ~~~~~~Eee~~~l~~~~~----~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) +++...+++.++...+.. +.+.-.|+++-+|.-...+. +-+.+.=+- T Consensus 477 D~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~----d~~~~~~~~ 527 (527) T protein:vir:10 477 LTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEED----DQALNGQPL 527 (527) T ss_pred chHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCc----ccccCCCCC Confidence 556666666666654432 23333444443332222110 001110000 No 68 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.75 E-value=2.7e-17 Score=111.40 Aligned_cols=391 Identities=9% Similarity=0.011 Sum_probs=220.3 Q ss_pred cccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccC Q lcl|NC_019406. 18 AQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRP 97 (661) Q Consensus 18 ~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~ 97 (661) =..+.|..--..+....+++.++.+-|.|...++ ||+. +-.+.++.+++ + ..|+.+-+|++++++++-.- T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~-----~~~~---~~p~~~~~~~~-~-v~nw~~~iVds~a~rl~~~G 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDR-----FKGI---TIPQALSQQYR-S-ILGWCAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchh-----hcch---hhhHHHHHHHh-h-hcChhHHHHHHhHhhccccc Confidence 2222355556677778889999999999975553 3432 22334444433 3 45999999999998776432 Q ss_pred ccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhccc Q lcl|NC_019406. 98 PVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAK 177 (661) Q Consensus 98 p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~r 177 (661) -+ ..|+ .++++. +=|+|+.....+.+.+|.||+|+++|- +.. ..+ T Consensus 71 f~-------------~~d~-----------~l~~i~-----~~N~ld~~~~~~~~~al~yG~sf~~v~-~~~-----dg~ 115 (409) T protein:vir:16 71 FE-------------NDDF-----------TVNEIF-----EENNPDIFFDSTVLSALIASCSFTYIS-KGE-----NDA 115 (409) T ss_pred cc-------------Ccch-----------HHHHHH-----HhcChhHHHHHHHHHHHHhCceeEEEe-cCC-----CCc Confidence 11 1111 123322 358999999999999999999999995 322 235 Q ss_pred ceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhhe Q lcl|NC_019406. 178 SYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADAL 257 (661) Q Consensus 178 PY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~ 257 (661) |-+..++|.++.-.- +...++ ....+++ | T Consensus 116 ~~i~~~sP~~~~~i~-D~~~~~-~~~a~~~-----------------------------~-------------------- 144 (409) T protein:vir:16 116 VRLQVIEATNATGII-DPITGL-LTEGYAV-----------------------------L-------------------- 144 (409) T ss_pred eEEEEEcccceEEEe-eccccc-ceeeeEE-----------------------------E-------------------- Confidence 888888998765332 221111 0000000 0 Q ss_pred ecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecC-CCCCCccc--- Q lcl|NC_019406. 258 ARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM-SNAADCEK--- 333 (661) Q Consensus 258 ~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~-~~~~~~~~--- 333 (661) ..+ ..+.. +.+ ..+.++. +++.++.+..+. .. -++++.||+|.|... +.+-..+. T Consensus 145 -~~d--~~~~~--~~~--~~~~~~~------~~~~~~~~~~~~----~~----~~~~g~vPvV~f~n~~~~~~~~G~seI 203 (409) T protein:vir:16 145 -ERD--ENNNV--VLE--AHFLPDR------TDYYYRDSRNNI----SI----ANPTGNPLLVPIIHRPDAVRPFGRSRI 203 (409) T ss_pred -Eec--CCCce--EEE--EEEecCc------EEEEEecCcccc----ce----ecCCCCcceEEecccccccccCCcccc Confidence 000 00100 111 1111111 111122221111 11 267899999977532 22211222 Q ss_pred -cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC--CceeEecccceeecCCC--CCcceEeecCchhHHHH Q lcl|NC_019406. 334 -PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD--ASEYHIGPGRVWVVDKE--SGIPGIIEFKGEGLKTL 408 (661) Q Consensus 334 -pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~--~~~l~iGs~~~~~lp~~--ga~~~ylE~~g~~i~a~ 408 (661) .|+.+|. =+.=+..++...+.++.++|+.|+.|+++.. .+.+..+.++.|.+|++ |..+++-|+++..+.-+ T Consensus 204 ~~~v~~l~---da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~ 280 (409) T protein:vir:16 204 TRSGMYWQ---SNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPF 280 (409) T ss_pred chhHHHHH---HHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhhhhhhHhhccCCCCCCCCceEEecCCCChhHH Confidence 3455442 2334556677889999999999999996532 22356667788988853 45678889999999999 Q ss_pred HHHHHHHHHHHHHHhHHhccc-ccCccc-hhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCc---ce Q lcl|NC_019406. 409 ERALNEKEQQIAAIGGRLMPG-MSKSVS-ESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDT---AT 483 (661) Q Consensus 409 ~~~L~~le~qM~~lGArll~~-~~~~~~-eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~---~~ 483 (661) .+.|+.+..++.....=.... +..+.| -||++.+............+-..+..++.++++++....|...... .. T Consensus 281 ~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~ 360 (409) T protein:vir:16 281 TEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSK 360 (409) T ss_pred HHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhcc Confidence 999999998888653211110 001112 4666666543333333334444467777788888877776422111 12 Q ss_pred EEEEeccc--cccccCCHHHHHHHHHHHhcCC-CC-HHHHHHHHHhcCCCCcc Q lcl|NC_019406. 484 LRYEIDAT--FLTTALDARALRAIQQLYEGGL-LP-IDALYENFVKNGIIPST 532 (661) Q Consensus 484 ~~v~ln~D--F~~~~lda~~l~all~~~~aG~-Is-~et~~~eL~r~gvl~~~ 532 (661) +.+..-+- -....+ ++..+++.+++++|. +. .++.++ +.|+=.++ T Consensus 361 ~~v~W~~~~~~~~~s~-a~~aDa~~Kl~~a~~~~~~~~v~~~---~~g~~~~d 409 (409) T protein:vir:16 361 TKPKWEPLFEADASML-SLIGDGAIKLNQAIPEFINKDTIRD---LTGIKGAE 409 (409) T ss_pred ceEEecCCCCcchhhH-HHHHHHHHHHHhhcccccchhHHHH---hccCCCCC Confidence 33333221 122223 678999999999984 33 455533 34875555 No 69 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.73 E-value=2.3e-16 Score=106.34 Aligned_cols=464 Identities=9% Similarity=0.027 Sum_probs=224.5 Q ss_pred CCCccccccccccccccCC----------ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHH Q lcl|NC_019406. 4 LSPNSANIRRTKRGAQQFT----------HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYL 73 (661) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~----------V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl 73 (661) .|| |.+.+++--..- |..--..+....+++.++.+-|.|...++ |||. .-...|++.+ T Consensus 1 ~~~----~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~-----~~~~---~~p~~~~~~~ 68 (504) T protein:vir:99 1 MTE----ETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIR-----QIGN---LIPPEYLRTA 68 (504) T ss_pred CCc----cCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccch-----hccc---cccHHHHHHh Confidence 232 332222222211 11222335666777788888888876543 4443 2334455332 Q ss_pred hhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHH Q lcl|NC_019406. 74 DRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALE 153 (661) Q Consensus 74 ~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~ 153 (661) .-.|+.+-+|+.++.+++-.-..+. + + ....+.+..++ +=|+|+.....+... T Consensus 69 ---~v~n~~~~iVd~~a~rl~~~Gf~~~---d----------~---~~~~~~l~~i~--------~~N~ld~~~~~~~~~ 121 (504) T protein:vir:99 69 ---TVLGWSAKAVDTLARRCNLESFVWP---D----------G---DYGSIGGPDVW--------DENFFATKANNAMVS 121 (504) T ss_pred ---hccCcHHHHHHHHHhhhccceeeCC---C----------C---ChhhHHHHHHH--------HhcChhhHHHHHHHH Confidence 3469999999999887765433321 1 0 00111222333 347899999999999 Q ss_pred HHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhh Q lcl|NC_019406. 154 QVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSET 233 (661) Q Consensus 154 ~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~ 233 (661) ++.||+++++|- +..++ ..+|.+..++|+++.-- +++..++ +.. .+..+ T Consensus 122 a~iyG~af~~v~-~~~d~---~~~~~I~~~sP~~~~~i-yD~~~~~--~~~--------------------a~~~~---- 170 (504) T protein:vir:99 122 SLIHGPAFLINT-EGGAG---EPDSLIHVKSAMQATGE-WNSRRNA--MDS--------------------LLSIT---- 170 (504) T ss_pred HHhhCceeEEEe-cCCCC---CceeEEEEeccceeEEE-EeCCCCc--eeE--------------------EEEEE---- Confidence 999999999993 32221 23566777888875311 1111111 000 00000 Q ss_pred hhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcc Q lcl|NC_019406. 234 AQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRT 313 (661) Q Consensus 234 vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~ 313 (661) .. +. .+ ...++.++..+ .+|.+. .. ..+.+..+ ...++ T Consensus 171 ------------------------~~-d~--~g----~~~~~~~y~~~----~~~~~~--~~-~~~~~~~~----~~~~~ 208 (504) T protein:vir:99 171 ------------------------SR-DA--EG----HPTGIALYEDG----VTVTAD--MD-DDGDWHAD----VRTHK 208 (504) T ss_pred ------------------------Ee-cC--CC----eEEEEEEEcCC----cEEEEE--Ec-CCceeeec----cccCC Confidence 00 00 00 01111122221 112221 11 11122111 22344 Q ss_pred cceeeEEEEecC-CCCCCccc----cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC-----Cc---eeEe Q lcl|NC_019406. 314 LPFIPFVFFGSM-SNAADCEK----PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD-----AS---EYHI 380 (661) Q Consensus 314 L~~IPfv~~~~~-~~~~~~~~----pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~-----~~---~l~i 380 (661) ++ ||+|.|-.. +.+...+. .|+.+|.+ +.=+..++...+.++.++|++++.|++..+ .+ .+.. T Consensus 209 ~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~D---a~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~ 284 (504) T protein:vir:99 209 LG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQ---RALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQI 284 (504) T ss_pred CC-cceEEecccccCccccCcccchhhHHHHHH---HHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhh Confidence 55 677765311 11211122 24444332 233455677889999999999999986532 11 2334 Q ss_pred cccceeecCCC-------CCcceEeecCchhHHHHHHHHHHHHHHHHHHhHH---hcccccCccchhHHHHHHHHHHhhH Q lcl|NC_019406. 381 GPGRVWVVDKE-------SGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGR---LMPGMSKSVSESDNQSALREANEQS 450 (661) Q Consensus 381 Gs~~~~~lp~~-------ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGAr---ll~~~~~~~~eTataa~~d~~~~~S 450 (661) ..++.|.+|++ +.++++-++++..++.+.+.|+.+..++.....= -+-..+..++.||++.+........ T Consensus 285 ~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ 364 (504) T protein:vir:99 285 ALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIA 364 (504) T ss_pred hhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHH Confidence 44566777642 2457788999999999999999998888754321 1100111245688888887777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCCC---CcceEEEEeccccccccCCHHHHHHHHHHHhcCC--CC-HHHHHHHHH Q lcl|NC_019406. 451 LLLNVIMALEDGMTSVVRYWLMFRDIPLT---DTATLRYEIDATFLTTALDARALRAIQQLYEGGL--LP-IDALYENFV 524 (661) Q Consensus 451 ~L~~~A~~le~Al~~aL~~~A~w~G~~~~---~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~--Is-~et~~~eL~ 524 (661) ....+-..+..++.++++++....+.... +...+.+.. +|-.+..+ ++..+++.+++++|. ++ .+++++.| T Consensus 365 ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w-~d~~~~s~-a~~aDa~~Kl~~ag~~l~~~~~~l~~~l- 441 (504) T protein:vir:99 365 EAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKF-RSPLYLSK-AAQADAGAKMLGAGPEWLKETEVGLELL- 441 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEe-cCCCccCH-HHHHHHHHHHHhhccccccchHHHHhhc- Confidence 77777788888899999988887763221 112223322 23333332 568899999999986 33 35554333 Q ss_pred hcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHH-hccCCCchhHHHhhh Q lcl|NC_019406. 525 KNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAE-RHLEIDEEKLRISAK 603 (661) Q Consensus 525 r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e-~~~~~~~~~~~~~~~ 603 (661) |+-+++. +..+++.+.+. ..+.-++ +....... ..-++..+....+.- ..+ .+++.. T Consensus 442 --g~~~~ei--~r~~~e~~~~~-~~~~~~~--l~~~~~~~-~~~~~~~~~~~~e~a-----~~~~~~~~~~--------- 499 (504) T protein:vir:99 442 --GLTPQQA--KRALAERRRAS-SVSIIEA--LNRRQQEA-ATAGEDQDQGAGEPP-----ANEPPAALGR--------- 499 (504) T ss_pred --CCCHHHH--HHHHHHHHHHh-hHHHHHH--HhcccCCC-CCCCCCCCcCCCCCC-----CCCCCccCCC--------- Confidence 6633322 11111111110 0110000 11100000 000000000000000 000 111111 Q ss_pred hhhhhhhHHHhcC Q lcl|NC_019406. 604 VGSTSVAASRKLG 616 (661) Q Consensus 604 ~~~~~~~~~~~~~ 616 (661) =++. | T Consensus 500 ---p~~~-----~ 504 (504) T protein:vir:99 500 ---PTLV-----G 504 (504) T ss_pred ---cccC-----C Confidence 1111 1 No 70 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.68 E-value=2.3e-14 Score=95.35 Aligned_cols=444 Identities=12% Similarity=0.094 Sum_probs=212.6 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchH-HHHhCCcccCCCCCCCChHHHHHHHhhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGER-EIKAQGVKYLKAPKGFDDEDYANYLDRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~-~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~ 79 (661) +.|. ++.++......+|.. |+++......| +..|.|.. -++.....+-+++. +. |-.-. T Consensus 15 ~~~~------~~~~~~~~~~~~~~~-~~~~~~~i~~~---~~yy~g~~~~~~~~~~~~~~~~~-------~~---~~~~~ 74 (496) T protein:vir:38 15 RMGL------LKALKDVKDHKKVNA-NDEDYKYIDMW---KRLYQGHYAEWHNLNYEHNGNPV-------NR---RQLSM 74 (496) T ss_pred Hhcc------chhhHHHHhcCCCcC-CHHHHHHHHHH---HHHhcCCCchhhcchhccCCCcc-------cc---ceeec Confidence 2221 233333434444443 77777776777 46788853 33332221211111 11 11225 Q ss_pred chHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCC Q lcl|NC_019406. 80 NMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGR 159 (661) Q Consensus 80 n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr 159 (661) |+.+.+++.+++++|.+||+|+ +.+ ....++++++ ++.++++.-++.++..++.+|. T Consensus 75 n~~k~i~~~~a~~l~~~p~~i~-~~d---------------------~~~~e~l~~~-~~~n~f~~~~~~~~~~a~~~G~ 131 (496) T protein:vir:38 75 NLPKVTAKYMSKLLFNEKVKIN-IDD---------------------KAAEEFVLNV-LKTNGFTKNMERYIEYGEAMGG 131 (496) T ss_pred chHHHHHHHHhhhhhCCcceEe-eCC---------------------hHHHHHHHHH-HhccCHHHHHHHHHHHHhhhCc Confidence 9999999999999999999985 211 0112222222 2346899999999999999999 Q ss_pred EEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 160 FGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 160 ~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) +++.|=+. . ..+|.+..++|++++-...+. + .++.+.+.+....++. .|. T Consensus 132 ~~~~~~~D-~-----~~~~~i~~v~~~~~~P~~~~~-~---~~~~~~f~~~~~~~~~-------~y~------------- 181 (496) T protein:vir:38 132 FVIKVYHD-G-----NKNVKVSFATADCMYPLSNDS-E---NVDECVIANSFHKNNK-------YYT------------- 181 (496) T ss_pred EEEEEEEc-C-----CCcEEEEEEcccceEEEEecC-C---cEEEEEEEEEEEeCCe-------EEE------------- Confidence 99887332 1 245788899999987543321 1 2444444332221110 000 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccc---------cceeeccC Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQAR---------DVYTPMVR 310 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~---------~~~~p~~~ 310 (661) ++ -.....++.++.+..+|+.......+ +...+... T Consensus 182 ----------------------------------~l-e~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~ 226 (496) T protein:vir:38 182 ----------------------------------LL-EWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVP 226 (496) T ss_pred ----------------------------------EE-EEEEEeCceEEEEEEEEecCCccccCcccccccccccccccee Confidence 00 00001112222222222221111111 11111100 Q ss_pred CcccceeeEEEEecC--CCC---CCcccc------chhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeE Q lcl|NC_019406. 311 GRTLPFIPFVFFGSM--SNA---ADCEKP------PLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYH 379 (661) Q Consensus 311 g~~L~~IPfv~~~~~--~~~---~~~~~p------PLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~ 379 (661) =+.++..||+++... ++. -..+.+ +|+|-.+..++.|. .+++.+-+..-+|.-++....+..+.... T Consensus 227 ~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~--~~~~~~~~~i~v~~~~l~~~~~~~g~~~~ 304 (496) T protein:vir:38 227 LPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYY--QEFKLGKKKVLVPSSFVKTAVNLDGSTTQ 304 (496) T ss_pred ecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHH--HHHhhcccceecchHHhhccCCCCCcccc Confidence 012455666665321 111 111233 33333333333332 23333333333333333222222211100 Q ss_pred --ecccc---eeecCCCCCcceEeecCch-hHHHHHHHHHHHHHHHHHH---hHHhcccccCccchhHHHHHHHHHHhhH Q lcl|NC_019406. 380 --IGPGR---VWVVDKESGIPGIIEFKGE-GLKTLERALNEKEQQIAAI---GGRLMPGMSKSVSESDNQSALREANEQS 450 (661) Q Consensus 380 --iGs~~---~~~lp~~ga~~~ylE~~g~-~i~a~~~~L~~le~qM~~l---GArll~~~~~~~~eTataa~~d~~~~~S 450 (661) ..... ++.....++..++-.+++. ..+.+...++.+.+++... +...+.. ..++..||++.........+ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~-~~~g~~tAtei~~~~~~l~~ 383 (496) T protein:vir:38 305 YFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTF-DENGLKTATEVVSEKSETYQ 383 (496) T ss_pred CCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCC-CccccchHHHHHHHHHHHHH Confidence 00001 1111111222334334433 1255566666666655432 2223321 24567789998888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHc-------CCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHH Q lcl|NC_019406. 451 LLLNVIMALEDGMTSVVRYWLMFR-------DIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENF 523 (661) Q Consensus 451 ~L~~~A~~le~Al~~aL~~~A~w~-------G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL 523 (661) ....+...++.+|.++++.+..+. |.. .+..++.|..+.. .+.. ..+.++.+.+++.+|.||++|++..+ T Consensus 384 ~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~-~~~~~i~v~f~d~-i~~d-~~~~~~~~~~~~~~GiiS~et~l~~~ 460 (496) T protein:vir:38 384 TKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEV-VELDTITVDFDDS-IAQD-EDTTINRYTNAKNQGMIPLKIALQRA 460 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC-CCccceEEEeCCC-CCCC-HHHHHHHHHHHHhcCCCCHHHHHHhc Confidence 888889999999999877765432 222 2334555555532 1221 24578999999999999999985422 Q ss_pred HhcCCCCccCCHHHHHHHHhccCCC-CCCchhhhhhcCCcc Q lcl|NC_019406. 524 VKNGIIPSTQTLEEFTIKMNDPKSF-IGQPDAIAMRRGYVS 563 (661) Q Consensus 524 ~r~gvl~~~~~~Eee~~~l~~~~~~-l~~ddae~~~~g~~~ 563 (661) -++ ++.+.+++.++|+++... .+.+|.. -..|.++ T Consensus 461 --~~~--~d~ea~~el~ri~~E~~~~~~~~d~~-~~~~~~e 496 (496) T protein:vir:38 461 --WNI--TEAEADEWAEMLAKEKQAEMPNNDMN-GIFGEEE 496 (496) T ss_pred --CCC--ChHHHHHHHHHHHHhhhccCcccccc-CCCCCCC Confidence 243 223345677788765431 1211111 1112111 No 71 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.63 E-value=1.8e-13 Score=90.50 Aligned_cols=449 Identities=13% Similarity=0.108 Sum_probs=210.3 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcch-HHHHhCCcccCCCCCCCChHHHHHHHhhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGE-REIKAQGVKYLKAPKGFDDEDYANYLDRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~-~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~ 79 (661) +.|. ++.++.....-+|. .|+++......|+ ..|.|. ..+......+-+++.. + +-.=. T Consensus 15 ~~~~------~~~~~~~~~~~~i~-~~~~~~~~i~~~~---~~Y~g~~~~~~~~~~~~~~~~~~------~----~~~s~ 74 (499) T protein:vir:80 15 RMGL------LKSLKDVTDHKKVN-ANDEDYKYIDMWK---RLYQGNYAEWHNLNYEHNGNPVN------R----RQLSM 74 (499) T ss_pred Hhcc------ccchhhhhcCCCCc-CCHHHHHHHHHHH---HHhcCCcchhhccccccCCCccc------c----ceeec Confidence 2222 23333344444555 4888888888886 567775 3443332211111111 1 11225 Q ss_pred chHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCC Q lcl|NC_019406. 80 NMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGR 159 (661) Q Consensus 80 n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr 159 (661) |+.+.+++.+++++|.+||+|+ +.+. ...++++++ ++-+++...++..+..++.+|. T Consensus 75 n~~~~iv~~~a~~l~~ep~~i~-~~d~---------------------~~~e~l~~~-~~~n~f~~~~~~~~~~a~~~G~ 131 (499) T protein:vir:80 75 NLPKVTAKYMSKLLFNEKVKIN-IDDE---------------------TAEEFVLNV-LKTNGFTKNMERYIEYGEAMGG 131 (499) T ss_pred chHHHHHHHHHHhhhCCcceEe-eCCH---------------------HHHHHHHHH-HhhccHHHHHHHHHHHHhhcCc Confidence 9999999999999999999985 2221 111222222 2346789999999999999999 Q ss_pred EEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 160 FGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 160 ~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) +++.|=+.. ..+|-+..++|.+++--.++ . | .++.+.+-+.....+. .|. T Consensus 132 ~~~~~~~D~------~~~~~i~~v~a~~~~Pi~~d-~-~--~~~~~~f~~~~~~~~~-------~y~------------- 181 (499) T protein:vir:80 132 FVIKVYHDG------NKNVKVSFATADCMYPLSND-S-E--NVDECLIANSFHKNNK-------YYK------------- 181 (499) T ss_pred EEEEEEECC------CCcEEEEEEcCCceEEEEec-C-C--CeEEEEEEEEEeecCe-------EEE------------- Confidence 888775432 23577888899987642222 1 1 2444444332221100 000 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccc---------eeeccC Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDV---------YTPMVR 310 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~---------~~p~~~ 310 (661) +.-++ .+..+.++.++.+..+|+.......+.. ..|... T Consensus 182 ------------------------------~lE~h--~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~ 229 (499) T protein:vir:80 182 ------------------------------LLEWN--EWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVP 229 (499) T ss_pred ------------------------------EEEEE--EecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCcee Confidence 00000 0111222222222233332221111111 111110 Q ss_pred CcccceeeEEEEecC--CC---CCCccccchhHH----HHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCce--eE Q lcl|NC_019406. 311 GRTLPFIPFVFFGSM--SN---AADCEKPPLLDI----VELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASE--YH 379 (661) Q Consensus 311 g~~L~~IPfv~~~~~--~~---~~~~~~pPLldL----A~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~--l~ 379 (661) =..++..||+++... ++ +...+.+-|.++ -.+|...-+..-+++.+-...-+|.-++....+..+.. .- T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 309 (499) T protein:vir:80 230 LPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYF 309 (499) T ss_pred ecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcccCC Confidence 012566777766322 11 111133333322 23343333333344332221122211121111111110 00 Q ss_pred ecccc---eeecCCCCCcceEeecCchh-HHHHHHHHHHHHHHHHH---HhHHhcccccCccchhHHHHHHHHHHhhHHH Q lcl|NC_019406. 380 IGPGR---VWVVDKESGIPGIIEFKGEG-LKTLERALNEKEQQIAA---IGGRLMPGMSKSVSESDNQSALREANEQSLL 452 (661) Q Consensus 380 iGs~~---~~~lp~~ga~~~ylE~~g~~-i~a~~~~L~~le~qM~~---lGArll~~~~~~~~eTataa~~d~~~~~S~L 452 (661) -.... ++....+++..++-.+++.- .+.+.+.|+.+.+++.. ++...+.. ...+.+||++.....+...... T Consensus 310 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~-~~~g~~TAtei~s~~~~l~~~~ 388 (499) T protein:vir:80 310 DSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTF-DENGLKTATEVVSEKSETYQTK 388 (499) T ss_pred CcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCC-CcccchhHHHHHHHHHHHHHHH Confidence 00111 11111122222333333322 14445555555444432 22222322 2346789999988888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc---CCC---CCCcceEEEEeccccccccCC-HHHHHHHHHHHhcCCCCHHHHHHHHHh Q lcl|NC_019406. 453 LNVIMALEDGMTSVVRYWLMFR---DIP---LTDTATLRYEIDATFLTTALD-ARALRAIQQLYEGGLLPIDALYENFVK 525 (661) Q Consensus 453 ~~~A~~le~Al~~aL~~~A~w~---G~~---~~~~~~~~v~ln~DF~~~~ld-a~~l~all~~~~aG~Is~et~~~eL~r 525 (661) ..+...++.+|.++++.+.+|. +.- ..+...+.|..+..- ..| .++++.+.+++.+|.||+++++.. . T Consensus 389 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i---~~d~~~~~~~~~~~~~~Gi~S~et~l~~--~ 463 (499) T protein:vir:80 389 NSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSI---AQDEDTTINRYTTAKNQGMIPLKIALQR--A 463 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCC---CCCHHHHHHHHHHHHHcCCCCHHHHHhh--c Confidence 8888999999999877776652 111 112334555443321 223 457888999999999999998532 2 Q ss_pred cCCCCccCCHHHHHHHHhccCC-CCCCchhhhhhcCCccccCCCc Q lcl|NC_019406. 526 NGIIPSTQTLEEFTIKMNDPKS-FIGQPDAIAMRRGYVSRQQELD 569 (661) Q Consensus 526 ~gvl~~~~~~Eee~~~l~~~~~-~l~~ddae~~~~g~~~~~~~~~ 569 (661) -|+ ++.+.+++.++|+++.. .++.+|-. | .+.+-| T Consensus 464 ~~~--~d~ea~~el~~i~~E~~~~~~~~d~~----g---~~ge~e 499 (499) T protein:vir:80 464 WNI--TEAEADEWAEMLAKEKQAEIPNNDMT----G---IFGEEE 499 (499) T ss_pred CCC--ChHHHHHHHHHHHHHhhcCCCCCCcc----c---cCCCCC Confidence 244 33334567777775533 22222211 1 011111 No 72 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.63 E-value=1.3e-14 Score=96.65 Aligned_cols=443 Identities=9% Similarity=0.008 Sum_probs=223.8 Q ss_pred CCCc-cccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchH Q lcl|NC_019406. 4 LSPN-SANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMT 82 (661) Q Consensus 4 ~~~~-~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~ 82 (661) .-|- -++|.-+... ..-.+..---.+....+++..+.+-|.|...+ .||| ..-...|+.+. ...|+. T Consensus 1 ~~~~~~~~~~gl~~~-~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~-----~~~~---~~~p~~~r~~~---~v~nw~ 68 (474) T protein:vir:81 1 MIQQQTVRIPSLSND-ENALINGLLAQIENLRWKNLLRTSYYENKRTI-----QYVG---TLIPPQYFNLG---LVLGWT 68 (474) T ss_pred CcCCCcCcCCCCChh-HHHHHHHHHHHHHHHhhHHHHHHHHhccCCCh-----hhcc---ccccHHHHHHH---hhcChH Confidence 1111 1122200000 01112223345666677788888888887544 3444 33345566442 257999 Q ss_pred HHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEE Q lcl|NC_019406. 83 SQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGA 162 (661) Q Consensus 83 ~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gv 162 (661) +-.|+.++.++.-.--.+. + +. .....+..+| +=|+|+.+...+.+.+|.||++++ T Consensus 69 ~~~Vd~~a~rl~~~Gf~~~---d----------~~---~~~~~l~~iw--------~~N~ld~~~~~~~~~al~~G~sf~ 124 (474) T protein:vir:81 69 GKAVDALARRCNLEGFVWP---D----------GD---LDSLGGTEVV--------DDNHLLSEIDSAIVAAMQHGPAFL 124 (474) T ss_pred HHHHHHHHhhhcccceECC---C----------CC---ccchHHHHHH--------HhcChhHHHHHHHHHHHhhCceeE Confidence 9999999877665433221 1 00 0111223333 357999999999999999999999 Q ss_pred EEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhh Q lcl|NC_019406. 163 LVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRR 242 (661) Q Consensus 163 LVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~ 242 (661) +|=.... ...+|-+..++|.+++-- ++...++-..-+.++ T Consensus 125 ~V~~~~d----~~~~~~i~~~sp~~~~~~-~D~~~~~~~~al~~~----------------------------------- 164 (474) T protein:vir:81 125 INTVGED----DEPEALIHVKDASEATGE-WNRRRRGLNNLLSII----------------------------------- 164 (474) T ss_pred EEecCCC----CCceeEEEEeccceEEEE-EeCCCCcceeeeEEE----------------------------------- Confidence 9964422 123567778888876421 122222100000000 Q ss_pred cchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEE Q lcl|NC_019406. 243 AGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFF 322 (661) Q Consensus 243 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~ 322 (661) . ....+.. .+..++.++. ++.+ .+.+..+.+. .....++++ ||+|.+ T Consensus 165 ---------------~--~~~~g~~-----~~~~ly~~~~----~~~~--~~~~~~~~w~----~~~~~~~~g-vPvV~~ 211 (474) T protein:vir:81 165 ---------------D--KDKEGKV-----LSLALYLDNE----TVTA--QRDKATLKWQ----VDRDEHVYG-VPAQVL 211 (474) T ss_pred ---------------E--EcCCCcE-----EEEEEEeCCc----EEEE--EEcCccceee----eccCCCCCC-cceEEe Confidence 0 0000000 0001111111 1111 1212111111 122235566 677765 Q ss_pred ecC-CCCCCccc----cchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC-----Cc---eeEecccceeecC Q lcl|NC_019406. 323 GSM-SNAADCEK----PPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD-----AS---EYHIGPGRVWVVD 389 (661) Q Consensus 323 ~~~-~~~~~~~~----pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~-----~~---~l~iGs~~~~~lp 389 (661) ... +.+-..+. .|+++|.+ +.=+..++...+.++.++|+.|+.|++..+ .+ .+.....+.|.+| T Consensus 212 ~n~~~~~~~~G~s~i~e~v~~l~d---a~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~ 288 (474) T protein:vir:81 212 PYKPAPKRPFGQSRITKPMMGLQD---AGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLP 288 (474) T ss_pred cccccccCcCCccccchhHHHHHH---HHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCC Confidence 322 22211222 35555432 333455677889999999999999987533 11 1222334566666 Q ss_pred CCC-------CcceEeecCchhHHHHHHHHHHHHHHHHHHhHHhccc---ccCccchhHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_019406. 390 KES-------GIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRLMPG---MSKSVSESDNQSALREANEQSLLLNVIMAL 459 (661) Q Consensus 390 ~~g-------a~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArll~~---~~~~~~eTataa~~d~~~~~S~L~~~A~~l 459 (661) ++. ..+++-|+++..++-+.+.|+.+..++.....=.... .+-..+.||++..............+-..+ T Consensus 289 ~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~f 368 (474) T protein:vir:81 289 DDADADIPQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDF 368 (474) T ss_pred CcccccccccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 421 2356788999999999999998888887543211110 111223578888877666666667777778 Q ss_pred HHHHHHHHHHHHHHcCCCCCCc-----ceEEEEeccccccccCCHHHHHHHHHHHhcCC--CCHHHHHHHHHhcCCCCcc Q lcl|NC_019406. 460 EDGMTSVVRYWLMFRDIPLTDT-----ATLRYEIDATFLTTALDARALRAIQQLYEGGL--LPIDALYENFVKNGIIPST 532 (661) Q Consensus 460 e~Al~~aL~~~A~w~G~~~~~~-----~~~~v~ln~DF~~~~lda~~l~all~~~~aG~--Is~et~~~eL~r~gvl~~~ 532 (661) ..++.+++++++...|....+. ..+.+.. +|-.... .++..+++.++.++|. ++++++++. -|+ T Consensus 369 g~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W-~d~~~~s-~a~~aDa~~Kl~~a~~~~~~~~~~~~~---lg~---- 439 (474) T protein:vir:81 369 TPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKW-RDPRYLS-KSAQADAGMKQLAAVPWLAETEVGLEL---IGL---- 439 (474) T ss_pred HHHHHHHHHHHHHHhCCCCccccchhhccceeEe-cCCCccC-HHHHHHHHHHHHhcccCCCcHHHHHhh---cCC---- Confidence 8999999999999887533221 1223322 2333333 3778999999999873 445555433 254 Q ss_pred CCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchh Q lcl|NC_019406. 533 QTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQR 572 (661) Q Consensus 533 ~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~ 572 (661) +.++++....+.. ...+....++...+..+-...| T Consensus 440 -t~~~i~~~~~~~~----~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 440 -TPQQARRAMADKR----RVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred -CHHHHHHHHHHHH----HHhHHHHHHHHHhcCCCCCCCC Confidence 3333322111100 0011111111111111000000 No 73 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.52 E-value=3.1e-12 Score=83.69 Aligned_cols=442 Identities=11% Similarity=0.100 Sum_probs=204.2 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) .-|..++--||. .-.+|. ..|+|......|..+ |.|...+. .|-+ . +...+.|+. .=.| T Consensus 17 ~~~~~~~~~~~~------~~~~i~-~~~~~~~ri~~~~~~---y~g~~~~~----~~~~---~--~~~~~~~~~--~sln 75 (508) T protein:vir:15 17 ATGVTGSLSKIT------DDPRIS-IDPDEYVRIQTDLDY---YSDKLQYI----HYQA---S--DGIKKKRLK--NTIN 75 (508) T ss_pred HhccccchHHhh------cccccc-cCHHHHHHHHHHHHH---hcCCCccc----cccc---C--CCCccccce--eecc Confidence 234444322332 233453 367777777777554 77753221 1111 1 111112221 1238 Q ss_pred hHHHHHHHHhchhhccCccccccch--hhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhC Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPN--TGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMG 158 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~--~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~G 158 (661) +.+.+++.+++++|.++|+++ +++ ... ..+..++ +-+++...++..+..++..| T Consensus 76 ~~~~i~~~~A~lv~~e~~~i~-v~~~~~~~------------------e~l~~il-----~~n~f~~~~~~~~e~a~a~G 131 (508) T protein:vir:15 76 MAKTAARRIASVVFNEKAEIH-VKDNNEAD------------------KFLNDVL-----EDNDFKNKFEEALEKGVALG 131 (508) T ss_pred hHHHHHHHHHhhhhCCCceEE-eCCchHHH------------------HHHHHHH-----HhccHHHHHHHHHHHHhhcC Confidence 999999999999999999985 321 111 1222232 34678888999999999999 Q ss_pred CEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 159 RFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 159 r~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) .+++-+=+. ..++-+..++|.+++-..++. ++ ....+.+.+....+.+ +.. T Consensus 132 ~~~~k~~~d-------~~~~~i~~v~ad~~~P~~~d~-~~--~~~~af~~~~~~~~~~----~~~--------------- 182 (508) T protein:vir:15 132 GFAMRPYID-------GNHIKIAWVRADQFYPLQSNT-ND--ISEAAIASRTQRTESN----QTK--------------- 182 (508) T ss_pred ceEEEEEEe-------CCeeEEEEEcCCeeEEEEEcC-CC--eEEEEEEEEEEeecCC----Cce--------------- Confidence 777654332 123556677888875433321 21 1122222222221110 000 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEe-ecccccceEEEEEEEecCcccccccce-----------e Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELIL-ELQKDGSRVYKQFVYVEDPLGQARDVY-----------T 306 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l-~~g~~g~~~~~~~~~~~~~~~~~~~~~-----------~ 306 (661) +|+.+-. +.+.+|.++.+..+|+.......+..+ . T Consensus 183 ---------------------------------~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~ 229 (508) T protein:vir:15 183 ---------------------------------YYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELA 229 (508) T ss_pred ---------------------------------EEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCC Confidence 1111100 111222333334444432211111111 1 Q ss_pred eccCCcccceeeEEEEecC--CCC---CCccccchhH----HHHHHHHHHhhhhhHHHHHHHhcCceeEEecCC--CCCC Q lcl|NC_019406. 307 PMVRGRTLPFIPFVFFGSM--SNA---ADCEKPPLLD----IVELNLKHYRTYAELEHGRFFTALPTYYAPELD--DSDA 375 (661) Q Consensus 307 p~~~g~~L~~IPfv~~~~~--~~~---~~~~~pPLld----LA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~--~~~~ 375 (661) |.+.=+.++..||+.+-.. ++. -..+.+-|.+ |-.+|..+-+ +.+.+......+.+-..+- ++.. T Consensus 230 ~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~----~~~e~~~~~~~i~v~~~~l~~d~~~ 305 (508) T protein:vir:15 230 PQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQ----FIWEIRLGQKHIAVQPGMLRFDDEH 305 (508) T ss_pred cceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHH----HHHHHHhcccceeechHHhcCCCCC Confidence 1100012444566555321 111 1123333332 2334444444 4444433333333311111 1111 Q ss_pred ce-eEeccccee---ecCCC-CCcceEeecCchhHHHHHHHHHHHHHHHHH---HhHHhcccccCccchhHHHHHHHHHH Q lcl|NC_019406. 376 SE-YHIGPGRVW---VVDKE-SGIPGIIEFKGEGLKTLERALNEKEQQIAA---IGGRLMPGMSKSVSESDNQSALREAN 447 (661) Q Consensus 376 ~~-l~iGs~~~~---~lp~~-ga~~~ylE~~g~~i~a~~~~L~~le~qM~~---lGArll~~~~~~~~eTataa~~d~~~ 447 (661) .+ +-.+ ...+ ..+.. +..+..+.|.= ..+.+.+.++.+.+++.. ++...+.. ...+.+|||+...+.+. T Consensus 306 ~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~i-r~e~~~~~~~~~l~~~~~~~gls~~~f~~-~~~~~~TAtei~s~~~~ 382 (508) T protein:vir:15 306 KPTFDTE-QNVYVGVLSDDNNGLGVKDMTTPI-RTVQYKDAIDHFIKEFEVQIGLSTGTFSY-SNDGVKTATEVVSNNSM 382 (508) T ss_pred ccccCCC-CeeEEeccCCCCCCCceeEeeccc-ChHHHHHHHHHHHHHHHHHhCCCchhccc-ccCccccHHHHHHHHHH Confidence 11 1111 1111 11111 22343333220 223445555555544332 22222211 23456899999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHc---CCCCC-----------CcceEEEEecccccccc-CC-HHHHHHHHHHHhc Q lcl|NC_019406. 448 EQSLLLNVIMALEDGMTSVVRYWLMFR---DIPLT-----------DTATLRYEIDATFLTTA-LD-ARALRAIQQLYEG 511 (661) Q Consensus 448 ~~S~L~~~A~~le~Al~~aL~~~A~w~---G~~~~-----------~~~~~~v~ln~DF~~~~-ld-a~~l~all~~~~a 511 (661) .......+...++.||.++++.+.++. +.-.. ...++.|. |...- .| .++++.+.+++.+ T Consensus 383 ~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~----f~D~i~~d~~~~~~~~~~~v~a 458 (508) T protein:vir:15 383 TYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECH----FDDGVFVNKDKQLEEDAKVLAI 458 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEE----eCCCCCCCHHHHHHHHHHHHhc Confidence 999999999999999999877755543 22111 11233333 33322 23 3478899999999 Q ss_pred CCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchh-hhhhcCCccc Q lcl|NC_019406. 512 GLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDA-IAMRRGYVSR 564 (661) Q Consensus 512 G~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~dda-e~~~~g~~~~ 564 (661) |.+|+++++.. .-|+ ++.+.+++.++|+++.+.....+. .-..+|.-.+ T Consensus 459 Gi~s~e~~i~~--~~g~--~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 459 GALSKQTFLQR--NYGM--TDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred CCCCHHHHHHh--cCCC--ChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 99999988632 2343 222235677788777554332221 1111111111 No 74 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.50 E-value=1.1e-12 Score=86.08 Aligned_cols=468 Identities=10% Similarity=0.014 Sum_probs=213.9 Q ss_pred CCCCCCcccccccc---------ccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRT---------KRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~---------~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |.-.+--..=|++. +.....-.| ...+++......|.. +|.|...+ |.....+ ...+. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~I~~w~~---~Y~g~~~~-------~~~~~~~--~~~~~ 67 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKI-NIDPNELARIERNLR---QYEGDYPQ-------VEYINSQ--GKIQE 67 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCce-ecCHHHHHHHHHHHH---HhcCCCcc-------ccccccc--ccccc Confidence 32221111111111 111111123 225666666677754 36664221 1100101 01111 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) |. -.=.|+.+.+++.+.++||.++++|. +++.-.....+.. ...-.++++.+ ++-+++...++..+ T Consensus 68 ~~--~~sl~~~~~i~~~~A~Ll~~e~~~i~-v~d~~~~~~~~~~----------~~~~~e~l~~i-~~~n~f~~~~~~~~ 133 (517) T protein:vir:98 68 RD--YMTLNLRKLSADVLSGLVFNEQCEVY-VSDAKDEEKKDNS----------FKTAHEFIQHV-FQHNKFIKNLSDYL 133 (517) T ss_pred cc--eeecCcHHHHHHHhhhhhcCCcceEE-ecccccccccccc----------hhHHHHHHHHH-HHhccHHHHHHHHH Confidence 11 11248999999999999999999985 3331100000000 01112222222 34568899999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeee-eeeccccccccccceeeeec Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREF-ERVDEHATPSQQNPWIGREG 230 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~-~~~~~~~~~~~~~~~i~~~~ 230 (661) ..++..|-+++-+=+. +.++-+..++|.+++-++++. +| .+...|... .....+ +...|.. T Consensus 134 e~a~a~G~~a~k~~~d-------~~~~~I~~v~ad~~~Pl~~~~-~~---v~~~ai~~~~~~~~~~----~~~~Yt~--- 195 (517) T protein:vir:98 134 EPTFALGGLTVRPYVD-------NGEIEFSWALANAFYPLRSNS-NG---ISEGVMKSVTTKVIGN----KTVYYTL--- 195 (517) T ss_pred HHHhhhCCEEEEEEEe-------CCeeEEEEEcCCeeEEEEecC-CC---eEEEEEEEEEEEeecC----CceEEEE--- Confidence 9999999666643221 123557777888876555432 11 122222111 111000 0000100 Q ss_pred hhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccce----- Q lcl|NC_019406. 231 SETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVY----- 305 (661) Q Consensus 231 ~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~----- 305 (661) --+++.--..+.+|.++.+..+|+.+.....|..+ T Consensus 196 ----------------------------------------lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~ 235 (517) T protein:vir:98 196 ----------------------------------------LEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEEL 235 (517) T ss_pred ----------------------------------------EEEEecCceeccCCcEEEEEEEEecCCCcccccccccccc Confidence 00000000111234454455555543322222111 Q ss_pred ----eeccCCcccceeeEEEEec-C-CC---CCCccccch----hHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCC Q lcl|NC_019406. 306 ----TPMVRGRTLPFIPFVFFGS-M-SN---AADCEKPPL----LDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDD 372 (661) Q Consensus 306 ----~p~~~g~~L~~IPfv~~~~-~-~~---~~~~~~pPL----ldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~ 372 (661) .|.+--+.++.-+|+++-. . ++ ....+.+-+ .-|-.||..+-+-.-+++-+-+..-+|.-++.--.+ T Consensus 236 ~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~~~ 315 (517) T protein:vir:98 236 YEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTVPD 315 (517) T ss_pred ccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhhhccccC Confidence 1111001122222333311 1 11 111233322 234466766666666666655555555555421111 Q ss_pred CCCceeEecc-----c---ceeecCCCCCcceEeecCchh-HHHHHHHHHHHHHHHH---HHhHHhcccccCccchhHHH Q lcl|NC_019406. 373 SDASEYHIGP-----G---RVWVVDKESGIPGIIEFKGEG-LKTLERALNEKEQQIA---AIGGRLMPGMSKSVSESDNQ 440 (661) Q Consensus 373 ~~~~~l~iGs-----~---~~~~lp~~ga~~~ylE~~g~~-i~a~~~~L~~le~qM~---~lGArll~~~~~~~~eTata 440 (661) .. +...|+ . ..+..+ .++.+|-++++.- .+.+...++.+-+++. -++...+... ..+.+|||+ T Consensus 316 ~~--g~~~~~~~d~~~~~y~~~~~~--~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~-~~~~kTATE 390 (517) T protein:vir:98 316 ES--GMPPPQVFDPDVNVYKSIRMG--TDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFD-GRSMKTATE 390 (517) T ss_pred CC--CcccCCCCCcccceeeeccCC--CCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccc-ccccccHHH Confidence 11 111111 1 112222 2334455555532 1344444444444332 1223334332 345789999 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc------CCCCCCcceEEEEecccccccc-CCH-HHHHHHHHHHhcC Q lcl|NC_019406. 441 SALREANEQSLLLNVIMALEDGMTSVVRYWLMFR------DIPLTDTATLRYEIDATFLTTA-LDA-RALRAIQQLYEGG 512 (661) Q Consensus 441 a~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~------G~~~~~~~~~~v~ln~DF~~~~-lda-~~l~all~~~~aG 512 (661) ...+.+...+....+...+++||.++++.+..|. +.......++.| +|...- .|. +++..+.+++.+| T Consensus 391 i~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v----~f~D~i~~D~~~~~~~~~~~v~aG 466 (517) T protein:vir:98 391 IVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGV----DFDDGVFQDRSALLRFYGQAKTFG 466 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEE----EcCCCCCCCHHHHHHHHHHHHhcC Confidence 9999999999999999999999999988876553 211111222333 444332 233 4788899999999 Q ss_pred CCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCC-CchhhhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 513 LLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIG-QPDAIAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 513 ~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~-~ddae~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) .||+++++..+ -|+ + +.+.+++..+|+++...-+ .++.+.. .+ ....|-| T Consensus 467 ~ms~~~~i~~~--~g~-~-eeeA~~e~~~i~~E~~~~~~~~~~~~~-------~~-------~~~gd~e 517 (517) T protein:vir:98 467 FIPTVEAIQRI--FKV-P-KKTAEQWLEEIRKDQIELDPVTISQRA-------QK-------RMFGDEE 517 (517) T ss_pred CCCHHHHHHHh--CCC-C-hHHHHHHHHHHHHhccccCCCCccccc-------cC-------CCCCCCC Confidence 99999996443 364 2 3334567778876654321 1211111 11 1222211 No 75 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.46 E-value=1e-11 Score=80.84 Aligned_cols=444 Identities=11% Similarity=0.081 Sum_probs=206.9 Q ss_pred CC---CCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhc Q lcl|NC_019406. 1 MA---GLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~ 77 (661) |. |+.++--+|. .--+|.. .|+|.+....|. .+|.|...+ |-..... ...+.|...+ T Consensus 14 ~~~~~~~~~~~~~i~------d~~~i~~-~~~~~~~i~~~~---~~Y~g~~~~-------l~~~~~~--~~~~~~~~~s- 73 (505) T protein:vir:79 14 GSAAVGMTKSLGQII------DDPRINL-PADEVERIARDK---RYYMDDFKQ-------VTHKNSY--GDTQKHELQS- 73 (505) T ss_pred hhhhhcchhhhhhhh------cccCCCC-CHHHHHHHHHHH---HHhcCCCcc-------ccccccC--CCccccceee- Confidence 32 1222222222 1223444 366666666674 567775322 1111000 1111122222 Q ss_pred ccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhh Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAM 157 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~ 157 (661) .|+.+.+++.+++++|.++|+|+ +.+. +..+.+..+ ++.+++...++..+..++.. T Consensus 74 -lnl~~~i~~~~A~ll~~e~~~i~-~~d~-----------------~~~e~l~~i-----~~~n~f~~~~~~~~e~a~a~ 129 (505) T protein:vir:79 74 -VNVTKLASAKLASLIFNEQCQVT-VSDE-----------------TANDFLDDV-----FQQNDFYTTFEEKLEEWIAL 129 (505) T ss_pred -cchHHHHHHHHHhhhcCCCceee-cCCh-----------------HHHHHHHHH-----HHhccHHHHHHHHHHHHhhc Confidence 48999999999999999999985 2221 111222222 24567899999999999999 Q ss_pred CCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcc Q lcl|NC_019406. 158 GRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRT 237 (661) Q Consensus 158 Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w 237 (661) |.+++.+=+. ..++-+..++|++++-..++. ++ ++.+.+.......+.. T Consensus 130 G~~~~k~~~D-------~~~~~i~~v~ad~~~P~~~d~-~~---~~~~a~~~~~~~~~~~-------------------- 178 (505) T protein:vir:79 130 GSGCVRPYVD-------SGKIKLAWATADQVYPLQADT-NQ---VNELAIASRTTEVENH-------------------- 178 (505) T ss_pred CCeEEEEEEe-------CCceEEEEEcCCeeEEEEEcC-CC---eEEEEEEEEEEEecCC-------------------- Confidence 9777654332 123556777888876533322 22 2222221111100000 Q ss_pred hhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccc-----------ee Q lcl|NC_019406. 238 SGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDV-----------YT 306 (661) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~-----------~~ 306 (661) ...+|+.+-.-...++.++.+..+|+.......+.. .. T Consensus 179 -------------------------------~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~ 227 (505) T protein:vir:79 179 -------------------------------RTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLE 227 (505) T ss_pred -------------------------------cceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccC Confidence 001121111111112233333444443322222111 01 Q ss_pred eccCCcccceeeEEEEec--CCCCC---Cccccchh----HHHHHHHHHHhhhhhHHHHHHHhcCceeEEe------cCC Q lcl|NC_019406. 307 PMVRGRTLPFIPFVFFGS--MSNAA---DCEKPPLL----DIVELNLKHYRTYAELEHGRFFTALPTYYAP------ELD 371 (661) Q Consensus 307 p~~~g~~L~~IPfv~~~~--~~~~~---~~~~pPLl----dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~------Gl~ 371 (661) +.+.=+.++..+|+.+-. .++.. ..+.+-|. -|-.||..+-+-.-+++.+-+..-+|.-++. |.. T Consensus 228 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~ 307 (505) T protein:vir:79 228 PQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQA 307 (505) T ss_pred cceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCccc Confidence 111001244445554421 11111 12333333 2334554444444444433333223222221 111 Q ss_pred CCCCceeEeccccee-ecCCCCCcceEeecCchh-HHHHHHHHHHHHHHHHHH---hHHhcccccCccchhHHHHHHHHH Q lcl|NC_019406. 372 DSDASEYHIGPGRVW-VVDKESGIPGIIEFKGEG-LKTLERALNEKEQQIAAI---GGRLMPGMSKSVSESDNQSALREA 446 (661) Q Consensus 372 ~~~~~~l~iGs~~~~-~lp~~ga~~~ylE~~g~~-i~a~~~~L~~le~qM~~l---GArll~~~~~~~~eTataa~~d~~ 446 (661) .....++--+-..++ ....+++..++-.+++.- .+.+.+.|+.+.+++... +...+.. ...+.+|||+...+.+ T Consensus 308 ~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~-~~~~~~TAtei~s~~~ 386 (505) T protein:vir:79 308 SETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTT-SPSGIQTATEVVTNNS 386 (505) T ss_pred ccccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCC-CccccchHHHHHHHHh Confidence 011101100111111 111123344455555542 245566666655554432 2222322 2345789999999999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHcCCC------------CCCcceEEEEecccccccc-CC-HHHHHHHHHHHhcC Q lcl|NC_019406. 447 NEQSLLLNVIMALEDGMTSVVRYWLMFRDIP------------LTDTATLRYEIDATFLTTA-LD-ARALRAIQQLYEGG 512 (661) Q Consensus 447 ~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~------------~~~~~~~~v~ln~DF~~~~-ld-a~~l~all~~~~aG 512 (661) ...+....+...++.||.++++.++.+...- .....++.| +|...- .| .++++.+.+++.+| T Consensus 387 ~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v----~f~d~i~~d~~~~~~~~~~~v~~G 462 (505) T protein:vir:79 387 QTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITI----NFNDGVFVDQESKRAADLQAVQAQ 462 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEE----EeCCCCCCCHHHHHHHHHHHHHcC Confidence 9999999999999999999988887654211 111223444 344322 23 34788899999999 Q ss_pred CCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCC Q lcl|NC_019406. 513 LLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQE 567 (661) Q Consensus 513 ~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~ 567 (661) .+|+++++.. .-|+ ++.+.+++.++|+++.... .|+. . ..+.+ T Consensus 463 i~s~e~~l~~--~~~~--~eeea~~el~ri~~E~~~~-~p~~--~-----~~gg~ 505 (505) T protein:vir:79 463 VMPKKQFLMR--NYGL--DEEEADEWLAQIDAENSTA-EPEF--N-----QFGGD 505 (505) T ss_pred CCCHHHHHHh--cCCC--ChHHHHHHHHHHHHhcccc-CCCc--h-----hccCC Confidence 9999988532 2343 2222356777887764321 1111 0 11111 No 76 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.41 E-value=1.8e-11 Score=79.54 Aligned_cols=516 Identities=11% Similarity=0.007 Sum_probs=241.5 Q ss_pred CC----CCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhh Q lcl|NC_019406. 1 MA----GLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRA 76 (661) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA 76 (661) |+ ---||+--.. ++..+- +.+.-.+.+..+++..|.|.|... + |=+-..+++ .|- T Consensus 1 m~~~~~q~~p~~~~fp----~~~a~w---V~~~D~~RlaaY~ly~d~y~n~~~-----e-l~~il~G~d--------r~~ 59 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLR----GGDDNI---VDENDKNRVRAYDLYENIYLNSAE-----T-LKLVLRGDD--------SVP 59 (563) T ss_pred CCccccccCCCccccc----cccccc---CCHHHHHHHHHHHHHHHhhcCchh-----h-hhhhcCCCc--------eee Confidence 32 2234443222 444443 455566688889999999988422 1 111122332 455 Q ss_pred cccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHh Q lcl|NC_019406. 77 AFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVA 156 (661) Q Consensus 77 ~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~ 156 (661) ++.|.-+-.|++ +..++.++..+. ||+..+ |. | .++. +..+|.+. -+-.+|...+.+.-+.++. T Consensus 60 ~~~ps~r~~V~~-~~~~Lg~~~~~~-Ve~~~~----de-~-----~~~a---vq~~Lr~~-~~~e~l~~~~~~~~r~a~v 123 (563) T protein:vir:74 60 ILMPSGRKIVEA-VHRFLGVGFDYL-VEPDMG----DE-G-----IRQS---LNAYFRTT-FKREAIKAKFTSNKRWGLI 123 (563) T ss_pred eccchHHHHHHH-HHHhcCCCcEEe-cCcccc----Cc-c-----hHHH---HHHHHHHH-HHHhhhHHHHHHHHHhhhh Confidence 555666677777 457777777773 555432 10 1 1111 22222221 2346788889999999888 Q ss_pred hCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhc Q lcl|NC_019406. 157 MGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQR 236 (661) Q Consensus 157 ~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~ 236 (661) .|-..++|=.. .++..+.|+-+..|+|.++.-|.-.+...-..+ |++..-..-.+++.. .+. T Consensus 124 lGDgvf~l~wD--p~K~~g~R~rv~~vDP~~~fp~~dpd~v~g~~~--v~v~~~~~~pdd~~~-----~~~--------- 185 (563) T protein:vir:74 124 RGDAHFYIHAD--PNKKAGERISVDEVDPRQIFLIEDGSTVVGFHM--VDIVQDFRSPDDPSK-----KLA--------- 185 (563) T ss_pred hcceeEEEeec--cccccCCCceEeecCCceeeeccCCCCccccee--eecccCCCCCcchhc-----cce--------- Confidence 88555444332 234567899999999998877753322110001 111100000001000 000 Q ss_pred chhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccc---c-ceeeccCCc Q lcl|NC_019406. 237 TSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQAR---D-VYTPMVRGR 312 (661) Q Consensus 237 w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~---~-~~~p~~~g~ 312 (661) ++ ..+.+.-.+.+.|....-+-.-.+++|+.-.+...-..+.....+... + +. ..--+ T Consensus 186 -------r~---------~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~--~~LP~ 247 (563) T protein:vir:74 186 -------RR---------RTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSAQHDEEE--EELPE 247 (563) T ss_pred -------ee---------eeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhhhhhchh--hhccc Confidence 00 000000000011111111111111111100000000000000111100 0 01 01135 Q ss_pred ccceeeEEEEecCCCC-CCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCC-----ceeEeccccee Q lcl|NC_019406. 313 TLPFIPFVFFGSMSNA-ADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDA-----SEYHIGPGRVW 386 (661) Q Consensus 313 ~L~~IPfv~~~~~~~~-~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~-----~~l~iGs~~~~ 386 (661) ++++||+|.+.+.... ..=+.+-|.+|-.+--+.-++.+|.+-++-+++.|+.++.|....++ ....||++..| T Consensus 248 pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~ 327 (563) T protein:vir:74 248 PISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIV 327 (563) T ss_pred cccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccccccccccccccccCCceeE Confidence 7899999876433222 22356779999888889999999999999999999999986543332 23569999999 Q ss_pred ecCCCC--CcceEeecCc-hhHHHHHHHHHHHHHH-HHHHhHHhccc--c--cCccchhHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_019406. 387 VVDKES--GIPGIIEFKG-EGLKTLERALNEKEQQ-IAAIGGRLMPG--M--SKSVSESDNQSALREANEQSLLLNVIMA 458 (661) Q Consensus 387 ~lp~~g--a~~~ylE~~g-~~i~a~~~~L~~le~q-M~~lGArll~~--~--~~~~~eTataa~~d~~~~~S~L~~~A~~ 458 (661) .+++++ +.+..| +| ..+.-++.-|+++... |+. .+++=.. + .-+..+|+.+-.+.-+...|.+.-.... T Consensus 328 El~~~~~~g~l~~v--~g~~~l~~~q~Hm~~l~eral~~-~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~ 404 (563) T protein:vir:74 328 EIAGNRNDNYFERV--SGVQDVSPFQDHMKWIDEKGIAE-GSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELE 404 (563) T ss_pred eccCCccccceeee--cchhhhHHHHHHHHHHHHHHHHh-hccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHH Confidence 999753 344444 45 3344555556566553 332 1222100 0 1123678888887766555533333333 Q ss_pred HHHHHHH--------HHHHH---------HHHcCCCCCCcceEEEEe-ccccccccCCHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019406. 459 LEDGMTS--------VVRYW---------LMFRDIPLTDTATLRYEI-DATFLTTALDARALRAIQQLYEGGLLPIDALY 520 (661) Q Consensus 459 le~Al~~--------aL~~~---------A~w~G~~~~~~~~~~v~l-n~DF~~~~lda~~l~all~~~~aG~Is~et~~ 520 (661) +...+.+ .|..+ ++|.|..+.. ....|.+ -.++.+.+ ..+-+...+.++++|.||++|.. T Consensus 405 l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~-~~~~v~ivf~p~~P~d-~~~vv~~~~tl~~aGiiSretAv 482 (563) T protein:vir:74 405 MIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLL-NECSVVCIFADPMPVN-KTQVTQDTLLLQQAHLILRKMAV 482 (563) T ss_pred HHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccC-CceEEEEEeCCCCCcc-HHHHHHHHHHHHHcCchhHHHHH Confidence 3333333 22222 2455654322 2222221 12333332 13468889999999999999999 Q ss_pred HHHHhcCCCCccC-------CHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCC Q lcl|NC_019406. 521 ENFVKNGIIPSTQ-------TLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEI 593 (661) Q Consensus 521 ~eL~r~gvl~~~~-------~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~ 593 (661) ++|.+.|..-++. ...++.+.+..+.-+=..=+++++-+|.-..++.-+|-++ +.+=---+.+ T Consensus 483 ~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p----------~~~~~~~~~~ 552 (563) T protein:vir:74 483 AKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQGNP----------IDQFGNPVEI 552 (563) T ss_pred HHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccCCc----------hhHcCCcccC Confidence 9999999755442 2233333221111111111233333332222111111000 0000001111 Q ss_pred CchhHHHhhhhhhhhhhH Q lcl|NC_019406. 594 DEEKLRISAKVGSTSVAA 611 (661) Q Consensus 594 ~~~~~~~~~~~~~~~~~~ 611 (661) -++-++++- +- T Consensus 553 ~~~~~~~~~-------~~ 563 (563) T protein:vir:74 553 PPDVTQVPL-------SP 563 (563) T ss_pred CccccccCC-------CC Confidence 111111111 00 No 77 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.39 E-value=3.9e-11 Score=77.63 Aligned_cols=453 Identities=10% Similarity=0.037 Sum_probs=201.7 Q ss_pred CCCCCCccccccccccc---------cccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRG---------AQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~---------~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |.-..----=|+++... +..-.|. ..|++.+....| +..|.|..- . +..+.... ..+. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~~~~i~~~---~~~Y~g~~~----~---~~~~~~~~--~~~~ 67 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIA-ISKLEYDRITTN---LKYYKSDWD----S---VLYLNTDG--ETKK 67 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhcccccc-CCHHHHHHHHHH---HHHhcCCCC----C---cccccCCC--Cccc Confidence 22111000001221111 1111233 244444444444 456666311 1 11111111 1111 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +... -.|+.+.+++.+++++|.++|+|+ +++. ...++++++ ++-+++...++..+ T Consensus 68 ~~~~--slnl~~~i~~~~A~lv~~e~~~i~-~~d~---------------------~~~~~l~~i-l~~n~f~~~~~~~~ 122 (500) T protein:vir:98 68 RDLN--HLPIARTAAKKIASLVFNEQAEIK-VDDD---------------------AANEFISET-LKNDRFNKNFERYL 122 (500) T ss_pred Ccee--ecchHHHHHHHHhhhhcCCcceEe-cCCh---------------------HHHHHHHHH-HhhccHHHHHHHHH Confidence 1111 248999999999999999999985 3321 112222222 24578899999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) ..++..|.+++-+=+. +.+|.+..++|++++-++++..+ ....+.+.++.....+ +...|..+ T Consensus 123 e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~~d~~~---~~~~a~~~~~~~~~~~----~~~~yt~l--- 185 (500) T protein:vir:98 123 ESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQSNTQD---VSSAAVVIKSVKTING----KEVYYTLI--- 185 (500) T ss_pred HHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEEEcCCC---eEEEEEEEEEeeeecC----CceEEEEE--- Confidence 9999999776654322 23577888899998765543222 2222233222221111 00011000 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccc-------- Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARD-------- 303 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~-------- 303 (661) | +++ ++.+. .++.+..+|+.......+. T Consensus 186 -----------------------E-----------------~h~--~~~~~--~~~I~n~ly~~~~~~~lG~~v~l~~~~ 221 (500) T protein:vir:98 186 -----------------------E-----------------FHE--WQSSD--DYVISNELYRSDDKAKVGSRVPLSEVY 221 (500) T ss_pred -----------------------E-----------------EEE--EeCCc--eeEEEEEEEecccccccCccccccccc Confidence 0 000 11111 1111222233211111111 Q ss_pred -ceeeccCCcccceeeEEEEec-CCCCCC----ccccchh----HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCC- Q lcl|NC_019406. 304 -VYTPMVRGRTLPFIPFVFFGS-MSNAAD----CEKPPLL----DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDD- 372 (661) Q Consensus 304 -~~~p~~~g~~L~~IPfv~~~~-~~~~~~----~~~pPLl----dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~- 372 (661) ...|...=+.++..||+++-. ..|.-. .+.+-|. -|-.+|..+-+-.-+++.+-+..-+|.-++..-.+ T Consensus 222 ~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~ 301 (500) T protein:vir:98 222 KDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRT 301 (500) T ss_pred CCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCC Confidence 111111001234445554421 111111 1222222 22334444444444555433333333333321111 Q ss_pred CCCc---eeEeccc-cee-ecCC-CCCcceEeecCchh-HHHHHHHHHHHHHHHHH-H--hHHhcccccCccchhHHHHH Q lcl|NC_019406. 373 SDAS---EYHIGPG-RVW-VVDK-ESGIPGIIEFKGEG-LKTLERALNEKEQQIAA-I--GGRLMPGMSKSVSESDNQSA 442 (661) Q Consensus 373 ~~~~---~l~iGs~-~~~-~lp~-~ga~~~ylE~~g~~-i~a~~~~L~~le~qM~~-l--GArll~~~~~~~~eTataa~ 442 (661) ..+. +...... ..+ .++. +++.-++-++++.- .+.+...|+..-+++.. . +...+... ..+.+|||+.. T Consensus 302 ~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~-~~g~~TAtei~ 380 (500) T protein:vir:98 302 TDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFD-GKSMKTATEIV 380 (500) T ss_pred CCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccC-cCccccHHHHH Confidence 0000 0011111 111 1111 12222232333322 24455555555544332 1 22222222 24578999999 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc---CC---CCCCcceEEEEeccccccc-cCCH-HHHHHHHHHHhcCCC Q lcl|NC_019406. 443 LREANEQSLLLNVIMALEDGMTSVVRYWLMFR---DI---PLTDTATLRYEIDATFLTT-ALDA-RALRAIQQLYEGGLL 514 (661) Q Consensus 443 ~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~---G~---~~~~~~~~~v~ln~DF~~~-~lda-~~l~all~~~~aG~I 514 (661) .+.+........+...++.||.++++.+..+. +. ......++.|+. ... ..|. ++++.+++++.+|.| T Consensus 381 s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f----~d~i~~d~~~~~~~~~~~v~aGi~ 456 (500) T protein:vir:98 381 SENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISL----DDGVFTDRDAELDYWIKVVNAGFG 456 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEe----CCCCCCCHHHHHHHHHHHHHcCCC Confidence 99999999999999999999999988776542 11 111222344443 332 2233 478899999999999 Q ss_pred CHHHHHHHHHhcCCCCccCCHHHHHHHHhccCC-CCCCchhhhhhcCC Q lcl|NC_019406. 515 PIDALYENFVKNGIIPSTQTLEEFTIKMNDPKS-FIGQPDAIAMRRGY 561 (661) Q Consensus 515 s~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~-~l~~ddae~~~~g~ 561 (661) |+++++..+ -|+ ++...+++.++|+++.+ ..+..+.+.-.-|. T Consensus 457 s~~~~i~~~--~g~--~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 457 TREMAIQKV--LNV--TEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred CHHHHHHhc--CCC--CHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 999986332 354 22234556677765533 34444444444342 No 78 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.39 E-value=3.9e-11 Score=77.63 Aligned_cols=453 Identities=10% Similarity=0.037 Sum_probs=201.7 Q ss_pred CCCCCCccccccccccc---------cccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRG---------AQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~---------~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |.-..----=|+++... +..-.|. ..|++.+....| +..|.|..- . +..+.... ..+. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~~~~i~~~---~~~Y~g~~~----~---~~~~~~~~--~~~~ 67 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIA-ISKLEYDRITTN---LKYYKSDWD----S---VLYLNTDG--ETKK 67 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhcccccc-CCHHHHHHHHHH---HHHhcCCCC----C---cccccCCC--Cccc Confidence 22111000001221111 1111233 244444444444 456666311 1 11111111 1111 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +... -.|+.+.+++.+++++|.++|+|+ +++. ...++++++ ++-+++...++..+ T Consensus 68 ~~~~--slnl~~~i~~~~A~lv~~e~~~i~-~~d~---------------------~~~~~l~~i-l~~n~f~~~~~~~~ 122 (500) T protein:vir:30 68 RDLN--HLPIARTAAKKIASLVFNEQAEIK-VDDD---------------------AANEFISET-LKNDRFNKNFERYL 122 (500) T ss_pred Ccee--ecchHHHHHHHHhhhhcCCcceEe-cCCh---------------------HHHHHHHHH-HhhccHHHHHHHHH Confidence 1111 248999999999999999999985 3321 112222222 24578899999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) ..++..|.+++-+=+. +.+|.+..++|++++-++++..+ ....+.+.++.....+ +...|..+ T Consensus 123 e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~~d~~~---~~~~a~~~~~~~~~~~----~~~~yt~l--- 185 (500) T protein:vir:30 123 ESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQSNTQD---VSSAAVVIKSVKTING----KEVYYTLI--- 185 (500) T ss_pred HHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEEEcCCC---eEEEEEEEEEeeeecC----CceEEEEE--- Confidence 9999999776654322 23577888899998765543222 2222233222221111 00011000 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccc-------- Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARD-------- 303 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~-------- 303 (661) | +++ ++.+. .++.+..+|+.......+. T Consensus 186 -----------------------E-----------------~h~--~~~~~--~~~I~n~ly~~~~~~~lG~~v~l~~~~ 221 (500) T protein:vir:30 186 -----------------------E-----------------FHE--WQSSD--DYVISNELYRSDDKAKVGSRVPLSEVY 221 (500) T ss_pred -----------------------E-----------------EEE--EeCCc--eeEEEEEEEecccccccCccccccccc Confidence 0 000 11111 1111222233211111111 Q ss_pred -ceeeccCCcccceeeEEEEec-CCCCCC----ccccchh----HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCC- Q lcl|NC_019406. 304 -VYTPMVRGRTLPFIPFVFFGS-MSNAAD----CEKPPLL----DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDD- 372 (661) Q Consensus 304 -~~~p~~~g~~L~~IPfv~~~~-~~~~~~----~~~pPLl----dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~- 372 (661) ...|...=+.++..||+++-. ..|.-. .+.+-|. -|-.+|..+-+-.-+++.+-+..-+|.-++..-.+ T Consensus 222 ~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~ 301 (500) T protein:vir:30 222 KDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRT 301 (500) T ss_pred CCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCC Confidence 111111001234445554421 111111 1222222 22334444444444555433333333333321111 Q ss_pred CCCc---eeEeccc-cee-ecCC-CCCcceEeecCchh-HHHHHHHHHHHHHHHHH-H--hHHhcccccCccchhHHHHH Q lcl|NC_019406. 373 SDAS---EYHIGPG-RVW-VVDK-ESGIPGIIEFKGEG-LKTLERALNEKEQQIAA-I--GGRLMPGMSKSVSESDNQSA 442 (661) Q Consensus 373 ~~~~---~l~iGs~-~~~-~lp~-~ga~~~ylE~~g~~-i~a~~~~L~~le~qM~~-l--GArll~~~~~~~~eTataa~ 442 (661) ..+. +...... ..+ .++. +++.-++-++++.- .+.+...|+..-+++.. . +...+... ..+.+|||+.. T Consensus 302 ~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~-~~g~~TAtei~ 380 (500) T protein:vir:30 302 TDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFD-GKSMKTATEIV 380 (500) T ss_pred CCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccC-cCccccHHHHH Confidence 0000 0011111 111 1111 12222232333322 24455555555544332 1 22222222 24578999999 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc---CC---CCCCcceEEEEeccccccc-cCCH-HHHHHHHHHHhcCCC Q lcl|NC_019406. 443 LREANEQSLLLNVIMALEDGMTSVVRYWLMFR---DI---PLTDTATLRYEIDATFLTT-ALDA-RALRAIQQLYEGGLL 514 (661) Q Consensus 443 ~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~---G~---~~~~~~~~~v~ln~DF~~~-~lda-~~l~all~~~~aG~I 514 (661) .+.+........+...++.||.++++.+..+. +. ......++.|+. ... ..|. ++++.+++++.+|.| T Consensus 381 s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f----~d~i~~d~~~~~~~~~~~v~aGi~ 456 (500) T protein:vir:30 381 SENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISL----DDGVFTDRDAELDYWIKVVNAGFG 456 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEe----CCCCCCCHHHHHHHHHHHHHcCCC Confidence 99999999999999999999999988776542 11 111222344443 332 2233 478899999999999 Q ss_pred CHHHHHHHHHhcCCCCccCCHHHHHHHHhccCC-CCCCchhhhhhcCC Q lcl|NC_019406. 515 PIDALYENFVKNGIIPSTQTLEEFTIKMNDPKS-FIGQPDAIAMRRGY 561 (661) Q Consensus 515 s~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~-~l~~ddae~~~~g~ 561 (661) |+++++..+ -|+ ++...+++.++|+++.+ ..+..+.+.-.-|. T Consensus 457 s~~~~i~~~--~g~--~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 457 TREMAIQKV--LNV--TEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred CHHHHHHhc--CCC--CHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 999986332 354 22234556677765533 34444444444342 No 79 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.26 E-value=2.8e-10 Score=72.94 Aligned_cols=451 Identities=7% Similarity=-0.062 Sum_probs=192.4 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcchHHHH--hCCcccCCCCCCCCh----HHHHHHHh----------hhcccchHHH Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAGEREIK--AQGVKYLKAPKGFDD----EDYANYLD----------RAAFYNMTSQ 84 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G~~~vr--~~g~~YLPk~~~E~~----~~Y~~rl~----------rA~~~n~~~~ 84 (661) |.|...--.+. ..-|.|+..=+ +.-..|++.+...-. ..|..|+. |-.=+|..+. T Consensus 1 ~~~~~~~~~~i---------~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~ 71 (518) T protein:vir:78 1 MGVWSVMTRFI---------KGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNE 71 (518) T ss_pred CcchhhHHHHH---------HHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHH Confidence 66655433332 22233331100 111223332221110 01111111 2234467899 Q ss_pred HHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEE-- Q lcl|NC_019406. 85 TQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGA-- 162 (661) Q Consensus 85 tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gv-- 162 (661) +++.++.+||.++|+|+ |++ .|.. ....+.++++++ ++.+.++..++..+..++..|..++ T Consensus 72 i~~~~A~ll~~e~~~i~-v~~--------~~~~-------d~e~~~~~l~~i-l~~n~f~~~~~~~~e~a~a~G~~~~k~ 134 (518) T protein:vir:78 72 IVVVAAEYISGKPLSID-VTG--------VNGS-------KDENLTKQLKEA-LRIDNFDSKSVKIVELAGGSGVSAVKI 134 (518) T ss_pred HHHHHHHhhcCCCceEE-ecC--------cccc-------CcHHHHHHHHHH-HHhccHHHHHHHHHHHhhccCceEEEE Confidence 99999999999999985 322 0000 001223333332 3567888999999999999997664 Q ss_pred EEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhh Q lcl|NC_019406. 163 LVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRR 242 (661) Q Consensus 163 LVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~ 242 (661) .+| ..+|-+..++|.+++=... +| .++-+.+-+..... .+...|..+. T Consensus 135 ~~d---------~~~~~i~~v~ad~~~P~~~---~g--~~~~~~f~~~~~~~-----~k~~~y~~lE------------- 182 (518) T protein:vir:78 135 NIL---------NGRPSISVHSSSQFWIDFK---NN--EPFRFNFFEEIPTS-----NKADIYYLVE------------- 182 (518) T ss_pred EEE---------CCeeEEEEEcCCeeEEEee---cC--cEEEEEEEEEeecC-----CcceeEEEEE------------- Confidence 333 1235556666666654221 12 13333332221110 0000111000 Q ss_pred cchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc-ccccc---------------ccee Q lcl|NC_019406. 243 AGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP-LGQAR---------------DVYT 306 (661) Q Consensus 243 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~-~~~~~---------------~~~~ 306 (661) ..+. .... + .....+.++.+..+|+... .+... +... T Consensus 183 -------------~he~--------~~~~-~-----~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~ 235 (518) T protein:vir:78 183 -------------SREI--------KQWD-K-----EGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQ 235 (518) T ss_pred -------------eecc--------cccc-c-----eeecccceeEEEEEeeecCcccccccccccccccccccccccCc Confidence 0000 0000 0 0000111122222332210 00000 0000 Q ss_pred eccCCcccceeeEEEE---ecCCCCCC---ccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCC----CCCC Q lcl|NC_019406. 307 PMVRGRTLPFIPFVFF---GSMSNAAD---CEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELD----DSDA 375 (661) Q Consensus 307 p~~~g~~L~~IPfv~~---~~~~~~~~---~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~----~~~~ 375 (661) +...-.+....||+++ ...++-.. .+.+-|.++..+=-+.=..-|.+.+.+.. +.+..+++ .+- +... T Consensus 236 e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~ 314 (518) T protein:vir:78 236 LNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKST 314 (518) T ss_pred cceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCC Confidence 0000001122444443 22222221 23333433332222222222344455544 44445553 110 0000 Q ss_pred ----ceeEecccceeec---CCCCCc----ceEeecCchhHHHHHHHHHHHHHHHH---HHhHHhcccccCccchhHHHH Q lcl|NC_019406. 376 ----SEYHIGPGRVWVV---DKESGI----PGIIEFKGEGLKTLERALNEKEQQIA---AIGGRLMPGMSKSVSESDNQS 441 (661) Q Consensus 376 ----~~l~iGs~~~~~l---p~~ga~----~~ylE~~g~~i~a~~~~L~~le~qM~---~lGArll~~~~~~~~eTataa 441 (661) ..+-.+-+....+ +..|.+ ..-++|.= ..+.+...|+.+-.++. -++...+.. .++.+|||+. T Consensus 315 ~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~I-r~e~~~~~~~~~l~~~~~~~G~s~~tfg~--~~~~~TATei 391 (518) T protein:vir:78 315 DKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDF-RDGSYRETMEYFAQKAVSKSGYNPATFNL--GNREVKATEI 391 (518) T ss_pred CccccccCCCCceEEEecCcCCCCCccccceeeeeccc-ChHHHHHHHHHHHHHHHHhhCCChhhcCc--ccccccHHHH Confidence 0011111111111 111111 22222210 12344444544444432 122333322 2457899999 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCC--------CCCcceEEEEeccccccccCCHHHHHHHHHHHhcCC Q lcl|NC_019406. 442 ALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIP--------LTDTATLRYEIDATFLTTALDARALRAIQQLYEGGL 513 (661) Q Consensus 442 ~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~--------~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~ 513 (661) ..+.+...+.+..+...++.++.++++.++..++.- ..+..++.|..+. -...+ ..+.++.+.+++.+|. T Consensus 392 ~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D-~i~~D-~~~~~~~~~~~v~aGi 469 (518) T protein:vir:78 392 WSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPD-PMSVN-LNELSSTLNNMNSALA 469 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCC-CCCCC-HHHHHHHHHHHHhcCC Confidence 999999999999999999999999988776654321 1112234444332 22222 1235667788999999 Q ss_pred CCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccC Q lcl|NC_019406. 514 LPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQ 566 (661) Q Consensus 514 Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~ 566 (661) +|+++.++++. .++ ++...++|.+||+++......++-+. ..|+..++. T Consensus 470 mS~e~~i~~~~-~~~--~deea~~e~~ri~~E~~~~~~~~p~~-~~g~~~~~g 518 (518) T protein:vir:78 470 MSVEEKVKLIH-PKW--EDEEIQAEVKRIYLENAIGEVPDPEA-IGGMETKGG 518 (518) T ss_pred CCHHHHHHHhC-CCC--CHHHHHHHHHHHHHHhcccCCCCCcc-ccCCCCCCC Confidence 99999866431 232 22334567778877755433222211 224333333 No 80 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.22 E-value=2.5e-10 Score=73.27 Aligned_cols=611 Identities=12% Similarity=0.025 Sum_probs=197.1 Q ss_pred CCCCCC-ccccccccccccccCCccccCHHHHH-------------HHHHHHHHHHHhcchHHHHh---CCcccC--CCC Q lcl|NC_019406. 1 MAGLSP-NSANIRRTKRGAQQFTHLVVHPEYEY-------------YRPDWAKIRDAIAGEREIKA---QGVKYL--KAP 61 (661) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~V~~~hPey~a-------------~~~~W~~irD~~~G~~~vr~---~g~~YL--Pk~ 61 (661) |.-|.- .+.|..++.+-.+.|--+..+++-.. ...-+..++..+.....+|+ ....|. =+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw 80 (776) T protein:vir:93 1 MFDLNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQW 80 (776) T ss_pred CCCccccccccccccccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC Confidence 655532 44566666655555522222211111 11111112222233333332 111222 145 Q ss_pred CCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCC Q lcl|NC_019406. 62 KGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGT 141 (661) Q Consensus 62 ~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~ 141 (661) +.+.....+.+=.....+|.++.+|+.++|.-.+..+.+.-+|..- .|. +.-+.+..++..+ .+=+ T Consensus 81 ~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~~~----~d~---------~~Ae~l~~~~~~~-~~~~ 146 (776) T protein:vir:93 81 SQDEIDELKERGQAPTVYNVISQSVNWIIGSEKRGRSDFKVLPRRK----DGG---------KAAERKTALLKYL-SDVN 146 (776) T ss_pred CHHHHHHHHhcCCceEEecchHHHHHHHHHHHHhCCcceEEecCCh----hHH---------HHHHHHHHHHHHH-HHhh Confidence 5555555556656678899999999999999999888776555421 111 1122333333333 3556 Q ss_pred CHHHHHHHHHHHHHhhCCEEEE--EeccCCCchhhcccceeE-eechhhhc-cceeeccccccceeeeeeeeeeeecccc Q lcl|NC_019406. 142 SHQGFAKTVALEQVAMGRFGAL--VDVAPSSDPTAPAKSYTV-GYAAENIV-DWTVEDVDGFYVPTRILLREFERVDEHA 217 (661) Q Consensus 142 sL~~fa~~~~~~~L~~Gr~gvL--VD~P~a~~~~~g~rPY~~-~~~p~~Ii-nW~~~~~~g~~~Lt~v~ire~~~~~~~~ 217 (661) +.+.-+..+|..++.+|.+++= +||....+ |+.. .+.|.+|+ |+.....+. ..-.|+..+.+...++- T Consensus 147 ~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~------~~~~~~~~p~~i~~Dp~a~~~D~-sDar~~~~~~~~~~~~~- 218 (776) T protein:vir:93 147 HTPFERSMAFEETTKAGIGWLESQVQDENDGE------PIYAGAESWRNILWDSTYRRLDM-DDCRYIFRVKWVDLDVM- 218 (776) T ss_pred cHHHHHHHHHHHhhhcCcceEEEEeeccCCCC------ceEeeccChhheeeccccccCCH-HHHhhhhhhccCCHHHH- Confidence 7888899999999999977754 46653322 2222 23444432 122111110 01112222222111100 Q ss_pred ccccccceee-eech---hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeec------------- Q lcl|NC_019406. 218 TPSQQNPWIG-REGS---ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILEL------------- 280 (661) Q Consensus 218 ~~~~~~~~i~-~~~~---e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~------------- 280 (661) . ...|... .+.. .....|...-..+.. .....+...........+.+....||++.|- T Consensus 219 -~-~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~ 293 (776) T protein:vir:93 219 -L-AIFPERAAQLRAAAVDNFETWGTDDIDGDD---AMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKG 293 (776) T ss_pred -H-HhcCCchHHHHHhhhhcccccchhcccccc---cccccccccccccccccccccCCCeEEEEEEEEeeeeehhhccc Confidence 0 0000000 0000 000000000000000 0000000000000001111111111111110 Q ss_pred --c--------------------------cccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecC---CCCC Q lcl|NC_019406. 281 --Q--------------------------KDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM---SNAA 329 (661) Q Consensus 281 --g--------------------------~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~---~~~~ 329 (661) + .......++.++..+.. ......| -+.+.||||++... ..++ T Consensus 294 ~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~--l~~~~~p----~~~~~~Pfv~~~~~~~~~~~~ 367 (776) T protein:vir:93 294 RNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDL--MWAGPSP----YRHNRYPFTPIWGFRRARDGM 367 (776) T ss_pred ccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchh--hhccCCC----CCCCccceEEecCceeccccc Confidence 0 00000011111111100 0000000 12356777765332 1222 Q ss_pred CccccchhHHHHHHH--HHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee-Eec-ccceeecCCCCC--cceEeecCch Q lcl|NC_019406. 330 DCEKPPLLDIVELNL--KHYRTYAELEHGRFFTALPTYYAPELDDSDASEY-HIG-PGRVWVVDKESG--IPGIIEFKGE 403 (661) Q Consensus 330 ~~~~pPLldLA~LNl--~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l-~iG-s~~~~~lp~~ga--~~~ylE~~g~ 403 (661) ..+ -+..|.+..- ..+.+. +.+++ ...++.+-.|..+....-. .++ ++.++.+ ..|+ .+.+..+.+ T Consensus 368 ~~G--~v~~~~d~Q~~~N~~~s~--~~~~l--~~~~~~~~~gav~~~d~~~~~~~rp~~vi~~-~~~~~~~~~~~~~~~- 439 (776) T protein:vir:93 368 PYG--VIRFMRGMQDDVNKRLSK--ALYIL--STNKVLMEEGAVDDIDEFRREAARPDAVMTV-KNGKLGAVKMDVDRD- 439 (776) T ss_pred ccc--hHHhhhHHHHHHHHHHHH--HHHhh--cCCceeeccccccchHHHHHhcccCCceeee-CCccccccccccCcC- Confidence 111 1222222221 122222 23333 2344444445322111100 111 2233332 2232 333433222 Q ss_pred hHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC Q lcl|NC_019406. 404 GLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL 478 (661) Q Consensus 404 ~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~ 478 (661) -...+.+-|+...+.|..+ |..-...+..+.+.|+++......+..-.|+.+..|+..++..+ |.++..|++..- T Consensus 440 ~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r 519 (776) T protein:vir:93 440 LAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEK 519 (776) T ss_pred ccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcce Confidence 2234445455555555433 32111112233457888888888888888888888888777655 556666664210 Q ss_pred ------C-CcceEEEEecc----------cccc--------ccCCHHHHHHHHHHHhcCC--CCHHHHHHHHHhcCCCCc Q lcl|NC_019406. 479 ------T-DTATLRYEIDA----------TFLT--------TALDARALRAIQQLYEGGL--LPIDALYENFVKNGIIPS 531 (661) Q Consensus 479 ------~-~~~~~~v~ln~----------DF~~--------~~lda~~l~all~~~~aG~--Is~et~~~eL~r~gvl~~ 531 (661) . +..+ .|.||. +|.. .....+.+.+|++++.... +.....-..+.-.++ T Consensus 520 ~~ri~~~~~~~~-~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~--- 595 (776) T protein:vir:93 520 QFRITNSRGNPE-YVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDI--- 595 (776) T ss_pred EEEEeecCCCcc-eEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCc--- Confidence 0 1111 123331 1111 1112223444555543210 111000001111111 Q ss_pred cCCHHHHHHHHhccCCCCC------CchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhh Q lcl|NC_019406. 532 TQTLEEFTIKMNDPKSFIG------QPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVG 605 (661) Q Consensus 532 ~~~~Eee~~~l~~~~~~l~------~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~ 605 (661) -..++...+|....+.-. .+..+..+ ......+++.+. ...+.++++..+.++.++..+..+++...-.. T Consensus 596 -p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~q-q~q~~~~q~q~~--~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~ 671 (776) T protein:vir:93 596 -PNRDELVKRIRAVNGQKDPDQDEPTPEEIARE-QAQQQQQQYNDA--LAIATLEEQQAKARKAAAEAQVAEAKAKHISR 671 (776) T ss_pred -cchHHHHHHHHHhhcccccchhhcchhHHHHH-HHhhHHHHHHHH--HhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhh Confidence 123455555543322110 00000000 000000000000 00111111111100000000000000000000 Q ss_pred hh---hhhHH-HhcCChhhhhhhhhhhhHHHHh--hcccccCCCCCCCcccccCCCC----------------------- Q lcl|NC_019406. 606 ST---SVAAS-RKLGDPEQAKPSKAEQAQIDAQ--QKQAAAKPVTPTPGTVQRGRPP----------------------- 656 (661) Q Consensus 606 ~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----------------------- 656 (661) ++ ...++ ..+- ..++.+...+.+++..+ +.+....|..|.+....-+.|| T Consensus 672 ~a~~~~~~a~q~a~q-a~~~~~~~~~~a~~a~~~~~~a~~~~p~~p~~~~~~~~~~~~~~~p~~p~~p~~p~~p~~~~~~ 750 (776) T protein:vir:93 672 MAIREGVGAVKDATD-AATAIAFMPELAGLSDGILRESGWDDPNTPQPASAASGMPPAPAQPAQPANPAQPPAPGQAASE 750 (776) T ss_pred cchhhhhhhhhhhhh-hhhhhhhhhhhhhhhhhhhccccccccccccccccccCCCCCCCCCCCCCCcCCCCCCCCCCCC Confidence 00 00000 0000 00000000001111111 0000001111111000001111 Q ss_pred ----------ccCCC Q lcl|NC_019406. 657 ----------QNGAS 661 (661) Q Consensus 657 ----------~~~~~ 661 (661) +...+ T Consensus 751 ~~p~~p~~~p~~p~~ 765 (776) T protein:vir:93 751 AQPALPANPPQPPGV 765 (776) T ss_pred CCCcccCCCCCCCCC Confidence 00000 No 81 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.18 E-value=8.1e-10 Score=70.42 Aligned_cols=471 Identities=10% Similarity=-0.001 Sum_probs=198.9 Q ss_pred CCCCCCccccccccccc---cccCCccc-----cCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRG---AQQFTHLV-----VHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANY 72 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~~V~~-----~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~r 72 (661) |.-.+---.=|++...- .+..+|.. .|+++......| +.+|.|... ...|. ... .....+ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~---~~~y~g~~~----~~~~~---~~~--~~~~~~ 68 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRN---LVYYQSKWD----DVQYK---NTD--GDIKSR 68 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHH---HHHhcCCcc----ccccc---ccC--cchhcc Confidence 21111000001222111 11222221 166665554444 556777311 11111 111 111111 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) . -.-.|+.+.+++.++++||.++|+|+ +++. ...+.+..++ +-+.+...++..+. T Consensus 69 ~--~~slnl~~~i~~~~A~lv~~e~~~i~-v~d~-----------------~~~~~l~~~l-----~~n~f~~~~~~~~e 123 (522) T protein:vir:47 69 P--MNHLPIARTASKKIASLVYNEQATIT-TKNE-----------------ILQKFLDDML-----TNDRFNKNFERYLE 123 (522) T ss_pred c--ceecchHHHHHHHHhhhhcCCcceee-cCCh-----------------HHHHHHHHHH-----hhcchHHHHHHHHH Confidence 1 11249999999999999999999985 3221 1111222222 34678889999999 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .++..|-.++.+=+. +.++.+..++|.+++=-.++. ..++.-+.+.+......... . |..+ T Consensus 124 ~a~a~G~~a~k~~~d-------~~~~~i~~v~ad~~~P~~~~~---~~~~e~a~~~~~~~~~~~~~----~-~yt~---- 184 (522) T protein:vir:47 124 SCLALGGLAMRPYID-------GDKVRVAFIQAPVFFPLESNT---QDVSSAAILTKTIKSEGRKN----V-YYTL---- 184 (522) T ss_pred HhhccCCEEEEEEEc-------CCceEEEEEcCCceEEEEEcC---CceEEEEEEEEEEeecccce----e-EEEE---- Confidence 999988555543221 123556667777765322211 11222222222221111100 0 0000 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccce------- Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVY------- 305 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~------- 305 (661) .+..+....... ....+. ..+.++.+...|+.......|..+ T Consensus 185 ---------------------lE~he~~~~~~~--------~~~~~~--~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e 233 (522) T protein:vir:47 185 ---------------------VEFHEWVTADGQ--------ETGSTN--DKKYYRITNELYRSDVNDVLGQRVNLSELDK 233 (522) T ss_pred ---------------------EEEeeecccccc--------cccccc--cCCceEEEEEEeecCCCcccCcccccccccc Confidence 000000000000 000001 111222233333332111111100 Q ss_pred ----eeccCCcccceeeEEEEecC-CCCCC----ccccchh----HHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCC Q lcl|NC_019406. 306 ----TPMVRGRTLPFIPFVFFGSM-SNAAD----CEKPPLL----DIVELNLKHYRTYAELEHGRFFTALPTYYAPELDD 372 (661) Q Consensus 306 ----~p~~~g~~L~~IPfv~~~~~-~~~~~----~~~pPLl----dLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~ 372 (661) .|...-+.+...+|+++-.. .|.-. .+.+-+. -|-.||..+-+-.-+++-+-+..-+|.-++.-..+ T Consensus 234 ~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~ 313 (522) T protein:vir:47 234 YKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQ 313 (522) T ss_pred ccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCC Confidence 11100011233334443211 11111 1222222 23355555555444444333332233322221000 Q ss_pred CC------------CceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHH---HhHHhcccccCccchh Q lcl|NC_019406. 373 SD------------ASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAA---IGGRLMPGMSKSVSES 437 (661) Q Consensus 373 ~~------------~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~---lGArll~~~~~~~~eT 437 (661) .. ...+.++-+. .+.++.++..+.|.= -.+.+..+++.+...+-. ++...+... ..+.+| T Consensus 314 ~~~g~~~~~~~fd~~~~~f~~~~~---~~~~~~~i~~~~~~i-r~e~~~~~~~~~l~~i~~~~gls~~tf~~~-~~~~kT 388 (522) T protein:vir:47 314 RPDGTIDFRPRFDVEQNVYMQIGG---SSMDAGGITDLTSPI-RANDYILAISEGLKLFEMQIGVSSGMFTFD-GQGMKT 388 (522) T ss_pred CCCcccccccccCcccceEeecCC---CCCCCCcceeecccc-ChHHHHHHHHHHHHHHHHHhCCCccccCcc-cccccc Confidence 00 1112222111 111223344333211 123344444444443322 222333222 345789 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCC------CCCCcceEEEEeccccccc-cCC-HHHHHHHHHHH Q lcl|NC_019406. 438 DNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDI------PLTDTATLRYEIDATFLTT-ALD-ARALRAIQQLY 509 (661) Q Consensus 438 ataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~------~~~~~~~~~v~ln~DF~~~-~ld-a~~l~all~~~ 509 (661) ||+...+.+...+....+...++.||.++++.++.+... ......++.|. |... ..| ..++..+++++ T Consensus 389 AtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~----f~D~i~~D~~~~~~~~~~~v 464 (522) T protein:vir:47 389 ATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVN----LDDGVFTDRHAELDYWAKMV 464 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEE----cCCCCCCCHHHHHHHHHHHH Confidence 999999999999999999999999999998888866531 11123334444 3432 223 34688999999 Q ss_pred hcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhh Q lcl|NC_019406. 510 EGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAA 574 (661) Q Consensus 510 ~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~ 574 (661) .+|.||+++++. +.-|+ ++ .+.+++..+|+++...-..++. .+- | ++-+++.+-+-.. T Consensus 465 ~aG~~s~e~~i~--~~~g~-~e-eea~~el~ri~~E~~~~~~~~~-~~~-~-~~~~~~~~~d~~~ 522 (522) T protein:vir:47 465 AAGFSTKKRAIG--KTLNI-SG-VEAEKELNAINSELLPMNDAEL-AIY-G-MHDQNEEKADDKG 522 (522) T ss_pred hcCCCCHHHHHH--hcCCC-Ch-HHHHHHHHHHHHhhccCCCCCC-CCC-C-CCCcccccCCCCC Confidence 999999999863 33454 22 2235677788766432211111 111 1 1112211111111 No 82 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.04 E-value=4.1e-09 Score=66.57 Aligned_cols=605 Identities=15% Similarity=0.056 Sum_probs=214.1 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcc---cC--CCCCCCChHHHHHHHhh Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVK---YL--KAPKGFDDEDYANYLDR 75 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~---YL--Pk~~~E~~~~Y~~rl~r 75 (661) +.-+---|+|-. -+.+++-..--+ |..+.....+...+|+...+ |. =+|+.+.....+.+-.- T Consensus 3 ~~~~~~~~~~~~-----~~~~~~~~~~~~-------~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p 70 (772) T protein:vir:10 3 ITENDRQYLNGL-----PPAGDTPLTVDE-------YADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIP 70 (772) T ss_pred cchhhHHhhccC-----CcccccccCHHH-------HHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC Confidence 000000011100 111112111122 33333333444444421111 11 26777766777777777 Q ss_pred hcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHH Q lcl|NC_019406. 76 AAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQV 155 (661) Q Consensus 76 A~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L 155 (661) .+.+|.++.+|+..+|.--+..+.+.-+|.. ...|. ..+---...|+.+++ -++.+.-+..+|..++ T Consensus 71 ~~~~N~i~~~v~~v~g~~~~nr~d~~v~Pr~-----~~~d~---~~Ae~l~~~~~~~~~-----~~~~~~~~s~Af~~~i 137 (772) T protein:vir:10 71 PAVEDLIGPALLSLQGYEAVTRTDWRVTPNG-----DVGGQ---EVADALNYRLNTAER-----QSGADRACSEAFRPQI 137 (772) T ss_pred cEEEcchHHHHHHHHHHHHhcCcceEEecCC-----CchHH---HHHHHHHHHHHHHHH-----hcChHHHHHHHHHHhh Confidence 8899999999999999999999888766631 00010 011111223333333 4566778899999999 Q ss_pred hhCCEEEEEeccCCCchhhcccceeEeechhhhc-cceeeccccccceeeeeeeeeeeeccc--cccccccceeeeechh Q lcl|NC_019406. 156 AMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIV-DWTVEDVDGFYVPTRILLREFERVDEH--ATPSQQNPWIGREGSE 232 (661) Q Consensus 156 ~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~Ii-nW~~~~~~g~~~Lt~v~ire~~~~~~~--~~~~~~~~~i~~~~~e 232 (661) .+|+.|+=|++-... .+...++..+.|.+|+ ||.... +.. .-.|+.++.+...+.- .|.... ..+.. ... T Consensus 138 ~~G~Gw~e~~~~~d~---~~~~i~i~~v~p~~v~~Dp~a~~-D~s-Dar~~~~~~~~~~d~~~~~fp~~a-~~~~~-~~~ 210 (772) T protein:vir:10 138 ACGIGWVEVSRESDP---FKFPYRCRPIRRDEIHWDMKCGD-DWE-ACRFLRRQRWLSPDRIALVFPEHA-ELIGM-VGK 210 (772) T ss_pred hcCceeEEeccccCC---CCCCeEEEeeCcccceecCCCCC-CHH-HhhhhhhhccCCHHHHHHhCCCch-hHHHh-hhh Confidence 999998877764321 2223456666666642 222211 110 1112222211111000 000000 00000 000 Q ss_pred hhhcchhhh----hcchhhhhhh-hhhhheecccccCCCceeeE---------EEEE----EEeecccccce-------- Q lcl|NC_019406. 233 TAQRTSGGR----RAGLAERQGS-ARADALARPSRFTSSYTFRT---------IYRE----LILELQKDGSR-------- 286 (661) Q Consensus 233 ~vi~w~~~~----~~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~---------~~rv----~~l~~g~~g~~-------- 286 (661) ..-.|+... ..+....... ..-+...... ..+.|.... +|+. .++.+ .+|.. T Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~-~~g~~~~~~~~~~ 288 (772) T protein:vir:10 211 YGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTV-QEDHWYNPTSKEICLVELWYRRWVQVHVLKS-PDGRVVEYDPNNL 288 (772) T ss_pred hcccccCcccccccccccccccccccchhhcccc-ccccccccCCceEEEEEEeeeeeeeeeeecc-CCCceEeeCcccH Confidence 000000000 0000000000 0000000000 000000000 1111 11111 11111 Q ss_pred -------------------EEEEEEEecCcccccccceeeccCCcccceeeEEEEecCCCCCCccc-cchhHHHHHHHHH Q lcl|NC_019406. 287 -------------------VYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEK-PPLLDIVELNLKH 346 (661) Q Consensus 287 -------------------~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~-pPLldLA~LNl~H 346 (661) +++|..+.....- .+...|- .+..+++|||+.+.-...+...+- -.+.|.= =.+.. T Consensus 289 ~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L--~~~~~p~-~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Q-r~~N~ 364 (772) T protein:vir:10 289 AHNIALASGRISPKKVTVSRVRRSYWLGPHCL--HDGPTPY-THRHFPYVPFFGFREDATGIPYGYVRGMKYAQ-DSLNS 364 (772) T ss_pred HHHHHHhhcccchheeeeeEEEEEEEecceee--ccCCCCC-CCCccceEEEeeeEeccCCcccchhhhhhhHH-HHHHH Confidence 1122222222111 1111111 122355555443322222211121 1122211 11112 Q ss_pred HhhhhhHHHHHHHhcCceeEEecCCCCCCcee--Eec-cccee-ecCC----CCCcceEeecCchhHHHHHHHHHHHHHH Q lcl|NC_019406. 347 YRTYAELEHGRFFTALPTYYAPELDDSDASEY--HIG-PGRVW-VVDK----ESGIPGIIEFKGEGLKTLERALNEKEQQ 418 (661) Q Consensus 347 Yq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l--~iG-s~~~~-~lp~----~ga~~~ylE~~g~~i~a~~~~L~~le~q 418 (661) |.+. +-++|.-.+ +..-.|.-+.....+ .+. +++.+ +-|. .++++.+.. ...-...+.+.|+...+. T Consensus 365 ~~S~--~~~~l~~~~--~~~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~ 439 (772) T protein:vir:10 365 GVSK--LRWGMSVAR--VERTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKR-DYTLTDQHFQMLQDNRAT 439 (772) T ss_pred HHHH--HHHHHhccc--ccccCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccC-CccccHHHHHHHHHHHHH Confidence 2222 344554433 333344433211100 111 11122 2221 234455433 232334455555555555 Q ss_pred HHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------CC--cceEE Q lcl|NC_019406. 419 IAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------TD--TATLR 485 (661) Q Consensus 419 M~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------~~--~~~~~ 485 (661) |..+ |..--..+..+.+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..|++... .+ +..-. T Consensus 440 i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~ 519 (772) T protein:vir:10 440 IERVSNITAGFQGRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRV 519 (772) T ss_pred HHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCce Confidence 5544 42221112234456888888888887878888888888877654 777788885321 00 00111 Q ss_pred EEecc--------------ccccccC-------------CHHHHHHHHHHHhcCCCCHHHHHHHHHhcCC-CCccCCHHH Q lcl|NC_019406. 486 YEIDA--------------TFLTTAL-------------DARALRAIQQLYEGGLLPIDALYENFVKNGI-IPSTQTLEE 537 (661) Q Consensus 486 v~ln~--------------DF~~~~l-------------da~~l~all~~~~aG~Is~et~~~eL~r~gv-l~~~~~~Ee 537 (661) +.||. |.....+ ..+.+.++++++.. ++-+.. ..+...-+ +.+.-..++ T Consensus 520 v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~--~~P~~~-~~~~~~~le~~D~p~~~e 596 (772) T protein:vir:10 520 VVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKS--MPPQYQ-AAVLPFLVSLMDVPFKRD 596 (772) T ss_pred EEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhc--cChhHH-HHHHHHHHhhcCCCChHH Confidence 22331 1111111 13355666666543 222211 11100000 111223467 Q ss_pred HHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhh--hhhhhhhhHHHhc Q lcl|NC_019406. 538 FTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISA--KVGSTSVAASRKL 615 (661) Q Consensus 538 e~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 615 (661) +.++|....+....+..+. ..+++..|..+...+|++..+.+++++....+..+.+..+ ...++..++..- T Consensus 597 i~~~ir~~~~~~~peq~~~------~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~a- 669 (772) T protein:vir:10 597 VVEAIRAVDQQQTPEQIQQ------QIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQA- 669 (772) T ss_pred HHHHHHHHhccCChHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh- Confidence 7888876543322222111 1111112223334444544444444433332222221111 111111111100 Q ss_pred CChhhhhhhhhhhhHHHHh--hcccc----------cCCCCCC-----------CcccccCCCCccCCC Q lcl|NC_019406. 616 GDPEQAKPSKAEQAQIDAQ--QKQAA----------AKPVTPT-----------PGTVQRGRPPQNGAS 661 (661) Q Consensus 616 ~~~~~~~~~~~~~~~~~~~--~~~~~----------~~~~~~~-----------~~~~~~~~~~~~~~~ 661 (661) + ++.. ...+.+++..+ +++.+ +.|++.. |+.++.+.|...|.. T Consensus 670 a--~~~~-q~~q~a~~ad~~l~~~g~~~~~~~~~~~~~p~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 735 (772) T protein:vir:10 670 G--AQIA-QMPMIAPIADAVMQSAGYQRPNPAGDDPNYPIADQTAAMNIRSPYIQGQGPAAEAEAESVS 735 (772) T ss_pred h--hhHH-hhhhhhHHHHHHHHhcccccccccccCCCCCCCCCccCCCCCccCCCCCCCCCccccCCCC Confidence 1 1000 00011122211 11111 1111111 111111111111111 No 83 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=98.85 E-value=2.8e-08 Score=62.02 Aligned_cols=570 Identities=12% Similarity=0.049 Sum_probs=188.5 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchH--HHHhCCcccCCCCCCCChHHHHHHHhhhcc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGER--EIKAQGVKYLKAPKGFDDEDYANYLDRAAF 78 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~--~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~ 78 (661) ||--.+++. ++-+. ...-....++....-+.|.. ..+++=..|+=...+... .=+..++ T Consensus 1 ~~k~~~~~~-----------~~~~~---~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~-----~~~s~~~ 61 (705) T protein:vir:88 1 MAKRRKIKP-----------MDDEQ---VLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNER-----PGKSGIV 61 (705) T ss_pred CCccccccc-----------CCHHH---HHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCccc-----CCCCccc Confidence 554433321 11111 11111122222222222321 112222234422111111 1145677 Q ss_pred cchHHHHHHHHhchhhc----cCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHH Q lcl|NC_019406. 79 YNMTSQTQAGMVGQIFR----RPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQ 154 (661) Q Consensus 79 ~n~~~~tv~~l~G~vFr----k~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~ 154 (661) .|.+..+++.+.+.+.+ .++.+.-.|-. -.|++ .-+-...++..+-.+.+....++..+|+.+ T Consensus 62 ~~~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~----~~D~~---------~a~~~~~~~~~~~~~~~~~~~~~~~~~~da 128 (705) T protein:vir:88 62 SRDVQETVDWIMPSLMKVFTSGGQVVKYEPDT----AEDVE---------QAEQETEYVNYLFMRKNEGFKVMFDWFQDT 128 (705) T ss_pred cHHHHHHHHHHHHHHHHhhcCCCceEEEeeCC----hhHHH---------HHHHHHHHHhHHHhhccchhHHHHHHHHHH Confidence 88888888888876543 33333333311 01111 112223333333344556678889999999 Q ss_pred HhhCCEEEEEeccCCCch--------------------h----------------------hcccceeEeechhhhccce Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDP--------------------T----------------------APAKSYTVGYAAENIVDWT 192 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~--------------------~----------------------~g~rPY~~~~~p~~IinW~ 192 (661) |.+|.+.+=|-+-..... . ...++-+..+.|++++ T Consensus 129 l~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~--- 205 (705) T protein:vir:88 129 LMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFL--- 205 (705) T ss_pred hhcCCeEEEeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHce--- Confidence 999987665544210000 0 0011222222222221 Q ss_pred eeccccccceeeeeeeeeeeeccccccccccceeeeech---hhhhcc--hhhhhcchhhhhhh----hh----hhheec Q lcl|NC_019406. 193 VEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS---ETAQRT--SGGRRAGLAERQGS----AR----ADALAR 259 (661) Q Consensus 193 ~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~---e~vi~w--~~~~~~g~~~~~~~----~~----~~~~~~ 259 (661) . +.+...-...+|+.+... +.++.+ ....+......... .. ...... T Consensus 206 -------------------~-dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~ 265 (705) T protein:vir:88 206 -------------------V-DRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDM 265 (705) T ss_pred -------------------e-cCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhcccccccc Confidence 1 101101111122211111 111111 00000000000000 00 000000 Q ss_pred c--cccCCCceeeEEEEEEEee------cccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecC-CCCCC Q lcl|NC_019406. 260 P--SRFTSSYTFRTIYRELILE------LQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM-SNAAD 330 (661) Q Consensus 260 ~--~~~~~~~~~~~~~rv~~l~------~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~-~~~~~ 330 (661) . ............+.|.+++ ...||...+.+.++..+ ++. ...+++.+||+++... ..+.. T Consensus 266 ~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~--------~il--~~~~~~~~PF~~~~~~p~~~~~ 335 (705) T protein:vir:88 266 TGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGD--------YII--SNEPWDCRPFADLNAYRIAHKF 335 (705) T ss_pred ccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCc--------ccc--ccccCCCCCEEEecceeecCcc Confidence 0 0000000000001111111 11111111111111110 010 1135677888875422 11112 Q ss_pred ccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEecccceeecCCCCCcceEeecCchhHHHHH Q lcl|NC_019406. 331 CEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLE 409 (661) Q Consensus 331 ~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~ 409 (661) .+.++...++.+.-..=-.-+-+-++++.+..|...+. |..+. .+.+...++.++.+.. ++.+.++.++.-+ .... T Consensus 336 ~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~-~d~~~~~pg~vv~~~~-~~~i~~~~~~~~~-~~~~ 412 (705) T protein:vir:88 336 HGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNL-EDLLTNEAAGIVRVKS-MNSITPLETPQLS-GEVY 412 (705) T ss_pred ccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccCc-ccccccCCCeeEEecC-CCccccccCCcCc-HHHH Confidence 34444444444443332233334577788888877763 43221 2234555566665543 3457776554332 2223 Q ss_pred HHHHHHHHHHHHH-hHHhcccc-c---CccchhHHHHHHHHHHhhHHHHHHHHHHHH-H----HHHHHHHHHHHcCCCCC Q lcl|NC_019406. 410 RALNEKEQQIAAI-GGRLMPGM-S---KSVSESDNQSALREANEQSLLLNVIMALED-G----MTSVVRYWLMFRDIPLT 479 (661) Q Consensus 410 ~~L~~le~qM~~l-GArll~~~-~---~~~~eTataa~~d~~~~~S~L~~~A~~le~-A----l~~aL~~~A~w~G~~~~ 479 (661) .-|+.+++.|..+ |..-+..+ + -.+++|+++.++-.......|..++.++.+ . +..++.++..|...+.. T Consensus 413 ~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~ 492 (705) T protein:vir:88 413 GMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEV 492 (705) T ss_pred HHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceE Confidence 3344555555533 43332221 1 123579999999999999999999998864 3 46777788888753210 Q ss_pred ----------------CcceEEEEeccccccccCCHHHHHHHHHHHhc----C----CCCHHHHHHH----HHhcCCCCc Q lcl|NC_019406. 480 ----------------DTATLRYEIDATFLTTALDARALRAIQQLYEG----G----LLPIDALYEN----FVKNGIIPS 531 (661) Q Consensus 480 ----------------~~~~~~v~ln~DF~~~~lda~~l~all~~~~a----G----~Is~et~~~e----L~r~gvl~~ 531 (661) +...+.+.+...+....-..+.+..++.+.+. + .++...++.. ++..|+-.. T Consensus 493 ~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~ 572 (705) T protein:vir:88 493 FQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDP 572 (705) T ss_pred EeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhH Confidence 00111111111111111111223344443322 1 1111111111 111111100 Q ss_pred ------cCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCC-------hhhHHHHHHHhccCCCchhH Q lcl|NC_019406. 532 ------TQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDAD-------FQQQELEQAERHLEIDEEKL 598 (661) Q Consensus 532 ------~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d-------~~q~~~~~~e~~~~~~~~~~ 598 (661) ....+..+.+++.+. .+.+..+.- .+ .+.++. ..+++ +++...|+|.+++..+..+. T Consensus 573 ~~~~~~~~~~e~~~~~~~~~q-----~e~~~~~~~--~~-~q~e~~--k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~ 642 (705) T protein:vir:88 573 DRFWTNPNSPEALQAKAIREQ-----KEAQPKPED--IK-AQADAQ--RAQSDALAKQAEAQMKQVEAQIRLAEIELKKQ 642 (705) T ss_pred HHHhhhhhhHHHHHHHHhhhh-----hhhhHHHHH--HH-HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000111111111000 000000000 00 000000 01111 11111122222222222211 Q ss_pred HHhhhhhhhh--hh-HHHhcC----------ChhhhhhhhhhhhHHHHhhcccccCCCCCCCcccccC Q lcl|NC_019406. 599 RISAKVGSTS--VA-ASRKLG----------DPEQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRG 653 (661) Q Consensus 599 ~~~~~~~~~~--~~-~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 653 (661) +....-.++. +. ..++.. ...+++..|+.+.+.+.++- |.+..|..-.|- T Consensus 643 ~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~~~-----~~~~k~~~~~rr 705 (705) T protein:vir:88 643 EAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKV-----PETKKPTKAVRR 705 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH-----HHHHHHHHHhcC Confidence 1111000000 00 000000 00000001111111111111 122233333333 No 84 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=98.83 E-value=3.2e-08 Score=61.64 Aligned_cols=549 Identities=10% Similarity=0.041 Sum_probs=207.1 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhc-chHHHHhCCcccCCCCCCCChHHHHH-------- Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIA-GEREIKAQGVKYLKAPKGFDDEDYAN-------- 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~-G~~~vr~~g~~YLPk~~~E~~~~Y~~-------- 71 (661) ||+- |---|++ +| ...|--=.-.+.+|+..++... -...|+.+.+.|..+ .+..+.|.. T Consensus 3 ~~~~-~~~~~~~-------~~--~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~--~~~~~y~~~~~~~~~~~ 70 (651) T protein:vir:80 3 LATT-TTDKNRQ-------TY--DETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLST--PEAQDYLRDQVLRSVGD 70 (651) T ss_pred cccc-ccchhhh-------hh--hhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhccc--HHHHHhhccccccccCC Confidence 5542 2222332 11 1122222223445555555432 123344433333332 111111111 Q ss_pred ---HHhhhcccchHHHHHHHHhchhhcc----CccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHH Q lcl|NC_019406. 72 ---YLDRAAFYNMTSQTQAGMVGQIFRR----PPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQ 144 (661) Q Consensus 72 ---rl~rA~~~n~~~~tv~~l~G~vFrk----~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~ 144 (661) .-+..++.|.+..+++.+...+++. +..++-.| ..|.|.+ ..+-+.+..+.. +- +.-.++. T Consensus 71 ~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p------~~~~d~a--~~~~~~~~~~~~---~~-l~~~~~~ 138 (651) T protein:vir:80 71 VNADWRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVP------AKPGQDN--LLVSRLIKRYVQ---DK-LTEGKFR 138 (651) T ss_pred CCCCCCccccChhHHHHHHHHHHHHHHhhcCCCceeEecc------CCchhHH--HHHHHHHHHHHH---HH-hhccCcH Confidence 1123578899999998877766553 33333222 1233321 112222333322 11 2345688 Q ss_pred HHHHHHHHHHHhhCCEEEEEeccCC----------Cchhhc---------------ccceeEeechhhhccceeeccccc Q lcl|NC_019406. 145 GFAKTVALEQVAMGRFGALVDVAPS----------SDPTAP---------------AKSYTVGYAAENIVDWTVEDVDGF 199 (661) Q Consensus 145 ~fa~~~~~~~L~~Gr~gvLVD~P~a----------~~~~~g---------------~rPY~~~~~p~~IinW~~~~~~g~ 199 (661) .....++..++.+|.+.+=|-+-.. +....+ ..|.+-.++|.+++ |... ..+. T Consensus 139 ~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~-~dp~-a~~~ 216 (651) T protein:vir:80 139 AAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCF-YDPN-VTDP 216 (651) T ss_pred HHHHHHHHhhcccCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHee-ecCC-CcCc Confidence 8888999999999987765432110 000001 22333444444432 2110 0111 Q ss_pred cceeeeeeeeeeeeccccccccccceeeeec--hhhhhcch----h---hhhcchhhhhhhhh---hhheecccccC--- Q lcl|NC_019406. 200 YVPTRILLREFERVDEHATPSQQNPWIGREG--SETAQRTS----G---GRRAGLAERQGSAR---ADALARPSRFT--- 264 (661) Q Consensus 200 ~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~--~e~vi~w~----~---~~~~g~~~~~~~~~---~~~~~~~~~~~--- 264 (661) . ..+|+.+.. ...+.+-- . ....-+........ .+........+ T Consensus 217 ~---------------------d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 275 (651) T protein:vir:80 217 N---------------------RGAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSL 275 (651) T ss_pred c---------------------ccceeeeeeeeHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccc Confidence 1 122222211 11111000 0 00000000000000 00000000000 Q ss_pred ----CCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc-ccceeeEEEEe-cCCCCCCccccchhH Q lcl|NC_019406. 265 ----SSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR-TLPFIPFVFFG-SMSNAADCEKPPLLD 338 (661) Q Consensus 265 ----~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~-~L~~IPfv~~~-~~~~~~~~~~pPLld 338 (661) .....-++|- .+.. +|...+.+++...+. .+.....+ .++..||+++. -...+...+..|... T Consensus 276 ~~~~~~v~v~E~~~--~~d~--e~~~~~~~~v~~~g~-------~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~ 344 (651) T protein:vir:80 276 WSPHQNVELLEYWG--DIHL--ENKTYHDVVVTIMGN-------EVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQP 344 (651) T ss_pred cccccceEEEEEEE--Eeec--cCCceEEEEEEEcCc-------EEecccccCCCCCCCeeeecceecCccccCCChHHH Confidence 0000111111 1111 122222222111111 11111122 23445665432 223334456777776 Q ss_pred HHHHHHHHHhhhhhHHHHHHHhcCceeEEe--cCCCCCCceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHH Q lcl|NC_019406. 339 IVELNLKHYRTYAELEHGRFFTALPTYYAP--ELDDSDASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKE 416 (661) Q Consensus 339 LA~LNl~HYq~sSDl~~il~~~~~P~l~i~--Gl~~~~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le 416 (661) +....-..=......-+.++.+..|...+. |+... ..+..+++.++.... .+++..+.+....+......|+.++ T Consensus 345 ~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~--~~l~~~pg~vi~~~~-~~~~~~l~~~~~~~~~~~~~l~~l~ 421 (651) T protein:vir:80 345 NLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQP--EDVYTEPGKVFLVSD-HGDLQPLANQSSNFSITYQESSFLE 421 (651) T ss_pred HhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccH--HHhhcCCCceEEecC-CCCceeeccCcccchhHHHHHHHHH Confidence 666555555555567778888888887764 33332 235667888876654 4567777776556666677788888 Q ss_pred HHHHHH-hHHhcccc---cCccchhHHHHHHHHHHhhHHHHHHHHHHHHH-----HHHHHHHHHHHcCCCCC------C- Q lcl|NC_019406. 417 QQIAAI-GGRLMPGM---SKSVSESDNQSALREANEQSLLLNVIMALEDG-----MTSVVRYWLMFRDIPLT------D- 480 (661) Q Consensus 417 ~qM~~l-GArll~~~---~~~~~eTataa~~d~~~~~S~L~~~A~~le~A-----l~~aL~~~A~w~G~~~~------~- 480 (661) ..|..+ |...+..+ ....+.||++.+.........|..++.+++.. ++.+|.++.++.-.+.. . T Consensus 422 ~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~ 501 (651) T protein:vir:80 422 STIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEA 501 (651) T ss_pred HHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeeccccc Confidence 877643 44322111 11235699999999999999999999999874 35666777666532210 0 Q ss_pred cceEEE-----EeccccccccCCHH-------HHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCC---HHHHHHHHhcc Q lcl|NC_019406. 481 TATLRY-----EIDATFLTTALDAR-------ALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQT---LEEFTIKMNDP 545 (661) Q Consensus 481 ~~~~~v-----~ln~DF~~~~lda~-------~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~---~Eee~~~l~~~ 545 (661) .....+ .+.-+|....+.+. .+..++.++ +-.+-.+.... +......|.+. T Consensus 502 ~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~--------------q~~~~~p~~~~~~~~~~~~~~l~~~ 567 (651) T protein:vir:80 502 GAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFI--------------QAVAQVPEMGQLVDYKRILVDLLQH 567 (651) T ss_pred ccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHH--------------HhhccCCccchhhhHHHHHHHHHHH Confidence 000000 11222322222221 122222222 22222222111 12222222221 Q ss_pred CCCCCCchhhhhhcCCccccCCCcch-hhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhH-HHhcC---Chhh Q lcl|NC_019406. 546 KSFIGQPDAIAMRRGYVSRQQELDQQ-RAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAA-SRKLG---DPEQ 620 (661) Q Consensus 546 ~~~l~~ddae~~~~g~~~~~~~~~q~-~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~~ 620 (661) -+++.++. .. ....+..++. +.+.. .+.|.+..+- +....+..+.+ .++++..+ +.|.- .-+| T Consensus 568 -~g~~~~~~--~l---~~~~q~~~~~~~~~~~--~q~~~~~~~a---~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 635 (651) T protein:vir:80 568 -WGFEEPEA--YL---KQQDQQAPANPQEALL--SQAKDVGGQA---MSNMLQNQLQA-DGGTQMMSEMYGTPNADQMQQ 635 (651) T ss_pred -cCCCCcHH--hc---CCCccchhhhhhHHHH--hhHHHHHHHH---HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Confidence 11222221 11 1111111111 11111 1111111110 00000000000 01111110 11100 0011 Q ss_pred hhhhhhhhhHHHHhhccccc Q lcl|NC_019406. 621 AKPSKAEQAQIDAQQKQAAA 640 (661) Q Consensus 621 ~~~~~~~~~~~~~~~~~~~~ 640 (661) + ..+..-+.++.+=+- T Consensus 636 ~----~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 636 E----LMATTPNVSEQQLTQ 651 (651) T ss_pred H----HHHHHHHHHHhhccC Confidence 1 111111111111111 No 85 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.60 E-value=2.1e-07 Score=57.17 Aligned_cols=570 Identities=12% Similarity=0.064 Sum_probs=188.0 Q ss_pred CCCCCCccccccccc-cccccCC----ccccCHHHHHHH-------HHHHHHHHHhcchHHHHhCCcccCCCCCC---CC Q lcl|NC_019406. 1 MAGLSPNSANIRRTK-RGAQQFT----HLVVHPEYEYYR-------PDWAKIRDAIAGEREIKAQGVKYLKAPKG---FD 65 (661) Q Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~----V~~~hPey~a~~-------~~W~~irD~~~G~~~vr~~g~~YLPk~~~---E~ 65 (661) |.=-.|- -|-.-+ ++.-.+. .+..+-.|..+. ++|+-+.+.|.+....+. -++|+... .. T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~---~~~~~~~~~~~~~ 75 (641) T protein:vir:94 1 MTIEMPT--PIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQ---NTRARNFQTTGAD 75 (641) T ss_pred CccCCCc--ccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhh---hcccccccccccc Confidence 1111110 010000 0000010 111122222222 356544333333222211 12344322 11 Q ss_pred hHHHHHHHhhhcccchHHHHHHHHhchhhcc----CccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCC Q lcl|NC_019406. 66 DEDYANYLDRAAFYNMTSQTQAGMVGQIFRR----PPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGT 141 (661) Q Consensus 66 ~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk----~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~ 141 (661) ...++ ..+|-+-..++++.|+..+++- ++-++-.|.. -.|++.+ +-+..+..+. +.-+ T Consensus 76 ~~~~r----~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~----~ed~~~A---------~~~~~~~~~~-l~~~ 137 (641) T protein:vir:94 76 DADWR----HRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMV----PELADAA---------RVVKQLTKTK-LEAA 137 (641) T ss_pred hhccc----ccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCC----CChHHHH---------HHHHHHHHHH-Hhhc Confidence 12222 2466666666666666555442 1111211111 1122221 1111111111 1122 Q ss_pred CHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhh-ccceeeccccccceeeeeeee---eeeecccc Q lcl|NC_019406. 142 SHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENI-VDWTVEDVDGFYVPTRILLRE---FERVDEHA 217 (661) Q Consensus 142 sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~I-inW~~~~~~g~~~Lt~v~ire---~~~~~~~~ 217 (661) ++..-+..++++++.+|.+.+-|.+-..-.. ..... .+...++ -+|....+-. ....+++.- .....+.. T Consensus 138 ~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~-~~~~~---~~~~~~~~~~~~~~~v~~--~~~~~r~~~v~~~di~~dps 211 (641) T protein:vir:94 138 SIRDIFETYVRNLVLYGVSTYRLGWDTSMER-QFKRT---FVETGDIFGGWEDVAVNR--QRSELRIEPLSPYDVWLDTS 211 (641) T ss_pred chHHHHHHHHHHHhhcCceEEEeehhhHHHH-hhhhh---cccchhhcccccccceec--ccceeeEEecchhheeecCC Confidence 3344446899999999988887775321000 00000 0000010 1121111100 011111110 00000000 Q ss_pred ccccccceeeeechh-hhhcchhhhhcchhhhhhhhhhhheecc-----cccCCCceeeEEEEEEEe--ecccccceEEE Q lcl|NC_019406. 218 TPSQQNPWIGREGSE-TAQRTSGGRRAGLAERQGSARADALARP-----SRFTSSYTFRTIYRELIL--ELQKDGSRVYK 289 (661) Q Consensus 218 ~~~~~~~~i~~~~~e-~vi~w~~~~~~g~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~rv~~l--~~g~~g~~~~~ 289 (661) .......++++.... .+..-......++.... ......+. ...+..+.....|++... ....+|...|. T Consensus 212 ~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~---~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~ 288 (641) T protein:vir:94 212 GGKNTGTFVRLRHTREELHELVTSGYYDLDLTQ---VEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWC 288 (641) T ss_pred CCcccccceehhhhHHHHHHHHhcCCCChhhcc---hhhcccccccccccccccccccccccceeeeeeeeccCCCceee Confidence 000000011111000 01000000000000000 00000000 000000011111221111 11112222333 Q ss_pred EEEEecCcccccccceeeccCCc-ccceeeEEEEecC-CCCCCccccchhHHHH-HHHHHHhhhhhHHHHHHHhcCceeE Q lcl|NC_019406. 290 QFVYVEDPLGQARDVYTPMVRGR-TLPFIPFVFFGSM-SNAADCEKPPLLDIVE-LNLKHYRTYAELEHGRFFTALPTYY 366 (661) Q Consensus 290 ~~~~~~~~~~~~~~~~~p~~~g~-~L~~IPfv~~~~~-~~~~~~~~pPLldLA~-LNl~HYq~sSDl~~il~~~~~P~l~ 366 (661) +++...+ ..+....|+ .++..||+++.-. ..+-..+.+|..++.. +...---..+-+++ ++.+..|.+. T Consensus 289 ~~~~~~g-------~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~-~~~~~~p~~~ 360 (641) T protein:vir:94 289 VHAVFYG-------KQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDN-LVLHINKMWT 360 (641) T ss_pred EEEEEeC-------CEEeecccccccCcCCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHH-HHHHhCCeee Confidence 3222211 122223333 3556788765422 3333445666432222 11111112222333 4445556554 Q ss_pred E-e-cCCCCCCceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhc-ccc--cCccchhHHH Q lcl|NC_019406. 367 A-P-ELDDSDASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLM-PGM--SKSVSESDNQ 440 (661) Q Consensus 367 i-~-Gl~~~~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll-~~~--~~~~~eTata 440 (661) + . |... ...+..+|+..+.... .+..+++.+....+......++.++..+... +...+ ... ..+.+.||++ T Consensus 361 ~~~~~~~~--~~~l~~~PG~ii~~~~-~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtE 437 (641) T protein:vir:94 361 LVEDGILK--REDVKAKPGAVFKVAQ-HGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAE 437 (641) T ss_pred eccccccc--cceeeccCCcceeeCC-CCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHH Confidence 4 2 2222 2347889998876654 4457777654444555566666666666533 33222 111 1122469999 Q ss_pred HHHHHHHhhHHHHHHHHHHHH-----HHHHHHHHHHHHcCCC----------------CCCcceEEEEeccccccccC-- Q lcl|NC_019406. 441 SALREANEQSLLLNVIMALED-----GMTSVVRYWLMFRDIP----------------LTDTATLRYEIDATFLTTAL-- 497 (661) Q Consensus 441 a~~d~~~~~S~L~~~A~~le~-----Al~~aL~~~A~w~G~~----------------~~~~~~~~v~ln~DF~~~~l-- 497 (661) .+.........|..+++++++ .++.++.++.+..-.+ .....+++++ -++..... T Consensus 438 V~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~--~~iv~l~~~q 515 (641) T protein:vir:94 438 IQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYP--YKFLALGANY 515 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeee--eeEeecchhH Confidence 999999999999999999985 4444555554432111 1112233221 12221111 Q ss_pred ---CHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCC---ccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcch Q lcl|NC_019406. 498 ---DARALRAIQQLYEGGLLPIDALYENFVKNGIIP---STQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQ 571 (661) Q Consensus 498 ---da~~l~all~~~~aG~Is~et~~~eL~r~gvl~---~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~ 571 (661) .++.+..|+.+++ -.+..| +..++......+.+.. +++.+..- .+.++.++. T Consensus 516 ~~~~~~~i~~l~~~~~--------------~~a~~P~v~d~~d~~~~~~~~~~~~-g~~~p~~~-------ir~~~~~~~ 573 (641) T protein:vir:94 516 VVERERMVTDLLQLLD--------------ISGRVPQIGQSLDYALILEDLLRQM-RFTDPMRY-------IKKAEAPPA 573 (641) T ss_pred HHHHHHHHHHHHHHHH--------------HhhcChhhhhcCCHHHHHHHHHHHh-CCCCchhh-------ccCccCchh Confidence 1112333333332 222211 2233444334443321 23433331 112211111 Q ss_pred hhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhh--hhhhhhhhHHHHhhcccccCCCCCCCcc Q lcl|NC_019406. 572 RAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQA--KPSKAEQAQIDAQQKQAAAKPVTPTPGT 649 (661) Q Consensus 572 ~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 649 (661) ++..+.||-+.++ .+.++.+|...++-+.+-=.+++. -+.++.-+.++. -+|+.| .+||.. T Consensus 574 ---------~~~~~~~~~q~~~----~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~ 636 (641) T protein:vir:94 574 ---------APPIAPAEPGALP----PEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSDV-APEAMA---AATQQI 636 (641) T ss_pred ---------HHHHHHHHHHHHH----HHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchhh-hHHHHh---cccccc Confidence 1111112211111 122222222222222110112222 222222222211 223333 223321 Q ss_pred cccCCC Q lcl|NC_019406. 650 VQRGRP 655 (661) Q Consensus 650 ~~~~~~ 655 (661) . .|.- T Consensus 637 ~-~~~~ 641 (641) T protein:vir:94 637 T-SGAL 641 (641) T ss_pred c-ccCC Confidence 1 1111 No 86 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=98.33 E-value=1.3e-06 Score=52.88 Aligned_cols=570 Identities=13% Similarity=0.090 Sum_probs=246.3 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHH-HHHHhcchHHHHhCCc----ccCCCCCCCChHHHHHHHhh Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAK-IRDAIAGEREIKAQGV----KYLKAPKGFDDEDYANYLDR 75 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~-irD~~~G~~~vr~~g~----~YLPk~~~E~~~~Y~~rl~r 75 (661) |+ .++..=-+-||+- ...+|.. |...-.+-..|+.++. .|+=-....+... T Consensus 1 m~---------------~~~~~~~~~tpe~--la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~------- 56 (663) T protein:vir:34 1 MN---------------ESQPTDFADTPQG--WAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAE------- 56 (663) T ss_pred CC---------------ccccccchhcchh--HHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCccc------- Confidence 21 1122223345755 4666754 5555555444444332 2321111111111 Q ss_pred hcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhh-----ccCCCCCHHHHHHHH Q lcl|NC_019406. 76 AAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQR-----FAKDGTSHQGFAKTV 150 (661) Q Consensus 76 A~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~-----~dl~G~sL~~fa~~~ 150 (661) .-||.+-.++.+|.=-|++++|.++ | ..-..|+|+-+- +.+.+.+++ +..+-..|+.-|+.+ T Consensus 57 -~r~nl~~sni~~i~P~iYar~P~p~-V----~~rf~d~d~~~~-------r~ase~leR~~~~~~~~D~~~l~~~~~~~ 123 (663) T protein:vir:34 57 -TRWNLFSTNIQTQMASLYGQTPKVS-V----SRRFADADDDVA-------RVASELLERLLNTDIEKDSDTFQQALEYA 123 (663) T ss_pred -cccchhhhhHHHHhhhhhcCCCcce-e----eecccCcccchh-------hhHHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 1369999999999999999999884 3 334556553222 233333443 222345699999999 Q ss_pred HHHHHhhCCEEEEEeccCC-----------CchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccc Q lcl|NC_019406. 151 ALEQVAMGRFGALVDVAPS-----------SDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATP 219 (661) Q Consensus 151 ~~~~L~~Gr~gvLVD~P~a-----------~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~ 219 (661) .+..|.+||.-+=|=|-.. ++.+. .-| .....+.+++-|..-. +-+|..+-+.......+ T Consensus 124 v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~D~~~~-~~~-a~~~~~~e~~a~E~v~------id~v~~~dfl~~pAr~W- 194 (663) T protein:vir:34 124 LQDRLLPGFGLCRIRYEVEWEEVAGVDAILDEATG-AEL-AAAVPPTQRKAYECVE------TDYLHWQDVLWSPARVW- 194 (663) T ss_pred HHhhhccccceEEEEeecccchhccccccCCCccc-cch-hcccccchhhccccee------eeeechhhcccchhhcc- Confidence 9999999988888888220 00000 000 0001122222221110 11111111111100001 Q ss_pred ccccceeeeec--hhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc Q lcl|NC_019406. 220 SQQNPWIGREG--SETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP 297 (661) Q Consensus 220 ~~~~~~i~~~~--~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~ 297 (661) .-.+||+... ......-+.+.-...... ........+....++-...... ..-.|.+|.+.. T Consensus 195 -~ev~wva~r~~mtk~e~~~rf~~~~~~~~~---a~~~~~~~~~~~~~~~~~~~~~------------~a~VwEIWdK~~ 258 (663) T protein:vir:34 195 -HEVRWLAFRNLLDMREFNARFDADGSRNLW---ASVPKVGKPKDGKDGQSCHPWD------------RAEVWEIWDKGG 258 (663) T ss_pred -ccccceeeeccCCHHHHHHhhcCChhhhhh---hhccCcCCccccCCCCCcchhc------------CcceeEEEecCC Confidence 0112222111 111111111000000000 0000000000000000000000 111222222211 Q ss_pred --cccc--cc----ceeeccCCccc-ceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe Q lcl|NC_019406. 298 --LGQA--RD----VYTPMVRGRTL-PFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP 368 (661) Q Consensus 298 --~~~~--~~----~~~p~~~g~~L-~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~ 368 (661) .++. |. .+-|..-|..+ --.||..++...++-.+..|++. |..=-.+--+...+--+.|..+--|.-+++ T Consensus 259 ~~V~w~~eg~~~~L~~~~p~lgl~~ffPcPrpl~~~~~~ds~ipvpd~~-~y~~~~~E~n~~t~Rin~l~d~ikv~gvy~ 337 (663) T protein:vir:34 259 RKVDWYVEGYSAVLDTQPDPLGLESFFPCPKPLLANWTTDKVVPRPDFV-LAQDLYKEIDLVSTRITLLERAIRVVGVYD 337 (663) T ss_pred cEEEEEEcCcceecccCCCCCCCCCCCCCcccccceecCCCeecCCcHH-HHHHHHHHHHHHHHHHHHHHhhhhhceeec Confidence 1111 11 11222223221 22688888888888888777766 544444455666666666666666666664 Q ss_pred -cCCC--------CCCceeEecccceeecCC-CCC---cceEeec--CchhHHHHHHHHHHHHHHHHHH-hHHhcccccC Q lcl|NC_019406. 369 -ELDD--------SDASEYHIGPGRVWVVDK-ESG---IPGIIEF--KGEGLKTLERALNEKEQQIAAI-GGRLMPGMSK 432 (661) Q Consensus 369 -Gl~~--------~~~~~l~iGs~~~~~lp~-~ga---~~~ylE~--~g~~i~a~~~~L~~le~qM~~l-GArll~~~~~ 432 (661) |... ...+.| .+=..|.... .|+ ...|+-. --..|..+..+=..+..-.+++ |..=+.++.- T Consensus 338 ~~~g~~i~~~l~~a~~n~l--vpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~ 415 (663) T protein:vir:34 338 KSSGLTIGRLLSEAAQNDL--IPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGAS 415 (663) T ss_pred cccchhHHHHHHHhhCCCc--eecchhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhccc Confidence 2211 111211 1111121111 122 1233322 2244555544444444444433 4332334455 Q ss_pred ccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc-------------CCCCC---------------CcceE Q lcl|NC_019406. 433 SVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFR-------------DIPLT---------------DTATL 484 (661) Q Consensus 433 ~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~-------------G~~~~---------------~~~~~ 484 (661) ..+||+++..+-...-.-.|+-+...|++....+.++.|+.+ |.... .-.-+ T Consensus 416 ~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~ 495 (663) T protein:vir:34 416 DPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMY 495 (663) T ss_pred CcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcce Confidence 678999999999988888899999999999999999998876 32221 11223 Q ss_pred EEEeccccccccCCH----HH-------HHHHH----HHHhcCCCCHHHHHHHHHhcCC--CCccCCHHHHHHHHhccCC Q lcl|NC_019406. 485 RYEIDATFLTTALDA----RA-------LRAIQ----QLYEGGLLPIDALYENFVKNGI--IPSTQTLEEFTIKMNDPKS 547 (661) Q Consensus 485 ~v~ln~DF~~~~lda----~~-------l~all----~~~~aG~Is~et~~~eL~r~gv--l~~~~~~Eee~~~l~~~~~ 547 (661) ++.|-.|=... .|. +. +.+++ -+.+.+-.... +..+|.+-.+ +....+.|..++++... T Consensus 496 ~ldIe~dsT~~-~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p-~l~Ellk~~~~~f~~~~qie~ai~~~~~~-- 571 (663) T protein:vir:34 496 RVEVKPEAVSL-QDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAP-FLLQMLKWSVSGLRGSSTIEGVLDKAIAA-- 571 (663) T ss_pred eeeeccCCCCc-CChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHH-HHHHHHHHHhhcCChhhhHHHHHHHHHhh-- Confidence 34443321111 111 11 11122 22244444444 4455555433 34456667777777652 Q ss_pred CCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhhhhhhhh Q lcl|NC_019406. 548 FIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQAKPSKAE 627 (661) Q Consensus 548 ~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 627 (661) ++.+.+.+ +..+..++.+|...+.++ -+..++.||.+++.|...+++..+.++..... .++++..|.+ T Consensus 572 ---~e~aa~~~-~~~~pa~~~~~~k~~~~q--~k~q~~~aeAq~e~q~~~~~~ql~~~~~~~k~------~~~a~~~~~~ 639 (663) T protein:vir:34 572 ---AEEAQKQA-AQQSPAPQQPDPKVVAQA--MKGQQEMAKVQAEVQGDLLRIQAETQANETKE------RQQAEWNVRE 639 (663) T ss_pred ---hHHHhhcc-CCCCcccchhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHH Confidence 23333322 333444444444332232 23344555655555554444333222222111 1233444445 Q ss_pred hhHHHHhhcccccCCCCCCCcccccCCC Q lcl|NC_019406. 628 QAQIDAQQKQAAAKPVTPTPGTVQRGRP 655 (661) Q Consensus 628 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 655 (661) .++....+.++-++-+..--| |-| T Consensus 640 a~q~~~~~~~~r~~~~~a~~~----~~~ 663 (663) T protein:vir:34 640 AAQKNLISQAARAMNPQARNG----GMP 663 (663) T ss_pred HHHhhHHHHHHHhhchhhhcC----CCC Confidence 555544444443333222222 222 No 87 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.20 E-value=2.7e-06 Score=51.10 Aligned_cols=582 Identities=10% Similarity=0.023 Sum_probs=180.9 Q ss_pred CCcc-ccCHHHHHHHHHHHHHHHHhcchHHHHhCC---cccC--CCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhh Q lcl|NC_019406. 21 FTHL-VVHPEYEYYRPDWAKIRDAIAGEREIKAQG---VKYL--KAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIF 94 (661) Q Consensus 21 ~~V~-~~hPey~a~~~~W~~irD~~~G~~~vr~~g---~~YL--Pk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vF 94 (661) |.=+ ..|- ..+.. ++.++.....+|+.. ..|. =+|+.+.....+. ..|=+ +|.++++|+.++|.-= T Consensus 1 m~d~~~~~~---~~~~~---~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~-q~rp~-~N~i~~~v~~v~g~e~ 72 (725) T protein:vir:10 1 MADNENRLE---SILSR---FDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSEMR 72 (725) T ss_pred CCchHHHHH---HHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh-cCCCc-ccchHHHHHHHHhhHH Confidence 2111 1121 11111 112222222222100 0111 1666555554433 33434 5999999999999988 Q ss_pred ccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEE--EeccCCCch Q lcl|NC_019406. 95 RRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGAL--VDVAPSSDP 172 (661) Q Consensus 95 rk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvL--VD~P~a~~~ 172 (661) +..+.+.-+|..- .|. ..+---...|+.+++ -++.+.-..++|..++.+|.+|+= .||...+.. T Consensus 73 ~nr~d~~v~p~~~----~d~-----~~Ae~l~~~~~~~~~-----~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~ 138 (725) T protein:vir:10 73 QNPIDVLYRPKDG----ASP-----DAADVLMGMYRTDMR-----HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) T ss_pred hCCcceEEecCCc----chH-----HHHHHHHHHHHHHHH-----hcCcchHHhHHHHHHhhcCcceeeeeccccCCCCC Confidence 8777776555421 111 111111223333333 345666688999999999999954 477432221 Q ss_pred h--hcccceeEeechhhh-ccceeeccccccceeeeeeeeeeeecc-ccccccccceeeeechhhhhcchhhhhcchh-- Q lcl|NC_019406. 173 T--APAKSYTVGYAAENI-VDWTVEDVDGFYVPTRILLREFERVDE-HATPSQQNPWIGREGSETAQRTSGGRRAGLA-- 246 (661) Q Consensus 173 ~--~g~rPY~~~~~p~~I-inW~~~~~~g~~~Lt~v~ire~~~~~~-~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~-- 246 (661) . ..++-+.+...+.+| +||.....+... -.|+.++.+..... +.+. .-|. ..+..+..|.......-. T Consensus 139 ~~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sD-ar~~~~~~~~~~~~~~~~~---~~~~--~~a~~~~~~~~~~~~~~~~~ 212 (725) T protein:vir:10 139 SNNQVIRREPIHSACSHVIWDSNSKLMDKSD-ARHCTVIHSMSQNGWDDFA---EKYD--LDADNIPSFQNPNDWVFPWL 212 (725) T ss_pred CCceeeeeeecccCHhHcccCchhhccChhh-hhhhhhhccCCHHHHHHHH---HhCC--Cccccccccccccccccccc Confidence 1 112222222344554 444433332211 12333333332110 0000 0000 000011111100000000 Q ss_pred hhhhhhhhhheecccccCCCceeeEEE--EEEEeecccccceE-----------------------------EEEEEEec Q lcl|NC_019406. 247 ERQGSARADALARPSRFTSSYTFRTIY--RELILELQKDGSRV-----------------------------YKQFVYVE 295 (661) Q Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--rv~~l~~g~~g~~~-----------------------------~~~~~~~~ 295 (661) .-..+++.++++ +... ++..+..+..|..+ ++++.+.- T Consensus 213 ~~~~vrv~E~~~-----------r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~ 281 (725) T protein:vir:10 213 TQDTIQIAEFYE-----------VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII 281 (725) T ss_pred CCCeEEEEEEEE-----------EEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEee Confidence 000011111111 1100 00111111111100 01110000 Q ss_pred CcccccccceeeccCCcccceeeEEEEecC----CCCC---CccccchhHHHHHHHHHHhhhhhHHHHHH-HhcCceeEE Q lcl|NC_019406. 296 DPLGQARDVYTPMVRGRTLPFIPFVFFGSM----SNAA---DCEKPPLLDIVELNLKHYRTYAELEHGRF-FTALPTYYA 367 (661) Q Consensus 296 ~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~----~~~~---~~~~pPLldLA~LNl~HYq~sSDl~~il~-~~~~P~l~i 367 (661) . |....+..+...| ++||||+|... ++.+ ..-. ++.|.=. .+. ++.|.-+ +++. ....+..+- T Consensus 282 ~--g~~~l~~~~~~~~---~~fP~vP~~g~r~~~~g~~~~~G~vr-~~kd~Q~-~~N-~~~s~~~-~~~~~~~~~~~~~~ 352 (725) T protein:vir:10 282 T--CTAVLKDKQLIAG---EHIPIVPVFGEWGFVEDKEVYEGVVR-LTKDGQR-LRN-MIMSFNA-DIVARTPKKKPFFW 352 (725) T ss_pred c--chhhhcCCCCCCC---CceeEEEEEeeeeccCCcceeeeeec-cchhHHH-HHH-HHHHHHH-HHHHhcCCcccccc Confidence 0 0000000001122 33555543222 1111 1101 1111111 111 2222222 3332 222222222 Q ss_pred ecCCCCCCceeEecccceee----cCC-C----CCcceEeecCchhHHHHHHHHHHHHHHHHH-HhH--HhcccccCccc Q lcl|NC_019406. 368 PELDDSDASEYHIGPGRVWV----VDK-E----SGIPGIIEFKGEGLKTLERALNEKEQQIAA-IGG--RLMPGMSKSVS 435 (661) Q Consensus 368 ~Gl~~~~~~~l~iGs~~~~~----lp~-~----ga~~~ylE~~g~~i~a~~~~L~~le~qM~~-lGA--rll~~~~~~~~ 435 (661) .|.-+.......-..+..+. .+. . ...+.+.++..- ...+.+-|+.....|.. .|. .++ +..+.+ T Consensus 353 ~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~-p~~~~~ll~~~~~~i~~~tGi~~~~l--G~~~n~ 429 (725) T protein:vir:10 353 PEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEV-PQANAYMLEAATAAVKEVATLGVDAE--AVNGGQ 429 (725) T ss_pred HhhhhHHHHHHhccCCceeeecccccccCcccccccCcccCCCCc-hHHHHHHHHHHHHHHHHHhCCCHHHh--CcCchh Confidence 22111000000000000010 000 0 123444443221 23334444545554543 342 333 223445 Q ss_pred hhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------CCcceEEEEeccc-------------- Q lcl|NC_019406. 436 ESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------TDTATLRYEIDAT-------------- 491 (661) Q Consensus 436 eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------~~~~~~~v~ln~D-------------- 491 (661) .|+.+...+..+....|+.+-.|+..+...+ |.++..+++..- .++..-.|.||.. T Consensus 430 ~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Nd 509 (725) T protein:vir:10 430 VAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLND 509 (725) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhc Confidence 6888888888888888888888888887765 677777774321 1111112333321 Q ss_pred ----ccc--------ccCCHHHHHHHHHHHhcCC-CCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCC--chh-h Q lcl|NC_019406. 492 ----FLT--------TALDARALRAIQQLYEGGL-LPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQ--PDA-I 555 (661) Q Consensus 492 ----F~~--------~~lda~~l~all~~~~aG~-Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~--dda-e 555 (661) |.. .....+.+.+|++++..-- +.-. .-..|-..--+++.-..+++.++|..+.+..+. ++. + T Consensus 510 i~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~-~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e 588 (725) T protein:vir:10 510 IRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPE-YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPE 588 (725) T ss_pred cccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchh-HHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccc Confidence 110 0011234555556654311 1100 000010000122333346677777655433221 110 0 Q ss_pred hhhcCCccccCCCcchh---h-------hhcCChhhHHHHHHHhc--cCCCchhHHHhh-hhhh-hhhh----------- Q lcl|NC_019406. 556 AMRRGYVSRQQELDQQR---A-------ARDADFQQQELEQAERH--LEIDEEKLRISA-KVGS-TSVA----------- 610 (661) Q Consensus 556 ~~~~g~~~~~~~~~q~~---~-------~~e~d~~q~~~~~~e~~--~~~~~~~~~~~~-~~~~-~~~~----------- 610 (661) .++.-....+.+..|.+ . ..++|+++.+-|..+.+ ++-.+.++++.+ +.++ ..+. T Consensus 589 ~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~ 668 (725) T protein:vir:10 589 EQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREF 668 (725) T ss_pred hhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHH Confidence 00000000000000000 0 11111111111111100 000111111111 0000 0000 Q ss_pred ------HHHhc-CChhhhhh-hhhhhhHHHHhhcccccCCC----CCCC-cccccCCCCc Q lcl|NC_019406. 611 ------ASRKL-GDPEQAKP-SKAEQAQIDAQQKQAAAKPV----TPTP-GTVQRGRPPQ 657 (661) Q Consensus 611 ------~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~~~ 657 (661) .+.+. ++....++ .+..+.+..+|+... .|.. ..+| +.| +-.|+ T Consensus 669 ~~~~~~~q~~~~~~~~~~ae~~~~~~~~~~~~~~~~-~~~~~~q~~~~~~~~~--~~~~~ 725 (725) T protein:vir:10 669 LKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDI-ANILQSQRQNQPSGSV--AETPQ 725 (725) T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHhhh-hhccccccccCCCccc--ccCCC Confidence 00000 00000000 000111111111100 1111 0111 111 11112 No 88 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.13 E-value=4e-06 Score=50.17 Aligned_cols=584 Identities=10% Similarity=-0.009 Sum_probs=207.4 Q ss_pred ccccCHHHHHHHHHHHHHHHHhcchHHHHhCC---cccC--C--CCCCCChHHHHHHHh----hhcccchHHHHHHHHhc Q lcl|NC_019406. 23 HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQG---VKYL--K--APKGFDDEDYANYLD----RAAFYNMTSQTQAGMVG 91 (661) Q Consensus 23 V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g---~~YL--P--k~~~E~~~~Y~~rl~----rA~~~n~~~~tv~~l~G 91 (661) -+.++-+... ..+..++.+......+|... ..|. . +|+.+.....+.+-+ -.+-+|.++.+|+..+| T Consensus 1 m~e~~~~~~~--~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g 78 (706) T protein:vir:10 1 MAESRQKQHE--RVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIIS 78 (706) T ss_pred CCcchHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhh Confidence 1222332221 22333444444433333221 1222 1 677776655554433 26788999999999999 Q ss_pred hhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEE--EEEeccCC Q lcl|NC_019406. 92 QIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFG--ALVDVAPS 169 (661) Q Consensus 92 ~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~g--vLVD~P~a 169 (661) .--+..+.+.-.|.. ...|. ..+--....|+.+++ =++.+.-..++|..++.+|++| +..||-.. T Consensus 79 ~~~~nr~~~~v~P~~-----~~~d~---~~Ae~l~~l~~~~~~-----~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~ 145 (706) T protein:vir:10 79 EYRNNRISVKFRPGD-----NAASE---ELANKLNGLFRADYE-----ETDGGEACDNAFDDAATGGFGCFRLTTSFVNE 145 (706) T ss_pred HHHhCCCceEEecCC-----CCchH---HHHHHHHHHHHHHHH-----hcCchHHHHHHHHHHhhcCcceEEeeeccccc Confidence 999988888655531 00010 011111223333333 3467777899999999999998 45566432 Q ss_pred Cchhh-ccc-ceeEeechh-hh-ccceeeccccccceeeeeeeeeeeeccc--cccccccceeeeechhhhhcchhhhhc Q lcl|NC_019406. 170 SDPTA-PAK-SYTVGYAAE-NI-VDWTVEDVDGFYVPTRILLREFERVDEH--ATPSQQNPWIGREGSETAQRTSGGRRA 243 (661) Q Consensus 170 ~~~~~-g~r-PY~~~~~p~-~I-inW~~~~~~g~~~Lt~v~ire~~~~~~~--~~~~~~~~~i~~~~~e~vi~w~~~~~~ 243 (661) .++.. ..+ .+-..+.|. +| +||...+.+... -.|+..+.+...+.- .|..................|.+.. T Consensus 146 ~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sD-ar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d-- 222 (706) T protein:vir:10 146 YDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSD-ALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPD-- 222 (706) T ss_pred cCCCCCCccceeeeeccchhceecCchhcccChhh-cceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCC-- Confidence 21111 111 122223443 34 455433333221 112222222111100 0111000000000001111111110 Q ss_pred chhhhhhhhhh----hheecccccCCCceeeEEEEEEEe-e-cccccc--------eEE--EEEEEecCcccccccceee Q lcl|NC_019406. 244 GLAERQGSARA----DALARPSRFTSSYTFRTIYRELIL-E-LQKDGS--------RVY--KQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 244 g~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~rv~~l-~-~g~~g~--------~~~--~~~~~~~~~~~~~~~~~~p 307 (661) ++....+.... ..+...+...++..+...++.... . ....|. .++ .|+.+. +..+. T Consensus 223 ~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~-------g~~~l- 294 (706) T protein:vir:10 223 VVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVD-------GDGFL- 294 (706) T ss_pred cceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeec-------ccccc- Confidence 00000000000 000000000000000000000000 0 000000 000 111111 00000 Q ss_pred ccCCcc--cceeeEEEEecCCC-----C--CCccccchhHH-HHHHHHHHhhhhhHHHHHHHhcCceeEE-----ecCCC Q lcl|NC_019406. 308 MVRGRT--LPFIPFVFFGSMSN-----A--ADCEKPPLLDI-VELNLKHYRTYAELEHGRFFTALPTYYA-----PELDD 372 (661) Q Consensus 308 ~~~g~~--L~~IPfv~~~~~~~-----~--~~~~~pPLldL-A~LNl~HYq~sSDl~~il~~~~~P~l~i-----~Gl~~ 372 (661) -..++ .+.||||++..... . +..-. .+.|. -.+|. + .|. +-+++-....-.... .|+.. T Consensus 295 -~~~~p~~~~~~P~vP~~g~r~~~d~~~~~~G~vr-~~~d~Q~~~N~--~-~s~-~~~~~~~~~~~~~~~~~~~i~~~~~ 368 (706) T protein:vir:10 295 -EKPRRIPGEHIPLIPVYGKRWFIDDVERVEGHIA-KAMDPQRLYNL--Q-VSM-LADAAAQDPGQTPIVDMEQIRGLEQ 368 (706) T ss_pred -ccCCCCCCCccceEEEeeccccccccCcccceec-cchhhHHHHHH--H-HHH-HHHHHHhcCCcccccchhHHHHHHH Confidence 01122 25566665543221 1 11101 11111 11222 1 111 122222111111111 11111 Q ss_pred CCCce----------eEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHH Q lcl|NC_019406. 373 SDASE----------YHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQS 441 (661) Q Consensus 373 ~~~~~----------l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa 441 (661) .|... ..+|...+-..+. ...+++++++- -..+..+-|+.....|..+ |..--.-+.. ++.|+.+. T Consensus 369 ~~~~~~~~~~~~l~~~~~~~~~g~i~~~-~~~~~~~~~~~-~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~-sn~SG~Ai 445 (706) T protein:vir:10 369 HWEGRNRKRPAFLPLRTVTDKTGNVVAP-ANVAGYTQAPV-LNQALAALLQQTSADIQEVTGSSQAMQQMP-SNVARETV 445 (706) T ss_pred HhhhcccccccchhcccccCCCCccccc-ccccccCCCcc-hHHHHHHHHHHHHHHHHHHhCCCHHHcCCc-cchHHHHH Confidence 22110 1344443333222 23556665532 2233344444444444433 4222111222 24689999 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------CCcceEEEEecc--------------cc----- Q lcl|NC_019406. 442 ALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------TDTATLRYEIDA--------------TF----- 492 (661) Q Consensus 442 ~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------~~~~~~~v~ln~--------------DF----- 492 (661) ..+..+..-.|+.+-.|+..+...+ |.++..|++.+- .++..-.+.||. |+ T Consensus 446 ~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~y 525 (706) T protein:vir:10 446 NSLLNRSDMASFIYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRY 525 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeE Confidence 9999888888999999999888877 788888874321 011111122221 11 Q ss_pred --c------cccCCHHHHHHHHHHHhcCCCCH---HHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCC Q lcl|NC_019406. 493 --L------TTALDARALRAIQQLYEGGLLPI---DALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGY 561 (661) Q Consensus 493 --~------~~~lda~~l~all~~~~aG~Is~---et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~ 561 (661) . ......+.+.+|+++++++.--. ..+...+- -+.+---.+++.++|..+.+..+....+. T Consensus 526 Dv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~---~~~d~p~~~e~~e~irk~~~~q~~~~~~~----- 597 (706) T protein:vir:10 526 DVSVDVGPSYSARRDATVNALTQLLQGMLPQDPMRPALMGIII---DNMEGEGLDDFKAFNRRQLLTQGIVKPRN----- 597 (706) T ss_pred EEEEecccCcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHH---hhcCccchHHHHHHHHHhhcccCCccccc----- Confidence 1 11112345667777777543211 11111111 01222234677777766544333221110 Q ss_pred ccccCCCc---chhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhh----------------- Q lcl|NC_019406. 562 VSRQQELD---QQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQA----------------- 621 (661) Q Consensus 562 ~~~~~~~~---q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------- 621 (661) ..++++. |+....+.|.++.+.+.|.++.+.+..+++. .-.++...+...+.+-.++ T Consensus 598 -~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k~~a--~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~ 674 (706) T protein:vir:10 598 -QQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQKSQN--ETVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDK 674 (706) T ss_pred -hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0001110 1111122223333333333222221111111 1112222222221111110 Q ss_pred hhhhhhhhHHHHhhcccccCCC-CCCCccccc Q lcl|NC_019406. 622 KPSKAEQAQIDAQQKQAAAKPV-TPTPGTVQR 652 (661) Q Consensus 622 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 652 (661) +-.|+-++.-+.+.+|+-+.|+ .++||.|.. T Consensus 675 ~~~q~~q~l~~~~a~q~~~~~~~~~~~~~~~~ 706 (706) T protein:vir:10 675 AVMETLRLLKEVAASQQQTIPSPPSPADIVPS 706 (706) T ss_pred HHHHHHHHHHHHHHhccCCCCCCCCCcccCCC Confidence 0011111111222232222222 244444433 No 89 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.05 E-value=6e-06 Score=49.23 Aligned_cols=588 Identities=10% Similarity=0.012 Sum_probs=170.2 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHH-hhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYL-DRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl-~rA~~~ 79 (661) |---+-|-+-.-.....+.+ .=...||....+..+....++.+ ..++.+...+|-.+..+.+..=..-. ..++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~ 76 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKL-TSWKNELSLQALKADLDAAKPSH---TAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQP 76 (763) T ss_pred CCcCccCcCCCccccchhcC-CCCCChHHHHHHHHHHHhhhcch---hHHHHHHHHHHHhhhccccCcccccCCCccccC Confidence 22211111111111122222 22223444444433333222222 22222211222222122111101111 447788 Q ss_pred chHHHHHHHHhchhhc---cCccc-cccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHH Q lcl|NC_019406. 80 NMTSQTQAGMVGQIFR---RPPVI-RNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQV 155 (661) Q Consensus 80 n~~~~tv~~l~G~vFr---k~p~i-~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L 155 (661) +-++++++.|.+.+.+ -.+.+ +-.|-.= .|++-+ +.... +..++=+-..+| ..++..+|+.+| T Consensus 77 ~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~----~D~~~A--~q~t~----~~n~~~~~~~~~---~~~~~~~~~~~l 143 (763) T protein:vir:95 77 KLVRRQAEWRYSALTEPFLGSNKLFKVTPVTW----EDVQGA--RQNEL----VLNYQFRTKLNR---VSFIDNYVRSVV 143 (763) T ss_pred HHHHHHHHHHHHHHHHhhcCCCcEEEEecCCc----chHHHH--HHHHH----HHHHHHhhcCch---hhHHHHHHHHHh Confidence 8899999999887665 22211 2222110 111110 00000 000100111223 345567777777 Q ss_pred hhCCE--EEEEecc--C-------CC----chh----------------------------------------------- Q lcl|NC_019406. 156 AMGRF--GALVDVA--P-------SS----DPT----------------------------------------------- 173 (661) Q Consensus 156 ~~Gr~--gvLVD~P--~-------a~----~~~----------------------------------------------- 173 (661) ..|.. -+..|.- . .+ ... T Consensus 144 ~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (763) T protein:vir:95 144 DDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQT 223 (763) T ss_pred hcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecc Confidence 77755 2223311 0 00 000 Q ss_pred ----------hcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeee--ech-hhhhc--ch Q lcl|NC_019406. 174 ----------APAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGR--EGS-ETAQR--TS 238 (661) Q Consensus 174 ----------~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~--~~~-e~vi~--w~ 238 (661) ...+|.+-.+.|++++ |. ......-..-.|+.+ +.. ..++. +. T Consensus 224 ~~~~~~~~~~~k~~p~ie~V~p~d~~-iD---------------------p~a~sD~~Da~~~~~~~~~t~~dL~~~~~~ 281 (763) T protein:vir:95 224 GTTTTEVEVPLANHPTVEMLNPENII-ID---------------------PSCQGDINKAMFAIVSFETCKADLLKEKDR 281 (763) T ss_pred cceeEEEEEEecCceEEEeecHHHhe-ec---------------------CCCCCchhhCceEeeEEeccHHHHHhccCC Confidence 0012222222222221 10 000000000011111 111 11111 00 Q ss_pred hhhhcchhhhhhhhhh-hheecccccCCCceeeEEEEEEEeec----ccccceEEEEEEEecCcccccccceeeccCCcc Q lcl|NC_019406. 239 GGRRAGLAERQGSARA-DALARPSRFTSSYTFRTIYRELILEL----QKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRT 313 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~rv~~l~~----g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~ 313 (661) ...+..+......... ..........+.+.....++|+++|- +.+|..++.++...-.+..-......| .+ T Consensus 282 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p----~~ 357 (763) T protein:vir:95 282 YHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNP----YP 357 (763) T ss_pred ccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeeeccCCcceeEEEEEEEEcCeeeeccccc----cc Confidence 0000000000000000 00000000011111112233333321 122333333322111111111111111 12 Q ss_pred cceeeEEEEecCCC-CCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEE-ecCCCCCCceeEecccceeecCCC Q lcl|NC_019406. 314 LPFIPFVFFGSMSN-AADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYA-PELDDSDASEYHIGPGRVWVVDKE 391 (661) Q Consensus 314 L~~IPfv~~~~~~~-~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i-~Gl~~~~~~~l~iGs~~~~~lp~~ 391 (661) .+.+||+++..... .-..+.+.+..+..+.-.+=-..+-.-+++..+..|...+ .|..+.. +.+...++..+.+- . T Consensus 358 ~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~-d~~~~~pg~v~~v~-~ 435 (763) T protein:vir:95 358 DGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDAL-NSRRYREGEDYEYN-P 435 (763) T ss_pred CCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccch-hhhcccCCceEEee-C Confidence 34577765432211 1122333333333333322223344567777778886655 3432211 12233333333221 1 Q ss_pred CCc----ceEeec--CchhHHHHHHHHHHHHHHHHHHhHHhcccc-c-CccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 392 SGI----PGIIEF--KGEGLKTLERALNEKEQQIAAIGGRLMPGM-S-KSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 392 ga~----~~ylE~--~g~~i~a~~~~L~~le~qM~~lGArll~~~-~-~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) |+. +.+..+ ...++......++...+++ .|..-+..+ . ...+.|+++.+.-.......|+.++.++.+++ T Consensus 436 g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~--TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~ 513 (763) T protein:vir:95 436 TQNPAQMIIEHKFPELPQSALTMATLQNQEAESL--TGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGM 513 (763) T ss_pred CCChhhhcccccCCCCcchHHHHHHHHHHHHHHh--hCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 221 222222 1222222222222221111 222211110 1 12345676665555556666777788887765 Q ss_pred ----HHHHHHHHHHcCCCCCCcceEE------EEecc-----cccc------ccCCHHHHHHHHHHHhc--CCCCHHHHH Q lcl|NC_019406. 464 ----TSVVRYWLMFRDIPLTDTATLR------YEIDA-----TFLT------TALDARALRAIQQLYEG--GLLPIDALY 520 (661) Q Consensus 464 ----~~aL~~~A~w~G~~~~~~~~~~------v~ln~-----DF~~------~~lda~~l~all~~~~a--G~Is~et~~ 520 (661) ..+|.++..|++.+. -++ |.+++ +|.. ...+.+.+..|..+.+. ..+...... T Consensus 514 k~l~~~~l~Li~q~~d~~r----viRI~g~e~v~v~~~~~~~~~DV~V~~~~as~~~q~~~~l~~ll~~l~~~~~~~~~~ 589 (763) T protein:vir:95 514 SEIGNKIIAMNAVFLAEHE----VVRITNEEFVTIKREDLKGNFDLEVDISTAEVDNQKSQDLGFMLQTIGPNVDQQITL 589 (763) T ss_pred HHHHHHHHHHHHhhCCCCc----EEEEeCCccccccHHHhcCCcceEEecccchHHHHHHHHHHHHHHHhccccChHHHH Confidence 455666777765321 111 11211 2221 11111222223332221 112211110 Q ss_pred HHHHh-cCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcC-----ChhhHHHHHHHhc---- Q lcl|NC_019406. 521 ENFVK-NGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDA-----DFQQQELEQAERH---- 590 (661) Q Consensus 521 ~eL~r-~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~-----d~~q~~~~~~e~~---- 590 (661) .-|.+ .++ ....+....++...+ .++++.|.++..+. ..++...+.|+-+ T Consensus 590 ~il~~~~d~----~~~~~~~~~lr~~q~----------------~~d~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~ 649 (763) T protein:vir:95 590 NILAEIADL----KRMPKLAHDLRTWQP----------------QPDPVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQ 649 (763) T ss_pred HHHHHHHhh----hchhhhHHHHHhcCC----------------CccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000 000 011111122222111 11112222221111 1111111111110 Q ss_pred ---cCCCchhHHHhhhhhhhhhhHHHhcCChhhhhhhhhhhhHHHHh----------hccccc---CC-----CCCCCcc Q lcl|NC_019406. 591 ---LEIDEEKLRISAKVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQ----------QKQAAA---KP-----VTPTPGT 649 (661) Q Consensus 591 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~---~~-----~~~~~~~ 649 (661) +..+..++++.++-.++++++++. ... -+..+|.+......| .+|+++ .| ++.-+-. T Consensus 650 ~~~aq~e~~~~d~~~~e~~~Q~~~e~~-~~~-~~~eaq~~l~~~~a~~~~~~ea~~~~~~~~~~~~~~~~~~~~~~~~~~ 727 (763) T protein:vir:95 650 KAMAERDNKNLDYLEQESGTKHARDLE-KMK-AQSQGNQQLEITKALTKPRKEGELPPNLSAAIGYNALTNGEDTGIQSV 727 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHhccChhHHHhhhhcccccccCCCccch Confidence 000111112222222333332222 100 001111111111111 111111 12 1222223 Q ss_pred cccCCCCccC-CC Q lcl|NC_019406. 650 VQRGRPPQNG-AS 661 (661) Q Consensus 650 ~~~~~~~~~~-~~ 661 (661) -++.++|+|. +| T Consensus 728 ~~~~~~~~~~~~~ 740 (763) T protein:vir:95 728 SERDIAAEANPAY 740 (763) T ss_pred hhcccCccccccc Confidence 3455555521 12 No 90 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=97.94 E-value=9.8e-06 Score=48.04 Aligned_cols=569 Identities=11% Similarity=0.000 Sum_probs=204.4 Q ss_pred ccccCHHH-HHHHHHHHHHHHHhcchHHHHhCC---cccC----CCCCCCChHHHHHHHhh----hcccchHHHHHHHHh Q lcl|NC_019406. 23 HLVVHPEY-EYYRPDWAKIRDAIAGEREIKAQG---VKYL----KAPKGFDDEDYANYLDR----AAFYNMTSQTQAGMV 90 (661) Q Consensus 23 V~~~hPey-~a~~~~W~~irD~~~G~~~vr~~g---~~YL----Pk~~~E~~~~Y~~rl~r----A~~~n~~~~tv~~l~ 90 (661) -..+|-+. ..++..+ +.+......+|+.. ..|- =+|+.+.....+.+.+- ..-+|.++.+|+..+ T Consensus 1 m~~~~~~~~~~~~~~~---~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~ 77 (708) T protein:vir:10 1 MAETLEKKHERIMLRF---DRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRII 77 (708) T ss_pred CchhHHHHHHHHHHHH---HHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHH Confidence 12222221 2222222 22222222222211 1121 16777766666655542 567899999999999 Q ss_pred chhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEE--eccC Q lcl|NC_019406. 91 GQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALV--DVAP 168 (661) Q Consensus 91 G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLV--D~P~ 168 (661) |.-=+..+.+.-+|..- +.|. ..+---...|+.+++ -++.+.-+.++|..++.+|++|+=| ||-. T Consensus 78 g~~~~nr~d~~v~P~~~-----~~d~---~~Ae~l~~~~~~~~~-----~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~ 144 (708) T protein:vir:10 78 AEYRNNRITVKFRPGDR-----EASE---ELANKLNGLFRADYE-----ETDGGEACDNAFDDAATGGFGCFRLTSMLVN 144 (708) T ss_pred HHHHhCCcceEEEcCCC-----CchH---HHHHHHHHHHHHHHH-----hcCchHHHHHHHHhhhhcccceeeeeecccc Confidence 99988888876554421 1111 011111223333333 3456778899999999999999744 5422 Q ss_pred CCch---hhcccceeEeech-hhh-ccceeeccccccceeeeeeeeeeeeccc--cccccccceeeeechhhhhcchhhh Q lcl|NC_019406. 169 SSDP---TAPAKSYTVGYAA-ENI-VDWTVEDVDGFYVPTRILLREFERVDEH--ATPSQQNPWIGREGSETAQRTSGGR 241 (661) Q Consensus 169 a~~~---~~g~rPY~~~~~p-~~I-inW~~~~~~g~~~Lt~v~ire~~~~~~~--~~~~~~~~~i~~~~~e~vi~w~~~~ 241 (661) -.+. ..+. ++-..+.| .+| +||...+.+... -.|+.++.+...+.- .|..... ..+......+|...- T Consensus 145 e~d~~~~~~~i-~i~~~~~p~~~v~~Dp~a~~~D~sD-ar~~~~~~~~~~d~~~~~~p~~a~---~~~d~~~~~~~~~~~ 219 (708) T protein:vir:10 145 EYDPMDDRQRI-AIEPIYDPSRSVWFDPDAKKYDKSD-ALWAFCMYSLSPEKYEAEYGKKPP---TSLDVTSMTSWEYNW 219 (708) T ss_pred ccCCCCCcccc-ceEEeecchhhcccCccccccChhh-hhhhhhccCCCHHHHHHhCCCCcc---cccccccCCCccccc Confidence 1110 1122 33344444 445 555543333221 123333322221111 1110000 001111111221110 Q ss_pred hcchhhhhhhhhhhheecccccCCCceeeEEEE--EEEeecccccceEEEE----------------------------E Q lcl|NC_019406. 242 RAGLAERQGSARADALARPSRFTSSYTFRTIYR--ELILELQKDGSRVYKQ----------------------------F 291 (661) Q Consensus 242 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r--v~~l~~g~~g~~~~~~----------------------------~ 291 (661) .+ ...+++.++ -.+.+.. +..+.....|. +..+ + T Consensus 220 ~~----~d~v~v~ey-----------~~r~~~~~~~~~~~~~~tg~-~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~ 283 (708) T protein:vir:10 220 FG----ADVIYIAKY-----------YEVRKESVDVISYRHPITGE-IATYDSDQVEDIEDELAIAGFHEVARRSVKRRR 283 (708) T ss_pred cC----CCceEEEEe-----------eeEEEEEEEEEEEecCCCCc-eeeecchhhhhHHHHHHhcccchhheeeeeeEE Confidence 00 000111111 1111111 11111111111 1111 0 Q ss_pred EEecCcccccccceeeccCC-cccceeeEEEEecCCC-----C--CCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCc Q lcl|NC_019406. 292 VYVEDPLGQARDVYTPMVRG-RTLPFIPFVFFGSMSN-----A--ADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALP 363 (661) Q Consensus 292 ~~~~~~~~~~~~~~~p~~~g-~~L~~IPfv~~~~~~~-----~--~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P 363 (661) ++.-. ..+..+. ...+ -|.+.+|+|++..... . +..-. .+.|.=. .+..|++. +.+++-.+.-. T Consensus 284 v~~~~---~~g~~~l-e~~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr-~~kd~Q~-~~N~~~S~--~~~~~a~~~~~ 355 (708) T protein:vir:10 284 VYVSV---VDGDGFL-EKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIA-KAMDPQR-LYNLQVSM--LADTAAQDPGQ 355 (708) T ss_pred EEEEe---ecchhhh-ccCCCCCCCceeeEEEeeeeeccCCCcccceeec-ccchhHH-HHHHHHHH--HHHHHHhcCCc Confidence 00000 0010000 0001 2334566665532211 1 11110 1111111 12223322 23334333333 Q ss_pred eeEE-----ecCCCCCCce------e----EecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhc Q lcl|NC_019406. 364 TYYA-----PELDDSDASE------Y----HIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLM 427 (661) Q Consensus 364 ~l~i-----~Gl~~~~~~~------l----~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll 427 (661) ..++ .|+...|... + .++...+...+. +..++++++.-- ...+.+-|+.....|..+ |..-- T Consensus 356 ~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~-~~~~~~~q~~~~-~~~~~~l~q~~~~~i~~vsG~~~~ 433 (708) T protein:vir:10 356 IPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAG-ATPAGYTQPAVM-NQALAALLQQTSADIQEVTGGSQA 433 (708) T ss_pred ccccChhhhhhHHHHHhhccccchhhhccccccccccccccc-cCCccccCCccc-hHHHHHHHHHHHHHHHHHhCcChh Confidence 3332 2332222110 0 022222222221 124455554221 222333334444444333 32221 Q ss_pred ccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------CCcceEEEEecc------- Q lcl|NC_019406. 428 PGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------TDTATLRYEIDA------- 490 (661) Q Consensus 428 ~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------~~~~~~~v~ln~------- 490 (661) .-+. ..+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..|++..- .++..-.+.||. T Consensus 434 ~lG~-~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~ 512 (708) T protein:vir:10 434 MQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQT 512 (708) T ss_pred HccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCC Confidence 1122 2345898999998888888999888888777655 667777774211 000000111221 Q ss_pred --------------cc------ccccCCHHHHHHHHHHHhcCCCCHH---HHHHHHHhcCCCCccCCHHHHHHHHhccCC Q lcl|NC_019406. 491 --------------TF------LTTALDARALRAIQQLYEGGLLPID---ALYENFVKNGIIPSTQTLEEFTIKMNDPKS 547 (661) Q Consensus 491 --------------DF------~~~~lda~~l~all~~~~aG~Is~e---t~~~eL~r~gvl~~~~~~Eee~~~l~~~~~ 547 (661) |+ .......+.+.+|++++........ .+...+- -+.+---.+++.++|..+.+ T Consensus 513 g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l---~~~D~p~~~ei~erir~~~~ 589 (708) T protein:vir:10 513 GAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIIL---DNIDGEGLDDFKEYNRNQLL 589 (708) T ss_pred cceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHH---HhcCCcChHHHHHHHHHhhc Confidence 11 1112234567778888776543211 1111111 12233335777888876654 Q ss_pred CCCC--chhhhhhcCCccccCCCcchh--hhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhh--h Q lcl|NC_019406. 548 FIGQ--PDAIAMRRGYVSRQQELDQQR--AARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQ--A 621 (661) Q Consensus 548 ~l~~--ddae~~~~g~~~~~~~~~q~~--~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 621 (661) ..+. +.++.. ++...+.+ +....|+++.+.+++..+++.+..+++-.+ .++.-++...+-+-++ + T Consensus 590 ~~~~~~~~~~ee-------~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~a~a--~~~~~~a~q~~~~~~~a~~ 660 (708) T protein:vir:10 590 ISGIAKPRNEKE-------QQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNET--AQTQIKAFTAQQDAMESQA 660 (708) T ss_pred ccccccccchhh-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH Confidence 4332 111110 00001111 111112222222222222211111111111 1111111111000010 0 Q ss_pred ----hhhhhhhhH--------------HHHhhcccccCCCCCCCcccccCCCCc Q lcl|NC_019406. 622 ----KPSKAEQAQ--------------IDAQQKQAAAKPVTPTPGTVQRGRPPQ 657 (661) Q Consensus 622 ----~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 657 (661) .+++++..+ -..|+.|+++.|..|- --||. T Consensus 661 ~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q~~~~~~~p~~~~------~~~p~ 708 (708) T protein:vir:10 661 NTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPA------DLMPS 708 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccCch------hccCC Confidence 111111111 1112334444433331 00111 No 91 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=97.91 E-value=1.1e-05 Score=47.71 Aligned_cols=584 Identities=9% Similarity=-0.030 Sum_probs=199.8 Q ss_pred CCCC---CCcc-ccccccccccccCC--ccccCHHHH----HHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHH Q lcl|NC_019406. 1 MAGL---SPNS-ANIRRTKRGAQQFT--HLVVHPEYE----YYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYA 70 (661) Q Consensus 1 ~~~~---~~~~-~~~~~~~~~~~~~~--V~~~hPey~----a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~ 70 (661) ||-- ||-- +=|. -++.+.-++ -...|-.+. +...+|...|.-..=...+. .| =+|+.+....=+ T Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy-~G----~Qw~~~~~~~l~ 74 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAK-KAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFL-GG----EQWPSQVRTERE 74 (711) T ss_pred CCcccccccccchhHH-HHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHh-CC----CCCCHHHHHHHH Confidence 5432 2211 1111 111111111 111222211 11222222221111111111 12 166665555555 Q ss_pred HHHhhhcccchHHHHHHHHhchhhccCccccccchhhH---------------hhhhcccccccccchhhhh-hhHhhhh Q lcl|NC_019406. 71 NYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGA---------------ITGRDAEGGVQVVAPASIG-KLLTQLQ 134 (661) Q Consensus 71 ~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~---------------~l~~d~dG~~~~~~~~~~~-~~~~~~~ 134 (661) .+-.-.+.+|.++++|+..+|.-=+..|.+.-.|-... .+-.+.+. ..+=+.+. .|..++ T Consensus 75 ~~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d---~~~Ae~l~~~~~~~~- 150 (711) T protein:vir:10 75 LEQRPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKND---YELAEVFTGLIKNIE- 150 (711) T ss_pred hcCCCcEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhH---HHHHHHHHHHHHHHH- Confidence 55566889999999999999999998888865553100 00000000 01111111 122222 Q ss_pred hccCCCCCHHHHHHHHHHHHHhhCCEEE--EEeccCCCchhhcccceeEee-chhhhc-cceeeccccccceeeeeeeee Q lcl|NC_019406. 135 RFAKDGTSHQGFAKTVALEQVAMGRFGA--LVDVAPSSDPTAPAKSYTVGY-AAENIV-DWTVEDVDGFYVPTRILLREF 210 (661) Q Consensus 135 ~~dl~G~sL~~fa~~~~~~~L~~Gr~gv--LVD~P~a~~~~~g~rPY~~~~-~p~~Ii-nW~~~~~~g~~~Lt~v~ire~ 210 (661) +-++.+.-+..+|..++..|++|+ .+||-..+.. . ..+-+..| +|.+|+ ||.....+.. .-.|+..+.+ T Consensus 151 ----~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~-~-~e~~i~~v~~p~~v~~Dp~a~~~D~s-Dar~~~~~~~ 223 (711) T protein:vir:10 151 ----YNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSF-E-QDLIIEAIQNQFSVTIDPDAKKRDRS-DMNWCLIDDT 223 (711) T ss_pred ----HhcChhHHHHHHHHHhhhcCcceEEEEecccCCCCC-C-CCeEEeeecChhheeeCccccccChh-hhcceeeeec Confidence 344677778899999999998874 4576322111 1 12223234 466643 3322222211 1123333322 Q ss_pred eeeccc--cccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeeccc------ Q lcl|NC_019406. 211 ERVDEH--ATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQK------ 282 (661) Q Consensus 211 ~~~~~~--~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~------ 282 (661) ...++- .|......-+.......-..|... ..+++.+++.. ....+++..+..+. T Consensus 224 ~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~--------~~vrv~E~~~r---------~~~~~~~~~~~~~~~~~~~~ 286 (711) T protein:vir:10 224 MSKEKFKALYPDATAEPVYEDSVADYDTWFTE--------KSVRVSEYFTR---------EPVIREIALLSDGRSFWLDA 286 (711) T ss_pred CCHHHHHHhCCchhhhhhhcccccccCcccCc--------ceeeEEEEEee---------eeeeeEEEeecCCceeccCc Confidence 221111 000000000000000000001110 00011111100 00112222221110 Q ss_pred ----------ccc--------eEEE--EEEEecCcccccccceeeccCCccc--ceeeEEEEecCC-----CCCCccccc Q lcl|NC_019406. 283 ----------DGS--------RVYK--QFVYVEDPLGQARDVYTPMVRGRTL--PFIPFVFFGSMS-----NAADCEKPP 335 (661) Q Consensus 283 ----------~g~--------~~~~--~~~~~~~~~~~~~~~~~p~~~g~~L--~~IPfv~~~~~~-----~~~~~~~pP 335 (661) .|. .+++ |+.+. +..+. ....++ ++||||+|.... .+...+. T Consensus 287 ~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~-------G~~~L--~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~-- 355 (711) T protein:vir:10 287 LEDIVDELLEAGISIVRTRKVKTFKTYWRKIT-------GANVL--EGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSI-- 355 (711) T ss_pred chhHHHHHHhcCchhhhhhhhceeeEEEEEEe-------cceee--cCCCCCCCCcccEEEEeeeeeccccccccchh-- Confidence 000 0000 11111 11111 122333 557777654321 1111111 Q ss_pred hhHHHHHH-HHHHhhhhhHHHHHHHhcCceeEE-ecCCCCCCcee--E-ecccceeec-CCC--CCcceEeecCchhHHH Q lcl|NC_019406. 336 LLDIVELN-LKHYRTYAELEHGRFFTALPTYYA-PELDDSDASEY--H-IGPGRVWVV-DKE--SGIPGIIEFKGEGLKT 407 (661) Q Consensus 336 LldLA~LN-l~HYq~sSDl~~il~~~~~P~l~i-~Gl~~~~~~~l--~-iGs~~~~~l-p~~--ga~~~ylE~~g~~i~a 407 (661) ..++-+.. +.-+.. |-+-+++..++-+.+++ .|.-+...+.+ . .-++..+.+ |.. ++.+.+..+..-+ .. T Consensus 356 vr~~~d~Qr~~N~~~-s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~-~~ 433 (711) T protein:vir:10 356 IRHSKDAQRMANYWD-SAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVP-AA 433 (711) T ss_pred hhhhhhhHHHHHHHH-HHHHHHHHhcCCCceeecCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCC-HH Confidence 11222211 111222 22455555565555554 44422111101 1 112233322 221 1245555433322 33 Q ss_pred HHHHHHHHHHHHH-HHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHH----HHHHHHHHcCCCC---- Q lcl|NC_019406. 408 LERALNEKEQQIA-AIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTS----VVRYWLMFRDIPL---- 478 (661) Q Consensus 408 ~~~~L~~le~qM~-~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~----aL~~~A~w~G~~~---- 478 (661) ...-|+.....|. ..|..-..-+..+.+.|+.+......+..-.|..+..|+..+... +|.++..|+.... T Consensus 434 ~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI 513 (711) T protein:vir:10 434 ELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRL 513 (711) T ss_pred HHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEE Confidence 4444444444443 334322212233446788888888888888888777777766554 4667777773211 Q ss_pred ---CCcceEEEEeccc-------------------c--------ccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcC- Q lcl|NC_019406. 479 ---TDTATLRYEIDAT-------------------F--------LTTALDARALRAIQQLYEGGLLPIDALYENFVKNG- 527 (661) Q Consensus 479 ---~~~~~~~v~ln~D-------------------F--------~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~g- 527 (661) .++.+ .|.||.. | .......+.+.+|++++. .++. +...+...- T Consensus 514 ~ged~~~~-~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~--~~p~--~~~~~~~~il 588 (711) T protein:vir:10 514 KFPDETED-FVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQ--AVPS--AAAVMADLIA 588 (711) T ss_pred ecCCCCcc-eEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHh--hcch--hhhHHHHHHH Confidence 01111 1223321 1 111111223444555443 2221 100010000 Q ss_pred CCCccCCHHHHHHHHhccCCCCCC--chhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhh--h Q lcl|NC_019406. 528 IIPSTQTLEEFTIKMNDPKSFIGQ--PDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISA--K 603 (661) Q Consensus 528 vl~~~~~~Eee~~~l~~~~~~l~~--ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~--~ 603 (661) -+.+-...+++.++|....+.-+. +..+..+ +...++++...+.-+++++.+....++.++..+++..+ + T Consensus 589 ~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~q------q~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa 662 (711) T protein:vir:10 589 QNMDWPGADVIAERLKKIVPPNVLSKDEREAIE------EDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKA 662 (711) T ss_pred HhcCCCCHHHHHHHHHhhcCcccCcchhhhHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 012223346666666544332111 1110000 11111122222211222222221122222222222111 1 Q ss_pred hhhhhhhHHHhcCC----------hhhhhhhhhhhhHHHHhhcccccCCC Q lcl|NC_019406. 604 VGSTSVAASRKLGD----------PEQAKPSKAEQAQIDAQQKQAAAKPV 643 (661) Q Consensus 604 ~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~ 643 (661) -+++.+.-....+. .+|.+ .+.++++.+.|..|+-..-- T Consensus 663 ~~e~~~~q~q~~~~~~~aq~~~~~~qq~~-~~l~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 663 QLETEEAQKQLAMIEDMAQGGDVVYQQVR-ELVAQALAEITASQANVTEQ 711 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcC Confidence 11111100000000 11111 11112222222222111111 No 92 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=97.74 E-value=2.3e-05 Score=45.97 Aligned_cols=577 Identities=13% Similarity=0.076 Sum_probs=204.4 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhC---CcccC--CCCCCCChHHHHHHHhh Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQ---GVKYL--KAPKGFDDEDYANYLDR 75 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~---g~~YL--Pk~~~E~~~~Y~~rl~r 75 (661) |.-..|-.++.- ...+....|. .. |..+...+.+...+|+. ...|. =+|+.+....-+.+-.- T Consensus 1 ~~~~~~~~~~~~------~~~~~~~~~~---~~---l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p 68 (714) T protein:vir:10 1 MKNEINTTAMKN------DHGSTPRFSQ---RQ---LLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQP 68 (714) T ss_pred CCcCcCcccCCC------cchhhhhhhH---HH---HHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCC Confidence 443333222221 0001111111 11 11112223333333310 01111 15665555555555556 Q ss_pred hcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHH Q lcl|NC_019406. 76 AAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQV 155 (661) Q Consensus 76 A~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L 155 (661) .+.+|.++++|+..+|.-=+..|.+.-.|..-. +. + ...+-.-...|..+++ -++.+.-+..+|..++ T Consensus 69 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~~~----~~-~--~~~Ae~l~~~~~~~~~-----~~~~~~~~s~af~~~~ 136 (714) T protein:vir:10 69 MTIHNLIAPTVDGVLGMEAKTRTDLIVMSDDPN----DE-T--EKLAEAINAEFADACR-----LGNMNKARSDAYAEQI 136 (714) T ss_pred cEEeccHHHHHHHHHHHHHhCCcceEEecCCCC----hh-h--HHHHHHHHHHHHHHHH-----hhchhHHHHHHHHHhh Confidence 788999999999999999998888865552110 00 0 0011111223333332 3467778889999999 Q ss_pred hhCCEEE--EEeccCCCchhhcccceeEeechhhhc-cceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 156 AMGRFGA--LVDVAPSSDPTAPAKSYTVGYAAENIV-DWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 156 ~~Gr~gv--LVD~P~a~~~~~g~rPY~~~~~p~~Ii-nW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .+|..|+ .+||-. .+..+++..+.|.+|+ ||.....+. ..-.|+.++.+...++- ....|.-..+-.. T Consensus 137 ~~G~G~~~~~~d~d~-----~~~~i~i~~v~p~~v~~Dp~a~~~D~-sDar~~~~~~~~~~~~~---~~~fp~~a~~i~~ 207 (714) T protein:vir:10 137 KAGLSWVEVRRNSEP-----FGPEFKVSTVSRNEVFWDWLSREADL-SDCRWLMRRRWMDTDEA---KATFPGMAQVIDY 207 (714) T ss_pred hcccceEEeeeccCC-----CCCCeEEEecChhheeeccccccCCh-hhhhhhhhhccCCHHHH---HHhcCCchhhhhc Confidence 9998877 567632 2345677777777753 222111110 01122222222211110 0000100000000 Q ss_pred hhhcchhhhhcchhh-hhhhhhhhhe---ecccccCCC-------------ceeeEEEEEEEeecccccceE-------- Q lcl|NC_019406. 233 TAQRTSGGRRAGLAE-RQGSARADAL---ARPSRFTSS-------------YTFRTIYRELILELQKDGSRV-------- 287 (661) Q Consensus 233 ~vi~w~~~~~~g~~~-~~~~~~~~~~---~~~~~~~~~-------------~~~~~~~rv~~l~~g~~g~~~-------- 287 (661) ....|..- .++... .....+.... ...+...+. +-++...+..++.+ .+|..+ T Consensus 208 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~-~~g~~~~~d~~~~~ 285 (714) T protein:vir:10 208 AIDDWRGF-VDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIEL-SNGRVVAFDKNNLM 285 (714) T ss_pred cchhhcCc-ccchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecC-CCCCeeeeCccCHH Confidence 00111100 000000 0000000000 000000000 00011111112211 112211 Q ss_pred -------------------EEEEEEecCcccccccceeeccCCcc--cceeeEEEEecC---CCCCCcc-ccchhHHHHH Q lcl|NC_019406. 288 -------------------YKQFVYVEDPLGQARDVYTPMVRGRT--LPFIPFVFFGSM---SNAADCE-KPPLLDIVEL 342 (661) Q Consensus 288 -------------------~~~~~~~~~~~~~~~~~~~p~~~g~~--L~~IPfv~~~~~---~~~~~~~-~pPLldLA~L 342 (661) .++..+... ... ..+-.| -+.+|||++... ..+...+ .-.+.|.-. T Consensus 286 ~~~~~~~g~~~~~~~~~~rv~~~~~~g~-------~~L-~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr- 356 (714) T protein:vir:10 286 QAVAVASGRVQVKVGRVSRIREAWFVGP-------HFI-VDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQD- 356 (714) T ss_pred HHHHHHhccceecccceeeEEEEEEecc-------hhh-hcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHH- Confidence 001111100 000 001112 223555544322 2211111 112333321 Q ss_pred HHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee-E-ec-cccee-ecCC--C----CCcceEeecCchhHHHHHHHH Q lcl|NC_019406. 343 NLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY-H-IG-PGRVW-VVDK--E----SGIPGIIEFKGEGLKTLERAL 412 (661) Q Consensus 343 Nl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l-~-iG-s~~~~-~lp~--~----ga~~~ylE~~g~~i~a~~~~L 412 (661) .+.++++.. .++|.-.+ .++..|..+.+...+ . +. ++..+ +-|. . +..+. +++...-...+.+-| T Consensus 357 ~~N~~~s~~--~~~l~~~~--~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ll 431 (714) T protein:vir:10 357 EVNFRRIKL--TWLLQAKR--VIMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFR-VEQDFQVASQQFQVM 431 (714) T ss_pred HHHHHHHHH--HHHHhCCc--eeeccccccccHHHHHHhccCCCCeEEecccccccCCcccccc-ccCCCCCcHHHHHHH Confidence 233455543 44553222 233344433321111 0 10 11122 2121 0 11233 223233334445555 Q ss_pred HHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC--------- Q lcl|NC_019406. 413 NEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL--------- 478 (661) Q Consensus 413 ~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~--------- 478 (661) +.....|..+ |..-..-+..+.+.|+.+...+..+..-.|+.+..|+..+...+ |.++..|++... T Consensus 432 q~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~ 511 (714) T protein:vir:10 432 QESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRD 511 (714) T ss_pred HHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCC Confidence 5555555543 32211112234456888888777777778888888887777554 677777774321 Q ss_pred CCcceEEEEeccc--------------ccc--------ccCCHHHHHHHHHHHhcC-----CCCHHHHHHHHHhcCCCCc Q lcl|NC_019406. 479 TDTATLRYEIDAT--------------FLT--------TALDARALRAIQQLYEGG-----LLPIDALYENFVKNGIIPS 531 (661) Q Consensus 479 ~~~~~~~v~ln~D--------------F~~--------~~lda~~l~all~~~~aG-----~Is~et~~~eL~r~gvl~~ 531 (661) .....-.+.+|.+ |.. .....+.+.+|++++... .+....++ . . .+ T Consensus 512 ~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~l---e---~-~d 584 (714) T protein:vir:10 512 DRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWV---N---L-LD 584 (714) T ss_pred CcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHH---H---h-cC Confidence 1111112333321 111 111223456667776542 11111111 1 1 11 Q ss_pred cCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhh--HHHHHHHhccCCCchhHHHhhhhhhh-- Q lcl|NC_019406. 532 TQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQ--QELEQAERHLEIDEEKLRISAKVGST-- 607 (661) Q Consensus 532 ~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q--~~~~~~e~~~~~~~~~~~~~~~~~~~-- 607 (661) .-..+++.++|....+.-+.++ . ..++ +|.+.....-+++ .+.+.+|.++.++..++++...-++. T Consensus 585 ~p~~~ei~~~ir~~~~~~~~~~--~------~~~e--~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~ 654 (714) T protein:vir:10 585 VPQKQEFVERIRAALGTPKSPD--E------MTPE--EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQR 654 (714) T ss_pred CcCHHHHHHHHHHHcCCCCCcc--c------cCcc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1234677778866543211100 0 0111 1111111111122 22222233333333333333221211 Q ss_pred -hhhHHHhcCChh----hhhhhhhhhhHH-------------------HHhhcccccCCC Q lcl|NC_019406. 608 -SVAASRKLGDPE----QAKPSKAEQAQI-------------------DAQQKQAAAKPV 643 (661) Q Consensus 608 -~~~~~~~~~~~~----~~~~~~~~~~~~-------------------~~~~~~~~~~~~ 643 (661) ...|+..++.-+ .....|++.+.+ ..++.|-+|=|- T Consensus 655 ~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~~~~~q~~~q~~~~~~~~~~~ 714 (714) T protein:vir:10 655 DNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDVLQQQMLYTLQQRMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHhcCC Confidence 112222111111 111111111111 111222222222 No 93 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=97.66 E-value=3.2e-05 Score=45.24 Aligned_cols=535 Identities=13% Similarity=0.069 Sum_probs=202.4 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcc-hHHHHhCCcccCCCCCCCChHHHHHHHhhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAG-EREIKAQGVKYLKAPKGFDDEDYANYLDRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G-~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~ 79 (661) |++-. .|+. . +..+|-.=...+..|+...+...= ...|++.- .|+-.+..+....-+...+..+|. T Consensus 1 ~~~~~---~~~~-------~--~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~-~y~~a~~~~~~~~~~~~~r~~~~~ 67 (584) T protein:vir:95 1 MSVKV---AELN-------S--LLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELR-NYVFATDTTTTSNQGLPWKNSTTL 67 (584) T ss_pred CCcch---hhhh-------h--hccccchHHHHHHHHHHHHhhhchhhccCHHHH-HHHHhhhhhhhhhcccccccccch Confidence 33321 1221 1 112344333445555555544321 12222211 111111111111112222346688 Q ss_pred chHHHHHHHHhchhhccCccccccchhhHhhhhcc---cccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHh Q lcl|NC_019406. 80 NMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDA---EGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVA 156 (661) Q Consensus 80 n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~---dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~ 156 (661) |.+..+++.++-.+|.- . .|+ +.|.+=+ .|-...+.-+. +..+.+|= +.-.++..-++.++..++. T Consensus 68 ~k~~~~~~~i~~~l~~~--~---Fp~--~~w~~~v~~~~~~~~~~~~~a---i~~~i~dk-l~e~~~~~~~~~~i~d~~~ 136 (584) T protein:vir:95 68 PKLCQIRDNLHSNYFSS--L---FPN--DDWLRWVGYGKGDSTKTKAKA---IQAYMSNK-CRESHFRTEVSKLIYDYID 136 (584) T ss_pred hHHHHHHHHHHHHHHHh--h---cCc--cceeeeecCCCchhhHHHHHH---HHHHHhhh-hhhccHHHHHHHHHHhhcc Confidence 88888777766655542 1 111 1111100 01111111122 22222111 1222788889999999999 Q ss_pred hCCEEEEEeccCC----C---chhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccc---cccccccee Q lcl|NC_019406. 157 MGRFGALVDVAPS----S---DPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHA---TPSQQNPWI 226 (661) Q Consensus 157 ~Gr~gvLVD~P~a----~---~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~---~~~~~~~~i 226 (661) +|-|.+=|.+-.. . ....-.+|++..++|.+|. |.-.- +....-..++ |... +.++- ..++..|| T Consensus 137 ~G~~~~k~~~~~~~~e~~e~~~v~~~~~prieriSP~d~~-~Dpsa-~~i~d~~fiv-rs~~-T~~~L~~l~~~~~~~~- 211 (584) T protein:vir:95 137 YGNAFATVSFEAKYKEMTDGTLVPDYIGPRLVRISPLDIV-FNPLA-TSISDTFKIV-RSVK-TKGELMRLAQDEPEQS- 211 (584) T ss_pred CCceEEEEeEeecceeeeccccccccccceEEeeChhhee-ecCCC-CCccchhhhh-hhhh-hHHHHHHHHhhcCccc- Confidence 9988887775431 1 1123457999999998876 54211 0000000111 1000 00000 00001111 Q ss_pred eeechhhhhcchhhhhcchhhhhhhhhhhhee--cccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccc--- Q lcl|NC_019406. 227 GREGSETAQRTSGGRRAGLAERQGSARADALA--RPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQA--- 301 (661) Q Consensus 227 ~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~--- 301 (661) |..+ .+++..........-....+ ...+ ..++....+++....+|.+|+-. | -++-..+++...+ T Consensus 212 --y~~d-~v~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~~~ey~~~~~V~vl~~~--g----~~~~~~~~e~~~~~iv 281 (584) T protein:vir:95 212 --YWLE-ALKRREEICRHLGGYSVEDF-DKAAGFDVDGFGNLYEYYMSDWVEILEFY--G----DYHDKETGELQTNRII 281 (584) T ss_pred --cchH-HHHHHHHhccCCCCCccccc-ccccccccccccccccccCCceeEEEeec--c----cccccccCCCcccceE Confidence 1111 11111111000000000000 0000 00111111222222234444310 0 0000011111111 Q ss_pred ----cccee-eccCCcccceeeEEEEec-CCCCCCccccchhHHHHHH---HHHHhhhhhHHHHHHHhcCceeEEecCCC Q lcl|NC_019406. 302 ----RDVYT-PMVRGRTLPFIPFVFFGS-MSNAADCEKPPLLDIVELN---LKHYRTYAELEHGRFFTALPTYYAPELDD 372 (661) Q Consensus 302 ----~~~~~-p~~~g~~L~~IPfv~~~~-~~~~~~~~~pPLldLA~LN---l~HYq~sSDl~~il~~~~~P~l~i~Gl~~ 372 (661) +...+ -..+-.+.+.+||+.+.- ...+-..+.+++.-|.++. =..+|..-|. +.....|++-..+-. T Consensus 282 ~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDn---l~l~~~pv~k~~~~~- 357 (584) T protein:vir:95 282 TVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADA---VDLIIQPPLKIIGEV- 357 (584) T ss_pred EEEeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHH---HHHhcCcceeecccc- Confidence 00111 012224567889987642 2333335566654333322 2234443333 344555765554322 Q ss_pred CCCceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhcccc-cCccchhHHHHHHHHHHhhH Q lcl|NC_019406. 373 SDASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLMPGM-SKSVSESDNQSALREANEQS 450 (661) Q Consensus 373 ~~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll~~~-~~~~~eTataa~~d~~~~~S 450 (661) ..+.-|++..+..... +...+++++...+......|+-++..|-.+ |+.....+ +..+++||+..+.=..+.+. T Consensus 358 ---~~~~~~pg~~~~~~~~-~~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~ 433 (584) T protein:vir:95 358 ---EEFVWGPGAEIHLDQG-GDVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGR 433 (584) T ss_pred ---chhcccCCceeecCCC-CCcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHH Confidence 2256788888877654 457888888777666666677777777743 54433221 13457888888887778889 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHc--CCCCCCcceEEEEeccccccc---cCCHHHHHHHHHHHhcCC---CCHHH--- Q lcl|NC_019406. 451 LLLNVIMALEDGM-TSVVRYWLMFR--DIPLTDTATLRYEIDATFLTT---ALDARALRAIQQLYEGGL---LPIDA--- 518 (661) Q Consensus 451 ~L~~~A~~le~Al-~~aL~~~A~w~--G~~~~~~~~~~v~ln~DF~~~---~lda~~l~all~~~~aG~---Is~et--- 518 (661) .+..++...++.+ ++++.++=+|- .+. ..+.+++. |+++... .+++++|+.=.++.-.|. +.++. T Consensus 434 ~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd--~~~~vr~~-n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q 510 (584) T protein:vir:95 434 IFQEKVTTFEVELLEPVLNAMLETATRNMD--GSDVIRVM-DTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQ 510 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--ccCceeee-ccccccccccccChhhhccCeeEEeehhhHHHHHHHHHH Confidence 9999999988887 66444443331 111 12233322 2221000 001122211111111111 01111 Q ss_pred -HHHHHH--hcCCCCccCCHHHHHHHHhcc--CCC--CCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhcc Q lcl|NC_019406. 519 -LYENFV--KNGIIPSTQTLEEFTIKMNDP--KSF--IGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHL 591 (661) Q Consensus 519 -~~~eL~--r~gvl~~~~~~Eee~~~l~~~--~~~--l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~ 591 (661) +..-|+ -+-.|.+.+.--+...-+++. .|+ +-.+++.. +++++-++..+ ..|+.+.+|+.. T Consensus 511 ~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~~~~~~~~~--------~~Q~~~q~~~~---~~q~~~~~~~~~- 578 (584) T protein:vir:95 511 NLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYEIFRPNVAV--------AEQAETQSLVA---QAQEDLQLQAQM- 578 (584) T ss_pred HHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCcccccCCCccc--------chhHHHHhhhH---HHHHHHHHHHhh- Confidence 111111 111233333222222222221 111 11111100 11111111111 112222332211 Q ss_pred CCCchhHHHhhhhhh Q lcl|NC_019406. 592 EIDEEKLRISAKVGS 606 (661) Q Consensus 592 ~~~~~~~~~~~~~~~ 606 (661) ++++|= T Consensus 579 ---------~~~~~~ 584 (584) T protein:vir:95 579 ---------PAEGAI 584 (584) T ss_pred ---------hhccCC Confidence 111111 No 94 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=97.53 E-value=4.9e-05 Score=44.20 Aligned_cols=571 Identities=10% Similarity=-0.018 Sum_probs=203.5 Q ss_pred ccccCHHHHHHHHHHHHHHHHhcchHHHHhC---Cc--ccC--CCCCCCChHHHHHHHhh----hcccchHHHHHHHHhc Q lcl|NC_019406. 23 HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQ---GV--KYL--KAPKGFDDEDYANYLDR----AAFYNMTSQTQAGMVG 91 (661) Q Consensus 23 V~~~hPey~a~~~~W~~irD~~~G~~~vr~~---g~--~YL--Pk~~~E~~~~Y~~rl~r----A~~~n~~~~tv~~l~G 91 (661) -..+|-+ .....+.+++.++.....+++. .. .|. =+|+.+.....+.|-+- ..-+|.++++|+.++| T Consensus 1 ma~~~~~--~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g 78 (708) T protein:vir:17 1 MAETLEK--KHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIA 78 (708) T ss_pred CchhHHH--HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHh Confidence 1122221 1122233444444444333321 11 132 25666666655554432 5678999999999999 Q ss_pred hhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEE--EEeccCC Q lcl|NC_019406. 92 QIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGA--LVDVAPS 169 (661) Q Consensus 92 ~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gv--LVD~P~a 169 (661) .-=+..+.+.-+|.. .+.|. ..+---...|+.+++. ++.+.-...+|..++..|.+|+ ..||-.- T Consensus 79 ~e~~nr~d~~v~p~~-----~~~d~---~~Ae~l~~l~~~~~~~-----~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e 145 (708) T protein:vir:17 79 EYRNNRITVKFRPGD-----REASE---ELANKLNGLFRADYEE-----TDGGEACDNAFDDAATGGFGCFRLTSMLVNE 145 (708) T ss_pred hHhhCCcceEEecCC-----CcchH---HHHHHHHHHHHHHHHh-----cCchhHHhHHHHHhhhcccceeeeeeccccc Confidence 977777766544441 01111 0111112233443333 3556668899999999999988 4455321 Q ss_pred C---chhhcccceeEeechhhh-ccceeeccccccceeeeeeeeeeeecccccccccccee--eeechhhhhcchhhhhc Q lcl|NC_019406. 170 S---DPTAPAKSYTVGYAAENI-VDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWI--GREGSETAQRTSGGRRA 243 (661) Q Consensus 170 ~---~~~~g~rPY~~~~~p~~I-inW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i--~~~~~e~vi~w~~~~~~ 243 (661) + +--.+..-+.+...+.+| +||...+.+.. .-.|+..+.+...+.- ....|-- .......+.+|...-.+ T Consensus 146 ~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~s-Dar~~~~~~~~~~d~~---~~~yp~~a~~~~~~~~~~~~~~~~~~ 221 (708) T protein:vir:17 146 YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKS-DALWAFCMYSLSPEKY---EAEYGKKPPASLDVTSMTSWEYDWFD 221 (708) T ss_pred CCCCCCccccceEeeccchhheecCccccccChh-hhhhhhhhccCCHHHH---HHhCccccchhhhhhhhccccccccC Confidence 1 111122212222233566 55554333321 0112222222211100 0000000 00001111122111000 Q ss_pred chhhhhhhhhhhheecccccCCCceeeEEEE--EEEeecccccce-----------------------------EEEEEE Q lcl|NC_019406. 244 GLAERQGSARADALARPSRFTSSYTFRTIYR--ELILELQKDGSR-----------------------------VYKQFV 292 (661) Q Consensus 244 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r--v~~l~~g~~g~~-----------------------------~~~~~~ 292 (661) . .-+++.+ |.++.+.+ ++++..+..|.. +++|+. T Consensus 222 ~----d~vrv~e-----------~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~ 286 (708) T protein:vir:17 222 A----DVIYIAK-----------YYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYV 286 (708) T ss_pred C----CeEEEEE-----------EEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEE Confidence 0 0011111 11111111 111111111110 011110 Q ss_pred EecCcccccccceeeccCC--cccceeeEEEEecCCCCCCcccc-------chhHHHHHHHHHHhhhhhHHHHHHHhcCc Q lcl|NC_019406. 293 YVEDPLGQARDVYTPMVRG--RTLPFIPFVFFGSMSNAADCEKP-------PLLDIVELNLKHYRTYAELEHGRFFTALP 363 (661) Q Consensus 293 ~~~~~~~~~~~~~~p~~~g--~~L~~IPfv~~~~~~~~~~~~~p-------PLldLA~LNl~HYq~sSDl~~il~~~~~P 363 (661) +.-. +..+. -.. -|-+.+|+|+|+...... .+.| .+.|.=. .+ -++.|.-++++......+ T Consensus 287 ~~~~-----g~~~l--~~~~~~p~~~fP~vP~~g~r~~~-d~~~~~yG~vr~~kd~Q~-~~-N~~~S~~~~~~a~~~~~~ 356 (708) T protein:vir:17 287 SVVD-----GDGFL--EKPRRIPGEHIPLIPVYGKRWFI-DDIERVEGHIAKAMDPQR-LY-NLQVSMLADTAAQDPGQI 356 (708) T ss_pred Eeec-----ccccc--cCCCCCCCCccceEEEecccccc-cCCCcccchhhhchhHHH-HH-HHHHHHHHHHHHhcCCcc Confidence 0000 00000 011 123446666554322211 1112 1111111 11 222333344443333333 Q ss_pred eeE----EecCCCCCCce----------eEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhcc Q lcl|NC_019406. 364 TYY----APELDDSDASE----------YHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLMP 428 (661) Q Consensus 364 ~l~----i~Gl~~~~~~~----------l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll~ 428 (661) ..+ +.|+...|... ..+.....+..+. ...++.+++.. -.....+-|+.....|..+ |..-.. T Consensus 357 ~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~-a~~~~~~~~~~-~~~~~~~llq~~~~~i~~~tGi~d~~ 434 (708) T protein:vir:17 357 PIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAG-ATPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAM 434 (708) T ss_pred eeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccc-cCCcccCCCcc-ccHHHHHHHHHHHHHHHHhcCCChHH Confidence 222 23443322110 1122222222221 11334444332 2234444455555554433 422211 Q ss_pred cccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHH----HHHHHHHHcCCCC------CCcceEEEEecc-------- Q lcl|NC_019406. 429 GMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTS----VVRYWLMFRDIPL------TDTATLRYEIDA-------- 490 (661) Q Consensus 429 ~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~----aL~~~A~w~G~~~------~~~~~~~v~ln~-------- 490 (661) .+. ..+.|+.+...+..+..-.|+.+-.|+..+... +|.++..+++.+- .+...-.|.||. T Consensus 435 ~G~-~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g 513 (708) T protein:vir:17 435 QQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTG 513 (708) T ss_pred ccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCc Confidence 122 234588888888888888888888887776654 4777778774221 011111122221 Q ss_pred -------------cc------ccccCCHHHHHHHHHHHhcCCCCH---HHHHHHHHhcCCCCccCCHHHHHHHHhccCCC Q lcl|NC_019406. 491 -------------TF------LTTALDARALRAIQQLYEGGLLPI---DALYENFVKNGIIPSTQTLEEFTIKMNDPKSF 548 (661) Q Consensus 491 -------------DF------~~~~lda~~l~all~~~~aG~Is~---et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~ 548 (661) |+ .......+.+.+|++++....... ..+...|-. ..+---.+++.++|..+.+. T Consensus 514 ~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~---~~D~p~~~ei~e~ir~~~~~ 590 (708) T protein:vir:17 514 AVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILD---NIDGEGLDDFKEYNRNQLLI 590 (708) T ss_pred cceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHH---hcCCCChHHHHHHHHHHhhc Confidence 11 111122345666777766543211 111111211 11222236677777655443 Q ss_pred CCC--chhhhhhcCCccccCCCcc-hhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhh---- Q lcl|NC_019406. 549 IGQ--PDAIAMRRGYVSRQQELDQ-QRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQA---- 621 (661) Q Consensus 549 l~~--ddae~~~~g~~~~~~~~~q-~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 621 (661) .+. +.++. +.++...+ .......++++++.+.+..+++.+..+++-.+. ++..++...+-+-+++ T Consensus 591 ~~~~~~~~~e------~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~--~~q~~a~q~~~~~~~a~~~a 662 (708) T protein:vir:17 591 SGIAKPRNEK------EQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETA--QTQIKAFTAQQDAMESQANT 662 (708) T ss_pred cccccCcchh------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH Confidence 221 11110 00000000 011111223333333333222222222221111 1111111111111111 Q ss_pred --hhhhhhhhH--------------HHHhhcccccCCCCCCCcccccCCCCc Q lcl|NC_019406. 622 --KPSKAEQAQ--------------IDAQQKQAAAKPVTPTPGTVQRGRPPQ 657 (661) Q Consensus 622 --~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 657 (661) .++|++..+ -..|+-|++|.|..|- --||. T Consensus 663 ~q~~~q~~~~~~~~~~~~~~~l~~~q~~q~q~~~a~p~~~~------~~~~~ 708 (708) T protein:vir:17 663 VYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPA------DLMPS 708 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHhccccCch------hccCC Confidence 122221111 1122344445554441 11221 No 95 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=97.48 E-value=5.8e-05 Score=43.81 Aligned_cols=568 Identities=10% Similarity=0.007 Sum_probs=180.9 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcchHHHHhCC---cccC--CCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhc Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQG---VKYL--KAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFR 95 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g---~~YL--Pk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFr 95 (661) |.=+ +-.+...+..| +.++.....+|+.. ..|. =+|+.+.....+. ..|-+ +|.++++|++++|.-=+ T Consensus 1 m~d~--~~~~~~~~~~~---~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~-q~rp~-~N~i~~~i~~v~g~e~~ 73 (725) T protein:vir:92 1 MADN--ENRLESILSRF---DADWTASDEARREAKNDLFFSRISQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSEMRQ 73 (725) T ss_pred CCch--HHHHHHHHHHH---HHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh-cCCCc-ccchHHHHHHHHhhHHh Confidence 2211 11222222222 22222222222210 0111 2565554444433 34444 59999999999998777 Q ss_pred cCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEE--EEeccCCCchh Q lcl|NC_019406. 96 RPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGA--LVDVAPSSDPT 173 (661) Q Consensus 96 k~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gv--LVD~P~a~~~~ 173 (661) ..+.+.-+|..- .|. ..+---...|+.+++ -++.+.-..++|..++.+|.+|+ ..||...+.-. T Consensus 74 nr~d~~v~P~~~----~d~-----~~Ae~l~~~~~~~~~-----~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~ 139 (725) T protein:vir:92 74 NPIDVLYRPKDG----ASP-----DAADVLMGMYRTDMR-----HNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTS 139 (725) T ss_pred CCcceEEecCCc----cHH-----HHHHHHHHHHHHHHH-----hhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCC Confidence 777775555421 111 111111223333333 45667778899999999999984 44774322111 Q ss_pred h--cccceeEeechh-hh-ccceeeccccccceeeeeeeeeeeeccc-----cccccccceeeeechhhhhcchhhhhcc Q lcl|NC_019406. 174 A--PAKSYTVGYAAE-NI-VDWTVEDVDGFYVPTRILLREFERVDEH-----ATPSQQNPWIGREGSETAQRTSGGRRAG 244 (661) Q Consensus 174 ~--g~rPY~~~~~p~-~I-inW~~~~~~g~~~Lt~v~ire~~~~~~~-----~~~~~~~~~i~~~~~e~vi~w~~~~~~g 244 (661) . .++-.. ++.|. +| +||...+.+... =.|+.++.+...+.. .++...... ......-+|...-... T Consensus 140 ~~~~i~~~~-i~~~~~~V~~Dp~a~~~D~sD-ar~~~~~~~~~~d~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 214 (725) T protein:vir:92 140 NNQVIRREP-IHSACSHVIWDSNSKLMDKSD-SRHCTVIHSMSQNGWEDFAEKYDLDADDI---PSFQNPNDWVFPWLTQ 214 (725) T ss_pred CceeeEEee-ccCChhhcccCchhhccChhh-HHHHHHHhcCCHHHHHHHHhhcCcchhhh---hhcccCCcccccccCC Confidence 1 111111 12333 33 555543332210 001212222211100 000000000 0000111111100000 Q ss_pred hhhhhhhhhhhheecccccCCCceeeEEEEEEE------eecccccceE---------------------------EEEE Q lcl|NC_019406. 245 LAERQGSARADALARPSRFTSSYTFRTIYRELI------LELQKDGSRV---------------------------YKQF 291 (661) Q Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~------l~~g~~g~~~---------------------------~~~~ 291 (661) ..+++.++ +||+.+ +.....|..+ .+.+ T Consensus 215 ----d~vrv~e~---------------~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~k 275 (725) T protein:vir:92 215 ----DTIQIAEF---------------YEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRR 275 (725) T ss_pred ----CeEEEEEE---------------EEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeee Confidence 00111111 111111 1111111100 0000 Q ss_pred EEecCcccccccceeeccCCcccceeeEEEEecC----CCCC---CccccchhHHHHHHHHHHhhhhhHHHHHH-HhcCc Q lcl|NC_019406. 292 VYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSM----SNAA---DCEKPPLLDIVELNLKHYRTYAELEHGRF-FTALP 363 (661) Q Consensus 292 ~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~----~~~~---~~~~pPLldLA~LNl~HYq~sSDl~~il~-~~~~P 363 (661) ++.---.|....+..+...| ++||||+|... ++.+ ..-. ++.|.=. .+. ++.|.-+ +++. ....+ T Consensus 276 v~~~~~~g~~~l~~~~~~~~---~~~P~vP~~g~r~~~~g~~~~~G~vr-~~kd~Q~-~~N-~~~S~~~-~~~~~~~~~~ 348 (725) T protein:vir:92 276 VYKSIITCTAVLKDKQLIAG---EHIPIVPVFGEWGFVEDKEVYEGVVR-LTKDGQR-LRN-MIMSFNA-DIVARTPKKK 348 (725) T ss_pred EeeeeecchhhhcCCCCCCC---CceeeEEEEeeeeccCCcccccceec-cchhHHH-HHH-HHHHHHH-HHHHhccCcc Confidence 00000000000000001112 34555544322 1111 1111 1111111 111 2222222 3332 22222 Q ss_pred eeEEecCCCCCCceeEecccceee----cCCC-----CCcceEeecCchhHHHHHHHHHHHHHHHHHH-hH--Hhccccc Q lcl|NC_019406. 364 TYYAPELDDSDASEYHIGPGRVWV----VDKE-----SGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GG--RLMPGMS 431 (661) Q Consensus 364 ~l~i~Gl~~~~~~~l~iGs~~~~~----lp~~-----ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GA--rll~~~~ 431 (661) ..+-.|.-++.........+..+. .+.. ...+.+..+..- ...+.+-|+.....|..+ |. .++ +. T Consensus 349 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~-p~~~~~ll~~~~~~i~~~tGi~~~~l--G~ 425 (725) T protein:vir:92 349 PFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENPEV-PQANAYMLEAATAAVKEVATLGVDAE--AV 425 (725) T ss_pred cccchhhhhHHHHHHhccCccceeeccccccccccccccCCcccCCCCc-hHHHHHHHHHHHHHHHHHhCCCHHHh--cc Confidence 222222211100000100000110 1111 113444443222 233344445555555433 32 233 22 Q ss_pred CccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------CCcceEEEEeccc---------- Q lcl|NC_019406. 432 KSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------TDTATLRYEIDAT---------- 491 (661) Q Consensus 432 ~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------~~~~~~~v~ln~D---------- 491 (661) .+.+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..+++..- .++..-.+.||.. T Consensus 426 ~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~ 505 (725) T protein:vir:92 426 NGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQ 505 (725) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchh Confidence 34457888888888888888888888888877664 677777774321 1111122333321 Q ss_pred --------cccc--------cCCHHHHHHHHHHHhcCC-CCHHHHHHH-HHhcCCCCccCCHHHHHHHHhccCCCCC--C Q lcl|NC_019406. 492 --------FLTT--------ALDARALRAIQQLYEGGL-LPIDALYEN-FVKNGIIPSTQTLEEFTIKMNDPKSFIG--Q 551 (661) Q Consensus 492 --------F~~~--------~lda~~l~all~~~~aG~-Is~et~~~e-L~r~gvl~~~~~~Eee~~~l~~~~~~l~--~ 551 (661) |... ....+.+.+|++++..-- +. ...-.. ++--. +++..-.+++.+++..+.+.-+ . T Consensus 506 ~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~-~~~~~~l~~~~~-~~d~~~~~e~~erirkq~~~~~~~~ 583 (725) T protein:vir:92 506 VLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGT-PEYQLLLLQYFT-LLDGKGVEMMRDYANKQLIQMGVKK 583 (725) T ss_pred hhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccch-hHHHHHHHHHhh-cccchHHHHHHHHHHhhhchhccCC Confidence 1110 001133444555544211 10 000000 10001 1122223555566654332211 1 Q ss_pred ch----hhhhhcCCccccCCCcchh----------hhhcCChhhHHHHHHHh---------ccCC--------------- Q lcl|NC_019406. 552 PD----AIAMRRGYVSRQQELDQQR----------AARDADFQQQELEQAER---------HLEI--------------- 593 (661) Q Consensus 552 dd----ae~~~~g~~~~~~~~~q~~----------~~~e~d~~q~~~~~~e~---------~~~~--------------- 593 (661) ++ .+.++.- .+.+..|.+ ...++|+++..-|..+- +++. T Consensus 584 ~~~~e~~q~~~~~---qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~ 660 (725) T protein:vir:92 584 PETPEEQQWLVEA---QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLS 660 (725) T ss_pred ccchhhhHHHHHH---HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH Confidence 10 0001000 000000000 01122222111111000 0000 Q ss_pred -CchhHHHhhhhhhhhhhHH--HhcCChhhhh-hhhhhhhHH---HHh---hcccccCCCCCCCc Q lcl|NC_019406. 594 -DEEKLRISAKVGSTSVAAS--RKLGDPEQAK-PSKAEQAQI---DAQ---QKQAAAKPVTPTPG 648 (661) Q Consensus 594 -~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~---~~~---~~~~~~~~~~~~~~ 648 (661) +..+++....++++.+..+ .|.+.+..+| ..|..+.+. +.- ++|...--++.||- T Consensus 661 q~~~~~~~~~~~~~~q~~~~~~a~~~ae~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 725 (725) T protein:vir:92 661 KQSEFREFLKTVASFQQDRSEDARANAELLLKGNEQTHKQRMDIANILQSQRQNQPSGSVAETPQ 725 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHhcchhccCCccccccCCC Confidence 0011111112222211100 1111111111 111111111 111 22222223445554 No 96 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=97.39 E-value=1.1e-06 Score=53.21 Aligned_cols=439 Identities=11% Similarity=0.022 Sum_probs=176.6 Q ss_pred HHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhh Q lcl|NC_019406. 49 EIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGK 128 (661) Q Consensus 49 ~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~ 128 (661) -+.+.|+. .--......|.+++.+ ...+++++.+ ....+.+..=+.-+|... + -.. T Consensus 1 ~~~~~~~~---~~V~~~hp~y~a~~~~---W~~ird~~~G-~~~~~~r~~yl~~~~~~~-----~------------e~~ 56 (489) T protein:vir:78 1 MLTENGQG---SGVKTKHREWLHYAPK---WQKVRHALAG-ELVSYLRNVGLNEPDKAY-----G------------EAR 56 (489) T ss_pred CccCCCcc---CCCCccCHHHHHHHHH---HHHHHHHhcC-cccccccCCCCCCCCCCC-----C------------hHH Confidence 01111110 0011122345444433 3334444443 113333322111111100 0 001 Q ss_pred hHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEE---EEEeccCC--------CchhhcccceeEeechhhhccceeeccc Q lcl|NC_019406. 129 LLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFG---ALVDVAPS--------SDPTAPAKSYTVGYAAENIVDWTVEDVD 197 (661) Q Consensus 129 ~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~g---vLVD~P~a--------~~~~~g~rPY~~~~~p~~IinW~~~~~~ 197 (661) ...++.+. .+.-+.++.++.. .|+++ +.+|.|.. +......--++ .++...- ..- T Consensus 57 Y~~rl~rA-----~~~n~~~~tl~~l--~G~vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~-----~~~~~~~--l~~ 122 (489) T protein:vir:78 57 QAEYEAGG-----IVYNFTRRTLSGM--VGSVMRKEPEINIPKELEYLLKNADGSGVGLIQHA-----QDTLMEI--DSV 122 (489) T ss_pred HHHHHhcc-----ccCChHHHHHHHH--hchhhcCCcceeccHHHHHHHhccCCCCCCHHHHH-----HHHHHHH--Hhc Confidence 23332222 2223333333211 12111 22355531 11111100000 0000000 001 Q ss_pred cccceeeeeeeee---eeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEE Q lcl|NC_019406. 198 GFYVPTRILLREF---ERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYR 274 (661) Q Consensus 198 g~~~Lt~v~ire~---~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 274 (661) | ..+|++.-. .....+....+.+||+.+|.+++||||++.+++|+..+++++++|+ +. T Consensus 123 G---~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~----------------~~ 183 (489) T protein:vir:78 123 G---RGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRET----------------WE 183 (489) T ss_pred C---eEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEe----------------EE Confidence 1 112222111 1122233456779999999999999999999999999999887642 11 Q ss_pred EEEeecccccc-eEEEEEEEecCcccccccc-eeeccCCccc-ceeeEEEEecCCCCCCccccchh------------HH Q lcl|NC_019406. 275 ELILELQKDGS-RVYKQFVYVEDPLGQARDV-YTPMVRGRTL-PFIPFVFFGSMSNAADCEKPPLL------------DI 339 (661) Q Consensus 275 v~~l~~g~~g~-~~~~~~~~~~~~~~~~~~~-~~p~~~g~~L-~~IPfv~~~~~~~~~~~~~pPLl------------dL 339 (661) +.. ..+.++. .+.++++..-+.++..... +.....|... ..++.+. ..++...+..||- .. T Consensus 184 ~~d-~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~---~~g~~~l~~IPfv~~~~~~~~~~~~~p 259 (489) T protein:vir:78 184 YNE-PGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYP---DLGESLRGVIPFTFIGATNNDATIDDA 259 (489) T ss_pred eec-CCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEec---cCCCCccCeeeEEEEecCCCCCCCCcC Confidence 110 1122333 3445666666655544322 2223334433 2233322 2233334444433 22 Q ss_pred HHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCc---e-----eEecccceeecCCCCCcceEeecCchh---HHHH Q lcl|NC_019406. 340 VELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDAS---E-----YHIGPGRVWVVDKESGIPGIIEFKGEG---LKTL 408 (661) Q Consensus 340 A~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~---~-----l~iGs~~~~~lp~~ga~~~ylE~~g~~---i~a~ 408 (661) -.+.|+|-+..--+..+-... ++.+.++.--+.. . +..|....+.++. ...+.-|.+.. ++.. T Consensus 260 PLl~LA~lni~Hy~~ssd~~~---~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g~---~~~~~lp~~~~~~~ie~~ 333 (489) T protein:vir:78 260 PLLPLAELNIGHYRNSADNEE---SSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFGS---RRGHNLGYGGSAQLIQAG 333 (489) T ss_pred chHHHHHHHHHHhhhhhHHHH---HHHHcccceeeeecCccCCcccccccCccceeeCC---cccccCCCCCCcceeccC Confidence 245666654333222222211 2233333222211 1 1223333445542 23333333432 3333 Q ss_pred HHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcceEEEEe Q lcl|NC_019406. 409 ERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTATLRYEI 488 (661) Q Consensus 409 ~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~l 488 (661) -..+ ..+.|..+=.+|...+-+-..+++..+....+.+.+.-.+.=..+...++.+|..+-+|+..-. + ..... T Consensus 334 ~~~~--~r~~l~~le~qm~~lGa~l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~-G-~~~~~-- 407 (489) T protein:vir:78 334 ENNL--ARQNMLDKEQQAIQIGAQLITPTQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVML-G-KPEDT-- 407 (489) T ss_pred cchH--HHHHHHHHHHHHHHHhhhhccCCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc-C-CCCCC-- Confidence 3333 2444655555555433334444554556666666777777888899999999999999997542 1 11000 Q ss_pred cccc-ccccCCHHHHHH-HHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhc------- Q lcl|NC_019406. 489 DATF-LTTALDARALRA-IQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRR------- 559 (661) Q Consensus 489 n~DF-~~~~lda~~l~a-ll~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~------- 559 (661) ...| ....++.+.++. .++ .+ -.+...|.|+....+++.+.+ .-++.++ +...+ T Consensus 408 ~~~i~~n~dF~~~~~d~~~~~----------al-~~~~~~G~is~~t~~~~L~~~-----gv~d~~~-e~~~~ei~~~~~ 470 (489) T protein:vir:78 408 EVEFRLNMDFFLEPMTAQDRA----------AW-MADINAGLLPATAYYAALRKA-----GVTDWTD-ADIKDAVADQPL 470 (489) T ss_pred ceEEEeecccCcccCCHHHHH----------HH-HHHHhcCCCCHHHHHHHHHhC-----CCCCccH-HHHHHHHhhcCC Confidence 0122 233344333321 111 12 234457777766555544331 1222111 11111 Q ss_pred -CCccccCCCcchhhhhcCChhhHHH Q lcl|NC_019406. 560 -GYVSRQQELDQQRAARDADFQQQEL 584 (661) Q Consensus 560 -g~~~~~~~~~q~~~~~e~d~~q~~~ 584 (661) +......++|++ -||+++ T Consensus 471 ~~~~~~~g~~~~~-------~q~~~~ 489 (489) T protein:vir:78 471 PVATEVQGEIPQS-------AQQQEK 489 (489) T ss_pred CcccCCcccCCCC-------cccccC Confidence 111122222322 111111 No 97 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.36 E-value=8.5e-05 Score=42.89 Aligned_cols=484 Identities=13% Similarity=0.107 Sum_probs=189.9 Q ss_pred CC------CCCCccccccccccc--cccCCccccCHHHHHHHHHHH-HHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MA------GLSPNSANIRRTKRG--AQQFTHLVVHPEYEYYRPDWA-KIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~------~~~~~~~~~~~~~~~--~~~~~V~~~hPey~a~~~~W~-~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |+ |..+-.-.++.++.+ +....+..-||.-.=--.+|. .++..-.|.- .+. -+-|+. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~----------~~~----~~L~ed 66 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNL----------QAQ----AELFMD 66 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCH----------HHH----HHHHHH Confidence 43 333322222222111 111122222221100001232 3344434421 111 245666 Q ss_pred HHhh-hcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHH Q lcl|NC_019406. 72 YLDR-AAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTV 150 (661) Q Consensus 72 rl~r-A~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~ 150 (661) -+.| +.+...+..-..++. ..+..|+ |+.= .+.+. -..-+.+.+++.++ .+++.++.++ T Consensus 67 m~e~D~~i~s~l~~Rk~av~----~~~w~I~--p~~~----~~~~~------~~~a~~v~~~l~~~----~~~~~~i~~~ 126 (526) T protein:vir:79 67 MEERDAHLFAEMSKRKRAIL----GLDWAVE--PPRN----ASAAE------KADADYLHELLLDL----EGLEDLLLDA 126 (526) T ss_pred HHhhChHHHHHHHHHHHHHh----CCCceEe--cCCC----CChHH------HHHHHHHHHHHhcc----cCHHHHHHHH Confidence 6644 444444444444444 4455553 1100 00000 00111233333222 2588888888 Q ss_pred HHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeec Q lcl|NC_019406. 151 ALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREG 230 (661) Q Consensus 151 ~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~ 230 (661) +. ++-||.+.+=+-| +.+ +|.-.+..+..| T Consensus 127 ld-A~~~G~s~~Ei~w-------------------------~~~--~g~~~~~~l~~r---------------------- 156 (526) T protein:vir:79 127 LD-GIGHGYSCIELEW-------------------------ALQ--GREWMPLAFHHR---------------------- 156 (526) T ss_pred Hh-hhhhcceeEEEEE-------------------------eec--CCceeEEEeeee---------------------- Confidence 75 7778866555433 221 121111000000 Q ss_pred hhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccC Q lcl|NC_019406. 231 SETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVR 310 (661) Q Consensus 231 ~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~ 310 (661) ...++.|.. ++..+.+. +.+ ... T Consensus 157 ------------------------------~~~~F~~~~-------------~~~~~l~~---~~~-----------~~~ 179 (526) T protein:vir:79 157 ------------------------------PQSWFQLNP-------------EDQNELRL---RDN-----------SPA 179 (526) T ss_pred ------------------------------cccceEecc-------------CCCcEEEe---cCC-----------CCC Confidence 000000000 00000000 000 011 Q ss_pred CcccceeeEEE-EecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCce-----eEec Q lcl|NC_019406. 311 GRTLPFIPFVF-FGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDASE-----YHIG 381 (661) Q Consensus 311 g~~L~~IPfv~-~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~~-----l~iG 381 (661) |.+|+.-=|++ .+....+...+...|..++..-+---....|.-.-+..-|.|+++.. |.++++++. ..|| T Consensus 180 g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~ 259 (526) T protein:vir:79 180 GEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLG 259 (526) T ss_pred ceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHh Confidence 22222111222 23333444455555665655544333366677777888899998875 444443332 3589 Q ss_pred ccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHH--HhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_019406. 382 PGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAA--IGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMAL 459 (661) Q Consensus 382 s~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~--lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~l 459 (661) ++++.++|. |..+.|++..+.+....+.-++...++|.. +|--|......+..-|--..........-++.+-+..+ T Consensus 260 ~da~~iiP~-~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i 338 (526) T protein:vir:79 260 HAAAGIIPE-TMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQL 338 (526) T ss_pred cCcEEEecC-CceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHH Confidence 999999995 789999998776666677777777777663 45433221111111122344556666777889999999 Q ss_pred HHHHH-HHHHHHHHHcCCCCCCc-ceEEEEeccccccccCCH-HHHHHHHHHHhcCC-CCHHHHHHHHHhcCCCCccCCH Q lcl|NC_019406. 460 EDGMT-SVVRYWLMFRDIPLTDT-ATLRYEIDATFLTTALDA-RALRAIQQLYEGGL-LPIDALYENFVKNGIIPSTQTL 535 (661) Q Consensus 460 e~Al~-~aL~~~A~w~G~~~~~~-~~~~v~ln~DF~~~~lda-~~l~all~~~~aG~-Is~et~~~eL~r~gvl~~~~~~ 535 (661) +++++ +++++++.|-+-...+. --.+|.+.. . ...|- ..++.+..+...|. |+.+.+.+ +-|+ |.-... T Consensus 339 ~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~-e~eDl~~~a~~~~~L~~~G~~i~~~~i~e---~~gi-p~~~~~ 411 (526) T protein:vir:79 339 AATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--R-EQADITSMAQSIPALVNVGLEIPSAWVYD---KLGI-PQPAKN 411 (526) T ss_pred HHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--C-CcccHHHHHHHHHHHHhCCCcCCHHHHHH---HhCC-CCCCCc Confidence 99997 59999999986432221 112333321 1 11121 23555666777787 77665533 3465 332333 Q ss_pred HHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhh--------cCC-------hhhHHHHHHHhccCCCchhHHH Q lcl|NC_019406. 536 EEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAAR--------DAD-------FQQQELEQAERHLEIDEEKLRI 600 (661) Q Consensus 536 Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~--------e~d-------~~q~~~~~~e~~~~~~~~~~~~ 600 (661) |+.......+.+.-..+..... .+..........+.+- .+| +-.+..++-+...-+++-+..| T Consensus 412 e~~l~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~~~~i~~~~~~~~s~ee~~~~L 489 (526) T protein:vir:79 412 EPVLRPAAQPAILSRQHGQRVA--ALATIVGPRYGDQQALDKALADLPAKDMQNQANDLLAPLLDAVNRGDSETELLGAL 489 (526) T ss_pred hhhccccCCccccccccccccc--cccccccccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHH Confidence 3333221111111000000000 0000000000000000 000 0011111111111122222222 Q ss_pred hhhhhhhhhhHHHhcCChhhhhhhhhhhhHHHHh--hcccccCCCC Q lcl|NC_019406. 601 SAKVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQ--QKQAAAKPVT 644 (661) Q Consensus 601 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 644 (661) .+..+++..+ ++...=+..--++.= +--+....+- T Consensus 490 ~~l~~~ld~~---------~l~~~l~~a~~~A~l~Gr~~~~~e~~~ 526 (526) T protein:vir:79 490 AEAFPDMDDS---------ALTDALHRLLFAADTWGRLHGNLDRID 526 (526) T ss_pred HHHhccCCHH---------HHHHHHHHHHHHHHHhhhhhhhhcccC Confidence 2221111111 111100000000000 0000000000 No 98 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=97.33 E-value=9.3e-05 Score=42.68 Aligned_cols=462 Identities=12% Similarity=0.044 Sum_probs=189.4 Q ss_pred ccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCC------CCChHHHHHHHhhhcccchHHHHHHHHhchhhccCc Q lcl|NC_019406. 25 VVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPK------GFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPP 98 (661) Q Consensus 25 ~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~------~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p 98 (661) .++|+...-..--..++|.+.+ +.. + .++|... +-+-..|+..+. -.++...++.....|...+. T Consensus 1 v~~~~l~~e~at~~~~~d~~~~---~~~-~-l~~~~~~il~~a~~g~~~~y~~l~~----D~~i~s~l~~rk~av~~~~w 71 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRP---FIS-G-LQVPNDSILQRRGGNDLRVYEEILS----DAQVKTVWGQRQLAVVSREW 71 (488) T ss_pred CCccchhHHHHHHHhhhhhhcc---ccC-C-CCCCChHHHHhhccCCHHHHHHHhh----ChHHHHHHHHHHHHHhcCCc Confidence 3444433222211122333321 111 1 2334321 011245666554 45667777777777777777 Q ss_pred cccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccc Q lcl|NC_019406. 99 VIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKS 178 (661) Q Consensus 99 ~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rP 178 (661) .|+ |+. .+... -+..+.+.+.+ .+.+++.+++.++. ++-||.+.+=+-|-.. T Consensus 72 ~i~--p~~-----~~~~~------~~~ae~v~~~l-----~~~~~~~~l~~~ld-a~~~G~s~~Ei~w~~~--------- 123 (488) T protein:vir:99 72 KVE--AGG-----DRPID------QAAAEHLEQQL-----QRVGWDRVTSKMLF-GVFYGYAVSELIYGRD--------- 123 (488) T ss_pred eEE--cCC-----CChHH------HHHHHHHHHHH-----hCCCHHHHHHHHHh-hhhhcceeEEEEEeec--------- Confidence 773 221 01000 01112333333 45578999999884 7778877665544211 Q ss_pred eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhhee Q lcl|NC_019406. 179 YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALA 258 (661) Q Consensus 179 Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~ 258 (661) +|.-.+..+..+- + T Consensus 124 ------------------~g~~~~~~l~~r~---------------------~--------------------------- 137 (488) T protein:vir:99 124 ------------------DRYITLEAIKVRN---------------------R--------------------------- 137 (488) T ss_pred ------------------CCeeeEeeeeeec---------------------c--------------------------- Confidence 2211111110000 0 Q ss_pred cccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEE---EecCCCCCCccccc Q lcl|NC_019406. 259 RPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVF---FGSMSNAADCEKPP 335 (661) Q Consensus 259 ~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~---~~~~~~~~~~~~pP 335 (661) ..+.|. .++..+++. +.+. ..|.+|+ .|+.| .+..+.+-..+.+. T Consensus 138 ----~~f~~d-------------~~~~l~~~~---~~~~-----------~~g~~lp-~~~~~i~~~~~~~~g~p~g~gL 185 (488) T protein:vir:99 138 ----RRFRYD-------------QDGGLRLLT---PNNM-----------FEGEPCP-APYFWHFSTGADNDDEPYGLGL 185 (488) T ss_pred ----cceeec-------------CCCceEEec---cCCC-----------CCccccc-cCceEEEEeecCCCCCcccchH Confidence 000000 000001000 0000 0122222 23222 22333334445555 Q ss_pred hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe----cCCCCCCce-----eEecccceeecCCCCCcceEeecCchhHH Q lcl|NC_019406. 336 LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP----ELDDSDASE-----YHIGPGRVWVVDKESGIPGIIEFKGEGLK 406 (661) Q Consensus 336 LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~----Gl~~~~~~~-----l~iGs~~~~~lp~~ga~~~ylE~~g~~i~ 406 (661) |..++..-+---....+...-+..-|.|+++.. |.+++++.. ..+|+.++..+|. |..+.|++.++.+.. T Consensus 186 l~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~viP~-~~~ie~~ea~~~~~~ 264 (488) T protein:vir:99 186 AHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAIIMPA-GMQAELLEAGRSGTA 264 (488) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEEecC-CceeEEeecCCCChH Confidence 666655543333345667777778899988875 122222222 3588899999995 789999998877777 Q ss_pred HHHHHHHHHHHHHHH--HhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCCCcce Q lcl|NC_019406. 407 TLERALNEKEQQIAA--IGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT-SVVRYWLMFRDIPLTDTAT 483 (661) Q Consensus 407 a~~~~L~~le~qM~~--lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~-~aL~~~A~w~G~~~~~~~~ 483 (661) ..+.-++...++|.. +|- .+....+++ |--...........++.+.+..++++++ +++.+++.|-. +.... T Consensus 265 ~~~~li~~~d~~Isk~iLGq-tlts~~~~G--s~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~-~~~~~-- 338 (488) T protein:vir:99 265 DYKTLHDTMDATIAKVGLGQ-VASTQGTPG--RLGNDDLQADVRLDLVKADADLICESFNLGPARWLTEWNF-PGAQP-- 338 (488) T ss_pred HHHHHHHHHHHHHHHHHhhh-hhccccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCc-CCcCC-- Confidence 777777777777764 344 443322221 2234455666677888999999999997 58999999875 22222 Q ss_pred EEEEeccccccccCCH-HHHHHHHHHHhc-CC-CCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhh-- Q lcl|NC_019406. 484 LRYEIDATFLTTALDA-RALRAIQQLYEG-GL-LPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMR-- 558 (661) Q Consensus 484 ~~v~ln~DF~~~~lda-~~l~all~~~~a-G~-Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~-- 558 (661) ..|.+.. ... -|. ...+.+.++.+. |. |+.+.+. ++-|+=.+... ++. ....+.. ..++... T Consensus 339 p~~~~~~--~e~-edl~~~a~~~~~l~~~~G~~i~~~~i~---e~~Gip~~~~~-~~~----~~~~~~~--~~~~~~~~~ 405 (488) T protein:vir:99 339 PRVYRVI--EEP-EDITAKAERDEKVFRMSGFRPTRGYVQ---ETYGVEVESTQ-AEA----TAPTPST--EFAEGDQPS 405 (488) T ss_pred ceeEecC--CCc-ccHHHHHHHHHHHHhhcCCCCCHHHHH---HHcCCCCcccc-ccc----ccCCCcc--cCCCCCCCC Confidence 2333321 211 122 235555666663 65 6655443 33455222211 111 0011100 0000000 Q ss_pred cCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhhhhhhhhhhHHHHhhccc Q lcl|NC_019406. 559 RGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQQKQA 638 (661) Q Consensus 559 ~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 638 (661) +........++........++-.+..++-++..-+++-+..|.+-.+++ |+.++..+=+..--+..=.-.. T Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~---------d~~~l~~~l~~a~~~a~l~G~~ 476 (488) T protein:vir:99 406 DPAAAMAPQLAEAMQPVVGNWTTQLRTLIEQASSLEDLRERLLDLAPQL---------SLDQYAQAMAEGLEAAHLAGRN 476 (488) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccC---------CHHHHHHHHHHHHHHHHHhhhh Confidence 0000000000000000000111122222222222222222222211111 1111111100000000000000 Q ss_pred ccCCCCCCCccccc Q lcl|NC_019406. 639 AAKPVTPTPGTVQR 652 (661) Q Consensus 639 ~~~~~~~~~~~~~~ 652 (661) .+ ..+--|--|- T Consensus 477 ~~--~~e~~~~~~~ 488 (488) T protein:vir:99 477 DV--QEELDGREQI 488 (488) T ss_pred hH--hhhhcccCCC Confidence 00 0000010000 No 99 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=97.19 E-value=0.00014 Score=41.80 Aligned_cols=498 Identities=13% Similarity=0.099 Sum_probs=196.2 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcc-hHHHHhCCcccCCCCC---CCChHHHHHHHhhh Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAG-EREIKAQGVKYLKAPK---GFDDEDYANYLDRA 76 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G-~~~vr~~g~~YLPk~~---~E~~~~Y~~rl~rA 76 (661) || =......-.....+|+.+++--.- ...|++...-.||..- +..... ++. - T Consensus 1 m~--------------------~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~---~~~-~ 56 (536) T protein:vir:21 1 MA--------------------EKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAST---DYQ-T 56 (536) T ss_pred Cc--------------------chhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccc---ccc-c Confidence 32 222222223333333332221100 2223333322334221 111111 111 1 Q ss_pred cccchHHHHHHHHhchhhcc--Ccc-c-c-ccch-hhHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHH Q lcl|NC_019406. 77 AFYNMTSQTQAGMVGQIFRR--PPV-I-R-NLPN-TGAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQ 144 (661) Q Consensus 77 ~~~n~~~~tv~~l~G~vFrk--~p~-i-~-~~p~-~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~ 144 (661) .|-+.-.+.++.+...++.- |+. + . .+++ .+..+.. .+.....++.+|+.| -+.-++.+ T Consensus 57 ~~dst~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~---------~~~~~~~v~~~L~~ve~~~~~~l~~snf~ 127 (536) T protein:vir:21 57 PWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLS---------DPDGLAKVDEGLSMVERIIMNYIESNSYR 127 (536) T ss_pred cccccHHHHHHHHHHHHHHhhcCCCcccccccChhhhhcccc---------chhhHHHHHHHHHHHHHHHHHHHHhcCcH Confidence 34444445555444333321 210 1 0 0000 0110000 011111122222221 12346677 Q ss_pred HHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccc Q lcl|NC_019406. 145 GFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNP 224 (661) Q Consensus 145 ~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~ 224 (661) .-+-.++.+.+.+|-+.+++|-+... +.+ ++..|.-.+ +-+.. ++.+.+.-|..+++..... T Consensus 128 ~~~~~~~~~L~~~G~a~ly~~e~~~~----~~~-~f~~~pl~~---~~v~~-d~~G~vd~i~r~~~~t~~~--------- 189 (536) T protein:vir:21 128 VTLFEALKQLVVAGNVLLYLPEPEGS----NYN-PMKLYRLSS---YVVQR-DAFGNVLQMVTRDQIAFGA--------- 189 (536) T ss_pred HHHHHHHHHHHhHCcEeEEEeeCCCC----cee-eEEEEEcCe---EEEee-CCCCCeeEEeeeeeccHHH--------- Confidence 88888899999999999999855321 111 223333222 22221 1222233233333222110 Q ss_pred eeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccc Q lcl|NC_019406. 225 WIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDV 304 (661) Q Consensus 225 ~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~ 304 (661) -+-.|...... .... . ..+...++++...++.+ ++ .|.|+...++. T Consensus 190 --------l~~~fg~~~~~-----------~~~~--~---~~~~~v~v~~~v~~~~~-~~--~~~~~~e~~g~------- 235 (536) T protein:vir:21 190 --------LPEDIRKAVEG-----------QGGE--K---KADETIDVYTHIYLDED-SG--EYLRYEEVEGM------- 235 (536) T ss_pred --------HHHhhhhhhcc-----------cccc--c---ccccceeEEEEEEEecC-CC--cEEEEeccCCe------- Confidence 01111111000 0000 0 11122234444333322 12 23443322221 Q ss_pred eeeccCC----cccceeeEEEEecCCCCCCccccc----hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCC Q lcl|NC_019406. 305 YTPMVRG----RTLPFIPFVFFGSMSNAADCEKPP----LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDA 375 (661) Q Consensus 305 ~~p~~~g----~~L~~IPfv~~~~~~~~~~~~~pP----LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~ 375 (661) .++...| ..+++||+.|.-..+..+.. .| |-|+..||.- +.+-+..+......|.++-+ |+.+. T Consensus 236 ~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGr--gp~~~~l~D~k~L~~l---~~~~l~~~~~a~~~~~lv~p~g~~~~-- 308 (536) T protein:vir:21 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGR--SYIEEYLGDLRSLENL---QEAIVKMSMISSKVIGLVNPAGITQP-- 308 (536) T ss_pred eeccccCccccccCCeeeeeeeecCCCcccc--chHHHHHHHHHHHHHH---HHHHHHHHHHHhcCCcccCcccccch-- Confidence 1222233 23577777777665555544 44 4477777754 34445666666677766654 33221 Q ss_pred ceeEecccceeecCCCCCcceEeec-CchhHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHH Q lcl|NC_019406. 376 SEYHIGPGRVWVVDKESGIPGIIEF-KGEGLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLL 453 (661) Q Consensus 376 ~~l~iGs~~~~~lp~~ga~~~ylE~-~g~~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~ 453 (661) ..+.=|....+. |...++.+.++. .+..+....+.|+++++.+... =+.++.. ..+...||++...+...-...|. T Consensus 309 ~~~~~~~~g~~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~-~~~~r~TAtEV~~r~~E~~~~LG 386 (536) T protein:vir:21 309 RRLTKAQTGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQ-RTGERVTAEEIRYVASELEDTLG 386 (536) T ss_pred hhhccCCCccee-cCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhccc-CCCCCccHHHHHHHHHHHHHHhh Confidence 122223333333 222234444432 3566888899999999998742 1222211 22345799999999999999998 Q ss_pred HHHHHHHHHHHH-HHHHHHHHc---CC-CCCCcceEEEEeccccccccCCH----HHHHHHHHHHhcCCCCHHHHHHHHH Q lcl|NC_019406. 454 NVIMALEDGMTS-VVRYWLMFR---DI-PLTDTATLRYEIDATFLTTALDA----RALRAIQQLYEGGLLPIDALYENFV 524 (661) Q Consensus 454 ~~A~~le~Al~~-aL~~~A~w~---G~-~~~~~~~~~v~ln~DF~~~~lda----~~l~all~~~~aG~Is~et~~~eL~ 524 (661) .+-..+++=+-. ++.++-..+ |+ +....+-+++ +|. ..+.+ +++..++. |+..|. T Consensus 387 ~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~----~~v-s~l~~l~r~~~~~~l~~-----------~~~~la 450 (536) T protein:vir:21 387 GVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEP----TIS-TGLEAIGRGQDLDKLER-----------CVTAWA 450 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccc----eEE-ecHHHHHHHHHHHHHHH-----------HHHHHH Confidence 888776654433 443333333 21 1111222222 332 22221 12222222 333333 Q ss_pred hcC--CCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhh Q lcl|NC_019406. 525 KNG--IIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISA 602 (661) Q Consensus 525 r~g--vl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~ 602 (661) .-+ +++...++++..+.+.+. +|.+-+..+.. +++..|- .+|+..++|..+.|.+..+. T Consensus 451 ~~~Pe~ld~~id~d~~~~~~a~~---~Gv~p~~~irt-----~eev~~~-------r~q~~~~~~~~~~a~~~~~~---- 511 (536) T protein:vir:21 451 ALAPMRDDPDINLAMIKLRIANA---IGIDTSGILLT-----EEQKQQK-------MAQQSMQMGMDNGAAALAQG---- 511 (536) T ss_pred hhchhhhcccCCHHHHHHHHHHH---cCCChhhhcCC-----HHHHHHH-------HHHHHHHHHHHHHHHHHHHH---- Confidence 222 445567888888888764 33322222221 1111111 11222222222211111111 Q ss_pred hhhhhhhhHHHhcCChhhhhhhhhhhhHHHHhhcccccCCCCCCCcc Q lcl|NC_019406. 603 KVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVTPTPGT 649 (661) Q Consensus 603 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 649 (661) ++-++.+.-|.-++++.-+---||. T Consensus 512 ----------------------~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 512 ----------------------MAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred ----------------------HHHHHhcChhhHHhhhhccccCCCC Confidence 0111111112222222222222333 No 100 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=97.15 E-value=0.00015 Score=41.55 Aligned_cols=526 Identities=11% Similarity=0.072 Sum_probs=193.7 Q ss_pred ccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHh Q lcl|NC_019406. 11 IRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMV 90 (661) Q Consensus 11 ~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~ 90 (661) ...++. .-.+.-.-+-....++|+-|.+.+ ||..-..+...=..+ ..-.|-+...+.++.+. T Consensus 1 m~~~~~----~r~~~l~~~R~~~e~~w~e~~~y~-------------lP~~~~~~~~~~~~~-~~~~~dst~~~a~~~La 62 (555) T protein:vir:17 1 MKHSAQ----AKYMMLRADREDYLDSGRQSARLT-------------LPYILTDEGHVQGGY-LPTPWQSVGSKGVNVLA 62 (555) T ss_pred ChhHHH----HHHHHHHHHhhHHHHHHHHHHHHh-------------cccccCCCCCccccc-ccccccccHHHHHHHHH Confidence 010000 000000011112233444444443 443211111100011 11234455555665555 Q ss_pred chhhcc-----Ccccc-ccch-hhHhhhhcccccccccchhhhhhhHhhhhhcc------CCCCCHHHHHHHHHHHHHhh Q lcl|NC_019406. 91 GQIFRR-----PPVIR-NLPN-TGAITGRDAEGGVQVVAPASIGKLLTQLQRFA------KDGTSHQGFAKTVALEQVAM 157 (661) Q Consensus 91 G~vFrk-----~p~i~-~~p~-~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d------l~G~sL~~fa~~~~~~~L~~ 157 (661) ..++.- .|=+. .+.+ .+..+..+ ++....++..++.|. +.-++.+.-+-.++.+.+.+ T Consensus 63 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~---------~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~ 133 (555) T protein:vir:17 63 SKLMLSLFPVNTSFFKLQINDAEIDNLGMD---------EQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVT 133 (555) T ss_pred HHHHHhhcCCCCcccccccCHHHHhhccCC---------HHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhH Confidence 444331 11110 0111 01111111 111112222222211 23456888888889999999 Q ss_pred CCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecc--ccccccccceeeeechhhhh Q lcl|NC_019406. 158 GRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDE--HATPSQQNPWIGREGSETAQ 235 (661) Q Consensus 158 Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~--~~~~~~~~~~i~~~~~e~vi 235 (661) |-+-+++|-.+ .| .|+-.+ +-+.. ++.+.+.-|..+++..... ..++.... .+.+- T Consensus 134 G~a~ly~~~~~-------~~----~~pl~~---y~v~~-d~~G~vd~v~rk~~~t~~ql~~~fg~~~l-------~~~~~ 191 (555) T protein:vir:17 134 GNALLYQGKKN-------LK----LYPLDR---FVVSR-DGEGNVMEIVTEEQIDRSLLPEEFQKVGG-------LEGAP 191 (555) T ss_pred CeEEEEecCCc-------ee----EEEcCe---EEEee-CCCcCeeEEEeeeeecHHHHHHHhhhccc-------cchhh Confidence 99888887431 11 121111 11111 2222333333333322110 01110000 00000 Q ss_pred cchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCC-ccc Q lcl|NC_019406. 236 RTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRG-RTL 314 (661) Q Consensus 236 ~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g-~~L 314 (661) +.....-++... .+...... .........+|....+ +++ .|.|+....+... .... ...| ..+ T Consensus 192 ~~~~~~~d~~~~----~~~~~~~~---~~~~~~~~~v~t~~~~---~~~--~~~~~~e~~~~~v---~~~l-~e~g~~e~ 255 (555) T protein:vir:17 192 DSNAVGEDGPKM----GVTAPGGR---DKGKSNDALVYTYVCR---KDG--QVKWHQECDGKVI---PGSN-SSAPYTHN 255 (555) T ss_pred hhhhccccchhh----hhhhhccc---ccCCCcceeEeecccc---cCC--eeEEEEecCceec---cccc-cccCcccC Confidence 000000000000 00000000 1111222333332222 122 2333322222110 0000 0111 246 Q ss_pred ceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEecccceeecCCC Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPGRVWVVDKE 391 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~~~~~lp~~ 391 (661) ++||+.|.-..+..+..+ ..-|-|+..||.-+ .+-++.+......|.++-+ |.... ..+.-|+++.+..... T Consensus 256 P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~---~~~l~~~~~~~~pp~lv~~~g~~~~--~~l~~~~~g~v~~g~~ 330 (555) T protein:vir:17 256 PWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALS---QAMVEGSAASAKVVFMVSPSATTKP--QNLALAANGAIIQGRP 330 (555) T ss_pred CeeeeeeeecCCCccccchHHHHHHHHHHHHHHH---HHHHHHHHHHhCCceeeccccccCc--ceeecCCCceeecCCc Confidence 778888876655555443 11244777777653 3335555555555555543 33222 2356666666543222 Q ss_pred CCcceEeecC-chhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHH-HHHHH---- Q lcl|NC_019406. 392 SGIPGIIEFK-GEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALE-DGMTS---- 465 (661) Q Consensus 392 ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le-~Al~~---- 465 (661) ..+.-++.. +..+....+.|+++++.+..+=. ++ ...++...||++...+...-...|..+-.++. +-+.- T Consensus 331 -~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm-~~-~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R 407 (555) T protein:vir:17 331 -DDVSVVQANKAADFRTVLEMIQKLEQRISDAFL-ML-QVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLAR 407 (555) T ss_pred -ccceeeeccccchhhHHHHHHHHHHHHHHHHHh-hc-CCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 334444422 34578889999999998875411 22 23445678999999999999999999888886 44433 Q ss_pred HHHHHHHHcC-CCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcC---CCCccCCHHHHHHH Q lcl|NC_019406. 466 VVRYWLMFRD-IPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNG---IIPSTQTLEEFTIK 541 (661) Q Consensus 466 aL~~~A~w~G-~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~g---vl~~~~~~Eee~~~ 541 (661) ++.++.+ .| ++....+-+.++|... +.++...-+ .-+-..|...+-... -+-+..++++..++ T Consensus 408 ~~~il~r-~g~lP~~p~~~v~~~i~~~----------l~~l~r~~~--~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~ 474 (555) T protein:vir:17 408 KLHLLQK-QRKLPQLPKDLVQPTVVAG----------LWGVGRGQD--KQQLMEFITTLAQTMGPEIAMKYINPTEFIKR 474 (555) T ss_pred HHHHHHh-CCCCCCCCHhhhccceeeh----------HHHHHHHHH--HHHHHHHHHHHHhhcCchhHhhcCCHHHHHHH Confidence 3333333 23 2221111122222221 111111111 011112333332111 12235677777777 Q ss_pred HhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhh Q lcl|NC_019406. 542 MNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQA 621 (661) Q Consensus 542 l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 621 (661) |.+- +|.|-+.... .++++.+. -+||..+.|..+...++.+..-+. +...+- +++.++++ T Consensus 475 ~a~~---~Gv~p~~ivr-----s~eev~~~-------rq~~~~~~~q~~~~~qa~~~~~~~----~~~~~~-~~~~~~~~ 534 (555) T protein:vir:17 475 LAAA---QGIDTLQLIN-----SPETMKQL-------GDQQKQDMVQASLINQAGQLAKTP----MAEQAM-QLIQQQQE 534 (555) T ss_pred HHHH---cCCChhhhcC-----CHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhhh----hhhhHH-hccccchh Confidence 7664 3332221111 01111111 122222222222111211111000 000111 11222221 Q ss_pred hhhhhhhhHHHHhhcccccCCCCCCCc Q lcl|NC_019406. 622 KPSKAEQAQIDAQQKQAAAKPVTPTPG 648 (661) Q Consensus 622 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 648 (661) . |.+. -+--|+++||-.-.|. T Consensus 535 ~---a~~~---~~a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 535 G---AQDA---GAAESETSSAEAQAGA 555 (555) T ss_pred h---hhHH---HHHHhhcCCcccccCC Confidence 1 1111 1122334443322222 No 101 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=97.14 E-value=0.00016 Score=41.44 Aligned_cols=465 Identities=12% Similarity=0.015 Sum_probs=186.2 Q ss_pred CCCC--CCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcc Q lcl|NC_019406. 1 MAGL--SPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAF 78 (661) Q Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~ 78 (661) |+.. .|..-=|. .+. +.+.-......+...-+.+.+.. +....-..|.+. +-+-..|+..+.-+ T Consensus 1 ~~~~i~~~~g~~~~---~~~-------~~~~~~~~ia~~~~~~~~~~~~~-~~p~~~~il~~~-~~~~~~y~~m~~D~-- 66 (491) T protein:vir:79 1 MSKGLWVSPTEFVK---FGE-------PDKSLSSQIATRARSIDFFALGM-YLPNPDPVLKAL-GKDIRVYRELRADA-- 66 (491) T ss_pred CCCeeeCCCCCccc---ccc-------cchhHHHHHhhhccccccccccc-cCcchhHHHhhc-cCCHHHHHHHhhCh-- Confidence 3321 01100011 110 11111111111221112211100 000000011111 12345677765444 Q ss_pred cchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhC Q lcl|NC_019406. 79 YNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMG 158 (661) Q Consensus 79 ~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~G 158 (661) ++...++.....|...+..|+ |+.- | .+.-+.+.+.++ +.+++.++.+++ .++-|| T Consensus 67 --~i~s~l~~Rk~av~~~~w~i~--~~~~-------~-------~~~a~~i~e~l~-----~~~~~~~i~~~l-da~~~G 122 (491) T protein:vir:79 67 --HVGGCVRRRKAAVKALEWGLD--RGKA-------K-------SRVAKSIADVFA-----DLDLSRIATEML-DAVLYG 122 (491) T ss_pred --HHHHHHHHHHHHHhCCCcEEe--cCCC-------C-------HHHHHHHHHHHh-----cCCHHHHHHHHH-Hhhhhc Confidence 444444444445555555553 2110 0 111234444443 446888888886 477788 Q ss_pred CEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 159 RFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 159 r~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) .+.+=+-|- .+ +|.-.+.-+..+- + T Consensus 123 ~s~~Ei~w~-------------------------~~--~g~~~~~~l~~r~---------------------~------- 147 (491) T protein:vir:79 123 YQPMEITWG-------------------------KV--GNYIVPIDVVGKP---------------------A------- 147 (491) T ss_pred ceeEEEEEe-------------------------ec--CCeeeEEeeeeec---------------------c------- Confidence 776554432 11 2221111111100 0 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP 318 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP 318 (661) .++.| ..++..+++.. . + . ..|.+|+.== T Consensus 148 ------------------------~~f~~-------------d~~~~l~l~~~--~-~--~---------~~g~~lp~~k 176 (491) T protein:vir:79 148 ------------------------DWFVY-------------DPENQLRFRSK--E-H--W---------VQGEELPARK 176 (491) T ss_pred ------------------------cceee-------------ccCCceEEeec--C-C--C---------CCceeecCCC Confidence 00000 00111111110 0 0 0 0111121100 Q ss_pred EE-EEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCce-----eEecccceeecC Q lcl|NC_019406. 319 FV-FFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDASE-----YHIGPGRVWVVD 389 (661) Q Consensus 319 fv-~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~~-----l~iGs~~~~~lp 389 (661) |+ +.+..+.+...+.+.|..++..-+--=....+...-+..-+.|+++.. |.+++++.. ..||++++..+| T Consensus 177 ~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP 256 (491) T protein:vir:79 177 FLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDAETNLLLDRLEDMVQDAVAVIP 256 (491) T ss_pred eEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCeEEEec Confidence 22 223233344455666666665444333345677778888899988875 334443332 358989999999 Q ss_pred CCCCcceEeecCch--hHHHHHHHHHHHHHHHH--HHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHH Q lcl|NC_019406. 390 KESGIPGIIEFKGE--GLKTLERALNEKEQQIA--AIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTS 465 (661) Q Consensus 390 ~~ga~~~ylE~~g~--~i~a~~~~L~~le~qM~--~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~ 465 (661) . |..+.|+|..+. +....++-++...++|. .+|- .+... .++ |--..........-++...+..+++++++ T Consensus 257 ~-~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGq-tlTt~-~~g--s~a~~~vh~~v~~~i~~~D~~~i~~tln~ 331 (491) T protein:vir:79 257 D-DSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQ-NQTTE-ATS--TRASAQAGLEVTDDIRDGDKAIVVEAMNM 331 (491) T ss_pred C-CceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhh-hhccC-ccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 789999998653 34456666666666665 3443 44332 222 33344556667788899999999999999 Q ss_pred HHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCC-CCHHHHHHHHHhcCCCCccCCHHHHHHHHhc Q lcl|NC_019406. 466 VVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGL-LPIDALYENFVKNGIIPSTQTLEEFTIKMND 544 (661) Q Consensus 466 aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~-Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~ 544 (661) .+++++.|.+-+ ...++|.+.. ....+......+..+...|. |+.+.++ .+-|+=.+... ++. +.. T Consensus 332 li~~l~~~N~~~---~~~p~f~~~e---~ee~~~~~a~~~~~L~~~G~~i~~~~~~---e~~Gip~~~~~-e~~---~~~ 398 (491) T protein:vir:79 332 LIRWICDLNFDG---AARPVFDMWE---QEQVDEIQAGRDEKLTRAGARFTPAYFK---RAYNLQDGDLD-ERP---LPV 398 (491) T ss_pred HHHHHHHhcCCC---CCcceEeecC---cCchhHHHHHHHHHHHhCCCccCHHHHH---HHhCCCCCCCC-ccc---cCc Confidence 999999999743 2234444322 22222223455667777776 6655443 34465222221 111 111 Q ss_pred cCCCCCCchhhhhhcCCccccCCCcchhhhhcC-C-------hhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcC Q lcl|NC_019406. 545 PKSFIGQPDAIAMRRGYVSRQQELDQQRAARDA-D-------FQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLG 616 (661) Q Consensus 545 ~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~-d-------~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 616 (661) +.+.-+...+.....+ ..+..++.-..+..+ + +-.+..+.-++..-+++-+.+|.+..+++. T Consensus 399 ~~~~~~~~~~~~~~~~--~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~~L~~l~~~~d-------- 468 (491) T protein:vir:79 399 SAVDAVGAASFAEFEA--PDQDALDAALNALSARDLNADAQALVAPLLKRIANGASADELLGMLAELYPSLD-------- 468 (491) T ss_pred CcccccccccccccCC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCC-------- Confidence 1111110000000000 000100000000000 0 001111111111112222222222211111 Q ss_pred ChhhhhhhhhhhhHHHHh--hccc Q lcl|NC_019406. 617 DPEQAKPSKAEQAQIDAQ--QKQA 638 (661) Q Consensus 617 ~~~~~~~~~~~~~~~~~~--~~~~ 638 (661) +.++...=+..--++.= +-.| T Consensus 469 -~~~l~~~l~~a~~~A~l~Gr~~a 491 (491) T protein:vir:79 469 -TDALQERLARAIFVANLWGRLHA 491 (491) T ss_pred -HHHHHHHHHHHHHHHHHhhhccC Confidence 11111111110001110 1111 No 102 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=97.11 E-value=0.00017 Score=41.28 Aligned_cols=498 Identities=13% Similarity=0.095 Sum_probs=195.0 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcc-hHHHHhCCcccCCCCC---CCChHHHHHHHhhh Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAG-EREIKAQGVKYLKAPK---GFDDEDYANYLDRA 76 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G-~~~vr~~g~~YLPk~~---~E~~~~Y~~rl~rA 76 (661) ||- ......-.....+|+.+++--.= ...|++...-.||..- +........ - T Consensus 1 m~~--------------------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~----~ 56 (536) T protein:vir:10 1 MAE--------------------KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQ----T 56 (536) T ss_pred Ccc--------------------hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccc----c Confidence 322 22222222333333332221000 2222332222233221 111111111 1 Q ss_pred cccchHHHHHHHHhchhhcc--Ccc-c-c-ccch-hhHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHH Q lcl|NC_019406. 77 AFYNMTSQTQAGMVGQIFRR--PPV-I-R-NLPN-TGAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQ 144 (661) Q Consensus 77 ~~~n~~~~tv~~l~G~vFrk--~p~-i-~-~~p~-~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~ 144 (661) .|-+.-.+.++.+...++.- |+. + . .+++ .+..+.. .+.....++.+|+.| -+.-++.+ T Consensus 57 ~~dst~~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~---------~~~~~~~v~~~L~~ve~~~~~~l~~snf~ 127 (536) T protein:vir:10 57 PWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLS---------DPDGLAKVDEGLSMVERIIMNYIESNSYR 127 (536) T ss_pred cccccHHHHHHHHHHHHHhhhcCCCcccccccChhhhhcccc---------chhhHHHHHHHHHHHHHHHHHHHHhcCcH Confidence 23344444444443333321 210 1 0 0000 0110000 011111122222221 12346677 Q ss_pred HHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccc Q lcl|NC_019406. 145 GFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNP 224 (661) Q Consensus 145 ~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~ 224 (661) .-+-.++.+.+.+|-+.+++|-+... +.+ ++..|.-.+ +-+.. ++.+.+.-|..+++..... T Consensus 128 ~~~~~~~~~L~~~G~a~ly~~e~~~~----~~~-~~~~~pl~~---~~v~~-d~~G~vd~i~r~~~~t~~~--------- 189 (536) T protein:vir:10 128 VTLFEALKQLVVAGNVLLYLPEPEGS----NYN-PMKLYRLSS---YVVQR-DAFGNVLQMVTRDQIAFGA--------- 189 (536) T ss_pred HHHHHHHHHHHhHCcEeEEEeeCCCC----cee-eEEEEEcCe---EEEee-CCCCCeeEEeeeeeccHHH--------- Confidence 88888899999999999999855321 111 223333222 22221 1222233333333222100 Q ss_pred eeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccc Q lcl|NC_019406. 225 WIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDV 304 (661) Q Consensus 225 ~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~ 304 (661) -+-.|..... ...... ..+..-++++....+. .++ .|.|+...++.. T Consensus 190 --------l~~~fg~~~~-----------~~~~~~-----~~~~~v~v~~~V~~~~-~~~--~~~~~~e~~g~~------ 236 (536) T protein:vir:10 190 --------LPEDIRKAVE-----------GQGGEK-----KADETIDVYTHIYLDE-ASG--EYLRYEEVEGME------ 236 (536) T ss_pred --------HHHhhhhhhc-----------cccccc-----CcccceEEEEEEEEec-CCC--cEEEEEeecCcc------ Confidence 0011111100 000000 1112223444333322 122 244443333221 Q ss_pred eeeccCC----cccceeeEEEEecCCCCCCccccc----hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCC Q lcl|NC_019406. 305 YTPMVRG----RTLPFIPFVFFGSMSNAADCEKPP----LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDA 375 (661) Q Consensus 305 ~~p~~~g----~~L~~IPfv~~~~~~~~~~~~~pP----LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~ 375 (661) ++...| ..+++||+.|.-..+..+.. .| |-|+..||.- +.+-+..+......|.++-+ |+.+. T Consensus 237 -v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGr--gp~~~~l~D~k~L~~l---~~~~l~~~~~a~~~~~lv~p~g~~~~-- 308 (536) T protein:vir:10 237 -VQGSDGTYPKEACPYIPIRMVRLDGESYGR--SYIEEYLGDLRSLENL---QEAIVKMSMISSKVIGLVNPAGITQP-- 308 (536) T ss_pred -ccccccccccccCCceeeeeeecCCCcccc--chHHHHHHHHHHHHHH---HHHHHHHHHHHhcCCcccCcccccch-- Confidence 112222 24677777777665555544 34 4477777754 34445666666677766654 33221 Q ss_pred ceeEecccceeecCCCCCcceEeec-CchhHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHH Q lcl|NC_019406. 376 SEYHIGPGRVWVVDKESGIPGIIEF-KGEGLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLL 453 (661) Q Consensus 376 ~~l~iGs~~~~~lp~~ga~~~ylE~-~g~~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~ 453 (661) ..+.=|....+. |...++.+.++. .+..+....+.|+++++.+... =+.++.. ..+...||++...+...-...|. T Consensus 309 ~~~~~~~~g~~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~-~~~~r~TAtEV~~r~~E~~~~LG 386 (536) T protein:vir:10 309 RRLTKAQTGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQ-RTGERVTAEEIRYVASELEDTLG 386 (536) T ss_pred hhhccCCCccee-cCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhccc-CCCCCccHHHHHHHHHHHHHHhh Confidence 122223333333 222234444432 3566888899999999998742 1222211 22345799999999999999998 Q ss_pred HHHHHHHHHHHH-HHHHHHHHc---CC-CCCCcceEEEEeccccccccCCH----HHHHHHHHHHhcCCCCHHHHHHHHH Q lcl|NC_019406. 454 NVIMALEDGMTS-VVRYWLMFR---DI-PLTDTATLRYEIDATFLTTALDA----RALRAIQQLYEGGLLPIDALYENFV 524 (661) Q Consensus 454 ~~A~~le~Al~~-aL~~~A~w~---G~-~~~~~~~~~v~ln~DF~~~~lda----~~l~all~~~~aG~Is~et~~~eL~ 524 (661) .+-..+++=+-. ++.++-..+ |+ +....+-+++ +|. ..+.+ +++..++. |...|. T Consensus 387 ~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~----~~v-s~l~~l~r~~~~~~l~~-----------~~~~la 450 (536) T protein:vir:10 387 GVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEP----TIS-TGLEAIGRGQDLDKLER-----------CVTAWA 450 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccc----eEE-ecHHHHHHHHHHHHHHH-----------HHHHHH Confidence 888776654433 443333333 21 1111222222 332 22221 12222222 233332 Q ss_pred hc--CCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhh Q lcl|NC_019406. 525 KN--GIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISA 602 (661) Q Consensus 525 r~--gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~ 602 (661) .- .+++...++++..+.+.+. +|.+-+..+.. +++..|- .+|+..++|..+.|.+.++.- T Consensus 451 ~~~P~~ld~~id~d~~~~~~a~~---~Gv~p~~~irt-----~eev~~~-------r~q~~~~~~~~~~a~~~~~~~--- 512 (536) T protein:vir:10 451 ALAPMRDDPDINLAMIKLRIANA---IGIDTSGILLT-----EEQKQQK-------MAQQSMQMGMDNGAAALAQGM--- 512 (536) T ss_pred hhchhhhcccCCHHHHHHHHHHH---cCCCchhhcCC-----HHHHHHH-------HHHHHHHHHHHHHHHHHHHHH--- Confidence 22 2344557888888888764 33322222221 1111111 122222222222222211111 Q ss_pred hhhhhhhhHHHhcCChhhhhhhhhhhhHHHHhhcccccCCCCCCCcc Q lcl|NC_019406. 603 KVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVTPTPGT 649 (661) Q Consensus 603 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 649 (661) +-++.+--|.-++++.-+---||. T Consensus 513 -----------------------~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 513 -----------------------AAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred -----------------------HHHHhcCchhHHhhhhccccCCCC Confidence 000111111111222122222332 No 103 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=97.10 E-value=0.00017 Score=41.22 Aligned_cols=590 Identities=13% Similarity=0.070 Sum_probs=202.9 Q ss_pred CCCCCCccccccccccccccCCc----cccCHHHHHHHHHHHHHHHHhcchHHHHhC---CcccC--CCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTH----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQ---GVKYL--KAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V----~~~hPey~a~~~~W~~irD~~~G~~~vr~~---g~~YL--Pk~~~E~~~~Y~~ 71 (661) |. ..+..|.- ..+...+...+..|..- +.....+|+. ...|. =+|+.+....-+. T Consensus 1 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~ 64 (714) T protein:vir:99 1 MK-------------NETNTMATKNDNGATPRFSQRQLQALCSD---IDSQPKWRDAANKACAYYDGDQLPPEVLQVLKD 64 (714) T ss_pred CC-------------cccccccCCCCcchhHHHHHHHHHHHHHH---HHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh Confidence 22 11111110 11112233332222222 2223333310 01111 1566555555555 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +-.-.+.+|.++++|+..+|.-=+..+.+.-.|..- |+.-...+-.-...|..+++ -++.+.-...+| T Consensus 65 ~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~-------~~~~~~~Ae~l~~~~~~~~~-----~~~~~~~~s~af 132 (714) T protein:vir:99 65 RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEP-------DDETEKLAEAINAEFADACR-----LGNMNKARSDAY 132 (714) T ss_pred cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCC-------CchhHHHHHHHHHHHHHHHH-----hhchhHHHHHHH Confidence 555688899999999999999999888886555311 11000011111223333332 346777888999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceee--ccccccceeeeeeeeeeeeccccccccccceeeee Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVE--DVDGFYVPTRILLREFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~--~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~ 229 (661) ..++.+|.+|+=|-+- ++ ..+..+++..+.|.+|+ |... ..+. ..-.|+..+.+...++- ....|--..+ T Consensus 133 ~~~~~~G~G~~~~~~~--~d-~~~~~i~i~~v~p~~v~-~Dp~a~~~D~-sDar~~~~~~~~~~~~~---~~~fP~~a~~ 204 (714) T protein:vir:99 133 AEQIKAGLSWVEVRRN--SD-PFGPEFKVSTVSRNEVF-WDWLSREADL-SDCRWLMRRRWMDTDEA---KATFPGMAQV 204 (714) T ss_pred HHhhhcCcceEEeccc--cC-CCCCCeEEEecchhhee-eccccccCCh-hhccceeeeecCCHHHH---HHhcCCchhh Confidence 9999999888443221 11 22345667777777753 3211 1110 11123333332221111 0011110000 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheec---ccccCCCc-------------eeeEEEEEEEeecccccceEE----- Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALAR---PSRFTSSY-------------TFRTIYRELILELQKDGSRVY----- 288 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~-------------~~~~~~rv~~l~~g~~g~~~~----- 288 (661) ....+..|+.-.-..........+....+. .+...+.| -++...+..++.+ .+|..++ T Consensus 205 i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~-~~g~~~~~d~~~ 283 (714) T protein:vir:99 205 IDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIEL-SNGRVVAFDKNN 283 (714) T ss_pred hhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeecc-CCCceEEeCccC Confidence 001111111100000000000000000000 00000000 0111111112221 1111110 Q ss_pred ----------------------EEEEEecCcccccccceeeccCCcccceeeEEEEecCC---CCCCcc-ccchhHH-HH Q lcl|NC_019406. 289 ----------------------KQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMS---NAADCE-KPPLLDI-VE 341 (661) Q Consensus 289 ----------------------~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~---~~~~~~-~pPLldL-A~ 341 (661) ++..|.... -..+...| . +-+.+|||++.... .+...+ .-.+.|. -. T Consensus 284 ~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~--~L~~~~~p-~---p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~ 357 (714) T protein:vir:99 284 LMQAVAVASGRVQVKVGRVSRIREAWFVGPH--FIVDRPCS-A---PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDE 357 (714) T ss_pred HHHHHHHhhcchhhhccccceEEEEEEecCc--ccccCCCC-C---CCCceeEEEEeeeeeeccCceeehhhhchhHHHH Confidence 111111110 00000011 1 11345555443221 111111 0112222 22 Q ss_pred HHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee-E-ec-ccceeec-CC--CC----CcceEeecCchhHHHHHHH Q lcl|NC_019406. 342 LNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY-H-IG-PGRVWVV-DK--ES----GIPGIIEFKGEGLKTLERA 411 (661) Q Consensus 342 LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l-~-iG-s~~~~~l-p~--~g----a~~~ylE~~g~~i~a~~~~ 411 (661) +| .+++.. .++|. +.-+++..|..+.+.+.+ . +. +++.+.+ |. .| ..+... +...-.....+. T Consensus 358 ~N--~~~s~~--~~~l~--~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~-~~~~~~~~~~~l 430 (714) T protein:vir:99 358 VN--FRRIKL--TWLLQ--AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVE-QDFQVASQQFQV 430 (714) T ss_pred HH--HHHHHH--HHhhc--CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCcccccc-CCCCccHHHHHH Confidence 33 344443 34442 222233334322221111 0 00 1122221 21 11 123322 222223444555 Q ss_pred HHHHHHHHHHH-hH--HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------ Q lcl|NC_019406. 412 LNEKEQQIAAI-GG--RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------ 478 (661) Q Consensus 412 L~~le~qM~~l-GA--rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------ 478 (661) |+...+.|..+ |. .++ +..+.+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..|++... T Consensus 431 lq~~~~~i~~~tGv~~~~l--G~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~ 508 (714) T protein:vir:99 431 MQESEKLIQDTMGVYSAFL--GQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVI 508 (714) T ss_pred HHHHHHHHHHhhCCChHHc--CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEec Confidence 55555555433 32 233 2233456888888888887788888888887777665 557777775321 Q ss_pred ---CCcceEEEEecc----------------ccccccC------CHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCCCCcc Q lcl|NC_019406. 479 ---TDTATLRYEIDA----------------TFLTTAL------DARALRAIQQLYEGGLLPIDALYENFVK-NGIIPST 532 (661) Q Consensus 479 ---~~~~~~~v~ln~----------------DF~~~~l------da~~l~all~~~~aG~Is~et~~~eL~r-~gvl~~~ 532 (661) .....-.|.||+ |+..... ..+.+.+|+++++. ++.+....-+.- -. +.+- T Consensus 509 e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~-~~d~ 585 (714) T protein:vir:99 509 NRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVN-LLDV 585 (714) T ss_pred cCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHH-hcCC Confidence 111111233331 1111111 13456666777653 222211100000 00 1111 Q ss_pred CCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHH--HHhccCCCchhHHHhhhhhh---h Q lcl|NC_019406. 533 QTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQ--AERHLEIDEEKLRISAKVGS---T 607 (661) Q Consensus 533 ~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~--~e~~~~~~~~~~~~~~~~~~---~ 607 (661) -..+++.++|.+..+.-+. .+.+ .++ +|.+.+...-++++..++ ++.++.++..++++...-++ . T Consensus 586 p~~~el~~~ir~~~~~~~~--~~~~------~~e--~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:99 586 PQKQEFVERIRAALGTPKS--PDEM------TPE--EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred CCHHHHHHHHHHHcCCCCC--cccc------chh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2346778888765432211 1000 011 111111111122222222 22333222233222221111 1 Q ss_pred hhhHHHhcCCh----hhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCC-----CCccCC-C Q lcl|NC_019406. 608 SVAASRKLGDP----EQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGR-----PPQNGA-S 661 (661) Q Consensus 608 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~ 661 (661) ...++..++.. ......+++.+++. +..+..+. .+-|+.-. +++.-+ | T Consensus 656 ~~~a~~~~~~~~~~~~~~~~~~a~~a~~~-~~~~~~~~-----~~~~~~~q~~q~~~~~~~~~~ 713 (714) T protein:vir:99 656 NASAQREVALTQGQRYVDALNQAHTAEII-TGVQNMEQ-----EQDVLQQQMLYTLQQRMNEMS 713 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhHhhhhh-----hhHHHHHHHHHHHHHHHHhcC Confidence 11221111111 11222222222221 00111111 11111100 001000 0 No 104 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=97.10 E-value=0.00017 Score=41.22 Aligned_cols=590 Identities=13% Similarity=0.070 Sum_probs=202.9 Q ss_pred CCCCCCccccccccccccccCCc----cccCHHHHHHHHHHHHHHHHhcchHHHHhC---CcccC--CCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTH----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQ---GVKYL--KAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V----~~~hPey~a~~~~W~~irD~~~G~~~vr~~---g~~YL--Pk~~~E~~~~Y~~ 71 (661) |. ..+..|.- ..+...+...+..|..- +.....+|+. ...|. =+|+.+....-+. T Consensus 1 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~ 64 (714) T protein:vir:10 1 MK-------------NETNTMATKNDNGATPRFSQRQLQALCSD---IDSQPKWRDAANKACAYYDGDQLPPEVLQVLKD 64 (714) T ss_pred CC-------------cccccccCCCCcchhHHHHHHHHHHHHHH---HHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh Confidence 22 11111110 11112233332222222 2223333310 01111 1566555555555 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +-.-.+.+|.++++|+..+|.-=+..+.+.-.|..- |+.-...+-.-...|..+++ -++.+.-...+| T Consensus 65 ~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~-------~~~~~~~Ae~l~~~~~~~~~-----~~~~~~~~s~af 132 (714) T protein:vir:10 65 RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEP-------DDETEKLAEAINAEFADACR-----LGNMNKARSDAY 132 (714) T ss_pred cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCC-------CchhHHHHHHHHHHHHHHHH-----hhchhHHHHHHH Confidence 555688899999999999999999888886555311 11000011111223333332 346777888999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceee--ccccccceeeeeeeeeeeeccccccccccceeeee Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVE--DVDGFYVPTRILLREFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~--~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~ 229 (661) ..++.+|.+|+=|-+- ++ ..+..+++..+.|.+|+ |... ..+. ..-.|+..+.+...++- ....|--..+ T Consensus 133 ~~~~~~G~G~~~~~~~--~d-~~~~~i~i~~v~p~~v~-~Dp~a~~~D~-sDar~~~~~~~~~~~~~---~~~fP~~a~~ 204 (714) T protein:vir:10 133 AEQIKAGLSWVEVRRN--SD-PFGPEFKVSTVSRNEVF-WDWLSREADL-SDCRWLMRRRWMDTDEA---KATFPGMAQV 204 (714) T ss_pred HHhhhcCcceEEeccc--cC-CCCCCeEEEecchhhee-eccccccCCh-hhccceeeeecCCHHHH---HHhcCCchhh Confidence 9999999888443221 11 22345667777777753 3211 1110 11123333332221111 0011110000 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheec---ccccCCCc-------------eeeEEEEEEEeecccccceEE----- Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALAR---PSRFTSSY-------------TFRTIYRELILELQKDGSRVY----- 288 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~-------------~~~~~~rv~~l~~g~~g~~~~----- 288 (661) ....+..|+.-.-..........+....+. .+...+.| -++...+..++.+ .+|..++ T Consensus 205 i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~-~~g~~~~~d~~~ 283 (714) T protein:vir:10 205 IDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIEL-SNGRVVAFDKNN 283 (714) T ss_pred hhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeecc-CCCceEEeCccC Confidence 001111111100000000000000000000 00000000 0111111112221 1111110 Q ss_pred ----------------------EEEEEecCcccccccceeeccCCcccceeeEEEEecCC---CCCCcc-ccchhHH-HH Q lcl|NC_019406. 289 ----------------------KQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMS---NAADCE-KPPLLDI-VE 341 (661) Q Consensus 289 ----------------------~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~---~~~~~~-~pPLldL-A~ 341 (661) ++..|.... -..+...| . +-+.+|||++.... .+...+ .-.+.|. -. T Consensus 284 ~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~--~L~~~~~p-~---p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~ 357 (714) T protein:vir:10 284 LMQAVAVASGRVQVKVGRVSRIREAWFVGPH--FIVDRPCS-A---PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDE 357 (714) T ss_pred HHHHHHHhhcchhhhccccceEEEEEEecCc--ccccCCCC-C---CCCceeEEEEeeeeeeccCceeehhhhchhHHHH Confidence 111111110 00000011 1 11345555443221 111111 0112222 22 Q ss_pred HHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee-E-ec-ccceeec-CC--CC----CcceEeecCchhHHHHHHH Q lcl|NC_019406. 342 LNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY-H-IG-PGRVWVV-DK--ES----GIPGIIEFKGEGLKTLERA 411 (661) Q Consensus 342 LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l-~-iG-s~~~~~l-p~--~g----a~~~ylE~~g~~i~a~~~~ 411 (661) +| .+++.. .++|. +.-+++..|..+.+.+.+ . +. +++.+.+ |. .| ..+... +...-.....+. T Consensus 358 ~N--~~~s~~--~~~l~--~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~-~~~~~~~~~~~l 430 (714) T protein:vir:10 358 VN--FRRIKL--TWLLQ--AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVE-QDFQVASQQFQV 430 (714) T ss_pred HH--HHHHHH--HHhhc--CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCcccccc-CCCCccHHHHHH Confidence 33 344443 34442 222233334322221111 0 00 1122221 21 11 123322 222223444555 Q ss_pred HHHHHHHHHHH-hH--HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------ Q lcl|NC_019406. 412 LNEKEQQIAAI-GG--RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------ 478 (661) Q Consensus 412 L~~le~qM~~l-GA--rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------ 478 (661) |+...+.|..+ |. .++ +..+.+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..|++... T Consensus 431 lq~~~~~i~~~tGv~~~~l--G~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~ 508 (714) T protein:vir:10 431 MQESEKLIQDTMGVYSAFL--GQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVI 508 (714) T ss_pred HHHHHHHHHHhhCCChHHc--CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEec Confidence 55555555433 32 233 2233456888888888887788888888887777665 557777775321 Q ss_pred ---CCcceEEEEecc----------------ccccccC------CHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCCCCcc Q lcl|NC_019406. 479 ---TDTATLRYEIDA----------------TFLTTAL------DARALRAIQQLYEGGLLPIDALYENFVK-NGIIPST 532 (661) Q Consensus 479 ---~~~~~~~v~ln~----------------DF~~~~l------da~~l~all~~~~aG~Is~et~~~eL~r-~gvl~~~ 532 (661) .....-.|.||+ |+..... ..+.+.+|+++++. ++.+....-+.- -. +.+- T Consensus 509 e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~-~~d~ 585 (714) T protein:vir:10 509 NRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVN-LLDV 585 (714) T ss_pred cCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHH-hcCC Confidence 111111233331 1111111 13456666777653 222211100000 00 1111 Q ss_pred CCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHH--HHhccCCCchhHHHhhhhhh---h Q lcl|NC_019406. 533 QTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQ--AERHLEIDEEKLRISAKVGS---T 607 (661) Q Consensus 533 ~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~--~e~~~~~~~~~~~~~~~~~~---~ 607 (661) -..+++.++|.+..+.-+. .+.+ .++ +|.+.+...-++++..++ ++.++.++..++++...-++ . T Consensus 586 p~~~el~~~ir~~~~~~~~--~~~~------~~e--~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:10 586 PQKQEFVERIRAALGTPKS--PDEM------TPE--EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred CCHHHHHHHHHHHcCCCCC--cccc------chh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2346778888765432211 1000 011 111111111122222222 22333222233222221111 1 Q ss_pred hhhHHHhcCCh----hhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCC-----CCccCC-C Q lcl|NC_019406. 608 SVAASRKLGDP----EQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGR-----PPQNGA-S 661 (661) Q Consensus 608 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~ 661 (661) ...++..++.. ......+++.+++. +..+..+. .+-|+.-. +++.-+ | T Consensus 656 ~~~a~~~~~~~~~~~~~~~~~~a~~a~~~-~~~~~~~~-----~~~~~~~q~~q~~~~~~~~~~ 713 (714) T protein:vir:10 656 NASAQREVALTQGQRYVDALNQAHTAEII-TGVQNMEQ-----EQDVLQQQMLYTLQQRMNEMS 713 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhHhhhhh-----hhHHHHHHHHHHHHHHHHhcC Confidence 11221111111 11222222222221 00111111 11111100 001000 0 No 105 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=97.10 E-value=0.00017 Score=41.22 Aligned_cols=590 Identities=13% Similarity=0.070 Sum_probs=202.9 Q ss_pred CCCCCCccccccccccccccCCc----cccCHHHHHHHHHHHHHHHHhcchHHHHhC---CcccC--CCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTH----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQ---GVKYL--KAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V----~~~hPey~a~~~~W~~irD~~~G~~~vr~~---g~~YL--Pk~~~E~~~~Y~~ 71 (661) |. ..+..|.- ..+...+...+..|..- +.....+|+. ...|. =+|+.+....-+. T Consensus 1 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~ 64 (714) T protein:vir:81 1 MK-------------NETNTMATKNDNGATPRFSQRQLQALCSD---IDSQPKWRDAANKACAYYDGDQLPPEVLQVLKD 64 (714) T ss_pred CC-------------cccccccCCCCcchhHHHHHHHHHHHHHH---HHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh Confidence 22 11111110 11112233332222222 2223333310 01111 1566555555555 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +-.-.+.+|.++++|+..+|.-=+..+.+.-.|..- |+.-...+-.-...|..+++ -++.+.-...+| T Consensus 65 ~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~-------~~~~~~~Ae~l~~~~~~~~~-----~~~~~~~~s~af 132 (714) T protein:vir:81 65 RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEP-------DDETEKLAEAINAEFADACR-----LGNMNKARSDAY 132 (714) T ss_pred cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCC-------CchhHHHHHHHHHHHHHHHH-----hhchhHHHHHHH Confidence 555688899999999999999999888886555311 11000011111223333332 346777888999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceee--ccccccceeeeeeeeeeeeccccccccccceeeee Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVE--DVDGFYVPTRILLREFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~--~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~ 229 (661) ..++.+|.+|+=|-+- ++ ..+..+++..+.|.+|+ |... ..+. ..-.|+..+.+...++- ....|--..+ T Consensus 133 ~~~~~~G~G~~~~~~~--~d-~~~~~i~i~~v~p~~v~-~Dp~a~~~D~-sDar~~~~~~~~~~~~~---~~~fP~~a~~ 204 (714) T protein:vir:81 133 AEQIKAGLSWVEVRRN--SD-PFGPEFKVSTVSRNEVF-WDWLSREADL-SDCRWLMRRRWMDTDEA---KATFPGMAQV 204 (714) T ss_pred HHhhhcCcceEEeccc--cC-CCCCCeEEEecchhhee-eccccccCCh-hhccceeeeecCCHHHH---HHhcCCchhh Confidence 9999999888443221 11 22345667777777753 3211 1110 11123333332221111 0011110000 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheec---ccccCCCc-------------eeeEEEEEEEeecccccceEE----- Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALAR---PSRFTSSY-------------TFRTIYRELILELQKDGSRVY----- 288 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~-------------~~~~~~rv~~l~~g~~g~~~~----- 288 (661) ....+..|+.-.-..........+....+. .+...+.| -++...+..++.+ .+|..++ T Consensus 205 i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~-~~g~~~~~d~~~ 283 (714) T protein:vir:81 205 IDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIEL-SNGRVVAFDKNN 283 (714) T ss_pred hhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeecc-CCCceEEeCccC Confidence 001111111100000000000000000000 00000000 0111111112221 1111110 Q ss_pred ----------------------EEEEEecCcccccccceeeccCCcccceeeEEEEecCC---CCCCcc-ccchhHH-HH Q lcl|NC_019406. 289 ----------------------KQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMS---NAADCE-KPPLLDI-VE 341 (661) Q Consensus 289 ----------------------~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~---~~~~~~-~pPLldL-A~ 341 (661) ++..|.... -..+...| . +-+.+|||++.... .+...+ .-.+.|. -. T Consensus 284 ~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~--~L~~~~~p-~---p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~ 357 (714) T protein:vir:81 284 LMQAVAVASGRVQVKVGRVSRIREAWFVGPH--FIVDRPCS-A---PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDE 357 (714) T ss_pred HHHHHHHhhcchhhhccccceEEEEEEecCc--ccccCCCC-C---CCCceeEEEEeeeeeeccCceeehhhhchhHHHH Confidence 111111110 00000011 1 11345555443221 111111 0112222 22 Q ss_pred HHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee-E-ec-ccceeec-CC--CC----CcceEeecCchhHHHHHHH Q lcl|NC_019406. 342 LNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY-H-IG-PGRVWVV-DK--ES----GIPGIIEFKGEGLKTLERA 411 (661) Q Consensus 342 LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l-~-iG-s~~~~~l-p~--~g----a~~~ylE~~g~~i~a~~~~ 411 (661) +| .+++.. .++|. +.-+++..|..+.+.+.+ . +. +++.+.+ |. .| ..+... +...-.....+. T Consensus 358 ~N--~~~s~~--~~~l~--~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~-~~~~~~~~~~~l 430 (714) T protein:vir:81 358 VN--FRRIKL--TWLLQ--AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVE-QDFQVASQQFQV 430 (714) T ss_pred HH--HHHHHH--HHhhc--CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCcccccc-CCCCccHHHHHH Confidence 33 344443 34442 222233334322221111 0 00 1122221 21 11 123322 222223444555 Q ss_pred HHHHHHHHHHH-hH--HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------ Q lcl|NC_019406. 412 LNEKEQQIAAI-GG--RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------ 478 (661) Q Consensus 412 L~~le~qM~~l-GA--rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------ 478 (661) |+...+.|..+ |. .++ +..+.+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..|++... T Consensus 431 lq~~~~~i~~~tGv~~~~l--G~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~ 508 (714) T protein:vir:81 431 MQESEKLIQDTMGVYSAFL--GQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVI 508 (714) T ss_pred HHHHHHHHHHhhCCChHHc--CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEec Confidence 55555555433 32 233 2233456888888888887788888888887777665 557777775321 Q ss_pred ---CCcceEEEEecc----------------ccccccC------CHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCCCCcc Q lcl|NC_019406. 479 ---TDTATLRYEIDA----------------TFLTTAL------DARALRAIQQLYEGGLLPIDALYENFVK-NGIIPST 532 (661) Q Consensus 479 ---~~~~~~~v~ln~----------------DF~~~~l------da~~l~all~~~~aG~Is~et~~~eL~r-~gvl~~~ 532 (661) .....-.|.||+ |+..... ..+.+.+|+++++. ++.+....-+.- -. +.+- T Consensus 509 e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~-~~d~ 585 (714) T protein:vir:81 509 NRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVN-LLDV 585 (714) T ss_pred cCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHH-hcCC Confidence 111111233331 1111111 13456666777653 222211100000 00 1111 Q ss_pred CCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHH--HHhccCCCchhHHHhhhhhh---h Q lcl|NC_019406. 533 QTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQ--AERHLEIDEEKLRISAKVGS---T 607 (661) Q Consensus 533 ~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~--~e~~~~~~~~~~~~~~~~~~---~ 607 (661) -..+++.++|.+..+.-+. .+.+ .++ +|.+.+...-++++..++ ++.++.++..++++...-++ . T Consensus 586 p~~~el~~~ir~~~~~~~~--~~~~------~~e--~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:81 586 PQKQEFVERIRAALGTPKS--PDEM------TPE--EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred CCHHHHHHHHHHHcCCCCC--cccc------chh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2346778888765432211 1000 011 111111111122222222 22333222233222221111 1 Q ss_pred hhhHHHhcCCh----hhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCC-----CCccCC-C Q lcl|NC_019406. 608 SVAASRKLGDP----EQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGR-----PPQNGA-S 661 (661) Q Consensus 608 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~ 661 (661) ...++..++.. ......+++.+++. +..+..+. .+-|+.-. +++.-+ | T Consensus 656 ~~~a~~~~~~~~~~~~~~~~~~a~~a~~~-~~~~~~~~-----~~~~~~~q~~q~~~~~~~~~~ 713 (714) T protein:vir:81 656 NASAQREVALTQGQRYVDALNQAHTAEII-TGVQNMEQ-----EQDVLQQQMLYTLQQRMNEMS 713 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhHhhhhh-----hhHHHHHHHHHHHHHHHHhcC Confidence 11221111111 11222222222221 00111111 11111100 001000 0 No 106 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=97.10 E-value=0.00017 Score=41.22 Aligned_cols=590 Identities=13% Similarity=0.070 Sum_probs=202.9 Q ss_pred CCCCCCccccccccccccccCCc----cccCHHHHHHHHHHHHHHHHhcchHHHHhC---CcccC--CCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTH----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQ---GVKYL--KAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V----~~~hPey~a~~~~W~~irD~~~G~~~vr~~---g~~YL--Pk~~~E~~~~Y~~ 71 (661) |. ..+..|.- ..+...+...+..|..- +.....+|+. ...|. =+|+.+....-+. T Consensus 1 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~ 64 (714) T protein:vir:27 1 MK-------------NETNTMATKNDNGATPRFSQRQLQALCSD---IDSQPKWRDAANKACAYYDGDQLPPEVLQVLKD 64 (714) T ss_pred CC-------------cccccccCCCCcchhHHHHHHHHHHHHHH---HHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh Confidence 22 11111110 11112233332222222 2223333310 01111 1566555555555 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +-.-.+.+|.++++|+..+|.-=+..+.+.-.|..- |+.-...+-.-...|..+++ -++.+.-...+| T Consensus 65 ~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~-------~~~~~~~Ae~l~~~~~~~~~-----~~~~~~~~s~af 132 (714) T protein:vir:27 65 RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEP-------DDETEKLAEAINAEFADACR-----LGNMNKARSDAY 132 (714) T ss_pred cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCC-------CchhHHHHHHHHHHHHHHHH-----hhchhHHHHHHH Confidence 555688899999999999999999888886555311 11000011111223333332 346777888999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceee--ccccccceeeeeeeeeeeeccccccccccceeeee Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVE--DVDGFYVPTRILLREFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~--~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~ 229 (661) ..++.+|.+|+=|-+- ++ ..+..+++..+.|.+|+ |... ..+. ..-.|+..+.+...++- ....|--..+ T Consensus 133 ~~~~~~G~G~~~~~~~--~d-~~~~~i~i~~v~p~~v~-~Dp~a~~~D~-sDar~~~~~~~~~~~~~---~~~fP~~a~~ 204 (714) T protein:vir:27 133 AEQIKAGLSWVEVRRN--SD-PFGPEFKVSTVSRNEVF-WDWLSREADL-SDCRWLMRRRWMDTDEA---KATFPGMAQV 204 (714) T ss_pred HHhhhcCcceEEeccc--cC-CCCCCeEEEecchhhee-eccccccCCh-hhccceeeeecCCHHHH---HHhcCCchhh Confidence 9999999888443221 11 22345667777777753 3211 1110 11123333332221111 0011110000 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheec---ccccCCCc-------------eeeEEEEEEEeecccccceEE----- Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALAR---PSRFTSSY-------------TFRTIYRELILELQKDGSRVY----- 288 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~-------------~~~~~~rv~~l~~g~~g~~~~----- 288 (661) ....+..|+.-.-..........+....+. .+...+.| -++...+..++.+ .+|..++ T Consensus 205 i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~-~~g~~~~~d~~~ 283 (714) T protein:vir:27 205 IDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIEL-SNGRVVAFDKNN 283 (714) T ss_pred hhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeecc-CCCceEEeCccC Confidence 001111111100000000000000000000 00000000 0111111112221 1111110 Q ss_pred ----------------------EEEEEecCcccccccceeeccCCcccceeeEEEEecCC---CCCCcc-ccchhHH-HH Q lcl|NC_019406. 289 ----------------------KQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMS---NAADCE-KPPLLDI-VE 341 (661) Q Consensus 289 ----------------------~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~---~~~~~~-~pPLldL-A~ 341 (661) ++..|.... -..+...| . +-+.+|||++.... .+...+ .-.+.|. -. T Consensus 284 ~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~--~L~~~~~p-~---p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~ 357 (714) T protein:vir:27 284 LMQAVAVASGRVQVKVGRVSRIREAWFVGPH--FIVDRPCS-A---PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDE 357 (714) T ss_pred HHHHHHHhhcchhhhccccceEEEEEEecCc--ccccCCCC-C---CCCceeEEEEeeeeeeccCceeehhhhchhHHHH Confidence 111111110 00000011 1 11345555443221 111111 0112222 22 Q ss_pred HHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee-E-ec-ccceeec-CC--CC----CcceEeecCchhHHHHHHH Q lcl|NC_019406. 342 LNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY-H-IG-PGRVWVV-DK--ES----GIPGIIEFKGEGLKTLERA 411 (661) Q Consensus 342 LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l-~-iG-s~~~~~l-p~--~g----a~~~ylE~~g~~i~a~~~~ 411 (661) +| .+++.. .++|. +.-+++..|..+.+.+.+ . +. +++.+.+ |. .| ..+... +...-.....+. T Consensus 358 ~N--~~~s~~--~~~l~--~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~-~~~~~~~~~~~l 430 (714) T protein:vir:27 358 VN--FRRIKL--TWLLQ--AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVE-QDFQVASQQFQV 430 (714) T ss_pred HH--HHHHHH--HHhhc--CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCcccccc-CCCCccHHHHHH Confidence 33 344443 34442 222233334322221111 0 00 1122221 21 11 123322 222223444555 Q ss_pred HHHHHHHHHHH-hH--HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------ Q lcl|NC_019406. 412 LNEKEQQIAAI-GG--RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------ 478 (661) Q Consensus 412 L~~le~qM~~l-GA--rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------ 478 (661) |+...+.|..+ |. .++ +..+.+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..|++... T Consensus 431 lq~~~~~i~~~tGv~~~~l--G~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~ 508 (714) T protein:vir:27 431 MQESEKLIQDTMGVYSAFL--GQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVI 508 (714) T ss_pred HHHHHHHHHHhhCCChHHc--CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEec Confidence 55555555433 32 233 2233456888888888887788888888887777665 557777775321 Q ss_pred ---CCcceEEEEecc----------------ccccccC------CHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCCCCcc Q lcl|NC_019406. 479 ---TDTATLRYEIDA----------------TFLTTAL------DARALRAIQQLYEGGLLPIDALYENFVK-NGIIPST 532 (661) Q Consensus 479 ---~~~~~~~v~ln~----------------DF~~~~l------da~~l~all~~~~aG~Is~et~~~eL~r-~gvl~~~ 532 (661) .....-.|.||+ |+..... ..+.+.+|+++++. ++.+....-+.- -. +.+- T Consensus 509 e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~-~~d~ 585 (714) T protein:vir:27 509 NRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVN-LLDV 585 (714) T ss_pred cCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHH-hcCC Confidence 111111233331 1111111 13456666777653 222211100000 00 1111 Q ss_pred CCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHH--HHhccCCCchhHHHhhhhhh---h Q lcl|NC_019406. 533 QTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQ--AERHLEIDEEKLRISAKVGS---T 607 (661) Q Consensus 533 ~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~--~e~~~~~~~~~~~~~~~~~~---~ 607 (661) -..+++.++|.+..+.-+. .+.+ .++ +|.+.+...-++++..++ ++.++.++..++++...-++ . T Consensus 586 p~~~el~~~ir~~~~~~~~--~~~~------~~e--~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:27 586 PQKQEFVERIRAALGTPKS--PDEM------TPE--EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred CCHHHHHHHHHHHcCCCCC--cccc------chh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2346778888765432211 1000 011 111111111122222222 22333222233222221111 1 Q ss_pred hhhHHHhcCCh----hhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCC-----CCccCC-C Q lcl|NC_019406. 608 SVAASRKLGDP----EQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGR-----PPQNGA-S 661 (661) Q Consensus 608 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~ 661 (661) ...++..++.. ......+++.+++. +..+..+. .+-|+.-. +++.-+ | T Consensus 656 ~~~a~~~~~~~~~~~~~~~~~~a~~a~~~-~~~~~~~~-----~~~~~~~q~~q~~~~~~~~~~ 713 (714) T protein:vir:27 656 NASAQREVALTQGQRYVDALNQAHTAEII-TGVQNMEQ-----EQDVLQQQMLYTLQQRMNEMS 713 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhHhhhhh-----hhHHHHHHHHHHHHHHHHhcC Confidence 11221111111 11222222222221 00111111 11111100 001000 0 No 107 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=97.10 E-value=0.00017 Score=41.22 Aligned_cols=590 Identities=13% Similarity=0.070 Sum_probs=202.9 Q ss_pred CCCCCCccccccccccccccCCc----cccCHHHHHHHHHHHHHHHHhcchHHHHhC---CcccC--CCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTH----LVVHPEYEYYRPDWAKIRDAIAGEREIKAQ---GVKYL--KAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V----~~~hPey~a~~~~W~~irD~~~G~~~vr~~---g~~YL--Pk~~~E~~~~Y~~ 71 (661) |. ..+..|.- ..+...+...+..|..- +.....+|+. ...|. =+|+.+....-+. T Consensus 1 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~ 64 (714) T protein:vir:32 1 MK-------------NETNTMATKNDNGATPRFSQRQLQALCSD---IDSQPKWRDAANKACAYYDGDQLPPEVLQVLKD 64 (714) T ss_pred CC-------------cccccccCCCCcchhHHHHHHHHHHHHHH---HHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh Confidence 22 11111110 11112233332222222 2223333310 01111 1566555555555 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) +-.-.+.+|.++++|+..+|.-=+..+.+.-.|..- |+.-...+-.-...|..+++ -++.+.-...+| T Consensus 65 ~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~-------~~~~~~~Ae~l~~~~~~~~~-----~~~~~~~~s~af 132 (714) T protein:vir:32 65 RGQPMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEP-------DDETEKLAEAINAEFADACR-----LGNMNKARSDAY 132 (714) T ss_pred cCCCcEEeccHHHHHHHHHhHHHhCCcceEEecCCC-------CchhHHHHHHHHHHHHHHHH-----hhchhHHHHHHH Confidence 555688899999999999999999888886555311 11000011111223333332 346777888999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceee--ccccccceeeeeeeeeeeeccccccccccceeeee Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVE--DVDGFYVPTRILLREFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~--~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~ 229 (661) ..++.+|.+|+=|-+- ++ ..+..+++..+.|.+|+ |... ..+. ..-.|+..+.+...++- ....|--..+ T Consensus 133 ~~~~~~G~G~~~~~~~--~d-~~~~~i~i~~v~p~~v~-~Dp~a~~~D~-sDar~~~~~~~~~~~~~---~~~fP~~a~~ 204 (714) T protein:vir:32 133 AEQIKAGLSWVEVRRN--SD-PFGPEFKVSTVSRNEVF-WDWLSREADL-SDCRWLMRRRWMDTDEA---KATFPGMAQV 204 (714) T ss_pred HHhhhcCcceEEeccc--cC-CCCCCeEEEecchhhee-eccccccCCh-hhccceeeeecCCHHHH---HHhcCCchhh Confidence 9999999888443221 11 22345667777777753 3211 1110 11123333332221111 0011110000 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheec---ccccCCCc-------------eeeEEEEEEEeecccccceEE----- Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALAR---PSRFTSSY-------------TFRTIYRELILELQKDGSRVY----- 288 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~-------------~~~~~~rv~~l~~g~~g~~~~----- 288 (661) ....+..|+.-.-..........+....+. .+...+.| -++...+..++.+ .+|..++ T Consensus 205 i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~-~~g~~~~~d~~~ 283 (714) T protein:vir:32 205 IDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIEL-SNGRVVAFDKNN 283 (714) T ss_pred hhhhhhhhccccccccccccccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeecc-CCCceEEeCccC Confidence 001111111100000000000000000000 00000000 0111111112221 1111110 Q ss_pred ----------------------EEEEEecCcccccccceeeccCCcccceeeEEEEecCC---CCCCcc-ccchhHH-HH Q lcl|NC_019406. 289 ----------------------KQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMS---NAADCE-KPPLLDI-VE 341 (661) Q Consensus 289 ----------------------~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~---~~~~~~-~pPLldL-A~ 341 (661) ++..|.... -..+...| . +-+.+|||++.... .+...+ .-.+.|. -. T Consensus 284 ~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~--~L~~~~~p-~---p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~ 357 (714) T protein:vir:32 284 LMQAVAVASGRVQVKVGRVSRIREAWFVGPH--FIVDRPCS-A---PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDE 357 (714) T ss_pred HHHHHHHhhcchhhhccccceEEEEEEecCc--ccccCCCC-C---CCCceeEEEEeeeeeeccCceeehhhhchhHHHH Confidence 111111110 00000011 1 11345555443221 111111 0112222 22 Q ss_pred HHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCcee-E-ec-ccceeec-CC--CC----CcceEeecCchhHHHHHHH Q lcl|NC_019406. 342 LNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEY-H-IG-PGRVWVV-DK--ES----GIPGIIEFKGEGLKTLERA 411 (661) Q Consensus 342 LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l-~-iG-s~~~~~l-p~--~g----a~~~ylE~~g~~i~a~~~~ 411 (661) +| .+++.. .++|. +.-+++..|..+.+.+.+ . +. +++.+.+ |. .| ..+... +...-.....+. T Consensus 358 ~N--~~~s~~--~~~l~--~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~-~~~~~~~~~~~l 430 (714) T protein:vir:32 358 VN--FRRIKL--TWLLQ--AKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVE-QDFQVASQQFQV 430 (714) T ss_pred HH--HHHHHH--HHhhc--CCceeeecCcccccHHHHHHhccCCCCceeecccccccCCCCcccccc-CCCCccHHHHHH Confidence 33 344443 34442 222233334322221111 0 00 1122221 21 11 123322 222223444555 Q ss_pred HHHHHHHHHHH-hH--HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------ Q lcl|NC_019406. 412 LNEKEQQIAAI-GG--RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------ 478 (661) Q Consensus 412 L~~le~qM~~l-GA--rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------ 478 (661) |+...+.|..+ |. .++ +..+.+.|+.+...+..+..-.|+.+-.|+..+...+ |.++..|++... T Consensus 431 lq~~~~~i~~~tGv~~~~l--G~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~ 508 (714) T protein:vir:32 431 MQESEKLIQDTMGVYSAFL--GQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVI 508 (714) T ss_pred HHHHHHHHHHhhCCChHHc--CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEec Confidence 55555555433 32 233 2233456888888888887788888888887777665 557777775321 Q ss_pred ---CCcceEEEEecc----------------ccccccC------CHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCCCCcc Q lcl|NC_019406. 479 ---TDTATLRYEIDA----------------TFLTTAL------DARALRAIQQLYEGGLLPIDALYENFVK-NGIIPST 532 (661) Q Consensus 479 ---~~~~~~~v~ln~----------------DF~~~~l------da~~l~all~~~~aG~Is~et~~~eL~r-~gvl~~~ 532 (661) .....-.|.||+ |+..... ..+.+.+|+++++. ++.+....-+.- -. +.+- T Consensus 509 e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~-~~d~ 585 (714) T protein:vir:32 509 NRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVN-LLDV 585 (714) T ss_pred cCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHH-hcCC Confidence 111111233331 1111111 13456666777653 222211100000 00 1111 Q ss_pred CCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHH--HHhccCCCchhHHHhhhhhh---h Q lcl|NC_019406. 533 QTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQ--AERHLEIDEEKLRISAKVGS---T 607 (661) Q Consensus 533 ~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~--~e~~~~~~~~~~~~~~~~~~---~ 607 (661) -..+++.++|.+..+.-+. .+.+ .++ +|.+.+...-++++..++ ++.++.++..++++...-++ . T Consensus 586 p~~~el~~~ir~~~~~~~~--~~~~------~~e--~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~ 655 (714) T protein:vir:32 586 PQKQEFVERIRAALGTPKS--PDEM------TPE--EQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRD 655 (714) T ss_pred CCHHHHHHHHHHHcCCCCC--cccc------chh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2346778888765432211 1000 011 111111111122222222 22333222233222221111 1 Q ss_pred hhhHHHhcCCh----hhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCC-----CCccCC-C Q lcl|NC_019406. 608 SVAASRKLGDP----EQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGR-----PPQNGA-S 661 (661) Q Consensus 608 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~-~ 661 (661) ...++..++.. ......+++.+++. +..+..+. .+-|+.-. +++.-+ | T Consensus 656 ~~~a~~~~~~~~~~~~~~~~~~a~~a~~~-~~~~~~~~-----~~~~~~~q~~q~~~~~~~~~~ 713 (714) T protein:vir:32 656 NASAQREVALTQGQRYVDALNQAHTAEII-TGVQNMEQ-----EQDVLQQQMLYTLQQRMNEMS 713 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhHhhhhh-----hhHHHHHHHHHHHHHHHHhcC Confidence 11221111111 11222222222221 00111111 11111100 001000 0 No 108 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.00 E-value=0.00022 Score=40.68 Aligned_cols=486 Identities=13% Similarity=0.094 Sum_probs=190.5 Q ss_pred CCCCC-Cccccccccccc-------cccCCccccCHHHHHHHHHHHH-HHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLS-PNSANIRRTKRG-------AQQFTHLVVHPEYEYYRPDWAK-IRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~-~~~~~~~~~~~~-------~~~~~V~~~hPey~a~~~~W~~-irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |+-+- ++.-+|.+.+-. +.-..+..-||.-.=--.+|.. ++..-.|. ++. --+-|+. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd----------~~~----~~~L~e~ 66 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGN----------LQA----QAELFMD 66 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCC----------HHH----HHHHHHH Confidence 33221 122223322111 1111111112111100012322 33333332 111 1234666 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) .+.| -.++...++.....|...+..|+ |+.= .+.+. -..-+.+.+.+.++ .+++.++.+++ T Consensus 67 m~e~---D~~i~s~l~~Rk~av~~~~w~I~--p~~~----~~~~~------~~~a~~v~~~l~~~----~~~~~~i~~~l 127 (526) T protein:vir:99 67 MEER---DAHLFAEMSKRKRAILGLDWAVE--PPRN----ASAAE------KADADYLHELLLDL----EGLEDLLLDAL 127 (526) T ss_pred HHhh---ChHHHHHHHHHHHHHhCCCceEe--cCCC----CCHHH------HHHHHHHHHHHhcc----cCHHHHHHHHH Confidence 5544 33444444444455555565553 1100 00000 00011233333222 25888888887 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) . ++-||.+.+=+-|- .+ +|.-.+.-+..|- T Consensus 128 d-a~~~G~s~~Eivw~-------------------------~~--~g~~~~~~l~~r~---------------------- 157 (526) T protein:vir:99 128 D-GIGHGYSCIELEWA-------------------------LQ--GREWMPLAFHHRP---------------------- 157 (526) T ss_pred H-hhhhcceeEEEEEe-------------------------ec--CCceeEEEeeeec---------------------- Confidence 4 77788666555432 21 1111111000000 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCC Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRG 311 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g 311 (661) ..++.|.. ++..+.+. +.+ ...| T Consensus 158 ------------------------------~~~f~~~~-------------~~~~~l~~---~~~-----------~~~g 180 (526) T protein:vir:99 158 ------------------------------QSWFQLNP-------------EDQNELRL---RDN-----------SPAG 180 (526) T ss_pred ------------------------------ccceeecc-------------CCCcEEEe---cCC-----------CCCc Confidence 00000000 00000010 000 0012 Q ss_pred cccceeeEEE-EecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCce-----eEecc Q lcl|NC_019406. 312 RTLPFIPFVF-FGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDASE-----YHIGP 382 (661) Q Consensus 312 ~~L~~IPfv~-~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~~-----l~iGs 382 (661) .+|+.-=|++ .+....+...+.+.|..++..-+---....+.-.-+..-|.|+++.. |.++++++. ..||+ T Consensus 181 ~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~ 260 (526) T protein:vir:99 181 EALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGH 260 (526) T ss_pred eeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhh Confidence 2222111222 23334455556666666666555444466677788888899998885 444443322 35899 Q ss_pred cceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHH--HhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_019406. 383 GRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAA--IGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALE 460 (661) Q Consensus 383 ~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~--lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le 460 (661) +++.++|. |..+.|++..+.+......-++...++|.. +|--|......+..-|--...........++.+-+..++ T Consensus 261 d~~~iiP~-~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~ 339 (526) T protein:vir:99 261 AAAGIIPE-TMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLA 339 (526) T ss_pred CcEEEecC-CceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999995 789999998777767777777777777763 444332111111111222345566667778889999999 Q ss_pred HHHH-HHHHHHHHHcCCCCCCc-ceEEEEeccccccccCCHHHHHHHHHHHhcCC-CCHHHHHHHHHhcCCCCccCCHHH Q lcl|NC_019406. 461 DGMT-SVVRYWLMFRDIPLTDT-ATLRYEIDATFLTTALDARALRAIQQLYEGGL-LPIDALYENFVKNGIIPSTQTLEE 537 (661) Q Consensus 461 ~Al~-~aL~~~A~w~G~~~~~~-~~~~v~ln~DF~~~~lda~~l~all~~~~aG~-Is~et~~~eL~r~gvl~~~~~~Ee 537 (661) ++++ +++.+++.|-+-...+. --.+|.+... ...++ ...++.+..+...|. |+.+.+.+.+ |+ |.....|+ T Consensus 340 ~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~-e~eDl-~~~a~~~~~L~~~G~~i~~~~i~e~~---Gi-p~~~~~e~ 413 (526) T protein:vir:99 340 ATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLR-EQADI-TSMAQSIPALVNVGLEIPSAWVYDKL---GI-PQPAKNEP 413 (526) T ss_pred HHHHHHHHHHHHHhCCCCcCCccccceEEeCCC-CcccH-HHHHHHHHHHHhCCCccCHHHHHHHh---CC-CCCCCccc Confidence 9997 59999999976422111 1123433210 11111 124556666777786 8876664443 65 22222222 Q ss_pred HHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhh--------cCChh-------hHHHHHHHhccCCCchhHHHhh Q lcl|NC_019406. 538 FTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAAR--------DADFQ-------QQELEQAERHLEIDEEKLRISA 602 (661) Q Consensus 538 e~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~--------e~d~~-------q~~~~~~e~~~~~~~~~~~~~~ 602 (661) .......+ ..+..........+.....+...++.+- ..|++ .+..++-+...-+++-+.+|.+ T Consensus 414 ~l~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ee~~~~L~~ 491 (526) T protein:vir:99 414 VLRSAAQP--AILSRQHGQRVAALATIVGPRYGDQQALDKALADLPAKDMQNQANDLLAPLLEAVNRGDSETELLGALAE 491 (526) T ss_pred ccCCCCCC--cccccccccccccccccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHH Confidence 22111111 1110000000000000000000000000 00000 1111111111112222222222 Q ss_pred hhhhhhhhHHHhcCChhhhhhhhhhhhHHHHh--hcccccCCCC Q lcl|NC_019406. 603 KVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQ--QKQAAAKPVT 644 (661) Q Consensus 603 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 644 (661) ..+++..+ ++...=+..--++.= +--+....+- T Consensus 492 l~~~ld~~---------~l~~~l~~a~~~A~l~Gr~~~~~e~~~ 526 (526) T protein:vir:99 492 AFPDMDDS---------ALTDALHRLLFAADTWGRLHGNLDRID 526 (526) T ss_pred HhccCCHH---------HHHHHHHHHHHHHHHhhhhhhhhcccC Confidence 21111111 111100000000000 0000000000 No 109 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=96.85 E-value=0.0003 Score=39.91 Aligned_cols=467 Identities=12% Similarity=0.073 Sum_probs=185.2 Q ss_pred CCCCC-Cccccccccccc-------cccCCccccCHHHHHHHHHHH-HHHHHhcchHHHHhCCcccCCCCCCCChHHHHH Q lcl|NC_019406. 1 MAGLS-PNSANIRRTKRG-------AQQFTHLVVHPEYEYYRPDWA-KIRDAIAGEREIKAQGVKYLKAPKGFDDEDYAN 71 (661) Q Consensus 1 ~~~~~-~~~~~~~~~~~~-------~~~~~V~~~hPey~a~~~~W~-~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~ 71 (661) |+.+- |..-+|++.+-. +..+.+..-||.-.=--.+|. .++..-.|... ...+ | .|+- T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~--~~~~--L---------~~dm 67 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLT--AQAD--L---------AFDM 67 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHH--HHHH--H---------HHHH Confidence 44432 222222211111 111111111221111112222 23333333211 1111 1 2232 Q ss_pred HHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHH Q lcl|NC_019406. 72 YLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVA 151 (661) Q Consensus 72 rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~ 151 (661) -..-+.+...+..-..++.|+-|+-.|.-+.-|. .+ +..+.+.+++.++ -+++.++..++ T Consensus 68 ~~~D~hi~s~l~~Rk~av~~~~w~I~p~~~~~~~-~~---------------~~a~~v~~~l~~~----~~f~~~~~~ll 127 (512) T protein:vir:19 68 EEKDTHLFSELSKRRLAIQALEWRIAPARDASAQ-EK---------------KDADMLNEYLHDA----AWFEDALFDAG 127 (512) T ss_pred HhhChHHHHHHHHHHHHHhCCCceEecCCCCCHH-HH---------------HHHHHHHHHHhcC----CCHHHHHHHHH Confidence 2334667777777788888887776553211010 00 0011222222222 14778888876 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) .++-||.+.+=+- |+.+ +|.-.+..+..+. T Consensus 128 -dA~~~G~s~~Ei~-------------------------w~~~--~g~~~~~~~~~r~---------------------- 157 (512) T protein:vir:19 128 -DAILKGYSMQEIE-------------------------WGWL--GKMRVPVALHHRD---------------------- 157 (512) T ss_pred -hhhhhcceeeeeE-------------------------eeee--CCceeeeeeeeec---------------------- Confidence 4777886655443 3221 2211111111100 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCC Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRG 311 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g 311 (661) ..++.|.. ++....+. +.+ ...| T Consensus 158 ------------------------------~~~f~~~~-------------~~~~~lr~---~~~-----------~~~G 180 (512) T protein:vir:19 158 ------------------------------PALFCANP-------------DNLNELRL---RDA-----------SYHG 180 (512) T ss_pred ------------------------------cccceecc-------------CCCcEEEe---cCC-----------CCCc Confidence 00000000 00000000 000 0112 Q ss_pred cccceeeEEEE-ecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCce-----eEecc Q lcl|NC_019406. 312 RTLPFIPFVFF-GSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDASE-----YHIGP 382 (661) Q Consensus 312 ~~L~~IPfv~~-~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~~-----l~iGs 382 (661) .+|+.-=|+++ +....+...+...|..++..-+--=....+.-.-+..-|.|+++.. |.++.+++. ..||+ T Consensus 181 ~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~~~~ 260 (512) T protein:vir:19 181 LELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNREKATLMQAVMDIGR 260 (512) T ss_pred eeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHHHHHHHHHHHHHhh Confidence 22221002222 2233344455555656655555444555677788888899998874 333333332 35799 Q ss_pred cceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHH--HHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_019406. 383 GRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIA--AIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALE 460 (661) Q Consensus 383 ~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~--~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le 460 (661) +++.++|. |..+.|++.++.+......-++....+|. .+|--|. ... +..-|--..........-++.+.+..++ T Consensus 261 ~a~~iiP~-~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlT-s~~-g~~Gs~a~~~vh~ev~~di~~aDa~~i~ 337 (512) T protein:vir:19 261 RAGGIIPM-GMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLT-TEA-GDKGARSLGEVHDEVRREIRNADVGQLA 337 (512) T ss_pred CcEEEecC-CceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhc-ccc-cccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999995 78999999877776667777777777776 3454432 221 1111223456666777888899999999 Q ss_pred HHHH-HHHHHHHHHcCCCCCCc-ceEEEEeccccccccCCHHHHHHHH---HHHhcCC-CCHHHHHHHHHhcCCCCccCC Q lcl|NC_019406. 461 DGMT-SVVRYWLMFRDIPLTDT-ATLRYEIDATFLTTALDARALRAIQ---QLYEGGL-LPIDALYENFVKNGIIPSTQT 534 (661) Q Consensus 461 ~Al~-~aL~~~A~w~G~~~~~~-~~~~v~ln~DF~~~~lda~~l~all---~~~~aG~-Is~et~~~eL~r~gvl~~~~~ 534 (661) ++++ +++++++.|-+....+. .-.+|.+.. . .+.++.++. .....|. |+.+.+.+. -|+ |.-.. T Consensus 338 ~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~----~--e~eDl~~~a~~~~~l~~G~~i~~~~i~e~---~Gi-p~~~~ 407 (512) T protein:vir:19 338 RSINRDLIYPLLALNSDSTIDINRLPGIVFDT----S--EAGDITALSDAIPKLAAGMRIPVSWIQEK---LHI-PQPVG 407 (512) T ss_pred HHHHHHHHHHHHHhCCCCCCCccccceEEecC----C--ChhhHHHHHHHHHHHhcCCCCCHHHHHHH---hCC-CCCCC Confidence 9997 58999999886433221 112333321 1 122332222 2222344 665544333 354 22222 Q ss_pred HHHHHHHHhccCCCCC-CchhhhhhcCCccccCCCcchhhhhcC----ChhhH-------HHHHHHhccCCCchhHHHhh Q lcl|NC_019406. 535 LEEFTIKMNDPKSFIG-QPDAIAMRRGYVSRQQELDQQRAARDA----DFQQQ-------ELEQAERHLEIDEEKLRISA 602 (661) Q Consensus 535 ~Eee~~~l~~~~~~l~-~ddae~~~~g~~~~~~~~~q~~~~~e~----d~~q~-------~~~~~e~~~~~~~~~~~~~~ 602 (661) .|+.... ....+.-+ ...+... ......|+.....+ |+++. ..++-+ ..-+++-+.+|.+ T Consensus 408 ~e~~~~~-~~~~~~~~~~~~~~~~------~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~i~~~~~-~~s~ee~~~~L~~ 479 (512) T protein:vir:19 408 DEAVFTI-QPVVPDNGSQKEAALS------AEDIPQEDDIDRMGVSPEDWQRSVDPLLKPVIFSVL-KDGPEAAMNKAAS 479 (512) T ss_pred ccccccC-CCcccccccccccccc------ccCCCchhhHhHHhhhHHHHHHHHHHHHHHHHHHHH-hCCHHHHHHHHHH Confidence 2222111 10100000 0000000 00000111100000 11111 000000 0111222222222 Q ss_pred hhhhhhhhHHHhcCChhhhhhhhhhhhHH-------HHhhcc Q lcl|NC_019406. 603 KVGSTSVAASRKLGDPEQAKPSKAEQAQI-------DAQQKQ 637 (661) Q Consensus 603 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~ 637 (661) ..+++. +.++...=+..--+ +.+..| T Consensus 480 l~~~ld---------~~~l~~~l~~a~~~A~l~G~~~~~~e~ 512 (512) T protein:vir:19 480 LYPQMD---------DAELIDMLTRAIFVADIWGRLDAAADH 512 (512) T ss_pred HhccCC---------HHHHHHHHHHHHHHHHHhhhhhhhccC Confidence 111111 11111100000000 001111 No 110 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=96.80 E-value=0.00028 Score=40.08 Aligned_cols=423 Identities=10% Similarity=0.015 Sum_probs=169.9 Q ss_pred HhchhhccCccccccchhhHhhhhcccccc------cccchhhh---------hhhHhhhhhccCCCCCHHHHHHHHHHH Q lcl|NC_019406. 89 MVGQIFRRPPVIRNLPNTGAITGRDAEGGV------QVVAPASI---------GKLLTQLQRFAKDGTSHQGFAKTVALE 153 (661) Q Consensus 89 l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~------~~~~~~~~---------~~~~~~~~~~dl~G~sL~~fa~~~~~~ 153 (661) |-- |-++-|.....-+..+...+=..|.. ..++|... ...+.++ +...+.-++++++.. T Consensus 1 m~~-V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl-----~rA~~~n~~~~t~~~ 74 (501) T protein:vir:95 1 MPN-VSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYL-----KRAVFYNVARRTLFG 74 (501) T ss_pred CCC-CCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHh-----hccccCchHHHHHHH Confidence 211 21222222111111222222223321 12333211 1122222 222333333333322 Q ss_pred HHhhCCE---EEEEeccCC--------CchhhcccceeEeechhhhccceeeccccccceeeeeee------eeeeeccc Q lcl|NC_019406. 154 QVAMGRF---GALVDVAPS--------SDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLR------EFERVDEH 216 (661) Q Consensus 154 ~L~~Gr~---gvLVD~P~a--------~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ir------e~~~~~~~ 216 (661) . .|++ -..+|.|+. +......--++ .++...- ..-| ..+|++- +......+ T Consensus 75 l--~G~vf~k~p~~~~p~~l~~l~~d~D~~G~~L~~f~-----~~~~~~~--l~~G---~~~ilVD~P~~~~~~~~t~a~ 142 (501) T protein:vir:95 75 L--VGQVFMRDPVVKVPALLNPLVANATGSGINLTQLA-----KRAVSLN--LAYS---RAGLLVDYPTTEAEGGASIAD 142 (501) T ss_pred H--hhhhhcCCcceeCcHHHHHHHhccCCCCCCHHHHH-----HHHHHHH--HhcC---eEEEEEeecCCCCcccccHHH Confidence 1 1211 123344431 11000000000 0000000 0011 1222221 11223344 Q ss_pred cccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceE-EEEEEEec Q lcl|NC_019406. 217 ATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRV-YKQFVYVE 295 (661) Q Consensus 217 ~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~-~~~~~~~~ 295 (661) ....+.+|||.+|.+++||||++.+++|+..+++++++|++.. ..+.++... -.+++..- T Consensus 143 ~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~-------------------~d~~f~~~~~~q~RvL~~ 203 (501) T protein:vir:95 143 LEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCA-------------------ADDGFEMKTSGQFRVLRL 203 (501) T ss_pred HHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEee-------------------cCCCcccceeEEEEEEee Confidence 5567779999999999999999999999999999887644221 123333332 22334443 Q ss_pred Ccccccccceeecc-----------CCcccceeeEEEEecCCCCCCccccchh------------HHHHHHHHHHhhhhh Q lcl|NC_019406. 296 DPLGQARDVYTPMV-----------RGRTLPFIPFVFFGSMSNAADCEKPPLL------------DIVELNLKHYRTYAE 352 (661) Q Consensus 296 ~~~~~~~~~~~p~~-----------~g~~L~~IPfv~~~~~~~~~~~~~pPLl------------dLA~LNl~HYq~sSD 352 (661) +..|.+.-.+.-.. .|.....+++++. ..++...+..||- ..-.++|+|- T Consensus 204 ~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~--~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~l----- 276 (501) T protein:vir:95 204 DEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPT--DAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASL----- 276 (501) T ss_pred CCCceEEEEEEEecCCcccCcceecCCcccccceeeee--ccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHH----- Confidence 33333221111110 1111112222222 2233333344433 2234455554 Q ss_pred HHHHHHHhcCc----eeEEecCCCCC-----CceeEecccceeecCCCCCcceEeecCchh---HHHHHHHHHHHHHHHH Q lcl|NC_019406. 353 LEHGRFFTALP----TYYAPELDDSD-----ASEYHIGPGRVWVVDKESGIPGIIEFKGEG---LKTLERALNEKEQQIA 420 (661) Q Consensus 353 l~~il~~~~~P----~l~i~Gl~~~~-----~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~---i~a~~~~L~~le~qM~ 420 (661) ++-|+.... ++.+.++.--+ ......++...+.+. +...+.-|.+.. ++..-..|. .+.|. T Consensus 277 --ni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G---~~~~~~lP~~~~~~~ie~~~~~i~--~~~l~ 349 (501) T protein:vir:95 277 --NMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVNFG---SRGGIPLPVGADAKLLQASENTML--KEAMD 349 (501) T ss_pred --HHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCceeec---ccccccCCCCCceeEEecChhhHH--HHHHH Confidence 344544443 34455443322 112233344444443 233344455543 233334553 34566 Q ss_pred HHhHHhcccccCcc-chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcceEEEEeccccc-cccCC Q lcl|NC_019406. 421 AIGGRLMPGMSKSV-SESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFL-TTALD 498 (661) Q Consensus 421 ~lGArll~~~~~~~-~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~-~~~ld 498 (661) .+=..|...+-+-. ..++.......+-+.+.-.+.-..+...++.+|+.+-+|+..-. +.-.-++ .|. ...+. T Consensus 350 ~l~~~m~~~Ga~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~---g~~~~~~--~v~i~~df~ 424 (501) T protein:vir:95 350 TKERQMVALGAKLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWV---GQADSGV--KFELNTDFD 424 (501) T ss_pred HHHHHHHHHHHhhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHc---CCCCCce--EEEEecccc Confidence 55555544322222 22345566677777777788888999999999999999997532 1101111 121 22233 Q ss_pred HHHHH-HHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCC--ccccCCC----cch Q lcl|NC_019406. 499 ARALR-AIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGY--VSRQQEL----DQQ 571 (661) Q Consensus 499 a~~l~-all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~--~~~~~~~----~q~ 571 (661) ...++ +.+ +.+..|...|.|+....+++.+. ...++.++......-. .+.+..+ .+. T Consensus 425 ~~~~~~~~~-----------~al~~~~~~G~is~~t~~~~L~~-----~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~ 488 (501) T protein:vir:95 425 IARMTPDER-----------RSLVEEWQKGAITFEEMRTGLRK-----AGVATEDDSKAKEKIAKDTAEAMALATPANVP 488 (501) T ss_pred cccCCHHHH-----------HHHHHHHhCCCCcHHHHHHHHHh-----CCCCChhHHHHHHHHHhhhcCcccccccCCCC Confidence 32221 111 12224556777776655555442 2333322221111000 0001000 111 Q ss_pred hhhhcCC-hhhHH Q lcl|NC_019406. 572 RAARDAD-FQQQE 583 (661) Q Consensus 572 ~~~~e~d-~~q~~ 583 (661) ....-+| +--++ T Consensus 489 ~~~~gg~~~~~~~ 501 (501) T protein:vir:95 489 GDGSGGDNVGNSE 501 (501) T ss_pred CCCcccccccCCC Confidence 1111111 11111 No 111 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=96.58 E-value=0.00049 Score=38.72 Aligned_cols=322 Identities=14% Similarity=0.042 Sum_probs=124.2 Q ss_pred hhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee---EEEE-ecCCCC Q lcl|NC_019406. 253 RADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP---FVFF-GSMSNA 328 (661) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP---fv~~-~~~~~~ 328 (661) +.|.+-..+ ++.+..+. +.+.+.. .+.+|.....+.. ..-...+ ..|..--.|| |+++ +....+ T Consensus 1 v~Eivw~~~--~g~~~~~~----l~~r~~~---~~~~f~~~~~~~l--~~~~~~~-~~g~~~~~lp~~kfi~~~~~~~~g 68 (355) T protein:vir:78 1 MFEQVYRIE--NGRARLGK----LAWRPPR---TISRFDVAPDGGL--VAIEQWG-VFGKATVRIPVDRLVVFVNEREGA 68 (355) T ss_pred CeEEEEEee--CCeEEEee----eeecCcc---ceeeeeeccCCce--eEEEecC-CCCCCcceeccCCEEEEEeCCCCC Confidence 111111111 11111111 1111110 1111111111110 0000000 0111111232 2322 222233 Q ss_pred CCccccchhHHHHHHHHHHhhhhhHHHHHHHh--cCceeEEecCCC-----CCC-----------c-------eeEeccc Q lcl|NC_019406. 329 ADCEKPPLLDIVELNLKHYRTYAELEHGRFFT--ALPTYYAPELDD-----SDA-----------S-------EYHIGPG 383 (661) Q Consensus 329 ~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~--~~P~l~i~Gl~~-----~~~-----------~-------~l~iGs~ 383 (661) -..+.+.|..++..-+ |.+.+--....|.- +.|+++..|-.. .++ + .+..|.. T Consensus 69 ~p~G~gLlr~~~w~~~--fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~ 146 (355) T protein:vir:78 69 NWLGQSLLRQAYKNWL--LKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEA 146 (355) T ss_pred CccchhhHHHHHHHHH--HHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcc Confidence 3344554555444333 12222223333333 558888765321 111 0 1346888 Q ss_pred ceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHH-HhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019406. 384 RVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAA-IGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG 462 (661) Q Consensus 384 ~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~-lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A 462 (661) ++.++|. |.++.|++..|.... ..+.++...++|.. +.+..|........-|--...........++.+.+..++++ T Consensus 147 a~~iip~-g~~ie~~ea~g~~~~-~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ 224 (355) T protein:vir:78 147 AGGYIPH-GANFTLTGVQGKLPE-MDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADV 224 (355) T ss_pred eeEeecC-CceEEEeecCCCccc-HHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888995 789999998776543 45566666666652 23444433111111122334455666678888999999999 Q ss_pred HH-HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCC-CHHHHHHHHHh-cCCCCccCCHHHHH Q lcl|NC_019406. 463 MT-SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLL-PIDALYENFVK-NGIIPSTQTLEEFT 539 (661) Q Consensus 463 l~-~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~I-s~et~~~eL~r-~gvl~~~~~~Eee~ 539 (661) ++ +++++++.|..-+. ..-.+|.+.+ .... +....+.+..+...|.+ +.+....++++ -|+ |.-...+++ T Consensus 225 ln~~li~~l~~lN~~~~--~~~P~~~~~~--~~~~-~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gi-p~p~~~~~~- 297 (355) T protein:vir:78 225 TQQHVVEDLVDQNWGPE--EPAPRLVPAQ--LGKE-QPVTAEAIRALVECGAFTADPELEKDLRARYGL-PAPAERDDG- 297 (355) T ss_pred HHHHHHHHHHHhcCCCC--CCCCEEEecC--cChh-HHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC-CCCCCCCcc- Confidence 97 58999999874221 1123444421 2221 12235566667777764 33333333443 443 322211211 Q ss_pred HHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCCh Q lcl|NC_019406. 540 IKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDP 618 (661) Q Consensus 540 ~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 618 (661) +....+..+.........|... +.+.... ..+....+.+.++|+..+-.+.-|--++- T Consensus 298 --~~~~~~~~~~~~~~~~~~~~~~-~~~~~a~------------------~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (355) T protein:vir:78 298 --ADAAAAKAAGRRRAKRLPGQRQ-GAALPSR------------------SPRADPPRRRGPLRRRPRHPAHRRCAPDG 355 (355) T ss_pred --cCCccccccccccccccCCccc-ccccccc------------------CCCCCChhhhHHHHHHhhccccCCCCCCC Confidence 1111111111111111111110 1111111 11111111222222222222222211111 No 112 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=96.43 E-value=0.00063 Score=38.13 Aligned_cols=506 Identities=11% Similarity=0.096 Sum_probs=195.6 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCC---CCCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAP---KGFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~---~~E~~~~Y~~rl~rA~ 77 (661) ||-.+--- +|. ++.+ ..-.+.-.-+-....++|+-|.+.+ ||.. .+..... ++. -. T Consensus 1 ~~~~~~~~-~~~--~~~~-~~r~~~l~~~R~~~e~~w~e~~~y~-------------lP~~~~~~~~~~~~---~~~-~~ 59 (535) T protein:vir:94 1 MASSQKRE-GFA--ENGA-KAVYDALKNDRNSYETRAENCAKYT-------------IPSLFPKDSDNAST---DYT-TP 59 (535) T ss_pred CCchhhhh-hHH--HHHH-HHHHHHHHHHhhHHHHHHHHHHHHh-------------ccccCCCCCCcccc---ccC-Cc Confidence 22111000 000 0000 0000000011112344454444443 3321 1111111 111 13 Q ss_pred ccchHHHHHHHHhchhhcc--CccccccchhhHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHHHHHHH Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRR--PPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQGFAKT 149 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk--~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~~fa~~ 149 (661) |-+.-.+.++.+...++.- |+. | ...--..|.+.......+.....++.+|+.| -+..++.+.-+-. T Consensus 60 ~dst~~~a~~~Laa~l~~~ltP~~----~-WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~ 134 (535) T protein:vir:94 60 WQAVGARGLNNLASKLMLALFPMQ----T-WMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFE 134 (535) T ss_pred ccccHHHHHHHHHHHHHhhhcCCC----C-ccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHH Confidence 4444444555444333331 211 0 1111011111111111122222344444432 1356778888888 Q ss_pred HHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeee Q lcl|NC_019406. 150 VALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGRE 229 (661) Q Consensus 150 ~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~ 229 (661) ++.+.+.+|-+-+++|-++... . .+..|+-.+ +-+. .++.+.+.-|+.++....+.- T Consensus 135 ~~~~L~~~G~a~l~~~~~~~~~----~--~f~~~pl~~---y~v~-~d~~G~vd~i~r~~~~~~~~l------------- 191 (535) T protein:vir:94 135 TLKQLVVAGNALLYIPEPEGTY----N--PMKLYRLSS---YVVQ-RDAFGTVLQIVTLDKTAYAAL------------- 191 (535) T ss_pred HHHHHHhhCcEeEeeccCcCcc----c--ceEEEEcCe---EEEe-eCCCCCeEEEEeeeeccHHHh------------- Confidence 8999999999999998653211 1 122232222 1111 122223333333332211100 Q ss_pred chhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeecc Q lcl|NC_019406. 230 GSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMV 309 (661) Q Consensus 230 ~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~ 309 (661) ...|+..... ...........++....++. .+..|.|+.+.++.... +. ... T Consensus 192 ----~~~~~~~~~~-----------------~~~~~~~~~v~v~~~v~~~~---~~~~~~~~~e~~g~~~~-~~---~~~ 243 (535) T protein:vir:94 192 ----PEDVRNSMDS-----------------SQEHKGDEMIDVYTHIYLDE---ESGEYLKYEEIDGVEVE-GT---DAS 243 (535) T ss_pred ----hHHHHHHHHh-----------------ccccCCCceeEEEEEEEeeC---CCCcEEEEEEecCeeec-cc---ccc Confidence 0111111100 00112223344444433322 12334554444332110 00 001 Q ss_pred CC-cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEecccce Q lcl|NC_019406. 310 RG-RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPGRV 385 (661) Q Consensus 310 ~g-~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~~~ 385 (661) .| ..+++||+.|.-..+..+..+ .--|-|+..||.-+ .+-++.+......|.++-+ |..+. ..+.-|.+.. T Consensus 244 ~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~---~~~l~~~~~a~~~~~lv~p~g~~~~--~~~~~~~~g~ 318 (535) T protein:vir:94 244 YPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQ---EAIVKMSMISAKVIGLVNPAGITQV--RRLTKAQTGD 318 (535) T ss_pred CccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHH---HHHHHHHHHhccCCcccccccccch--hhcccCCCce Confidence 12 246788888876666666443 11245777777543 3434555555566655543 33222 2233343333 Q ss_pred eecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHHh--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAIG--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG 462 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lG--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A 462 (661) +. |...+..+.++.. +..+....+.|++++..++..= ..+.. ..+...||++...+...-...|..+-..+++= T Consensus 319 ~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~--~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E 395 (535) T protein:vir:94 319 FV-SGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQ--RTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (535) T ss_pred ee-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhcc--CCCCCccHHHHHHHHHHHHHHhhhHHHHHHHH Confidence 33 3323345454332 4578888999999999887431 11221 23345799999999999999999888876544 Q ss_pred H-HHHHHHHHHHc---CC-CCCCcceEEEEeccccccccCCH----HHHHHHHHHHhcCCCCHHHHHHHHHhc--CCCCc Q lcl|NC_019406. 463 M-TSVVRYWLMFR---DI-PLTDTATLRYEIDATFLTTALDA----RALRAIQQLYEGGLLPIDALYENFVKN--GIIPS 531 (661) Q Consensus 463 l-~~aL~~~A~w~---G~-~~~~~~~~~v~ln~DF~~~~lda----~~l~all~~~~aG~Is~et~~~eL~r~--gvl~~ 531 (661) + .-++..+-..+ |+ +....+-+ +.+|. ..+.+ +++..++ .|+..|..- .+++. T Consensus 396 lL~Pli~r~~~il~r~g~lP~~p~~~v----~~~~v-s~la~l~r~~~~~~l~-----------~~~~~laq~~P~~ld~ 459 (535) T protein:vir:94 396 LQLPMVRVLLKQLQATNQIPELPKEAV----EPTIS-TGMEALGRGQDLDKLE-----------RCIAAWSALAPMQGDP 459 (535) T ss_pred HHHHHHHHHHHHHHhCCCCCCCChhhc----cceEe-ehHHHHHHHHHHHHHH-----------HHHHHHHhhChHHhhh Confidence 3 33333332222 22 11112222 33442 22211 1222222 233333322 23444 Q ss_pred cCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhH Q lcl|NC_019406. 532 TQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAA 611 (661) Q Consensus 532 ~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 611 (661) ..++++..+.+.+- +|.|-..... -++++.|- .+|+....|..+++.+++++--.. . T Consensus 460 ~id~d~~~~~~a~~---~Gvp~~~i~r-----s~eev~~~-------~~q~~~~~~~~~~~~~~g~~~~~~----~---- 516 (535) T protein:vir:94 460 DINIATIKLRIANA---IGIDTSGILK-----TPEEKQQE-------MAEAAQGTAMQNAAASAGAGAGTM----A---- 516 (535) T ss_pred cCCHHHHHHHHHHH---hCCChhhhcC-----CHHHHHHH-------HHHHHHHHHHHHHHHHHHHhhhcc----c---- Confidence 56788888888764 3322111111 01111110 111111112111111111111000 0 Q ss_pred HHhcCChhhhhhhhhhhhHHHHhhcccccCCC Q lcl|NC_019406. 612 SRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPV 643 (661) Q Consensus 612 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 643 (661) + .+|+. .+++. .|..-.|- T Consensus 517 --~-~~~~~-----~~~~~-----~~~g~~~~ 535 (535) T protein:vir:94 517 --T-ASPEN-----MKAAA-----AQAGMAPN 535 (535) T ss_pred --c-cChHH-----HHHHH-----HHhccCCC Confidence 0 11111 11100 01111111 No 113 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=96.31 E-value=0.00076 Score=37.70 Aligned_cols=396 Identities=11% Similarity=0.015 Sum_probs=165.0 Q ss_pred ccCCccc-cCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCC------CCCC---CChHHHHHHHhhhcccchHHHHHHH Q lcl|NC_019406. 19 QQFTHLV-VHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLK------APKG---FDDEDYANYLDRAAFYNMTSQTQAG 88 (661) Q Consensus 19 ~~~~V~~-~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLP------k~~~---E~~~~Y~~rl~rA~~~n~~~~tv~~ 88 (661) -||.|-. +.|+.......= ..| ..+ ..| ||- +.-+ +.-.-|+.-..+ -.++...++. T Consensus 1 ~~~~~~~~p~~~~~~~~~~~------~~~-~~~-~~g--~~~~D~~lr~~gg~~~~~~~l~~~m~e~---D~~v~s~l~~ 67 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYA------MEH-LGL-ATS--YLSEDGGYKRAGKPTYQQLSAWDEAAQT---EPIIAQGLDS 67 (446) T ss_pred CcccccCCCchhhhhhhhhc------ccc-chh-hcc--cCCcchHhhhcCCChHHHHHHHHHHHhc---chHHHHHHHH Confidence 6777777 777665543210 111 111 122 331 0001 111234333332 4566666666 Q ss_pred HhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccC Q lcl|NC_019406. 89 MVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAP 168 (661) Q Consensus 89 l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~ 168 (661) .-..|.+.+..|+ |.. -+.-+.+.+++++++ +...... +..++.||.+.+=+.|-. T Consensus 68 Rk~av~~~~w~V~--p~~----------------~~~a~~v~~~l~~~~-----~~~~~~~-~ldai~~G~s~~Eivw~~ 123 (446) T protein:vir:98 68 IALSVLNKVGPYQ--HGD----------------KRIKKFIDDQLRNRA-----KTWISHC-VKSIMTYGFSLSEQIYAH 123 (446) T ss_pred HHHHhhcCCceec--Ccc----------------HHHHHHHHHHHhhcC-----chhHHHH-HHHHHhhCceeeeEEEee Confidence 6667777777775 321 012234555565553 3333333 457788897777666543 Q ss_pred CCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhh Q lcl|NC_019406. 169 SSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAER 248 (661) Q Consensus 169 a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~ 248 (661) .++.. +|--++ ..++.|+ .++..+.+..+.. T Consensus 124 ~~g~~---~p~~~~---d~~~~~~-----------------------------~~~~r~~~~~~~~-------------- 154 (446) T protein:vir:98 124 GARDN---MPATVL---DDIVNYH-----------------------------PLQVMLIANDNGR-------------- 154 (446) T ss_pred ccccc---ccchhh---ccccccc-----------------------------cccceeeeccCCc-------------- Confidence 22210 010000 0001110 0000000000000 Q ss_pred hhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee-ccCC--cccceeeEE-EEec Q lcl|NC_019406. 249 QGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP-MVRG--RTLPFIPFV-FFGS 324 (661) Q Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p-~~~g--~~L~~IPfv-~~~~ 324 (661) .+ ++ ...+..+.... .+........++.... ...| .+++.=-|+ +.+. T Consensus 155 ---~~----------~~--~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~ 206 (446) T protein:vir:98 155 ---IV----------DG--DTVTASQYKSG-------------YWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYN 206 (446) T ss_pred ---cc----------cc--cccchhhcccc-------------cccCcccchhhhhhhhcccCcccccccccceEEEEec Confidence 00 00 00000000000 0000000000000000 0001 112222223 3343 Q ss_pred CCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCce------------------eEeccc Q lcl|NC_019406. 325 MSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDASE------------------YHIGPG 383 (661) Q Consensus 325 ~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~~------------------l~iGs~ 383 (661) ...+...+.+.|--++..-+--=...-+.-.-+-.=+.|+++.. |.++.+.+. ..+|+. T Consensus 207 ~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~d 286 (446) T protein:vir:98 207 TKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTD 286 (446) T ss_pred CCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhcccc Confidence 34444455555544444333222233345555666678888864 544433320 146666 Q ss_pred ceeec-----CCCCCcceEeecCchhHHHHHHHHHHHHHHHHH--HhHHhcccccCccchhHHHHHHHHHHhhHHHHHHH Q lcl|NC_019406. 384 RVWVV-----DKESGIPGIIEFKGEGLKTLERALNEKEQQIAA--IGGRLMPGMSKSVSESDNQSALREANEQSLLLNVI 456 (661) Q Consensus 384 ~~~~l-----p~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~--lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A 456 (661) ++.++ | +|..+.+++..+++-...+..++-...+|.. +|--|.-.++.+..-|--.+.....--.-.+.+-+ T Consensus 287 a~~ii~~~~~P-~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa 365 (446) T protein:vir:98 287 SGLVLTQLSKE-QPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIF 365 (446) T ss_pred ceeeeecccCC-CCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHH Confidence 66655 6 5789999998887655566667766667653 33323211111111122233344444556677889 Q ss_pred HHHHHHHH-HHHHHHHHHcCCCCCCcceEEEEecccccccc-CCHHHHHH----HHHHHhcCCCCH--HHHHHHHHhcCC Q lcl|NC_019406. 457 MALEDGMT-SVVRYWLMFRDIPLTDTATLRYEIDATFLTTA-LDARALRA----IQQLYEGGLLPI--DALYENFVKNGI 528 (661) Q Consensus 457 ~~le~Al~-~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~-lda~~l~a----ll~~~~aG~Is~--et~~~eL~r~gv 528 (661) ..+++.++ +++++++.|.+.+.. .-+.+..++.... -+++++.+ +..+...|.+.. +.++ .++-|+ T Consensus 366 ~~i~~tln~~Li~~l~~lNf~~~~----~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~i--re~~gi 439 (446) T protein:vir:98 366 DTVIHAFTEQVIGNLIRLNFDPAL----YPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHI--RSITGL 439 (446) T ss_pred HHHHHHHHHHHHHHHHHhCCCccc----cccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHH--HHHhCc Confidence 99999997 689999998874321 1111111111110 12344444 444455565422 2221 222343 Q ss_pred CCccCCHHH Q lcl|NC_019406. 529 IPSTQTLEE 537 (661) Q Consensus 529 l~~~~~~Ee 537 (661) |+... .. T Consensus 440 -P~~~~-~~ 446 (446) T protein:vir:98 440 -PDAIS-ST 446 (446) T ss_pred -CCCCC-CC Confidence 22110 11 No 114 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=96.30 E-value=0.00077 Score=37.65 Aligned_cols=571 Identities=10% Similarity=0.001 Sum_probs=181.8 Q ss_pred CCcc-ccCHHHHHHHHHHHHHHHHhcchHHHHhCC---cccC--CCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhh Q lcl|NC_019406. 21 FTHL-VVHPEYEYYRPDWAKIRDAIAGEREIKAQG---VKYL--KAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIF 94 (661) Q Consensus 21 ~~V~-~~hPey~a~~~~W~~irD~~~G~~~vr~~g---~~YL--Pk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vF 94 (661) |.=+ ..|-. .+. .++.++.....+|... ..|. =+|+.+.....+. ..|-+ +|.++++|+.++|.-= T Consensus 1 m~d~~~~~~~---~~~---~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~-q~rp~-~N~i~~~i~~v~g~~~ 72 (725) T protein:vir:77 1 MADNENRLES---ILS---RFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSEMR 72 (725) T ss_pred CCchHHHHHH---HHH---HHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHh-cCCCc-cccHHHHHHHHHhhHH Confidence 2111 11211 111 1112222222222100 0111 1565555544433 34444 5999999999999888 Q ss_pred ccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEE--EEeccCCCch Q lcl|NC_019406. 95 RRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGA--LVDVAPSSDP 172 (661) Q Consensus 95 rk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gv--LVD~P~a~~~ 172 (661) +..+.+.-+|..- .|. ..+---...|+.+++ -++.+.-..++|..++.+|.+|+ ..||...+.- T Consensus 73 ~nr~d~~v~P~~~----~d~-----~~Ae~l~~~~~~~~~-----~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~ 138 (725) T protein:vir:77 73 QNPIDVLYRPKDG----ARP-----DAADVLMGMYRTDMR-----HNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) T ss_pred hCCcceEEecCCc----cHH-----HHHHHHHHHHHHHHH-----hhCchhHHHHHHHHHhhcCcceeeeeecccCCCCC Confidence 8777776555421 111 111112223333333 34566678899999999999984 4577432221 Q ss_pred --hhcccceeEeechhh-hccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchh--h Q lcl|NC_019406. 173 --TAPAKSYTVGYAAEN-IVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLA--E 247 (661) Q Consensus 173 --~~g~rPY~~~~~p~~-IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~--~ 247 (661) ...++.+.+...+.+ ++||...+.+.. .=.|+.++.+...+. +.. ..|..- -.+..+..|......... . T Consensus 139 ~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~s-Dar~~~~~~~~~~d~--~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 213 (725) T protein:vir:77 139 SNNQVIRREPIHSACSHVIWDSNSKLMDKS-DARHCTVIHSMSQNG--WED-FAEKYD-LDADDIPSFQNPNDWVFPWLT 213 (725) T ss_pred CCceeeEEeecccChhhceeCchhhccChh-hHHHHHHHhcCCHHH--HHH-HHhhCC-cchhhcccccccccccccccC Confidence 111222222223333 234443322211 000111122221110 000 000000 000001111000000000 0 Q ss_pred hhhhhhhhheecccccCCCceeeEEE--EEEEeecccccceE-----------------------------EEEEEEecC Q lcl|NC_019406. 248 RQGSARADALARPSRFTSSYTFRTIY--RELILELQKDGSRV-----------------------------YKQFVYVED 296 (661) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~~~--rv~~l~~g~~g~~~-----------------------------~~~~~~~~~ 296 (661) ...+++.++++ +... .+..+..+..|..+ ++++.+.- T Consensus 214 ~d~vrv~E~~~-----------r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~- 281 (725) T protein:vir:77 214 QDTIQIAEFYE-----------VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSII- 281 (725) T ss_pred CCeeEEEEEEE-----------EEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeee- Confidence 00011111111 0000 00111111111100 00000000 Q ss_pred cccccccceeeccCCcc--cceeeEEEEecCC---CCCCc--cc-cchhHHHHHHHHHHhhhhhHHHHHHH-hcCceeEE Q lcl|NC_019406. 297 PLGQARDVYTPMVRGRT--LPFIPFVFFGSMS---NAADC--EK-PPLLDIVELNLKHYRTYAELEHGRFF-TALPTYYA 367 (661) Q Consensus 297 ~~~~~~~~~~p~~~g~~--L~~IPfv~~~~~~---~~~~~--~~-pPLldLA~LNl~HYq~sSDl~~il~~-~~~P~l~i 367 (661) .+..+ +....+ -+.||||++.... .+-.. +- -.+.|.=. .+..| .|.-+ +++.. ...+..+- T Consensus 282 ----~g~~~--l~~~~~~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~-~~N~~-~S~~~-~~~~~~~~~~~~~~ 352 (725) T protein:vir:77 282 ----TCTAV--LKDKQLIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQR-LRNMI-MSFNA-DIVARTPKKKPFFW 352 (725) T ss_pred ----cCcee--eccCCcCCCCccceEEEeeeeeccCCcccccchhhhhhhHHH-HHHHH-HHHHH-HHHHhccccccccc Confidence 01110 001111 2345665443221 11111 10 01111110 11122 22222 33322 22222232 Q ss_pred ecCCCC----CC--ceeEecccceeecCCCC----CcceEeecCchhHHHHHHHHHHHHHHHHHH-hH--HhcccccCcc Q lcl|NC_019406. 368 PELDDS----DA--SEYHIGPGRVWVVDKES----GIPGIIEFKGEGLKTLERALNEKEQQIAAI-GG--RLMPGMSKSV 434 (661) Q Consensus 368 ~Gl~~~----~~--~~l~iGs~~~~~lp~~g----a~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GA--rll~~~~~~~ 434 (661) .|..+. +. +.......+.+... .| +.+.++++..- .....+-|+.....|..+ |. .++ +..+. T Consensus 353 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~i~~~~~~~l-p~~~~~ll~~~~~~i~~~tGi~~~~l--G~~~n 428 (725) T protein:vir:77 353 PEQIAGFEHMYDGNDDYPYYLLNRTDEN-SGDLPTQPLAYYENPEV-PQANAYMLEAATSAVKEVATLGVDTE--AVNGG 428 (725) T ss_pred hhhhhHHHHHHHhccCCceecccccccC-CCcccccCccccCCCCc-hHHHHHHHHHHHHHHHHHhCCCHHHh--CCCch Confidence 322111 10 01111111111111 11 23344443221 233444555555555433 43 333 23344 Q ss_pred chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------CCcceEEEEeccc------------- Q lcl|NC_019406. 435 SESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------TDTATLRYEIDAT------------- 491 (661) Q Consensus 435 ~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------~~~~~~~v~ln~D------------- 491 (661) +.|+.+...+..+....|+.+-.|+..+...+ |.++..+++..- .+...-.+.||.. T Consensus 429 ~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~N 508 (725) T protein:vir:77 429 QVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLN 508 (725) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhh Confidence 57899999999888889898999988887776 777788873211 1111122333311 Q ss_pred -----ccc--------ccCCHHHHHHHHHHHhcCCCCHHHHHHHH-HhcCCCCccCCHHHHHHHHhccCCCCCC--c--- Q lcl|NC_019406. 492 -----FLT--------TALDARALRAIQQLYEGGLLPIDALYENF-VKNGIIPSTQTLEEFTIKMNDPKSFIGQ--P--- 552 (661) Q Consensus 492 -----F~~--------~~lda~~l~all~~~~aG~Is~et~~~eL-~r~gvl~~~~~~Eee~~~l~~~~~~l~~--d--- 552 (661) |.. .....+.+.+|++++...--.....-..| +--. +++.--.+++.+++..+.+.... + T Consensus 509 Di~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~-l~d~~~~~e~~erirkq~~~~~~~q~~~~ 587 (725) T protein:vir:77 509 DIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFT-LLDGKGVEMMRDYANKQLIQMGVKKPETP 587 (725) T ss_pred hhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhc-cccchHHHHHHHHHHhhhhhhhccCCCCh Confidence 211 00112344555555553210001110001 0001 11222235666666654332211 1 Q ss_pred -hhhhhhcCCccccCCCcchhh----------hhcCChhhHHHHHHHhc--cCCCchhHHHhh----------------- Q lcl|NC_019406. 553 -DAIAMRRGYVSRQQELDQQRA----------ARDADFQQQELEQAERH--LEIDEEKLRISA----------------- 602 (661) Q Consensus 553 -dae~~~~g~~~~~~~~~q~~~----------~~e~d~~q~~~~~~e~~--~~~~~~~~~~~~----------------- 602 (661) +++..+.- .+.+..|.++ ..++++++-..|+.+.+ ++-.+..+++.+ T Consensus 588 ~e~q~~~~~---qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~ 664 (725) T protein:vir:77 588 EEQQWLVEA---QQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSE 664 (725) T ss_pred hhHHHHHHH---HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH Confidence 11111100 0110111110 01111111011100000 000000111000 Q ss_pred ------hhhhhhhhHHHh--cCChhhhhhhhhh----hhHHHH----hhcccccCCCCCCCc Q lcl|NC_019406. 603 ------KVGSTSVAASRK--LGDPEQAKPSKAE----QAQIDA----QQKQAAAKPVTPTPG 648 (661) Q Consensus 603 ------~~~~~~~~~~~~--~~~~~~~~~~~~~----~~~~~~----~~~~~~~~~~~~~~~ 648 (661) .+.++.+..+++ .+.+.++| +|-+ +..+++ |+++-.+-.++.||- T Consensus 665 ~~~~~~~~~~~q~~~~~~~~~~ae~~~~-~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 725 (725) T protein:vir:77 665 FREFLKTVASFQQDRSEDARANAELLLK-GDEQTHKQRMDIANILQSQRQNQPSGSVAETPQ 725 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHH-hhhHHHhhHHHHHHHHHHHHhcCCCcCcccCCC Confidence 001111000000 00000000 0000 000111 122222223445554 No 115 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=96.26 E-value=0.00082 Score=37.51 Aligned_cols=461 Identities=11% Similarity=0.006 Sum_probs=187.9 Q ss_pred CCCC--CCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCC------CCChHHHHHH Q lcl|NC_019406. 1 MAGL--SPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPK------GFDDEDYANY 72 (661) Q Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~------~E~~~~Y~~r 72 (661) |+.. .|+--=|.... . +.|....... .-.|..-. ..| .++|+.. +-+-..|+.. T Consensus 1 m~~~i~~~~g~p~~~~~--------~-~~~~~~~ia~-------~~~~~~~~-~~~-~~~~~~~~iLr~~~~~~~~y~~m 62 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGE--------P-DKSLSSQIAT-------RARSIDFF-ALG-MYLPNPDPVLKALGKDIRVYREL 62 (491) T ss_pred CCCceeCCCCCccCccc--------C-ChHHHHHHHh-------hhcccccc-ccc-CCccchHHHHHhcCCCHHHHHHH Confidence 3321 00000011000 0 0111111110 00000000 011 1122211 1234577776 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) +. -.++...++.....|...+..|+ |+. .| ....+.+.+.++ +..++.++.+++ T Consensus 63 ~~----D~~i~s~l~~Rk~av~~~~w~i~--~~~-------~~-------~~~~e~v~e~l~-----~~~~~~~l~~~l- 116 (491) T protein:vir:10 63 RA----DAHVGGCVRRRKAAVKALEWGLD--RGK-------AK-------SRVAKSIADVFA-----DLDLSRIVTEML- 116 (491) T ss_pred hh----ChHHHHHHHHHHHHHhCCCcEEe--cCC-------CC-------HHHHHHHHHHHh-----cCCHHHHHHHHH- Confidence 53 45666666666666666666663 211 01 111234444443 457899999987 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .++-||.+.+=+-| +.+ +|.-.+..+..+- T Consensus 117 da~~~G~s~~Ei~w-------------------------~~~--~g~~~~~~l~~r~----------------------- 146 (491) T protein:vir:10 117 DAVLYGYQPMEITW-------------------------GKV--GNYIVPIDVVGKP----------------------- 146 (491) T ss_pred HhhhhcceeEEEEE-------------------------eec--CCeeEEEEeeeec----------------------- Confidence 57778866654433 211 2211111111100 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR 312 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 312 (661) ..++.|. .++..+++. +.+ + ..|. T Consensus 147 -----------------------------~~~f~~d-------------~~~~l~~~~---~~~--~---------~~g~ 170 (491) T protein:vir:10 147 -----------------------------ADWFVYD-------------PENQLRFRS---KDH--W---------MQGE 170 (491) T ss_pred -----------------------------ccceeec-------------cCCceEEec---CCC--C---------CCcc Confidence 0000000 011111110 000 0 0111 Q ss_pred ccceeeEE-EEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCce-----eEeccc Q lcl|NC_019406. 313 TLPFIPFV-FFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDASE-----YHIGPG 383 (661) Q Consensus 313 ~L~~IPfv-~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~~-----l~iGs~ 383 (661) +|+.-=|+ +.+..+.+...+.+.|..++..-+---....+...-+..-+.|+++.. |.++++++. ..||++ T Consensus 171 ~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~l~~al~~~~~~ 250 (491) T protein:vir:10 171 ELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDGEKNLLLDCLEDMVQD 250 (491) T ss_pred eecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcC Confidence 12111122 223233334445555666655554444455677788888899998875 334443332 358889 Q ss_pred ceeecCCCCCcceEeecCchh--HHHHHHHHHHHHHHHH--HHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_019406. 384 RVWVVDKESGIPGIIEFKGEG--LKTLERALNEKEQQIA--AIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMAL 459 (661) Q Consensus 384 ~~~~lp~~ga~~~ylE~~g~~--i~a~~~~L~~le~qM~--~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~l 459 (661) ++..+|. |..+.|++..+.+ ....++-++...++|. .+|- .+.. ..++ |--..........-++...+..+ T Consensus 251 a~~viP~-~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGq-tlTt-~~~g--s~a~~~vh~~v~~di~~~D~~~i 325 (491) T protein:vir:10 251 AVAVVPD-DSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQ-NQTT-EATS--TRASAQAGLEVTDDIRDGDKAVV 325 (491) T ss_pred cEEEecC-CceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhh-hccc-Cccc--chhHHHHHHHHHHHHHHHHHHHH Confidence 9999995 7899999987643 4456666666666665 4453 4433 2222 33344556666788888999999 Q ss_pred HHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCC-CCHHHHHHHHHhcCCCCccCCHHHH Q lcl|NC_019406. 460 EDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGL-LPIDALYENFVKNGIIPSTQTLEEF 538 (661) Q Consensus 460 e~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~-Is~et~~~eL~r~gvl~~~~~~Eee 538 (661) ++.+++.+.+++.|.+.+. .-.+|.+.. ....+....+.+..+...|. |+.+.++ ++-|+=.+... ++. T Consensus 326 ~~tln~li~~l~~~N~~~~---~~p~f~~~~---~~e~~~~~a~~~~~L~~~G~~i~~~~i~---e~~Gip~~~~~-~~~ 395 (491) T protein:vir:10 326 SEAMNMLIRWICDLNFDGA---DRPVFDMWE---QEQVDEIQAGRDQKLTQAGARFTPAYFK---RAYNLQDGDLD-ERP 395 (491) T ss_pred HHHHHHHHHHHHHhcCCCC---CcceEEecC---cCchhHHHHHHHHHHHhCCCcCCHHHHH---HHhCCCCCCcC-ccc Confidence 9999999999999997532 234555532 22222334556667777777 6655443 34465222221 111 Q ss_pred HHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcC--------ChhhHHHHHHHhccCCCchhHHHhhhhhhhhhh Q lcl|NC_019406. 539 TIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDA--------DFQQQELEQAERHLEIDEEKLRISAKVGSTSVA 610 (661) Q Consensus 539 ~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~--------d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 610 (661) +..+.+...........++ ..+.+++.--....+ ++-.+..+.-++..-+++-+.+|.+..+++..+ T Consensus 396 ---~~~~~~~~~~~~~~~~~~~--~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~~L~~l~~~~d~~ 470 (491) T protein:vir:10 396 ---LPVSAVDTVGAASFAEFEA--PDQDALDAALNTLSARDLNADAQALVAPLLKRIANGASADELLGMLAELYPSLDAD 470 (491) T ss_pred ---cccCCCCCcccccccccCC--CCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCCHH Confidence 1111111111100000000 001100000000000 000111111111111122222222211111111 Q ss_pred HHHhcCChhhhhhhhhhhhHHHHhhccccc Q lcl|NC_019406. 611 ASRKLGDPEQAKPSKAEQAQIDAQQKQAAA 640 (661) Q Consensus 611 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 640 (661) ++...=+..--++.=-=.+-| T Consensus 471 ---------~l~~~l~~a~~~A~l~G~~~a 491 (491) T protein:vir:10 471 ---------ALQERLARAIFVANLWGRLHA 491 (491) T ss_pred ---------HHHHHHHHHHHHHHHhhhccC Confidence 100000000001111111111 No 116 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=96.08 E-value=0.001 Score=36.95 Aligned_cols=506 Identities=12% Similarity=0.082 Sum_probs=196.7 Q ss_pred CCCCCCccccccccccccccC--CccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCC-CCCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQF--THLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAP-KGFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~-~~E~~~~Y~~rl~rA~ 77 (661) ||--.+| .-++... -.+.-.-+-....++|+-|.+.+- |.. +.++... ..++. -. T Consensus 1 m~~~~~~-------~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~l-------------P~~~~~~~~~~-~~~~~-~~ 58 (535) T protein:vir:15 1 MADSKRT-------GLGEDGAKATYDRLTNDRRAYETRAENCAQYTI-------------PSLFPKESDNE-STDYT-TP 58 (535) T ss_pred CCccchh-------ccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhc-------------ccccCCCCCcc-ccccc-cc Confidence 4432221 1111110 011111122234555665555543 321 1111110 00111 12 Q ss_pred ccchHHHHHHHHhchhhccCccccccchhhHhhhhc-ccccc---cccchhhhhhhHhhhhhcc------CCCCCHHHHH Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRD-AEGGV---QVVAPASIGKLLTQLQRFA------KDGTSHQGFA 147 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d-~dG~~---~~~~~~~~~~~~~~~~~~d------l~G~sL~~fa 147 (661) |-+.-.+.++.+...++.- + .|+. .|+.= ..-.. ....+.....++.+|+.|. +.-++.+.-+ T Consensus 59 ~dst~~~a~~~Laa~l~~~---l--tP~~--~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~ 131 (535) T protein:vir:15 59 WQAVGARGLNNLASKLMLA---L--FPMQ--SWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTL 131 (535) T ss_pred ccccHHHHHHHHHHHHHHh---h--cCCC--cccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHH Confidence 3333444555444333321 1 0110 11110 00000 0000111112233333321 3456788888 Q ss_pred HHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceee Q lcl|NC_019406. 148 KTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIG 227 (661) Q Consensus 148 ~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~ 227 (661) -.++.+.+.+|-+-++||-+... . ..+..|+-. ++-+. .++.+.+.-|..++.... T Consensus 132 ~~~~~~L~~~G~a~l~~~~~~~~--~----~~f~~~pl~---~~~v~-~d~~G~vd~i~r~~~~t~-------------- 187 (535) T protein:vir:15 132 FECLKQLIVAGNALLYLPEPEGS--Y----NPMKLYRLS---SYVVQ-RDAYGNVLQIVTRDQIAF-------------- 187 (535) T ss_pred HHHHHHHHhhCceeEEeecCCCC--c----eeeEEEEcC---eeEEe-eCCCCCeeEEEEeEeecH-------------- Confidence 88999999999998998754211 0 111222211 12222 122223333333322211 Q ss_pred eechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee Q lcl|NC_019406. 228 REGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 228 ~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p 307 (661) ..+ ...+... ..........+....+|....++.+ .+ .|.|+.+.++.... + T Consensus 188 ----~~l--------------~~~~~~~-~~~~~~~~~~~~~v~v~~~v~~~~~-~~--~~~~~~e~~g~~~~-~----- 239 (535) T protein:vir:15 188 ----GAL--------------PEDVRSA-VEKAGGEKKMDEMVDVYTHVYLDEE-SG--DYLKYEEVEDVEID-G----- 239 (535) T ss_pred ----HHH--------------HHHHhHh-hhccccccCCCCceeEEEEEEEecC-CC--cEEEEEEeeCcccc-c----- Confidence 111 0000000 0000001112333445554444322 22 34444443332210 0 Q ss_pred ccCC---cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEec Q lcl|NC_019406. 308 MVRG---RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIG 381 (661) Q Consensus 308 ~~~g---~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iG 381 (661) ...+ ..+++||+.|.-..+..+..+ ..-|-|+..||.-+-.. +..+......|.++-+ |.... ..+.-| T Consensus 240 ~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~---l~~~~~~~~p~~lv~~~g~~~~--~~l~~~ 314 (535) T protein:vir:15 240 SDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAI---VKMSMISAKVIGLVNPAGITQP--RRLTKA 314 (535) T ss_pred cccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHH---HHHHHHHhcCceeecccccccc--hhcccC Confidence 0111 236778887876655555443 11244777777654332 3444444455545433 22221 123334 Q ss_pred ccceeecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHH--hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_019406. 382 PGRVWVVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAI--GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMA 458 (661) Q Consensus 382 s~~~~~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~l--GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~ 458 (661) ..+.+. +...++...++.. +..+....+.|+++++.+... --.+.. ..+...||++...+...-...|..+-.+ T Consensus 315 ~~g~~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~--~~~~r~TAtEV~~r~~E~~~~LG~v~~r 391 (535) T protein:vir:15 315 QTGDFV-PGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQ--RTGERVTAEEIRYVASELEDTLGGVYSI 391 (535) T ss_pred Cceeee-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhccc--CCCccccHHHHHHHHHHHHHHHhHHHHH Confidence 444443 3333455555432 456899999999999998763 112221 2234579999999999999999988888 Q ss_pred HHHHH-HHHHHHHHHHc---C-CCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhc--CCCCc Q lcl|NC_019406. 459 LEDGM-TSVVRYWLMFR---D-IPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKN--GIIPS 531 (661) Q Consensus 459 le~Al-~~aL~~~A~w~---G-~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~--gvl~~ 531 (661) +++=+ .-++.++-..+ | ++......++++| . ..+ .++..+.. .-+-..|...+..- .+++. T Consensus 392 l~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~y----i-s~L-----a~aqr~~~--~~~l~~~~~~la~~~P~~ld~ 459 (535) T protein:vir:15 392 LSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTI----S-TGL-----EAIGRGQD--LDKLERCISAWAALAPMQGDP 459 (535) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEE----e-cHH-----HHHHHHHH--HHHHHHHHHHHHhcChhhhhc Confidence 65543 33333333333 2 2222233344443 2 222 11111111 11112233333322 34555 Q ss_pred cCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhH Q lcl|NC_019406. 532 TQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAA 611 (661) Q Consensus 532 ~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 611 (661) ..++++..+.+.+- +|.|-. .+.+. ++++.+. .+|+..+.+.-+.+.+.++.- .+ T Consensus 460 ~id~d~~~~~~a~~---~Gvp~~-~i~~~----~eev~~~-------~~q~~~~~~~~~~a~~~g~~~----------~~ 514 (535) T protein:vir:15 460 DINLAVIKLRIANA---IGIDTS-GILLT----DEQKQAL-------MMQDAAQTGIENAAATGGAGV----------GA 514 (535) T ss_pred cCCHHHHHHHHHHH---cCCChh-hhcCC----HHHHHHH-------HHHHHHHHHHHHHHHHHHhhc----------cc Confidence 57888888888764 333211 11111 1111111 111111111111111111110 00 Q ss_pred HHhcCChhhhhhhhhhhhHHHHhhcccccCCCC Q lcl|NC_019406. 612 SRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVT 644 (661) Q Consensus 612 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 644 (661) .-+ ..||... +..+ ++--++| T Consensus 515 ~~~-~~p~~~~------~~~~-----~~g~~~~ 535 (535) T protein:vir:15 515 LAT-SSPEAMQ------GAAA-----QAGLDAT 535 (535) T ss_pred hhc-cChHHHH------HHHh-----ccCCCCC Confidence 001 1222211 1111 1111222 No 117 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=95.84 E-value=0.0014 Score=36.27 Aligned_cols=483 Identities=9% Similarity=-0.008 Sum_probs=190.4 Q ss_pred ccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHh Q lcl|NC_019406. 11 IRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMV 90 (661) Q Consensus 11 ~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~ 90 (661) .+.|+.+-=+.- .+. .+..+|+-|.+.+ ||.....+...-..++.+ .|-+.-.+.++.++ T Consensus 1 mk~~~~~~~~~l--kr~----~~e~~w~e~a~~t-------------lP~~~~~~~~~~~~~~~~-~~dstg~~a~~~LA 60 (510) T protein:vir:78 1 MKSTAAMLWEKL--RDG----SVEQRAIEFAKTT-------------LPYLMVDPMSGSRGVVEH-DFQSAGALLVNNLA 60 (510) T ss_pred ChhHHHHHHHHH--hcc----chHHHHHHHHHhh-------------ccccccCCCCcccccccC-cccchHHHHHHHHH Confidence 222211110000 011 1334455555544 332211111111112222 24444445555544 Q ss_pred chhhcc--Ccc--c-c-ccchh-hHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHHHHHHHHHHHHHhh Q lcl|NC_019406. 91 GQIFRR--PPV--I-R-NLPNT-GAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQGFAKTVALEQVAM 157 (661) Q Consensus 91 G~vFrk--~p~--i-~-~~p~~-l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~~fa~~~~~~~L~~ 157 (661) ..+..- ||. + . .+.+. ++.+..+ +.....++.+|+.| -+.-++.+.-+-.++.+...+ T Consensus 61 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~---------~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~ 131 (510) T protein:vir:78 61 AKLARSLFPTGIPFFRSELTDAIRREADSR---------DTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVT 131 (510) T ss_pred HHHHHhhcCCCCcccccCCChHHhhhcccC---------cchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 333221 111 1 0 11111 1111000 01111223333322 134567888888888898999 Q ss_pred CCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcc Q lcl|NC_019406. 158 GRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRT 237 (661) Q Consensus 158 Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w 237 (661) |-+.+++|-+.. .+..|+-.+ +-+. .++.+.+.-|..++...... -+-.| T Consensus 132 G~a~l~~~~~~~---------~~~~~pl~~---y~v~-~d~~G~vd~i~rr~~~t~~~-----------------l~~~~ 181 (510) T protein:vir:78 132 GNALLYRNSDEA---------TVVAWSLRS---YAVR-RDATGRWMDIVLKQRYKSKD-----------------LDDVY 181 (510) T ss_pred CeEEEEEeCCCC---------eEEEEEcce---eEEe-eCCCcCeeEEEeeeeccHHH-----------------HHHHh Confidence 998888874321 122232222 1111 12222232233333221100 00111 Q ss_pred hhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCC---ccc Q lcl|NC_019406. 238 SGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRG---RTL 314 (661) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g---~~L 314 (661) ..... .. ......+...++++......+. +...|.++...++.. .+ ..++ ..+ T Consensus 182 ~~~~~-----------~~-----~~~~~~~~~v~v~~~V~~~~~~-~~~~~sv~~e~dg~~--i~-----~~~~~~~~e~ 237 (510) T protein:vir:78 182 KQDLM-----------RA-----GRNLSGSGSVDLYTHVQRRKGT-AMDYAEMYHEIDGVR--VG-----ETGRWPIHLC 237 (510) T ss_pred hHHhh-----------hh-----hhccCCCceEEEEEEEEeecCC-CCcEEEEEEEecCee--ec-----cccccccccC Confidence 11000 00 0001112223344433333222 222344443322221 11 1122 236 Q ss_pred ceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEecccceeecCCC Q lcl|NC_019406. 315 PFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPGRVWVVDKE 391 (661) Q Consensus 315 ~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~~~~~lp~~ 391 (661) ++||+.|.-..+..+..+ .--|-|+..||.-+ .+.+..+.+....|.++-+ |.... ..+.-|++..+. |+. T Consensus 238 P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~---~~~l~~a~~a~~~~~lv~p~g~~~~--~~l~~~~~g~~v-~g~ 311 (510) T protein:vir:78 238 PYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLS---EKLGLYELESLEVLNLVDEAKGAVV--DDYQDAEMGDYV-PGG 311 (510) T ss_pred CeeeeeeeecCCCccccchHHHHHHHHHHHHHHH---HHHHHHHHHhhcCCcccCCccccch--hhhccCCCceee-cCC Confidence 788888876666665554 11245666666433 3556667777777877764 33221 234555554443 432 Q ss_pred CCcceEeecC-chhHHHHHHHHHHHHHHHHHHhHHhccc-ccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH-----HH Q lcl|NC_019406. 392 SGIPGIIEFK-GEGLKTLERALNEKEQQIAAIGGRLMPG-MSKSVSESDNQSALREANEQSLLLNVIMALEDG-----MT 464 (661) Q Consensus 392 ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lGArll~~-~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A-----l~ 464 (661) ....+.++.. +..+....+.|+++++.+... =|+.. +..+...||++...+...-...|..+-..+.+- +. T Consensus 312 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~a--F~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~ 389 (510) T protein:vir:78 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQA--FMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAY 389 (510) T ss_pred cccccccccCcccchHHHHHHHHHHHHHHHHH--HhhccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 2334444432 345788899999999998863 22221 223345699999999999999999877774443 33 Q ss_pred HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhc-CCCCHHHHHHHHHhcCCCCccCCHHHHHHHHh Q lcl|NC_019406. 465 SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEG-GLLPIDALYENFVKNGIIPSTQTLEEFTIKMN 543 (661) Q Consensus 465 ~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~a-G~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~ 543 (661) +++.++-+ .|+....+..++..+ ...+.+ |-.+... +..+...++..+-.-.-+.+..++++..+.+. T Consensus 390 r~~~il~r-~gl~p~p~~~~~~~~-----v~~is~-----Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a 458 (510) T protein:vir:78 390 VCLSEVDD-ALLQGLITKQHKPAI-----ETGLPA-----LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIW 458 (510) T ss_pred HHHHHHHh-ccCCCCCccccccee-----eecccH-----HHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHH Confidence 33444332 243322222222211 111211 1111111 01111112222211122455678888888887 Q ss_pred ccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhhhh Q lcl|NC_019406. 544 DPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQAKP 623 (661) Q Consensus 544 ~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 623 (661) +.. |.|-+..+. . ++++.+- ++ ..+||..+.|.-+.|+.+.=.. T Consensus 459 ~~~---Gv~p~~ivr-s----~eev~a~---~~-~~~~q~~~~~~~~~a~~~~~~~------------------------ 502 (510) T protein:vir:78 459 AAF---SVDTSQFYK-S----ADELQAE---AE-EQRRQAAQAQAAQETLLEGASD------------------------ 502 (510) T ss_pred HHh---CCChhhhcC-C----HHHHHHH---HH-HHHHHHHHHHHHHHHHHHhhhh------------------------ Confidence 652 322221111 0 1111000 00 0001111111111111000011 Q ss_pred hhhhhhHHHHhhcccccCCCCC Q lcl|NC_019406. 624 SKAEQAQIDAQQKQAAAKPVTP 645 (661) Q Consensus 624 ~~~~~~~~~~~~~~~~~~~~~~ 645 (661) -+++++.- T Consensus 503 --------------~~~~~~g~ 510 (510) T protein:vir:78 503 --------------MTNALAGV 510 (510) T ss_pred --------------hcccCCCC Confidence 11111111 No 118 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=95.74 E-value=0.0015 Score=36.02 Aligned_cols=482 Identities=11% Similarity=0.043 Sum_probs=185.7 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) --|...-++.=+ .+.-..+-.....+|+-|.+.+--. + .+.+.+.. +.+| .|-+ T Consensus 7 ~~~~~~~~l~~r----------~~~Lk~~R~~~e~~w~e~~~~tlP~----------~--~~~~~~~~---~~~~-~~ds 60 (515) T protein:vir:70 7 EYGGQRSKIPKL----------WEKFSKKRSPYLDRAKHFAKLTLPY----------L--MNNKGDNE---TSQN-GWQG 60 (515) T ss_pred hhcCCHHHHHHH----------HHHHHHhhhHHHHHHHHHHHHhccc----------c--cCCCCCcc---cccc-cccc Confidence 001111110000 0000111122334455555444331 1 11111111 1111 3444 Q ss_pred hHHHHHHHHhc----hhhccC-cccc-ccchh-hHhhhhcccccccccchhhhhhhHhhhhhcc------CCCCCHHHHH Q lcl|NC_019406. 81 MTSQTQAGMVG----QIFRRP-PVIR-NLPNT-GAITGRDAEGGVQVVAPASIGKLLTQLQRFA------KDGTSHQGFA 147 (661) Q Consensus 81 ~~~~tv~~l~G----~vFrk~-p~i~-~~p~~-l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d------l~G~sL~~fa 147 (661) .-.+.++.++. .+|.-- |=+. .+.+. ++.+ |+ .+.....+.++|+.+. +.-++.+.-+ T Consensus 61 tg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~l----~~-----~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~ 131 (515) T protein:vir:70 61 VGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVL----DD-----RGLKKTQLATIFARVETTAMKALEQRQFRPAI 131 (515) T ss_pred hHHHHHHHHHHHHHHhhcCCCCcccccccChhhhhcc----cc-----chhHHHHHHHHHHHHHHHHHHHHHhcCchHHH Confidence 44455554443 333210 1110 01111 1111 00 0011112222222221 3456888888 Q ss_pred HHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceee Q lcl|NC_019406. 148 KTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIG 227 (661) Q Consensus 148 ~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~ 227 (661) -.++.+....|-+.+++|-+ . + +..|+-.+ +-+. .++.+.+.-|..++......- T Consensus 132 ~~~~~~L~~~G~a~l~~d~~-~-----~----~~~~pl~~---y~v~-~d~~G~v~~i~rr~~~t~~~l----------- 186 (515) T protein:vir:70 132 VEVFKHLIVAGNCLLYKPSK-G-----A----MSAVPMHH---YVVN-RDTNGDLMDVILLQEKALRTF----------- 186 (515) T ss_pred HHHHHHHHhHCeEEEEEeCC-C-----C----eEEEEcCe---EEEe-eCCCcCeeEEEeeeeccHHHH----------- Confidence 88999999999999999832 1 1 11222222 1111 122222333333332221100 Q ss_pred eechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee Q lcl|NC_019406. 228 REGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 228 ~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p 307 (661) +-.|... .++..... . ... +...++|.+...... ..|.++...++. ... T Consensus 187 ------~~~f~~~------~~~~~~~~--~---~~~---~~~v~i~~~v~~~~~----~~~~~~~e~d~~--~~~----- 235 (515) T protein:vir:70 187 ------DPATRMA------IEVGMKGK--K---CKE---DDNVKLYTHAQYAGE----GFWKINQSADDI--PVG----- 235 (515) T ss_pred ------HHhhhhh------hhhhhhhh--h---cCC---CCceEEEEEEEecCC----CceEEEEecCce--eec----- Confidence 0001100 00000000 0 011 112234443333221 233333322221 111 Q ss_pred ccCC---cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEec Q lcl|NC_019406. 308 MVRG---RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIG 381 (661) Q Consensus 308 ~~~g---~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iG 381 (661) ..+| ..+++||+.|.-..+..+..+ .--|-|+..||.-+-. -+..+......|.++-+ |... ...+.-| T Consensus 236 ~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~---~l~~~~~a~~p~~lv~~~g~~~--~~~l~~~ 310 (515) T protein:vir:70 236 KESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEA---MARGAALMADIKYLIRPGSQTD--VDHFVNS 310 (515) T ss_pred cccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHH---HHHHHHHhcCCCeeeCcccccc--hhhcccc Confidence 1222 347888888876666665443 1114577777654432 24444444444444443 3322 1235556 Q ss_pred ccceeecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHHhH--HhcccccCccchhHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_019406. 382 PGRVWVVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAIGG--RLMPGMSKSVSESDNQSALREANEQSLLLNVIMA 458 (661) Q Consensus 382 s~~~~~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lGA--rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~ 458 (661) +++.+ .|......+-++.. +..+....+.|+++++.+.++=. .+... .+...|||+...+...-...|.-+-.+ T Consensus 311 ~~g~i-v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r--d~~rvTAtEV~~r~~E~~~~LGpv~sr 387 (515) T protein:vir:70 311 GTGEV-ITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRR--DAERVTAVEIQRDALEIEQNMGGVYSL 387 (515) T ss_pred CCcee-ecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhcc--CCccccHHHHHHHHHHHHHHhhHHHHH Confidence 65554 34333455555532 34588889999999999876432 13222 233579999999999999999888888 Q ss_pred HHHHHHHHHHHHHH-HcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCC---CCccC Q lcl|NC_019406. 459 LEDGMTSVVRYWLM-FRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVK-NGI---IPSTQ 533 (661) Q Consensus 459 le~Al~~aL~~~A~-w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r-~gv---l~~~~ 533 (661) +.+=+-.=|-.+++ =++-.. -...+ +.+|. .. +.+|..+.....|. .+...+.- ..+ +.+-. T Consensus 388 L~~Ell~Pli~r~~~~~~p~~-P~~~v----~~~~v-s~-----l~~L~r~q~~~~i~--~~~q~i~~~~~~~p~~~~~i 454 (515) T protein:vir:70 388 FAMTMQTPIAMWGLQEAGDSF-TSELV----DPVIV-TG-----IEALGRMAELDKLA--NFAQYMSLPQTWPEPAQRAI 454 (515) T ss_pred HHHHHHHHHHHHHHHhhCCCC-Chhhc----cccee-hh-----HHHHHHHHHHHHHH--HHHHHHHHHhccChhHHhhC Confidence 76655443322221 112111 11112 22221 11 22222222211121 12222211 122 22346 Q ss_pred CHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHH Q lcl|NC_019406. 534 TLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASR 613 (661) Q Consensus 534 ~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 613 (661) ++++..+.+....+. + . .+-.+++.-++-.+|+.+..|...++.+.+++. -+ T Consensus 455 d~d~~~~~~a~~~g~-p--~-------------~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~---------~~--- 506 (515) T protein:vir:70 455 RWGDYMDWVRGQISA-E--L-------------PFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAV---------PG--- 506 (515) T ss_pred CHHHHHHHHHHHhCC-C--c-------------cccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhc---------cc--- Confidence 777777777654321 1 0 011111111111122222222222222222221 11 Q ss_pred hcCChhhhhhh Q lcl|NC_019406. 614 KLGDPEQAKPS 624 (661) Q Consensus 614 ~~~~~~~~~~~ 624 (661) +--++.|.+ T Consensus 507 --~~~~~~~~~ 515 (515) T protein:vir:70 507 --VIQQEMKEG 515 (515) T ss_pred --chhhhhccC Confidence 111111111 No 119 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=95.72 E-value=0.0016 Score=35.95 Aligned_cols=486 Identities=12% Similarity=0.064 Sum_probs=185.6 Q ss_pred CCCCCCcccccccccccccc--CCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQ--FTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAF 78 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~ 78 (661) |+- ||.-++.. .-.+.-.-+-....++|+-|.+.+ ||..-..+...=..++. -.| T Consensus 1 ~~~---------~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~-------------lP~~~~~~~~~~~~~~~-~~~ 57 (522) T protein:vir:94 1 MAE---------REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVT-------------IPSLFPKESDNSSTEYT-TPW 57 (522) T ss_pred Ccc---------cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHh-------------cccccCCCCCccccccc-ccc Confidence 221 11100000 000000011112344455555444 33221101000001111 145 Q ss_pred cchHHHHHHHHhchhhccCccccccchhhHhhhh-ccc--ccc-cccchhhhhhhHhhhhhc------cCCCCCHHHHHH Q lcl|NC_019406. 79 YNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGR-DAE--GGV-QVVAPASIGKLLTQLQRF------AKDGTSHQGFAK 148 (661) Q Consensus 79 ~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~-d~d--G~~-~~~~~~~~~~~~~~~~~~------dl~G~sL~~fa~ 148 (661) -+...+.++.+...++.- + .|.. +|+. ... +.. ....++.....+.+|+.+ -+.-++.+.-+- T Consensus 58 dst~~~a~~~Las~l~~~---l--tP~~--~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~ 130 (522) T protein:vir:94 58 QAVGARCLNNLAAKLMLA---L--FPQS--PWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLF 130 (522) T ss_pred cccHHHHHHHHHHHHHhh---c--CCCC--cccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHH Confidence 555556666655444431 1 0111 1111 000 000 000001111222222221 134566888888 Q ss_pred HHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeee Q lcl|NC_019406. 149 TVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGR 228 (661) Q Consensus 149 ~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~ 228 (661) .++.+.+.+|-+.++++-+..+. + . .+..|+-.+ +-+. .++.+++.-|..++....+. . T Consensus 131 ~~~~~L~~~G~a~l~~~~~~~~~---~-~-~~~~~pl~~---y~v~-~d~~G~vd~i~r~~~~~~~~--l---------- 189 (522) T protein:vir:94 131 EALKQLIVSGNCLLYIPEPEQGT---Y-S-PMRMYRLVS---YVVQ-RDAFGNILQIVTIDKVAFSA--L---------- 189 (522) T ss_pred HHHHHHHhhCcEeEeeeccCCCc---e-e-eEEEEEcce---EEEe-eCCCcCeEEEeeeeeccHHh--c---------- Confidence 88999999999888887443211 0 0 122222222 1111 12222232333332221100 0 Q ss_pred echhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeec Q lcl|NC_019406. 229 EGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPM 308 (661) Q Consensus 229 ~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~ 308 (661) ++.+ + ..+. .. ....+...++++....+.+ .|.|+...++. .++. T Consensus 190 --~~~~---~------------~~~~---~~---~~~p~~~v~v~~~v~~~~~-----~~~~~~~~~g~-------~~~~ 234 (522) T protein:vir:94 190 --PEDV---K------------SQLN---AD---DYEPDTELEVYTHIYRQDD-----EYLRYEEVEGI-------EVTG 234 (522) T ss_pred --chHH---H------------HHHh---cc---cCCccceEEEEEEEEeeCC-----ceeEEeeccCc-------eecc Confidence 0000 0 0000 00 0112233445554433332 23333222221 1111 Q ss_pred cCC----cccceeeEEEEecCCCCCCccccc----hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeE Q lcl|NC_019406. 309 VRG----RTLPFIPFVFFGSMSNAADCEKPP----LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYH 379 (661) Q Consensus 309 ~~g----~~L~~IPfv~~~~~~~~~~~~~pP----LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~ 379 (661) ..| ..+++||+.|.-..+..+.. .| |-|+..||.-+- +-+..+......|.++-+ |.... ..+. T Consensus 235 ~~~~~~~~e~P~~~~Rw~~~~ge~YGr--gp~~~~l~D~k~L~~l~~---~~l~~~~~~~~p~~~v~~~g~~~~--~~~~ 307 (522) T protein:vir:94 235 TDGSYPLTACPYIPVRMVRLDGEDYGR--SYCEEYLGDLNSLETITE---AITKMAKVASKVVGLVNPNGITQP--RRLN 307 (522) T ss_pred cCCCCccccCCceeeeeeecCCCcccc--chHHHHHHHHHHHHHHHH---HHHHHHHHHhCCceeecccccccc--hhee Confidence 122 23677777776655555544 34 446666665432 334444444455544433 33222 1244 Q ss_pred ecccceeecCCCCCcceEeec-CchhHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHH Q lcl|NC_019406. 380 IGPGRVWVVDKESGIPGIIEF-KGEGLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIM 457 (661) Q Consensus 380 iGs~~~~~lp~~ga~~~ylE~-~g~~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~ 457 (661) -|..+.+. +....+.+.++. ++..+....+.|+++++.+... -..++. ...+...||++...+...-...|..+-. T Consensus 308 ~~~~g~~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~r~TAtEV~~r~~E~~~~LG~v~~ 385 (522) T protein:vir:94 308 KAATGEFV-AGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAV-QRNAERVTAEEIRYVAGELEATLGGVYS 385 (522) T ss_pred ccCCceee-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhc-cCCCccccHHHHHHHHHHHHHHHhHHHH Confidence 44444443 333334554442 2456888999999999998753 111221 1234467999999999999999998888 Q ss_pred HHHHHH-HHHHHHHHHHc---C-CCCCCcceEEEEeccccccccCC----HHHHHHHHHHHhcCCCCHHHHHHHHHhc-- Q lcl|NC_019406. 458 ALEDGM-TSVVRYWLMFR---D-IPLTDTATLRYEIDATFLTTALD----ARALRAIQQLYEGGLLPIDALYENFVKN-- 526 (661) Q Consensus 458 ~le~Al-~~aL~~~A~w~---G-~~~~~~~~~~v~ln~DF~~~~ld----a~~l~all~~~~aG~Is~et~~~eL~r~-- 526 (661) .+++=+ .-++..+-..+ | ++......++++ |. ..+. .+++..++. |+..+..- T Consensus 386 rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~----~~-s~La~~qr~~~~~~l~~-----------~~~~ia~l~P 449 (522) T protein:vir:94 386 VQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPT----VS-TGLEALGRGQDLEKLTQ-----------AVNMMTGLQP 449 (522) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEee----Ee-cHHHHHHHHHHHHHHHH-----------HHHHHHhccc Confidence 765543 33333322222 2 222222333333 33 2221 112333333 22222221 Q ss_pred CCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhh Q lcl|NC_019406. 527 GIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGS 606 (661) Q Consensus 527 gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~ 606 (661) .++.+..++++..+.+.+. +|.|-+.... . ++ +.++. .+||-...+..+++..+. +.+.+ T Consensus 450 ~~~~~~id~d~~~~~~a~~---~Gv~~~~ivr-~----~e---e~~~~----~~q~~~~~~~~~~~~~~~-~~~~a---- 509 (522) T protein:vir:94 450 LSQDPDINLPTLKLRLLNA---LGIDTAGLLL-T----QD---EKIQR----MAEQSSQQAVVQGASAAG-ANMGA---- 509 (522) T ss_pred hhhhhcCCHHHHHHHHHHH---cCCChhhccC-C----HH---HHHHH----HHHHHHHHHHHHHHHHHH-HHhhh---- Confidence 1233446788888888765 3322111111 0 00 00000 111111111111111111 11111 Q ss_pred hhhhHHHhcCChhhhhh--hhh Q lcl|NC_019406. 607 TSVAASRKLGDPEQAKP--SKA 626 (661) Q Consensus 607 ~~~~~~~~~~~~~~~~~--~~~ 626 (661) +.+.+... ++| T Consensus 510 ---------~~~~~~~~~~~~~ 522 (522) T protein:vir:94 510 ---------AVGQGAGEDMAQA 522 (522) T ss_pred ---------hhhcccchhhhcC Confidence 00111111 111 No 120 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=95.70 E-value=0.0016 Score=35.91 Aligned_cols=478 Identities=12% Similarity=0.056 Sum_probs=190.7 Q ss_pred CCCCC-----Cc---------cccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCCh Q lcl|NC_019406. 1 MAGLS-----PN---------SANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDD 66 (661) Q Consensus 1 ~~~~~-----~~---------~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~ 66 (661) |+-+- |. .+.+..+.+-.+....+.-.|. +|.. .++++..-+++. -- T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~------~l~~---------il~~a~~gd~~~----~~ 61 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPA------KLAH---------ILIEAEQGHLQA----QA 61 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHH------HHHH---------HHHhhhCCCHHH----HH Confidence 32211 11 1111111111111111111111 1211 222222112221 11 Q ss_pred HHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHH Q lcl|NC_019406. 67 EDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGF 146 (661) Q Consensus 67 ~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~f 146 (661) +-|+..+.| -.++...++.....|...+..|+ |..= .+.+. -..-..+.+++.++ .+++.+ T Consensus 62 ~L~~~m~e~---D~~i~s~l~~Rk~av~~~~w~I~--p~~~----~~~~~------~~~a~~v~~~l~~~----~~f~~~ 122 (528) T protein:vir:10 62 ELFMDMEER---DAHLFAEMSKRKRAVLGLDWTIE--PPRN----ASAAE------KADAEYLHELLLDL----EGIEDL 122 (528) T ss_pred HHHHHHHhh---ChHHHHHHHHHHHHHhcCCceEe--cCCC----CCHHH------HHHHHHHHHHHhCC----ccHHHH Confidence 235555443 45666666666667777777664 1100 00000 00111233333332 247777 Q ss_pred HHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccccccccee Q lcl|NC_019406. 147 AKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWI 226 (661) Q Consensus 147 a~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i 226 (661) +..++. ++-||.+.+=+-| +.+ +|.-.+.-+..+- T Consensus 123 i~~~ld-a~~~G~s~~Ei~w-------------------------~~~--~g~~~~~~~~~r~----------------- 157 (528) T protein:vir:10 123 MLDCMD-GVGHGYSAIELDW-------------------------SLQ--GREWLPQAFDHRP----------------- 157 (528) T ss_pred HHHHHh-hhhhcceeEEEEE-------------------------eec--CCceeEEEeeeec----------------- Confidence 777663 6778866655433 221 1211110000000 Q ss_pred eeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccccee Q lcl|NC_019406. 227 GREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYT 306 (661) Q Consensus 227 ~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~ 306 (661) ..++.|.. ++..+++. +.+. T Consensus 158 -----------------------------------~~~f~~~~-------------~~~~~l~~---~~~~--------- 177 (528) T protein:vir:10 158 -----------------------------------QSWFQLNP-------------DDQDELRL---RDNS--------- 177 (528) T ss_pred -----------------------------------ccceeecc-------------CCCcEEec---cCCC--------- Confidence 00000000 00001110 0000 Q ss_pred eccCCcccceeeEE-EEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCce----- Q lcl|NC_019406. 307 PMVRGRTLPFIPFV-FFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDASE----- 377 (661) Q Consensus 307 p~~~g~~L~~IPfv-~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~~----- 377 (661) ..|.+|+.-=|+ +.+....+...+...|..++..-+---....++..-+..-|.|+++.. |.++++++. T Consensus 178 --~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al 255 (528) T protein:vir:10 178 --IAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPDEEKVTLLRAV 255 (528) T ss_pred --CCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHH Confidence 011222211122 223334445556666777766665555566778888888899998875 434443332 Q ss_pred eEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHH--HHhHHhcccccCccch--hHHHHHHHHHHhhHHHH Q lcl|NC_019406. 378 YHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIA--AIGGRLMPGMSKSVSE--SDNQSALREANEQSLLL 453 (661) Q Consensus 378 l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~--~lGArll~~~~~~~~e--Tataa~~d~~~~~S~L~ 453 (661) ..||+.++.++|. |..+.|++.++.+......-++....+|. .+| ..+... .+..+ |--...........++. T Consensus 256 ~~i~~~~~~iiP~-~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLG-qtlTs~-~~~g~~gS~Alg~vh~~v~~di~~ 332 (528) T protein:vir:10 256 TGLGHAAAGIIPE-SMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILG-GTLTSQ-TSESGGGAYALGQVHNEVRHDLLA 332 (528) T ss_pred HHHhhCcEEEecC-CceeEEeecCCCChhHHHHHHHHHHHHHHHHHhh-hhhhcc-ccccccchhhhHHHHHHHHHHHHH Confidence 3588899999995 78999999887777777777777777766 345 344221 11122 22234456666777889 Q ss_pred HHHHHHHHHHH-HHHHHHHHHcCCCCCCc-ceEEEEeccccccccCCH-HHHHHHHHHHhcCC-CCHHHHHHHHHhcCCC Q lcl|NC_019406. 454 NVIMALEDGMT-SVVRYWLMFRDIPLTDT-ATLRYEIDATFLTTALDA-RALRAIQQLYEGGL-LPIDALYENFVKNGII 529 (661) Q Consensus 454 ~~A~~le~Al~-~aL~~~A~w~G~~~~~~-~~~~v~ln~DF~~~~lda-~~l~all~~~~aG~-Is~et~~~eL~r~gvl 529 (661) +-+..++++++ +++.+++.|-..+..+. .-.+|.+.. . ...|- ...+.+..+...|. |+.+.+++. -|+ T Consensus 333 aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~--~-e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~---~gi- 405 (528) T protein:vir:10 333 ADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDL--K-DRADLAAMATSLPPLVKLGVQVPVNWVQEQ---LGI- 405 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecC--C-CcccHHHHHHHHHHHHhCCCCCCHHHHHHH---hCC- Confidence 99999999997 58999999986432221 112344321 1 11222 23556666777887 786655443 355 Q ss_pred CccCCHHHHHHHHhccCCCCCCch---hhhhhcCCccccCCCcchh--------hhhcCChhhH-------HHHHHHhcc Q lcl|NC_019406. 530 PSTQTLEEFTIKMNDPKSFIGQPD---AIAMRRGYVSRQQELDQQR--------AARDADFQQQ-------ELEQAERHL 591 (661) Q Consensus 530 ~~~~~~Eee~~~l~~~~~~l~~dd---ae~~~~g~~~~~~~~~q~~--------~~~e~d~~q~-------~~~~~e~~~ 591 (661) |.-...|++. ..+.+.-+... .......+.....+....+ .....|++.. ..+.-+... T Consensus 406 p~p~~~e~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~l~~i~~~l~~~~ 482 (528) T protein:vir:10 406 PLPANGEAVL---GDQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQEALDQVLASLPAQDMQNQADSLVAPLLDVISRGG 482 (528) T ss_pred CCCCCCcccc---cCCCcccccccCcccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 2222223222 11111110000 0000000000000000000 0000011110 000011111 Q ss_pred CCCchhHHHhhhhhhhhhhHHHhcCChhhhhhhhhhhhHHHHh-hcccccCCCCC Q lcl|NC_019406. 592 EIDEEKLRISAKVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQ-QKQAAAKPVTP 645 (661) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 645 (661) -+++-+.+|.+-.+++. +.++...=+..--++.= =+-.+...+-. T Consensus 483 s~ee~~~~L~~l~~~~d---------~~~l~~~l~~a~~~A~l~G~~~~~~e~~~ 528 (528) T protein:vir:10 483 SEAELLGALAEAFPDMD---------DSALADALHRLLFVADTWGRLNGTLDRID 528 (528) T ss_pred CHHHHHHHHHHHhhcCC---------HHHHHHHHHHHHHHHHHhhhhhccccccC Confidence 11222222222111111 11111100000000000 00000000000 No 121 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=95.62 E-value=0.0017 Score=35.72 Aligned_cols=450 Identities=12% Similarity=0.060 Sum_probs=183.9 Q ss_pred ccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC--CChHHHHHH-Hhhhc----ccch Q lcl|NC_019406. 9 ANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG--FDDEDYANY-LDRAA----FYNM 81 (661) Q Consensus 9 ~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~--E~~~~Y~~r-l~rA~----~~n~ 81 (661) .||= ..-|....|...+.+..+......|.|...-+..+ +.|..-. ..-..+..+ ..||- =.++ T Consensus 1 mn~~-------dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~--~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~ 71 (502) T protein:vir:79 1 MAIL-------DDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHK--ARRENRTADQLSQYGAVSLREQARYLDNNHDL 71 (502) T ss_pred CchH-------hhHHhhcChHHHHHHHhhHHHHhhccccCcccccC--CCCCCCChHHHHHHHHHHHHHHHHHHHhcChH Confidence 1211 11145677877787777777777777664433222 2332211 111112111 12221 1233 Q ss_pred HHH----HHHHHhch-hhccCccccccchhh-HhhhhcccccccccchhhhhhhHhhhhhccCCCC-CHHHHHHHHHHHH Q lcl|NC_019406. 82 TSQ----TQAGMVGQ-IFRRPPVIRNLPNTG-AITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGT-SHQGFAKTVALEQ 154 (661) Q Consensus 82 ~~~----tv~~l~G~-vFrk~p~i~~~p~~l-~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~-sL~~fa~~~~~~~ 154 (661) .+. .++..+|- -++--|.+..-.... +.|.+ ..-..|..++++||..|. +++.+.+.+++.. T Consensus 72 a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~-----------~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~ 140 (502) T protein:vir:79 72 VIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAA-----------EIRTRWSEWSVSPEVTGQFTRPMLERLMLRTW 140 (502) T ss_pred HHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHH-----------HHHHHHHHhhcCcCccccCCHHHHHHHHHHHH Confidence 344 44445553 232222221000000 11111 111356788999999997 8999999999999 Q ss_pred HhhCCEEEEEeccCCCchhhcccce---eEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeech Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDPTAPAKSY---TVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGS 231 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~~~g~rPY---~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~ 231 (661) +..|=|++..-+.+... .....|| +-+|.|+.|-+-.- +|. .|+ T Consensus 141 ~~dGE~f~~~~~~~~~~-~~~g~~~~l~lq~iepd~l~~~~~---~~~------~i~----------------------- 187 (502) T protein:vir:79 141 LRDGEVFAQMVSGRINS-LTPSAGVHFWLEALEPDFIPMTSD---ESN------RLN----------------------- 187 (502) T ss_pred HhCCceEEEEeecccCc-cCCCcccceEEEEecchhcCCCCC---CCC------eeE----------------------- Confidence 99999999886643211 1111222 34455544421110 000 000 Q ss_pred hhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCc-ccccccceeeccC Q lcl|NC_019406. 232 ETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDP-LGQARDVYTPMVR 310 (661) Q Consensus 232 e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~-~~~~~~~~~p~~~ 310 (661) +|+ + .+.+|..+ .++++...+ .+. T Consensus 188 -----------~GV------------e---------------------~d~~Gr~~-aY~i~~~hPgd~~---------- 212 (502) T protein:vir:79 188 -----------QGV------------F---------------------VDDWGRPE-KYLVYKSRPVSGR---------- 212 (502) T ss_pred -----------eee------------E---------------------ECCCCceE-EEEEeecCCCCCc---------- Confidence 010 0 01111111 111111111 000 Q ss_pred Ccccceee---EEE-EecCCCCCCccccchhHHHH--HHHHHHhhhhhHHHHHHHhcCceeEEecCCC----------CC Q lcl|NC_019406. 311 GRTLPFIP---FVF-FGSMSNAADCEKPPLLDIVE--LNLKHYRTYAELEHGRFFTALPTYYAPELDD----------SD 374 (661) Q Consensus 311 g~~L~~IP---fv~-~~~~~~~~~~~~pPLldLA~--LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~----------~~ 374 (661) +..+..|| ++- +...+.+-.-+.|+|...-. -++..|. .|.+....--+.+...+-++..+ .. T Consensus 213 ~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~-dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 291 (502) T protein:vir:79 213 QMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYE-DSELTAARIAAALGMYIRKGDGQSYEPDGNGSKEN 291 (502) T ss_pred ccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHH-HHHHHHHHHhhhheeeeecCCCcccccccCCCCCc Confidence 01122233 221 12233334445565543211 1233444 23344444444444443332211 11 Q ss_pred CceeEeccccee-ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHH-HHh--HHhcccccCccchhHHHHHHHHHHhhH Q lcl|NC_019406. 375 ASEYHIGPGRVW-VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIA-AIG--GRLMPGMSKSVSESDNQSALREANEQS 450 (661) Q Consensus 375 ~~~l~iGs~~~~-~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~-~lG--Arll~~~~~~~~eTataa~~d~~~~~S 450 (661) ...+.+++++.+ +|+ +|-+++|+.++.. .......++.+..+|. .+| ..+|..--+..=.|+-+.-+++-.... T Consensus 292 ~~~~~l~pG~i~~~L~-pGe~i~~~~p~~p-~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~ 369 (502) T protein:vir:79 292 ERELTIQPGIIYDDLK-PGEEIGMVKSDRP-NPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYL 369 (502) T ss_pred cccccccCCccccccC-CCceeeeeCCCCC-CCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHH Confidence 123678888765 455 4789999987532 2233333333333332 112 122322111111234455555555555 Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHHcCCC-CCCcceEEEEecccccccc---CCHH-HHHHHHHHHhcCCCCHHHHHHH Q lcl|NC_019406. 451 LLLNVIMALEDGMTSVVRYW---LMFRDIP-LTDTATLRYEIDATFLTTA---LDAR-ALRAIQQLYEGGLLPIDALYEN 522 (661) Q Consensus 451 ~L~~~A~~le~Al~~aL~~~---A~w~G~~-~~~~~~~~v~ln~DF~~~~---lda~-~l~all~~~~aG~Is~et~~~e 522 (661) .++.+. +....+-+.++| |...|.- .++...-.-.++-+|.... +|+. ++++.+.++.+|..|++.... T Consensus 370 ~~q~~~--~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a- 446 (502) T protein:vir:79 370 ILQDWF--IGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVR- 446 (502) T ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHH- Confidence 555421 112222233322 2223321 1111011111122333333 4664 899999999999999988754 Q ss_pred HHhcCCCCccCCHHHHHHHHhccCC-----CCCCch-hhhhhcCCccccCCCcchhhhhcCChhhH Q lcl|NC_019406. 523 FVKNGIIPSTQTLEEFTIKMNDPKS-----FIGQPD-AIAMRRGYVSRQQELDQQRAARDADFQQQ 582 (661) Q Consensus 523 L~r~gvl~~~~~~Eee~~~l~~~~~-----~l~~dd-ae~~~~g~~~~~~~~~q~~~~~e~d~~q~ 582 (661) ++|. +|+++.+.++.+.. +|.++. ....+.+.....+ .+.+....|-.|| T Consensus 447 --~~G~-----D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~---~~e~~~~~~~~e~ 502 (502) T protein:vir:79 447 --AGGR-----NPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATK---RQEPQHTDDQSEE 502 (502) T ss_pred --HcCC-----CHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCC Confidence 4464 55555554443311 222221 1111111111111 1111111111111 No 122 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=95.59 E-value=0.0018 Score=35.65 Aligned_cols=518 Identities=11% Similarity=0.043 Sum_probs=186.9 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcc-hHHHHhCCcccCCCCCC-CChHHHHHHHhh-hc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAG-EREIKAQGVKYLKAPKG-FDDEDYANYLDR-AA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G-~~~vr~~g~~YLPk~~~-E~~~~Y~~rl~r-A~ 77 (661) |+- ++-.....+|+.+..--.= ...|++...-.||.... .++......... -. T Consensus 1 m~~------------------------~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~ 56 (556) T protein:vir:73 1 MAE------------------------TEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKI 56 (556) T ss_pred CCh------------------------hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCcc Confidence 221 1112222223222221100 23333333334553321 122222211111 23 Q ss_pred ccchHHHHHHHHhchhhccCccccccchhhHhh---hhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHH Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAIT---GRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQ 154 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l---~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~ 154 (661) |-+.-.+.++.+...++.- + .|+.-..+ ..|.+......+-.++......+-+ -+.-++++.-+..++.+. T Consensus 57 ~dst~~~a~~~Las~l~~~---l--tpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~~L 130 (556) T protein:vir:73 57 VDPTGSMAQRILSSGMMSG---I--TSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNE-VFNKSNLYQSLPVMYASL 130 (556) T ss_pred ccchHHHHHHHHHHHHHHh---h--cCCCCcccccccCcccccchHHHHHHHHHHHHHHHH-HHHhcCcHHHHHHHHHHH Confidence 4444445555544433321 0 01000000 0011110000011111111111111 123467888888899999 Q ss_pred HhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 155 VAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 155 L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) +.+|-+-++||..+.. +. .+..|+.. ++-+.....-.+.+ |..++. +++..+ T Consensus 131 ~~~G~a~l~~~~~~~~----~~--r~~~~~l~---~~~~~~d~~G~vd~-i~r~~~------------------~t~~ql 182 (556) T protein:vir:73 131 GTFGTGAMAVMEDDQD----VI--RTMPFPIG---SYYLANSPRGSVDT-CIRQFS------------------MTVRQM 182 (556) T ss_pred HhhCceeeeeeecCCc----eE--EEEEeecc---eeEEeeCCCCCeEE-EEEEEe------------------ccHHHH Confidence 9999999999854311 11 11111111 11121111111111 111111 111111 Q ss_pred h-cchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecc----cccc--eEEEEEEEecCcccccccceee Q lcl|NC_019406. 235 Q-RTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQ----KDGS--RVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 235 i-~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g----~~g~--~~~~~~~~~~~~~~~~~~~~~p 307 (661) . .|.. .++.......+ .. +..+.. -++++....... ..+. ..|....|..+..+. .+. T Consensus 183 ~~~fg~---~~l~~~v~~~~----~~-~~~~~~---~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~~~~----~vl 247 (556) T protein:vir:73 183 VQEFGL---DNVSTSVKGMW----EN-GTYETW---VEVNHCITPNVNRDSGKMDSKNKPYRSVYFESGGDSD----KLL 247 (556) T ss_pred HHHcCc---ccCCHHHHHHH----hc-CCccce---EEEEEEEeccccccccccCcccceEEEEEEEecCCCc----eec Confidence 0 1100 00111110011 10 111111 112111111011 1111 122222233221111 111 Q ss_pred ccCC-cccceeeEEEEecCCCCCCccccc---hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEeccc Q lcl|NC_019406. 308 MVRG-RTLPFIPFVFFGSMSNAADCEKPP---LLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPG 383 (661) Q Consensus 308 ~~~g-~~L~~IPfv~~~~~~~~~~~~~pP---LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~ 383 (661) ..+| ..+++||+.|.-..+..+..+.|- |-|+..||.-+-.. -..++.+.-|.+.++ ++.....+.+.++ T Consensus 248 ~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~----l~~~~~~~~pp~~v~--~~~~~~~~~~~pg 321 (556) T protein:vir:73 248 RESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRK----AQLIDKATNPPMVAP--TSLKNQRVSLLPG 321 (556) T ss_pred ccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHH----HHHHHHHhcCceecc--ccccccceeeccC Confidence 2233 358889998887777776655432 44666666544332 334444555544443 2222235677777 Q ss_pred ceeecCCCC--CcceEeecCchhHHHHHHHHHHHHHHHHHH-hHH---hcccccCccchhHHHHHHHHHHhhHHHHHHHH Q lcl|NC_019406. 384 RVWVVDKES--GIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGR---LMPGMSKSVSESDNQSALREANEQSLLLNVIM 457 (661) Q Consensus 384 ~~~~lp~~g--a~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GAr---ll~~~~~~~~eTataa~~d~~~~~S~L~~~A~ 457 (661) .....+..+ ..+.-+......+..+.+.|+++++.+... -+. ++. ..++...||++...+...--..|..+-. T Consensus 322 g~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~-~~~~~r~TAtEv~~r~~E~~~~LG~v~~ 400 (556) T protein:vir:73 322 DVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQ-NINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (556) T ss_pred ccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhc-cCCCCCccHHHHHHHHHHHHHHhhHHHH Confidence 655443222 223333222234777788889998888743 111 221 1234467999999999999999999888 Q ss_pred HHHHH-HH----HHHHHHHHHcCC-CCC----CcceEEEEeccccccccCCHHHHHHHHHHHhcC-CCCHHHHHHHHHhc Q lcl|NC_019406. 458 ALEDG-MT----SVVRYWLMFRDI-PLT----DTATLRYEIDATFLTTALDARALRAIQQLYEGG-LLPIDALYENFVKN 526 (661) Q Consensus 458 ~le~A-l~----~aL~~~A~w~G~-~~~----~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG-~Is~et~~~eL~r~ 526 (661) ++.+= +. +++.++-+ .|. +.. ...++++ +|.. .+ .+..++.... .+..-.++..|... T Consensus 401 rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l~~~~i~v----~yis-~L-----a~aqk~~~~~~i~~~~~~~~~laq~ 469 (556) T protein:vir:73 401 RLNDEALNPLIDRVFSIMAR-KNMLPEPPDVLQGMPLRI----EYIS-VM-----AQAQKSIGLTSLSQTVGFIGQLAQF 469 (556) T ss_pred HHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhcCceeEE----Eeec-HH-----HHHHHHHHHHHHHHHHHHHHHHhcc Confidence 87443 33 33343333 121 111 1122222 2322 12 2222222111 11111222222211 Q ss_pred --CCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCch-h-HHHhh Q lcl|NC_019406. 527 --GIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEE-K-LRISA 602 (661) Q Consensus 527 --gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~-~-~~~~~ 602 (661) ++ -+..++++..+.+.+- +|.|. ..+. .. ++..+ +.||..+.|..+.+.+.. + +++.. T Consensus 470 ~Pe~-~d~id~d~~~~~~a~~---~Gvp~-~~ir-s~----eev~~--------~rq~r~~~qq~~~~~~~~~~a~~~~~ 531 (556) T protein:vir:73 470 KPEA-LDKLDVDQAIDAFSEM---SGVSP-TVIV-PQ----EQVQG--------IREERAKQAQAAQAMAMGQAAAQGAK 531 (556) T ss_pred Chhh-HhcCCHHHHHHHHHHH---cCCCh-hhcC-CH----HHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1347788888888764 44432 1111 10 10000 111111111111111000 0 11111 Q ss_pred hhhhhhhhHHHhcCChhhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCCCCc Q lcl|NC_019406. 603 KVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGRPPQ 657 (661) Q Consensus 603 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 657 (661) ..+++ ...+|+.+.. ..+ . -|-|+| T Consensus 532 ~~~~~------~~~~~~~l~~--------~~~--~--------------~g~~~~ 556 (556) T protein:vir:73 532 TLSET------QTSDPSALTA--------IAN--A--------------AGAPQQ 556 (556) T ss_pred Hhhhc------cCCCHHHHHH--------HHH--h--------------hcCCCC Confidence 11111 1112211111 000 0 112222 No 123 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=95.49 E-value=0.002 Score=35.43 Aligned_cols=500 Identities=13% Similarity=0.113 Sum_probs=190.2 Q ss_pred CCccccCHHHHHH-------HHHHHHHHHHhcchHHHHhCCcccCCCCCC----CChHHHHHH-HhhhcccchHHHHHHH Q lcl|NC_019406. 21 FTHLVVHPEYEYY-------RPDWAKIRDAIAGEREIKAQGVKYLKAPKG----FDDEDYANY-LDRAAFYNMTSQTQAG 88 (661) Q Consensus 21 ~~V~~~hPey~a~-------~~~W~~irD~~~G~~~vr~~g~~YLPk~~~----E~~~~Y~~r-l~rA~~~n~~~~tv~~ 88 (661) |+-..-.-.|..+ ..+|+-|.+. .||.... .....+... ...-.|-+...+.++. T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~-------------~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~ 67 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKY-------------IMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLET 67 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHH-------------hcccccccccCCCCCcccccccccccccchHHHHHHH Confidence 5544433344333 3344444444 3444321 111112111 1122344555555555 Q ss_pred Hhchhhcc--CccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEec Q lcl|NC_019406. 89 MVGQIFRR--PPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDV 166 (661) Q Consensus 89 l~G~vFrk--~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~ 166 (661) ++..++.- ||.. +..+--..|.+......+-.++......+-+. +.-.+++.-+-.++.+.+.+|-+-++++- T Consensus 68 Las~L~~~ltPp~~----~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~-l~~snf~~~~~~~~~~L~~~G~a~l~~~~ 142 (547) T protein:vir:10 68 LSSSLHGSLTSPAT----KWFELAFRDKELNSDDECRKWLENATHDVYSA-LQDSNFNLEANETYIDLCGYGNAIMVEEE 142 (547) T ss_pred HHHHHHHhhcCCCC----cccccccCCccccchHHHHHHHHHHHHHHHHH-HHhcCcHHHHHHHHHHHHhHCcEeEEecc Confidence 54443331 0110 00000001111111011112222222221111 23456777788889999999999888875 Q ss_pred cCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchh Q lcl|NC_019406. 167 APSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLA 246 (661) Q Consensus 167 P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~ 246 (661) ++... . .+++..|+..++ -+.. ++.+.+.-|..++ .+++..+..- =|.. T Consensus 143 d~~~~--~--~~r~~~~pl~~~---~v~~-d~~G~v~~i~r~~------------------~~t~~qi~~~-----fg~~ 191 (547) T protein:vir:10 143 DEDEE--G--SVVFQSSPIQDS---YFEE-DSRGQVVNFYRVF------------------RWTPAQIYDR-----FGDE 191 (547) T ss_pred CCCCC--C--ceeEEEeecceE---EEee-CCCcCeeeeeeee------------------eccHHHHHHh-----cCcc Confidence 43211 1 122333333221 1111 1111111111110 1111111110 0111 Q ss_pred hhhhhhhhhheecccccCCCceeeEEEEEEEeecccccce----E-------E-EEEEEecCcccccccceeeccCC-cc Q lcl|NC_019406. 247 ERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSR----V-------Y-KQFVYVEDPLGQARDVYTPMVRG-RT 313 (661) Q Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~----~-------~-~~~~~~~~~~~~~~~~~~p~~~g-~~ 313 (661) .++-.+-......++. .+...++++...-+...++.. . | .++...++. . .+-..+| .. T Consensus 192 ~l~~~v~~~~~~~~~~---~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~-----~-~~l~esg~~e 262 (547) T protein:vir:10 192 GTPEAIIKKAKEASNQ---AALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGA-----V-QLGEEGGYYE 262 (547) T ss_pred cCCHHHHHHHhcCCCc---ccceEEEEEEEeeccCCCCCccccceeeccccceeEEEEEecCc-----e-eeeecCCccc Confidence 1111111111111111 111112222111111111111 0 1 111111110 0 1111122 34 Q ss_pred cceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCC Q lcl|NC_019406. 314 LPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKE 391 (661) Q Consensus 314 L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~ 391 (661) +++||+.|.-..+..+..+ .--|-|+..||.-+-.. + ..++.+.-|.+.++ +++-..++.++++..+... . T Consensus 263 ~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~---l-~~~~~~~~pp~~v~--~~g~~~~~~~~pgg~~~~~-~ 335 (547) T protein:vir:10 263 MPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELV---L-RSSEKVIDPAIMVT--ERGLISDIDLGASGLTVVR-D 335 (547) T ss_pred CCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHH---H-HHHHHHhcCceecc--cccccccceecCCeeeecC-C Confidence 7888888876666665443 11244666666554332 3 34444555555443 1222345778888766543 3 Q ss_pred CCcceEeecCchhHHHHHHHHHHHHHHHHHHh-H-HhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHH-HHHHHHHH Q lcl|NC_019406. 392 SGIPGIIEFKGEGLKTLERALNEKEQQIAAIG-G-RLMPGMSKSVSESDNQSALREANEQSLLLNVIMALE-DGMTSVVR 468 (661) Q Consensus 392 ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lG-A-rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le-~Al~~aL~ 468 (661) ...+.=++. |..+......|+++++.|...= + .+.. -++...||++...+...-...|..+-..+. +-+.-++. T Consensus 336 ~~~v~pl~~-~~~~~~~~~~i~~~~~rI~~af~~d~~~~--~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~ 412 (547) T protein:vir:10 336 MESMKPFES-RARFDVSSIQLTDLRSAVRRIYYVDQLQM--KDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQ 412 (547) T ss_pred cccceeeec-ccchHHHHHHHHHHHHHHHHHhhhhhhhc--CCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 345554553 5678888999999999887531 1 1221 234568999999999999999998888775 34433333 Q ss_pred HHHHHc---CC-CCC-------CcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCCCC---ccC Q lcl|NC_019406. 469 YWLMFR---DI-PLT-------DTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVK-NGIIP---STQ 533 (661) Q Consensus 469 ~~A~w~---G~-~~~-------~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r-~gvl~---~~~ 533 (661) -+-.++ |+ +.. +...+.|+ |.. -|....++.....| .-|+..+.. .++-| +.. T Consensus 413 r~~~il~r~g~lP~~p~~l~~~~~~~~~v~----~is------~Laraq~~~~~~~i--~~~~~~v~~laq~~P~vld~i 480 (547) T protein:vir:10 413 RTFNIRFRAGKLGELPSKLLESGKAAMDIV----YTG------PLSRAQKIDQAASI--ERWAGSTAQLAEINPEVLDIP 480 (547) T ss_pred HHHHHHHhcCCCCCCchhhhccCcceEEEE----ecc------HHHHHHHHHHHHHH--HHHHHHHHHhhccChhhhhcC Confidence 222222 22 110 11112222 221 11111111111111 112222111 12211 347 Q ss_pred CHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHH Q lcl|NC_019406. 534 TLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASR 613 (661) Q Consensus 534 ~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 613 (661) ++++..+.+.+- +|.| +..+. . ++|..+-+..++ ..+++..|++ +.+ ...+.... T Consensus 481 d~d~~~~~~a~~---~Gvp-~~~ir-s----~eev~~~r~qr~------~~~q~~~qaa-------~~~---~~g~~m~~ 535 (547) T protein:vir:10 481 DWDEMVRMLGSL---LGAP-QTLMR-P----KAKVTSIRKNRS------QTQQKAEQAA-------IAE---AEGNAMEA 535 (547) T ss_pred CHHHHHHHHHHH---hCCC-hhccC-C----HHHHHHHHHHHH------HHHHHHHHHH-------HHH---HHHHHHHh Confidence 788888888764 3333 11111 1 111111110000 0000111111 111 11222111 Q ss_pred hcCChhhhhhhhhhhhHHHHh Q lcl|NC_019406. 614 KLGDPEQAKPSKAEQAQIDAQ 634 (661) Q Consensus 614 ~~~~~~~~~~~~~~~~~~~~~ 634 (661) + | .+.++-++|| T Consensus 536 ~-~--------~~~a~~~~~~ 547 (547) T protein:vir:10 536 Q-G--------KGQAALKENQ 547 (547) T ss_pred h-c--------CcccchhccC Confidence 1 2 1222333333 No 124 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=95.48 E-value=0.002 Score=35.40 Aligned_cols=443 Identities=8% Similarity=-0.032 Sum_probs=157.4 Q ss_pred HHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhccc-chHHHHHHHHhchhhccCcccc----ccchhhHhhhhc Q lcl|NC_019406. 39 KIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFY-NMTSQTQAGMVGQIFRRPPVIR----NLPNTGAITGRD 113 (661) Q Consensus 39 ~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~-n~~~~tv~~l~G~vFrk~p~i~----~~p~~l~~l~~d 113 (661) ++..+| +|-+| ...+-......|.+++.+=.+. .....+++.--..-..|++..+ +-+.-..+.... T Consensus 1 ~~~~~~-----~~~~~---~~m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~ 72 (488) T protein:vir:96 1 MLKCLY-----IKHRG---FFMLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKI 72 (488) T ss_pred CceeEE-----Eeecc---eeecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccc Confidence 111111 11111 0000122223333332221100 0011111110000011111100 000000000000 Q ss_pred ccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccC----------CCchhhcccceeEee Q lcl|NC_019406. 114 AEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAP----------SSDPTAPAKSYTVGY 183 (661) Q Consensus 114 ~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~----------a~~~~~g~rPY~~~~ 183 (661) -+ ..+....=.+.+-++. +..+..++-.+|+.--. ++.|. ++......--++ T Consensus 73 ~~------~y~~~~~~rA~~~n~~--~~tl~~l~G~vfrk~p~-------~~~~~~~~l~~l~~d~D~~G~~L~~f~--- 134 (488) T protein:vir:96 73 EK------DWEDLTWRLANYVNIV--NPTMNAITGAVMRREPE-------FDTMDNPVLIGLRDNIDGKGNGIDQEC--- 134 (488) T ss_pred hh------hhHhhhhhccccCchh--HHHHHHhcchhhccCce-------eccCCcHHHHHHHhccCCCCCCHHHHH--- Confidence 00 0000000000001111 22333333333322111 11110 111110000000 Q ss_pred chhhhccceeeccccccceeeeeee--eeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheeccc Q lcl|NC_019406. 184 AAENIVDWTVEDVDGFYVPTRILLR--EFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPS 261 (661) Q Consensus 184 ~p~~IinW~~~~~~g~~~Lt~v~ir--e~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~ 261 (661) .++...- ..-|+ .+|++. +......+....+.+||+.+|.+++||||++.+++|+..+++++++|++.. T Consensus 135 --~~~~~~~--l~~G~---~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~-- 205 (488) T protein:vir:96 135 --KQALNAL--QWGSR---CGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQE-- 205 (488) T ss_pred --HHHHHHH--HhcCe---EEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEe-- Confidence 0110000 00111 111111 111223344556789999999999999999999999999999988764321 Q ss_pred ccCCCceeeEEEEEEEeecccc--cceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecCCCCCCccccch--- Q lcl|NC_019406. 262 RFTSSYTFRTIYRELILELQKD--GSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPL--- 336 (661) Q Consensus 262 ~~~~~~~~~~~~rv~~l~~g~~--g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPL--- 336 (661) ..+.+ +..++++++...+. +..... ..++... .+++... ++...+.-|+ T Consensus 206 -----------------~D~~~~~~~~~~~~~~l~~g~---~~v~~~-~~~~~~~---e~~~~~~--g~~~l~~IP~v~~ 259 (488) T protein:vir:96 206 -----------------RDGGTYVSKQRLINHRLVDGL---CEFQEV-TDDEYSD---EWTPVLI--NSKQSDTIPFFLA 259 (488) T ss_pred -----------------ccCCCcccceEEEEEEEECcE---EEEEEE-ecCCccc---ceEeecC--CCcccCeeEEEEE Confidence 11211 23345555444332 111111 1122222 2233221 1112222222 Q ss_pred ---------hHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceee---------cCCCCCcceEe Q lcl|NC_019406. 337 ---------LDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWV---------VDKESGIPGII 398 (661) Q Consensus 337 ---------ldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~---------lp~~ga~~~yl 398 (661) ...=.+.|+|-+..--+..+-... ++.+.++. .+++|...... -..-++++.+. T Consensus 260 ~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~---il~~~~~p-----~lv~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 331 (488) T protein:vir:96 260 SSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNK---AMILANEA-----KWMVDMGDMNKTMASEMNPLGFTLAGRMPYY 331 (488) T ss_pred ecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHH---HHHhcCCc-----eeeeccCCCCcccccccccceeeeccccccc Confidence 122244666654443222222211 23344431 13343221100 00112333333 Q ss_pred ecCchh--HHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019406. 399 EFKGEG--LKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDI 476 (661) Q Consensus 399 E~~g~~--i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~ 476 (661) .+.|.- +++.-.+| .++.|..+=.+|...+-+-..+++.......+-+.+.-.+.=..+...++.+|+.+-+|+.. T Consensus 332 ~~~g~~~~~e~~~~~l--~~~~l~~l~~qm~~~Ga~l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~ 409 (488) T protein:vir:96 332 VKNGDVKVIQAQFSPE--TENKVEKLFEQAVKVGASLFTQQSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMR 409 (488) T ss_pred ccCCceeecCCchhHH--HHHHHHHHHHHHHHHhHhhccCCCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 344321 22223333 24446655555544332333443334555555666666777788999999999999999986 Q ss_pred CCCCcceEEEEeccccccc-cCCHHHH-HHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCc-h Q lcl|NC_019406. 477 PLTDTATLRYEIDATFLTT-ALDARAL-RAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQP-D 553 (661) Q Consensus 477 ~~~~~~~~~v~ln~DF~~~-~lda~~l-~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~d-d 553 (661) -.+........-...|... ++..+.+ .+.+++ +-.+...|.|+....+++.+. +.-|+.| + T Consensus 410 w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~a-----------l~~~~~~G~Is~~t~~~~L~~-----~gvl~~d~~ 473 (488) T protein:vir:96 410 YFEGTNLYVNPDELVFKLNRDYFDVEVNPQMLQV-----------AYAAMMEGNLPQVSWFELLKR-----ARVVRGDMS 473 (488) T ss_pred HcCCCCCCcCccceEEEeccCCCCccCCHHHHHH-----------HHHHHhcCCCCHHHHHHHHHh-----CCcCCccCC Confidence 5543322211112234322 2222222 122222 223445677776655555443 1222211 1 Q ss_pred hhhhhcCCccccCCCcc Q lcl|NC_019406. 554 AIAMRRGYVSRQQELDQ 570 (661) Q Consensus 554 ae~~~~g~~~~~~~~~q 570 (661) -+..++- ...+.+.- T Consensus 474 ~e~~~~~--ie~~g~~~ 488 (488) T protein:vir:96 474 KEEFDEH--IAELGFGM 488 (488) T ss_pred HHHHHHH--HhhcCCCC Confidence 1111111 00111111 No 125 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=459 Identities=12% Similarity=0.016 Sum_probs=188.8 Q ss_pred CCCCCCcccccccccc-ccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCC-CCCCCChHHHHHHHhhhcc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKR-GAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLK-APKGFDDEDYANYLDRAAF 78 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLP-k~~~E~~~~Y~~rl~rA~~ 78 (661) +.-..|--.+|--++. +...|.-+..+..|..+. .+.|+..-.....-|.+ ...+ .+-+..|. . T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~--~~l~a~Y~----~ 112 (537) T protein:vir:10 47 MMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFS--------AYANPNLSEGLVLWYAQQAFIG--HQMCALIA----T 112 (537) T ss_pred cCCCCCccCcccccccccccchhccccccchhhhh--------hhccccccchhhhhccccCCcc--HHHHHHHH----h Confidence 1122222222222211 122233333332222211 12232222222222222 2222 22333333 3 Q ss_pred cchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhC Q lcl|NC_019406. 79 YNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMG 158 (661) Q Consensus 79 ~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~G 158 (661) ....+..|+..+...+|+...|+.. |.+ ...++..+.|...++.. .+..-++.+++.+..|| T Consensus 113 ~~l~r~iVd~~A~d~~r~~~~i~~~---------~~~----~~~~~~~~~l~~~~~~l-----~~~~~l~~a~~~~rlyG 174 (537) T protein:vir:10 113 HWLVNKACSQMPRDAMRKGYKIISD---------DGN----ELDPKDAKFIDRYDRAF-----NIKKHAIQFVRKGRIFG 174 (537) T ss_pred CchhhhhhhhhhHHhhcCCceeecC---------Ccc----cccHHHHHHHHHHHHHh-----hHHHHHHHHHHhccccc Confidence 4577888888888899999888521 110 11234445555555444 56788888888888899 Q ss_pred CEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 159 RFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 159 r~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) .++++|.-...+... -..|. .++.| ++..+..+++... ....|++..+ T Consensus 175 ~~~i~i~v~~~D~~~-~~~Pl----~~~~i---------~kg~~k~l~vidp---------~~~~~~~~~~--------- 222 (537) T protein:vir:10 175 IRIALFKVDSPDPYY-YEKPF----NIDGV---------MPGAYKGIVQIDP---------YWCAPLLDAQ--------- 222 (537) T ss_pred ceEEEEeecCcCCcc-ccccc----ccccc---------cccceeEEEEech---------hhcccccchh--------- Confidence 888887654332210 00010 11100 0001111100000 0000110000 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP 318 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP 318 (661) ....+...+ |..-..|++. |..+ +.+.++ ...|.+++.+. T Consensus 223 -----------------~~~dp~sp~--fg~P~~y~v~-------g~~i-------------H~SRli-~f~g~~~p~~~ 262 (537) T protein:vir:10 223 -----------------ASSNPVSMH--FYEPTYWLIN-------GKKY-------------HRSHLA-IYINDEVVDFL 262 (537) T ss_pred -----------------hhccCCccc--cCCceeeeec-------CeEe-------------cceeEE-EecCCCCchhh Confidence 001111111 1111223221 1111 111111 12233333332 Q ss_pred EEEEecCCCCCCccccchhHHHHHHHHHHhhhh-hHHHHHHHhcCceeEEecCCCCC-Ccee---------Eecccceee Q lcl|NC_019406. 319 FVFFGSMSNAADCEKPPLLDIVELNLKHYRTYA-ELEHGRFFTALPTYYAPELDDSD-ASEY---------HIGPGRVWV 387 (661) Q Consensus 319 fv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sS-Dl~~il~~~~~P~l~i~Gl~~~~-~~~l---------~iGs~~~~~ 387 (661) -- ..+ ..+.|. +..+.=.|..|.... .--.+++...++++-+.|+..-. .+.+ .-+....+. T Consensus 263 ~~----~~~--~~G~Sv-lq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g~~~ 335 (537) T protein:vir:10 263 KP----SYI--YGGVPL-PQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNYQVRV 335 (537) T ss_pred hc----ccC--cccccH-HHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCcceeE Confidence 11 011 123443 444555566665443 44567788888888777653211 1111 124345666 Q ss_pred cCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hH--Hhccccc-CccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GG--RLMPGMS-KSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GA--rll~~~~-~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) ++.++..+..+..+-+++. +.++...+++..+ |. -.|-.++ ++-+.|++ .|...=+..+.++-..+.-++ T Consensus 336 id~e~e~~e~~~~~lsgl~---~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe---~D~~~yyd~I~~~Qe~l~p~l 409 (537) T protein:vir:10 336 VDKDNEDVVQIDTTLNDLD---KVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGD---YEEASYHEECESTQDDMRPLI 409 (537) T ss_pred ecCCCceeEEEeccCCCHH---HHHHHHHHHHHhhhCCCceeeccCCccccccchh---HHHHHHHHHHHHHHHHHHHHH Confidence 7655556666555555543 4444555555433 21 1121221 22222333 344444555666656678888 Q ss_pred HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHH--------HHHHHHHHHhcCCCCHHHHHHHHHhc------CCC Q lcl|NC_019406. 464 TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDAR--------ALRAIQQLYEGGLLPIDALYENFVKN------GII 529 (661) Q Consensus 464 ~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~--------~l~all~~~~aG~Is~et~~~eL~r~------gvl 529 (661) +.+++++.+-.+.+ ..++.|+.|. ...++.. ..++...++++|.|+..+.+..|+.- ++. T Consensus 410 ~~l~~ll~~~~~~~---~~~~~i~f~p---L~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~ 483 (537) T protein:vir:10 410 DRHHQLVCRSHLRK---RIRVKVEFPP---MDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSIT 483 (537) T ss_pred HHHHHHHHHhcCCC---CcceEEEeCC---CCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCcccccccc Confidence 88888877544322 2346666553 1122222 23567888999999999999999873 333 Q ss_pred CccCCHHHHHH-HHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCc Q lcl|NC_019406. 530 PSTQTLEEFTI-KMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDE 595 (661) Q Consensus 530 ~~~~~~Eee~~-~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~ 595 (661) +..+.|++++ .+.++.+.-...+....+ .+-..+.++++ +..+.+..+++-.. T Consensus 484 -~~~~~ed~e~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~---~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 484 -PAMRPTDAEDIDVDDEGKPVRIIEDQPAP---------SEMFGATSSGE---SANDPRDSGAAFED 537 (537) T ss_pred -CCCChhhhhcccCCccCCcCCCCCCCCCc---------cccCCCCcccc---ccCCCccCccccCC Confidence 3333343332 222222211111111111 11111111111 00111111221111 No 126 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=95.17 E-value=0.0026 Score=34.75 Aligned_cols=484 Identities=9% Similarity=0.070 Sum_probs=191.4 Q ss_pred CCccccCH----HHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC-CChH-HHHHHHhhhcccchHHHHHHHHhchh- Q lcl|NC_019406. 21 FTHLVVHP----EYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG-FDDE-DYANYLDRAAFYNMTSQTQAGMVGQI- 93 (661) Q Consensus 21 ~~V~~~hP----ey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~-E~~~-~Y~~rl~rA~~~n~~~~tv~~l~G~v- 93 (661) |..-.+.- +......+|+-|.+.+ ||..-. .... ....++. -.|-+.-.+.++.++..+ T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~t-------------lP~~~~~~~~~~~~~~~~~-~~~dstg~~a~~~LAa~l~ 66 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELT-------------LPYLIDDDISSRPNHKSLT-VPWQSVGAKCCVTLAAKLM 66 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHh-------------hhcccCCCCCCCccccccc-ccccchHHHHHHHHHHHHH Confidence 44222111 1122244454444444 332211 1111 1111222 244444455555544333 Q ss_pred ---hcc-Ccccc-ccchhhHhhhhcccccccccchhhhhhhHhhhhhcc------CCCCCHHHHHHHHHHHHHhhCCEEE Q lcl|NC_019406. 94 ---FRR-PPVIR-NLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFA------KDGTSHQGFAKTVALEQVAMGRFGA 162 (661) Q Consensus 94 ---Frk-~p~i~-~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d------l~G~sL~~fa~~~~~~~L~~Gr~gv 162 (661) |.- .|=+. .+++. .+..+.+ ++.-..++.+|+.|+ +.-++.+.-+-.++.+.+.+|-+-+ T Consensus 67 ~~ltpp~~~WF~l~~~d~--~l~~~~~-------~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 137 (522) T protein:vir:10 67 LAVLPPQTSFFKLQVRDD--KLGEELD-------PQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALI 137 (522) T ss_pred HhhcCCCCccccccCChH--HHhhhcC-------hhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeE Confidence 321 11110 01110 1111111 111123344444443 4577888889999999999999999 Q ss_pred EEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhh Q lcl|NC_019406. 163 LVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRR 242 (661) Q Consensus 163 LVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~ 242 (661) ++|-.+ . ..|+-.+ +-+. .++.+.+.-|..+++.... .-+..|..... T Consensus 138 y~~~~~-------~----~~~pl~~---y~v~-~d~~G~vd~i~r~~~~t~~-----------------ql~~~fg~~~~ 185 (522) T protein:vir:10 138 FMGKDG-------L----KTFPLTR---YVIN-RDGDGNVLEIVTKELISRK-----------------VLDIELPEPKP 185 (522) T ss_pred EEcCCC-------c----eEEEcce---EEEe-eCCCCCeeEEEeeeeccHH-----------------HHHHhcchhcc Confidence 998532 1 1222111 2111 2223333333333332210 11111211110 Q ss_pred cchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeec---cCC-cccceee Q lcl|NC_019406. 243 AGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPM---VRG-RTLPFIP 318 (661) Q Consensus 243 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~---~~g-~~L~~IP 318 (661) . ..+... ...+...+++.....+.+. ..|.|+....+.. .+. ..| ..+++|| T Consensus 186 ------~-----~~~~~~---~~~~~~v~v~~~v~p~~~~---~~~~~~~~~~~~~-------~~~~~s~~g~~~~P~~~ 241 (522) T protein:vir:10 186 ------N-----TGIDES---STTNDDVTIYTYVKLDKSS---GRWVWHQEAFDKI-------IPDSRSTAPKNASPWLP 241 (522) T ss_pred ------c-----hhhhcc---cCCCCceEEEEEEEeeccC---CceEEEEccCCcc-------ccccccccccccCCcee Confidence 0 000000 0112233444444333221 1233332222211 111 112 2467888 Q ss_pred EEEEecCCCCCCccccc----hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEE--ecCCCCCCceeEecccceeecCCCC Q lcl|NC_019406. 319 FVFFGSMSNAADCEKPP----LLDIVELNLKHYRTYAELEHGRFFTALPTYYA--PELDDSDASEYHIGPGRVWVVDKES 392 (661) Q Consensus 319 fv~~~~~~~~~~~~~pP----LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i--~Gl~~~~~~~l~iGs~~~~~lp~~g 392 (661) +.|.-..+..+.. .| |-|+..||.-+ .+-+. ..+.+.-|.+.+ .|.... ..+.-|.++.+... .. T Consensus 242 ~Rw~~~~ge~YGr--gp~~~~l~D~k~L~~l~---~~~~~-~~~~a~~p~~lv~~~~~~~~--~~l~~~~~~~~v~g-~~ 312 (522) T protein:vir:10 242 LRFNTVDGEDYGR--GRVEEFLGDLKSLDGLS---QSLIE-GAAAASKVVFLVSPSSTTKP--ATIAKAGNGAIVQG-RP 312 (522) T ss_pred eeeeecCCCcccc--chHHHHHHHHHHHHHHH---HHHHH-HHHHhcCCceeecccccccc--ccccCCCCcceecC-CC Confidence 8887665555544 44 44777776542 22233 444444444444 233222 12344555555433 33 Q ss_pred CcceEeecC-chhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH-----HH Q lcl|NC_019406. 393 GIPGIIEFK-GEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT-----SV 466 (661) Q Consensus 393 a~~~ylE~~-g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~-----~a 466 (661) ++...++.. +..+....+.+++++..++.+= |+-...++...|||+...+...-...|..+-..+.+=+- ++ T Consensus 313 ~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aF--l~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~ 390 (522) T protein:vir:10 313 EDVAVIQVGKTADFSTAANMATAIEKRLLEAF--LVMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRT 390 (522) T ss_pred ccceeecccccccchHHHHHHHHHHHHHHHHH--hhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 455555543 4567888999999999988742 222233456679999999999999999998888754333 33 Q ss_pred HHHHHHHcCC-CCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHh-c--CCCCccCCHHHHHHHH Q lcl|NC_019406. 467 VRYWLMFRDI-PLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVK-N--GIIPSTQTLEEFTIKM 542 (661) Q Consensus 467 L~~~A~w~G~-~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r-~--gvl~~~~~~Eee~~~l 542 (661) |.++-+ .|+ +....+-+...+ -.|...---++.+..++ .|...+.. . ..+.+..++++..+.+ T Consensus 391 ~~il~r-~g~lP~~p~~~~~~~~-v~~is~Laraq~~~~l~-----------~~~~~i~~~~~p~~~~~~id~d~~~~~~ 457 (522) T protein:vir:10 391 LLVLQR-SNQIPKLPKDIVRPTI-VAGVNALGRGQDRESLT-----------AFVGTIAQTLGPEALMQYLNPLEAIKRL 457 (522) T ss_pred HHHHHh-cCCCCCCCcccccccc-ccchhHHHHHHHHHHHH-----------HHHHHHHHhhCchhhhhcCCHHHHHHHH Confidence 333332 232 111111111111 11221110112222222 22222311 1 2233456788888888 Q ss_pred hccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhhh Q lcl|NC_019406. 543 NDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQAK 622 (661) Q Consensus 543 ~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 622 (661) .+- +|.|-+..+. . +++ ..++.++.|+.+. ++.+.+..+++... ..++|.+.- T Consensus 458 a~~---~Gvp~~~ivr-t----~ee-----------v~~~~q~~q~~~~-----~~~~~~~a~~~~~~---~~~~~~~~~ 510 (522) T protein:vir:10 458 AAA---QGIDVLNLVK-T----EQQ-----------LAEEQQAAQQQAA-----QQSLVDQAGQMTGS---PLMDPTKNP 510 (522) T ss_pred HHH---hCCChhhhcC-C----HHH-----------HHHHHHHHHHHHH-----HHHHHHHHHHHhcc---cccCccccH Confidence 764 2322111111 0 000 0001111111000 00111100111111 112221100 Q ss_pred hhhhhhhHHHHhhcccccCC Q lcl|NC_019406. 623 PSKAEQAQIDAQQKQAAAKP 642 (661) Q Consensus 623 ~~~~~~~~~~~~~~~~~~~~ 642 (661) ++ .+|-|+.-+. T Consensus 511 --~~------~~~~~~~~~~ 522 (522) T protein:vir:10 511 --QL------MDEEQPPMEE 522 (522) T ss_pred --HH------HHHhCCCCCC Confidence 01 1111111111 No 127 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=95.13 E-value=0.0027 Score=34.66 Aligned_cols=445 Identities=13% Similarity=0.027 Sum_probs=170.8 Q ss_pred ccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC--CChHHHHHHHhhhc----ccchH Q lcl|NC_019406. 9 ANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG--FDDEDYANYLDRAA----FYNMT 82 (661) Q Consensus 9 ~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~--E~~~~Y~~rl~rA~----~~n~~ 82 (661) .|+-.+. ..-|.-....+.+. -.|.|....+ +-...|.++. +..........||- =.++. T Consensus 1 m~~~~~~---------~~a~~~~~~~~~~~---~~y~aa~~~~--~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a 66 (495) T protein:vir:10 1 MNMTPSG---------YQSLASGLLVPVGA---SAYEGASGGH--RWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWA 66 (495) T ss_pred CCccccc---------ccccchhhhhHHHh---hhhhccccCc--ccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHH Confidence 3332211 11122222222222 2233322221 1112232221 11111222222221 12333 Q ss_pred HH----HHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCC-CHHHHHHHHHHHHHhh Q lcl|NC_019406. 83 SQ----TQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGT-SHQGFAKTVALEQVAM 157 (661) Q Consensus 83 ~~----tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~-sL~~fa~~~~~~~L~~ 157 (661) +. .++..+|-=|+--+... + +.+.. ..-..|..++++||..|. +++.+.+.+++..+.. T Consensus 67 ~~av~~~~~~vVG~Gi~p~~~~~---~--~~~~~-----------~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~d 130 (495) T protein:vir:10 67 TNAVATWVAAAVGNGLTPRWRMK---E--QELRQ-----------ELQELWGDWVNEADFDEVQSFYGLQALVVRTVINS 130 (495) T ss_pred HHHHHHHHHhhcCCCcccccCCc---h--HHHHH-----------HHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhC Confidence 44 44444444343222211 1 11111 122457788999999995 9999999999999999 Q ss_pred CCEEEEEeccCCCchhhcccc-eeEeechhhhccceeec--cccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 158 GRFGALVDVAPSSDPTAPAKS-YTVGYAAENIVDWTVED--VDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 158 Gr~gvLVD~P~a~~~~~g~rP-Y~~~~~p~~IinW~~~~--~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) |=|++.+-+.+.... +.-| -+-+|.|+.|-+-.-+. .+|. +|+.-+.. T Consensus 131 GE~f~~~~~~~~~~g--~~~~~~lqliepd~l~~~~~~~~~~~g~------~i~~GIe~--------------------- 181 (495) T protein:vir:10 131 GEAFVIKKPRPLSEG--LSVPLQLQIIEPDMLASDIPDETLPSGG------YVKGGIRF--------------------- 181 (495) T ss_pred CceEEEEeecccCCC--CccceEEEEechhhcCCCCCCCCCCCCC------EEEeceEE--------------------- Confidence 999998877542210 0111 24566666653221100 0000 01111100 Q ss_pred hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccc Q lcl|NC_019406. 235 QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTL 314 (661) Q Consensus 235 i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L 314 (661) +.+.-..-|.++.-.+| . +. ....+... T Consensus 182 ------------------------------d~~Gr~vaY~i~~~hpg---------------d-~~------~~~~~~~~ 209 (495) T protein:vir:10 182 ------------------------------SNGGKRKAYCFYRNHPA---------------E-SS------LIGDPVDT 209 (495) T ss_pred ------------------------------CCCCceEEEEEeecCCC---------------c-cc------ccccccce Confidence 01111111222211111 0 00 00001111 Q ss_pred ceeeE---EEEecCCCCCCccccchhHHHH-HHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCC--------------CC Q lcl|NC_019406. 315 PFIPF---VFFGSMSNAADCEKPPLLDIVE-LNLKHYRTYAELEHGRFFTALPTYYAP-ELDDS--------------DA 375 (661) Q Consensus 315 ~~IPf---v~~~~~~~~~~~~~pPLldLA~-LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~--------------~~ 375 (661) -.||. +-+...+-+-.-+.|-|..|-. -.+..|.. |.+....- .+....||+ ...++ .. T Consensus 210 ~rvpA~~vlH~f~~r~gQ~RGis~la~i~~l~~l~~y~d-ael~~a~i-~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~ 287 (495) T protein:vir:10 210 VWIKAEHVLHVTVLTVRSDAGAPWFQLLLRLNELDQYED-AELVRKKT-AALFAAFIQEATADSTGGPTIGQPKRSKGGK 287 (495) T ss_pred eeechhheEeccccCCCcccCcchhHHHHHHHHhhHHHH-HHHHHHHH-hhhheeeeecCCCccccccccCccccccCcc Confidence 22331 1111122222234442222111 12233332 22333333 334455553 21111 01 Q ss_pred ceeEecccceeecCCCCCcceEeecCc--hhHHHHHHHHHHHHHHHH-HHh--HHhcccccCccc-hhHHHHHHHHHHhh Q lcl|NC_019406. 376 SEYHIGPGRVWVVDKESGIPGIIEFKG--EGLKTLERALNEKEQQIA-AIG--GRLMPGMSKSVS-ESDNQSALREANEQ 449 (661) Q Consensus 376 ~~l~iGs~~~~~lp~~ga~~~ylE~~g--~~i~a~~~~L~~le~qM~-~lG--Arll~~~~~~~~-eTataa~~d~~~~~ 449 (661) ..+.++++.+..|+ +|-+++|+.++. .........+ ...++ .+| ..+|..--..++ .|+-+.-+++-... T Consensus 288 ~~~~l~pG~i~~L~-pGe~i~~~~p~~p~~~~~~f~~~~---lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~ 363 (495) T protein:vir:10 288 RITGLNPGTLQYLQ-PGQEVKFSNPADVGTTYEPWLRYQ---LLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLC 363 (495) T ss_pred cceecCCceeeecC-CCCeeeeeCCCCCCCCHHHHHHHH---HHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHH Confidence 13568888888888 478999999763 3334333332 22222 111 122321100111 23444455555555 Q ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHcCC-CCCCcceEE-EEecccccccc---CCHH-HHHHHHHHHhcCCCCHHHHHH Q lcl|NC_019406. 450 SLLLN--VIMALEDGMTSVVRYWLMFRDI-PLTDTATLR-YEIDATFLTTA---LDAR-ALRAIQQLYEGGLLPIDALYE 521 (661) Q Consensus 450 S~L~~--~A~~le~Al~~aL~~~A~w~G~-~~~~~~~~~-v~ln~DF~~~~---lda~-~l~all~~~~aG~Is~et~~~ 521 (661) ..++. ++..+-.-+-..+--+|-..|. +.++.-+.. .-.+-+|.... +|+. ++.+.+.++.+|..|++.... T Consensus 364 ~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a 443 (495) T protein:vir:10 364 QQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQA 443 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 44432 3333333322222222333342 111100000 00122344333 4664 899999999999999988754 Q ss_pred HHHhcCCCCccCCHHHHHHHHhccCC---CCCCchhhhhhcCCccccCCCcchhhhhcCChhhHH Q lcl|NC_019406. 522 NFVKNGIIPSTQTLEEFTIKMNDPKS---FIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQE 583 (661) Q Consensus 522 eL~r~gvl~~~~~~Eee~~~l~~~~~---~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~ 583 (661) ++|. +++++.+.|+.+.. .+|+.- ++..-..+...+.+...+-+-.++| T Consensus 444 ---~~G~-----D~~~v~~q~a~e~~~~~~~Gl~~-----~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 444 ---ERGY-----DMEELFDMISDANQLIDEYDLRL-----DSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred ---HcCC-----CHHHHHHHHHHHHHHHHHcCCCC-----CCCCCcCCCccCCCCCCCCCCCCCC Confidence 4564 45555544443211 122210 0111111111222111111111112 No 128 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=94.93 E-value=0.0032 Score=34.30 Aligned_cols=512 Identities=12% Similarity=0.094 Sum_probs=188.0 Q ss_pred CCCCCCccccccccccccccC--CccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCC-CCCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQF--THLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAP-KGFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~-~~E~~~~Y~~rl~rA~ 77 (661) ||-.- +.+.+...- -.+.-.-+-....++|+-|.+.+ ||.. +.+++..- ..+. -. T Consensus 1 ~~~~~-------~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~-------------lP~~~~~~~~~~~-~~~~-~~ 58 (543) T protein:vir:88 1 MAETK-------REGLAEEGAKAVYERLKNDRVPYETRAENCAKVT-------------IPSLFPKDSDNSS-TDYT-TP 58 (543) T ss_pred Ccccc-------cCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHh-------------ccccCCCCCCccc-cccc-cc Confidence 32111 110000000 00000111122334454444444 3421 11111110 1111 13 Q ss_pred ccchHHHHHHHHhchhhccCccccccchhhHhhhh----cccccccccchhhhhhhHhhhhhcc------CCCCCHHHHH Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGR----DAEGGVQVVAPASIGKLLTQLQRFA------KDGTSHQGFA 147 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~----d~dG~~~~~~~~~~~~~~~~~~~~d------l~G~sL~~fa 147 (661) |-+...+.++.+...++.- + .|+. .|+. |.+-......+.....++.+|+.+. +.-++.+.-+ T Consensus 59 ~dst~~~a~~~Laa~l~~~---l--tP~~--~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~ 131 (543) T protein:vir:88 59 WQAVGARGLNNLSAKVMLA---L--FPLQ--SWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTL 131 (543) T ss_pred ccchHHHHHHHHHHHHHHh---h--cCCC--cccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHH Confidence 4444445555554443331 1 1111 1111 0000000001111122333322211 2356678888 Q ss_pred HHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceee Q lcl|NC_019406. 148 KTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIG 227 (661) Q Consensus 148 ~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~ 227 (661) -.++.+.+.+|-+-+++|-+.... .+.+| +..|.-. .|-+. .++.+.+.-|..++..... T Consensus 132 ~~~~~~L~~~G~a~ly~~~~~~~~--~~~~~-~~~~pl~---~y~v~-~d~~G~v~~i~r~~~~~~~------------- 191 (543) T protein:vir:88 132 FELIRQLALAGTALIYLPPPDASS--NSYNP-MKLYTLH---NHVVQ-RDAFGNVLQIVTLDKVAYA------------- 191 (543) T ss_pred HHHHHHHHhhCceeeeeccCcccc--ceecc-eEEeEcc---eEEEe-eCCCCCeeeeeeeeeccHH------------- Confidence 888999999999999998553211 11111 1111111 11111 1112222222222222110 Q ss_pred eechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee Q lcl|NC_019406. 228 REGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 228 ~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p 307 (661) .+...+ .+.+.. ......+...++|+....+.+. ..|.|+..-++ .+++ T Consensus 192 -------------------~l~~~~-~~~v~~-~~~~~p~~~~~v~~~V~pr~~~---~~~~~~~~~~~-------~~v~ 240 (543) T protein:vir:88 192 -------------------ALPEDV-RNSLSG-GQEYKPEQELEVYTHIYIDDES---GDFLSYQEIEG-------VEVD 240 (543) T ss_pred -------------------HHhHHh-hHHHHH-HhhcCCccceEEEEEEEeecCC---CcccccccccC-------eeee Confidence 000000 000000 0001122333444443332221 12233211111 1122 Q ss_pred ccCC----cccceeeEEEEecCCCCCCccccc----hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCcee Q lcl|NC_019406. 308 MVRG----RTLPFIPFVFFGSMSNAADCEKPP----LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEY 378 (661) Q Consensus 308 ~~~g----~~L~~IPfv~~~~~~~~~~~~~pP----LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l 378 (661) ...| ...++|++.|.-..+..+.. .| |-|+..||.-+-. -+..+......|.++-+ |.... ..+ T Consensus 241 ~~~~~~~~~e~P~i~~Rw~~~~ge~YGr--gp~~~~l~D~k~L~~l~~~---~l~~~~~~~~pp~~v~~~g~~~~--~~~ 313 (543) T protein:vir:88 241 GSDGQYPQDALPWIAVRWTKRDGEHYGR--SHVEEYLGDLNSLESLNEA---MIKFAMISSKVVGLVNPNGITQV--RRL 313 (543) T ss_pred cCCCccccccCCceeeeeeecCCCcccc--chHHHHHHHHHHHHHHHHH---HHHHHHHHhcCceeeccccccch--hhc Confidence 2222 23566777776555555544 44 4467777654432 24444445555544432 22221 224 Q ss_pred EecccceeecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHHHHH Q lcl|NC_019406. 379 HIGPGRVWVVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLLNVI 456 (661) Q Consensus 379 ~iGs~~~~~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A 456 (661) .-|..+.+ .+...++...++.. +..+....+.|+++++.+... =..++. ...+...||++...+...-...|..+- T Consensus 314 ~~~~~g~~-v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~r~TAtEV~~r~~E~~~~LG~v~ 391 (543) T protein:vir:88 314 VKAQTGDF-VAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAV-QRSGERVTAEEIRYVASELEDTLGGVY 391 (543) T ss_pred ccCCCcee-ecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhc-cCCCCcccHHHHHHHHHHHHHHHhHHH Confidence 44544443 34334456555433 456899999999999998742 122221 123445799999999999999999988 Q ss_pred HHHHHHH-HHHHHHHHHHc---C-CCCCCcceEEEEeccccccccC-CHHHHHHHHHHHhc-CCCCHHHHHHHHHhcCCC Q lcl|NC_019406. 457 MALEDGM-TSVVRYWLMFR---D-IPLTDTATLRYEIDATFLTTAL-DARALRAIQQLYEG-GLLPIDALYENFVKNGII 529 (661) Q Consensus 457 ~~le~Al-~~aL~~~A~w~---G-~~~~~~~~~~v~ln~DF~~~~l-da~~l~all~~~~a-G~Is~et~~~eL~r~gvl 529 (661) .++++=+ .-++..+-..+ | ++......++++|-. ....+ -.+++..|....+. |.|.. -++ T Consensus 392 ~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs--~l~~l~r~~~~~~l~~~~~~v~~~~~---------p~v- 459 (543) T protein:vir:88 392 SILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTT--GAEALGRGQDLDKLTQFLNAVATVSQ---------LNG- 459 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEe--cHHHHHHHHHHHHHHHHHHHHHhccc---------hhh- Confidence 8865543 33333322222 2 222122233333321 00111 11222222222221 12221 112 Q ss_pred CccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhh Q lcl|NC_019406. 530 PSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSV 609 (661) Q Consensus 530 ~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 609 (661) .+..++++..+.+.+- +|.|-+..+. .+ +|-+.. .+|+..+++..+.+.+++ .++ T Consensus 460 ld~id~d~~~~~~a~~---~Gv~~~~i~r------~~--~e~~~~----~~q~~~q~~~~~~~~~~~----------~~~ 514 (543) T protein:vir:88 460 DPDLNVNNIKLRLANA---IGIDTAGLLL------TE--AEKAQA----QSQEMLKQGGLNAAAGIG----------SGV 514 (543) T ss_pred hccCCHHHHHHHHHHH---hCCChhhhcC------CH--HHHHHH----HHHHHHHHHHHHHHHHHh----------hch Confidence 2457788888888764 2332221111 00 000000 011111111111111000 001 Q ss_pred hHHHhcCChhhhhhhhhhhhHHHHhhcccccCCCCCCCccccc Q lcl|NC_019406. 610 AASRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQR 652 (661) Q Consensus 610 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 652 (661) ...-+ .- |...+++ .+....-|+|-..|- T Consensus 515 ~~~~~-~~-----~~~~~~~--------~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 515 AAQAT-AS-----PEAMESA--------MDTAGVQPGPIATQV 543 (543) T ss_pred hhhhc-cC-----hHHHHHH--------hhhcCCCCCCCCCCC Confidence 10000 01 1111111 111112222322222 No 129 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=94.89 E-value=0.0033 Score=34.22 Aligned_cols=427 Identities=10% Similarity=0.002 Sum_probs=179.4 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhc-ch-HHHHhCCcccCCCCCCCChHHHHHHHhhhcc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIA-GE-REIKAQGVKYLKAPKGFDDEDYANYLDRAAF 78 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~-G~-~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~ 78 (661) |+- -|.| .+.-++.+.-.|..+. .-++ |+ +.....+ .|....----...+..|. . T Consensus 1 ~~~--~~~a---------~~~~~~~~a~~~~~~~-------~~~g~~~~~d~~~~~-~~~~~~~~~~~~l~~lY~----~ 57 (461) T protein:vir:80 1 MYS--IDKA---------KQAKIDSKIVNRNDFM-------VGHGKANSRDKLTRQ-TPGNGQKLDLKACENLYA----S 57 (461) T ss_pred Ccc--chhh---------hhhhhhhhhhhhhHHH-------hhcCCcchhhhhhcc-ccCcccccCHHHHHHHHH----h Confidence 431 1111 1222222222222221 1122 11 1111112 232211111122234443 3 Q ss_pred cchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhC Q lcl|NC_019406. 79 YNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMG 158 (661) Q Consensus 79 ~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~G 158 (661) ..+.+..|+..++..+|+.+.|+... ++..+.|...++++ .+..-++++++.+..|| T Consensus 58 ~~l~r~iVd~~a~d~~r~g~~i~~~~------------------~~~~~~~~~~~~~l-----~~~~~l~~~~~~~rl~G 114 (461) T protein:vir:80 58 NSIAMNIVDIISEDMVRAGWSLKTDN------------------KEMKKNIESKWRKL-----KTKDRFQKLYADKRLYG 114 (461) T ss_pred CCccchhhccchHHhhcCCeeeecCC------------------HHHHHHHHHHHHHh-----hHHHHHHHHHHhhcccc Confidence 35567788888889999998885322 22334444444443 57888999999999999 Q ss_pred CEEEEEeccCCCc-hhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcc Q lcl|NC_019406. 159 RFGALVDVAPSSD-PTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRT 237 (661) Q Consensus 159 r~gvLVD~P~a~~-~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w 237 (661) .++|+|..-.... ...... .+.|+.+ .| | .||..++...|. T Consensus 115 ~a~i~i~v~d~~~~~~~~~~----pl~~~~~--------~~---~---------------------~~l~~~~~~~i~-- 156 (461) T protein:vir:80 115 DGFLSIGVVSSNREQADLST----AIDPKTI--------KS---I---------------------PYINTFNTQKVT-- 156 (461) T ss_pred cEEEEEEeecCCccccCccC----Ccccccc--------cc---e---------------------eEEEeccccccc-- Confidence 9999986421110 000000 1111111 11 1 111111111110 Q ss_pred hhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccccee Q lcl|NC_019406. 238 SGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFI 317 (661) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~I 317 (661) ..+....+..+ .|..-+.|++.-..... ... ..+..+ ..+..++.- T Consensus 157 ---------------~~~~~~dp~sp--~fg~P~~y~i~~~~~~~--~~~------~~~~~~---------~~~~~iH~S 202 (461) T protein:vir:80 157 ---------------QLYLNQDMFSE--HFGEVEFFEVNRVSQLG--EEI------LSGTTA---------STSEQIHRS 202 (461) T ss_pred ---------------hhhhcccCcCc--ccccceEEEEecccccc--ccc------cccccC---------ccceEEccc Confidence 00000111111 12222333332110000 000 000000 001123334 Q ss_pred eEEEEe-cCCCCCCccccchhHHHHHHHHHHhhhhh-HHHHHHHhcCceeEEecCCCCCCc---------eeEeccccee Q lcl|NC_019406. 318 PFVFFG-SMSNAADCEKPPLLDIVELNLKHYRTYAE-LEHGRFFTALPTYYAPELDDSDAS---------EYHIGPGRVW 386 (661) Q Consensus 318 Pfv~~~-~~~~~~~~~~pPLldLA~LNl~HYq~sSD-l~~il~~~~~P~l~i~Gl~~~~~~---------~l~iGs~~~~ 386 (661) +++.+. ..-.+..-+.| ++..+.=-|..|..... --++++...++++-+.|+...-.+ ....+..+.+ T Consensus 203 Rii~~~~~~~~~~~~G~S-~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~ 281 (461) T protein:vir:80 203 RIIHEQGLRFEGETKGRS-IFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALA 281 (461) T ss_pred cEEEecCCCCCccccCcc-hHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEE Confidence 444331 11111122433 44455555555555442 345778888998888876432111 1223334444 Q ss_pred ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHh---H-HhcccccCccchhHHHHHHHHHHhhHHHHHHH-HHHHH Q lcl|NC_019406. 387 VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIG---G-RLMPGMSKSVSESDNQSALREANEQSLLLNVI-MALED 461 (661) Q Consensus 387 ~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lG---A-rll~~~~~~~~eTataa~~d~~~~~S~L~~~A-~~le~ 461 (661) .+.. +.++..+..+-+++ .+.++...++|...- . +|+ .++-+.+-|+ ..|...=+..+.++- ..+.. T Consensus 282 ~~d~-~e~~e~~~~~lsgl---~~~l~~~~~~iaa~s~iP~t~L~-G~s~g~~asg---e~D~~~yyd~i~~~qe~~l~p 353 (461) T protein:vir:80 282 IIKG-DEQLTKESTNVSGM---KDLLDYGWDYLAGAVRMPKTVLK-GQEAGTLTGA---QYDVMNYYARVSSIQENRLRP 353 (461) T ss_pred EEcC-CcceEEEecCcCCH---HHHHHHHHHHHhhhhcCCeeeee-cccCCccccc---hHHHHHHHHHHHHHHHHHHHH Confidence 5554 45666666554554 445555555554321 1 122 2221222222 223333445555555 34777 Q ss_pred HHHHHHHHHHHHcCC--C--CCCcceEEEEeccccccccCCHHH--------HHHHHHHHhcCCCCHHHHHHHHHhcCCC Q lcl|NC_019406. 462 GMTSVVRYWLMFRDI--P--LTDTATLRYEIDATFLTTALDARA--------LRAIQQLYEGGLLPIDALYENFVKNGII 529 (661) Q Consensus 462 Al~~aL~~~A~w~G~--~--~~~~~~~~v~ln~DF~~~~lda~~--------l~all~~~~aG~Is~et~~~eL~r~gvl 529 (661) .++++++++.+=++. + +.+..++.|+.|+ ...++..+ .+++..++++|.||.++.+++|+.+.-+ T Consensus 354 ~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~---L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~ 430 (461) T protein:vir:80 354 QLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNP---LWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGL 430 (461) T ss_pred HHHHHHHHHHHHhcccccccCccccceEEEeCC---CCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCC Confidence 888888877642221 1 1223466666663 22223322 3677888999999999999989744333 Q ss_pred -CccCCH--HHHHHHHhccCCCCCCchhhhhhcC Q lcl|NC_019406. 530 -PSTQTL--EEFTIKMNDPKSFIGQPDAIAMRRG 560 (661) Q Consensus 530 -~~~~~~--Eee~~~l~~~~~~l~~ddae~~~~g 560 (661) ++.... +.+.++++++.. .+..+...+| T Consensus 431 ~~~~~~~~~~~~~~~~~~~~~---~~~~~e~~~g 461 (461) T protein:vir:80 431 ENSSKFSGDSAEIDKLAKLVY---DAYAKKNADG 461 (461) T ss_pred CCCccCCCCCchhhhhhhhcc---ccccccCCCC Confidence 222111 111112221110 0111111122 No 130 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=94.82 E-value=0.0034 Score=34.11 Aligned_cols=488 Identities=11% Similarity=0.076 Sum_probs=188.0 Q ss_pred CCCCCCcccccccc-ccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRT-KRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~ 79 (661) |. +|.|-+-- .+.+...-.+.--.+......+|+-|.+.+--+. .+.+... .+.+| .|- T Consensus 1 ~~----~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~------------~~~~~~~---~~~~~-~~d 60 (516) T protein:vir:10 1 MK----QSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYL------------MNDKGDN---ETSQN-GWQ 60 (516) T ss_pred CC----chhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccc------------cCCCCCc---ccccc-ccc Confidence 11 12221100 0000000011111222333555555555543311 1111111 11111 344 Q ss_pred chHHHHHHHHhch----hhccC-cccc-ccchh-hHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHHHH Q lcl|NC_019406. 80 NMTSQTQAGMVGQ----IFRRP-PVIR-NLPNT-GAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQGF 146 (661) Q Consensus 80 n~~~~tv~~l~G~----vFrk~-p~i~-~~p~~-l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~~f 146 (661) +.-.+.++.++.. +|.-- |=+. .+++. ++.+..+ | .....+..+|+.| -+.-++.+.- T Consensus 61 stg~~a~~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~--~-------~~~~~v~~~L~~ve~~~~~~l~~snf~~~ 131 (516) T protein:vir:10 61 GVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQR--G-------LKKTELATIFAQVETRAMKELEQRQFRPA 131 (516) T ss_pred chHHHHHHHHHHHHHhhhcCCCCccccccCChhhHhhhhcc--C-------chhHHHHHHHHHHHHHHHHHHHhcCcHHH Confidence 4444555554433 33211 1110 11111 1111100 0 1111233333333 2456788888 Q ss_pred HHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccccccccee Q lcl|NC_019406. 147 AKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWI 226 (661) Q Consensus 147 a~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i 226 (661) +-.++.+.+.+|-+.+++|.+. +. ..|+-.+ +-+. .++.+.+.-|+.++......- T Consensus 132 ~~~~~~~L~~~G~a~l~~d~~~------~~----~~~pl~~---y~v~-~d~~G~v~~ivrr~~~~~~~l---------- 187 (516) T protein:vir:10 132 VVEAFKHLIVAGSCMLYKPSKG------AI----SAIPMHH---YVVN-RDTNGDLLDIILLQEKSLRTF---------- 187 (516) T ss_pred HHHHHHHHHhHCeEeEEecCCC------Ce----EEEEcCe---EEEe-eCCCCCeEEEeeeecccHHHH---------- Confidence 8889999999999988887431 11 2222221 1111 122222333333332221100 Q ss_pred eeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccccee Q lcl|NC_019406. 227 GREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYT 306 (661) Q Consensus 227 ~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~ 306 (661) ++. |. .... +..... .... .....+|.......+ ..|.++...++. ..+ T Consensus 188 ----~e~---~~-----~~~~-~~~~~~-----~~~~---~~~~~i~t~v~~~~~----~~~~~~~~~d~~--~~~---- 236 (516) T protein:vir:10 188 ----DPA---TR-----AVVE-VGLKGK-----KCKE---DDSIKLYTHAKYLGE----GFWELKQSADDI--PVG---- 236 (516) T ss_pred ----HHH---hh-----hhhh-hhhhhh-----ccCC---CCceEEEEEEEecCC----CceEEEEeeCce--eec---- Confidence 000 00 0000 000000 0001 112233433333222 123333322221 111 Q ss_pred eccCC---cccceeeEEEEecCCCCCCcccc---chhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeE Q lcl|NC_019406. 307 PMVRG---RTLPFIPFVFFGSMSNAADCEKP---PLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYH 379 (661) Q Consensus 307 p~~~g---~~L~~IPfv~~~~~~~~~~~~~p---PLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~ 379 (661) ..+| ..+++||+.|.-..+..+..+ | -|-|+..||.-+ .+-+..+.+-...|.++-+ |... ...+. T Consensus 237 -~~s~~~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~L~D~k~L~~l~---~~~l~~~~~a~~~~~lv~p~g~~~--~~~l~ 309 (516) T protein:vir:10 237 -KVSKIKSEKLPFIPLTWKRSYGEDWGRP-LAEDYSGDLFVIQFLS---EAVARGAALMADIKYLIRPGAQTD--VDHFV 309 (516) T ss_pred -cccccccccCCeeeeeeeecCCCCcccc-hHHHhhHHHHHHHHHH---HHHHHHHHHhcCCCcccCcccccc--hhhhc Confidence 1122 347888888876666666554 2 255777777432 2334555555556666643 3322 22355 Q ss_pred ecccceeecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHHh-H-HhcccccCccchhHHHHHHHHHHhhHHHHHHH Q lcl|NC_019406. 380 IGPGRVWVVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAIG-G-RLMPGMSKSVSESDNQSALREANEQSLLLNVI 456 (661) Q Consensus 380 iGs~~~~~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lG-A-rll~~~~~~~~eTataa~~d~~~~~S~L~~~A 456 (661) -|+++.+. |+.....+-++.. +..+..+.+.|+++++.+..+= + .+... .+...|||+...+...-...|.-+- T Consensus 310 ~~~~g~~~-~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r--d~~rvTAtEV~~r~~E~~~~LGpv~ 386 (516) T protein:vir:10 310 NSGTGEVV-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRR--DAERVTAVEIQRDALEIEQNMGGVY 386 (516) T ss_pred cCCCceee-cCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhcc--CCccccHHHHHHHHHHHHHHhhhHH Confidence 56655554 4333345555543 3458888999999999987632 1 23322 2345799999999999999998888 Q ss_pred HHHHHHH-HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCC-HHHHHHHHHh-cCCCCccC Q lcl|NC_019406. 457 MALEDGM-TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLP-IDALYENFVK-NGIIPSTQ 533 (661) Q Consensus 457 ~~le~Al-~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is-~et~~~eL~r-~gvl~~~~ 533 (661) ..+..=+ .-++..+..-++ +.. +.++ ++.++. .. +.+|..+.....|. ...++..+.. ..-+-+-. T Consensus 387 ~rl~~Ell~Pli~r~~~~~~-p~~-P~~l---v~~~~v-~~-----i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~i 455 (516) T protein:vir:10 387 SLFATTMQSPVAMWGLLEAG-DSF-TSDL---VDPVII-TG-----IEALGRMAELDKLANFAQYMSLPLQWPEPVLAAV 455 (516) T ss_pred HHHHHHHHHHHHHHHHHhhC-CCC-Chhh---cCccee-hh-----HHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhc Confidence 7765544 333433322222 111 1111 223321 12 22222222211111 1112221211 01122345 Q ss_pred CHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCC-CchhHHHhhhhhhhhhh Q lcl|NC_019406. 534 TLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEI-DEEKLRISAKVGSTSVA 610 (661) Q Consensus 534 ~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~-~~~~~~~~~~~~~~~~~ 610 (661) ++++..+.+.+..+. | .. ..+. .+++.+-+..+. ..+|.....+..+.|. -.-+.|+++ + T Consensus 456 d~d~~~~~~a~~~gv---p-~~-~irs----~eev~~~r~~~~-~~q~~~~~~~~~~~~~~~~~~~~~~~-------~ 516 (516) T protein:vir:10 456 KWPDYMDWVRGQISA---E-LP-FLKS----AEEMEQEQEAQM-QAQQAQMLEEGVAKAVPGVIQQELKE-------A 516 (516) T ss_pred CHHHHHHHHHHHhCC---C-hh-ccCC----HHHHHHHHHHHH-HHHHHHHHHHHhhhcccchhhhhhhc-------C Confidence 677777767654221 1 10 1100 111111111110 0111111111111111 111122222 1 No 131 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=94.81 E-value=0.0034 Score=34.10 Aligned_cols=504 Identities=10% Similarity=0.083 Sum_probs=186.0 Q ss_pred ccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC-CChHHHHHHHhhhcccchHHHHHHHH Q lcl|NC_019406. 11 IRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG-FDDEDYANYLDRAAFYNMTSQTQAGM 89 (661) Q Consensus 11 ~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~-E~~~~Y~~rl~rA~~~n~~~~tv~~l 89 (661) ...++ ..-.+.-.-+-....++|+-|.+.+ ||..-. +.+.. ..++. -.|-+.-.+.++.+ T Consensus 1 mk~~a----~~r~~~l~~~R~~~e~~w~e~~~y~-------------lP~~~~~~~~~~-~~~~~-~~~dstg~~a~~~L 61 (542) T protein:vir:78 1 MKGLA----QARYSAMRADREDFLDMARRCAALT-------------LPYLLTEDGHAS-GGRLQ-QPYQSLGSKGVNAL 61 (542) T ss_pred ChhHH----HHHHHHHHHHhhHHHHHHHHHHHHh-------------ccccCCCCCCcc-ccccc-ccccchHHHHHHHH Confidence 11000 0001111111122344454444443 332211 11111 11111 23444445555555 Q ss_pred hchhhcc-----Ccccc-ccchh-hHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHHHHHHHHHHHHHh Q lcl|NC_019406. 90 VGQIFRR-----PPVIR-NLPNT-GAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQGFAKTVALEQVA 156 (661) Q Consensus 90 ~G~vFrk-----~p~i~-~~p~~-l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~~fa~~~~~~~L~ 156 (661) ...++.- .|=+. .+++. +..+ ...| ++....++.+|+.| -+.-++.+.-+-.++.+.+. T Consensus 62 aa~l~~~ltpp~~~WF~l~~~d~~l~~~-~~~~-------~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~ 133 (542) T protein:vir:78 62 SSKLMLSLFPIQTSFFKLQINDAEIASV-PELT-------PEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIV 133 (542) T ss_pred HHHHHHhhcCCCCccccccCCHHHHHhh-ccCC-------hhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 4444331 11110 01100 0000 0001 11111122222211 12355777778888999999 Q ss_pred hCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhc Q lcl|NC_019406. 157 MGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQR 236 (661) Q Consensus 157 ~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~ 236 (661) +|-+.+++|-++ +..|+-.+ +-+. .++.+.+.-|..++..... .-+-. T Consensus 134 ~G~a~l~~~~~~-----------~~~~pl~~---y~v~-~d~~G~vd~v~r~~~~t~~-----------------ql~~~ 181 (542) T protein:vir:78 134 TGNVLVFAGKKT-----------LKVYPLDR---YVIE-RDGDGNVIEIITRELVDRS-----------------LLPAE 181 (542) T ss_pred hCeEEEEecCCC-----------ceEEecce---eEEe-eCCCCCeEEEeeeeecCHH-----------------HHHHh Confidence 999999988542 11222111 1111 1122222222222222110 00001 Q ss_pred chhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccc-------cceEEEEEEEecCcccccccceeecc Q lcl|NC_019406. 237 TSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKD-------GSRVYKQFVYVEDPLGQARDVYTPMV 309 (661) Q Consensus 237 w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~-------g~~~~~~~~~~~~~~~~~~~~~~p~~ 309 (661) |.... ++..+.... .......| ++.+...-..+.+ ....|.|+...++..... . ... T Consensus 182 fg~~~------l~~~~~~~~---~~~~~~~~---~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~v~~---~-~~e 245 (542) T protein:vir:78 182 FQKQS------LLEGKDSNA---VGEDGPKF---GVAQGKGGRNDAEVFTCCKLVDGQHRWHQECDGKEIKG---S-RSS 245 (542) T ss_pred hcccc------CchHHHhhc---cccCCCeE---EEEEEeecccCCccccccccCCCeEEEEEEeccccccc---c-ccc Confidence 11000 000000000 00000000 0000000000000 001234433332221100 0 001 Q ss_pred CC-cccceeeEEEEecCCCCCCccccc----hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEeccc Q lcl|NC_019406. 310 RG-RTLPFIPFVFFGSMSNAADCEKPP----LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPG 383 (661) Q Consensus 310 ~g-~~L~~IPfv~~~~~~~~~~~~~pP----LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~ 383 (661) .| ..+++||+.|.-..+..+.. .| |-|+..||.-+-. -+..+......|.++-+ |..+. ..+.-|.. T Consensus 246 ~g~~~~P~i~~Rw~~~~ge~YGr--gp~~~~l~D~k~L~~l~~~---~l~~~~~a~~pp~lv~~~g~~~~--~~~~~~~~ 318 (542) T protein:vir:78 246 SPLKHSPWLPLRFNVVDGESYGR--GRVEEFFGDLSSLDALTRS---LIEGSAAAAKVVFMVSPSATTKP--QSLARAGT 318 (542) T ss_pred cccccCCceeeeeeecCCCcccc--chHHHHHHHHHHHHHHHHH---HHHHHHHHhcCceeeccccccch--hhcccCCC Confidence 12 24788888887766666644 34 4477777765433 24444444455544433 33221 22333444 Q ss_pred ceeecCCCCCcceEeec-CchhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019406. 384 RVWVVDKESGIPGIIEF-KGEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG 462 (661) Q Consensus 384 ~~~~lp~~ga~~~ylE~-~g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A 462 (661) +.++ +...++.+.++. ++..+....+.|+++++.+..+ =|+....++...||++...+...-...|..+-..+++= T Consensus 319 g~iv-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~a--Fl~~~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E 395 (542) T protein:vir:78 319 GAII-QGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDA--FLILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVE 395 (542) T ss_pred ceee-cCCccceeeeecccccchhHHHHHHHHHHHHHHHH--hcccccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 4443 333345555543 2446888999999999999763 23323334556799999999999999999988888543 Q ss_pred -H----HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHh---cCCCCccCC Q lcl|NC_019406. 463 -M----TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVK---NGIIPSTQT 534 (661) Q Consensus 463 -l----~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r---~gvl~~~~~ 534 (661) + +++|.++.+---++....+-+++ +|.. . |.++.++.....| ..|...+-. -..+.+..+ T Consensus 396 ~L~Pli~R~~~il~r~g~lP~~p~~lv~~----~~~s-~-----La~~~r~~~~~~l--~~~~~~i~~~~~p~~l~~~id 463 (542) T protein:vir:78 396 LLTPYLNRKLHLMQRSKQLPSLPKGLVMP----TVVA-G-----LGGVGRGEDRAAL--IEFMQTVGQAMGPEALQQFID 463 (542) T ss_pred HHHHHHHHHHHHHHhcCCCCCCchhceee----eeec-h-----HHHHHHHHHHHHH--HHHHHHHHHhcCChhHHhcCC Confidence 3 33444443322122222222333 3332 2 2222222222111 223333311 122334567 Q ss_pred HHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHh Q lcl|NC_019406. 535 LEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRK 614 (661) Q Consensus 535 ~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 614 (661) +++..+.+.+- +|.|-+.... . +++ . ..+.|+.+. ++.++.+.+-.|. .|.-. T Consensus 464 ~d~~~~~~a~~---~Gvp~~~i~~-s----~e~-----------~---~~~~~q~q~--~~~~~al~~~a~~---~a~~~ 516 (542) T protein:vir:78 464 PTEFLKRLAAA---SGIDTLNLVK-S----PET-----------M---ANEAQQAQQ--QQMTASLMGQAGQ---LAKSP 516 (542) T ss_pred HHHHHHHHHHH---cCCCHhhccC-C----HHH-----------H---HHHHHHHHH--HHHHHHHHHhhhh---ccccc Confidence 88888888764 3333221111 0 000 0 001111000 0000110000000 00000 Q ss_pred cCChhhhhhhhhhhhHHHHhhccc--ccCCCCCCCcccc Q lcl|NC_019406. 615 LGDPEQAKPSKAEQAQIDAQQKQA--AAKPVTPTPGTVQ 651 (661) Q Consensus 615 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 651 (661) .|.+ ..++-+| -.+|+.|.-|.-- T Consensus 517 ~~~~-------------~~~~~~a~~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 517 IGEK-------------MMQQINAPGQEAPAGPQTGEDL 542 (542) T ss_pred cccc-------------hhhhcCCCCcCCCCCCcccccC Confidence 0110 0001111 1122222222211 No 132 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=94.47 E-value=0.0043 Score=33.55 Aligned_cols=482 Identities=9% Similarity=0.016 Sum_probs=188.4 Q ss_pred ccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHH-HHHHHH Q lcl|NC_019406. 11 IRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTS-QTQAGM 89 (661) Q Consensus 11 ~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~-~tv~~l 89 (661) ++.++. +-.. .-+-.....+|+-|.+.+--. +-..+.++...-. ++...+..|+ +.++.+ T Consensus 1 m~~~~~----~l~~--k~~R~~~e~~w~e~a~~~lP~----------~~~~~~~~~~~~~---~~~~~~dstg~~a~~~L 61 (514) T protein:vir:80 1 MRQQAS----AMWA--EYRDSTAIRKAEDFAKFTIAS----------LMVDPLDKTHQAE---VVEYDFQSAGAFLVNNL 61 (514) T ss_pred CccchH----HHHH--HhhcchHHHHHHHHHHHhccc----------ccCCCCCCccccc---ccccccchhHHHHHHHH Confidence 221110 0000 011223566676666665331 1111222222111 1112223333 555544 Q ss_pred hch----hhccC-ccccc-cchh-hHhhhhcccccccccchhhhhhhHhhhhhcc------CCCCCHHHHHHHHHHHHHh Q lcl|NC_019406. 90 VGQ----IFRRP-PVIRN-LPNT-GAITGRDAEGGVQVVAPASIGKLLTQLQRFA------KDGTSHQGFAKTVALEQVA 156 (661) Q Consensus 90 ~G~----vFrk~-p~i~~-~p~~-l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d------l~G~sL~~fa~~~~~~~L~ 156 (661) +.. +|.-- |=+.- +.+. ++.+.. .+.....+..+|+.|+ +..++.+.-+-.++.+... T Consensus 62 Aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~---------~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~ 132 (514) T protein:vir:80 62 TAKLALTLFPPGRPSFQIELDDTLQELAAA---------NGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVV 132 (514) T ss_pred HHHHHhhhcCCCCcccccccCchhhhhccc---------cchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 433 33210 10110 0000 011000 0000112222222221 2457888888899999999 Q ss_pred hCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhc Q lcl|NC_019406. 157 MGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQR 236 (661) Q Consensus 157 ~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~ 236 (661) +|-+.+++|-.+ . + +..|+-.+ +-+. .++.+.+.-|..++....+.- . T Consensus 133 ~G~a~l~~~~~~-----~---~-~~~~pl~~---y~v~-~d~~G~v~~i~rr~~~~~~~l-------------------~ 180 (514) T protein:vir:80 133 TGNALFYREPGT-----G---K-MLVWTMQS---YTVR-RTSHGDPAVVVLRQQMPFREL-------------------T 180 (514) T ss_pred HCeEEEEEecCC-----C---c-EEEEEcCe---EEEe-eCCCcCeEEEEeeeeecHHHh-------------------h Confidence 999999987211 1 1 22232222 2111 122333333444443322110 0 Q ss_pred chhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCC---cc Q lcl|NC_019406. 237 TSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRG---RT 313 (661) Q Consensus 237 w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g---~~ 313 (661) +.... ........ ...+....+|......+++++ .+|.++...++.. . ...+| .- T Consensus 181 ~~~~~---------~~~~~~~~-----~~~~~~v~v~~~v~~~~~~~~-~~~sv~~e~~g~~--i-----~~es~y~~~e 238 (514) T protein:vir:80 181 PEIQA---------DAQAKQIA-----KRDSDKCDLYTVIEWQPTPNG-KRCAVWHELEGKR--V-----GPESSYPAHL 238 (514) T ss_pred hhhhh---------hhhhhhcc-----CCCCCceEEEEEEEeecCCCC-eEEEEEEecccee--e-----cccCcccccc Confidence 00000 00000000 011122234444334444433 2344433322211 1 11222 23 Q ss_pred cceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEecccceeecCC Q lcl|NC_019406. 314 LPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPGRVWVVDK 390 (661) Q Consensus 314 L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~~~~~lp~ 390 (661) +++||+.|.-..+..+..+ .--|-|+..||.-+ .+-+..+.-....|.++-+ |... ...+.-|+++.+. |+ T Consensus 239 ~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~---~~~l~~~~~a~~~~~~v~~~g~~~--~~~l~~~~~g~~v-~g 312 (514) T protein:vir:80 239 CPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILS---ERLGLYEFEALSLLNLVDEAKGGA--VDDYRDAETGDFV-PG 312 (514) T ss_pred CCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCceeCcccccc--hhhhcccCCceee-cC Confidence 6888888876666555443 11245777777332 2223333333344444433 3322 1235556555443 43 Q ss_pred CCCcceEeecC-chhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHH-HHH Q lcl|NC_019406. 391 ESGIPGIIEFK-GEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTS-VVR 468 (661) Q Consensus 391 ~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~-aL~ 468 (661) .....+.++.. +..+....+.|+++++.+..+=. |......+...|||+...+...-...|.-+-..+..=+-. ++. T Consensus 313 ~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFm-l~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~ 391 (514) T protein:vir:80 313 QVGSVASYERGDYNKIAQASASVESIVMRLNRAFM-YTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAY 391 (514) T ss_pred CCccceeeecCcccchHHHHHHHHHHHHHHHHHHh-hhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 33456666543 44688889999999999976311 1111112344699999999999999998877776544333 333 Q ss_pred HHHHHc-----CC-CCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCC---CCccCCHHHH Q lcl|NC_019406. 469 YWLMFR-----DI-PLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVK-NGI---IPSTQTLEEF 538 (661) Q Consensus 469 ~~A~w~-----G~-~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r-~gv---l~~~~~~Eee 538 (661) .+-..+ |. +....+. +..+|.. . +.++..+.....| ..|...+.. .++ +.+..++++. T Consensus 392 r~~~il~r~~~g~lP~~p~~l----~~~~~vs-~-----la~l~r~~~~~~l--~~~~~~i~~l~~~~p~v~d~id~d~~ 459 (514) T protein:vir:80 392 LTMYEASRGNGGMLLGIAQGV----YRPSIIT-G-----IPALTRNIETANI--LRATQEASAIVPALVQLSKRFDPEKL 459 (514) T ss_pred HHHHHHhhhccCCCCCCCchh----hcceeee-c-----HHHHHHHHHHHHH--HHHHHHHHHHhccchhhhhcCCHHHH Confidence 322222 22 1111112 2233332 1 2222222222222 223333322 122 2345778888 Q ss_pred HHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCCh Q lcl|NC_019406. 539 TIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDP 618 (661) Q Consensus 539 ~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 618 (661) .+.+.+- +|.|-..... . + ++.+...+ ++.+.++.+++.++ .... + |.. T Consensus 460 ~~~~a~~---~Gvp~~~i~~-~----~---e~~~~~~~------------~~~~~~~~~~~~~~----~~~~-~---~~~ 508 (514) T protein:vir:80 460 VERIFAN---NSVDLSTLSK-D----P---DVVAAEAE------------QEAALAQQQLDVAS----GALA-A---ETS 508 (514) T ss_pred HHHHHHH---hCCCHhhccC-C----H---HHHHHHHH------------HHHHHHHHHHHHHH----HHHH-H---hhh Confidence 8888764 3333221111 0 0 01111111 11000111111100 0000 0 011 Q ss_pred hhhhhh Q lcl|NC_019406. 619 EQAKPS 624 (661) Q Consensus 619 ~~~~~~ 624 (661) .-.-|| T Consensus 509 ~~~~~~ 514 (514) T protein:vir:80 509 AGVLTS 514 (514) T ss_pred ccccCC Confidence 111122 No 133 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=94.45 E-value=0.0044 Score=33.51 Aligned_cols=519 Identities=11% Similarity=0.054 Sum_probs=185.6 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCC-CChHHHHHHHhh-hcccchHHHHHHHHhchh----h Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKG-FDDEDYANYLDR-AAFYNMTSQTQAGMVGQI----F 94 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~-E~~~~Y~~rl~r-A~~~n~~~~tv~~l~G~v----F 94 (661) |.-.... ........-+--|.-+ ...|++...-.||.... ..+..-...... -.|-+.-.+.++.+...+ | T Consensus 1 m~~~~~~-~l~~r~~~l~~~R~~~--e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~lt 77 (559) T protein:vir:95 1 MAETTKE-RLNKQFAQLESERQSF--EPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) T ss_pred CChhhHH-HHHHHHHHHHHHhhHH--HHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Confidence 3322211 1111111111111222 23333333333454322 111111111111 134444445555544333 3 Q ss_pred c-cCccccccchhhHhhhhcccccccccchhhhhhhHhhh-hhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCch Q lcl|NC_019406. 95 R-RPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQL-QRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDP 172 (661) Q Consensus 95 r-k~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~-~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~ 172 (661) - ..|=+. + -..|.+......+-.++......+ +.+ .-++++.-+-.++.+.+.+|-+-+++|..+. T Consensus 78 pp~~~WF~-l------~~~d~~~~e~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~--- 145 (559) T protein:vir:95 78 SPARPWFR-L------ATPDPEMMDYGPVKLWLEAVQNRMNDMF--NKSNLYQSLPQLYGSLGTYSTGAMAVLDDDE--- 145 (559) T ss_pred CCCCcccc-c------ccCCccccchHHHHHHHHHHHHHHHHHH--HhcCcHHHHHHHHHHHHhhCceeeEeecCCC--- Confidence 3 111110 0 000111100000111111111111 122 3456777788889999999999999885321 Q ss_pred hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh-hcchhhhhcchhhhhhh Q lcl|NC_019406. 173 TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA-QRTSGGRRAGLAERQGS 251 (661) Q Consensus 173 ~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v-i~w~~~~~~g~~~~~~~ 251 (661) .+ ..+..|+..+ +-+... +.+.+.-|..++... +..+ -.|... +....... T Consensus 146 -~~--~r~~~~~l~~---~~v~~d-~~G~vd~i~r~~~~t------------------~~ql~~~fg~~---~l~~~~~~ 197 (559) T protein:vir:95 146 -DI--IRTMPFPIGS---YYLANS-PRGSVDTCFRKFSMT------------------VRQLVQEFGLN---NVSESVKS 197 (559) T ss_pred -ce--eEEEEeecCe---EEEeeC-CCCCeEEEEEeEecC------------------HHHHHHHcCcc---cCCHHHHH Confidence 11 1222222222 222211 112222222111111 1111 011000 00000000 Q ss_pred hhhhheecccccCCCceeeEEEEEEEeecccc----cc--eEEEEEEEecCcccccccceeeccCC-cccceeeEEEEec Q lcl|NC_019406. 252 ARADALARPSRFTSSYTFRTIYRELILELQKD----GS--RVYKQFVYVEDPLGQARDVYTPMVRG-RTLPFIPFVFFGS 324 (661) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~----g~--~~~~~~~~~~~~~~~~~~~~~p~~~g-~~L~~IPfv~~~~ 324 (661) .+ .. +.. ..+.++++........+ +. ..|....|..+..+ .++...+| ..+++||+.|.-. T Consensus 198 ~~----~~-~~~---~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~----~~~l~esg~~e~P~~~~Rw~~~ 265 (559) T protein:vir:95 198 MW----ES-GTY---EKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDN----DKLLRESGFDEFPIMAPRWEVN 265 (559) T ss_pred HH----hc-CCC---CCeEEEEEEEeccccccccccccccceEEEEEEEecCCC----ceeeecCCcccCCccceeeeec Confidence 10 00 000 01112222111111111 11 11222223322111 11112223 3588898888877 Q ss_pred CCCCCCccccc---hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCCCC--cceEee Q lcl|NC_019406. 325 MSNAADCEKPP---LLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKESG--IPGIIE 399 (661) Q Consensus 325 ~~~~~~~~~pP---LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~ga--~~~ylE 399 (661) .+..+..+.|- |-|+..||.-+-.. -..++.+..|.+.++ ++....++.+.|+..+..+..++ .+..+. T Consensus 266 ~ge~YGrg~P~~~al~d~k~L~~l~~~~----l~~~~~~~~pp~~v~--~~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~ 339 (559) T protein:vir:95 266 GEDVYGSSCPGMLALGPVKALQLLQKRK----SQLIDKATNPPMVAP--TSLKNQRASLLPGDITYIDQITGQDGFRPAY 339 (559) T ss_pred CCccccccchHHHhhHHHHHHHHHHHHH----HHHHHHHhcCceecc--ccccccceeeeccceeeeCCCCCcccceeec Confidence 66666654332 44666666544332 344555555655544 22333456677776665543222 122221 Q ss_pred cCchhHHHHHHHHHHHHHHHHHH-hHH---hcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH-HH----HHHHHH Q lcl|NC_019406. 400 FKGEGLKTLERALNEKEQQIAAI-GGR---LMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG-MT----SVVRYW 470 (661) Q Consensus 400 ~~g~~i~a~~~~L~~le~qM~~l-GAr---ll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A-l~----~aL~~~ 470 (661) .-...+......|+++++.+... -.. ++. ..++...||++...+...--..|..+-.++.+= +. +++.++ T Consensus 340 ~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~-~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il 418 (559) T protein:vir:95 340 LVNPSTADLVADIQDTRQIINSAYFVDLFMMLQ-NINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMM 418 (559) T ss_pred ccccchHHHHHHHHHHHHHHHHHhhhhhHHHhh-cCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 11234565667788888887643 111 222 223456699999999999999999888887443 33 334444 Q ss_pred HHHcCC-CCC----CcceEEEEeccccccccCCHHHHHHHHHHHhc-CCCCHHHHHHHHHhc--CCCCccCCHHHHHHHH Q lcl|NC_019406. 471 LMFRDI-PLT----DTATLRYEIDATFLTTALDARALRAIQQLYEG-GLLPIDALYENFVKN--GIIPSTQTLEEFTIKM 542 (661) Q Consensus 471 A~w~G~-~~~----~~~~~~v~ln~DF~~~~lda~~l~all~~~~a-G~Is~et~~~eL~r~--gvl~~~~~~Eee~~~l 542 (661) -+- |+ +.. ...+++++ |. .. |....++... +.+..-.++..|... +++ +..++++..+++ T Consensus 419 ~r~-g~lP~~p~~l~~~~i~v~----~i-s~-----La~aqk~~~~~~i~~~~~~~~~laq~~Pevl-d~id~d~~~~~~ 486 (559) T protein:vir:95 419 VRK-NMLPPPPDVMEGMPLKVE----YI-SV-----MAQAQKSIGLSSLASTVNFIGQLAQVKPEAL-DKLNVDQAIDAF 486 (559) T ss_pred Hhc-CCCCCCcccccCcceEEE----ee-cH-----HHHHHHHHHHHHHHHHHHHHHHHhccChhhh-hcCCHHHHHHHH Confidence 332 21 111 11122222 22 11 2222222111 111222233333221 121 347788888888 Q ss_pred hccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCch--hHHHhhhhhhhhhhHHHhcCChhh Q lcl|NC_019406. 543 NDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEE--KLRISAKVGSTSVAASRKLGDPEQ 620 (661) Q Consensus 543 ~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 620 (661) .+- +|.| +..+. .. ++..+ +.||..++|+.+.+.+.. -+++.. .+++..- T Consensus 487 a~~---~Gvp-~~~ir-s~----~ev~~--------~rqqr~~~qq~~q~~~~~~~aa~~~~-----------~~~~~~~ 538 (559) T protein:vir:95 487 ADM---SGVS-PTVIV-PQ----EQVEQ--------ARQQRAQQQQQQQMMAMGMAAAQGVK-----------TLSEAKT 538 (559) T ss_pred HHH---hCCc-hhhcC-CH----HHHHH--------HHHHHHHHHHHHHHHHHHHHHHHhhh-----------ccccccC Confidence 764 4444 21111 10 10010 112222222211110000 011111 1111111 Q ss_pred hhhhhhhhhHHHHhhcccccCCCCCCCcccc Q lcl|NC_019406. 621 AKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQ 651 (661) Q Consensus 621 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 651 (661) ..+..+++..-.. +.--|--| T Consensus 539 ~~~~~l~~~~~~~----------~~~~~~~~ 559 (559) T protein:vir:95 539 SDPSVLSAMANAV----------SGQGGQSQ 559 (559) T ss_pred CChhHHHHHHHhh----------cCccccCC Confidence 1111111111000 00000000 No 134 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=94.16 E-value=0.0052 Score=33.10 Aligned_cols=479 Identities=10% Similarity=0.017 Sum_probs=189.5 Q ss_pred ccccccccccCCccccCHHHH-----HHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHH Q lcl|NC_019406. 11 IRRTKRGAQQFTHLVVHPEYE-----YYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQT 85 (661) Q Consensus 11 ~~~~~~~~~~~~V~~~hPey~-----a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~t 85 (661) ...++ ..-|. .+..+|+-|.+.+--. +....+.+. ..++.+ .|-+.-.+. T Consensus 1 mk~~~-----------~~~~~~lkR~~~e~~w~e~a~~tlP~----------~~~~~~~~~---~~~~~~-~~dstg~~a 55 (510) T protein:vir:63 1 MKTTA-----------AMLWEKLRDGSVEQRAIEFAKTTLPY----------LMVDPMSGS---RGVVEH-DFQSAGALL 55 (510) T ss_pred ChhHH-----------HHHHHHHhccchHHHHHHHHHhhccc----------cCCCCCCcc---ccccCC-CccchHHHH Confidence 11110 11111 1334455444444221 111122111 112222 244444555 Q ss_pred HHHHhchhhcc--Ccc--c-c-ccchh-hHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHHHHHHHHHH Q lcl|NC_019406. 86 QAGMVGQIFRR--PPV--I-R-NLPNT-GAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQGFAKTVAL 152 (661) Q Consensus 86 v~~l~G~vFrk--~p~--i-~-~~p~~-l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~~fa~~~~~ 152 (661) ++.++..+..- ||. + . .+.+. ++.+..+ +.....++.+|+.+ -+.-++.+.-+-.++. T Consensus 56 ~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~---------~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~ 126 (510) T protein:vir:63 56 VNNLAAKLARSLFPTGIPFFRSELTDAIRREADSR---------DTDITEVTAALARVDRKATQRLFQNASLAVLTQVIK 126 (510) T ss_pred HHHHHHHHHhhhcCCCCcccccCCChHHhhccccc---------chhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHH Confidence 55544333221 111 1 0 11111 1111000 00011122211111 1245678888888999 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) +.+.+|-+.+++|-- +. .+..|+-.+ +-+. .++.+.+.-|+.++......- ++ T Consensus 127 ~Li~~G~a~l~~~~~-------~~--~~~~~pl~~---y~v~-~d~~G~vd~i~rr~~~t~~~l--------------~e 179 (510) T protein:vir:63 127 LLIVTGNALLYRDSD-------AA--TVVAWSLRS---YAVR-RDATGRWMDIVLKQRYKSKDL--------------DE 179 (510) T ss_pred HHHhhCeEEEEEcCC-------Cc--EEEEEEcce---eEEe-eCCCcCeeEEEeeeeccHHHH--------------hH Confidence 999999998998821 11 122332222 2111 122223333444433321110 00 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccC-C Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVR-G 311 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~-g 311 (661) . +....... ......+...++|+.....++. +...|.++...++.. .+ .... | T Consensus 180 ~------------------~~~~~~~~-~~~~~~~~~v~v~~~V~~~~~~-~~~~~sv~~e~dg~~--~~----~~~~~~ 233 (510) T protein:vir:63 180 E------------------YKQDLMRA-GRNLSGSGSVDLYTHVQRKKGT-AMEYAELYHEIDGVR--VG----KEGRWP 233 (510) T ss_pred H------------------hhhhhhcc-ccccCCCcceEEEEEEEeecCC-CceEEEEEEEecCce--ec----cccccc Confidence 0 00000000 0001111222344433333322 223344443333321 11 1111 1 Q ss_pred -cccceeeEEEEecCCCCCCccccc---hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEeccccee Q lcl|NC_019406. 312 -RTLPFIPFVFFGSMSNAADCEKPP---LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPGRVW 386 (661) Q Consensus 312 -~~L~~IPfv~~~~~~~~~~~~~pP---LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~~~~ 386 (661) ..+++||+.|.-..+..+..+ |- |-|+..||.-+ .+.+..+.+....|.++-+ |..+. ..+.-|.+..+ T Consensus 234 ~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~---~~~l~~a~~a~~~~~lv~p~g~~~~--~~~~~~~~g~~ 307 (510) T protein:vir:63 234 IHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLLS---EKLGLYELESLEVLNLVDEAKGAVV--DDYQDAEMGDY 307 (510) T ss_pred cccCceeeeeeeecCCCccccc-hHHHHHHHHHHHHHHH---HHHHHHHHHhccCCcccCcccccch--hhhccCCCcee Confidence 346788888876666666554 22 44666666432 3556667777777877664 33221 23455554444 Q ss_pred ecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHHhHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHH--- Q lcl|NC_019406. 387 VVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAIGGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDG--- 462 (661) Q Consensus 387 ~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lGArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~A--- 462 (661) . |+.....+.++.. +..+....+.|+++++.+...=.-.+. +..+...||++...+...-...|..+-..+.+= T Consensus 308 v-~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~l~-~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~ 385 (510) T protein:vir:63 308 V-PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQS 385 (510) T ss_pred e-cCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHhhcc-cCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 3 3322334444432 345788899999999988864211121 223345699999999999999998877774443 Q ss_pred --HHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcC-CCCHHHHHHHHHhcCCCCccCCHHHHH Q lcl|NC_019406. 463 --MTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGG-LLPIDALYENFVKNGIIPSTQTLEEFT 539 (661) Q Consensus 463 --l~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG-~Is~et~~~eL~r~gvl~~~~~~Eee~ 539 (661) +.+++.++-+ .|+....+..++..+ ...+ .+|-.+.... ..+...++..+..-.-+.+..++++.. T Consensus 386 Pli~r~~~il~r-~gl~p~p~~~~~~~~-----v~~i-----s~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~ 454 (510) T protein:vir:63 386 PLAYVCLSEVDD-ALLQGLITKQHKPAI-----ETGL-----PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMM 454 (510) T ss_pred HHHHHHHHHHHh-ccCCCCCchhcccce-----ecch-----hHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHH Confidence 3333444432 243332233332211 1111 1111111110 111112222221112245678888888 Q ss_pred HHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCC Q lcl|NC_019406. 540 IKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGD 617 (661) Q Consensus 540 ~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 617 (661) +.+.+. +|.|-+..+. ...+ ..+...+ -.||..++++.++++.++-..+....+ |- T Consensus 455 ~~~a~~---~Gv~p~~ivr-s~ee------v~a~~~~--~~qq~~~~~~~~~~~~~~a~~~~~~~~----------g~ 510 (510) T protein:vir:63 455 DTIWAA---FSVDTSQFYK-SADE------LQAEAEQ--QRQQAAQAQAAQETLLEGASDMTNALA----------GV 510 (510) T ss_pred HHHHHH---hCCChhHhcC-CHHH------HHHHHHH--HHHHHHHHHHHHHHHHHHHHhhccccc----------CC Confidence 888765 3322221111 1000 0000000 001111111111111111111111000 11 No 135 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=93.87 E-value=0.0061 Score=32.73 Aligned_cols=525 Identities=12% Similarity=0.048 Sum_probs=190.0 Q ss_pred CCC------CCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCC-cccCC-CCCCCChHHHHHH Q lcl|NC_019406. 1 MAG------LSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQG-VKYLK-APKGFDDEDYANY 72 (661) Q Consensus 1 ~~~------~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g-~~YLP-k~~~E~~~~Y~~r 72 (661) |.+ ..|-.-++....--...+.-+..++ ....+.+. +-+.-.|..++...- .-|.+ ...+ .+-++.| T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~f~g--yql~alY 117 (765) T protein:vir:96 43 IRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYG--DGPTPAAK-AAAGGQNPYVVPTMLQDWYNSQGFIG--YQACAII 117 (765) T ss_pred HhhcccccccCCCCCCCCcccCcccceecccccc--ccccchHH-HhhhccCccchhhHHHhhhcccCCcc--HHHHHHH Confidence 111 1111111110000000000000000 00001110 001111112222211 11111 1111 1223333 Q ss_pred HhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHH Q lcl|NC_019406. 73 LDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVAL 152 (661) Q Consensus 73 l~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~ 152 (661) .. ...++..|+..+-...|+...|+...+ ...++.++.|...++.+ .+..-++++++ T Consensus 118 ~~----~~l~rkiVd~pAeDa~R~g~~I~~~~~--------------e~~~~~~~~l~~~~~rl-----~v~~~l~ea~~ 174 (765) T protein:vir:96 118 SQ----HWLVDKACSMSGEDAARNGWELKSDGR--------------KLSDEQSALIARRDMEF-----RVKDNLVELNR 174 (765) T ss_pred Hh----CchhhhhhhcchHHhhcCCceeecCcc--------------ccCHHHHHHHHHHHHHh-----hHHHHHHHHHH Confidence 33 334455666666666788777743211 12244455666655555 57788888888 Q ss_pred HHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechh Q lcl|NC_019406. 153 EQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSE 232 (661) Q Consensus 153 ~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e 232 (661) .+-.||.++++|+.-..+.... ..|. .++.| ++..|..++..... +..|++. T Consensus 175 ~~RlyGga~i~i~i~~~D~~~l-~~PL----~~~~I---------~kg~~kgl~vldp~---------~~~~~~v----- 226 (765) T protein:vir:96 175 FKNVFGVRIALFVVESDDPDYY-EKPF----NPDGI---------APGSYKGISQIDPY---------WAMPQLT----- 226 (765) T ss_pred HhhhceeeEEEEEecccCcchh-hccc----ccccc---------ccceeeEEEEechh---------hcccccc----- Confidence 8888998888876532221100 0111 11110 01111111110000 0000000 Q ss_pred hhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCc Q lcl|NC_019406. 233 TAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGR 312 (661) Q Consensus 233 ~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~ 312 (661) .+....+...+ |..-..|++. +..+. .+ .+....|. T Consensus 227 ---------------------~e~~~Dp~sp~--fg~P~~y~i~-------g~~IH-------------~S-Rli~~~g~ 262 (765) T protein:vir:96 227 ---------------------AESTADPSAEH--FYEPDFWIIS-------GKKYH-------------RS-HLVVVRGP 262 (765) T ss_pred ---------------------hhccccccccc--cCcceeeeec-------Cceec-------------cc-eEEEecCC Confidence 00011111111 1112223321 10111 11 11111222 Q ss_pred ccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhh-hHHHHHHHhcCceeEEecCCCC-CCce---------eEec Q lcl|NC_019406. 313 TLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYA-ELEHGRFFTALPTYYAPELDDS-DASE---------YHIG 381 (661) Q Consensus 313 ~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sS-Dl~~il~~~~~P~l~i~Gl~~~-~~~~---------l~iG 381 (661) +++.+ .....+.+ + -|++..++=-|..|...+ .--++++...+.++.+.++..- ..+. ...+ T Consensus 263 ~lpd~----lk~~~~~~--G-~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~ 335 (765) T protein:vir:96 263 QPPDI----LKPTYIFG--G-IPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRD 335 (765) T ss_pred Cchhh----hccccCcc--C-ccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcC Confidence 22221 11111111 2 236666777777776665 5567888888888888766321 1111 1223 Q ss_pred ccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hH---Hhccccc-CccchhHHHHHHHHHHhhHHHHHHH Q lcl|NC_019406. 382 PGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GG---RLMPGMS-KSVSESDNQSALREANEQSLLLNVI 456 (661) Q Consensus 382 s~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GA---rll~~~~-~~~~eTataa~~d~~~~~S~L~~~A 456 (661) ..+.+++.. +.++..+..+-+++ .+.|+...++|... |. +|+ .++ ++-+.|++. |...=+..+.++- T Consensus 336 n~g~~~id~-ee~~e~~s~~lsgl---~d~l~~~~~~iAaas~IP~t~Lf-Gqsp~GlnATGe~---D~~nYyD~I~s~Q 407 (765) T protein:vir:96 336 NHGVKVIGI-DETMEQFDTNLSDF---DSVIMNQYQLVAAIAKTPATKLL-GTSPKGFNATGEH---ETISYHEELESIQ 407 (765) T ss_pred CceeEEecC-CcceeEEecccCCH---HHHHHHHHHHHHhhhCCCeeeec-cCCcccccCcchH---HHHHHHHHHHHHH Confidence 444555554 45666666554554 44444555555533 21 223 222 233334443 3333344455444 Q ss_pred -HHHHHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHH--------HHHHHHHHHhcCCCCHHHHHHHHHhcC Q lcl|NC_019406. 457 -MALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDAR--------ALRAIQQLYEGGLLPIDALYENFVKNG 527 (661) Q Consensus 457 -~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~--------~l~all~~~~aG~Is~et~~~eL~r~g 527 (661) ..+..+++++++++.+..+++ .++.|+.|. ...++.. ..++...++++|.|+-.+.+.+|+..+ T Consensus 408 e~~l~p~le~L~~li~~s~~i~----~d~~i~Fnp---L~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~ 480 (765) T protein:vir:96 408 EHIFDPLLERHYLLLAKSESID----VQLEIVWNP---VDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDP 480 (765) T ss_pred HHHHHHHHHHHHHHHHHhcCCC----CcceEEeCC---CCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccc Confidence 346888999999998876553 256777664 2222332 235677889999999999999998765 Q ss_pred CCC-ccCCHHHHHHHHhccCCCCCCchhhhhhcCCccc----c---CCCcchhhhhcCChhhHHHHHHHhccCCC----- Q lcl|NC_019406. 528 IIP-STQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSR----Q---QELDQQRAARDADFQQQELEQAERHLEID----- 594 (661) Q Consensus 528 vl~-~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~----~---~~~~q~~~~~e~d~~q~~~~~~e~~~~~~----- 594 (661) ... ...+.+++.. .+..+.++.+..+.+.... . ++-.+-. ..+. .|+......+...+. T Consensus 481 ~~g~~~l~d~~~e~-----~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~-~~eg--~~~~~~~~p~~~~p~~~~~~ 552 (765) T protein:vir:96 481 RSGYNRLTDDQAET-----EPGMSPENLAELEKAGAQSAKAKGEAERAEAQAG-AVEG--AGDPVPAAPRGTKPLAKAAE 552 (765) T ss_pred cCCCCCCCcccccc-----ccCCCccccccccCCCcccccccCccccccCCCC-ccCC--CCcccccCCcccCCcccccc Confidence 432 1122222111 1111111111111110000 0 0000000 0000 011111111111110 Q ss_pred ch-------------hHHHhhhhhhhhhhHHHhcCChhhhhh---hhhhhhHHHHhhcccccCC--CCCCCcccccCC-- Q lcl|NC_019406. 595 EE-------------KLRISAKVGSTSVAASRKLGDPEQAKP---SKAEQAQIDAQQKQAAAKP--VTPTPGTVQRGR-- 654 (661) Q Consensus 595 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~-- 654 (661) ++ +++. +++...-.++.+++-+-..... .|......+ +|++| +.++|..+--++ T Consensus 553 ~~~g~~~~~p~~~~p~~~~-~~~~~~~~~~~~~~~~~a~~~g~~v~~~~~~a~~-----~a~~ps~a~~~~~~~~~~~~~ 626 (765) T protein:vir:96 553 EGAGEAATPPSRPNPRAEL-RNLLSDLLSKLEALDDAQAPDGVDIEQDDAPGLK-----RTSKPSVSGMEPSVFSSNRIV 626 (765) T ss_pred ccCccccCccccccccccc-hhcccchhhhhhccccccccCCCCCCCCccchhh-----hhhccccCCCCCcccCCCCCC Confidence 00 0000 0000011111222111111100 111111111 22222 122333322222 Q ss_pred -CCcc---------CCC Q lcl|NC_019406. 655 -PPQN---------GAS 661 (661) Q Consensus 655 -~~~~---------~~~ 661 (661) |-+| |-. T Consensus 627 ~P~~~~~~~~~~~~~~~ 643 (765) T protein:vir:96 627 GPRDHSELQRIKVNGIT 643 (765) T ss_pred CCcccccccceeecceE Confidence 1112 111 No 136 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=93.76 E-value=0.0065 Score=32.59 Aligned_cols=505 Identities=13% Similarity=0.110 Sum_probs=193.6 Q ss_pred CCCCCCccccccccccccccCC--ccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFT--HLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAF 78 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~--V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~ 78 (661) || +--++..+++... .+.-.-+-....++|+-|.+.+--.. .|.........+. -.| T Consensus 1 m~-------~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~---------~~~~~~~~~~~~~-----~~~ 59 (535) T protein:vir:33 1 MA-------DSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL---------FPKESDNESTDYT-----TPW 59 (535) T ss_pred CC-------hhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---------cCCCCCccccccc-----ccc Confidence 32 1112222222110 11111122234556666665543311 0111000011110 123 Q ss_pred cchHHHHHHHHhchhhccCccccccchhhHhhhh----cccccccccchhhhhhhHhhhhhcc------CCCCCHHHHHH Q lcl|NC_019406. 79 YNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGR----DAEGGVQVVAPASIGKLLTQLQRFA------KDGTSHQGFAK 148 (661) Q Consensus 79 ~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~----d~dG~~~~~~~~~~~~~~~~~~~~d------l~G~sL~~fa~ 148 (661) -+.-.+.++.+...++.- + .|+. .|+. |.+=......+.....+..+|+.|. +.-++.+.-+- T Consensus 60 dst~~~a~~~Laa~l~~~---l--tP~~--~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~ 132 (535) T protein:vir:33 60 QAVGARGLNNLASKLMLA---L--FPMQ--SWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLF 132 (535) T ss_pred cccHHHHHHHHHHHHHHh---h--cCCC--cccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHH Confidence 344445555544443331 1 1110 1111 0000000000111112222222221 24566888888 Q ss_pred HHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeee Q lcl|NC_019406. 149 TVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGR 228 (661) Q Consensus 149 ~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~ 228 (661) .++.+.+.+|-+.+++|-+.. . + ..+..|+-. ++-+. .++.+.+.-|..++... T Consensus 133 ~~~~~L~~~G~a~l~~~~~~~--~--~--~~f~~~pl~---~~~v~-~d~~G~vd~i~r~~~~t---------------- 186 (535) T protein:vir:33 133 ECLKQLIVAGNALLYLPEPEG--S--Y--NPMKLYRLS---SYVVQ-RDAYGNVLQIVTRDQIA---------------- 186 (535) T ss_pred HHHHHHHhhCceeEEeecCCC--C--c--eeeEEEEcC---eeEEe-eCCCCCeeEEEeeEeec---------------- Confidence 889999999999999985421 1 1 112222221 12222 12222232233332221 Q ss_pred echhhh-hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee Q lcl|NC_019406. 229 EGSETA-QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 229 ~~~e~v-i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p 307 (661) +..+ -.|... ......+ ...+....+|....+. .+++. |.|+.+.++... + T Consensus 187 --~~ql~~~~~~~-----------~~~~~~~-----k~~~~~~~v~~~v~~~-~~~~~--~~~~~~~~~~~~-------~ 238 (535) T protein:vir:33 187 --FGALPEDVRSA-----------VEKSGGE-----KKMDEMVDVYTHVYLD-EESGD--YLKYEEVEDVEI-------D 238 (535) T ss_pred --HHHHHHHhhhh-----------hcccccc-----cccccCCeEEEEEEee-CCCCc--EEEEEEEeCccc-------c Confidence 1111 001100 0000001 1112233444443332 22333 344433333211 1 Q ss_pred cc-CC---cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEe Q lcl|NC_019406. 308 MV-RG---RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHI 380 (661) Q Consensus 308 ~~-~g---~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~i 380 (661) .. ++ ..+++||+.|.-..+..+..+ ..-|-|+..||.-+-.. +..+......|.++-+ |.... ..+.- T Consensus 239 ~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~---l~~~~~~~~p~~lv~~~g~~~~--~~~~~ 313 (535) T protein:vir:33 239 GSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAI---VKMSMISAKVIGLVNPAGITQP--RRLTK 313 (535) T ss_pred ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHH---HHHHHHHhcCceeeccccccch--hhccc Confidence 11 11 236778887876655555443 11245777777654332 4444444455545433 22221 12334 Q ss_pred cccceeecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHH--hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHH Q lcl|NC_019406. 381 GPGRVWVVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAI--GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIM 457 (661) Q Consensus 381 Gs~~~~~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~l--GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~ 457 (661) |..+.+. +...++...++.. +..+....+.|+++++.+... --.+.. ..+...||++...+...-...|..+-. T Consensus 314 ~~~g~~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~--~~~~r~TAtEV~~r~~E~~~~LG~v~~ 390 (535) T protein:vir:33 314 AQTGDFV-PGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQ--RTGERVTAEEIRYVASELEDTLGGVYS 390 (535) T ss_pred CCceeee-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhccc--CCCccccHHHHHHHHHHHHHHHhHHHH Confidence 4444443 3333455555432 456899999999999998763 112221 223457999999999999999998888 Q ss_pred HHHHHH-HHHHHHHHHHc---C-CCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhc--CCCC Q lcl|NC_019406. 458 ALEDGM-TSVVRYWLMFR---D-IPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKN--GIIP 530 (661) Q Consensus 458 ~le~Al-~~aL~~~A~w~---G-~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~--gvl~ 530 (661) ++++=+ .-++..+-..+ | ++......++++| . ..+ .++..+.. .-+-..|...+..- .+++ T Consensus 391 rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~y----i-s~L-----a~aqr~~~--~~~l~~~~~~la~~~P~~~d 458 (535) T protein:vir:33 391 ILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTI----S-TGL-----EAIGRGQD--LDKLERCISAWAALAPMQGD 458 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEE----e-cHH-----HHHHHHHH--HHHHHHHHHHHHhhChhhhh Confidence 865543 33333333333 2 2222233344443 2 222 11111111 11112233333322 3455 Q ss_pred ccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhh Q lcl|NC_019406. 531 STQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVA 610 (661) Q Consensus 531 ~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 610 (661) ...++++..+.+.+- +|.|-. .+.+. +++..+ ..+ |+-.+.|.-+.+.+ .|.+ +. T Consensus 459 ~~id~d~~~~~~a~~---~Gvp~~-~i~~~----~ee~~~---~~~----q~~~~~~~~~~~~~---------~g~~-~~ 513 (535) T protein:vir:33 459 PDINLAVIKLRIANA---IGIDTS-GILLT----DEQKQA---LMM----QDAAQTGVENAAAA---------GGAG-VG 513 (535) T ss_pred ccCCHHHHHHHHHHH---cCCCHh-HhcCC----HHHHHH---HHH----HHHHHHHHHHHHHh---------hhhh-hc Confidence 557888888888764 333211 11111 111111 111 11111110000000 0111 11 Q ss_pred HHHhcCChhhhhhhhhhhhHHHHhhcccccCCCCCCC Q lcl|NC_019406. 611 ASRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVTPTP 647 (661) Q Consensus 611 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 647 (661) ++-+.| |+ +++ +++..---.|. T Consensus 514 ~~~~~~-~~-------------~~~-~~~~~~g~~~~ 535 (535) T protein:vir:33 514 ALATSS-PE-------------AMQ-GAAAKAGLNAT 535 (535) T ss_pred chhhcC-Ch-------------hHH-HHHHhccCCCC Confidence 111111 10 000 11111101111 No 137 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=93.73 E-value=0.0066 Score=32.55 Aligned_cols=511 Identities=14% Similarity=0.115 Sum_probs=188.6 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCC---CCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPK---GFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~---~E~~~~Y~~rl~rA~ 77 (661) |+-.++.+.=+.|-. .-..+-..+.++|+-|.+. .||... ..+...=..|. .-. T Consensus 1 M~~~~~~~~l~~r~~---------~l~~~R~~~e~~w~e~~~~-------------~lP~~~~~~~~~~~~~~~~~-~~~ 57 (555) T protein:vir:10 1 MAEQTERKLLLSRWG---------QLRTERESWMSHWKEISDY-------------LLPRAGRFFVQDRNRGEKRH-NNI 57 (555) T ss_pred CCCcccHHHHHHHHH---------HHHHHhhHHHHHHHHHHHH-------------hCcccccccCCCCCcchhcc-ccc Confidence 332222211111100 0011112223444444443 345411 11111001111 124 Q ss_pred ccchHHHHHHHHhchhhcc-----Ccccc-ccc-------hhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHH Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRR-----PPVIR-NLP-------NTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQ 144 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk-----~p~i~-~~p-------~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~ 144 (661) |-+.-.+.++.+...+..- .|=+. .+. ..++.+++.+. +.+... +..++++ T Consensus 58 ~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve-----------~~~~~~-----l~~snf~ 121 (555) T protein:vir:10 58 LDNTGTRALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVT-----------RLMLMI-----FAKSNTY 121 (555) T ss_pred ccccHHHHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHH-----------HHHHHH-----HHhcCcH Confidence 4444455555544333321 11010 000 01122222111 111111 3457788 Q ss_pred HHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccc Q lcl|NC_019406. 145 GFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNP 224 (661) Q Consensus 145 ~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~ 224 (661) .-+-.++.+.+.+|-+-++++-... .+. .+..|...+ +-+. .++.+.+.-|..++. T Consensus 122 ~~~~~~~~~Lv~~G~a~l~~~~d~~----~~~--rf~~~pl~~---~~v~-~d~~G~vd~i~r~~~-------------- 177 (555) T protein:vir:10 122 RALHSMYEELGAFGTASSIVLPDFD----AVV--YHHSLTAGE---YAIA-ADNQGRVNTLYREFQ-------------- 177 (555) T ss_pred HHHHHHHHHHHhhCceEEEEecCCC----ceE--EEEEeecce---eEEe-eCCCCCEEEEEEEEe-------------- Confidence 8888889999999998888884321 111 122222222 1111 111222221111111 Q ss_pred eeeeechhhh-hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeec----ccccce--EEEEEEEecCc Q lcl|NC_019406. 225 WIGREGSETA-QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILEL----QKDGSR--VYKQFVYVEDP 297 (661) Q Consensus 225 ~i~~~~~e~v-i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~----g~~g~~--~~~~~~~~~~~ 297 (661) +++..+ -.|.... +....... ... ...+ ...++++...... +..+.. -|..+.+..+. T Consensus 178 ----~t~~ql~~~fg~~~---l~~~~~~~----~~~-~~~~---~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:10 178 ----ITVAQMVREFGKDK---CSTTVQSL----FDR-GALE---QWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGA 242 (555) T ss_pred ----ccHHHHHHhcCccc---CCHHHHHH----Hhc-CCCC---ceEEEEEEEeeccCcCcCCCCccccceEEEEEEecc Confidence 111111 1111000 00000000 010 0100 1112222211111 111111 12222233222 Q ss_pred ccccccceeeccCC-cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC Q lcl|NC_019406. 298 LGQARDVYTPMVRG-RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD 374 (661) Q Consensus 298 ~~~~~~~~~p~~~g-~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~ 374 (661) .+. .+...+| ..+++||+.|.-..+..+..+ ..-|-|+..||.-+- +-++.+-.....|.++-.+. . T Consensus 243 d~~----~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~---~~l~~~~~~~~pp~~v~~~~---~ 312 (555) T protein:vir:10 243 DET----RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQL---RKAQAIDYKSNPPLQLPVSA---K 312 (555) T ss_pred CCc----cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHH---HHHHHHHHHhcCceeecccc---c Confidence 111 1111222 357888888876666666443 112557777776333 22444444444455444422 2 Q ss_pred CceeEecccceeecCC-CCCcce--EeecCchhHHHHHHHHHHHHHHHHHHhH----HhcccccCccchhHHHHHHHHHH Q lcl|NC_019406. 375 ASEYHIGPGRVWVVDK-ESGIPG--IIEFKGEGLKTLERALNEKEQQIAAIGG----RLMPGMSKSVSESDNQSALREAN 447 (661) Q Consensus 375 ~~~l~iGs~~~~~lp~-~ga~~~--ylE~~g~~i~a~~~~L~~le~qM~~lGA----rll~~~~~~~~eTataa~~d~~~ 447 (661) ...+.+-++....... .+++.- .++. +..+....+.|+++++.+...=. .++.. .++...||++...+... T Consensus 313 ~~~~~~~pgg~~~v~~g~~~d~~~~~~~~-~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~-~~~~~~TAtEV~~r~~E 390 (555) T protein:vir:10 313 NQDISTVPGGLSYVDAAAPNGGIRTAFEV-NLDLSHLLADIVDVRERIKASFYADLFLMLAN-GTNPQMTATEVAERHEE 390 (555) T ss_pred cccceeccccccccccCCCCcceeccccc-ccchHHHHHHHHHHHHHHHHHhhcchhhhccC-CCCCcccHHHHHHHHHH Confidence 2335555555433321 122211 2232 34578888999999999886432 12321 23456899999999999 Q ss_pred hhHHHHHHHHHHHH-HHHHHH----HHHHHHcCC-CCCCcceE-EEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019406. 448 EQSLLLNVIMALED-GMTSVV----RYWLMFRDI-PLTDTATL-RYEIDATFLTTALDARALRAIQQLYEGGLLPIDALY 520 (661) Q Consensus 448 ~~S~L~~~A~~le~-Al~~aL----~~~A~w~G~-~~~~~~~~-~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~ 520 (661) -...|..+-.++.+ .+.-++ .++-+- |+ +.. +.++ ...|+-+|.. -|.+..++.....| .-++ T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~-P~~l~~~~i~v~yis------~La~aq~~~~~~~i--~~~l 460 (555) T protein:vir:10 391 KLLMLGPVLERMHNEILDPLIELTFQRMVEA-NILPPP-PQEMQGVDLNVEFVS------MLAQAQRAIATNSV--DRFV 460 (555) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCC-chhhcCceeEEEecc------HHHHHHHHHHHHHH--HHHH Confidence 99999998888744 333333 333331 21 111 0011 0111112222 22222333222111 1122 Q ss_pred HHHHh-cCCCC---ccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCch Q lcl|NC_019406. 521 ENFVK-NGIIP---STQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEE 596 (661) Q Consensus 521 ~eL~r-~gvl~---~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~ 596 (661) ..+.. .++-| +-.++++..+.+.+- +|.|. ..+. . .++..+-+. +++..++++.++++++. T Consensus 461 ~~i~~laq~~P~vld~id~d~~~~~~a~~---~Gvp~-~~ir-s----~eev~~~r~------qr~~~~q~~~~a~~~~q 525 (555) T protein:vir:10 461 GNLGAVAGIKPEVLDKFDADRWADTYADM---LGIDP-ELIV-P----GNQVALIRK------QRADQQQAAQQAALLNQ 525 (555) T ss_pred HHHHHHhcCChhhhhcCCHHHHHHHHHHH---hCCCc-cccC-C----HHHHHHHHH------HHHHHHHHHHHHHHHHH Confidence 22211 12212 347788888888764 33331 1111 1 010010000 00111122222222222 Q ss_pred hHHHhhhhhhhhhhHHHhcCChhhhhhhhhhhhHH Q lcl|NC_019406. 597 KLRISAKVGSTSVAASRKLGDPEQAKPSKAEQAQI 631 (661) Q Consensus 597 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 631 (661) =.++.+.+|++.-+ +........+|.---- T Consensus 526 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 526 GADTAAKLGSVDTS-----KQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHhcccccC-----cchhHHHHHhhhccCC Confidence 22223333332222 1111111111111000 No 138 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=93.73 E-value=0.0066 Score=32.55 Aligned_cols=511 Identities=14% Similarity=0.115 Sum_probs=188.6 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCC---CCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPK---GFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~---~E~~~~Y~~rl~rA~ 77 (661) |+-.++.+.=+.|-. .-..+-..+.++|+-|.+. .||... ..+...=..|. .-. T Consensus 1 M~~~~~~~~l~~r~~---------~l~~~R~~~e~~w~e~~~~-------------~lP~~~~~~~~~~~~~~~~~-~~~ 57 (555) T protein:vir:10 1 MAEQTERKLLLSRWG---------QLRTERESWMSHWKEISDY-------------LLPRAGRFFVQDRNRGEKRH-NNI 57 (555) T ss_pred CCCcccHHHHHHHHH---------HHHHHhhHHHHHHHHHHHH-------------hCcccccccCCCCCcchhcc-ccc Confidence 332222211111100 0011112223444444443 345411 11111001111 124 Q ss_pred ccchHHHHHHHHhchhhcc-----Ccccc-ccc-------hhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHH Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRR-----PPVIR-NLP-------NTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQ 144 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk-----~p~i~-~~p-------~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~ 144 (661) |-+.-.+.++.+...+..- .|=+. .+. ..++.+++.+. +.+... +..++++ T Consensus 58 ~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve-----------~~~~~~-----l~~snf~ 121 (555) T protein:vir:10 58 LDNTGTRALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVT-----------RLMLMI-----FAKSNTY 121 (555) T ss_pred ccccHHHHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHH-----------HHHHHH-----HHhcCcH Confidence 4444455555544333321 11010 000 01122222111 111111 3457788 Q ss_pred HHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccc Q lcl|NC_019406. 145 GFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNP 224 (661) Q Consensus 145 ~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~ 224 (661) .-+-.++.+.+.+|-+-++++-... .+. .+..|...+ +-+. .++.+.+.-|..++. T Consensus 122 ~~~~~~~~~Lv~~G~a~l~~~~d~~----~~~--rf~~~pl~~---~~v~-~d~~G~vd~i~r~~~-------------- 177 (555) T protein:vir:10 122 RALHSMYEELGAFGTASSIVLPDFD----AVV--YHHSLTAGE---YAIA-ADNQGRVNTLYREFQ-------------- 177 (555) T ss_pred HHHHHHHHHHHhhCceEEEEecCCC----ceE--EEEEeecce---eEEe-eCCCCCEEEEEEEEe-------------- Confidence 8888889999999998888884321 111 122222222 1111 111222221111111 Q ss_pred eeeeechhhh-hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeec----ccccce--EEEEEEEecCc Q lcl|NC_019406. 225 WIGREGSETA-QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILEL----QKDGSR--VYKQFVYVEDP 297 (661) Q Consensus 225 ~i~~~~~e~v-i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~----g~~g~~--~~~~~~~~~~~ 297 (661) +++..+ -.|.... +....... ... ...+ ...++++...... +..+.. -|..+.+..+. T Consensus 178 ----~t~~ql~~~fg~~~---l~~~~~~~----~~~-~~~~---~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:10 178 ----ITVAQMVREFGKDK---CSTTVQSL----FDR-GALE---QWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGA 242 (555) T ss_pred ----ccHHHHHHhcCccc---CCHHHHHH----Hhc-CCCC---ceEEEEEEEeeccCcCcCCCCccccceEEEEEEecc Confidence 111111 1111000 00000000 010 0100 1112222211111 111111 12222233222 Q ss_pred ccccccceeeccCC-cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC Q lcl|NC_019406. 298 LGQARDVYTPMVRG-RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD 374 (661) Q Consensus 298 ~~~~~~~~~p~~~g-~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~ 374 (661) .+. .+...+| ..+++||+.|.-..+..+..+ ..-|-|+..||.-+- +-++.+-.....|.++-.+. . T Consensus 243 d~~----~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~---~~l~~~~~~~~pp~~v~~~~---~ 312 (555) T protein:vir:10 243 DET----RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQL---RKAQAIDYKSNPPLQLPVSA---K 312 (555) T ss_pred CCc----cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHH---HHHHHHHHHhcCceeecccc---c Confidence 111 1111222 357888888876666666443 112557777776333 22444444444455444422 2 Q ss_pred CceeEecccceeecCC-CCCcce--EeecCchhHHHHHHHHHHHHHHHHHHhH----HhcccccCccchhHHHHHHHHHH Q lcl|NC_019406. 375 ASEYHIGPGRVWVVDK-ESGIPG--IIEFKGEGLKTLERALNEKEQQIAAIGG----RLMPGMSKSVSESDNQSALREAN 447 (661) Q Consensus 375 ~~~l~iGs~~~~~lp~-~ga~~~--ylE~~g~~i~a~~~~L~~le~qM~~lGA----rll~~~~~~~~eTataa~~d~~~ 447 (661) ...+.+-++....... .+++.- .++. +..+....+.|+++++.+...=. .++.. .++...||++...+... T Consensus 313 ~~~~~~~pgg~~~v~~g~~~d~~~~~~~~-~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~-~~~~~~TAtEV~~r~~E 390 (555) T protein:vir:10 313 NQDISTVPGGLSYVDAAAPNGGIRTAFEV-NLDLSHLLADIVDVRERIKASFYADLFLMLAN-GTNPQMTATEVAERHEE 390 (555) T ss_pred cccceeccccccccccCCCCcceeccccc-ccchHHHHHHHHHHHHHHHHHhhcchhhhccC-CCCCcccHHHHHHHHHH Confidence 2335555555433321 122211 2232 34578888999999999886432 12321 23456899999999999 Q ss_pred hhHHHHHHHHHHHH-HHHHHH----HHHHHHcCC-CCCCcceE-EEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019406. 448 EQSLLLNVIMALED-GMTSVV----RYWLMFRDI-PLTDTATL-RYEIDATFLTTALDARALRAIQQLYEGGLLPIDALY 520 (661) Q Consensus 448 ~~S~L~~~A~~le~-Al~~aL----~~~A~w~G~-~~~~~~~~-~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~ 520 (661) -...|..+-.++.+ .+.-++ .++-+- |+ +.. +.++ ...|+-+|.. -|.+..++.....| .-++ T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~-P~~l~~~~i~v~yis------~La~aq~~~~~~~i--~~~l 460 (555) T protein:vir:10 391 KLLMLGPVLERMHNEILDPLIELTFQRMVEA-NILPPP-PQEMQGVDLNVEFVS------MLAQAQRAIATNSV--DRFV 460 (555) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCC-chhhcCceeEEEecc------HHHHHHHHHHHHHH--HHHH Confidence 99999998888744 333333 333331 21 111 0011 0111112222 22222333222111 1122 Q ss_pred HHHHh-cCCCC---ccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCch Q lcl|NC_019406. 521 ENFVK-NGIIP---STQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEE 596 (661) Q Consensus 521 ~eL~r-~gvl~---~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~ 596 (661) ..+.. .++-| +-.++++..+.+.+- +|.|. ..+. . .++..+-+. +++..++++.++++++. T Consensus 461 ~~i~~laq~~P~vld~id~d~~~~~~a~~---~Gvp~-~~ir-s----~eev~~~r~------qr~~~~q~~~~a~~~~q 525 (555) T protein:vir:10 461 GNLGAVAGIKPEVLDKFDADRWADTYADM---LGIDP-ELIV-P----GNQVALIRK------QRADQQQAAQQAALLNQ 525 (555) T ss_pred HHHHHHhcCChhhhhcCCHHHHHHHHHHH---hCCCc-cccC-C----HHHHHHHHH------HHHHHHHHHHHHHHHHH Confidence 22211 12212 347788888888764 33331 1111 1 010010000 00111122222222222 Q ss_pred hHHHhhhhhhhhhhHHHhcCChhhhhhhhhhhhHH Q lcl|NC_019406. 597 KLRISAKVGSTSVAASRKLGDPEQAKPSKAEQAQI 631 (661) Q Consensus 597 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 631 (661) =.++.+.+|++.-+ +........+|.---- T Consensus 526 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 526 GADTAAKLGSVDTS-----KQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHhcccccC-----cchhHHHHHhhhccCC Confidence 22223333332222 1111111111111000 No 139 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=93.73 E-value=0.0066 Score=32.55 Aligned_cols=511 Identities=14% Similarity=0.115 Sum_probs=188.6 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCC---CCChHHHHHHHhhhc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPK---GFDDEDYANYLDRAA 77 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~---~E~~~~Y~~rl~rA~ 77 (661) |+-.++.+.=+.|-. .-..+-..+.++|+-|.+. .||... ..+...=..|. .-. T Consensus 1 M~~~~~~~~l~~r~~---------~l~~~R~~~e~~w~e~~~~-------------~lP~~~~~~~~~~~~~~~~~-~~~ 57 (555) T protein:vir:98 1 MAEQTERKLLLSRWG---------QLRTERESWMSHWKEISDY-------------LLPRAGRFFVQDRNRGEKRH-NNI 57 (555) T ss_pred CCCcccHHHHHHHHH---------HHHHHhhHHHHHHHHHHHH-------------hCcccccccCCCCCcchhcc-ccc Confidence 332222211111100 0011112223444444443 345411 11111001111 124 Q ss_pred ccchHHHHHHHHhchhhcc-----Ccccc-ccc-------hhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHH Q lcl|NC_019406. 78 FYNMTSQTQAGMVGQIFRR-----PPVIR-NLP-------NTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQ 144 (661) Q Consensus 78 ~~n~~~~tv~~l~G~vFrk-----~p~i~-~~p-------~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~ 144 (661) |-+.-.+.++.+...+..- .|=+. .+. ..++.+++.+. +.+... +..++++ T Consensus 58 ~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve-----------~~~~~~-----l~~snf~ 121 (555) T protein:vir:98 58 LDNTGTRALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVT-----------RLMLMI-----FAKSNTY 121 (555) T ss_pred ccccHHHHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHH-----------HHHHHH-----HHhcCcH Confidence 4444455555544333321 11010 000 01122222111 111111 3457788 Q ss_pred HHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccc Q lcl|NC_019406. 145 GFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNP 224 (661) Q Consensus 145 ~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~ 224 (661) .-+-.++.+.+.+|-+-++++-... .+. .+..|...+ +-+. .++.+.+.-|..++. T Consensus 122 ~~~~~~~~~Lv~~G~a~l~~~~d~~----~~~--rf~~~pl~~---~~v~-~d~~G~vd~i~r~~~-------------- 177 (555) T protein:vir:98 122 RALHSMYEELGAFGTASSIVLPDFD----AVV--YHHSLTAGE---YAIA-ADNQGRVNTLYREFQ-------------- 177 (555) T ss_pred HHHHHHHHHHHhhCceEEEEecCCC----ceE--EEEEeecce---eEEe-eCCCCCEEEEEEEEe-------------- Confidence 8888889999999998888884321 111 122222222 1111 111222221111111 Q ss_pred eeeeechhhh-hcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeec----ccccce--EEEEEEEecCc Q lcl|NC_019406. 225 WIGREGSETA-QRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILEL----QKDGSR--VYKQFVYVEDP 297 (661) Q Consensus 225 ~i~~~~~e~v-i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~----g~~g~~--~~~~~~~~~~~ 297 (661) +++..+ -.|.... +....... ... ...+ ...++++...... +..+.. -|..+.+..+. T Consensus 178 ----~t~~ql~~~fg~~~---l~~~~~~~----~~~-~~~~---~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~~ 242 (555) T protein:vir:98 178 ----ITVAQMVREFGKDK---CSTTVQSL----FDR-GALE---QWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEPGA 242 (555) T ss_pred ----ccHHHHHHhcCccc---CCHHHHHH----Hhc-CCCC---ceEEEEEEEeeccCcCcCCCCccccceEEEEEEecc Confidence 111111 1111000 00000000 010 0100 1112222211111 111111 12222233222 Q ss_pred ccccccceeeccCC-cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCC Q lcl|NC_019406. 298 LGQARDVYTPMVRG-RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSD 374 (661) Q Consensus 298 ~~~~~~~~~p~~~g-~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~ 374 (661) .+. .+...+| ..+++||+.|.-..+..+..+ ..-|-|+..||.-+- +-++.+-.....|.++-.+. . T Consensus 243 d~~----~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~---~~l~~~~~~~~pp~~v~~~~---~ 312 (555) T protein:vir:98 243 DET----RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQL---RKAQAIDYKSNPPLQLPVSA---K 312 (555) T ss_pred CCc----cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHH---HHHHHHHHHhcCceeecccc---c Confidence 111 1111222 357888888876666666443 112557777776333 22444444444455444422 2 Q ss_pred CceeEecccceeecCC-CCCcce--EeecCchhHHHHHHHHHHHHHHHHHHhH----HhcccccCccchhHHHHHHHHHH Q lcl|NC_019406. 375 ASEYHIGPGRVWVVDK-ESGIPG--IIEFKGEGLKTLERALNEKEQQIAAIGG----RLMPGMSKSVSESDNQSALREAN 447 (661) Q Consensus 375 ~~~l~iGs~~~~~lp~-~ga~~~--ylE~~g~~i~a~~~~L~~le~qM~~lGA----rll~~~~~~~~eTataa~~d~~~ 447 (661) ...+.+-++....... .+++.- .++. +..+....+.|+++++.+...=. .++.. .++...||++...+... T Consensus 313 ~~~~~~~pgg~~~v~~g~~~d~~~~~~~~-~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~-~~~~~~TAtEV~~r~~E 390 (555) T protein:vir:98 313 NQDISTVPGGLSYVDAAAPNGGIRTAFEV-NLDLSHLLADIVDVRERIKASFYADLFLMLAN-GTNPQMTATEVAERHEE 390 (555) T ss_pred cccceeccccccccccCCCCcceeccccc-ccchHHHHHHHHHHHHHHHHHhhcchhhhccC-CCCCcccHHHHHHHHHH Confidence 2335555555433321 122211 2232 34578888999999999886432 12321 23456899999999999 Q ss_pred hhHHHHHHHHHHHH-HHHHHH----HHHHHHcCC-CCCCcceE-EEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019406. 448 EQSLLLNVIMALED-GMTSVV----RYWLMFRDI-PLTDTATL-RYEIDATFLTTALDARALRAIQQLYEGGLLPIDALY 520 (661) Q Consensus 448 ~~S~L~~~A~~le~-Al~~aL----~~~A~w~G~-~~~~~~~~-~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~ 520 (661) -...|..+-.++.+ .+.-++ .++-+- |+ +.. +.++ ...|+-+|.. -|.+..++.....| .-++ T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~-P~~l~~~~i~v~yis------~La~aq~~~~~~~i--~~~l 460 (555) T protein:vir:98 391 KLLMLGPVLERMHNEILDPLIELTFQRMVEA-NILPPP-PQEMQGVDLNVEFVS------MLAQAQRAIATNSV--DRFV 460 (555) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCC-chhhcCceeEEEecc------HHHHHHHHHHHHHH--HHHH Confidence 99999998888744 333333 333331 21 111 0011 0111112222 22222333222111 1122 Q ss_pred HHHHh-cCCCC---ccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCch Q lcl|NC_019406. 521 ENFVK-NGIIP---STQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEE 596 (661) Q Consensus 521 ~eL~r-~gvl~---~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~ 596 (661) ..+.. .++-| +-.++++..+.+.+- +|.|. ..+. . .++..+-+. +++..++++.++++++. T Consensus 461 ~~i~~laq~~P~vld~id~d~~~~~~a~~---~Gvp~-~~ir-s----~eev~~~r~------qr~~~~q~~~~a~~~~q 525 (555) T protein:vir:98 461 GNLGAVAGIKPEVLDKFDADRWADTYADM---LGIDP-ELIV-P----GNQVALIRK------QRADQQQAAQQAALLNQ 525 (555) T ss_pred HHHHHHhcCChhhhhcCCHHHHHHHHHHH---hCCCc-cccC-C----HHHHHHHHH------HHHHHHHHHHHHHHHHH Confidence 22211 12212 347788888888764 33331 1111 1 010010000 00111122222222222 Q ss_pred hHHHhhhhhhhhhhHHHhcCChhhhhhhhhhhhHH Q lcl|NC_019406. 597 KLRISAKVGSTSVAASRKLGDPEQAKPSKAEQAQI 631 (661) Q Consensus 597 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 631 (661) =.++.+.+|++.-+ +........+|.---- T Consensus 526 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 526 GADTAAKLGSVDTS-----KQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHhcccccC-----cchhHHHHHhhhccCC Confidence 22223333332222 1111111111111000 No 140 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=92.70 E-value=0.01 Score=31.47 Aligned_cols=438 Identities=11% Similarity=0.018 Sum_probs=178.9 Q ss_pred ccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCc-ccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhc Q lcl|NC_019406. 17 GAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGV-KYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFR 95 (661) Q Consensus 17 ~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~-~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFr 95 (661) -++.....++.|+....... -+..|..-+.+... -.| ++ +..-+-|+....| -.++...++.....|.+ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~-----~~~~~~~~~~~~e~~~~l-r~-~~~~~ly~~m~e~---D~~i~s~l~~rk~av~~ 70 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGS-----GVVDGWTVWDPFEQTPEL-QW-PQSVAVYSRMDNE---DSRVTSLLEAISLPIRS 70 (469) T ss_pred CCCcccCCCCccchhhhhhc-----ccccchhhcccccccccc-cc-ccchHHHHHHHhh---ChHHHHHHHHHHHHHhc Confidence 23333334444433331100 00000001110000 001 11 2333457766554 34555555555555666 Q ss_pred cCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhc------------cCCCCCHHHHHHHHHHHHHhhCCEEEE Q lcl|NC_019406. 96 RPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRF------------AKDGTSHQGFAKTVALEQVAMGRFGAL 163 (661) Q Consensus 96 k~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------------dl~G~sL~~fa~~~~~~~L~~Gr~gvL 163 (661) .+..|+ |+.= + .+.-+.+.+.+.+. +....+...++.+.+..++.||.+.+= T Consensus 71 ~~w~v~--p~~~-------~-------~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~E 134 (469) T protein:vir:10 71 TPWRIR--ANGA-------S-------DEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFE 134 (469) T ss_pred CCceEe--cCCC-------C-------HHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeee Confidence 666553 2210 0 01111111111110 123556778888888888889988776 Q ss_pred EeccCCCchhhc-ccceeE-eechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhh Q lcl|NC_019406. 164 VDVAPSSDPTAP-AKSYTV-GYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGR 241 (661) Q Consensus 164 VD~P~a~~~~~g-~rPY~~-~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~ 241 (661) +.|-....-..| ..|--. ...++.|--|.+++.+| T Consensus 135 ivw~~~~~~~dG~~~~~~l~~rp~~~i~~~~~~~~~~------------------------------------------- 171 (469) T protein:vir:10 135 QVYRPRNQSPDGRFWLRKLAPRPQWTISKFNVAPDGG------------------------------------------- 171 (469) T ss_pred eeeecccccCCCceeeeeeeecCcccceeeeeccCCc------------------------------------------- Confidence 655321100000 000000 00001111111111111 Q ss_pred hcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccc-ccceeeccCCcccceeeEE Q lcl|NC_019406. 242 RAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQA-RDVYTPMVRGRTLPFIPFV 320 (661) Q Consensus 242 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~-~~~~~p~~~g~~L~~IPfv 320 (661) .+.... ....+.. +..+.....|.+|+.==|+ T Consensus 172 --------------------------------------------l~~~~~---~~~~~~~~~~~~~~~~~~~~lp~~k~i 204 (469) T protein:vir:10 172 --------------------------------------------LESIEQ---IAPPARTRGSLYVANIAPPEIPVNRLV 204 (469) T ss_pred --------------------------------------------eeeeee---cCcccccccccccCCCCccccccCcEE Confidence 000000 0000000 0000000011112111122 Q ss_pred E-EecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCc-------eeEecccceeecC Q lcl|NC_019406. 321 F-FGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDAS-------EYHIGPGRVWVVD 389 (661) Q Consensus 321 ~-~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~-------~l~iGs~~~~~lp 389 (661) + .+....+-..+.+.|..++..-+--=....+.-.-+..-+.|+++.. |.+++++. .+..|+++++++| T Consensus 205 ~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip 284 (469) T protein:vir:10 205 VYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLA 284 (469) T ss_pred EEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEcc Confidence 2 23333344456666666665544333355667777777789988875 22333222 1345888888999 Q ss_pred CCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH-HHH Q lcl|NC_019406. 390 KESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT-SVV 467 (661) Q Consensus 390 ~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~-~aL 467 (661) . |.++.|++.+|+. ......++...++|..+ -+..|....+++ |--..........-++.+.+..++++++ +++ T Consensus 285 ~-~~~ie~~ea~g~~-~~~~~li~~~d~~Isk~iLG~tlTs~~~gG--S~a~~~vh~ev~~d~~~sDa~~i~~tln~~li 360 (469) T protein:vir:10 285 Q-GQILELLGVSGNL-PDIRRAIEGHDRSIALSGLAHFLNLDGKGG--SYALASVLEDPFTQAVHAYATSICRIANQHII 360 (469) T ss_pred C-CceEEEeecCCCc-hHHHHHHHHHHHHHHHHHhcccccccCccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 7899999988765 45777777777776632 233443322222 2224455556667789999999999997 588 Q ss_pred HHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCH-HHHHHHH-HhcCCCCccCCHHHHHHHHhcc Q lcl|NC_019406. 468 RYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPI-DALYENF-VKNGIIPSTQTLEEFTIKMNDP 545 (661) Q Consensus 468 ~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~-et~~~eL-~r~gvl~~~~~~Eee~~~l~~~ 545 (661) ++++.|..-.. ..-.+|.+.. ...-+....+++..+.+.|.+.. +....++ .+-|+ |.-...+......+.. T Consensus 361 ~~l~~lN~g~~--~~~P~~~~~~---~e~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gi-p~~~~~~~~~~~~~~~ 434 (469) T protein:vir:10 361 EDLVDINFGVD--TPAPVLTFDP---IGSRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNL-PSELNDTPSAEPEEPA 434 (469) T ss_pred HHHHHhcCCCC--CCccEEEecC---CCCcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCC-CCCCCCcccccchhcc Confidence 99988873221 2223555432 11112223455556666666321 1111122 22343 2222111111111100 Q ss_pred CCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhh Q lcl|NC_019406. 546 KSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAK 603 (661) Q Consensus 546 ~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~ 603 (661) . .+...+ .+ ..... ...+| .+++ .+..++++|-.+ T Consensus 435 ~--~~~~~~--~~----~~~~~------~~~~~-------~~~~--~~~~~~~~l~da 469 (469) T protein:vir:10 435 A--VPNQSA--AP----ARTRS------SGNAD-------ARAR--APKADQGVLFDA 469 (469) T ss_pred c--CCCCCc--cc----cccCC------CCCcc-------cccc--cCCChHHhhccC Confidence 0 000000 00 00000 00000 0010 111222222221 No 141 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=92.35 E-value=0.012 Score=31.16 Aligned_cols=393 Identities=11% Similarity=0.075 Sum_probs=171.3 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHH---HHHHhhhcccchHHHHHHHHhchhhccC Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDY---ANYLDRAAFYNMTSQTQAGMVGQIFRRP 97 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y---~~rl~rA~~~n~~~~tv~~l~G~vFrk~ 97 (661) |.+ +++.-| .++++|. ..|..|-+ ..+...| ..|... ...+..|+...--+.|+. T Consensus 1 ~~~-~~~d~~----------~~~~~~~----~~~~~~~~---~~~~~~~~l~a~Y~~~----~l~~~~Vd~~aed~~r~g 58 (427) T protein:vir:10 1 MKI-VKHDGY----------NDIFNGG----ADGSPKPF---FMSDASYHVGSFYNDN----ATAKRIVDVIPEEMVTAG 58 (427) T ss_pred CCc-cccchH----------HHHhhcC----CCCcccCc---cccCchHHHHHHHHcC----chhhhhhccchHHhhcCC Confidence 111 112222 1234442 22333322 2233334 333333 344556666666667887 Q ss_pred ccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhccc Q lcl|NC_019406. 98 PVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAK 177 (661) Q Consensus 98 p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~r 177 (661) ..|+...+ -++|...+++. .+..-++++++.+..||.++|+++..... T Consensus 59 ~~i~g~~~--------------------~~~~~~~~~~l-----~~~~~l~~a~~~~rl~G~a~i~i~v~d~~------- 106 (427) T protein:vir:10 59 FKMSGVKD--------------------EKEFKSLWDSY-----KLDSSLVDLLCWARLYGGAAMVAIIKDNR------- 106 (427) T ss_pred ccccCccH--------------------HHHHHHHHHHh-----hHHHHHHHHHHhccccceeEEEEEecCCC------- Confidence 77753211 12333333332 57788889999999999999998753211 Q ss_pred ceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhhe Q lcl|NC_019406. 178 SYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADAL 257 (661) Q Consensus 178 PY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~ 257 (661) | |+..-+ +...|. ++..+.+..+ + + -... T Consensus 107 ~------------l~~p~~-~~g~l~---------------------~l~v~d~~~~--------------~--~-~~~~ 135 (427) T protein:vir:10 107 M------------LTSQAK-PGAKLE---------------------GVRVYDRFAI--------------T--V-EKRV 135 (427) T ss_pred c------------cccccC-CCccee---------------------EEEEechhcc--------------c--c-cccc Confidence 1 000000 011111 1111111100 0 0 0000 Q ss_pred ecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecCCCCCCccccchh Q lcl|NC_019406. 258 ARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLL 337 (661) Q Consensus 258 ~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLl 337 (661) ..+..+ .|..-+.|++. +..+...++++ ........|.+++++. ....-..+.+||. T Consensus 136 ~dp~s~--~fg~P~~y~v~----~~~~~~~~~iH-----------~SRli~~~g~~~p~~~------~~~~~~~G~S~l~ 192 (427) T protein:vir:10 136 TNARSP--RYGEPEIYKVS----PGDNMQPYLIH-----------HSRVFIADGERVAQQA------RKQNQGWGASVLN 192 (427) T ss_pred cCcccc--ccCcceEEEEe----cCCCCcceEEc-----------cccEEEecCCCchhhh------cccCCcccchhhh Confidence 111111 22233344442 11111111111 1111112233332221 1111223456776 Q ss_pred HHHHHHHHHHhhhhh-HHHHHHHhcCceeEEecCCC----CCCc-e---------eEecccceeecCCCCCcceEeecCc Q lcl|NC_019406. 338 DIVELNLKHYRTYAE-LEHGRFFTALPTYYAPELDD----SDAS-E---------YHIGPGRVWVVDKESGIPGIIEFKG 402 (661) Q Consensus 338 dLA~LNl~HYq~sSD-l~~il~~~~~P~l~i~Gl~~----~~~~-~---------l~iGs~~~~~lp~~ga~~~ylE~~g 402 (661) ...+=.|..|.+.+. -.+++|...+.++-+.|+.+ .+.. . ..-|.+..+.+..++.++..+..+= T Consensus 193 ~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~l 272 (427) T protein:vir:10 193 KSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDI 272 (427) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEeccc Confidence 655444777755554 46778888888888877631 2211 0 1234455556655555666655444 Q ss_pred hhHHHHHHHHHHHHHHHHHHhHHh----ccccc-CccchhHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHcCC Q lcl|NC_019406. 403 EGLKTLERALNEKEQQIAAIGGRL----MPGMS-KSVSESDNQSALREANEQSLLLNVI-MALEDGMTSVVRYWLMFRDI 476 (661) Q Consensus 403 ~~i~a~~~~L~~le~qM~~lGArl----l~~~~-~~~~eTataa~~d~~~~~S~L~~~A-~~le~Al~~aL~~~A~w~G~ 476 (661) +++. +.++...++|... +++ |-.++ ++-+-|+.. |...=+..+.++- ..+..+++++++++.+ T Consensus 273 sgl~---~~~~~~~~~iaaa-~~IP~t~L~G~sp~Glnstgd~---D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~---- 341 (427) T protein:vir:10 273 SGVP---EFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNT---ALETFYKLVDRKREEDYRPLLEFLLPFIVD---- 341 (427) T ss_pred CChH---HHHHHHHHHHHhh-hCCCeeeeccCCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc---- Confidence 4443 3444455555432 221 11211 222223232 3333344444443 2356667777776552 Q ss_pred CCCCcceEEEEeccccccccCCHH-----HHHHHHHHHhcCCCCHHHHHHHHHhcCCCCc-----cCCHHHHHHHHhccC Q lcl|NC_019406. 477 PLTDTATLRYEIDATFLTTALDAR-----ALRAIQQLYEGGLLPIDALYENFVKNGIIPS-----TQTLEEFTIKMNDPK 546 (661) Q Consensus 477 ~~~~~~~~~v~ln~DF~~~~lda~-----~l~all~~~~aG~Is~et~~~eL~r~gvl~~-----~~~~Eee~~~l~~~~ 546 (661) ..++.|+.|+-......+-. ..++...++++|.|+.++.+++|+..+.... +.++|+..+ -.+.. T Consensus 342 ----s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~-~~e~~ 416 (427) T protein:vir:10 342 ----EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEE-TTEPE 416 (427) T ss_pred ----CCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccch-hcCCC Confidence 23677887764444332222 2456778888999999999999986443222 122222221 11222 Q ss_pred CCCCCchhhhhhc Q lcl|NC_019406. 547 SFIGQPDAIAMRR 559 (661) Q Consensus 547 ~~l~~ddae~~~~ 559 (661) |..+ +.+..++ T Consensus 417 p~~~--e~~~d~~ 427 (427) T protein:vir:10 417 PGLG--EKLEDEN 427 (427) T ss_pred CCCC--CCCCCCC Confidence 2211 1111111 No 142 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=91.30 E-value=0.017 Score=30.35 Aligned_cols=487 Identities=11% Similarity=0.060 Sum_probs=185.4 Q ss_pred CCCCCCcccccccc-ccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRT-KRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~ 79 (661) |. +|.|-..- .+..-..-.+.-..+......+|+-|.+.+--.. +|+. ... .+.+| .|- T Consensus 1 ~~----~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~---------~~~~---~~~---~~~~~-~~d 60 (516) T protein:vir:96 1 MK----QSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYL---------MNDK---GDN---ETSQN-GWQ 60 (516) T ss_pred Cc----chhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccc---------cCCC---CCc---cccCC-ccc Confidence 22 22222100 0000000011112222233445555555543310 1111 111 11121 344 Q ss_pred chHHHHHHHHhchhhcc--Ccc---cc-ccchh-hHhhhhcccccccccchhhhhhhHhhhhhcc------CCCCCHHHH Q lcl|NC_019406. 80 NMTSQTQAGMVGQIFRR--PPV---IR-NLPNT-GAITGRDAEGGVQVVAPASIGKLLTQLQRFA------KDGTSHQGF 146 (661) Q Consensus 80 n~~~~tv~~l~G~vFrk--~p~---i~-~~p~~-l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~d------l~G~sL~~f 146 (661) +.-.+.++.++..+..- ||. +. .+.+. ++.+... | .....+..+|+.|. +.-++.+.- T Consensus 61 stg~~a~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~--~-------~~~~~v~~~L~~ve~~~~~~l~~snf~~~ 131 (516) T protein:vir:96 61 GVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQR--G-------LKKTELATIFAQVETRAMKELEQRQFRPA 131 (516) T ss_pred chHHHHHHHHHHHHHhhhcCCCCcccccccChhHHhhcccc--C-------chhHHHHHHHHHHHHHHHHHHHhcCcHHH Confidence 44455555554333221 011 10 01111 1111000 0 00111222222211 345678888 Q ss_pred HHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccccccccee Q lcl|NC_019406. 147 AKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWI 226 (661) Q Consensus 147 a~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i 226 (661) +-.++.+.+.+|-+.+++|.+. + +..|+-.+ +-+. .++.+.+.-|+.+++.....- T Consensus 132 ~~~~~~~L~~~G~a~l~~d~~~------~----~~~~pl~~---y~v~-~d~~G~v~~i~rr~~~~~~~l---------- 187 (516) T protein:vir:96 132 VVEAFKHLIVAGSCMLYKPSKG------A----ISAIPMHH---YVVN-RDTNGDLLDIILLQEKALRTF---------- 187 (516) T ss_pred HHHHHHHHHhHCeEeEEecCCC------C----EEEEEcCe---EEEe-eCCCCCeeeehhhhHhhHHHH---------- Confidence 8888999999999988887431 1 11221111 1111 111111111222211110000 Q ss_pred eeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccccccee Q lcl|NC_019406. 227 GREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYT 306 (661) Q Consensus 227 ~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~ 306 (661) . ...... ..+... ......+ ..-++|.....+.+ ..|.++...++. ..+ T Consensus 188 -----------~-~~~~~~-~~~~~~-----~~~~~~~---~~v~v~~~v~~~~~----~~~~~~~~~d~~--~~~---- 236 (516) T protein:vir:96 188 -----------D-PATRAV-VEVGLK-----GKKCKED---DSVKLYTHAKYLGD----GFWELKQSADDI--PVG---- 236 (516) T ss_pred -----------H-Hhhhhh-hhhhhh-----hhhcCCC---CceEEEEeeeeeCC----ceeEEEEEeCce--eec---- Confidence 0 000000 000000 0001111 12233433333322 123333222211 111 Q ss_pred eccCC---cccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCce-eEEe-cCCCCCCceeE Q lcl|NC_019406. 307 PMVRG---RTLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPT-YYAP-ELDDSDASEYH 379 (661) Q Consensus 307 p~~~g---~~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~-l~i~-Gl~~~~~~~l~ 379 (661) ..+| ..+++||+.|.-..+..+..+ .--|-|+..||.-+- +-+.. .+.+.-|. ++-+ |... ...+. T Consensus 237 -~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~---~~l~~-~~~a~~~~~lv~p~g~~~--~~~l~ 309 (516) T protein:vir:96 237 -KVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSE---AVARG-AALMADIKYLIRPGAQTD--VDHFV 309 (516) T ss_pred -cccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHH---HHHHH-HHHhcCCccccCcccccc--hhhhc Confidence 1122 347888888876666666554 112557777774332 22333 34444444 4433 3322 22355 Q ss_pred ecccceeecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHH--hHHhcccccCccchhHHHHHHHHHHhhHHHHHHH Q lcl|NC_019406. 380 IGPGRVWVVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAI--GGRLMPGMSKSVSESDNQSALREANEQSLLLNVI 456 (661) Q Consensus 380 iGs~~~~~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~l--GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A 456 (661) -|+++.+. |+......-++.. +..+....+.|+++++.+... ...+... .+...|||+...+...-...|.-+- T Consensus 310 ~~~~g~i~-~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r--~~~rvTAtEV~~r~~E~~~~LGpv~ 386 (516) T protein:vir:96 310 NSGTGEVV-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRR--DAERVTAVEIQRDALEIEQNMGGVY 386 (516) T ss_pred cCCCceee-cCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccC--CCccccHHHHHHHHHHHHHHhhhHH Confidence 56665554 4333345555543 335788899999999998762 2122222 2345799999999999999988877 Q ss_pred HHHHH-HHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCC---CCc Q lcl|NC_019406. 457 MALED-GMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVK-NGI---IPS 531 (661) Q Consensus 457 ~~le~-Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r-~gv---l~~ 531 (661) ..+.. -+.-++.++..-++-... ...+ +..|. .. +.+|..+.....|. .+...+-. .++ +-+ T Consensus 387 ~rl~~Ell~Pli~r~l~~~~p~lp-~~~v----~~~~v-s~-----l~~l~r~~~~~~i~--~~~~~i~~~~~~~p~v~d 453 (516) T protein:vir:96 387 SLFATTMQSPVAMWGLLEAGESFT-SDLV----DPVII-TG-----IEALGRMAELDKLA--NFAQYMSLPLQWPEPVLA 453 (516) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCc-cccc----cceee-ch-----HHHHHHHHHHHHHH--HHHHHHHHHhcCChhHHh Confidence 77554 444444554444442222 2222 22232 12 22233333222221 22222211 112 225 Q ss_pred cCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhH Q lcl|NC_019406. 532 TQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAA 611 (661) Q Consensus 532 ~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 611 (661) ..++++..+.+.+. +|.|. ..+. . .++ ..|...+.|+.|.+. ..+ +-.|+.--.. T Consensus 454 ~id~d~~~~~~a~~---~Gvp~-~~ir-s----~ee-----------v~~~~~~~~~~q~~~--~~a---~~~~~~~~~~ 508 (516) T protein:vir:96 454 AVKWPDYMDWVRGQ---ISAEL-PFLK-S----AEE-----------MAQEQEAQMQAQQAQ--MLE---EGVAKAVPGV 508 (516) T ss_pred cCCHHHHHHHHHHH---hCCCc-cccC-C----HHH-----------HHHHHHHHHHHHHHH--HHH---HHhhhhhhHH Confidence 67788888888765 33331 1111 1 011 111111111111000 000 0000000000 Q ss_pred HHhcCChhhhhhh Q lcl|NC_019406. 612 SRKLGDPEQAKPS 624 (661) Q Consensus 612 ~~~~~~~~~~~~~ 624 (661) . |+ |.|+| T Consensus 509 ~---~~--~~~~~ 516 (516) T protein:vir:96 509 I---QQ--ELKEA 516 (516) T ss_pred h---hc--ccccC Confidence 0 00 11111 No 143 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=91.02 E-value=0.018 Score=30.17 Aligned_cols=503 Identities=12% Similarity=0.091 Sum_probs=189.9 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) ||----++.=.. .. ..-.+.-.-+-....++|+-|.+.+--.+. +. .+..... ++.+ .|-+ T Consensus 1 m~~~~~~~~~~~----~~-~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~---------~~-~~~~~~~---~~~~-~~ds 61 (532) T protein:vir:99 1 MAEVEKTGFAAD----GA-AAAYNRLKNDRGAYETRAEDCATYTIPSVF---------PS-ATADGST---SYTT-PWQS 61 (532) T ss_pred CcchhhccccHH----HH-HHHHHHHHHHhhHHHHHHHHHHHHhhhccc---------CC-CCCcchh---hccc-cccc Confidence 332111100000 00 000000011112334556555555533210 11 1111111 1111 3444 Q ss_pred hHHHHHHHHhc----hhhcc-Ccccc-ccch-hhHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHHHHH Q lcl|NC_019406. 81 MTSQTQAGMVG----QIFRR-PPVIR-NLPN-TGAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQGFA 147 (661) Q Consensus 81 ~~~~tv~~l~G----~vFrk-~p~i~-~~p~-~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~~fa 147 (661) .-.+.++.+.. .+|.- .|=+. .+++ .+..+ . ..++.....+.+|+.+ -+.-++.+.-+ T Consensus 62 t~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~----~-----~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~ 132 (532) T protein:vir:99 62 IGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQS----I-----TSPEELTEIATGLAMVERICMNYMESNSFRPTL 132 (532) T ss_pred hHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHhcc----C-----CChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHH Confidence 44455554443 33331 11110 0110 01000 0 0011112223332221 13456788888 Q ss_pred HHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceee Q lcl|NC_019406. 148 KTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIG 227 (661) Q Consensus 148 ~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~ 227 (661) -.++.+.+.+|-+-+++|-+..+... ...+..|+-.+ +-+. .++.+.+.-|+.++.+..+.- T Consensus 133 ~~~~~~L~~~G~a~l~~~~~~~~~~~---~~~f~~~pl~~---y~v~-~d~~G~v~~ivrr~~~~~~~l----------- 194 (532) T protein:vir:99 133 HAAIKQLLVAGNVLLYIPSTEQVEGQ---SNAPKLYKLHN---FVVE-RDAYDNVLQIVTEDKIARAAL----------- 194 (532) T ss_pred HHHHHHHHhHCcEeEEecccccccCc---ccceEEEEcCe---EEEe-eCCCCCeeeEeeeeeecHHhc----------- Confidence 88999999999999999865322211 11122222222 1111 122223333333332211100 Q ss_pred eechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceee Q lcl|NC_019406. 228 REGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTP 307 (661) Q Consensus 228 ~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p 307 (661) ++.+ +....++ . .....+..-++|.......+ ...|.|+.+.++.. ++ T Consensus 195 ---~e~~---~~~~~~~------------~----~~~~p~~~v~v~~~v~~~~~---~~~~~~~~~~~g~~-------~~ 242 (532) T protein:vir:99 195 ---PEDV---RKSLEDA------------Q----GDQNPSEEVTIYTHVYRDPE---AMVFRSYQEIDGEI-------VA 242 (532) T ss_pred ---ChHH---HHHhhcc------------c----cccCCCcceEEEEEEEecCC---CCeeEEEEeecCce-------ec Confidence 0111 0000000 0 00111222344444433322 23345554443321 11 Q ss_pred c-cCCc---ccceeeEEEEecCCCCCCcc--ccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEe Q lcl|NC_019406. 308 M-VRGR---TLPFIPFVFFGSMSNAADCE--KPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHI 380 (661) Q Consensus 308 ~-~~g~---~L~~IPfv~~~~~~~~~~~~--~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~i 380 (661) . .++- .+++||+.|.-..+..+..+ .--|-|+..||.- +.+-+..+......|.++-+ |..+. ..+.- T Consensus 243 ~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l---~~~~l~~~~~a~~~~~lv~p~g~~~~--~~~~~ 317 (532) T protein:vir:99 243 GTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENL---YEAIVKMSMISSKVLFFVNPNGVTQI--RRVAK 317 (532) T ss_pred ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHH---HHHHHHHHHHHcCCCceeccccccch--hhhcc Confidence 1 1221 35777777766555555443 1124577777754 33445555555666656653 33221 12333 Q ss_pred cccceeecCCCCCcceEeec-CchhHHHHHHHHHHHHHHHHHHh-HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_019406. 381 GPGRVWVVDKESGIPGIIEF-KGEGLKTLERALNEKEQQIAAIG-GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMA 458 (661) Q Consensus 381 Gs~~~~~lp~~ga~~~ylE~-~g~~i~a~~~~L~~le~qM~~lG-Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~ 458 (661) |.+..+. |......+.++. ++..+....+.|+++++.+..+= ..++. ...+...||++...+...-...|..+-.. T Consensus 318 ~~~g~~v-~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~d~~r~TAtEV~~r~~E~~~~LGpv~~r 395 (532) T protein:vir:99 318 ANTGDFV-AGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAV-QRGGDRVTAEEIRYVAGELEDTLGGVYSL 395 (532) T ss_pred CCCccee-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcc-cCCCCcccHHHHHHHHHHHHHHhhHHHHH Confidence 4343433 322234444442 34568889999999999987521 11121 12234579999999999999999988887 Q ss_pred HHHH-HHHHHHHHHHHc---CC-CCCCcceEEEEeccccccccC-CHHHHHHHHHHHhcCCCCHHHHHHHHHh-cCCCCc Q lcl|NC_019406. 459 LEDG-MTSVVRYWLMFR---DI-PLTDTATLRYEIDATFLTTAL-DARALRAIQQLYEGGLLPIDALYENFVK-NGIIPS 531 (661) Q Consensus 459 le~A-l~~aL~~~A~w~---G~-~~~~~~~~~v~ln~DF~~~~l-da~~l~all~~~~aG~Is~et~~~eL~r-~gvl~~ 531 (661) +.+= +.-++..+-..+ |+ +....+-+...+. .|. ..| -++.+..++. +...|.. .+-+.+ T Consensus 396 l~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv-~~i-s~Laraq~~~~l~~-----------~~~~laq~~p~~~d 462 (532) T protein:vir:99 396 LSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA-TGL-EALGRGHDLNKLNV-----------FIDYMIKLAGLQDD 462 (532) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCChhhccccee-ecc-hHHHHHHHHHHHHH-----------HHHHHHhhcchhhh Confidence 6543 333333333322 21 1111111111111 111 111 1122222222 2222221 244456 Q ss_pred cCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhH Q lcl|NC_019406. 532 TQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAA 611 (661) Q Consensus 532 ~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 611 (661) ..++++..+.+.+- +|.|-..... . +++..+ .+ +|+..+++..++..+++. T Consensus 463 ~id~d~~~~~~a~~---~GV~~~~i~r-~----~ee~~~---~~----~q~~~~~~~~~a~~~~~~-------------- 513 (532) T protein:vir:99 463 DINLLDVKMRLANS---LGMDTTGLIL-T----QQDKQA---KM----AEASTAAGMVTAGQQMGA-------------- 513 (532) T ss_pred hCCHHHHHHHHHHH---hCCChhhccC-C----HHHHHH---HH----HHHHHHHHHHHHHHHHHH-------------- Confidence 67888888888664 2221111111 0 000000 00 000000000000000000 Q ss_pred HHhcCChhhhhhhhhhhhHHHHhhcccccCCCCCCCccccc Q lcl|NC_019406. 612 SRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQR 652 (661) Q Consensus 612 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 652 (661) . ..|+++++.+.--|+-+. T Consensus 514 -------------~---------~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 514 -------------A---------GGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred -------------H---------HHHhcchhHHhhcCCCCC Confidence 0 011111111111111111 No 144 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=90.03 E-value=0.023 Score=29.56 Aligned_cols=391 Identities=12% Similarity=0.074 Sum_probs=171.9 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCC-hHHHHHHHhhhcccchHHHHHHHHhchhhccCcc Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFD-DEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPV 99 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~-~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~ 99 (661) |+-. .| ..-++++|.. ++ .|-..+..-+ ..-+..|. .....+..|+...--++|+... T Consensus 1 ~~~~-----------D~-~~n~~~gg~~----~~-~~~~~~~~~~~~~l~a~Y~----~~~l~~~~Vd~~aed~~r~g~~ 59 (422) T protein:vir:10 1 MVKT-----------DS-YANIFLGGSD----GS-EIYGSLQNQAPTILASLYA----DNALVRRIIDTIPETALAAGFH 59 (422) T ss_pred Cccc-----------hh-hHHHHcCCCC----Cc-cccCcccccCHHHHHHHHH----hChhhHHHHhhhhHHHhcCCcc Confidence 1100 00 1112334432 22 2222221111 11222332 3344566777777777888888 Q ss_pred ccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccce Q lcl|NC_019406. 100 IRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSY 179 (661) Q Consensus 100 i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY 179 (661) |+..++. .++..-+++ ..+..-++.+++.+..||.++|+++...... -.. T Consensus 60 i~~~~~~--------------------~~~~~~~~~-----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~---~~~-- 109 (422) T protein:vir:10 60 IDGIDDE--------------------PAFWSRWDD-----LEMTQNINDAWSWARLFGGAAIVAIVKDNRA---LTS-- 109 (422) T ss_pred ccCCCHH--------------------HHHHHHHHH-----hhHHHHHHHHHHhhccccceEEEEEecCCCC---ccc-- Confidence 7543221 122222333 2577888999999999999999998631110 001 Q ss_pred eEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheec Q lcl|NC_019406. 180 TVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALAR 259 (661) Q Consensus 180 ~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~ 259 (661) |.+. +| .+. ++..+.+..| + . ...... T Consensus 110 -----Pl~~--------~g--~~~---------------------~l~v~d~~~i--------------~--~-~~~~~d 136 (422) T protein:vir:10 110 -----PVRE--------GA--ELE---------------------TVRVYDRTQV--------------K--V-QTREEN 136 (422) T ss_pred -----cccc--------cC--cee---------------------eEEeeccccc--------------c--c-hhcccC Confidence 1110 01 011 1111111100 0 0 000011 Q ss_pred ccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHH Q lcl|NC_019406. 260 PSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDI 339 (661) Q Consensus 260 ~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldL 339 (661) +..+ .|..-..|++.- +. +...+++ +........|.+++.+ .....+. .+.+||..+ T Consensus 137 p~s~--~fg~P~~y~v~~---~~-~~~~~~i-----------H~SRli~~~g~~~p~~----~~~~~~~--~G~S~l~~~ 193 (422) T protein:vir:10 137 PRNA--RFGEPLTYRITT---NE-SDMFYDV-----------HYSRIHIIDGERIPNV----MRRQNDG--WGRSVLSSD 193 (422) T ss_pred cccc--ccCcceEEEEec---CC-CCcceee-----------ccceeEEeCCCCchhh----hcccCCc--ccchhHHHH Confidence 1111 122223343321 10 0000111 1111111123333221 1111222 356788877 Q ss_pred HHHHHHHHhhhhhH-HHHHHHhcCceeEEecCCC----CCCc----------eeEecccceeecCCCCCcceEeecCchh Q lcl|NC_019406. 340 VELNLKHYRTYAEL-EHGRFFTALPTYYAPELDD----SDAS----------EYHIGPGRVWVVDKESGIPGIIEFKGEG 404 (661) Q Consensus 340 A~LNl~HYq~sSDl-~~il~~~~~P~l~i~Gl~~----~~~~----------~l~iGs~~~~~lp~~ga~~~ylE~~g~~ 404 (661) ++=-|..|.+.+.- ..++|...+.++-+.|+.. +... ...-|.+..+.+..++.++..+..+-++ T Consensus 194 ~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg 273 (422) T protein:vir:10 194 ILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG 273 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCC Confidence 76667777766654 6678888888888887532 2111 0123445555555445567776655555 Q ss_pred HHHHHHHHHHHHHHHHHH-hH---HhcccccCccchhHHHHHHHHHHhhHHHHHHHH-HHHHHHHHHHHHHHHHcCCCCC Q lcl|NC_019406. 405 LKTLERALNEKEQQIAAI-GG---RLMPGMSKSVSESDNQSALREANEQSLLLNVIM-ALEDGMTSVVRYWLMFRDIPLT 479 (661) Q Consensus 405 i~a~~~~L~~le~qM~~l-GA---rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~-~le~Al~~aL~~~A~w~G~~~~ 479 (661) +. +.++...++|... |. +|+-.+.++-+.|+. .+...=+..+.++-. .+..+++++++++.+ T Consensus 274 l~---~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd---~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~------- 340 (422) T protein:vir:10 274 ID---AFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQN---TALETFHKLVDRKRNAELLPILEFLIPFIVN------- 340 (422) T ss_pred hH---HHHHHHHHHHHhhhCCCeeeeccCCcccccccch---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------- Confidence 54 3344445555432 11 222111122122222 223333444554443 467777777777653 Q ss_pred CcceEEEEeccccccccCCHH-----HHHHHHHHHhcCCCCHHHHHHHHHhc----CCCCccCCHHHHH-HHHhccCCCC Q lcl|NC_019406. 480 DTATLRYEIDATFLTTALDAR-----ALRAIQQLYEGGLLPIDALYENFVKN----GIIPSTQTLEEFT-IKMNDPKSFI 549 (661) Q Consensus 480 ~~~~~~v~ln~DF~~~~lda~-----~l~all~~~~aG~Is~et~~~eL~r~----gvl~~~~~~Eee~-~~l~~~~~~l 549 (661) ..++.|+.|+-......+-. ..++...++++|.|+.+..++.|+.. |+.+. ..++++. .+..+....- T Consensus 341 -s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 418 (422) T protein:vir:10 341 -AEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDG-SVETEVTISETSNDPLEV 418 (422) T ss_pred -cCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCC-CCccccchhhcCCCCCCC Confidence 23577777754433322111 24567788889999999999999763 33332 2222222 1111111111 Q ss_pred CCch Q lcl|NC_019406. 550 GQPD 553 (661) Q Consensus 550 ~~dd 553 (661) +.+| T Consensus 419 ~~~d 422 (422) T protein:vir:10 419 PTDD 422 (422) T ss_pred CCCC Confidence 1111 No 145 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=89.00 E-value=0.029 Score=29.02 Aligned_cols=413 Identities=12% Similarity=-0.002 Sum_probs=173.1 Q ss_pred HHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhc Q lcl|NC_019406. 34 RPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRD 113 (661) Q Consensus 34 ~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d 113 (661) .-.-.-......|... +.....|-|..+ .... |......=-....++..|+...--++|+...|+. +| T Consensus 1 ~~~~D~~~~~~~~~g~-~~~~~~~~~~~~-~~~~-~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~---------~d 68 (437) T protein:vir:52 1 MKFFDGIKSLALKLGS-KQEQTYYSPSLS-LTDD-LVQLEALWRDNWIANKVCIKRPEDMVRNWREIYS---------ND 68 (437) T ss_pred CchhhhhHhHHhcCCC-ccccceeecCcc-cccc-HHHHHHHHHhCchhhHHhhcchHHhhcCCceEec---------CC Confidence 0000001111222111 111223333332 2222 2222222123455667777777888899888742 11 Q ss_pred ccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhcccee Q lcl|NC_019406. 114 AEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTV 193 (661) Q Consensus 114 ~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~ 193 (661) . -++.+++|+..+++. .+..-++++++.+-.||.++|||..-..+. ..|. +. T Consensus 69 ~-------~~~~~~~~~~~~~~l-----~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~----~~pl-------~~----- 120 (437) T protein:vir:52 69 L-------NSKQLDLFTKFERSL-----KLRETLTKALQWSSLYGSVGLLVVTDSQNT----SAPL-------KP----- 120 (437) T ss_pred C-------CHHHHHHHHHHHHhh-----cHHHHHHHHHHhcccccceEEEEEecCCCc----cccc-------cc----- Confidence 1 123445566666555 467777777777778999999987532110 0111 00 Q ss_pred eccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEE Q lcl|NC_019406. 194 EDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIY 273 (661) Q Consensus 194 ~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (661) . +.+..+ ..+.+ |...-. ..+...+..+ .|..-..| T Consensus 121 ---~--~~~~~~---------------------~v~~~-----~~v~~~-----------~~~~~dp~s~--~fg~p~~y 156 (437) T protein:vir:52 121 ---T--ERLKRL---------------------IILPK-----WKISPT-----------GTKDDDVLSP--NFGRYSEY 156 (437) T ss_pred ---C--CceeEE---------------------EEech-----hhcccc-----------cccccccccc--ccCcceEE Confidence 0 011111 11111 000000 0000111111 12222334 Q ss_pred EEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhH Q lcl|NC_019406. 274 RELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAEL 353 (661) Q Consensus 274 rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl 353 (661) ++. .+ ...+ ..+...+....|.+ +| ....+. -+.|+|. .+.=-|..|...+.. T Consensus 157 ~v~---~~---~~~~-----------~iH~SRii~~~~~~---~~----~~~~~~--~G~s~le-~~~~~i~~~~~~~~~ 209 (437) T protein:vir:52 157 SIL---GG---SQSI-----------TVHHSRLIILNAND---AP----LSDNDI--WGVSDLE-KIIDVLKRFDSASVN 209 (437) T ss_pred EEe---cC---Ccce-----------eEccceeEEecCcc---CC----Cccccc--cCCchHH-HHHHHHHHHHHHHHH Confidence 432 01 0000 00000110111111 11 122222 2555554 345555555555433 Q ss_pred -HHHHHHhcCceeEEecCC----CCCCce-------e--EecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHH Q lcl|NC_019406. 354 -EHGRFFTALPTYYAPELD----DSDASE-------Y--HIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQI 419 (661) Q Consensus 354 -~~il~~~~~P~l~i~Gl~----~~~~~~-------l--~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM 419 (661) ..++|...++++-+.|+. .+.... + .-+..+.+.++. +.++..+..+-+++. +.++...++| T Consensus 210 ~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~-~~~~e~~~~~~sgl~---~~l~~~~~~i 285 (437) T protein:vir:52 210 VGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDA-ENEYDRKELTFTGLK---DLLTEFRNAV 285 (437) T ss_pred HHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcC-CcceEEEecCcCCHH---HHHHHHHHHH Confidence 566888888888787752 111110 1 123345556654 456766665544544 4444555555 Q ss_pred HHH-h--H-HhcccccCccchhHHHHHHHHHHhhHHHHHHHH-HHHHHHHHHHHHHHHHcCCCCCCcceEEEEecccccc Q lcl|NC_019406. 420 AAI-G--G-RLMPGMSKSVSESDNQSALREANEQSLLLNVIM-ALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLT 494 (661) Q Consensus 420 ~~l-G--A-rll~~~~~~~~eTataa~~d~~~~~S~L~~~A~-~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~ 494 (661) ... | + +|+ .++ .+.- |+ ...|...-+..+.++-. .+...++.+++++..=.+.+. ..++.|+.|+=... T Consensus 286 aaa~~iP~t~L~-G~s-~~Gl-as-ge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~--~~~~~~~f~pL~~~ 359 (437) T protein:vir:52 286 AGAADMPVTILF-GQS-VSGL-AS-GDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGL--PADWWFEFVPLTTV 359 (437) T ss_pred HHHhcCchhhhc-CcC-cccc-cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCcceEEeCCcCCc Confidence 532 1 2 222 222 1111 11 22344444455555543 467778887777665433222 23577776642222 Q ss_pred ccCCH-----HHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCc Q lcl|NC_019406. 495 TALDA-----RALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELD 569 (661) Q Consensus 495 ~~lda-----~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~ 569 (661) ..-+- ...++...++++|.|+-+..+.+|+..|+++.- +.+++.+....+.+.-+.+..+... .+..+..-| T Consensus 360 s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 436 (437) T protein:vir:52 360 KQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANI-SAEHIEELKNADEFAGNFEEPEKME--GAQVQNSED 436 (437) T ss_pred CHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCC-CccccccccCCCCCCCccCCCCCCC--CCCCCCCCC Confidence 21111 123456777889999999999999999887643 2232221111110000000000000 011111111 Q ss_pred c Q lcl|NC_019406. 570 Q 570 (661) Q Consensus 570 q 570 (661) | T Consensus 437 ~ 437 (437) T protein:vir:52 437 Q 437 (437) T ss_pred C Confidence 1 No 146 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=87.35 E-value=0.039 Score=28.29 Aligned_cols=530 Identities=12% Similarity=0.042 Sum_probs=196.6 Q ss_pred cccccccccCC---ccccCHHHHHHHHHHHHHHHHhcc-hHHHHhCCcccCCCC----CCCChHHHHHHHhhhcccchHH Q lcl|NC_019406. 12 RRTKRGAQQFT---HLVVHPEYEYYRPDWAKIRDAIAG-EREIKAQGVKYLKAP----KGFDDEDYANYLDRAAFYNMTS 83 (661) Q Consensus 12 ~~~~~~~~~~~---V~~~hPey~a~~~~W~~irD~~~G-~~~vr~~g~~YLPk~----~~E~~~~Y~~rl~rA~~~n~~~ 83 (661) -+|+-.+-+.. .+..|---......|+...+.-.= .+.|++.- .|+-.- -+-++..|+ .++..|.+- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~-~yi~~~~tr~t~~~~~~w~----~s~t~~k~~ 75 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELM-DYIDATDTRKTSNSKLPFK----NSTTINKLA 75 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHH-HHHhhhcccccccCCCCcc----cccchHHHH Confidence 12222222222 234455545566677776665321 22232211 122211 111111222 234555555 Q ss_pred HHHHHHhchhhccCccccccch--hhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEE Q lcl|NC_019406. 84 QTQAGMVGQIFRRPPVIRNLPN--TGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFG 161 (661) Q Consensus 84 ~tv~~l~G~vFrk~p~i~~~p~--~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~g 161 (661) ..+..+.-..|.-- .|+ .+.....+++ ......-+.++++ .++= +.-.++..-...++...+.||-|+ T Consensus 76 ~~~~~l~a~~~~~~-----fp~~~w~d~~~~~~~-~~~~~~~~~i~~y---i~~K-l~e~~~~~~~~~~v~d~i~~G~~v 145 (599) T protein:vir:31 76 HLHLMITTSYMEHL-----LPNRNWVDFVGFDND-SVNAEKREIARSY---VRGK-VEASNLEGVIERMVDDFAVRGFCV 145 (599) T ss_pred HHHHHHHHHHHhhh-----cCCccceEeeecCCc-hhHHHHHHHHHHH---hhhh-hhhcchHHHHHHHHhhhcccCcee Confidence 55555554444321 111 1111112221 1112222333333 2221 233355666788889999999999 Q ss_pred EEEeccC-----CCch--hhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhh Q lcl|NC_019406. 162 ALVDVAP-----SSDP--TAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETA 234 (661) Q Consensus 162 vLVD~P~-----a~~~--~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~v 234 (661) .-|++-+ .|+. ..-..|-+..++|.+|. |...-.+-.....-||.+.. ..+-...-..++...|. ... T Consensus 146 at~~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~-~Dp~A~si~d~~fivRs~~T---k~~L~~l~~~~~~~~y~-~d~ 220 (599) T protein:vir:31 146 AHTRHVKRMTVTAENQVIKNYSGTVTERLSPSDVF-WDVTADSLPKAAKCIRQLYT---LGSLKREIEEGTFPLMS-MED 220 (599) T ss_pred EeeeEEEcceeecccccccccccceEEeeccccee-eCCCCCCCCcceeeeehhhh---HHHHHHHhccCCccccc-hHH Confidence 9999542 2222 22345778888886653 21110000001111111110 00000000000001111 112 Q ss_pred hcchhhhhcchhhhhhhhhhhheecccccCCCc------e---eeEEEE---------EEEeecccccceEEEEEEEecC Q lcl|NC_019406. 235 QRTSGGRRAGLAERQGSARADALARPSRFTSSY------T---FRTIYR---------ELILELQKDGSRVYKQFVYVED 296 (661) Q Consensus 235 i~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~------~---~~~~~r---------v~~l~~g~~g~~~~~~~~~~~~ 296 (661) ++|. ..+|....+-..+++ . +...+. |-+|+ .|- -.|.+. T Consensus 221 ~~~~--------------~~~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLe-------ywG-d~ydee 278 (599) T protein:vir:31 221 FQKL--------------REERRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLT-------FMG-DFYDEE 278 (599) T ss_pred HHHH--------------HhhccCCCccccchhhhhhhccccccccccchhhhcccchhhhhh-------hhh-hhhccc Confidence 2221 111111000000000 0 010011 11111 000 012221 Q ss_pred cccccccceeeccCC----------cccceeeEEEEe---cCCCCCCcccc-chhHHH-HHHHHHHhhhhhHHHHHHHhc Q lcl|NC_019406. 297 PLGQARDVYTPMVRG----------RTLPFIPFVFFG---SMSNAADCEKP-PLLDIV-ELNLKHYRTYAELEHGRFFTA 361 (661) Q Consensus 297 ~~~~~~~~~~p~~~g----------~~L~~IPfv~~~---~~~~~~~~~~p-PLldLA-~LNl~HYq~sSDl~~il~~~~ 361 (661) ..+.....+....++ .+.+.+||++.+ -.+.-+..+-| +++++- .||+. +|..-|. +-... T Consensus 279 ~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~-~Ng~iD~---~~~~l 354 (599) T protein:vir:31 279 NDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKR-ENFREDL---HDRFL 354 (599) T ss_pred CCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccCCCCCchhcchHHHHHHHH-HHHhhhh---hhhhh Confidence 122222222222221 335668887654 22333333322 123332 25555 6666654 22233 Q ss_pred CceeEEecC-CCCCCceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhcccc-cCccchhH Q lcl|NC_019406. 362 LPTYYAPEL-DDSDASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLMPGM-SKSVSESD 438 (661) Q Consensus 362 ~P~l~i~Gl-~~~~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll~~~-~~~~~eTa 438 (661) .|++...|- ... .+.-+|+.+|+..+ .+.+.++.++..... ..-.+...+++|-.+ |+.....+ +..+.+|| T Consensus 355 ~p~l~~~~dl~~e---D~~~~P~~v~~~~d-~~~vq~~~p~s~~~~-a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA 429 (599) T protein:vir:31 355 HPSLKKVGDVREK---GMRGGPNHVFEVEE-TGDVQYMTPPAEVLQ-PDNQLSITLQLMEDLSGAPKESIGQRTAGEKTK 429 (599) T ss_pred ccccccccccccc---CccCCCCcceeecC-CCccccccCchhhhh-HHHHHHHHHHHHHHhhccchhhcCCcccchhhH Confidence 566666543 333 24557888988875 455666666554332 233456667767643 66654332 23457788 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHH-----HHHHHHHHcCCCCCCcceEEEEeccc-----cccccCCHHHHHHHHHH Q lcl|NC_019406. 439 NQSALREANEQSLLLNVIMALEDGMTS-----VVRYWLMFRDIPLTDTATLRYEIDAT-----FLTTALDARALRAIQQL 508 (661) Q Consensus 439 taa~~d~~~~~S~L~~~A~~le~Al~~-----aL~~~A~w~G~~~~~~~~~~v~ln~D-----F~~~~lda~~l~all~~ 508 (661) ...+.=....+-+.+.+....++.+-. .+.+...++. +.+.+++ +|++ |... +.+++..=.++ T Consensus 430 ~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D----~~~tiri-~~~e~~~~~f~~i--~redl~~~~~~ 502 (599) T protein:vir:31 430 FEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLD----ASDTIKT-FNSELGTATFLDI--TADDLNLNGQM 502 (599) T ss_pred HHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----cccceee-ecccccceeeEEe--ehhhhhCCeee Confidence 888888888888888888888877655 3444444443 2333433 3333 3322 22233222222 Q ss_pred HhcCC---CCHHHHHHHHHhc--CCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHH Q lcl|NC_019406. 509 YEGGL---LPIDALYENFVKN--GIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQE 583 (661) Q Consensus 509 ~~aG~---Is~et~~~eL~r~--gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~ 583 (661) +.-|. +.++.+.+.|..- .=+.....++.-+.++... +++ .+.+..+..... T Consensus 503 v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~-----l~~-~~~l~~~~~~~~----------------- 559 (599) T protein:vir:31 503 VAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNA-----VEY-LGDLDAYGIFTF----------------- 559 (599) T ss_pred eechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHH-----HHH-HHhccccccCCC----------------- Confidence 22221 1222222222110 0001111111111111110 000 111111111111 Q ss_pred HHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhhhhhhhhhhHHHHhhcccccC-CCCCCCcccc Q lcl|NC_019406. 584 LEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQAKPSKAEQAQIDAQQKQAAAK-PVTPTPGTVQ 651 (661) Q Consensus 584 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 651 (661) -+|+++.++++...=+++.++ +++|-+.. --+||..|.| T Consensus 560 ------~va~~eqq~~~~m~Q~~lq~~-----------------------~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 560 ------GIGVQEDQQLARMAQKSTQQT-----------------------EETALTQEEVGGPTTDTGQ 599 (599) T ss_pred ------chhHHHHHHHHHHHHHHHHHh-----------------------HhhhhhhhhcCCCCcccCC Confidence 111111111110000000000 00111100 1123333333 No 147 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=86.85 E-value=0.043 Score=28.09 Aligned_cols=406 Identities=11% Similarity=0.062 Sum_probs=165.6 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |+-.. =.-+|+.+ | .+.+.|.......+.-|.+... -..-+..|. ... T Consensus 5 m~~~~----~~~~~~D~------------~----------~~~~~~~~g~~~~~~~~~~~~~--~~~l~~~Y~----~~~ 52 (435) T protein:vir:79 5 MSDKV----KAITKEDG------------Y----------NEIFGSKDGTFRPNAFYMQRAA--FKALSQFYE----EDG 52 (435) T ss_pred ccccc----ccchhhcc------------h----------hhhhcccccccccCcccCCcCC--HHHHHHHHh----cCc Confidence 32210 00001111 1 1112221111111111111111 011222233 334 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) .++..|+.....++|+...|+...+ -++|...+++. .+..-++++++.+..||.+ T Consensus 53 l~~~~Vd~~aed~~r~g~~i~g~~~--------------------~~~~~~~~~~l-----~~~~~l~~a~~~~rl~G~~ 107 (435) T protein:vir:79 53 MARRIVDVIPEEMVTPGFKVDGVKN--------------------EKSFKSRWDEL-----RLNAKIIDALSWSRLFGGS 107 (435) T ss_pred hhhhhhccchHHhhcCCceecCCCh--------------------HHHHHHHHHHh-----hHHHHHHHHHHhhhccccE Confidence 5567777777888888877753111 12344444443 5778889999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) +|+|........ . .|.++ +| .|. ++..+.+..|-- T Consensus 108 ~i~i~~~d~~~~---~-------~Pl~~--------~g--~i~---------------------~i~v~d~~~i~~---- 142 (435) T protein:vir:79 108 AILAVVADNKML---K-------SPVKP--------GA--QLE---------------------DIRVYDRYQITI---- 142 (435) T ss_pred EEEEEecCCCCc---c-------ccccc--------CC--cee---------------------eEEeechhhccc---- Confidence 999986421110 0 11110 01 111 111121111100 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFV 320 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv 320 (661) ......+..+ .|..-..|++. +.++...++ ++........|.+++..+- T Consensus 143 -------------~~~~~dp~sp--~fg~P~~y~v~----~~~~~~~~~-----------iH~SRli~~~g~~~p~~~~- 191 (435) T protein:vir:79 143 -------------HERETNARSV--RYGEPKLYKIS----PGGDIPEFF-----------VHYSRICIIDGERVSNEKR- 191 (435) T ss_pred -------------hhhccCCccc--ccCcceEEEEe----cCCCCCceE-----------EcceeEEEecCCcchhhhc- Confidence 0000111111 12222334432 111100011 1111111122333332211 Q ss_pred EEecCCCCCCccccchhHHHHHHHHHHhhhhhH-HHHHHHhcCceeEEecCCC----CCCc-e---------eEecccce Q lcl|NC_019406. 321 FFGSMSNAADCEKPPLLDIVELNLKHYRTYAEL-EHGRFFTALPTYYAPELDD----SDAS-E---------YHIGPGRV 385 (661) Q Consensus 321 ~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl-~~il~~~~~P~l~i~Gl~~----~~~~-~---------l~iGs~~~ 385 (661) ...+ ..+.+||+..++=.|..|...+.- .+++|...+.++-+.|+.. .... . ..-+.++. T Consensus 192 ---~~~~--~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~ 266 (435) T protein:vir:79 192 ---RQND--GWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKA 266 (435) T ss_pred ---cccC--cccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCc Confidence 1112 235567776666557777666644 6678888888888877521 1111 0 12344555 Q ss_pred eecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-h--H-HhcccccCccchhHHHHHHHHHHhhHHHHHHH-HHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-G--G-RLMPGMSKSVSESDNQSALREANEQSLLLNVI-MALE 460 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-G--A-rll~~~~~~~~eTataa~~d~~~~~S~L~~~A-~~le 460 (661) +.+..++.++..+..+-+++ .+.++...+++... | + +|+-.+.++-+-|+.. +...=+..+.++- ..+. T Consensus 267 ~~i~~~~e~~e~~~~~lsgl---~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~---d~~~yyd~i~~~Qe~~l~ 340 (435) T protein:vir:79 267 IGIDATDEEYEVLNSDVSGV---PEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNT---ALETFYKLIDRKRVEDYK 340 (435) T ss_pred eeEecCCcceEEEecccCCH---HHHHHHHHHHHHhhhCCCeeeeccCCccccccchhH---HHHHHHHHHHHHHHHHHH Confidence 55554445666666444444 44454555555532 1 1 2221111222223322 2223333333332 2245 Q ss_pred HHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCC-----HHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCH Q lcl|NC_019406. 461 DGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALD-----ARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTL 535 (661) Q Consensus 461 ~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~ld-----a~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~ 535 (661) ..+++++++++. ..++.|+.|+=......+ ....++...++++|.|+.++.+++|+.... ..-.. T Consensus 341 p~l~~l~~li~~--------s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~--~~~~~ 410 (435) T protein:vir:79 341 PILEFLLPFMIS--------ETEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICP--DLKIM 410 (435) T ss_pred HHHHHHHHHhhc--------CCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcc--ccCCC Confidence 555665555442 135677766522221111 113456677888999999999888864221 11111 Q ss_pred HHHHHHHhccCCCCCCchhhhhhcCCccc Q lcl|NC_019406. 536 EEFTIKMNDPKSFIGQPDAIAMRRGYVSR 564 (661) Q Consensus 536 Eee~~~l~~~~~~l~~ddae~~~~g~~~~ 564 (661) +++...|.+. ..++.+..+.|..++ T Consensus 411 ~~~~~~~~~~----~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 411 DNDNIELPEP----EDLDPEPGQEGGLNK 435 (435) T ss_pred CcccccCCcc----ccCCCCCCCCCCCCC Confidence 1111112110 001111111122222 No 148 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=81.22 E-value=0.088 Score=26.38 Aligned_cols=417 Identities=13% Similarity=0.069 Sum_probs=160.7 Q ss_pred CCCCC-------CccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCc------ccCCCCCCCChH Q lcl|NC_019406. 1 MAGLS-------PNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGV------KYLKAPKGFDDE 67 (661) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~------~YLPk~~~E~~~ 67 (661) ||--- |-.+-++ .+ ..+.|..++.-..+... .|- .-| +. ..+-+ T Consensus 1 m~k~~~k~~~~~~~~~~~~----~~--------------~~~~~~~~~~~~~~~~~---~g~~~~~~~~iL-r~-~~~~~ 57 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSID----PS--------------DVPKLEGASVPVMSTSY---DVVVDREFDELL-QG-KDGLL 57 (448) T ss_pred CCCCCCCCccccCcccccc----cc--------------cchhhhhhhhhhccccc---ccccccchhHhh-cc-ccchH Confidence 43211 1111111 00 12223333333222110 110 001 11 12235 Q ss_pred HHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccC--CCCCHHH Q lcl|NC_019406. 68 DYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAK--DGTSHQG 145 (661) Q Consensus 68 ~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl--~G~sL~~ 145 (661) -|+..+.- .++...++.....|...+..|+ |.. .+.+ ..+.-+.+.+.+...++ .-.+++. T Consensus 58 ly~~m~~D----~hi~s~l~~Rk~av~~~~w~v~--p~~-----~~~~------~~~~ae~v~~~l~~~~~~~~~~~f~~ 120 (448) T protein:vir:79 58 VYHKMLSD----GTVKNALNYIFGRIRSAKWYVE--PAS-----TDPE------DIAIAAFIHAQLGIDDASVGKYPFGR 120 (448) T ss_pred HHHHHhhC----hHHHHHHHHHHHHHhcCCceEe--cCC-----CCHH------HHHHHHHHHHHhhhhhhhhccCCHHH Confidence 67776544 4555555555556666666663 210 0000 00111223333332221 1235667 Q ss_pred HHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccce Q lcl|NC_019406. 146 FAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPW 225 (661) Q Consensus 146 fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~ 225 (661) ++.+++ .++-||.+.+=+.|-. .. +|+-.+..+..+ T Consensus 121 ~~~~~l-da~~~G~s~~Eivw~~-------------------------~~-~g~~~~~~l~~r----------------- 156 (448) T protein:vir:79 121 LFAIYE-NAYIYGMAAGEIVLTL-------------------------GA-DGKLILDKIVPI----------------- 156 (448) T ss_pred HHHHHH-HhhhhcceeEEEEeee-------------------------cC-CCceeccccccc----------------- Confidence 777665 4777886665444321 10 111000000000 Q ss_pred eeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccce Q lcl|NC_019406. 226 IGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVY 305 (661) Q Consensus 226 i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~ 305 (661) +.+.+- ++.|. .++..+.+. +.+... .. T Consensus 157 ----~~~~~~----------------------------~f~~~-------------~d~~l~~~~---~~~~~~----~~ 184 (448) T protein:vir:79 157 ----HPFNID----------------------------EVLYD-------------EEGGPKALK---LSGEVK----GG 184 (448) T ss_pred ----CCcccc----------------------------ceeee-------------cCCceEEee---cCCccc----cc Confidence 000000 00000 001111110 000000 00 Q ss_pred eeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCc------ Q lcl|NC_019406. 306 TPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDAS------ 376 (661) Q Consensus 306 ~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~------ 376 (661) .+...|..+++=-|+.+.-...+...+.+.|..++..-+--=....+...-+..-+.|+++.. |.++.+.+ T Consensus 185 ~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~ 264 (448) T protein:vir:79 185 SQFVSGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKE 264 (448) T ss_pred ccCCCccccccceEEEEecCccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHH Confidence 011122222222233322223333445555555555444433455567778888899999876 44432221 Q ss_pred ---eeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHH-HhHHhcccccCccchhHHHHH-HHHHHhhHH Q lcl|NC_019406. 377 ---EYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAA-IGGRLMPGMSKSVSESDNQSA-LREANEQSL 451 (661) Q Consensus 377 ---~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~-lGArll~~~~~~~~eTataa~-~d~~~~~S~ 451 (661) .|..|+.++.++|. |.++.|++..|.+.. ..+.++...++|.. +.+..+....++ .+...+. .......-. T Consensus 265 av~~i~~g~~a~~iiP~-~~~ie~~ea~~~~~~-~~~~i~~~d~~Isk~iLGqtlTs~~~~--g~~~~~~~~~~~v~~~~ 340 (448) T protein:vir:79 265 IVKNFVQKPRHGIILPD-DWKFDTVDLKSAMPD-AIPYLTYHDAGIARALGIDFNTVQLNM--GVQAINIGEFVSLTQQT 340 (448) T ss_pred HHHHHhcCCceEEEecC-CceEEEEecCCCccc-HHHHHHHHHHHHHHHHhhhhhcccccc--chhhhhhhhHHHHHHHH Confidence 13468888889995 789999998876643 45566666666552 234455332211 2222221 122233455 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCC Q lcl|NC_019406. 452 LLNVIMALEDGMTS-VVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIP 530 (661) Q Consensus 452 L~~~A~~le~Al~~-aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~ 530 (661) +.+-+..+++++++ ++.+++.|---+. ..-.+|.+. ..+++++.++.+.... +. ++ T Consensus 341 ~~aDa~~i~~tln~~li~~l~~lNfg~~--~~~P~~~f~------~~e~~Dl~~~a~~~~~--l~-----------~~-- 397 (448) T protein:vir:79 341 IISLQREFASAVNLYLIPKLVLPNWPSA--TRFPRLTFE------MEERNDFSAAANLMGM--LI-----------NA-- 397 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCc--CCCcEEEec------CCChHHHHHHHHHhhh--hh-----------cc-- Confidence 67888999999985 8899888873221 111234432 1134555554443321 10 00 Q ss_pred ccCCHHHH-HHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhc Q lcl|NC_019406. 531 STQTLEEF-TIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERH 590 (661) Q Consensus 531 ~~~~~Eee-~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~ 590 (661) ....++. .+.+.-..+. +.++...-+....+ +......+| +.-+=..-|. T Consensus 398 -~~~~~~~~~~~~~~p~~~---~~~~~~a~~~~~~~----~~~~~~~~~--~~~~~~~~~~ 448 (448) T protein:vir:79 398 -VKDSEDIPTELKALIDAL---PSKMRRALGVVDEV----REAVRQPAD--SRYLYTRRRR 448 (448) T ss_pred -chhhHHHHHHhhcCCCCC---CCccccccCCCCcc----cccccCCcc--ccchhhcccC Confidence 0111111 1111111111 11111011111111 111112222 0000000011 No 149 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=80.80 E-value=0.092 Score=26.28 Aligned_cols=500 Identities=14% Similarity=0.062 Sum_probs=181.2 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcc-hHHHHhCCcccCCCCC---------CCChHHHHHHHhhhcccchHHHHHHHHh Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAG-EREIKAQGVKYLKAPK---------GFDDEDYANYLDRAAFYNMTSQTQAGMV 90 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G-~~~vr~~g~~YLPk~~---------~E~~~~Y~~rl~rA~~~n~~~~tv~~l~ 90 (661) |.-+. ..-......+|..++.--.= ...|++...--||... ......+..+ .|-+.-.+.++.++ T Consensus 1 m~~d~-~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~----~~dstg~~a~~~LA 75 (549) T protein:vir:10 1 MTNDD-AKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQK----MFDSTAPLALRNFV 75 (549) T ss_pred CCcch-HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccc----cccchHHHHHHHHH Confidence 33322 11111222222221111000 2222333222244321 1111111111 23333334444433 Q ss_pred ----chhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhc-cCCCCCHHHHHHHHHHHHHhhCCEEEEEe Q lcl|NC_019406. 91 ----GQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRF-AKDGTSHQGFAKTVALEQVAMGRFGALVD 165 (661) Q Consensus 91 ----G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~-dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD 165 (661) +.+|..--.+=.+. ..|.+....-.+-+++......+-.+ -...++++.-+-.++.+.+.+|-+-+++| T Consensus 76 s~l~~~ltpp~~~wF~l~------~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~ 149 (549) T protein:vir:10 76 AAMDSMITPATQLWHRLK------TGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIE 149 (549) T ss_pred HHHHhhccCCCCcccccc------CCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEe Confidence 33332110010000 00000000000111222222222111 23466788888888999999999999998 Q ss_pred ccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceee--eechhhh-hcchhhhh Q lcl|NC_019406. 166 VAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIG--REGSETA-QRTSGGRR 242 (661) Q Consensus 166 ~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~--~~~~e~v-i~w~~~~~ 242 (661) -.. + .+. . ++-+.|.+++...+.. + ...-.++ .+++..+ -.|.... T Consensus 150 ~~~--~--~~~--~----------------------f~~~pl~~~~v~~d~~-G-~vd~i~r~~~~t~~ql~~~fg~~~- 198 (549) T protein:vir:10 150 HDV--G--KGI--V----------------------YRNVPMQRLWFAENNS-G-LIDKTHVQWELTLRQAAQRFGREN- 198 (549) T ss_pred ecC--C--Cee--E----------------------EEEEEcCeEEEeeCCC-C-CeEEEEEEeecCHHHHHHhcCccc- Confidence 321 1 111 1 1122222222222211 0 0000111 1111111 1111110 Q ss_pred cchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccc----cce--EEEEEEEecCcccccccceeeccCC-cccc Q lcl|NC_019406. 243 AGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKD----GSR--VYKQFVYVEDPLGQARDVYTPMVRG-RTLP 315 (661) Q Consensus 243 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~----g~~--~~~~~~~~~~~~~~~~~~~~p~~~g-~~L~ 315 (661) +.......+. ...+..-++++......+.+ ... -|.+..+.+++. .+...+| ..++ T Consensus 199 --l~~~v~~~~~---------~~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~------~il~esg~~e~P 261 (549) T protein:vir:10 199 --LSPSMQSTLE---------KDPEKSAIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRD------RIVQNSGFRTFP 261 (549) T ss_pred --CCHHHHHHhh---------cCCCceEEEEEEeecCCCCCccccccccCceEEEEEEecCC------EeeccCCcccCC Confidence 1111111110 11223334444432222111 111 122222333221 1111223 3578 Q ss_pred eeeEEEEecCCCCCCccccc----hhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe--cCCCCCCceeEecccceeecC Q lcl|NC_019406. 316 FIPFVFFGSMSNAADCEKPP----LLDIVELNLKHYRTYAELEHGRFFTALPTYYAP--ELDDSDASEYHIGPGRVWVVD 389 (661) Q Consensus 316 ~IPfv~~~~~~~~~~~~~pP----LldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~--Gl~~~~~~~l~iGs~~~~~lp 389 (661) +||+.|.-..+..+.. .| |-|+..||.-+-.. + ..++.+.-|.+.++ |.... ..+.-|+.+.... T Consensus 262 ~~~~Rw~~~~ge~YGr--gp~~~~l~D~k~L~~l~~~~---l-~~~~~~~~p~~~v~~~g~~~~--~~l~pgg~~~~~~- 332 (549) T protein:vir:10 262 FAIGRFYVGTDDVYGG--SPAYDAMPDVRMANDMAKTN---I-RGAQKLVDPPLLANEDGVLDG--FDLRSGALNWGGL- 332 (549) T ss_pred cceeeeeecCCCcccc--chHHHHHHHHHHHHHHHHHH---H-HHHHHHhcCceeecccccccc--ceeccCCcccccc- Confidence 8888887666665544 34 45777777654332 3 34444445555443 32221 1233333332221 Q ss_pred CCCCc--ceEeecCchhHHHHHHHHHHHHHHHHHH-hHHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHH-HHHHH Q lcl|NC_019406. 390 KESGI--PGIIEFKGEGLKTLERALNEKEQQIAAI-GGRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALE-DGMTS 465 (661) Q Consensus 390 ~~ga~--~~ylE~~g~~i~a~~~~L~~le~qM~~l-GArll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le-~Al~~ 465 (661) ..+++ +.-+. .+..+..+...|+++++.+... -+.++....++...||++...+...-...|..+-.++. +-+.- T Consensus 333 ~~~~~~~~~pl~-~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~P 411 (549) T protein:vir:10 333 NDKGEEMVKPLL-TGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGP 411 (549) T ss_pred CCCCccceeeec-cccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 11222 23233 3567888889899999988753 11111111134568999999999999999999888875 43333 Q ss_pred ----HHHHHHHHcCC-CCCCcceE---EEEeccccccccCC----HHHHHHHHHHHhcCCCCHHHHHHHHHhc--CCCCc Q lcl|NC_019406. 466 ----VVRYWLMFRDI-PLTDTATL---RYEIDATFLTTALD----ARALRAIQQLYEGGLLPIDALYENFVKN--GIIPS 531 (661) Q Consensus 466 ----aL~~~A~w~G~-~~~~~~~~---~v~ln~DF~~~~ld----a~~l~all~~~~aG~Is~et~~~eL~r~--gvl~~ 531 (661) ++.++-+ .|+ +. -+.++ .+.++-.|.. .|. ...+.++....+ ++..|... +++ + T Consensus 412 li~R~~~il~r-~g~lP~-~p~~l~~~~~~~~i~yis-~La~aq~~~~~~~i~~~~~--------~~~~laq~~Pe~l-d 479 (549) T protein:vir:10 412 MIAREVDILAE-AGQLPD-MPQELIDAGADVDVEYDS-PLNKAMRAGEGAAILQWLQ--------QLGIVSQFDPAAA-K 479 (549) T ss_pred HHHHHHHHHHh-cCCCCC-CChhhhcCCceeEEEeec-HHHHHHHHHHHHHHHHHHH--------HHHHHhccChhHH-h Confidence 3333333 232 11 01111 1122223322 111 112222222221 11111111 111 3 Q ss_pred cCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhH Q lcl|NC_019406. 532 TQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAA 611 (661) Q Consensus 532 ~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 611 (661) ..++++..+.+.+- +|.|. ..+. . .++..+.+.+++ ||.+..|-.++ +...-.+ T Consensus 480 ~id~d~~~~~~a~~---~Gvp~-~~ir-s----~eev~~~r~~~~----~qqq~~~~~~~-------------a~~a~~~ 533 (549) T protein:vir:10 480 VPNGARIARLLADY---GGVPV-EAMS-T----DEELQAQQAAEA----QAAQMQQMLAA-------------APVAAGA 533 (549) T ss_pred cCCHHHHHHHHHHh---cCCCc-cccC-C----HHHHHHHHHHHH----HHHHHHHHHHH-------------HHHHHHH Confidence 46778888888764 34332 1111 1 011111110100 00000010111 1122222 Q ss_pred HHhcCChhhhhhhhhhhhHH Q lcl|NC_019406. 612 SRKLGDPEQAKPSKAEQAQI 631 (661) Q Consensus 612 ~~~~~~~~~~~~~~~~~~~~ 631 (661) ...+.+-.++ ++.|.+ T Consensus 534 a~~~~~~~ta----~~~~~~ 549 (549) T protein:vir:10 534 IKDLSDAQTA----AQTARV 549 (549) T ss_pred HHhhhhhcCC----CcccCC Confidence 2232222222 122222 No 150 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=79.43 E-value=0.1 Score=25.96 Aligned_cols=442 Identities=11% Similarity=0.021 Sum_probs=157.1 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCC-----CCChHHHHHHHhh Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPK-----GFDDEDYANYLDR 75 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~-----~E~~~~Y~~rl~r 75 (661) ||- +. -..+--.|...+- +.++... ..+..|+..+. +..-+-|+.-+. T Consensus 1 ~~~-------~~--------~~~~gl~p~rl~~-----i~~~~~~------~~~~~~~~~~~~~Lr~~~~~~ly~~m~~- 53 (488) T protein:vir:95 1 MAD-------IT--------ETQESLPPFRMGE-----VGSLGLK------VKNGRIYEEPRQALRFPESIKTFQLMMR- 53 (488) T ss_pred CCC-------cc--------ccCCCCCHHHHHH-----HHHHhhc------cccchhhccchhhhcccchHHHHHHHhh- Confidence 321 11 1111122322111 1111111 11112222111 223345666543 Q ss_pred hcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHH Q lcl|NC_019406. 76 AAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQV 155 (661) Q Consensus 76 A~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L 155 (661) -.++.-.++.....|...+..|+ |+.-. +.|- ...+.-+.+...++++. .++..++.+++ .++ T Consensus 54 ---D~hi~s~l~~Rk~av~~~~w~v~--p~~~~----~~d~----~~~~~a~~v~~~l~~~~---~~~~~~i~~~l-da~ 116 (488) T protein:vir:95 54 ---DPAVAASVNIIKMFVRKVNWRFV--PPKGK----EQDP----KMLERADFFNSLMDDME---HDWADFINSVM-SFC 116 (488) T ss_pred ---ChHHHHHHHHHHHHHhcCCceEe--cCCCC----chhH----HHHHHHHHHHHHHhccC---ccHHHHHHHHH-Hhh Confidence 36666666666677777776663 21000 0000 00001122333333331 35778888887 588 Q ss_pred hhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhh Q lcl|NC_019406. 156 AMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQ 235 (661) Q Consensus 156 ~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi 235 (661) -||.+.+=+-|-...+. .++ +.|.. .+|+-.+.-|..| | .+.+ T Consensus 117 ~~G~s~~Eivw~~~~~~---~~~----------~~~~~--~dg~~~~~~i~~R---------------p------q~~~- 159 (488) T protein:vir:95 117 TYGFCVNEKVYKKRQGK---KGK----------YQSKF--DDGLIGWAKLPIR---------------N------QSTL- 159 (488) T ss_pred cccceeeeeeeeccccc---ccc----------ccccc--cCCeeeeeeeeec---------------C------cccc- Confidence 89977765555321110 000 01111 1111000000000 0 0000 Q ss_pred cchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeec-cCCcc- Q lcl|NC_019406. 236 RTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPM-VRGRT- 313 (661) Q Consensus 236 ~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~-~~g~~- 313 (661) .| |.+ ..++..+.+.. +.-..+... .+. ..|.. T Consensus 160 ~~-----------------------------f~~-----------d~d~~l~~~~~--~~~~~~~~~---~~~~~~~~~~ 194 (488) T protein:vir:95 160 DK-----------------------------WYF-----------DEDFRRVTGVR--QNLRNVSHI---AGAINLGERP 194 (488) T ss_pred cc-----------------------------eee-----------ccCCCceeecc--ccccccccc---cccccccccc Confidence 00 000 00000000000 000000000 000 00000 Q ss_pred -cceee---EEE-EecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHh--cCceeEEecCC---CC----CCc--- Q lcl|NC_019406. 314 -LPFIP---FVF-FGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFT--ALPTYYAPELD---DS----DAS--- 376 (661) Q Consensus 314 -L~~IP---fv~-~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~--~~P~l~i~Gl~---~~----~~~--- 376 (661) =..|| |++ .+....+-..+...|..++..-+ |.+.+--.+..|.- +.|+|++.|.. .. ++. T Consensus 195 ~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~--fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~ 272 (488) T protein:vir:95 195 LTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWK--YKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFV 272 (488) T ss_pred ccccccccceEEEeecCCCCccchhhHHHHHHHHHH--HHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHH Confidence 01244 222 22222333334444444433332 12222223333333 57888776631 11 111 Q ss_pred --------eeEecccceeecCCCCCcce---------EeecCchhHHHHHHHHHHHHHHHH--HHhHHhcccccCccchh Q lcl|NC_019406. 377 --------EYHIGPGRVWVVDKESGIPG---------IIEFKGEGLKTLERALNEKEQQIA--AIGGRLMPGMSKSVSES 437 (661) Q Consensus 377 --------~l~iGs~~~~~lp~~ga~~~---------ylE~~g~~i~a~~~~L~~le~qM~--~lGArll~~~~~~~~eT 437 (661) .+..|+.++.++|. |.... .++-.|......+..++...++|. .+|- .|....+.+ -| T Consensus 273 ~a~~~i~~~~~~~~~ag~iiP~-g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGq-tLT~~~~~~-Gs 349 (488) T protein:vir:95 273 QYCKTVVNDMIANDRAGLIWPR-YIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSD-VLAMGQSKY-GS 349 (488) T ss_pred HHHHHHHHHhhccchhheeecc-ccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhcc-ccccccCcc-hh Confidence 12234556667774 33332 244444444445555555555554 4443 332222111 13 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCCCcceEEEEeccccccccCCH-HHHHHHHHHHhcCC-C Q lcl|NC_019406. 438 DNQSALREANEQSLLLNVIMALEDGMT-SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDA-RALRAIQQLYEGGL-L 514 (661) Q Consensus 438 ataa~~d~~~~~S~L~~~A~~le~Al~-~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda-~~l~all~~~~aG~-I 514 (661) --..........-++.+-+..++++++ +++.+++.|-.-.. ..-.+|.+.. . ..-|. ....++-.+...|. + T Consensus 350 ~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~--~~~P~~~~~~--~-e~~Dl~~~ae~~~~L~~~G~~i 424 (488) T protein:vir:95 350 FSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNMWDD--EEHVQITYDD--I-ETPDLEAIGSYIQKTVAVGALE 424 (488) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCccEEEecC--c-ChhhHHHHHHHHHHHHhCCCcc Confidence 334566667778889999999999997 58999999874322 1123344321 1 11121 12334455666666 3 Q ss_pred CHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCC Q lcl|NC_019406. 515 PIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEID 594 (661) Q Consensus 515 s~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~ 594 (661) +......+++++-=||.....|.+.... .+..... .+.+.... ...++...+.+. T Consensus 425 ~~~~~~~~i~e~~gip~~~~~e~~~~~~------~~~~~~~-~~~~~~~~-----~~~~~~~~~~~~------------- 479 (488) T protein:vir:95 425 VDKELSNKLREHIGLPPADESQPVSEKL------SPNSQSR-SGDGYKTA-----GEGTAKTPSAKD------------- 479 (488) T ss_pred ccHHHHHHHHHHhCCCCCCCCccccccC------CCCCCCC-CCcccCCC-----cccCCccccccc------------- Confidence 4333333344322233222112111110 0000000 00000000 000111111000 Q ss_pred chhHHHhhhhhhhhhhHHH Q lcl|NC_019406. 595 EEKLRISAKVGSTSVAASR 613 (661) Q Consensus 595 ~~~~~~~~~~~~~~~~~~~ 613 (661) .. +...+.+ T Consensus 480 ~~----------~a~~~~~ 488 (488) T protein:vir:95 480 PS----------TANKANK 488 (488) T ss_pred ch----------hhhhccC Confidence 00 0000000 No 151 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=78.26 E-value=0.12 Score=25.71 Aligned_cols=414 Identities=12% Similarity=0.045 Sum_probs=162.2 Q ss_pred CCCCC-------CccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCC------CCCChH Q lcl|NC_019406. 1 MAGLS-------PNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAP------KGFDDE 67 (661) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~------~~E~~~ 67 (661) ||--. |-++-+.. ...+.|..++.-..+-.. .|- +|.. ....-+ T Consensus 1 m~kk~~k~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~---~g~--~~~~~~~iLr~~~~~~ 57 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDP------------------SDVPKLEGASVPVMSTSY---DVV--VDREFDELLQGKDGLL 57 (448) T ss_pred CCCCCCCCcccCCcccccch------------------hhhhhhccchhhhccccc---ccc--cccchhHhhccccchH Confidence 54221 11111111 112223333332221100 110 1100 011234 Q ss_pred HHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccC--CCCCHHH Q lcl|NC_019406. 68 DYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAK--DGTSHQG 145 (661) Q Consensus 68 ~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl--~G~sL~~ 145 (661) -|+..+.-+.+...+ +.....|...+..|+ |.. .+. ...+.-+.+.+++.+.++ ...+++. T Consensus 58 ly~~m~~D~hi~s~l----~~Rk~av~~~~w~v~--p~~-----~~~------~d~~~ae~v~~~l~~~~~~~~~~~f~~ 120 (448) T protein:vir:77 58 VYHKMLSDGTVKNAL----NYIFGRIRSAKWYVE--PAS-----TDP------EDIAIAAFIHAQLGIDDASVGKYPFGR 120 (448) T ss_pred HHHHHhhChHHHHHH----HHHHHHHhcCCceEe--cCC-----CCH------HHHHHHHHHHHHhhchhhhhccCCHHH Confidence 677776544444444 444444555555442 110 000 000111233333333222 2457888 Q ss_pred HHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccce Q lcl|NC_019406. 146 FAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPW 225 (661) Q Consensus 146 fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~ 225 (661) ++.+++ .++-||.+.+=+.|- ... +|+-.+..+.. T Consensus 121 ~i~~~l-da~~~G~s~~Eivw~-------------------------~~~-dg~~~~~~l~~------------------ 155 (448) T protein:vir:77 121 LFAIYE-NAYIYGMAAGEIVLT-------------------------LGA-DGKLILDKIVP------------------ 155 (448) T ss_pred HHHHHH-HhhhhcceeEEEEEe-------------------------ecC-CCceeeccccc------------------ Confidence 888885 688888777655442 110 11100000000 Q ss_pred eeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccce Q lcl|NC_019406. 226 IGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVY 305 (661) Q Consensus 226 i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~ 305 (661) .+.+.+- ++.|. .++..+++.+ .+.. .+ . T Consensus 156 ---r~~~~~~----------------------------~f~~~-------------~~~~l~~~~~---~~~~--~~--~ 184 (448) T protein:vir:77 156 ---IHPFNID----------------------------EVLYD-------------EEGGPKALKL---SGEV--KG--G 184 (448) T ss_pred ---cCCCccc----------------------------eeeee-------------cCCceEEEec---CCcc--cc--c Confidence 0000000 00000 0011111110 0000 00 0 Q ss_pred eeccCCcccceeeEEEE--ec-CCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe---cCCCCCCc--- Q lcl|NC_019406. 306 TPMVRGRTLPFIPFVFF--GS-MSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP---ELDDSDAS--- 376 (661) Q Consensus 306 ~p~~~g~~L~~IPfv~~--~~-~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~---Gl~~~~~~--- 376 (661) .+...|. .||+-++ .. ...+...+...|..++..-+---....|.-.-+..-+.|+++.. |.++...+ T Consensus 185 ~~~~~~~---~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~ 261 (448) T protein:vir:77 185 SQFVNGL---EIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEA 261 (448) T ss_pred ccCCCcc---ccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHH Confidence 0001111 2454222 11 12233334444555555444434444667777888889999876 44332211 Q ss_pred ------eeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHH-HhHHhcccccCccchhHHHHH-HHHHHh Q lcl|NC_019406. 377 ------EYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAA-IGGRLMPGMSKSVSESDNQSA-LREANE 448 (661) Q Consensus 377 ------~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~-lGArll~~~~~~~~eTataa~-~d~~~~ 448 (661) .|..|+.++.++|. |..+.|+|..|.+- ...+.++...++|.. +.+..+....++ .++..+. ....-. T Consensus 262 l~~av~~i~~g~~a~~iiP~-g~~ie~~ea~~~~~-~~~~~i~~~d~~Isk~iLGqtlTs~~~~--g~~~~~~~~~~~v~ 337 (448) T protein:vir:77 262 AKEIVKNFVQKPRHGIILPD-DWKFDTVDLKSAMP-DAIPYLTYHDAGIARALGIDFNTVQLNM--GVQAVNIGEFVSLT 337 (448) T ss_pred HHHHHHHHhcCCceEEEecC-CceEEEEecCCCcc-CHHHHHHHHHHHHHHHHhcccccccccc--chhhhhhhhHHHHH Confidence 13568888889995 78999999877654 455666655666552 234455432222 2222222 122233 Q ss_pred hHHHHHHHHHHHHHHHH-HHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHh-c Q lcl|NC_019406. 449 QSLLLNVIMALEDGMTS-VVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVK-N 526 (661) Q Consensus 449 ~S~L~~~A~~le~Al~~-aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r-~ 526 (661) .-.+.+-+..+++.+++ ++++++.|-.... ..-.+|.+. . .+++++.++.+.... +..+.+. . T Consensus 338 ~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~--~~~P~~~f~----~--~e~eDl~~~a~~~~~-------l~~~~~~~~ 402 (448) T protein:vir:77 338 QQTIISLQREFASAVNLYLIPKLVLPNWPGA--TRFPRLTFE----M--EERNDFSAAANLMGM-------LINAVKDSE 402 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCCCEEEec----C--CChhhHHHHHHHhHH-------HHHHHHHHh Confidence 44557788889999984 7888888763221 111244432 1 134566665554431 2222221 2 Q ss_pred CCCCccCCHHHHHHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhc Q lcl|NC_019406. 527 GIIPSTQTLEEFTIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERH 590 (661) Q Consensus 527 gvl~~~~~~Eee~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~ 590 (661) |+ |...+ .....+..+..+-. ...+ -+.......+| +.-....-|. T Consensus 403 ~i-p~~~~------------~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~~--~~~~~~r~~~ 448 (448) T protein:vir:77 403 DI-PTELK------------ALIDALPSKMRRAL--GVVD-EVREAVRQPAD--SRYLYTRRRR 448 (448) T ss_pred cC-CccCC------------cCCCCCchhccccc--CCCC-CCCchhhcchh--hHHHHhhhcC Confidence 22 22111 00000000000000 0000 00011112222 1111111111 No 152 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=77.70 E-value=0.12 Score=25.59 Aligned_cols=380 Identities=12% Similarity=0.043 Sum_probs=153.4 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.=.. +-++. ...++..-|+|. ....+|. .+ .|+ ..+..|..+..+. T Consensus 1 M~~f~-------~~~~~--~~~~~~~~~~~~---------~~~~~~~-----~~-~~v---------~~~~al~~~~V~~ 47 (397) T protein:vir:38 1 MPLLK-------LNKSH--SQGFSLNDPDWV---------NFLTGGE-----AQ-KYV---------SADTALKNSDIFS 47 (397) T ss_pred Ccchh-------hhhcc--cCcccCCchhhh---------hhhcCCc-----CC-cee---------chHHhhccHHHHH Confidence 32211 00111 111222222221 1111121 11 121 1123455555555 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) ......+.+.++.|+.. .+.+..|....+ ...+-.+|.+.++...+.+|-| T Consensus 48 ~v~~ia~~ia~~p~~~~------~~~~~~l~~~PN-----------------------~~~s~~~f~~~~~~~lll~Gna 98 (397) T protein:vir:38 48 LIMQLSGDLAMVRYTSE------SDRSQSIISNPS-----------------------VTANGYSFWQGMFAQLLLDGNC 98 (397) T ss_pred HHHHHHHHHhhCccccc------ccHHHHHHhcCC-----------------------CCCCHHHHHHHHHHHhhhcCCE Confidence 55666666666656531 122333333332 3678899999999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) ++++.+..... ---+..+.|..|.=+. T Consensus 99 ~~~i~r~~~g~-----~~~l~~l~~~~v~i~~------------------------------------------------ 125 (397) T protein:vir:38 99 YAYRHKNTNGV-----DLSWEYLRPSQVQPML------------------------------------------------ 125 (397) T ss_pred EEEEEECCCCc-----EEEEEEEcCceeEEEE------------------------------------------------ Confidence 99987653211 0122222332210000 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFV 320 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv 320 (661) ..++ ....|++..- ....+... ..| .-..|=| T Consensus 126 ---------------------~~~~---~~~~y~~~~~----------------~~~~~~~~--~~~-----~~eiih~- 157 (397) T protein:vir:38 126 ---------------------LQDG---SGLIYNINFD----------------EPAIGYME--NVP-----AADVIHI- 157 (397) T ss_pred ---------------------cCCC---ceEEEEEEec----------------ccccccee--Eec-----CccEEEe- Confidence 0000 0011111110 00000000 000 0011111 Q ss_pred EEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHH-hcCceeEEec---CCCCCCc-------eeEecc--cceee Q lcl|NC_019406. 321 FFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFF-TALPTYYAPE---LDDSDAS-------EYHIGP--GRVWV 387 (661) Q Consensus 321 ~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~-~~~P~l~i~G---l~~~~~~-------~l~iGs--~~~~~ 387 (661) . +....+...+.||+..+ ...|.......++...++. .+.|..++.- ++++... ...-|. +..+. T Consensus 158 ~-~~~~~~~~~G~s~i~~~-~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~v 235 (397) T protein:vir:38 158 R-LLSKNGGKTGISPLSAL-INEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVV 235 (397) T ss_pred c-CCCCCCccccccHHHHH-HHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCcee Confidence 1 11222333567777644 4466666666666666555 4677777752 2221110 112222 23445 Q ss_pred cCCCCCcceEeecCchhHHHH-HHHHHHHHHHHHHH-h--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTL-ERALNEKEQQIAAI-G--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~-~~~L~~le~qM~~l-G--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) ++ .| +.|.+.+-.+.... .+..+-..++++++ | ..+|-. ......+.++... --...|.-++..+++++ T Consensus 236 l~-~g--~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~-~~~~~~~~e~~~~---~~~~~l~P~~~~ie~~l 308 (397) T protein:vir:38 236 ID-AL--EDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNG-QGDQQSSITQISG---QYAKSLNRYVQAIVGEL 308 (397) T ss_pred cC-CC--ceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CCCcccHHHHHHH---HHHHHHHHHHHHHHHHH Confidence 54 33 56666655444433 44555555555543 4 233321 1111212222222 12346677777777777 Q ss_pred HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHh Q lcl|NC_019406. 464 TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMN 543 (661) Q Consensus 464 ~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~ 543 (661) +.-|- . . ++++-+|...........++-.+++.|.|+..+.++.|-.-++.+.+.-.-+.. T Consensus 309 n~~l~------~-----~----~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~---- 369 (397) T protein:vir:38 309 NDKLH------A-----N----ISANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKE---- 369 (397) T ss_pred HHhcc------C-----h----hcccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccccccc---- Confidence 65431 1 0 111223332222223566777888999999998887765444433321110000 Q ss_pred ccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 544 DPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 544 ~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) ....... ..+.|......+..+. ..|.+ T Consensus 370 ----~~~~~~~-~~~~~g~~~~~~~~e~----~~~~~ 397 (397) T protein:vir:38 370 ----PQQAIQL-IQQEGGENDGNNSDER----GSDPE 397 (397) T ss_pred ----ccccccc-cccccCCCCCCCCCCC----CCCCC Confidence 0000000 0000111111111111 11111 No 153 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=77.44 E-value=0.12 Score=25.54 Aligned_cols=475 Identities=11% Similarity=0.069 Sum_probs=183.8 Q ss_pred CCccc----c--CHHHHH-------HHHHHHHHHHHhcchHHHHhCCcccCCCC-CCCChHHHHHHHhhhcccchHHHHH Q lcl|NC_019406. 21 FTHLV----V--HPEYEY-------YRPDWAKIRDAIAGEREIKAQGVKYLKAP-KGFDDEDYANYLDRAAFYNMTSQTQ 86 (661) Q Consensus 21 ~~V~~----~--hPey~a-------~~~~W~~irD~~~G~~~vr~~g~~YLPk~-~~E~~~~Y~~rl~rA~~~n~~~~tv 86 (661) ||-.+ . .--|.. ...+|+-|.+.+- |-. +.+++. .+.+ -.|-+.-.+.+ T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~l-------------P~~~~~~~~~---~~~~-~~~dstg~~a~ 63 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTL-------------PYLMADVNDD---LSSQ-NAWQDDGASAT 63 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhc-------------cccccCCCCC---cccc-ccccchHHHHH Confidence 55331 0 011222 2335555555443 321 111111 1111 13444445555 Q ss_pred HHHhchhhcc--Ccc---cc-ccch-hhHhhhhcccccccccchhhhhhhHhhhhhc------cCCCCCHHHHHHHHHHH Q lcl|NC_019406. 87 AGMVGQIFRR--PPV---IR-NLPN-TGAITGRDAEGGVQVVAPASIGKLLTQLQRF------AKDGTSHQGFAKTVALE 153 (661) Q Consensus 87 ~~l~G~vFrk--~p~---i~-~~p~-~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~------dl~G~sL~~fa~~~~~~ 153 (661) +.++..+..- ||. +. .+.+ .+..+ ++ -++....++.+|+.| -+.-++.+.-+-.++.+ T Consensus 64 ~~LAa~l~~~ltpp~~~WF~l~~~~~~l~~~----~~-----~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~ 134 (517) T protein:vir:10 64 NFLSNKLSQVLFPAQRSFFRIDLTPEGIKQL----DN-----EAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKH 134 (517) T ss_pred HHHHHHHHHhhcCCCCccccccCCHHHHHhh----cc-----CcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHH Confidence 5554433321 111 10 0111 11111 00 011112223333222 23567888888899999 Q ss_pred HHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhh Q lcl|NC_019406. 154 QVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSET 233 (661) Q Consensus 154 ~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~ 233 (661) ...+|-+.+++|-. ..+ +..|+-.+ +-+. .++.+.+.-|+.|+......- T Consensus 135 L~~~G~a~ly~~~~-----~~~----~~~~pl~~---y~v~-~d~~G~v~~ivrr~~~~~~~l----------------- 184 (517) T protein:vir:10 135 LIVTGNVMMYHPDK-----TSP----IQAVPLHH---YCVR-RDNNGTVLDIVFLQEKALETF----------------- 184 (517) T ss_pred HHhHCeEEEEEeCC-----CCc----EEEEEcCe---EEEe-eCCCcCeEEEEeeeeccHHHH----------------- Confidence 99999877776511 111 12222111 2111 122222323333332221100 Q ss_pred hhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCC-- Q lcl|NC_019406. 234 AQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRG-- 311 (661) Q Consensus 234 vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g-- 311 (661) +-.|..... ...+.. .. . .....++|...... .++ .|.|+...++.. .+ ..+| T Consensus 185 ~~~~~~~~~------~~~~~~----~~-~---~~~~v~v~~~v~~~--~~~--~~~~~~~~d~~~--~~-----~~s~y~ 239 (517) T protein:vir:10 185 EPSIRMAIQ------ASRKGK----QY-K---DKDNVKLYTHAKRT--KDG--KYLIRQSADDVP--VG-----KESTVT 239 (517) T ss_pred HHHhhhhcc------hhhhhh----cc-C---CcCceEEEEEEEEe--CCC--ceEEEEEeCcee--ec-----cccccc Confidence 001111000 000000 00 0 11122333332222 222 233333322211 11 1122 Q ss_pred -cccceeeEEEEecCCCCCCcccc---chhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCCCCceeEeccccee Q lcl|NC_019406. 312 -RTLPFIPFVFFGSMSNAADCEKP---PLLDIVELNLKHYRTYAELEHGRFFTALPTYYAP-ELDDSDASEYHIGPGRVW 386 (661) Q Consensus 312 -~~L~~IPfv~~~~~~~~~~~~~p---PLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~~~~~l~iGs~~~~ 386 (661) ..+++||+.|.-..+..+..+ | -|-|+..||.-+ .+-+..+..-...|.++-+ |.... ..+.-|.++.+ T Consensus 240 ~~e~P~~~~Rw~~~~ge~YGrg-p~~~~L~D~k~L~~l~---~~~~~~~~~a~~~~~lv~~~~~~~~--~~l~~~~~g~~ 313 (517) T protein:vir:10 240 EDKSPFLILTWKRSYGEDYGRG-MAEDHAGAFFVIQFLS---EALARGMALMADVKYLVKPGSYTDI--NQFVEGGSGAV 313 (517) T ss_pred cccCCeeeeeeeecCCCCcccc-hHHHhHHHHHHHHHHH---HHHHHHHHHhccCCcccCcccccch--hhccCCCcccc Confidence 346788888876666665544 2 245666666433 3334555555666666543 33222 22444444333 Q ss_pred ecCCCCCcceEeecC-chhHHHHHHHHHHHHHHHHHHh--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 387 VVDKESGIPGIIEFK-GEGLKTLERALNEKEQQIAAIG--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 387 ~lp~~ga~~~ylE~~-g~~i~a~~~~L~~le~qM~~lG--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) .|..-....-++.. +..+....+.|+++++.+...= -.+... .+...|||+...+...-...|.-+-..+.+=+ T Consensus 314 -~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~--~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~El 390 (517) T protein:vir:10 314 -LHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRR--DAERVTAYEIQRDAMLVEQSLGGVYSLFATTF 390 (517) T ss_pred -ccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhcc--CCccccHHHHHHHHHHHHHHhhhHHHHHHHHH Confidence 34322344444432 4457888999999999887531 112211 23457999999999999999988777755542 Q ss_pred -HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhc----CCCCccCCHHHH Q lcl|NC_019406. 464 -TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGGLLPIDALYENFVKN----GIIPSTQTLEEF 538 (661) Q Consensus 464 -~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~----gvl~~~~~~Eee 538 (661) .-++.++-.-++... -...++++ |. .. +.+|..+.....| ..|...+-.- ..+.+..++++. T Consensus 391 l~Pli~r~~~~l~~~l-~~~~v~~~----~~-s~-----la~l~r~~~~~~i--~~~~~~i~~~a~~~~~~~~~id~d~~ 457 (517) T protein:vir:10 391 QGPLARWFMNGISSIL-TSKNVSPT----IL-TG-----IEALGRMAELDKL--GTFNGYVSMTAQWPEPLQQAIKWPDF 457 (517) T ss_pred HHHHHHHHHHHhhhhc-CCCCccce----ee-cc-----HHHHHHHHHHHHH--HHHHHHHHHhhcCChHHHhcCCHHHH Confidence 222222222222111 11122222 22 12 2223333222212 2333333222 223345778888 Q ss_pred HHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCchhHHHhhhhhhhhhhHHHhcCCh Q lcl|NC_019406. 539 TIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDEEKLRISAKVGSTSVAASRKLGDP 618 (661) Q Consensus 539 ~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 618 (661) .+.+.+. +|.|. ..+. . .+|..|. ++ |+..+.|..+++.++++ .. ..-.+++ T Consensus 458 ~~~~a~~---~Gvp~-~~ir-s----~~ev~~~---~~----~~~~~~~~~~~~~~ag~-----------~~-~~~~~~~ 509 (517) T protein:vir:10 458 TDWVQGQ---ISANF-PFFK-T----QDELNAE---AQ----AQQEQEATKYAAEQAGK-----------AI-PDMVKNG 509 (517) T ss_pred HHHHHHH---hCCCh-hhcC-C----HHHHHHH---HH----HHHHHHHHHHHHHHHHH-----------HH-HHHHhCC Confidence 8888775 33331 1111 1 1111111 01 11111111010000000 00 0000111 Q ss_pred hhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCC Q lcl|NC_019406. 619 EQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGR 654 (661) Q Consensus 619 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 654 (661) + +-|+- |. T Consensus 510 ~---------------~~~~~-------------~~ 517 (517) T protein:vir:10 510 Q---------------INPQG-------------GQ 517 (517) T ss_pred C---------------CCCCC-------------CC Confidence 0 00000 00 No 154 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=76.12 E-value=0.14 Score=25.28 Aligned_cols=572 Identities=11% Similarity=0.013 Sum_probs=175.5 Q ss_pred CCccccC-HHHHHHHHHHHHHHHHhcchHHHHhCC---cccC----CCCCCCChHHHHHHH---hh-hcccchHHHHHHH Q lcl|NC_019406. 21 FTHLVVH-PEYEYYRPDWAKIRDAIAGEREIKAQG---VKYL----KAPKGFDDEDYANYL---DR-AAFYNMTSQTQAG 88 (661) Q Consensus 21 ~~V~~~h-Pey~a~~~~W~~irD~~~G~~~vr~~g---~~YL----Pk~~~E~~~~Y~~rl---~r-A~~~n~~~~tv~~ 88 (661) |. ..+ -.+..+ +..++.+......+|+.. ..|- =+|+.+..+..+.-| .| .+-+|.++++|+. T Consensus 1 ma--~~~~~~l~~~---~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~ 75 (720) T protein:vir:35 1 MA--ETLQKRHEQI---MRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNR 75 (720) T ss_pred Cc--hHHHHHHHHH---HHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHH Confidence 21 111 011111 112222223333333211 1121 266666555333222 22 3567999999999 Q ss_pred HhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEE--ec Q lcl|NC_019406. 89 MVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALV--DV 166 (661) Q Consensus 89 l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLV--D~ 166 (661) .+|.-=+..|.+.-+|..= +.|. ..+---...|+.+++ =++.+.-..++|..++.+|.+|+=| || T Consensus 76 v~g~~~~nr~d~~v~P~~~-----~~d~---~~Ae~l~~~~~~~~~-----~~~~~~~~s~Af~~~i~~G~G~~~v~~d~ 142 (720) T protein:vir:35 76 IISEYRHNRITVKFRPGDK-----TASE---ALANKLNGLFRADYE-----ETDGGEACDNAFDDGSTGGFGCFRLTTNL 142 (720) T ss_pred HHhHHHhCCCceEEEcCCC-----cchH---HHHHHHHHHHHHHHH-----hcCchHHHhHHHHHhhhccceeEEeeecc Confidence 9999977777775455411 1110 011111223333333 3356666789999999999888755 55 Q ss_pred cCCCchhhcccceeEe----ech-hhh-ccceeeccccccceeeeeeeeeeeeccc--cccccccceeeeechhhhhcch Q lcl|NC_019406. 167 APSSDPTAPAKSYTVG----YAA-ENI-VDWTVEDVDGFYVPTRILLREFERVDEH--ATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 167 P~a~~~~~g~rPY~~~----~~p-~~I-inW~~~~~~g~~~Lt~v~ire~~~~~~~--~~~~~~~~~i~~~~~e~vi~w~ 238 (661) ....+.... ++-+. +.| .+| +||...+.+... =.|+.++.+...++. .|..........+.......|. T Consensus 143 ~~~~d~~~~--~~~i~i~~v~~~~~~v~~Dp~a~~~D~sD-ar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~ 219 (720) T protein:vir:35 143 VNALDPMDE--RQRICLEPIYDPARSVWFDPDAKKYDKSD-AEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWY 219 (720) T ss_pred cccCCCCcc--cceeeEecccCchhheeecccccccChhh-hhhhhhhcCCCHHHHHHhCCCcccccccccccccccccc Confidence 432221111 11111 111 222 333332222110 012222222111000 0110000000000000001111 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccce-----------------------------EEE Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSR-----------------------------VYK 289 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~-----------------------------~~~ 289 (661) .. ..+++.+++... .+. .++.++.....|.. .++ T Consensus 220 ~~--------~~v~i~E~~~~~--------~~~-~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~ 282 (720) T protein:vir:35 220 DV--------DVVYIAKYYEVK--------KES-VDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRR 282 (720) T ss_pred CC--------CceEEEEeeEEE--------EEE-EEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEE Confidence 10 011111111100 000 00111111110110 011 Q ss_pred EEEEecCccc-ccccceeeccCCcccceeeEEEEecCCC----CC---CccccchhHHHHHHHHHHhhhhhHHHHHHHhc Q lcl|NC_019406. 290 QFVYVEDPLG-QARDVYTPMVRGRTLPFIPFVFFGSMSN----AA---DCEKPPLLDIVELNLKHYRTYAELEHGRFFTA 361 (661) Q Consensus 290 ~~~~~~~~~~-~~~~~~~p~~~g~~L~~IPfv~~~~~~~----~~---~~~~pPLldLA~LNl~HYq~sSDl~~il~~~~ 361 (661) ++.+.-++.. -..... -|.+.||+|+|...+. .+ ..-. ++.|.=. .+.++.+. +-+++ +. T Consensus 283 v~~~~~~g~~~l~~~~~------~p~~~fP~vP~~g~r~~~d~~~~~~G~vr-~~kd~Q~-~~N~~~s~--~~~~~--~~ 350 (720) T protein:vir:35 283 VYVSVVDGEGFLEKAQR------IPGEHIPLIPVYGKRWFIDDIERVEGHIA-KAMDAQR-LYNLQVSM--LADSA--TQ 350 (720) T ss_pred EEEEeeccchhcccCCC------CCCCccceEEEEeeeeccCCCcccceeee-cchhHHH-HHHHHHHH--HHHHH--Hc Confidence 1111100000 000000 1234455554432211 11 1111 1111111 11112221 22222 22 Q ss_pred CceeEEecCC-------CCCCcee----------EecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHH-Hh Q lcl|NC_019406. 362 LPTYYAPELD-------DSDASEY----------HIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAA-IG 423 (661) Q Consensus 362 ~P~l~i~Gl~-------~~~~~~l----------~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~-lG 423 (661) .|...-.|.. ..|...- .++...+.+.+.. .++++.++..-+ ....+-|+.-...|.. .| T Consensus 351 ~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~-~~~~~~~~~~~~-~~~~~llq~~~~~i~~vsG 428 (720) T protein:vir:35 351 DTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPP-TPVGYTQPQPLN-QAMAALLQQTGADIQEVTG 428 (720) T ss_pred CCccccccCcchHHHHHHHhhccccccccccccccccccCcccccCC-CcccccCCCCCc-hHHHHHHHHHHHHHHHHhC Confidence 3322212211 2221110 1222222221111 245565543222 2223333333333332 23 Q ss_pred HH--hcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHHHcCCCC------CCcceEEEEecc- Q lcl|NC_019406. 424 GR--LMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSV----VRYWLMFRDIPL------TDTATLRYEIDA- 490 (661) Q Consensus 424 Ar--ll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~a----L~~~A~w~G~~~------~~~~~~~v~ln~- 490 (661) .. ++-. . ++.|+.+...+..+..-.|..+-.|+..+...+ |.++..|++..- .++..-.+.+|. T Consensus 429 i~~~~lG~--~-sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~ 505 (720) T protein:vir:35 429 SSQAMQPM--P-SNIAKETVNHLMHRSDMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVV 505 (720) T ss_pred CChHHcCc--c-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechh Confidence 22 3321 1 246888888888888888888888887776554 677777775211 111111111211 Q ss_pred -------------c-------c------ccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCC---ccCCHHHHHHH Q lcl|NC_019406. 491 -------------T-------F------LTTALDARALRAIQQLYEGGLLPIDALYENFVKNGIIP---STQTLEEFTIK 541 (661) Q Consensus 491 -------------D-------F------~~~~lda~~l~all~~~~aG~Is~et~~~eL~r~gvl~---~~~~~Eee~~~ 541 (661) | + .......+.+.++++++.+ ..+...+...+ ..++- +--..+++.++ T Consensus 506 ~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~-~~p~~~~~~~~--~~~ile~~d~p~~~e~~er 582 (720) T protein:vir:35 506 INDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAG-MLPQDPMRQVL--QGIILDNMEGEGLDEFKEY 582 (720) T ss_pred hhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHh-cCCCchhHHHH--HHHHHHhcCchhHHHHHHH Confidence 1 1 1111123345566666542 11111111000 00001 11113455566 Q ss_pred HhccCCCCCCchhhhhhcCCccccCCCcchhhhhc--CChhhHHHHHHHhccCCCchhHHHhh------------hhhhh Q lcl|NC_019406. 542 MNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARD--ADFQQQELEQAERHLEIDEEKLRISA------------KVGST 607 (661) Q Consensus 542 l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e--~d~~q~~~~~~e~~~~~~~~~~~~~~------------~~~~~ 607 (661) +....+.-+.-+ +.+.++.+...+ .-.+|+..+.++.++.+++.+++... .-+++ T Consensus 583 irk~~~~~~~~~-----------~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa 651 (720) T protein:vir:35 583 NRKQLLTQGVVK-----------PRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQT 651 (720) T ss_pred HHhhcchhcccC-----------ccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654332211000 000000000000 00111111111111111111111100 00000 Q ss_pred hh--h-HHHhcCChhhh---hh--hhhhhhHHHHh------hcccccCCCCCCCcccccCCCCccCCC Q lcl|NC_019406. 608 SV--A-ASRKLGDPEQA---KP--SKAEQAQIDAQ------QKQAAAKPVTPTPGTVQRGRPPQNGAS 661 (661) Q Consensus 608 ~~--~-~~~~~~~~~~~---~~--~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 661 (661) .. . +..++-..++. ++ .|+.+.--..| +...++-|..++.++--.-+--+...| T Consensus 652 ~a~~~~a~~~~~~aq~~~~~q~~i~qalq~~~~~q~~q~~~eqa~~el~~~~~~~~~~~~~~~~~~~~ 719 (720) T protein:vir:35 652 EARVAEAKMVQILASADSAKRAEIREALKMLHQFQKEQGDASRADAELILKATDTQHKQNRDAAKNHS 719 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhhcccchhhhhhHHHhhccC Confidence 00 0 00000000000 00 00010000001 011111222333322111111111111 No 155 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=74.89 E-value=0.15 Score=25.05 Aligned_cols=369 Identities=9% Similarity=0.015 Sum_probs=145.9 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |-=+.. ...++.+... ...++ |......+.+. + ..|. |+ + ....++.++.+. T Consensus 1 M~~f~~-----~~~~~~~~~~----~~~~~------~~~~~~~~~~~--~-~~~~-~v------~---~~~~~~~~~v~~ 52 (386) T protein:vir:48 1 MPIFNI-----TNLATESPPI----SQGGF------FDITDPDFLST--L-NGSE-WV------S---AESALRNSDLFS 52 (386) T ss_pred Cccccc-----cccccccccc----ccccc------cccccchhccc--c-cCCc-ee------c---hhhhhcchHHHH Confidence 221111 1000110000 00000 11111100000 0 1111 11 1 112244444444 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) .+..+.+.+.++.| .+.. .....|....+ ...+-.+|.+.++...+.+|-+ T Consensus 53 ~i~~ia~~ia~~p~----~~~~--~~~~~l~~~pN-----------------------~~~t~~~f~~~~~~~lll~Gna 103 (386) T protein:vir:48 53 IINQLSNDLATVKL----TASR--KQLQGIIDNPS-----------------------NNANRFNFYQSIFAQMLLGGEA 103 (386) T ss_pred HHHHHHHhhccCce----eecc--chhHHHhhcCC-----------------------CCCCHHHHHHHHHHHhhhcCcE Confidence 44444444444433 3311 11222332222 3678889999999999999999 Q ss_pred EEEEeccCCCchhhcccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) ++++...... +| -+..+.|..|. +. T Consensus 104 ~~~i~r~~~g------~~~~L~~l~~~~v~-----------------v~------------------------------- 129 (386) T protein:vir:48 104 FAYRWRNENG------RDMKWEYLRPSQVS-----------------FN------------------------------- 129 (386) T ss_pred EEEEEECCCC------cEEEEEEecCceeE-----------------EE------------------------------- Confidence 9998754211 11 11122222110 00 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeE Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPF 319 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPf 319 (661) . ..+ ....+|++..-.. .....+.+. .+ ..|-| T Consensus 130 -------------------~--~~~---~~~~~y~~~~~~~-----~~~~~~~~~-------~~-----------evih~ 162 (386) T protein:vir:48 130 -------------------R--LDN---KDGIYYNITFDDP-----RIPPKQHVP-------QG-----------DVLHF 162 (386) T ss_pred -------------------E--cCC---CceEEEEEEecCc-----cccceeEec-------Cc-----------cEEEe Confidence 0 000 0011122111000 000000000 00 11111 Q ss_pred EEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHH-HHHhcCceeEEec---CCCCCC-------ceeEecccceeec Q lcl|NC_019406. 320 VFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHG-RFFTALPTYYAPE---LDDSDA-------SEYHIGPGRVWVV 388 (661) Q Consensus 320 v~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~i-l~~~~~P~l~i~G---l~~~~~-------~~l~iGs~~~~~l 388 (661) - +...++...+.+|+.-++. .+.......++... +...+.|..++.- ++++.. ....-+++.++.+ T Consensus 163 ~--~~~~~~~~~G~s~i~~~~~-~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl 239 (386) T protein:vir:48 163 K--LLSVDGGLTSVSPLMALSR-ELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVL 239 (386) T ss_pred c--CCCCCCceeeccHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCceec Confidence 1 1122233356788765443 44444444444444 3445678877752 111110 0112233445555 Q ss_pred CCCCCcceEeecCchhHHH-HHHHHHHHHHHHHHH-h--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_019406. 389 DKESGIPGIIEFKGEGLKT-LERALNEKEQQIAAI-G--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMT 464 (661) Q Consensus 389 p~~ga~~~ylE~~g~~i~a-~~~~L~~le~qM~~l-G--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~ 464 (661) + +|. +|...+-.+-.+ ..+..+-..++++.+ | ..++- .+ +...+.++....+ -...|.-++..++++++ T Consensus 240 ~-~g~--~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~-~~~~~~e~~~~~~--~~~~l~P~~~~ie~~l~ 312 (386) T protein:vir:48 240 D-DLE--EFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVG-GQ-GDQQSSLEMSLDL--YNKAVSRYLRPFLSELS 312 (386) T ss_pred C-CCc--eEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CC-CCcccHHHHHHHH--HHHHHHHHHHHHHHHHH Confidence 5 344 454444333332 244445555555543 4 33331 11 1122344444443 34567778888888887 Q ss_pred HHHHHHHHHcCCCCCCcceEEEEeccccccccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHh Q lcl|NC_019406. 465 SVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDAR-ALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMN 543 (661) Q Consensus 465 ~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~-~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~ 543 (661) +.|-- . +.+.+...|. .+.. ....+-+++.+|.++.-+.++.|-+.++.+.+...-+ .+. T Consensus 313 ~~l~~---~----------~~~~~~~~~~---~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~---~~~ 373 (386) T protein:vir:48 313 QKLSC---D----------VDADILPAVD---PTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPEGE---NPN 373 (386) T ss_pred Hhhcc---h----------hhcchhhhhc---cChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchhhc---CCC Confidence 76521 1 1111111111 1222 3345567788999999999998888888765432110 000 Q ss_pred ccCCCCCCchhhh Q lcl|NC_019406. 544 DPKSFIGQPDAIA 556 (661) Q Consensus 544 ~~~~~l~~ddae~ 556 (661) .....-|.++++. T Consensus 374 ~~~~~gGd~~~~~ 386 (386) T protein:vir:48 374 KTTLKGGEINGED 386 (386) T ss_pred CCccCCCCCCCCC Confidence 0000011111111 No 156 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=73.64 E-value=0.17 Score=24.83 Aligned_cols=484 Identities=12% Similarity=0.022 Sum_probs=171.7 Q ss_pred CCCCCC------------ccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHH Q lcl|NC_019406. 1 MAGLSP------------NSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDED 68 (661) Q Consensus 1 ~~~~~~------------~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~ 68 (661) -++-.+ .|.+|.-...++-+..-+-+.++|..- .++... .+.-|-+++...+-.. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~---------~~~~~~----~~~g~~~~~~~~~~~~ 75 (547) T protein:vir:63 9 LAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQP---------VIGSMS----ANPGFKTKPSIRNNQD 75 (547) T ss_pred hhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhch---------hhheee----cccccccCCccCChhH Confidence 111000 011221111222222223344444221 111111 1223556665555555 Q ss_pred HHHHHhhhcccchHHHHHHHHhchhhc--cCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCC----CC Q lcl|NC_019406. 69 YANYLDRAAFYNMTSQTQAGMVGQIFR--RPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDG----TS 142 (661) Q Consensus 69 Y~~rl~rA~~~n~~~~tv~~l~G~vFr--k~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G----~s 142 (661) .++.++...-.++.+.+|+.++-.|.. .+.....--..++.-..+.+...+..-..-++.+..++.+..... .+ T Consensus 76 l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s 155 (547) T protein:vir:63 76 LHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDS 155 (547) T ss_pred HHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccch Confidence 555554444456777777666544432 111000000000111112222222222233345566666655443 26 Q ss_pred HHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccce-eEeechhhhccceeeccccccceeeeeeeeeeeecccccccc Q lcl|NC_019406. 143 HQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSY-TVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQ 221 (661) Q Consensus 143 L~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY-~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~ 221 (661) ...|++.++...|.+|-+++.+-..... +|. +..+.|..|. +. T Consensus 156 ~~~f~~~lv~d~ll~Gn~~~~i~rd~~G------~~~~L~~l~p~~V~-----------------~~------------- 199 (547) T protein:vir:63 156 FSSFVKKIVRDTYMYDQVNFEKVFNRNQ------SMVRFVAKDPTTIF-----------------FA------------- 199 (547) T ss_pred HHHHHHHHHHHHHhhCCEEEEEEECCCC------cEEEEEEecCceeE-----------------EE------------- Confidence 7789999999999999999988754321 111 1112221110 00 Q ss_pred ccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccc Q lcl|NC_019406. 222 QNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQA 301 (661) Q Consensus 222 ~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~ 301 (661) ...++......++-+.. .++..++. + . T Consensus 200 ---------------------------------------~~~~g~~~~~~~~y~~~----~~~~~~~~---~-------~ 226 (547) T protein:vir:63 200 ---------------------------------------TTADGKIPDNGNRFVQV----IDQKIVAT---F-------N 226 (547) T ss_pred ---------------------------------------ECCccccccCceEEEEE----cCCcEEEE---e-------c Confidence 00000000000000000 00000000 0 0 Q ss_pred ccceeeccCCcccceeeEEEEe-cCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHHHh-cCceeEE--ec---CCCCC Q lcl|NC_019406. 302 RDVYTPMVRGRTLPFIPFVFFG-SMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRFFT-ALPTYYA--PE---LDDSD 374 (661) Q Consensus 302 ~~~~~p~~~g~~L~~IPfv~~~-~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~~~-~~P~l~i--~G---l~~~~ 374 (661) .+++ |=|.... ........+.|||..++ ..|........+...++.- ++|-.++ .| ++++. T Consensus 227 ~~ei-----------ih~r~n~~~~~~~~~~G~Spi~~~~-~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~ 294 (547) T protein:vir:63 227 AREM-----------AFAVRNPRSDIYATGYGYPELEIAL-KQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHA 294 (547) T ss_pred cccE-----------EEecccCCCCcccccccccHHHHHH-HHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHH Confidence 0001 1000000 01111224778876544 4555555556666666654 4677554 44 22221 Q ss_pred Cc-------eeEecccce---eecCCCCCcceEeecCchhHHHH-HHHHHHHHHHHHHH-h--HHhcccccCc-----cc Q lcl|NC_019406. 375 AS-------EYHIGPGRV---WVVDKESGIPGIIEFKGEGLKTL-ERALNEKEQQIAAI-G--GRLMPGMSKS-----VS 435 (661) Q Consensus 375 ~~-------~l~iGs~~~---~~lp~~ga~~~ylE~~g~~i~a~-~~~L~~le~qM~~l-G--Arll~~~~~~-----~~ 435 (661) .+ ...-|++++ .+++. ..+.|...+..+-.+. .+..+-..++++++ | ..+|--..++ .. T Consensus 295 ~~~lk~~~~~~~~G~~nagk~~vl~~--~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~ 372 (547) T protein:vir:63 295 LEIFKREWKNSLSGINGSWQIPVVSA--EDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKG 372 (547) T ss_pred HHHHHHHHHHHhcCcccccccccccC--CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccc Confidence 11 122354443 23432 3467777665554433 23333333344322 2 1111000000 01 Q ss_pred hhHHHHHHHH---HHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHHHHHHHHHHhcC Q lcl|NC_019406. 436 ESDNQSALRE---ANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARALRAIQQLYEGG 512 (661) Q Consensus 436 eTataa~~d~---~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~l~all~~~~aG 512 (661) .+.+.+.++. .-....|.-++..++++|+..|- -. .| ..+.|+++. . ...+..+...+..++.+| T Consensus 373 ~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~--~~-~~------~~~~~~f~~--~-~~~~~~~~~~~~~~~~~g 440 (547) T protein:vir:63 373 GSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIV--AE-FG------DKYTFQFVG--G-DIKSELESVKILAEKAKV 440 (547) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cc-cC------CceEEEeec--c-ccccHHHHHHHHHHHhCC Confidence 1222222332 23456788888888888887652 11 12 235555542 1 122344555677888899 Q ss_pred CCCHHHHHHHHHhcCCCCccCCHHH-HHHHHhccCCCCC------CchhhhhhcCCccccCCCcchhhhhcCChhhHHHH Q lcl|NC_019406. 513 LLPIDALYENFVKNGIIPSTQTLEE-FTIKMNDPKSFIG------QPDAIAMRRGYVSRQQELDQQRAARDADFQQQELE 585 (661) Q Consensus 513 ~Is~et~~~eL~r~gvl~~~~~~Ee-e~~~l~~~~~~l~------~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~ 585 (661) .|+.-+.++.+ |+-| ....-+ ...-+ -...++ .++.+..+.......+ +.......|.+++- - T Consensus 441 ~lT~NE~R~~~---gl~P-~~egGD~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~-~ 510 (547) T protein:vir:63 441 AMTVNEVRKEL---NLPG-DVIGGDIPLNGV--IVQRIGQLMQQEQFEHEKQQSNLQMLQE---QTGNRVSTDVEDIP-D 510 (547) T ss_pred CcCHHHHHHHh---CCCC-CCCCCceeeccc--ccccccccccccCCccccchhhcccccc---ccCCCCCCCCCCCC-C Confidence 99988887644 4322 111111 00000 000000 0000000000000000 00000000100000 0 Q ss_pred HHHhccCCCchhHHHhhhhhhhhhhHHHhcCChhhhhhhhhhh Q lcl|NC_019406. 586 QAERHLEIDEEKLRISAKVGSTSVAASRKLGDPEQAKPSKAEQ 628 (661) Q Consensus 586 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 628 (661) .++.+.....+... +.. +-+++ ..-.++-. |++--+- T Consensus 511 ~~~~~~~~~~d~~~-~~~--~~~~~-~~~~~~~~--~~~~~~~ 547 (547) T protein:vir:63 511 GKDTTGDIGKDGQR-KDK--DNANA-GKQGMKGD--KPNDWQT 547 (547) T ss_pred CcccCCCcCccccc-cCc--cccch-hhhhcCCC--CccccCC Confidence 00011111111110 000 00111 11111111 1111100 No 157 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=51.91 E-value=0.57 Score=21.93 Aligned_cols=485 Identities=15% Similarity=0.080 Sum_probs=178.5 Q ss_pred CCcc-----ccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChH---HHHHHH-hhh--------cccchHH Q lcl|NC_019406. 21 FTHL-----VVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDE---DYANYL-DRA--------AFYNMTS 83 (661) Q Consensus 21 ~~V~-----~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~---~Y~~rl-~rA--------~~~n~~~ 83 (661) |++- ...|.....+.........|.|...-+ .....|.. .-.+. .+..+| .|| .--+++. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r--~~~~~~~~-~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~ 77 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGR--THKAKRQP-LGADTSLQKSAVSMREQCRKLDEDHDLVTGLLD 77 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccc--cccccCCC-CChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 5533 234555555555555556676654322 22223322 11111 112111 122 1123344 Q ss_pred HHHHHHhchhhc-cCccccccchh-hHhhhhcccccccccchhhh-hhhHhhhhhccCCCC-CHHHHHHHHHHHHHhhCC Q lcl|NC_019406. 84 QTQAGMVGQIFR-RPPVIRNLPNT-GAITGRDAEGGVQVVAPASI-GKLLTQLQRFAKDGT-SHQGFAKTVALEQVAMGR 159 (661) Q Consensus 84 ~tv~~l~G~vFr-k~p~i~~~p~~-l~~l~~d~dG~~~~~~~~~~-~~~~~~~~~~dl~G~-sL~~fa~~~~~~~L~~Gr 159 (661) ..++..||-.+- -.|+.-...+. -+.|. ..+ ..+..++++||-+|. +++++.+.+++..+..|= T Consensus 78 ~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~------------~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE 145 (548) T protein:vir:95 78 RLEERVVGGSGIGVEPLPLRLDGSVHAELA------------MEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGE 145 (548) T ss_pred HHHHhccCccccceeeeecCCCHHHHHHHH------------HHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCc Confidence 444556663221 11111000000 01111 112 235678999999986 699999999999999999 Q ss_pred EEEEEeccCCCchhh-cccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcc Q lcl|NC_019406. 160 FGALVDVAPSSDPTA-PAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRT 237 (661) Q Consensus 160 ~gvLVD~P~a~~~~~-g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w 237 (661) |++..-+-+...... ..-| -+-+|.|+.|= ...... ..+| T Consensus 146 ~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~-------------------------~~~~~~--~~~i----------- 187 (548) T protein:vir:95 146 GLAQKLMGRVPNYTFATSVPFALELLEPDYLP-------------------------FSYNNL--SKGI----------- 187 (548) T ss_pred eEEEeeecccccccCCcccceEEEEechhhcC-------------------------CCCCCC--CCce----------- Confidence 988776543322111 1112 12334444321 000000 0000 Q ss_pred hhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccccee Q lcl|NC_019406. 238 SGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFI 317 (661) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~I 317 (661) +.|+ +. +.+.-..-|+++.-.+ +..... .....+..| T Consensus 188 ----~~GI------------E~-----D~~Grp~aY~i~~~hP---------------gd~~~~-------~~~~~~~rv 224 (548) T protein:vir:95 188 ----VQGI------------ER-----DTWRRKRAYHLLKDHP---------------GNLQTL-------GGSLAVKRV 224 (548) T ss_pred ----eeee------------EE-----CCCCceEEEEEeecCC---------------Cccccc-------ccccceeee Confidence 0111 10 1111112222221111 110000 001112233 Q ss_pred eE--EE--EecCCCCCCccccchhHHHH--HHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCC---------CCceeEec Q lcl|NC_019406. 318 PF--VF--FGSMSNAADCEKPPLLDIVE--LNLKHYRTYAELEHGRFFTALPTYYAP-ELDDS---------DASEYHIG 381 (661) Q Consensus 318 Pf--v~--~~~~~~~~~~~~pPLldLA~--LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~---------~~~~l~iG 381 (661) |- |. +...+.+-.-+.|.|...-. ..+..|... .+..+.--+++ ..||. +..+. ....+.++ T Consensus 225 pA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~da-el~~aki~A~~-a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 302 (548) T protein:vir:95 225 EAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEES-ERVAARISAAL-AMYIKKGNPDSYTVEPGKDRKNRTIPIA 302 (548) T ss_pred chhHheecccccCCccccCcchHHHHHHHHHHHhHHHHH-HHHHHHHhhhh-eeeeecCCCccccCCCCccccccccccc Confidence 33 11 12233333345565543211 122233322 23333333334 44443 22111 12236788 Q ss_pred cccee-ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHH-HHh--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHH Q lcl|NC_019406. 382 PGRVW-VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIA-AIG--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIM 457 (661) Q Consensus 382 s~~~~-~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~-~lG--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~ 457 (661) ++..+ .|+ +|-+++|+.++- +.......++.+...|. .+| ..+|..--...=.|+-+.-+++-.....++.+. T Consensus 303 pG~iv~~L~-pGe~i~~~~p~~-p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~- 379 (548) T protein:vir:95 303 PGMVFDDLE-PGEDVGMIESNR-PNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEF- 379 (548) T ss_pred CCccccccC-CCceeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHH- Confidence 88765 455 477899988652 22223333333333222 112 122321111111233344444444444444321 Q ss_pred HHHHHHHHHHHHH---HHHcCC-CCCCcceEEEEecccccccc---CCHH-HHHHHHHHHhcCCCCHHHHHHHHHhcCCC Q lcl|NC_019406. 458 ALEDGMTSVVRYW---LMFRDI-PLTDTATLRYEIDATFLTTA---LDAR-ALRAIQQLYEGGLLPIDALYENFVKNGII 529 (661) Q Consensus 458 ~le~Al~~aL~~~---A~w~G~-~~~~~~~~~v~ln~DF~~~~---lda~-~l~all~~~~aG~Is~et~~~eL~r~gvl 529 (661) +....+-+.++| |...|. +.++.......++-+|.... +|+. ++++.+.++.+|..|.+.... ++|. T Consensus 380 -i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a---~~G~- 454 (548) T protein:vir:95 380 -IDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVAR---ARGR- 454 (548) T ss_pred -HHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHH---HhCC- Confidence 122222233322 122231 11111111112233444333 4664 899999999999999987754 4564 Q ss_pred CccCCHHHHHHHHhccCC---CCCCchhhhhhcCCccccCCCcchhhhhc-CChhhHHHHHHHhccCCCchhHHHhhhhh Q lcl|NC_019406. 530 PSTQTLEEFTIKMNDPKS---FIGQPDAIAMRRGYVSRQQELDQQRAARD-ADFQQQELEQAERHLEIDEEKLRISAKVG 605 (661) Q Consensus 530 ~~~~~~Eee~~~l~~~~~---~l~~ddae~~~~g~~~~~~~~~q~~~~~e-~d~~q~~~~~~e~~~~~~~~~~~~~~~~~ 605 (661) +++++.+.++.+.. .+|+.- -..+..++..+ +|. . ++.+.+ T Consensus 455 ----D~~ev~~q~a~E~~~~~~~GL~~------------~~~~~~~~~~~~~~~---~----------~~~~~~------ 499 (548) T protein:vir:95 455 ----DPRELKKSRETEIKANRAAGLVF------------SSDAYHQLVKSGMDP---V----------EAVQKV------ 499 (548) T ss_pred ----CHHHHHHHHHHHHHHHHHcCCCC------------CCcccccccccccCC---C----------Cchhhh------ Confidence 56665555544311 112110 00011111111 110 0 000000 Q ss_pred hhhhhHHHhcCChhhhhhhhhhhhHHHHhhcccccCCCCCCCcccccCCCCccCCC Q lcl|NC_019406. 606 STSVAASRKLGDPEQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTVQRGRPPQNGAS 661 (661) Q Consensus 606 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 661 (661) -.-.+..+-||. +.-+--+- -|- -|.||.--....-..|+. T Consensus 500 -~~~~~~~~~~~~----------~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~ 540 (548) T protein:vir:95 500 -YLGVGKMLTADE----------ARELVNRY-GAG---LPVPGPDFPNESNNGGAD 540 (548) T ss_pred -ccccccccccch----------hHHhhccC-CCC---CcCCCCCCCcccccCCCC Confidence 000001111222 00000011 111 122222111111111111 No 158 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=42.95 E-value=0.86 Score=20.93 Aligned_cols=373 Identities=11% Similarity=-0.023 Sum_probs=152.8 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.|+- | .|++...+.+...+....+. ..+.-+...+.|. .|..|-+ +..++ .+ T Consensus 2 ~m~~f-~--~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~-----~~~~v~~----------~~al~----~~ 54 (392) T protein:vir:39 2 ILPIL-N--FINQTNDPPEVGSVQSYFPD-----GNDAQIMESLLGD-----NNEWVSA----------RAALR----NS 54 (392) T ss_pred cchhh-h--hhhccccccccccccccccc-----CchhhhhhhhcCC-----CCceech----------HHhhc----cH Confidence 33443 1 24443333333322222111 0011111111111 1111100 11122 23 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) .....|+.+++.|-.-|..+..- ....|.+..+ ...+-.+|.+.++...+.+|-+ T Consensus 55 ~v~~~i~~ia~~ia~lp~~~~~~--~~~~l~~~PN-----------------------~~~t~~~f~~~~~~~lll~Gna 109 (392) T protein:vir:39 55 DLFSIILQLSSDLAIVKINAEKK--KNQGIIDNPS-----------------------TNANKHGFWQSMFAQLLLGGEA 109 (392) T ss_pred HHHHHHHHHHHhhccCceeeccc--hhhhHhhcCC-----------------------CCCCHHHHHHHHHHHhhhcCcE Confidence 44445555555554444444211 1122222222 3678889999999999999999 Q ss_pred EEEEeccCCCchhhcccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) ++++...... +| -+..+.|..|. + T Consensus 110 ~~~i~r~~~g------~~~~L~~l~~~~v~-----------------~-------------------------------- 134 (392) T protein:vir:39 110 FAYRWRNANG------ADMKWEYLRPSQVN-----------------T-------------------------------- 134 (392) T ss_pred EEEEEECCCC------cEEEEEEEcCceeE-----------------E-------------------------------- Confidence 9998754321 11 11111121110 0 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeE Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPF 319 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPf 319 (661) . ...+ ....+|++.. ..... +.. ...+ -+.|= T Consensus 135 ------------------~--~~~~---~~~~~y~~~~---------------~~~~~-~~~--~~~~------~~eii- 166 (392) T protein:vir:39 135 ------------------Y--YFEY---ENGMYYNITF---------------DDPKI-EPI--LQAP------QSDLI- 166 (392) T ss_pred ------------------E--EcCC---CceEEEEEEe---------------cCccc-cee--EEEc------cccEE- Confidence 0 0000 0011122111 00000 000 0000 01110 Q ss_pred EEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHH-HHHhcCceeEEe--c-CCCCCC------cee--Eecccceee Q lcl|NC_019406. 320 VFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHG-RFFTALPTYYAP--E-LDDSDA------SEY--HIGPGRVWV 387 (661) Q Consensus 320 v~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~i-l~~~~~P~l~i~--G-l~~~~~------~~l--~iGs~~~~~ 387 (661) .+-+....+...+.+|+..+... +..-....++... ....+.|-.+++ | ..+.+. ..+ .-.++.++. T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~v 245 (392) T protein:vir:39 167 HMKLLSIDGGKTGISPLYSLRRE-SKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVV 245 (392) T ss_pred EecCCCCCCccccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeee Confidence 01122223444678887654442 3333333333333 344567776654 2 111110 001 123334556 Q ss_pred cCCCCCcceEeecCchhHHHH-HHHHHHHHHHHHHH-h--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTL-ERALNEKEQQIAAI-G--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~-~~~L~~le~qM~~l-G--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) ++ +| .+|..++-.+-.+. .+..+-..++++++ | ..++ ... ....+..+... +--...|.-++..+++++ T Consensus 246 l~-~g--~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~-~~~~~~~~~~~--~f~~~~l~P~~~~ie~~l 318 (392) T protein:vir:39 246 LD-DL--EEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQ-GDQQSSIQQIS--GMYASALNRYLRPAISEL 318 (392) T ss_pred cC-CC--ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCC-CCcccHHHHHH--HHHHHHHHHHHHHHHHHH Confidence 65 34 45555544333222 34445455555543 4 3333 111 11112222211 123456777777888877 Q ss_pred HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHH Q lcl|NC_019406. 464 TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDA-RALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKM 542 (661) Q Consensus 464 ~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda-~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l 542 (661) +..|- . .+.+.+..-|. .|. .....+-.++.+|.+++...++.|.+.|+.+++.- ++ T Consensus 319 ~~~L~---~----------~~~~d~~~~~~---~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r------~~ 376 (392) T protein:vir:39 319 EYKLS---D----------HISVNMRPAID---PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP------AP 376 (392) T ss_pred HHhcc---c----------cccccchhhhc---cCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccc------hh Confidence 76551 1 12222222111 122 23556778889999999999999999999875542 12 Q ss_pred hccCCCCCCchhhhhhcCCccccCCCc Q lcl|NC_019406. 543 NDPKSFIGQPDAIAMRRGYVSRQQELD 569 (661) Q Consensus 543 ~~~~~~l~~ddae~~~~g~~~~~~~~~ 569 (661) ++ .|.+ +.|.. .++.| T Consensus 377 e~-l~~~--------~~Gd~--~~p~p 392 (392) T protein:vir:39 377 EN-TNKK--------TTGQS--NEPVP 392 (392) T ss_pred cC-CCCC--------CCCCC--CCCCC Confidence 22 2221 22311 22223 No 159 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=42.95 E-value=0.86 Score=20.93 Aligned_cols=373 Identities=11% Similarity=-0.023 Sum_probs=152.8 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.|+- | .|++...+.+...+....+. ..+.-+...+.|. .|..|-+ +..++ .+ T Consensus 2 ~m~~f-~--~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~-----~~~~v~~----------~~al~----~~ 54 (392) T protein:vir:10 2 ILPIL-N--FINQTNDPPEVGSVQSYFPD-----GNDAQIMESLLGD-----NNEWVSA----------RAALR----NS 54 (392) T ss_pred cchhh-h--hhhccccccccccccccccc-----CchhhhhhhhcCC-----CCceech----------HHhhc----cH Confidence 33443 1 24443333333322222111 0011111111111 1111100 11122 23 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) .....|+.+++.|-.-|..+..- ....|.+..+ ...+-.+|.+.++...+.+|-+ T Consensus 55 ~v~~~i~~ia~~ia~lp~~~~~~--~~~~l~~~PN-----------------------~~~t~~~f~~~~~~~lll~Gna 109 (392) T protein:vir:10 55 DLFSIILQLSSDLAIVKINAEKK--KNQGIIDNPS-----------------------TNANKHGFWQSMFAQLLLGGEA 109 (392) T ss_pred HHHHHHHHHHHhhccCceeeccc--hhhhHhhcCC-----------------------CCCCHHHHHHHHHHHhhhcCcE Confidence 44445555555554444444211 1122222222 3678889999999999999999 Q ss_pred EEEEeccCCCchhhcccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) ++++...... +| -+..+.|..|. + T Consensus 110 ~~~i~r~~~g------~~~~L~~l~~~~v~-----------------~-------------------------------- 134 (392) T protein:vir:10 110 FAYRWRNANG------ADMKWEYLRPSQVN-----------------T-------------------------------- 134 (392) T ss_pred EEEEEECCCC------cEEEEEEEcCceeE-----------------E-------------------------------- Confidence 9998754321 11 11111121110 0 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeE Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPF 319 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPf 319 (661) . ...+ ....+|++.. ..... +.. ...+ -+.|= T Consensus 135 ------------------~--~~~~---~~~~~y~~~~---------------~~~~~-~~~--~~~~------~~eii- 166 (392) T protein:vir:10 135 ------------------Y--YFEY---ENGMYYNITF---------------DDPKI-EPI--LQAP------QSDLI- 166 (392) T ss_pred ------------------E--EcCC---CceEEEEEEe---------------cCccc-cee--EEEc------cccEE- Confidence 0 0000 0011122111 00000 000 0000 01110 Q ss_pred EEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHH-HHHhcCceeEEe--c-CCCCCC------cee--Eecccceee Q lcl|NC_019406. 320 VFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHG-RFFTALPTYYAP--E-LDDSDA------SEY--HIGPGRVWV 387 (661) Q Consensus 320 v~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~i-l~~~~~P~l~i~--G-l~~~~~------~~l--~iGs~~~~~ 387 (661) .+-+....+...+.+|+..+... +..-....++... ....+.|-.+++ | ..+.+. ..+ .-.++.++. T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~v 245 (392) T protein:vir:10 167 HMKLLSIDGGKTGISPLYSLRRE-SKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVV 245 (392) T ss_pred EecCCCCCCccccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeee Confidence 01122223444678887654442 3333333333333 344567776654 2 111110 001 123334556 Q ss_pred cCCCCCcceEeecCchhHHHH-HHHHHHHHHHHHHH-h--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 388 VDKESGIPGIIEFKGEGLKTL-ERALNEKEQQIAAI-G--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 388 lp~~ga~~~ylE~~g~~i~a~-~~~L~~le~qM~~l-G--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) ++ +| .+|..++-.+-.+. .+..+-..++++++ | ..++ ... ....+..+... +--...|.-++..+++++ T Consensus 246 l~-~g--~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~-~~~~~~~~~~~--~f~~~~l~P~~~~ie~~l 318 (392) T protein:vir:10 246 LD-DL--EEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQ-GDQQSSIQQIS--GMYASALNRYLRPAISEL 318 (392) T ss_pred cC-CC--ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCC-CCcccHHHHHH--HHHHHHHHHHHHHHHHHH Confidence 65 34 45555544333222 34445455555543 4 3333 111 11112222211 123456777777888877 Q ss_pred HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHH Q lcl|NC_019406. 464 TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDA-RALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKM 542 (661) Q Consensus 464 ~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda-~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l 542 (661) +..|- . .+.+.+..-|. .|. .....+-.++.+|.+++...++.|.+.|+.+++.- ++ T Consensus 319 ~~~L~---~----------~~~~d~~~~~~---~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r------~~ 376 (392) T protein:vir:10 319 EYKLS---D----------HISVNMRPAID---PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP------AP 376 (392) T ss_pred HHhcc---c----------cccccchhhhc---cCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccc------hh Confidence 76551 1 12222222111 122 23556778889999999999999999999875542 12 Q ss_pred hccCCCCCCchhhhhhcCCccccCCCc Q lcl|NC_019406. 543 NDPKSFIGQPDAIAMRRGYVSRQQELD 569 (661) Q Consensus 543 ~~~~~~l~~ddae~~~~g~~~~~~~~~ 569 (661) ++ .|.+ +.|.. .++.| T Consensus 377 e~-l~~~--------~~Gd~--~~p~p 392 (392) T protein:vir:10 377 EN-TNKK--------TTGQS--NEPVP 392 (392) T ss_pred cC-CCCC--------CCCCC--CCCCC Confidence 22 2221 22311 22223 No 160 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=42.87 E-value=0.87 Score=20.92 Aligned_cols=455 Identities=10% Similarity=0.034 Sum_probs=166.8 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChH--HHHHH-Hhhh- Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDE--DYANY-LDRA- 76 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~--~Y~~r-l~rA- 76 (661) .+..+|..+ ..+. .+.. --|.|..........|-|..-.-+.+ .+..+ ..|| T Consensus 9 ~~~~a~~~~-~~~~---------------------~~~~--~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaR 64 (553) T protein:vir:63 9 LSEVTSGRP-EQSA---------------------SLGG--GGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGR 64 (553) T ss_pred hcccccccc-hhhh---------------------hhhc--ccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHH Confidence 222222211 0000 0000 01222111111111122221110000 01111 1111 Q ss_pred -------cccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhh-hhhHhhhhh----ccCCCC-CH Q lcl|NC_019406. 77 -------AFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASI-GKLLTQLQR----FAKDGT-SH 143 (661) Q Consensus 77 -------~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~-~~~~~~~~~----~dl~G~-sL 143 (661) +--.++...++..||-=|+--+++ +.- .+..+++..-...-..+ +.|..++++ ||-+|. ++ T Consensus 65 dL~rNn~~a~~av~~~~~nvVG~Gi~~~~~~----~~~--~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f 138 (553) T protein:vir:63 65 DMADNDGFTNGAVGYQRDSIVGAQYRLNSMP----DIN--VIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTF 138 (553) T ss_pred HHHhcChHHHHHHHHHHHhhccCCceeeecc----chh--hhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCH Confidence 111233444444555434422221 100 00011111001111122 234566664 566676 89 Q ss_pred HHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccc Q lcl|NC_019406. 144 QGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQ 222 (661) Q Consensus 144 ~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~ 222 (661) +.+.+.+++..+..|=|++..-+-+..+. .-| -+-+|.|+.|-+..-...++ .|+.-+.. T Consensus 139 ~~~q~l~~r~~~~dGE~~~~~~~~~~~~~---~~~~~lq~ie~drl~~~~~~~~~~-------~i~~GVE~--------- 199 (553) T protein:vir:63 139 TGLIRLGVVGYVKTGEVLATAEWDRAANR---PYATCFQMVSTDRLSNPYQQLDTP-------TLRRGVQY--------- 199 (553) T ss_pred HHHHHHHHHHHHhCCceEEEeeeccCCCC---cccceEEEechhhcCCCCCCCCCC-------eeEeeeEE--------- Confidence 99999999999999999998776543211 111 23556666554332111110 01111110 Q ss_pred cceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccc Q lcl|NC_019406. 223 NPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQAR 302 (661) Q Consensus 223 ~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~ 302 (661) +.+.-..-|+++.-.+|. ... T Consensus 200 ------------------------------------------d~~Gr~vaY~i~~~hPgd---------------~~~-- 220 (553) T protein:vir:63 200 ------------------------------------------DKRGRPQGYWIQVAHPGD---------------LYQ-- 220 (553) T ss_pred ------------------------------------------CCCCceEEEEeeccCCCc---------------ccc-- Confidence 011111112222111110 000 Q ss_pred cceeeccCCcccceee------E--E--EEecCCCCCCccccchhHHHHHHHHHHhhh--hhHHHHHHHhcCceeEEecC Q lcl|NC_019406. 303 DVYTPMVRGRTLPFIP------F--V--FFGSMSNAADCEKPPLLDIVELNLKHYRTY--AELEHGRFFTALPTYYAPEL 370 (661) Q Consensus 303 ~~~~p~~~g~~L~~IP------f--v--~~~~~~~~~~~~~pPLldLA~LNl~HYq~s--SDl~~il~~~~~P~l~i~Gl 370 (661) ....+..+..|| - | .+-..+.+-.-+.|.|...- ..++++..- |.+....--+++...+-++. T Consensus 221 ----~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl-~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~ 295 (553) T protein:vir:63 221 ----MAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGL-KDMRMAKRFKEMSLQNAVINASYAAAIESEL 295 (553) T ss_pred ----ccccccceeeeccccccChhHheecccccCCCcccCCchHHHHH-HHHHHHhHHHHHHHHHHHHhhhheeeeecCC Confidence 000001111111 0 1 11223333344555554321 222222222 23444444444433333322 Q ss_pred CCC-------------------------------CCceeEecccceeecCCCCCcceEeecC--chhHHHHHHHHHHHHH Q lcl|NC_019406. 371 DDS-------------------------------DASEYHIGPGRVWVVDKESGIPGIIEFK--GEGLKTLERALNEKEQ 417 (661) Q Consensus 371 ~~~-------------------------------~~~~l~iGs~~~~~lp~~ga~~~ylE~~--g~~i~a~~~~L~~le~ 417 (661) +.+ ....+.++++.+..|++ |-+++|+.++ +..+..... .+.. T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p-Ge~i~~~~p~~p~~~~~~F~~---~~lr 371 (553) T protein:vir:63 296 PPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFP-GTKLNLKPMGTPGGVGSEFEA---SLNR 371 (553) T ss_pred ChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCC-CCeeeecCCCCCCCCHHHHHH---HHHH Confidence 110 01126788998888884 7899999876 233343333 3333 Q ss_pred HHH-HHh--HHhcccc-cCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH---HHHcCC-CCCCcceEEEE-- Q lcl|NC_019406. 418 QIA-AIG--GRLMPGM-SKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYW---LMFRDI-PLTDTATLRYE-- 487 (661) Q Consensus 418 qM~-~lG--Arll~~~-~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~---A~w~G~-~~~~~~~~~v~-- 487 (661) .|. .+| ..+|..- ++..=.|+-+.-+++-.....++.+. +....+-+.++| |-..|. +..+.. ..+. T Consensus 372 ~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~--~~~~~~pi~~~wl~~a~l~G~i~~p~~~-~~~~~~ 448 (553) T protein:vir:63 372 HLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMC--ADRLATEFFTLWLEEAIAAGEVPMPPGQ-TRDLFY 448 (553) T ss_pred HHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHcCCccCCCcc-cchhhc Confidence 332 112 1223221 11111245555566666666555421 122222233322 223332 111110 0000 Q ss_pred --------eccccccc---cCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCC-----CCC Q lcl|NC_019406. 488 --------IDATFLTT---ALDAR-ALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKS-----FIG 550 (661) Q Consensus 488 --------ln~DF~~~---~lda~-~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~-----~l~ 550 (661) ++-+|... -+||. ++++.+.++.+|..|++...+ ++|. +|+++.+.++.+.. +|. T Consensus 449 ~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a---~~G~-----D~~~v~~q~a~e~~~~~~~Gl~ 520 (553) T protein:vir:63 449 QPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIA---RLGG-----DFRKSFAQRAREDALLKKYGLT 520 (553) T ss_pred chhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHH---HhCC-----CHHHHHHHHHHHHHHHHHcCCC Confidence 00122222 23664 899999999999999988744 4464 55555554443311 222 Q ss_pred CchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHH Q lcl|NC_019406. 551 QPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAE 588 (661) Q Consensus 551 ~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e 588 (661) .|..-....+....++..+.... -..+..|.+| T Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~e 553 (553) T protein:vir:63 521 FNLSAKRSLGDGRDAATGIAEDP-----AAAQTSQQGE 553 (553) T ss_pred CCCCCccccCCCcccCCCCCCCC-----CCCCcccccC Confidence 22111111121122211111111 1111222222 No 161 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=39.83 E-value=1 Score=20.59 Aligned_cols=542 Identities=12% Similarity=0.008 Sum_probs=187.9 Q ss_pred CCCCCC------------ccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcc----hHHHHhCCcccCCCCC-- Q lcl|NC_019406. 1 MAGLSP------------NSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAG----EREIKAQGVKYLKAPK-- 62 (661) Q Consensus 1 ~~~~~~------------~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G----~~~vr~~g~~YLPk~~-- 62 (661) -+.-+| .++|-.+.-.+.-.|+..+ .|.- ...=.++.|-+++ ...-+....-++++.- T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~-~~~~---~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~ 127 (862) T protein:vir:99 52 KPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAV-RSAI---KAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQD 127 (862) T ss_pred cCCCCCCcccccccccccccccchhhhhhhhcchhhc-chhh---hhhhhhhhhcchhhhhhccccccccccccchhccc Confidence 111111 1122111111111222221 1111 1111223332221 1010111111122110 Q ss_pred ---CCChHHHHHHHhhhcccchHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCC Q lcl|NC_019406. 63 ---GFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKD 139 (661) Q Consensus 63 ---~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~ 139 (661) .-.-..|. ++..=-....++..|+..+.-..|+.+.|.... || ...-++.++.|+..+++. T Consensus 128 ~~~~~~f~gyq-l~alY~~~~larkiVd~pAeDatR~g~~I~~~~----------d~--~e~~~e~~~~ie~~~~rL--- 191 (862) T protein:vir:99 128 WYLSQGFIGHQ-ACALIAQHWLVDKACSLAGEDAIRNGWHLKSLG----------EG--EEIDEESLEKFKAIDVEF--- 191 (862) T ss_pred cccccCcccHH-HHHHHHhCchhhhhhhhhhHHHhhCCceEeecC----------cc--cccCHHHHHHHHHHHHHh--- Confidence 00112232 122111245566777778888889988885211 11 012244556666665554 Q ss_pred CCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeecccccc Q lcl|NC_019406. 140 GTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATP 219 (661) Q Consensus 140 G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~ 219 (661) .|..-++++++..-.||+.++|+.-...+... -..|. .++. + ++..|.-++ T Consensus 192 --~v~~~l~eair~~RLyGga~ililv~~~D~~~-LsqPL----n~e~--------I-~kG~lkgl~------------- 242 (862) T protein:vir:99 192 --KVKENLIEFNRFKNVFGIRVAIFVVDSEDPDY-YEKPF----NPDG--------I-TPGSYRGIS------------- 242 (862) T ss_pred --hHHHHHHHHHHhcccccceEEEEEecCcCchh-hhcCc----Cccc--------c-cccceeEEE------------- Confidence 45666666666666688777765432222110 00111 1110 1 000111000 Q ss_pred ccccceeeeechhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccc Q lcl|NC_019406. 220 SQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLG 299 (661) Q Consensus 220 ~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~ 299 (661) .+.+..+ ... ...+....+..+ .|..-..|++. |..+..-+ T Consensus 243 --------vlDp~w~----~p~----------~v~~~~~Dp~sp--~yGkP~~y~I~-------g~~IH~SR-------- 283 (862) T protein:vir:99 243 --------QIDPYWM----MPM----------LTAESTADPSSQ--FFYEPEFWIIS-------GQKYHRSH-------- 283 (862) T ss_pred --------Eechhhh----ccc----------cccccccccccc--ccCCceeeeec-------Ceeeccce-------- Confidence 0000000 000 000000111111 11112223221 11111111 Q ss_pred ccccceeeccCCcccceeeEEEEecCCCCCCccccchhHHHHHHHHHHhhhhh-HHHHHHHhcCceeEEecCCCCCC-ce Q lcl|NC_019406. 300 QARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAE-LEHGRFFTALPTYYAPELDDSDA-SE 377 (661) Q Consensus 300 ~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSD-l~~il~~~~~P~l~i~Gl~~~~~-~~ 377 (661) ++ ...|.+++.+ . ....+. .+.| ++..++=-|..|...+. -..+++...+.++-+.|+..--. .. T Consensus 284 -----li-if~g~~vpd~---l-k~ay~f--~G~S-vLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~ed~ 350 (862) T protein:vir:99 284 -----LI-IARGPQPADI---L-KPTYIF--GGIP-LVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIANEDK 350 (862) T ss_pred -----eE-EecCCCchhh---h-hccCCc--cCcc-HHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhccHHH Confidence 11 1122222221 0 011111 2333 44445555666655543 35567777777777777642111 00 Q ss_pred ---------eEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHHhHHh----ccccc-CccchhHHHHHH Q lcl|NC_019406. 378 ---------YHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAIGGRL----MPGMS-KSVSESDNQSAL 443 (661) Q Consensus 378 ---------l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~lGArl----l~~~~-~~~~eTataa~~ 443 (661) ..-+..+.+.+.. +.++..+..+-+++.. .|....++|... +++ |-.++ ++-+.|++. T Consensus 351 l~~r~~~~~~~rdN~Gi~liD~-eEe~e~ls~slSGL~d---ll~~~~q~IAaa-s~IP~tiLfGqspaGlnATGE~--- 422 (862) T protein:vir:99 351 FIQRLMFWVRYRDNHAVKVLGT-DETMEQFDTSLADFDA---VIMGQYQLVASI-AKTPATKLLGTAPKGFNSTGEF--- 422 (862) T ss_pred HHHHHHHHHhccCcceeEEecC-CCceeEEecccCChHH---HHHHHHHHHHhh-hCCCceeecccCcccccCchHH--- Confidence 1123233445553 4577777765556544 333444444432 221 21222 232334443 Q ss_pred HHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHHH-----HHHHHHHHhcCCCCHH Q lcl|NC_019406. 444 REANEQSLLLNVI-MALEDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDARA-----LRAIQQLYEGGLLPID 517 (661) Q Consensus 444 d~~~~~S~L~~~A-~~le~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~~-----l~all~~~~aG~Is~e 517 (661) |...=+..+.++- ..+...+++++.++..-+|.+ .++.|+.|+=+.....+-.+ .+++..++++|.|+-+ T Consensus 423 D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg~~----~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispd 498 (862) T protein:vir:99 423 ETISYHEELESIQEHVYMPFLQRHYLISRLSLGIQ----HEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPD 498 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH Confidence 2233344444443 457778888887776556642 35777776422221111112 2456678889999999 Q ss_pred HHHHHHHhcCCCC-ccCCHHHHHHHHhccCCCCC-CchhhhhhcCCccccCCCcchhhhhcCCh-h--hHHHHHHHhccC Q lcl|NC_019406. 518 ALYENFVKNGIIP-STQTLEEFTIKMNDPKSFIG-QPDAIAMRRGYVSRQQELDQQRAARDADF-Q--QQELEQAERHLE 592 (661) Q Consensus 518 t~~~eL~r~gvl~-~~~~~Eee~~~l~~~~~~l~-~ddae~~~~g~~~~~~~~~q~~~~~e~d~-~--q~~~~~~e~~~~ 592 (661) +.+.+|+..+... +..+.++.. ..+... -+.++.+.-|....+..-+.++++..+.. + |......+ ..+ T Consensus 499 EvR~~L~~~~~~g~~~l~ded~E-----~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~-~~~ 572 (862) T protein:vir:99 499 EERNRIRDDKRSGYNRLTKEDAE-----ETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVP-SMK 572 (862) T ss_pred HHHHHHHhcCCcCCCCCCccccc-----ccCCCCcccccccccCCcccccccccccccccCCccccCCcccccccC-CCC Confidence 9999998765522 223222221 111111 11111122222222221111111111110 0 00111111 001 Q ss_pred CCchhHHHhhhh--------------hhhhhhHHHhc-------CCh-hhhhhhhhhhhHHHHhhcccccCCCCCCCccc Q lcl|NC_019406. 593 IDEEKLRISAKV--------------GSTSVAASRKL-------GDP-EQAKPSKAEQAQIDAQQKQAAAKPVTPTPGTV 650 (661) Q Consensus 593 ~~~~~~~~~~~~--------------~~~~~~~~~~~-------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 650 (661) +=++-+....+. ++|-+..-.+. +.+ ++..-+-..++.......+-.|+-.+..|++. T Consensus 573 ~g~~~~~t~~~~a~~p~~~~~~~~~~~~~~e~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 652 (862) T protein:vir:99 573 PGQMVGPEVGITAPMPEDDAPVAGVVAKLAELQQAQMGAVTGVLARLVEQLDRMHDRTIAEGADIGQYDASGRTVKPGTI 652 (862) T ss_pred CCCccccccccccCCCccccccCcccccchhhhcCcchhhcchhhhhHHHHHhhhhhhhhhhcchhhhcccccccccccc Confidence 100001111111 11111110000 000 11111111111111111111111123333332 Q ss_pred ccCCCCc------------------cCCC Q lcl|NC_019406. 651 QRGRPPQ------------------NGAS 661 (661) Q Consensus 651 ~~~~~~~------------------~~~~ 661 (661) -.-+|-+ ||-. T Consensus 653 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 681 (862) T protein:vir:99 653 ATIRPSVSGNHVGEQPTYQLPKMKMNGRI 681 (862) T ss_pred CCCCCcccccccccCCccccceeEeccee Confidence 2222222 1111 No 162 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=33.33 E-value=1.4 Score=19.84 Aligned_cols=449 Identities=11% Similarity=0.052 Sum_probs=167.7 Q ss_pred ccCHHHHH------HHHHHHHHHHHhcchHHHHhCCcccCCCCCCCCh--HHHHHH-Hhhh----cccchHHHHHHHHhc Q lcl|NC_019406. 25 VVHPEYEY------YRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDD--EDYANY-LDRA----AFYNMTSQTQAGMVG 91 (661) Q Consensus 25 ~~hPey~a------~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~--~~Y~~r-l~rA----~~~n~~~~tv~~l~G 91 (661) .+.|...+ +.+. ..+...++|.......-..+.|..-.-+. ..+..+ ..|| .=.++.+..|+.++. T Consensus 1 ~~~p~~~~~~~~~~~~~~-~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 79 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSL-REYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQD 79 (533) T ss_pred CCCchhhhhhcccccchH-HHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 22222221 1111 11222233321111112224443321111 112222 1222 123445555554444 Q ss_pred hhhccCccccccchhhHhhhhcccccccccchhhh-hhhHhhhhh----ccCCCC-CHHHHHHHHHHHHHhhCCEEEEEe Q lcl|NC_019406. 92 QIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASI-GKLLTQLQR----FAKDGT-SHQGFAKTVALEQVAMGRFGALVD 165 (661) Q Consensus 92 ~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~-~~~~~~~~~----~dl~G~-sL~~fa~~~~~~~L~~Gr~gvLVD 165 (661) .|=-.-+++...|+. ..| +.|+......-+.+ ..+..++++ ||..|. +++++.+.+++..+..|=|++..- T Consensus 80 nvVG~Gi~~~~~p~~-~~l--g~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~ 156 (533) T protein:vir:34 80 HIVGSFFRLSHRPSW-RYL--GIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQAT 156 (533) T ss_pred HhhCCCceeeeccch-hhc--CCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEee Confidence 443222222212211 001 11111001111122 235566665 677777 999999999999999999999876 Q ss_pred ccCCCchhhcccce---eEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhh Q lcl|NC_019406. 166 VAPSSDPTAPAKSY---TVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRR 242 (661) Q Consensus 166 ~P~a~~~~~g~rPY---~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~ 242 (661) +-+..+ +|| +-+|.|+.|-+..-...++ +|+ T Consensus 157 ~~~~~g-----~~~~~~lq~ie~d~l~~~~~~~~~~--------------------------~i~--------------- 190 (533) T protein:vir:34 157 WDTSSS-----RLFRTQFRMVSPKRISNPNNTGDSR--------------------------NCR--------------- 190 (533) T ss_pred eccCCC-----CccceEEEEechhhcCCCCCCCCCC--------------------------ceE--------------- Confidence 644321 122 3556665543322111000 000 Q ss_pred cchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccc--ccceeeccCCcccceeeE- Q lcl|NC_019406. 243 AGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQA--RDVYTPMVRGRTLPFIPF- 319 (661) Q Consensus 243 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~--~~~~~p~~~g~~L~~IPf- 319 (661) +|+ + .+.+|. ...++++.....+.. ....+|. ...||- T Consensus 191 ~GI------------e---------------------~d~~Gr-~~aY~i~~~~~~~~~~~~~~~~~~-----~~~v~a~ 231 (533) T protein:vir:34 191 AGV------------Q---------------------INDSGA-ALGYYVSEDGYPGWMPQKWTWIPR-----ELPGGRA 231 (533) T ss_pred eee------------E---------------------ECCCCC-eEEEEEeecCCCCccccccceeee-----eeccChh Confidence 010 0 011111 111111111110000 0000110 000110 Q ss_pred -EE--EecCCCCCCccccchhHHHH--HHHHHHhhhhhHHHHHHHhcCceeEEe-cCCCC-------------------- Q lcl|NC_019406. 320 -VF--FGSMSNAADCEKPPLLDIVE--LNLKHYRTYAELEHGRFFTALPTYYAP-ELDDS-------------------- 373 (661) Q Consensus 320 -v~--~~~~~~~~~~~~pPLldLA~--LNl~HYq~sSDl~~il~~~~~P~l~i~-Gl~~~-------------------- 373 (661) |. +-..+.+-.-+.|.|...-. ..+..|. .|.+..+.--+.+ ..||+ ..... T Consensus 232 ~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~-dael~~a~i~A~~-a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (533) T protein:vir:34 232 SFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQ-NTQLQSAIVKAMY-AATIESELDTQSAMDFILGANSQEQRERLTG 309 (533) T ss_pred HeeeeccccCCCcccCCchHHHHHHHHHHHHHHH-HHHHHHHHHhhhh-eeeeecCCCcccccccccCCCcccccccccc Confidence 11 11223333345555543211 1222331 2233333333333 44443 21100 Q ss_pred ---------CCceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHH-HHh--HHhcccc-cCccchhHHH Q lcl|NC_019406. 374 ---------DASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIA-AIG--GRLMPGM-SKSVSESDNQ 440 (661) Q Consensus 374 ---------~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~-~lG--Arll~~~-~~~~~eTata 440 (661) ....+.++++++..|+ +|.+++|++++.. .......++.+...+. .+| ..+|..- ++..=.|+-+ T Consensus 310 ~~~~~~~~~~~~~~~l~pG~i~~L~-pGe~i~~~~~~~p-~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~ 387 (533) T protein:vir:34 310 WIGEIAAYYAAAPVRLGGAKVPHLM-PGDSLNLQTAQDT-DNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARA 387 (533) T ss_pred cchhhhhccCcceeeccCceeeecC-CCCeeeecCCCCC-CCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHH Confidence 0113568999988888 4889999997632 2222333333333222 222 1223221 1111123445 Q ss_pred HHHHHHHhhHHHHHHHHHHHH-HHHHHHHHH---HHHcCCC-CCCcceEEEE------eccccccc---cCCHH-HHHHH Q lcl|NC_019406. 441 SALREANEQSLLLNVIMALED-GMTSVVRYW---LMFRDIP-LTDTATLRYE------IDATFLTT---ALDAR-ALRAI 505 (661) Q Consensus 441 a~~d~~~~~S~L~~~A~~le~-Al~~aL~~~---A~w~G~~-~~~~~~~~v~------ln~DF~~~---~lda~-~l~al 505 (661) ..+++-.....++.. +.. -.+-++++| |...|.- .+....+.|. ++-+|... -+|+. ++++. T Consensus 388 ~~~e~~r~~~~~q~~---~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~ 464 (533) T protein:vir:34 388 SANESWAYFMGRRKF---VASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEA 464 (533) T ss_pred HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHH Confidence 555555555555542 222 222233222 2223321 1110000010 01122222 23564 89999 Q ss_pred HHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCC-----CCCCchhhhhhcCCccccCCCcchhhhhcCChh Q lcl|NC_019406. 506 QQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKS-----FIGQPDAIAMRRGYVSRQQELDQQRAARDADFQ 580 (661) Q Consensus 506 l~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~-----~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~ 580 (661) +.++.+|..|++.... ++|. +|+++.+.++.+.. +|..+..-..... .+.+|......+| T Consensus 465 ~~~i~~G~~s~~~~~a---~~G~-----D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~-----s~~~~~~~~~~~~-- 529 (533) T protein:vir:34 465 VMLIEAGLSTYEKECA---KRGD-----DYQEIFAQQVRETMERRAAGLKPPAWAAAAFE-----SGLRQSTEEEKSD-- 529 (533) T ss_pred HHHHHcCCCCHHHHHH---HcCC-----CHHHHHHHHHHHHHHHHhcCCCCCCCCCcCcc-----CCCCCCCCCCccc-- Confidence 9999999999987754 4464 55665555543311 2222211111111 1111111111111 Q ss_pred hHHHHHHHhccC Q lcl|NC_019406. 581 QQELEQAERHLE 592 (661) Q Consensus 581 q~~~~~~e~~~~ 592 (661) ..+| T Consensus 530 --------~~~~ 533 (533) T protein:vir:34 530 --------SRAA 533 (533) T ss_pred --------CCCC Confidence 1111 No 163 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=32.33 E-value=1.4 Score=19.73 Aligned_cols=369 Identities=8% Similarity=-0.011 Sum_probs=144.2 Q ss_pred CCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCC-CCCChHHHHHHHhhhcccchHHHHHHHHhchhhccCcc Q lcl|NC_019406. 21 FTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAP-KGFDDEDYANYLDRAAFYNMTSQTQAGMVGQIFRRPPV 99 (661) Q Consensus 21 ~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~-~~E~~~~Y~~rl~rA~~~n~~~~tv~~l~G~vFrk~p~ 99 (661) |.+..+...-.+ .....++-+.+. -+..+++.+ .++.-.. ...++. +.+-..++.+++.|-.-|+. T Consensus 1 M~~f~~~~~~~~---~~~~~~~~~~~~-----~~~~~~~~~~~~~~v~~-~~al~~----~~v~~~i~~ia~~ia~~p~~ 67 (386) T protein:vir:49 1 MPIFNITNLATE---SPPINQESFFDI-----ADSDFLASLNSSEWVSA-ENALKN----SDLFSIISQLSNDLATAKIT 67 (386) T ss_pred CchhhhhccCCC---Ccccchhhhhhh-----hhccccccccCCceech-hhhhcc----HHHHHHHHHHHHHhhhCcee Confidence 443322110000 000000000000 000011111 0110000 112222 23334555555555555554 Q ss_pred ccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhhcccc- Q lcl|NC_019406. 100 IRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTAPAKS- 178 (661) Q Consensus 100 i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~g~rP- 178 (661) +.. .....|....+ ...+-.+|.+.++...+.+|-+++++...... +| T Consensus 68 ~~~--~~~~~l~~~PN-----------------------~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g------~~~ 116 (386) T protein:vir:49 68 TSR--KQLQGIVDNPS-----------------------NNANRFNFYQSIFAQMLLGGEAFAYRWRNDNG------RDM 116 (386) T ss_pred ecc--chhhhhhhccC-----------------------CCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC------cEE Confidence 421 11122322222 26688899999999999999999998764321 11 Q ss_pred eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhhhhhee Q lcl|NC_019406. 179 YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSARADALA 258 (661) Q Consensus 179 Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~ 258 (661) -+..+.|.+|. +. . T Consensus 117 ~l~~i~~~~v~-----------------v~-------------------------------------------------~ 130 (386) T protein:vir:49 117 KWEYLRPSQVS-----------------FN-------------------------------------------------R 130 (386) T ss_pred EEEEecCceeE-----------------EE-------------------------------------------------E Confidence 12222222110 00 0 Q ss_pred cccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEEEEecCCCCCCccccchhH Q lcl|NC_019406. 259 RPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFVFFGSMSNAADCEKPPLLD 338 (661) Q Consensus 259 ~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~~pPLld 338 (661) +. ++ ....|++..- .. ..+.. ...|. =..|-|- +....+...+.||+.. T Consensus 131 --~~-~~---~~~~y~~~~~---------------~~-~~~~~--~~~~~-----~evih~~--~~~~~~~~~G~s~l~~ 179 (386) T protein:vir:49 131 --LD-NQ---NGLYYNITFD---------------DP-HIAPK--QHVPQ-----NDILHFR--LLSVDGGLTSVSPLMA 179 (386) T ss_pred --cC-CC---ceEEEEEEEc---------------Cc-cccce--eEEcc-----ccEEEec--CCCCCCccccccHHHH Confidence 00 00 0011111110 00 00000 00000 0112111 1122233456777655 Q ss_pred HHHHHHHHHhhhhhHHHHHHH-hcCceeEEe--cCCCCC-C-------ceeEecccceeecCCCCCcceEeecCchhHHH Q lcl|NC_019406. 339 IVELNLKHYRTYAELEHGRFF-TALPTYYAP--ELDDSD-A-------SEYHIGPGRVWVVDKESGIPGIIEFKGEGLKT 407 (661) Q Consensus 339 LA~LNl~HYq~sSDl~~il~~-~~~P~l~i~--Gl~~~~-~-------~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a 407 (661) +.. .+.......++...++. .+.|-.++. |-...+ . ....=.++.++.++ .|.++.-+..+..-.. T Consensus 180 ~~~-~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~~- 256 (386) T protein:vir:49 180 LGR-EFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQ- 256 (386) T ss_pred HHH-HHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecC-CCceEEEccCChhHHH- Confidence 443 44555555555555544 456877764 221111 0 01222334455555 3444444433333222 Q ss_pred HHHHHHHHHHHHHHH-h--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcceE Q lcl|NC_019406. 408 LERALNEKEQQIAAI-G--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGMTSVVRYWLMFRDIPLTDTATL 484 (661) Q Consensus 408 ~~~~L~~le~qM~~l-G--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al~~aL~~~A~w~G~~~~~~~~~ 484 (661) ..+..+-..++++++ | ..+|- .+.. .++.....+. .-...|.-+...+++.+++.| +- .+ T Consensus 257 ~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~--~~~~~~~~~~-~~~~~i~~~l~~i~~~~~~~l-------~~------~~ 319 (386) T protein:vir:49 257 LLSQADWTTGQFAKVYGIPESIVG-GDGD--QQSSLEMIYN-IYFKSVSRYLRPFVSEMSKKL-------SC------EV 319 (386) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhC-CCCC--ccchHHHHHH-HHHHHHHHHHHHHHHHHHHHh-------cc------hh Confidence 234444455555543 3 33331 1111 1222222222 223456666666666666654 11 12 Q ss_pred EEEeccccccccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCCCCCCchhhh Q lcl|NC_019406. 485 RYEIDATFLTTALDAR-ALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKSFIGQPDAIA 556 (661) Q Consensus 485 ~v~ln~DF~~~~lda~-~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~~l~~ddae~ 556 (661) .+.++. .. ..|.. ....+-.++.+|.++.-+.++.|.+.|+++++....+.......++ |.++++. T Consensus 320 ~~~~~~--~~-~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~~~~g---Gd~~~~~ 386 (386) T protein:vir:49 320 DVDISP--AV-DPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNPNRTSLKG---GEINEQD 386 (386) T ss_pred cccchh--hh-ccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhccCCCCCCC---CCCCCCC Confidence 222221 11 12333 3455667888999999999999999998876543211100000000 1111111 No 164 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=31.12 E-value=1.5 Score=19.58 Aligned_cols=188 Identities=16% Similarity=0.123 Sum_probs=78.6 Q ss_pred ccccchhHHHHHHHHHHhhhhhHHHHHHHhcCceeEEecCCCCCCceeEecccceeecCCCCCcceEeecCchhHHHHHH Q lcl|NC_019406. 331 CEKPPLLDIVELNLKHYRTYAELEHGRFFTALPTYYAPELDDSDASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLER 410 (661) Q Consensus 331 ~~~pPLldLA~LNl~HYq~sSDl~~il~~~~~P~l~i~Gl~~~~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~ 410 (661) +=+ ..+|+.+.- .+...... ++-+ + .. .=|-+.++.+.+++-++..+..+-+++.. T Consensus 1 V~k--~~~l~~~~~------~~~~~~~~--r~~~-~----~~------~~~~~~~~~ld~~~e~~e~~~~~lsGl~d--- 56 (201) T protein:vir:10 1 MWK--AKGLADLCD------DSDGAARL--RLAQ-V----DN------NSGVGQAIGIDADSEEYNVLNSDIGGIDT--- 56 (201) T ss_pred Ccc--chHHHHHhc------CChHHHHH--HHHH-H----HH------hhhhhhhheeecCCcceeeeecCcCChHH--- Confidence 101 011111110 00000000 0000 0 00 00112233344333456666655555554 Q ss_pred HHHHHHHHHHHHh----HHhcccccCccchhHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHcCCCCCCcceEE Q lcl|NC_019406. 411 ALNEKEQQIAAIG----GRLMPGMSKSVSESDNQSALREANEQSLLLNVI-MALEDGMTSVVRYWLMFRDIPLTDTATLR 485 (661) Q Consensus 411 ~L~~le~qM~~lG----Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A-~~le~Al~~aL~~~A~w~G~~~~~~~~~~ 485 (661) .|....++|...- -+|+-.+.++-+-|+..- ...=+..++++- ..+.-++++.++ +...+ .++. T Consensus 57 ~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d---~~nyyd~i~~~Qe~~l~p~le~l~~----~~~~~----~~~~ 125 (201) T protein:vir:10 57 FLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTA---LETFYGYVDRKRKAELLPLLEFLLP----FIVTE----QEWS 125 (201) T ss_pred HHHHHHHHHHhHhcCchhhhcCCCCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHH----hhcCC----CCce Confidence 3444555554332 233322222223333322 222233333333 334445555554 44332 3677 Q ss_pred EEeccccccccCCHH-----HHHHHHHHHhcCCCCHHHHHHHHHhcCC---CCccCCHHHHHHHHhccCCCCCCch Q lcl|NC_019406. 486 YEIDATFLTTALDAR-----ALRAIQQLYEGGLLPIDALYENFVKNGI---IPSTQTLEEFTIKMNDPKSFIGQPD 553 (661) Q Consensus 486 v~ln~DF~~~~lda~-----~l~all~~~~aG~Is~et~~~eL~r~gv---l~~~~~~Eee~~~l~~~~~~l~~dd 553 (661) |+.|+=+.....+-+ ..++...++++|.|+....+.+|+..+. ++++...+++.....+..+..+.++ T Consensus 126 ~~f~pL~~~s~kekAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 126 VEFNPLSQVSDKDKSEILEKNVNSVAALIAAGIIDADEARDTLRAISTEVKIGEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred EeeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCCCccccccccCCCCCCCCCC Confidence 887763333332222 2456777889999999999999998654 5444444444333322222223222 No 165 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=29.01 E-value=1.7 Score=19.33 Aligned_cols=412 Identities=12% Similarity=0.042 Sum_probs=139.2 Q ss_pred CCCCCCccccccccccccccCCc-cccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTH-LVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFY 79 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V-~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~ 79 (661) |+--- .+|-.-++++..+.+ ..--+.+. +. ........++|. . ..|. +. ..+..|+.+..| T Consensus 1 ~~~~l---~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~g~~-~--~~g~-~v---------~~~~al~~~~V~ 62 (434) T protein:vir:43 1 MSKSL---GKVLSSATSAPRSSLFGWGGKTIR-LT-DGAFWSQFLGRE-S--SSGK-KV---------TVDKAMKLSAVW 62 (434) T ss_pred Cccch---hhhhhhcccccchhhhcccccccc-cC-chHHHHHHhcCC-c--cCCc-ee---------chhhhhccHHHH Confidence 33211 111111111111100 00000000 00 000111122110 0 0111 00 112223333333 Q ss_pred chHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCC Q lcl|NC_019406. 80 NMTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGR 159 (661) Q Consensus 80 n~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr 159 (661) -...-+-+.+..+ |..+ ...+.||.......+ .+...|-.==-...+-.+|.+.++...+.+|- T Consensus 63 ~~i~~ia~~ia~l----p~~~---------~~~~~~g~~~~~~~~---~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 126 (434) T protein:vir:43 63 ACVRLISTSVAGL----PLGV---------YERKADGSRVDARSF---PLYDVVHNSPNDDMTAFQFWQAMVASMLLWGN 126 (434) T ss_pred HHHHHHHHhhhhC----ceEE---------EEEcCCCcccccccc---HHHHHHhccCCCCCCHHHHHHHHHHHHhhcCC Confidence 3333333333333 2222 011222221111111 11111100001367888999999999999999 Q ss_pred EEEEEeccCCCchhhcccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcch Q lcl|NC_019406. 160 FGALVDVAPSSDPTAPAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTS 238 (661) Q Consensus 160 ~gvLVD~P~a~~~~~g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~ 238 (661) ++++|... . .+| .+..+.|..|.= T Consensus 127 ay~~i~~~--~-----G~~~~L~~l~p~~v~~------------------------------------------------ 151 (434) T protein:vir:43 127 AYAEIRRA--A-----GRPAALDFLLPSRVDL------------------------------------------------ 151 (434) T ss_pred eEEEEEeC--C-----CcEEEEEEEcCcceEE------------------------------------------------ Confidence 99998643 1 111 122222222100 Q ss_pred hhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceee Q lcl|NC_019406. 239 GGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIP 318 (661) Q Consensus 239 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IP 318 (661) ..+.+|...|++. ..+ |.. ...+ -..| T Consensus 152 ----------------------------------------~~~~~g~~~y~~~--~~~--g~~--~~~~------~~eV- 178 (434) T protein:vir:43 152 ----------------------------------------ECDENGRLKYFYT--TKK--GAR--REIE------RTNM- 178 (434) T ss_pred ----------------------------------------EEcCCCeEEEEEE--ecC--ceE--EEEc------cccE- Confidence 0001111122111 000 000 0000 0001 Q ss_pred EEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHH-HhcCceeEEec---CCCCCCce----e--Eecc---cce Q lcl|NC_019406. 319 FVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRF-FTALPTYYAPE---LDDSDASE----Y--HIGP---GRV 385 (661) Q Consensus 319 fv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~-~~~~P~l~i~G---l~~~~~~~----l--~iGs---~~~ 385 (661) +.+-....+...+.+|+.-++. .|.......++...++ ..+.|-.++.- ++++..+. + ..|. +.. T Consensus 179 -ih~~~~~~dg~~G~spi~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~ 256 (434) T protein:vir:43 179 -LHIPAFTLDGRIGLSAIRYGVD-VFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRS 256 (434) T ss_pred -EEecCcCCCCccccCHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCc Confidence 1111111223356788765544 3444444445554444 44678777642 11111000 0 1232 334 Q ss_pred eecCCCCCcceEeecCchhHHHH-HHHHHHHHHHHHH-Hh--HHhcccccCccc-hhH-HHHHHHHHHhhHHHHHHHHHH Q lcl|NC_019406. 386 WVVDKESGIPGIIEFKGEGLKTL-ERALNEKEQQIAA-IG--GRLMPGMSKSVS-ESD-NQSALREANEQSLLLNVIMAL 459 (661) Q Consensus 386 ~~lp~~ga~~~ylE~~g~~i~a~-~~~L~~le~qM~~-lG--Arll~~~~~~~~-eTa-taa~~d~~~~~S~L~~~A~~l 459 (661) +.++ .| ++|.+.+-.+-.+. .+..+-...++++ .| ..++-...++.. -|. ++..+. =-...|.-++.++ T Consensus 257 ~vl~-~g--~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~--f~~~~L~P~~~~i 331 (434) T protein:vir:43 257 PVLE-QG--ITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLA--FLTFSISSITNQI 331 (434) T ss_pred cccC-CC--ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHH--HHHHHHHHHHHHH Confidence 4555 34 45555544433322 2222222333332 22 222311111100 011 122222 2244588888888 Q ss_pred HHHHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHH Q lcl|NC_019406. 460 EDGMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDAR-ALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEF 538 (661) Q Consensus 460 e~Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda~-~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee 538 (661) |++++.-|---..+. .+.|+++.+=..+ .|.. -..++-.++..|.++.-+.++.+ |+ ++... T Consensus 332 e~~ln~kL~~~~~~~--------~~~~~fd~~~llr-~d~~~r~~~~~~~~~~G~~T~NE~R~~~---gl-~p~~g---- 394 (434) T protein:vir:43 332 QQCVNKRLLTAPERI--------RYYAEFSLEGFLK-ADSAGRAAWYSTMAQNGFMTRNEGRRKE---NL-PELPG---- 394 (434) T ss_pred HHHHHhhcCChhhhc--------CceEEEechhhhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHh---CC-CCCCC---- Confidence 888887542112222 2334444321222 2444 46677788899999988887654 33 11100 Q ss_pred HHHHhccCCCCCCchhhhhhcCCccccCCCcchhhhhcCChhhHHHHHHHhccCCCc Q lcl|NC_019406. 539 TIKMNDPKSFIGQPDAIAMRRGYVSRQQELDQQRAARDADFQQQELEQAERHLEIDE 595 (661) Q Consensus 539 ~~~l~~~~~~l~~ddae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e~~~~~~~ 595 (661) -+++--+......+.+...+ ..+.... -.+.++-+-+|+| T Consensus 395 gD~~~~~~n~~~~~~~~~~~--------~~~~~~~---------~~~~~~~~~~~~~ 434 (434) T protein:vir:43 395 GDILTVQSNLVPIDQLGQSN--------KSQAVRA---------ALMNWFSQPEPQE 434 (434) T ss_pred CCeEeeccCccchhhhhccC--------CCcchhh---------hhhccCCCCCCCC Confidence 01111111111111110000 0000000 0000111111111 No 166 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=27.80 E-value=1.8 Score=19.17 Aligned_cols=373 Identities=12% Similarity=-0.004 Sum_probs=153.2 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.|+- | .+++...+.+...++...+. ..|.-+...+.|. .|..+-+ ...++ .+ T Consensus 2 ~m~~~-~--~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~-----~g~~v~~----------~~al~----~~ 54 (392) T protein:vir:74 2 ILPIL-N--FINQTNDPPEAGSVQSYFPD-----GNDAQIMESLLGD-----NNEWVSA----------RAALR----NS 54 (392) T ss_pred cchhh-h--hhhcccCccccccccccccc-----CchhhhhhhccCC-----CCcccch----------hhhhc----ch Confidence 55552 1 23332222222222211111 1122222333221 1221100 11122 23 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) .+...|+.++..|-.-|..+..- ....|.+..+ ...+-.+|.+.++...+.+|-+ T Consensus 55 ~v~~~v~~ia~~ia~lp~~~~~~--~~~~l~~~PN-----------------------~~~t~~~f~~~~~~~lll~Gna 109 (392) T protein:vir:74 55 DLFSIILQLSSDLAIVKINAEKK--KNQGIIDNPS-----------------------TNANKHGFWQSMFAQLLLGGEA 109 (392) T ss_pred HHHHHHHHHHHhhccCceeeccc--hhhhhhhhcC-----------------------CCCCHHHHHHHHHHHhhhcCCE Confidence 34444444444444434433211 1112222221 3678889999999999999999 Q ss_pred EEEEeccCCCchhhcccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSG 239 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~ 239 (661) ++++...... +| -+..+.|..|. + T Consensus 110 ~~~i~r~~~G------~~~~L~~i~~~~v~-----------------v-------------------------------- 134 (392) T protein:vir:74 110 FAYRWRNANG------ADMKWEYLRPSQVN-----------------T-------------------------------- 134 (392) T ss_pred EEEEEECCCC------cEEEEEEEcCceeE-----------------E-------------------------------- Confidence 9998754211 11 11111221110 0 Q ss_pred hhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCccccee-e Q lcl|NC_019406. 240 GRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFI-P 318 (661) Q Consensus 240 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~I-P 318 (661) .. +. +....+|++.. ..... +.. ...+ -+.| - T Consensus 135 ------------------~~-~~----~~~~~~y~~~~---------------~~~~~-~~~--~~~~------~~evih 167 (392) T protein:vir:74 135 ------------------YY-FE----YENGMYYNITF---------------DDPKI-EPI--LQAP------QSDLIH 167 (392) T ss_pred ------------------EE-cC----CCceEEEEEEe---------------cCCcc-cee--EEEc------CccEEE Confidence 00 00 00011122211 00000 000 0000 0111 1 Q ss_pred EEEEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHH-HHhcCceeEEe--c-CCCCCC------cee--Eeccccee Q lcl|NC_019406. 319 FVFFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGR-FFTALPTYYAP--E-LDDSDA------SEY--HIGPGRVW 386 (661) Q Consensus 319 fv~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il-~~~~~P~l~i~--G-l~~~~~------~~l--~iGs~~~~ 386 (661) | -+...++...+.+|+..+... |..-.....+.... ...+.|-.+++ + ....+. ..+ .-.++..+ T Consensus 168 ~--~~~~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~ 244 (392) T protein:vir:74 168 M--KLLSIDGGKTGISPLYSLRRE-SKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPV 244 (392) T ss_pred e--cCCCCCCccccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCee Confidence 1 111223344577887655542 33333334444433 44566776664 2 111110 001 11233445 Q ss_pred ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-h--HHhcccccCccchhHHHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019406. 387 VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-G--GRLMPGMSKSVSESDNQSALREANEQSLLLNVIMALEDGM 463 (661) Q Consensus 387 ~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-G--Arll~~~~~~~~eTataa~~d~~~~~S~L~~~A~~le~Al 463 (661) .++ .|.++.=+..+..-.+ ..+..+-...+++++ | ..++- .. ..+.+..+.... --...|.-++..+++++ T Consensus 245 vl~-~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg-~~-~~~~~~~e~~~~--~~~~~l~p~~~~ie~~l 318 (392) T protein:vir:74 245 VLD-DLEEFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYIG-GQ-GDQQSSIQQISG--MYASALNRYLRPAISEL 318 (392) T ss_pred ecC-CCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhC-CC-CCcccHHHHHHH--HHHHHHHHHHHHHHHHH Confidence 665 3544444443333222 234444455555543 3 22331 11 111122222222 23456777888888888 Q ss_pred HHHHHHHHHHcCCCCCCcceEEEEeccccccccCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHH Q lcl|NC_019406. 464 TSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDA-RALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKM 542 (661) Q Consensus 464 ~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda-~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l 542 (661) +..|- . .+.+.+..-|. .|. .....+-.++.+|.+++...++.|.+.|+.+.+.. +. T Consensus 319 ~~~l~---~----------~~~~~~~~~~~---~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r------~~ 376 (392) T protein:vir:74 319 EYKLS---D----------HISVNMRPAID---PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLP------AP 376 (392) T ss_pred HHhcc---c----------hhcccchhhhc---CCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccc------hh Confidence 77651 1 12222222221 122 34567788899999999999999999999875553 12 Q ss_pred hccCCCCCCchhhhhhcCCccccCCCc Q lcl|NC_019406. 543 NDPKSFIGQPDAIAMRRGYVSRQQELD 569 (661) Q Consensus 543 ~~~~~~l~~ddae~~~~g~~~~~~~~~ 569 (661) ++ .+.+ +.|.. .++-| T Consensus 377 en-l~~~--------~~Gd~--~~p~p 392 (392) T protein:vir:74 377 EN-TNKK--------TTGQS--NEPVP 392 (392) T ss_pred cC-CCCC--------CCCCC--CCCCC Confidence 11 2221 22322 22222 No 167 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=25.45 E-value=2.1 Score=18.87 Aligned_cols=363 Identities=13% Similarity=0.127 Sum_probs=144.2 Q ss_pred CCCCCCccccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChHHHHHHHhhhcccc Q lcl|NC_019406. 1 MAGLSPNSANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDEDYANYLDRAAFYN 80 (661) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n 80 (661) |.=+++.+..-+ ... .......+.+. ....+|. .+ .|+- . +.+++- + T Consensus 1 Mg~~~~~~~~k~----~~~-~~~~~~~~~~~---------~~~~~~~-----~~-~~v~------~---~~~l~~----~ 47 (383) T protein:vir:10 1 MGLLTPKNFSKR----NAK-NMVYPSNPAFF---------TTTVGGM-----QL-SYVS------A---LSALQN----T 47 (383) T ss_pred CCcccccccccc----ccc-ccccccchhhh---------hhhccCc-----cc-cccc------h---hHhhcc----h Confidence 333332211111 100 01111111111 1111111 11 1110 0 122322 3 Q ss_pred hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCE Q lcl|NC_019406. 81 MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRF 160 (661) Q Consensus 81 ~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~ 160 (661) .+...|+.+++.+-.-|..+.. ..+.+|+...+ ...+-.+|.+.++...+.+|-+ T Consensus 48 ~v~~~i~~ia~~ia~~~~~~~~--~~~~~ll~~PN-----------------------~~~t~~~f~~~~~~~l~l~Gn~ 102 (383) T protein:vir:10 48 NVYSVINRIASDVSSAHFKTEN--TATLNRLESPS-----------------------SLIGRFSFWQGALMQLCLSGND 102 (383) T ss_pred HHHHHHHHHHHhhccCceeecc--cchhhhhhCCC-----------------------CCCCHHHHHHHHHHHhhhcCCe Confidence 3444555555555444544432 12223333222 3678899999999999999999 Q ss_pred EEEEeccCCCchhhcccceeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhh Q lcl|NC_019406. 161 GALVDVAPSSDPTAPAKSYTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGG 240 (661) Q Consensus 161 gvLVD~P~a~~~~~g~rPY~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~ 240 (661) +++++-- +. .+|.+.. +.+. T Consensus 103 ~~~i~~~----------~~-~~~p~~~-----------------~~v~-------------------------------- 122 (383) T protein:vir:10 103 YIPLVGQ----------NL-EHIPNSD-----------------VQIN-------------------------------- 122 (383) T ss_pred EEEEEcC----------ce-eEeecCc-----------------ceEE-------------------------------- Confidence 9998621 00 0010000 0000 Q ss_pred hhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccccccceeeccCCcccceeeEE Q lcl|NC_019406. 241 RRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQARDVYTPMVRGRTLPFIPFV 320 (661) Q Consensus 241 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~~~~~~p~~~g~~L~~IPfv 320 (661) ..... ...+|++.....+ .. ..+. .+ ..|=|- T Consensus 123 ------------------~~~~~-----~~~~~~~~~~~~~----~~---~~~~-------~~-----------evih~r 154 (383) T protein:vir:10 123 ------------------YLPGN-----MGIVYTVLESNDR----PK---MVLR-------QD-----------QMLHFR 154 (383) T ss_pred ------------------EEEcC-----CceEEEEEEcCCc----eE---EEEc-------cc-----------ceEEec Confidence 00000 0011111110000 00 0000 00 111111 Q ss_pred EEecCCCCCCccccchhHHHHHHHHHHhhhhhHHHHHH-HhcCceeEEe--c-CCCCCCc--------eeEeccc--cee Q lcl|NC_019406. 321 FFGSMSNAADCEKPPLLDIVELNLKHYRTYAELEHGRF-FTALPTYYAP--E-LDDSDAS--------EYHIGPG--RVW 386 (661) Q Consensus 321 ~~~~~~~~~~~~~pPLldLA~LNl~HYq~sSDl~~il~-~~~~P~l~i~--G-l~~~~~~--------~l~iGs~--~~~ 386 (661) .+.....+...+.||+.-+.. .+........+...++ ..+.|-.++. | +.+++.. ...-|.+ ..+ T Consensus 155 ~~~~~~~~~~~G~s~l~~~~~-~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~ 233 (383) T protein:vir:10 155 LMPDPQYRYLIGRSPLESLQN-ALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLM 233 (383) T ss_pred cCCCCcccccccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCcc Confidence 111122233457788875444 3444444444444444 4567776664 2 3222210 1222322 345 Q ss_pred ecCCCCCcceEeecCchhHHHHHHHHHHHHHHHHHH-h--HHhccccc--CccchhHHHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_019406. 387 VVDKESGIPGIIEFKGEGLKTLERALNEKEQQIAAI-G--GRLMPGMS--KSVSESDNQSALREANEQSLLLNVIMALED 461 (661) Q Consensus 387 ~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~~l-G--Arll~~~~--~~~~eTataa~~d~~~~~S~L~~~A~~le~ 461 (661) .++ +|.++.-+..+..-...+.+.++...++++.+ | ..+|-... +....+.++....+ ...|.-++..+++ T Consensus 234 vl~-~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~---~~~l~P~~~~ie~ 309 (383) T protein:vir:10 234 VLP-DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATY---LANLNSYVNPIVD 309 (383) T ss_pred ccC-CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHH---HHHHHHHHHHHHH Confidence 555 45555555544444333344445545555432 3 22332111 11111223333222 2357777777887 Q ss_pred HHHHHHHHHHHHcCCCCCCcceEEEEeccccccccCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHH Q lcl|NC_019406. 462 GMTSVVRYWLMFRDIPLTDTATLRYEIDATFLTTALDA-RALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTI 540 (661) Q Consensus 462 Al~~aL~~~A~w~G~~~~~~~~~~v~ln~DF~~~~lda-~~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~ 540 (661) +++..| ++ ..+.|.++. ... .|. ....++-.+++.|.++..+.++.+-.-++.+.+... T Consensus 310 ~l~~~l------~~------~~~~f~~~~--l~~-~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~----- 369 (383) T protein:vir:10 310 ELRLKM------NA------PDLELDIKD--MLD-VDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPE----- 369 (383) T ss_pred HHHHhh------CC------ceEEeechh--hhc-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccc----- Confidence 777544 22 234444432 222 233 356778899999999999998877554443333210 Q ss_pred HHhccCCCCCCchhhhhhcCCccc Q lcl|NC_019406. 541 KMNDPKSFIGQPDAIAMRRGYVSR 564 (661) Q Consensus 541 ~l~~~~~~l~~ddae~~~~g~~~~ 564 (661) .....+ .+. |..++ T Consensus 370 -~~~~~~--------~~~-gGd~e 383 (383) T protein:vir:10 370 -FKPLTN--------ETK-GGDDK 383 (383) T ss_pred -cCCCcc--------cCC-CCCCC Confidence 000000 011 11111 No 168 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=23.87 E-value=2.2 Score=18.65 Aligned_cols=446 Identities=14% Similarity=0.108 Sum_probs=164.7 Q ss_pred ccccccccccccCCccccCHHHHHHHHHHHHHHHHhcchHHHHhCCcccCCCCCCCChH---HHHHHHhhh--ccc--c- Q lcl|NC_019406. 9 ANIRRTKRGAQQFTHLVVHPEYEYYRPDWAKIRDAIAGEREIKAQGVKYLKAPKGFDDE---DYANYLDRA--AFY--N- 80 (661) Q Consensus 9 ~~~~~~~~~~~~~~V~~~hPey~a~~~~W~~irD~~~G~~~vr~~g~~YLPk~~~E~~~---~Y~~rl~rA--~~~--n- 80 (661) .|+.++.. ++- .|...+ ++.-+.|..........|.|..-.-+.+ .......|| .+- + T Consensus 1 ~~~~~~~~------~~~-~~~~~~-------~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 66 (530) T protein:vir:38 1 MKIPSLVG------PDG-KTSLRE-------YAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGY 66 (530) T ss_pred Cccceeec------Ccc-ccchHH-------HhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChH Confidence 33332211 110 011111 1111222211112222344433211111 222222222 221 2 Q ss_pred ---hHHHHHHHHhchhhccCccccccchhhHhhhhcccccccccchhhh-hhhHhhhhh----ccCCCC-CHHHHHHHHH Q lcl|NC_019406. 81 ---MTSQTQAGMVGQIFRRPPVIRNLPNTGAITGRDAEGGVQVVAPASI-GKLLTQLQR----FAKDGT-SHQGFAKTVA 151 (661) Q Consensus 81 ---~~~~tv~~l~G~vFrk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~-~~~~~~~~~----~dl~G~-sL~~fa~~~~ 151 (661) ++...++..||-=|+--++ |+. ..| ..||......-+.+ ..|..++++ ||.+|. +++++-+.++ T Consensus 67 a~~av~~~~~nvVG~Gi~~~~~----p~~-~~l--~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~ 139 (530) T protein:vir:38 67 AANAVQLHQDHIVGSFFRLSYR----PSW-RYL--GINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGV 139 (530) T ss_pred HHHHHHHHHHHhhCCCceeeec----cch-hhc--CCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHH Confidence 3444445555554432222 110 000 11111001111111 245566664 677776 9999999999 Q ss_pred HHHHhhCCEEEEEeccCCCchhhcccce---eEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeee Q lcl|NC_019406. 152 LEQVAMGRFGALVDVAPSSDPTAPAKSY---TVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGR 228 (661) Q Consensus 152 ~~~L~~Gr~gvLVD~P~a~~~~~g~rPY---~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~ 228 (661) +..+..|=|++..-+-+.++ +|| +-+|.|+.|-+..-...++ .|+ T Consensus 140 r~~~~dGE~~~~~~~~~~~g-----~~~~~~lq~ie~d~l~~~~~~~~~~-------~i~-------------------- 187 (530) T protein:vir:38 140 AMHAFNGELCVQATWDSDST-----RLFRTQFKMVSPKRVSNPNNIGDTR-------NCR-------------------- 187 (530) T ss_pred HHHhhCCceEEEeeeccCCC-----CccceEEEEechhhcCCCCCCCCCC-------eeE-------------------- Confidence 99999999998877654321 233 3555565543332111000 000 Q ss_pred echhhhhcchhhhhcchhhhhhhhhhhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCccccc--cccee Q lcl|NC_019406. 229 EGSETAQRTSGGRRAGLAERQGSARADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQA--RDVYT 306 (661) Q Consensus 229 ~~~e~vi~w~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~~--~~~~~ 306 (661) .|+ + .+.+|..+ .++++.....+.. ....+ T Consensus 188 --------------~GI------------e---------------------~d~~Gr~~-aY~i~~~~~~~~~~~~~~~~ 219 (530) T protein:vir:38 188 --------------AGV------------K---------------------INDSGAAL-GYYVSDDGYPGWMAQNWTYI 219 (530) T ss_pred --------------eee------------E---------------------ECCCCceE-EEEEeeccCCCcccccccee Confidence 000 0 01111111 1111111100000 00011 Q ss_pred eccCCcccceee---EEE-EecCCCCCCccccchhHHHHHHHHHHhhh--hhHHHHHHHhcCceeEEec-CC-------- Q lcl|NC_019406. 307 PMVRGRTLPFIP---FVF-FGSMSNAADCEKPPLLDIVELNLKHYRTY--AELEHGRFFTALPTYYAPE-LD-------- 371 (661) Q Consensus 307 p~~~g~~L~~IP---fv~-~~~~~~~~~~~~pPLldLA~LNl~HYq~s--SDl~~il~~~~~P~l~i~G-l~-------- 371 (661) |.. ..|| ++- +...+.+-.-+.|.|... -..++++..- |.+..+. +.+.-..||+. .. T Consensus 220 ~~~-----~~v~a~~vlH~f~~~r~gQ~RGis~lapv-l~~l~~l~~y~dael~~a~-i~A~~a~fi~~~~~~~~~~~~~ 292 (530) T protein:vir:38 220 PRE-----LPGGRPSFIHVFEPMEDGQTRGANAFYSV-MEQMKMLDTLQNTQLQSAI-VKAMYAATIESELDTQSAMDFI 292 (530) T ss_pred eee-----eccChhHeEeeccccCCCcccCCchHHHH-HHHHHHHhHHHHHHHHHHH-HhhhheeeeeccCCcccccccc Confidence 110 0011 111 112233333455555432 2223333322 2232333 33333444431 10 Q ss_pred ----CC-----------------CCceeEecccceeecCCCCCcceEeecCchhHHHHHHHHHHHHHHHH-HHh--HHhc Q lcl|NC_019406. 372 ----DS-----------------DASEYHIGPGRVWVVDKESGIPGIIEFKGEGLKTLERALNEKEQQIA-AIG--GRLM 427 (661) Q Consensus 372 ----~~-----------------~~~~l~iGs~~~~~lp~~ga~~~ylE~~g~~i~a~~~~L~~le~qM~-~lG--Arll 427 (661) .. ....+.++++.+..++ +|.+++|+.++.. .......++.+...+. .+| ..+| T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~-pGe~i~~~~p~~p-~~~~~~f~~~~lr~iaaglGi~ye~l 370 (530) T protein:vir:38 293 LGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLL-PGDSLNLQSAQDT-DNGYSTFEQSLLRYIAAGLGVSYEQL 370 (530) T ss_pred ccCCcccccccccccchhhhhcccccceeccCceeeecC-CCCeeeeeCCCCC-CCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 00 0123578999988887 4889999997632 2223333333333322 122 1223 Q ss_pred ccccCccc-hhHHHHHHHHHHhhHHHHHHHHHHHHHH-HHHHHHH---HHHcCCCC-CCcceEEEE------eccccccc Q lcl|NC_019406. 428 PGMSKSVS-ESDNQSALREANEQSLLLNVIMALEDGM-TSVVRYW---LMFRDIPL-TDTATLRYE------IDATFLTT 495 (661) Q Consensus 428 ~~~~~~~~-eTataa~~d~~~~~S~L~~~A~~le~Al-~~aL~~~---A~w~G~~~-~~~~~~~v~------ln~DF~~~ 495 (661) ..--..++ .|+-+.-+++......++. .+...+ ..+++.| |-..|.-. +....+.|. ++-.|... T Consensus 371 t~D~s~~nYSS~R~~~~e~~r~~~~~q~---~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p 447 (530) T protein:vir:38 371 SRNYSQMSYSTARASANESWAYFMGRRK---FVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGS 447 (530) T ss_pred hcccccccHHHHHHHHHHHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecC Confidence 22101111 2344555555555544443 222221 2222211 22223111 111110000 01122222 Q ss_pred ---cCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhcCCCCccCCHHHHHHHHhccCC---CCCCch----hhhhhcCCccc Q lcl|NC_019406. 496 ---ALDAR-ALRAIQQLYEGGLLPIDALYENFVKNGIIPSTQTLEEFTIKMNDPKS---FIGQPD----AIAMRRGYVSR 564 (661) Q Consensus 496 ---~lda~-~l~all~~~~aG~Is~et~~~eL~r~gvl~~~~~~Eee~~~l~~~~~---~l~~dd----ae~~~~g~~~~ 564 (661) -+||- ++++...++.+|..|++.... ++|. +|+++.+.|+.+.. .+|++. ......+ ... T Consensus 448 ~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a---~~G~-----D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~-~~~ 518 (530) T protein:vir:38 448 GRMAIDGLKEVQEAVMLIEAGLSTYEKECA---KRGD-----DYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAG-VKK 518 (530) T ss_pred CccccChHHHHHHHHHHHHcCCCCHHHHHH---HcCC-----CHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCC-CCC Confidence 23665 899999999999999988754 4464 55555554443311 122211 1111001 011 Q ss_pred cCCCcchhhhhc Q lcl|NC_019406. 565 QQELDQQRAARD 576 (661) Q Consensus 565 ~~~~~q~~~~~e 576 (661) .++-+++.+... T Consensus 519 ~~~~~~d~~~~a 530 (530) T protein:vir:38 519 SNEEEQDGARAA 530 (530) T ss_pred CCCCCCCCCCCC Confidence 111111111111 No 169 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=23.80 E-value=2.3 Score=18.64 Aligned_cols=399 Identities=11% Similarity=0.039 Sum_probs=149.2 Q ss_pred CHHHHHHHHHHHHHHHHhcchHHHHhCCc------cc----CCCCCCCChH--HHHHHHhhhcccchHHHHHHHHhchhh Q lcl|NC_019406. 27 HPEYEYYRPDWAKIRDAIAGEREIKAQGV------KY----LKAPKGFDDE--DYANYLDRAAFYNMTSQTQAGMVGQIF 94 (661) Q Consensus 27 hPey~a~~~~W~~irD~~~G~~~vr~~g~------~Y----LPk~~~E~~~--~Y~~rl~rA~~~n~~~~tv~~l~G~vF 94 (661) .|+-..| .-|.++++.+.....+...+. .+ +.-++..+.. .....|+.+..+-....+-+.+..+.| T Consensus 1 ~~~~~~m-g~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~ 79 (432) T protein:vir:81 1 MPDEKKL-GLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPL 79 (432) T ss_pred CCchhhc-chhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCce Confidence 2222222 334444443333222211100 00 0000000000 123344444444444444444444443 Q ss_pred ccCccccccchhhHhhhhcccccccccchhhhhhhHhhhhhccCCCCCHHHHHHHHHHHHHhhCCEEEEEeccCCCchhh Q lcl|NC_019406. 95 RRPPVIRNLPNTGAITGRDAEGGVQVVAPASIGKLLTQLQRFAKDGTSHQGFAKTVALEQVAMGRFGALVDVAPSSDPTA 174 (661) Q Consensus 95 rk~p~i~~~p~~l~~l~~d~dG~~~~~~~~~~~~~~~~~~~~dl~G~sL~~fa~~~~~~~L~~Gr~gvLVD~P~a~~~~~ 174 (661) . + ..++.||+... .-+.+-.+...-.| ...+-.+|.+.+....+.+|-+++++.... + T Consensus 80 ~----~---------y~~~~~g~~~~-~~~~l~~lL~~~PN---~~~t~~~f~~~l~~~lll~Gnayv~i~~~~--g--- 137 (432) T protein:vir:81 80 T----M---------YMRTPDGRKEA-VNHPLYTLLLDGPN---STQTAFDFWQVVVTRLLLDGTAYVRKVVTD--G--- 137 (432) T ss_pred e----e---------EEecCCcceec-ccchHHHHHHhccc---ccCCHHHHHHHHHHHHhhcCCeEEEEEecC--C--- Confidence 2 2 01122332110 00111111111122 256778999999999999999999886431 1 Q ss_pred cccc-eeEeechhhhccceeeccccccceeeeeeeeeeeeccccccccccceeeeechhhhhcchhhhhcchhhhhhhhh Q lcl|NC_019406. 175 PAKS-YTVGYAAENIVDWTVEDVDGFYVPTRILLREFERVDEHATPSQQNPWIGREGSETAQRTSGGRRAGLAERQGSAR 253 (661) Q Consensus 175 g~rP-Y~~~~~p~~IinW~~~~~~g~~~Lt~v~ire~~~~~~~~~~~~~~~~i~~~~~e~vi~w~~~~~~g~~~~~~~~~ 253 (661) +| -+..+.|..|.=+ T Consensus 138 --~~~~L~~l~~~~v~v~-------------------------------------------------------------- 153 (432) T protein:vir:81 138 --RIESLQYLANDRLTIT-------------------------------------------------------------- 153 (432) T ss_pred --cEEEEEEEcCCceEEE-------------------------------------------------------------- Confidence 11 1122222211000 Q ss_pred hhheecccccCCCceeeEEEEEEEeecccccceEEEEEEEecCcccc-cccceeeccCCcccceeeEEEEecCCCCCCcc Q lcl|NC_019406. 254 ADALARPSRFTSSYTFRTIYRELILELQKDGSRVYKQFVYVEDPLGQ-ARDVYTPMVRGRTLPFIPFVFFGSMSNAADCE 332 (661) Q Consensus 254 ~~~~~~~~~~~~~~~~~~~~rv~~l~~g~~g~~~~~~~~~~~~~~~~-~~~~~~p~~~g~~L~~IPfv~~~~~~~~~~~~ 332 (661) .+.+|...|++.... +.... ..+++ |-|- + .+.+...+ T Consensus 154 --------------------------~~~~g~~~y~~~~~~-g~~~~~~~~~i-----------ih~r--~-~~~dg~~G 192 (432) T protein:vir:81 154 --------------------------TDPKGNTAYRYRRTD-GQMIDIPKQQI-----------WKIM--G-YSLDGENG 192 (432) T ss_pred --------------------------ECCCCcEEEEEEecC-ceEEEEccccE-----------EEec--C-CCCCCccc Confidence 001111122211100 00000 00011 1111 1 11122356 Q ss_pred ccchhHHHHHHHHHHhhhhhHHHHHHH-hcCceeEEe--c-CCCCCCc----eeE--ecccceeecCCCCCcceEeecCc Q lcl|NC_019406. 333 KPPLLDIVELNLKHYRTYAELEHGRFF-TALPTYYAP--E-LDDSDAS----EYH--IGPGRVWVVDKESGIPGIIEFKG 402 (661) Q Consensus 333 ~pPLldLA~LNl~HYq~sSDl~~il~~-~~~P~l~i~--G-l~~~~~~----~l~--iGs~~~~~lp~~ga~~~ylE~~g 402 (661) .+||.-++ --|.......++...++. .+.|-.++. + ++++..+ .+. ..++.++.++ .|.++.-+..+. T Consensus 193 ~spi~~~~-~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~-~g~~~~~l~~~~ 270 (432) T protein:vir:81 193 LSAIRYGA-QIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLE-GGMDVKSLGLNP 270 (432) T ss_pred ccHHHHHH-HHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhcCCCceecC-CCceEEEccCCH Confidence 78876544 346665555555555544 356655554 2 1111100 111 2334466666 355555444443 Q ss_pred hhHHHHHHHHHHHHHHHHHH-h--HHhcccccCccchhHHHHHHHHHH---hhHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019406. 403 EGLKTLERALNEKEQQIAAI-G--GRLMPGMSKSVSESDNQSALREAN---EQSLLLNVIMALEDGMTSVVRYWLMFRDI 476 (661) Q Consensus 403 ~~i~a~~~~L~~le~qM~~l-G--Arll~~~~~~~~eTataa~~d~~~---~~S~L~~~A~~le~Al~~aL~~~A~w~G~ 476 (661) ...+. .+..+-..++++++ | ..+|-...+ +.+++.+.+++.. -...|.-++..+|++++..|---.. T Consensus 271 ~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~---- 343 (432) T protein:vir:81 271 VDAQL-LQSRQYSVESICRFFGVPPSMIGHSSA--GTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLSPAE---- 343 (432) T ss_pred HHHHH-HHHHHHHHHHHHHHhCCCHHHcCCcCC--ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc---- Confidence 33222 22222333333321 2 223311111 1122223333222 2357777888888888864411111 Q ss_pred CCCCcceEEEEeccccccccCCHH-HHHHHHHHHhcCCCCHHHHHHHHHhcCC--CCccCCHHHHHHHHhccCCCCCCch Q lcl|NC_019406. 477 PLTDTATLRYEIDATFLTTALDAR-ALRAIQQLYEGGLLPIDALYENFVKNGI--IPSTQTLEEFTIKMNDPKSFIGQPD 553 (661) Q Consensus 477 ~~~~~~~~~v~ln~DF~~~~lda~-~l~all~~~~aG~Is~et~~~eL~r~gv--l~~~~~~Eee~~~l~~~~~~l~~dd 553 (661) ...+.|++|.+=... .|.. -..++-.++++|.++.-+.++.+ |+ ++++. +.+.-+....++++ T Consensus 344 ----~~~~~~~fd~~~llr-~d~~~r~~~~~~~~~~G~~t~NE~R~~~---glpp~~g~~------~~~~~~~~~~pl~~ 409 (432) T protein:vir:81 344 ----RRRYFADFDTSALLR-ADSAARSSYYSQLVNNGLMTRDEAREIE---GLPKLGGNA------AVLTVQSAMVPLDS 409 (432) T ss_pred ----cCceEEEeechhhhc-cCHHHHHHHHHHHHhCCCCCHHHHHHHh---CCCCCCCCc------ceEeecCcccchhh Confidence 112445554322222 2444 45667778889999988887654 33 22111 11111111111111 Q ss_pred hhhhhcCCccccCCCcchhhhhcCChhhHHHHHHH Q lcl|NC_019406. 554 AIAMRRGYVSRQQELDQQRAARDADFQQQELEQAE 588 (661) Q Consensus 554 ae~~~~g~~~~~~~~~q~~~~~e~d~~q~~~~~~e 588 (661) .. ++..++. .+++-.+|+.+.++ T Consensus 410 ~~---------~~~~~~~---~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 410 IG---------LQASPEP---ASGLGNQQQDKVSK 432 (432) T ss_pred hc---------cCCCCCC---CCCCCCcccccccC Confidence 10 0000111 11111122222222 Done!