Query lcl|NC_019445.1_cdsid_YP_007001977.1 [gene=F365_gp26] [protein=hypothetical protein] [protein_id=YP_007001977.1] [location=15145..16824] Match_columns 559 No_of_seqs 134 out of 174 Neff 7.8 Searched_HMMs 1612 Date Thu Nov 7 16:06:16 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_26 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_26_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95315 Length: 559 100.0 6E-200 4E-203 1112.8 65.0 559 1-559 1-559 (559) 2 protein:vir:7321 Length: 556 # 100.0 9E-197 6E-200 1095.4 65.1 556 1-556 1-556 (556) 3 protein:vir:98506 Length: 555 100.0 4E-192 2E-195 1070.2 64.2 554 1-555 1-555 (555) 4 protein:vir:107822 Length: 555 100.0 4E-192 2E-195 1070.2 64.2 554 1-555 1-555 (555) 5 protein:vir:107404 Length: 555 100.0 4E-192 2E-195 1070.2 64.2 554 1-555 1-555 (555) 6 protein:vir:103765 Length: 549 100.0 3E-182 2E-185 1015.9 61.5 538 1-545 1-549 (549) 7 protein:vir:102668 Length: 547 100.0 1E-175 8E-179 979.5 60.9 531 4-543 1-547 (547) 8 protein:vir:78696 Length: 542 100.0 7E-159 5E-162 887.5 55.3 524 5-559 1-535 (542) 9 protein:vir:1785 Length: 555 # 100.0 1E-155 7E-159 869.9 54.2 534 5-559 1-554 (555) 10 protein:vir:1538 Length: 535 # 100.0 2E-154 1E-157 862.8 57.2 519 1-556 1-535 (535) 11 protein:vir:94572 Length: 535 100.0 6E-154 4E-157 860.7 57.7 517 1-556 1-535 (535) 12 protein:vir:3361 Length: 535 # 100.0 6E-154 4E-157 860.6 56.8 520 1-556 1-535 (535) 13 protein:vir:2198 Length: 536 # 100.0 9E-154 6E-157 859.6 55.7 516 1-557 1-536 (536) 14 protein:vir:8883 Length: 543 # 100.0 2E-153 1E-156 857.9 56.7 523 1-559 1-540 (543) 15 protein:vir:10447 Length: 536 100.0 7E-153 4E-156 854.9 55.6 516 1-557 1-536 (536) 16 protein:vir:100039 Length: 522 100.0 7E-152 4E-155 849.4 54.7 512 1-557 1-522 (522) 17 protein:vir:99672 Length: 532 100.0 3E-151 2E-154 845.7 53.9 509 1-557 1-532 (532) 18 protein:vir:78942 Length: 510 100.0 2E-150 1E-153 841.7 54.6 497 5-540 1-510 (510) 19 protein:vir:6322 Length: 510 # 100.0 7E-150 4E-153 838.5 52.6 501 5-540 1-510 (510) 20 protein:vir:80211 Length: 514 100.0 5E-149 3E-152 833.8 53.7 498 1-535 1-514 (514) 21 protein:vir:7017 Length: 515 # 100.0 1E-148 7E-152 831.8 54.3 498 1-537 1-515 (515) 22 protein:vir:94709 Length: 522 100.0 5E-148 3E-151 828.0 56.2 504 1-550 1-522 (522) 23 protein:vir:103330 Length: 517 100.0 3E-148 2E-151 829.0 55.2 503 1-541 1-517 (517) 24 protein:vir:96988 Length: 516 100.0 2E-148 1E-151 830.7 50.9 497 1-536 1-516 (516) 25 protein:vir:105641 Length: 516 100.0 2E-147 1E-150 824.9 52.2 497 1-536 5-516 (516) 26 protein:vir:94599 Length: 641 100.0 2.2E-95 1.4E-98 539.5 46.4 536 1-559 20-640 (641) 27 protein:vir:80165 Length: 651 100.0 9.1E-72 5.7E-75 410.0 47.7 535 1-559 15-651 (651) 28 protein:vir:95315 Length: 559 100.0 1.8E-65 1.1E-68 375.5 64.8 485 4-559 1-556 (559) 29 protein:vir:95449 Length: 584 100.0 4.6E-49 2.9E-52 285.6 39.7 504 1-531 11-584 (584) 30 protein:vir:3139 Length: 599 # 100.0 4.6E-47 2.9E-50 274.6 42.5 513 1-559 15-597 (599) 31 protein:vir:8846 Length: 705 # 100.0 5.8E-37 3.6E-40 219.2 46.3 524 1-559 1-648 (705) 32 protein:vir:95821 Length: 763 100.0 4.2E-28 2.6E-31 170.7 48.6 522 1-559 24-709 (763) 33 protein:vir:93630 Length: 776 99.9 2.1E-19 1.3E-22 123.0 40.4 530 1-559 38-712 (776) 34 protein:vir:108295 Length: 711 99.8 6.3E-17 3.9E-20 109.4 42.8 542 1-559 1-710 (711) 35 protein:vir:172 Length: 708 # 99.7 1.1E-16 7.1E-20 108.0 33.5 536 1-559 1-692 (708) 36 protein:vir:817 Length: 714 # 99.7 1.1E-14 6.6E-18 97.2 42.6 535 1-559 8-692 (714) 37 protein:vir:9950 Length: 714 # 99.7 1.1E-14 6.6E-18 97.2 42.6 535 1-559 8-692 (714) 38 protein:vir:2764 Length: 714 # 99.7 1.1E-14 6.6E-18 97.2 42.6 535 1-559 8-692 (714) 39 protein:vir:10117 Length: 714 99.7 1.1E-14 6.6E-18 97.2 42.6 535 1-559 8-692 (714) 40 protein:vir:3296 Length: 714 # 99.7 1.1E-14 6.6E-18 97.2 42.6 535 1-559 8-692 (714) 41 protein:vir:105619 Length: 772 99.7 1.2E-14 7.4E-18 96.9 43.6 521 1-559 18-702 (772) 42 protein:vir:105429 Length: 708 99.7 1.1E-15 6.9E-19 102.6 35.8 542 1-559 1-692 (708) 43 protein:vir:105520 Length: 706 99.7 9.6E-16 6E-19 102.9 34.5 531 1-559 1-695 (706) 44 protein:vir:77597 Length: 725 99.7 4.5E-15 2.8E-18 99.2 36.8 536 1-559 1-680 (725) 45 protein:vir:9263 Length: 725 # 99.7 4.6E-15 2.9E-18 99.2 36.6 535 1-559 1-680 (725) 46 protein:vir:100920 Length: 725 99.7 8.2E-15 5.1E-18 97.8 36.2 539 1-559 1-661 (725) 47 protein:vir:104437 Length: 714 99.6 7.2E-14 4.5E-17 92.6 38.3 534 1-559 1-692 (714) 48 protein:vir:3520 Length: 720 # 99.5 1.8E-13 1.1E-16 90.4 31.0 534 1-559 1-672 (720) 49 protein:vir:80680 Length: 441 99.5 2.2E-12 1.4E-15 84.5 41.0 430 1-543 1-441 (441) 50 protein:vir:99916 Length: 504 99.5 1.1E-11 6.7E-15 80.7 37.3 460 1-559 18-504 (504) 51 protein:vir:9751 Length: 422 # 99.4 1.3E-11 8.3E-15 80.2 34.2 413 1-513 1-422 (422) 52 protein:vir:2341 Length: 488 # 99.4 5.8E-11 3.6E-14 76.7 34.3 457 1-558 1-488 (488) 53 protein:vir:94742 Length: 409 99.4 6.5E-11 4E-14 76.4 35.8 402 1-496 1-409 (409) 54 protein:vir:4223 Length: 486 # 99.3 7.7E-11 4.8E-14 76.0 35.4 447 1-559 8-486 (486) 55 protein:vir:1634 Length: 409 # 99.3 9.8E-11 6.1E-14 75.5 34.8 402 1-496 1-409 (409) 56 protein:vir:9568 Length: 410 # 99.3 1.1E-10 6.7E-14 75.2 33.4 402 20-517 1-410 (410) 57 protein:vir:80959 Length: 499 99.3 2.7E-10 1.7E-13 73.1 33.5 463 1-559 1-499 (499) 58 protein:vir:38 Length: 496 # N 99.2 3.9E-10 2.4E-13 72.2 34.9 460 1-559 1-496 (496) 59 protein:vir:2427 Length: 485 # 99.2 5.2E-10 3.3E-13 71.5 39.3 447 1-558 13-485 (485) 60 protein:vir:104082 Length: 485 99.2 5.7E-10 3.6E-13 71.2 37.7 450 1-558 8-485 (485) 61 protein:vir:9871 Length: 429 # 99.2 8.9E-10 5.5E-13 70.2 39.1 418 1-541 1-429 (429) 62 protein:vir:3609 Length: 452 # 99.2 9.7E-10 6E-13 70.0 40.0 428 1-542 17-452 (452) 63 protein:vir:94805 Length: 492 99.2 1E-09 6.4E-13 69.9 35.3 430 1-559 37-490 (492) 64 protein:vir:3964 Length: 453 # 99.2 1E-09 6.5E-13 69.8 39.2 431 1-542 11-453 (453) 65 protein:vir:93747 Length: 472 99.2 1.2E-09 7.3E-13 69.5 34.9 432 1-559 18-472 (472) 66 protein:vir:78227 Length: 480 99.1 1.2E-09 7.6E-13 69.4 34.5 451 1-559 1-478 (480) 67 protein:vir:99522 Length: 470 99.1 1.4E-09 8.5E-13 69.2 39.1 433 1-559 19-470 (470) 68 protein:vir:96494 Length: 501 99.1 1.4E-09 8.8E-13 69.1 40.2 451 1-545 37-501 (501) 69 protein:vir:7768 Length: 484 # 99.1 1.6E-09 9.7E-13 68.9 35.3 449 1-556 1-484 (484) 70 protein:vir:79043 Length: 479 99.1 1.6E-09 9.9E-13 68.8 39.4 440 1-542 18-479 (479) 71 protein:vir:94498 Length: 474 99.1 2.2E-09 1.4E-12 68.0 35.6 438 1-545 24-474 (474) 72 protein:vir:97447 Length: 474 99.1 2.2E-09 1.4E-12 68.0 35.6 438 1-545 24-474 (474) 73 protein:vir:5961 Length: 503 # 99.1 2.4E-09 1.5E-12 67.8 35.7 463 1-559 1-503 (503) 74 protein:vir:78537 Length: 480 99.1 2.6E-09 1.6E-12 67.7 32.3 447 1-559 1-475 (480) 75 protein:vir:7430 Length: 563 # 99.1 1.2E-09 7.5E-13 69.5 26.1 498 1-559 1-558 (563) 76 protein:vir:733 Length: 453 # 99.1 3E-09 1.8E-12 67.3 40.3 426 1-533 11-453 (453) 77 protein:vir:97336 Length: 492 99.1 3.2E-09 2E-12 67.2 35.8 432 1-559 38-492 (492) 78 protein:vir:1236 Length: 483 # 99.1 3.2E-09 2E-12 67.2 35.3 432 1-542 29-483 (483) 79 protein:vir:1587 Length: 508 # 99.1 3.6E-09 2.2E-12 66.9 32.3 470 1-557 3-508 (508) 80 protein:vir:8184 Length: 474 # 99.0 3.8E-09 2.4E-12 66.7 34.9 437 1-559 12-474 (474) 81 protein:vir:2732 Length: 501 # 99.0 4.3E-09 2.7E-12 66.5 40.2 450 1-546 37-501 (501) 82 protein:vir:4898 Length: 502 # 99.0 4.7E-09 2.9E-12 66.2 39.5 447 1-559 37-498 (502) 83 protein:vir:105461 Length: 470 99.0 4.9E-09 3E-12 66.1 38.6 446 1-559 1-470 (470) 84 protein:vir:98444 Length: 434 99.0 5.1E-09 3.1E-12 66.1 30.3 413 34-554 1-434 (434) 85 protein:vir:99781 Length: 511 99.0 5.1E-09 3.2E-12 66.0 37.4 446 1-559 31-509 (511) 86 protein:vir:95113 Length: 474 99.0 5.3E-09 3.3E-12 66.0 34.6 436 1-542 24-474 (474) 87 protein:vir:106639 Length: 481 99.0 5.8E-09 3.6E-12 65.7 40.5 435 1-543 30-481 (481) 88 protein:vir:105292 Length: 478 99.0 7.3E-09 4.5E-12 65.2 35.7 438 1-559 1-478 (478) 89 protein:vir:99072 Length: 479 99.0 7.8E-09 4.8E-12 65.0 37.3 449 1-557 1-479 (479) 90 protein:vir:102950 Length: 471 99.0 1E-08 6.5E-12 64.3 36.7 441 1-545 1-471 (471) 91 protein:vir:2500 Length: 501 # 98.9 1.3E-08 8.1E-12 63.8 37.5 466 1-558 23-501 (501) 92 protein:vir:345 Length: 663 # 98.9 1.9E-08 1.2E-11 62.9 32.0 508 1-559 1-653 (663) 93 protein:vir:95899 Length: 474 98.9 2.1E-08 1.3E-11 62.7 36.8 438 1-559 24-474 (474) 94 protein:vir:96266 Length: 474 98.9 2.1E-08 1.3E-11 62.7 36.8 438 1-559 24-474 (474) 95 protein:vir:105889 Length: 474 98.9 2.6E-08 1.6E-11 62.1 38.0 443 1-543 10-474 (474) 96 protein:vir:94101 Length: 474 98.9 2.6E-08 1.6E-11 62.1 38.0 443 1-543 10-474 (474) 97 protein:vir:97171 Length: 512 98.9 2.7E-08 1.7E-11 62.0 39.0 450 1-547 31-512 (512) 98 protein:vir:96240 Length: 511 98.8 3.1E-08 1.9E-11 61.8 38.6 451 1-559 31-511 (511) 99 protein:vir:4782 Length: 522 # 98.8 3.2E-08 2E-11 61.6 35.6 478 1-559 3-522 (522) 100 protein:vir:96179 Length: 468 98.8 3.4E-08 2.1E-11 61.5 37.4 428 1-538 23-468 (468) 101 protein:vir:9306 Length: 511 # 98.8 4.1E-08 2.6E-11 61.1 39.4 448 1-559 31-509 (511) 102 protein:vir:107112 Length: 478 98.8 4.9E-08 3E-11 60.7 37.1 438 1-559 23-478 (478) 103 protein:vir:101494 Length: 527 98.8 5.6E-08 3.5E-11 60.3 27.0 465 1-559 1-526 (527) 104 protein:vir:98883 Length: 517 98.8 5.7E-08 3.5E-11 60.3 30.6 475 1-559 3-517 (517) 105 protein:vir:102239 Length: 527 98.8 6.3E-08 3.9E-11 60.1 27.0 465 1-559 1-526 (527) 106 protein:vir:78805 Length: 511 98.7 7.4E-08 4.6E-11 59.7 38.6 452 1-559 31-511 (511) 107 protein:vir:96366 Length: 511 98.7 7.4E-08 4.6E-11 59.7 38.6 452 1-559 31-511 (511) 108 protein:vir:78907 Length: 518 98.7 8.5E-08 5.3E-11 59.3 29.7 479 1-555 1-518 (518) 109 protein:vir:103951 Length: 511 98.7 1.1E-07 6.7E-11 58.8 38.8 451 1-559 31-509 (511) 110 protein:vir:106571 Length: 499 98.7 1.4E-07 9E-11 58.1 37.5 456 1-559 13-499 (499) 111 protein:vir:9815 Length: 500 # 98.6 1.6E-07 9.7E-11 57.9 33.4 459 1-550 3-500 (500) 112 protein:vir:3028 Length: 500 # 98.6 1.6E-07 9.7E-11 57.9 33.4 459 1-550 3-500 (500) 113 protein:vir:79703 Length: 505 98.6 2.3E-07 1.4E-10 57.0 37.7 460 1-532 3-505 (505) 114 protein:vir:95806 Length: 440 98.5 3.5E-07 2.2E-10 56.0 38.8 420 12-542 1-440 (440) 115 protein:vir:7987 Length: 456 # 98.4 7.3E-07 4.6E-10 54.2 37.1 436 1-543 1-456 (456) 116 protein:vir:96839 Length: 474 98.3 2E-06 1.2E-09 51.9 35.7 432 1-540 1-474 (474) 117 protein:vir:9922 Length: 489 # 98.2 2.2E-06 1.4E-09 51.6 37.4 440 1-542 13-489 (489) 118 protein:vir:94546 Length: 506 98.2 2.2E-06 1.4E-09 51.6 36.5 445 1-544 16-506 (506) 119 protein:vir:80453 Length: 535 98.1 4.8E-06 3E-09 49.7 30.9 458 1-559 32-534 (535) 120 protein:vir:105819 Length: 456 98.0 6.3E-06 3.9E-09 49.1 37.9 434 1-543 1-456 (456) 121 protein:vir:102602 Length: 456 98.0 6.3E-06 3.9E-09 49.1 37.9 434 1-543 1-456 (456) 122 protein:vir:102330 Length: 451 97.8 1.6E-05 9.8E-09 46.9 38.4 426 4-535 1-451 (451) 123 protein:vir:95149 Length: 501 97.8 1.7E-05 1E-08 46.8 30.6 448 1-542 1-501 (501) 124 protein:vir:97265 Length: 513 97.5 5.7E-05 3.5E-08 43.9 26.9 459 1-559 1-513 (513) 125 protein:vir:78083 Length: 537 97.4 7E-05 4.3E-08 43.4 40.6 475 1-559 8-513 (537) 126 protein:vir:94956 Length: 452 97.3 0.0001 6.5E-08 42.4 30.6 435 1-543 1-452 (452) 127 protein:vir:96783 Length: 488 96.1 0.001 6.4E-07 36.9 29.0 415 1-494 14-488 (488) 128 protein:vir:80040 Length: 461 96.0 0.0011 6.7E-07 36.8 21.7 427 1-529 1-461 (461) 129 protein:vir:96738 Length: 505 95.9 0.0012 7.7E-07 36.5 21.8 447 1-559 10-503 (505) 130 protein:vir:78393 Length: 489 95.7 0.0016 1E-06 35.9 27.4 437 1-545 1-489 (489) 131 protein:vir:96068 Length: 765 95.2 0.00089 5.5E-07 37.3 10.2 461 1-559 37-559 (765) 132 protein:vir:101647 Length: 460 95.1 0.0029 1.8E-06 34.5 22.5 401 1-541 1-460 (460) 133 protein:vir:5249 Length: 437 # 94.8 0.0035 2.2E-06 34.0 18.9 417 1-559 1-437 (437) 134 protein:vir:4952 Length: 386 # 94.5 0.0042 2.6E-06 33.6 25.2 355 10-478 1-386 (386) 135 protein:vir:81072 Length: 432 94.5 0.0043 2.7E-06 33.6 21.2 369 1-484 1-432 (432) 136 protein:vir:95014 Length: 491 94.4 0.0045 2.8E-06 33.5 29.9 441 1-542 1-491 (491) 137 protein:vir:99452 Length: 651 93.4 0.0047 2.9E-06 33.3 10.3 493 1-559 1-573 (651) 138 protein:vir:7407 Length: 392 # 93.2 0.0085 5.3E-06 31.9 25.9 319 33-475 1-392 (392) 139 protein:vir:6382 Length: 553 # 92.9 0.0097 6E-06 31.6 20.4 451 1-559 1-552 (553) 140 protein:vir:4698 Length: 251 # 92.5 0.011 6.8E-06 31.3 12.5 238 1-351 1-251 (251) 141 protein:vir:1023 Length: 392 # 92.2 0.013 7.8E-06 31.0 25.9 330 1-440 1-392 (392) 142 protein:vir:3989 Length: 392 # 92.2 0.013 7.8E-06 31.0 25.9 330 1-440 1-392 (392) 143 protein:vir:4337 Length: 434 # 91.2 0.017 1.1E-05 30.3 22.4 386 1-502 1-434 (434) 144 protein:vir:3420 Length: 533 # 90.8 0.019 1.2E-05 30.0 26.4 445 1-558 1-533 (533) 145 protein:vir:78641 Length: 278 89.3 0.027 1.7E-05 29.2 22.5 258 78-444 1-278 (278) 146 protein:vir:96579 Length: 576 88.2 0.034 2.1E-05 28.7 28.7 426 1-559 1-516 (576) 147 protein:vir:79538 Length: 502 88.2 0.034 2.1E-05 28.7 26.7 449 1-559 1-501 (502) 148 protein:vir:10321 Length: 495 87.7 0.037 2.3E-05 28.4 17.5 432 1-559 1-495 (495) 149 protein:vir:5737 Length: 419 # 87.1 0.041 2.5E-05 28.2 21.5 353 1-485 1-419 (419) 150 protein:vir:4156 Length: 542 # 86.6 0.044 2.8E-05 28.0 22.5 403 30-559 1-499 (542) 151 protein:vir:81152 Length: 411 86.4 0.046 2.8E-05 27.9 24.9 375 1-493 1-411 (411) 152 protein:vir:4995 Length: 384 # 86.2 0.047 2.9E-05 27.8 23.1 355 10-473 1-384 (384) 153 protein:vir:1326 Length: 457 # 85.8 0.05 3.1E-05 27.7 21.4 416 1-559 1-446 (457) 154 protein:vir:78161 Length: 355 84.5 0.06 3.7E-05 27.3 16.3 310 210-559 1-355 (355) 155 protein:vir:4828 Length: 382 # 84.4 0.061 3.8E-05 27.3 27.6 323 38-475 1-382 (382) 156 protein:vir:8317 Length: 409 # 82.2 0.079 4.9E-05 26.6 23.8 342 10-473 1-409 (409) 157 protein:vir:4194 Length: 540 # 82.0 0.081 5E-05 26.6 19.5 419 30-559 1-494 (540) 158 protein:vir:4854 Length: 386 # 76.4 0.14 8.4E-05 25.3 23.0 343 10-475 1-386 (386) 159 protein:vir:104338 Length: 422 76.4 0.14 8.4E-05 25.3 18.4 394 1-540 1-422 (422) 160 protein:vir:3153 Length: 467 # 75.9 0.14 8.8E-05 25.2 23.0 417 51-559 1-466 (467) 161 protein:vir:100150 Length: 437 75.5 0.15 9.1E-05 25.2 23.8 403 1-543 1-437 (437) 162 protein:vir:107742 Length: 537 75.5 0.15 9.1E-05 25.2 20.3 444 1-559 68-534 (537) 163 protein:vir:10362 Length: 432 75.0 0.15 9.4E-05 25.1 21.1 369 1-484 1-432 (432) 164 protein:vir:389 Length: 530 # 72.7 0.18 0.00011 24.7 26.6 448 1-558 1-530 (530) 165 protein:vir:63755 Length: 547 71.9 0.19 0.00012 24.5 22.1 427 1-559 31-522 (547) 166 protein:vir:101648 Length: 518 71.7 0.19 0.00012 24.5 19.4 398 1-559 9-442 (518) 167 protein:vir:7853 Length: 518 # 71.1 0.2 0.00012 24.4 19.4 401 1-559 9-467 (518) 168 protein:vir:102118 Length: 409 69.7 0.22 0.00014 24.2 20.5 369 1-493 1-409 (409) 169 protein:vir:78749 Length: 337 69.5 0.22 0.00014 24.2 19.7 314 1-444 1-337 (337) 170 protein:vir:107662 Length: 427 68.6 0.24 0.00015 24.0 18.9 393 1-542 1-427 (427) 171 protein:vir:99312 Length: 563 64.9 0.29 0.00018 23.5 25.8 444 1-559 42-551 (563) 172 protein:vir:95599 Length: 563 64.9 0.29 0.00018 23.5 25.8 444 1-559 42-551 (563) 173 protein:vir:79984 Length: 441 64.8 0.29 0.00018 23.5 23.2 393 1-542 11-441 (441) 174 protein:vir:9408 Length: 441 # 64.8 0.29 0.00018 23.5 23.2 393 1-542 11-441 (441) 175 protein:vir:99563 Length: 862 60.9 0.36 0.00023 23.0 13.8 465 1-559 68-584 (862) 176 protein:vir:4454 Length: 414 # 60.3 0.37 0.00023 22.9 19.5 378 21-543 1-414 (414) 177 protein:vir:1266 Length: 416 # 60.2 0.38 0.00023 22.9 22.6 392 4-540 1-416 (416) 178 protein:vir:102080 Length: 429 55.7 0.47 0.00029 22.4 22.8 398 1-541 1-429 (429) 179 protein:vir:4598 Length: 416 # 53.4 0.53 0.00033 22.1 22.2 380 1-542 1-416 (416) 180 protein:vir:81095 Length: 416 53.4 0.53 0.00033 22.1 22.2 380 1-542 1-416 (416) 181 protein:vir:6240 Length: 457 # 48.9 0.66 0.00041 21.6 22.7 391 38-559 1-448 (457) 182 protein:vir:97060 Length: 432 48.0 0.68 0.00042 21.5 21.0 360 1-484 1-432 (432) 183 protein:vir:80644 Length: 551 45.7 0.76 0.00047 21.2 23.4 439 1-559 39-526 (551) 184 protein:vir:9359 Length: 348 # 45.4 0.77 0.00048 21.2 19.6 289 78-477 1-348 (348) 185 protein:vir:1431 Length: 419 # 39.7 1 0.00062 20.6 19.8 382 15-534 1-419 (419) 186 protein:vir:98853 Length: 219 35.4 1.2 0.00077 20.1 10.2 200 199-454 1-219 (219) 187 protein:vir:94049 Length: 532 33.7 1.3 0.00083 19.9 15.2 469 1-558 1-532 (532) 188 protein:vir:100249 Length: 431 33.5 1.3 0.00084 19.9 25.3 387 1-530 1-431 (431) 189 protein:vir:80796 Length: 574 32.8 1.4 0.00087 19.8 22.2 450 1-559 1-539 (574) 190 protein:vir:80333 Length: 419 32.5 1.4 0.00088 19.8 22.7 358 22-491 1-419 (419) 191 protein:vir:3743 Length: 345 # 28.6 1.7 0.0011 19.3 21.3 311 37-446 1-345 (345) 192 protein:vir:100187 Length: 385 25.5 2 0.0013 18.9 19.8 330 38-481 1-385 (385) 193 protein:vir:8418 Length: 409 # 23.6 2.3 0.0014 18.6 21.7 356 1-481 1-409 (409) 194 protein:vir:100882 Length: 383 21.3 2.6 0.0016 18.3 19.4 337 38-502 1-383 (383) 195 protein:vir:1082 Length: 359 # 21.2 2.6 0.0016 18.3 19.4 333 1-470 1-359 (359) 196 protein:vir:3843 Length: 397 # 20.9 2.7 0.0017 18.2 24.3 370 1-542 1-397 (397) No 1 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=5.9e-200 Score=1112.81 Aligned_cols=559 Identities=99% Similarity=1.418 Sum_probs=549.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |+++++++|++||+.|+++|++||++|+||++||+|++++|.+++.+++.++.+++|||||++|+++||||||++||||+ T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~ 80 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 99999999999999999999999999999999999999999999988889999999999999999999999999999999 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEE Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYL 160 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v 160 (559) +|||||++.|+++.+.+++++||++|+++|+++|++||||.++|++|+||++||||++|+++|+++++||++|||++||| T Consensus 81 ~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~~l~~~~v 160 (559) T protein:vir:95 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPFPIGSYYL 160 (559) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEeecCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEec Q lcl|NC_019445. 161 ANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVG 240 (559) Q Consensus 161 ~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~ 240 (559) ++|++|+||+|||+|+||++||+++||+++||+++++++++++++++|+|+|+|+||.++++++++.++|||.||||+++ T Consensus 161 ~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~ 240 (559) T protein:vir:95 161 ANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVG 240 (559) T ss_pred eeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccceEEEEEEEec Confidence 99999999999999999999999999999999999999999988889999999999999999999999999999999999 Q ss_pred CCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccceecC Q lcl|NC_019445. 241 GDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLP 320 (559) Q Consensus 241 ~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~p 320 (559) ++++++++||||++|||+++||++.+|++||||+|+++||||+|+||.++++++++++++++|||++|++++.+++++.| T Consensus 241 ~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~l~p 320 (559) T protein:vir:95 241 GDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLP 320 (559) T ss_pred CCCceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccceeeec Confidence 98899999999999999999999999999999988999999999999999999999999999999999999888999999 Q ss_pred CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHH Q lcl|NC_019445. 321 GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (559) Q Consensus 321 g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~ 400 (559) ||+++++..++.+.++|+++++++++.+.+.|++++++|+++||+|+|.++.++++++||||||++|++|++++|||||+ T Consensus 321 gg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~ 400 (559) T protein:vir:95 321 GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (559) T ss_pred cceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHH Confidence 99999988887888999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHH Q lcl|NC_019445. 401 RLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVD 480 (559) Q Consensus 401 ~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d 480 (559) ||++|||.|+|+|+|++|+|+|+||++|++|.|.+|+|+|+|||+++||+.++++|.++++++++|+|++|+++|+||+| T Consensus 401 rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d 480 (559) T protein:vir:95 401 RLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVD 480 (559) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 481 QAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 481 ~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ++++++++++|||++++||++||+++||||+||||++|+++++.++++++|+++++++.+++++++|++++.|+++++| T Consensus 481 ~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 481 QAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred HHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccccCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=9e-197 Score=1095.37 Aligned_cols=556 Identities=80% Similarity=1.240 Sum_probs=542.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |+++++++|++||+.|+++|++||++|+||++||+|++++|.+++++++.++.+++|||||++|+++||||||++||||+ T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~ 80 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSPA 80 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCCC Confidence 99999999999999999999999999999999999999999999999988889999999999999999999999999999 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEE Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYL 160 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v 160 (559) +|||+|++.|+++.+..++++||++|+++|+++|++||||.++|++|+||++||||++|+++|+++++||++||+++||| T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l~~~~~ 160 (556) T protein:vir:73 81 RPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPIGSYYL 160 (556) T ss_pred CcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeecceeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEec Q lcl|NC_019445. 161 ANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVG 240 (559) Q Consensus 161 ~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~ 240 (559) ++|++|+||+|||+|+||++||+++||+++||+.+++++++++++.+|+|+|+|+||.+++++++++++|||.|+||+.+ T Consensus 161 ~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~~ 240 (556) T protein:vir:73 161 ANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSVYFESG 240 (556) T ss_pred eeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccceEEEEEEEec Confidence 99999999999999999999999999999999999999999988889999999999999999999999999999999999 Q ss_pred CCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccceecC Q lcl|NC_019445. 241 GDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLP 320 (559) Q Consensus 241 ~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~p 320 (559) ++++++++||||++|||+++||++.+|++||||+|++++|||+|+||.++++++++++++++|||++|++++++++++.| T Consensus 241 ~~~~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~p 320 (556) T protein:vir:73 241 GDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQRVSLLP 320 (556) T ss_pred CCCceecccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccceeecc Confidence 88899999999999999999999999999999988999999999999999999999999999999999999888899999 Q ss_pred CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHH Q lcl|NC_019445. 321 GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (559) Q Consensus 321 g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~ 400 (559) ||+|+++..++.+.++|++.++++++.+.+.|++++++|+++||+|+|.++.++++++||||||++|++|++++|||||+ T Consensus 321 gg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~ 400 (556) T protein:vir:73 321 GDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (556) T ss_pred CccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHH Confidence 99999988888889999999998999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHH Q lcl|NC_019445. 401 RLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVD 480 (559) Q Consensus 401 ~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d 480 (559) ||++|||.|+|+|+|+||+|+|+||++|++|.|.+|+|+|+|||+++||..++++|.++++++++|+|++|+++|+||+| T Consensus 401 rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d 480 (556) T protein:vir:73 401 RLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVD 480 (556) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCC Q lcl|NC_019445. 481 QAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGG 556 (559) Q Consensus 481 ~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 556 (559) ++++++++++|||++++||++||+++||+|+++||++++++++++++++||+++++++.+++++++|+++++.+.. T Consensus 481 ~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 481 QAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhhcCCCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999886664443 No 3 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=3.6e-192 Score=1070.16 Aligned_cols=554 Identities=44% Similarity=0.755 Sum_probs=540.2 Q ss_pred CChh-hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAET-TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~-~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |++. .+++|++||+.|+++|++||++|+||++||+|++++|+..+++.++++.+++|||||++|+++||||||++|||| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 9885 577899999999999999999999999999999999988888888888999999999999999999999999999 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYY 159 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~ 159 (559) ++|||||.+.|+++++..++++||++|+++|+++|++||||.++|++|+||++||||++|+++|.++++||++||+++|| T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~ 160 (555) T protein:vir:98 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYA 160 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEe Q lcl|NC_019445. 160 LANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEV 239 (559) Q Consensus 160 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~ 239 (559) |++|++|+||+|||+|+||++||+++||+++||+.+++++++++++.+|+|+|+|+||.++++++.++++|||+||||+. T Consensus 161 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~ 240 (555) T protein:vir:98 161 IAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEP 240 (555) T ss_pred EeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEe Confidence 99999999999999999999999999999999999999999988888999999999999999999999999999999999 Q ss_pred cCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccceec Q lcl|NC_019445. 240 GGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLL 319 (559) Q Consensus 240 ~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~ 319 (559) +.+++++++||||++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++|||++|++++...+++. T Consensus 241 ~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrg-p~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~ 319 (555) T protein:vir:98 241 GADETRTLRESGYRSFRALCPRWALVGGDIYGNS-PAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDISTV 319 (555) T ss_pred ccCCccccccCCcccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccceec Confidence 8888899999999999999999999999999999 899999999999999999999999999999999999888889999 Q ss_pred CCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHH Q lcl|NC_019445. 320 PGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVL 399 (559) Q Consensus 320 pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~ 399 (559) |||++++....+.+.++|++.+.++++.+.+.|++++++|+++||+|||.++.++++++||||||++|++|++++||||| T Consensus 320 pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~ 399 (555) T protein:vir:98 320 PGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVL 399 (555) T ss_pred cccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHH Confidence 99999997766677889998888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCH Q lcl|NC_019445. 400 ERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNV 479 (559) Q Consensus 400 ~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~ 479 (559) +||++|||.|+|+|+|+||+|+|+||++|+++.|.+|+|+|+|||+++||..++++|.++++++++++|++|+++|+||+ T Consensus 400 ~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~ 479 (555) T protein:vir:98 400 ERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDA 479 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCC Q lcl|NC_019445. 480 DQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQG 555 (559) Q Consensus 480 d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (559) |++++++++++|||++++||++||+++|+||+||||++++++++.++++.+++++++.+.+++++++++++.+|.+ T Consensus 480 d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 480 DRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhccCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=3.6e-192 Score=1070.16 Aligned_cols=554 Identities=44% Similarity=0.755 Sum_probs=540.2 Q ss_pred CChh-hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAET-TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~-~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |++. .+++|++||+.|+++|++||++|+||++||+|++++|+..+++.++++.+++|||||++|+++||||||++|||| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 9885 577899999999999999999999999999999999988888888888999999999999999999999999999 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYY 159 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~ 159 (559) ++|||||.+.|+++++..++++||++|+++|+++|++||||.++|++|+||++||||++|+++|.++++||++||+++|| T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~ 160 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYA 160 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEe Q lcl|NC_019445. 160 LANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEV 239 (559) Q Consensus 160 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~ 239 (559) |++|++|+||+|||+|+||++||+++||+++||+.+++++++++++.+|+|+|+|+||.++++++.++++|||+||||+. T Consensus 161 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~ 240 (555) T protein:vir:10 161 IAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEP 240 (555) T ss_pred EeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEe Confidence 99999999999999999999999999999999999999999988888999999999999999999999999999999999 Q ss_pred cCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccceec Q lcl|NC_019445. 240 GGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLL 319 (559) Q Consensus 240 ~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~ 319 (559) +.+++++++||||++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++|||++|++++...+++. T Consensus 241 ~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrg-p~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~ 319 (555) T protein:vir:10 241 GADETRTLRESGYRSFRALCPRWALVGGDIYGNS-PAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDISTV 319 (555) T ss_pred ccCCccccccCCcccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccceec Confidence 8888899999999999999999999999999999 899999999999999999999999999999999999888889999 Q ss_pred CCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHH Q lcl|NC_019445. 320 PGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVL 399 (559) Q Consensus 320 pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~ 399 (559) |||++++....+.+.++|++.+.++++.+.+.|++++++|+++||+|||.++.++++++||||||++|++|++++||||| T Consensus 320 pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~ 399 (555) T protein:vir:10 320 PGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVL 399 (555) T ss_pred cccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHH Confidence 99999997766677889998888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCH Q lcl|NC_019445. 400 ERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNV 479 (559) Q Consensus 400 ~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~ 479 (559) +||++|||.|+|+|+|+||+|+|+||++|+++.|.+|+|+|+|||+++||..++++|.++++++++++|++|+++|+||+ T Consensus 400 ~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~ 479 (555) T protein:vir:10 400 ERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDA 479 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCC Q lcl|NC_019445. 480 DQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQG 555 (559) Q Consensus 480 d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (559) |++++++++++|||++++||++||+++|+||+||||++++++++.++++.+++++++.+.+++++++++++.+|.+ T Consensus 480 d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 480 DRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhccCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=3.6e-192 Score=1070.16 Aligned_cols=554 Identities=44% Similarity=0.755 Sum_probs=540.2 Q ss_pred CChh-hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAET-TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~-~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |++. .+++|++||+.|+++|++||++|+||++||+|++++|+..+++.++++.+++|||||++|+++||||||++|||| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 9885 577899999999999999999999999999999999988888888888999999999999999999999999999 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYY 159 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~ 159 (559) ++|||||.+.|+++++..++++||++|+++|+++|++||||.++|++|+||++||||++|+++|.++++||++||+++|| T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~ 160 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYA 160 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEe Q lcl|NC_019445. 160 LANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEV 239 (559) Q Consensus 160 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~ 239 (559) |++|++|+||+|||+|+||++||+++||+++||+.+++++++++++.+|+|+|+|+||.++++++.++++|||+||||+. T Consensus 161 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~ 240 (555) T protein:vir:10 161 IAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYFEP 240 (555) T ss_pred EeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEEEe Confidence 99999999999999999999999999999999999999999988888999999999999999999999999999999999 Q ss_pred cCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccceec Q lcl|NC_019445. 240 GGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLL 319 (559) Q Consensus 240 ~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~ 319 (559) +.+++++++||||++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++|||++|++++...+++. T Consensus 241 ~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrg-p~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~~~~ 319 (555) T protein:vir:10 241 GADETRTLRESGYRSFRALCPRWALVGGDIYGNS-PAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDISTV 319 (555) T ss_pred ccCCccccccCCcccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccceec Confidence 8888899999999999999999999999999999 899999999999999999999999999999999999888889999 Q ss_pred CCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHH Q lcl|NC_019445. 320 PGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVL 399 (559) Q Consensus 320 pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~ 399 (559) |||++++....+.+.++|++.+.++++.+.+.|++++++|+++||+|||.++.++++++||||||++|++|++++||||| T Consensus 320 pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~ 399 (555) T protein:vir:10 320 PGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVL 399 (555) T ss_pred cccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHH Confidence 99999997766677889998888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCH Q lcl|NC_019445. 400 ERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNV 479 (559) Q Consensus 400 ~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~ 479 (559) +||++|||.|+|+|+|+||+|+|+||++|+++.|.+|+|+|+|||+++||..++++|.++++++++++|++|+++|+||+ T Consensus 400 ~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~ 479 (555) T protein:vir:10 400 ERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDA 479 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCC Q lcl|NC_019445. 480 DQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQG 555 (559) Q Consensus 480 d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (559) |++++++++++|||++++||++||+++|+||+||||++++++++.++++.+++++++.+.+++++++++++.+|.+ T Consensus 480 d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 480 DRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhccCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=2.8e-182 Score=1015.91 Aligned_cols=538 Identities=28% Similarity=0.437 Sum_probs=509.0 Q ss_pred CChh---hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC---CCCCCcccccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAET---TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT---SEVNRNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~---~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~---~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) |+.. .+++|++||+.|+++|++||++|+||++||+|++++|.. .+++++.++.+++|||||++|+++||||||+ T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~ 80 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDS 80 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHh Confidence 8764 467889999999999999999999999999999988754 3456677888999999999999999999999 Q ss_pred hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH--hccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEE Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN--KSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMP 152 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~--~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~ 152 (559) +||||++|||||.+.|+++.+.+++++||++|+++|+.+++ +||||.++|++|+||++||||++|+++|.+++++|++ T Consensus 81 ~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~f~~ 160 (549) T protein:vir:10 81 MITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIVYRN 160 (549) T ss_pred hccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeEEEE Confidence 99999999999999999999999999999999999999764 8999999999999999999999999999999999999 Q ss_pred eeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 153 FPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) |||++|||.+|++|+||+|||+|+||++||+++||+++||+.+++.+++++ +++|+|||+|+||.++++++.++++||| T Consensus 161 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~-~~~~~v~~~V~pr~~~~~~~~~~~~~pf 239 (549) T protein:vir:10 161 VPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDP-EKSAIFYHAVEPRADRDPRKLDGRNMQF 239 (549) T ss_pred EEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCC-CceEEEEEEeecCCCCCccccccccCce Confidence 999999999999999999999999999999999999999999999999876 5789999999999999999999999999 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCc Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~ 312 (559) .||||++++ +++++||||++|||+++||++.+||+|||| |++++|||+|+||.|+++++++++++++|||++|+++. T Consensus 240 ~sv~~e~~~--~~il~esg~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~ 316 (549) T protein:vir:10 240 ASYWLDEGR--DRIVQNSGFRTFPFAIGRFYVGTDDVYGGS-PAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGV 316 (549) T ss_pred EEEEEEecC--CEeeccCCcccCCcceeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccc Confidence 999999854 589999999999999999999999999999 89999999999999999999999999999999999887 Q ss_pred cccceecCCceeecCC-cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHH Q lcl|NC_019445. 313 NQRASLLPGDITYIDQ-ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEK 391 (559) Q Consensus 313 ~~~~~~~pg~~~~~~~-~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~ 391 (559) .+.+++.||+++++.. .++...++|++.+ .+++.+++.|++++++|+++||+|+|.+. +++++||||||++|++|+ T Consensus 317 ~~~~~l~pgg~~~~~~~~~~~~~~~pl~~~-~~~~~~~~~i~~~~~rI~~af~~d~~~~~--~~~~~~TAtEV~~r~~E~ 393 (549) T protein:vir:10 317 LDGFDLRSGALNWGGLNDKGEEMVKPLLTG-KQAQIGIEFAQDTRQTINQWFYVTLFQIL--VDSGDMTATEVLQRAQEK 393 (549) T ss_pred cccceeccCCccccccCCCCccceeeeccc-cchhHHHHHHHHHHHHHHHHHhhhhhhhh--cCCCCccHHHHHHHHHHH Confidence 7889999999998753 4455679998876 47888889999999999999999998764 689999999999999999 Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh--CCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019445. 392 LLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAM--EGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQA 469 (559) Q Consensus 392 ~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l--~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~ 469 (559) +++|||||+||++|||.|+|+|+|+||+|+|+||++|++| .|.+++|+|||+|+++||..++++|.++++++++|+|+ T Consensus 394 ~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~ 473 (549) T protein:vir:10 394 GVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQF 473 (549) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 9999999999999999999999999999999999999998 67899999999999999999999999999999999999 Q ss_pred ChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHH Q lcl|NC_019445. 470 KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLS 545 (559) Q Consensus 470 ~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 545 (559) +|+++|+||+|++++++++++|||++++||++||+++|++|+||||++++++++.+++++||+++++.+..|++.- T Consensus 474 ~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 474 DPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDAQTAAQTARV 549 (549) T ss_pred ChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCcccCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999887655 No 7 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=1.2e-175 Score=979.51 Aligned_cols=531 Identities=24% Similarity=0.369 Sum_probs=487.2 Q ss_pred hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCc---ccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 4 TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRN---DRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 4 ~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~---~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) .++++|++||+.|+++|++||++|+||++||+|++++|.++....+ .++++++|||||++|+++||||||++||||+ T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 5678899999999999999999999999999999999887766544 4678899999999999999999999999999 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC--CceEEEEEeeccEE Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD--EDIIRTMPFPIGSY 158 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~--~~~~~~~~~~l~~~ 158 (559) +|||||++.|+++.+.+++++||++|+++|+++|++||||.++|++|+||++||||++|+++|+ ++++||++||+++| T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~ 160 (547) T protein:vir:10 81 TKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDS 160 (547) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceE Confidence 9999999999999999999999999999999999999999999999999999999999998764 56899999999999 Q ss_pred EEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCC--ceEEEEEEEeecCccccccc-----cccccc Q lcl|NC_019445. 159 YLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYE--KWIEVMHSVYPNIDRDTSKL-----DSKNKP 231 (559) Q Consensus 159 ~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~--~~v~v~~~v~p~~~~~~~~~-----~~~~~~ 231 (559) ||++|++|+||+|||+|+||++||+++||.++||+++++++++++++ .+++|+|+|+|+.++++++. +.++|| T Consensus 161 ~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p 240 (547) T protein:vir:10 161 YFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERP 240 (547) T ss_pred EEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccccc Confidence 99999999999999999999999999999999999999999876543 37999999999999988764 457999 Q ss_pred EEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC Q lcl|NC_019445. 232 FKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL 311 (559) Q Consensus 232 ~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~ 311 (559) |+|+||+.++ .+++++||||++|||+++||++.+|++|||| |++++|||+|+||.++++++++++++++|||++|+++ T Consensus 241 ~~s~~~e~~~-~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g 318 (547) T protein:vir:10 241 FGKKWILKEG-AVQLGEEGGYYEMPAYAIRWRKSAGSQWGFG-PSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERG 318 (547) T ss_pred eeEEEEEecC-ceeeeecCCcccCCeeeeeeeecCCcccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccc Confidence 9999999764 4689999999999999999999999999999 8999999999999999999999999999999999887 Q ss_pred ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHH Q lcl|NC_019445. 312 KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEK 391 (559) Q Consensus 312 ~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~ 391 (559) ...++++.|||+++++. .+.++|+..+ .+++.+.+.|++++++|+++||+|+|++ +++++||||||++|++|+ T Consensus 319 ~~~~~~~~pgg~~~~~~---~~~v~pl~~~-~~~~~~~~~i~~~~~rI~~af~~d~~~~---~~~~~~TAtEV~~r~~E~ 391 (547) T protein:vir:10 319 LISDIDLGASGLTVVRD---MESMKPFESR-ARFDVSSIQLTDLRSAVRRIYYVDQLQM---KDSPAMTATEVQVRYELM 391 (547) T ss_pred ccccceecCCeeeecCC---cccceeeecc-cchHHHHHHHHHHHHHHHHHhhhhhhhc---CCCccccHHHHHHHHHHH Confidence 67789999999998754 4567887544 5788888999999999999999998764 688999999999999999 Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh---CCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 392 LLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAM---EGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQ 468 (559) Q Consensus 392 ~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l---~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~ 468 (559) +++|||||+||++|||.|+|+|+|++|+|.|+||++|+++ .+.+++|+|+|+|+++||..++++|.++++++++++| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq 471 (547) T protein:vir:10 392 QRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAE 471 (547) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 9999999999999999999999999999999999999998 5789999999999999999999999999999999999 Q ss_pred cChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCC-ChhH Q lcl|NC_019445. 469 AKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTS-DPSV 543 (559) Q Consensus 469 ~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~-~~~~ 543 (559) ++|+++|+||+|++++++++++|||++++||++||+++|+||+++||+++++++++++.++++.++...+. -++. T Consensus 472 ~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 472 INPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAALKENQ 547 (547) T ss_pred cChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccchhccC Confidence 99999999999999999999999999999999999999999999999888877777666666655432222 1111 No 8 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=7.4e-159 Score=887.50 Aligned_cols=524 Identities=16% Similarity=0.159 Sum_probs=445.2 Q ss_pred hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcce Q lcl|NC_019445. 5 TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWF 84 (559) Q Consensus 5 ~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf 84 (559) -++.+++||+.|+++|++||++|+||++||+|+++++.++. +..+..++|||||++|+++|||||||+||||++||| T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~---~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 77 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHA---SGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFF 77 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCc---ccccccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 34557799999999999999999999999999997665433 344567899999999999999999999999999999 Q ss_pred eccCCccchhh--------HHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeecc Q lcl|NC_019445. 85 RLATPDPEMMD--------YGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIG 156 (559) Q Consensus 85 ~l~~~d~~~~~--------~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~ 156 (559) ||.++|.++.+ ..+++.||++||++|+++|++||||.++|++|+||++||||++|+++++ |++|||+ T Consensus 78 ~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~-----~~~~pl~ 152 (542) T protein:vir:78 78 KLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKKT-----LKVYPLD 152 (542) T ss_pred cccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCCC-----ceEEecc Confidence 99999876554 4679999999999999999999999999999999999999999998873 6789999 Q ss_pred EEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEE Q lcl|NC_019445. 157 SYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVY 236 (559) Q Consensus 157 ~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~ 236 (559) +|||++|++|+||+|||+|+||++||+++||+++||+.++++.++++ +.+|+|+|+|+||.++++.+...++++|.|+| T Consensus 153 ~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~-~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~ 231 (542) T protein:vir:78 153 RYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGED-GPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWH 231 (542) T ss_pred eeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccC-CCeEEEEEEeecccCCccccccccCCCeEEEE Confidence 99999999999999999999999999999999999999999877655 57899999999999999999999999999999 Q ss_pred EEecCCC-ceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcccc Q lcl|NC_019445. 237 YEVGGDN-DKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR 315 (559) Q Consensus 237 ~~~~~~~-~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~ 315 (559) ++.++.. ...++|+||++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++|||++++++.... T Consensus 232 ~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~ 310 (542) T protein:vir:78 232 QECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRG-RVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKP 310 (542) T ss_pred EEeccccccccccccccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccch Confidence 9986653 234899999999999999999999999999 89999999999999999999999999999999988876666 Q ss_pred ceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHH Q lcl|NC_019445. 316 ASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLM 394 (559) Q Consensus 316 ~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~ 394 (559) .++.||+..++- .+..+.++|+..++ .+++.+.+.|++++++|+++||.+ ..+++++||||||++|++|++++ T Consensus 311 ~~~~~~~~g~iv-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~-----~~~d~~rvTAtEV~~r~~E~~~~ 384 (542) T protein:vir:78 311 QSLARAGTGAII-QGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLIL-----NVRQSERTTATEVREVQMELDRQ 384 (542) T ss_pred hhcccCCCceee-cCCccceeeeecccccchhHHHHHHHHHHHHHHHHhccc-----ccCCcccccHHHHHHHHHHHHHH Confidence 666665543331 12234466665443 478888999999999999999743 45899999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhH Q lcl|NC_019445. 395 LGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEAL 474 (559) Q Consensus 395 LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~ 474 (559) |||||+||++|||.|+|+|+|++|+|+|+||++|+++ ++|+|+|||+++||++++++|.+|++.++++.. +|.++ T Consensus 385 LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~l----v~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~-p~~l~ 459 (542) T protein:vir:78 385 LSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGL----VMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMG-PEALQ 459 (542) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhc----eeeeeechHHHHHHHHHHHHHHHHHHHHHHhcC-ChhHH Confidence 9999999999999999999999999999999999987 899999999999999999999999999988633 45567 Q ss_pred hcCCHHHHHHHHHHHcCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhc Q lcl|NC_019445. 475 DKLNVDQAIDAFADMSGVSP-TVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSG 553 (559) Q Consensus 475 ~~id~d~~~~~~a~~~Gvp~-~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 553 (559) ++||+|++++++++++|||+ .+++|+||+++++++++|+++++..+++ ++. ++.+++.++...+.+++ + T Consensus 460 ~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~---a~~----~a~~~~~~~~~~~~~a~---~ 529 (542) T protein:vir:78 460 QFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQ---AGQ----LAKSPIGEKMMQQINAP---G 529 (542) T ss_pred hcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHh---hhh----ccccccccchhhhcCCC---C Confidence 89999999999999999995 6999999998876654443333333222 222 23334433332222322 2 Q ss_pred CCCCCC Q lcl|NC_019445. 554 QGGQSQ 559 (559) Q Consensus 554 ~~~~~~ 559 (559) ++.+++ T Consensus 530 ~~~~~~ 535 (542) T protein:vir:78 530 QEAPAG 535 (542) T ss_pred cCCCCC Confidence 222222 No 9 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=1.2e-155 Score=869.94 Aligned_cols=534 Identities=15% Similarity=0.143 Sum_probs=453.8 Q ss_pred hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcce Q lcl|NC_019445. 5 TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWF 84 (559) Q Consensus 5 ~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf 84 (559) -+++|++||+.|+++|++|+++|+||++||+|+++++.++ .+..+..++|||||++|+++|||||||+||||++||| T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF 77 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGH---VQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFF 77 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCC---cccccccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 3455789999999999999999999999999999766443 3455678899999999999999999999999999999 Q ss_pred eccCCccchhh-------HHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 85 RLATPDPEMMD-------YGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 85 ~l~~~d~~~~~-------~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) ||.+.|+++.+ ...++.||++|+++|+.+|++||||.++|++|+||++||||++|+++++ ++.|||++ T Consensus 78 ~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~-----~~~~pl~~ 152 (555) T protein:vir:17 78 KLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKKN-----LKLYPLDR 152 (555) T ss_pred ccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCCc-----eeEEEcCe Confidence 99999876544 4568999999999999999999999999999999999999999998774 45699999 Q ss_pred EEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccc---cccccEEE Q lcl|NC_019445. 158 YYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLD---SKNKPFKS 234 (559) Q Consensus 158 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~---~~~~~~~s 234 (559) |||.+|++|+||+|||||+||++||+++||++.+++.+++.+++.+ +..++++|++.|+..+.....+ ...+++.+ T Consensus 153 y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~~ 231 (555) T protein:vir:17 153 FVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGE-DGPKMGVTAPGGRDKGKSNDALVYTYVCRKDGQ 231 (555) T ss_pred EEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccc-cchhhhhhhhcccccCCCcceeEeecccccCCe Confidence 9999999999999999999999999999999999999999998765 5668899999998876654432 23445555 Q ss_pred EEEEecCCCcee---eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC Q lcl|NC_019445. 235 VYYEVGGDNDKL---LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL 311 (559) Q Consensus 235 v~~~~~~~~~~i---l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~ 311 (559) ++|+.+.++.++ ++|+||++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++|||++++++ T Consensus 232 ~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g 310 (555) T protein:vir:17 232 VKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRG-RVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSA 310 (555) T ss_pred eEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc Confidence 555555555554 789999999999999999999999999 8999999999999999999999999999999999888 Q ss_pred ccccceecCCceeecCCcCCchhhhhhhhccc-cHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHH Q lcl|NC_019445. 312 KNQRASLLPGDITYIDQITGQDGFRPAYLVNP-STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEE 390 (559) Q Consensus 312 ~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~-~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e 390 (559) .....++.||+.+++. .+..+.++|+..+++ +++.+++.|++++++|+++||. +..+++++||||||++|++| T Consensus 311 ~~~~~~l~~~~~g~v~-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~-----~~~~d~~r~TAtEV~~r~~E 384 (555) T protein:vir:17 311 TTKPQNLALAANGAII-QGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLM-----LQVRQSERTTATEVQATVQE 384 (555) T ss_pred ccCcceeecCCCceee-cCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhh-----cCCCCcccchHHHHHHHHHH Confidence 7788899998876663 344566888887764 6888899999999999999963 35689999999999999999 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019445. 391 KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAK 470 (559) Q Consensus 391 ~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~ 470 (559) ++++|||||+||++|||.|+|+|+|++|+|+|+||++|+++ +++++++++.+++|+++++++.+|++.++++.+ + T Consensus 385 ~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~----v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~-~ 459 (555) T protein:vir:17 385 LNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDL----VQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMG-P 459 (555) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhh----hccceeehHHHHHHHHHHHHHHHHHHHHHhhcC-c Confidence 99999999999999999999999999999999999999997 778999999999999999999999888877755 6 Q ss_pred hhhHhcCCHHHHHHHHHHHcCCC-ccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---hhhhcCCChhHHHH Q lcl|NC_019445. 471 PEALDKLNVDQAIDAFADMSGVS-PTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKT---LSEAKTSDPSVLSA 546 (559) Q Consensus 471 P~~~~~id~d~~~~~~a~~~Gvp-~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~---~~~~~~~~~~~~~~ 546 (559) |+++|+||+|++++.+++++||| ..++||+||++++||++++++|+++++++++++++.+.. +.......+.+.++ T Consensus 460 p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~~a~~~ 539 (555) T protein:vir:17 460 EIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQEGAQDA 539 (555) T ss_pred hhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchhhhhHH Confidence 89999999999999999999995 679999999999999998888888887777766554322 22222223333333 Q ss_pred HH--HHhhcCCCCCC Q lcl|NC_019445. 547 MA--NAVSGQGGQSQ 559 (559) Q Consensus 547 ~~--~~~~~~~~~~~ 559 (559) .+ .+...+-|.++ T Consensus 540 ~~a~~~~~~~~~~~~ 554 (555) T protein:vir:17 540 GAAESETSSAEAQAG 554 (555) T ss_pred HHHHhhcCCcccccC Confidence 21 11111222222 No 10 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=2.4e-154 Score=862.80 Aligned_cols=519 Identities=15% Similarity=0.154 Sum_probs=439.3 Q ss_pred CChhh-----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |++.. ++.+++||+.|+++|++|+++|+||++||+|++++.. ...+..+..++|||||++|+++|||||||+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~---~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKE---SDNESTDYTTPWQAVGARGLNNLASKLMLA 77 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCC---CCcccccccccccccHHHHHHHHHHHHHHh Confidence 99864 4567889999999999999999999999999985432 233445667899999999999999999999 Q ss_pred hcCCCCcceeccCCccc-------hhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceE Q lcl|NC_019445. 76 ITSPARPWFRLATPDPE-------MMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDII 148 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~-------~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~ 148 (559) |||+ +|||||.+.|.. ..+.++++.||++|+++|+++|++||||.++|++|+||++||||++|+++++++++ T Consensus 78 ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~ 156 (535) T protein:vir:15 78 LFPM-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYN 156 (535) T ss_pred hcCC-CcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCce Confidence 9986 799999998854 34567899999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccc Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSK 228 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~ 228 (559) +|++|||++|||.+|++|+||+|||+|+||+++|.++|+.+..+ ... +++++++|+|||+|+|+.+ T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~----~~~-~~~~~~~v~v~~~v~~~~~--------- 222 (535) T protein:vir:15 157 PMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEK----AGG-EKKMDEMVDVYTHVYLDEE--------- 222 (535) T ss_pred eeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhc----ccc-ccCCCCceeEEEEEEEecC--------- Confidence 99999999999999999999999999999999999999865422 223 3345678999999998643 Q ss_pred cccEEEEEEEecCCC-ceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_019445. 229 NKPFKSVYYEVGGDN-DKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA 307 (559) Q Consensus 229 ~~~~~sv~~~~~~~~-~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~ 307 (559) +++|.+ |.++++.. ....+++||++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++|||++ T Consensus 223 ~~~~~~-~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv 300 (535) T protein:vir:15 223 SGDYLK-YEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRS-YCEEYLGDLRSLENLQEAIVKMSMISAKVIGLV 300 (535) T ss_pred CCcEEE-EEEeeCccccccccccccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 346654 44544332 122456678999999999999999999999 899999999999999999999999999999999 Q ss_pred cCCCccccceecCCceeecCCcCCchhhhhhh-hccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHH Q lcl|NC_019445. 308 PTSLKNQRASLLPGDITYIDQITGQDGFRPAY-LVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIE 386 (559) Q Consensus 308 p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~-~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~ 386 (559) ++++..+..++.||+..++-. +..+.+.|+. ...++++.+.+.|++++++|+++||.| ++..+++++||||||++ T Consensus 301 ~~~g~~~~~~l~~~~~g~~v~-g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~---~~~~~~~~r~TAtEV~~ 376 (535) T protein:vir:15 301 NPAGITQPRRLTKAQTGDFVP-GRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLN---SAVQRTGERVTAEEIRY 376 (535) T ss_pred cccccccchhcccCCceeeec-CCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCccccHHHHHH Confidence 988877778888877654421 2233445553 223578888999999999999999876 66778999999999999 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 387 MKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQL 466 (559) Q Consensus 387 r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~l 466 (559) |++|++++|||||+||++|||.|||+|+|++|+|.|+||++|+++ ++|+|+|||+++||++++++|.+|++. + T Consensus 377 r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~----v~~~yis~La~aqr~~~~~~l~~~~~~---l 449 (535) T protein:vir:15 377 VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEA----VEPTISTGLEAIGRGQDLDKLERCISA---W 449 (535) T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccc----eeEEEecHHHHHHHHHHHHHHHHHHHH---H Confidence 999999999999999999999999999999999999999999875 999999999999999999999998765 4 Q ss_pred hccChhhHh-cCCHHHHHHHHHHHcCCCcc-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHH Q lcl|NC_019445. 467 AQAKPEALD-KLNVDQAIDAFADMSGVSPT-VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVL 544 (559) Q Consensus 467 a~~~P~~~~-~id~d~~~~~~a~~~Gvp~~-~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 544 (559) ++++|++++ +||+|++++++++++|||++ +++|+||++++++++++++++++++++ ++ +.++..++.+|+.+ T Consensus 450 a~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~---~g---~~~~~~~~~~p~~~ 523 (535) T protein:vir:15 450 AALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAAT---GG---AGVGALATSSPEAM 523 (535) T ss_pred HhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH---HH---hhccchhccChHHH Confidence 568999998 59999999999999999975 999999999888765444444433322 22 22333456678999 Q ss_pred HHHHHHhhcCCC Q lcl|NC_019445. 545 SAMANAVSGQGG 556 (559) Q Consensus 545 ~~~~~~~~~~~~ 556 (559) +++++.++.++. T Consensus 524 ~~~~~~~g~~~~ 535 (535) T protein:vir:15 524 QGAAAQAGLDAT 535 (535) T ss_pred HHHHhccCCCCC Confidence 999999888888 No 11 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=5.7e-154 Score=860.72 Aligned_cols=517 Identities=16% Similarity=0.136 Sum_probs=437.3 Q ss_pred CChhh------HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT------KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~~------~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) |+... ++++++||+.|+++|++||++|+||++||+|++.++.++ .+..+..++|||||++|+++||||||| T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~ 77 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSD---NASTDYTTPWQAVGARGLNNLASKLML 77 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCC---ccccccCCcccccHHHHHHHHHHHHHh Confidence 88875 344899999999999999999999999999998765443 344566789999999999999999999 Q ss_pred hhcCCCCcceeccCCccc-------hhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce Q lcl|NC_019445. 75 GITSPARPWFRLATPDPE-------MMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI 147 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~-------~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~ 147 (559) +|||+ +|||||.+.|.. ..+.+++++||+.|+++|+.+|++||||.++|++|+||++||||++|++++.+++ T Consensus 78 ~ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 156 (535) T protein:vir:94 78 ALFPM-QTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTY 156 (535) T ss_pred hhcCC-CCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcc Confidence 99976 689999998754 3566789999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccc Q lcl|NC_019445. 148 IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) Q Consensus 148 ~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~ 227 (559) ++|++|||++|||.+|++|+||+|||||+++++++.++|+. .+++..++ +++.+|+|||||+|+. T Consensus 157 ~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~-----~~~~~~~~-~~~~~v~v~~~v~~~~--------- 221 (535) T protein:vir:94 157 NPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRN-----SMDSSQEH-KGDEMIDVYTHIYLDE--------- 221 (535) T ss_pred cceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHH-----HHHhcccc-CCCceeEEEEEEEeeC--------- Confidence 99999999999999999999999999999999999888764 33333333 4467899999999864 Q ss_pred ccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_019445. 228 KNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA 307 (559) Q Consensus 228 ~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~ 307 (559) .+|||.++|+..+......++++||++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++|||++ T Consensus 222 ~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv 300 (535) T protein:vir:94 222 ESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRS-YCEEYLGDLRSLENLQEAIVKMSMISAKVIGLV 300 (535) T ss_pred CCCcEEEEEEecCeeeccccccCccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc Confidence 4688988664432222234678899999999999999999999999 999999999999999999999999999999998 Q ss_pred cCCCccccceecCC--ceeecCCcCCchhhhhh-hhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH Q lcl|NC_019445. 308 PTSLKNQRASLLPG--DITYIDQITGQDGFRPA-YLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) Q Consensus 308 p~~~~~~~~~~~pg--~~~~~~~~~~~~~~~p~-~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei 384 (559) ++++.....++.++ |.+..+..+ .+.|+ .....+++.+.+.|++++++|+++||. .++.++++++|||||| T Consensus 301 ~p~g~~~~~~~~~~~~g~~v~g~~~---~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~---~~~~~~d~~rvTAtEV 374 (535) T protein:vir:94 301 NPAGITQVRRLTKAQTGDFVSGRPE---DISFLQLEKAADFSVARAVSEQIEGRLSYAFML---NSAVQRTGERVTAEEI 374 (535) T ss_pred ccccccchhhcccCCCceeecCCcc---cceeeecccccchhHHHHHHHHHHHHHHHHHhH---hhhccCCCCCccHHHH Confidence 77654444444432 222223333 33444 233357888899999999999999965 4677899999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~ 464 (559) ++|++|++++|||||+||++|||.|+|+|+|++|+|.|+||++|+++ ++++|+|||++++|++++++|.+|++ T Consensus 375 ~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~----v~~~~vs~la~l~r~~~~~~l~~~~~--- 447 (535) T protein:vir:94 375 RYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEA----VEPTISTGMEALGRGQDLDKLERCIA--- 447 (535) T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhh----ccceEeehHHHHHHHHHHHHHHHHHH--- Confidence 99999999999999999999999999999999999999999999997 89999999999999999888887766 Q ss_pred HHhccChhhHh-cCCHHHHHHHHHHHcCCC-ccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 465 QLAQAKPEALD-KLNVDQAIDAFADMSGVS-PTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 465 ~la~~~P~~~~-~id~d~~~~~~a~~~Gvp-~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) .+++++|+++| +||+|++++++++++||| ..++||++|++++|+++++++|++++++++++++++ .+..++. T Consensus 448 ~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~------~~~~~~~ 521 (535) T protein:vir:94 448 AWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGT------MATASPE 521 (535) T ss_pred HHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc------ccccChH Confidence 45678899998 599999999999999999 579999999999998887777766665555544433 2344566 Q ss_pred HHHHHHHHhhcCCC Q lcl|NC_019445. 543 VLSAMANAVSGQGG 556 (559) Q Consensus 543 ~~~~~~~~~~~~~~ 556 (559) ..++++++++..++ T Consensus 522 ~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 522 NMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHHHHhccCCC Confidence 66777777777788 No 12 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=6e-154 Score=860.61 Aligned_cols=520 Identities=15% Similarity=0.152 Sum_probs=441.1 Q ss_pred CChhh-----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |++.. ++.+++||+.|+++|++|+++|+||++||+|++.+. ++..+..+..++|||||++|+++|||||||+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~---~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPK---ESDNESTDYTTPWQAVGARGLNNLASKLMLA 77 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCC---CCCcccccccccccccHHHHHHHHHHHHHHh Confidence 99864 567889999999999999999999999999998543 2233445567899999999999999999999 Q ss_pred hcCCCCcceeccCCccchh-------hHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceE Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMM-------DYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDII 148 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~-------~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~ 148 (559) |||+ +|||||.+.|.++. +.++++.||+.|+++|+.+|++||||.++|++|+||++||||++|+++++++++ T Consensus 78 ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~ 156 (535) T protein:vir:33 78 LFPM-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYN 156 (535) T ss_pred hcCC-CcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCce Confidence 9986 79999999986554 456899999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccc Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSK 228 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~ 228 (559) +|++|||++|||.+|++|+||+|||+|+||+++|+++||.+.+++.++ ++++++++|||||+++. . T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~-----k~~~~~~~v~~~v~~~~---------~ 222 (535) T protein:vir:33 157 PMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGE-----KKMDEMVDVYTHVYLDE---------E 222 (535) T ss_pred eeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccc-----cccccCCeEEEEEEeeC---------C Confidence 999999999999999999999999999999999999999887765442 34456799999998743 4 Q ss_pred cccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_019445. 229 NKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAP 308 (559) Q Consensus 229 ~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p 308 (559) +++|.++++..+.......+.+||++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++|||+++ T Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~ 301 (535) T protein:vir:33 223 SGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRS-YCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN 301 (535) T ss_pred CCcEEEEEEEeCccccccccccccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Confidence 567876654332221122334468999999999999999999999 8999999999999999999999999999999999 Q ss_pred CCCccccceecCCceeecCCcCCchhhhhhh-hccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 309 TSLKNQRASLLPGDITYIDQITGQDGFRPAY-LVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 309 ~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~-~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) +++..+..++.||+.+++-. +..+.+.|+. ...++++.+.+.|++++++|+++||.| ++..+++++||||||++| T Consensus 302 ~~g~~~~~~~~~~~~g~~v~-g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~---~~~~~~~~r~TAtEV~~r 377 (535) T protein:vir:33 302 PAGITQPRRLTKAQTGDFVP-GRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLN---SAVQRTGERVTAEEIRYV 377 (535) T ss_pred cccccchhhcccCCceeeec-CCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCccccHHHHHHH Confidence 88877788888887654421 2233445553 223578888999999999999999876 667789999999999999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la 467 (559) ++|++++|||||+||++|||.|||+|+|++|+|.|+||++|+++ ++|+|+|||+++||++++++|.+|++. ++ T Consensus 378 ~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~----v~~~yis~La~aqr~~~~~~l~~~~~~---la 450 (535) T protein:vir:33 378 ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEA----VEPTISTGLEAIGRGQDLDKLERCISA---WA 450 (535) T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccc----eeEEEecHHHHHHHHHHHHHHHHHHHH---HH Confidence 99999999999999999999999999999999999999999874 999999999999999999999998765 45 Q ss_pred ccChhhHh-cCCHHHHHHHHHHHcCCCcc-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHH Q lcl|NC_019445. 468 QAKPEALD-KLNVDQAIDAFADMSGVSPT-VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLS 545 (559) Q Consensus 468 ~~~P~~~~-~id~d~~~~~~a~~~Gvp~~-~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 545 (559) +++|++++ +||+|++++++++++|||++ +++|++|++++++++++++|++++++++.+. ++.....++++.+ T Consensus 451 ~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~------~~~~~~~~~~~~~ 524 (535) T protein:vir:33 451 ALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAG------VGALATSSPEAMQ 524 (535) T ss_pred hhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhh------hcchhhcCChhHH Confidence 68999998 59999999999999999975 9999999999988766655555544442221 2223444677788 Q ss_pred HHHHHhhcCCC Q lcl|NC_019445. 546 AMANAVSGQGG 556 (559) Q Consensus 546 ~~~~~~~~~~~ 556 (559) ++++.++--+. T Consensus 525 ~~~~~~g~~~~ 535 (535) T protein:vir:33 525 GAAAKAGLNAT 535 (535) T ss_pred HHHHhccCCCC Confidence 88888877777 No 13 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=9e-154 Score=859.62 Aligned_cols=516 Identities=17% Similarity=0.175 Sum_probs=435.9 Q ss_pred CChhh----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhh Q lcl|NC_019445. 1 MAETT----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGI 76 (559) Q Consensus 1 M~~~~----~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l 76 (559) |++.. +++|++||+.|+++|++|+++|+||++||+|+++++.++ .+..+..++|||||++|+++|||||||+| T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD---NASTDYQTPWQAVGARGLNNLASKLMLAL 77 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCC---cccccccccccccHHHHHHHHHHHHHHhh Confidence 99943 779999999999999999999999999999998766543 34456678999999999999999999999 Q ss_pred cCCCCcceeccCCccchh-------hHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce-E Q lcl|NC_019445. 77 TSPARPWFRLATPDPEMM-------DYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI-I 148 (559) Q Consensus 77 ~pp~~~Wf~l~~~d~~~~-------~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~-~ 148 (559) ||+ +|||||.+.|+++. +.+++++||+.|+++|+++|++||||.++|++|+||++||||++|++++.+++ . T Consensus 78 tP~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~ 156 (536) T protein:vir:21 78 FPM-QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) T ss_pred cCC-CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCcee Confidence 986 69999999987654 45689999999999999999999999999999999999999999999988775 4 Q ss_pred EEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccc Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSK 228 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~ 228 (559) +|++|||++|||.+|++|+||+|||+|+||+++|+++||.+.+++..+ ++++++|+|||+|+|+.+ T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~-----~~~~~~v~v~~~v~~~~~--------- 222 (536) T protein:vir:21 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGE-----KKADETIDVYTHIYLDED--------- 222 (536) T ss_pred eEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccc-----cccccceeEEEEEEEecC--------- Confidence 589999999999999999999999999999999999999988876644 355678999999999754 Q ss_pred cccEEEEEEEecCCCceeeeecC---cccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019445. 229 NKPFKSVYYEVGGDNDKLLRESG---FDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) Q Consensus 229 ~~~~~sv~~~~~~~~~~il~esg---~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~ 305 (559) +++|. +|++. ++.+++.++| |++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++||+ T Consensus 223 ~~~~~-~~~e~--~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~ 298 (536) T protein:vir:21 223 SGEYL-RYEEV--EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRS-YIEEYLGDLRSLENLQEAIVKMSMISSKVIG 298 (536) T ss_pred CCcEE-EEecc--CCeeeccccCccccccCCeeeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Confidence 45664 46665 3456767666 5899999999999999999999 8999999999999999999999999999999 Q ss_pred eecCCCccccceecC---CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHH Q lcl|NC_019445. 306 VAPTSLKNQRASLLP---GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVE 382 (559) Q Consensus 306 ~~p~~~~~~~~~~~p---g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~ 382 (559) ++++++.....++.| |.++. +..++ ..+.|+ ...++++.+.+.|++++++|+++||.| ++.++++++|||| T Consensus 299 lv~p~g~~~~~~~~~~~~g~~v~-g~~~~-v~~~~~-~~~~~~~~~~~~i~~~~~rI~~af~~~---~l~~~~~~r~TAt 372 (536) T protein:vir:21 299 LVNPAGITQPRRLTKAQTGDFVT-GRPED-ISFLQL-EKQADFTVAKAVSDAIEARLSFAFMLN---SAVQRTGERVTAE 372 (536) T ss_pred ccCcccccchhhhccCCCcceec-CCccc-ceeeec-cccccchHHHHHHHHHHHHHHHHHhhh---hcccCCCCCccHH Confidence 887665544444444 43332 23332 223333 334578888999999999999999765 6777999999999 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 383 AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNF 462 (559) Q Consensus 383 Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~ 462 (559) ||++|++|++++|||||+||++|||.|+|+|+|++|+|+|+||++|+++ ++++|+|+|++++|+++++++.+|++. T Consensus 373 EV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~----v~~~~vs~l~~l~r~~~~~~l~~~~~~ 448 (536) T protein:vir:21 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEA----VEPTISTGLEAIGRGQDLDKLERCVTA 448 (536) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhh----ccceEEecHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999986 899999999999999999888888765 Q ss_pred HHHHhccChhhHh-cCCHHHHHHHHHHHcCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 463 IGQLAQAKPEALD-KLNVDQAIDAFADMSGV-SPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 463 ~~~la~~~P~~~~-~id~d~~~~~~a~~~Gv-p~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~ 540 (559) + ++++|+++| +||+|++++++++++|| |.+++||++||+++|+||++++|++++++++.+.. +...+.+ T Consensus 449 l---a~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~------~~~~~~~ 519 (536) T protein:vir:21 449 W---AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM------AAQATAS 519 (536) T ss_pred H---HhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHhcC Confidence 5 567899998 59999999999999999 67999999999999998877777666555443322 1122234 Q ss_pred hhHHHHHHHHhhcCCCC Q lcl|NC_019445. 541 PSVLSAMANAVSGQGGQ 557 (559) Q Consensus 541 ~~~~~~~~~~~~~~~~~ 557 (559) +..++++++.++.++|- T Consensus 520 ~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 520 PEAMAAAADSVGLQPGI 536 (536) T ss_pred hhhHHhhhhccccCCCC Confidence 45555665555555555 No 14 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=1.9e-153 Score=857.91 Aligned_cols=523 Identities=15% Similarity=0.147 Sum_probs=447.6 Q ss_pred CChh-----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAET-----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~-----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |++. .+++|++||++|+++|++|+++|+||++||+|+++++.++. ...+..++|||||++|+++|||||||+ T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~---~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDN---SSTDYTTPWQAVGARGLNNLSAKVMLA 77 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCc---ccccccccccchHHHHHHHHHHHHHHh Confidence 8883 46789999999999999999999999999999986543322 234456799999999999999999999 Q ss_pred hcCCCCcceeccCCccch-------hhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceE Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEM-------MDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDII 148 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~-------~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~ 148 (559) |||+ +|||||.+.|..+ .+.++++.||++|+++|+++|++||||.++|++|+||++||||++|++++.++++ T Consensus 78 ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~ 156 (543) T protein:vir:88 78 LFPL-QSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSN 156 (543) T ss_pred hcCC-CcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccc Confidence 9986 6999999988543 4578999999999999999999999999999999999999999999999998876 Q ss_pred EEE---EeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccc Q lcl|NC_019445. 149 RTM---PFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 (559) Q Consensus 149 ~~~---~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~ 225 (559) +|. .|||++|+|.+|++|+||+||||+++|+++|.++| ++.+++.+++++ +++|+|||+|+||++++.. T Consensus 157 ~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~-----~~~v~~~~~~~p-~~~~~v~~~V~pr~~~~~~-- 228 (543) T protein:vir:88 157 SYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDV-----RNSLSGGQEYKP-EQELEVYTHIYIDDESGDF-- 228 (543) T ss_pred eecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHh-----hHHHHHHhhcCC-ccceEEEEEEEeecCCCcc-- Confidence 665 48999999999999999999999999999998776 467888887766 4689999999999876643 Q ss_pred ccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019445. 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) Q Consensus 226 ~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~ 305 (559) ..++++.|+|++... ++..|++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++||| T Consensus 229 -~~~~~~~~~~v~~~~------~~~~~~e~P~i~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 300 (543) T protein:vir:88 229 -LSYQEIEGVEVDGSD------GQYPQDALPWIAVRWTKRDGEHYGRS-HVEEYLGDLNSLESLNEAMIKFAMISSKVVG 300 (543) T ss_pred -cccccccCeeeecCC------CccccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 345677777665321 12236899999999999999999999 8999999999999999999999999999999 Q ss_pred eecCCCccccceecCCceeecCCcCCchhhhhhh-hccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH Q lcl|NC_019445. 306 VAPTSLKNQRASLLPGDITYIDQITGQDGFRPAY-LVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) Q Consensus 306 ~~p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~-~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei 384 (559) ++++++..+..++.||+.+++- ++..+.+.|+. ...++++.+.+.|++++++|+++||.| ++..+++++|||||| T Consensus 301 ~v~~~g~~~~~~~~~~~~g~~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~r~TAtEV 376 (543) T protein:vir:88 301 LVNPNGITQVRRLVKAQTGDFV-AGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLN---SAVQRSGERVTAEEI 376 (543) T ss_pred eeccccccchhhcccCCCceee-cCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhh---hhccCCCCcccHHHH Confidence 9998887778888898776652 22334455553 234578889999999999999999876 556789999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~ 464 (559) ++|++|++++|||||+||++|||.|+|+|+|++|+|.|+||++|+++ ++++|+|+|++++|++++++|.+++++++ T Consensus 377 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~----v~~~~vs~l~~l~r~~~~~~l~~~~~~v~ 452 (543) T protein:vir:88 377 RYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEA----VEPTVTTGAEALGRGQDLDKLTQFLNAVA 452 (543) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhc----eeeeEEecHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999874 89999999999999999999999999999 Q ss_pred HHhccChhhHhcCCHHHHHHHHHHHcCCC-ccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhH Q lcl|NC_019445. 465 QLAQAKPEALDKLNVDQAIDAFADMSGVS-PTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSV 543 (559) Q Consensus 465 ~la~~~P~~~~~id~d~~~~~~a~~~Gvp-~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 543 (559) .+++ |+++|+||+|++++++++++||| .+++|+++|++++|+||+++|++++++++ ..+..|+. ...++.+ T Consensus 453 ~~~~--p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~--~~~~~~~~----~~~~~~~ 524 (543) T protein:vir:88 453 TVSQ--LNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAG--IGSGVAAQ----ATASPEA 524 (543) T ss_pred hccc--hhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH--Hhhchhhh----hccChHH Confidence 9876 89999999999999999999995 57999999999998877655544433332 22222333 2234578 Q ss_pred HHHHHHHhhcCCCCCC Q lcl|NC_019445. 544 LSAMANAVSGQGGQSQ 559 (559) Q Consensus 544 ~~~~~~~~~~~~~~~~ 559 (559) +++|+..+++++|++. T Consensus 525 ~~~~~~~~~~~~~p~~ 540 (543) T protein:vir:88 525 MESAMDTAGVQPGPIA 540 (543) T ss_pred HHHHhhhcCCCCCCCC Confidence 8888888888888887 No 15 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=6.7e-153 Score=854.87 Aligned_cols=516 Identities=16% Similarity=0.170 Sum_probs=434.4 Q ss_pred CChhh----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhh Q lcl|NC_019445. 1 MAETT----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGI 76 (559) Q Consensus 1 M~~~~----~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l 76 (559) |++.. +++|++||+.|+++|++||++|+||++||+|+++++.++ .++.+..++|||||++|+++|||||||+| T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD---NASTDYQTPWQAVGARGLNNLASKLMLAL 77 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCC---cccccccccccccHHHHHHHHHHHHHhhh Confidence 99943 789999999999999999999999999999998766543 34456778999999999999999999999 Q ss_pred cCCCCcceeccCCccchh-------hHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce-E Q lcl|NC_019445. 77 TSPARPWFRLATPDPEMM-------DYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI-I 148 (559) Q Consensus 77 ~pp~~~Wf~l~~~d~~~~-------~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~-~ 148 (559) ||+ +|||||.+.|+++. +.+++++||+.|+++|+++|++||||.++|++|+||++||||++|++++.+++ . T Consensus 78 tP~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~ 156 (536) T protein:vir:10 78 FPM-QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) T ss_pred cCC-CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCcee Confidence 976 69999999987654 45689999999999999999999999999999999999999999999988775 4 Q ss_pred EEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccc Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSK 228 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~ 228 (559) +|++|||++|||.+|++|+||+|||+|+||+++|+++||++.+++..+ ++++++|+|||||+|+.+ T Consensus 157 ~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~-----~~~~~~v~v~~~V~~~~~--------- 222 (536) T protein:vir:10 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGE-----KKADETIDVYTHIYLDEA--------- 222 (536) T ss_pred eEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccc-----cCcccceEEEEEEEEecC--------- Confidence 589999999999999999999999999999999999999988876643 345678999999999854 Q ss_pred cccEEEEEEEecCCCceeeeecC---cccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019445. 229 NKPFKSVYYEVGGDNDKLLRESG---FDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) Q Consensus 229 ~~~~~sv~~~~~~~~~~il~esg---~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~ 305 (559) +++|. +|.+++ +.+++.++| |++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++||+ T Consensus 223 ~~~~~-~~~e~~--g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~ 298 (536) T protein:vir:10 223 SGEYL-RYEEVE--GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRS-YIEEYLGDLRSLENLQEAIVKMSMISSKVIG 298 (536) T ss_pred CCcEE-EEEeec--CccccccccccccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Confidence 34564 455554 345555444 6899999999999999999999 8999999999999999999999999999999 Q ss_pred eecCCCccccceecC---CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHH Q lcl|NC_019445. 306 VAPTSLKNQRASLLP---GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVE 382 (559) Q Consensus 306 ~~p~~~~~~~~~~~p---g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~ 382 (559) ++++++.....++.| |.++. +..++ ..+.|+ ...++++.+.+.|++++++|+++||.| ++..+++++|||| T Consensus 299 lv~p~g~~~~~~~~~~~~g~~v~-g~~~~-v~~~~~-~~~~~~~~~~~~i~~~~~rI~~af~~~---~l~~~~~~r~TAt 372 (536) T protein:vir:10 299 LVNPAGITQPRRLTKAQTGDFVT-GRPED-ISFLQL-EKQADFTVAKAVSDAIEARLSFAFMLN---SAVQRTGERVTAE 372 (536) T ss_pred ccCcccccchhhhccCCCcceec-CCccc-ceeeec-cccccchHHHHHHHHHHHHHHHHHhhh---hcccCCCCCccHH Confidence 987665544445444 43332 23332 223333 334578888999999999999999765 6777999999999 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 383 AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNF 462 (559) Q Consensus 383 Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~ 462 (559) ||++|++|++++|||||+||++|||.|+|+|+|++|+|+|+||++|+++ ++++|+|+|++++|+++++++.+|++. T Consensus 373 EV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~----v~~~~vs~l~~l~r~~~~~~l~~~~~~ 448 (536) T protein:vir:10 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEA----VEPTISTGLEAIGRGQDLDKLERCVTA 448 (536) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhh----ccceEEecHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999986 899999999999999998888887765 Q ss_pred HHHHhccChhhHh-cCCHHHHHHHHHHHcCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 463 IGQLAQAKPEALD-KLNVDQAIDAFADMSGV-SPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 463 ~~~la~~~P~~~~-~id~d~~~~~~a~~~Gv-p~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~ 540 (559) + ++++|+++| .||+|++++++++++|| |.+++||++||+++|+||++++|++++++++.+.. +...+.+ T Consensus 449 l---a~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~------~~~~~~~ 519 (536) T protein:vir:10 449 W---AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM------AAQATAS 519 (536) T ss_pred H---HhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHhcC Confidence 5 567899998 49999999999999999 67999999999999998877777666555443322 1122234 Q ss_pred hhHHHHHHHHhhcCCCC Q lcl|NC_019445. 541 PSVLSAMANAVSGQGGQ 557 (559) Q Consensus 541 ~~~~~~~~~~~~~~~~~ 557 (559) +.+++++++.++.++|- T Consensus 520 ~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 520 PEAMAAAADSVGLQPGI 536 (536) T ss_pred chhHHhhhhccccCCCC Confidence 55566666555555555 No 16 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=6.6e-152 Score=849.40 Aligned_cols=512 Identities=18% Similarity=0.177 Sum_probs=428.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |. +++||+.|+++|++|+++|+||++||+|+++++.+.. ..+..+..++|||||++|+++||||||++||||+ T Consensus 1 m~------~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~-~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 73 (522) T protein:vir:10 1 MK------ARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISS-RPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQ 73 (522) T ss_pred Cc------hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCC-CcccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 32 6689999999999999999999999999998875533 3455667789999999999999999999999999 Q ss_pred CcceeccCCccchhhH------HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEee Q lcl|NC_019445. 81 RPWFRLATPDPEMMDY------GPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFP 154 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~------~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~ 154 (559) +|||||.+.|+++.+. +++++||+.|+++++++|++||||.++|++|+||++||||++|+++++ |++|| T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~-----~~~~p 148 (522) T protein:vir:10 74 TSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKDG-----LKTFP 148 (522) T ss_pred CccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCCC-----ceEEE Confidence 9999999998766553 579999999999999999999999999999999999999999999874 56799 Q ss_pred ccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEE Q lcl|NC_019445. 155 IGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKS 234 (559) Q Consensus 155 l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~s 234 (559) |++|||++|++|+||+|||||+||+++|+++||.+++|+.++.+ ++++++|+|||||+|+.+++ .| + T Consensus 149 l~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~---~~~~~~v~v~~~v~p~~~~~---------~~-~ 215 (522) T protein:vir:10 149 LTRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDES---STTNDDVTIYTYVKLDKSSG---------RW-V 215 (522) T ss_pred cceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcc---cCCCCceEEEEEEEeeccCC---------ce-E Confidence 99999999999999999999999999999999999999988754 34567899999999986532 12 2 Q ss_pred EEEEecCCCcee--eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCc Q lcl|NC_019445. 235 VYYEVGGDNDKL--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) Q Consensus 235 v~~~~~~~~~~i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~ 312 (559) +|.+.. +...+ .+++||++|||+++||++.+||+|||| |++++|||+|+||.|+++++++++++++|||++++++. T Consensus 216 ~~~~~~-~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~ 293 (522) T protein:vir:10 216 WHQEAF-DKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRG-RVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSST 293 (522) T ss_pred EEEccC-CccccccccccccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccc Confidence 333332 22223 357799999999999999999999999 89999999999999999999999999999999988887 Q ss_pred cccceecCCceeecCCcCCchhhhhhhhc-cccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHH Q lcl|NC_019445. 313 NQRASLLPGDITYIDQITGQDGFRPAYLV-NPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEK 391 (559) Q Consensus 313 ~~~~~~~pg~~~~~~~~~~~~~~~p~~~~-~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~ 391 (559) .+..++.||+.+++. .+..+.+.|...+ ..+++.+.+.|++++++|+++|| ++..+++++||||||++|++|+ T Consensus 294 ~~~~~l~~~~~~~~v-~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl-----~~~~~d~~rvTAtEV~~r~~E~ 367 (522) T protein:vir:10 294 TKPATIAKAGNGAIV-QGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFL-----VMNVRNAERVTAEEVRLTQLEL 367 (522) T ss_pred cccccccCCCCccee-cCCCccceeecccccccchHHHHHHHHHHHHHHHHHh-----hccCCCCCCCCHHHHHHHHHHH Confidence 778888888777663 2334445555433 45788889999999999999995 3468899999999999999999 Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccCh Q lcl|NC_019445. 392 LLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKP 471 (559) Q Consensus 392 ~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P 471 (559) +++|||||+||++|||.|+|+|+|++|+|.|+||++|+++. ....++|+|+|+++|+ ++++.+|++.+++++. +| T Consensus 368 ~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~-~~~~v~~is~Laraq~---~~~l~~~~~~i~~~~~-p~ 442 (522) T protein:vir:10 368 EQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIV-RPTIVAGVNALGRGQD---RESLTAFVGTIAQTLG-PE 442 (522) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccc-ccccccchhHHHHHHH---HHHHHHHHHHHHHhhC-ch Confidence 99999999999999999999999999999999999999984 4567999999998775 5677788887776643 45 Q ss_pred hhHhcCCHHHHHHHHHHHcCCC-ccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHH Q lcl|NC_019445. 472 EALDKLNVDQAIDAFADMSGVS-PTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANA 550 (559) Q Consensus 472 ~~~~~id~d~~~~~~a~~~Gvp-~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 550 (559) .++|+||+|++++++++++||| +.++||++||+++||+++|+++++++++++ ++.++....-++.+++++ ++ T Consensus 443 ~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a---~~~~~~~~~~~~~~~~~~----~~ 515 (522) T protein:vir:10 443 ALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQA---GQMTGSPLMDPTKNPQLM----DE 515 (522) T ss_pred hhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHH---HHHhcccccCccccHHHH----HH Confidence 5678999999999999999999 689999999999998887777776655554 344444333333334333 33 Q ss_pred hhcCCCC Q lcl|NC_019445. 551 VSGQGGQ 557 (559) Q Consensus 551 ~~~~~~~ 557 (559) .+..+-. T Consensus 516 ~~~~~~~ 522 (522) T protein:vir:10 516 EQPPMEE 522 (522) T ss_pred hCCCCCC Confidence 3322222 No 17 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=3.1e-151 Score=845.69 Aligned_cols=509 Identities=15% Similarity=0.134 Sum_probs=420.8 Q ss_pred CChhh-----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |++.. +++|++||+.|+++|++|+++|+||++||+|+++++. ++.++++..++|||||++|+++||||||++ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~---~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 77 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA---TADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCC---CCcchhhccccccchHHHHHHHHHHHHHHh Confidence 99964 7889999999999999999999999999999986543 345667788999999999999999999999 Q ss_pred hcCCCCcceeccCCccchhh-------HHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC---C Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMMD-------YGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD---E 145 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~~-------~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~---~ 145 (559) ||||++|||||.+.|+++.+ .++|+.||++||++|+++|++||||.++|++|+||++||||++|++++. + T Consensus 78 ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~ 157 (532) T protein:vir:99 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) T ss_pred hcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccC Confidence 99999999999999876544 4789999999999999999999999999999999999999999998643 4 Q ss_pred ceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhc----CCCCceEEEEEEEeecCccc Q lcl|NC_019445. 146 DIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWES----GTYEKWIEVMHSVYPNIDRD 221 (559) Q Consensus 146 ~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~----~~~~~~v~v~~~v~p~~~~~ 221 (559) +.++|++|||++|||++|++|+||+||||++++++++ |+++++.+++ ++++.+|+|||+|+|+++ T Consensus 158 ~~~~f~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l---------~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~-- 226 (532) T protein:vir:99 158 QSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAAL---------PEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPE-- 226 (532) T ss_pred cccceEEEEcCeEEEeeCCCCCeeeEeeeeeecHHhc---------ChHHHHHhhccccccCCCcceEEEEEEEecCC-- Confidence 6789999999999999999999999999987776665 5555555443 355678999999999764 Q ss_pred ccccccccccEEEEEEEecCCCceeeeec--CcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 222 TSKLDSKNKPFKSVYYEVGGDNDKLLRES--GFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDK 299 (559) Q Consensus 222 ~~~~~~~~~~~~sv~~~~~~~~~~il~es--g~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~ 299 (559) +++|.++|+.. + ..+++++| +|++|||+++||++.+|++|||| |++++|||+|+||.|++++++++++ T Consensus 227 -------~~~~~~~~~~~-g-~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~ 296 (532) T protein:vir:99 227 -------AMVFRSYQEID-G-EIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRS-FVEEYLGDLKSLENLYEAIVKMSMI 296 (532) T ss_pred -------CCeeEEEEeec-C-ceecccccccccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHH Confidence 47888876553 2 23455555 47899999999999999999999 8999999999999999999999999 Q ss_pred HhcCceeecCCCccccceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 300 ATNPPMVAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 300 ~~~p~~~~p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) +++|||++++++.....++.||+..++- ++..+.+.|+..++ .+++.+.+.|++++++|+++||.| ++..+++++ T Consensus 297 a~~~~~lv~p~g~~~~~~~~~~~~g~~v-~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~d~~r 372 (532) T protein:vir:99 297 SSKVLFFVNPNGVTQIRRVAKANTGDFV-AGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN---SAVQRGGDR 372 (532) T ss_pred HcCCCceeccccccchhhhccCCCccee-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCc Confidence 9999999987766555666554432221 12234466665443 478889999999999999999766 677799999 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) ||||||++|++|++++|||||+||++|||.|+|+|+|++|+|.|+||++|+++.+..+ ++|+|+|+++|+ ++++ T Consensus 373 ~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~i-v~~is~Laraq~---~~~l-- 446 (532) T protein:vir:99 373 VTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGLEALGRGHD---LNKL-- 446 (532) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccce-eecchHHHHHHH---HHHH-- Confidence 9999999999999999999999999999999999999999999999999999987766 789999998876 3344 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCC-ccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVS-PTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAK 537 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp-~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~ 537 (559) +++++.|+++.|+++|+||+|++++++++++||| +.++||+||++++|+++++||+++++.+++.++++.+... T Consensus 447 -~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~---- 521 (532) T protein:vir:99 447 -NVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA---- 521 (532) T ss_pred -HHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcch---- Confidence 4455566778899999999999999999999995 6899999999999988777776666555555444322110 Q ss_pred CCChhHHHHHHHHhhcCCCC Q lcl|NC_019445. 538 TSDPSVLSAMANAVSGQGGQ 557 (559) Q Consensus 538 ~~~~~~~~~~~~~~~~~~~~ 557 (559) +++.-.|.+.+ T Consensus 522 ---------~~~~~~~~~~~ 532 (532) T protein:vir:99 522 ---------MMQQQAGMPTQ 532 (532) T ss_pred ---------hHHhhcCCCCC Confidence 11111111111 No 18 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=1.7e-150 Score=841.65 Aligned_cols=497 Identities=13% Similarity=0.127 Sum_probs=422.8 Q ss_pred hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcce Q lcl|NC_019445. 5 TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWF 84 (559) Q Consensus 5 ~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf 84 (559) .++++++||++|| |++||++|+||++||+|+++++.+++ .+.+..++|||||++|+++||||||++||||++||| T Consensus 1 mk~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~~~---~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:78 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSG---SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccccCCCCc---ccccccCcccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 5677889999996 99999999999999999987655443 233456799999999999999999999999999999 Q ss_pred eccCCccchh-------hHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 85 RLATPDPEMM-------DYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 85 ~l~~~d~~~~-------~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) ||++.|..+. +.+++++||++|+++|+.+|++||||.++|++|+||++|||+++|++++.. +|+.|||++ T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~---~~~~~pl~~ 152 (510) T protein:vir:78 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) T ss_pred ccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCCC---eEEEEEcce Confidence 9999886553 356799999999999999999999999999999999999999999987754 578899999 Q ss_pred EEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEE Q lcl|NC_019445. 158 YYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) Q Consensus 158 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~ 237 (559) |||.+|++|+||+|||||+||+++|.++||.+.++ ...+++++++|+|||+|+|++ .++|||.|+|+ T Consensus 153 y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~-----~~~~~~~~~~v~v~~~V~~~~--------~~~~~~~sv~~ 219 (510) T protein:vir:78 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMR-----AGRNLSGSGSVDLYTHVQRRK--------GTAMDYAEMYH 219 (510) T ss_pred eEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhh-----hhhccCCCceEEEEEEEEeec--------CCCCcEEEEEE Confidence 99999999999999999999999999999976553 223445677899999999864 46899999999 Q ss_pred EecCCCceeeeecC--cccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcccc Q lcl|NC_019445. 238 EVGGDNDKLLRESG--FDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR 315 (559) Q Consensus 238 ~~~~~~~~il~esg--~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~ 315 (559) |+++. +++++|+ |++|||+++||++.+||+|||| |++++|||+|+||.|+++.+++++++++|||++++++.... T Consensus 220 e~dg~--~i~~~~~~~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~ 296 (510) T protein:vir:78 220 EIDGV--RVGETGRWPIHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV 296 (510) T ss_pred EecCe--eeccccccccccCCeeeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccch Confidence 98654 5667766 5899999999999999999999 99999999999999999999999999999999988776666 Q ss_pred ceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHH Q lcl|NC_019445. 316 ASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLM 394 (559) Q Consensus 316 ~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~ 394 (559) .++.+|+..++ .+++.+.+.|+..+. .+++.+.+.|++++++|+++||.| +.++++++||||||++|++|++++ T Consensus 297 ~~l~~~~~g~~-v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~----l~~~~~~rvTAtEV~~r~~E~~~~ 371 (510) T protein:vir:78 297 DDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENT 371 (510) T ss_pred hhhccCCCcee-ecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhhc----cccCCCCCcCHHHHHHHHHHHHHH Confidence 66776654333 133455678876543 578888999999999999999754 567899999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhH Q lcl|NC_019445. 395 LGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEAL 474 (559) Q Consensus 395 LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~ 474 (559) |||||+||++|||.|||+|+|++|++.|++|++|+. +++..++|+|+|+++|+..++.++.++++.+++++++.| T Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~--~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~~q~~~--- 446 (510) T protein:vir:78 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ--HKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP--- 446 (510) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCccc--ccceeeecccHHHHHHHHHHHHHHHHHHHHhcChhhhhh--- Confidence 999999999999999999999999999988887765 446679999999999999998888888888887766655 Q ss_pred hcCCHHHHHHHHHHHcCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 475 DKLNVDQAIDAFADMSGV-SPTVIVPQEQVDQARQQRAQQQQQQQMMAM--GMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 475 ~~id~d~~~~~~a~~~Gv-p~~~~rs~~ev~~~rq~r~q~~q~~~~~~~--~~~~~~~a~~~~~~~~~~ 540 (559) .||+|++++++++++|| |..++||+|||+++|++|+||+++++++++ ++++++.|++ ..+= T Consensus 447 -~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~----~~g~ 510 (510) T protein:vir:78 447 -RISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA----LAGV 510 (510) T ss_pred -cCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc----CCCC Confidence 68999999999999999 568999999999999988766665554443 2333333333 2111 No 19 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=6.5e-150 Score=838.47 Aligned_cols=501 Identities=14% Similarity=0.136 Sum_probs=419.7 Q ss_pred hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcce Q lcl|NC_019445. 5 TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWF 84 (559) Q Consensus 5 ~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf 84 (559) .++++++||++|| |++||++|+||++||+|+++++.+++ .+.+..++|||||++|+++||||||++||||++||| T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~---~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSG---SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCc---cccccCCCccchHHHHHHHHHHHHHhhhcCCCCccc Confidence 6778999999996 99999999999999999987665443 234557799999999999999999999999999999 Q ss_pred eccCCccch-------hhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 85 RLATPDPEM-------MDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 85 ~l~~~d~~~-------~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) ||++.|..+ .+.+++++||+.||++|+.+|++||||.++|++|+||++|||+++|+++|. .+|++|||++ T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~~~---~~~~~~pl~~ 152 (510) T protein:vir:63 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDSDA---ATVVAWSLRS 152 (510) T ss_pred ccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcCCC---cEEEEEEcce Confidence 999988654 345679999999999999999999999999999999999999999998764 4688999999 Q ss_pred EEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEE Q lcl|NC_019445. 158 YYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) Q Consensus 158 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~ 237 (559) |||.+|++|+||+||||++||+++|.++|+.+.++ +..+ ++++++|+|||+|+|++ .++|||.|||+ T Consensus 153 y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~----~~~~-~~~~~~v~v~~~V~~~~--------~~~~~~~sv~~ 219 (510) T protein:vir:63 153 YAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMR----AGRN-LSGSGSVDLYTHVQRKK--------GTAMEYAELYH 219 (510) T ss_pred eEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhc----cccc-cCCCcceEEEEEEEeec--------CCCceEEEEEE Confidence 99999999999999999999999999999866543 2233 45567899999999764 46899999999 Q ss_pred EecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccce Q lcl|NC_019445. 238 EVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRAS 317 (559) Q Consensus 238 ~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~ 317 (559) +.++......+++||++|||+++||++.+||+|||| |++++|||+|+||.|+++.+++++++++|||++++++.....+ T Consensus 220 e~dg~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~ 298 (510) T protein:vir:63 220 EIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD 298 (510) T ss_pred EecCceeccccccccccCceeeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhh Confidence 987654333445567999999999999999999999 9999999999999999999999999999999998877666667 Q ss_pred ecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhh Q lcl|NC_019445. 318 LLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLG 396 (559) Q Consensus 318 ~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG 396 (559) +.||+..++ .++..+.+.|+..+. .+++.+.+.|++++++|+++||.| +.++++++||||||++|++|++++|| T Consensus 299 ~~~~~~g~~-v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~----l~~~~~~rvTAtEV~~r~~E~~~~LG 373 (510) T protein:vir:63 299 YQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTLG 373 (510) T ss_pred hccCCCcee-ecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHhh----cccCCCCCcCHHHHHHHHHHHHHHhh Confidence 777664433 233445677766443 578888999999999999999753 56799999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhc Q lcl|NC_019445. 397 PVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDK 476 (559) Q Consensus 397 ~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~ 476 (559) |||+||++|||.|+|+|+|++|++.|++|+||+.+. ...++|+|+|+++|+..++.++.++++.+++++++.| + T Consensus 374 pv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~--~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~----~ 447 (510) T protein:vir:63 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHK--PAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP----R 447 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcc--cceecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhc----c Confidence 999999999999999999999999998887777653 4558899999999988888888888888777666555 7 Q ss_pred CCHHHHHHHHHHHcCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 477 LNVDQAIDAFADMSGV-SPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 477 id~d~~~~~~a~~~Gv-p~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~ 540 (559) ||+|++++++++++|| |..++||++||++++++++||+++++++++ .+.++|+.++.+..+= T Consensus 448 id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~--~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 448 ISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQE--TLLEGASDMTNALAGV 510 (510) T ss_pred CCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhcccccCC Confidence 9999999999999999 568999999999998775554444433332 2233444433333321 No 20 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=100.00 E-value=4.7e-149 Score=833.76 Aligned_cols=498 Identities=14% Similarity=0.133 Sum_probs=419.9 Q ss_pred CChhhHHHHHHHHHHH--HHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQL--ESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l--~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |. +++..| |.+|++|+++|+||++||+|++.+++.++.+.+ .+..++|||||++|+++||||||++||| T Consensus 1 m~--------~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~-~~~~~~~dstg~~a~~~LAa~l~~~ltp 71 (514) T protein:vir:80 1 MR--------QQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQA-EVVEYDFQSAGAFLVNNLTAKLALTLFP 71 (514) T ss_pred Cc--------cchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccc-cccccccchhHHHHHHHHHHHHHhhhcC Confidence 33 333334 667999999999999999999987766654443 3457789999999999999999999999 Q ss_pred CCCcceeccCCcc-------chhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 79 PARPWFRLATPDP-------EMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 79 p~~~Wf~l~~~d~-------~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) |++|||||++.|+ +..+..++++||++|+++|+++|++||||.++|++|+||++||||++|++++.. +|+ T Consensus 72 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~---~~~ 148 (514) T protein:vir:80 72 PGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGTG---KML 148 (514) T ss_pred CCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCCC---cEE Confidence 9999999998763 334567899999999999999999999999999999999999999999988754 477 Q ss_pred EeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +|||++|||.+|++|+||+||||++||++++.++|+.+. .++..++++ +.+|+|||||+++++ +++| T Consensus 149 ~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~----~~~~~~~~~-~~~v~v~~~v~~~~~--------~~~~ 215 (514) T protein:vir:80 149 VWTMQSYTVRRTSHGDPAVVVLRQQMPFRELTPEIQADA----QAKQIAKRD-SDKCDLYTVIEWQPT--------PNGK 215 (514) T ss_pred EEEcCeEEEeeCCCcCeEEEEeeeeecHHHhhhhhhhhh----hhhhccCCC-CCceEEEEEEEeecC--------CCCe Confidence 899999999999999999999999999999998887543 333444444 567999999998753 5789 Q ss_pred EEEEEEEecCCCceeeeecCc--ccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_019445. 232 FKSVYYEVGGDNDKLLRESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPT 309 (559) Q Consensus 232 ~~sv~~~~~~~~~~il~esg~--~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~ 309 (559) |.|+|.+.+ +.+++++||| ++|||+++||++.+||+|||| |++++|||+|+||.|+++.+++++++++|||++++ T Consensus 216 ~~sv~~e~~--g~~i~~es~y~~~e~P~i~~Rw~~~~ge~YGrg-p~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~ 292 (514) T protein:vir:80 216 RCAVWHELE--GKRVGPESSYPAHLCPYVPVAWNVPDGEHYGRG-YVEEYSGDFARLSILSERLGLYEFEALSLLNLVDE 292 (514) T ss_pred EEEEEEecc--ceeecccCccccccCCeeeeeeEecCCCCcccc-hHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCc Confidence 999998874 4568999998 789999999999999999999 99999999999999999999999999999999988 Q ss_pred CCccccceecCCceeecCCcCCchhhhhhhhc-cccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHH Q lcl|NC_019445. 310 SLKNQRASLLPGDITYIDQITGQDGFRPAYLV-NPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) Q Consensus 310 ~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~-~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~ 388 (559) ++.....++.||+..++- ++..+.+.|+..+ ..+++.+++.|++++++|+++||. +. ..+++++||||||++|+ T Consensus 293 ~g~~~~~~l~~~~~g~~v-~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml--~~--~~rd~~rvTAtEV~~r~ 367 (514) T protein:vir:80 293 AKGGAVDDYRDAETGDFV-PGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMY--TG--QVRDAERVTVEEIRTVA 367 (514) T ss_pred ccccchhhhcccCCceee-cCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhh--hc--cCCCCCCCCHHHHHHHH Confidence 877667777776644331 3345567776644 357888899999999999999973 22 33789999999999999 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 389 EEKLLMLGPVLERLNDECLNPLIDRAFSMMVR--KNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQL 466 (559) Q Consensus 389 ~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r--~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~l 466 (559) +|++++|||||+||++|||.|+|+|+|++|+| .|.||++|+++ ++++|+|+|++++|++++++|.+|+++++++ T Consensus 368 ~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l----~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l 443 (514) T protein:vir:80 368 EEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGV----YRPSIITGIPALTRNIETANILRATQEASAI 443 (514) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchh----hcceeeecHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999987 48999999987 7899999999999999999999999999999 Q ss_pred hccChhhHhcCCHHHHHHHHHHHcCCCcc-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhhh Q lcl|NC_019445. 467 AQAKPEALDKLNVDQAIDAFADMSGVSPT-VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ-GAKTLSE 535 (559) Q Consensus 467 a~~~P~~~~~id~d~~~~~~a~~~Gvp~~-~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~-~a~~~~~ 535 (559) ++++|+++|+||+|++++++++++|||++ +++++|++++.++++++++|+||++++++++++ .++.+-. T Consensus 444 ~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 444 VPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred hccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 99999999999999999999999999986 555555555444444455555555555555543 2233222 No 21 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=1.1e-148 Score=831.81 Aligned_cols=498 Identities=9% Similarity=0.093 Sum_probs=430.8 Q ss_pred CChh------hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAET------TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~------~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) |-+. .+++|++||+.|+++|++|+++|+||++||+|++++..+ +.++.+++|||||++|+++||||||+ T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~-----~~~~~~~~~dstg~~a~~~LAa~l~~ 75 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKG-----DNETSQNGWQGVGAQATNHLANKLAQ 75 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCC-----CcccccccccchHHHHHHHHHHHHHH Confidence 5554 578999999999999999999999999999998753222 22345679999999999999999999 Q ss_pred hhcCCCCcceeccCCccch-------hhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEM-------MDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI 147 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~-------~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~ 147 (559) +||||++|||||+++|... .+..++++||+.|++.++.+|++||||.++|++|+||++||||++|++++. + T Consensus 76 ~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~--~ 153 (515) T protein:vir:70 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG--A 153 (515) T ss_pred hhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCCC--C Confidence 9999999999999887543 456789999999999999999999999999999999999999999998654 3 Q ss_pred EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccc Q lcl|NC_019445. 148 IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) Q Consensus 148 ~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~ 227 (559) |++|||++|||.+|++|+||+|||||+||+++|+++||.+.++..+.. +. +++.+|+|||+|+|+. T Consensus 154 --~~~~pl~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~--~~-~~~~~v~i~~~v~~~~--------- 219 (515) T protein:vir:70 154 --MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGK--KC-KEDDNVKLYTHAQYAG--------- 219 (515) T ss_pred --eEEEEcCeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhh--hc-CCCCceEEEEEEEecC--------- Confidence 667999999999999999999999999999999999998776655433 33 3356799999999864 Q ss_pred ccccEEEEEEEecCCCceeeeecCc--ccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019445. 228 KNKPFKSVYYEVGGDNDKLLRESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) Q Consensus 228 ~~~~~~sv~~~~~~~~~~il~esg~--~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~ 305 (559) ++|.++|.+.++ .++++|||| ++|||+++||++.+||+|||| |++++|||+|+||.|+++++++++++++||| T Consensus 220 --~~~~~~~~e~d~--~~~~~es~y~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~ 294 (515) T protein:vir:70 220 --EGFWKINQSADD--IPVGKESRIKSEKLPFIPLTWKRSYGEDWGRP-LAEDYSGDLFVIQFLSEAMARGAALMADIKY 294 (515) T ss_pred --CCceEEEEecCc--eeeccccccccccCCceeeeeeecCCCCcccc-hHHHhhHHHHHHHHHHHHHHHHHHHhcCCCe Confidence 356678877744 578899985 899999999999999999999 9999999999999999999999999999999 Q ss_pred eecCCCccccceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH Q lcl|NC_019445. 306 VAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) Q Consensus 306 ~~p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei 384 (559) ++++++..+..++.||+..++ .++..+.+.|+..++ .+++.+++.|++++++|+++||.| .+..+++++|||||| T Consensus 295 lv~~~g~~~~~~l~~~~~g~i-v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~---~l~~rd~~rvTAtEV 370 (515) T protein:vir:70 295 LIRPGSQTDVDHFVNSGTGEV-ITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME---TMTRRDAERVTAVEI 370 (515) T ss_pred eeCcccccchhhccccCCcee-ecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh---hhhccCCccccHHHH Confidence 999888777777888775444 234456677776553 478889999999999999999987 456789999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~ 464 (559) ++|++|++++|||||+||+.|||.||+.|+ +.+.+|++|+++ ++++|+|+|++++|++++++|.+|+++++ T Consensus 371 ~~r~~E~~~~LGpv~srL~~Ell~Pli~r~-----~~~~~p~~P~~~----v~~~~vs~l~~L~r~q~~~~i~~~~q~i~ 441 (515) T protein:vir:70 371 QRDALEIEQNMGGVYSLFAMTMQTPIAMWG-----LQEAGDSFTSEL----VDPVIVTGIEALGRMAELDKLANFAQYMS 441 (515) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHH-----HHhhCCCCChhh----cccceehhHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999875 367889999886 88999999999999999999999999999 Q ss_pred HHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhhhhc Q lcl|NC_019445. 465 QLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ-GAKTLSEAK 537 (559) Q Consensus 465 ~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~-~a~~~~~~~ 537 (559) .+++++|+++++||+|++++++++.+|+|.+++||++||+++|++|+|+||+++.++++++++. .++..+..+ T Consensus 442 ~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 9999999999999999999999999999999999999999999999888887776666554432 222211111 No 22 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=5.3e-148 Score=828.02 Aligned_cols=504 Identities=17% Similarity=0.172 Sum_probs=416.8 Q ss_pred CChhh---HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETT---KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~---~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |++++ ++++++||+.|+++|++|+++|+||++||+|+++++.++. +..+..++|||||++|+++||||||++|| T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~---~~~~~~~~~dst~~~a~~~Las~l~~~lt 77 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDN---SSTEYTTPWQAVGARCLNNLAAKLMLALF 77 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCc---ccccccccccccHHHHHHHHHHHHHhhcC Confidence 99974 6789999999999999999999999999999987654432 34456789999999999999999999999 Q ss_pred CCCCcceeccCCcc-------chhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCC-ceEE Q lcl|NC_019445. 78 SPARPWFRLATPDP-------EMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE-DIIR 149 (559) Q Consensus 78 pp~~~Wf~l~~~d~-------~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~-~~~~ 149 (559) | ++|||||.+.|. +..+.+++++||++|+++|+++|++||||.++|++|+||++||||++|++++.. ++.+ T Consensus 78 P-~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~ 156 (522) T protein:vir:94 78 P-QSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSP 156 (522) T ss_pred C-CCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceee Confidence 6 679999988753 456678899999999999999999999999999999999999999999988764 4678 Q ss_pred EEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcC--CCCceEEEEEEEeecCccccccccc Q lcl|NC_019445. 150 TMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESG--TYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) Q Consensus 150 ~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~--~~~~~v~v~~~v~p~~~~~~~~~~~ 227 (559) |++|||++|||++|++|+||+||||+++++++| |+++++.++++ +++++|+|||+|+|+.++. T Consensus 157 ~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l---------~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~------ 221 (522) T protein:vir:94 157 MRMYRLVSYVVQRDAFGNILQIVTIDKVAFSAL---------PEDVKSQLNADDYEPDTELEVYTHIYRQDDEY------ 221 (522) T ss_pred EEEEEcceEEEeeCCCcCeEEEeeeeeccHHhc---------chHHHHHHhcccCCccceEEEEEEEEeeCCce------ Confidence 999999999999999999999999999998775 55666666543 3467999999999976531 Q ss_pred ccccEEEEEEEecCCCceeeeec--CcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019445. 228 KNKPFKSVYYEVGGDNDKLLRES--GFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) Q Consensus 228 ~~~~~~sv~~~~~~~~~~il~es--g~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~ 305 (559) ++|.+..+ ..+.+++| ||++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++||| T Consensus 222 ------~~~~~~~g-~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~ 293 (522) T protein:vir:94 222 ------LRYEEVEG-IEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRS-YCEEYLGDLNSLETITEAITKMAKVASKVVG 293 (522) T ss_pred ------eEEeeccC-ceecccCCCCccccCCceeeeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCce Confidence 23344433 23345555 68999999999999999999999 8999999999999999999999999999999 Q ss_pred eecCCCccccceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH Q lcl|NC_019445. 306 VAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) Q Consensus 306 ~~p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei 384 (559) ++++++.....++.||+..++- ++..+.++|+..++ .+++.+.+.|++++++|+++||.| ++..+++++|||||| T Consensus 294 ~v~~~g~~~~~~~~~~~~g~~v-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~r~TAtEV 369 (522) T protein:vir:94 294 LVNPNGITQPRRLNKAATGEFV-AGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLN---SAVQRNAERVTAEEI 369 (522) T ss_pred eecccccccchheeccCCceee-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh---hhccCCCccccHHHH Confidence 9998887777777776654442 12234455555333 478888999999999999999876 566789999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~ 464 (559) ++|++|++++|||||+||++|||.|||+|+|++|+|+|+||++|+++ ++|+|+|||+++||++++++|.+|++.++ T Consensus 370 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~----v~v~~~s~La~~qr~~~~~~l~~~~~~ia 445 (522) T protein:vir:94 370 RYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEA----VEPTVSTGLEALGRGQDLEKLTQAVNMMT 445 (522) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccc----EEeeEecHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999885 99999999999999999999999988664 Q ss_pred HHhccChhhHh-cCCHHHHHHHHHHHcCCC-ccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 465 QLAQAKPEALD-KLNVDQAIDAFADMSGVS-PTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 465 ~la~~~P~~~~-~id~d~~~~~~a~~~Gvp-~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) +++|++++ +||+|++++++++++||| +.++||++|++++++|+++++++++++.+. +...+|+....++ T Consensus 446 ---~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~-~~~~~a~~~~~~~----- 516 (522) T protein:vir:94 446 ---GLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAA-GANMGAAVGQGAG----- 516 (522) T ss_pred ---hccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhhhhhcccc----- Confidence 56889875 899999999999999995 679999999999988765554443333222 1122222211111 Q ss_pred HHHHHHHH Q lcl|NC_019445. 543 VLSAMANA 550 (559) Q Consensus 543 ~~~~~~~~ 550 (559) +.|.++ T Consensus 517 --~~~~~~ 522 (522) T protein:vir:94 517 --EDMAQA 522 (522) T ss_pred --hhhhcC Confidence 222211 No 23 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=3.4e-148 Score=829.05 Aligned_cols=503 Identities=11% Similarity=0.090 Sum_probs=428.1 Q ss_pred CChh---hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAET---TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~---~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |+=. .+++|++||+.|+++|++|+++|+||++||+|++++..+++ .+..++|||||++|+++||||||++|| T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~-----~~~~~~~dstg~~a~~~LAa~l~~~lt 75 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD-----LSSQNAWQDDGASATNFLSNKLSQVLF 75 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC-----ccccccccchHHHHHHHHHHHHHHhhc Confidence 4433 57899999999999999999999999999999987654432 234679999999999999999999999 Q ss_pred CCCCcceeccCCccchhh-------HHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMD-------YGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRT 150 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~-------~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~ 150 (559) ||++|||||++.|+.+.+ .++++.||+.||++|+++|++||||.++|++|+||++||||++|+++.. .+| T Consensus 76 pp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~---~~~ 152 (517) T protein:vir:10 76 PAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHPDKT---SPI 152 (517) T ss_pred CCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEeCCC---CcE Confidence 999999999999876554 4679999999999999999999999999999999999999999986543 357 Q ss_pred EEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccc Q lcl|NC_019445. 151 MPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) Q Consensus 151 ~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 230 (559) ++|||++|||.+|++|+||+||||+++|+++|+++||.+..+...+. +. +++.+|+|||+|+|+.+. T Consensus 153 ~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~--~~-~~~~~v~v~~~v~~~~~~---------- 219 (517) T protein:vir:10 153 QAVPLHHYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGK--QY-KDKDNVKLYTHAKRTKDG---------- 219 (517) T ss_pred EEEEcCeEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhh--cc-CCcCceEEEEEEEEeCCC---------- Confidence 78999999999999999999999999999999999997765544332 22 345689999999997542 Q ss_pred cEEEEEEEecCCCceeeeecCc--ccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLLRESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAP 308 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il~esg~--~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p 308 (559) +.++|.+. ++.+++++|+| ++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++|||+++ T Consensus 220 -~~~~~~~~--d~~~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~ 295 (517) T protein:vir:10 220 -KYLIRQSA--DDVPVGKESTVTEDKSPFLILTWKRSYGEDYGRG-MAEDHAGAFFVIQFLSEALARGMALMADVKYLVK 295 (517) T ss_pred -ceEEEEEe--CceeeccccccccccCCeeeeeeeecCCCCcccc-hHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccC Confidence 12456665 34568888886 799999999999999999999 9999999999999999999999999999999999 Q ss_pred CCCccccceecCCceeecCCcCCchhhhhhhhc-cccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 309 TSLKNQRASLLPGDITYIDQITGQDGFRPAYLV-NPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 309 ~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~-~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) +++..+..++.||+..++. ++..+.+.|+..+ ..+++.+.+.|++++++|+++||.|+ +..+++++||||||++| T Consensus 296 ~~~~~~~~~l~~~~~g~~~-~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~---l~~~~~~rvTAtEV~~r 371 (517) T protein:vir:10 296 PGSYTDINQFVEGGSGAVL-HGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEA---MTRRDAERVTAYEIQRD 371 (517) T ss_pred cccccchhhccCCCccccc-cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh---hhccCCccccHHHHHHH Confidence 9888777888888865553 3444567776544 35788899999999999999999874 45789999999999999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la 467 (559) ++|++++|||||+||++|||.|+|+|+|++|.+.+ |. .+++|+|+|+|++++|++++++|.+|++++++++ T Consensus 372 ~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l--~~-------~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a 442 (517) T protein:vir:10 372 AMLVEQSLGGVYSLFATTFQGPLARWFMNGISSIL--TS-------KNVSPTILTGIEALGRMAELDKLGTFNGYVSMTA 442 (517) T ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhc--CC-------CCccceeeccHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999997543 22 2478999999999999999999999999999999 Q ss_pred ccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhhhhhcCCCh Q lcl|NC_019445. 468 QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMA-AAQGAKTLSEAKTSDP 541 (559) Q Consensus 468 ~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~-~~~~a~~~~~~~~~~~ 541 (559) +++|.++++||+|++++++++++|||+++|||++||+++|++++++||++++++++.. +++.++..++++.+++ T Consensus 443 ~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 443 QWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred cCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 9988888999999999999999999999999999999998887777666655544332 3333444344444333 No 24 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=1.7e-148 Score=830.70 Aligned_cols=497 Identities=11% Similarity=0.099 Sum_probs=428.1 Q ss_pred CC----hh---hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MA----ET---TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~----~~---~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |. .. .+++|++||+.|+++|++||++|+||++||+|+..+. +. +..+.+++|||||++|+++|||||| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~--~~---~~~~~~~~~dstg~~a~~~LAa~l~ 75 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMND--KG---DNETSQNGWQGVGAQATNHLANKLA 75 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCC--CC---CccccCCcccchHHHHHHHHHHHHH Confidence 43 33 4688999999999999999999999999999986432 22 2334568999999999999999999 Q ss_pred HhhcCCCCcceeccCCccch-------hhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEM-------MDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED 146 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~-------~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~ 146 (559) ++||||++|||+|++.|..+ .+.+++++||++|+++|+.+|++||||.++|++|+||++||||++|++++. T Consensus 76 ~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~-- 153 (516) T protein:vir:96 76 QVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG-- 153 (516) T ss_pred hhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC-- Confidence 99999999999999887543 356689999999999999999999999999999999999999999997664 Q ss_pred eEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccc Q lcl|NC_019445. 147 IIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLD 226 (559) Q Consensus 147 ~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~ 226 (559) +|++|||++|||.+|++|+|++||||++++++++.++|+ ++++.+++..++++ +..|+|||+|+++.+ T Consensus 154 --~~~~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~--~~~~~~~~~~~~~~-~~~v~v~~~v~~~~~------- 221 (516) T protein:vir:96 154 --AISAIPMHHYVVNRDTNGDLLDIILLQEKALRTFDPATR--AVVEVGLKGKKCKE-DDSVKLYTHAKYLGD------- 221 (516) T ss_pred --CEEEEEcCeEEEeeCCCCCeeeehhhhHhhHHHHHHhhh--hhhhhhhhhhhcCC-CCceEEEEeeeeeCC------- Confidence 267899999999999999999999999999999999995 45666776666554 568999999998653 Q ss_pred cccccEEEEEEEecCCCceeeeecC--cccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 227 SKNKPFKSVYYEVGGDNDKLLRESG--FDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 227 ~~~~~~~sv~~~~~~~~~~il~esg--~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) .|.++|++.+ +.+++++|+ |++|||+++||++.+||+|||| |++++|||+|+||.|+++++++++++++|| T Consensus 222 ----~~~~~~~~~d--~~~~~~es~~~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~L~D~k~L~~l~~~~l~~~~~a~~~~ 294 (516) T protein:vir:96 222 ----GFWELKQSAD--DIPVGKVSKIKSEKLPFIPLTWKRSYGEDWGRP-LAEDYSGDLFVIQFLSEAVARGAALMADIK 294 (516) T ss_pred ----ceeEEEEEeC--ceeeccccccccccCCeeeeeeeecCCCCcccc-hHHHhhHHHHHHHHHHHHHHHHHHHhcCCc Confidence 2456777764 446788877 5899999999999999999999 999999999999999999999999999999 Q ss_pred eeecCCCccccceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHH Q lcl|NC_019445. 305 MVAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEA 383 (559) Q Consensus 305 ~~~p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~E 383 (559) |++++++..+..++.||+..++- ++..+.+.|+..++ .+++.+++.|++++++|+++||.| .+..+++++||||| T Consensus 295 ~lv~p~g~~~~~~l~~~~~g~i~-~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~---~l~~r~~~rvTAtE 370 (516) T protein:vir:96 295 YLIRPGAQTDVDHFVNSGTGEVV-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME---TMTRRDAERVTAVE 370 (516) T ss_pred cccCcccccchhhhccCCCceee-cCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh---hhccCCCccccHHH Confidence 99988877777777777755442 34456678865543 478888999999999999999876 45678999999999 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 384 VIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFI 463 (559) Q Consensus 384 i~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~ 463 (559) |++|++|++++|||||+||++|||.|+|.|++.++ .|++|+.+ ++++|+|+|++++|++++++|.+|++++ T Consensus 371 V~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~-----~p~lp~~~----v~~~~vs~l~~l~r~~~~~~i~~~~~~i 441 (516) T protein:vir:96 371 IQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA-----GESFTSDL----VDPVIITGIEALGRMAELDKLANFAQYM 441 (516) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhc-----CCCCcccc----ccceeechHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999997665 37777654 8999999999999999999999999999 Q ss_pred HHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhh-hhhh Q lcl|NC_019445. 464 GQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAA-QGAKT-LSEA 536 (559) Q Consensus 464 ~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~-~~a~~-~~~~ 536 (559) +++++++|+++|+||+|++++++++++|||++++||++||+++|++++++||+++++++++++. ..+|+ +.|+ T Consensus 442 ~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 442 SLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 9999999999999999999999999999999999999999999999988888877666554432 22222 2222 No 25 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=100.00 E-value=1.9e-147 Score=824.92 Aligned_cols=497 Identities=11% Similarity=0.098 Sum_probs=428.3 Q ss_pred CChhh---HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETT---KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~---~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |.+.+ +++|++||+.|+++|++||++|+||++||+|++++..+ +..+.+++|||||++|+++||||||++|| T Consensus 5 ~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~-----~~~~~~~~~dstg~~a~~~LAa~l~~~lt 79 (516) T protein:vir:10 5 TDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKG-----DNETSQNGWQGVGAQATNHLANKLAQVLF 79 (516) T ss_pred hhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCC-----CcccccccccchHHHHHHHHHHHHHhhhc Confidence 66654 58899999999999999999999999999998754322 12345689999999999999999999999 Q ss_pred CCCCcceeccCCccch-------hhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEE Q lcl|NC_019445. 78 SPARPWFRLATPDPEM-------MDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRT 150 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~-------~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~ 150 (559) ||++|||||+++|..+ .+.+++++||+.||++++.+|++||||.++|++|+||++||||++|++++. + | T Consensus 80 pp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~--~--~ 155 (516) T protein:vir:10 80 PAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG--A--I 155 (516) T ss_pred CCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC--C--e Confidence 9999999999987644 345679999999999999999999999999999999999999999997654 2 6 Q ss_pred EEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccc Q lcl|NC_019445. 151 MPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) Q Consensus 151 ~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 230 (559) ++|||++|||.+|++|+||+||||+++|++++.++|+ ++++..++..+.++ +.+++|||+|+++.+ T Consensus 156 ~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~--~~~~~~~~~~~~~~-~~~~~i~t~v~~~~~----------- 221 (516) T protein:vir:10 156 SAIPMHHYVVNRDTNGDLLDIILLQEKSLRTFDPATR--AVVEVGLKGKKCKE-DDSIKLYTHAKYLGE----------- 221 (516) T ss_pred EEEEcCeEEEeeCCCCCeEEEeeeecccHHHHHHHhh--hhhhhhhhhhccCC-CCceEEEEEEEecCC----------- Confidence 6799999999999999999999999999999999995 45677777776655 567999999997542 Q ss_pred cEEEEEEEecCCCceeeeecC--cccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLLRESG--FDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAP 308 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il~esg--~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p 308 (559) .|.++|.+.+ +.++.++|+ |++|||+++||++.+||+|||| |++++|||+|+||.|+++++++++++++|||+++ T Consensus 222 ~~~~~~~~~d--~~~~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrg-p~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~ 298 (516) T protein:vir:10 222 GFWELKQSAD--DIPVGKVSKIKSEKLPFIPLTWKRSYGEDWGRP-LAEDYSGDLFVIQFLSEAVARGAALMADIKYLIR 298 (516) T ss_pred CceEEEEeeC--ceeeccccccccccCCeeeeeeeecCCCCcccc-hHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccC Confidence 2456777764 446777776 6899999999999999999999 9999999999999999999999999999999998 Q ss_pred CCCccccceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 309 TSLKNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 309 ~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) +++.....++.||+..++ .++..+.+.|+..++ .+++.+++.|++++++|+++||.| .+..+++++||||||++| T Consensus 299 p~g~~~~~~l~~~~~g~~-~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~---~l~~rd~~rvTAtEV~~r 374 (516) T protein:vir:10 299 PGAQTDVDHFVNSGTGEV-VTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME---TMTRRDAERVTAVEIQRD 374 (516) T ss_pred cccccchhhhccCCCcee-ecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhh---hhhccCCccccHHHHHHH Confidence 887777788888876555 234556678865553 478888999999999999999987 456689999999999999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la 467 (559) ++|++++|||||+||++|||.|+|.|++. +++|++|+++ ++++|++++++++|++++++|.+|++++++++ T Consensus 375 ~~E~~~~LGpv~~rl~~Ell~Pli~r~~~-----~~~p~~P~~l----v~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~ 445 (516) T protein:vir:10 375 ALEIEQNMGGVYSLFATTMQSPVAMWGLL-----EAGDSFTSDL----VDPVIITGIEALGRMAELDKLANFAQYMSLPL 445 (516) T ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHH-----hhCCCCChhh----cCcceehhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999963 5579999988 67888999999999999999999999999999 Q ss_pred ccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhh-hhhh Q lcl|NC_019445. 468 QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ-GAKT-LSEA 536 (559) Q Consensus 468 ~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~-~a~~-~~~~ 536 (559) +++|+++|+||+|++++.+++++|||++++||++||+++|+||+|+||.++++++++++.+ ++|. +.++ T Consensus 446 q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 446 QWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred cCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhcccchhhhhhhcC Confidence 9999999999999999999999999999999999999999999888887665554443322 1121 2222 No 26 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=2.2e-95 Score=539.55 Aligned_cols=536 Identities=14% Similarity=0.129 Sum_probs=396.9 Q ss_pred CCh-hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHh----------ccccCCCCCCCCCCcccccCCCCcchHHHHHHHHH Q lcl|NC_019445. 1 MAE-TTKERLNKQFAQLESERQSFEPHWRELSDYI----------NPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLA 69 (559) Q Consensus 1 M~~-~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~----------~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~La 69 (559) |.+ +.+..|++||+.+++.|++||++|+||++|+ .|++.++.++...++ ..++++++..+++++|+ T Consensus 20 ~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---r~ki~~~~~~~~~~~l~ 96 (641) T protein:vir:94 20 LSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADW---RHRINTGHTFEVVETLV 96 (641) T ss_pred CCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcc---cccccchhHHHHHHHHh Confidence 654 5678899999999999999999999999665 445544444443332 34799999999999999 Q ss_pred HHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeec------ Q lcl|NC_019445. 70 SGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLED------ 143 (559) Q Consensus 70 s~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~------ 143 (559) ++||+++|| +++||++.+.+++..+..++ +++.+...+++++|+..+++.+.+++.+|||++.+..+ T Consensus 97 s~Lm~~~~p-~~~wf~~~p~~~ed~~~A~~------~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~ 169 (641) T protein:vir:94 97 AYFKGATFP-SDDWFDLKGMVPELADAARV------VKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQ 169 (641) T ss_pred hHHhhhhcC-CCceEEEecCCCChHHHHHH------HHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHh Confidence 999999997 88999999877765554332 44556678889999999999999999999998876532 Q ss_pred ----------------------CCceEEEEEeeccEEEEeeCCCC--CEEEEEEEEeecHHHHHHh--cCcccCCHHHHH Q lcl|NC_019445. 144 ----------------------DEDIIRTMPFPIGSYYLANSPRG--SVDICFRKFSMTVRQLVQE--FGLNNVSESVKS 197 (559) Q Consensus 144 ----------------------~~~~~~~~~~~l~~~~v~~d~~G--~vd~i~r~~~~t~~ql~~~--fg~~~l~~~v~~ 197 (559) ....+++++++..+|+++.++.. .++++||++++|+.+++.+ ||.+++....+. T Consensus 170 ~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~ 249 (641) T protein:vir:94 170 FKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYV 249 (641) T ss_pred hhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhcc Confidence 12334666667777777654421 1235788899999999887 888888766655 Q ss_pred HHhcCCCCceEEEEEEEeecCccc--ccccccccccEEEEEEEecCCCceeeeecCc---ccCCeEEEEeeecCCCcccc Q lcl|NC_019445. 198 MWESGTYEKWIEVMHSVYPNIDRD--TSKLDSKNKPFKSVYYEVGGDNDKLLRESGF---DEFPIMAPRWEVNGEDVYGS 272 (559) Q Consensus 198 ~~~~~~~~~~v~v~~~v~p~~~~~--~~~~~~~~~~~~sv~~~~~~~~~~il~esg~---~~~P~~~~rw~~~~g~~YGr 272 (559) .++...++...++-....++++.. .+.++..+++|.|+|++.++ +++++++|+ +++||+++||.+.++++||+ T Consensus 250 ~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g--~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~ 327 (641) T protein:vir:94 250 DYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYG--KQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGM 327 (641) T ss_pred cccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeC--CEEeecccccccCcCCeEEecceecCCcccCC Confidence 444333333222221111111110 12456678999999988754 579988874 57799999999999999999 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC--ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHH Q lcl|NC_019445. 273 SCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL--KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVA 350 (559) Q Consensus 273 G~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~--~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 350 (559) | |+++++||+++||.+++.++++++++++|||++++++ ++.++.+.||++++++..+ .++|+..+.+++..... T Consensus 328 g-p~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~---~v~pl~~~~~~~~~~~~ 403 (641) T protein:vir:94 328 S-VLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHG---SLQPIDMGRQDFVVTYQ 403 (641) T ss_pred C-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCC---cceeecCCccccchhHH Confidence 9 8999999999999999999999999999999988775 4455788899998876544 46777655555556667 Q ss_pred HHHHHHHHHHHHhhcchhhhccC-CCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-------- Q lcl|NC_019445. 351 DIQDTRQIINSAYFVDLFMMLQN-INTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-------- 421 (559) Q Consensus 351 ~i~~~~~rI~~af~~dl~~~~~~-~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-------- 421 (559) .++.++.+|+++|+.|.+.++.+ ++++++|||||+++.+|+...||+++++|+.||+.||+.|+++++++. T Consensus 404 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R 483 (641) T protein:vir:94 404 EAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIR 483 (641) T ss_pred HHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhh Confidence 88999999999999998766554 677889999999999999999999999999999999999999999884 Q ss_pred ---------CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC- Q lcl|NC_019445. 422 ---------NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG- 491 (559) Q Consensus 422 ---------g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G- 491 (559) |.+|++|++|.| ++.| ++|+++++..+.+++++++++++.+++ .|++++++|++.+++.+++..| T Consensus 484 ~~~~~~~~~~~~~~~p~~L~~---~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~-~P~v~d~~d~~~~~~~~~~~~g~ 558 (641) T protein:vir:94 484 MYVPEEQMDGFFEVSPEYLHY---PYKF-LALGANYVVERERMVTDLLQLLDISGR-VPQIGQSLDYALILEDLLRQMRF 558 (641) T ss_pred hhchhhhcccCCCCCccceee---eeeE-eecchhHHHHHHHHHHHHHHHHHHhhc-ChhhhhcCCHHHHHHHHHHHhCC Confidence 677778888753 4444 489999999999999999999988777 6999999999999999998866 Q ss_pred -CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCC------------- Q lcl|NC_019445. 492 -VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQ------------- 557 (559) Q Consensus 492 -vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 557 (559) +|..++|+++...+-+++ ++++++++++++++.+++.+..-+.+. .-++.+++|.++++-+.+. T Consensus 559 ~~p~~~ir~~~~~~~~~~~-~~~~~q~~~~~~a~~~~~~~~~~a~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 636 (641) T protein:vir:94 559 TDPMRYIKKAEAPPAAPPI-APAEPGALPPEMMNSVGGGLNDQAIAG-MTPEDVSDLASRIGIDTSDVAPEAMAAATQQI 636 (641) T ss_pred CCchhhccCccCchhHHHH-HHHHHHHHHHHHHHHHHhhhHHHHHHH-hhHHHHHHHHHhhcCCchhhhHHHHhcccccc Confidence 567899998753322222 222222333334443333222211111 1244555555554433331 Q ss_pred --CC Q lcl|NC_019445. 558 --SQ 559 (559) Q Consensus 558 --~~ 559 (559) .| T Consensus 637 ~~~~ 640 (641) T protein:vir:94 637 TSGA 640 (641) T ss_pred cccC Confidence 11 No 27 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=9.1e-72 Score=410.05 Aligned_cols=535 Identities=13% Similarity=0.139 Sum_probs=362.5 Q ss_pred CChh--hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-------cCCCCCCCCCCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAET--TKERLNKQFAQLESERQSFEPHWRELSDYINPR-------GSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~--~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~-------~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) |.|. .+..+.++|+++++.|+.|+++|++++++..++ .+..+.....+......+++.++...+++++.+. T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~~v~~~ve~~~~~ 94 (651) T protein:vir:80 15 YDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKAFEAIETIHAY 94 (651) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccChhHHHHHHHHHHH Confidence 5553 345689999999999999999999999877763 1111211122222233568999999999999999 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCC------ Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE------ 145 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~------ 145 (559) |+..+|| +.+||++.+.++. +..+++-+-|+..+...+++++|+..++.+++|.+++|||++.+.++.. T Consensus 95 l~~~~~~-~~~~~~~~p~~~~----d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~ 169 (651) T protein:vir:80 95 LMSATFP-NKNWFDVVPAKPG----QDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKK 169 (651) T ss_pred HHHhhcC-CCceeEeccCCch----hHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeeh Confidence 9999997 6889999885432 2235555668888888899999999999999999999999997654321 Q ss_pred -------------------------ceEEEEEeeccEEEEeeCCCCCEEEEE-EEEeecHHHHHHhc----CcccCCHHH Q lcl|NC_019445. 146 -------------------------DIIRTMPFPIGSYYLANSPRGSVDICF-RKFSMTVRQLVQEF----GLNNVSESV 195 (559) Q Consensus 146 -------------------------~~~~~~~~~l~~~~v~~d~~G~vd~i~-r~~~~t~~ql~~~f----g~~~l~~~v 195 (559) ..++++.+|+.+|+++.++.+.-|+-| .+..+|.+++.+.. ..+...... T Consensus 170 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~ 249 (651) T protein:vir:80 170 KVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDV 249 (651) T ss_pred heeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHH Confidence 235789999999999999977665533 33456777765532 111111111 Q ss_pred H-HHH--------------hc-----CCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeee--ecCc- Q lcl|NC_019445. 196 K-SMW--------------ES-----GTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLR--ESGF- 252 (559) Q Consensus 196 ~-~~~--------------~~-----~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~--esg~- 252 (559) . +.. +. .++..+|+||+|..+ ++..++.+.++|+..++ +++++ +.+| T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~--------~d~e~~~~~~~~v~~~g--~~il~~~~~~~~ 319 (651) T protein:vir:80 250 VEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGD--------IHLENKTYHDVVVTIMG--NEVLRFEQNPYW 319 (651) T ss_pred HhhhccccccCCccccccccCCCccccccccceEEEEEEEE--------eeccCCceEEEEEEEcC--cEEecccccCCC Confidence 1 111 00 012356788877443 23456678888777644 35664 5565 Q ss_pred ccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC--ccccceecCCceeecCCcC Q lcl|NC_019445. 253 DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL--KNQRASLLPGDITYIDQIT 330 (559) Q Consensus 253 ~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~--~~~~~~~~pg~~~~~~~~~ 330 (559) +++||+++||.+.+|+.||+| |++.++|+++.||.+++++++++.++++|+|+++++. ++.++...||++++++..+ T Consensus 320 ~~~Pf~~~~~~~~~~~~yG~g-~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~vi~~~~~~ 398 (651) T protein:vir:80 320 CGRPFVIGTYIPTARQPYAMG-ALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGKVFLVSDHG 398 (651) T ss_pred CCCCeeeecceecCccccCCC-hHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCceEEecCCC Confidence 579999999999999999999 8999999999999999999999999999999998764 4445678899999887765 Q ss_pred CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhh-ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_019445. 331 GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMM-LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNP 409 (559) Q Consensus 331 ~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~-~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~P 409 (559) + ++|+..+.+++......|+.++++|++.|..+.+.+ +..++.+++||+||+.+++++..+||++|++|+.||+.| T Consensus 399 ~---~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~p 475 (651) T protein:vir:80 399 D---LQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLV 475 (651) T ss_pred C---ceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 455544444566667789999999999998876665 344567899999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCCCchhhC------------CcceE----EEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhh Q lcl|NC_019445. 410 LIDRAFSMMVRKNMLPPPPDAME------------GMPLK----VEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEA 473 (559) Q Consensus 410 li~r~~~il~r~g~lp~~p~~l~------------g~~v~----~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~ 473 (559) |++|++++|++.+..|++|.... ..+++ +..+++.+.+.|...++.+.++++.+++ .|++ T Consensus 476 l~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~----~p~~ 551 (651) T protein:vir:80 476 LLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQ----VPEM 551 (651) T ss_pred HHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhcc----CCcc Confidence 99999999999998887654321 12333 3445666666676666666666654433 5777 Q ss_pred HhcCCHHHHHHHHHHHcCCCc--cccCCHHHHHHHHHHHHHHHHHHH-------HHHHHH-HHHHHHhhhhhhc-----C Q lcl|NC_019445. 474 LDKLNVDQAIDAFADMSGVSP--TVIVPQEQVDQARQQRAQQQQQQQ-------MMAMGM-AAAQGAKTLSEAK-----T 538 (559) Q Consensus 474 ~~~id~d~~~~~~a~~~Gvp~--~~~rs~~ev~~~rq~r~q~~q~~~-------~~~~~~-~~~~~a~~~~~~~-----~ 538 (559) .+.+|...+++.+++.+|++. .++..+++.+....+.+.++|.++ ++++++ ++.+..+...+.. + T Consensus 552 ~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 631 (651) T protein:vir:80 552 GQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQMMSEMYGTPNAD 631 (651) T ss_pred chhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 778999999999999999963 467666554332222211111110 000000 0000000000000 0 Q ss_pred CChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 539 SDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 539 ~~~~~~~~~~~~~~~~~~~~~ 559 (559) +....+ .+-.+..-+..-.| T Consensus 632 ~~~~~~-~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 632 QMQQEL-MATTPNVSEQQLTQ 651 (651) T ss_pred HHHHHH-HHHHHHHHHhhccC Confidence 000000 00011111111111 No 28 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=1.8e-65 Score=375.50 Aligned_cols=485 Identities=11% Similarity=0.010 Sum_probs=172.5 Q ss_pred hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHH-HHHHHHHHHHhhcCCCCc Q lcl|NC_019445. 4 TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMA-ARTLASGMMSGITSPARP 82 (559) Q Consensus 4 ~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a-~~~Las~l~~~l~pp~~~ 82 (559) +..+ .++.+....+.=..-...|+..++=|.=+-.+............. ....++..++ .-..+..|.++|...=.| T Consensus 1 m~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~-~~~~~~~~dst~~~a~~~Las~l~~~ltp 78 (559) T protein:vir:95 1 MAET-TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRN-DRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) T ss_pred CChh-hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcc-cccccccccchHHHHHHHHHHHHHHhhcC Confidence 5433 333333333333445566777777665554433222222221111 1223333333 344566677777763222 Q ss_pred -ceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee---c---CCceEEEEEeec Q lcl|NC_019445. 83 -WFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE---D---DEDIIRTMPFPI 155 (559) Q Consensus 83 -Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~---~---~~~~~~~~~~~l 155 (559) +-.+=-- ...+....+ ...+.+.|+ .+...+.+.+-.+|--.-+.+ | .++++ T Consensus 79 p~~~WF~l--~~~d~~~~e------~~~v~~~L~------~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~------- 137 (559) T protein:vir:95 79 PARPWFRL--ATPDPEMMD------YGPVKLWLE------AVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGA------- 137 (559) T ss_pred CCCccccc--ccCCccccc------hHHHHHHHH------HHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcee------- Confidence 2111100 111111111 112333333 222222232333331100101 1 12221 Q ss_pred cEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCc---------------c Q lcl|NC_019445. 156 GSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNID---------------R 220 (559) Q Consensus 156 ~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~---------------~ 220 (559) .|+..|+.+. +|-..+++++.+ + .++....-.+|+.......+ + T Consensus 138 --l~~~~d~~~~----~r~~~~~l~~~~-------v--------~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~ 196 (559) T protein:vir:95 138 --MAVLDDDEDI----IRTMPFPIGSYY-------L--------ANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVK 196 (559) T ss_pred --eEeecCCCce----eEEEEeecCeEE-------E--------eeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHH Confidence 3455555432 222222222211 1 11111111222322111110 0 Q ss_pred cccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCC-Cccc----------------ccchHHHHH--- Q lcl|NC_019445. 221 DTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGE-DVYG----------------SSCPGMLAL--- 280 (559) Q Consensus 221 ~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g-~~YG----------------rG~P~~~~l--- 280 (559) ...+.+.. ..+..+++.+.-..+.-....+-..+||-..-|....+ .+-+ +. +++.|= T Consensus 197 ~~~~~~~~-~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~-~ge~YGrg~ 274 (559) T protein:vir:95 197 SMWESGTY-EKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVN-GEDVYGSSC 274 (559) T ss_pred HHHhcCCC-CCeEEEEEEEeccccccccccccccceEEEEEEEecCCCceeeecCCcccCCccceeeeec-CCccccccc Confidence 00011111 12234444432211111111223467888777775332 1111 11 233331 Q ss_pred HHHHHHH--HHHHHHHHHHHHHhcCceeecCCCccccceecCCcee-ecC-CcCCchhhhhhhhccccHHH------HHH Q lcl|NC_019445. 281 GPVKALQ--LLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDIT-YID-QITGQDGFRPAYLVNPSTAD------LVA 350 (559) Q Consensus 281 ~d~~~L~--~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~pg~~~-~~~-~~~~~~~~~p~~~~~~~~~~------~~~ 350 (559) |-...|- +.-+.+.+....++.-... | .+...+.+.. ..+ .+++...+... .+...+.. -.. T Consensus 275 P~~~al~d~k~L~~l~~~~l~~~~~~~~-p------p~~v~~~~~~~~~~l~pgg~~~~~~~-~~~~~i~p~~~~~~~~~ 346 (559) T protein:vir:95 275 PGMLALGPVKALQLLQKRKSQLIDKATN-P------PMVAPTSLKNQRASLLPGDITYIDQI-TGQDGFRPAYLVNPSTA 346 (559) T ss_pred hHHHhhHHHHHHHHHHHHHHHHHHHHhc-C------ceeccccccccceeeeccceeeeCCC-CCcccceeecccccchH Confidence 1111111 1111111221111111111 1 1111122211 111 12222222111 11111111 111 Q ss_pred HHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh-----c-CC- Q lcl|NC_019445. 351 DIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVR-----K-NM- 423 (559) Q Consensus 351 ~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r-----~-g~- 423 (559) .++...+.++...=.-+|..+-.. ....+++ +.-+.|....-. -. ...|.|++.|.-..+.. . +. T Consensus 347 ~~~~~i~~~~~rI~~af~~d~~~~-l~~r~~~--rvTAtEV~~r~~----E~-~~~LG~v~~rl~~E~l~Pli~r~~~il 418 (559) T protein:vir:95 347 DLVADIQDTRQIINSAYFVDLFMM-LQNINTR--SMPVEAVIEMKE----EK-LLMLGPVLERLNDECLNPLIDRSFSMM 418 (559) T ss_pred HHHHHHHHHHHHHHHHhhhhhHHH-hhcCCCC--CCCHHHHHHHHH----HH-HHHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 222222334433322111111111 1122332 222333222110 01 13477877776444331 1 11 Q ss_pred --CCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHH-----HHHHcCCCccc Q lcl|NC_019445. 424 --LPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDA-----FADMSGVSPTV 496 (559) Q Consensus 424 --lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~-----~a~~~Gvp~~~ 496 (559) --.+|+--++-.....-+..+..+.++++...+....++++.++++. .++++ +++. +.+.+.- . T Consensus 419 ~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~la-----q~~Pe-vld~id~d~~~~~~a~---~ 489 (559) T protein:vir:95 419 VRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA-----QVKPE-ALDKLNVDQAIDAFAD---M 489 (559) T ss_pred HhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh-----ccChh-hhhcCCHHHHHHHHHH---H Confidence 11233321111112222678888999999999988888888776553 25654 3332 1111111 1 Q ss_pred cCCHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 497 IVPQEQV----DQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 497 ~rs~~ev----~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) +--+..+ ++..+.|+|++|+||++++++++.++|+.+........+.-++|.+++...+|+.. T Consensus 490 ~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 556 (559) T protein:vir:95 490 SGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGG 556 (559) T ss_pred hCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccc Confidence 1112222 44455566666666666666666666555443333333344555555555555555 No 29 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=4.6e-49 Score=285.59 Aligned_cols=504 Identities=13% Similarity=0.095 Sum_probs=335.8 Q ss_pred CC--hhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MA--ETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~--~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |- ++++..++++|.++.+.|++|+..|.|+++|..-+..++......++ ..++|-++....++++++.||+.+|| T Consensus 11 ~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~---r~~~~~~k~~~~~~~i~~~l~~~~Fp 87 (584) T protein:vir:95 11 LLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPW---KNSTTLPKLCQIRDNLHSNYFSSLFP 87 (584) T ss_pred hccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhccccc---ccccchhHHHHHHHHHHHHHHHhhcC Confidence 44 34567789999999999999999999999999998887766665555 34689999999999999999999998 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc------------ Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED------------ 146 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~------------ 146 (559) ++.||++....++..+... -+.+++.+...|+.|||+.++.+.+++++++|+|.+-+.+..+. T Consensus 88 -~~~w~~~v~~~~~~~~~~~----~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~ 162 (584) T protein:vir:95 88 -NDDWLRWVGYGKGDSTKTK----AKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVPDY 162 (584) T ss_pred -ccceeeeecCCCchhhHHH----HHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecccccccc Confidence 7899999987765443322 34578888889999999999999999999999999987765442 Q ss_pred -eEEEEEeeccEEEEeeCCCCCEEEE--EEEEeecHHHHHHhcCcc---c-CCHHHHHHHhcCCCCceEEEEEEEeecCc Q lcl|NC_019445. 147 -IIRTMPFPIGSYYLANSPRGSVDIC--FRKFSMTVRQLVQEFGLN---N-VSESVKSMWESGTYEKWIEVMHSVYPNID 219 (559) Q Consensus 147 -~~~~~~~~l~~~~v~~d~~G~vd~i--~r~~~~t~~ql~~~fg~~---~-l~~~v~~~~~~~~~~~~v~v~~~v~p~~~ 219 (559) ..+++.+++.+++++.++ +.++.. +++..+|..+|.++.-.. . ..+.++....+........ .+-+.+... T Consensus 163 ~~prieriSP~d~~~Dpsa-~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~-~~~~~~~~~ 240 (584) T protein:vir:95 163 IGPRLVRISPLDIVFNPLA-TSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYS-VEDFDKAAG 240 (584) T ss_pred ccceEEeeChhheeecCCC-CCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCc-ccccccccc Confidence 467888999999999999 555442 235668999998875211 1 1133443332211000000 000000000 Q ss_pred cccccc----------------------ccccccEEEEEEEecCCCceeee--ecCc--ccCCeEEEEeeecCCCccccc Q lcl|NC_019445. 220 RDTSKL----------------------DSKNKPFKSVYYEVGGDNDKLLR--ESGF--DEFPIMAPRWEVNGEDVYGSS 273 (559) Q Consensus 220 ~~~~~~----------------------~~~~~~~~sv~~~~~~~~~~il~--esg~--~~~P~~~~rw~~~~g~~YGrG 273 (559) .+.+++ +..+.+-..+|+.......++++ +..| +.+||++..|.+.....||.| T Consensus 241 ~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~g 320 (584) T protein:vir:95 241 FDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMG 320 (584) T ss_pred cccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCC Confidence 000000 00010111111111112345555 4444 789999999999999999999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHH Q lcl|NC_019445. 274 CPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQ 353 (559) Q Consensus 274 ~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~ 353 (559) |.+.++|.++.||.+.|.+++++.++++|++..-.+ ..+..+.||+.++.+.+++...+.| ....+......|+ T Consensus 321 -i~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~--~~~~~~~pg~~~~~~~~~~~q~~~p---~a~~~~s~~~~lq 394 (584) T protein:vir:95 321 -PLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGE--VEEFVWGPGAEIHLDQGGDVQEIAK---NVNYIINADNQIQ 394 (584) T ss_pred -chhhhhhHHHHHhHHHHHHHHHHHHhcCcceeeccc--cchhcccCCceeecCCCCCcceecC---chhhhhHHHHHHH Confidence 799999999999999999999999999996654433 3567889999999987766444443 2223333344456 Q ss_pred HHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcC----CCCC--- Q lcl|NC_019445. 354 DTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKN----MLPP--- 426 (559) Q Consensus 354 ~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g----~lp~--- 426 (559) .++....+.--+.-+.++.. .+..-||+++.+..+ ..+.++.++..+|-.|+++|++.+|.+.| ..++ T Consensus 395 ~~e~~me~~sGvp~~~~G~~-~~~~~TAtg~s~l~n----aa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr 469 (584) T protein:vir:95 395 MLEDRMELYAGAPREAMGIR-TPGEKTAFEVQQLGN----AAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIR 469 (584) T ss_pred HHHHHHHhhhCCChhhcccc-cchhhhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcee Confidence 65555555443333334333 444567887755555 66678888888888888899888888754 1221 Q ss_pred --------------CchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCC Q lcl|NC_019445. 427 --------------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGV 492 (559) Q Consensus 427 --------------~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gv 492 (559) .+++|.|. +++......+-+.|.+..+++.++++. ++++.++++++-..+.+.+++..++ T Consensus 470 ~~n~e~~~~~f~~i~r~Dl~g~-~~~va~Ga~~~~~keq~~q~l~~ilq~-----~~~~~i~p~~~~~~l~~~ladl~~~ 543 (584) T protein:vir:95 470 VMDTDLGVKEFMSVTREDITAN-GKIRPIGARHFGKQAQDLQNLVGIFNS-----QIGQMILPHTSGKALATFVDDVTGL 543 (584) T ss_pred eeccccccccccccChhhhccC-eeEEeehhhHHHHHHHHHHHHHHHHHh-----hhhhhccccchHHHHHHHHHHHhCC Confidence 23455443 666666665556777888888888764 5667777889999999999999999 Q ss_pred Cc-cccCCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 493 SP-TVIVPQEQVDQ-ARQQRAQQQQQQQMMAMGMAAAQGAK 531 (559) Q Consensus 493 p~-~~~rs~~ev~~-~rq~r~q~~q~~~~~~~~~~~~~~a~ 531 (559) |. .+.+++-.+++ +..|+...++++..+++++..+++|- T Consensus 544 p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 544 QGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred CcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 97 45555444432 12222222223333334444444333 No 30 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=4.6e-47 Score=274.60 Aligned_cols=513 Identities=11% Similarity=0.072 Sum_probs=340.0 Q ss_pred CChh--hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAET--TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~--~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |.++ ....++.+|.++.+.|+..+..|.|+++|+..+..+.++++..+++ .+++.++.++++.+|++.+|+++|| T Consensus 15 ~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~---~s~t~~k~~~~~~~l~a~~~~~~fp 91 (599) T protein:vir:31 15 RDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFK---NSTTINKLAHLHLMITTSYMEHLLP 91 (599) T ss_pred cCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcc---cccchHHHHHHHHHHHHHHHhhhcC Confidence 6554 3556899999999999999999999999999988887777777775 4467889999999999999999998 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEe-e-------cC-----C Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVL-E-------DD-----E 145 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~-~-------~~-----~ 145 (559) +..||++..-+++... +.--+.+++.|...|++|+|+.++...+.|++++|||+.-++ + |. . T Consensus 92 -~~~w~d~~~~~~~~~~----~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~~~ 166 (599) T protein:vir:31 92 -NRNWVDFVGFDNDSVN----AEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNY 166 (599) T ss_pred -CccceEeeecCCchhH----HHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeeccccccccc Confidence 8899999987665222 222445778888999999999999999999999999987665 1 11 1 Q ss_pred ceEEEEEeeccEEEEeeCCCCCEEEEE--EEEeecHHHHHHhcCcccC---CHHHHHHHh-c-CC-CCceEEEEEEEeec Q lcl|NC_019445. 146 DIIRTMPFPIGSYYLANSPRGSVDICF--RKFSMTVRQLVQEFGLNNV---SESVKSMWE-S-GT-YEKWIEVMHSVYPN 217 (559) Q Consensus 146 ~~~~~~~~~l~~~~v~~d~~G~vd~i~--r~~~~t~~ql~~~fg~~~l---~~~v~~~~~-~-~~-~~~~v~v~~~v~p~ 217 (559) .+-+++.+.+..++++.++ +.++.++ ++...|..+|.+..+.... +.+.....+ . .+ .+.+.+-+.-+. T Consensus 167 ~~P~~ervsP~Di~~Dp~A-~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~-- 243 (599) T protein:vir:31 167 SGTVTERLSPSDVFWDVTA-DSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRR-- 243 (599) T ss_pred ccceEEeecccceeeCCCC-CCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhh-- Confidence 1347788999999999999 5554433 5778889999887653321 112111111 0 00 000011000000 Q ss_pred Ccccc---cccccccccEEE----------EEEEec------------CCCceeeee----cCcccCCeEEEEeeecCCC Q lcl|NC_019445. 218 IDRDT---SKLDSKNKPFKS----------VYYEVG------------GDNDKLLRE----SGFDEFPIMAPRWEVNGED 268 (559) Q Consensus 218 ~~~~~---~~~~~~~~~~~s----------v~~~~~------------~~~~~il~e----sg~~~~P~~~~rw~~~~g~ 268 (559) ..++ +++.....++.+ .|++.. .+..++++- .+..+.||++..|.+..++ T Consensus 244 -g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~ 322 (599) T protein:vir:31 244 -KFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDT 322 (599) T ss_pred -hccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccc Confidence 0000 000000001000 111111 112233222 2334589999999999999 Q ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccccceecCCceeecCCcCCchhhhhhhhccccHHHH Q lcl|NC_019445. 269 VYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADL 348 (559) Q Consensus 269 ~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~ 348 (559) .||.| |-+.++|.+..||.+.+.++++...+++|.+....++.+.++.+.||++++++..++...+.|-. ++... T Consensus 323 ~yG~G-~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~eD~~~~P~~v~~~~d~~~vq~~~p~s----~~~~a 397 (599) T protein:vir:31 323 LCPIG-PLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREKGMRGGPNHVFEVEETGDVQYMTPPA----EVLQP 397 (599) T ss_pred cCCCC-CchhcchHHHHHHHHHHHhhhhhhhhhcccccccccccccCccCCCCcceeecCCCccccccCch----hhhhH Confidence 99999 78999999999999999999999999999888888888778889999999987655544443321 22333 Q ss_pred HHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC- Q lcl|NC_019445. 349 VADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPP- 427 (559) Q Consensus 349 ~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~- 427 (559) ...|+..+.+..+.--+..+.++. +.+..-||+||++..++....+...+..+..+++.||+++++.+.++.-.-|+. T Consensus 398 ~~~is~~e~~mee~sGvp~~~~G~-~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~ti 476 (599) T protein:vir:31 398 DNQLSITLQLMEDLSGAPKESIGQ-RTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTI 476 (599) T ss_pred HHHHHHHHHHHHHhhccchhhcCC-cccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccce Confidence 334555555555544444444443 334456999999999999999999999999999999999999999874322221 Q ss_pred ----------------chhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC Q lcl|NC_019445. 428 ----------------PDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG 491 (559) Q Consensus 428 ----------------p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G 491 (559) ++.|. .++++..++.-.-+.|..-.+++.++++ +|+++..++++.-.++...++.... T Consensus 477 ri~~~e~~~~~f~~i~redl~-~~~~~v~~Ga~~v~ere~~~q~l~~il~-----~~~~q~~~P~~~~k~l~~~l~~~~~ 550 (599) T protein:vir:31 477 KTFNSELGTATFLDITADDLN-LNGQMVAQGATLFAEKANTLQNLNAILG-----GPLGAALAPHMSRTKLFNAVEYLGD 550 (599) T ss_pred eeecccccceeeEEeehhhhh-CCeeeeechhhHHHHHHHHHHHHHHHhc-----ccCCCccchhhHHHHHHHHHHHHHh Confidence 11222 2356655555555777777788887776 4666666667777777777777666 Q ss_pred CCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 492 VSP-TVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 492 vp~-~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) +.. .+.+..--| ++| |.+..+++++.++. ..++|++...|+++..| T Consensus 551 l~~~~~~~~~va~---~eq---q~~~~m~Q~~lq~~----------------~~~~~~~~~~~~~~~~~ 597 (599) T protein:vir:31 551 LDAYGIFTFGIGV---QED---QQLARMAQKSTQQT----------------EETALTQEEVGGPTTDT 597 (599) T ss_pred ccccccCCCchhH---HHH---HHHHHHHHHHHHHh----------------HhhhhhhhhcCCCCccc Confidence 654 344433322 121 11111112222211 12356666677766666 No 31 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=5.8e-37 Score=219.24 Aligned_cols=524 Identities=10% Similarity=0.042 Sum_probs=304.5 Q ss_pred CChh------h----HHHHHHHHHHHHHHhh-hHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHH Q lcl|NC_019445. 1 MAET------T----KERLNKQFAQLESERQ-SFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLA 69 (559) Q Consensus 1 M~~~------~----~~~l~~r~~~l~~~R~-~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~La 69 (559) |++. + ..-+..+++.+++-+. .+...+.+..+|.+ ++..+. ..+| ..+++.+.-...++.+. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~---g~~~~~-~~~~---~s~~~~~~v~~~v~~~~ 73 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYF---GEPFGN-ERPG---KSGIVSRDVQETVDWIM 73 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHh---CCCCCc-ccCC---CCccccHHHHHHHHHHH Confidence 4433 2 2234455555555444 22234455555543 322221 1222 35567778888999999 Q ss_pred HHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHH-HhccchHHHHHHHHHHHhhCcEEEEEeecCC--- Q lcl|NC_019445. 70 SGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMF-NKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE--- 145 (559) Q Consensus 70 s~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l-~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~--- 145 (559) +.|+..+|+ +.+||++.+-.+...+.. +.++..+.-++ ..++.+..++.++++.++.|+|++-+.++.. T Consensus 74 ~~l~~~~~~-~~~~~~~~p~~~~D~~~a------~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~ 146 (705) T protein:vir:88 74 PSLMKVFTS-GGQVVKYEPDTAEDVEQA------EQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKP 146 (705) T ss_pred HHHHHhhcC-CCceEEEeeCChhHHHHH------HHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccch Confidence 999999886 889999998655433322 22344444443 4556678899999999999999884433110 Q ss_pred ---------------------------------------------ceEEEEEeeccEEEEeeCCCCCEEE--EEEEEeec Q lcl|NC_019445. 146 ---------------------------------------------DIIRTMPFPIGSYYLANSPRGSVDI--CFRKFSMT 178 (559) Q Consensus 146 ---------------------------------------------~~~~~~~~~l~~~~v~~d~~G~vd~--i~r~~~~t 178 (559) ..+++++||+.+|+++.++.+.-|. +++++.+| T Consensus 147 ~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t 226 (705) T protein:vir:88 147 TFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYT 226 (705) T ss_pred hhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEecc Confidence 2368899999999999998775554 66788999 Q ss_pred HHHHHHhcCcccCCHHHHHHH--------------hcC----------------CCCceEEEEEEEeecCcccccccccc Q lcl|NC_019445. 179 VRQLVQEFGLNNVSESVKSMW--------------ESG----------------TYEKWIEVMHSVYPNIDRDTSKLDSK 228 (559) Q Consensus 179 ~~ql~~~fg~~~l~~~v~~~~--------------~~~----------------~~~~~v~v~~~v~p~~~~~~~~~~~~ 228 (559) .++|... |+++ +.+.+.. ..+ .....|+|++|.. +.+.+.+ .. T Consensus 227 ~~dl~~~-g~~~--~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~-~~d~~~d---~~ 299 (705) T protein:vir:88 227 VSDLRLL-GVPE--DVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYT-LLDVDGD---GI 299 (705) T ss_pred HHHHHhh-cCCh--hHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeee-EecccCC---cc Confidence 9999654 3221 0111110 000 0012355555433 3333221 22 Q ss_pred cccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_019445. 229 NKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAP 308 (559) Q Consensus 229 ~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p 308 (559) ..++..+| .++++++...++.+||++.++.+.++..||+| +++.+.+-++.+|.+.+.+++++..+++|+++++ T Consensus 300 ~~~~~~~~-----~g~~il~~~~~~~~PF~~~~~~p~~~~~~G~g-~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~ 373 (705) T protein:vir:88 300 SELRRILY-----VGDYIISNEPWDCRPFADLNAYRIAHKFHGMS-VYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVL 373 (705) T ss_pred eeeEEEEE-----eCccccccccCCCCCEEEecceeecCccccCC-hHHHHhHHHHHHHHHHHHHHHHHHhccCCceecc Confidence 23333332 13467888788899999999999999999999 6999999999999999999999999999999998 Q ss_pred CCCcc--ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCC--CCCcCHHHH Q lcl|NC_019445. 309 TSLKN--QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNIN--TRSMPVEAV 384 (559) Q Consensus 309 ~~~~~--~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~--~~~~TA~Ei 384 (559) ++..+ ...+..||+++.+...+ .++|+.... -.+.+...++.+.+.|++..-..-+.++...+ ..+.||+.| T Consensus 374 ~g~v~~~d~~~~~pg~vv~~~~~~---~i~~~~~~~-~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i 449 (705) T protein:vir:88 374 DGQVNLEDLLTNEAAGIVRVKSMN---SITPLETPQ-LSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSV 449 (705) T ss_pred ccccCcccccccCCCeeEEecCCC---ccccccCCc-CcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHH Confidence 77432 23557899998876433 344443221 12233445677777887766444344433322 346899999 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCC-----------CCchhhCCcceEEEeecHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLP-----------PPPDAMEGMPLKVEYISVMAQAQKSIGL 453 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp-----------~~p~~l~g~~v~~~~is~La~a~r~~~~ 453 (559) ..+.+.....+..+...|...++.+++.++++++....--| ..|..+.+ .+++.+.+++..+.+.+.. T Consensus 450 ~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~-~~~v~v~v~~~~~~~eq~~ 528 (705) T protein:vir:88 450 NQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRE-RSDLTVTVGIGNMNKDQQM 528 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhcc-CCceEEeeccccchHHHHH Confidence 99999999999999999988899999999999999864322 22444443 4566666666666666666 Q ss_pred HHHHHHHHHHHHHhccChhhHhcCCH---HHHHHHHHHHcCCC--ccccCCHHHHHHHHHHHHHH----HH------HHH Q lcl|NC_019445. 454 SSLASTVNFIGQLAQAKPEALDKLNV---DQAIDAFADMSGVS--PTVIVPQEQVDQARQQRAQQ----QQ------QQQ 518 (559) Q Consensus 454 ~~l~~~~~~~~~la~~~P~~~~~id~---d~~~~~~a~~~Gvp--~~~~rs~~ev~~~rq~r~q~----~q------~~~ 518 (559) ..+...++....+.+. |+..+.++. .+++..++...|+- ..++..+...++++.+.+++ ++ +++ T Consensus 529 a~l~~ll~~~q~l~~~-~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~ 607 (705) T protein:vir:88 529 LHLMRIWEMAQAVVGG-GGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQA 607 (705) T ss_pred HHHHHHHHHHHHhhcc-cchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHH Confidence 6777777666665543 444444443 35566666666652 22332222222111110000 00 000 Q ss_pred HHHHHHHHHHHHhhhhhhcCCCh--hHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 519 MMAMGMAAAQGAKTLSEAKTSDP--SVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 519 ~~~~~~~~~~~a~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 559 (559) .++.++...+.. .+++...-. .+.++.+..........| T Consensus 608 e~~k~q~e~~~~--q~e~q~~q~E~q~~q~e~e~~~~~~~~~~ 648 (705) T protein:vir:88 608 DAQRAQSDALAK--QAEAQMKQVEAQIRLAEIELKKQEAVLQQ 648 (705) T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000 000000000 000000000000000000 No 32 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=100.00 E-value=4.2e-28 Score=170.67 Aligned_cols=522 Identities=12% Similarity=0.083 Sum_probs=268.3 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHH---HHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFE---PHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~---~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) =++++...|++-++..++...+.. ..|-+++-|.- .. ......| ..+++...-...++-+-+.|+..++ T Consensus 24 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~-~~~~~~g---rs~vv~~~v~~~ve~~~~~l~~~f~ 95 (763) T protein:vir:95 24 KNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEG----KA-KPPKVKG---RSQVQPKLVRRQAEWRYSALTEPFL 95 (763) T ss_pred CChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccc----cC-cccccCC---CccccCHHHHHHHHHHHHHHHHhhc Confidence 233344444444443333333322 33444433321 10 1111222 3456777888899999999999999 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHH-HHHhccchHHHHHHHHHHHhhCcEEEEEeecC------------ Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMND-MFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD------------ 144 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~-~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~------------ 144 (559) + +..||++.+-.++..+.++. .+..+.- ....++=+..++.+++++++.|||++-+.++. T Consensus 96 ~-~~~~~~~~P~~~~D~~~A~q------~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~ 168 (763) T protein:vir:95 96 G-SNKLFKVTPVTWEDVQGARQ------NELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVF 168 (763) T ss_pred C-CCcEEEEecCCcchHHHHHH------HHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhh Confidence 7 77899999987765554443 3333333 33455666788899999999999987553210 Q ss_pred ------------------------------------------------------------------CceEEEEEeeccEE Q lcl|NC_019445. 145 ------------------------------------------------------------------EDIIRTMPFPIGSY 158 (559) Q Consensus 145 ------------------------------------------------------------------~~~~~~~~~~l~~~ 158 (559) .+..+++.+|+.+| T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~ 248 (763) T protein:vir:95 169 SLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENI 248 (763) T ss_pred hhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHh Confidence 01126677999999 Q ss_pred EEeeCCCCCEE---EEEEEEeecHHHHHHh-cCcccCCH---HHHHHHh--------------cCCCCceEEEEEEEeec Q lcl|NC_019445. 159 YLANSPRGSVD---ICFRKFSMTVRQLVQE-FGLNNVSE---SVKSMWE--------------SGTYEKWIEVMHSVYPN 217 (559) Q Consensus 159 ~v~~d~~G~vd---~i~r~~~~t~~ql~~~-fg~~~l~~---~v~~~~~--------------~~~~~~~v~v~~~v~p~ 217 (559) +|+.++.+.++ -|++++.+|..+|..+ ++++++.+ +..+... .+.....|.|+.+ |-+ T Consensus 249 ~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~-y~~ 327 (763) T protein:vir:95 249 IIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEY-WGF 327 (763) T ss_pred eecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEe-eee Confidence 99999876543 3688999999999775 44443321 1111100 0011245666554 333 Q ss_pred CcccccccccccccEEEEEE-EecCCCceeeee--cCc--ccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 218 IDRDTSKLDSKNKPFKSVYY-EVGGDNDKLLRE--SGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKR 292 (559) Q Consensus 218 ~~~~~~~~~~~~~~~~sv~~-~~~~~~~~il~e--sg~--~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~ 292 (559) .+.+.++ +...|. -..+ +++++. +.| +.+||+++.+.+.++..||+| .++.+.+.++.+|.+.+. T Consensus 328 ~d~~gdg-------~~~~~~v~~~g--~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~g-i~~~~~d~Qr~~N~~~~~ 397 (763) T protein:vir:95 328 WDIEGNG-------VLEPIVATWIG--STLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEP-DAELLGDNQAVLGAVMRG 397 (763) T ss_pred eccCCcc-------eeEEEEEEEEc--CeeeecccccccCCCcCEEEecceeecCcccCCc-hHHHhhHHHHHHHHHHHH Confidence 3222211 111221 1112 345543 444 679999999999999999999 599999999999999999 Q ss_pred HHHHHHHHhcCceeecCCCcc--ccceecCCceeecCCcC-Cchhhhhhhhc--cccHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_019445. 293 KSQLIDKATNPPMVAPTSLKN--QRASLLPGDITYIDQIT-GQDGFRPAYLV--NPSTADLVADIQDTRQIINSAYFVDL 367 (559) Q Consensus 293 ~~~~~~~~~~p~~~~p~~~~~--~~~~~~pg~~~~~~~~~-~~~~~~p~~~~--~~~~~~~~~~i~~~~~rI~~af~~dl 367 (559) +++.+.++++|.|.++.+... ......||+++.+.... ....+++.... ......+ ++.+.+.+.+.--..- T Consensus 398 ~~d~l~~~~~~~~~v~~gav~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~---l~~~~~~~e~~TGv~~ 474 (763) T protein:vir:95 398 MIDLLGRSANGQRGMPKGMLDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTM---ATLQNQEAESLTGVKA 474 (763) T ss_pred HHHHHHhhcCCcEEeecccccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHH---HHHHHHHHHHhhCcch Confidence 999999999999998877532 22456899988875321 11223332211 1122222 2222222221111111 Q ss_pred hhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc----------C--CCCCCchhhCCcc Q lcl|NC_019445. 368 FMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK----------N--MLPPPPDAMEGMP 435 (559) Q Consensus 368 ~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~----------g--~lp~~p~~l~g~~ 435 (559) +.++...+....||++|..+.+.....+..++.+|.. .+.+++++++.++.+. | ..|-.|+.+.| . T Consensus 475 ~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~-~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~-~ 552 (763) T protein:vir:95 475 FAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAK-GMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKG-N 552 (763) T ss_pred hhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcC-C Confidence 1222233344579999999999999999999888865 7899999999999984 2 34445555655 3 Q ss_pred eEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhh--------HhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHH Q lcl|NC_019445. 436 LKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEA--------LDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQAR 507 (559) Q Consensus 436 v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~--------~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~r 507 (559) ++|++..+.+ ..+.+..+.+..+++.++. .++|.+ ++..+...+++.+.....-|...-....+ +. T Consensus 553 ~DV~V~~~~a-s~~~q~~~~l~~ll~~l~~--~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaq---le 626 (763) T protein:vir:95 553 FDLEVDISTA-EVDNQKSQDLGFMLQTIGP--NVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQ---LA 626 (763) T ss_pred cceEEecccc-hHHHHHHHHHHHHHHHhcc--ccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHH---HH Confidence 4444433322 1222233333333333322 223322 12233333333333222211111111111 10 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCC-----------------------Ch--hHHHHHHHH------hhcCCC Q lcl|NC_019445. 508 QQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTS-----------------------DP--SVLSAMANA------VSGQGG 556 (559) Q Consensus 508 q~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~-----------------------~~--~~~~~~~~~------~~~~~~ 556 (559) +++++++++..++++....+++.+...++... .. -.++.+.+. +..+.+ T Consensus 627 ~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~~ea~~~~~ 706 (763) T protein:vir:95 627 VEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPRKEGELPPN 706 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccChh Confidence 11110000000000000001111111111100 00 000000000 000001 Q ss_pred CCC Q lcl|NC_019445. 557 QSQ 559 (559) Q Consensus 557 ~~~ 559 (559) ++. T Consensus 707 ~~~ 709 (763) T protein:vir:95 707 LSA 709 (763) T ss_pred HHH Confidence 110 No 33 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.85 E-value=2.1e-19 Score=123.04 Aligned_cols=530 Identities=12% Similarity=0.096 Sum_probs=258.5 Q ss_pred CChh----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCC-CCCCCCCCcc-cccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAET----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSR-FLTSEVNRND-RRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~-~~~~~~~~~~-~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) +++. ..++|..+|..-...-..|...+.+-.+|.. +. ...+....-. +....+.-+.....++.+.+..-. T Consensus 38 ~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~---G~Qw~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~ 114 (776) T protein:vir:93 38 LDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYD---NIQWSQDEIDELKERGQAPTVYNVISQSVNWIIGSEKR 114 (776) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhC---CCCCCHHHHHHHHhcCCceEEecchHHHHHHHHHHHHh Confidence 3222 2334555566555555677777777777753 22 1111111000 111223333444444444432222 Q ss_pred hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEE--eecC-CceEEEE Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV--LEDD-EDIIRTM 151 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v--~~~~-~~~~~~~ 151 (559) +++=+++.+.+++..+. .+.++..+......+++..+...++.+.++.|.|++-+ +.+. +..+... T Consensus 115 -----nr~~~~~~p~~~~d~~~------Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~ 183 (776) T protein:vir:93 115 -----GRSDFKVLPRRKDGGKA------AERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAG 183 (776) T ss_pred -----CCcceEEecCChhHHHH------HHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEee Confidence 56667777654332222 22345555556678899999999999999999998754 3333 3456667 Q ss_pred EeeccEEEEeeCCCC-C---EEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcC------------------------- Q lcl|NC_019445. 152 PFPIGSYYLANSPRG-S---VDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESG------------------------- 202 (559) Q Consensus 152 ~~~l~~~~v~~d~~G-~---vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~------------------------- 202 (559) .++..+++++.++.- . ..-+||+..+|.+++...|++. .+.+.+..... T Consensus 184 ~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 (776) T protein:vir:93 184 AESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPER--AAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMN 261 (776) T ss_pred ccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCc--hHHHHHhhhhcccccchhcccccccccccccccccc Confidence 788899999887642 2 2337889999999999999863 23333221100 Q ss_pred --------CCCceEEEEEEEeecCcc----------------cc-----------cccccccccEEEEEEEecCCCceee Q lcl|NC_019445. 203 --------TYEKWIEVMHSVYPNIDR----------------DT-----------SKLDSKNKPFKSVYYEVGGDNDKLL 247 (559) Q Consensus 203 --------~~~~~v~v~~~v~p~~~~----------------~~-----------~~~~~~~~~~~sv~~~~~~~~~~il 247 (559) ....+|.|+.+-+.++.. ++ +...-.......+||..-.+ .+++ T Consensus 262 ~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g-~~~l 340 (776) T protein:vir:93 262 SVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTT-RDLM 340 (776) T ss_pred cccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEec-chhh Confidence 011356666664433210 00 00011112223334333222 2455 Q ss_pred ee--cCc--ccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc-c-c---cee Q lcl|NC_019445. 248 RE--SGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN-Q-R---ASL 318 (559) Q Consensus 248 ~e--sg~--~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~-~-~---~~~ 318 (559) +. +.| +.|||+++...+.+.+.||.|+ +..+.+-++.+|.....++..+ ...++++..+... . . ... T Consensus 341 ~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~-v~~~~d~Q~~~N~~~s~~~~~l---~~~~~~~~~gav~~~d~~~~~~~ 416 (776) T protein:vir:93 341 WAGPSPYRHNRYPFTPIWGFRRARDGMPYGV-IRFMRGMQDDVNKRLSKALYIL---STNKVLMEEGAVDDIDEFRREAA 416 (776) T ss_pred hccCCCCCCCccceEEecCceecccccccch-HHhhhHHHHHHHHHHHHHHHhh---cCCceeeccccccchHHHHHhcc Confidence 44 444 6899999999999999999996 9999999999999988887654 3456777655321 1 1 125 Q ss_pred cCCceeecCCcC-CchhhhhhhhccccHHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHHHHHHHHHhh Q lcl|NC_019445. 319 LPGDITYIDQIT-GQDGFRPAYLVNPSTADLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLG 396 (559) Q Consensus 319 ~pg~~~~~~~~~-~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG 396 (559) .||+++.+.... +...+++..... ..+.+.++...+.|+..- .+|. +++. .+..++..-|..+.+.....+. T Consensus 417 rp~~vi~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~i~~~tGi~~~--~~G~-~~n~~Sg~ai~~~~~~~~~~~~ 490 (776) T protein:vir:93 417 RPDAVMTVKNGKLGAVKMDVDRDLA---PAHLELASRSIQMIQQVGGVTDE--MLGR-TTNAVSGVAIQARQEQGSVATN 490 (776) T ss_pred cCCceeeeCCccccccccccCcCcc---HHHHHHHHHHHHHHHHhhCcChH--HhCC-CcchhhHHHHHHHHHHHHHHHH Confidence 699988774321 111223322111 122233444444444332 1111 1122 2234677789999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCC------------C---C-----chhhCCcceEEEeecHH-HHHHHHHHHHH Q lcl|NC_019445. 397 PVLERLNDECLNPLIDRAFSMMVRKNMLP------------P---P-----PDAMEGMPLKVEYISVM-AQAQKSIGLSS 455 (559) Q Consensus 397 ~v~~~l~~E~l~Pli~r~~~il~r~g~lp------------~---~-----p~~l~g~~v~~~~is~L-a~a~r~~~~~~ 455 (559) .++.+|.. .+.=+....+.++.+..-=+ . + ...+....+.|.+..+. ...+|.+..+. T Consensus 491 ~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~ 569 (776) T protein:vir:93 491 KLFDNLRL-AFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAE 569 (776) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHH Confidence 99999966 55557777777776631000 0 0 01111122345443333 33446666666 Q ss_pred HHHHHHHHHH-Hh-ccCh---hhHhcCCHHHHHHHHHHHcCCCc--cccCCHHHHHHHHHHHHHHHHHHHHHHH------ Q lcl|NC_019445. 456 LASTVNFIGQ-LA-QAKP---EALDKLNVDQAIDAFADMSGVSP--TVIVPQEQVDQARQQRAQQQQQQQMMAM------ 522 (559) Q Consensus 456 l~~~~~~~~~-la-~~~P---~~~~~id~d~~~~~~a~~~Gvp~--~~~rs~~ev~~~rq~r~q~~q~~~~~~~------ 522 (559) |.+.++.+.. +. .+.+ +.++.-+.+++...+-...+-+. ..-..+++.+.. +.++++++.++..+. T Consensus 570 l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~q-q~q~~~~q~q~~~~~a~~~~~ 648 (776) T protein:vir:93 570 LMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIARE-QAQQQQQQYNDALAIATLEEQ 648 (776) T ss_pred HHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHH-HHhhHHHHHHHHHhhhhhhHh Confidence 6555542211 00 0111 12233456667777766665431 122222221111 111111111100000 Q ss_pred -H---HHHHHHHhhhhhhcCC----ChhHHHH---HHHHhh----------------cCCCCCC Q lcl|NC_019445. 523 -G---MAAAQGAKTLSEAKTS----DPSVLSA---MANAVS----------------GQGGQSQ 559 (559) Q Consensus 523 -~---~~~~~~a~~~~~~~~~----~~~~~~~---~~~~~~----------------~~~~~~~ 559 (559) + ...++..+...++... .....++ .+.... ...++.+ T Consensus 649 qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~~~~a~~~~ 712 (776) T protein:vir:93 649 QAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGILRESGWDD 712 (776) T ss_pred hHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhccccccc Confidence 0 0001111111111100 0000000 000000 0000000 No 34 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.80 E-value=6.3e-17 Score=109.40 Aligned_cols=542 Identities=10% Similarity=0.034 Sum_probs=246.2 Q ss_pred CCh----------------------hhHHH----HHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcc-ccc Q lcl|NC_019445. 1 MAE----------------------TTKER----LNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRND-RRN 53 (559) Q Consensus 1 M~~----------------------~~~~~----l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~ 53 (559) |++ ...+. |..||......=+.|....++-.+|.. ....+.+....-+ +.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~--G~Qw~~~~~~~l~~~g~ 78 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLG--GEQWPSQVRTERELEQR 78 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhC--CCCCCHHHHHHHHhcCC Confidence 221 11122 222222222222233333334445542 1111111111000 011 Q ss_pred CCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchh---------------h-HHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 54 TRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMM---------------D-YGPVKLWLEAVQNRMNDMFNKS 117 (559) Q Consensus 54 ~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~---------------~-~~~v~~~l~~ve~~~~~~l~~s 117 (559) ..+.-+.....++...+..- .+++=+++.+.+.... . ..+-.+-.+..+..+......+ T Consensus 79 p~~~~N~i~~~v~~v~g~~~-----~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~ 153 (711) T protein:vir:10 79 PCLVNNVLPTFVDQVLGDQR-----QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNC 153 (711) T ss_pred CcEEEcchHHHHHHHhhhHh-----hCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhc Confidence 12233333444444433222 2334444444221000 0 0001111222344445555678 Q ss_pred cchHHHHHHHHHHHhhCcEEEEE--e---ecC-CceEEEEEee-ccEEEEeeCC---CCC-EEEEEEEEeecHHHHHHhc Q lcl|NC_019445. 118 NLYQSLPQLYGSLGTYSTGAMAV--L---EDD-EDIIRTMPFP-IGSYYLANSP---RGS-VDICFRKFSMTVRQLVQEF 186 (559) Q Consensus 118 nf~~~~~~~~~dl~~~G~~~l~v--~---~~~-~~~~~~~~~~-l~~~~v~~d~---~G~-vd~i~r~~~~t~~ql~~~f 186 (559) +...+...++.+.++.|.|.+=+ + ++. ++-+++..++ ..+++++.++ ++. ..-+|+...||..++..+| T Consensus 154 ~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~y 233 (711) T protein:vir:10 154 DAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALY 233 (711) T ss_pred ChhHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhC Confidence 88888999999999999997633 2 222 3446777774 7778887654 322 2347899999999999999 Q ss_pred CcccCCHHHHHHH-hcCC---CCceEEEEEEEeecCccc--c-------ccccc------------------ccccEEEE Q lcl|NC_019445. 187 GLNNVSESVKSMW-ESGT---YEKWIEVMHSVYPNIDRD--T-------SKLDS------------------KNKPFKSV 235 (559) Q Consensus 187 g~~~l~~~v~~~~-~~~~---~~~~v~v~~~v~p~~~~~--~-------~~~~~------------------~~~~~~sv 235 (559) |+.+. +.+.... ...+ ....|.|.+.-+.++... . ..++. +...-..+ T Consensus 234 p~~a~-~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v 312 (711) T protein:vir:10 234 PDATA-EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKT 312 (711) T ss_pred Cchhh-hhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeE Confidence 86542 2221111 1101 113455544333221100 0 00000 00112234 Q ss_pred EEEecCCCceeeee-cCc--ccCCeEEEE--eeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCC Q lcl|NC_019445. 236 YYEVGGDNDKLLRE-SGF--DEFPIMAPR--WEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTS 310 (559) Q Consensus 236 ~~~~~~~~~~il~e-sg~--~~~P~~~~r--w~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~ 310 (559) ||..-.+ .+++.+ +.| ..+||+++- +...++..++.|+ +..+.+-++.+|.+....++.+.+..+++++++.+ T Consensus 313 ~~~~~~G-~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~-vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~g 390 (711) T protein:vir:10 313 YWRKITG-ANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSI-IRHSKDAQRMANYWDSAATETVALAPKAPFIGSEG 390 (711) T ss_pred EEEEEec-ceeecCCCCCCCCcccEEEEeeeeeccccccccchh-hhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCc Confidence 5543322 345532 334 569999763 4556777777775 88999999999999999999999999999998766 Q ss_pred Ccc--c----cceecCCceeecCCcC-CchhhhhhhhccccH-HHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCH Q lcl|NC_019445. 311 LKN--Q----RASLLPGDITYIDQIT-GQDGFRPAYLVNPST-ADLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPV 381 (559) Q Consensus 311 ~~~--~----~~~~~pg~~~~~~~~~-~~~~~~p~~~~~~~~-~~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA 381 (559) ... . .....||+++.+.... +...+++... +.+ ..+...++...+.|.+.- .+|. +++.. +..++. T Consensus 391 ai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~--~~~~~~~~~ll~~~~~~i~~~tGi~~~--~~G~~-~n~~Sg 465 (711) T protein:vir:10 391 NVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPP--AAVPAAELTLGQNSVEKIKSTMGMYDA--SLGAM-GNETSG 465 (711) T ss_pred ccCChHHHHHhccccCCCeeEecccccCcCCccccCC--CCCCHHHHHHHHHHHHHHHHHhCCChH--HcCCC-ccchHH Confidence 432 1 1236799988775322 2223443321 222 222334444455554433 1221 12222 235788 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCC-----------C-C-------Cch----------hhC Q lcl|NC_019445. 382 EAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNML-----------P-P-------PPD----------AME 432 (559) Q Consensus 382 ~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~l-----------p-~-------~p~----------~l~ 432 (559) .-|..+++.....|...+.+|.. ...=+....+.++.+..-- + + .+. .+. T Consensus 466 ~ai~~~q~qg~~~l~~~~dn~~~-~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~ 544 (711) T protein:vir:10 466 RAIIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN 544 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccc Confidence 99999999999999999999875 5555556566655542100 0 0 000 011 Q ss_pred CcceEEEe-ecHHHHHHHHHHHHHHHHHHHHHHHHhcc-ChhhH---hcCCHHHHHHHHHHHcCCCccccCCHHHH-HHH Q lcl|NC_019445. 433 GMPLKVEY-ISVMAQAQKSIGLSSLASTVNFIGQLAQA-KPEAL---DKLNVDQAIDAFADMSGVSPTVIVPQEQV-DQA 506 (559) Q Consensus 433 g~~v~~~~-is~La~a~r~~~~~~l~~~~~~~~~la~~-~P~~~---~~id~d~~~~~~a~~~Gvp~~~~rs~~ev-~~~ 506 (559) ...+.|.+ ++|-...+|.+.+..+.++++.+-++..+ .+.++ |.-+.++++..+....+-+. ......+. ++. T Consensus 545 ~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~-~~~~~~~~~qq~ 623 (711) T protein:vir:10 545 VQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNV-LSKDEREAIEED 623 (711) T ss_pred eeeeEEEEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCccc-CcchhhhHHHHH Confidence 11223343 33444566666666666665543222221 22233 34567777877777765432 11111111 111 Q ss_pred HHHHHHH---HHHH-HHHHHH--HHHHH-------HHhhhhhhcCC-------------ChhHHHHHH-----HHhhcCC Q lcl|NC_019445. 507 RQQRAQQ---QQQQ-QMMAMG--MAAAQ-------GAKTLSEAKTS-------------DPSVLSAMA-----NAVSGQG 555 (559) Q Consensus 507 rq~r~q~---~q~~-~~~~~~--~~~~~-------~a~~~~~~~~~-------------~~~~~~~~~-----~~~~~~~ 555 (559) .++++++ ++.+ +.++.. ...++ .++.-.++... ++..++.+. ..+.-++ T Consensus 624 ~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qaelq~ 703 (711) T protein:vir:10 624 MPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITA 703 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111 0000 000000 00000 00110000000 000000000 0000000 Q ss_pred CCC---C Q lcl|NC_019445. 556 GQS---Q 559 (559) Q Consensus 556 ~~~---~ 559 (559) .++ | T Consensus 704 ~q~~~~q 710 (711) T protein:vir:10 704 SQANVTE 710 (711) T ss_pred HHHHhhc Confidence 000 0 No 35 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.73 E-value=1.1e-16 Score=107.97 Aligned_cols=536 Identities=14% Similarity=0.110 Sum_probs=238.2 Q ss_pred CChhhHHHHHHHHHHHHHHhh---hHHHHHHHHHHHhccccCCCCCCCCCCcccc-----cCCCCcchHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQ---SFEPHWRELSDYINPRGSRFLTSEVNRNDRR-----NTRIIDSTGTMAARTLASGM 72 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~---~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~-----~~~~~~s~~~~a~~~Las~l 72 (559) |++.+.+.+.+.+..+....+ .|...|++-.+|..=.....+......-+.+ ...+.-+.....++.+.+.- T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 999988777766666666544 4444444443331100001111110000000 01122223333333333211 Q ss_pred HHhhcCCCCcceeccCCccc-hhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEE-----ee---- Q lcl|NC_019445. 73 MSGITSPARPWFRLATPDPE-MMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV-----LE---- 142 (559) Q Consensus 73 ~~~l~pp~~~Wf~l~~~d~~-~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v-----~~---- 142 (559) - .+++=+++.+.++. ..+..++ ++..+......++...+...++.+.++.|.|.+-+ .+ T Consensus 81 ~-----~nr~d~~v~p~~~~~d~~~Ae~------l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~ 149 (708) T protein:vir:17 81 R-----NNRITVKFRPGDREASEELANK------LNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPM 149 (708) T ss_pred h-----hCCcceEEecCCCcchHHHHHH------HHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCC Confidence 1 35555666665432 2222222 44445556667888889999999999999997733 22 Q ss_pred cCCceEEEE--EeeccEEEEeeCCCC--CEEE--EEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCC----CceEEEEE Q lcl|NC_019445. 143 DDEDIIRTM--PFPIGSYYLANSPRG--SVDI--CFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTY----EKWIEVMH 212 (559) Q Consensus 143 ~~~~~~~~~--~~~l~~~~v~~d~~G--~vd~--i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~----~~~v~v~~ 212 (559) +...++.++ ..|..+++++.++.- .-|. +||...|+..++..+||+.+....--..+.+..+ ..+|.|+. T Consensus 150 ~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e 229 (708) T protein:vir:17 150 DDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAK 229 (708) T ss_pred CCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEE Confidence 122333333 346778888877522 1233 6899999999999999976543222122111111 12343332 Q ss_pred EEeecCcc-------cc--c---cccc------------------ccccEEE--EEEEecCCCceeeee---cCcccCCe Q lcl|NC_019445. 213 SVYPNIDR-------DT--S---KLDS------------------KNKPFKS--VYYEVGGDNDKLLRE---SGFDEFPI 257 (559) Q Consensus 213 ~v~p~~~~-------~~--~---~~~~------------------~~~~~~s--v~~~~~~~~~~il~e---sg~~~~P~ 257 (559) +-+.+.+. ++ + .++. ...+... |||.... +.+++.+ ++|..||| T Consensus 230 ~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~-g~~~l~~~~~~p~~~fP~ 308 (708) T protein:vir:17 230 YYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVD-GDGFLEKPRRIPGEHIPL 308 (708) T ss_pred EEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeec-ccccccCCCCCCCCccce Confidence 22211110 00 0 0000 0111111 3444322 2345544 56788999 Q ss_pred EEE---EeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc-c-----cc----------ee Q lcl|NC_019445. 258 MAP---RWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN-Q-----RA----------SL 318 (559) Q Consensus 258 ~~~---rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~-~-----~~----------~~ 318 (559) +++ ||.. +|...-.|+ +..+.+-++.+|......++.+.++.+-+++++.+.+. . .. .- T Consensus 309 vP~~g~r~~~-d~~~~~yG~-vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~ 386 (708) T protein:vir:17 309 IPVYGKRWFI-DDIERVEGH-IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) T ss_pred EEEecccccc-cCCCcccch-hhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhc Confidence 876 4443 555533454 88999999999999999999999988877776654211 0 00 01 Q ss_pred cCCceeec-CCcCCchhhhhhhhccccHH-HHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhh Q lcl|NC_019445. 319 LPGDITYI-DQITGQDGFRPAYLVNPSTA-DLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLG 396 (559) Q Consensus 319 ~pg~~~~~-~~~~~~~~~~p~~~~~~~~~-~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG 396 (559) .++.+..+ ........++| +.+. .....++...+.|.+.--..-. +++.. ..++.--|..|++.....++ T Consensus 387 ~~~~~g~v~~~a~~~~~~~~-----~~~~~~~~~llq~~~~~i~~~tGi~d~-~~G~~--sn~SG~Ai~~rq~qg~~~~~ 458 (708) T protein:vir:17 387 VRDKYGNIIAGATPAGYTQP-----AVMNQALAALLQQTSADIQEVTGGSQA-MQQMP--SNIAQETVNNLMNRADMASF 458 (708) T ss_pred cCCcccccccccCCcccCCC-----ccccHHHHHHHHHHHHHHHHhcCCChH-HccCc--cchHHHHHHHHHHHHHHHHH Confidence 12221111 11111111222 1211 2223344555555544321111 22322 23566678888888888999 Q ss_pred hHHHHHH------HHHHHHHHHHHHH------HHHhcCC----------CCCCch------hhCCcceEEEee-cHHHHH Q lcl|NC_019445. 397 PVLERLN------DECLNPLIDRAFS------MMVRKNM----------LPPPPD------AMEGMPLKVEYI-SVMAQA 447 (559) Q Consensus 397 ~v~~~l~------~E~l~Pli~r~~~------il~r~g~----------lp~~p~------~l~g~~v~~~~i-s~La~a 447 (559) ..+.+|. -+.+.-||...|. |+-..|. .++.+. .|....+.|.+. .|-... T Consensus 459 ~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t 538 (708) T protein:vir:17 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchh Confidence 9888876 3455555554442 1111111 011110 011112334443 234456 Q ss_pred HHHHHHHHHHHHHHHHHHHhccC----hhhHhc---CCHHHHHHHHHHHcCCCcccc-CCHHHHHHHHHHHHHHHHHH-- Q lcl|NC_019445. 448 QKSIGLSSLASTVNFIGQLAQAK----PEALDK---LNVDQAIDAFADMSGVSPTVI-VPQEQVDQARQQRAQQQQQQ-- 517 (559) Q Consensus 448 ~r~~~~~~l~~~~~~~~~la~~~----P~~~~~---id~d~~~~~~a~~~Gvp~~~~-rs~~ev~~~rq~r~q~~q~~-- 517 (559) +|.+..+.+.++++.+....+.- +-+++. -+.++++..+...+......- ..+++. ++.++.++++|++ T Consensus 539 ~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~-q~~~q~qq~~q~q~~ 617 (708) T protein:vir:17 539 RRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQ-QIVQQAQMAAQSQPN 617 (708) T ss_pred HHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhH-HHHHHHHHHHHHHHH Confidence 67777777766665543321111 112222 345677777777665432211 112221 1111111111111 Q ss_pred HHHHHHH-----HHHHHHhhhhh--------------hcCCChhHH--------------HHHHHHhhcCCCCCC Q lcl|NC_019445. 518 QMMAMGM-----AAAQGAKTLSE--------------AKTSDPSVL--------------SAMANAVSGQGGQSQ 559 (559) Q Consensus 518 ~~~~~~~-----~~~~~a~~~~~--------------~~~~~~~~~--------------~~~~~~~~~~~~~~~ 559 (559) +++.+++ ..++..|.-++ +......+. ...+.++..+.+.++ T Consensus 618 ~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~~~~~~l~~~q~~q~ 692 (708) T protein:vir:17 618 PEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQ 692 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHH Confidence 0000000 00111111000 000000000 001111111111111 No 36 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.72 E-value=1.1e-14 Score=97.21 Aligned_cols=535 Identities=12% Similarity=0.089 Sum_probs=247.1 Q ss_pred CChhh-----HHHHHHHHHHHHHHhh---hHHHHHHHHHHHhccccCCCCCCCCCCcc-cccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQ---SFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~---~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~s~~~~a~~~Las~ 71 (559) |++.. .+...+.+.....+.. .|.....+-.+|.. ....+.+....-. +....+.-+.....++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~--G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYD--GDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 55542 2223334444444432 34444446666653 1111111111000 111222233334444443332 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEE--EEeecCCc-eE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAM--AVLEDDED-II 148 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~v~~~~~~-~~ 148 (559) .- .+++=+++.+.+.+.... +-.+.++..+......+++..+...++.+.+..|-|.+ ++++|+.+ .+ T Consensus 86 ~~-----~nr~~~~v~p~~~~~~~~----~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:81 86 EA-----KTRTDLVVMSDEPDDETE----KLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HH-----hCCcceEEecCCCCchhH----HHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 22 255556666643322211 11222444555566678888899999999999888864 45555544 48 Q ss_pred EEEEeeccEEEEeeCCCC-CE---EEEEEEEeecHHHHHHhcCcccCCHHHHHHH---h--------------------- Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRG-SV---DICFRKFSMTVRQLVQEFGLNNVSESVKSMW---E--------------------- 200 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G-~v---d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~---~--------------------- 200 (559) +++++|..+++++.++.. .+ .-+||...+|..++...||+.+ +.+.... . T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:81 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhhhhhccccccccccccccccccchhh Confidence 999999999999886532 22 2378999999999999998632 1111110 0 Q ss_pred -----cC------CCCceEEEEEEEeecCcccc---------ccccc----------------ccccEEEEEEEecCCCc Q lcl|NC_019445. 201 -----SG------TYEKWIEVMHSVYPNIDRDT---------SKLDS----------------KNKPFKSVYYEVGGDND 244 (559) Q Consensus 201 -----~~------~~~~~v~v~~~v~p~~~~~~---------~~~~~----------------~~~~~~sv~~~~~~~~~ 244 (559) .. ++..+|.|+.+-+......+ -.++. ...+...+|+..- .+. T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~ 313 (714) T protein:vir:81 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGP 313 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecC Confidence 00 01134555555333221000 00000 0011111111110 123 Q ss_pred eeeee--cCc--ccCCeEEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcccc--c Q lcl|NC_019445. 245 KLLRE--SGF--DEFPIMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR--A 316 (559) Q Consensus 245 ~il~e--sg~--~~~P~~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~--~ 316 (559) ++|.+ ++| ..|||+++-... ..|..|| + +..+.+-++.+|+.....+.+ +..+-++..++.+.... + T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~-vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:81 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--L-ISRAIPAQDEVNFRRIKLTWL--LQAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--h-hhhchhHHHHHHHHHHHHHHh--hcCCceeeecCcccccHHHH Confidence 46654 344 469998765443 5678886 4 778899999999876665554 45666676655543211 1 Q ss_pred ---eecCCceeecCCcC--Cc---hhhhhhhhccccHHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 317 ---SLLPGDITYIDQIT--GQ---DGFRPAYLVNPSTADLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 317 ---~~~pg~~~~~~~~~--~~---~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) -..||+++...... +. ..+++..... --....+.++...+.|++.- .+|. +++.. +..++-.-|..| T Consensus 389 ~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~llq~~~~~i~~~tGv~~~--~lG~~-~na~SGvAi~~r 464 (714) T protein:vir:81 389 MEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQ-VASQQFQVMQESEKLIQDTMGVYSA--FLGQD-SGATSGVAISNL 464 (714) T ss_pred HHhccCCCCceeecccccccCCCCccccccCCCC-ccHHHHHHHHHHHHHHHHhhCCChH--HcCCC-ccchhHHHHHHH Confidence 26788888774221 11 1233322111 11222334444455554443 1111 12222 223555669999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc----------CCCCC------C---chh--------hCCcceEEEe Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK----------NMLPP------P---PDA--------MEGMPLKVEY 440 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~----------g~lp~------~---p~~--------l~g~~v~~~~ 440 (559) ++.....|+..+.+|..- ..=+-+..++++.+. |.-.+ + |.. +....+.|.+ T Consensus 465 q~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i 543 (714) T protein:vir:81 465 VEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIAL 543 (714) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEE Confidence 999999999999888653 333344444444331 10000 0 000 0011233444 Q ss_pred -ecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChh-h---HhcCCHHHHHHHHHHHcCCCc--cccCCHHHHHHHHHHHHH Q lcl|NC_019445. 441 -ISVMAQAQKSIGLSSLASTVNFIGQLAQ-AKPE-A---LDKLNVDQAIDAFADMSGVSP--TVIVPQEQVDQARQQRAQ 512 (559) Q Consensus 441 -is~La~a~r~~~~~~l~~~~~~~~~la~-~~P~-~---~~~id~d~~~~~~a~~~Gvp~--~~~rs~~ev~~~rq~r~q 512 (559) .+|-...+|.+..+.+.++++.+....+ +.++ + +|.-+.+++++.+-..+|.+. .....+++.++..++..+ T Consensus 544 ~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~ 623 (714) T protein:vir:81 544 APVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQ 623 (714) T ss_pred eeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHH Confidence 2344456677777777777765432211 1222 2 233467899999999998753 333333332222222222 Q ss_pred HHHHHHHHHH-------HHH-----HH-------HHHhhhhhhcCC---ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 513 QQQQQQMMAM-------GMA-----AA-------QGAKTLSEAKTS---DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 513 ~~q~~~~~~~-------~~~-----~~-------~~a~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 559 (559) ++|.+.+++. +.+ .+ ++++.++.+... .......++..+.+..+.+| T Consensus 624 ~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~ 692 (714) T protein:vir:81 624 QQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQ 692 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhh Confidence 1111100000 000 00 011111110000 00000111111111122222 No 37 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.72 E-value=1.1e-14 Score=97.21 Aligned_cols=535 Identities=12% Similarity=0.089 Sum_probs=247.1 Q ss_pred CChhh-----HHHHHHHHHHHHHHhh---hHHHHHHHHHHHhccccCCCCCCCCCCcc-cccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQ---SFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~---~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~s~~~~a~~~Las~ 71 (559) |++.. .+...+.+.....+.. .|.....+-.+|.. ....+.+....-. +....+.-+.....++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~--G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYD--GDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 55542 2223334444444432 34444446666653 1111111111000 111222233334444443332 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEE--EEeecCCc-eE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAM--AVLEDDED-II 148 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~v~~~~~~-~~ 148 (559) .- .+++=+++.+.+.+.... +-.+.++..+......+++..+...++.+.+..|-|.+ ++++|+.+ .+ T Consensus 86 ~~-----~nr~~~~v~p~~~~~~~~----~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:99 86 EA-----KTRTDLVVMSDEPDDETE----KLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HH-----hCCcceEEecCCCCchhH----HHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 22 255556666643322211 11222444555566678888899999999999888864 45555544 48 Q ss_pred EEEEeeccEEEEeeCCCC-CE---EEEEEEEeecHHHHHHhcCcccCCHHHHHHH---h--------------------- Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRG-SV---DICFRKFSMTVRQLVQEFGLNNVSESVKSMW---E--------------------- 200 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G-~v---d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~---~--------------------- 200 (559) +++++|..+++++.++.. .+ .-+||...+|..++...||+.+ +.+.... . T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:99 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhhhhhccccccccccccccccccchhh Confidence 999999999999886532 22 2378999999999999998632 1111110 0 Q ss_pred -----cC------CCCceEEEEEEEeecCcccc---------ccccc----------------ccccEEEEEEEecCCCc Q lcl|NC_019445. 201 -----SG------TYEKWIEVMHSVYPNIDRDT---------SKLDS----------------KNKPFKSVYYEVGGDND 244 (559) Q Consensus 201 -----~~------~~~~~v~v~~~v~p~~~~~~---------~~~~~----------------~~~~~~sv~~~~~~~~~ 244 (559) .. ++..+|.|+.+-+......+ -.++. ...+...+|+..- .+. T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~ 313 (714) T protein:vir:99 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGP 313 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecC Confidence 00 01134555555333221000 00000 0011111111110 123 Q ss_pred eeeee--cCc--ccCCeEEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcccc--c Q lcl|NC_019445. 245 KLLRE--SGF--DEFPIMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR--A 316 (559) Q Consensus 245 ~il~e--sg~--~~~P~~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~--~ 316 (559) ++|.+ ++| ..|||+++-... ..|..|| + +..+.+-++.+|+.....+.+ +..+-++..++.+.... + T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~-vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:99 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--L-ISRAIPAQDEVNFRRIKLTWL--LQAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--h-hhhchhHHHHHHHHHHHHHHh--hcCCceeeecCcccccHHHH Confidence 46654 344 469998765443 5678886 4 778899999999876665554 45666676655543211 1 Q ss_pred ---eecCCceeecCCcC--Cc---hhhhhhhhccccHHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 317 ---SLLPGDITYIDQIT--GQ---DGFRPAYLVNPSTADLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 317 ---~~~pg~~~~~~~~~--~~---~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) -..||+++...... +. ..+++..... --....+.++...+.|++.- .+|. +++.. +..++-.-|..| T Consensus 389 ~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~llq~~~~~i~~~tGv~~~--~lG~~-~na~SGvAi~~r 464 (714) T protein:vir:99 389 MEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQ-VASQQFQVMQESEKLIQDTMGVYSA--FLGQD-SGATSGVAISNL 464 (714) T ss_pred HHhccCCCCceeecccccccCCCCccccccCCCC-ccHHHHHHHHHHHHHHHHhhCCChH--HcCCC-ccchhHHHHHHH Confidence 26788888774221 11 1233322111 11222334444455554443 1111 12222 223555669999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc----------CCCCC------C---chh--------hCCcceEEEe Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK----------NMLPP------P---PDA--------MEGMPLKVEY 440 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~----------g~lp~------~---p~~--------l~g~~v~~~~ 440 (559) ++.....|+..+.+|..- ..=+-+..++++.+. |.-.+ + |.. +....+.|.+ T Consensus 465 q~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i 543 (714) T protein:vir:99 465 VEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIAL 543 (714) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEE Confidence 999999999999888653 333344444444331 10000 0 000 0011233444 Q ss_pred -ecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChh-h---HhcCCHHHHHHHHHHHcCCCc--cccCCHHHHHHHHHHHHH Q lcl|NC_019445. 441 -ISVMAQAQKSIGLSSLASTVNFIGQLAQ-AKPE-A---LDKLNVDQAIDAFADMSGVSP--TVIVPQEQVDQARQQRAQ 512 (559) Q Consensus 441 -is~La~a~r~~~~~~l~~~~~~~~~la~-~~P~-~---~~~id~d~~~~~~a~~~Gvp~--~~~rs~~ev~~~rq~r~q 512 (559) .+|-...+|.+..+.+.++++.+....+ +.++ + +|.-+.+++++.+-..+|.+. .....+++.++..++..+ T Consensus 544 ~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~ 623 (714) T protein:vir:99 544 APVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQ 623 (714) T ss_pred eeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHH Confidence 2344456677777777777765432211 1222 2 233467899999999998753 333333332222222222 Q ss_pred HHHHHHHHHH-------HHH-----HH-------HHHhhhhhhcCC---ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 513 QQQQQQMMAM-------GMA-----AA-------QGAKTLSEAKTS---DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 513 ~~q~~~~~~~-------~~~-----~~-------~~a~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 559 (559) ++|.+.+++. +.+ .+ ++++.++.+... .......++..+.+..+.+| T Consensus 624 ~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~ 692 (714) T protein:vir:99 624 QQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQ 692 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhh Confidence 1111100000 000 00 011111110000 00000111111111122222 No 38 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.72 E-value=1.1e-14 Score=97.21 Aligned_cols=535 Identities=12% Similarity=0.089 Sum_probs=247.1 Q ss_pred CChhh-----HHHHHHHHHHHHHHhh---hHHHHHHHHHHHhccccCCCCCCCCCCcc-cccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQ---SFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~---~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~s~~~~a~~~Las~ 71 (559) |++.. .+...+.+.....+.. .|.....+-.+|.. ....+.+....-. +....+.-+.....++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~--G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYD--GDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 55542 2223334444444432 34444446666653 1111111111000 111222233334444443332 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEE--EEeecCCc-eE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAM--AVLEDDED-II 148 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~v~~~~~~-~~ 148 (559) .- .+++=+++.+.+.+.... +-.+.++..+......+++..+...++.+.+..|-|.+ ++++|+.+ .+ T Consensus 86 ~~-----~nr~~~~v~p~~~~~~~~----~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:27 86 EA-----KTRTDLVVMSDEPDDETE----KLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HH-----hCCcceEEecCCCCchhH----HHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 22 255556666643322211 11222444555566678888899999999999888864 45555544 48 Q ss_pred EEEEeeccEEEEeeCCCC-CE---EEEEEEEeecHHHHHHhcCcccCCHHHHHHH---h--------------------- Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRG-SV---DICFRKFSMTVRQLVQEFGLNNVSESVKSMW---E--------------------- 200 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G-~v---d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~---~--------------------- 200 (559) +++++|..+++++.++.. .+ .-+||...+|..++...||+.+ +.+.... . T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:27 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhhhhhccccccccccccccccccchhh Confidence 999999999999886532 22 2378999999999999998632 1111110 0 Q ss_pred -----cC------CCCceEEEEEEEeecCcccc---------ccccc----------------ccccEEEEEEEecCCCc Q lcl|NC_019445. 201 -----SG------TYEKWIEVMHSVYPNIDRDT---------SKLDS----------------KNKPFKSVYYEVGGDND 244 (559) Q Consensus 201 -----~~------~~~~~v~v~~~v~p~~~~~~---------~~~~~----------------~~~~~~sv~~~~~~~~~ 244 (559) .. ++..+|.|+.+-+......+ -.++. ...+...+|+..- .+. T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~ 313 (714) T protein:vir:27 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGP 313 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecC Confidence 00 01134555555333221000 00000 0011111111110 123 Q ss_pred eeeee--cCc--ccCCeEEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcccc--c Q lcl|NC_019445. 245 KLLRE--SGF--DEFPIMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR--A 316 (559) Q Consensus 245 ~il~e--sg~--~~~P~~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~--~ 316 (559) ++|.+ ++| ..|||+++-... ..|..|| + +..+.+-++.+|+.....+.+ +..+-++..++.+.... + T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~-vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:27 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--L-ISRAIPAQDEVNFRRIKLTWL--LQAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--h-hhhchhHHHHHHHHHHHHHHh--hcCCceeeecCcccccHHHH Confidence 46654 344 469998765443 5678886 4 778899999999876665554 45666676655543211 1 Q ss_pred ---eecCCceeecCCcC--Cc---hhhhhhhhccccHHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 317 ---SLLPGDITYIDQIT--GQ---DGFRPAYLVNPSTADLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 317 ---~~~pg~~~~~~~~~--~~---~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) -..||+++...... +. ..+++..... --....+.++...+.|++.- .+|. +++.. +..++-.-|..| T Consensus 389 ~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~llq~~~~~i~~~tGv~~~--~lG~~-~na~SGvAi~~r 464 (714) T protein:vir:27 389 MEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQ-VASQQFQVMQESEKLIQDTMGVYSA--FLGQD-SGATSGVAISNL 464 (714) T ss_pred HHhccCCCCceeecccccccCCCCccccccCCCC-ccHHHHHHHHHHHHHHHHhhCCChH--HcCCC-ccchhHHHHHHH Confidence 26788888774221 11 1233322111 11222334444455554443 1111 12222 223555669999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc----------CCCCC------C---chh--------hCCcceEEEe Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK----------NMLPP------P---PDA--------MEGMPLKVEY 440 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~----------g~lp~------~---p~~--------l~g~~v~~~~ 440 (559) ++.....|+..+.+|..- ..=+-+..++++.+. |.-.+ + |.. +....+.|.+ T Consensus 465 q~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i 543 (714) T protein:vir:27 465 VEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIAL 543 (714) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEE Confidence 999999999999888653 333344444444331 10000 0 000 0011233444 Q ss_pred -ecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChh-h---HhcCCHHHHHHHHHHHcCCCc--cccCCHHHHHHHHHHHHH Q lcl|NC_019445. 441 -ISVMAQAQKSIGLSSLASTVNFIGQLAQ-AKPE-A---LDKLNVDQAIDAFADMSGVSP--TVIVPQEQVDQARQQRAQ 512 (559) Q Consensus 441 -is~La~a~r~~~~~~l~~~~~~~~~la~-~~P~-~---~~~id~d~~~~~~a~~~Gvp~--~~~rs~~ev~~~rq~r~q 512 (559) .+|-...+|.+..+.+.++++.+....+ +.++ + +|.-+.+++++.+-..+|.+. .....+++.++..++..+ T Consensus 544 ~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~ 623 (714) T protein:vir:27 544 APVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQ 623 (714) T ss_pred eeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHH Confidence 2344456677777777777765432211 1222 2 233467899999999998753 333333332222222222 Q ss_pred HHHHHHHHHH-------HHH-----HH-------HHHhhhhhhcCC---ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 513 QQQQQQMMAM-------GMA-----AA-------QGAKTLSEAKTS---DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 513 ~~q~~~~~~~-------~~~-----~~-------~~a~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 559 (559) ++|.+.+++. +.+ .+ ++++.++.+... .......++..+.+..+.+| T Consensus 624 ~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~ 692 (714) T protein:vir:27 624 QQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQ 692 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhh Confidence 1111100000 000 00 011111110000 00000111111111122222 No 39 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.72 E-value=1.1e-14 Score=97.21 Aligned_cols=535 Identities=12% Similarity=0.089 Sum_probs=247.1 Q ss_pred CChhh-----HHHHHHHHHHHHHHhh---hHHHHHHHHHHHhccccCCCCCCCCCCcc-cccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQ---SFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~---~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~s~~~~a~~~Las~ 71 (559) |++.. .+...+.+.....+.. .|.....+-.+|.. ....+.+....-. +....+.-+.....++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~--G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYD--GDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 55542 2223334444444432 34444446666653 1111111111000 111222233334444443332 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEE--EEeecCCc-eE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAM--AVLEDDED-II 148 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~v~~~~~~-~~ 148 (559) .- .+++=+++.+.+.+.... +-.+.++..+......+++..+...++.+.+..|-|.+ ++++|+.+ .+ T Consensus 86 ~~-----~nr~~~~v~p~~~~~~~~----~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:10 86 EA-----KTRTDLVVMSDEPDDETE----KLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HH-----hCCcceEEecCCCCchhH----HHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 22 255556666643322211 11222444555566678888899999999999888864 45555544 48 Q ss_pred EEEEeeccEEEEeeCCCC-CE---EEEEEEEeecHHHHHHhcCcccCCHHHHHHH---h--------------------- Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRG-SV---DICFRKFSMTVRQLVQEFGLNNVSESVKSMW---E--------------------- 200 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G-~v---d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~---~--------------------- 200 (559) +++++|..+++++.++.. .+ .-+||...+|..++...||+.+ +.+.... . T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:10 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhhhhhccccccccccccccccccchhh Confidence 999999999999886532 22 2378999999999999998632 1111110 0 Q ss_pred -----cC------CCCceEEEEEEEeecCcccc---------ccccc----------------ccccEEEEEEEecCCCc Q lcl|NC_019445. 201 -----SG------TYEKWIEVMHSVYPNIDRDT---------SKLDS----------------KNKPFKSVYYEVGGDND 244 (559) Q Consensus 201 -----~~------~~~~~v~v~~~v~p~~~~~~---------~~~~~----------------~~~~~~sv~~~~~~~~~ 244 (559) .. ++..+|.|+.+-+......+ -.++. ...+...+|+..- .+. T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~ 313 (714) T protein:vir:10 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGP 313 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecC Confidence 00 01134555555333221000 00000 0011111111110 123 Q ss_pred eeeee--cCc--ccCCeEEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcccc--c Q lcl|NC_019445. 245 KLLRE--SGF--DEFPIMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR--A 316 (559) Q Consensus 245 ~il~e--sg~--~~~P~~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~--~ 316 (559) ++|.+ ++| ..|||+++-... ..|..|| + +..+.+-++.+|+.....+.+ +..+-++..++.+.... + T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~-vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:10 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--L-ISRAIPAQDEVNFRRIKLTWL--LQAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--h-hhhchhHHHHHHHHHHHHHHh--hcCCceeeecCcccccHHHH Confidence 46654 344 469998765443 5678886 4 778899999999876665554 45666676655543211 1 Q ss_pred ---eecCCceeecCCcC--Cc---hhhhhhhhccccHHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 317 ---SLLPGDITYIDQIT--GQ---DGFRPAYLVNPSTADLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 317 ---~~~pg~~~~~~~~~--~~---~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) -..||+++...... +. ..+++..... --....+.++...+.|++.- .+|. +++.. +..++-.-|..| T Consensus 389 ~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~llq~~~~~i~~~tGv~~~--~lG~~-~na~SGvAi~~r 464 (714) T protein:vir:10 389 MEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQ-VASQQFQVMQESEKLIQDTMGVYSA--FLGQD-SGATSGVAISNL 464 (714) T ss_pred HHhccCCCCceeecccccccCCCCccccccCCCC-ccHHHHHHHHHHHHHHHHhhCCChH--HcCCC-ccchhHHHHHHH Confidence 26788888774221 11 1233322111 11222334444455554443 1111 12222 223555669999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc----------CCCCC------C---chh--------hCCcceEEEe Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK----------NMLPP------P---PDA--------MEGMPLKVEY 440 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~----------g~lp~------~---p~~--------l~g~~v~~~~ 440 (559) ++.....|+..+.+|..- ..=+-+..++++.+. |.-.+ + |.. +....+.|.+ T Consensus 465 q~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i 543 (714) T protein:vir:10 465 VEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIAL 543 (714) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEE Confidence 999999999999888653 333344444444331 10000 0 000 0011233444 Q ss_pred -ecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChh-h---HhcCCHHHHHHHHHHHcCCCc--cccCCHHHHHHHHHHHHH Q lcl|NC_019445. 441 -ISVMAQAQKSIGLSSLASTVNFIGQLAQ-AKPE-A---LDKLNVDQAIDAFADMSGVSP--TVIVPQEQVDQARQQRAQ 512 (559) Q Consensus 441 -is~La~a~r~~~~~~l~~~~~~~~~la~-~~P~-~---~~~id~d~~~~~~a~~~Gvp~--~~~rs~~ev~~~rq~r~q 512 (559) .+|-...+|.+..+.+.++++.+....+ +.++ + +|.-+.+++++.+-..+|.+. .....+++.++..++..+ T Consensus 544 ~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~ 623 (714) T protein:vir:10 544 APVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQ 623 (714) T ss_pred eeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHH Confidence 2344456677777777777765432211 1222 2 233467899999999998753 333333332222222222 Q ss_pred HHHHHHHHHH-------HHH-----HH-------HHHhhhhhhcCC---ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 513 QQQQQQMMAM-------GMA-----AA-------QGAKTLSEAKTS---DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 513 ~~q~~~~~~~-------~~~-----~~-------~~a~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 559 (559) ++|.+.+++. +.+ .+ ++++.++.+... .......++..+.+..+.+| T Consensus 624 ~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~ 692 (714) T protein:vir:10 624 QQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQ 692 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhh Confidence 1111100000 000 00 011111110000 00000111111111122222 No 40 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.72 E-value=1.1e-14 Score=97.21 Aligned_cols=535 Identities=12% Similarity=0.089 Sum_probs=247.1 Q ss_pred CChhh-----HHHHHHHHHHHHHHhh---hHHHHHHHHHHHhccccCCCCCCCCCCcc-cccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT-----KERLNKQFAQLESERQ---SFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~~-----~~~l~~r~~~l~~~R~---~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~s~~~~a~~~Las~ 71 (559) |++.. .+...+.+.....+.. .|.....+-.+|.. ....+.+....-. +....+.-+.....++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~--G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYD--GDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 55542 2223334444444432 34444446666653 1111111111000 111222233334444443332 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEE--EEeecCCc-eE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAM--AVLEDDED-II 148 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~v~~~~~~-~~ 148 (559) .- .+++=+++.+.+.+.... +-.+.++..+......+++..+...++.+.+..|-|.+ ++++|+.+ .+ T Consensus 86 ~~-----~nr~~~~v~p~~~~~~~~----~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i 156 (714) T protein:vir:32 86 EA-----KTRTDLVVMSDEPDDETE----KLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEF 156 (714) T ss_pred HH-----hCCcceEEecCCCCchhH----HHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCe Confidence 22 255556666643322211 11222444555566678888899999999999888864 45555544 48 Q ss_pred EEEEeeccEEEEeeCCCC-CE---EEEEEEEeecHHHHHHhcCcccCCHHHHHHH---h--------------------- Q lcl|NC_019445. 149 RTMPFPIGSYYLANSPRG-SV---DICFRKFSMTVRQLVQEFGLNNVSESVKSMW---E--------------------- 200 (559) Q Consensus 149 ~~~~~~l~~~~v~~d~~G-~v---d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~---~--------------------- 200 (559) +++++|..+++++.++.. .+ .-+||...+|..++...||+.+ +.+.... . T Consensus 157 ~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a--~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:32 157 KVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred EEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCch--hhhhhhhhhhccccccccccccccccccchhh Confidence 999999999999886532 22 2378999999999999998632 1111110 0 Q ss_pred -----cC------CCCceEEEEEEEeecCcccc---------ccccc----------------ccccEEEEEEEecCCCc Q lcl|NC_019445. 201 -----SG------TYEKWIEVMHSVYPNIDRDT---------SKLDS----------------KNKPFKSVYYEVGGDND 244 (559) Q Consensus 201 -----~~------~~~~~v~v~~~v~p~~~~~~---------~~~~~----------------~~~~~~sv~~~~~~~~~ 244 (559) .. ++..+|.|+.+-+......+ -.++. ...+...+|+..- .+. T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~-~g~ 313 (714) T protein:vir:32 235 YQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF-VGP 313 (714) T ss_pred hccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE-ecC Confidence 00 01134555555333221000 00000 0011111111110 123 Q ss_pred eeeee--cCc--ccCCeEEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcccc--c Q lcl|NC_019445. 245 KLLRE--SGF--DEFPIMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR--A 316 (559) Q Consensus 245 ~il~e--sg~--~~~P~~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~~--~ 316 (559) ++|.+ ++| ..|||+++-... ..|..|| + +..+.+-++.+|+.....+.+ +..+-++..++.+.... + T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~-vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:32 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--L-ISRAIPAQDEVNFRRIKLTWL--LQAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--h-hhhchhHHHHHHHHHHHHHHh--hcCCceeeecCcccccHHHH Confidence 46654 344 469998765443 5678886 4 778899999999876665554 45666676655543211 1 Q ss_pred ---eecCCceeecCCcC--Cc---hhhhhhhhccccHHHHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 317 ---SLLPGDITYIDQIT--GQ---DGFRPAYLVNPSTADLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 317 ---~~~pg~~~~~~~~~--~~---~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) -..||+++...... +. ..+++..... --....+.++...+.|++.- .+|. +++.. +..++-.-|..| T Consensus 389 ~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~llq~~~~~i~~~tGv~~~--~lG~~-~na~SGvAi~~r 464 (714) T protein:vir:32 389 MEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQ-VASQQFQVMQESEKLIQDTMGVYSA--FLGQD-SGATSGVAISNL 464 (714) T ss_pred HHhccCCCCceeecccccccCCCCccccccCCCC-ccHHHHHHHHHHHHHHHHhhCCChH--HcCCC-ccchhHHHHHHH Confidence 26788888774221 11 1233322111 11222334444455554443 1111 12222 223555669999 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc----------CCCCC------C---chh--------hCCcceEEEe Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK----------NMLPP------P---PDA--------MEGMPLKVEY 440 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~----------g~lp~------~---p~~--------l~g~~v~~~~ 440 (559) ++.....|+..+.+|..- ..=+-+..++++.+. |.-.+ + |.. +....+.|.+ T Consensus 465 q~qg~~~l~~~~Dnl~~~-~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i 543 (714) T protein:vir:32 465 VEQGATTLAEINDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIAL 543 (714) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEE Confidence 999999999999888653 333344444444331 10000 0 000 0011233444 Q ss_pred -ecHHHHHHHHHHHHHHHHHHHHHHHHhc-cChh-h---HhcCCHHHHHHHHHHHcCCCc--cccCCHHHHHHHHHHHHH Q lcl|NC_019445. 441 -ISVMAQAQKSIGLSSLASTVNFIGQLAQ-AKPE-A---LDKLNVDQAIDAFADMSGVSP--TVIVPQEQVDQARQQRAQ 512 (559) Q Consensus 441 -is~La~a~r~~~~~~l~~~~~~~~~la~-~~P~-~---~~~id~d~~~~~~a~~~Gvp~--~~~rs~~ev~~~rq~r~q 512 (559) .+|-...+|.+..+.+.++++.+....+ +.++ + +|.-+.+++++.+-..+|.+. .....+++.++..++..+ T Consensus 544 ~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~ 623 (714) T protein:vir:32 544 APVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQ 623 (714) T ss_pred eeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHH Confidence 2344456677777777777765432211 1222 2 233467899999999998753 333333332222222222 Q ss_pred HHHHHHHHHH-------HHH-----HH-------HHHhhhhhhcCC---ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 513 QQQQQQMMAM-------GMA-----AA-------QGAKTLSEAKTS---DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 513 ~~q~~~~~~~-------~~~-----~~-------~~a~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 559 (559) ++|.+.+++. +.+ .+ ++++.++.+... .......++..+.+..+.+| T Consensus 624 ~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~ 692 (714) T protein:vir:32 624 QQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQ 692 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhh Confidence 1111100000 000 00 011111110000 00000111111111122222 No 41 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.72 E-value=1.2e-14 Score=96.91 Aligned_cols=521 Identities=12% Similarity=0.052 Sum_probs=243.1 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCc-ccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRN-DRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) ++.+.-..+..+|..-...-..|.....+-.+|.. ....+.+....- .+....+.-+.....++...+..- . T Consensus 18 ~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~--G~QW~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~-----~ 90 (772) T protein:vir:10 18 GDTPLTVDEYADINYEIEDQPAWRAVADKEMDYAD--GNQLDTELLRRQQALGIPPAVEDLIGPALLSLQGYEA-----V 90 (772) T ss_pred cccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhc--CCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHHHHHH-----h Confidence 22222122333444444444466666667666753 111111111000 011122333334444444433222 2 Q ss_pred CCcceeccCCcc-chhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEE--eecCCc-eEEEEEeec Q lcl|NC_019445. 80 ARPWFRLATPDP-EMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV--LEDDED-IIRTMPFPI 155 (559) Q Consensus 80 ~~~Wf~l~~~d~-~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v--~~~~~~-~~~~~~~~l 155 (559) +++=+++.+.++ ...+..++ ++..+......+++..+...++.+.+..|-|.+-+ ++|+.+ .+++..++. T Consensus 91 nr~d~~v~Pr~~~~d~~~Ae~------l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i~~v~p 164 (772) T protein:vir:10 91 TRTDWRVTPNGDVGGQEVADA------LNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPYRCRPIRR 164 (772) T ss_pred cCcceEEecCCCchHHHHHHH------HHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCeEEEeeCc Confidence 555566666432 22222222 44445555667888999999999999988876543 344433 488999999 Q ss_pred cEEEEeeCCCCCEEE---EEEEEeecHHHHHHhcCcccCCHHHHHHHhc------------------------------- Q lcl|NC_019445. 156 GSYYLANSPRGSVDI---CFRKFSMTVRQLVQEFGLNNVSESVKSMWES------------------------------- 201 (559) Q Consensus 156 ~~~~v~~d~~G~vd~---i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~------------------------------- 201 (559) .+++++.++...... +||...|+.+++..+||+.+ +.+....+. T Consensus 165 ~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (772) T protein:vir:10 165 DEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHA--ELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAW 242 (772) T ss_pred ccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCch--hHHHhhhhhcccccCcccccccccccccccccccchhhcc Confidence 999999987554433 78999999999999999643 111110000 Q ss_pred --------CCCCceEEEEEEEeecCcccc------cc---cc----------------cccccEEEEEEEecCCCceeee Q lcl|NC_019445. 202 --------GTYEKWIEVMHSVYPNIDRDT------SK---LD----------------SKNKPFKSVYYEVGGDNDKLLR 248 (559) Q Consensus 202 --------~~~~~~v~v~~~v~p~~~~~~------~~---~~----------------~~~~~~~sv~~~~~~~~~~il~ 248 (559) ..+.++|.|+++-|.++.... ++ ++ .+...-.-|||..-. +.++|+ T Consensus 243 ~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~-g~~~L~ 321 (772) T protein:vir:10 243 TVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWL-GPHCLH 321 (772) T ss_pred ccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEe-cceeec Confidence 011256888877554432111 00 00 000011112332211 235675 Q ss_pred e--cCc--ccCCeEEEEee--ecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccc------cc Q lcl|NC_019445. 249 E--SGF--DEFPIMAPRWE--VNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQ------RA 316 (559) Q Consensus 249 e--sg~--~~~P~~~~rw~--~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~------~~ 316 (559) . ++| ..|||++.-.. ...|..|| + +..+.+-++.+|+.....+..+.... ++...+.... .- T Consensus 322 ~~~~p~~~~~fP~vP~~g~r~~~~g~~~G--~-vr~~kd~Qr~~N~~~S~~~~~l~~~~---~~~~~gav~~~d~~~~e~ 395 (772) T protein:vir:10 322 DGPTPYTHRHFPYVPFFGFREDATGIPYG--Y-VRGMKYAQDSLNSGVSKLRWGMSVAR---VERTKGAVAMTDAQFRRQ 395 (772) T ss_pred cCCCCCCCCccceEEEeeeEeccCCcccc--h-hhhhhhHHHHHHHHHHHHHHHHhccc---ccccCCCccchhHHHHHh Confidence 4 445 46999876433 35667774 4 78888999999998777776544332 2332222111 12 Q ss_pred eecCCceeecCCcCCc---hhhhhhhhccccH-HHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHH Q lcl|NC_019445. 317 SLLPGDITYIDQITGQ---DGFRPAYLVNPST-ADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKL 392 (559) Q Consensus 317 ~~~pg~~~~~~~~~~~---~~~~p~~~~~~~~-~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~ 392 (559) -..|++++.++..... ..+++ ..++.+ ....+.++...+.|.+.--..- .+++.. +..++..-|..|++... T Consensus 396 ~arp~~vi~~~~~~~~~~~~~~~~--~~~~~~~~~~~~llq~~~~~i~~vsGv~~-~~lG~~-~na~SGvAi~~rq~qg~ 471 (772) T protein:vir:10 396 IARPDADIVLDENHMAKPGARFDV--KRDYTLTDQHFQMLQDNRATIERVSNITA-GFQGRK-GTATSGIQEQQQIEQSN 471 (772) T ss_pred ccCCCCeEEeCCccccCCCCCccc--cCCccccHHHHHHHHHHHHHHHHHhCCCH-HHcCCC-cchhhHHHHHHHHHHHH Confidence 2468888777532111 11221 112222 2233444555555655431111 122333 33467777999999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHhc-------------CCCCC----C--c--h----------hhCCcceEEEe- Q lcl|NC_019445. 393 LMLGPVLERLNDECLNPLIDRAFSMMVRK-------------NMLPP----P--P--D----------AMEGMPLKVEY- 440 (559) Q Consensus 393 ~~LG~v~~~l~~E~l~Pli~r~~~il~r~-------------g~lp~----~--p--~----------~l~g~~v~~~~- 440 (559) ..++..+.+|.. ...=+-+.+++++.+. +.-++ + + + .|....+.|.+ T Consensus 472 ~~l~~~~Dnl~~-~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~ 550 (772) T protein:vir:10 472 QSIGRIMDNFRA-GRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALE 550 (772) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEee Confidence 999999988865 3333444455554441 11000 0 0 0 01111123333 Q ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHhccChhh-----------HhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHH-H Q lcl|NC_019445. 441 ISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEA-----------LDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQAR-Q 508 (559) Q Consensus 441 is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~-----------~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~r-q 508 (559) ..|....+|.+..+.+.+++.. ++|++ +|.-+.+++++.+-...+-+ ++++.++.. + T Consensus 551 ~~p~~~t~r~~~~~~m~ql~~~------~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~-----~peq~~~~~~q 619 (772) T protein:vir:10 551 DVPSTNSYRGQQLNAMSEAVKS------MPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQ-----TPEQIQQQIDQ 619 (772) T ss_pred ccccchHHHHHHHHHHHHHHhc------cChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccC-----ChHHHHHHHHH Confidence 3355556777777666666532 23332 22234557777777766542 222222221 1 Q ss_pred HHHHHHHHHHH-HHHHHH-------HHHHHhhhhhhcCC-----------------Ch---hHHHHHHHHhhcC----CC Q lcl|NC_019445. 509 QRAQQQQQQQM-MAMGMA-------AAQGAKTLSEAKTS-----------------DP---SVLSAMANAVSGQ----GG 556 (559) Q Consensus 509 ~r~q~~q~~~~-~~~~~~-------~~~~a~~~~~~~~~-----------------~~---~~~~~~~~~~~~~----~~ 556 (559) +.+++++++++ ++..+. .+++.+..+.+... .+ .....++..++-. ++ T Consensus 620 ~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~a~~ad~~l~~~g~~~~~~~~ 699 (772) T protein:vir:10 620 AVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMIAPIADAVMQSAGYQRPNPAG 699 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhHHHHHHHHhcccccccccc Confidence 11111111110 000000 01111111111000 00 1111122111110 00 Q ss_pred CCC Q lcl|NC_019445. 557 QSQ 559 (559) Q Consensus 557 ~~~ 559 (559) .+. T Consensus 700 ~~~ 702 (772) T protein:vir:10 700 DDP 702 (772) T ss_pred cCC Confidence 000 No 42 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.71 E-value=1.1e-15 Score=102.58 Aligned_cols=542 Identities=14% Similarity=0.099 Sum_probs=238.1 Q ss_pred CChhhHHHH---HHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccc-----cCCCCcchHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERL---NKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRR-----NTRIIDSTGTMAARTLASGM 72 (559) Q Consensus 1 M~~~~~~~l---~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~-----~~~~~~s~~~~a~~~Las~l 72 (559) |++.+.+.+ ..+|.......+.|...+.+=.+|..=.....+.++...-+.+ ...+.-+.....++...+.- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 999865544 4555555555555555555555554210111111111000000 11122233444444444322 Q ss_pred HHhhcCCCCcceeccCCccc-hhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee--------- Q lcl|NC_019445. 73 MSGITSPARPWFRLATPDPE-MMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE--------- 142 (559) Q Consensus 73 ~~~l~pp~~~Wf~l~~~d~~-~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~--------- 142 (559) . .+++=+++.+.+++ ..+.. +.++..+......++...+...++.+.++.|-|.+-+.. T Consensus 81 ~-----~nr~d~~v~P~~~~~d~~~A------e~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~ 149 (708) T protein:vir:10 81 R-----NNRITVKFRPGDREASEELA------NKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPM 149 (708) T ss_pred H-----hCCcceEEEcCCCCchHHHH------HHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCC Confidence 2 25666677765443 12222 224444555556788888999999999999999775522 Q ss_pred cCCceEEEEE--eeccEEEEeeCCC---CC-EEEEEEEEeecHHHHHHhcCcccCCH-HHHHHHhcCC---CCceEEEEE Q lcl|NC_019445. 143 DDEDIIRTMP--FPIGSYYLANSPR---GS-VDICFRKFSMTVRQLVQEFGLNNVSE-SVKSMWESGT---YEKWIEVMH 212 (559) Q Consensus 143 ~~~~~~~~~~--~~l~~~~v~~d~~---G~-vd~i~r~~~~t~~ql~~~fg~~~l~~-~v~~~~~~~~---~~~~v~v~~ 212 (559) +...++.+++ .|..+++++.++. .. -.-+||...|+..++..+||+.+-.. ++........ ....+.|.+ T Consensus 150 ~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~e 229 (708) T protein:vir:10 150 DDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAK 229 (708) T ss_pred CCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEE Confidence 1223344433 3556777776542 21 12367889999999999999653211 1110000000 001122111 Q ss_pred E-----------EeecCc-cccccccc------------------ccccEEE--EEEEecCCCceeeee---cCcccCCe Q lcl|NC_019445. 213 S-----------VYPNID-RDTSKLDS------------------KNKPFKS--VYYEVGGDNDKLLRE---SGFDEFPI 257 (559) Q Consensus 213 ~-----------v~p~~~-~~~~~~~~------------------~~~~~~s--v~~~~~~~~~~il~e---sg~~~~P~ 257 (559) . +++++. .....++. ...+..+ |||..-. +.+++.. ++|..||| T Consensus 230 y~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~-g~~~le~~~~~p~~~fP~ 308 (708) T protein:vir:10 230 YYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVD-GDGFLEKPRRIPGEHIPL 308 (708) T ss_pred eeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeec-chhhhccCCCCCCCceee Confidence 1 111110 00000000 0011111 3333322 2345533 56788999 Q ss_pred EEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccc------cceecCCce---eec Q lcl|NC_019445. 258 MAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQ------RASLLPGDI---TYI 326 (559) Q Consensus 258 ~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~------~~~~~pg~~---~~~ 326 (559) +++-+.. .+|..++.|+ +..+.+-++.+|+.....+..+.++-+.++++..+.+.. ..+...... +.. T Consensus 309 vP~~g~r~~~d~~~~~yG~-vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 387 (708) T protein:vir:10 309 IPVYGKRWFIDDIERVEGH-IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREV 387 (708) T ss_pred EEEeeeeeccCCCccccee-ecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccc Confidence 9875433 3666755675 889999999999999999999998888776664432210 011110000 000 Q ss_pred CCcCCc---hhhhhhhhccccHH-HHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHH Q lcl|NC_019445. 327 DQITGQ---DGFRPAYLVNPSTA-DLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERL 402 (559) Q Consensus 327 ~~~~~~---~~~~p~~~~~~~~~-~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l 402 (559) .+..|. ....+.....+.+. ...+.++...+.|.+..-... .++++. ..++..-|..|++.....++..+.+| T Consensus 388 ~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~-~~lG~~--sn~SG~aI~~rq~qg~~~l~~~~Dnl 464 (708) T protein:vir:10 388 RDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQ-AMQQMP--SNIAQETVNNLMNRADMASFIYLDNM 464 (708) T ss_pred cccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcCh-hHccCc--cchHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 00111111111221 122334455555555532111 123332 23577789999999999999999988 Q ss_pred HHHHHHHHHHHHHHHHHh-------------cCC-----C-----CCCch------hhCCcceEEEe-ecHHHHHHHHHH Q lcl|NC_019445. 403 NDECLNPLIDRAFSMMVR-------------KNM-----L-----PPPPD------AMEGMPLKVEY-ISVMAQAQKSIG 452 (559) Q Consensus 403 ~~E~l~Pli~r~~~il~r-------------~g~-----l-----p~~p~------~l~g~~v~~~~-is~La~a~r~~~ 452 (559) .. ...=+-+..+++..+ .|. + ++-.. .|....+.|.+ .+|-...+|.+. T Consensus 465 ~~-~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~ 543 (708) T protein:vir:10 465 AK-SLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDAT 543 (708) T ss_pred HH-HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHH Confidence 64 222233333444333 111 0 00000 00011223433 335555677777 Q ss_pred HHHHHHHHHHHHHHh----ccChhhHhc---CCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHH--HHHHH Q lcl|NC_019445. 453 LSSLASTVNFIGQLA----QAKPEALDK---LNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQ--MMAMG 523 (559) Q Consensus 453 ~~~l~~~~~~~~~la----~~~P~~~~~---id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~--~~~~~ 523 (559) .+.+.++++.+.... .+.+-+++. -+.++++..+-..++.+...--..+|.+++.++.++++++++ ++.++ T Consensus 544 ~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~ 623 (708) T protein:vir:10 544 VSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLA 623 (708) T ss_pred HHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 777777776554311 112223333 455677777777665542221112222222222221111111 11000 Q ss_pred HH-----HHHHHhhhhhh--------------cCC----------ChhHH----HHHHHHhhcCCCCCC Q lcl|NC_019445. 524 MA-----AAQGAKTLSEA--------------KTS----------DPSVL----SAMANAVSGQGGQSQ 559 (559) Q Consensus 524 ~~-----~~~~a~~~~~~--------------~~~----------~~~~~----~~~~~~~~~~~~~~~ 559 (559) ++ .++..+.-+++ ... .+++. ...+.++....+-+| T Consensus 624 qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l~~~q~~q~ 692 (708) T protein:vir:10 624 QAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQ 692 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHH Confidence 00 01110110000 000 00000 000111111111111 No 43 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.70 E-value=9.6e-16 Score=102.91 Aligned_cols=531 Identities=14% Similarity=0.125 Sum_probs=226.7 Q ss_pred CChhhHHHHHHHHHHHHHH---hhhHHHHHHHHHHHhccccCCCCCCCCCCccc-----ccCCCCcchHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESE---RQSFEPHWRELSDYINPRGSRFLTSEVNRNDR-----RNTRIIDSTGTMAARTLASGM 72 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~---R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~-----~~~~~~~s~~~~a~~~Las~l 72 (559) |+|...+.+.+....++.. .+.|...+++-.+|..-.....+......-+. ....+.-+.....++...+.. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 9997665555444444444 44555555555555431111111111110000 011222333344444433322 Q ss_pred HHhhcCCCCcceeccCCcc-chhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEe------ec-- Q lcl|NC_019445. 73 MSGITSPARPWFRLATPDP-EMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVL------ED-- 143 (559) Q Consensus 73 ~~~l~pp~~~Wf~l~~~d~-~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~------~~-- 143 (559) - .+++=+++.+.++ ...+..+ .++..+......++...+...++.+.+..|.|.+=+. .+ T Consensus 81 ----~-~nr~~~~v~P~~~~~d~~~Ae------~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~ 149 (706) T protein:vir:10 81 ----R-NNRISVKFRPGDNAASEELAN------KLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPM 149 (706) T ss_pred ----H-hCCCceEEecCCCCchHHHHH------HHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCC Confidence 2 2444466665432 2222222 2455555566688999999999999999999976431 11 Q ss_pred -CCceEEEEE--eeccEEEEeeCC---CCC-EEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcC--------C----- Q lcl|NC_019445. 144 -DEDIIRTMP--FPIGSYYLANSP---RGS-VDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESG--------T----- 203 (559) Q Consensus 144 -~~~~~~~~~--~~l~~~~v~~d~---~G~-vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~--------~----- 203 (559) ....+.++. .|+++++++.++ ++. ---+||...|+.+++..+||+.+. ++-+....+ + T Consensus 150 ~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~--~~~~~~~~~~~~d~~~~d~~~~~ 227 (706) T protein:vir:10 150 DERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPT--SLDRVGSVSWQYDWFTPDVVYIA 227 (706) T ss_pred CCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChh--hhhhhccccccccccCCCcceec Confidence 112334433 366788887764 332 123789999999999999997532 111110000 0 Q ss_pred --CCceEE-EEEEEeecCcc------ccc------------c---cccccccEEEEEEEecCCCceeeee-cCc--ccCC Q lcl|NC_019445. 204 --YEKWIE-VMHSVYPNIDR------DTS------------K---LDSKNKPFKSVYYEVGGDNDKLLRE-SGF--DEFP 256 (559) Q Consensus 204 --~~~~v~-v~~~v~p~~~~------~~~------------~---~~~~~~~~~sv~~~~~~~~~~il~e-sg~--~~~P 256 (559) +..+.. +..+.+.+.-. ..+ + +..+.++-..|||..-.+ ..++.. +.| +.|| T Consensus 228 eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g-~~~l~~~~p~~~~~~P 306 (706) T protein:vir:10 228 KYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDG-DGFLEKPRRIPGEHIP 306 (706) T ss_pred ccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeecc-ccccccCCCCCCCccc Confidence 000001 11111222100 000 0 001122333456654433 345533 444 7789 Q ss_pred eEEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee-ecCCCcc----------c--------c Q lcl|NC_019445. 257 IMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV-APTSLKN----------Q--------R 315 (559) Q Consensus 257 ~~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~-~p~~~~~----------~--------~ 315 (559) |+++-..+ .++.....|+ +..+.+-++.+|+....++..+.+.-+-+.. .+++... . . T Consensus 307 ~vP~~g~r~~~d~~~~~~G~-vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~ 385 (706) T protein:vir:10 307 LIPVYGKRWFIDDVERVEGH-IAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRT 385 (706) T ss_pred eEEEeeccccccccCcccce-eccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhccc Confidence 98753322 2444444454 7789999999999988888877665553333 2232110 0 0 Q ss_pred ceecCCceeecCCcCCchhhhhhhhccccHH-HHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHHHHHHHH Q lcl|NC_019445. 316 ASLLPGDITYIDQITGQDGFRPAYLVNPSTA-DLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEMKEEKLL 393 (559) Q Consensus 316 ~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~-~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~ 393 (559) ....+|.++.... ++.+...+.+. ...+.++.....|.+.- .+| .+++..+ .++.--|..|++.... T Consensus 386 ~~~~~g~i~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~--~~lG~~s--n~SG~Ai~~rq~qg~~ 454 (706) T protein:vir:10 386 VTDKTGNVVAPAN-------VAGYTQAPVLNQALAALLQQTSADIQEVTGSSQ--AMQQMPS--NVARETVNSLLNRSDM 454 (706) T ss_pred ccCCCCccccccc-------ccccCCCcchHHHHHHHHHHHHHHHHHHhCCCH--HHcCCcc--chHHHHHHHHHHHHHH Confidence 1111222221111 00111111222 12233444455555543 121 1223222 2577778999999999 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHh-------------cCCC--CCC----chhhCC----------cceEEEee-cH Q lcl|NC_019445. 394 MLGPVLERLNDECLNPLIDRAFSMMVR-------------KNML--PPP----PDAMEG----------MPLKVEYI-SV 443 (559) Q Consensus 394 ~LG~v~~~l~~E~l~Pli~r~~~il~r-------------~g~l--p~~----p~~l~g----------~~v~~~~i-s~ 443 (559) .+...+.+|..- ..=+-+.+++++.+ .|.- +.+ .+...| ..+.|.+. +| T Consensus 455 ~~~~~~Dnl~~~-~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p 533 (706) T protein:vir:10 455 ASFIYLDNMAKS-LKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGP 533 (706) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEeccc Confidence 999988777542 22222333333332 2210 000 001111 12344443 34 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccC----hhhHhcCCH---HHHHHHHHHHcCCCccccCCH-HHHHHHHHHHHHHHH Q lcl|NC_019445. 444 MAQAQKSIGLSSLASTVNFIGQLAQAK----PEALDKLNV---DQAIDAFADMSGVSPTVIVPQ-EQVDQARQQRAQQQQ 515 (559) Q Consensus 444 La~a~r~~~~~~l~~~~~~~~~la~~~----P~~~~~id~---d~~~~~~a~~~Gvp~~~~rs~-~ev~~~rq~r~q~~q 515 (559) -...+|.+..+.+.++++.+....++- +-+++..|+ ++++..+-..++.. ...... ++.+++.++.+|+++ T Consensus 534 ~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q-~~~~~~~~~eq~~~~q~qq~q~ 612 (706) T protein:vir:10 534 SYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQ-GIVKPRNQQEQAIVQQAQQAQA 612 (706) T ss_pred CcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhccc-CCccccchhHHHHHHHHHHHHH Confidence 555668888777777776443221112 223444443 44555554444432 222222 111222221111111 Q ss_pred HHHHHHHHHHH-------HHHHhh--------------hhhhcCC-----------Ch-------hHHHHHHHHhhcCCC Q lcl|NC_019445. 516 QQQMMAMGMAA-------AQGAKT--------------LSEAKTS-----------DP-------SVLSAMANAVSGQGG 556 (559) Q Consensus 516 ~~~~~~~~~~~-------~~~a~~--------------~~~~~~~-----------~~-------~~~~~~~~~~~~~~~ 556 (559) +++.+++.... ++..|. ..++..+ .+ .+++.+......++. T Consensus 613 ~q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~~~~a~~~~~~~~~q~~q~l~~~~a~q~~ 692 (706) T protein:vir:10 613 TQPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMETLRLLKEVAASQQQ 692 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 11111110000 000000 0000000 00 000000111111111 Q ss_pred CCC Q lcl|NC_019445. 557 QSQ 559 (559) Q Consensus 557 ~~~ 559 (559) .+- T Consensus 693 ~~~ 695 (706) T protein:vir:10 693 TIP 695 (706) T ss_pred CCC Confidence 111 No 44 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.69 E-value=4.5e-15 Score=99.23 Aligned_cols=536 Identities=11% Similarity=-0.009 Sum_probs=238.2 Q ss_pred CChhh--HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETT--KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~--~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |+|.. .+++..+|.........|.....+=.+|.. ....+......- +...++.-+.....++.+.+..- T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~--G~Qw~~~~~~~l-~~q~rp~~N~i~~~i~~v~g~~~----- 72 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSR--VSQWDDWLSQYT-TLQYRGQFDVVRPVVRKLVSEMR----- 72 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhC--CCCCCHHHHHHH-HhcCCCccccHHHHHHHHHhhHH----- Confidence 99952 345666666666666666666666666653 111111111100 01112211222222333222111 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEE-----eecCC-ceEEEEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV-----LEDDE-DIIRTMP 152 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v-----~~~~~-~~~~~~~ 152 (559) .+++=+++.+.++...+..++ ++..+......++..-+...++.+.++.|.|.+=+ .+|+. ..+.+.. T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~------l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~ 146 (725) T protein:vir:77 73 QNPIDVLYRPKDGARPDAADV------LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) T ss_pred hCCcceEEecCCccHHHHHHH------HHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEE Confidence 266666777755433332222 34444445567888999999999999999997633 23322 2233333 Q ss_pred e----eccEEEEeeCCCCC-E-EE--EEEEEeecHH---HHHHhcCcccCCHHHHHHHh---cC-CCCceEEEEEEEeec Q lcl|NC_019445. 153 F----PIGSYYLANSPRGS-V-DI--CFRKFSMTVR---QLVQEFGLNNVSESVKSMWE---SG-TYEKWIEVMHSVYPN 217 (559) Q Consensus 153 ~----~l~~~~v~~d~~G~-v-d~--i~r~~~~t~~---ql~~~fg~~~l~~~v~~~~~---~~-~~~~~v~v~~~v~p~ 217 (559) + |..+++++.++.-. . |. +||...|+.. ++..+||.+.-...-..... .+ -....|.|+.+-+.+ T Consensus 147 ~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~ 226 (725) T protein:vir:77 147 EPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) T ss_pred eecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEE Confidence 3 45567777665321 1 22 5677788876 45555654321100000000 00 011234444443322 Q ss_pred Ccc-------c--ccc---cc--------------------cccccEEEEEEEecCCCceeeee---cCcccCCeEEEEe Q lcl|NC_019445. 218 IDR-------D--TSK---LD--------------------SKNKPFKSVYYEVGGDNDKLLRE---SGFDEFPIMAPRW 262 (559) Q Consensus 218 ~~~-------~--~~~---~~--------------------~~~~~~~sv~~~~~~~~~~il~e---sg~~~~P~~~~rw 262 (559) ... + .+. ++ .+......|||....+ .+++.. ..++.|||+++-. T Consensus 227 ~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g-~~~l~~~~~~~~~~~P~vP~~g 305 (725) T protein:vir:77 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITC-TAVLKDKQLIAGEHIPIVPVFG 305 (725) T ss_pred EEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecC-ceeeccCCcCCCCccceEEEee Confidence 110 0 000 00 0111223566665433 356644 2336799996532 Q ss_pred --eecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc--ccceecCCceeec-----CCcCCch Q lcl|NC_019445. 263 --EVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN--QRASLLPGDITYI-----DQITGQD 333 (559) Q Consensus 263 --~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~--~~~~~~pg~~~~~-----~~~~~~~ 333 (559) ...+|..|+.|+ +....+-++.+|+.....+..+..+.+.++.+..+... ...-..|+++++. ...+|.. T Consensus 306 ~r~~~~g~~~~~G~-vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) T protein:vir:77 306 EWGFVEDKEVYEGV-VRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGDL 384 (725) T ss_pred eeeccCCcccccch-hhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCcc Confidence 357899999995 99999999999999999999999988888777655322 1122234443322 1111211 Q ss_pred hhhhhh-hccccHH-HHHHHHHHHHHHHHHHh-hcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_019445. 334 GFRPAY-LVNPSTA-DLVADIQDTRQIINSAY-FVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPL 410 (559) Q Consensus 334 ~~~p~~-~~~~~~~-~~~~~i~~~~~rI~~af-~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pl 410 (559) ...++. ...+.+. .....++...+.|.+.- .+| .+++..++ .++.--|..|++.....+...+.+|..- ..=+ T Consensus 385 ~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~--~~lG~~~n-~~SG~ai~~rq~qg~~~~~~~~Dnl~~~-~~~~ 460 (725) T protein:vir:77 385 PTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGV--DTEAVNGG-QVAFDTVNQLNMRADLETYVFQDNLATA-MRRD 460 (725) T ss_pred cccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCH--HHhCCCch-hhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 112211 1112222 22334555555555443 111 12223322 3556668888888888888888876542 2222 Q ss_pred HHHHHHHHHh-------------cCCCC-----C----Cc-------hhhCCcceEEEe-ecHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 411 IDRAFSMMVR-------------KNMLP-----P----PP-------DAMEGMPLKVEY-ISVMAQAQKSIGLSSLASTV 460 (559) Q Consensus 411 i~r~~~il~r-------------~g~lp-----~----~p-------~~l~g~~v~~~~-is~La~a~r~~~~~~l~~~~ 460 (559) -+..+++..+ .|... . +. -.|.| .+.|.+ ++|-...+|.+.+..+.+++ T Consensus 461 g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g-~~Dv~v~~~p~~~s~r~~~~~~l~qll 539 (725) T protein:vir:77 461 GEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRG-RYECYTDVGPSFQSMKQQNRAEILELL 539 (725) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhcc-ceeeEEeeccchHHHHHHHHHHHHHHH Confidence 2333333322 11100 0 00 01122 345555 34555667877777777777 Q ss_pred HHHHHHhccChhhH----hcCCH---HHHHHHHHHHcCCCccccC--CHHHHHHHHHHHHHHHHHHHHHHHHHH------ Q lcl|NC_019445. 461 NFIGQLAQAKPEAL----DKLNV---DQAIDAFADMSGVSPTVIV--PQEQVDQARQQRAQQQQQQQMMAMGMA------ 525 (559) Q Consensus 461 ~~~~~la~~~P~~~----~~id~---d~~~~~~a~~~Gvp~~~~r--s~~ev~~~rq~r~q~~q~~~~~~~~~~------ 525 (559) +.+..++++...++ +..|. +++++.+...... ..... ++++. +.+++.+|++++++..++..+ T Consensus 540 ~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~-~~~~q~~~~~e~-q~~~~~qq~~~~q~~~e~~q~q~~~~~ 617 (725) T protein:vir:77 540 GKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ-MGVKKPETPEEQ-QWLVEAQQAKQGQQDPAMVQAQGVLLQ 617 (725) T ss_pred HhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhh-hhccCCCChhhH-HHHHHHHHHHHHhHHHHHHHHHHHHHH Confidence 65544332222222 23343 3444444333221 11111 12221 111111111111111111100 Q ss_pred -HHHHHhhhhhh--------------cCCChhHHHHHHHHhhcCC--------------CCCC Q lcl|NC_019445. 526 -AAQGAKTLSEA--------------KTSDPSVLSAMANAVSGQG--------------GQSQ 559 (559) Q Consensus 526 -~~~~a~~~~~~--------------~~~~~~~~~~~~~~~~~~~--------------~~~~ 559 (559) -++.+++-.++ ........+.+.++...+. ...+ T Consensus 618 ~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~~~~~~~q~~~~ 680 (725) T protein:vir:77 618 GQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRS 680 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH Confidence 01111111110 0000000000000000000 0000 No 45 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.69 E-value=4.6e-15 Score=99.17 Aligned_cols=535 Identities=11% Similarity=-0.010 Sum_probs=241.0 Q ss_pred CChhh--HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETT--KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~--~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |+|.. .+++..+|.........|.....+-.+|.. ....+......-+ ...++--+.....++.+.+ .-- T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~--G~Qw~~~~~~~l~-~q~rp~~N~i~~~i~~v~g----~e~- 72 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSR--ISQWDDWLSQYTT-LQYRGQFDVVRPVVRKLVS----EMR- 72 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHH-hcCCCcccchHHHHHHHHh----hHH- Confidence 99962 345566666666666666666666666753 1111111111000 1111211111222222222 111 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEE-----eecCC-ceEEEEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV-----LEDDE-DIIRTMP 152 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v-----~~~~~-~~~~~~~ 152 (559) .+++=+++.+.++...+..++ ++..+......++..-+...++.+.++.|.|.+=| .+|+. ..+.+.. T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~------l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~ 146 (725) T protein:vir:92 73 QNPIDVLYRPKDGASPDAADV------LMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) T ss_pred hCCcceEEecCCccHHHHHHH------HHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEE Confidence 256666777755433333332 34444445567889999999999999999997533 23332 2233333 Q ss_pred e----eccEEEEeeCCCCC-E-EE--EEEEEeecHH---HHHHhcCcccCCHHHHHHHhcCC------CCceEEEEEEEe Q lcl|NC_019445. 153 F----PIGSYYLANSPRGS-V-DI--CFRKFSMTVR---QLVQEFGLNNVSESVKSMWESGT------YEKWIEVMHSVY 215 (559) Q Consensus 153 ~----~l~~~~v~~d~~G~-v-d~--i~r~~~~t~~---ql~~~fg~~~l~~~v~~~~~~~~------~~~~v~v~~~v~ 215 (559) . |+.+++++.++.-. . |. +||...|+.. ++..+||.+.- ++.....-.+ ....|.|+.+.+ T Consensus 147 ~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~d~vrv~e~~~ 224 (725) T protein:vir:92 147 EPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDAD--DIPSFQNPNDWVFPWLTQDTIQIAEFYE 224 (725) T ss_pred eeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchh--hhhhcccCCcccccccCCCeEEEEEEEE Confidence 3 45567776665321 1 22 5677778865 55567775321 1111101011 113455544333 Q ss_pred ecCc---------ccccc---ccc--------------------ccccEEEEEEEecCCCceeeeec---CcccCCeEEE Q lcl|NC_019445. 216 PNID---------RDTSK---LDS--------------------KNKPFKSVYYEVGGDNDKLLRES---GFDEFPIMAP 260 (559) Q Consensus 216 p~~~---------~~~~~---~~~--------------------~~~~~~sv~~~~~~~~~~il~es---g~~~~P~~~~ 260 (559) .+.. ...+. ++. +...-..|||....+ .++|... .++.|||+++ T Consensus 225 r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g-~~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:92 225 VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITC-TAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecc-hhhhcCCCCCCCCceeeEEE Confidence 2211 00000 010 001112456654333 3465432 3356999975 Q ss_pred Eee--ecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc--ccceecCCceeec-----CCcCC Q lcl|NC_019445. 261 RWE--VNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN--QRASLLPGDITYI-----DQITG 331 (559) Q Consensus 261 rw~--~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~--~~~~~~pg~~~~~-----~~~~~ 331 (559) -.. ..+|..|+.|+ +....+-++.+|+.....+..+..+.+.++.+..+... ...-..|.+.++. ...+| T Consensus 304 ~g~r~~~~g~~~~~G~-vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 382 (725) T protein:vir:92 304 FGEWGFVEDKEVYEGV-VRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNG 382 (725) T ss_pred EeeeeccCCcccccce-eccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeeccccccccc Confidence 423 46889998895 99999999999999999999999998888887654322 1111234443332 11111 Q ss_pred chhhhhhh-hccccHH-HHHHHHHHHHHHHHHHhh-cchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_019445. 332 QDGFRPAY-LVNPSTA-DLVADIQDTRQIINSAYF-VDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLN 408 (559) Q Consensus 332 ~~~~~p~~-~~~~~~~-~~~~~i~~~~~rI~~af~-~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~ 408 (559) .....++. ...+.+. .+...++...+.|++..- +| .+++..++ .++.--|..|++.....|+..+.+|..-. . T Consensus 383 ~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~--~~lG~~~n-~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~-~ 458 (725) T protein:vir:92 383 EMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGV--DAEAVNGG-QVAYDTVNQLNMRADLETYVFQDNLATAM-R 458 (725) T ss_pred cccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCH--HHhccCch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 11111111 1112222 333455666666655542 22 12333332 35666788999999999999988776522 2 Q ss_pred HHHHHHHHHHHh-------------cCCC------CCCch----------hhCCcceEEEe-ecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 409 PLIDRAFSMMVR-------------KNML------PPPPD----------AMEGMPLKVEY-ISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 409 Pli~r~~~il~r-------------~g~l------p~~p~----------~l~g~~v~~~~-is~La~a~r~~~~~~l~~ 458 (559) =+-+..++++.+ .|.. ...++ .|.| .+.+.+ ++|-...+|.+....+.+ T Consensus 459 ~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g-~~Dv~v~~~p~~~s~r~~~~~~l~q 537 (725) T protein:vir:92 459 RDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRG-RYECYTDVGPSFQSMKQQNRAEILE 537 (725) T ss_pred HHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhcccc-ceeeEEeeccChHHHHHHHHHHHHH Confidence 233333333332 1110 00111 1112 345555 345566778877777777 Q ss_pred HHHHHHHHhccChhh----HhcCCH---HHHHHHHHHHcCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019445. 459 TVNFIGQLAQAKPEA----LDKLNV---DQAIDAFADMSGVSP-TVIVPQEQVDQARQQRAQQQQQQQMMAMGMA----- 525 (559) Q Consensus 459 ~~~~~~~la~~~P~~----~~~id~---d~~~~~~a~~~Gvp~-~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~----- 525 (559) ++..+.+++++...+ ++..|. +++++.+....+... .--.++++.. ..++++|++++++..+++.. T Consensus 538 l~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q-~~~~~qqa~~~q~~~e~~~~qa~~~ 616 (725) T protein:vir:92 538 LLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQ-WLVEAQQAKQGQQDPAMVQAQGVLL 616 (725) T ss_pred HHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhH-HHHHHHHHHHhhhHHHHHHHHHHHH Confidence 776554433222222 233333 444444444332210 0111222221 11111111111111111111 Q ss_pred --HHHHHhhhhh--------------hcCCChhHHHHHHHHhhcC--------------CCCCC Q lcl|NC_019445. 526 --AAQGAKTLSE--------------AKTSDPSVLSAMANAVSGQ--------------GGQSQ 559 (559) Q Consensus 526 --~~~~a~~~~~--------------~~~~~~~~~~~~~~~~~~~--------------~~~~~ 559 (559) .++.+|.-.+ +........+...++...+ ..+.+ T Consensus 617 ~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~~~~~~~q~~~~ 680 (725) T protein:vir:92 617 QGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFLKTVASFQQDRS 680 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Confidence 1111111111 0000000000000000000 00000 No 46 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.67 E-value=8.2e-15 Score=97.80 Aligned_cols=539 Identities=12% Similarity=-0.010 Sum_probs=240.0 Q ss_pred CChhh--HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETT--KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~--~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |+|.. .+++..+|......-..|.....+-.+|.. ....+......-+ ...++--+.....++.+.+.- - T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~--G~QW~~~~~~~l~-~q~rp~~N~i~~~v~~v~g~e----~- 72 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSR--VSQWDDWLSQYTT-LQYRGQFDVVRPVVRKLVSEM----R- 72 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc--CCCCCHHHHHHHH-hcCCCcccchHHHHHHHHhhH----H- Confidence 99962 334555555555544555555555566653 1111111111000 111221122223333322211 1 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEE-----eecCCc-eEEEEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV-----LEDDED-IIRTMP 152 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v-----~~~~~~-~~~~~~ 152 (559) .+++=+++.+.++...+..++ ++..+......++..-+-..++.+.++.|-|.+=+ ++|+.+ .+.+.. T Consensus 73 ~nr~d~~v~p~~~~d~~~Ae~------l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~ 146 (725) T protein:vir:10 73 QNPIDVLYRPKDGASPDAADV------LMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) T ss_pred hCCcceEEecCCcchHHHHHH------HHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeee Confidence 255666677655433333332 33344445557888888999999999999997643 233322 233332 Q ss_pred e----eccEEEEeeCCC-CCE-EE--EEEEEeecH---HHHHHhcCcccCC--HH--HHHHHhcCCCCceEEEEEEEeec Q lcl|NC_019445. 153 F----PIGSYYLANSPR-GSV-DI--CFRKFSMTV---RQLVQEFGLNNVS--ES--VKSMWESGTYEKWIEVMHSVYPN 217 (559) Q Consensus 153 ~----~l~~~~v~~d~~-G~v-d~--i~r~~~~t~---~ql~~~fg~~~l~--~~--v~~~~~~~~~~~~v~v~~~v~p~ 217 (559) + |..+++++.++. ... |. +||...|+. .++..+||.+.-. .. +......--....+.|+.+.+.+ T Consensus 147 ~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~ 226 (725) T protein:vir:10 147 EPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) T ss_pred eecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEE Confidence 3 455677776652 111 22 567778885 4566678765321 10 11110000011234444433322 Q ss_pred Ccc-------c--ccc---cccc--------------------cccEEEEEEEecCCCceeeeec---CcccCCeEEEEe Q lcl|NC_019445. 218 IDR-------D--TSK---LDSK--------------------NKPFKSVYYEVGGDNDKLLRES---GFDEFPIMAPRW 262 (559) Q Consensus 218 ~~~-------~--~~~---~~~~--------------------~~~~~sv~~~~~~~~~~il~es---g~~~~P~~~~rw 262 (559) +.. + .+. ++.. ...-..|||..-.+ .++|+.. .++.|||+++-. T Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g-~~~l~~~~~~~~~~fP~vP~~g 305 (725) T protein:vir:10 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITC-TAVLKDKQLIAGEHIPIVPVFG 305 (725) T ss_pred EEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecc-hhhhcCCCCCCCCceeEEEEEe Confidence 110 0 000 0100 01112456654332 3466432 235689997543 Q ss_pred e--ecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc--ccceecCCceeecC-----CcCCch Q lcl|NC_019445. 263 E--VNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN--QRASLLPGDITYID-----QITGQD 333 (559) Q Consensus 263 ~--~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~--~~~~~~pg~~~~~~-----~~~~~~ 333 (559) . ..+|..|+.|+ +....+-++.+|......+..+..+.+.++.+..+... ...-..|.+.+++. ..++.- T Consensus 306 ~r~~~~g~~~~~G~-vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~ 384 (725) T protein:vir:10 306 EWGFVEDKEVYEGV-VRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM 384 (725) T ss_pred eeeccCCcceeeee-eccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCccc Confidence 3 36889998895 99999999999999999999999988888877654321 11223455544431 111111 Q ss_pred hhhhh-hhccccH-HHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_019445. 334 GFRPA-YLVNPST-ADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLI 411 (559) Q Consensus 334 ~~~p~-~~~~~~~-~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli 411 (559) ...++ ....+.+ ..+...++...+.|.+..-..-. +++..++ .++.--|..|++.....+...+.+|.. ...=+- T Consensus 385 ~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~-~lG~~~n-~~SG~ai~~rq~qg~~~l~~~~Dnl~~-~~~~~g 461 (725) T protein:vir:10 385 PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVD-AEAVNGG-QVAYDTVNQLNMRADLETYVFQDNLAT-AMRRDG 461 (725) T ss_pred ccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHH-HhCcCch-hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 11111 1112222 23344566666666655422111 2233322 356666889999999999998888865 222233 Q ss_pred HHHHHHHHh-------------cCCC------CCCchhhCC---------cceEEEe-ecHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 412 DRAFSMMVR-------------KNML------PPPPDAMEG---------MPLKVEY-ISVMAQAQKSIGLSSLASTVNF 462 (559) Q Consensus 412 ~r~~~il~r-------------~g~l------p~~p~~l~g---------~~v~~~~-is~La~a~r~~~~~~l~~~~~~ 462 (559) +..+++..+ .|.. .+.++...| +.+.+.+ ++|-...+|.+.+..+.+++.. T Consensus 462 ~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~ 541 (725) T protein:vir:10 462 EIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGK 541 (725) T ss_pred HHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHh Confidence 333333332 1210 001110111 1345555 3455566787777777777765 Q ss_pred HHHHhccChhhH----hcC---CHHHHHHHHHHHcCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHH-------HHHH Q lcl|NC_019445. 463 IGQLAQAKPEAL----DKL---NVDQAIDAFADMSGVSP-TVIVPQEQVDQARQQRAQQQQQQQMMAMG-------MAAA 527 (559) Q Consensus 463 ~~~la~~~P~~~----~~i---d~d~~~~~~a~~~Gvp~-~~~rs~~ev~~~rq~r~q~~q~~~~~~~~-------~~~~ 527 (559) +..++.+.+.++ +.. ..+++++.+....+... .=-.++++.+++.+ ++|++++++..+.. ...+ T Consensus 542 ~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e-~qq~~~~q~~~e~~q~~~~~~~~qa 620 (725) T protein:vir:10 542 TPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVE-AQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) T ss_pred ccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHH-HHHHHHhhhHHHHHHHHHHHHHHHH Confidence 554333222222 222 23455555544433211 01112222211111 11111111111110 1111 Q ss_pred HHHhhhhhhcCCChh----HHHHHHHHh-----hcCCCCCC Q lcl|NC_019445. 528 QGAKTLSEAKTSDPS----VLSAMANAV-----SGQGGQSQ 559 (559) Q Consensus 528 ~~a~~~~~~~~~~~~----~~~~~~~~~-----~~~~~~~~ 559 (559) +..+.-+++...... ..++...++ ..+....| T Consensus 621 e~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q 661 (725) T protein:vir:10 621 ELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSK 661 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHH Confidence 111111110000000 000000000 00111111 No 47 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.65 E-value=7.2e-14 Score=92.63 Aligned_cols=534 Identities=13% Similarity=0.110 Sum_probs=246.1 Q ss_pred CChh------------hHHHHHHHHHHHHHHhhhHHHHHH----HHHHHhccccCCCCCCCCCCcc-cccCCCCcchHHH Q lcl|NC_019445. 1 MAET------------TKERLNKQFAQLESERQSFEPHWR----ELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTM 63 (559) Q Consensus 1 M~~~------------~~~~l~~r~~~l~~~R~~~~~~w~----e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~s~~~~ 63 (559) |++. .++...+.|..+..++. +.+.|+ +-.+|.. ....+.+....-. +....+.-+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~r~~a~~d~~fy~--G~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDID-SQPLWRDAANKACAYYD--GDQLAPEVIQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHh-hhHHHHHHHHHHHHhhc--CCCCCHHHHHHHHhcCCCcEEeccHHH Confidence 3221 11112233444444433 234454 5555542 1111111100000 0112222333333 Q ss_pred HHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEE--EEe Q lcl|NC_019445. 64 AARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAM--AVL 141 (559) Q Consensus 64 a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~v~ 141 (559) .++...+..- .+++=+++.+.+.+... .+-.+.++..+......++...+...++.+.+..|-|.+ +++ T Consensus 78 ~v~~v~g~~~-----~nr~~~~v~pr~~~~~~----~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d 148 (714) T protein:vir:10 78 TVDGVLGMEA-----KTRTDLIVMSDDPNDET----EKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred HHHHHHHHHH-----hCCcceEEecCCCChhh----HHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeec Confidence 3443333222 24555566664433221 111233455556666778888899999999999898877 455 Q ss_pred ecCC-ceEEEEEeeccEEEEeeCCCC-CE---EEEEEEEeecHHHHHHhcCcccCCHHHHHH------------------ Q lcl|NC_019445. 142 EDDE-DIIRTMPFPIGSYYLANSPRG-SV---DICFRKFSMTVRQLVQEFGLNNVSESVKSM------------------ 198 (559) Q Consensus 142 ~~~~-~~~~~~~~~l~~~~v~~d~~G-~v---d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~------------------ 198 (559) .|.. ..+++++++..+++++.++.- .. .-+||...||.+++..+||+.+ +.+... T Consensus 149 ~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a--~~i~~~~~~~~~~~~~~~~~~~~~ 226 (714) T protein:vir:10 149 SEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMA--QVIDYAIDDWRGFVDTTVTEGQPS 226 (714) T ss_pred cCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCch--hhhhccchhhcCcccchhhhhhcc Confidence 5544 458999999999999887532 22 2368899999999999998632 111100 Q ss_pred -----------Hhc------CCCCceEEEEEEEeecCccccc---------cccc----------------ccccEEEEE Q lcl|NC_019445. 199 -----------WES------GTYEKWIEVMHSVYPNIDRDTS---------KLDS----------------KNKPFKSVY 236 (559) Q Consensus 199 -----------~~~------~~~~~~v~v~~~v~p~~~~~~~---------~~~~----------------~~~~~~sv~ 236 (559) ++. .++..+|.|+.+-+......+. .++. .......|| T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:10 227 PLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred cccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEE Confidence 000 0112356666664432211100 0110 001111232 Q ss_pred EEecCCCceeeeec--Cc--ccCCeEEEEeee--cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCC Q lcl|NC_019445. 237 YEVGGDNDKLLRES--GF--DEFPIMAPRWEV--NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTS 310 (559) Q Consensus 237 ~~~~~~~~~il~es--g~--~~~P~~~~rw~~--~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~ 310 (559) +..- .+.++|.++ .| ..|||+++-... ..|..|| + +..+.+-++.+|......+.+ +..+-++..++. T Consensus 307 ~~~~-~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G--~-vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~~~ga 380 (714) T protein:vir:10 307 EAWF-VGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--L-ISRAIPAQDEVNFRRIKLTWL--LQAKRVIMDEDA 380 (714) T ss_pred EEEE-ecchhhhcCCCCCCCCceeeEEecceeeeccCccce--e-hhhhhhHHHHHHHHHHHHHHH--HhCCceeecccc Confidence 2221 123455443 33 468887654333 5666775 4 778889999999877776654 345555555444 Q ss_pred Ccccc--c---eecCCceeecCCcC-----CchhhhhhhhccccH-HHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCc Q lcl|NC_019445. 311 LKNQR--A---SLLPGDITYIDQIT-----GQDGFRPAYLVNPST-ADLVADIQDTRQIINSAYFVDLFMMLQNINTRSM 379 (559) Q Consensus 311 ~~~~~--~---~~~pg~~~~~~~~~-----~~~~~~p~~~~~~~~-~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~ 379 (559) +...+ + -..||+++.+.... ....+++... +.+ ......++...+.|++.--..-..+ +.. +..+ T Consensus 381 v~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~llq~~~~~i~~~tGv~~~~l-G~~-~na~ 456 (714) T protein:vir:10 381 TQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQD--FQVASQQFQVMQESEKLIQDTMGVYSAFL-GQD-SGAT 456 (714) T ss_pred ccccHHHHHHhccCCCCeEEecccccccCCccccccccCC--CCCcHHHHHHHHHHHHHHHHhhCCCHHHc-CCC-cchh Confidence 42211 1 14788888763211 1122333321 111 1223344555555555431111112 222 2335 Q ss_pred CHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcC-------CC----CC-Cchhh-------CC------- Q lcl|NC_019445. 380 PVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKN-------ML----PP-PPDAM-------EG------- 433 (559) Q Consensus 380 TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g-------~l----p~-~p~~l-------~g------- 433 (559) +..-|..|++.....|+..+.+|.. ...=+.+..++++.+.- +. +. .+..+ .| T Consensus 457 SGvAI~~r~~qg~~~l~~~~dnl~~-~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~ 535 (714) T protein:vir:10 457 SGVAISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDIS 535 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccce Confidence 6666999999999999999999866 44444555555554411 00 00 00000 00 Q ss_pred -cceEEEee-cHHHHHHHHHHHHHHHHHHHHHHHH-hccCh-hhHhc---CCHHHHHHHHHHHcCCCc--cccCCHHHHH Q lcl|NC_019445. 434 -MPLKVEYI-SVMAQAQKSIGLSSLASTVNFIGQL-AQAKP-EALDK---LNVDQAIDAFADMSGVSP--TVIVPQEQVD 504 (559) Q Consensus 434 -~~v~~~~i-s~La~a~r~~~~~~l~~~~~~~~~l-a~~~P-~~~~~---id~d~~~~~~a~~~Gvp~--~~~rs~~ev~ 504 (559) ..+.|.+. .|-...+|.+..+.+.++++.+... .++.+ -++.. -+.+++++.+-..+|.+. +-+..+++.. T Consensus 536 ~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~ 615 (714) T protein:vir:10 536 RLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEV 615 (714) T ss_pred eeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHH Confidence 12344442 3445566777777777777644221 11112 22333 456789999999999863 3333333322 Q ss_pred HHHHHHHHHHHHHHHHHHHH-----HHHHHHh--------------hhhhhcCC---ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 505 QARQQRAQQQQQQQMMAMGM-----AAAQGAK--------------TLSEAKTS---DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 505 ~~rq~r~q~~q~~~~~~~~~-----~~~~~a~--------------~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 559 (559) +..++.++++|.+.++++++ ..+++++ .++.+..+ .......++..+.+..+-+| T Consensus 616 q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q 692 (714) T protein:vir:10 616 AAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQ 692 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 22222212111110000000 0011111 11111100 00000001111111112222 No 48 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.54 E-value=1.8e-13 Score=90.42 Aligned_cols=534 Identities=13% Similarity=0.095 Sum_probs=221.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHH----HHHhccccCCCCCCCCCC---cccccCCC-Cc-chHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWREL----SDYINPRGSRFLTSEVNR---NDRRNTRI-ID-STGTMAARTLASG 71 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~----~~~~~P~~~~~~~~~~~~---~~~~~~~~-~~-s~~~~a~~~Las~ 71 (559) |+|.+.+.|.+.++.++.... |.+.|+.- .+|..-.....+...... ...+..++ +. +.....++...+ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~-~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g- 78 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHS-PQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIIS- 78 (720) T ss_pred CchHHHHHHHHHHHHHHHHHh-hhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHh- Confidence 999987777766666666543 23344432 233320011111111000 00011122 11 233333333333 Q ss_pred HHHhhcCCCCcceeccCCccc-hhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee------c- Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPE-MMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE------D- 143 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~-~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~------~- 143 (559) .-- .+++=+++.+.+++ ..+..++ ++..+......++...+...++.+.++.|-|++-+.- + T Consensus 79 ---~~~-~nr~d~~v~P~~~~~d~~~Ae~------l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~ 148 (720) T protein:vir:35 79 ---EYR-HNRITVKFRPGDKTASEALANK------LNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDP 148 (720) T ss_pred ---HHH-hCCCceEEEcCCCcchHHHHHH------HHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCC Confidence 222 25566677765443 2222222 3444455556788888889999999999999885521 1 Q ss_pred --CCceEEEEEe--eccEEEEeeCCCC-CE-EE--EEEEEeecHHHHHHhcCcccCCHHHHHHHhcCC-----CCceEEE Q lcl|NC_019445. 144 --DEDIIRTMPF--PIGSYYLANSPRG-SV-DI--CFRKFSMTVRQLVQEFGLNNVSESVKSMWESGT-----YEKWIEV 210 (559) Q Consensus 144 --~~~~~~~~~~--~l~~~~v~~d~~G-~v-d~--i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~-----~~~~v~v 210 (559) ..+.+.+.++ |..+++++.++.- .. |. +||...|+.+++..+||+++-. +........ ....|.+ T Consensus 149 ~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~--~~~~~~~~~~~d~~~~~~v~i 226 (720) T protein:vir:35 149 MDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPAT--LMSGIERSWDYDWYDVDVVYI 226 (720) T ss_pred CcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCcccc--ccccccccccccccCCCceEE Confidence 1233444443 4567788766532 11 22 6788889999999999976421 111111111 1123444 Q ss_pred EEEEeecCc-------cccc-----cccc---------------------ccccEEEEEEEecCCCceeee---ecCccc Q lcl|NC_019445. 211 MHSVYPNID-------RDTS-----KLDS---------------------KNKPFKSVYYEVGGDNDKLLR---ESGFDE 254 (559) Q Consensus 211 ~~~v~p~~~-------~~~~-----~~~~---------------------~~~~~~sv~~~~~~~~~~il~---esg~~~ 254 (559) +++-+.+.. .++. .++. ..+.+. +||..-++. .++. .++|+. T Consensus 227 ~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~-v~~~~~~g~-~~l~~~~~~p~~~ 304 (720) T protein:vir:35 227 AKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRR-VYVSVVDGE-GFLEKAQRIPGEH 304 (720) T ss_pred EEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEE-EEEEeeccc-hhcccCCCCCCCc Confidence 333222110 0000 0000 011222 344443332 3442 356788 Q ss_pred CCeEEEEee--ecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee-ecCCCcccc-ceecCCceee----- Q lcl|NC_019445. 255 FPIMAPRWE--VNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV-APTSLKNQR-ASLLPGDITY----- 325 (559) Q Consensus 255 ~P~~~~rw~--~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~-~p~~~~~~~-~~~~pg~~~~----- 325 (559) |||+++-.. ..+|..+..|+ +..+.+-++.+|+.....+..+.+.-.-+.. .+++..... .-..|++.+. T Consensus 305 fP~vP~~g~r~~~d~~~~~~G~-vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~ 383 (720) T protein:vir:35 305 IPLIPVYGKRWFIDDIERVEGH-IAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPL 383 (720) T ss_pred cceEEEEeeeeccCCCccccee-eecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhcccccccccccc Confidence 999876422 23666755664 7889999999999888888877554433332 222211000 0012222211 Q ss_pred --cCCcCCc--------hhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHh Q lcl|NC_019445. 326 --IDQITGQ--------DGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLML 395 (559) Q Consensus 326 --~~~~~~~--------~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~L 395 (559) +....|. ..++| ....+.. ...++.-...|.++--... .+++...+ ++.--|..|++.....+ T Consensus 384 ~~~~~~~G~~~~~~~~~~~~~~-~~~~~~~---~~llq~~~~~i~~vsGi~~-~~lG~~sn--~SG~Ai~~rq~qg~~~~ 456 (720) T protein:vir:35 384 NEIVDKQGNIIAPPTPVGYTQP-QPLNQAM---AALLQQTGADIQEVTGSSQ-AMQPMPSN--IAKETVNHLMHRSDMSS 456 (720) T ss_pred ccccccCcccccCCCcccccCC-CCCchHH---HHHHHHHHHHHHHHhCCCh-HHcCcccc--hHHHHHHHHHHHHHHHH Confidence 1111111 11111 1111111 1223333344444421110 12222222 45667889999999999 Q ss_pred hhHHHHHHH------HHHHHHHHHHHH------HHHhcCC--CCCCc----hhhCC----------cceEEEee-cHHHH Q lcl|NC_019445. 396 GPVLERLND------ECLNPLIDRAFS------MMVRKNM--LPPPP----DAMEG----------MPLKVEYI-SVMAQ 446 (559) Q Consensus 396 G~v~~~l~~------E~l~Pli~r~~~------il~r~g~--lp~~p----~~l~g----------~~v~~~~i-s~La~ 446 (559) ...+.+|.. +.+.-||...|. |+-..|. ...+. +...| ..+.|.+. +|-.. T Consensus 457 ~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~ 536 (720) T protein:vir:35 457 FIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYT 536 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcc Confidence 999888765 333333333331 1111121 00000 01112 12344443 34445 Q ss_pred HHHHHHHHHHHHHHHHHHHH----hccChhhHhcCCHH---HHHHHHHHHcCCCccccCC--HHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 447 AQKSIGLSSLASTVNFIGQL----AQAKPEALDKLNVD---QAIDAFADMSGVSPTVIVP--QEQVDQARQQRAQQQQQQ 517 (559) Q Consensus 447 a~r~~~~~~l~~~~~~~~~l----a~~~P~~~~~id~d---~~~~~~a~~~Gvp~~~~rs--~~ev~~~rq~r~q~~q~~ 517 (559) .+|.+....+.+++..+..- +.+.+.++.+.|+. +++..+...+. +.....+ .++.+++.++++++||.+ T Consensus 537 s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~-~~~~~~~~~~e~qq~~a~~qq~~qq~~ 615 (720) T protein:vir:35 537 ARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLL-TQGVVKPRNTEEEQMVAQMIQQAQQPN 615 (720) T ss_pred cHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcc-hhcccCccChhHHHHHHHHHHHHHhHh Confidence 56776666666655433210 11222344455544 44444443332 1111111 121111111111111111 Q ss_pred HHHHHHHH-----HHHHHhh--------hhhhcCCCh-hHHH-HHHHHhhcCCCCCC Q lcl|NC_019445. 518 QMMAMGMA-----AAQGAKT--------LSEAKTSDP-SVLS-AMANAVSGQGGQSQ 559 (559) Q Consensus 518 ~~~~~~~~-----~~~~a~~--------~~~~~~~~~-~~~~-~~~~~~~~~~~~~~ 559 (559) .++++++. -++..+. +.......+ ...+ .+.....++....| T Consensus 616 ~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q 672 (720) T protein:vir:35 616 AELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKR 672 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000 0000000 000000000 0000 01111111111111 No 49 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.54 E-value=2.2e-12 Score=84.52 Aligned_cols=430 Identities=10% Similarity=0.008 Sum_probs=205.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--ccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYI--NPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |.+.....+....+..... .++....+.+|+-- +|... . ..+...+..++..+-+..+++.+++.| +| T Consensus 1 ~~~~~~~~i~~l~~~~~~~-~~r~~~l~~Yy~G~~~i~~~~----~-~~~~~~~~~k~~~n~~~~ivd~~~~~l----~~ 70 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRL-SSWHCCIEGYYEGSNRVRDLG----V-AIPPELQRVQTVVSWPGIAVDALEERL----DW 70 (441) T ss_pred CCccHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCcchhcC----c-ccchhhhhhhhhcchHHHHHHHHHhhh----cc Confidence 8887766555544444332 23333333443221 21111 1 111222345567777888888777765 34 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSY 158 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~ 158 (559) .+ | ..++. .. +.+...+.+|....+++.++..+||.|.+++..|...-+++..++..++ T Consensus 71 ~g---~--~~~d~-----~~-----------l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~ 129 (441) T protein:vir:80 71 LG---W--TNGDG-----YG-----------LDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNC 129 (441) T ss_pred cc---c--cCCCh-----HH-----------HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceE Confidence 33 2 22221 11 2334456899999999999999999999988887665578888999988 Q ss_pred EEeeCC-CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEE Q lcl|NC_019445. 159 YLANSP-RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) Q Consensus 159 ~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~ 237 (559) ++-.|+ .+++...++.+... .+....+++| ++... + .|+ T Consensus 130 ~~i~d~~~~~~~~~~~~~~~~-----------------------~~~~~~~~vy---~~~~~------------~--~~~ 169 (441) T protein:vir:80 130 TGKFSADGSRLDAGLVVQQTC-----------------------DPEVVEAELL---LPDVI------------V--QVE 169 (441) T ss_pred EEEEeCCCCceeEEEEEEEEe-----------------------cCceEEEEEE---ecCeE------------E--EEE Confidence 877774 45666555543310 0111123332 11110 0 011 Q ss_pred EecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeecC----C Q lcl|NC_019445. 238 EVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGM-LALGPVKALQLLQKRKSQLIDKATNPPMVAPT----S 310 (559) Q Consensus 238 ~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~-~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~----~ 310 (559) ..+...-... .+-+|..+|++++.-+...++.||+|- .. +..+-+..++...-.+...++..+.|.+.+.+ + T Consensus 170 ~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~-l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~ 248 (441) T protein:vir:80 170 RRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSE-ITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADE 248 (441) T ss_pred EcCCcceeeccccccCCCceeEEEeeccccCCccCCccc-chhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCccc Confidence 1111110011 123578899999888888889999984 43 45677778888888888899999999766532 1 Q ss_pred CccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHH Q lcl|NC_019445. 311 LKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEE 390 (559) Q Consensus 311 ~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e 390 (559) .........+|+++..+...+...+........++....+.+..+...|...--.. ...++..+.-.-++.-++..... T Consensus 249 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p-~~~~g~~~~~~~Sg~Al~~~~~~ 327 (441) T protein:vir:80 249 FSQPGWVLSMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVP-ERYFGFITSNPPSGEALAAEESR 327 (441) T ss_pred cccchhhhcccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCC-HHHhccCCCcchHHHHHHHHHHH Confidence 11223456678877664433322222211111233333333333333332111110 01121111111133333322222 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019445. 391 KLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQA 469 (559) Q Consensus 391 ~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~ 469 (559) |=-...+.+. .+.+-+.+.+.++.+. |.....+.. ...+++++.-++.. ++...++.+..+.+. T Consensus 328 ----l~~k~~~~~~-~f~~~l~~~~~l~~~~~~~~~~~~~~--~~~i~~~f~~~~~~--------~~~e~ad~~~kl~~~ 392 (441) T protein:vir:80 328 ----LVKRAERRQT-SFGQGWLSVGFLAAKALDSRVDEADF--FGDVGLRWRDASTP--------TRAATADAVTKLVGA 392 (441) T ss_pred ----HHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccccc--ceeeeEEeCCCCCc--------CHHHHHHHHHHHHhc Confidence 2222333333 3444555655665543 332333222 23566777655431 122223334444443 Q ss_pred ChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhH Q lcl|NC_019445. 470 KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSV 543 (559) Q Consensus 470 ~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 543 (559) ++-. ++. ..+...+|. +++|++++.+++++++++. .+.....+ .+...+ T Consensus 393 g~~~---~s~----~~~~~~l~~------~~~e~~~~~~e~~e~~~~~--~~~~~~~~----------~~~~~~ 441 (441) T protein:vir:80 393 GILP---ADS----RTVLEMLGL------DDVQVEAVMRHRAESSDPL--AVLAGAIS----------RQTNEV 441 (441) T ss_pred Cccc---ccH----HHHHHhCCC------CHHHHHHHHHHHHHHHHHH--HHHhhhhh----------cccccC Confidence 3211 121 122344454 3567776655544433222 11111111 111111 No 50 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.46 E-value=1.1e-11 Score=80.70 Aligned_cols=460 Identities=13% Similarity=0.070 Sum_probs=198.1 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.+.....+...+..+...+ ++.+++.+|..-... +..+.. .+...+.-+...+-+..+|++|+..|. +- T Consensus 18 l~~~e~~~i~~L~~~~~~~~----~r~~~l~~YY~G~~~i~~~~~~-~p~~~~~~~~v~n~~~~iVd~~a~rl~----~~ 88 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDRT----PRNLLRASFYDGKYAIRQIGNL-IPPEYLRTATVLGWSAKAVDTLARRCN----LE 88 (504) T ss_pred CCHHHHHHHHHHHHHHHHHh----HHHHHHHHHHhccccchhcccc-ccHHHHHHhhccCcHHHHHHHHHhhhc----cc Confidence 88877666666566555443 455555566432110 111111 122222223456677788888877542 22 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCC--ceEEEEEeeccE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE--DIIRTMPFPIGS 157 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~--~~~~~~~~~l~~ 157 (559) + |+ .++.+..+ ..+++...+++|.....++.++..+||.+.++|..+.. ...+++.++..+ T Consensus 89 G---f~--~~d~~~~~------------~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~ 151 (504) T protein:vir:99 89 S---FV--WPDGDYGS------------IGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQ 151 (504) T ss_pred e---ee--CCCCChhh------------HHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccce Confidence 2 22 22322111 11333456688999999999999999999998876542 235667788888 Q ss_pred EEEeeCC-CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEE Q lcl|NC_019445. 158 YYLANSP-RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVY 236 (559) Q Consensus 158 ~~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~ 236 (559) ..+..|+ .+++...++.+. .+ .+.....+++| .+.. .+| T Consensus 152 ~~~iyD~~~~~~~~a~~~~~-----------~d-----------~~g~~~~~~~y---~~~~---------------~~~ 191 (504) T protein:vir:99 152 ATGEWNSRRNAMDSLLSITS-----------RD-----------AEGHPTGIALY---EDGV---------------TVT 191 (504) T ss_pred eEEEEeCCCCceeEEEEEEE-----------ec-----------CCCeEEEEEEE---cCCc---------------EEE Confidence 7766664 444443333211 00 01111122222 1111 112 Q ss_pred EEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee----cCC Q lcl|NC_019445. 237 YEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA----PTS 310 (559) Q Consensus 237 ~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~----p~~ 310 (559) +...++..... .+-++. +|++++..+...++.||+|-=....++-+..++...-.++..+++.+.|...+ +++ T Consensus 192 ~~~~~~~~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~ 270 (504) T protein:vir:99 192 ADMDDDGDWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKN 270 (504) T ss_pred EEEcCCceeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccc Confidence 22111111111 112333 89999988888899999994123567889999999999999999999997554 221 Q ss_pred -----Ccc-ccceecCCceeecCCcCCchh---hhh-h-hhccccHHHHHHHHHHHHHHHHHHhhcchhhhccC-CCCCC Q lcl|NC_019445. 311 -----LKN-QRASLLPGDITYIDQITGQDG---FRP-A-YLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQN-INTRS 378 (559) Q Consensus 311 -----~~~-~~~~~~pg~~~~~~~~~~~~~---~~p-~-~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~-~~~~~ 378 (559) .+. ..+....+.++..+....... -.| + .....++....+.+..+...|...--.. ...++. .+... T Consensus 271 ~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P-~~~lG~~~~~n~ 349 (504) T protein:vir:99 271 FRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIP-VESLGFSNRANP 349 (504) T ss_pred cccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCC-HHHhcccccccc Confidence 111 123344466665543221110 111 1 1111234433333333333332111110 011111 11111 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecH--HHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISV--MAQAQKSIGLSS 455 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~--La~a~r~~~~~~ 455 (559) -+|.-|+....- |--...+.+. .+..-+.+++.++... +..+..+.+.. .+++.+.-+ ...++. T Consensus 350 sSa~Ai~~~~~~----L~~ka~~k~~-~f~~~l~~~~rla~~~~~~~~~~~~~~~--~~~v~w~d~~~~s~a~~------ 416 (504) T protein:vir:99 350 TSADAYIASRED----LIAEAEGATD-DWSPAFRRSMIRALAIKNGLDRIPPEWK--TIDSKFRSPLYLSKAAQ------ 416 (504) T ss_pred cHHHHHHHHHHH----HHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccccccc--cceeEecCCCccCHHHH------ Confidence 244333322221 1122233322 2333334444443321 22333444433 344444322 222222 Q ss_pred HHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhhh Q lcl|NC_019445. 456 LASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAA--QGAKTL 533 (559) Q Consensus 456 l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~--~~a~~~ 533 (559) +..++.|.+.++..+ ... +.+.+.+|+ +++|++++.+++++++......+.+.+.. +.++.. T Consensus 417 ----aDa~~Kl~~ag~~l~--~~~----~~l~~~lg~------~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~ 480 (504) T protein:vir:99 417 ----ADAGAKMLGAGPEWL--KET----EVGLELLGL------TPQQAKRALAERRRASSVSIIEALNRRQQEAATAGED 480 (504) T ss_pred ----HHHHHHHHhhccccc--cch----HHHHhhcCC------CHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCC Confidence 222233333332110 111 223345576 45666665555433322221111111111 111111 Q ss_pred hhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 534 SEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 534 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .+.+.. +....-.+++.+++.+.. T Consensus 481 ~~~~~~--e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 481 QDQGAG--EPPANEPPAALGRPTLVG 504 (504) T ss_pred CCcCCC--CCCCCCCCccCCCcccCC Confidence 111110 000111112222222222 No 51 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.44 E-value=1.3e-11 Score=80.20 Aligned_cols=413 Identities=12% Similarity=0.052 Sum_probs=196.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.....+.|.+.|..- .++.+.+.+|..-... +..+....+.-+...+...+-+..+|+.||..+. .. T Consensus 1 m~~~~i~~L~~~~~~~-------~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~----~~ 69 (422) T protein:vir:97 1 MNYMGMGYLRRKLALF-------KTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRII----FR 69 (422) T ss_pred CChHHHHHHHHHHHHH-------HHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccc----cc Confidence 8888877777665542 2244445555432110 0111111111111122344566666666666331 11 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc-eEEEEEeeccEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED-IIRTMPFPIGSY 158 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~-~~~~~~~~l~~~ 158 (559) + | ..+|.+ +++.+.++++.....++.++..+||.+.++|..+.+. ..++..++..+. T Consensus 70 G---f--~~~d~~-----------------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~ 127 (422) T protein:vir:97 70 E---F--TNDDFN-----------------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKA 127 (422) T ss_pred e---e--eCCchh-----------------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhE Confidence 1 2 222211 2345667899999999999999999999999765432 346777888888 Q ss_pred EEeeCCC-CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEE Q lcl|NC_019445. 159 YLANSPR-GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) Q Consensus 159 ~v~~d~~-G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~ 237 (559) .+..|+. +++...++.+. .+... ..+. ..+++... .+|+ T Consensus 128 ~~i~D~~~~~~~~a~~~~~------------------------~~~~~-~~~~-~~~~~~~~--------------~~~~ 167 (422) T protein:vir:97 128 TGILDPTTFLLTEGYAILE------------------------SDSNG-NPTL-EAYFTDKD--------------IWYY 167 (422) T ss_pred EEEEeCCCCcceeeEEEEE------------------------ecCCC-cEEE-EEEEcCce--------------EEEE Confidence 8777753 33333332211 11111 1111 12222211 0111 Q ss_pred EecCCCceeeeecCcccCCeEEEEeeecCCCcccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeec---CCCcc Q lcl|NC_019445. 238 EVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPG-MLALGPVKALQLLQKRKSQLIDKATNPPMVAP---TSLKN 313 (559) Q Consensus 238 ~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~-~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p---~~~~~ 313 (559) ..++... -.+-++..+|++++..+...++.||+|- . +..++-+..++...-.++..+++.+.|...+- .+..+ T Consensus 168 ~~~~~~~--~~~~~~g~vPvv~~~n~~~~~~~~G~s~-I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~ 244 (422) T protein:vir:97 168 PKKGKPY--NIKNPTGHPLLVPIIHRPDAVRPFGRSR-ITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKP 244 (422) T ss_pred cCCCccc--cccCCCCCcceEEecccCCCccccCccc-cchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCccccc Confidence 1111111 1234567899999999999999999994 4 56888999999999999999999999976542 12222 Q ss_pred -ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHH Q lcl|NC_019445. 314 -QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKL 392 (559) Q Consensus 314 -~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~ 392 (559) ..+....|.++..+...+...++.-.....++....+.+..+...|...--.. ...++....-+.+|.-|..... T Consensus 245 ~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP-~~~lg~~~~NpsSa~Ai~a~~~--- 320 (422) T protein:vir:97 245 MEKWRATVSTLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLFAGGSGLT-LDDLGFPSDNPSSVESIKAAHE--- 320 (422) T ss_pred CchhhhhhhhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHHhcccCCC-HHHhccccCchhHHHHHHHHHH--- Confidence 23556677777775544333333222222345544444444444433221111 1112111110123433332211 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccCh Q lcl|NC_019445. 393 LMLGPVLERLNDECLNPLIDRAFSMMVR-KNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKP 471 (559) Q Consensus 393 ~~LG~v~~~l~~E~l~Pli~r~~~il~r-~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P 471 (559) .|--...+.+. ...+-+.+++.++.. .|..+..+.++..-.++.....+......++.+.+ +..|+++.| T Consensus 321 -~L~~ka~~k~~-~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa-------~~Kl~~a~~ 391 (422) T protein:vir:97 321 -NLRAAGRKAQR-SFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDG-------AIKLNQAIP 391 (422) T ss_pred -HHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHH-------HHHHHhhcc Confidence 12222233322 223333344444332 24445556555443444432222222222222222 223333334 Q ss_pred hhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHH Q lcl|NC_019445. 472 EALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQ 513 (559) Q Consensus 472 ~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~ 513 (559) .+ .+.+ .+.+.+|++. ++++...+.+++++. T Consensus 392 ~~---~~~~----~~~~~lg~~~----~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 392 GF---MDAD----VIRDLTGVKG----ADKPIPAITEVTTDG 422 (422) T ss_pred cc---ccHH----HHHHHcCCCc----hhHHHHHHHhhhccC Confidence 22 2322 3344457732 123333332222222 No 52 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.36 E-value=5.8e-11 Score=76.72 Aligned_cols=457 Identities=13% Similarity=0.059 Sum_probs=189.0 Q ss_pred CChhh----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc-CCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHH-H Q lcl|NC_019445. 1 MAETT----KERLNKQFAQLESERQSFEPHWRELSDYINPRG-SRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMM-S 74 (559) Q Consensus 1 M~~~~----~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~-~ 74 (559) |++.. .+-+.+.+..+... .++++.+.+|..-.. -...+ ...+...+..++..+-+..+++++++.|. . T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~----~~r~~~~~~Yy~g~~~i~~~~-~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~ 75 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENK----QNELKSSKAYYDAERRPDAIG-LAVPLDMRKYLAHVGYPRTYVDAIAERQELE 75 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHH----HHHHHHHHHHHhcccchhhcC-cccchhhhhhhhhcchHHHHHHHHHHhhhcc Confidence 55531 22223333333333 345555556642110 00001 11122223345667778888888887663 2 Q ss_pred hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC--------Cc Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD--------ED 146 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~--------~~ 146 (559) |++-+.. .. .+.......++... +...+.+++|.....++.++..+||.|.+++..+. .. T Consensus 76 Gf~~~~~----~~-~~~~~~~d~~~~~~-------l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~ 143 (488) T protein:vir:23 76 GFRIPSA----NG-EEPESGGENDPASE-------LWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPE 143 (488) T ss_pred ceeccCC----cc-cccccccchhHHHH-------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCC Confidence 2221111 00 00011111112222 33456778999999999999999999998876532 22 Q ss_pred eEEEEEeeccEEEEeeC-CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccc Q lcl|NC_019445. 147 IIRTMPFPIGSYYLANS-PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 (559) Q Consensus 147 ~~~~~~~~l~~~~v~~d-~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~ 225 (559) ..++..++..+.++-.| ..+++...+|.+. . . + ...+..++...+.. T Consensus 144 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~----------~-------------~-~-~~~~~~~~~y~~~~------- 191 (488) T protein:vir:23 144 VPLIRVEPPTALYAEVDPRTRKVLYAIRAIY----------G-------------A-D-GNEIVSATLYLPDT------- 191 (488) T ss_pred cceEEEeccceeEEEEecCCCceEEEEEEEE----------e-------------c-C-CCcEEEEEEEecCc------- Confidence 24566778887666666 4566665555432 0 0 0 11222222222211 Q ss_pred ccccccEEEEEEEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 226 DSKNKPFKSVYYEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATN 302 (559) Q Consensus 226 ~~~~~~~~sv~~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~~~~ 302 (559) .+++...+..-.+. .+-+|..+|+++++.+...+..+|+|- ..+ .++-+..++...-.++..++..+. T Consensus 192 --------~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~-i~~~v~~l~Da~~~~~s~~~~~~~~~a~ 262 (488) T protein:vir:23 192 --------TMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSE-ISPELRSVTDAAAQILMNMQGTANLMAI 262 (488) T ss_pred --------EEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccc-hhhhHHHHHHHHHHHHHHHHHHHHHhhh Confidence 11121111111111 124678899999999988899999985 443 456677888888888888888888 Q ss_pred Cceeec----CCC-----cc-ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc-chh-hh Q lcl|NC_019445. 303 PPMVAP----TSL-----KN-QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV-DLF-MM 370 (559) Q Consensus 303 p~~~~p----~~~-----~~-~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~-dl~-~~ 370 (559) |.+.+- ++. .. ..+...+|.++.... ++...+.-. ....+ ...+..++.-|...+.. ++. .. T Consensus 263 p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~-g~~~~~~q~--~~~~~---~~~~~~l~~~i~~~~~~~~~p~~~ 336 (488) T protein:vir:23 263 PQRLIFGAKPEELGINAETGQRMFDAYMARILAFEG-GEGAHAEQF--SAAEL---RNFVDALDALDRKAASYSGLPPQY 336 (488) T ss_pred HHHHHhCCCcccccccccccchhhhhhhhhhccCCC-CCCceeEec--CCCCh---HHHHHHHHHHHHHHhcccCCCHHH Confidence 876542 111 11 113344555554422 111112111 11122 23344444444433311 000 11 Q ss_pred ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHH Q lcl|NC_019445. 371 LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKS 450 (559) Q Consensus 371 ~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~ 450 (559) ++....-.-++.-++....-+. . ..++.+. .+.+-+.+++.++...--....+.++ ..+++++..+... ... T Consensus 337 ~g~~~~n~~Sg~Al~~~~~~l~-~---k~~~~~~-~f~~~l~~~~~l~~~~~~~~~~~~~~--~~i~v~f~~~~~~-s~~ 408 (488) T protein:vir:23 337 LSSSSDNPASAEAIKAAESRLV-K---KVERKNK-IFGGAWEQAMRLAYKMVKGGDIPTEY--YRMETVWRDPSTP-TYA 408 (488) T ss_pred hccccCcchHHHHHHHHHHHHH-H---HHHHHHH-HHHHHHHHHHHHHHHHhcCCCcchhh--ccceEEecCCCCC-CHH Confidence 1111000113333322222211 1 1233333 34445566666655421112223332 3466676544321 111 Q ss_pred HHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Q lcl|NC_019445. 451 IGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQ-QQMMAMGMAAAQG 529 (559) Q Consensus 451 ~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~-~~~~~~~~~~~~~ 529 (559) +.+ ..+..|++.+.. .+..+. +...+|.. +++++++++..+++.++ ..++.++-..++. T Consensus 409 ~~a-------da~~kl~~~g~~---~~s~et----~~~~l~~~------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (488) T protein:vir:23 409 AKA-------DAAAKLFANGAG---LIPRER----GWVDMGYT------IVEREQMRQWLEQDQKQGLGLIGSLYGASTP 468 (488) T ss_pred HHH-------HHHHHHHhcccc---cCCHHH----HHHhCCCC------chHHHHHHHHHHHHHHHHHHHHHHHhccCCC Confidence 122 222233332221 122222 33334531 23334443322221111 1111111100000 Q ss_pred HhhhhhhcCCChhHHHHHHHHhhcCCCCC Q lcl|NC_019445. 530 AKTLSEAKTSDPSVLSAMANAVSGQGGQS 558 (559) Q Consensus 530 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (559) ....+.++++++..=+ +.-| T Consensus 469 ~~~~~~~~~~~~~~~e---------~~~a 488 (488) T protein:vir:23 469 EGKPGEAPVGEPPAPE---------PDAA 488 (488) T ss_pred cccCCCCCCCCCCCCC---------CCCC Confidence 0001111111100000 0000 No 53 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.36 E-value=6.5e-11 Score=76.45 Aligned_cols=402 Identities=12% Similarity=0.040 Sum_probs=195.3 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |...+...|.++|.. |.+...+..++|+---|-. ..+....+.-+..-+...+-+..+|++||..|. ..+ T Consensus 1 ~~~~~i~~L~~~~~~----~~~r~~~~~~yY~g~~~~~--~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLSV----HKRRAEMRYDQYAMKYVDR--FKGITIPQALSQQYRSILGWCAKGVDSLADRLV----FRE 70 (409) T ss_pred CCHHHHHHHHHHHHH----HhHHHHHHHHHhcccCchh--hcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc----cCc Confidence 888887777766544 2222333333333221110 111111111111223455677777877777542 222 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEE Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYL 160 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v 160 (559) |+ ..|. . +++...+++|.....++.++..+||.+.++|.++...-.++..++..+.++ T Consensus 71 ---f~--~~d~------~-----------l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~ 128 (409) T protein:vir:94 71 ---FE--NDDF------T-----------VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATG 128 (409) T ss_pred ---cc--CCch------H-----------HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEE Confidence 21 1221 1 334556788999999999999999999999987655445677788888888 Q ss_pred eeCCC-CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEe Q lcl|NC_019445. 161 ANSPR-GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEV 239 (559) Q Consensus 161 ~~d~~-G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~ 239 (559) ..|+. +++...++... .+ +.+ ..... .+|-.. ..+++.. T Consensus 129 i~D~~~~~~~~a~~~~~-----------~d-----------~~~---~~~~~-~~~~~~--------------~~~~~~~ 168 (409) T protein:vir:94 129 IIDPITGLLTEGYAVLE-----------RD-----------ENN---NVVLE-AHFLPD--------------RTDYYYR 168 (409) T ss_pred EEecCCCceeeeEEEEE-----------ec-----------CCC---ceEEE-EEEecC--------------cEEEEEe Confidence 88863 44444443211 00 011 11111 111100 0111111 Q ss_pred cCCCceeeeecCcccCCeEEEEeeecCCCcccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceee---cCCCcc-c Q lcl|NC_019445. 240 GGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPG-MLALGPVKALQLLQKRKSQLIDKATNPPMVA---PTSLKN-Q 314 (559) Q Consensus 240 ~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~-~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~---p~~~~~-~ 314 (559) .++. ..-...++..+|++++..+...++.||+|- . +..++-+..++...-.++..+++.+.|...+ ..+..+ . T Consensus 169 ~~~~-~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~-I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~ 246 (409) T protein:vir:94 169 DSRN-NISIANPTGHPLLVPIIHRPDAVRPFGRSR-ITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPME 246 (409) T ss_pred cCce-eEeeeCCCCCcceEEeccccccccccCccc-cchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccc Confidence 1111 122345778999999999999999999994 4 5677889999999999999999999996544 233322 2 Q ss_pred cceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHH Q lcl|NC_019445. 315 RASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLM 394 (559) Q Consensus 315 ~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~ 394 (559) .+...+|.++..+...+....+.......+++...+.+..+...+...--.. ...++....-+-+|.-|.+. ++.+ T Consensus 247 ~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP-~~~lg~~~~NpsSa~Al~a~-~~~L-- 322 (409) T protein:vir:94 247 TWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLT-LDDLGFVSDNPSSVEAIKAS-HENL-- 322 (409) T ss_pred hhhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCC-HHHhccccCchhHHHHHHHH-HHHH-- Confidence 3566678888776543333333222223345554444444443332221111 11121111001233333321 1111 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhh Q lcl|NC_019445. 395 LGPVLERLNDECLNPLIDRAFSMMVR-KNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEA 473 (559) Q Consensus 395 LG~v~~~l~~E~l~Pli~r~~~il~r-~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~ 473 (559) --..++.+. ...+-+.+++.++.. .+-.+..|.++....++-..+.+-.....++. +..+.-|++++|-. T Consensus 323 -~~~a~~k~~-~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~-------aDa~~Kl~~ag~~~ 393 (409) T protein:vir:94 323 -RLAGRKAQR-SLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSLI-------GDGAIKLNQAIPEF 393 (409) T ss_pred -HHHHHHHHH-HHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHHH-------HHHHHHHHHhcccc Confidence 112222222 222222333333222 12234455554433444442222222222222 23333344444422 Q ss_pred HhcCCHHHHHHHHHHHcCCCccc Q lcl|NC_019445. 474 LDKLNVDQAIDAFADMSGVSPTV 496 (559) Q Consensus 474 ~~~id~d~~~~~~a~~~Gvp~~~ 496 (559) + + -+.+.+.+|.+..= T Consensus 394 ~---~----~~~~~~~lG~~~~d 409 (409) T protein:vir:94 394 I---N----KDTIRDLTGIEGGE 409 (409) T ss_pred c---c----hhHHHHHcCCCCCC Confidence 2 1 13455666765321 No 54 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.35 E-value=7.7e-11 Score=76.04 Aligned_cols=447 Identities=12% Similarity=0.061 Sum_probs=187.0 Q ss_pred CChhh--HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCC---CCCCcccccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAETT--KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTS---EVNRNDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~~--~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~---~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |.+.. .+-+......+.. ..++.+.+.+|.. +...-. ...+.+.++.+...+-+..+|++++..| T Consensus 8 ~~e~~~~~~~~~~l~~~~~~----~~~r~~~l~~YY~---G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l--- 77 (486) T protein:vir:42 8 MEEIEDPAVVREEMISAFED----ASKDLASNTSYYD---AERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQ--- 77 (486) T ss_pred CCCcccHHHHHHHHHHHHHH----HHHHHHHHHHHhc---ccCcchhcccccchhHhhhhhccchHHHHHHHHHhhh--- Confidence 54442 1112222233322 2344445555532 221111 0112222233445667777777777654 Q ss_pred hcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC--------Cce Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD--------EDI 147 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~--------~~~ 147 (559) +|.+ |++ ++.+... ..+.+.+.+.+|.....++.++..+||.+.++|..+. ... T Consensus 78 -~~~g---~~~--~~~~~~~------------~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~ 139 (486) T protein:vir:42 78 -AVEG---FRL--GDADEAD------------EELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNV 139 (486) T ss_pred -cccc---eec--CCCchhH------------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCe Confidence 3433 222 2221111 1123345568899999999999999999998886542 223 Q ss_pred EEEEEeeccEEEEeeC-CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccc Q lcl|NC_019445. 148 IRTMPFPIGSYYLANS-PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLD 226 (559) Q Consensus 148 ~~~~~~~l~~~~v~~d-~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~ 226 (559) .++..++..+.++-.| ..+++...+|.+.- .. ...++.+....++. T Consensus 140 ~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~------------------------~~-~~~~~~~~~y~~~~-------- 186 (486) T protein:vir:42 140 PIIRVEPPTRMHAEIDPRINRVSKAIRVAYD------------------------KE-GNEIQAATLYTPME-------- 186 (486) T ss_pred eEEEEecccceEEEEeCCCCCeEEEEEEEEe------------------------cC-CCeEEEEEEEcCCc-------- Confidence 4677788887777666 45666665554320 00 11222222111110 Q ss_pred cccccEEEEEE-EecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 227 SKNKPFKSVYY-EVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATN 302 (559) Q Consensus 227 ~~~~~~~sv~~-~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~~~~ 302 (559) .+|+ ..+++ -... .+-+|..+|++.++.+...+..||+|- ... ..+-+..++...-.+...++..+. T Consensus 187 -------~~~~~~~~~~-~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~-i~~~v~~liDa~~~~~s~~~~~~e~~a~ 257 (486) T protein:vir:42 187 -------TIGWFRADGE-WAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSE-ITPELRSMTDAAARILMLMQATAELMGV 257 (486) T ss_pred -------EEEEEecCCc-EEeecceecCCCCceEEEeccccccCCCCCccc-chhhHHHHHHHHHHHHHHHHHHHHhhcc Confidence 0111 11111 0111 124678899999999888899999995 543 446677888887788888888888 Q ss_pred CceeecC----CC-----cc-ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc-chh-hh Q lcl|NC_019445. 303 PPMVAPT----SL-----KN-QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV-DLF-MM 370 (559) Q Consensus 303 p~~~~p~----~~-----~~-~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~-dl~-~~ 370 (559) |.+.+-+ +. +. ..+...+|.++..+..+ ..+. ..+-..+...++.++.-|...... ++. .. T Consensus 258 p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-----q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~ 330 (486) T protein:vir:42 258 PQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFEDAE--GKIQ-----QFSAAELANFTNALDQIAKQVAAYTGLPPQY 330 (486) T ss_pred hHHHhhcCCccccccccccccchhhhhhchhcccCCCC--ceEE-----eecccCHHHHHHHHHHHHHHHhcccCCCHHH Confidence 8765432 11 11 11223345544332211 1111 111112223344555555433321 110 01 Q ss_pred ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHH Q lcl|NC_019445. 371 LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKS 450 (559) Q Consensus 371 ~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~ 450 (559) +.....-..++.-++....- +... .++.+ ..+.+-+.+++.++.+..-....+.++ ..+++++.-++.. ... T Consensus 331 fg~~~~n~~Sg~Al~~~~~~-l~~k---a~~~~-~~f~~~l~~~~~l~~~~~~~~~~~~d~--~~i~v~w~~~~~~-s~~ 402 (486) T protein:vir:42 331 LSTAADNPASAEAIRAAESR-LIKK---VERKN-LMFGGAWEEAMRIAYRIMKGGDVPPDM--LRMETVWRDPSTP-TYA 402 (486) T ss_pred hccccCchhHHHHHHHHHHH-HHHH---HHHHH-HHHHHHHHHHHHHHHHHhcCCCccccc--eeeeEEecCCCCC-CHH Confidence 11110001133333322221 1111 12332 234445555566554432112222232 2466666544321 011 Q ss_pred HHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 451 IGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGA 530 (559) Q Consensus 451 ~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a 530 (559) +.++++ ..|++..+. .+.-+. + ...+|+. +++++++++.++++.+..+.. ........ T Consensus 403 ~~ad~~-------~kl~~~~~g---~~s~et-~---~~~lg~~------~d~~~e~~~~~~e~~~~~~~~--~~~~~~~~ 460 (486) T protein:vir:42 403 AKADAA-------TKLYGNGQG---VIPRER-A---RIDMGYS------VKEREEMRRWDEEEAAMGLGL--LGTMVDAD 460 (486) T ss_pred HHHHHH-------HHHHhcccC---CCCHHH-H---HhcCCCC------hhHHHHHHHHHHHHHHHHHHH--HHHhhcCC Confidence 122222 222222111 122111 1 2345542 334444443322222211111 11111111 Q ss_pred --hhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 531 --KTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 531 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ....++++ .++.-+..+... ||+. T Consensus 461 ~~~~~~~~~~-~~~~~~~~~~~~----~~~~ 486 (486) T protein:vir:42 461 PTVPGSPSPT-APPKPQPAIESS----GGDA 486 (486) T ss_pred CCCCCCCCCC-CCCCCCcccCCC----CCCC Confidence 11111111 112112121111 1111 No 55 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.33 E-value=9.8e-11 Score=75.46 Aligned_cols=402 Identities=12% Similarity=0.045 Sum_probs=194.1 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |...+...|.++|.. +.+...+..++|+--.|-. ..+....+.-+..-+...+-+..+|+++|..|. ..+ T Consensus 1 ~~~~~i~~L~~~~~~----~~~r~~~~~~yY~g~~~~~--~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLSV----HKRRAEMRYEQYAMKHVDR--FKGITIPQALSQQYRSILGWCAKGVDSLADRLV----FRE 70 (409) T ss_pred CCHHHHHHHHHHHHH----HhHHHHHHHHHHhccCchh--hcchhhhHHHHHHHhhhcChhHHHHHHhHhhcc----ccc Confidence 888877777766644 2233333333443221110 011111111111123455677777887776543 122 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEE Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYL 160 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v 160 (559) |+ ..|. . +++...+++|.....++.++..+||.+.++|.++.....++..++..+.++ T Consensus 71 ---f~--~~d~------~-----------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~ 128 (409) T protein:vir:16 71 ---FE--NDDF------T-----------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATG 128 (409) T ss_pred ---cc--Ccch------H-----------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEE Confidence 21 1221 1 234556799999999999999999999999987655445677788888877 Q ss_pred eeCC-CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEe Q lcl|NC_019445. 161 ANSP-RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEV 239 (559) Q Consensus 161 ~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~ 239 (559) ..|+ .+++...++... .+ ..+......+ ..|. ..+++.. T Consensus 129 i~D~~~~~~~~a~~~~~-----------~d-----------~~~~~~~~~~---~~~~---------------~~~~~~~ 168 (409) T protein:vir:16 129 IIDPITGLLTEGYAVLE-----------RD-----------ENNNVVLEAH---FLPD---------------RTDYYYR 168 (409) T ss_pred EeecccccceeeeEEEE-----------ec-----------CCCceEEEEE---EecC---------------cEEEEEe Confidence 7775 344443333211 00 1111111111 1111 1111111 Q ss_pred cCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee---cCCCcc-cc Q lcl|NC_019445. 240 GGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA---PTSLKN-QR 315 (559) Q Consensus 240 ~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~---p~~~~~-~~ 315 (559) .+. ...-.+-++..||++++..+...++.||+|-=.+..++-+..++...-.++..+++.+.|...+ ..+..+ .. T Consensus 169 ~~~-~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~ 247 (409) T protein:vir:16 169 DSR-NNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMET 247 (409) T ss_pred cCc-cccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccch Confidence 111 1112235678899999999999999999984124577889999999999999999999997554 222322 23 Q ss_pred ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC-cCHHHHHHHHHHHHHH Q lcl|NC_019445. 316 ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS-MPVEAVIEMKEEKLLM 394 (559) Q Consensus 316 ~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~-~TA~Ei~~r~~e~~~~ 394 (559) +...+|.++..+...+....+--.....+++...+.+..+...+...--.. ...++.. ... -+|.-|.+. + .. T Consensus 248 ~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP-~~~lg~~-~~NpsSa~Ai~a~-~---~~ 321 (409) T protein:vir:16 248 WKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLT-LDDLGFV-SDNPSSVEAIKAS-H---EN 321 (409) T ss_pred hhhhhhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCC-HHHcccc-cCchhHHHHHHHH-H---HH Confidence 566678888775443333322212222345554444444443333211111 1112111 111 233333221 1 11 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhh Q lcl|NC_019445. 395 LGPVLERLNDECLNPLIDRAFSMMVR-KNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEA 473 (559) Q Consensus 395 LG~v~~~l~~E~l~Pli~r~~~il~r-~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~ 473 (559) |--...+-+. ...+-+.+++.++.. .|-.+..|++..+-.++-..+.+-.....++.++ .+.-|++++|-. T Consensus 322 L~~ka~~k~~-~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aD-------a~~Kl~~a~~~~ 393 (409) T protein:vir:16 322 LRLAGRKAQR-SLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGD-------GAIKLNQAIPEF 393 (409) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHHHHHH-------HHHHHHhhcccc Confidence 1112222222 222223333333332 2334555666544444444222211112222222 233333333322 Q ss_pred HhcCCHHHHHHHHHHHcCCCccc Q lcl|NC_019445. 474 LDKLNVDQAIDAFADMSGVSPTV 496 (559) Q Consensus 474 ~~~id~d~~~~~~a~~~Gvp~~~ 496 (559) + + -+.+.+.+|++..= T Consensus 394 ~---~----~~v~~~~~g~~~~d 409 (409) T protein:vir:16 394 I---N----KDTIRDLTGIKGAE 409 (409) T ss_pred c---c----hhHHHHhccCCCCC Confidence 1 1 12334555664321 No 56 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.32 E-value=1.1e-10 Score=75.24 Aligned_cols=402 Identities=10% Similarity=0.015 Sum_probs=186.4 Q ss_pred hhhHHHHHHHHHHHhccccC-CCCCCCCCCcccc-cCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHH Q lcl|NC_019445. 20 RQSFEPHWRELSDYINPRGS-RFLTSEVNRNDRR-NTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYG 97 (559) Q Consensus 20 R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~~-~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~ 97 (559) =+-+.++-+.+.+|..-... +..+.. .+...+ .-+...+-+..+|+.||..|. ..+ | ...|. T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~-~p~~~~~~~~~v~nw~~~~Vds~a~rl~----~~G---f--~~~d~------ 64 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGIT-IPAHIRAKYQAVLGWAAKGVDSLADRLI----FRA---F--ANDDF------ 64 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchh-ccHHHHhHHHhhcchhHHHHHHhHhhhc----ccc---c--cCCCc------ Confidence 12233344444455332110 001111 111111 123455677777777776544 112 2 22221 Q ss_pred HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEEeeCCC-CCEEEEEEEEe Q lcl|NC_019445. 98 PVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLANSPR-GSVDICFRKFS 176 (559) Q Consensus 98 ~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d~~-G~vd~i~r~~~ 176 (559) . +++...+++|.....++.++..+||.+.++|.++.....++..++..+.++..|+. +++..-++... T Consensus 65 ~-----------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al~~~~ 133 (410) T protein:vir:95 65 N-----------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLLVEGYAVLA 133 (410) T ss_pred h-----------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCceEEEEEEEE Confidence 1 33445679999999999999999999999998765554677788888888888763 34433332210 Q ss_pred ecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCC Q lcl|NC_019445. 177 MTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFP 256 (559) Q Consensus 177 ~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P 256 (559) +.. +...+..+...|. ..+++...+... -.+-++..+| T Consensus 134 ------------------------~~~-~~~~~~~~~~~~~---------------~~~~~~~~~~~~--~~~~~~g~vP 171 (410) T protein:vir:95 134 ------------------------RDD-YNRPTLEAYFEPN---------------ATHFIPKDGEPY--SVTNETGIPL 171 (410) T ss_pred ------------------------ecC-CCeEEEEEEEeCC---------------cEEEEeeCCccc--cccCCCCCcc Confidence 001 1112222221111 112222222111 1234677899 Q ss_pred eEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee---cCCCc-cccceecCCceeecCCcCCc Q lcl|NC_019445. 257 IMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA---PTSLK-NQRASLLPGDITYIDQITGQ 332 (559) Q Consensus 257 ~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~---p~~~~-~~~~~~~pg~~~~~~~~~~~ 332 (559) ++++..+...++.||+|--.+..++-+..++...-.++..+++.+.|...+ ..+.. ...+...+|.++..+..++. T Consensus 172 vV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~~~ 251 (410) T protein:vir:95 172 LVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPMEKWKATVSSLLTISSSDKG 251 (410) T ss_pred eEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhhhhhheeccCCCCC Confidence 999999999999999994125677889999999999999999999997554 12222 23456677888888654443 Q ss_pred hhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_019445. 333 DGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLID 412 (559) Q Consensus 333 ~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~ 412 (559) ...+-......+++...+.+..+...|...--.. ...++....-.-+|.-|.... +. |--...+.+. ...+-+. T Consensus 252 ~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP-~~~lg~~~~NpsSa~Al~a~~-~~---L~~ka~~k~~-~fg~~l~ 325 (410) T protein:vir:95 252 VKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLT-LDDLGFVSDNPSSVEAIKASH-EN---LRLAGRKAQR-SLGAGLL 325 (410) T ss_pred CcceEEecCCCChHHHHHHHHHHHHHHhhhcCCC-HHHhccccCchhHHHHHHHHH-HH---HHHHHHHHHH-HHHHHHH Confidence 3222222223355555444444444443221111 111211110011232222211 11 1112222222 2222223 Q ss_pred HHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC Q lcl|NC_019445. 413 RAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG 491 (559) Q Consensus 413 r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G 491 (559) +++.++... +-.+..|.+.....++-..+-..+.-..++.+ ..+.-|+++.|-+ ++ -+.+.+.+| T Consensus 326 ~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~a-------Da~~Kl~~a~~g~---~~----~~~~~~~lg 391 (410) T protein:vir:95 326 NVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIG-------DGVVKLNQALPGY---IN----AETIRDLTG 391 (410) T ss_pred HHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHH-------HHHHHHHHhccCC---cc----HHHHHHhcC Confidence 333333221 23344444433333333211111111112222 2222333333322 12 233455667 Q ss_pred CCccccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 492 VSPTVIVPQEQVDQARQQRAQQQQQQ 517 (559) Q Consensus 492 vp~~~~rs~~ev~~~rq~r~q~~q~~ 517 (559) .. ++++..++.+.+ +++.+ T Consensus 392 ~~------~~~~~~~~~~e~-~~~g~ 410 (410) T protein:vir:95 392 IA------GDMSAKPVVSEG-GSNGE 410 (410) T ss_pred CC------hHHHHHHHHHHH-HhCCC Confidence 73 333322221111 11111 No 57 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.26 E-value=2.7e-10 Score=73.07 Aligned_cols=463 Identities=11% Similarity=0.097 Sum_probs=209.1 Q ss_pred CChhhHHHHHHHHHHHHH-----------------HhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLES-----------------ERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTM 63 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~-----------------~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~ 63 (559) |=+.....++.-+.++-- ........|+.+|.=--|-........ ........++--+.+.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~-~~~~~~~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEH-NGNPVNRRQLSMNLPKV 79 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhcccccc-CCCccccceeecchHHH Confidence 777666555555544221 122445667776542111111111111 11111223344567777 Q ss_pred HHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeec Q lcl|NC_019445. 64 AARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLED 143 (559) Q Consensus 64 a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~ 143 (559) .++.+|+-|++- |+ .++.+|. + ..+.+.+.+...+|...+.++..+...+|.+++.+..| T Consensus 80 iv~~~a~~l~~e--p~-----~i~~~d~------~-------~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D 139 (499) T protein:vir:80 80 TAKYMSKLLFNE--KV-----KINIDDE------T-------AEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHD 139 (499) T ss_pred HHHHHHHhhhCC--cc-----eEeeCCH------H-------HHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEEC Confidence 787777643322 22 2333332 2 22234446667889999999999999999999988887 Q ss_pred CCceEEEEEeeccEEEE-eeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccc Q lcl|NC_019445. 144 DEDIIRTMPFPIGSYYL-ANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDT 222 (559) Q Consensus 144 ~~~~~~~~~~~l~~~~v-~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~ 222 (559) ....+++..++...++- ..| .|++..+......+.+. ++. -.| ....-.+..+..+.|-+.++...+... T Consensus 140 ~~~~~~i~~v~a~~~~Pi~~d-~~~~~~~~f~~~~~~~~---~~y-~~l----E~h~~~~~~~~~y~I~n~~~~~~~~~~ 210 (499) T protein:vir:80 140 GNKNVKVSFATADCMYPLSND-SENVDECLIANSFHKNN---KYY-KLL----EWNEWKGEKEEVYTVTTELYQSDDPNE 210 (499) T ss_pred CCCcEEEEEEcCCceEEEEec-CCCeEEEEEEEEEeecC---eEE-EEE----EEEEecccceeeEEEEEEEEeccCccc Confidence 66668889999999874 455 57665443222222110 000 000 000000000112333333332221110 Q ss_pred cccccccccEEEEEEEecCCCceeeeecCcccCCeEEEE----eeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 223 SKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPR----WEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLID 298 (559) Q Consensus 223 ~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~r----w~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~ 298 (559) -+...|..++|-+. ... ....|....||+.++ .+...++++|+| -..++.+.+..|+..--......+ T Consensus 211 ---lG~~v~l~~~~~~~--~~~--~~~~~~~~p~f~~~~~~~~N~~~~~splG~S-~~~~~~~lid~lD~~~s~~~~e~~ 282 (499) T protein:vir:80 211 ---LGGKVSLKLLFNDI--EPV--VPLPSLTRPTFIYIKPNIANNKNLTSPLGIS-VYANALDTLKTLDLMFDSYYQEFK 282 (499) T ss_pred ---cCcccchhhhccCc--CCc--eeecCCCccceEeecCCccccccCCCccCCc-hHhhHHHHHHHHHHHHHHHHHHHH Confidence 01122222222111 111 111234455565554 344568899999 489999999999999999888877 Q ss_pred HHhcCceeecCCCcccc--ce------ecCCc----eeecCCcCCchhhhhhhhccccH--HHHHHHHHHHHHHHHHHhh Q lcl|NC_019445. 299 KATNPPMVAPTSLKNQR--AS------LLPGD----ITYIDQITGQDGFRPAYLVNPST--ADLVADIQDTRQIINSAYF 364 (559) Q Consensus 299 ~~~~p~~~~p~~~~~~~--~~------~~pg~----~~~~~~~~~~~~~~p~~~~~~~~--~~~~~~i~~~~~rI~~af~ 364 (559) . .+..+.+|.++.... .+ ..+.- .+.+...++...++.. ++++ ....+.++.+...|....= T Consensus 283 ~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~---~~~ir~e~~~~~l~~~l~~i~~~~g 358 (499) T protein:vir:80 283 L-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDI---SVEIRSTEFIESINAMLRIYAMQVG 358 (499) T ss_pred h-cccceecchhhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEe---cCcCChHHHHHHHHHHHHHHHHhcC Confidence 6 455566655432110 00 00110 1111112222223221 2222 2223334444444432220 Q ss_pred cchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHH Q lcl|NC_019445. 365 VDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVM 444 (559) Q Consensus 365 ~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~L 444 (559) . -..+++.......|||||....+...+...-.-..+ ...|..++..++++..-.+.+...+. ...++.+.+--++ T Consensus 359 ~-s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~~~~~~~--~~~~v~v~f~d~i 434 (499) T protein:vir:80 359 L-SAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLI-EQGIKEMIVSILEVGKLIKAYDGDTV--ELDTITVDFDDSI 434 (499) T ss_pred C-ChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccccCCCC--CccceEEEeCCCC Confidence 0 011233334456799999998888877766655554 34566777777766554443322111 1235677775443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 445 AQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGM 524 (559) Q Consensus 445 a~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~ 524 (559) ..- +...++ .+.++.+.+ .+... ..++...|++ ++|++++.++.+ +++..+ T Consensus 435 ~~d-~~~~~~-------~~~~~~~~G-----i~S~e---t~l~~~~~~~------d~ea~~el~~i~----~E~~~~--- 485 (499) T protein:vir:80 435 AQD-EDTTIN-------RYTTAKNQG-----MIPLK---IALQRAWNIT------EAEADEWAEMLA----KEKQAE--- 485 (499) T ss_pred CCC-HHHHHH-------HHHHHHHcC-----CCCHH---HHHhhcCCCC------hHHHHHHHHHHH----HHhhcC--- Confidence 211 111111 111211111 12222 2245566763 344332222111 010000 Q ss_pred HHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 525 AAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 525 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) . +..+. .|..|..+ T Consensus 486 -~----------~~~d~----------~g~~ge~e 499 (499) T protein:vir:80 486 -I----------PNNDM----------TGIFGEEE 499 (499) T ss_pred -C----------CCCCc----------cccCCCCC Confidence 0 10000 11122222 No 58 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.24 E-value=3.9e-10 Score=72.20 Aligned_cols=460 Identities=11% Similarity=0.106 Sum_probs=203.9 Q ss_pred CChhhHHHHHHHHHHH------HH-----------HhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQL------ES-----------ERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTM 63 (559) Q Consensus 1 M~~~~~~~l~~r~~~l------~~-----------~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~ 63 (559) |-+...+.++.-+.++ +. .+-.....|+++++=--+-........ ........++--+.+.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~-~~~~~~~~~~~~n~~k~ 79 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEH-NGNPVNRRQLSMNLPKV 79 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhcc-CCCccccceeecchHHH Confidence 6666655555444443 11 122345566665532112111111111 11111223343456777 Q ss_pred HHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeec Q lcl|NC_019445. 64 AARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLED 143 (559) Q Consensus 64 a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~ 143 (559) .++.+|+-|.+- |+ .++..|+ +..+ .+.+.+...+|...+.++..+...+|.+.+++..| T Consensus 80 i~~~~a~~l~~~--p~-----~i~~~d~------~~~e-------~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D 139 (496) T protein:vir:38 80 TAKYMSKLLFNE--KV-----KINIDDK------AAEE-------FVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHD 139 (496) T ss_pred HHHHHhhhhhCC--cc-----eEeeCCh------HHHH-------HHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEc Confidence 777777633211 11 1333332 2222 34446667889999999999999999999998888 Q ss_pred CCceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhc-CCCCceEEEEEEEeecCcccc Q lcl|NC_019445. 144 DEDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWES-GTYEKWIEVMHSVYPNIDRDT 222 (559) Q Consensus 144 ~~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~-~~~~~~v~v~~~v~p~~~~~~ 222 (559) ....+++..++...++--.+..|.+..+......+.+.. .| - .++. ...+..+.|.+.+|...+.+ T Consensus 140 ~~~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~--~y--~--------~le~h~~~~~~~~I~~~~y~~~~~~- 206 (496) T protein:vir:38 140 GNKNVKVSFATADCMYPLSNDSENVDECVIANSFHKNNK--YY--T--------LLEWNEWQGDVYTVTTELYQSDDPN- 206 (496) T ss_pred CCCcEEEEEEcccceEEEEecCCcEEEEEEEEEEEeCCe--EE--E--------EEEEEEEeCceEEEEEEEEecCCcc- Confidence 766788889999998854455677654332212211000 00 0 0000 00011122333333222110 Q ss_pred cccccccccEEEEEEEecCCCceeeeecCcccCCeEEEE----eeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 223 SKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPR----WEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLID 298 (559) Q Consensus 223 ~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~r----w~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~ 298 (559) .+ +...|+..+|-.. ... ..-.|+...||..++ .+...++.||+|. ..++++-+..|+..--......+ T Consensus 207 -~~-g~~v~~~~~~~~~--~~~--~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd-~~~~~~lid~ld~~~s~~~~~~~ 279 (496) T protein:vir:38 207 -EL-GTKVSLTLLFDDI--EPV--VPLPDFTRPTFIYIKPNIANNKNLTSPLGISV-YANALDTLKTLDLMFDSYYQEFK 279 (496) T ss_pred -cc-Ccccccccccccc--ccc--eeecCCCcceEEEecCCcccccccCCcCCCch-HhhHHHHHHHHHHHHHHHHHHHh Confidence 00 1122222222111 001 111233444554433 3446678999995 99999999999998888888777 Q ss_pred HHhcCceeecCCCcccccee--------cC-Ccee---ecCCcCCchhhhhhhhccccH--HHHHHHHHHHHHHHHHHhh Q lcl|NC_019445. 299 KATNPPMVAPTSLKNQRASL--------LP-GDIT---YIDQITGQDGFRPAYLVNPST--ADLVADIQDTRQIINSAYF 364 (559) Q Consensus 299 ~~~~p~~~~p~~~~~~~~~~--------~p-g~~~---~~~~~~~~~~~~p~~~~~~~~--~~~~~~i~~~~~rI~~af~ 364 (559) + .++.+.+|.++.....+. .+ -... .....++...++ ..++++ ....+.++.+.+.|....= T Consensus 280 ~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~---~~~~~i~~e~~~~~l~~~l~~i~~~~g 355 (496) T protein:vir:38 280 L-GKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIK---DISVEIRSTEFIESINAMLRIYAMQVG 355 (496) T ss_pred h-cccceecchHHhhccCCCCCccccCCCCccceEEEeecCCCcccccce---eeccccCHHHHHHHHHHHHHHHHHhhC Confidence 6 566667765542111100 00 0111 111111112222 222222 2233344444444433220 Q ss_pred cchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHH Q lcl|NC_019445. 365 VDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVM 444 (559) Q Consensus 365 ~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~L 444 (559) . ...+++...+...||+||..+.+...+...- ..+.....|..++.-++++....+.+-..+ ..+.++++.+.-++ T Consensus 356 ~-~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~g~~--~~~~~i~v~f~d~i 431 (496) T protein:vir:38 356 L-SAGTFTFDENGLKTATEVVSEKSETYQTKNS-HSQLIEQGIKEMIVSILEVGKFIEAYSGEV--VELDTITVDFDDSI 431 (496) T ss_pred C-ChhhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCC--CCccceEEEeCCCC Confidence 0 0112233344567999999888877776554 344445677777777776654333221111 11234666664432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 445 AQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGM 524 (559) Q Consensus 445 a~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~ 524 (559) ..- ....++ .+.++.+++ .+....+ +....|++ ++|++++.++ .+..+++ + ++ T Consensus 432 ~~d-~~~~~~-------~~~~~~~~G-----iiS~et~---l~~~~~~~------d~ea~~el~r-i~~E~~~---~-~~ 484 (496) T protein:vir:38 432 AQD-EDTTIN-------RYTNAKNQG-----MIPLKIA---LQRAWNIT------EAEADEWAEM-LAKEKQA---E-MP 484 (496) T ss_pred CCC-HHHHHH-------HHHHHHhcC-----CCCHHHH---HHhcCCCC------hHHHHHHHHH-HHHhhhc---c-Cc Confidence 210 011111 111221111 1332222 33445653 3443322211 1111100 0 00 Q ss_pred HHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 525 AAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 525 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ..+ ..+.+|..| T Consensus 485 -~~d----------------------~~~~~~~~e 496 (496) T protein:vir:38 485 -NND----------------------MNGIFGEEE 496 (496) T ss_pred -ccc----------------------ccCCCCCCC Confidence 000 001111111 No 59 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.21 E-value=5.2e-10 Score=71.46 Aligned_cols=447 Identities=13% Similarity=0.077 Sum_probs=189.1 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc-CCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRG-SRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.......|.+.| ... .++.+.+.+|..-.. .+..+.. .+.+.+..++..+-+..+++++++.| ++. T Consensus 13 ~~~~~~~~L~~~~---~~~----~~r~~~~~~YY~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~ 80 (485) T protein:vir:24 13 DPAIARDEMVSAF---EDQ----NQNLRSNTSYYEAERRPEAIGVT-VPVQMQSLLAHVGYPRLYVDSIAERQ----AVE 80 (485) T ss_pred chHHHHHHHHHHH---HHH----HHHHHHHHHHHhccCchhhcCcc-cchhhhhhhhccchHHHHHHHHhhhh----ccC Confidence 5554455555444 222 233334444432110 0001111 11222334455677788888877755 333 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCC--------ceEEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE--------DIIRTM 151 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~--------~~~~~~ 151 (559) + |+ .++.+... ..+.+.+...+|.....++.++..+||.|.++|..+.. ...++. T Consensus 81 g--~~---~~~~~~~~------------~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~ 143 (485) T protein:vir:24 81 G--FR---LGDADEAD------------EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIR 143 (485) T ss_pred c--ee---cCCCchhH------------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEE Confidence 2 32 22221111 11233455678889999999999999999998876532 234677 Q ss_pred EeeccEEEEeeCC-CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSP-RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) Q Consensus 152 ~~~l~~~~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 230 (559) .++..+.++..|+ .+++...++.+.- + .+ ..+..++...++. T Consensus 144 ~~~p~~~~~i~D~~~~~~~~~~~~~~~-----------~-----------~~---~~~~~~~~y~~~~------------ 186 (485) T protein:vir:24 144 VEPPTRMYAEIDPRIGRPAKAIRVAYD-----------A-----------EG---NEIQAATLYTPNE------------ 186 (485) T ss_pred EeccceeEEEeeCCcCceeEEEEEEEe-----------e-----------cC---CeEEEEEEEcCCc------------ Confidence 7888888877774 4555555544320 0 01 1122222211111 Q ss_pred cEEEEEEEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATNPPMVA 307 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~~~~p~~~~ 307 (559) .+++...+..-... .+-+|..+|++.++.+...+..||+|- ..+ ..+-+..++...-.+...++..+.|.+.+ T Consensus 187 ---~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~-i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i 262 (485) T protein:vir:24 187 ---TFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSE-ITPELRSMTDAAARILMLMQATAELMGVPQRLI 262 (485) T ss_pred ---EEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCccc-chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh Confidence 01111111111011 124577899999998888888999984 543 34556777777777888888888887654 Q ss_pred cC----C-----Cc-cccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc-chh-hhccCCC Q lcl|NC_019445. 308 PT----S-----LK-NQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV-DLF-MMLQNIN 375 (559) Q Consensus 308 p~----~-----~~-~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~-dl~-~~~~~~~ 375 (559) .+ + .+ ...+...+|.++..+..+ .. +. + .+...+...++.++.-|...... ++. ..++... T Consensus 263 ~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~-~~---q--~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~ 335 (485) T protein:vir:24 263 FGIKPEEIGVDPETGQTLFDAYLARILAFEDAE-GK-IQ---Q--FSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA 335 (485) T ss_pred ccCCccccccccccccchhhhcccceeccCCCC-ce-EE---e--ecccchHHHHHHHHHHHHHHhcccCCCHHHhcccc Confidence 31 1 11 112344566655443211 11 11 1 11112223334444444433311 110 1111110 Q ss_pred CCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHH Q lcl|NC_019445. 376 TRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSS 455 (559) Q Consensus 376 ~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~ 455 (559) .-..++.-+.. ....+.. ..++.+ ..+.+-+.+++.++.+.......+.. ...+++++..++..- ..+.+ T Consensus 336 ~n~~Sg~Al~~-~~~~l~~---ka~~~~-~~f~~~l~~~~~l~~~~~~~~~~~~d--~~~i~v~f~~~~~~s-~~~~a-- 405 (485) T protein:vir:24 336 DNPASAEAIRA-AESRLIK---KVERKN-AIFGGAWEEAMRLAYRLMKGGDVPPD--MLRMETVWRDPSTPT-YAAKA-- 405 (485) T ss_pred CcchHHHHHHH-HHHHHHH---HHHHHH-HHHHHHHHHHHHHHHHHhcCCCCccc--cceeeEEecCCCCCC-HHHHH-- Confidence 00112322222 2222222 223333 23445555666665442111112222 235677775443210 11122 Q ss_pred HHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhh Q lcl|NC_019445. 456 LASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQ-QQMMAMGMAAAQGAKTLS 534 (559) Q Consensus 456 l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~-~~~~~~~~~~~~~a~~~~ 534 (559) ..+..|++.+.. .+.-+. +...+|+. +++++++++.+.++..+ ++...++... +-....+ T Consensus 406 -----d~~~kl~~~g~~---~~s~et----~~~~l~~~------~d~~~e~~~~~ee~~~~~~~~~~~~~~~-~~~~~~~ 466 (485) T protein:vir:24 406 -----DAATKLYGNGQG---VIPRER----ARKDMGYS------IAEREEMRRWDEEEAAMGLGLLGTMVDA-DPTVPGS 466 (485) T ss_pred -----HHHHHHHhcccc---cCCHHH----HHhhCCCC------HhHHHHHHHHHHHHhhhhhhHHHhhccc-CCCCCCC Confidence 222222222211 122222 23445653 45555555443333221 1111111110 0000011 Q ss_pred hhcCCChhHHHHHHHHhhcCCCCC Q lcl|NC_019445. 535 EAKTSDPSVLSAMANAVSGQGGQS 558 (559) Q Consensus 535 ~~~~~~~~~~~~~~~~~~~~~~~~ 558 (559) +..+..++. +. +..++-+| T Consensus 467 ~~~~e~~~~-~~----~~~~~~~a 485 (485) T protein:vir:24 467 PNPTPAPKP-QP----AIEGGDSA 485 (485) T ss_pred CCCCCCCCC-cc----CCCCCCCC Confidence 111111110 00 00111111 No 60 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.21 E-value=5.7e-10 Score=71.25 Aligned_cols=450 Identities=12% Similarity=0.082 Sum_probs=190.8 Q ss_pred CChh--hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAET--TKERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~--~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |-+. ....+...+..+...+ ++.+++.+|..-..- +..+.. .+...+..+...+-+..++++++..| + T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~----~r~~~~~~Yy~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~~~~l----~ 78 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDST----QNLKTNTSYYEAERRPEAIGVT-VPIQMQSLLAHVGYPRLYVDSIAERQ----A 78 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHH----HHHHHHHHHHhcCCcchhcCCC-CChhhhhhhhhcCcHHHHHHHHHhhh----c Confidence 4332 2222333344443333 445556666432110 001111 11122233455677888888888765 3 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCC--------ceEE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE--------DIIR 149 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~--------~~~~ 149 (559) |.+ |+. ++.+.. ...+.+.+.+++|.....++.++..+||.|.+++..+.. ...+ T Consensus 79 ~~g---~~~--~~~~~~------------~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~ 141 (485) T protein:vir:10 79 VEG---FRF--GDADEA------------DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPI 141 (485) T ss_pred ccc---eec--CCCchh------------HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeE Confidence 333 322 222111 112333456788999999999999999999998876532 2346 Q ss_pred EEEeeccEEEEeeCC-CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccc Q lcl|NC_019445. 150 TMPFPIGSYYLANSP-RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSK 228 (559) Q Consensus 150 ~~~~~l~~~~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~ 228 (559) +..++..+.++..|+ .+++...++.+. .+ . +..++.++...+.. T Consensus 142 i~~~~p~~~~~~~D~~~~~~~~~~~~~~-----------~~-----------~---~~~~~~~~~y~~~~---------- 186 (485) T protein:vir:10 142 IRVEPPTRMYAEIDPRIGRVSKAIRVAY-----------DA-----------E---GNEIQAATLYTPND---------- 186 (485) T ss_pred EEEEccceeEEEEcCCCCceeEEEEEEE-----------ee-----------C---CCeEEEEEEEeCCe---------- Confidence 777888887777764 455555454321 00 0 11222222111110 Q ss_pred cccEEEEEEEecCCCcee--eeecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019445. 229 NKPFKSVYYEVGGDNDKL--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) Q Consensus 229 ~~~~~sv~~~~~~~~~~i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~~~~p~~ 305 (559) .+++.-.+..-.. ..+-+|..+|++.+..+...+..||+|- ..+ .++-+..++...-.+...++..+.|.+ T Consensus 187 -----~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~-i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~ 260 (485) T protein:vir:10 187 -----IFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSE-ITPELRSMTDAAARILMLMQATAELMGVPQR 260 (485) T ss_pred -----EEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccc-hhHHHHHHHHHHHHHHHHHHHHHHhhcchHH Confidence 0111111111011 1234678899999999999999999984 443 456677888888888888999988876 Q ss_pred eecC----CC-----c-cccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc-chh-hhccC Q lcl|NC_019445. 306 VAPT----SL-----K-NQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV-DLF-MMLQN 373 (559) Q Consensus 306 ~~p~----~~-----~-~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~-dl~-~~~~~ 373 (559) .+.+ +. + ...+...+|.++..+..+. . +. ......+. ..++.++.-|...+.. ++. ..++. T Consensus 261 ~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~-k-~~--q~~~~~~~---~~~~~l~~~i~~~~~~~~~p~~~fg~ 333 (485) T protein:vir:10 261 LIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEG-K-IQ--QFSAAELA---NFTNALDQIAKQVAAYTGLPPQYLST 333 (485) T ss_pred HHhcCCcccccccccccchhhhhcccceeccCCCCc-e-EE--eecccchH---HHHHHHHHHHHHHhcccCCCHHHhcc Confidence 5422 11 1 1113344566555432221 1 11 11111222 2334444444433321 000 01111 Q ss_pred CCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHH Q lcl|NC_019445. 374 INTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGL 453 (559) Q Consensus 374 ~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~ 453 (559) ...-.-++.-+......+.. ..++. .+.+.+-+.+.+.++...-.....+.+ ...+++++..++.. T Consensus 334 ~~~n~~Sg~Al~~~~~~l~~----k~~~k-~~~f~~~l~~~~~l~~~~~~~~~~~~~--~~~i~v~w~~~~~~------- 399 (485) T protein:vir:10 334 AADNPASAEAIRAAESRLIK----KVERK-NSIFGGAWEEAMRLAYRMMKGGDVPPD--MLRMETVWRDPSTP------- 399 (485) T ss_pred ccCchhHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHHhCCCCCccc--ceeeeEEecCCCCC------- Confidence 00001133333322222211 12332 223444455555554432111222222 23466776555421 Q ss_pred HHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhh Q lcl|NC_019445. 454 SSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQ-QQQMMAMGMAAAQGAKT 532 (559) Q Consensus 454 ~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q-~~~~~~~~~~~~~~a~~ 532 (559) ++.+.++.+..+.+.++. .+.-+.++ ..+|+. ++++++++..++++++ ...++.. +...... T Consensus 400 -~~~~~ada~~kl~~ag~~---~~s~et~~----~~lg~~------~~~~~~~~~~~ee~~~~~~~~~~~---~~~~~~~ 462 (485) T protein:vir:10 400 -TYAAKADAASKLYNGGTG---VIPRERAR----KDMGYS------IAEREEMRRWDEEEAAMGLGLIGT---MVDPNPT 462 (485) T ss_pred -CHHHHHHHHHHHHhcccc---CCCHHHHH----HhCCCC------HhHHHHHHHHHHHHHHHHHHHHHH---hhccCCC Confidence 111122223333332211 12322222 346653 3445555433322221 1111111 1110000 Q ss_pred hhhhcCCChhHHHHHHHHhhcCCCCC Q lcl|NC_019445. 533 LSEAKTSDPSVLSAMANAVSGQGGQS 558 (559) Q Consensus 533 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (559) ...+...++. .-...-.-+|.+| T Consensus 463 ~~~~~~~~~~---~~~~~~~~~~~~~ 485 (485) T protein:vir:10 463 VPGSPSPAPA---PKPAALESGGDAA 485 (485) T ss_pred CCCCCCcccc---ccCcCCCCCCCCC Confidence 0000000000 0000011112222 No 61 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.17 E-value=8.9e-10 Score=70.20 Aligned_cols=418 Identities=11% Similarity=0.105 Sum_probs=197.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc---ccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINP---RGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P---~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |......++.. +++. |. .+++.+.+|..- -..+.. . ...+.+.++..+.+...++..++-|++ T Consensus 1 l~~~~l~~~i~---~~~~-~~---~r~~~l~~yy~g~~~il~~~~---~-~~~~~~~ki~~n~~~~ivd~~~~~l~g--- 66 (429) T protein:vir:98 1 MTKDLLSELIQ---KHRS-FN---LSYSAYKQLYEGDHAILQQKQ---K-EQYKPDNRLVVNFAKYIVDTFNGYFIG--- 66 (429) T ss_pred CCHHHHHHHHH---HHHH-HH---HHHHHHHHHhccccccccccc---c-ccCCCcceeecchHHHHHHHHhhhhcc--- Confidence 66555444444 4433 22 334444444321 111111 1 112234567778888888888876643 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) .| +.++..++ .+...+...+...+|.....++.++..+||.|.+++..+...-+++..++..+ T Consensus 67 ---~~-~~~~~~~~-------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~ 129 (429) T protein:vir:98 67 ---VP-VQTSHENK-------------QVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLE 129 (429) T ss_pred ---cC-ceeecCCh-------------HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccc Confidence 11 22333322 12334455566788999999999999999999998887765557788888777 Q ss_pred EEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEE Q lcl|NC_019445. 158 YYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSV 235 (559) Q Consensus 158 ~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv 235 (559) .++-.|. .+++...+|.+.- .+ .+ .+.++-..+. .. T Consensus 130 ~~~v~dd~~~~~~~~~i~~~~~------------------------~~---~~--~~~~~~~~~~-------------~~ 167 (429) T protein:vir:98 130 AFIVYDDSIRQKPLFAVRYFYN------------------------KG---GV--LEGSYSDASN-------------IT 167 (429) T ss_pred eEEEEeCCCCCceEEEEEEEEe------------------------cC---ce--EEEEEEeCce-------------EE Confidence 7666554 3334444443310 01 01 1111111110 01 Q ss_pred EEEecCCCceeeee--cCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc Q lcl|NC_019445. 236 YYEVGGDNDKLLRE--SGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN 313 (559) Q Consensus 236 ~~~~~~~~~~il~e--sg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~ 313 (559) |+..++.+..+... -++..+|++.++ ++.+|+|. .+...+-+..++.+.-......+....|.+++.+.... T Consensus 168 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd-~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~ 241 (429) T protein:vir:98 168 YFKDGEKGIEIGESEPHPFDGVPMIEYV-----ENEERQSL-LASVVTLINAFNKAISEKANDVEYFADAYLKILGAELD 241 (429) T ss_pred EEEecCCceEecccccccCCccceEEec-----CCCCCCCc-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCC Confidence 11111111122211 256678887643 35689996 88899999999999999999999999998876543211 Q ss_pred c--cceecCCceeecCCcCCch-hhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHH Q lcl|NC_019445. 314 Q--RASLLPGDITYIDQITGQD-GFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEE 390 (559) Q Consensus 314 ~--~~~~~pg~~~~~~~~~~~~-~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e 390 (559) . ..+...++++.++..++.+ .+..+. -..+...+...++.+.+.|-...+.. . +........|+..+..+..- T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p--~-~~~~~~gn~Sg~Al~~~~~~ 317 (429) T protein:vir:98 242 DETLKSLRDTRIINLKDTDAQQLTVEFLQ-KPDADATQEHLLDRLENLIFRTAMVA--N-ISDESFGTASGIALRYRLQA 317 (429) T ss_pred cchhhhHhhCceeeccCCCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCcc--c-cCccccccchHHHHHHHHHH Confidence 1 1234455666665443322 222221 12244555556677777665444331 1 11111123455444332221 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019445. 391 KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAK 470 (559) Q Consensus 391 ~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~ 470 (559) |--..++.+. .+..-+.+++.++.+.-....-+. .-..+++.+.-++.+- +...++.++.++++ T Consensus 318 ----l~~k~~~~~~-~~~~~l~~~~~li~~~~~~~~~~~--d~~~i~v~f~~~~p~~--------~~~~a~~~~kl~g~- 381 (429) T protein:vir:98 318 ----MDNLAKTKER-KFMSGMNRRYKLIASYPTSKIGPK--DWIGIKYKFTRNLPAN--------LLEESQIAGNLAGI- 381 (429) T ss_pred ----HHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCCcc--ccccceEEeCCCCCcC--------HHHHHHHHHHHhcc- Confidence 1122222222 233334444444444211111111 1223556554333211 11112222333322 Q ss_pred hhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCCh Q lcl|NC_019445. 471 PEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDP 541 (559) Q Consensus 471 P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~ 541 (559) +..+.+ ...+| ++ -.++|++++++++.+..+.++ .+-...+..+... T Consensus 382 ------is~et~----~~~l~~v~----d~~~E~~ri~~E~~~~~~~~~----------~~~~~~~~~~~~~ 429 (429) T protein:vir:98 382 ------VSEETQ----VGVLSIVE----NPQKEIERKNSDKSTLISRQA----------GGLNGQNTTTILE 429 (429) T ss_pred ------CchHHH----HHhCCCCC----CHHHHHHHHHHHHHHHHHHHH----------hhhcCCCCCCCCC Confidence 222222 23343 32 124666666555443222111 1111112222211 No 62 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.17 E-value=9.7e-10 Score=69.99 Aligned_cols=428 Identities=9% Similarity=0.026 Sum_probs=203.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |.. +.|.+..+..+.. .++.+.+.+|..-.- .-.........+...++..+.+...++..++-|++ T Consensus 17 ~~~---~~i~~~i~~~~~~----~~r~~~~~~Yy~g~~-~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g------ 82 (452) T protein:vir:36 17 ITV---EVVTKFMEKHKLE----VARYEYLKNMYLGIM-AIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNG------ 82 (452) T ss_pred CCH---HHHHHHHHHHHHH----HHHHHHHHHHhcccc-ccccCccccccCccceeecchHHHHHHHHhhhhcc------ Confidence 433 3344434433332 344556666654321 00111111112234567778888888888876543 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEE Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYL 160 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v 160 (559) .| +.+...|+. ....+...+...+|-...+++.++..+||.|++++..+....+++..++..+.+. T Consensus 83 ~~-~~~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (452) T protein:vir:36 83 IP-VKKSHSDKE-------------ILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFM 148 (452) T ss_pred cC-ceeecCChh-------------HHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEE Confidence 11 223333322 2233556667789999999999999999999998888766667888888888877 Q ss_pred eeCCC--CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEE Q lcl|NC_019445. 161 ANSPR--GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYE 238 (559) Q Consensus 161 ~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~ 238 (559) -.|.. +.+...+|.+.-. .....++||+. . ..+++. T Consensus 149 v~d~~~~~~~~~~i~~~~~~------------------------~~~~~~~vyt~----~--------------~i~~~~ 186 (452) T protein:vir:36 149 VYDDTVKQEPLFAVRYGVDE------------------------DKKLQGEVYTL----L--------------ETIKIS 186 (452) T ss_pred EEcCCCCCceEEEEEEEEec------------------------CceEEEEEEec----C--------------eEEEEE Confidence 66653 3444444443210 11123343321 0 011222 Q ss_pred ecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc--c Q lcl|NC_019445. 239 VGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN--Q 314 (559) Q Consensus 239 ~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~--~ 314 (559) ..+.+-.+. ..-+|..+|++.++. +..|+|. .+...+-+..++.+.-.....++...+|.+++.+.... . T Consensus 187 ~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~sd-~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~~~ 260 (452) T protein:vir:36 187 GENDEISFGEGTYNPYPDLPVVEFYF-----NEERMSI-FESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEED 260 (452) T ss_pred EcCCceEEecceeccCCcccEEEecC-----CCCCCcc-hHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCchh Confidence 222111111 123566788776644 3468886 77888889999999999999999999998887653211 1 Q ss_pred cceecCCceeecCCcCCch--hhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHH Q lcl|NC_019445. 315 RASLLPGDITYIDQITGQD--GFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKL 392 (559) Q Consensus 315 ~~~~~pg~~~~~~~~~~~~--~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~ 392 (559) ..++.+++.+.+...+... .+.-+ +.+.+...+...+..+++.|-..-...-+ ....-...|+..+..+-.-+. T Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~l~~~I~~~s~~p~~---~~~~~gn~Sg~Al~~~~~~l~ 336 (452) T protein:vir:36 261 LKNIRSNRVINYYADGEGKNVDVKFL-EKPDSDSQTENLLDRLTKLIFQTTMVANI---SDESFGSSSGVSLAYKLQAMS 336 (452) T ss_pred hhhhhhcceEEecCCCCccCCcceeE-eecCCHHHHHHHHHHHHHHHHHHhCcccc---CcccccCCcHHHHHHHHHHHH Confidence 2335556655554322211 22211 11234555556667777766444432111 111223456665544332222 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChh Q lcl|NC_019445. 393 LMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPE 472 (559) Q Consensus 393 ~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~ 472 (559) . ......+.....+..+++-++.++...|. + ....+|+|++.-++.. + +...++.++.++++ T Consensus 337 ~-k~~~~~~~~~~~l~~~~~li~~~~~~~~~----~--~~~~~i~i~f~~~~p~-----d---~~~~a~~~~k~~g~--- 398 (452) T protein:vir:36 337 N-LALSFQRKFQSSLNSRYKLFCELSTNVSN----K--DSWKDIEYTFTRNEPK-----D---IKEQAETANILMGI--- 398 (452) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHhccCC----c--cccccceEEeCCCCCc-----C---HHHHHHHHHHHhcc--- Confidence 2 12222222233344444444444444342 1 1223456666444321 0 11112222233222 Q ss_pred hHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 473 ALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 473 ~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) +....++ ..+|.-. -.++|++++.++++++.+..+.. ....-+.-.+....+++ T Consensus 399 ----iS~et~~----~~~~~~~---d~~~E~~ri~~E~~~~~~~~~~~-----~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 399 ----TSQETAL----SVISVIP---DVQAEMEKIKKEEASTAIFDKDK-----QPSEKGTDTVVSETNEE 452 (452) T ss_pred ----CChHHHH----HhCCCCC---CHHHHHHHHHHHHHHHHHHHhhc-----cCCCCcccccCccccCC Confidence 3323333 3333211 13567766665543322211110 00000000111111111 No 63 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.16 E-value=1e-09 Score=69.86 Aligned_cols=430 Identities=11% Similarity=0.063 Sum_probs=195.7 Q ss_pred CC----hhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----cCC-CCCCCCCCcccccCCCCcchHHHHHHHHHH Q lcl|NC_019445. 1 MA----ETTKERLNKQFAQLESERQSFEPHWRELSDYINPR-----GSR-FLTSEVNRNDRRNTRIIDSTGTMAARTLAS 70 (559) Q Consensus 1 M~----~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~-----~~~-~~~~~~~~~~~~~~~~~~s~~~~a~~~Las 70 (559) +- +.+.+.|.+..+..+. ..++++.+.+|..-. +.+ ..........+.+.++..+.+...++..++ T Consensus 37 ~~~~~~~~~~~~i~~~i~~~~~----~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~ 112 (492) T protein:vir:94 37 RTNNKPETLEEMIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVS 112 (492) T ss_pred ccCCchhhHHHHHHHHHHHHHH----HHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHh Confidence 11 1223333333344333 234555666664311 100 111111111223456778888888998887 Q ss_pred HHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEE Q lcl|NC_019445. 71 GMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRT 150 (559) Q Consensus 71 ~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~ 150 (559) -|++ .| +.++..|.. +.+. +...+ ..+|-....++.++..+||.|.+++..|....+++ T Consensus 113 yl~G------~p-~~~~~~d~~------~~~~-------l~~~~-~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~ 171 (492) T protein:vir:94 113 YIVG------KP-IAFKHTDDE------VVKR-------IDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKL 171 (492) T ss_pred hhcc------cC-ceeccCchH------HHHH-------HHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEE Confidence 5532 12 123333321 1111 22222 35677888899999999999999888776666788 Q ss_pred EEeeccEEEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE------EEeecCcccc Q lcl|NC_019445. 151 MPFPIGSYYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMH------SVYPNIDRDT 222 (559) Q Consensus 151 ~~~~l~~~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~------~v~p~~~~~~ 222 (559) ..++..+.++-.| ..+++...+|.+... ....+++++ .+....... T Consensus 172 ~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~-------------------------~~~~~~~y~~~~v~~~~~~~~~~~- 225 (492) T protein:vir:94 172 FRVPAEQGIPIWTDKEHEELEAFIRMYKLE-------------------------NETKVEYWDKVTVNYYVYENGSLI- 225 (492) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEeec-------------------------cceeEEEEecCeEEEEEEecCeee- Confidence 8888888766655 457776666654421 011233322 111110000 Q ss_pred cccccccccEEEEEEEecCCCcee-eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 223 SKLDSKNKPFKSVYYEVGGDNDKL-LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKAT 301 (559) Q Consensus 223 ~~~~~~~~~~~sv~~~~~~~~~~i-l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~ 301 (559) . .+........+ ...-+|..+|++.++- +.+|.|. .+..++.+..++.+.-.+...++... T Consensus 226 -----------~-~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~~sd-~e~v~~liDa~d~~~S~~~~~~~~~~ 287 (492) T protein:vir:94 226 -----------P-DYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISD-IFMYKTLIDAYNRRLSDLSNTFKDSN 287 (492) T ss_pred -----------e-ccccccccccccccccCCCccceEEecC-----CCCCCCc-hHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 0 00000000001 1123567788876643 4578896 88899999999999999999999999 Q ss_pred cCceeecCCC----ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Q lcl|NC_019445. 302 NPPMVAPTSL----KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTR 377 (559) Q Consensus 302 ~p~~~~p~~~----~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~ 377 (559) +|.+++.+-. ......+..++++-++..++ +..+ +.+.+...+...++.++..|...-+..-+. ...-+. T Consensus 288 ~p~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~l-~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~--~~~~~~ 361 (492) T protein:vir:94 288 ELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGG---VDTI-QVEVPVENSKKYLDELYQKIMLFGQAVDFS--SDKFGS 361 (492) T ss_pred CceeeeecCCcccchhhHHHHhhccceecCCCCc---ceeE-eccCCHHHHHHHHHHHHHHHHHHhCCcCCC--cccccc Confidence 9988765421 11011223334444433222 2222 122344555566777777665544322111 111122 Q ss_pred CcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHH Q lcl|NC_019445. 378 SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 (559) Q Consensus 378 ~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~ 457 (559) ..|+.-+...-.- |--..++.+. .+..-+.+++.++.+..-.+. + ..++++++.-.+.. ++. T Consensus 362 n~Sg~Al~~~~~~----l~~k~~~k~~-~f~~~l~~~~~li~~~~~~~~---~--~~~i~v~f~~~~p~--------~~~ 423 (492) T protein:vir:94 362 APSGVALEFLYTN----LNLKADKLAR-KAKVAIQELLWFVFEHFDIKG---E--HKDVDISFNYNKVA--------NTE 423 (492) T ss_pred CchHHHHHHHHHH----HHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCc---c--cceeeEEecCCCCC--------CHH Confidence 3344333222211 1112233333 344455556665555322221 1 23456665443331 011 Q ss_pred HHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_019445. 458 STVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEA 536 (559) Q Consensus 458 ~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~ 536 (559) ..++.+..++++ +.-..++ ..++ ++ -.++|++++.+++++++++.+... +..... T Consensus 424 e~~~~~~kl~gi-------iS~et~~----~~l~~v~----d~~~E~eri~~E~~~~~~~~~~~~---------~~~~~~ 479 (492) T protein:vir:94 424 LQVQTAQQSMGI-------VSHETVL----ENHPFVE----DLQAELERIEQEQMEYNKQLPNLD---------DGGADS 479 (492) T ss_pred HHHHHHHHHhcc-------CchHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHhhccccc---------cccCCC Confidence 112223233222 2222222 2333 22 235677777666544443321110 000000 Q ss_pred cCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 537 KTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 537 ~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .... .+.+.++ T Consensus 480 ~~~~------------~~~~~~e 490 (492) T protein:vir:94 480 AQQQ------------ERSNNKE 490 (492) T ss_pred Cccc------------cCCcccc Confidence 0000 0001111 No 64 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.16 E-value=1e-09 Score=69.82 Aligned_cols=431 Identities=8% Similarity=0.035 Sum_probs=201.1 Q ss_pred CChh---hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAET---TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~---~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |... ..+.|.+..+.... ...+++++.+|..-.- .-.........+.+.++..+.+..+++.+++-|++ - T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~----~~~r~~~~~~yy~g~~-~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~ 83 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRL----EVARYEYLKNMYRGIM-AIDAEPTKDLWKPDNRLTVNFTKYIVDTFTGYFNG--I 83 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHH----HHHHHHHHHHHhhccC-chhcCCCccccCccceeecchHHHHHHHHhhhhcc--c Confidence 3331 22333333333322 2344555555543210 00000001112234567778888888888886532 1 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) | ++++..+++ ....+.+.+...+|.....++.++..++|.|.+++..+....+++..++..+ T Consensus 84 ~-----~~~~~~d~~-------------~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 145 (453) T protein:vir:39 84 P-----VKKSHSDKE-------------TLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPEN 145 (453) T ss_pred C-----ceeccCChH-------------HHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccc Confidence 1 223333321 2234566677889999999999999999999999988876667888888888 Q ss_pred EEEeeCC-CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEE Q lcl|NC_019445. 158 YYLANSP-RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVY 236 (559) Q Consensus 158 ~~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~ 236 (559) .++-.|. .++....+.++.. ..+....+++|+ +. + .++ T Consensus 146 ~~~v~d~~~~~~~~~~ir~~~-----------------------~~~~~~~~~~yt---~~------~---------i~~ 184 (453) T protein:vir:39 146 MFMVYDDTIKQEPLFAVRYGY-----------------------DDDYKLYGEVYT---KE------T---------TYA 184 (453) T ss_pred eEEEecCCCCCeEEEEEEEEE-----------------------eCCeEEEEEEEe---CC------e---------EEE Confidence 7766653 3333333333221 011112233321 11 0 012 Q ss_pred EEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc- Q lcl|NC_019445. 237 YEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN- 313 (559) Q Consensus 237 ~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~- 313 (559) +...++.-.+. .+-+|..+|++.++. +.+|+|. .+...+-+..++.+.-..+..++...+|-+++.+.... T Consensus 185 ~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd-~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~ 258 (453) T protein:vir:39 185 LNGTMGFYNMTEQAPNPFDDLPVVEFYF-----NEERMSI-FESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE 258 (453) T ss_pred EEecCCceeeecccccCCCceeEEEecC-----CCCCCcc-hhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCc Confidence 22222211121 124667889887653 4578996 78888889999999999999999999998877542111 Q ss_pred c-cceecCCceeecCCcC---CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHH Q lcl|NC_019445. 314 Q-RASLLPGDITYIDQIT---GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKE 389 (559) Q Consensus 314 ~-~~~~~pg~~~~~~~~~---~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~ 389 (559) . ..+...++++...... ....+..+ +.+.+...+...+..++..|-..-...-+ ....-...|+..+..... T Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-t~~~~~~~~~~~~~~l~~~I~~~s~~p~~---~~~~~gn~Sg~Al~~~~~ 334 (453) T protein:vir:39 259 EDLKNIRSNRVINYYGESSEAKNVDVKFL-EKPDSDSQTENLLDRLTKLIFQTTMVANI---SDESFGSSSGVSLAYKLQ 334 (453) T ss_pred hhhhhhhhcceeeecCCCCCCCCCceeEE-eecCCHHHHHHHHHHHHHHHHHHhCCccc---ccccccCChHHHHHHHHH Confidence 1 1223444444332211 11122222 12234556666677777766443322111 111112345554443332 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019445. 390 EKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQA 469 (559) Q Consensus 390 e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~ 469 (559) -+.. ......+.-.+.+..++.-+..++...|. + ....+|+|.+.-++..- +...++.++.++++ T Consensus 335 ~l~~-ka~~~~~~~~~~l~~~~~li~~~~~~~~~----~--~~~~~i~v~f~~~~p~~--------~~~~a~~~~kl~g~ 399 (453) T protein:vir:39 335 AMSN-LALSFQRKFQSSLNSRYKLYCELSTNVSN----K--EAWKDIEYTFTRNEPKD--------IKEQAETANILMGI 399 (453) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCC----c--cccccceEEeCCCCCcC--------HHHHHHHHHHHhcc Confidence 2222 11222222233333333333444433332 1 22334666664443211 11112223333332 Q ss_pred ChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 470 KPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 470 ~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) +....++ ..++ ++ -.++|++++.+++++..+..+.. ....-+.-.+.+..+++ T Consensus 400 -------is~et~l----~~l~~v~----D~~~E~~ri~~E~~~~~~~~~~~-----~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 400 -------TSQETAL----SVISVIP----DVQAEMEKIKKEEASTAIFDKDK-----QPSEKGTDTVVPETNEE 453 (453) T ss_pred -------CChHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHHHhc-----cCCCCCCCCCCCCcCCC Confidence 2223333 2333 22 12566666655544333221111 01011111222222222 No 65 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.15 E-value=1.2e-09 Score=69.54 Aligned_cols=432 Identities=12% Similarity=0.076 Sum_probs=197.6 Q ss_pred CCh---hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----cCCC-CCCCCCCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAE---TTKERLNKQFAQLESERQSFEPHWRELSDYINPR-----GSRF-LTSEVNRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~---~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~-----~~~~-~~~~~~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) |.. .+.+.|.+..+.... ...+++.+.+|..-. +.+. .........+.+.++..+.+..+++.+++- T Consensus 18 ~~~~~~~~~~~i~~~i~~~~~----~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~ 93 (472) T protein:vir:93 18 TNNKPETLEEMIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 93 (472) T ss_pred ecCchhhHHHHHHHHHHHHHH----HHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhh Confidence 221 223333333333333 234555666664321 1110 111111112234467788899999998876 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) |++ .| +.+...|++ +.. .+...+ ..+|-..++++.++..+||.|.+++..+....+++. T Consensus 94 l~g------~~-~~~~~~d~~------~~~-------~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~ 152 (472) T protein:vir:93 94 IVG------KP-IAFKHTDDE------VVK-------RIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLF 152 (472) T ss_pred hcc------cC-eeeccCChH------HHH-------HHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEE Confidence 642 11 223333321 111 122223 357888999999999999999999988776668888 Q ss_pred EeeccEEEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE------EEeecCccccc Q lcl|NC_019445. 152 PFPIGSYYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMH------SVYPNIDRDTS 223 (559) Q Consensus 152 ~~~l~~~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~------~v~p~~~~~~~ 223 (559) .++..+.++-.| ..+++...+|.+...- ...+++++ .++.... T Consensus 153 ~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~---- 203 (472) T protein:vir:93 153 RVPAEQGIPIWTDKEHEELEAFIRMYKLEN-------------------------ETKVEYWDKVTVNYYVYENGS---- 203 (472) T ss_pred EEcccceEEEEcCCCCCceEEEEEEEEeec-------------------------ceeEEEEecCeEEEEEEecCe---- Confidence 899988887765 3677766666544210 01222221 1111000 Q ss_pred ccccccccEEEEEEEecCCCcee-eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 224 KLDSKNKPFKSVYYEVGGDNDKL-LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATN 302 (559) Q Consensus 224 ~~~~~~~~~~sv~~~~~~~~~~i-l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~ 302 (559) +.. .+........+ ...-+|..+|++.++. +.+|+|. .+...+.+..++.+.-.+...++...+ T Consensus 204 --------~~~-~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~s~-~e~v~~liDa~~~~~s~~~~~~~~~~~ 268 (472) T protein:vir:93 204 --------LIP-DYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISD-IFMYKTLIDAYNRRLSDLSNTFKDSNE 268 (472) T ss_pred --------eee-cccccccccccccccCCCCCcceEEecC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhcC Confidence 000 00000010111 1224577888887764 4589996 888999999999999999999999999 Q ss_pred CceeecCCCcc----ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 303 PPMVAPTSLKN----QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 303 p~~~~p~~~~~----~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) |.+++.+.... ..-.+..++++.++..++ ++.+. ...+...+...+..++..|....+..-+. ....+.. T Consensus 269 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~~~~~n 342 (472) T protein:vir:93 269 LTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGG---VDTIQ-VEVPVENSKKYLDELYQKIMLFGQAVDFS--SDKFGSA 342 (472) T ss_pred ceeEeecCCcccchhhHHHHhhccccccCCCCc---ceeEe-ecCCHHHHHHHHHHHHHHHHHHhCCCCCC--ccccccC Confidence 98877542111 111233444444433222 22221 12344555566677776665544321111 1111233 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) .++.-+.....-+.. ...+.+. .+...+++++.++.+..-.+ .+ ..++++++.-.+.. ...+. T Consensus 343 ~Sg~Al~~~~~~l~~----ka~~~~~-~~~~~l~~~~~li~~~~~~~---~~--~~~i~v~f~~~~p~-~~~~~------ 405 (472) T protein:vir:93 343 PSGVALEFLYTNLNL----KADKLAR-KAKVAIQELLWFVFEHFDIK---GE--HKDVDISFNYNKVA-NTELQ------ 405 (472) T ss_pred chHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHhCCC---cc--cceeeEEeCCCCCC-CHHHH------ Confidence 444433322111111 1122222 23344455555554432111 11 22455555333221 01111 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAK 537 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~ 537 (559) ++.+..++++ +....+ ...++ ++ -.++|+++++++++++++.++.... ..+....+.. T Consensus 406 -~~~~~k~~gi-------is~et~----l~~l~~~~----d~~~E~~ri~~E~~~~~~~~~~~~~-----~~~d~~~~~~ 464 (472) T protein:vir:93 406 -VQTAQQSMGI-------VSHETV----LENHPFVE----DLQAELERIEQEQMEYNKQLPNLDD-----GGADGAQQQE 464 (472) T ss_pred -HHHHHHHhcc-------CchHHH----HHhCCCCC----CHHHHHHHHHHHHHHHHHhccCcCc-----ccCCCCCCCC Confidence 2222233332 222222 22332 22 1356676666655443333211100 0000000000 Q ss_pred CCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 538 TSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 538 ~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ..++. ..| T Consensus 465 ~~~~~--------------~~e 472 (472) T protein:vir:93 465 RSNNK--------------ESE 472 (472) T ss_pred CCCcc--------------cCC Confidence 00000 000 No 66 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.15 E-value=1.2e-09 Score=69.45 Aligned_cols=451 Identities=13% Similarity=0.088 Sum_probs=196.6 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.-.. +.|....+.+..++ ++...+.+|..-..- +..+. ..+...+..++..+-+..+|+.+++.| ++. T Consensus 1 ~~t~~-~~i~~L~~~~~~~~----~r~~~l~~Yy~G~~~i~~~~~-~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~ 70 (480) T protein:vir:78 1 MTTYH-EHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGI-GAPPELAYLDVQPGWVATYLRTLSDRL----DIE 70 (480) T ss_pred CCCHH-HHHHHHHHHHHHHH----HHHHHHHHHHhcccccccccc-ccchhHhhhhhhcchHHHHHHHHHhhh----ccC Confidence 76643 44555555554433 334444455322110 01111 112222334566677778888777765 332 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee------cCCceEEEEEe Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE------DDEDIIRTMPF 153 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~------~~~~~~~~~~~ 153 (559) + |. .++.+ +. .+.+...+++++|.....++.++..+||.|.++|.. |.....++..+ T Consensus 71 g---~~--~~~d~-----~~-------~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~ 133 (480) T protein:vir:78 71 G---FR--ISEDS-----EG-------LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVE 133 (480) T ss_pred c---ee--cCCCc-----hh-------HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEE Confidence 2 22 22211 11 122344566789999999999999999999888764 33344778889 Q ss_pred eccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 154 ~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +..+.++-.|+ .+++...+|.+.-. + .......+++|+ +.. T Consensus 134 ~p~~~~~~~D~~~~~~~~~~i~~~~~~----------~-----------~~~~~~~~~~y~---~~~------------- 176 (480) T protein:vir:78 134 SPLYMYAELDPRNTRRVTRAVRLYTTR----------D-----------DVAVPDRATLYL---PDE------------- 176 (480) T ss_pred cccceEEEEcCCCccceEEEEEEEEee----------c-----------CCCceEEEEEEe---CCe------------- Confidence 99998888885 46676666554210 0 001112233321 100 Q ss_pred EEEEEEEe-cCCC--cee---eeecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 232 FKSVYYEV-GGDN--DKL---LRESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 232 ~~sv~~~~-~~~~--~~i---l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) .+|+.. ++.. ... ..+-+|..+|+++++.+...+..||+|- ..+ ..+-+-.++...-.+...++..+.|. T Consensus 177 --~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~-i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~ 253 (480) T protein:vir:78 177 --TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE-ISPELRKVTDAASRTLMNLQSASQILGTPL 253 (480) T ss_pred --EEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCccc-chhhHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 011111 0000 000 1134678899999999988899999995 554 45778888888888888888888887 Q ss_pred eeecCCC--------ccccceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhc-chh-hhccC Q lcl|NC_019445. 305 MVAPTSL--------KNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFV-DLF-MMLQN 373 (559) Q Consensus 305 ~~~p~~~--------~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~-dl~-~~~~~ 373 (559) +.+.+.. ....+...+|.+..... ++.. + .+.+ .++.... +.++.-|...+.. ++. ..++. T Consensus 254 ~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~---~~~~~~~~~~~~---~~l~~~i~~~~~~~~~p~~~~g~ 325 (480) T protein:vir:78 254 RVISGVTTDELTNDGENTTLDIYYGRILTLAS-EAAK-I---SEFKAAELRNFA---EEMEVFRKEAASITGLPPQYLSS 325 (480) T ss_pred hhhhcCCccccccccccchhhhhhhhhccCCC-CCce-E---EecCccCHHHHH---HHHHHHHHHHhcccCCChHHhcc Confidence 6653211 01112233343332221 1111 1 1111 1233333 3344333332211 000 11111 Q ss_pred CCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHH Q lcl|NC_019445. 374 INTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGL 453 (559) Q Consensus 374 ~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~ 453 (559) .....-++.-++..... |=-...+... .+.+-+.+++.++.+..-. ..+.+. ..+++++.-+... ...+.+ T Consensus 326 ~~~n~~Sg~Alk~~~~~----l~~ka~~~~~-~f~~~l~~~~~l~~~~~g~-~~~~~~--~~i~v~f~~~~~~-s~~~~a 396 (480) T protein:vir:78 326 SSENPASAEAIIATDSR----IVKMAERKGR-IFGGAWERAMRIAMQIMGR-EVTEEY--TRLETVWRDPSTP-TVAAKA 396 (480) T ss_pred ccCcchHHHHHHHHHHH----HHHHHHHHHH-HHHHHHHHHHHHHHHHcCC-Cccccc--eeeeEEecCCCCC-CHHHHH Confidence 11111133223222211 1112233333 3444556666665542110 112222 2355666433211 111222 Q ss_pred HHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019445. 454 SSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTL 533 (559) Q Consensus 454 ~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~ 533 (559) +++.+ |.+.+.. .++.+. +...+|.. +++++++++.+.++.+. ...+.....+...+. T Consensus 397 d~~~k-------l~~~g~~---~~s~et----~~~~lg~~------~d~~~~~~~~~~e~~~~--~~~~~~~~~~~~~~~ 454 (480) T protein:vir:78 397 DAVSK-------LYANGQG---PIPKEQ----ARIDLGYT------ATQREQMRDWDKQETED--MIDTLYSTTKAQADA 454 (480) T ss_pred HHHHH-------HHHhccc---cCCHHH----HHhcCCCC------HhHHHHHHHHHHHHHHH--HHHHhhccccccCCC Confidence 22222 2222211 122222 23335653 45555555432222211 111111111111111 Q ss_pred hhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 534 SEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 534 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .+.+..+.. ..-...+.+|.|.++ T Consensus 455 ~~~~~~~~~--~~~~~~~~~~~~~~~ 478 (480) T protein:vir:78 455 TPKPTVTET--KTETQTSPSGFNRTK 478 (480) T ss_pred CCCCCCCCC--CCccccccCCCCccc Confidence 111111111 011223334444444 No 67 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.14 E-value=1.4e-09 Score=69.18 Aligned_cols=433 Identities=10% Similarity=0.042 Sum_probs=198.8 Q ss_pred CC---hhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MA---ETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~---~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |. +.+.+.+.+..+..+. ...++++.+.+|..-.- ..... .....+.+.++..+.+..+++..++-|++- T Consensus 19 ~~~~~~~~~~~i~~~i~~~~~---~~~~~~~~l~~Yy~g~~-~i~~~-~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~-- 91 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNET---VLKPRYRENMKLYLGKH-KILTA-PEKETGADNRIVVNSAKYVVDVYNGYFCGI-- 91 (470) T ss_pred eCCCCCcCHHHHHHHHHHHHH---hhHHHHHHHHHHhcccc-ccccC-cccccCCcceeecchHHHHHHHHhhhhccC-- Confidence 33 2233445554444433 33445556666654211 00111 111122345676777888888777754321 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) | +++...++. .. ...+.+.+.+.+|.....++.++..+||.+.+++..+...-+++..++..+ T Consensus 92 p-----~~~~~~~d~-~~-----------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~ 154 (470) T protein:vir:99 92 E-----PKLALLNDS-SK-----------IDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNH 154 (470) T ss_pred C-----eeEeeCCch-hH-----------HHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccce Confidence 1 122332221 01 112344566789999999999999999999998877766557888899999 Q ss_pred EEEeeCCCCC--EEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEE Q lcl|NC_019445. 158 YYLANSPRGS--VDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSV 235 (559) Q Consensus 158 ~~v~~d~~G~--vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv 235 (559) +++-.|..+. +...+|.+... .+ +... .+-.+|-... .+ T Consensus 155 ~~~i~d~~~~~~~~~~vr~~~~~-----------------------~~-~~~~-~~~~~~~~~~--------------~~ 195 (470) T protein:vir:99 155 AFIIYDDTVQRQPLAFVHYQIDN-----------------------SN-NWTD-AYGVIQYADK--------------FY 195 (470) T ss_pred eEEEEcCCCCcceEEEEEEEEEe-----------------------cC-CeeE-EEEEEEecCe--------------EE Confidence 8888876543 34444433311 01 1111 1111221110 01 Q ss_pred EEEecC-CC-ceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC Q lcl|NC_019445. 236 YYEVGG-DN-DKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL 311 (559) Q Consensus 236 ~~~~~~-~~-~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~ 311 (559) ++...+ .. .... .+-+|..+|++.++ ++.+|+|. .+..++.+..++.+.-.++..++...+|.+++.+.. T Consensus 196 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd-~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~ 269 (470) T protein:vir:99 196 KFKGYDIEEDTNAAGYAINPYGLVPAVEFF-----ENEERQGI-FDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFK 269 (470) T ss_pred EEEecccccccccccccccCCCccceEeec-----CCCCCCcc-hHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 111111 11 0011 12356678877654 34689996 888999999999999999999999999998876532 Q ss_pred ccc------cceecCCceeecCCcC--CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHH Q lcl|NC_019445. 312 KNQ------RASLLPGDITYIDQIT--GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEA 383 (559) Q Consensus 312 ~~~------~~~~~pg~~~~~~~~~--~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~E 383 (559) ... ...+..++++..+... ....+..+ +.+.+...+...++.+.+.|-..-... .......+...|+.. T Consensus 270 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~l~~~i~~~s~~p--~~~~~~~~~n~Sg~A 346 (470) T protein:vir:99 270 LPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFI-AKPDADQMQENLIQHLTDFIFMMAMVP--NIQDKNFAGNSSGVA 346 (470) T ss_pred cccccccchhhhhhhcceeeecCCCCCCCCcceEE-eecCChHHHHHHHHHHHHHHHHHhCCc--cccccccccCchHHH Confidence 111 1123344444443322 11222222 122344555556666666664443321 111111123456665 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019445. 384 VIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMA--QAQKSIGLSSLASTVN 461 (559) Q Consensus 384 i~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La--~a~r~~~~~~l~~~~~ 461 (559) +..+..-+.. ......+.-.+.+.-+++-++.++...+..+ ....++++.+.-++. .+..+ + T Consensus 347 i~~~~~~l~~-k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~-----~~~~~i~v~f~~~~p~~~~e~a----------~ 410 (470) T protein:vir:99 347 LQYKLFAMKN-KADSKERKFDKSLMQLYRIVLATLFNNKQDQ-----ELWSELDFKFTRNLPEDMASAI----------D 410 (470) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCcc-----cccccceEEeCCCCCcCHHHHH----------H Confidence 5543332222 1111122222223333333333333333221 112346666644332 12111 1 Q ss_pred HHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCCh Q lcl|NC_019445. 462 FIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDP 541 (559) Q Consensus 462 ~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~ 541 (559) .+..++++ +....++..+ -++ -.++|++++.++++.+++.++.+.. .. ...+.... T Consensus 411 ~~~kl~gi-------is~et~l~~l---~~v-----d~~~E~eri~~E~~~~~~~~~~~~~---~~----d~~~~d~~-- 466 (470) T protein:vir:99 411 NAKNAEGI-------VSKKTQLGMI---PDI-----EPDAEMKQIAKEKADAIKQTQQLSM---PI----DILKRDNN-- 466 (470) T ss_pred HHHHHhcc-------CCHHHHHHhC---CCC-----CHHHHHHHHHHHHHHHHHHHHhhcC---CC----CcCCCCCC-- Confidence 12222221 2222333321 123 2346677666554433322211100 00 00011111 Q ss_pred hHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 542 SVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 542 ~~~~~~~~~~~~~~~~~~ 559 (559) ++.| T Consensus 467 --------------~ee~ 470 (470) T protein:vir:99 467 --------------AEEE 470 (470) T ss_pred --------------ccCC Confidence 1111 No 68 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.13 E-value=1.4e-09 Score=69.08 Aligned_cols=451 Identities=12% Similarity=0.085 Sum_probs=207.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCC-CCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSR-FLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~-~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.+. .+.+.+..+..+..| .++++++.+|....... ..........+...++..+.+..+++..++-|++- | T Consensus 37 ~~~~-~~~i~~~i~~~~~~~---~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~--p- 109 (501) T protein:vir:96 37 MVNN-WELLKNFINHHKLRQ---APRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGN--P- 109 (501) T ss_pred cCCh-HHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhccc--C- Confidence 3332 222333333333332 34556666665432110 01111112223345677888888888887755421 1 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYY 159 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~ 159 (559) +++...+.. +... +.+.+...+...+|.....++.++..+||.|.+++..+....+++..++..+.+ T Consensus 110 ----~~~~~~~~~--~~~~-------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~ 176 (501) T protein:vir:96 110 ----IRVEYDDND--DNSQ-------NDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETF 176 (501) T ss_pred ----eeEeeCCcc--chhH-------HHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeE Confidence 123333322 1112 334455566778999999999999999999999988876666888889998888 Q ss_pred EeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEE Q lcl|NC_019445. 160 LANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) Q Consensus 160 v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~ 237 (559) +-.|. .+++...+|.+..... ......++||+ +.. .+++ T Consensus 177 ~v~d~~~~~~~~~~v~~~~~~~~---------------------~~~~~~~~vyt---~~~---------------i~~~ 217 (501) T protein:vir:96 177 VIYDNSLEDNSIAAVRYYNRGTL---------------------QSAKDVVEIYT---DEH---------------IYTL 217 (501) T ss_pred EEEcCCCCCceEEEEEEEEeecC---------------------CCcEEEEEEEc---CCc---------------EEEE Confidence 77775 3666666655432100 01112233221 110 1122 Q ss_pred EecCCCceee-eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc--- Q lcl|NC_019445. 238 EVGGDNDKLL-RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN--- 313 (559) Q Consensus 238 ~~~~~~~~il-~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~--- 313 (559) ..+++...+. ..-+|..+|++.++ ++..|+|. .+..++.+..++.+.-.+...++...+|.+++.+.... T Consensus 218 ~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd-~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~ 291 (501) T protein:vir:96 218 DASDDFNEISVTTHAFGTVPITEYL-----NNIDGIGD-YETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKG 291 (501) T ss_pred eeCCCceeccccccCCCccceEEec-----CCccCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcc Confidence 2222111111 12246678877653 45679995 88899999999999999999999999998877543211 Q ss_pred -ccceecCCceeecCCcCCc----hhhhhhh-hccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 314 -QRASLLPGDITYIDQITGQ----DGFRPAY-LVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 314 -~~~~~~pg~~~~~~~~~~~----~~~~p~~-~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) ...+....+.+.+...+.. ....+-+ +...+...+...+..+++.|...-... .......+...|+..+... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg~Al~~~ 369 (501) T protein:vir:96 292 MQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTP--DMSDTNFSGNTSGEALKYK 369 (501) T ss_pred cchhhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCc--ccCcccccccchHHHHHHH Confidence 1122334444444322111 1111111 112234444555666666654433221 1111111234466555433 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la 467 (559) ..-+. .......+.-.+.+.-+++-++.++...+.... .....+++++.-++..- +...++.+..++ T Consensus 370 ~~~l~-~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~----~d~~~i~i~f~~~~p~n--------~~e~ad~~~kl~ 436 (501) T protein:vir:96 370 LFGLD-QDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKD----FDESLLKITFTPNLPKS--------LNEQVSILTGLG 436 (501) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----cccccceEEeCCCCCcC--------HHHHHHHHHHHh Confidence 22222 222222233233333333334444444332221 22234677765544311 111122333333 Q ss_pred ccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh-HHH Q lcl|NC_019445. 468 QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS-VLS 545 (559) Q Consensus 468 ~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~-~~~ 545 (559) ++ |..+.++..+ -+++ -.++|++++.+++++........+........-+...+..++..+ .-+ T Consensus 437 g~-------iS~et~~~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 437 GQ-------VSQETALSLS---GLVE----SPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred cc-------CchHHHHHhC---CCCC----CHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCccccccC Confidence 22 3323333322 1222 135666666555443222111111111111111111111111110 000 No 69 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.13 E-value=1.6e-09 Score=68.85 Aligned_cols=449 Identities=11% Similarity=0.039 Sum_probs=185.4 Q ss_pred CChh-------hHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--ccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAET-------TKERLNKQFAQLESERQSFEPHWRELSDYI--NPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~-------~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) |+.+ +.+++.++...+...+.+...+++++|+=- +|.. +.. .+...+..+...+-+..+++.++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~----~~~-~~~~~~~~~~~~n~~~~ivd~~~~~ 75 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAV----GVT-VPQQMQKLLAHVGYPRLYIDAIAAR 75 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc----ccc-cchhHHhhhhhcCcHHHHHHHHHhh Confidence 5443 244455545454444444444444444221 1111 111 1111222234456666777777665 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce---- Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI---- 147 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~---- 147 (559) | ++.+ |+ .++.+. . .+.+.+...+.+|.....++.++..+||.|.++|..+.... T Consensus 76 l----~~~g---~~--~~~~~~-----~-------~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~ 134 (484) T protein:vir:77 76 Q----ELEG---FR--LGGADK-----A-------DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGV 134 (484) T ss_pred h----ccCc---ee--cCCcch-----h-------HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCccccc Confidence 4 2322 22 222211 1 12234455678999999999999999999999887654321 Q ss_pred ----EEEEEeeccEEEEeeCC-CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccc Q lcl|NC_019445. 148 ----IRTMPFPIGSYYLANSP-RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDT 222 (559) Q Consensus 148 ----~~~~~~~l~~~~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~ 222 (559) .++++++..+.++..|+ .+++...+|.+.-. .......+++| .+.. T Consensus 135 ~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~----------------------~~~~~~~~~~y---~~~~---- 185 (484) T protein:vir:77 135 DPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDE----------------------EGNEVIGATLY---LPNN---- 185 (484) T ss_pred ccccceEEEeccceeEEEecCCCCceEEEEEEEEee----------------------cCCcEEEEEEE---ecCe---- Confidence 24566777777766664 45665555543210 01111122222 1111 Q ss_pred cccccccccEEEEEEEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 223 SKLDSKNKPFKSVYYEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDK 299 (559) Q Consensus 223 ~~~~~~~~~~~sv~~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~ 299 (559) .+++......-... .+-+|..+|++.+..+...++.+|+|- ..+ ..+-+..++...-.++..++. T Consensus 186 -----------~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~-i~~~v~~L~Da~~~~~s~~~~~~~~ 253 (484) T protein:vir:77 186 -----------TVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTE-ITPELRSVTDAAARTLMLMQATAEL 253 (484) T ss_pred -----------EEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCccc-chHHHHHHHHHHHHHHHHHHHHHHh Confidence 01111111111111 124678899999998888899999994 543 446677888888888888998 Q ss_pred HhcCceeecC----CCcc------ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc-ch- Q lcl|NC_019445. 300 ATNPPMVAPT----SLKN------QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV-DL- 367 (559) Q Consensus 300 ~~~p~~~~p~----~~~~------~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~-dl- 367 (559) .+.|.+.+-+ +... ..+...+|.++..+..+ ..+..+. ...+. ..+..++.-|...... ++ T Consensus 254 ~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~q~~--~~~~e---~~~~~l~~~i~~~s~~~~~p 326 (484) T protein:vir:77 254 MGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFEDHE--SKAQQFS--AAELR---NFVDALDALDRKAAAYTGLP 326 (484) T ss_pred hhhhHHHHhCCCcchhcccccccchhhhhhhhhhcccCCCC--ceeEeec--CCChH---HHHHHHHHHHHHHhcccCCC Confidence 8888755421 1100 01223344443332211 1111110 11222 2334444444333210 10 Q ss_pred hhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHH Q lcl|NC_019445. 368 FMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQA 447 (559) Q Consensus 368 ~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a 447 (559) ...+.....-.-++.-+.....-+.. ...+.+. .+.+-+.+++.++....-....+.+. ..+++++.-++. T Consensus 327 ~~~fg~~~~n~~Sg~Al~~~~~~l~~----ka~~k~~-~f~~~l~~~~~l~~~~~~~~~~~~~~--~~i~v~w~~~~~-- 397 (484) T protein:vir:77 327 PYYLSFSSENPASAEAIRSSESRLVK----TVERKNK-IFGGAWEQAMRVAYKVMNGGDIPPEY--YRMESIWRDPST-- 397 (484) T ss_pred HHHhccccCcchHHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHhCCCCccccc--ccceEEecCCCC-- Confidence 01111110001133333322211111 1122222 23333444555544422112222222 245666644332 Q ss_pred HHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHH-HHHHHHHH-H Q lcl|NC_019445. 448 QKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQ-QQMMAMGM-A 525 (559) Q Consensus 448 ~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~-~~~~~~~~-~ 525 (559) .++.+.+..+..|++.+..+ ++ -+.+...+|+ ++++++++++.++++... ++.+..+. . T Consensus 398 ------~s~~~~ad~~~kl~~~g~gi---~s----~et~~~~l~~------~~~~~~e~~~~~~ee~~~~~~~~~~~~~~ 458 (484) T protein:vir:77 398 ------PTYAAKADAATKLYNNGQGV---IP----KERARIDMGY------SITEREEMRKWDEEEQAQGLGLMGTMFGT 458 (484) T ss_pred ------CCHHHHHHHHHHHHhccCCC---CC----HHHHHhcCCC------ChhHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 11122223344444443222 22 1223344454 233444444333322211 11111111 1 Q ss_pred HHHHHhhhhhhcCCChhHHHHHHHHhhcCCC Q lcl|NC_019445. 526 AAQGAKTLSEAKTSDPSVLSAMANAVSGQGG 556 (559) Q Consensus 526 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 556 (559) ..+.++.. . .++.-++......+..| T Consensus 459 ~~~~~~~~---~--~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 459 DPSGGGNP---D--NPETPEPQPNPAEEAAA 484 (484) T ss_pred cccCCCCC---C--CCCcccccCCCccccCC Confidence 11111110 0 01100111111111111 No 70 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.12 E-value=1.6e-09 Score=68.81 Aligned_cols=440 Identities=9% Similarity=0.014 Sum_probs=204.4 Q ss_pred CChhhHHHHHHHHHHHHHH-hhhHHHHHHHHHHHh--ccccCCCC-CC--CCCCcccccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESE-RQSFEPHWRELSDYI--NPRGSRFL-TS--EVNRNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~-R~~~~~~w~e~~~~~--~P~~~~~~-~~--~~~~~~~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) +.. ...+.+..+.+... |-+...+++++++-- ++.+-... .+ ...+..+.+.++..+.+...++..++-+++ T Consensus 18 ~~~--~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g 95 (479) T protein:vir:79 18 KES--TINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVG 95 (479) T ss_pred cCC--hhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhc Confidence 221 23344444454433 444444444444321 22221110 10 111122334567777888888888776553 Q ss_pred hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEee Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFP 154 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~ 154 (559) - |+ +++..++. ++.. ...+...+|.....++.++..+||.+++++..+...-+++..++ T Consensus 96 ~--p~-----~~~~~~~~------~~~~--------~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~ 154 (479) T protein:vir:79 96 N--PI-----VFNADDDN------LTKL--------LNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIP 154 (479) T ss_pred C--Cc-----eeccCCHH------HHHH--------HHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEc Confidence 1 22 22333321 2222 22333478999999999999999999998887776668888899 Q ss_pred ccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE------EEEeecCcccccccc Q lcl|NC_019445. 155 IGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM------HSVYPNIDRDTSKLD 226 (559) Q Consensus 155 l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~------~~v~p~~~~~~~~~~ 226 (559) ..+++.-.|. .+++...+|.|...-. ++.....+++| +.+... . ... T Consensus 155 p~~~~~v~d~~~~~~~~~~ir~y~~~~~--------------------~~~~~~~~e~y~~~~i~~~~~~~---~--~~~ 209 (479) T protein:vir:79 155 AEEAIPIWDSKRQRELVAFIRFYYIEDI--------------------DGNKIKRVEYYTENDITYFIERG---N--SFI 209 (479) T ss_pred cceeEEEEeCCCCCceEEEEEEEEEeec--------------------CCceEEEEEEEeCCcEEEEEecC---C--ccc Confidence 8888777664 4556665555443210 00111122222 111100 0 000 Q ss_pred cccccEEEEE--EEe-cCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019445. 227 SKNKPFKSVY--YEV-GGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNP 303 (559) Q Consensus 227 ~~~~~~~sv~--~~~-~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p 303 (559) ....+.... ... ......-...-+|..+|++.++- +.+|+|. .+...+-+..++.+.-......+...+| T Consensus 210 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~sd-~~~v~~liDa~d~~~S~~~~~~~~~~~~ 282 (479) T protein:vir:79 210 -QEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKN-----NEKCVSD-LTFYKSLIDIYDNNISTLADNLDEIQEV 282 (479) T ss_pred -ccccccccccccccccccccccccccCCCcccEEEecC-----CCCCCcc-hhhhHHHHHHHHHHHHHHHHHHHHhhCc Confidence 000000000 000 00000011223567888887654 4679996 7888888999999998999999999999 Q ss_pred ceeecCCC-cc-cc--ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCc Q lcl|NC_019445. 304 PMVAPTSL-KN-QR--ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSM 379 (559) Q Consensus 304 ~~~~p~~~-~~-~~--~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~ 379 (559) -+++.+.. .. .. -.+..++++.++..++...+. .+.+...+...++.++..|....+..-+. ....... T Consensus 283 ~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~----~~~~~~~~~~~~~~l~~~i~~~s~~p~~~---~~~~gn~ 355 (479) T protein:vir:79 283 IYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDKLE----INIPVEAKKELLDRLEKNIIIFGQGVNPE---SQNTGDK 355 (479) T ss_pred eeeeecCCccccccchhhhhhccceecCCCCcceEEe----ccCCHHHHHHHHHHHHHHHHHHhCccccc---cccccch Confidence 88776421 11 11 223455555554333222222 22345666677777777776555432121 1122344 Q ss_pred CHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 380 PVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAST 459 (559) Q Consensus 380 TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~ 459 (559) |++.+..+..-+. .......+.-.+.+.-+++-+..++...+.. .....+++|.+.-.+..- ... . T Consensus 356 Sg~Ai~~~~~~l~-~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-----~~~~~~i~i~f~~~~p~~-~~~-------~ 421 (479) T protein:vir:79 356 SGVALKFLYSLLD-LKCSKTEKKFKKAIRELLWFVCEYLKISGNK-----SYDYKTVQITFNHSMIIN-EAE-------K 421 (479) T ss_pred hHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-----ccccccceEEeCCCCCcC-HHH-------H Confidence 6655544322222 1222222333333333333333333333321 223345667665544311 111 1 Q ss_pred HHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Q lcl|NC_019445. 460 VNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKT 538 (559) Q Consensus 460 ~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~ 538 (559) ++.++.+++ .+....++. .++ ++ -.++|++++.+++.++.+..+.. ....+... T Consensus 422 a~~~~kl~g-------~iS~et~l~----~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~----------~~~~~~~~ 476 (479) T protein:vir:79 422 IDMAAKSTG-------IVSDETIVS----NHPWVE----DVNDELERLKKQEDTQKEYDDLI----------PNNQDGVI 476 (479) T ss_pred HHHHHHHhc-------cCcHHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHhcc----------CcccCCCc Confidence 222223322 133333333 232 21 13566666655544333222111 11000010 Q ss_pred CChh Q lcl|NC_019445. 539 SDPS 542 (559) Q Consensus 539 ~~~~ 542 (559) +.+ T Consensus 477 -~e~ 479 (479) T protein:vir:79 477 -DET 479 (479) T ss_pred -CcC Confidence 001 No 71 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.10 E-value=2.2e-09 Score=68.00 Aligned_cols=438 Identities=12% Similarity=0.060 Sum_probs=201.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--ccccCCCC-CCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYI--NPRGSRFL-TSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~--~P~~~~~~-~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) ..+...+.|.+.++..+. |-+...++.++|+-- +..+.... ......-...+.++..+.+...++..++-|++ T Consensus 24 ~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g--- 99 (474) T protein:vir:94 24 QFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--- 99 (474) T ss_pred cccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhc--- Confidence 333444555555555543 445556666665421 11111111 11111112334567788888888888876643 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) .| +.+...|++ ... .+...+ ..||...+.++.++..+||.|.+++..+....+++..++..+ T Consensus 100 ---~p-~~~~~~d~~------~~~-------~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~ 161 (474) T protein:vir:94 100 ---KP-VTYSCEDEN------VLK-------VIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQ 161 (474) T ss_pred ---CC-ceeccCcHH------HHH-------HHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccc Confidence 22 223443322 111 122222 468889999999999999999998887766668888899888 Q ss_pred EEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEE---eecCcccccccccccccE Q lcl|NC_019445. 158 YYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSV---YPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 158 ~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v---~p~~~~~~~~~~~~~~~~ 232 (559) .++-.|. .+++..++|.++.. + ...+++|+-- +.+.+. +. + T Consensus 162 ~~~v~d~~~~~~~~~~ir~~~~~------------------------~-~~~~~~yt~~~~~~y~~~~--~~-------~ 207 (474) T protein:vir:94 162 AIPIWVDKEREELKSFIRYYKFN------------------------N-EEKVEFWTDTTVTYYVLEN--GG-------L 207 (474) T ss_pred eEEEEcCCCCCceEEEEEEEEec------------------------C-eEEEEEEeCCeEEEEEEcC--Cc-------c Confidence 8887764 57777777765421 0 1123332110 000000 00 0 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCc Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~ 312 (559) ...+.........-...-+|..+|++.++. +.+|+|. .....+.+..+|.+.-..+..++...+|.+++.+... T Consensus 208 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd-~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~ 281 (474) T protein:vir:94 208 IPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSD-IWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEG 281 (474) T ss_pred ccccccCcCcccccccccCCCccceEEecC-----CcCCCCc-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc Confidence 000000000000000123567788887654 4689996 8889999999999999999999999999888764211 Q ss_pred c--cc--ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHH Q lcl|NC_019445. 313 N--QR--ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) Q Consensus 313 ~--~~--~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~ 388 (559) . .. -.+..++++.++..++ +..+. ...+...+...++.++..|-..-+.. .......+...|+..+..+- T Consensus 282 ~~~~~~~~~~~~~~~i~~~~~~~---~~~l~-~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg~Al~~~~ 355 (474) T protein:vir:94 282 EDLEEFMRGLKYYKAINVDGDGG---VETIQ-VEVPVSSTKEYIDLMRVYIMEFGQGV--DFQTDKFGSAPSGIALKFLY 355 (474) T ss_pred ccchhhhhhhhccceeeccCCCc---eeEEe-ecCCHHHHHHHHHHHHHHHHHHhCcc--ccCccccccccHHHHHHHHH Confidence 1 01 1223344444433222 22221 12345556666777776665544321 11111112334554433222 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 389 EEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQ 468 (559) Q Consensus 389 ~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~ 468 (559) ..+.. ...+. ...+.+.+.+++.++.+..-+. .+ ...|++.+.-.+.. . .. ..++.+... + T Consensus 356 ~~l~~----k~~~k-~~~~~~~l~~~~~li~~~~~~~---~d--~~~i~v~f~~~~p~-~---~~----e~a~~~~~~-g 416 (474) T protein:vir:94 356 GNLDL----KANKL-KNKATVAIQELISFIIDFNNLK---TD--VKDIEISFNFNRMM-N---DA----EQSQIIAQS-Q 416 (474) T ss_pred HHHHH----HHHHH-HHHHHHHHHHHHHHHHHHhCCC---cc--cceeeEEeccCccc-C---HH----HHHHHHHHc-C Confidence 21111 11222 2234455555555554422111 11 22355555322211 0 11 111222221 1 Q ss_pred cChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhhhhcCCChhHHH Q lcl|NC_019445. 469 AKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ-GAKTLSEAKTSDPSVLS 545 (559) Q Consensus 469 ~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~-~a~~~~~~~~~~~~~~~ 545 (559) .+....++..+ -+++ -.++|++++.++++++++.. ..... .+....+....++.-.+ T Consensus 417 -------~iS~et~l~~l---~~v~----D~~~E~eri~~E~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 417 -------YLSRETLVKSS---PLVD----DYKAELERIEQEQMEYNKQL------PNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred -------CCCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhc------cccCCCCCCCcccCCCCcccccC Confidence 13333333322 1232 12456665555443222211 00000 00000000000000000 No 72 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.10 E-value=2.2e-09 Score=68.00 Aligned_cols=438 Identities=12% Similarity=0.060 Sum_probs=201.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--ccccCCCC-CCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYI--NPRGSRFL-TSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~--~P~~~~~~-~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) ..+...+.|.+.++..+. |-+...++.++|+-- +..+.... ......-...+.++..+.+...++..++-|++ T Consensus 24 ~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g--- 99 (474) T protein:vir:97 24 QFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--- 99 (474) T ss_pred cccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhc--- Confidence 333444555555555543 445556666665421 11111111 11111112334567788888888888876643 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) .| +.+...|++ ... .+...+ ..||...+.++.++..+||.|.+++..+....+++..++..+ T Consensus 100 ---~p-~~~~~~d~~------~~~-------~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~ 161 (474) T protein:vir:97 100 ---KP-VTYSCEDEN------VLK-------VIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQ 161 (474) T ss_pred ---CC-ceeccCcHH------HHH-------HHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccc Confidence 22 223443322 111 122222 468889999999999999999998887766668888899888 Q ss_pred EEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEE---eecCcccccccccccccE Q lcl|NC_019445. 158 YYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSV---YPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 158 ~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v---~p~~~~~~~~~~~~~~~~ 232 (559) .++-.|. .+++..++|.++.. + ...+++|+-- +.+.+. +. + T Consensus 162 ~~~v~d~~~~~~~~~~ir~~~~~------------------------~-~~~~~~yt~~~~~~y~~~~--~~-------~ 207 (474) T protein:vir:97 162 AIPIWVDKEREELKSFIRYYKFN------------------------N-EEKVEFWTDTTVTYYVLEN--GG-------L 207 (474) T ss_pred eEEEEcCCCCCceEEEEEEEEec------------------------C-eEEEEEEeCCeEEEEEEcC--Cc-------c Confidence 8887764 57777777765421 0 1123332110 000000 00 0 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCc Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~ 312 (559) ...+.........-...-+|..+|++.++. +.+|+|. .....+.+..+|.+.-..+..++...+|.+++.+... T Consensus 208 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd-~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~ 281 (474) T protein:vir:97 208 IPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSD-IWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEG 281 (474) T ss_pred ccccccCcCcccccccccCCCccceEEecC-----CcCCCCc-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc Confidence 000000000000000123567788887654 4689996 8889999999999999999999999999888764211 Q ss_pred c--cc--ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHH Q lcl|NC_019445. 313 N--QR--ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) Q Consensus 313 ~--~~--~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~ 388 (559) . .. -.+..++++.++..++ +..+. ...+...+...++.++..|-..-+.. .......+...|+..+..+- T Consensus 282 ~~~~~~~~~~~~~~~i~~~~~~~---~~~l~-~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg~Al~~~~ 355 (474) T protein:vir:97 282 EDLEEFMRGLKYYKAINVDGDGG---VETIQ-VEVPVSSTKEYIDLMRVYIMEFGQGV--DFQTDKFGSAPSGIALKFLY 355 (474) T ss_pred ccchhhhhhhhccceeeccCCCc---eeEEe-ecCCHHHHHHHHHHHHHHHHHHhCcc--ccCccccccccHHHHHHHHH Confidence 1 01 1223344444433222 22221 12345556666777776665544321 11111112334554433222 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 389 EEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQ 468 (559) Q Consensus 389 ~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~ 468 (559) ..+.. ...+. ...+.+.+.+++.++.+..-+. .+ ...|++.+.-.+.. . .. ..++.+... + T Consensus 356 ~~l~~----k~~~k-~~~~~~~l~~~~~li~~~~~~~---~d--~~~i~v~f~~~~p~-~---~~----e~a~~~~~~-g 416 (474) T protein:vir:97 356 GNLDL----KANKL-KNKATVAIQELISFIIDFNNLK---TD--VKDIEISFNFNRMM-N---DA----EQSQIIAQS-Q 416 (474) T ss_pred HHHHH----HHHHH-HHHHHHHHHHHHHHHHHHhCCC---cc--cceeeEEeccCccc-C---HH----HHHHHHHHc-C Confidence 21111 11222 2234455555555554422111 11 22355555322211 0 11 111222221 1 Q ss_pred cChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhhhhcCCChhHHH Q lcl|NC_019445. 469 AKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ-GAKTLSEAKTSDPSVLS 545 (559) Q Consensus 469 ~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~-~a~~~~~~~~~~~~~~~ 545 (559) .+....++..+ -+++ -.++|++++.++++++++.. ..... .+....+....++.-.+ T Consensus 417 -------~iS~et~l~~l---~~v~----D~~~E~eri~~E~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 417 -------YLSRETLVKSS---PLVD----DYKAELERIEQEQMEYNKQL------PNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred -------CCCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhc------cccCCCCCCCcccCCCCcccccC Confidence 13333333322 1232 12456665555443222211 00000 00000000000000000 No 73 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.09 E-value=2.4e-09 Score=67.79 Aligned_cols=463 Identities=11% Similarity=0.029 Sum_probs=202.9 Q ss_pred CC---------------------hhhHHHHHHHHHHHH-HHhhhHHHHHHHHHHHhccc-----cCCCCCC----CCCCc Q lcl|NC_019445. 1 MA---------------------ETTKERLNKQFAQLE-SERQSFEPHWRELSDYINPR-----GSRFLTS----EVNRN 49 (559) Q Consensus 1 M~---------------------~~~~~~l~~r~~~l~-~~R~~~~~~w~e~~~~~~P~-----~~~~~~~----~~~~~ 49 (559) |+ +............+- ..| ..+++.+.+|..-. +.+.... ..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~---~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~ 77 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHN---PEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDD 77 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhc---HHHHHHHHHHhccccchhhccchhccccccccccc Confidence 11 111111111222221 122 24556666665321 1110000 00111 Q ss_pred ccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHH Q lcl|NC_019445. 50 DRRNTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGS 129 (559) Q Consensus 50 ~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~d 129 (559) .+.+.++..+-+..+++.+++-| |. .| ++++..|+ ++.+.++ .+...+|-....++.++ T Consensus 78 ~~~~~ri~~n~~~~ivd~~~~yl----~g--~~-~~~~~~d~------~~~~~l~--------~~~~n~~~~~~~~~~~~ 136 (503) T protein:vir:59 78 TKTNNRTSHAWHKLFVDQKTQYL----VG--EP-VTFTSDNK------TLLEYVN--------ELADDDFDDILNETVKN 136 (503) T ss_pred ccccceeecchHHHHHHHHHhhh----hc--CC-eeeccCcH------HHHHHHH--------HHHhcCHHHHHHHHHHH Confidence 12234566667777777777744 32 22 22343332 1222221 22246888899999999 Q ss_pred HHhhCcEEEEEeecCCceEEEEEeeccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCce Q lcl|NC_019445. 130 LGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKW 207 (559) Q Consensus 130 l~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~ 207 (559) ..+||.+++++..|...-+++..++..+++.-.|. .+++..++|.++.. +. .+..... T Consensus 137 ~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~----------~~----------~~~~~~~ 196 (503) T protein:vir:59 137 MSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYK----------GI----------MGEETQK 196 (503) T ss_pred HhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEe----------cC----------CCceEEE Confidence 99999999998887666688889999888877765 46676666655421 00 0011112 Q ss_pred EEEEEEEee---cCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHH Q lcl|NC_019445. 208 IEVMHSVYP---NIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVK 284 (559) Q Consensus 208 v~v~~~v~p---~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~ 284 (559) +++|+.-.- .........+..........+.. .-...-+|..+|++.++- +.+|.|. ...+.+.+. T Consensus 197 ~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~vPiv~~~n-----n~~~~sd-~~~~~~liD 265 (503) T protein:vir:59 197 AELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMT-----KGGQAIGWGRVPIIPFKN-----NEEMVSD-LKFYKDLID 265 (503) T ss_pred EEEEeCCcEEEEEEcCCccccccccccccccccee-----ecceeccCCccceEEecC-----CCCCCcc-hhhhHHHHH Confidence 222211000 00000000000000000000000 001123566788877653 4578996 788999999 Q ss_pred HHHHHHHHHHHHHHHHhcCceeecCC-Ccc---ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHH Q lcl|NC_019445. 285 ALQLLQKRKSQLIDKATNPPMVAPTS-LKN---QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIIN 360 (559) Q Consensus 285 ~L~~l~~~~~~~~~~~~~p~~~~p~~-~~~---~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~ 360 (559) .++.+.-..+..++...+|.+++.+. +.. ...++..++++..+..++ ++.+. .+.+...+...++.++..|. T Consensus 266 a~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~l~-~~~~~~~~~~~~~~l~~~i~ 341 (503) T protein:vir:59 266 NYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGG---VDTLR-AEIPVDSAAKELERIQDELY 341 (503) T ss_pred HHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCc---ceeEe-ccCCHHHHHHHHHHHHHHHH Confidence 99999999999999999998877542 111 112344555554443222 33322 23345556666777777775 Q ss_pred HHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEe Q lcl|NC_019445. 361 SAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEY 440 (559) Q Consensus 361 ~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~ 440 (559) ..-+.. .......+...|+..+..+..-.... .....+.-.+.|.-+++.++.++...+.... ....+|++++ T Consensus 342 ~~s~~p--~~~~~~~~~~~Sg~Ai~~~~~~l~~k-~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~----~~~~~i~i~f 414 (503) T protein:vir:59 342 KSAQAV--DNSPETIGGGATGPALENLYALLDLK-ANMAERKIRAGLRLFFWFFAEYLRNTGKGDF----NPDKELTMTF 414 (503) T ss_pred HHhccc--CCCcccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCccc----ccccceeEEe Confidence 554321 11111223456777665543333322 2223333344444444444444444332111 1123577777 Q ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 441 ISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMM 520 (559) Q Consensus 441 is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~ 520 (559) .-++..- .... ++.+..+.+.+ .+.-..++..+ -+++ -.++|++++.+++++.+++.+.. T Consensus 415 ~~~~p~d-~~~~-------~~~~~kl~~~G-----iiS~et~l~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~~ 474 (503) T protein:vir:59 415 TRTRIQN-DSEI-------VQSLVQGVTGG-----IMSKETAVARN---PFVQ----DPEEELARIEEEMNQYAEMQGNL 474 (503) T ss_pred CCCCCCC-HHHH-------HHHHHHHHhCC-----CCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhhccc Confidence 6655321 1112 22222222111 12222233221 1221 12466666655433222211110 Q ss_pred HHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 521 AMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 521 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) . ....+...+.... +...+ ...++.|++- T Consensus 475 --~---~~~~~~~~~~~~~-~~~~~----~~~~~~g~~~ 503 (503) T protein:vir:59 475 --L---DDEGGDDDLEEDD-PNAGA----AESGGAGQVS 503 (503) T ss_pred --c---CccCCCCCCCcCC-CCCCc----ccCCCCCCcC Confidence 0 0000111111100 00000 0111111111 No 74 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.08 E-value=2.6e-09 Score=67.65 Aligned_cols=447 Identities=13% Similarity=0.088 Sum_probs=193.3 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.... +.|...++.+... .++...+.+|..-..- +..+. ..+.+.+..++..+-+..+|+.+++.| ++. T Consensus 1 ~~t~~-d~i~~L~~~~~~~----~~r~~~~~~Yy~G~~~i~~~~~-~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~ 70 (480) T protein:vir:78 1 MTTYH-EHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGI-GAPPELAYLDVQPGWVATYLRTLSDRL----DIE 70 (480) T ss_pred CCCHH-HHHHHHHHHHHHH----HHHHHHHHHHHhccccchhccc-ccchhhhhhhhhcchHHHHHHHHHhhh----ccC Confidence 77643 4455555555443 3444555555432110 00111 112222334456777888888888765 333 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee------cCCceEEEEEe Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE------DDEDIIRTMPF 153 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~------~~~~~~~~~~~ 153 (559) + |... .|.+. .+.+...+.+++|.....++.++..+||.|.++|.. |.....++..+ T Consensus 71 g---~~~~-~d~~~-------------~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~ 133 (480) T protein:vir:78 71 G---FRIS-EDSEG-------------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) T ss_pred c---eecC-CCchh-------------HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEE Confidence 2 2222 12111 122344556788999999999999999999888763 33344678889 Q ss_pred eccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 154 ~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +..+.++-.|+ .+++...+|.+.-. + .......+++|+ +... T Consensus 134 ~p~~~~~i~D~~~~~~~~~~i~~~~~~----------d-----------~~~~~~~~~~y~---~~~~------------ 177 (480) T protein:vir:78 134 SPLYMYAELDPRNTRRVTRAVRLYTTR----------D-----------DVAVPDRATLYL---PDET------------ 177 (480) T ss_pred cccceEEEEcCCCccceEEEEEEEEee----------c-----------CCcceEEEEEEe---CCeE------------ Confidence 88888888886 45565555543210 0 001112233321 1100 Q ss_pred EEEEEEE-ecCCC--cee---eeecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 232 FKSVYYE-VGGDN--DKL---LRESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 232 ~~sv~~~-~~~~~--~~i---l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) +|+. .++.. ... ..+-+|..+|++++..+...+..||+|- ..+ ..+-+..++...-.+...++..+.|. T Consensus 178 ---~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sd-i~~~i~~l~Da~~~~~s~~~~~~~~~a~p~ 253 (480) T protein:vir:78 178 ---VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE-ISPELRKVTDAASRTLMNLQSASQILGTPL 253 (480) T ss_pred ---EEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccc-hhHHHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 1111 11000 000 1134678899999999988899999995 554 45778888888888888899888987 Q ss_pred eeecCCC--------ccccceecCCceeecCCcCCchhhhhhhhcc-ccHHHHHHHHHHHHHHHHHHhhc-chh-hhccC Q lcl|NC_019445. 305 MVAPTSL--------KNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFV-DLF-MMLQN 373 (559) Q Consensus 305 ~~~p~~~--------~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~-dl~-~~~~~ 373 (559) +.+.+.. ....+...+|.+..... ++ ..+ .+.+ .++... ++.++.-|...+.. ++. ..++. T Consensus 254 ~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~---~~~~~~~~~~~---~~~l~~~i~~~~~~~~~p~~~fg~ 325 (480) T protein:vir:78 254 RVISGVTTDELTNDGENTTLDIYYGRILTLAS-EA-AKI---SEFKAAELRNF---AEEMEVFRKEAASITGLPPQYLSS 325 (480) T ss_pred hhhhCCCccccccccccchhhhhhhhhccCCC-CC-ceE---EecCccCHHHH---HHHHHHHHHHHhcccCCCHHHhcc Confidence 6653211 11112233343332221 11 111 1111 123333 33334333333211 000 01111 Q ss_pred CCCCC-cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHH Q lcl|NC_019445. 374 INTRS-MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIG 452 (559) Q Consensus 374 ~~~~~-~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~ 452 (559) .+.. -++.-+...... |=-..++.+..| .+-+.+++.++.+..--. .+. ....+++++.-+...- ..+. T Consensus 326 -~~~n~~Sg~Al~~~~~~----l~~k~~~~~~~f-~~~l~~~~rl~~~~~~~~-~~~--~~~~i~v~w~~~~~~s-~~~~ 395 (480) T protein:vir:78 326 -SSENPASAEAIIATDSR----IVKMAERKGRIF-GGAWERAMRIAMQIMGRE-VTE--EYTRLETVWRDPSTPT-VAAK 395 (480) T ss_pred -ccCchhHHHHHHHHHHH----HHHHHHHHHHHH-HHHHHHHHHHHHHHcCCC-ccc--cceeeeEEecCCCCCC-HHHH Confidence 1111 123222222222 222233444433 444555666655431111 111 2234667664432110 1112 Q ss_pred HHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019445. 453 LSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKT 532 (559) Q Consensus 453 ~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~ 532 (559) ++++. .|.+.+.. .++- +.+...+|+ ++++++++.+.+.++. +....++....+...+ T Consensus 396 ad~~~-------kl~~~g~~---~~s~----et~~~~lg~------~~d~~~e~~~~~~~~~--~~~~~~~~~~~~~~~~ 453 (480) T protein:vir:78 396 ADAVS-------KLYANGQG---PIPK----EQARIDLGY------TATQREQMRDWDKQET--EDMIDTLYSTTKAQAD 453 (480) T ss_pred HHHHH-------HHHHhccc---CCCH----HHHHhcCCC------CHhHHHHHHHHHHHHH--HHHHHHhhccccCCCc Confidence 22222 22222211 1222 122334565 3555555543222221 1111111111110000 Q ss_pred hhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 533 LSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 533 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ..+.+..+. . .......+++.. T Consensus 454 ~~~~~~~~~----~-~~~~~~~~~~~~ 475 (480) T protein:vir:78 454 ATPKPTVTE----T-KTETQTSPSGFN 475 (480) T ss_pred cccCCCCCC----C-CCccCCCcccCC Confidence 011111110 0 011111111111 No 75 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.07 E-value=1.2e-09 Score=69.46 Aligned_cols=498 Identities=10% Similarity=0.010 Sum_probs=217.0 Q ss_pred CChhh---------HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCC-CCCcccccCCCCcchHHHHHHHHHH Q lcl|NC_019445. 1 MAETT---------KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSE-VNRNDRRNTRIIDSTGTMAARTLAS 70 (559) Q Consensus 1 M~~~~---------~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~-~~~~~~~~~~~~~s~~~~a~~~Las 70 (559) |.-.- -.....-|=....+| ....++.+.+|.. ++..... ..+|..+ ..++++.|...++++++ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~--RlaaY~ly~d~y~---n~~~el~~il~G~dr-~~~~~ps~r~~V~~~~~ 74 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKN--RVRAYDLYENIYL---NSAETLKLVLRGDDS-VPILMPSGRKIVEAVHR 74 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHH--HHHHHHHHHHhhc---CchhhhhhhcCCCce-eeeccchHHHHHHHHHH Confidence 22110 000111111111111 2233334444432 2111101 1233332 23778888888888654 Q ss_pred HHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc---- Q lcl|NC_019445. 71 GMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED---- 146 (559) Q Consensus 71 ~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~---- 146 (559) -| +....|+ ..+.+.+... +. .+++.+....++-|+.....++-.+.++.|-|++++-.|+.+ T Consensus 75 ~L-----g~~~~~~-Ve~~~~de~~----~~---avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~ 141 (563) T protein:vir:74 75 FL-----GVGFDYL-VEPDMGDEGI----RQ---SLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGE 141 (563) T ss_pred hc-----CCCcEEe-cCccccCcch----HH---HHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCC Confidence 33 4344443 2322222111 11 257777778888999999999999999999999999887644 Q ss_pred eEEEEEeeccEEEEeeCCCCCEEEEE-EEEeecHHHHHHhcCcccCCHHHHH-HHhc------CCCCceEEEEEEEeecC Q lcl|NC_019445. 147 IIRTMPFPIGSYYLANSPRGSVDICF-RKFSMTVRQLVQEFGLNNVSESVKS-MWES------GTYEKWIEVMHSVYPNI 218 (559) Q Consensus 147 ~~~~~~~~l~~~~v~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~-~~~~------~~~~~~v~v~~~v~p~~ 218 (559) .++.+.|-++.|+-..|++. |-.+| ++ ++..|. +|++-++ .+.. -|++.. +..|+.... T Consensus 142 R~rv~~vDP~~~fp~~dpd~-v~g~~~v~-------v~~~~~---~pdd~~~~~~r~~~~~~~lndeg~--~~~~~~~da 208 (563) T protein:vir:74 142 RISVDEVDPRQIFLIEDGST-VVGFHMVD-------IVQDFR---SPDDPSKKLARRRTFRRVRNDEGM--FTGRISSEL 208 (563) T ss_pred CceEeecCCceeeeccCCCC-cccceeee-------cccCCC---CCcchhccceeeeeeeeeeCCCCC--ccceeeecc Confidence 58888888888887666644 44444 21 333443 2222222 1111 111111 112222221 Q ss_pred c-ccccccccccccEEEEEEEecCCCc----eeeee----cCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHH Q lcl|NC_019445. 219 D-RDTSKLDSKNKPFKSVYYEVGGDND----KLLRE----SGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLL 289 (559) Q Consensus 219 ~-~~~~~~~~~~~~~~sv~~~~~~~~~----~il~e----sg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l 289 (559) + ...+++|.+ +..++-+....++- +-+++ -.|.-.||++++=.+.++++||+|. ..+.+.-++.||.- T Consensus 209 e~w~lg~wd~r--~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~-La~ll~~~~eLn~~ 285 (563) T protein:vir:74 209 THWTLGNWDDR--GAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQ-LEGMETLAYALNQS 285 (563) T ss_pred chhcccccccc--CccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhh-HHHHHHHHHHHhhh Confidence 1 122222222 21222221111110 01111 1345678888777888999999995 88999999999876 Q ss_pred HHHHHHHHHHHhcCceeec----CCC---ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 290 QKRKSQLIDKATNPPMVAP----TSL---KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSA 362 (559) Q Consensus 290 ~~~~~~~~~~~~~p~~~~p----~~~---~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~a 362 (559) -...-..+...=+|..... .|. .....++.||.++..+.+.....+--+. +.++++.++.-|.++..| + T Consensus 286 ~Td~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~-g~~~l~~~q~Hm~~l~er---a 361 (563) T protein:vir:74 286 LTDEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVS-GVQDVSPFQDHMKWIDEK---G 361 (563) T ss_pred hhHHHHHHHhcCCCeEEeccccccccccccccccccCCceeEeccCCccccceeeec-chhhhHHHHHHHHHHHHH---H Confidence 6666555555555654432 221 2223457899998887654334343332 234667777767666542 2 Q ss_pred hhcc--h----hhhccCCCCCCcCHHH-----HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCC----- Q lcl|NC_019445. 363 YFVD--L----FMMLQNINTRSMPVEA-----VIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPP----- 426 (559) Q Consensus 363 f~~d--l----~~~~~~~~~~~~TA~E-----i~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~----- 426 (559) .+.- + |+++.....+.=.|=| +-.+.+|++..|=.++-++-.+...=++ .++.-++..|..|. T Consensus 362 l~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL-~~~erl~~~g~~~~~~g~~ 440 (563) T protein:vir:74 362 IAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWL-PAYESDFQEQDGSRPFASA 440 (563) T ss_pred HHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHH-HHHHhHhhhhccccccccc Confidence 2110 0 0111111111111211 2233444444333333333222111111 12222334454443 Q ss_pred -CchhhCCcceEEEe--ecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHH Q lcl|NC_019445. 427 -PPDAMEGMPLKVEY--ISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQV 503 (559) Q Consensus 427 -~p~~l~g~~v~~~~--is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev 503 (559) +|..+ .|.+.+ ..|....+.- +.+..+.+.+ .|....+++.+.++ |.|-. --++|+ T Consensus 441 ~~~~~~---~v~ivf~p~~P~d~~~vv----------~~~~tl~~aG-----iiSretAv~~L~~~-g~~~p--dae~e~ 499 (563) T protein:vir:74 441 DLLNEC---SVVCIFADPMPVNKTQVT----------QDTLLLQQAH-----LILRKMAVAKLRSI-GWEYP--EVDDQG 499 (563) T ss_pred ccCCce---EEEEEeCCCCCccHHHHH----------HHHHHHHHcC-----chhHHHHHHHHHhC-CCCCC--cHHHHH Confidence 22111 234334 4565533322 2222222322 47777888888777 75411 113344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh--hhcCCCh-hHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 504 DQARQQRAQQQQQQQMMAMGMAAAQGAKTLS--EAKTSDP-SVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 504 ~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~--~~~~~~~-~~~~~~~~~~~~~~~~~~ 559 (559) +++...+-+.++.+++.+.+.-.++++++.+ +...+++ +-+...-+-..-++.--| T Consensus 500 ~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~~~~~ 558 (563) T protein:vir:74 500 NALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQGNPIDQFGNPVEIPPDVTQ 558 (563) T ss_pred hhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccCCchhHcCCcccCCccccc Confidence 4444433333333332222211111111110 0000000 000111000000000000 No 76 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.07 E-value=3e-09 Score=67.32 Aligned_cols=426 Identities=8% Similarity=0.004 Sum_probs=198.2 Q ss_pred CCh---hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAE---TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~---~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |.+ .+.+.|.+..+... .|.+....++++|+-.-+-+... ... ..+.+.++..+.+...++..++-|++- T Consensus 11 ~~~~~~~~~~~i~~~i~~~~-~~~~r~~~~~~yy~g~~~i~~~~---~~~-~~~~~~ki~~n~~~~ivd~~~~~l~g~-- 83 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMKKHQ-EEVERYEYLGNMYKGIMEISSQK---AKD-SWKPDNRLTNNFAKYIVDTFVGYFNGI-- 83 (453) T ss_pred ccccccCCHHHHHHHHHHHH-HHHHHHHHHHHHhccccchhcCC---CCC-ccCccceeecchHHHHHHHhhhhhccc-- Confidence 221 12334444444444 34455555666665432211111 111 122345677788888888888755331 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) | +++...++. ....+...+...+|.....++.++..+||.|.+++..+..+.+++..++..+ T Consensus 84 ~-----~~~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~ 145 (453) T protein:vir:73 84 P-----IKKTHDDKS-------------VLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLN 145 (453) T ss_pred C-----ceeecCChH-------------HHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccc Confidence 1 223332221 2223444566688999999999999999999998888766667777777766 Q ss_pred EEEeeC-CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEE Q lcl|NC_019445. 158 YYLANS-PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVY 236 (559) Q Consensus 158 ~~v~~d-~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~ 236 (559) .++-.| ..++....+.++... . +....++||+. .. .++ T Consensus 146 ~~~v~dd~~~~~~~~~i~~~~~----------------------~-~~~~~~~vyt~----~~--------------i~~ 184 (453) T protein:vir:73 146 VFMVYDDSIKQKPLFAVYYGFD----------------------E-EGNLSGTVYTL----LE--------------TIS 184 (453) T ss_pred eEEEEeCCCCceeEEEEEEEEe----------------------c-CceEEEEEEeC----Ce--------------EEE Confidence 655554 445544444333321 0 11123333321 00 011 Q ss_pred EEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC-cc Q lcl|NC_019445. 237 YEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL-KN 313 (559) Q Consensus 237 ~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~-~~ 313 (559) +...+..-.+. .+-+|..+|++.++ ++.+|+|. .+...+-+-.++.+.-..+..++...+|.+++.+-. -. T Consensus 185 ~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~s~-~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~ 258 (453) T protein:vir:73 185 ITGKAGEVKFGESTYNVYSDLPIVEYN-----FNEERQSI-FEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE 258 (453) T ss_pred EEecCCceEEccceeccCCceeEEEec-----CCCCCCcc-hhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc Confidence 22111111111 12356778887653 34678885 778888888999999999999999999987764321 10 Q ss_pred cc-ceecCCceeecCC--cC------CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH Q lcl|NC_019445. 314 QR-ASLLPGDITYIDQ--IT------GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) Q Consensus 314 ~~-~~~~pg~~~~~~~--~~------~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei 384 (559) .. -+..++..+.... .+ ....++.+ +.+.+...+...++.++..|-...... . +....-...|+..+ T Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l-~~~~~~~~~~~~~~~l~~~I~~~s~~p--~-~~~~~~gn~Sg~Al 334 (453) T protein:vir:73 259 EDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFL-DKPDSDVQTENLLNRLERSIFQFTMAA--N-ISDENFGNSSGVAL 334 (453) T ss_pred hhhhcccccccccccccccccccccccCceeEEe-eecCCHHHHHHHHHHHHHHHHHHhCCc--c-cCcccccCccHHHH Confidence 00 1112222211110 00 01112211 112234455556677777665443321 1 11111133455544 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~ 464 (559) ..+..-+. ....-..+...+.+..+++.+..++...|. +. ....+++.+.-++.. ++...++.++ T Consensus 335 ~~~~~~l~-~ka~~~~~~~~~~l~~~~~li~~~~~~~~~----~~--~~~~i~v~f~~~~p~--------~~~~~a~~~~ 399 (453) T protein:vir:73 335 AYKLQAMS-NLALSFQRKFQSALNRRYSLWSSLSTNASN----KD--AWKDIEYTFTRNEPK--------DIKEQAETAN 399 (453) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhccCC----cc--ccccceEEeCCCCCC--------CHHHHHHHHH Confidence 33322111 112222222223333333333333333332 11 223466666444321 0111122223 Q ss_pred HHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019445. 465 QLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTL 533 (559) Q Consensus 465 ~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~ 533 (559) .++++ +..+.+ ...++ ++ -.++|++++++++++++.+++.. .+.+.-+.-++| T Consensus 400 k~~gi-------is~et~----~~~~~~~~----d~~~E~~ri~~E~~~~~~~~~~~-~~~~~~~~~~~~ 453 (453) T protein:vir:73 400 ILKGI-------TSEETA----LSVISVIP----DVQAEMEKIKKKKLLQLSLTRTS-NLVRMKQMRGNL 453 (453) T ss_pred HHhcc-------CcHHHH----HHhCCCCC----CHHHHHHHHHHHHHHHHHHHHhc-cCCcchhhhcCC Confidence 33222 222222 23333 22 24677787777665554433322 112222222333 No 77 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.07 E-value=3.2e-09 Score=67.19 Aligned_cols=432 Identities=12% Similarity=0.076 Sum_probs=198.0 Q ss_pred CC---hhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----cCCC-CCCCCCCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MA---ETTKERLNKQFAQLESERQSFEPHWRELSDYINPR-----GSRF-LTSEVNRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~---~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~-----~~~~-~~~~~~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) +. +.+.+.|.+..+..+. ...+++.+.+|..-. +.+. .........+.+.++..+.+..+++..++- T Consensus 38 ~~~~~~~~~~~i~~~i~~~~~----~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~y 113 (492) T protein:vir:97 38 TNNKPETLEEMIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 113 (492) T ss_pred CCCchhhHHHHHHHHHHHHHH----HHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhh Confidence 11 1122333333344333 234555666664321 1000 011111112234467788888888888875 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) |++ .| +++...|+. +.. .+...+ ..+|-....++.++..+||.|.+++..+....+++. T Consensus 114 l~g------~p-~~~~~~d~~------~~~-------~l~~~~-~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~ 172 (492) T protein:vir:97 114 IVG------KP-IAFKHTDDE------VVK-------RIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLF 172 (492) T ss_pred hcc------cC-ceeccCchH------HHH-------HHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEE Confidence 532 12 123433321 111 122222 357888899999999999999998887766668888 Q ss_pred EeeccEEEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE------EEeecCccccc Q lcl|NC_019445. 152 PFPIGSYYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMH------SVYPNIDRDTS 223 (559) Q Consensus 152 ~~~l~~~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~------~v~p~~~~~~~ 223 (559) .++..+.++-.| ..+++...+|.+... ....+++++ .++.... T Consensus 173 ~~~p~~~~~i~d~~~~~~~~~~vr~~~~~-------------------------~~~~~~~y~~~~v~~~~~~~~~---- 223 (492) T protein:vir:97 173 RVPAEQGIPIWTDKEHEELEAFIRMYKLE-------------------------NETKVEYWDKVTVNYYVYENGS---- 223 (492) T ss_pred EEcccceEEEEcCCCCCceEEEEEEEeec-------------------------cceeEEEEecCeEEEEEEecCe---- Confidence 899888877765 457777776665421 011233221 1110000 Q ss_pred ccccccccEEEEEEEecCCCcee-eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 224 KLDSKNKPFKSVYYEVGGDNDKL-LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATN 302 (559) Q Consensus 224 ~~~~~~~~~~sv~~~~~~~~~~i-l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~ 302 (559) +... +........+ ...-+|..+|++.++. +.+|+|. .+..++.+..++.+.-.+....+...+ T Consensus 224 --------~~~~-~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~ 288 (492) T protein:vir:97 224 --------LIPD-YSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISD-IFMYKTLIDAYNRRLSDLSNTFKDSNE 288 (492) T ss_pred --------eeec-ccccccccccccccCCCCCcceEEecC-----CCCCCCc-hHhHHHHHHHHHHHHHHHHHHHHHhcc Confidence 0000 0000000000 1123567788876654 3578896 788889999999988888999999999 Q ss_pred CceeecCCC----ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 303 PPMVAPTSL----KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 303 p~~~~p~~~----~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) |.+++.+.. ....-.+...+++.++..++ ++.+ +...+...+...+..+++.|...-+.. ......-+.. T Consensus 289 ~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~l-~~~~~~~~~~~~~~~L~~~I~~~s~~p--~~~~~~~~~n 362 (492) T protein:vir:97 289 LTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGG---VDTI-QVEVPVENSKKYLDELYQKIMLFGQAV--DFSSDKFGSA 362 (492) T ss_pred ceeeeecCCcccchhHHHHHhhccceecCCCCc---ceeE-eccCCHHHHHHHHHHHHHHHHHHhCCC--CCCccccccC Confidence 987765421 11111234444454443332 2322 122345555666777777665544321 1111111233 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) .|+.-+.....- |--..++.+. .+...+.+++.++.+..-++. + ..++++++.-.+.+ ++.. T Consensus 363 ~Sg~Al~~~~~~----l~~ka~~~~~-~f~~~l~~~~~li~~~~~~~~---~--~~~i~v~f~~~~p~--------~~~e 424 (492) T protein:vir:97 363 PSGVALEFLYTN----LNLKADKLAR-KAKVAIQELLWFVFEHFDIKG---E--HKDVDISFNYNKVA--------NTEL 424 (492) T ss_pred cHHHHHHHHHHH----HHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCc---c--cceeeEEecCCCCC--------CHHH Confidence 444333322221 1112233322 344455556665555322221 1 23455655433221 0111 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAK 537 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~ 537 (559) .++.+..++++ +.-..++ ..++ ++ -.++|++++.++++++++..+.... .......+.. T Consensus 425 ~a~~~~kl~G~-------iS~et~l----~~l~~v~----d~~~Eleri~~E~~~~~~~~~~~~~-----~~~~~~~~~~ 484 (492) T protein:vir:97 425 QVQTAQQSMGI-------VSHETVL----ENHPFVE----DLQAELERIEQEQTEYNKQLPNLDD-----GGADSAQQQE 484 (492) T ss_pred HHHHHHHHhcc-------CchHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHhhhcccc-----CCCCCCcccc Confidence 12223333322 2222222 2332 22 1356777776665543332221100 0000000000 Q ss_pred CCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 538 TSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 538 ~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ..++ ...| T Consensus 485 ~~~~--------------~~~e 492 (492) T protein:vir:97 485 RSNN--------------KESE 492 (492) T ss_pred cccc--------------cccC Confidence 0000 0000 No 78 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.07 E-value=3.2e-09 Score=67.17 Aligned_cols=432 Identities=11% Similarity=0.065 Sum_probs=197.3 Q ss_pred CCh---hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----cCCC-CCCCCCCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAE---TTKERLNKQFAQLESERQSFEPHWRELSDYINPR-----GSRF-LTSEVNRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~---~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~-----~~~~-~~~~~~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) +.. .+.+.|.+..+.... ...++..+.+|..-. +.+. .........+.+.++..+.+...++..++- T Consensus 29 ~~~~~e~~~~~i~~~i~~~~~----~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~ 104 (483) T protein:vir:12 29 TNNKPETLEEMIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 104 (483) T ss_pred cCCchhhHHHHHHHHHHHHHH----HHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhh Confidence 111 122333333333333 334555666664321 1000 011111112233467788888888888875 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) |++ .| +++...|+. ..+ .+...+ ..+|.....++.++..+||.|.+++..|....+++. T Consensus 105 l~G--~p-----~~~~~~d~~------~~~-------~l~~~~-~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~ 163 (483) T protein:vir:12 105 IVG--KP-----IAFKHTDDE------VVK-------RIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLF 163 (483) T ss_pred hcc--cC-----ceeccCChH------HHH-------HHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEE Confidence 542 11 223333321 111 122222 357788889999999999999998887766667888 Q ss_pred EeeccEEEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE------EEEeecCccccc Q lcl|NC_019445. 152 PFPIGSYYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM------HSVYPNIDRDTS 223 (559) Q Consensus 152 ~~~l~~~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~------~~v~p~~~~~~~ 223 (559) .++..+.++..| ..+++...+|.+... + ...++++ +.++.... T Consensus 164 ~~~p~~~~~v~d~~~~~~~~~~ir~~~~~------------------------~-~~~~~~y~~~~v~~~~~~~~~---- 214 (483) T protein:vir:12 164 RVPAEQGIPIWTDKEHEELEAFIRMYKLE------------------------N-ETKVEYWDKVTVNYYVYENGS---- 214 (483) T ss_pred EEcccceEEEEcCCCCCceEEEEEEEEee------------------------c-ceEEEEEecCeEEEEEEeCCe---- Confidence 899999877665 457777666654421 0 1123332 11111000 Q ss_pred ccccccccEEEEEEEecCCCcee-eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 224 KLDSKNKPFKSVYYEVGGDNDKL-LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATN 302 (559) Q Consensus 224 ~~~~~~~~~~sv~~~~~~~~~~i-l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~ 302 (559) +.. .+........+ ...-+|..+|++.++- +.+|+|. .+...+.+..++.+.-.+...++...+ T Consensus 215 --------~~~-~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~ 279 (483) T protein:vir:12 215 --------LIP-DYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISD-IFMYKTLIDAYNRRLSDLSNTFKDSNE 279 (483) T ss_pred --------eee-cccccccccccccccCCCCccceEEecC-----CCCCCCc-hhhHHHHHHHHHHHHHHHHHHHHHhcC Confidence 000 00000000001 1223566788876653 4578996 788889999999999999999999999 Q ss_pred CceeecCCCccc----cceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 303 PPMVAPTSLKNQ----RASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 303 p~~~~p~~~~~~----~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) |.+++.+..... .-.+..++++.++..++ +..+. ...+...+...++.+++.|...-... ......-+.. T Consensus 280 ~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~l~-~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n 353 (483) T protein:vir:12 280 LTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGG---VDTIQ-VEVPVENSKKYLDELYQKIMLFGQAV--DFSSDKFGSA 353 (483) T ss_pred ceeeeecCCcccchhHHHhhhhccccccCCCCc---ceEEe-ecCCHHHHHHHHHHHHHHHHHHhCCC--CCCccccccC Confidence 988775422111 11233444443433222 22221 12344555566677776665444321 1111111233 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) .|+..+..+..-+. -...+.+ ..+...+.+++.++.+..-.+. ...++++++.-.+.. ++.. T Consensus 354 ~Sg~Al~~~~~~l~----~k~~~~~-~~f~~~l~~~~~li~~~~~~~~-----~~~~i~v~f~~~~p~--------~~~~ 415 (483) T protein:vir:12 354 PSGVALEFLYTNLN----LKADKLA-RKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVA--------NTEL 415 (483) T ss_pred cHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHHhcCCC-----ccceeeEEeCCCCCC--------CHHH Confidence 45544332221111 1122222 2344455555555554322221 123466666443321 1111 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAK 537 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~ 537 (559) .++.+..++++ +.-..++. .++ ++ -.++|+++++++++++++.++... ....+..+.-.+.. T Consensus 416 ~a~~~~kl~Gi-------iS~et~~~----~~~~v~----d~~~E~~ri~~E~~~~~~~~~~~~--~~~~d~~~~~~~~~ 478 (483) T protein:vir:12 416 QVQTAQQSMGI-------VSHETVLE----NHPFVE----DLQAELERIEQEQMEYNKQLPNLD--DGGADGAQQQERSN 478 (483) T ss_pred HHHHHHHHhcc-------CchHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHhhccccc--ccccCCcccCCCCC Confidence 12223333332 23222222 222 22 135677776666544333221110 00000000000000 Q ss_pred CCChh Q lcl|NC_019445. 538 TSDPS 542 (559) Q Consensus 538 ~~~~~ 542 (559) ...++ T Consensus 479 ~~e~e 483 (483) T protein:vir:12 479 NKESE 483 (483) T ss_pred cccCC Confidence 00001 No 79 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.05 E-value=3.6e-09 Score=66.90 Aligned_cols=470 Identities=10% Similarity=0.054 Sum_probs=204.5 Q ss_pred CChhhHHHHHHHH------H-----------HHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHH Q lcl|NC_019445. 1 MAETTKERLNKQF------A-----------QLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTM 63 (559) Q Consensus 1 M~~~~~~~l~~r~------~-----------~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~ 63 (559) |-+..+.-+.+-- . .+..++......|+.+|+=-.|..- ............++--+.+.. T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~---~~~~~~~~~~~~~~sln~~~~ 79 (508) T protein:vir:15 3 LIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIH---YQASDGIKKKRLKNTINMAKT 79 (508) T ss_pred hHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccc---cccCCCCccccceeecchHHH Confidence 3333322221101 1 1222333446667777655433221 111111111112223356667 Q ss_pred HHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeec Q lcl|NC_019445. 64 AARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLED 143 (559) Q Consensus 64 a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~ 143 (559) +++.+|+-+++-.. .+++++++ ...+| +.+.+...+|+..+.+++.+..++|.+++.+..| T Consensus 80 i~~~~A~lv~~e~~-------~i~v~~~~-----~~~e~-------l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d 140 (508) T protein:vir:15 80 AARRIASVVFNEKA-------EIHVKDNN-----EADKF-------LNDVLEDNDFKNKFEEALEKGVALGGFAMRPYID 140 (508) T ss_pred HHHHHHhhhhCCCc-------eEEeCCch-----HHHHH-------HHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEe Confidence 77777764432211 12233322 22233 3446777999999999999999999999977766 Q ss_pred CCceEEEEEeeccEEE-EeeCCCCCEEEEE-EEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccc Q lcl|NC_019445. 144 DEDIIRTMPFPIGSYY-LANSPRGSVDICF-RKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRD 221 (559) Q Consensus 144 ~~~~~~~~~~~l~~~~-v~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~ 221 (559) .+ .+++..++...++ +..|..+....+| +++..+-.. ..++- -.| ...+..+ +..+.|.+.+|...+.+ T Consensus 141 ~~-~~~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~-~~~~y-t~l-----E~h~~~~-~~~~~I~n~ly~~~~~~ 211 (508) T protein:vir:15 141 GN-HIKIAWVRADQFYPLQSNTNDISEAAIASRTQRTESN-QTKYY-TLL-----EFHQWQD-NGSYQITNELYKSDSPD 211 (508) T ss_pred CC-eeEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCC-CceEE-EEE-----EEEEEec-CcceEEEEEEEecCCch Confidence 54 4567778988877 4566544433333 333211000 00000 000 0000000 01122222233221100 Q ss_pred ccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEee----ecCCCcccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 222 TSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWE----VNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLI 297 (559) Q Consensus 222 ~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~----~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~ 297 (559) . -+...|...+.--.+-.+. ..-.|...-||..++.. ...+++||+|. -.++.+.+..||..--+..... T Consensus 212 --~-lG~~v~l~~~~e~~~l~~~--~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~-~~~~~~lid~lD~~~s~~~~e~ 285 (508) T protein:vir:15 212 --I-VGNQVPLSTLPVYKELAPQ--VTISGLQRPLFAYFKTPGANNINIESPLGLGV-VDNAKHVLDDINDTHDQFIWEI 285 (508) T ss_pred --h-cCcccchhhcccccCCCcc--eEecCCCcceeEEecCCccccccCCCCcCCch-HhhhHHHHHHHHHHHHHHHHHH Confidence 0 0111222221000000000 11123444445544432 23367899995 8999999999999998888888 Q ss_pred HHHhcCceeecCCCccccce----ecCCceeecCC---cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_019445. 298 DKATNPPMVAPTSLKNQRAS----LLPGDITYIDQ---ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMM 370 (559) Q Consensus 298 ~~~~~p~~~~p~~~~~~~~~----~~pg~~~~~~~---~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~ 370 (559) ...++.+.+|.++...+-+ ..++...|... .++...++.+ +..-......+.++.+.+.|....-.. ..+ T Consensus 286 -~~~~~~i~v~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~~~~~l~~~~~~~gls-~~~ 362 (508) T protein:vir:15 286 -RLGQKHIAVQPGMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDM-TTPIRTVQYKDAIDHFIKEFEVQIGLS-TGT 362 (508) T ss_pred -HhcccceeechHHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEe-ecccChHHHHHHHHHHHHHHHHHhCCC-chh Confidence 5777788887764322111 12333223211 1111223221 111122333444555555554433111 123 Q ss_pred ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-ch---h--hCCcceEEEeecHH Q lcl|NC_019445. 371 LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPP-PD---A--MEGMPLKVEYISVM 444 (559) Q Consensus 371 ~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~-p~---~--l~g~~v~~~~is~L 444 (559) ++.......|||||....+...+...-.-..+ ...|..++..+++++.-.+....- |. . ....+++|.+--++ T Consensus 363 f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~-~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i 441 (508) T protein:vir:15 363 FSYSNDGVKTATEVVSNNSMTYQTRSSYLTMV-EKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGV 441 (508) T ss_pred cccccCccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCC Confidence 33333445799999999998888877765555 457888888888877654433211 00 0 01123566654443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 445 AQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGM 524 (559) Q Consensus 445 a~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~ 524 (559) ..- +....+.. .+.+ +++ .+....+ +....|++ ++|++++.++.++.+ .+... T Consensus 442 ~~d-~~~~~~~~---~~~v----~aG-----i~s~e~~---i~~~~g~~------deea~~el~ri~~E~-----~~~~~ 494 (508) T protein:vir:15 442 FVN-KDKQLEED---AKVL----AIG-----ALSKQTF---LQRNYGMT------DEQAAEELAKIQSEA-----PTDTF 494 (508) T ss_pred CCC-HHHHHHHH---HHHH----hcC-----CCCHHHH---HHhcCCCC------hHHHHHHHHHHHHhc-----cccCc Confidence 211 11111111 1111 111 1222222 23445663 444433322211111 00000 Q ss_pred HHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCC Q lcl|NC_019445. 525 AAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQ 557 (559) Q Consensus 525 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 557 (559) . .....+.. |+-|. T Consensus 495 ~----~~~~~~~~---------------g~~ge 508 (508) T protein:vir:15 495 E----GGRSAILN---------------GGDGE 508 (508) T ss_pred c----ccccccCC---------------CCCCC Confidence 0 00000111 11111 No 80 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.05 E-value=3.8e-09 Score=66.72 Aligned_cols=437 Identities=13% Similarity=0.068 Sum_probs=187.0 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.+.....+...+..+...+ ++.+.+.+|..-... +..+.. .+...+.-+...+-+..+|+.||..|. +- T Consensus 12 l~~~~~~~~~~L~~~~~~~~----~~~~~~~~Yy~G~~~~~~~~~~-~p~~~r~~~~v~nw~~~~Vd~~a~rl~----~~ 82 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLR----WKNLLRTSYYENKRTIQYVGTL-IPPQYFNLGLVLGWTGKAVDALARRCN----LE 82 (474) T ss_pred CChhHHHHHHHHHHHHHHHh----hHHHHHHHHhccCCChhhcccc-ccHHHHHHHhhcChHHHHHHHHHhhhc----cc Confidence 88877655555454444443 344455555332110 011111 111111113456667777777777554 11 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC--CceEEEEEeeccE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD--EDIIRTMPFPIGS 157 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~--~~~~~~~~~~l~~ 157 (559) + |++ +|.+..+ .. +++...+++|.....+++++..+||.+.++|..+. ....+++.++..+ T Consensus 83 G---f~~--~d~~~~~-~~-----------l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~ 145 (474) T protein:vir:81 83 G---FVW--PDGDLDS-LG-----------GTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASE 145 (474) T ss_pred c---eEC--CCCCccc-hH-----------HHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccce Confidence 2 332 3322111 11 23455678999999999999999999999987532 2236677888888 Q ss_pred EEEeeCCC-CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEE Q lcl|NC_019445. 158 YYLANSPR-GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVY 236 (559) Q Consensus 158 ~~v~~d~~-G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~ 236 (559) .++..|+. +++..-++.. +.+. +...+...+..|...... ..+..+ .. T Consensus 146 ~~~~~D~~~~~~~~al~~~------------------------~~~~-~g~~~~~~ly~~~~~~~~-~~~~~~-----~~ 194 (474) T protein:vir:81 146 ATGEWNRRRRGLNNLLSII------------------------DKDK-EGKVLSLALYLDNETVTA-QRDKAT-----LK 194 (474) T ss_pred EEEEEeCCCCcceeeeEEE------------------------EEcC-CCcEEEEEEEeCCcEEEE-EEcCcc-----ce Confidence 88777763 2222222111 0011 111111111112111000 000000 01 Q ss_pred EEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeec----CC- Q lcl|NC_019445. 237 YEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPG-MLALGPVKALQLLQKRKSQLIDKATNPPMVAP----TS- 310 (559) Q Consensus 237 ~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~-~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p----~~- 310 (559) |..+.. +-++. +|.+++..+..-++.+|+|- . +..++-+..++...-.++..+++.+.|...+- ++ T Consensus 195 w~~~~~------~~~~g-vPvV~~~n~~~~~~~~G~s~-i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~ 266 (474) T protein:vir:81 195 WQVDRD------EHVYG-VPAQVLPYKPAPKRPFGQSR-ITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESAL 266 (474) T ss_pred eeeccC------CCCCC-cceEEecccccccCcCCccc-cchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhc Confidence 111111 12333 79999999988999999994 4 46778899999999999999999999975442 11 Q ss_pred ----Ccc-ccceecCCceeecCCcCCchhh-----hhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC-Cc Q lcl|NC_019445. 311 ----LKN-QRASLLPGDITYIDQITGQDGF-----RPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTR-SM 379 (559) Q Consensus 311 ----~~~-~~~~~~pg~~~~~~~~~~~~~~-----~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~-~~ 379 (559) .+. ..++...|.++..+..++.... +-......++....+.+..+...|...--. ....++..... .- T Consensus 267 ~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~i-P~~~lG~~~~~np~ 345 (474) T protein:vir:81 267 KNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDINGLAKLFAREASL-PDTAVAISGLSNPT 345 (474) T ss_pred ccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHHHHHHHHhhhCC-CHHHhccccccccc Confidence 111 1233445555555432221110 001111123443333333333333211111 11111111101 12 Q ss_pred CHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCC--CCCchhhCCcceEEEeecH--HHHHHHHHHHH Q lcl|NC_019445. 380 PVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NML--PPPPDAMEGMPLKVEYISV--MAQAQKSIGLS 454 (559) Q Consensus 380 TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~l--p~~p~~l~g~~v~~~~is~--La~a~r~~~~~ 454 (559) +|.-|......+.. ..++.+. .+.+-+.+++.++... |-. ...|.+... +++..-.+ -..++++. T Consensus 346 SaeAi~a~~~~l~~----kae~k~~-~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~--~~v~W~d~~~~s~a~~aD--- 415 (474) T protein:vir:81 346 SAESYDASQYELIA----EAEGAVD-DFTPALRKAFIRALAMKNKVAIDEIPDEWKS--IDAKWRDPRYLSKSAQAD--- 415 (474) T ss_pred HHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHhCCCCccccchhhcc--ceeEecCCCccCHHHHHH--- Confidence 34333322221111 1122222 2222333344333321 221 233444332 33333222 11122222 Q ss_pred HHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019445. 455 SLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLS 534 (559) Q Consensus 455 ~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~ 534 (559) .+..++++++-+. + .+ .+...+|+ +++|+++++..+.+++.+....+... ... T Consensus 416 -------a~~Kl~~a~~~~~---~-~~---~~~~~lg~------t~~~i~~~~~~~~~~~~~~~~~~l~~-~~~------ 468 (474) T protein:vir:81 416 -------AGMKQLAAVPWLA---E-TE---VGLELIGL------TPQQARRAMADKRRVQGRGTLQALID-RSN------ 468 (474) T ss_pred -------HHHHHHhcccCCC---c-HH---HHHhhcCC------CHHHHHHHHHHHHHHhHHHHHHHHHh-cCC------ Confidence 2222223222110 0 11 12233565 46677666554333322221111110 000 Q ss_pred hhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 535 EAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 535 ~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) + ++-+| T Consensus 469 ~-------------------~~~aq 474 (474) T protein:vir:81 469 N-------------------GATAQ 474 (474) T ss_pred C-------------------CCCCC Confidence 1 11111 No 81 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.04 E-value=4.3e-09 Score=66.45 Aligned_cols=450 Identities=12% Similarity=0.079 Sum_probs=205.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC-CCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT-SEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~-~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) +.+. .+.+.+..+.-+.. ..++|+++.+|.....-.... .......+.+.++..+.+..+++..++-|++ T Consensus 37 ~~~~-~~~l~~~i~~~~~~---~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g----- 107 (501) T protein:vir:27 37 MVNN-WELLKNFINHHKLR---QAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAG----- 107 (501) T ss_pred cccc-HHHHHHHHHHHHHH---HHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcc----- Confidence 3332 23344433333332 234556666665432111100 1111112234567778888888887775543 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYY 159 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~ 159 (559) .| ++++..|.... +.+.+.+.+.+...+|....+++.++..+||.+.+++..+....+++..++..+.+ T Consensus 108 -~p-~~~~~~d~~~~---------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~ 176 (501) T protein:vir:27 108 -NP-IRVEYDDNDNN---------SQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETF 176 (501) T ss_pred -cC-eeEecCCccch---------HHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeE Confidence 11 23333333211 12233445566678999999999999999999999998876666788888888877 Q ss_pred EeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEE Q lcl|NC_019445. 160 LANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) Q Consensus 160 v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~ 237 (559) +-.|. .+++...+|.+.... .++....++||+ + + + .+++ T Consensus 177 ~v~d~~~~~~~~~~ir~~~~~~---------------------~~~~~~~~~vyt---~--~----~---------v~~~ 217 (501) T protein:vir:27 177 VIYDNSLEDNSIAAVRYYNRGT---------------------LQNAKDVVEIYT---N--E----H---------IYTL 217 (501) T ss_pred EEecCCCCCceEEEEEEEEeee---------------------cCCcEEEEEEEe---C--C----e---------EEEE Confidence 66654 456655555543210 001112233321 1 0 0 1122 Q ss_pred EecCCCceee-eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc--- Q lcl|NC_019445. 238 EVGGDNDKLL-RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN--- 313 (559) Q Consensus 238 ~~~~~~~~il-~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~--- 313 (559) ...++...+. ..-+|..+|++.++ ++..|+|. .+..++.+..++.+.-.+...++...+|.+++.+.... T Consensus 218 ~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~ 291 (501) T protein:vir:27 218 DASDDFNEISVTTHAFGTVPITEFL-----NNVDGIGD-YETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKG 291 (501) T ss_pred EeCCceeeccccccCCCcccEEEec-----CCCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcc Confidence 2222211111 22356788887764 34679995 88899999999999999999999999998876543211 Q ss_pred -ccceecCCceeecCCcC------CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHH Q lcl|NC_019445. 314 -QRASLLPGDITYIDQIT------GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIE 386 (559) Q Consensus 314 -~~~~~~pg~~~~~~~~~------~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~ 386 (559) ........+.+.+...+ +...++.+ +...+...+...+..+++.|...-+..-+... .-+...|+..+.. T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~--~~~~n~Sg~Al~~ 368 (501) T protein:vir:27 292 MQASDMKRTRLMQLKPPKSADGKEGTVKAEYL-TKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDT--NFSGNTSGEALKY 368 (501) T ss_pred cchhhhhhcCceeecccccccCCCCCcceeee-eccCCHHHHHHHHHHHHHHHHHHhCCcccCcc--ccccCchHHHHHH Confidence 11122223333332211 11112211 11223444555567776666544432111111 1123345544443 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 387 MKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQL 466 (559) Q Consensus 387 r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~l 466 (559) ...- +........+.-.+.+.-+++.++.++...+... ......|++.+.-.+..- +...++.+..+ T Consensus 369 ~~~~-l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~----~~d~~~i~v~f~~~~p~n--------~~e~ad~~~kl 435 (501) T protein:vir:27 369 KLFG-LDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFK----DFDESLLKITFTPNLPKS--------LNEQVSILTGL 435 (501) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccccccceEEeCCCCCcC--------HHHHHHHHHHH Confidence 3222 2222333333333444444444444444333322 122334667765443311 11111223233 Q ss_pred hccChhhHhcCCHHHHHHHHHHHc-CCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHH Q lcl|NC_019445. 467 AQAKPEALDKLNVDQAIDAFADMS-GVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLS 545 (559) Q Consensus 467 a~~~P~~~~~id~d~~~~~~a~~~-Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 545 (559) +++ +....++ ..+ +++ -+++|++++++++++.....++-..-....+..+...+..++..+-.. T Consensus 436 ~g~-------iS~et~l----~~l~~v~----D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~ 500 (501) T protein:vir:27 436 GGQ-------VSQETAL----SLSGLVE----SPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAY 500 (501) T ss_pred hcc-------CcHHHHH----HhCCCCC----CHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccccC Confidence 332 2222222 223 232 135677777665443222211110000011111111111111110000 Q ss_pred H Q lcl|NC_019445. 546 A 546 (559) Q Consensus 546 ~ 546 (559) . T Consensus 501 ~ 501 (501) T protein:vir:27 501 E 501 (501) T ss_pred C Confidence 0 No 82 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.03 E-value=4.7e-09 Score=66.23 Aligned_cols=447 Identities=12% Similarity=0.083 Sum_probs=199.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCC-CCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFL-TSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~-~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) ......+.+.+..+.-+. .-.++++++.+|......... ........+...++..+.+...++..++-|++ T Consensus 37 ~~~~~~~~i~~~i~~h~~---~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g----- 108 (502) T protein:vir:48 37 LMVNNWELLKNFINHHKL---RQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAG----- 108 (502) T ss_pred hccccHHHHHHHHHHHHH---HHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcc----- Confidence 111122323333333222 223455666666443211000 01111122234466677777777777764332 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYY 159 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~ 159 (559) .| ++++..+.... .. +.+.+...+...+|....+++.+++.+||.|.+++..+....+++..++..+.+ T Consensus 109 -~p-~~~~~~d~~~~--~~-------~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~ 177 (502) T protein:vir:48 109 -NP-IRVEYDDNEDN--SQ-------NDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETF 177 (502) T ss_pred -cC-eeEecCCccch--hH-------HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceE Confidence 11 13343333211 12 333445566778999999999999999999999888776666788888888877 Q ss_pred EeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEE Q lcl|NC_019445. 160 LANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) Q Consensus 160 v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~ 237 (559) +-.|. .+++...+|.+.... .++....+++|+ +. + .+++ T Consensus 178 ~vydd~~~~~~~~~ir~~~~~~---------------------~~~~~~~~~iyt---~~------~---------i~~~ 218 (502) T protein:vir:48 178 VIYDNSLEDNSIAAVRYYNRGT---------------------LQNAKDVVEIYT---NQ------H---------IYTL 218 (502) T ss_pred EEEcCCCCCceEEEEEEEEEee---------------------cCCcEEEEEEEe---CC------e---------EEEE Confidence 66653 466666666543211 011112233321 10 0 1222 Q ss_pred EecCCCcee-eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc-cc Q lcl|NC_019445. 238 EVGGDNDKL-LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN-QR 315 (559) Q Consensus 238 ~~~~~~~~i-l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~-~~ 315 (559) ..++....+ ...-+|..+|++.++ ++..|+|. .+.+++-+..++.+.-.+...++...+|.+++.+.... .. T Consensus 219 ~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 292 (502) T protein:vir:48 219 DASDSFNEISVTPHAFGTVPITEFL-----NNADGIGD-YETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQG 292 (502) T ss_pred EeCCceeeccceecCCCccceEEec-----CCCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccc Confidence 222211111 122356778887654 34578995 88899999999999999999999999998887653211 11 Q ss_pred ---ceecCCceeecCCcC------CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHH Q lcl|NC_019445. 316 ---ASLLPGDITYIDQIT------GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIE 386 (559) Q Consensus 316 ---~~~~pg~~~~~~~~~------~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~ 386 (559) ..+...+.+.+..+. +...++.+.. ..+...+...+..+.+.|...-+.. ......-+...|+..+.. T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~-~~~~~~~~~~~~~L~~~I~~~s~~p--~~~~~~~~~n~Sg~Alk~ 369 (502) T protein:vir:48 293 MQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTK-SYDVSGAEAYKTRLNKDIHVFTNTP--DMSDNHFSGNASGEALKY 369 (502) T ss_pred cchhhhhhcceeeccccccccccccCcceeEeee-cCCHHHHHHHHHHHHHHHHHHhCCC--CcCccccccCchHHHHHH Confidence 112222333322111 1112222211 1234455556677777665433221 111111123446655554 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 387 MKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQL 466 (559) Q Consensus 387 r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~l 466 (559) ...- +........+.-.+.+.-+++-++.++...+... ......+++++.-.+..- +...++.+..+ T Consensus 370 ~~~~-l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~----~~d~~~i~i~f~~~~p~d--------~~e~a~~~~kl 436 (502) T protein:vir:48 370 KLFG-LDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFK----DFDESRLKITFTPNLPKS--------LYEQVSILNDL 436 (502) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccccccceEEeCCCCCcC--------HHHHHHHHHHH Confidence 3322 2222222233333333333333334443333221 222334677775443311 11112223333 Q ss_pred hccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHH Q lcl|NC_019445. 467 AQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLS 545 (559) Q Consensus 467 a~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 545 (559) +++ |.-+.+++ .+| ++ -.++|++.+.+++.+...... ....+........+.. T Consensus 437 ~g~-------iS~et~l~----~l~~v~----D~~~E~~ri~~E~~~~~~~~~--------~~~~~~~~~~~~d~~~--- 490 (502) T protein:vir:48 437 GGQ-------VSQETALS----LSGLVE----NPTEELDKINEESSKIDFKGY--------PSYFYDNVGKYTDEVK--- 490 (502) T ss_pred hcc-------CcHHHHHH----hCCCCC----CHHHHHHHHHHHHHhhhhhcc--------cccccccccccCCCcc--- Confidence 332 22222332 233 22 124566655554332111110 0000000000000000 Q ss_pred HHHHHhhcCCCCCC Q lcl|NC_019445. 546 AMANAVSGQGGQSQ 559 (559) Q Consensus 546 ~~~~~~~~~~~~~~ 559 (559) .+.+...+ T Consensus 491 ------e~~~~~~~ 498 (502) T protein:vir:48 491 ------ETHTDDFE 498 (502) T ss_pred ------CCCCcCcC Confidence 00011111 No 83 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.03 E-value=4.9e-09 Score=66.14 Aligned_cols=446 Identities=11% Similarity=0.040 Sum_probs=207.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC---CCC-----C-C-CCCCcccccCCCCcchHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS---RFL-----T-S-EVNRNDRRNTRIIDSTGTMAARTLAS 70 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~---~~~-----~-~-~~~~~~~~~~~~~~s~~~~a~~~Las 70 (559) |.-...+++.. .+...++....+++.+.+|..-.-. +.. . . ....-.+.+.|+..+.+...++..++ T Consensus 1 ~~~~~~~~~i~---~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~ 77 (470) T protein:vir:10 1 MELDALKKLIQ---NTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (470) T ss_pred CchHHHHHHHH---HHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhh Confidence 66555555554 4455555566777788887543210 000 0 0 00111123456767777777776666 Q ss_pred HHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEE Q lcl|NC_019445. 71 GMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRT 150 (559) Q Consensus 71 ~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~ 150 (559) -|++ .| ..+...+... ...++. .+. .+|...+.++.++...+|.+.+++..|...-+++ T Consensus 78 yl~G------~p-~~~~~~d~~~--~~~l~~-----------~~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~ 136 (470) T protein:vir:10 78 YVAS------VF-PDIDVGKDAD--NKKIID-----------VLG-DDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRY 136 (470) T ss_pred heec------cc-eeeecCchHH--HHHHHH-----------HHh-hhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEE Confidence 4333 11 1234333221 112222 233 3667778888899999999999888877666888 Q ss_pred EEeeccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE---EEeecCccccccc Q lcl|NC_019445. 151 MPFPIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMH---SVYPNIDRDTSKL 225 (559) Q Consensus 151 ~~~~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~---~v~p~~~~~~~~~ 225 (559) ..++..+.++-.|. .+++..++|.|...-. + .......+++|+ ..+.+........ T Consensus 137 ~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~--------~-----------~~~~~~~~e~yt~~~~~~~~~~~~~~~~ 197 (470) T protein:vir:10 137 GIIQPDQITPIYATTLDNKLLGILRSYKQLDP--------D-----------SGKYFTVHEYWTDKEAQFFRTNATDSTV 197 (470) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeeec--------C-----------CceEEEEEEEEcCCcEEEEEeecCccee Confidence 88998888877764 4677666665543200 0 001111222221 0000000000000 Q ss_pred ccccccEEEEEEEecCCCcee-eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 226 DSKNKPFKSVYYEVGGDNDKL-LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 226 ~~~~~~~~sv~~~~~~~~~~i-l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) ......+.+.......+...+ ...-+|..+|++.++= +.+|.|. .+...+.+..++.+.-.....++...+|. T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 271 (470) T protein:vir:10 198 IEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSK-----NKYRLPE-LNKYKGLIDAYDDIYNGFINDLDDVQTVI 271 (470) T ss_pred ccccccccccccccccccccccccccCCCeeeEEEeec-----CCCCCCc-hhHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 000011111111111111101 1123556677776553 4689996 88899999999999999999999999999 Q ss_pred eeecCCCc----cccceecCCceeecCCcCC--chhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 305 MVAPTSLK----NQRASLLPGDITYIDQITG--QDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 305 ~~~p~~~~----~~~~~~~pg~~~~~~~~~~--~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) +++.+... ....++..++.+.++..+. ...++-+ +...+.......++.+++.|-+.-+..-+ ..-.... T Consensus 272 lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l-t~~~~~~~~~~~~~~L~~~I~~~s~~p~~---~~~~~gn 347 (470) T protein:vir:10 272 LVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKL-QIDIPVEARDDALKITRKNIFLFGQGIDP---ANFESSN 347 (470) T ss_pred eeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEE-eecCChHHHHHHHHHHHHHHHHHhCCCCC---Ccccccc Confidence 98875321 1122344556666644321 2223322 22234555666677777777654433211 1112234 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~ 457 (559) .|+..+..+-.-+.. ...+... .+.+.+.+.++++.+. +. -. .....++++|.-++-. +. . T Consensus 348 ~Sg~Alk~~~~~l~~----k~~~~~~-~~~~~l~~~~~~i~~~l~~-~~----~d~~~i~i~f~~~~p~-----d~---~ 409 (470) T protein:vir:10 348 ASGVAIKMLYSHLEL----KAAKTQT-YFEHAINELVRAIMRYLNF-SD----ADKRHISQHWTRTKVE-----DS---L 409 (470) T ss_pred chHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHhcc-cC----cccceeeEEeccCCCC-----CH---H Confidence 455554433222221 2223322 3334445555554431 11 11 1123456666544431 11 1 Q ss_pred HHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_019445. 458 STVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEA 536 (559) Q Consensus 458 ~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~ 536 (559) ..++.+..+++ .++-..++.. ++ ++ -.++|++++.++++++++..+. .+...+. T Consensus 410 e~~~~~~~~~g-------~iS~et~l~~----~p~v~----D~~~E~eri~~E~~e~~~~~~~----------~~~~~~~ 464 (470) T protein:vir:10 410 TKAQIVSTVAN-------YSSKEAVAKA----NPIVD----DWQQELKDLAKDKEENDPYSNQ----------ADELNGK 464 (470) T ss_pred HHHHHHHHHhc-------cCcHHHHHHh----CCCCC----CHHHHHHHHHHHHHHHHHhhcc----------ccccCCC Confidence 11222222222 2333333332 23 21 1356666666554333221110 1111111 Q ss_pred cCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 537 KTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 537 ~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) +. +..| T Consensus 465 ~~-----------------dde~ 470 (470) T protein:vir:10 465 GV-----------------NDEQ 470 (470) T ss_pred CC-----------------CCCC Confidence 11 1111 No 84 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.02 E-value=5.1e-09 Score=66.06 Aligned_cols=413 Identities=8% Similarity=0.007 Sum_probs=178.6 Q ss_pred hccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 34 INPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDM 113 (559) Q Consensus 34 ~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~ 113 (559) ++|...+.. -+....+...+.+..+++.++..|. +.+ |+ .+|.+.. .. +.+. T Consensus 1 ~l~~~~~~~------~~~~~~~~v~n~~~~ivd~~~~~l~----~~g---f~--~~d~~~~--~~-----------~~~i 52 (434) T protein:vir:98 1 MLPKNAEQA------FLDFQRKARTNFCGLIANASVHRLL----ALG---VT--GPDGEPD--TR-----------ASRW 52 (434) T ss_pred CCCCCccHH------HHHhhhhhhccchHHHHHHHHhhhc----cCc---ee--cCCCchH--HH-----------HHHH Confidence 444432210 0111223345678888888888653 333 43 3333211 11 2334 Q ss_pred HHhccchHHHHHHHHHHHhhCcEEEEEeecCCc-------eEEEEEeeccEEEEeeCCC-CCEEEEEEEEeecHHHHHHh Q lcl|NC_019445. 114 FNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED-------IIRTMPFPIGSYYLANSPR-GSVDICFRKFSMTVRQLVQE 185 (559) Q Consensus 114 l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~-------~~~~~~~~l~~~~v~~d~~-G~vd~i~r~~~~t~~ql~~~ 185 (559) +.+++|.....+++++..+||.|.+++..+... -.++++++..+..+..|.. +++...++.+.... T Consensus 53 ~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~------ 126 (434) T protein:vir:98 53 WQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDI------ 126 (434) T ss_pred HHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEecc------ Confidence 566889999999999999999999988654321 2346678888887777753 45544444332110 Q ss_pred cCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeec Q lcl|NC_019445. 186 FGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVN 265 (559) Q Consensus 186 fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~ 265 (559) ++.....+.+++.++....+...... ..+.+.-|..... ..-..+-+|..+|++.+.-+.. T Consensus 127 ---------------~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~h~~g~vPvv~f~N~~~ 187 (434) T protein:vir:98 127 ---------------DGFGYARVFFDDTSFPYRTRERTGAR---LPWGPDSWVYTGT-ADSGDVHDLGGMQLVEFARMPD 187 (434) T ss_pred ---------------CCceEEEEEEeCcEEEEEEeeccccc---cccccccceeccc-ccccccCCCCccceEEeccCCC Confidence 01111122222222211111110000 0011100111000 0011223678899998876666 Q ss_pred CCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCccc-------------cceecCCceeecCCcCCc Q lcl|NC_019445. 266 GEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQ-------------RASLLPGDITYIDQITGQ 332 (559) Q Consensus 266 ~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~~-------------~~~~~pg~~~~~~~~~~~ 332 (559) .++ +|+|- .+..++.+..++...-.++..++..+.|.+.+.+..... .+...+|+++..+.. . T Consensus 188 ~~~-~g~sd-~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~ 263 (434) T protein:vir:98 188 LGE-DPEPE-FAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGE--N 263 (434) T ss_pred cCc-CCcch-hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhccccccccCCCC--C Confidence 554 79996 789999999999999999999999999976654211000 011233333322211 1 Q ss_pred hhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_019445. 333 DGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLID 412 (559) Q Consensus 333 ~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~ 412 (559) ..+..+ ....+..+...+..+.+.|...--.. ...+.. +..+.++..++.....+.... .+.+. .+.+-+. T Consensus 264 ~~~~q~--~~~~~~~~~~~l~~~i~~~~~~~~~p-~~~~~~-~~~n~Sg~Al~~~~~~l~~k~----~~k~~-~f~~~l~ 334 (434) T protein:vir:98 264 TQFGQL--DATDLSGFLKEHASDVRDMLTISQTP-TYLYAT-DLVNISADTIGALDILHVAKV----REHIA-SFSEGLE 334 (434) T ss_pred ceEEEe--cCcchHHHHHHHHHHHHHHhcccCCC-HHHhcc-ccCChHHHHHHHHHHHHHHHH----HHHHH-HHHHHHH Confidence 111111 11123333333333333332111000 011111 112345554444333322222 22222 2333334 Q ss_pred HHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCC Q lcl|NC_019445. 413 RAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGV 492 (559) Q Consensus 413 r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gv 492 (559) +.+.++.+..-. +.+ ...+++.+..++.. ++...++.+..+.+++ +.. +.+...+|. T Consensus 335 ~~~rl~~~~~g~---~~~--~~~~~v~w~~~~~~--------s~~~~ada~~kl~~~g------~~~----e~~~~~lg~ 391 (434) T protein:vir:98 335 SVLALAAAQAGV---PED--YTEAEVRWANPAHV--------TMAVKADAATKLKSIG------YPL----DVIAEELDE 391 (434) T ss_pred HHHHHHHHhcCC---Chh--heeeeEEecCCCCC--------CHHHHHHHHHHHHhcC------CcH----HHHHHhCCC Confidence 445554443211 112 23466766555432 1112222333333332 111 123345665 Q ss_pred CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcC Q lcl|NC_019445. 493 SPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQ 554 (559) Q Consensus 493 p~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 554 (559) +++|++++.+++.++....+..+. +. +. .+++..+.+ +.+..| T Consensus 392 ------~~~e~~r~~~e~~~~~~~~~~~~~--~~----~~-~~~g~~~~~------~~~~dg 434 (434) T protein:vir:98 392 ------SPARVRRIVAGAASQALLAASLLP--AP----GA-PSAGNVPDS------GGAVDG 434 (434) T ss_pred ------CHHHHHHHHHHHHHHHHHHHhhhc--cC----CC-CCCCCCCcc------cCCCCC Confidence 356777665554332222111111 00 00 001111100 111111 No 85 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.02 E-value=5.1e-09 Score=66.04 Aligned_cols=446 Identities=9% Similarity=0.075 Sum_probs=202.8 Q ss_pred CCh--h----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCC-CCCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAE--T----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFL-TSEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~~--~----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~-~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |.. . ..+.+.+..+.....|. ++++++.+|..-.-.... ........+...++..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhhc Confidence 221 1 12234333333333333 455555555432100000 0000111123346777888888888776554 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEe Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPF 153 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~ 153 (559) + -|+. ++..|+. +.+.+...+...+|.....++.++..+||.+.+++..+....+++..+ T Consensus 108 g--~p~~-----~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~ 167 (511) T protein:vir:99 108 G--NPIQ-----YQDDDKD-------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (511) T ss_pred c--cCce-----eecCchH-------------HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEE Confidence 2 1222 2333321 234455666778899999999999999999999888776666888889 Q ss_pred eccEEEEeeCCC--CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCC--ceEEEEEEEeecCccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSPR--GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYE--KWIEVMHSVYPNIDRDTSKLDSKN 229 (559) Q Consensus 154 ~l~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~--~~v~v~~~v~p~~~~~~~~~~~~~ 229 (559) +..+.++-.|.. +++...+|.+..... +..+.. ..+++|+ +.. . T Consensus 168 ~p~~~~~vyd~~~~~~~~~~vr~~~~~~~-------------------~~~~~~~~~~~~vyt---~~~--i-------- 215 (511) T protein:vir:99 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPI-------------------DKTDEDEVFTVDLFT---SHG--V-------- 215 (511) T ss_pred ccceeEEEEcCCCCCceEEEEEEEEeeec-------------------ccCccceEEEEEEEe---CCc--E-------- Confidence 988888777653 566666665543210 011111 1122221 100 0 Q ss_pred ccEEEEEEEecCC-------CceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 230 KPFKSVYYEVGGD-------NDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATN 302 (559) Q Consensus 230 ~~~~sv~~~~~~~-------~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~ 302 (559) ++|...+. ...-...-+|..+|++.++- +..|+|. .+..++.+..++.+.-.+...++...+ T Consensus 216 -----~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~ 284 (511) T protein:vir:99 216 -----YRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGD-YEKVITLIDLYDNAESDTANYMSDLND 284 (511) T ss_pred -----EEEEecCCccccccccccccccCCCCccceEEecC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 01111110 00111223577888887754 3578995 888999999999999999999999999 Q ss_pred CceeecCCCccc--cc-eecCCceeecCC----------cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_019445. 303 PPMVAPTSLKNQ--RA-SLLPGDITYIDQ----------ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFM 369 (559) Q Consensus 303 p~~~~p~~~~~~--~~-~~~pg~~~~~~~----------~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~ 369 (559) |.+++.+..... .. ....++.+.... .++...++.+. ...+...+...+..+++.|-..-+.. . T Consensus 285 ~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~-~~~~~~~~e~~~~~L~~~I~~~s~~P--~ 361 (511) T protein:vir:99 285 AMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTP--N 361 (511) T ss_pred hhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCc--c Confidence 987765422111 01 111122221110 11112233221 12345555566777777665444321 1 Q ss_pred hccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHH--HH Q lcl|NC_019445. 370 MLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMA--QA 447 (559) Q Consensus 370 ~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La--~a 447 (559) .....-+...|+..+..+-.- +........+.-.+.+.-+++-++.++...+.... +.+. ..+++.+.-.+. .+ T Consensus 362 ~~~~~~~gn~Sg~Alk~~~~~-l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~-~~~~--~~i~i~f~~~~p~n~~ 437 (511) T protein:vir:99 362 MKDDNFSGTQSGEAMKYKLFG-LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDV-SKDF--NTVRYVYNRNLPKSLI 437 (511) T ss_pred cccccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc-cccc--ccceEEeCCCCCcCHH Confidence 111111234455555444332 22233333333334444444444444444443222 1122 235555543322 12 Q ss_pred HHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 448 QKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAA 527 (559) Q Consensus 448 ~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~ 527 (559) .. ++.+..+++ .+....++..+ -+++ -.++|+++|.++++.+.+.++.... T Consensus 438 e~----------~~~~~kl~G-------iiS~et~l~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~~~~~----- 488 (511) T protein:vir:99 438 EE----------LKAYIDSGG-------KISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKNMY----- 488 (511) T ss_pred HH----------HHHHHHHhc-------cCCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhhccc----- Confidence 11 122222222 13333333332 1232 1356777776665443322211110 Q ss_pred HHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 528 QGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 528 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ...+.+.+....+.+ .....+ T Consensus 489 ~~~~~~~~~~~~~~~-----------~~~~d~ 509 (511) T protein:vir:99 489 QDPRNINDDEQDDST-----------KDSIDK 509 (511) T ss_pred ccCCCCCCCCCCCCC-----------cCcccc Confidence 000111111110000 000001 No 86 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.02 E-value=5.3e-09 Score=65.97 Aligned_cols=436 Identities=13% Similarity=0.089 Sum_probs=201.3 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--ccccCCCCCCC-CCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYI--NPRGSRFLTSE-VNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~--~P~~~~~~~~~-~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) +.+.+.+.|.+..+..+. |-+....+++++.-- ++.+....... .....+.+.++..+.+...++..++-|++ T Consensus 24 ~~~~~~~~i~~~i~~~~~-~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g--- 99 (474) T protein:vir:95 24 QFETQEEMIIRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--- 99 (474) T ss_pred ccCChHHHHHHHHHHHHH-HHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhcc--- Confidence 555555555555555543 333444555555421 12221111111 11112234567778888888888775543 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) .| +.+...|+. +. +.+...+ ..+|...+.++.++..++|.|.+++..+...-+++..++..+ T Consensus 100 ---~p-~~~~~~d~~------~~-------~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~ 161 (474) T protein:vir:95 100 ---KP-VTYSCEDES------VL-------KIIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQ 161 (474) T ss_pred ---CC-ceeccCchH------HH-------HHHHHHH-hccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEcccc Confidence 22 123443322 11 1222233 367899999999999999999998887766557888888888 Q ss_pred EEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEE---eecCcccccccccccccE Q lcl|NC_019445. 158 YYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSV---YPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 158 ~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v---~p~~~~~~~~~~~~~~~~ 232 (559) ++.-.|. .|.+..++|.+... ....+++|+.- +.+.+ .+++ T Consensus 162 ~~~v~d~~~~~~~~~~i~~~~~~-------------------------~~~~~~~y~~~~~~~~~~~--~~~~------- 207 (474) T protein:vir:95 162 AIPIWVDKEREELKSFIRYYKFN-------------------------NEEKVEFWTDTTVTYYVLE--NGGL------- 207 (474) T ss_pred eEEEEcCCCCCceEEEEEEEEEc-------------------------CeeEEEEEeCCeEEEEEEc--CCcc------- Confidence 7766554 56777677665421 01123333210 00000 0000 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCc Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~ 312 (559) ...+.........-...-+|..+|++.++. +.+|.|. .+...+.+..++.+.-.....++...+|.+++.+... T Consensus 208 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~ 281 (474) T protein:vir:95 208 IPDYYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSD-IWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEG 281 (474) T ss_pred ccccccCcccccccccccCCCccceEeecC-----CCCCCCc-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc Confidence 000000000000001123567889888754 4678996 8889999999999999999999999999888765321 Q ss_pred c--c--cceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHH Q lcl|NC_019445. 313 N--Q--RASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) Q Consensus 313 ~--~--~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~ 388 (559) . . .-....++++..+..++ ++.+. .+.+...+...+..+...|...-+.. .......+...|+..+..+. T Consensus 282 ~~~~~~~~~~~~~~~i~~~~~~~---~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p--~~~~~~~~~n~Sg~Alk~~~ 355 (474) T protein:vir:95 282 QDLEEFMRGLKYYKAINVDGDGG---VETIQ-VEVPVSSTKEYIDLMRAYIMEFGQGV--DFQTDKFGSAPSGIALKFLY 355 (474) T ss_pred ccchhhhhhhhccceeeccCCCc---eeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCc--ccccccccccchHHHHHHHH Confidence 1 1 11233344444433222 33221 22345566666777777776544321 11111112334555543332 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEe--ecHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 389 EEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEY--ISVMAQAQKSIGLSSLASTVNFIGQL 466 (559) Q Consensus 389 ~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~--is~La~a~r~~~~~~l~~~~~~~~~l 466 (559) ..+... ..+.. ..+...+.+++.++.+.--.. ....++++++ ..|-..+. .++.+..+ T Consensus 356 ~~l~~k----~~~k~-~~~~~~l~~~~~li~~~~g~~-----~d~~~i~v~f~~~~p~d~~e----------~a~~~~~~ 415 (474) T protein:vir:95 356 GNLDLK----ANKLK-NKATVAIQELIGFIIDFNNLK-----MDVKDIEISFNFNRMMNDAE----------QSQIIAQS 415 (474) T ss_pred HHHHHH----HHHHH-HHHHHHHHHHHHHHHHHhCCC-----cccceeeEEeccCCCcCHHH----------HHHHHHhc Confidence 222211 12222 234444455555554431111 1122344544 33322111 11112221 Q ss_pred hccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 467 AQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 467 a~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) + .+.-..++. .++ ++ -.++|++++.++++++.++++.... ..++....-.+.....+. T Consensus 416 -g-------~iS~et~i~----~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~--~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 416 -Q-------YLSRETLVK----SSPLVD----DYKAELERIEQEQMEYNKQLPNLDD--GGADGAQQQERSNDKESE 474 (474) T ss_pred -C-------CCchHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHhccccccc--ccCCCCcCCCCCccCCCC Confidence 1 233333332 222 32 1346666666554433332211100 000000000000000011 No 87 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.01 E-value=5.8e-09 Score=65.75 Aligned_cols=435 Identities=11% Similarity=0.124 Sum_probs=199.6 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC---CCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT---SEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~---~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |.....+++.. ..+ ....++|+.+.+|....-..... .......+...++..+.+...++..++-|. T Consensus 30 ~~~~~i~~~i~---~~~---~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~---- 99 (481) T protein:vir:10 30 LKEENLRNFIS---RHQ---TEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLT---- 99 (481) T ss_pred cCHHHHHHHHH---HHH---HHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhc---- Confidence 33333333333 322 34455677777775432111000 000111122345667777777777776443 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) +.|. .+...|.. ..+.+.+.+...+|.....++.++..++|.|.+++..+....+++..++..+ T Consensus 100 --g~~~-~~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~~ 163 (481) T protein:vir:10 100 --GNPI-TITHQDNQ-------------TNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPKS 163 (481) T ss_pred --cCCc-eEecCChh-------------HHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcccc Confidence 2222 23333322 1223455667788999999999999999999988877766667888899998 Q ss_pred EEEeeCCC--CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEE Q lcl|NC_019445. 158 YYLANSPR--GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSV 235 (559) Q Consensus 158 ~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv 235 (559) .++-.|.. +++...+|.++.. ..+ ...+..++...+.. .+ T Consensus 164 ~~~v~d~~~~~~~~~~i~~~~~~----------------------~~~-~~~~~~~~~y~~~~---------------i~ 205 (481) T protein:vir:10 164 TFVVYDQTLDKKVVAGVRYFEKQ----------------------DKD-KVPVQHVEVYTTDK---------------IY 205 (481) T ss_pred eEEEEcCCCCCceEEEEEEEEEe----------------------eCC-CceEEEEEEEecCe---------------EE Confidence 88777654 4566555554321 001 11222221111100 12 Q ss_pred EEEecCCCceeee--ecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc Q lcl|NC_019445. 236 YYEVGGDNDKLLR--ESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN 313 (559) Q Consensus 236 ~~~~~~~~~~il~--esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~ 313 (559) +|...+..-..+. +-+|..+|++.++- +.+|+|. .+...+.+..++.+...+...++...+|.+++++.... T Consensus 206 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~-~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~ 279 (481) T protein:vir:10 206 YIEIKGGTYHRVEEVEHYYNDVPIIEYLN-----DQFKQGD-FENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDL 279 (481) T ss_pred EEEecCCceeecccccccCCceeEEEeec-----CCCCCCc-hhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCC Confidence 2333222211111 22456788776542 4678996 77888889999998888888999999998887653211 Q ss_pred cc---ceecCCceeecCC------cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH Q lcl|NC_019445. 314 QR---ASLLPGDITYIDQ------ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) Q Consensus 314 ~~---~~~~pg~~~~~~~------~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei 384 (559) .. .....++.+.... .++...++-+ +...+...+...++.++..|...-... .......+...|+..+ T Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~l~~~i~~~s~~p--~~~~~~~~~n~Sg~Al 356 (481) T protein:vir:10 280 DSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYV-YKQYDVAGVEAYKKRLQNDIHKYTNTP--DLNDEQFSGVQSGESM 356 (481) T ss_pred CccchhhhhhccceeccccccccCCCCCcceeEE-eecCCHHHHHHHHHHHHHHHHHHhCCc--cccccccccccHHHHH Confidence 11 1222333332211 1111222211 112234455555666666554333221 1111111223355443 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~ 464 (559) .....- |--..++.+ ..+...+.+++.++.+.--+.... ......+++.+.-++..- .... ++.+. T Consensus 357 ~~~~~~----l~~k~~~~~-~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~i~v~f~~~~~~~-~~~~-------a~~~~ 422 (481) T protein:vir:10 357 KYKLFG----LEQVRAIKE-RLFKKGLMKRYKLLLNNVNLTGLK-QHNYAELTITFTPNLPKS-MMES-------INAFN 422 (481) T ss_pred HHHHHH----HHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCCC-ccccceeeEEeCCCCCcC-HHHH-------HHHHH Confidence 332222 222233332 345555566666655531111111 112235667665443210 1111 12222 Q ss_pred HHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhH Q lcl|NC_019445. 465 QLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSV 543 (559) Q Consensus 465 ~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 543 (559) .++++ +....+++ .++ ++ -.++|++.++++++++++..+.. ...+.... ....++++. T Consensus 423 kl~g~-------is~et~~~----~l~~i~----d~~~E~~ri~~E~~~~~~~~~~~----~~~~~~~~--~~~~dd~~g 481 (481) T protein:vir:10 423 ALSGG-------VSESTRLS----LLDFID----NPKEELEKMQEEEAQREKQADKR----GYGEAFEN--HLNVDDSNG 481 (481) T ss_pred HHhcc-------CChHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHhhhhhc----cCCccCCC--CCCCCCCCC Confidence 22222 22233333 232 21 13566666666554433221111 11111111 011111111 No 88 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.99 E-value=7.3e-09 Score=65.18 Aligned_cols=438 Identities=10% Similarity=0.048 Sum_probs=198.3 Q ss_pred CChh----------------------hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----c-CCCCCCCCCCcccc Q lcl|NC_019445. 1 MAET----------------------TKERLNKQFAQLESERQSFEPHWRELSDYINPR-----G-SRFLTSEVNRNDRR 52 (559) Q Consensus 1 M~~~----------------------~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~-----~-~~~~~~~~~~~~~~ 52 (559) |++- ..+.|.+.....+ ...++++.+.+|.-.. + ............+. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~----~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHK----ENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKP 76 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCchhccccccccccccccccc Confidence 4432 2222222223332 2345566666665431 0 00001111111123 Q ss_pred cCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHh Q lcl|NC_019445. 53 NTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGT 132 (559) Q Consensus 53 ~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 132 (559) +.++..+.+..+++..++-|++ -| ++++..++. .. +.+...+. .+|.....++.++..+ T Consensus 77 ~~ki~~n~~~~ivd~~~~~l~g--~~-----~~~~~~~d~------~~-------~~l~~~~~-n~~~~~~~~~~~~~~~ 135 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVA--NP-----VTFGVDNDK------AL-------KQIQHTLN-HKWDDKLVDILTAASN 135 (478) T ss_pred cceeccchHHHHHHHHHhhhcc--CC-----eeeecCChH------HH-------HHHHHHHh-cCHHHHHHHHHHHHHh Confidence 3456777888888888875543 11 123333322 11 11222333 5788899999999999 Q ss_pred hCcEEEEEeecCCceEEEEEeeccEEEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEE Q lcl|NC_019445. 133 YSTGAMAVLEDDEDIIRTMPFPIGSYYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEV 210 (559) Q Consensus 133 ~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v 210 (559) +|.|.+++..+....+++..++..+.+.-.| ..+.+...+|.++..- .+.+++ T Consensus 136 ~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~-------------------------~~~~~~ 190 (478) T protein:vir:10 136 KGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG-------------------------AERVEY 190 (478) T ss_pred cCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecC-------------------------ceEEEE Confidence 9999998887776667888888888777655 3567777777654211 112333 Q ss_pred EEE--E-eecCcccccccccccccEEEEEEEecCCCce---eeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHH Q lcl|NC_019445. 211 MHS--V-YPNIDRDTSKLDSKNKPFKSVYYEVGGDNDK---LLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVK 284 (559) Q Consensus 211 ~~~--v-~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~---il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~ 284 (559) ++. | +.+.+ ....+...+....+.... ....-+|..+|++.++. +.+|+|. .....+.+. T Consensus 191 y~~~~i~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd-~~~v~~liD 256 (478) T protein:vir:10 191 WTKDDVTYYELK--------EGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSD-LFMYKTIID 256 (478) T ss_pred EeCCeEEEEEEc--------CCeeeccccccccccccceecccccccCCccceEEecc-----CCCCCCc-HHHHHHHHH Confidence 211 0 00000 000000000000000000 01123567788877754 4689996 778889999 Q ss_pred HHHHHHHHHHHHHHHHhcCceeecCCC----ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHH Q lcl|NC_019445. 285 ALQLLQKRKSQLIDKATNPPMVAPTSL----KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIIN 360 (559) Q Consensus 285 ~L~~l~~~~~~~~~~~~~p~~~~p~~~----~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~ 360 (559) .++.+.-.+...++...+|.+++.+.. .....+...++++.++..++ ..+..+ +...+...+...++.++..|- T Consensus 257 a~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l-~~~~~~~~~~~~~~~l~~~i~ 334 (478) T protein:vir:10 257 ALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESG-SGVDTI-KVEVPIDSVKEYTKMLRDYII 334 (478) T ss_pred HHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEEecCCCC-CcceEE-eecCChHHHHHHHHHHHHHHH Confidence 999999999999999999987765421 11122344555555543322 222222 222345555666777776665 Q ss_pred HHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEe Q lcl|NC_019445. 361 SAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEY 440 (559) Q Consensus 361 ~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~ 440 (559) ..-... .......+...|+..+..+...+.. . ..+. ...+...+.+++.++.+..-. ...-.++++++ T Consensus 335 ~~s~~p--~~~~~~~~~n~Sg~Al~~~~~~l~~-k---~~~~-~~~~~~~l~~~~~li~~~~g~-----~~~~~~i~i~f 402 (478) T protein:vir:10 335 EFGQGV--DFQQDKFGNSPSGIALKFMYSNLDL-K---ANKL-KNKTLTALQELLQYIIDFYRL-----DVKVQDIEITF 402 (478) T ss_pred HHhCcc--ccCccccccccHHHHHHHHHHHHHH-H---HHHH-HHHHHHHHHHHHHHHHHHhCC-----CcccccceEEe Confidence 444321 1111111234566554433222211 1 1222 223444444555554442111 11122456655 Q ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 441 ISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMM 520 (559) Q Consensus 441 is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~ 520 (559) .-.+.. + ....++.++.++++ +....+++ .+|. +--.++|++.++++++.+++..... T Consensus 403 ~~~~p~-----d---~~e~a~~~~kl~g~-------iS~et~~~----~l~~---v~D~~~E~~ri~~E~~~~~~~~~~~ 460 (478) T protein:vir:10 403 NFNVMV-----N---ELENSQIAMNSTGL-------LSKETILS----NHAW---VEDPVAEMERIEQENIELNQQLPDI 460 (478) T ss_pred cCCCCC-----C---HHHHHHHHHHHhCC-------CChHHHHH----hCCC---CCCHHHHHHHHHHHHHHHHhhcccc Confidence 433321 0 11112223333332 33333333 3332 1123466666665543322221111 Q ss_pred HHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 521 AMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 521 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ......-.+... ..+++. T Consensus 461 ------~~~~~~~~~~~~---------------~~~~~~ 478 (478) T protein:vir:10 461 ------EEGLNGEQQRQS---------------ENNQPE 478 (478) T ss_pred ------ccccCCCCCCCC---------------CCCCCC Confidence 110000000000 001111 No 89 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.98 E-value=7.8e-09 Score=65.03 Aligned_cols=449 Identities=8% Similarity=0.009 Sum_probs=180.9 Q ss_pred CCh-----hhHHHHHHHHH-HHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcc---cccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAE-----TTKERLNKQFA-QLESERQSFEPHWRELSDYINPRGSRFLTSEVNRND---RRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~-----~~~~~l~~r~~-~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~---~~~~~~~~s~~~~a~~~Las~ 71 (559) |.+ .+.+++.+... +|-.......++++.+.+|..-...-.......+.+ +-..++..+.+..+|+.+++. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 80 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQ 80 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhh Confidence 332 12222322211 233333334455666666643211000000011111 111122345666667766664 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee-----cCCc Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE-----DDED 146 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~-----~~~~ 146 (559) | +|.+ |+ .+|.+.. ..+ .+.+.+++|....+++.++..+||.+.+++.+ |... T Consensus 81 l----~~~g---f~--~~d~~~~--~~~-----------~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g 138 (479) T protein:vir:99 81 L----IVDG---YR--KTGTNEN--AKG-----------WDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTT 138 (479) T ss_pred c----cccc---cc--CCCchhh--HHH-----------HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCC Confidence 3 4444 33 3333322 122 23445678888999999999999999998864 2333 Q ss_pred eEEEEEeeccEEEEeeC-CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccc Q lcl|NC_019445. 147 IIRTMPFPIGSYYLANS-PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 (559) Q Consensus 147 ~~~~~~~~l~~~~v~~d-~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~ 225 (559) ..++..++..+.++-.| +....-.+|. ++. .....+.++. .. T Consensus 139 ~~~i~~~~p~~~~~iydd~~~~~~~~~~---~~~-----------------------~~~~~~~~~~----~~------- 181 (479) T protein:vir:99 139 VARIKCIDPRDAFAIWEDPYWDEWPKYL---LER-----------------------QPNGQYWWWT----EE------- 181 (479) T ss_pred ceEEEEechhheEEEecCCcccceeeEE---Eee-----------------------cCceeEEEEe----cc------- Confidence 35666677766655443 2221111221 110 0011121111 00 Q ss_pred ccccccEEEEEEEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019445. 226 DSKNKPFKSVYYEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNP 303 (559) Q Consensus 226 ~~~~~~~~sv~~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p 303 (559) ...+++.+.+.-.+. .+-+|..+|++++.-+...+ .+|+|. .+..++.+-.++.....+...++..+.| T Consensus 182 -------~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~-~~g~sd-~e~v~~liDa~~~~~s~~~~~~~~~a~p 252 (479) T protein:vir:99 182 -------DYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLR-GVCYGD-VEPLVTVAKAIDKTGLDILLVQHHQSFQ 252 (479) T ss_pred -------eEEEEEecCCceeeccccccCCCCcceEEeecCCCcC-cCCcch-hHHHHHHHHHHHHHHHHHHHHHHHhhch Confidence 001122211111111 12356789999988777664 589996 7889899999999988889999999999 Q ss_pred ceeecCCCc-------cccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc-chh-hhccCC Q lcl|NC_019445. 304 PMVAPTSLK-------NQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV-DLF-MMLQNI 374 (559) Q Consensus 304 ~~~~p~~~~-------~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~-dl~-~~~~~~ 374 (559) .+.+.+... ........++++..... +.. +. ......+... +..++.-|...+.. ++. ..++. T Consensus 253 ~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~-~~--q~~~~~~~~~---~~~l~~~i~~i~~~t~~p~~~~g~- 324 (479) T protein:vir:99 253 IRWATGLMLPEGANADQEKMRFAQESMLISQNE-KAS-FG--AIPAAPLDGL---LNAYKESLLEFLALAQLPPHIAGQ- 324 (479) T ss_pred hhhhcCCCcccccccchhccccccccceeecCC-Cce-EE--EecccchHHH---HHHHHHHHHHHhccCCCCHHHccc- Confidence 766543211 11122233344433221 111 11 1111123333 33333333222211 000 11111 Q ss_pred CCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHH Q lcl|NC_019445. 375 NTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLS 454 (559) Q Consensus 375 ~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~ 454 (559) ....++.-+.....-+.. ...+.+ ..+.+-+.+++.++.+..-....... ..+++.+-.+... T Consensus 325 -~~n~Sg~Al~~~~~~l~~----ka~~~~-~~f~~al~~~~~l~~~~~~~~~~~~~---~~i~~~w~~~~~~-------- 387 (479) T protein:vir:99 325 -IVNVAADALAAGTRQTMQ----KLFEKQ-ATWKASHNQTMRLVNKIEGRTEEATD---LDFTITWQDVTIQ-------- 387 (479) T ss_pred -ccchHHHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHHcCCCccccc---eeeeEEecCCCCC-------- Confidence 122344444433222221 122222 23444555556655442111111111 2345544222110 Q ss_pred HHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh- Q lcl|NC_019445. 455 SLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTL- 533 (559) Q Consensus 455 ~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~- 533 (559) ++.+.++.+..|.+.+ .+..+.++ ....|++ +++++.+++.+.++.+..+..+++.....-++.- T Consensus 388 s~~~~ad~~~kl~~ag-----~is~et~l---~~l~gv~------~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (479) T protein:vir:99 388 SLAQFADAWAKMVESL-----KIPAEGVW---DMIPNLD------QSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRG 453 (479) T ss_pred CHHHHHHHHHHHHhcC-----CCCHHHHH---HhcCCCC------HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccC Confidence 1111222233332221 13323222 2223664 4566666555544444333333322111111100 Q ss_pred hhhc-CCChhHH--HHHHHHhhcCCCC Q lcl|NC_019445. 534 SEAK-TSDPSVL--SAMANAVSGQGGQ 557 (559) Q Consensus 534 ~~~~-~~~~~~~--~~~~~~~~~~~~~ 557 (559) +..+ +..+++. .+-.. ..+-+|+ T Consensus 454 ~~~~~~~~~~~~~~~~~~~-~~~~~~~ 479 (479) T protein:vir:99 454 GPNGATNMQQANNKTGEPA-SLNKSGA 479 (479) T ss_pred CCCCCCCCCCCCCCCcchh-ccCCCCC Confidence 0000 0000000 00001 1111111 No 90 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=98.95 E-value=1e-08 Score=64.33 Aligned_cols=441 Identities=10% Similarity=0.018 Sum_probs=194.7 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----cCCCC---------CCCCCCcccccCCCCcchHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPR-----GSRFL---------TSEVNRNDRRNTRIIDSTGTMAAR 66 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~-----~~~~~---------~~~~~~~~~~~~~~~~s~~~~a~~ 66 (559) |.-...+++. ..+....+...+++.++.+|..-. +.... ..........+.|+..+.+...++ T Consensus 1 ~~~e~~~~~i---~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd 77 (471) T protein:vir:10 1 MEIEVIKKII---SSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLD 77 (471) T ss_pred CCHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHH Confidence 6555544444 444444334445566666665311 00000 000011112334566677777777 Q ss_pred HHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCC- Q lcl|NC_019445. 67 TLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE- 145 (559) Q Consensus 67 ~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~- 145 (559) ..++-+++ -|+. ++..++. ..+. +...+ ..+|.....++.++..++|.|.+++..+.. T Consensus 78 ~~~~yl~G--~p~~-----~~~~~~~------~~~~-------l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~ 136 (471) T protein:vir:10 78 QKKAYALT--YPPT-----FDVDDKK------VNDM-------IVDVL-GDDYERISKQLCVNAGNAGIAWLHVWKDASD 136 (471) T ss_pred hhhhhhcc--cCce-----eccCChH------HHHH-------HHHHH-hcCHHHHHHHHHHHHhhCCeEEEEEEeeCCC Confidence 66654433 2222 3333321 1121 22223 367888899999999999999888776643 Q ss_pred ceEEEEEeeccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE------EEeec Q lcl|NC_019445. 146 DIIRTMPFPIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMH------SVYPN 217 (559) Q Consensus 146 ~~~~~~~~~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~------~v~p~ 217 (559) ..+++..++..+.++-.|. .+++...+|.|...... .+.....+++|+ .+... T Consensus 137 g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~-------------------~~~~~~~~~vy~~~~~~~y~~~~ 197 (471) T protein:vir:10 137 NSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDET-------------------DGKNYTVYEYWNDKECSFYRHEK 197 (471) T ss_pred CeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccC-------------------CCceeEEEEEEeCCcEEEEEecC Confidence 3478888998888776664 44566666655432110 011111223221 11100 Q ss_pred CcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 218 IDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLI 297 (559) Q Consensus 218 ~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~ 297 (559) ..... ......++.......+.....-...-+|..+|++.++. +.+|.|. .+...+.+-.++.+.-.....+ T Consensus 198 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd-~e~v~~liDa~d~~~S~~~~~~ 269 (471) T protein:vir:10 198 EKPLE--ELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETND-LKPIKDLVDVYDKVFSGFVNDT 269 (471) T ss_pred Ccccc--cccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCc-hHHHHHHHHHHHHHHHHHHHHH Confidence 00000 00000000000000000000001123667788877655 4568885 7888899999999999999999 Q ss_pred HHHhcCceeecCCCc---cc-cceecCCceeecCCcC--CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhc Q lcl|NC_019445. 298 DKATNPPMVAPTSLK---NQ-RASLLPGDITYIDQIT--GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMML 371 (559) Q Consensus 298 ~~~~~p~~~~p~~~~---~~-~~~~~pg~~~~~~~~~--~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~ 371 (559) +...+|.+++.+... .. .-+...++.+.++..+ ....+..+ +.+.+...+...++.+++.|-...... .. T Consensus 270 ~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~l~~~I~~~s~tp--~~- 345 (471) T protein:vir:10 270 DDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTI-AIDIPTEARNLILERTKKQIFISGQGV--NP- 345 (471) T ss_pred HHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEE-eecCChHHHHHHHHHHHHHHHHHhCCc--CC- Confidence 999999887765211 11 1233445555443322 11222222 222345566666777777765544321 11 Q ss_pred cCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHH Q lcl|NC_019445. 372 QNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKS 450 (559) Q Consensus 372 ~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~ 450 (559) .....+..|+.-+..+-. .|--..++... .+...+.+.+.++... |.. ...+++++|.-.+.. T Consensus 346 ~~~~~gn~Sg~Alk~~~~----~l~~k~~~~~~-~~~~~l~~~~~li~~~~~~~-------d~~~i~i~f~~~~p~---- 409 (471) T protein:vir:10 346 ETDKLGNSSGVALKFLYS----LLELKAGNMET-QFRSGYATLVKMILKHLGLS-------DKLKIKQTWTRNSIN---- 409 (471) T ss_pred CcccccCccHHHHHHHHH----HHHHHHHHHHH-HHHHHHHHHHHHHHHHhccC-------CCceeEEEeCCCCCC---- Confidence 111112334433322211 11112233322 2333334555444432 221 123466666544321 Q ss_pred HHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 451 IGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGA 530 (559) Q Consensus 451 ~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a 530 (559) + ....++.++.+++ .+.-..++..+ -+++ -.++|++++.+++++++++..-. . T Consensus 410 -n---~~e~~~~~~kl~g-------~iS~et~~~~~---p~v~----D~~~E~eri~~E~~~~~~~~~~~---------~ 462 (471) T protein:vir:10 410 -N---DTEMAQVVSTLAT-------ITSRENVAKSN---PIVE----DWQDELRLQKAEQEGRSEKLYDM---------E 462 (471) T ss_pred -C---HHHHHHHHHHHhc-------cCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhccccc---------C Confidence 1 1111222222222 13333333221 1222 12466666655443332211100 0 Q ss_pred hhhhhhcCCChhHHH Q lcl|NC_019445. 531 KTLSEAKTSDPSVLS 545 (559) Q Consensus 531 ~~~~~~~~~~~~~~~ 545 (559) +....+.. + T Consensus 463 ----~~~~~~e~--~ 471 (471) T protein:vir:10 463 ----EVEHESEV--E 471 (471) T ss_pred ----CCCCcccc--C Confidence 00000000 0 No 91 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.93 E-value=1.3e-08 Score=63.79 Aligned_cols=466 Identities=9% Similarity=0.031 Sum_probs=189.1 Q ss_pred CChhhHHHHHHH-HHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCCccc-ccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQ-FAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNRNDR-RNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r-~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~~~~-~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |.......+... +..+.. |. ++.+.+.+|..-... ........+..+ .+.+...+-+..+++.++..| + T Consensus 23 ~~~~~~~~l~~~l~~~~~~-~~---~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l----~ 94 (501) T protein:vir:25 23 MSREQLGALVADMWRLHIS-ER---QWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNL----S 94 (501) T ss_pred CChHHHHHHHHHHHHHHHH-HH---HHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhh----c Confidence 555444444433 333332 33 455555555431100 000111111001 112334466777777766644 3 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) |.+ |++ +|.+.. +. +.....+.+|....+++.++..+||.|.+++..+.... ++..++..+ T Consensus 95 ~~g---f~~--~d~~~~--~~-----------l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~-~i~~~sp~~ 155 (501) T protein:vir:25 95 VVG---YRN--ALAKEN--DP-----------AWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGP-VFRTRSPRQ 155 (501) T ss_pred ccc---eec--CCccch--HH-----------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCC-eEEEecccc Confidence 433 443 332211 11 23345678899999999999999999999887765442 344466666 Q ss_pred EEEe-eCCC--CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE--EEEeecCcccccccccccccE Q lcl|NC_019445. 158 YYLA-NSPR--GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM--HSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 158 ~~v~-~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~--~~v~p~~~~~~~~~~~~~~~~ 232 (559) ..+- .|+. .++...+|.+...- .......+++| ++++.-..............+ T Consensus 156 ~~~iy~D~~~~~~~~~ai~~~~~~~---------------------~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 214 (501) T protein:vir:25 156 ILAVYADPSVDAWPQYALETWVAQK---------------------DAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQA 214 (501) T ss_pred EEEEEecCCCCcceeEEEEEEeecc---------------------ccCcceeEEEecCeeEEEEecCceeeeecccccc Confidence 5433 3433 23433333322110 00001112222 112111110000000000011 Q ss_pred EEEE---EEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_019445. 233 KSVY---YEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPT 309 (559) Q Consensus 233 ~sv~---~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~ 309 (559) .... +.......+...+-+|..+|++.+.=+.. .+.+|+|- .+..++-+..++...-.++..++..+.|.+.+.+ T Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~-~~~~g~sd-ie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G 292 (501) T protein:vir:25 215 TQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRD-ADDMIVGE-VAPLILLQQAINSVNFDRLIVSRFGANPQRVISG 292 (501) T ss_pred ccccccccccccccccccccCCccceeeEeccCccc-cCccccch-hhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhC Confidence 1000 11111111222334677889887654443 35688885 6778888899999999999999999998655432 Q ss_pred C-Cc-cccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHH Q lcl|NC_019445. 310 S-LK-NQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEM 387 (559) Q Consensus 310 ~-~~-~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r 387 (559) - .. ...+...+|.++..+..+ ..+..+ ...+++...+.+..+...|...-... ...+.. .....++.-+... T Consensus 293 ~~~~~~~~~~~~~~~i~~~~~~~--~~~~q~--~~~~~~~~~~~l~~~i~~i~~~s~~P-~~~~~~-~~~N~Sg~Al~~~ 366 (501) T protein:vir:25 293 WTGSKAEVLKASALRVWTFEDPE--VKAQAF--PPASVEPYNLILEEMLQHVAMVAQIS-PAQVTG-KMINVSAEALAAA 366 (501) T ss_pred CCCCccchhhhcccceeccCCCC--ceEEEe--cccChHHHHHHHHHHHHHHHhhcCCC-hhhhcc-ccCChHHHHHHHH Confidence 1 11 122455677766553222 112111 11234444444444444442211110 111111 1112345444333 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 388 KEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) Q Consensus 388 ~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la 467 (559) ..-+.. ...+.+.. +.+-+.+.+.++....--... .....+++.+.-++.. ++.+.++.+..++ T Consensus 367 ~~~l~~----ka~~k~~~-f~~~l~~~~rl~~~~~~~~~~---~~~~~i~v~w~~~~~~--------s~~~~ada~~kl~ 430 (501) T protein:vir:25 367 EANQQR----KLAAKRES-FGESWEQLLRLAAEMDDDPDT---AADSGAEVLWRDTEAR--------SFGAVVDGITKLA 430 (501) T ss_pred HHHHHH----HHHHHHHH-HHHHHHHHHHHHHHHhCCCcc---ccceeeeEEecCCCCC--------CHHHHHHHHHHHH Confidence 222222 11222222 222223333333221111111 1123466665444321 1122233333443 Q ss_pred ccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHH Q lcl|NC_019445. 468 QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAM 547 (559) Q Consensus 468 ~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~ 547 (559) +++ |.... .+....|++ +++++++++++++++......+..... .. .....++.++-+.- T Consensus 431 ~~g------is~et---~~~~~~g~~------~~~ie~~~~~~~e~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~ 490 (501) T protein:vir:25 431 SAG------IPIEH---LLSMVPGMT------QQTIQAIKDSLRGGEVKSLVDKLLSNE----PA-PVPPPPPQAAAQAL 490 (501) T ss_pred hcC------CCHHH---HHHHcCCCC------HHHHHHHHHHHHHHhHHHHHHHhhccC----cC-CCCCCCCCCCcccc Confidence 331 21111 233445764 456666665554443322111111100 00 00111111111111 Q ss_pred HHHhhcCCCCC Q lcl|NC_019445. 548 ANAVSGQGGQS 558 (559) Q Consensus 548 ~~~~~~~~~~~ 558 (559) -.+...+.||+ T Consensus 491 ~~~~~~~~~g~ 501 (501) T protein:vir:25 491 NEGGVNGNGGA 501 (501) T ss_pred ccccCCCCCCC Confidence 11122223333 No 92 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=98.89 E-value=1.9e-08 Score=62.94 Aligned_cols=508 Identities=15% Similarity=0.082 Sum_probs=218.6 Q ss_pred CChh-------hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAET-------TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~~~-------~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |++. +.+.+.++|...-+.-..+...|++-.+.+-=.-.+. ..+.. +.... | |-|+|.++ T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~-~~~~~-~~~~r---~--------nl~~sni~ 67 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDE-RDSAH-DAETR---W--------NLFSTNIQ 67 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhcc-ccCCC-ccccc---c--------chhhhhHH Confidence 8882 3466777886654443334445555444443221111 11111 11111 1 45555443 Q ss_pred HhhcCC---CC--cceeccCCccchhhHHHHHHHHHHHHHHHHHHHH--hccchHHHHHHHHHHHhhCcEEEEEee---- Q lcl|NC_019445. 74 SGITSP---AR--PWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN--KSNLYQSLPQLYGSLGTYSTGAMAVLE---- 142 (559) Q Consensus 74 ~~l~pp---~~--~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~--~snf~~~~~~~~~dl~~~G~~~l~v~~---- 142 (559) +|.|. .. |=++=.+.|.+ ..-.+..-+.++|.+...|+ ..+|...+..+..+.++.|-|++++.- T Consensus 68 -~i~P~iYar~P~p~V~~rf~d~d---~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~ 143 (663) T protein:vir:34 68 -TQMASLYGQTPKVSVSRRFADAD---DDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEW 143 (663) T ss_pred -HHhhhhhcCCCcceeeecccCcc---cchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeeccc Confidence 33331 11 11221222222 11234555567777766774 467999999999999998888887532 Q ss_pred ------------cCC---------------ceEEEEEeeccEEEEeeC-CCCCEEEEEEEEeecHHHHHHhcCcccCCHH Q lcl|NC_019445. 143 ------------DDE---------------DIIRTMPFPIGSYYLANS-PRGSVDICFRKFSMTVRQLVQEFGLNNVSES 194 (559) Q Consensus 143 ------------~~~---------------~~~~~~~~~l~~~~v~~d-~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~ 194 (559) ..+ ..+.+..+.-.+|.++.- .--.|+=|.++..||-+++..+||. ++... T Consensus 144 ~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~~-~~~~~ 222 (663) T protein:vir:34 144 EEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFDA-DGSRN 222 (663) T ss_pred chhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhcC-Chhhh Confidence 001 145566666666655442 1235788889999999999999963 33222 Q ss_pred HHHHHh------cCC------CCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeee-------cCcccC Q lcl|NC_019445. 195 VKSMWE------SGT------YEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRE-------SGFDEF 255 (559) Q Consensus 195 v~~~~~------~~~------~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~e-------sg~~~~ 255 (559) ...... +++ +..+..| |-|| +... . .|||..+|.. .+|++ .||.-| T Consensus 223 ~~a~~~~~~~~~~~~~~~~~~~~~~a~V-wEIW---dK~~----~------~V~w~~eg~~-~~L~~~~p~lgl~~ffPc 287 (663) T protein:vir:34 223 LWASVPKVGKPKDGKDGQSCHPWDRAEV-WEIW---DKGG----R------KVDWYVEGYS-AVLDTQPDPLGLESFFPC 287 (663) T ss_pred hhhhccCcCCccccCCCCCcchhcCcce-eEEE---ecCC----c------EEEEEEcCcc-eecccCCCCCCCCCCCCC Confidence 211110 000 1112322 2233 2111 1 3677766553 55655 468888 Q ss_pred CeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCcc---ccc-eecCCceeec----- Q lcl|NC_019445. 256 PIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKN---QRA-SLLPGDITYI----- 326 (559) Q Consensus 256 P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~~---~~~-~~~pg~~~~~----- 326 (559) |+...-....++-+=+-.+ + -|=.-++.||.++..+- ...-+++|.+++|.+... ..+ ...-|..+.+ T Consensus 288 Prpl~~~~~~ds~ipvpd~-~-~y~~~~~E~n~~t~Rin-~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~ 364 (663) T protein:vir:34 288 PKPLLANWTTDKVVPRPDF-V-LAQDLYKEIDLVSTRIT-LLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLT 364 (663) T ss_pred cccccceecCCCeecCCcH-H-HHHHHHHHHHHHHHHHH-HHHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhh Confidence 9998877777764444332 2 45556678888777654 455678999998854322 111 1111222222 Q ss_pred --CCcCCchhh--hhhhhccccHHHHHHHHHHHHHHHHHHhh-----cchhhhccCCCCCCcCHHHHHHHHHHHHHHhhh Q lcl|NC_019445. 327 --DQITGQDGF--RPAYLVNPSTADLVADIQDTRQIINSAYF-----VDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGP 397 (559) Q Consensus 327 --~~~~~~~~~--~p~~~~~~~~~~~~~~i~~~~~rI~~af~-----~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~ 397 (559) +..+-.+.| -|+-.+.+.+. .+-..+..|+...| .|.+... .-.+-||||-.... +.++. T Consensus 365 ~~~~gg~~k~I~~~pi~~~~~aI~----~l~~~r~qir~d~~qITGiaDi~Rga---~~a~ETatAQ~IKs----q~gS~ 433 (663) T protein:vir:34 365 FADKGGLRGVVDWFPLEPVVAALT----SLRDYRRELVDALHQVTGMADIMRGA---SDPRETAMAQGVKA----KFGSI 433 (663) T ss_pred hhhhcCccchhhcccchhHHHHHH----HHHHHHHHHHHHHHHHHhHHHHhhcc---cCcchhhHHHHHHH----HHHhH Confidence 111110111 11211111122 12223344444433 3322211 12345666644433 33333 Q ss_pred HHHHHHHH---HHHHHHHHHHHHHHh-----------cCCCCC------CchhhCCc---ceEEEee--cH-----HH-H Q lcl|NC_019445. 398 VLERLNDE---CLNPLIDRAFSMMVR-----------KNMLPP------PPDAMEGM---PLKVEYI--SV-----MA-Q 446 (559) Q Consensus 398 v~~~l~~E---~l~Pli~r~~~il~r-----------~g~lp~------~p~~l~g~---~v~~~~i--s~-----La-~ 446 (559) -+..+++| |..-++.-.-.+|-. ...+|. .-..|... .+++.+- |. ++ + T Consensus 434 RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK 513 (663) T protein:vir:34 434 RLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALR 513 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHH Confidence 34444433 222222222222221 234442 11123222 2344443 32 21 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc-ccCC-HHHHHHHHHH-------------HH Q lcl|NC_019445. 447 AQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT-VIVP-QEQVDQARQQ-------------RA 511 (559) Q Consensus 447 a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~-~~rs-~~ev~~~rq~-------------r~ 511 (559) ..+..-+..|..|++.++.+++..|+..+. .-++++.... |++.+ -+.+ .+++.+..++ ++ T Consensus 514 ~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~--l~Ellk~~~~--~f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~ 589 (663) T protein:vir:34 514 NEKMEVLSGIASFMQGVAPLAQQVPGSAPF--LLQMLKWSVS--GLRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQP 589 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHH--HHHHHHHHhh--cCChhhhHHHHHHHHHhhhHHHhhccCCCCcccchh Confidence 334445567777777777777777764331 1223332222 22211 1100 1111111110 00 Q ss_pred HHHH--HHHHHHHHHHHHHHHhhhhhhcCC--------------ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 512 QQQQ--QQQMMAMGMAAAQGAKTLSEAKTS--------------DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 512 q~~q--~~~~~~~~~~~~~~a~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~ 559 (559) +... +|..++.+-+.+|+.-.+..+.++ ..+..++.-.+.-.+.|-++ T Consensus 590 ~~k~~~~q~k~q~~~aeAq~e~q~~~~~~ql~~~~~~~k~~~~a~~~~~~a~q~~~~~~~~r~~ 653 (663) T protein:vir:34 590 DPKVVAQAMKGQQEMAKVQAEVQGDLLRIQAETQANETKERQQAEWNVREAAQKNLISQAARAM 653 (663) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHhh Confidence 0000 000000000111111111111111 11222333233333333333 No 93 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.88 E-value=2.1e-08 Score=62.72 Aligned_cols=438 Identities=11% Similarity=0.050 Sum_probs=196.1 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc-----cccCCCCCCCCC-CcccccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYIN-----PRGSRFLTSEVN-RNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~-----P~~~~~~~~~~~-~~~~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) .++...+.|.+..+.... | ..+.+++.+|.. +.+......... ...+.+.++..+-+...++..++-|++ T Consensus 24 ~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g 99 (474) T protein:vir:95 24 KVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAG 99 (474) T ss_pred cccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcc Confidence 333333434444444433 2 233444444432 111111100001 111233467777778888877775543 Q ss_pred hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEee Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFP 154 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~ 154 (559) -| .+++..+.. +.+ .+...+ ..+|.....++.++..+||.|.+++..+...-+++..++ T Consensus 100 --~p-----~~~~~~~~~------~~~-------~l~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~ 158 (474) T protein:vir:95 100 --KP-----VTYAHDDDK------VLD-------VIHQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVP 158 (474) T ss_pred --cC-----ceeccCChH------HHH-------HHHHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEc Confidence 12 223333321 111 122222 367888999999999999999998887766567888899 Q ss_pred ccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 155 IGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 155 l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) ..++++-.|. .+++...+|.++.. ....+++|+.-.-. .+...+..+ T Consensus 159 p~~~~~v~d~~~~~~~~a~ir~~~~~-------------------------~~~~~~vy~~~~i~------~~~~~~~~~ 207 (474) T protein:vir:95 159 AEQAIPIWTDKEREQLNAFIRIFTFN-------------------------GETKVEYWTAETVT------YYVYENGGL 207 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEeec-------------------------CeeEEEEEeCCeEE------EEEEcCCce Confidence 8888877664 46776666665421 11234443210000 000000000 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC- Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL- 311 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~- 311 (559) .............-...-+|..+|++.++. +.+|.|. .+..++.+-.++.+.-.++..++...+|.+++.+-. T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d-~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~ 281 (474) T protein:vir:95 208 IPDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSD-IWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG 281 (474) T ss_pred eeccccccccccCcccccCCCccceEEecC-----CCCCCCc-hHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc Confidence 000000000011111234677888887754 4678896 888999999999999999999999999987765421 Q ss_pred ---ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHH Q lcl|NC_019445. 312 ---KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) Q Consensus 312 ---~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~ 388 (559) ......+..++++.++..++...+.+ +.+.......+..++..|-..-... .......+...|+..+..+- T Consensus 282 ~~~~~~~~~~~~~~~i~~~~~~~~~~l~~----~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg~Alk~~~ 355 (474) T protein:vir:95 282 EDLSEFMEGLKYYKAINVSSDGGVETIQV----EVPVASTKEYLDMMRAYIVEFGQGV--DFQTDKFGSATSGIALKFLY 355 (474) T ss_pred ccccchhhhhhccceeeccCCCceeEEec----cCCHHHHHHHHHHHHHHHHHHhCCc--CccccccccccHHHHHHHHH Confidence 11112334445555543333222221 2345555666777777765554321 11111112333443333221 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 389 EEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) Q Consensus 389 ~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la 467 (559) . .|--...+. ...+...+.+.+.++.+. |.-+ ...+|+++|.-.+..- .++ .++.+.+ + T Consensus 356 ~----~l~~k~~~~-~~~~~~~l~~~~~~i~~~~g~~~------d~~~i~i~f~~~~p~~----~~e----~a~~~~~-~ 415 (474) T protein:vir:95 356 T----NLNLKANKL-KNKANVALQELMQFILDFNKIKL------DAKEIEITFNFNVMVN----DLE----QSQIGAQ-S 415 (474) T ss_pred H----HHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCc------ccceeeEEecCCCccC----HHH----HHHHHHH-c Confidence 1 111111222 223344445555555443 3211 1224556554433210 011 1111111 1 Q ss_pred ccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHH Q lcl|NC_019445. 468 QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAM 547 (559) Q Consensus 468 ~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~ 547 (559) + .+.-..++.. ++.- --.++|++++.+++.+++++++.... ..+....+....... T Consensus 416 g-------iiS~et~~~~----lp~v---~D~~~E~eri~~E~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~----- 471 (474) T protein:vir:95 416 Q-------YLSKETLVRH----HPWV---DDPKAELERLDEEQLELNKQLPNLDD-----GGADGAQQQQQSENN----- 471 (474) T ss_pred C-------CCChHHHHHh----CCCC---CCHHHHHHHHHHHHHHHHhhcccccc-----ccCCCCCCcCCCCcc----- Confidence 1 2333333322 2221 11356666666554433222111000 001111110100000 Q ss_pred HHHhhcCCCCCC Q lcl|NC_019445. 548 ANAVSGQGGQSQ 559 (559) Q Consensus 548 ~~~~~~~~~~~~ 559 (559) ++. T Consensus 472 ---------e~~ 474 (474) T protein:vir:95 472 ---------QSK 474 (474) T ss_pred ---------ccC Confidence 000 No 94 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.88 E-value=2.1e-08 Score=62.72 Aligned_cols=438 Identities=11% Similarity=0.050 Sum_probs=196.1 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc-----cccCCCCCCCCC-CcccccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYIN-----PRGSRFLTSEVN-RNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~-----P~~~~~~~~~~~-~~~~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) .++...+.|.+..+.... | ..+.+++.+|.. +.+......... ...+.+.++..+-+...++..++-|++ T Consensus 24 ~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g 99 (474) T protein:vir:96 24 KVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAG 99 (474) T ss_pred cccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcc Confidence 333333434444444433 2 233444444432 111111100001 111233467777778888877775543 Q ss_pred hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEee Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFP 154 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~ 154 (559) -| .+++..+.. +.+ .+...+ ..+|.....++.++..+||.|.+++..+...-+++..++ T Consensus 100 --~p-----~~~~~~~~~------~~~-------~l~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~ 158 (474) T protein:vir:96 100 --KP-----VTYAHDDDK------VLD-------VIHQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVP 158 (474) T ss_pred --cC-----ceeccCChH------HHH-------HHHHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEc Confidence 12 223333321 111 122222 367888999999999999999998887766567888899 Q ss_pred ccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 155 IGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 155 l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) ..++++-.|. .+++...+|.++.. ....+++|+.-.-. .+...+..+ T Consensus 159 p~~~~~v~d~~~~~~~~a~ir~~~~~-------------------------~~~~~~vy~~~~i~------~~~~~~~~~ 207 (474) T protein:vir:96 159 AEQAIPIWTDKEREQLNAFIRIFTFN-------------------------GETKVEYWTAETVT------YYVYENGGL 207 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEeec-------------------------CeeEEEEEeCCeEE------EEEEcCCce Confidence 8888877664 46776666665421 11234443210000 000000000 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC- Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL- 311 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~- 311 (559) .............-...-+|..+|++.++. +.+|.|. .+..++.+-.++.+.-.++..++...+|.+++.+-. T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d-~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~ 281 (474) T protein:vir:96 208 IPDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSD-IWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG 281 (474) T ss_pred eeccccccccccCcccccCCCccceEEecC-----CCCCCCc-hHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc Confidence 000000000011111234677888887754 4678896 888999999999999999999999999987765421 Q ss_pred ---ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHH Q lcl|NC_019445. 312 ---KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) Q Consensus 312 ---~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~ 388 (559) ......+..++++.++..++...+.+ +.+.......+..++..|-..-... .......+...|+..+..+- T Consensus 282 ~~~~~~~~~~~~~~~i~~~~~~~~~~l~~----~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg~Alk~~~ 355 (474) T protein:vir:96 282 EDLSEFMEGLKYYKAINVSSDGGVETIQV----EVPVASTKEYLDMMRAYIVEFGQGV--DFQTDKFGSATSGIALKFLY 355 (474) T ss_pred ccccchhhhhhccceeeccCCCceeEEec----cCCHHHHHHHHHHHHHHHHHHhCCc--CccccccccccHHHHHHHHH Confidence 11112334445555543333222221 2345555666777777765554321 11111112333443333221 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 389 EEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLA 467 (559) Q Consensus 389 ~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la 467 (559) . .|--...+. ...+...+.+.+.++.+. |.-+ ...+|+++|.-.+..- .++ .++.+.+ + T Consensus 356 ~----~l~~k~~~~-~~~~~~~l~~~~~~i~~~~g~~~------d~~~i~i~f~~~~p~~----~~e----~a~~~~~-~ 415 (474) T protein:vir:96 356 T----NLNLKANKL-KNKANVALQELMQFILDFNKIKL------DAKEIEITFNFNVMVN----DLE----QSQIGAQ-S 415 (474) T ss_pred H----HHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCc------ccceeeEEecCCCccC----HHH----HHHHHHH-c Confidence 1 111111222 223344445555555443 3211 1224556554433210 011 1111111 1 Q ss_pred ccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHH Q lcl|NC_019445. 468 QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAM 547 (559) Q Consensus 468 ~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~ 547 (559) + .+.-..++.. ++.- --.++|++++.+++.+++++++.... ..+....+....... T Consensus 416 g-------iiS~et~~~~----lp~v---~D~~~E~eri~~E~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~----- 471 (474) T protein:vir:96 416 Q-------YLSKETLVRH----HPWV---DDPKAELERLDEEQLELNKQLPNLDD-----GGADGAQQQQQSENN----- 471 (474) T ss_pred C-------CCChHHHHHh----CCCC---CCHHHHHHHHHHHHHHHHhhcccccc-----ccCCCCCCcCCCCcc----- Confidence 1 2333333322 2221 11356666666554433222111000 001111110100000 Q ss_pred HHHhhcCCCCCC Q lcl|NC_019445. 548 ANAVSGQGGQSQ 559 (559) Q Consensus 548 ~~~~~~~~~~~~ 559 (559) ++. T Consensus 472 ---------e~~ 474 (474) T protein:vir:96 472 ---------QSK 474 (474) T ss_pred ---------ccC Confidence 000 No 95 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.86 E-value=2.6e-08 Score=62.14 Aligned_cols=443 Identities=9% Similarity=0.011 Sum_probs=203.7 Q ss_pred CCh--hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc-------c---ccCCCCCCC--CCCcccccCCCCcchHHHHHH Q lcl|NC_019445. 1 MAE--TTKERLNKQFAQLESERQSFEPHWRELSDYIN-------P---RGSRFLTSE--VNRNDRRNTRIIDSTGTMAAR 66 (559) Q Consensus 1 M~~--~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~-------P---~~~~~~~~~--~~~~~~~~~~~~~s~~~~a~~ 66 (559) +.+ .+.+.|.+..+..+..|..+...++.+..+.. | ....+.... .....+.+.|+..+.+...++ T Consensus 10 ~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd 89 (474) T protein:vir:10 10 IEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVD 89 (474) T ss_pred ccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHH Confidence 111 13444555555555556555555544433321 1 111110000 011112334666777777777 Q ss_pred HHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc Q lcl|NC_019445. 67 TLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED 146 (559) Q Consensus 67 ~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~ 146 (559) ..++-|++- |+ ++...+.. ....++++ .+.+.+..++|-....++.++..+||.|.+++..+... T Consensus 90 ~~~~yl~g~--pv-----~~~~~~~~-~~~e~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~ 154 (474) T protein:vir:10 90 TRVGYLHGV--PV-----TYDLDENA-EKNEKLKK-------FITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNG 154 (474) T ss_pred hHhhheecc--ce-----eEeeCCCC-cchHHHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCC Confidence 766644321 32 23332211 11223333 33445666889999999999999999999998877666 Q ss_pred eEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccc Q lcl|NC_019445. 147 IIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLD 226 (559) Q Consensus 147 ~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~ 226 (559) .+++..++..+.++-.|..+.....+|.+.... ..+ ...+..++ +| ..+ T Consensus 155 ~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~---------------------~~~-~~~~~~~~-~y-~~~------- 203 (474) T protein:vir:10 155 DIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKD---------------------DDN-GTDYVYAE-FY-DNA------- 203 (474) T ss_pred eeEEEEEcccceEEEEcCCCceEEEEEEEEEee---------------------CCC-ceEEEEEE-EE-cCc------- Confidence 688888888887777777777655454433110 001 11111111 11 100 Q ss_pred cccccEEEEEEEecC-CCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019445. 227 SKNKPFKSVYYEVGG-DNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNP 303 (559) Q Consensus 227 ~~~~~~~sv~~~~~~-~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p 303 (559) . .+++...+ +..... .+-+|..+|++.++ ++.+|.|. .+...+-+..++.+........+...+| T Consensus 204 ---~---~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:10 204 ---Y---YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGD-AEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred ---e---EEEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCc-hHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 0 11122111 111111 12356678877653 45689996 8889999999999999999999999999 Q ss_pred ceeecCCC-cccc-ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCH Q lcl|NC_019445. 304 PMVAPTSL-KNQR-ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPV 381 (559) Q Consensus 304 ~~~~p~~~-~~~~-~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA 381 (559) .+++.+.. .... .....++.+.+...+ ..++.+. ...+...+...+..+++.|...-+.. ......-+...|+ T Consensus 272 ~l~i~g~~~~~~~~~~~~~~~~i~~~~~~--~~~~~l~-~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg 346 (474) T protein:vir:10 272 YLVLRGMGMSEEMIQETQKSGAFELFDKD--MDVKYLT-KDVNDTMIENHLDRIEKNIMRFAKSV--NFNSDEFNGNVPI 346 (474) T ss_pred hhhhccCCCCchhhhhhhhcceeEecCCC--CceeEEe-ccCCHHHHHHHHHHHHHHHHHHhCCc--ccccccccccchH Confidence 88775421 1111 224445555443222 2223221 12244555666777777775544321 1111111234566 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHH--HHHHHHHHHHHHHH Q lcl|NC_019445. 382 EAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMA--QAQKSIGLSSLAST 459 (559) Q Consensus 382 ~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La--~a~r~~~~~~l~~~ 459 (559) ..+..+-.-+.. ......+.-.+.+.-+++-++.++...+.-.. + ..-.++++.|.-++. .+..+ T Consensus 347 ~Al~~~~~~l~~-k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~-~--~~~~~i~~~f~~~~p~d~~e~a--------- 413 (474) T protein:vir:10 347 IGMKLKLMALEN-KCMTFERKMTAMLRYQFKVILSALKRKGYNLD-D--DSYLNLIFKFTRNIPVNKLEES--------- 413 (474) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC-c--cccccceEEeCCCCCCCHHHHH--------- Confidence 555443322221 22222223233333333334444433332111 1 112246666654332 12111 Q ss_pred HHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Q lcl|NC_019445. 460 VNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKT 538 (559) Q Consensus 460 ~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~ 538 (559) +.+..+++ .+....+++. ++ ++ -.++|++.+.++.++.++...... . +...+... T Consensus 414 -~~~~kl~g-------~iS~et~~~~----l~~v~----d~~~E~eri~~E~~e~~~~~~~~~------~--~~~~~~~~ 469 (474) T protein:vir:10 414 -QVLINLKG-------QVSERTRLGQ----SQLVD----DVDYELDEMEKESLEFNDKLPDID------E--GDANDKSQ 469 (474) T ss_pred -HHHHHHhc-------cCchHHHHHh----CCCCC----CHHHHHHHHHHHHHHHHhhccccc------C--CCcCCCCc Confidence 22222222 1222233332 22 22 134666666555443332211100 0 01111111 Q ss_pred CChhH Q lcl|NC_019445. 539 SDPSV 543 (559) Q Consensus 539 ~~~~~ 543 (559) ++.+- T Consensus 470 ~~~s~ 474 (474) T protein:vir:10 470 NNQSE 474 (474) T ss_pred cccCC Confidence 11111 No 96 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.86 E-value=2.6e-08 Score=62.14 Aligned_cols=443 Identities=9% Similarity=0.011 Sum_probs=203.7 Q ss_pred CCh--hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc-------c---ccCCCCCCC--CCCcccccCCCCcchHHHHHH Q lcl|NC_019445. 1 MAE--TTKERLNKQFAQLESERQSFEPHWRELSDYIN-------P---RGSRFLTSE--VNRNDRRNTRIIDSTGTMAAR 66 (559) Q Consensus 1 M~~--~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~-------P---~~~~~~~~~--~~~~~~~~~~~~~s~~~~a~~ 66 (559) +.+ .+.+.|.+..+..+..|..+...++.+..+.. | ....+.... .....+.+.|+..+.+...++ T Consensus 10 ~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd 89 (474) T protein:vir:94 10 IEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVD 89 (474) T ss_pred ccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHH Confidence 111 13444555555555556555555544433321 1 111110000 011112334666777777777 Q ss_pred HHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc Q lcl|NC_019445. 67 TLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED 146 (559) Q Consensus 67 ~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~ 146 (559) ..++-|++- |+ ++...+.. ....++++ .+.+.+..++|-....++.++..+||.|.+++..+... T Consensus 90 ~~~~yl~g~--pv-----~~~~~~~~-~~~e~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~ 154 (474) T protein:vir:94 90 TRVGYLHGV--PV-----TYDLDENA-EKNEKLKK-------FITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNG 154 (474) T ss_pred hHhhheecc--ce-----eEeeCCCC-cchHHHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCC Confidence 766644321 32 23332211 11223333 33445666889999999999999999999998877666 Q ss_pred eEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccc Q lcl|NC_019445. 147 IIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLD 226 (559) Q Consensus 147 ~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~ 226 (559) .+++..++..+.++-.|..+.....+|.+.... ..+ ...+..++ +| ..+ T Consensus 155 ~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~---------------------~~~-~~~~~~~~-~y-~~~------- 203 (474) T protein:vir:94 155 DIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKD---------------------DDN-GTDYVYAE-FY-DNA------- 203 (474) T ss_pred eeEEEEEcccceEEEEcCCCceEEEEEEEEEee---------------------CCC-ceEEEEEE-EE-cCc------- Confidence 688888888887777777777655454433110 001 11111111 11 100 Q ss_pred cccccEEEEEEEecC-CCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019445. 227 SKNKPFKSVYYEVGG-DNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNP 303 (559) Q Consensus 227 ~~~~~~~sv~~~~~~-~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p 303 (559) . .+++...+ +..... .+-+|..+|++.++ ++.+|.|. .+...+-+..++.+........+...+| T Consensus 204 ---~---~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:94 204 ---Y---YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGD-AEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred ---e---EEEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCc-hHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 0 11122111 111111 12356678877653 45689996 8889999999999999999999999999 Q ss_pred ceeecCCC-cccc-ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCH Q lcl|NC_019445. 304 PMVAPTSL-KNQR-ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPV 381 (559) Q Consensus 304 ~~~~p~~~-~~~~-~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA 381 (559) .+++.+.. .... .....++.+.+...+ ..++.+. ...+...+...+..+++.|...-+.. ......-+...|+ T Consensus 272 ~l~i~g~~~~~~~~~~~~~~~~i~~~~~~--~~~~~l~-~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg 346 (474) T protein:vir:94 272 YLVLRGMGMSEEMIQETQKSGAFELFDKD--MDVKYLT-KDVNDTMIENHLDRIEKNIMRFAKSV--NFNSDEFNGNVPI 346 (474) T ss_pred hhhhccCCCCchhhhhhhhcceeEecCCC--CceeEEe-ccCCHHHHHHHHHHHHHHHHHHhCCc--ccccccccccchH Confidence 88775421 1111 224445555443222 2223221 12244555666777777775544321 1111111234566 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHH--HHHHHHHHHHHHHH Q lcl|NC_019445. 382 EAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMA--QAQKSIGLSSLAST 459 (559) Q Consensus 382 ~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La--~a~r~~~~~~l~~~ 459 (559) ..+..+-.-+.. ......+.-.+.+.-+++-++.++...+.-.. + ..-.++++.|.-++. .+..+ T Consensus 347 ~Al~~~~~~l~~-k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~-~--~~~~~i~~~f~~~~p~d~~e~a--------- 413 (474) T protein:vir:94 347 IGMKLKLMALEN-KCMTFERKMTAMLRYQFKVILSALKRKGYNLD-D--DSYLNLIFKFTRNIPVNKLEES--------- 413 (474) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC-c--cccccceEEeCCCCCCCHHHHH--------- Confidence 555443322221 22222223233333333334444433332111 1 112246666654332 12111 Q ss_pred HHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Q lcl|NC_019445. 460 VNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKT 538 (559) Q Consensus 460 ~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~ 538 (559) +.+..+++ .+....+++. ++ ++ -.++|++.+.++.++.++...... . +...+... T Consensus 414 -~~~~kl~g-------~iS~et~~~~----l~~v~----d~~~E~eri~~E~~e~~~~~~~~~------~--~~~~~~~~ 469 (474) T protein:vir:94 414 -QVLINLKG-------QVSERTRLGQ----SQLVD----DVDYELDEMEKESLEFNDKLPDID------E--GDANDKSQ 469 (474) T ss_pred -HHHHHHhc-------cCchHHHHHh----CCCCC----CHHHHHHHHHHHHHHHHhhccccc------C--CCcCCCCc Confidence 22222222 1222233332 22 22 134666666555443332211100 0 01111111 Q ss_pred CChhH Q lcl|NC_019445. 539 SDPSV 543 (559) Q Consensus 539 ~~~~~ 543 (559) ++.+- T Consensus 470 ~~~s~ 474 (474) T protein:vir:94 470 NNQSE 474 (474) T ss_pred cccCC Confidence 11111 No 97 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.85 E-value=2.7e-08 Score=62.05 Aligned_cols=450 Identities=9% Similarity=0.054 Sum_probs=197.9 Q ss_pred CChhh------HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC-CCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT------KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT-SEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~~~~------~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~-~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |.... .+.+.+.-+.....| .++++++.+|..-.-..... .......+...++..+.+...++.+++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~---~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~ 107 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhh---HHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 33211 122333333333333 34555666665421100000 000111123356777788888888887554 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEe Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPF 153 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~ 153 (559) + -| ++++..|+. ..+.+...+..++|.....++.++..+||.+.+++..+....+++..+ T Consensus 108 g--~p-----~~~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~ 167 (512) T protein:vir:97 108 G--NP-----IQCQDDDKD-------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (512) T ss_pred c--cC-----ceeccCChH-------------HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEE Confidence 3 11 123333321 223455566678899999999999999999999888876666888889 Q ss_pred eccEEEEeeCCC--CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSPR--GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 154 ~l~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +..+.++-.|.. +++...+|.+.+...+ .. ....-..++||+. .. T Consensus 168 ~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~-------~~----------~~~~~~~~~vyt~----~~------------ 214 (512) T protein:vir:97 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPID-------KT----------DEDEVFTVDLFTS----HG------------ 214 (512) T ss_pred cccceEEEEcCCCCCceEEEEEEEEeeecc-------cc----------ccceEEEEEEEeC----Cc------------ Confidence 988888777753 5666666665432100 00 0000011222211 00 Q ss_pred EEEEEEE-ecCCCc----ee--eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 232 FKSVYYE-VGGDND----KL--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 232 ~~sv~~~-~~~~~~----~i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) .+++. .++... .. ...-+|..+|++.++ .+..|+|. .+..++.+..++.+.-.+...++...+|. T Consensus 215 --i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd-~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 286 (512) T protein:vir:97 215 --VYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGD-YEKVITLIDLYDNAESDTANYMSDLNDAM 286 (512) T ss_pred --EEEEEecCCCcccccccccccccccCcccceEeec-----CCCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 01111 111100 01 112356778887654 34678996 88899999999999888999999999998 Q ss_pred eeecCCC--ccccce-ecCCceeecC-----------CcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_019445. 305 MVAPTSL--KNQRAS-LLPGDITYID-----------QITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMM 370 (559) Q Consensus 305 ~~~p~~~--~~~~~~-~~pg~~~~~~-----------~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~ 370 (559) +++.+.. ....+. ...++.+... ..++...++.+. ...+.......+..++..|-..-+.. .. T Consensus 287 lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~-~~~~~~~~e~~~~~L~~~I~~~s~~p--~~ 363 (512) T protein:vir:97 287 LLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTP--NM 363 (512) T ss_pred eeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEe-ecCCHHHHHHHHHHHHHHHHHHhCCc--cc Confidence 8765422 111111 1112221110 011112222221 12244555566677776664433321 11 Q ss_pred ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHH--HHH Q lcl|NC_019445. 371 LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMA--QAQ 448 (559) Q Consensus 371 ~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La--~a~ 448 (559) -...-+...|+..+...-. .+........+.-.+.+.-+++-++.++...+..... .+ -.++++.|.-++. .++ T Consensus 364 ~~~~~~gn~Sg~Al~~~~~-~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~-~d--~~~i~~~f~~~~p~~~~e 439 (512) T protein:vir:97 364 KDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDAN-KD--FNTVRYVYNRNLPKSLIE 439 (512) T ss_pred CcccccccchHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc-cc--cccceEEeCCCCCcCHHH Confidence 1111123345555443322 2222233333333333333333333333333332211 11 2246666654332 121 Q ss_pred HHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 449 KSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ 528 (559) Q Consensus 449 r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~ 528 (559) . ++.+..++++ +....++..+ -+++ -+++|++++.++++++.++.+... ..... T Consensus 440 ~----------~~~~~kl~gi-------iS~et~~~~l---~~v~----d~~~E~eri~~E~~~~~~~~~~~~--~~~~~ 493 (512) T protein:vir:97 440 E----------LKAYIDSGGK-------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGI--YKDPR 493 (512) T ss_pred H----------HHHHHHHhcc-------CchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhhcc--cCCCC Confidence 1 1222222222 2222223222 1222 235677666665443322221100 00000 Q ss_pred HHhhhhhhcCCChhHHHHH Q lcl|NC_019445. 529 GAKTLSEAKTSDPSVLSAM 547 (559) Q Consensus 529 ~a~~~~~~~~~~~~~~~~~ 547 (559) ....-.+.........+.- T Consensus 494 ~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 494 DINDDEQDDDTKDTVDKKE 512 (512) T ss_pred CCCCCCCCCCccccccccC Confidence 0000000000000000000 No 98 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.84 E-value=3.1e-08 Score=61.78 Aligned_cols=451 Identities=8% Similarity=0.049 Sum_probs=198.9 Q ss_pred CCh--h----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC-CCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAE--T----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT-SEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~~--~----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~-~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |.. . ..+.+.+..+.....|. ++++++.+|..-.-..... .......+...++..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhhc Confidence 221 1 12233333333333333 4555666665321100000 000111122345667788888887776443 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEe Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPF 153 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~ 153 (559) + -|+ +++..++. ..+.+...+..++|.....++.++..+||.+.+++..+....+++..+ T Consensus 108 g--~p~-----~~~~~~~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~ 167 (511) T protein:vir:96 108 G--NPI-----QYQDDDKD-------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (511) T ss_pred c--CCc-----eeecCchH-------------HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEE Confidence 2 111 12333321 223456677778999999999999999999999888776666788888 Q ss_pred eccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE-EeecCcccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHS-VYPNIDRDTSKLDSKNK 230 (559) Q Consensus 154 ~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~-v~p~~~~~~~~~~~~~~ 230 (559) +..+.++-.|. .+++...+|.+.....+ ... ..+++|+ ||- .+.. T Consensus 168 ~p~~~~~vydd~~~~~~~~~vr~~~~~~~d-------------------~~~---~~~~~~~~iyt-~~~i--------- 215 (511) T protein:vir:96 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPID-------------------KTD---EDEVFTVDLFT-SHGV--------- 215 (511) T ss_pred ccceeEEEEcCCCCCceEEEEEEEEeeecc-------------------ccc---cceEEEEEEEe-CCcE--------- Confidence 88887776654 35566666655432111 011 1112222 121 1100 Q ss_pred cEEEEEEEecCCC------ceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 231 PFKSVYYEVGGDN------DKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 231 ~~~sv~~~~~~~~------~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) + .|...++.. ..-...-+|..+|++.++- +.+|+|. .+..++.+..++.+.-.+...++...+|. T Consensus 216 -~--~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~gd-~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 286 (511) T protein:vir:96 216 -Y--RYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGD-YEKVITLIDLYDNAESDTANYMSDLNDAM 286 (511) T ss_pred -E--EEEecCCCcccccccccccccccCCceeeEEecC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhCce Confidence 0 011111110 0011223567788877653 4578996 88999999999999999999999999998 Q ss_pred eeecCCCccc--cc-eecCCceeec--------CC--cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhc Q lcl|NC_019445. 305 MVAPTSLKNQ--RA-SLLPGDITYI--------DQ--ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMML 371 (559) Q Consensus 305 ~~~p~~~~~~--~~-~~~pg~~~~~--------~~--~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~ 371 (559) +++.+..... .+ ....+..+.. .. .++...++.+. ...+...+...+..+.+.|...-+..-+. T Consensus 287 lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~e~~~~~L~~~I~~~s~~p~~~-- 363 (511) T protein:vir:96 287 LLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPNMK-- 363 (511) T ss_pred eeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCcccc-- Confidence 8765532111 11 1112222211 11 11111222221 12245555666777777775444321111 Q ss_pred cCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHH Q lcl|NC_019445. 372 QNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSI 451 (559) Q Consensus 372 ~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~ 451 (559) ...-+...|+..+...-.-+ ........+.-.+.+.-+++-++.++...+.... +.+ -..+++.|.-++..- ... T Consensus 364 ~~~~~~n~Sg~Al~~~~~~l-~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~-~~d--~~~i~~~f~~~~p~n-~~e 438 (511) T protein:vir:96 364 DDNFSGTQSGEAMKYKLFGL-EQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDA-NKD--FNTVRYVYNRNLPKS-LIE 438 (511) T ss_pred cccccccchHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc-ccc--cccceEEeCCCCCCC-HHH Confidence 11112345665554443322 2222222333333333333333333333332111 111 124566664433210 111 Q ss_pred HHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 452 GLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSG-VSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGA 530 (559) Q Consensus 452 ~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~G-vp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a 530 (559) . ++.+..++++ +....++. .++ ++ -.++|++++.++++.+...++. ......... T Consensus 439 ~-------~~~~~kl~G~-------iS~et~l~----~l~~v~----D~~~E~~ri~~E~~~~~~~~~~--~~~~~~~~~ 494 (511) T protein:vir:96 439 E-------LKAYIDSGGK-------ISQTTLMS----LFSFFQ----DPELEVKKIEEDEKESIKKAQK--GIYKDPRDI 494 (511) T ss_pred H-------HHHHHHHhcc-------CChHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHhh--ccccCCCCC Confidence 1 1222222222 33333333 232 32 1356777776665433222211 100000000 Q ss_pred hhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 531 KTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ....+.....++. .+.. T Consensus 495 ~~~~~~~~~~~~~------------~~~~ 511 (511) T protein:vir:96 495 NDDEQDDDTKDTV------------DKKE 511 (511) T ss_pred CCCCCCCcccccc------------cccC Confidence 0000000000000 0000 No 99 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.83 E-value=3.2e-08 Score=61.64 Aligned_cols=478 Identities=9% Similarity=0.035 Sum_probs=198.5 Q ss_pred CChhhHHHHHHHHHHHH----------------HHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLE----------------SERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~----------------~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a 64 (559) |-+..+.-+++-...+. .++......|+.+|+==.+ ...+.... ....+..++--+.+..+ T Consensus 3 ~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~-~~~~~~~~--~~~~~~~~~slnl~~~i 79 (522) T protein:vir:47 3 LFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWD-DVQYKNTD--GDIKSRPMNHLPIARTA 79 (522) T ss_pred hHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcc-cccccccC--cchhcccceecchHHHH Confidence 33333222222221111 1111222333333221001 00011111 11111122323556666 Q ss_pred HHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC Q lcl|NC_019445. 65 ARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD 144 (559) Q Consensus 65 ~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~ 144 (559) ++.+|+-+..-.. .++++|+ .+.+.+.+.|...+|+..+.+++....+.|++++-+..|. T Consensus 80 ~~~~A~lv~~e~~-------~i~v~d~-------------~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~ 139 (522) T protein:vir:47 80 SKKIASLVYNEQA-------TITTKNE-------------ILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDG 139 (522) T ss_pred HHHHhhhhcCCcc-------eeecCCh-------------HHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcC Confidence 6666663332111 2233332 2333445577789999999999999999999998776664 Q ss_pred CceEEEEEeeccEEEE-eeCCCCCEEE-EEEEEeecHHHHHH-----hcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Q lcl|NC_019445. 145 EDIIRTMPFPIGSYYL-ANSPRGSVDI-CFRKFSMTVRQLVQ-----EFGLNNVSESVKSMWESGTYEKWIEVMHSVYPN 217 (559) Q Consensus 145 ~~~~~~~~~~l~~~~v-~~d~~G~vd~-i~r~~~~t~~ql~~-----~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~ 217 (559) + .+++..++...++- ..|..|.+.. +|.+...+-.+-.. +|-+-.-.+. ...+. ......+.|-+..|.. T Consensus 140 ~-~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~-~~~~~-~~~~~~~~I~n~ly~~ 216 (522) T protein:vir:47 140 D-KVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADG-QETGS-TNDKKYYRITNELYRS 216 (522) T ss_pred C-ceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeeccccc-ccccc-cccCCceEEEEEEeec Confidence 3 46788899998884 6777776643 34332211111000 0000000000 00000 0011123333333322 Q ss_pred CcccccccccccccEEEEEEEecCCCceeeeecCcccCCeE-EE---Eeee-cCCCcccccchHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 218 IDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIM-AP---RWEV-NGEDVYGSSCPGMLALGPVKALQLLQKR 292 (559) Q Consensus 218 ~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~-~~---rw~~-~~g~~YGrG~P~~~~l~d~~~L~~l~~~ 292 (559) .+.+. -++..|..++.--.+..+..++ .|.. -|.+ .+ .++. ..+++||+|. -.++.+.++.||..--+ T Consensus 217 ~~~~~---lG~~v~l~~~~e~~~l~~~~~~--~~~~-~Plf~y~~~~~~N~~~~~splG~S~-~~~~~~~id~lD~~~s~ 289 (522) T protein:vir:47 217 DVNDV---LGQRVNLSELDKYKNLEPVTVF--ENLS-RPLFTYLKTPGMNNKDINSPLGLSI-FDNAKTTIDFINRSYDE 289 (522) T ss_pred CCCcc---cCccccccccccccCCCCceEe--CCCC-cceEEEecCCcccccccCCCcCCch-hhhhHHHHHHHHHHHHH Confidence 21110 0111222222100000111111 1222 2332 22 2333 3478999995 89999999999998888 Q ss_pred HHHHHHHHhcCceeecCCCcccccee-----------cCCceeec--CC-cCCchhhhhhhhccccHHHHHHHHHHHHHH Q lcl|NC_019445. 293 KSQLIDKATNPPMVAPTSLKNQRASL-----------LPGDITYI--DQ-ITGQDGFRPAYLVNPSTADLVADIQDTRQI 358 (559) Q Consensus 293 ~~~~~~~~~~p~~~~p~~~~~~~~~~-----------~pg~~~~~--~~-~~~~~~~~p~~~~~~~~~~~~~~i~~~~~r 358 (559) .....++.-. .+.+|.++.....+. .++...|. +. .++...++.+. ..-......+.++.+-+. T Consensus 290 ~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~-~~ir~e~~~~~~~~~l~~ 367 (522) T protein:vir:47 290 FMWEVRMGQR-RVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLT-SPIRANDYILAISEGLKL 367 (522) T ss_pred HHHHHHhccc-eeecchHHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeec-cccChHHHHHHHHHHHHH Confidence 8877764433 344444332111111 11111122 11 12222343321 122333444455555555 Q ss_pred HHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEE Q lcl|NC_019445. 359 INSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKV 438 (559) Q Consensus 359 I~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~ 438 (559) |....-. --.++........|||||....+...+...-.-..+. ..|..|+..++.++.-.+++-..+.. ..++.+ T Consensus 368 i~~~~gl-s~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~-~al~~lv~~i~~l~~~~~~~~~~~~~--~~~i~v 443 (522) T protein:vir:47 368 FEMQIGV-SSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVE-QSIKELCVSMCELGKAVGVYSGEIPE--LDDISV 443 (522) T ss_pred HHHHhCC-CccccCccccccccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhhccCCCCC--cceeEE Confidence 5432200 0123333344568999999999999988887766664 46677777777776544432221111 224667 Q ss_pred EeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 439 EYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQ 518 (559) Q Consensus 439 ~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~ 518 (559) .+--++..-. ...++.. . ++.+.+ .+....++ ....|++ ++|++++.++- ++.+ T Consensus 444 ~f~D~i~~D~-~~~~~~~---~----~~v~aG-----~~s~e~~i---~~~~g~~------eeea~~el~ri-~~E~--- 497 (522) T protein:vir:47 444 NLDDGVFTDR-HAELDYW---A----KMVAAG-----FSTKKRAI---GKTLNIS------GVEAEKELNAI-NSEL--- 497 (522) T ss_pred EcCCCCCCCH-HHHHHHH---H----HHHhcC-----CCCHHHHH---HhcCCCC------hHHHHHHHHHH-HHhh--- Confidence 7765543111 1111111 1 111112 13333333 3455653 34432221110 0000 Q ss_pred HHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 519 MMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 519 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .++........+ ++.+ -...++... T Consensus 498 -~~~~~~~~~~~~--~~~~-------------~~~~~d~~~ 522 (522) T protein:vir:47 498 -LPMNDAELAIYG--MHDQ-------------NEEKADDKG 522 (522) T ss_pred -ccCCCCCCCCCC--CCCc-------------ccccCCCCC Confidence 000000000000 0000 001111111 No 100 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=98.83 E-value=3.4e-08 Score=61.54 Aligned_cols=428 Identities=9% Similarity=-0.003 Sum_probs=198.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc--ccCCC-CCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINP--RGSRF-LTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P--~~~~~-~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) ....+.+.|.+..+.... |.+....++++|+=.-+ .+.+. .........+...++..+.+...++..++-|++ - T Consensus 23 ~~~~~~~~i~~~i~~~~~-~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g--~ 99 (468) T protein:vir:96 23 QYETQEEMILRLITKHKE-NVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVA--N 99 (468) T ss_pred cccCcHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhcc--C Confidence 333344444444444443 44555556666543321 11100 000011111223467677777777777765542 2 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) |+. ++..|+. ..+.| ...+ ..||...+.++.++..+||.+++++..+....+++..++..+ T Consensus 100 p~~-----~~~~d~~------~~~~l-------~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~ 160 (468) T protein:vir:96 100 PVT-----YGTEDEK------SLKTI-------QEVL-NHKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPAEQ 160 (468) T ss_pred Cce-----eccCChH------HHHHH-------HHHH-hcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccc Confidence 222 2333322 22222 2233 257888889999999999999988877766668888888888 Q ss_pred EEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE------EEEeecCccccccccccc Q lcl|NC_019445. 158 YYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM------HSVYPNIDRDTSKLDSKN 229 (559) Q Consensus 158 ~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~------~~v~p~~~~~~~~~~~~~ 229 (559) .+.-.| ..+++..++|.+...- ...++++ +......... T Consensus 161 ~~~v~~~~~~~~~~~~ir~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~-------- 207 (468) T protein:vir:96 161 AIPIWTNKERDELKAFIRLYELDG-------------------------GERVEYWTANDVTFYELKDGQLI-------- 207 (468) T ss_pred eEEEEcCCCCCceEEEEEEEEecC-------------------------ceEEEEEeCCeEEEEEEcCCcee-------- Confidence 776554 3577776666654321 0112221 1111110000 Q ss_pred ccEEEEEEEecCCCce-e--eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_019445. 230 KPFKSVYYEVGGDNDK-L--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV 306 (559) Q Consensus 230 ~~~~sv~~~~~~~~~~-i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~ 306 (559) ........+.... . ...-+|..+|++.++ ++.+|.|. .....+.+..++.+.-..+..++..++|.++ T Consensus 208 ---~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~-----n~~~g~sd-~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv 278 (468) T protein:vir:96 208 ---PDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFK-----NNPQEVSD-LFMYKTIIDAMDKRLSDTQNTFDEATELIYV 278 (468) T ss_pred ---ecccccccccccceeeccccccCCcccEEEec-----CCCCCCCc-hHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 0000000000000 1 112356678877663 35679996 7889999999999999999999999999888 Q ss_pred ecCCC----ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHH Q lcl|NC_019445. 307 APTSL----KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVE 382 (559) Q Consensus 307 ~p~~~----~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~ 382 (559) +.+.. .....+...++++.++..++ ..++.+. .+.+...+...++.++..|-..-+.. .......+...|+. T Consensus 279 ~~g~~~~~~~~~~~~~~~~~~i~~~~d~~-~~~~~l~-~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~~~n~Sg~ 354 (468) T protein:vir:96 279 LKGYEGEDLEEFMYNLKYYKAINVDGDGS-GGVDTIQ-IDVPVQSAKEYLDMLRDYVIEFGQGV--DFQQDKFGNSPSGI 354 (468) T ss_pred eecCCccccchhhhhhhcCceEEecCCCC-CcceEEe-ecCChHHHHHHHHHHHHHHHHHhCcc--cccccccccchHHH Confidence 76522 11122334455555543222 2222221 22345555666777777765554321 11111223345665 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 383 AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNF 462 (559) Q Consensus 383 Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~ 462 (559) .+..+..-.... ..+. ...+...+.+++.++.+..-.. ..-.++.+++.-.+..- ..+ .++. T Consensus 355 Alk~~~~~l~~k----~~~k-~~~~~~~l~~~~~li~~~~g~~-----~d~~~i~i~f~~~~p~d----~~e----~a~~ 416 (468) T protein:vir:96 355 ALKFMYSNLDLK----ANKL-KNKTLTALQELLQYIIDFYKLS-----IKVQDVEITFNFNVMVN----ELE----QSQI 416 (468) T ss_pred HHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHHHhCCC-----cccceeeEEecCCCCcC----HHH----HHHH Confidence 544332221111 1222 2234444455555554432111 11224556554443311 111 1111 Q ss_pred HHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Q lcl|NC_019445. 463 IGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKT 538 (559) Q Consensus 463 ~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~ 538 (559) +... + .+.-..+++.+ -+++ -.++|++++.+++++..+.+. +.-+.....++ T Consensus 417 ~~~~-g-------~iS~et~i~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~---------~~~~~~~~~~~ 468 (468) T protein:vir:96 417 GVNS-Q-------YLSKETVVTNH---PWVD----DPVAEMERIDQEELALPSIEE---------GLNGKENNEPT 468 (468) T ss_pred HHhc-C-------CCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHhh---------ccCCCCCCCCC Confidence 1111 1 23333333322 1232 124666666554432222111 11111112222 No 101 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.80 E-value=4.1e-08 Score=61.07 Aligned_cols=448 Identities=9% Similarity=0.055 Sum_probs=198.2 Q ss_pred CC--hh----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc---cCCCCCCCCCCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MA--ET----TKERLNKQFAQLESERQSFEPHWRELSDYINPR---GSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~--~~----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~---~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) |. +. ..+.+.+..+.....|.+ +++++.+|..-. ...... . ....+...++..+.+...++..++- T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~il~~~~~-~-~~~~~~~~ki~~n~~k~Iv~~~~~y 105 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTR-R-KEEYMADNRVAHDYASYISDFINGY 105 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCc-C-cccccCcceeecchHHHHHHHHhhh Confidence 22 11 123344433343444433 445555554321 100000 0 1111233467778888888887765 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) |++ -| ++++..++. ..+.+...+..++|.....++.++..+||.|.+++..+....+++. T Consensus 106 l~g--~p-----~~~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~ 165 (511) T protein:vir:93 106 FLG--NP-----IQYQDDDKD-------------VLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLY 165 (511) T ss_pred hcc--cC-----eeeccCChH-------------HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEE Confidence 533 11 123333321 2234555666788999999999999999999998888766667888 Q ss_pred EeeccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKN 229 (559) Q Consensus 152 ~~~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~ 229 (559) .++..+.++-.|. .+++...+|.+..... +... ...+..++...+.. T Consensus 166 ~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~-------------------~~~~-~~~~~~~~iyt~~~----------- 214 (511) T protein:vir:93 166 KSDAMSTFVIYDNTIERNSIAGVRYLRTKPI-------------------DKTD-EDEVFTVDLFTSHG----------- 214 (511) T ss_pred EEccceeEEEEcCCCCCceEEEEEEEEeeec-------------------cccc-cceEEEEEEEeCCc----------- Confidence 8888888776664 3666665565543210 0111 11121221111110 Q ss_pred ccEEEEEEEecCCC-----cee--eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019445. 230 KPFKSVYYEVGGDN-----DKL--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATN 302 (559) Q Consensus 230 ~~~~sv~~~~~~~~-----~~i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~ 302 (559) .++|..++.. ... ...-+|..+|++.++- +..|+|. .+..++.+..++.+.-.+...++...+ T Consensus 215 ----i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~gd-~e~v~~liDa~d~~~S~~~~~~~~~~~ 284 (511) T protein:vir:93 215 ----VYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGD-YEKVITLIDLYDNAESDTANYMSDLND 284 (511) T ss_pred ----EEEEEecCCCccccccccccccccCCCccceEEecC-----CCCCCCc-hhhHHHHHHHHHHHHHHHHHHHHHhhC Confidence 0111111110 000 1223567888877653 4578896 888999999999998899999999999 Q ss_pred CceeecCCCcc--ccc-eecCCceeec--------CC--cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_019445. 303 PPMVAPTSLKN--QRA-SLLPGDITYI--------DQ--ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFM 369 (559) Q Consensus 303 p~~~~p~~~~~--~~~-~~~pg~~~~~--------~~--~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~ 369 (559) |.+++.+.... ... ....+++... .. .++...++.+. .+.+...+...+..++..|...-+..-+. T Consensus 285 ~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~L~~~I~~~s~~P~~~ 363 (511) T protein:vir:93 285 AMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPNMK 363 (511) T ss_pred cceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCcccc Confidence 98876542211 111 1111221111 10 11112222221 12345555666777777775444322111 Q ss_pred hccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHH Q lcl|NC_019445. 370 MLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQK 449 (559) Q Consensus 370 ~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r 449 (559) ...-+...|+..+...-.- +........+.-.+.+.-+++-++.++...+.... +.+ -..+++.|.-.+.. T Consensus 364 --~~~~~~n~Sg~Al~~~~~~-l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~-~~d--~~~i~~~f~~~~p~--- 434 (511) T protein:vir:93 364 --DDNFSGTQSGEAMKYKLFG-LEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDA-NKD--FNTVRYVYNRNLPK--- 434 (511) T ss_pred --cccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-ccc--cccceEEeCCCCCC--- Confidence 1111233455544433222 22222223333333333333333333333332211 111 22456666433221 Q ss_pred HHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 450 SIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQG 529 (559) Q Consensus 450 ~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~ 529 (559) ++...++.+..++++ +....++..+ -+++ -+++|++++.++++.+..+++. .... . T Consensus 435 -----n~~e~~~~~~kl~g~-------iS~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~--~~~~---~ 490 (511) T protein:vir:93 435 -----SLIEELKAYIDSGGK-------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQK--GIYK---D 490 (511) T ss_pred -----CHHHHHHHHHHHhcc-------CchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhh--hccc---C Confidence 011112223333332 3323333322 1232 2356677666655433222111 0000 0 Q ss_pred HhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 530 AKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 530 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) -+.+.+....+++ .-...| T Consensus 491 ~~~~~~~~~~~~~-----------~~~~~~ 509 (511) T protein:vir:93 491 PRDINDDEQDDDT-----------KDTVDK 509 (511) T ss_pred CCCCCCCCCCCcc-----------cccccc Confidence 0111000000000 000000 No 102 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=98.79 E-value=4.9e-08 Score=60.67 Aligned_cols=438 Identities=11% Similarity=0.042 Sum_probs=197.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc-----ccCCC-CCCCCCCcccccCCCCcchHHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINP-----RGSRF-LTSEVNRNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P-----~~~~~-~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~ 74 (559) ..+...+.|.+..+..+. ...+++.+.+|..- .+... .........+.+.++..+.+...++..++-|++ T Consensus 23 ~~~~~~~~i~~~i~~~~~----~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g 98 (478) T protein:vir:10 23 KYETQEEMILRLVREHKE----NIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVA 98 (478) T ss_pred ccCChHHHHHHHHHHHHH----HHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcc Confidence 222233333343443332 23455555555421 01000 000001111223456677777788877775554 Q ss_pred hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEee Q lcl|NC_019445. 75 GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFP 154 (559) Q Consensus 75 ~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~ 154 (559) -| +.+...+++ ..+.+...+ ..+|.....++.++..+||.+.+++..|....+++..++ T Consensus 99 --~p-----~~~~~~~~~-------------~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~ 157 (478) T protein:vir:10 99 --NP-----VTFGVDNDK-------------ALKQIQHTL-NHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVP 157 (478) T ss_pred --cC-----ceeecCChH-------------HHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEc Confidence 12 223333322 111222233 257888999999999999999998887776678888888 Q ss_pred ccEEEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE--Ee-ecCccccccccccc Q lcl|NC_019445. 155 IGSYYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHS--VY-PNIDRDTSKLDSKN 229 (559) Q Consensus 155 l~~~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~--v~-p~~~~~~~~~~~~~ 229 (559) ..+.+.-.| ..|++...+|.+...- ...+++++. |+ -+.+. + T Consensus 158 p~~~~~v~d~~~~~~~~~~ir~~~~~~-------------------------~~~~~~y~~~~i~~~~~~~--------~ 204 (478) T protein:vir:10 158 AEQAVPIWTNKERDELQAFIRVYELDG-------------------------AERVEYWTKDDVTFYELKE--------G 204 (478) T ss_pred ccceEEEEcCCCCCceEEEEEEEeeeC-------------------------ceEEEEEeCCcEEEEEecC--------C Confidence 888776554 3677777666654220 112222210 00 00000 0 Q ss_pred ccEEEEEEEecCCCceee---eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_019445. 230 KPFKSVYYEVGGDNDKLL---RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV 306 (559) Q Consensus 230 ~~~~sv~~~~~~~~~~il---~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~ 306 (559) ..+........+...... ..-+|..+|++.++. +.+|+|. .+...+.+..++.+.-.....++...+|.++ T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd-~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~ 278 (478) T protein:vir:10 205 QLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSD-LFMYKTIIDALDKRLSDTQNTFDESVELIYI 278 (478) T ss_pred eeeccccccccccccceecccccccCCcceEEEecc-----CCCCCCc-HHHHHHHHHHHHHHHHHHHHHHHHhhCccee Confidence 000000011111111111 123567888887765 3578995 8888899999999999999999999999877 Q ss_pred ecCC-C---ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHH Q lcl|NC_019445. 307 APTS-L---KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVE 382 (559) Q Consensus 307 ~p~~-~---~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~ 382 (559) +.+- + .....++..++++.++...+ ..++.+ +.+.+...+...++.+++.|-..-+..-+. ....+...|+. T Consensus 279 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l-~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~--~~~~~~n~Sg~ 354 (478) T protein:vir:10 279 LKGYEGEDMKDFMHNLKYYKAISVAGESG-SGVDTI-KVEVPIDSVKEYTKMLRDYIIEFGQGVDFQ--QDKFGNSPSGI 354 (478) T ss_pred eecCCcccccchhhhhhhCceeEecCCCC-CcceEE-eecCCHHHHHHHHHHHHHHHHHHhCCcCcC--ccccccchHHH Confidence 6431 1 11112233444444433222 222322 223355666667777777765544321111 11112334554 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 383 AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNF 462 (559) Q Consensus 383 Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~ 462 (559) .+..+-.-+... ..+. ...+.+.+++++.++.+.--. .....+|++++.-.+.. + ....++. T Consensus 355 Ai~~~~~~l~~k----~~~~-~~~~~~~l~~~~~li~~~~~~-----~~d~~~i~i~f~~~~p~-----~---~~e~~~~ 416 (478) T protein:vir:10 355 ALKFMYSNLDLK----ANKL-KNKTLTALQELLQYIIDFYRL-----DVRVQDIEITFNFNVMV-----N---ELENSQI 416 (478) T ss_pred HHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHHHhCC-----CcccccceEEeCCCCCC-----C---HHHHHHH Confidence 443322221111 1222 223444555555555442111 11223456666444321 0 1111222 Q ss_pred HHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 463 IGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 463 ~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) ++.++++ +.-+.++..+ -+++ -.++|++++.++..+.+++. ...-+.. +..+ T Consensus 417 ~~~~~g~-------iS~et~i~~~---~~v~----d~~~E~~ri~~E~~~~~~~~---------~~~~~~~----~d~~- 468 (478) T protein:vir:10 417 AMNSTGL-------LSKETILGNH---SWVQ----DPVAEMERIEQENIELNQQL---------PDIEEGL----NDEQ- 468 (478) T ss_pred HHHHhCC-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhc---------cccCCCC----cccc- Confidence 2233222 3322222221 1232 13466666655544332211 1111110 0000 Q ss_pred HHHHHHHHhhcCCCCCC Q lcl|NC_019445. 543 VLSAMANAVSGQGGQSQ 559 (559) Q Consensus 543 ~~~~~~~~~~~~~~~~~ 559 (559) ...+..++++ T Consensus 469 -------~~~~~d~~~e 478 (478) T protein:vir:10 469 -------QRQSEDNQSE 478 (478) T ss_pred -------cccCcCCCCC Confidence 0111112222 No 103 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=98.77 E-value=5.6e-08 Score=60.32 Aligned_cols=465 Identities=13% Similarity=0.079 Sum_probs=211.2 Q ss_pred CChhhHHHHHHHHHHHHHHh--------------hhHHHHHHHHHHHhccccCCCCCCCCCCc-c-cccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESER--------------QSFEPHWRELSDYINPRGSRFLTSEVNRN-D-RRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R--------------~~~~~~w~e~~~~~~P~~~~~~~~~~~~~-~-~~~~~~~~s~~~~a 64 (559) |.-.. + .|+.-+.-| ...+..++.+.+|..=.-+... ...++ . +-...++++.+... T Consensus 1 ~~~~~-~----~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~--~~lrg~~~~~~r~~~~ps~~~~ 73 (527) T protein:vir:10 1 MGQDK-R----QYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQ--VILRGGDEGDQRPIYVPNGEKL 73 (527) T ss_pred CCccc-c----ccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhhee--eecCCccccccceeeehhhHHh Confidence 21110 0 000000000 0112223334444321100000 00001 1 11233667777443 Q ss_pred HHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC Q lcl|NC_019445. 65 ARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD 144 (559) Q Consensus 65 ~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~ 144 (559) ++. .+-.+.| +..|+- + +. -++|+..+...+++.|++....++-.+.++-|-|++.+-+|+ T Consensus 74 ~~~----~~~~~~~-g~~~~~----~----~~------~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~ 134 (527) T protein:vir:10 74 IEA----KMRFLGQ-GLKWEF----S----KK------DAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDD 134 (527) T ss_pred hCC----cceeecc-Cccccc----c----ch------hHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecc Confidence 332 2222332 333421 1 11 113566667788889999999999999999999999999887 Q ss_pred Cc----eEEEEEeeccEEEEeeCCCC--CEEEEEEEEeecHHHHHHhcCcccCCHHHHHHH---------hcCCCCce-- Q lcl|NC_019445. 145 ED----IIRTMPFPIGSYYLANSPRG--SVDICFRKFSMTVRQLVQEFGLNNVSESVKSMW---------ESGTYEKW-- 207 (559) Q Consensus 145 ~~----~~~~~~~~l~~~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~---------~~~~~~~~-- 207 (559) ++ .++.+.+-.+.|+.-+|++| .|-.+|- +..|. +|++-++.+ ...+++.. T Consensus 135 ~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~---------~~~~~---~P~d~~~~~~~ar~~~~~~~l~~~g~~~ 202 (527) T protein:vir:10 135 EKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYL---------VDEYP---HPDSEKKNEKCARVQKYMKTLDDDGKPV 202 (527) T ss_pred CCCcCCCceEeecCcceeeeeecCCCCCceeeEEE---------eeecc---CCccccccceehhhhhhhhhcCcccccc Confidence 65 48888999999998888765 3444432 12232 222221111 11111111 Q ss_pred ------EEEEEEEeecCc-ccccccccccccEEEEEEEecCCCceeeee--cCcccCCeEEEEeeecCCCcccccchHHH Q lcl|NC_019445. 208 ------IEVMHSVYPNID-RDTSKLDSKNKPFKSVYYEVGGDNDKLLRE--SGFDEFPIMAPRWEVNGEDVYGSSCPGML 278 (559) Q Consensus 208 ------v~v~~~v~p~~~-~~~~~~~~~~~~~~sv~~~~~~~~~~il~e--sg~~~~P~~~~rw~~~~g~~YGrG~P~~~ 278 (559) ++..|+-..+++ .+.-.+++.. | ... .+...+.+ -++.-.|+++++=...++++||+|. ..+ T Consensus 203 ~~G~~~yt~~~w~lg~w~d~~e~p~~~~~--~-----~~~-~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~-La~ 273 (527) T protein:vir:10 203 PGGAIKYTEELYEPGKWDDRPESPLEPDD--I-----KKL-STLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSG-LAG 273 (527) T ss_pred cCcceeeeeceeeccccccccccccchhh--h-----hhh-cCceeeecccCCCCccceEeecCCCccccccChhh-HhH Confidence 111111111110 0000000000 0 000 01122222 2466788888888888999999995 889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeecC----C--CccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHH Q lcl|NC_019445. 279 ALGPVKALQLLQKRKSQLIDKATNPPMVAPT----S--LKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADI 352 (559) Q Consensus 279 ~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~----~--~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i 352 (559) .+.-+..||.........+...-.|...... + ++..++.+.||+++-.+..+. +.-+. ..+.++.+.+.+ T Consensus 274 ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak---~~~v~-~~~~la~~~~h~ 349 (527) T protein:vir:10 274 LESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNK---IYRVN-GVASLEPSQTHM 349 (527) T ss_pred HHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCCceeEecCCCcc---eeecc-chhhhHHHHHHH Confidence 9999999998888888888888777655421 2 344456788998886644321 22111 113555566666 Q ss_pred HHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHH-HHHHHHH-HHH--------HHHhcC Q lcl|NC_019445. 353 QDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEC-LNPLIDR-AFS--------MMVRKN 422 (559) Q Consensus 353 ~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~-l~Pli~r-~~~--------il~r~g 422 (559) ..+..+|...--..-. .++..+..+ .-+.+. +...|+|++.+.+..- +.-.+.| +.. ...+-+ T Consensus 350 ~~L~~~l~~vA~~Pav-A~G~vD~s~-~~SG~A-----LeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~ 422 (527) T protein:vir:10 350 TKAEEAMQQTKGIPDI-AVGVVDAAV-AESGIA-----LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVG 422 (527) T ss_pred HHHHHHHHHhhcCCee-eeccccCCc-CcHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcc Confidence 6766666554422110 111112221 112222 2344666666665542 2222222 111 111111 Q ss_pred CCCCCchhhCCcceEEEe--ecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCH Q lcl|NC_019445. 423 MLPPPPDAMEGMPLKVEY--ISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQ 500 (559) Q Consensus 423 ~lp~~p~~l~g~~v~~~~--is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~ 500 (559) .-+ ......+++.+ ..|....+- ++.+..+.+.+ -+....+++.+.++.|+. -.+ T Consensus 423 ~~d----~~~~~~v~ivf~p~lP~D~~av----------ie~v~tL~~aG-----i~S~~tAv~~L~~~~g~e----D~E 479 (527) T protein:vir:10 423 IDD----ADKKLTVTITFRDPKPVNSEKR----------FNQLLQLWEAG-----LIPAKKLTEELSKIMGFE----LTE 479 (527) T ss_pred cCC----CccccceEEEecccCCCCHHHH----------HHHHHHHHHcC-----chhHHHHHHHHHhccCCC----ChH Confidence 100 01111345544 445443221 22222332322 377888899999888852 334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 501 EQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLS-EAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 501 ~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .|++++.+.++++.-++. .+.+.-.++++..++ +....++ ++|+.- T Consensus 480 ~E~~~I~~era~~a~a~a-~A~~~~~a~~~~~~g~~~~~~d~------------~~~~~~ 526 (527) T protein:vir:10 480 EDFKQATEDKKTQGIAQA-EAADPFGAQMAAEQGIPDEEDDQ------------ALNGQP 526 (527) T ss_pred HHHHHHHHHHHHHhHHhh-hhcCchhhhhccccCCCCCCccc------------ccCCCC Confidence 566677655544332221 111111122111110 0000000 011111 No 104 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.77 E-value=5.7e-08 Score=60.30 Aligned_cols=475 Identities=9% Similarity=0.063 Sum_probs=198.0 Q ss_pred CChhhHHHHHHHHH-----HHH-----------HHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFA-----QLE-----------SERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l~~r~~-----~l~-----------~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a 64 (559) |-+..+.-+.+--. .++ ..-.....+|+.+|+=-.|.......+. ...+..+.-=+.+..+ T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~---~~~~~~~~sl~~~~~i 79 (517) T protein:vir:98 3 VIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQG---KIQERDYMTLNLRKLS 79 (517) T ss_pred hHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccc---cccccceeecCcHHHH Confidence 33322211111100 011 1112244556666543333321111111 1111122222345555 Q ss_pred HHHHHHHHHHhhcCCCCcceeccCCccch-----hhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEE Q lcl|NC_019445. 65 ARTLASGMMSGITSPARPWFRLATPDPEM-----MDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMA 139 (559) Q Consensus 65 ~~~Las~l~~~l~pp~~~Wf~l~~~d~~~-----~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 139 (559) ++.+|+ .+|.- .+= +.++|.+. ......+++|+ +.+..++|+..+.+++.+..+.|++++- T Consensus 80 ~~~~A~----Ll~~e-~~~--i~v~d~~~~~~~~~~~~~~~e~l~-------~i~~~n~f~~~~~~~~e~a~a~G~~a~k 145 (517) T protein:vir:98 80 ADVLSG----LVFNE-QCE--VYVSDAKDEEKKDNSFKTAHEFIQ-------HVFQHNKFIKNLSDYLEPTFALGGLTVR 145 (517) T ss_pred HHHhhh----hhcCC-cce--EEecccccccccccchhHHHHHHH-------HHHHhccHHHHHHHHHHHHhhhCCEEEE Confidence 555555 44531 111 33333221 11123344444 4666789999999999999999999997 Q ss_pred EeecCCceEEEEEeeccEEEE-eeCCCCCEEEEE-EEEeecHHHHHHhcCcccCCHHHHHHHhc-CCCCceEEEEEEEee Q lcl|NC_019445. 140 VLEDDEDIIRTMPFPIGSYYL-ANSPRGSVDICF-RKFSMTVRQLVQEFGLNNVSESVKSMWES-GTYEKWIEVMHSVYP 216 (559) Q Consensus 140 v~~~~~~~~~~~~~~l~~~~v-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~~~~~-~~~~~~v~v~~~v~p 216 (559) +..|.++ +++..++...|+- ..|.+|.+..+| .++..+.++-...|= .|--. .+.. ...+.++.|.+.+|. T Consensus 146 ~~~d~~~-~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt--~lE~H---~~~~~~~~~~~y~I~n~ly~ 219 (517) T protein:vir:98 146 PYVDNGE-IEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYT--LLEFH---EWEKTEEGESLYVITNELYK 219 (517) T ss_pred EEEeCCe-eEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEE--EEEEE---ecCceeccCCcEEEEEEEEe Confidence 7776544 5577788888775 667777654433 333322111000000 00000 0000 000112344444443 Q ss_pred cCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEE----EEee-ecCCCcccccchHHHHHHHHHHHHHHHH Q lcl|NC_019445. 217 NIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMA----PRWE-VNGEDVYGSSCPGMLALGPVKALQLLQK 291 (559) Q Consensus 217 ~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~----~rw~-~~~g~~YGrG~P~~~~l~d~~~L~~l~~ 291 (559) ..+.. .-++..|..++|-+. .+..+ ..|.. .|.++ +..+ ...+++||+|. -.++++.++.||..-- T Consensus 220 s~~~~---~lG~~v~L~~~~e~l--~~~~~--~~g~~-~Plf~y~~~p~~N~~~~~splG~S~-~~~a~~~~d~lD~~~s 290 (517) T protein:vir:98 220 SDNEG---EIGKRIPLEELYEGM--QEKTY--IQGLS-RPLFNYLKPSGFNNINPHSPLGLGI-TDNSVSTLKKINDTYD 290 (517) T ss_pred cCCCc---cccccccccccccCC--Cccee--ECCCC-cceEEEecCCcccccccCCCCCCch-hhhhHHHHHHHHHHHH Confidence 21110 011122333332111 11111 12221 24222 1223 33368899995 8899999999999999 Q ss_pred HHHHHHHHHhcCceeecCCCccccce---ecCC------ceeec--CCcCCchhhhhhhhccccHHHHHHHHHHHHHHHH Q lcl|NC_019445. 292 RKSQLIDKATNPPMVAPTSLKNQRAS---LLPG------DITYI--DQITGQDGFRPAYLVNPSTADLVADIQDTRQIIN 360 (559) Q Consensus 292 ~~~~~~~~~~~p~~~~p~~~~~~~~~---~~pg------~~~~~--~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~ 360 (559) +.+.-.++ .+.++.+|.++.....+ ..++ ...|. ....+...++. ++..-......+.++.+-+.|. T Consensus 291 ~~~~e~~~-g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~-~~~~iR~e~~~~~~~~~L~~i~ 368 (517) T protein:vir:98 291 QFWWEIKM-GQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKD-VTHDIRTEQYKEAINQALRTLE 368 (517) T ss_pred HHHHHHHh-CCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceee-eccccchHHHHHHHHHHHHHHH Confidence 98887777 56677777665321111 1111 11111 11122222221 1111122334445555555443 Q ss_pred HHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEe Q lcl|NC_019445. 361 SAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEY 440 (559) Q Consensus 361 ~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~ 440 (559) ...-. -..++........|||||..+.+...+...-+-..+ ...|.-++.-++.+..-.+++..... ...++.|.+ T Consensus 369 ~~~Gl-s~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~~-~~aL~~lv~~i~~l~~~~~~~~~~~~--~~~~v~v~f 444 (517) T protein:vir:98 369 MELKL-SVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYEV-EQFIKGLVISVLELAKTYKLFGGEIP--SAEHIGVDF 444 (517) T ss_pred HHhCC-CcccccccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCCCCC--CCcceEEEc Confidence 22200 012333334455799999999998887777654444 34556665555544433333322110 122477777 Q ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 441 ISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMM 520 (559) Q Consensus 441 is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~ 520 (559) --++..- +...++... ++.+.+ .+....+ +...+|+ |++|.+++.++- +. + T Consensus 445 ~D~i~~D-~~~~~~~~~-------~~v~aG-----~ms~~~~---i~~~~g~------~eeeA~~e~~~i-~~---E--- 495 (517) T protein:vir:98 445 DDGVFQD-RSALLRFYG-------QAKTFG-----FIPTVEA---IQRIFKV------PKKTAEQWLEEI-RK---D--- 495 (517) T ss_pred CCCCCCC-HHHHHHHHH-------HHHhcC-----CCCHHHH---HHHhCCC------ChHHHHHHHHHH-HH---h--- Confidence 5554311 111111111 111111 1333333 3445575 344432221110 00 0 Q ss_pred HHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 521 AMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 521 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .+ .+ .+... .+ ....+-.|..+ T Consensus 496 -~~--~~------~~~~~-----~~---~~~~~~~gd~e 517 (517) T protein:vir:98 496 -QI--EL------DPVTI-----SQ---RAQKRMFGDEE 517 (517) T ss_pred -cc--cc------CCCCc-----cc---cccCCCCCCCC Confidence 00 00 00000 00 00000011111 No 105 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=98.75 E-value=6.3e-08 Score=60.06 Aligned_cols=465 Identities=12% Similarity=0.076 Sum_probs=211.3 Q ss_pred CChhhHHHHHHHHHHHHHHh--------------hhHHHHHHHHHHHhccccCCCCCCCCCCc-c-cccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESER--------------QSFEPHWRELSDYINPRGSRFLTSEVNRN-D-RRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R--------------~~~~~~w~e~~~~~~P~~~~~~~~~~~~~-~-~~~~~~~~s~~~~a 64 (559) |.-.. + .|+.-+.-| ...+..++.+.+|..=.-+... ...++ . +-...++++.+... T Consensus 1 ~~~~~-~----~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~--~~lrg~~~~~~r~~~~ps~~~~ 73 (527) T protein:vir:10 1 MGQDK-R----QYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQ--VILRGGDEGDQRPIYVPNGEKL 73 (527) T ss_pred CCccc-c----ccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhhee--eecCCccccccceeeehhhHHh Confidence 21110 0 000000000 0112223334444321100000 00001 1 11233667777443 Q ss_pred HHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC Q lcl|NC_019445. 65 ARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD 144 (559) Q Consensus 65 ~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~ 144 (559) ++. .+-.+.| +..|+- + +. -++|+..+...+++.|++....++-.+.++-|-|++.+-+|+ T Consensus 74 ~~~----~~~~~~~-g~~~~~----~----~~------~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~ 134 (527) T protein:vir:10 74 IEA----KMRFLGQ-GLKWEF----S----KK------DAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDD 134 (527) T ss_pred hCC----cceeecc-Cccccc----c----ch------hHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecc Confidence 332 2222332 333421 1 11 113666677788889999999999999999999999999887 Q ss_pred Cc----eEEEEEeeccEEEEeeCCCC--CEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHh---------cCCCCce-- Q lcl|NC_019445. 145 ED----IIRTMPFPIGSYYLANSPRG--SVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWE---------SGTYEKW-- 207 (559) Q Consensus 145 ~~----~~~~~~~~l~~~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~---------~~~~~~~-- 207 (559) ++ .++.+.+-.+.|+.-+|++| .|-.+|- +..|. +|++-++.++ ..+++.. T Consensus 135 ~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~---------~~~~~---~P~d~~~~~~~ar~~~~~~~l~~~g~~~ 202 (527) T protein:vir:10 135 EKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYL---------VDEYP---HPDSEKKNEKCARVQKYMKTLDDDGKPV 202 (527) T ss_pred CCCcCCCceEeecCcceeeeeecCCCCCceeeEEE---------eeecc---CCccccccceehhhhhhhhhcCcccccc Confidence 65 48888999999998888765 3444432 12232 2222211111 1111111 Q ss_pred ------EEEEEEEeecCc-ccccccccccccEEEEEEEecCCCceeeee--cCcccCCeEEEEeeecCCCcccccchHHH Q lcl|NC_019445. 208 ------IEVMHSVYPNID-RDTSKLDSKNKPFKSVYYEVGGDNDKLLRE--SGFDEFPIMAPRWEVNGEDVYGSSCPGML 278 (559) Q Consensus 208 ------v~v~~~v~p~~~-~~~~~~~~~~~~~~sv~~~~~~~~~~il~e--sg~~~~P~~~~rw~~~~g~~YGrG~P~~~ 278 (559) ++..|+-..+++ .+.-.+++.. | ... .+...+.+ -++.-.|+++++=...++++||+|. ..+ T Consensus 203 ~~G~~~yt~~~w~lg~w~d~~e~p~~~~~--~-----~~~-~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~-La~ 273 (527) T protein:vir:10 203 PGGAIKYTEELYEPGKWDDRPESPLEPDD--I-----KKL-STLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSG-LAG 273 (527) T ss_pred cCcceeeeeceeeccccccccccccchhh--h-----hhh-cCceeeecccCCCCccceEeecCCCccccccChhh-HhH Confidence 111111111110 0000000000 0 000 01122222 2466788888888888999999995 889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeecC----C--CccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHH Q lcl|NC_019445. 279 ALGPVKALQLLQKRKSQLIDKATNPPMVAPT----S--LKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADI 352 (559) Q Consensus 279 ~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~----~--~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i 352 (559) .+.-+..||.........+...-.|...... + ++..++.+.||+++-.+..+. +.-+. ..+.++.+.+.+ T Consensus 274 ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak---~~~v~-~~~~la~~~~h~ 349 (527) T protein:vir:10 274 LESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNNK---IYRVN-GVASLEPSQTHM 349 (527) T ss_pred HHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCCceeEecCCCcc---eeecc-chhhhHHHHHHH Confidence 9999999998888888888888777655421 2 344456788998886644321 22111 113555566666 Q ss_pred HHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHH-HHHHHHH-HHH--------HHHhcC Q lcl|NC_019445. 353 QDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEC-LNPLIDR-AFS--------MMVRKN 422 (559) Q Consensus 353 ~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~-l~Pli~r-~~~--------il~r~g 422 (559) ..+..+|...--..-. .++..+..+ .-+.+. +...|+|++.+.+..- +.-.+.| +.. ...+-+ T Consensus 350 ~~L~~~l~~vA~~Pav-A~G~vD~s~-~~SG~A-----LeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~ 422 (527) T protein:vir:10 350 NKAEEAMQQTKGIPDI-AVGVVDAAV-AESGIA-----LDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVG 422 (527) T ss_pred HHHHHHHHHhhcCCee-eeccccCCc-CcHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcc Confidence 7777666554422111 111112221 112222 2344666666665542 2222222 111 111111 Q ss_pred CCCCCchhhCCcceEEEe--ecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCH Q lcl|NC_019445. 423 MLPPPPDAMEGMPLKVEY--ISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQ 500 (559) Q Consensus 423 ~lp~~p~~l~g~~v~~~~--is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~ 500 (559) .-+ ......+++.+ ..|....+. ++.+..+.+.+ -+....+++.+.++.|+. -.+ T Consensus 423 ~~d----~~~~~~v~ivf~p~lP~D~~av----------ie~v~tL~~aG-----iiS~etAv~~L~~~~g~e----D~E 479 (527) T protein:vir:10 423 IDD----ADKKLTVTITFRDPKPVNNEKR----------FAQLLELWEAG-----LIPAKKLTEELSKIMGFE----LTE 479 (527) T ss_pred cCC----CccccceEEEecccCCCCHHHH----------HHHHHHHHHcC-----chhHHHHHHHHHhccCCC----chH Confidence 100 01111345544 455443322 22222222222 377888899999888852 234 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 501 EQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLS-EAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 501 ~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .|++++.+.++++.-++. .+.+.-.++++..++ +....++ ++|+.- T Consensus 480 ~E~~~I~~era~~a~a~a-~a~~~~~a~~~~~~g~~~~~~d~------------~~~~~~ 526 (527) T protein:vir:10 480 EDFRQATEDKKTQGIAQA-EAADPFGAQMAAEQGIPDEEDDQ------------ALNGQP 526 (527) T ss_pred HHHHHHHHHHHHHhHHhh-hhcCchhhhhccccCCCCCCccc------------ccCCCC Confidence 566666665554432221 111111122111110 0000000 011111 No 106 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.73 E-value=7.4e-08 Score=59.67 Aligned_cols=452 Identities=9% Similarity=0.060 Sum_probs=198.6 Q ss_pred CC--hh----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC-CCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MA--ET----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT-SEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~--~~----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~-~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |. +. +.+.+.+..++....+. ++++++.+|..-.-..... .......+.+.++..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhh---HHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 11 11 12334444444444444 3445555554321100000 001111123356777888888888887544 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEe Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPF 153 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~ 153 (559) + -|+. ++..++. ....+...+...+|.....++.++..+||.+.+++..+....+++..+ T Consensus 108 g--~p~~-----~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~ 167 (511) T protein:vir:78 108 G--NPIQ-----YQDDDKD-------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (511) T ss_pred c--cCce-----eecCchH-------------HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEE Confidence 2 1211 2333321 223455566678899999999999999999999888776666788888 Q ss_pred eccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 154 ~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +..+.++-.|. .+++...+|.+..... +... ...+..++ ||- .+. T Consensus 168 ~p~~~~~v~dd~~~~~~~~~vr~~~~~~~-------------------~~~~-~~~~~~~~-vyt-~~~----------- 214 (511) T protein:vir:78 168 DAMSTFIIYDNTVERNSIAGVRYLRTKPI-------------------DKTD-EDEVFTVD-LFT-SHG----------- 214 (511) T ss_pred cccceEEEEcCCCCCceEEEEEEEEeeec-------------------cccc-cceEEEEE-EEe-CCc----------- Confidence 88888776664 3555555555433210 0111 11111111 111 100 Q ss_pred EEEEEEEec-CCCce----e--eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 232 FKSVYYEVG-GDNDK----L--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 232 ~~sv~~~~~-~~~~~----i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) .+++..+ +.... . ...-+|..+|++.++- +.+|+|. .+..++.+..++.+.-.+...++...+|. T Consensus 215 --i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd-~e~v~~liDa~~~~~S~~~~~~~~~~~~~ 286 (511) T protein:vir:78 215 --VYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGD-YEKVITLIDLYDNAESDTANYMSDLNDAM 286 (511) T ss_pred --EEEEEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhcch Confidence 0111111 11000 0 1122466788776543 4578996 88899999999999888888999889998 Q ss_pred eeecCCC--ccccce-ecCCceeec--------CC--cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhc Q lcl|NC_019445. 305 MVAPTSL--KNQRAS-LLPGDITYI--------DQ--ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMML 371 (559) Q Consensus 305 ~~~p~~~--~~~~~~-~~pg~~~~~--------~~--~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~ 371 (559) +++.+.. ....+. ...++.+.. .. .++...++.+. ...+...+...+..+++.|-..-+..-+. . T Consensus 287 lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~e~~~~~L~~~I~~~s~~P~~~-~ 364 (511) T protein:vir:78 287 LLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPNMK-D 364 (511) T ss_pred hheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCcccc-c Confidence 7765422 111111 111111111 11 11112222221 12244555566677777665443321111 1 Q ss_pred cCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHH Q lcl|NC_019445. 372 QNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSI 451 (559) Q Consensus 372 ~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~ 451 (559) ..-+...|+..+..... .+........+.-.+.+.-+++-++.++...+.... +.+ -.++++.|.-++..- ... T Consensus 365 -~~~~~n~Sg~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~-~~~--~~~i~~~f~~~~p~n-~~e 438 (511) T protein:vir:78 365 -DNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKD--FNTVRYVYNRNLPKS-LIE 438 (511) T ss_pred -cccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccc--cccceEEeCCCCCcC-HHH Confidence 11123345555444332 222233333444444444444444444444332211 111 224566665433211 111 Q ss_pred HHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 452 GLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAK 531 (559) Q Consensus 452 ~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~ 531 (559) . ++.+..++++ +..+.++..+ -+++ -.++|++++.++++.+...++... .... + T Consensus 439 ~-------~d~~~kl~G~-------iS~et~l~~l---~~v~----d~~~El~ri~~E~~~~~~~~~~~~--~~~~---~ 492 (511) T protein:vir:78 439 E-------LKAYIDSGGK-------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGI--YKDP---R 492 (511) T ss_pred H-------HHHHHHHhcc-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhhcc--ccCC---C Confidence 1 1222222222 2223333221 1232 135666666665433222221110 0000 0 Q ss_pred hhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 532 TLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ...+....+++- ....+.. T Consensus 493 ~~~~~~~~~~~~---------~~~~e~~ 511 (511) T protein:vir:78 493 DINDDEQDDDTK---------DTVDKKE 511 (511) T ss_pred CCCCCCCCCCcc---------CcccccC Confidence 110100000000 0000000 No 107 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.73 E-value=7.4e-08 Score=59.67 Aligned_cols=452 Identities=9% Similarity=0.060 Sum_probs=198.6 Q ss_pred CC--hh----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC-CCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MA--ET----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT-SEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~--~~----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~-~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |. +. +.+.+.+..++....+. ++++++.+|..-.-..... .......+.+.++..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhh---HHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 11 11 12334444444444444 3445555554321100000 001111123356777888888888887544 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEe Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPF 153 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~ 153 (559) + -|+. ++..++. ....+...+...+|.....++.++..+||.+.+++..+....+++..+ T Consensus 108 g--~p~~-----~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~ 167 (511) T protein:vir:96 108 G--NPIQ-----YQDDDKD-------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (511) T ss_pred c--cCce-----eecCchH-------------HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEE Confidence 2 1211 2333321 223455566678899999999999999999999888776666788888 Q ss_pred eccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 154 ~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +..+.++-.|. .+++...+|.+..... +... ...+..++ ||- .+. T Consensus 168 ~p~~~~~v~dd~~~~~~~~~vr~~~~~~~-------------------~~~~-~~~~~~~~-vyt-~~~----------- 214 (511) T protein:vir:96 168 DAMSTFIIYDNTVERNSIAGVRYLRTKPI-------------------DKTD-EDEVFTVD-LFT-SHG----------- 214 (511) T ss_pred cccceEEEEcCCCCCceEEEEEEEEeeec-------------------cccc-cceEEEEE-EEe-CCc----------- Confidence 88888776664 3555555555433210 0111 11111111 111 100 Q ss_pred EEEEEEEec-CCCce----e--eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 232 FKSVYYEVG-GDNDK----L--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 232 ~~sv~~~~~-~~~~~----i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) .+++..+ +.... . ...-+|..+|++.++- +.+|+|. .+..++.+..++.+.-.+...++...+|. T Consensus 215 --i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd-~e~v~~liDa~~~~~S~~~~~~~~~~~~~ 286 (511) T protein:vir:96 215 --VYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGD-YEKVITLIDLYDNAESDTANYMSDLNDAM 286 (511) T ss_pred --EEEEEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhcch Confidence 0111111 11000 0 1122466788776543 4578996 88899999999999888888999889998 Q ss_pred eeecCCC--ccccce-ecCCceeec--------CC--cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhc Q lcl|NC_019445. 305 MVAPTSL--KNQRAS-LLPGDITYI--------DQ--ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMML 371 (559) Q Consensus 305 ~~~p~~~--~~~~~~-~~pg~~~~~--------~~--~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~ 371 (559) +++.+.. ....+. ...++.+.. .. .++...++.+. ...+...+...+..+++.|-..-+..-+. . T Consensus 287 lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~e~~~~~L~~~I~~~s~~P~~~-~ 364 (511) T protein:vir:96 287 LLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPNMK-D 364 (511) T ss_pred hheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCcccc-c Confidence 7765422 111111 111111111 11 11112222221 12244555566677777665443321111 1 Q ss_pred cCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHH Q lcl|NC_019445. 372 QNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSI 451 (559) Q Consensus 372 ~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~ 451 (559) ..-+...|+..+..... .+........+.-.+.+.-+++-++.++...+.... +.+ -.++++.|.-++..- ... T Consensus 365 -~~~~~n~Sg~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~-~~~--~~~i~~~f~~~~p~n-~~e 438 (511) T protein:vir:96 365 -DNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKD--FNTVRYVYNRNLPKS-LIE 438 (511) T ss_pred -cccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccc--cccceEEeCCCCCcC-HHH Confidence 11123345555444332 222233333444444444444444444444332211 111 224566665433211 111 Q ss_pred HHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 452 GLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAK 531 (559) Q Consensus 452 ~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~ 531 (559) . ++.+..++++ +..+.++..+ -+++ -.++|++++.++++.+...++... .... + T Consensus 439 ~-------~d~~~kl~G~-------iS~et~l~~l---~~v~----d~~~El~ri~~E~~~~~~~~~~~~--~~~~---~ 492 (511) T protein:vir:96 439 E-------LKAYIDSGGK-------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGI--YKDP---R 492 (511) T ss_pred H-------HHHHHHHhcc-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhhcc--ccCC---C Confidence 1 1222222222 2223333221 1232 135666666665433222221110 0000 0 Q ss_pred hhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 532 TLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ...+....+++- ....+.. T Consensus 493 ~~~~~~~~~~~~---------~~~~e~~ 511 (511) T protein:vir:96 493 DINDDEQDDDTK---------DTVDKKE 511 (511) T ss_pred CCCCCCCCCCcc---------CcccccC Confidence 110100000000 0000000 No 108 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.72 E-value=8.5e-08 Score=59.34 Aligned_cols=479 Identities=9% Similarity=0.015 Sum_probs=191.2 Q ss_pred CCh-hhHHHHHHHHHHHHHH--hhhHHHHHHHHHHHhccc------cCCCCCCCCCCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAE-TTKERLNKQFAQLESE--RQSFEPHWRELSDYINPR------GSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~-~~~~~l~~r~~~l~~~--R~~~~~~w~e~~~~~~P~------~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) |.- +..|+.++.|=+-+.. .-.....+..++.=-.+. ...|-+....++. ...++--+.+..+++.+|+- T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~-~~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTV-HDKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCcc-ccccccCChHHHHHHHHHHh Confidence 433 2334444444322210 001111111111100000 0011111111111 12234344577777777774 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) | |..- + .+++++.+..+...+++ .+...|..++|+..+.+.+.+..+.|.+++-+..+.++ +++. T Consensus 80 l----~~e~-~--~i~v~~~~~~d~e~~~~-------~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~-~~i~ 144 (518) T protein:vir:78 80 I----SGKP-L--SIDVTGVNGSKDENLTK-------QLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGR-PSIS 144 (518) T ss_pred h----cCCC-c--eEEecCccccCcHHHHH-------HHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCe-eEEE Confidence 4 4311 1 23333333222223333 34557778999999999999999999999865555444 5677 Q ss_pred EeeccEEEEeeCCCCCEEEE-EEEEeecHHHH-------HHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccc- Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDIC-FRKFSMTVRQL-------VQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDT- 222 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~i-~r~~~~t~~ql-------~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~- 222 (559) .++...|+...+ +|++..| |.....+-+.- ..+|+.. . .....+ ....|.+.+|....... T Consensus 145 ~v~ad~~~P~~~-~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~------~--~~~~~~-~~~~I~n~ly~~~~~~~v 214 (518) T protein:vir:78 145 VHSSSQFWIDFK-NNEPFRFNFFEEIPTSNKADIYYLVESREIKQW------D--KEGKKL-SGGFVTYSVIKIDGDKTT 214 (518) T ss_pred EEcCCeeEEEee-cCcEEEEEEEEEeecCCcceeEEEEEeeccccc------c--ceeecc-cceeEEEEEeeecCcccc Confidence 889988887654 4665443 32211110000 0000000 0 000000 01122222222110000 Q ss_pred -cccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeec-----CCCcccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 223 -SKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVN-----GEDVYGSSCPGMLALGPVKALQLLQKRKSQL 296 (559) Q Consensus 223 -~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~-----~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~ 296 (559) ...........++....+......+. .....||+++..+.. .+++||+|. ...+.+.++.||..--+.... T Consensus 215 ~~~~~~~~~~l~~~~~~~~~~e~~~~~--tg~~~~~~~~~~n~~~N~~~~~splG~S~-~~~~~~~id~lD~~~s~~~~e 291 (518) T protein:vir:78 215 PISAERLPEQITSYLHTNDIQLNHSVS--IGLKSMGAYLINNSPSNTRYPHLNLGESD-LSQCTNYLFAVDYFFTVYMRE 291 (518) T ss_pred cccccccccccccccccccCccceeec--cCCccceEEeeccccccccccCCCcCcch-HhhhhHHHHHHHHHHHHHHHH Confidence 00000001111111001111111111 113467777665543 367789994 899999999999999999999 Q ss_pred HHHHhcCceeecCCCcccccee---------cCCceeec--C-Cc-CCchhhhhhhhcccc--HHHHHHHHHHHHHHHHH Q lcl|NC_019445. 297 IDKATNPPMVAPTSLKNQRASL---------LPGDITYI--D-QI-TGQDGFRPAYLVNPS--TADLVADIQDTRQIINS 361 (559) Q Consensus 297 ~~~~~~p~~~~p~~~~~~~~~~---------~pg~~~~~--~-~~-~~~~~~~p~~~~~~~--~~~~~~~i~~~~~rI~~ 361 (559) .++ .++.+.+|.++.....+. ..+.-.|. + .. .+.+...-+...+++ .....+.++.+-..|.. T Consensus 292 ~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~ 370 (518) T protein:vir:78 292 GEK-TKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVS 370 (518) T ss_pred HHh-CCceeeechhHhccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHH Confidence 876 788888887653211111 11111111 1 11 111111111111222 23333444444444433 Q ss_pred HhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEee Q lcl|NC_019445. 362 AYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYI 441 (559) Q Consensus 362 af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~i 441 (559) ..=.. ..++.. ++..+|||||..+.+...+.+--.-..+.. .|.-++..+++++.-......-.......+++|.+- T Consensus 371 ~~G~s-~~tfg~-~~~~~TATei~s~~~~~~~t~~~~~~~~e~-al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~ 447 (518) T protein:vir:78 371 KSGYN-PATFNL-GNREVKATEIWSLQDATVRKIEKKKRLIQN-VYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFP 447 (518) T ss_pred hhCCC-hhhcCc-ccccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeC Confidence 22000 112332 344689999999998876666544444433 344455555555443221100000001124666665 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 442 SVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMA 521 (559) Q Consensus 442 s~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~ 521 (559) -++.. +.+...+.++ ++.+.+ .+..+.+++.+ ..|+ +++|+++.-++ -++++.+ . T Consensus 448 D~i~~-----D~~~~~~~~~---~~v~aG-----imS~e~~i~~~--~~~~------~deea~~e~~r-i~~E~~~---~ 502 (518) T protein:vir:78 448 DPMSV-----NLNELSSTLN---NMNSAL-----AMSVEEKVKLI--HPKW------EDEEIQAEVKR-IYLENAI---G 502 (518) T ss_pred CCCCC-----CHHHHHHHHH---HHHhcC-----CCCHHHHHHHh--CCCC------CHHHHHHHHHH-HHHHhcc---c Confidence 44321 1111111111 111111 23344444432 1133 34443221110 0000000 0 Q ss_pred HHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCC Q lcl|NC_019445. 522 MGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQG 555 (559) Q Consensus 522 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 555 (559) ..+ .|. .+.++-..|| T Consensus 503 -~~~--------------~p~---~~~g~~~~~g 518 (518) T protein:vir:78 503 -EVP--------------DPE---AIGGMETKGG 518 (518) T ss_pred -CCC--------------CCc---cccCCCCCCC Confidence 000 000 1111111111 No 109 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.69 E-value=1.1e-07 Score=58.79 Aligned_cols=451 Identities=9% Similarity=0.070 Sum_probs=194.1 Q ss_pred CC--hh----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCC-CCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MA--ET----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLT-SEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~--~~----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~-~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |. +. +.+.+.+..+.....+. ++++++.+|..-.-..... .......+...++..+.+...++..++-|+ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 21 11 12334444444333333 4555555554321100000 000111123345667778888887776443 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEe Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPF 153 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~ 153 (559) + -| .+++..+.+ ....+...+..++|.....++.+++.+||.|..++..+....+++..+ T Consensus 108 g--~p-----~~~~~~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~ 167 (511) T protein:vir:10 108 G--NP-----IQYQDDDKD-------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKS 167 (511) T ss_pred c--cC-----ceeecCchH-------------HHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEE Confidence 2 11 123333321 223455566778899999999999999999999888776556788888 Q ss_pred eccEEEEeeCCC--CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 154 PIGSYYLANSPR--GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 154 ~l~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +..+.++-.|.. +++...+|.+.....+ .. ....-..+++|+ ... . T Consensus 168 ~p~~~~~vydd~~~~~~~~~vr~~~~~~~d-------~~----------~~~~~~~~~iyt----~~~-i---------- 215 (511) T protein:vir:10 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPID-------KT----------DEDEVFTVDLFT----SHG-V---------- 215 (511) T ss_pred ccceeEEEEcCCCCCceEEEEEEEEeeecc-------cC----------ccceEEEEEEEe----CCc-E---------- Confidence 888887766643 4565555655432100 00 000001122221 110 0 Q ss_pred EEEEEEEecCCC----cee--eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_019445. 232 FKSVYYEVGGDN----DKL--LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) Q Consensus 232 ~~sv~~~~~~~~----~~i--l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~ 305 (559) + .|...++.. ... ...-+|..+|++.++- +.+|+|. .+..++.+..++.+.-.....++...+|.+ T Consensus 216 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~n-----n~~g~gd-~e~v~~liDa~d~~~S~~~~~~~~~~~~~l 287 (511) T protein:vir:10 216 Y--RYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGD-YEKVITLIDLYDNAESDTANYMSDLNDAML 287 (511) T ss_pred E--EEEecCCCcccccccccccccccCcceeEEEecC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhCcee Confidence 0 011111110 001 1123567788877653 4578996 888999999999988888888999899988 Q ss_pred eecCCCcc--ccc-eecCCceeec--------CC--cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhcc Q lcl|NC_019445. 306 VAPTSLKN--QRA-SLLPGDITYI--------DQ--ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQ 372 (559) Q Consensus 306 ~~p~~~~~--~~~-~~~pg~~~~~--------~~--~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~ 372 (559) ++.+.... ..+ ....++++.. .. .++...++.+. ...+...+...+..++..|...-+.. .... T Consensus 288 v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~-~~~~~~~~e~~~~~L~~~I~~~s~~P--~~~~ 364 (511) T protein:vir:10 288 LIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTP--NMKD 364 (511) T ss_pred eeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCc--cccc Confidence 76542211 111 1112222211 11 11112222221 12245555566777777665443321 1111 Q ss_pred CCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHH Q lcl|NC_019445. 373 NINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIG 452 (559) Q Consensus 373 ~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~ 452 (559) ..-+...|+..+...-.-+. .......++-.+.+.-+++-++.++...+.... +.+. .++++.+.-++..- .... T Consensus 365 ~~~~~n~Sg~Al~~~~~~l~-~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~-~~d~--~~i~i~f~~~~p~d-~~~~ 439 (511) T protein:vir:10 365 DNFSGTQSGEAMKYKLFGLE-QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDF--NTVRYVYNRNLPKS-LIEE 439 (511) T ss_pred ccccccchHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccc-cccc--ceeeEEeCCCCCcC-HHHH Confidence 11123456655544422221 122222222222332223323333333332221 1121 24566665433211 1111 Q ss_pred HHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019445. 453 LSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKT 532 (559) Q Consensus 453 ~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~ 532 (559) + +.+..+.++ +....++..+ -+++ -.++|++++.++++.+.+.++... ... -+. T Consensus 440 ~-------~~~~kl~G~-------iS~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~--~~~---~~~ 493 (511) T protein:vir:10 440 L-------KAYIDSGGK-------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGI--YKD---PRD 493 (511) T ss_pred H-------HHHHHHhcc-------CcHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhhhc--ccC---CCC Confidence 1 222222221 2223333222 1232 135667766665443322221100 000 000 Q ss_pred hhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 533 LSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 533 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ..+..+.+.+ .....| T Consensus 494 ~~~~~~~~~~-----------~~~~~~ 509 (511) T protein:vir:10 494 INDDEQDDDT-----------KDTVDK 509 (511) T ss_pred CCCCCCCCcc-----------cCcccc Confidence 0000000000 000000 No 110 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.65 E-value=1.4e-07 Score=58.08 Aligned_cols=456 Identities=14% Similarity=0.080 Sum_probs=195.0 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) +.+.+.+.|.+..+.++. ..++++++.+|....-.-..... ......+.++..+.+...++..++-|++- |+ T Consensus 13 ~~~~~~~~i~~~i~~~~~----~~~~~~~l~~Yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~--p~- 84 (499) T protein:vir:10 13 VNEPNIEAINYAIRELQN----RKKRLDKLSDYYNGKQEIEKHEF-DNATVEAANVMVNHAKYITDMNVGFMTGN--PV- 84 (499) T ss_pred hhcCCHHHHHHHHHHHHH----HHHHHHHHHHHhccccchhcCCc-CcCCCCcceeecchHHHHHHHHhhhhccc--Cc- Confidence 333333434444444433 24455556666443210000011 11123345666777778888777644331 22 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc-------------- Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED-------------- 146 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~-------------- 146 (559) ++...+.. ....+.+.+...+|.....++.++..+||.+.+++..+... T Consensus 85 ----~~~~~~~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~ 147 (499) T protein:vir:10 85 ----KYVAEKGK-------------NIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLT 147 (499) T ss_pred ----eeecCChh-------------HHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccc Confidence 23333321 11124445666788889999999999999998877654332 Q ss_pred ---eEEEEEeeccEEEEeeC-CCCC-EEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccc Q lcl|NC_019445. 147 ---IIRTMPFPIGSYYLANS-PRGS-VDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRD 221 (559) Q Consensus 147 ---~~~~~~~~l~~~~v~~d-~~G~-vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~ 221 (559) .+++..++..+.++-.+ ..++ +...+|.+... . .+.+.....++||+ |.. T Consensus 148 ~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~-----------~--------~~~~~~~~~~~iyt---~~~--- 202 (499) T protein:vir:10 148 PNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKK-----------D--------LEGNTNGYSITVYM---PQR--- 202 (499) T ss_pred cccceEEEEEcccceEEEecCCCCcceEEEEEEEEEe-----------e--------cCCCceEEEEEEEe---CCe--- Confidence 23455555554444333 3333 23333332211 0 00001011222221 100 Q ss_pred ccccccccccEEEEEEEecC-----CCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 222 TSKLDSKNKPFKSVYYEVGG-----DNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKS 294 (559) Q Consensus 222 ~~~~~~~~~~~~sv~~~~~~-----~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~ 294 (559) .++|...+ .+.... ..-+|..+|++.++- +.+|.|. .+...+.+..++.+.-... T Consensus 203 ------------i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~~d-~e~v~~liD~~~~~~S~~~ 264 (499) T protein:vir:10 203 ------------IVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRN-----NEERQGD-FEQLISLIDAYNLLQTDRI 264 (499) T ss_pred ------------EEEEEecCCccccCcceecccccCCCCccceEEecC-----CCCCCCc-hHhHHHHHHHHHHHHHHHH Confidence 00111100 011111 123577889887654 4678895 7889999999999999999 Q ss_pred HHHHHHhcCceeecCCCccc----cceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_019445. 295 QLIDKATNPPMVAPTSLKNQ----RASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMM 370 (559) Q Consensus 295 ~~~~~~~~p~~~~p~~~~~~----~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~ 370 (559) ..++...+|.+++.+..... ......|++...+..++ ..++.+. .+.+...+...+..+...|.+.-+.. .. T Consensus 265 ~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~l~-~~~~~~~~~~~~~~l~~~I~~~s~~p--~~ 340 (499) T protein:vir:10 265 SDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEG-ADIEWLT-KSFDETQVNLLSQSIENDIHKISYVP--NM 340 (499) T ss_pred HHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCC-CcceEEe-ccCCHHHHHHHHHHHHHHHHHHhCcc--cC Confidence 99999999988876532111 12234555544432222 2233221 12345556667777777775543321 11 Q ss_pred ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHH Q lcl|NC_019445. 371 LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKS 450 (559) Q Consensus 371 ~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~ 450 (559) ....-+...|+..+..+..-+.... ++. ...+.+.+.+++.++.+.--+... ......+++.+.-++.. T Consensus 341 ~~~~~~gn~Sg~Al~~~~~~l~~k~----~~k-~~~~~~~l~~~~~li~~~~~~~~~--~~d~~~i~i~f~~~~p~---- 409 (499) T protein:vir:10 341 NDEKFMGNVSGEAMKFKLFGLENLL----SIK-QRYFFDGLRRRLKLIQTIVNIKGA--NDDASGCKISLVANIPS---- 409 (499) T ss_pred CchhhcccchHHHHHHHHHHHHHHH----HHH-HHHHHHHHHHHHHHHHHHHhccCC--ccccccceEEeCCCCCC---- Confidence 0111123346655544333222211 222 223344445555554442111111 11122455655444321 Q ss_pred HHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 451 IGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGA 530 (559) Q Consensus 451 ~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a 530 (559) +....++.++.+++ .+....+++.+ -+++ -+++|++++.++++.....++... ....... T Consensus 410 ----n~~e~~~~~~kl~g-------~iS~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~--~~~~~~~ 469 (499) T protein:vir:10 410 ----NLSDVVNNVKNADG-------IIPRKYTYSWL---PDVD----NPQDVIDEMNQQDAETIKKNQEAL--RGQDPDR 469 (499) T ss_pred ----CHHHHHHHHHHHhc-------cCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhhh--ccCCCCC Confidence 11112222222222 13333333322 1222 235677777665544322222111 1111111 Q ss_pred hhhhhhcCCChh-HHHHHHHHhhcCCCCCC Q lcl|NC_019445. 531 KTLSEAKTSDPS-VLSAMANAVSGQGGQSQ 559 (559) Q Consensus 531 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 559 (559) ....+.....++ ..+.-..-...|.+-+- T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 470 LELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred CCCCCCCcccCCCCCCCccccccCCCCCCC Confidence 111111111100 00000000011111111 No 111 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.64 E-value=1.6e-07 Score=57.89 Aligned_cols=459 Identities=8% Similarity=0.054 Sum_probs=199.8 Q ss_pred CChhhHHHHHHHHH-----H-----------HHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFA-----Q-----------LESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l~~r~~-----~-----------l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a 64 (559) |-+..+.-+.+-.. . +..++-.....|+.+|.=-.|.. .+.+.. +......+.--+.+... T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~-~~~~~~--~~~~~~~~~slnl~~~i 79 (500) T protein:vir:98 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSV-LYLNTD--GETKKRDLNHLPIARTA 79 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCc-ccccCC--CCcccCceeecchHHHH Confidence 33333222211111 1 11223334555656554322221 111111 11111122222456666 Q ss_pred HHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC Q lcl|NC_019445. 65 ARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD 144 (559) Q Consensus 65 ~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~ 144 (559) ++.+|+ .+|.- .+ .+.++|+ . ..+.+.+.+...+|+..+.+++.+..+.|.+++.+..|. T Consensus 80 ~~~~A~----lv~~e-~~--~i~~~d~------~-------~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~ 139 (500) T protein:vir:98 80 AKKIAS----LVFNE-QA--EIKVDDD------A-------ANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG 139 (500) T ss_pred HHHHhh----hhcCC-cc--eEecCCh------H-------HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Confidence 666666 33321 11 1333332 2 222345567779999999999999999999998776665 Q ss_pred CceEEEEEeeccEEEE-eeCCCCCEEEEE-EEEeecHHHHHHhcCcccCCHHHHHHHhc--CCCCceEEEEEEEeecCcc Q lcl|NC_019445. 145 EDIIRTMPFPIGSYYL-ANSPRGSVDICF-RKFSMTVRQLVQEFGLNNVSESVKSMWES--GTYEKWIEVMHSVYPNIDR 220 (559) Q Consensus 145 ~~~~~~~~~~l~~~~v-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~~~~~--~~~~~~v~v~~~v~p~~~~ 220 (559) +. +++..++...++- ..|..|.+..+| ++...+... .+. .-..++. -.....+.|.+.+|..... T Consensus 140 ~~-~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~------~~~----~yt~lE~h~~~~~~~~~I~n~ly~~~~~ 208 (500) T protein:vir:98 140 DK-VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTING------KEV----YYTLIEFHEWQSSDDYVISNELYRSDDK 208 (500) T ss_pred Cc-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecC------Cce----EEEEEEEEEEeCCceeEEEEEEEecccc Confidence 44 5567788888775 556666654433 332211100 000 0000000 0000112333333332211 Q ss_pred cccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEE----eeecCCCcccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 221 DTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPR----WEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQL 296 (559) Q Consensus 221 ~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~r----w~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~ 296 (559) . ..+...|..++|-.. .+.+ ...|+..-||..++ =+...+++||.|. -.++.+.+..|+..--+.... T Consensus 209 ~---~lG~~v~l~~~~~~l--~~~~--~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~-~~~~~~lid~lD~~~s~~~~e 280 (500) T protein:vir:98 209 A---KVGSRVPLSEVYKDL--KDEA--KVTDVTRPIFTYLKTPGMNNKDINSPLGLSI-FDNAKTTIDFINTTYDEFMWE 280 (500) T ss_pred c---ccCcccccccccCCc--Ccce--EeccCCCccEEEecCCccccccCCCccCCch-hhhhHHHHHHHHHHHHHHHHH Confidence 1 112233444443111 1111 11233322333322 2334478899994 999999999999999999988 Q ss_pred HHHHhcCceeecCCCccccce---e--------cCCceeec--CC-cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 297 IDKATNPPMVAPTSLKNQRAS---L--------LPGDITYI--DQ-ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSA 362 (559) Q Consensus 297 ~~~~~~p~~~~p~~~~~~~~~---~--------~pg~~~~~--~~-~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~a 362 (559) .++ .+..+.+|.++.....+ . .+....|. +. .++...++.+ ...-......+.++.+-+.|... T Consensus 281 ~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~l~~~l~~i~~~ 358 (500) T protein:vir:98 281 VKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDL-TTPIRADDYIKAINEGLSLFEMQ 358 (500) T ss_pred HHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEe-ccccChHHHHHHHHHHHHHHHHH Confidence 876 66667776654221111 1 11111121 11 1222223322 11112233344444444444322 Q ss_pred h-hcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEee Q lcl|NC_019445. 363 Y-FVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYI 441 (559) Q Consensus 363 f-~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~i 441 (559) . +. -.++........|||||....+...+...-.-..+ ...|.-|+..++++..-.+.....+.. ..+|.+.+- T Consensus 359 ~gls--~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~-~~al~~lv~~il~~~~~~~~~~~~~~~--~~~v~v~f~ 433 (500) T protein:vir:98 359 IGVS--AGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALV-EQSLKELVISIFEIAKAYDLYQSEVPS--MDNISISLD 433 (500) T ss_pred hCCC--ccccccCcCccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCCCCCC--CcceEEEeC Confidence 1 10 01222233455799999999998888777765555 446666777776665432222111111 124677775 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 442 SVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMA 521 (559) Q Consensus 442 s~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~ 521 (559) -++..- +...++... +.+ +.+ .+....+ +....|+ +++|++++.++-+. + T Consensus 434 d~i~~d-~~~~~~~~~---~~v----~aG-----i~s~~~~---i~~~~g~------~eeea~~~l~~i~~--------E 483 (500) T protein:vir:98 434 DGVFTD-RDAELDYWI---KVV----NAG-----FGTREMA---IQKVLNV------TEEKAQEIAAEINT--------G 483 (500) T ss_pred CCCCCC-HHHHHHHHH---HHH----HcC-----CCCHHHH---HHhcCCC------CHHHHHHHHHHHHH--------h Confidence 443211 111111111 111 111 1333332 3445676 34554433222111 0 Q ss_pred HHHHHHHHHhhhhhhcCCChhHHHHHHHH Q lcl|NC_019445. 522 MGMAAAQGAKTLSEAKTSDPSVLSAMANA 550 (559) Q Consensus 522 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 550 (559) + +. ..+.....+++.|. T Consensus 484 ~---~~---------~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 484 I---VD---------EINQQRTDTHLYGE 500 (500) T ss_pred c---cc---------cCCCCCccccccCC Confidence 0 00 00011111222222 No 112 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.64 E-value=1.6e-07 Score=57.89 Aligned_cols=459 Identities=8% Similarity=0.054 Sum_probs=199.8 Q ss_pred CChhhHHHHHHHHH-----H-----------HHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFA-----Q-----------LESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l~~r~~-----~-----------l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a 64 (559) |-+..+.-+.+-.. . +..++-.....|+.+|.=-.|.. .+.+.. +......+.--+.+... T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~-~~~~~~--~~~~~~~~~slnl~~~i 79 (500) T protein:vir:30 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSV-LYLNTD--GETKKRDLNHLPIARTA 79 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCc-ccccCC--CCcccCceeecchHHHH Confidence 33333222211111 1 11223334555656554322221 111111 11111122222456666 Q ss_pred HHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC Q lcl|NC_019445. 65 ARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD 144 (559) Q Consensus 65 ~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~ 144 (559) ++.+|+ .+|.- .+ .+.++|+ . ..+.+.+.+...+|+..+.+++.+..+.|.+++.+..|. T Consensus 80 ~~~~A~----lv~~e-~~--~i~~~d~------~-------~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~ 139 (500) T protein:vir:30 80 AKKIAS----LVFNE-QA--EIKVDDD------A-------ANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG 139 (500) T ss_pred HHHHhh----hhcCC-cc--eEecCCh------H-------HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Confidence 666666 33321 11 1333332 2 222345567779999999999999999999998776665 Q ss_pred CceEEEEEeeccEEEE-eeCCCCCEEEEE-EEEeecHHHHHHhcCcccCCHHHHHHHhc--CCCCceEEEEEEEeecCcc Q lcl|NC_019445. 145 EDIIRTMPFPIGSYYL-ANSPRGSVDICF-RKFSMTVRQLVQEFGLNNVSESVKSMWES--GTYEKWIEVMHSVYPNIDR 220 (559) Q Consensus 145 ~~~~~~~~~~l~~~~v-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~~~~~--~~~~~~v~v~~~v~p~~~~ 220 (559) +. +++..++...++- ..|..|.+..+| ++...+... .+. .-..++. -.....+.|.+.+|..... T Consensus 140 ~~-~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~------~~~----~yt~lE~h~~~~~~~~~I~n~ly~~~~~ 208 (500) T protein:vir:30 140 DK-VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTING------KEV----YYTLIEFHEWQSSDDYVISNELYRSDDK 208 (500) T ss_pred Cc-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecC------Cce----EEEEEEEEEEeCCceeEEEEEEEecccc Confidence 44 5567788888775 556666654433 332211100 000 0000000 0000112333333332211 Q ss_pred cccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEE----eeecCCCcccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 221 DTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPR----WEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQL 296 (559) Q Consensus 221 ~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~r----w~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~ 296 (559) . ..+...|..++|-.. .+.+ ...|+..-||..++ =+...+++||.|. -.++.+.+..|+..--+.... T Consensus 209 ~---~lG~~v~l~~~~~~l--~~~~--~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~-~~~~~~lid~lD~~~s~~~~e 280 (500) T protein:vir:30 209 A---KVGSRVPLSEVYKDL--KDEA--KVTDVTRPIFTYLKTPGMNNKDINSPLGLSI-FDNAKTTIDFINTTYDEFMWE 280 (500) T ss_pred c---ccCcccccccccCCc--Ccce--EeccCCCccEEEecCCccccccCCCccCCch-hhhhHHHHHHHHHHHHHHHHH Confidence 1 112233444443111 1111 11233322333322 2334478899994 999999999999999999988 Q ss_pred HHHHhcCceeecCCCccccce---e--------cCCceeec--CC-cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 297 IDKATNPPMVAPTSLKNQRAS---L--------LPGDITYI--DQ-ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSA 362 (559) Q Consensus 297 ~~~~~~p~~~~p~~~~~~~~~---~--------~pg~~~~~--~~-~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~a 362 (559) .++ .+..+.+|.++.....+ . .+....|. +. .++...++.+ ...-......+.++.+-+.|... T Consensus 281 ~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~l~~~l~~i~~~ 358 (500) T protein:vir:30 281 VKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDL-TTPIRADDYIKAINEGLSLFEMQ 358 (500) T ss_pred HHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEe-ccccChHHHHHHHHHHHHHHHHH Confidence 876 66667776654221111 1 11111121 11 1222223322 11112233344444444444322 Q ss_pred h-hcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEee Q lcl|NC_019445. 363 Y-FVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYI 441 (559) Q Consensus 363 f-~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~i 441 (559) . +. -.++........|||||....+...+...-.-..+ ...|.-|+..++++..-.+.....+.. ..+|.+.+- T Consensus 359 ~gls--~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~-~~al~~lv~~il~~~~~~~~~~~~~~~--~~~v~v~f~ 433 (500) T protein:vir:30 359 IGVS--AGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALV-EQSLKELVISIFEIAKAYDLYQSEVPS--MDNISISLD 433 (500) T ss_pred hCCC--ccccccCcCccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCCCCCC--CcceEEEeC Confidence 1 10 01222233455799999999998888777765555 446666777776665432222111111 124677775 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 442 SVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMA 521 (559) Q Consensus 442 s~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~ 521 (559) -++..- +...++... +.+ +.+ .+....+ +....|+ +++|++++.++-+. + T Consensus 434 d~i~~d-~~~~~~~~~---~~v----~aG-----i~s~~~~---i~~~~g~------~eeea~~~l~~i~~--------E 483 (500) T protein:vir:30 434 DGVFTD-RDAELDYWI---KVV----NAG-----FGTREMA---IQKVLNV------TEEKAQEIAAEINT--------G 483 (500) T ss_pred CCCCCC-HHHHHHHHH---HHH----HcC-----CCCHHHH---HHhcCCC------CHHHHHHHHHHHHH--------h Confidence 443211 111111111 111 111 1333332 3445676 34554433222111 0 Q ss_pred HHHHHHHHHhhhhhhcCCChhHHHHHHHH Q lcl|NC_019445. 522 MGMAAAQGAKTLSEAKTSDPSVLSAMANA 550 (559) Q Consensus 522 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 550 (559) + +. ..+.....+++.|. T Consensus 484 ~---~~---------~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 484 I---VD---------EINQQRTDTHLYGE 500 (500) T ss_pred c---cc---------cCCCCCccccccCC Confidence 0 00 00011111222222 No 113 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.59 E-value=2.3e-07 Score=57.00 Aligned_cols=460 Identities=11% Similarity=0.086 Sum_probs=196.5 Q ss_pred CChhhHHHHHHHH-H------HHH-----------HHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHH Q lcl|NC_019445. 1 MAETTKERLNKQF-A------QLE-----------SERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGT 62 (559) Q Consensus 1 M~~~~~~~l~~r~-~------~l~-----------~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~ 62 (559) |-+.. |.+.+++ . .++ .++......|+.+|.=--|....... ........++--+.+. T Consensus 3 ~~~~i-k~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~---~~~~~~~~~~slnl~~ 78 (505) T protein:vir:79 3 FWDTL-KNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNS---YGDTQKHELQSVNVTK 78 (505) T ss_pred hHHHH-HHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcccccccc---CCCccccceeecchHH Confidence 33332 2222221 1 111 11222334566554322221111011 1111111122224566 Q ss_pred HHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee Q lcl|NC_019445. 63 MAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE 142 (559) Q Consensus 63 ~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~ 142 (559) .+++.+|+ .+|.- .+ +++++|+ ..+++ +.+.+...+|+..+.+++.+..++|.+++.+.. T Consensus 79 ~i~~~~A~----ll~~e-~~--~i~~~d~------~~~e~-------l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~ 138 (505) T protein:vir:79 79 LASAKLAS----LIFNE-QC--QVTVSDE------TANDF-------LDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYV 138 (505) T ss_pred HHHHHHHh----hhcCC-Cc--eeecCCh------HHHHH-------HHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEE Confidence 66666666 33321 11 2333332 22333 445667789999999999999999999997766 Q ss_pred cCCceEEEEEeeccEEE-EeeCCCCCEEEEE-EEEeecHHHHHHhcCcccCCHHHHHHHhcC-CCCceEEEEEEEeecCc Q lcl|NC_019445. 143 DDEDIIRTMPFPIGSYY-LANSPRGSVDICF-RKFSMTVRQLVQEFGLNNVSESVKSMWESG-TYEKWIEVMHSVYPNID 219 (559) Q Consensus 143 ~~~~~~~~~~~~l~~~~-v~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~-~~~~~v~v~~~v~p~~~ 219 (559) |.++ +++..++...++ +..|..+....+| ++++.+-++ +. ..-..++.- ..+..+.|.|.+|...+ T Consensus 139 D~~~-~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~-------~~---~~yt~lE~h~~~~~~~~I~n~ly~~~~ 207 (505) T protein:vir:79 139 DSGK-IKLAWATADQVYPLQADTNQVNELAIASRTTEVENH-------RT---IYYTLLEFHQWDHGDYVITNELYRSEA 207 (505) T ss_pred eCCc-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCC-------cc---eEEEEEEEEEecCceEEEEEEEEecCC Confidence 6543 567778888876 4566655444433 332211000 00 000000000 00012333333332221 Q ss_pred ccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEE---e-eecCCCcccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 220 RDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPR---W-EVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQ 295 (559) Q Consensus 220 ~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~r---w-~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~ 295 (559) .+. + +...|...+..-.+-.+.+ .-.|...-+|..++ + +...++.+|+|. -.++.+.+..|+..--+... T Consensus 208 ~~~--l-G~~v~l~~~~~~~~l~~~~--~~~g~~~p~f~~~~~~~~N~~~~~splG~S~-~~~~~~~id~lD~~~s~~~~ 281 (505) T protein:vir:79 208 AET--V-GINVPLNSLEQYEGLEPQV--KITGLKHPLFAFYRNKGANNKNFTSPMGMSL-IDNSYTVIDAINRTHDQFVD 281 (505) T ss_pred CCc--c-CcccchhhcccccccCcce--eecCCCcceEEEecCCcccccccCCccCCch-hhhhHHHHHHHHHHHHHHHH Confidence 110 0 1111222210000000001 01233333333332 2 234467899994 89999999999998888888 Q ss_pred HHHHHhcCceeecCCCccc-------ccee-----cCCceeecC--CcCCchhhhhhhhccccHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 296 LIDKATNPPMVAPTSLKNQ-------RASL-----LPGDITYID--QITGQDGFRPAYLVNPSTADLVADIQDTRQIINS 361 (559) Q Consensus 296 ~~~~~~~p~~~~p~~~~~~-------~~~~-----~pg~~~~~~--~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~ 361 (559) ..++ .+..+.+|.++... .... .+....|.. ..++...++.+. ++-......+.++.+-++|.. T Consensus 282 e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~ 359 (505) T protein:vir:79 282 EVKK-GQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDAT-SPIRVADYQATMDFFLREFEN 359 (505) T ss_pred HHHh-cccceeechHHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEec-ccCCHHHHHHHHHHHHHHHHH Confidence 7775 34444454432110 0111 112222221 112222222221 111223334445554444432 Q ss_pred HhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC----chhhCCcceE Q lcl|NC_019445. 362 AYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPP----PDAMEGMPLK 437 (559) Q Consensus 362 af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~----p~~l~g~~v~ 437 (559) ..=. --.++........|||||..+.+...+...-.-..+ ...|..|+..++++..-.+..+.- .......++. T Consensus 360 ~~g~-s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~-~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~ 437 (505) T protein:vir:79 360 QTGL-SQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQV-EKTIKALTYAILELASVPSFYADGQARWTGDVDSLDIT 437 (505) T ss_pred HhCC-ChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEE Confidence 2200 012233333455799999999999988888776666 557888988888887665443210 0001112466 Q ss_pred EEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 438 VEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQ 517 (559) Q Consensus 438 ~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~ 517 (559) +.+--++..- +....+.. . ++.+.+ .+... ..+....|++ ++|++++.++ .++.+. T Consensus 438 v~f~d~i~~d-~~~~~~~~---~----~~v~~G-----i~s~e---~~l~~~~~~~------eeea~~el~r-i~~E~~- 493 (505) T protein:vir:79 438 INFNDGVFVD-QESKRAAD---L----QAVQAQ-----VMPKK---QFLMRNYGLD------EEEADEWLAQ-IDAENS- 493 (505) T ss_pred EEeCCCCCCC-HHHHHHHH---H----HHHHcC-----CCCHH---HHHHhcCCCC------hHHHHHHHHH-HHHhcc- Confidence 6665554311 11111111 1 111111 12222 2234456663 3444322221 111110 Q ss_pred HHHHHHHHHHHHHhh Q lcl|NC_019445. 518 QMMAMGMAAAQGAKT 532 (559) Q Consensus 518 ~~~~~~~~~~~~a~~ 532 (559) .+++...+.++. T Consensus 494 ---~~~p~~~~~gg~ 505 (505) T protein:vir:79 494 ---TAEPEFNQFGGD 505 (505) T ss_pred ---ccCCCchhccCC Confidence 011111122222 No 114 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.53 E-value=3.5e-07 Score=56.00 Aligned_cols=420 Identities=7% Similarity=0.050 Sum_probs=190.1 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHhccccCCC-CCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCc Q lcl|NC_019445. 12 QFAQLESERQSFEPHWRELSDYINPRGSRF-LTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPD 90 (559) Q Consensus 12 r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~-~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d 90 (559) ........| .++++.+.+|..-.-... .........+.+.++..+.+...+++.++-|++- |+. +...+ T Consensus 1 ~~~~~~~~~---~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~~~-----~~~~~ 70 (440) T protein:vir:95 1 MLAAFLGSQ---KQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGN--PVS-----IGVME 70 (440) T ss_pred ChhhHHHHH---HHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheecc--Cce-----EeeCC Confidence 122222333 334555555543110000 0011111122345677778888888777654321 111 23333 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEEeeCCC--CCE Q lcl|NC_019445. 91 PEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLANSPR--GSV 168 (559) Q Consensus 91 ~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d~~--G~v 168 (559) +...+ ....+...+...+|.....++.++..+||.+.+++..+...-+++..++..+.++..|+. +++ T Consensus 71 ~~~~~----------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~ 140 (440) T protein:vir:95 71 GGSAD----------QLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNI 140 (440) T ss_pred CccHH----------HHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCce Confidence 22111 112244566778999999999999999999999988876655788889999998888765 445 Q ss_pred EEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEe-cC-CCcee Q lcl|NC_019445. 169 DICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEV-GG-DNDKL 246 (559) Q Consensus 169 d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~-~~-~~~~i 246 (559) ...+|.+... ....++||+... .++|.. .. ..... T Consensus 141 ~~~i~~~~~~-------------------------~~~~~~vyt~~~------------------~~~~~~~~~~~~~~~ 177 (440) T protein:vir:95 141 IAAVHLPIYA-------------------------DKVNMTVYTKDK------------------VITYKPYSNNSVRLV 177 (440) T ss_pred EEEEEEEEec-------------------------CceEEEEEeCCe------------------EEEEEEecCCcccee Confidence 5555543210 112233332100 001110 00 00000 Q ss_pred e---eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC-----cccc-ce Q lcl|NC_019445. 247 L---RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL-----KNQR-AS 317 (559) Q Consensus 247 l---~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~-----~~~~-~~ 317 (559) + ..-+|..+|++.++- +.+|+|. .+...+.+..++.+....+..++...+|.+++.+.. .... .. T Consensus 178 ~~~~~~~~~g~vPvv~~~n-----~~~g~sd-~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~ 251 (440) T protein:vir:95 178 VDDVKKHSYNDVPVVEWWN-----NRFRMGD-YESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAK 251 (440) T ss_pred ecceeeccCceeeEEEeeC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhh Confidence 1 123567889887654 4578996 888999999999999999999999999977654321 1111 11 Q ss_pred ecCCceeecCC------cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHH Q lcl|NC_019445. 318 LLPGDITYIDQ------ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEK 391 (559) Q Consensus 318 ~~pg~~~~~~~------~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~ 391 (559) ....+..+... .++...++.+. .+.+...+...++.++..|-..-... ......-+...|+..+..+..-+ T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~~~~~lt-~~~~~~~~~~~~~~l~~~i~~~s~~p--~~~~~~~~~n~Sg~Al~~~~~~l 328 (440) T protein:vir:95 252 MKDANMLFLKTGISTTGQQTTADASYIY-KQYDVNGTEAYKNRLANDIHRFSRIP--NLDDDRFNSTSSGIALLYKMIGL 328 (440) T ss_pred hhhccceecccccccccCCCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCc--ccccccccccchHHHHHHHHHHH Confidence 22233232211 11112232221 12345555666777777665443221 10001112345665544332211 Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccCh Q lcl|NC_019445. 392 LLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKP 471 (559) Q Consensus 392 ~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P 471 (559) .. ..++.+. .+..-+.+++.++.+.--... ...+....+++.+.-++..- ....++.+..++++ T Consensus 329 ~~----k~~~k~~-~~~~~l~~~~~li~~~~~~~~-~~~~~~~~v~i~f~~~~p~~--------~~~~ad~~~kl~g~-- 392 (440) T protein:vir:95 329 EQ----VRKDKET-YFTKALRRRYELISNIHKAIN-GPVIEANKLTFTFHPNIPQD--------VWTEIKAYIEAGGE-- 392 (440) T ss_pred HH----HHHHHHH-HHHHHHHHHHHHHHHHHhhcC-CcccccccceEEeCCCCCCC--------HHHHHHHHHHHhcc-- Confidence 11 1222222 222233444444333100000 11233345666665544211 11112222222222 Q ss_pred hhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 472 EALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 472 ~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) +....++..+ -+++ .++|++++.+++.+.+. +. .+..+.....+. +++ T Consensus 393 -----iS~et~~~~l---~~~d-----~~~E~~ri~~E~~~~~~-----~~----~~~~~~~~~~~~-~~e 440 (440) T protein:vir:95 393 -----ISQETLMENA---SFTD-----YKTEHSRILKQGGSSDL-----EI----GQIVGDADVGQA-DTE 440 (440) T ss_pred -----CcHHHHHHhC---CCCC-----cHHHHHHHHHHHHHhhh-----hH----HhhccCCCCCCc-CCC Confidence 3333333322 1232 34566555443321111 11 111111111111 111 No 115 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.42 E-value=7.3e-07 Score=54.22 Aligned_cols=436 Identities=10% Similarity=0.016 Sum_probs=176.7 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCCC-cccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVNR-NDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~~-~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |...+..++.+++-+-.. ...++.+.+.+|..-... ...+....+ .+....++..+-+..+++.+++.++.- T Consensus 1 ~~~~t~~~~~~~l~~~~~---~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~--- 74 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRID---DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN--- 74 (456) T ss_pred CCCCCHHHHHHHHHHHHH---HHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccC--- Confidence 777666555443333322 333344555555421110 001111111 111123344566777777777655322 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSY 158 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~ 158 (559) + |++...++. ..... +.+.+.+++|.....++.++..+||.|.+++..+...-.++..++..+. T Consensus 75 ---g-~~~~~~~d~-~~~~~-----------~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~ 138 (456) T protein:vir:79 75 ---G-ITVGGSADS-DLALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETM 138 (456) T ss_pred ---C-eecCCCCCc-cHHHH-----------HHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEecccee Confidence 2 222222211 11112 2334556788888999999999999998888776555567888888888 Q ss_pred EEeeCC-CC-CEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEE Q lcl|NC_019445. 159 YLANSP-RG-SVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVY 236 (559) Q Consensus 159 ~v~~d~-~G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~ 236 (559) ++..|+ .+ ++...+|.+.-. ..... .+.--..+..+.++...+...+.. ... T Consensus 139 ~~i~d~~~~~~~~~~~~~~~~~-----d~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~------------~~~ 192 (456) T protein:vir:79 139 VVSVDPLQPWRIRSAMRWWRDL-----DAESD---------FAIVWSGDGWQKFARPCFVQSSSR------------RRL 192 (456) T ss_pred EEEEcCCCCCceEEEEEEEEec-----CCcee---------EEEEEcCCceEEEEEEEEeecccc------------cee Confidence 777774 22 344455543210 00000 000000011111111111100000 000 Q ss_pred EEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC--- Q lcl|NC_019445. 237 YEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL--- 311 (559) Q Consensus 237 ~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~--- 311 (559) +....+.-... .+.++..+|++.++ +..|.|. .+..++-+-.++...-.++..++..+.|.+.+.+.. T Consensus 193 ~~~~~~~~~~~~~~~~~~~~~pvv~~~------N~~~~gd-~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~ 265 (456) T protein:vir:79 193 VTRISDSWVPVGDAVVTGSPPPVVVYQ------NPDGMGE-VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRL 265 (456) T ss_pred eeccCCceeecccccCCCCceeEEEec------CCCCCch-hhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccc Confidence 11111110111 12244556665542 4678885 777777777888777777777888888765553211 Q ss_pred -----cc------ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcC Q lcl|NC_019445. 312 -----KN------QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMP 380 (559) Q Consensus 312 -----~~------~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~T 380 (559) .. ..+...+|.++..+ ++ ..+..+ ...++....+.+..+...|...--.+ ...+.. +....+ T Consensus 266 ~~~d~~g~~i~~~~~~~~~~~~~~~~~--~~-~~~~q~--~~~~~~~~~~~l~~~i~~i~~~t~~p-~~~~~~-~~~N~S 338 (456) T protein:vir:79 266 PKVDENGNAIDYASIFEAAPGALWELP--PG-VDIWES--QTNDFTPMLSAIKEHIRQLSSATKTP-LPMLMP-DSANQS 338 (456) T ss_pred ccccccccccchhhhhhhhccccccCC--CC-cceeee--cccChHHHHHHHHHHHHHHHhhcCCC-hhHhcc-cccCcH Confidence 00 01223344443322 11 111111 11233333333333333332111000 011111 112345 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 381 VEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTV 460 (559) Q Consensus 381 A~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~ 460 (559) +.-+......+.. ...+.+ ..+.+-+.+.+.++.+..-.+ + ...+++.+.-++.. ...+.++.+ T Consensus 339 g~Al~~~~~~l~~----k~~~~~-~~f~~~l~~~~~l~~~~~g~~---~---~~~i~v~w~~~~~~-s~~~~ada~---- 402 (456) T protein:vir:79 339 AEGAHNIEKGFLF----KCEDRL-SIAKIGLEAILVKALQIEGES---V---EDTVDVSFESPDRV-TLGEKYSAA---- 402 (456) T ss_pred HHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHhcCCC---c---cccceEEeCCCCCc-CHHHHHHHH---- Confidence 5444433333222 223332 345556666666665532222 1 12366666554321 111222222 Q ss_pred HHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 461 NFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 461 ~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~ 540 (559) ..+.+++ +-..+. ....+|++ ++++++...+|..+++.+ ++...+ ..+..+ T Consensus 403 ---~kl~~~G------~~~~~~---~~~~lg~~------~~~i~~~e~~r~~~e~~~----~~~~~~-------~~~~~~ 453 (456) T protein:vir:79 403 ---SLAKAAG------ESWASI---RRNILNYN------ADQIKQDDLDRAREQITL----FAGNPV-------QRPQED 453 (456) T ss_pred ---HHHHhcC------CChHHH---HHhcCCCC------HHHHHHHHHHHHHHHHHH----HhhhHh-------hcCCCC Confidence 2222221 111111 22345664 333332222222222111 111111 111111 Q ss_pred hhH Q lcl|NC_019445. 541 PSV 543 (559) Q Consensus 541 ~~~ 543 (559) .+- T Consensus 454 ~~~ 456 (456) T protein:vir:79 454 GSR 456 (456) T ss_pred CCC Confidence 110 No 116 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.26 E-value=2e-06 Score=51.86 Aligned_cols=432 Identities=10% Similarity=0.020 Sum_probs=191.6 Q ss_pred CC----------------------hhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc-----cccCCCCCC-CCCCcccc Q lcl|NC_019445. 1 MA----------------------ETTKERLNKQFAQLESERQSFEPHWRELSDYIN-----PRGSRFLTS-EVNRNDRR 52 (559) Q Consensus 1 M~----------------------~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~-----P~~~~~~~~-~~~~~~~~ 52 (559) |+ +...+.+.+..+..+. |. .+.+.+.+|.. +.+-+.... ......+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~---~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKP-KI---DDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHH-HH---HHHHHHHHHhccCCcchhccchhccccccccccc Confidence 22 2223333444444432 22 23334444432 111111000 00011122 Q ss_pred cCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHh Q lcl|NC_019445. 53 NTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGT 132 (559) Q Consensus 53 ~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 132 (559) +.|+..+.+...++..++-|++ -| ..++..|++. ...++. .+ ..||.....++.++..+ T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l~g--~p-----~~~~~~d~~~--~~~l~~-----------~~-~n~~~~~~~~~~~~~~~ 135 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYAVA--NP-----VTFSSDDDKS--LKTIQE-----------VL-NHKWDDKLVDILTAASN 135 (474) T ss_pred chhcccchHHHHHHhhhhhhcc--cC-----ceeecCchHH--HHHHHH-----------HH-hcCHHHHHHHHHHHHHh Confidence 3456677777777777764433 12 1233333221 112222 22 25777788899999999 Q ss_pred hCcEEEEEeecCCceEEEEEeeccEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEE Q lcl|NC_019445. 133 YSTGAMAVLEDDEDIIRTMPFPIGSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEV 210 (559) Q Consensus 133 ~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v 210 (559) +|.+.+++..+...-+++..++..++++-.|. .+++...+|.++.. + ...+++ T Consensus 136 ~G~~~~~~y~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~------------------------~-~~~~~~ 190 (474) T protein:vir:96 136 KGIEWLQPYIDENGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLD------------------------G-AERVEY 190 (474) T ss_pred cCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec------------------------C-ceEEEE Confidence 99999888777666678888999988877764 56766666655321 0 112333 Q ss_pred EE------EEeecCcccccccccccccEEEEEEEecCC--CceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHH Q lcl|NC_019445. 211 MH------SVYPNIDRDTSKLDSKNKPFKSVYYEVGGD--NDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGP 282 (559) Q Consensus 211 ~~------~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~--~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d 282 (559) ++ .++....... . .++-..... .......-+|..+|++.++. +.+|+|. .+...+. T Consensus 191 yt~~~v~~~~~~~~~~~~------~----~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd-~e~v~~l 254 (474) T protein:vir:96 191 WTDSDVTYYEYQDGILIP------D----YYHGEEHIQSHYYVGNKRVSWGRVPFIPFKN-----NPQEMSD-LFMYKTI 254 (474) T ss_pred EeCCeEEEEEecCCceee------c----cccccccccccccccccccCCCceeEEEecc-----CCCCCCc-HHHHHHH Confidence 21 1111100000 0 000000000 00011234677899988765 4579996 7889999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCceeecCCC----ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHH Q lcl|NC_019445. 283 VKALQLLQKRKSQLIDKATNPPMVAPTSL----KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQI 358 (559) Q Consensus 283 ~~~L~~l~~~~~~~~~~~~~p~~~~p~~~----~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~r 358 (559) +..++.+.-..+...+...+|.+++.+.. .....++..++++..+.. +..++.+. .+.+.......++.++.. T Consensus 255 iDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~l~-~~~~~~~~~~~~~~l~~~ 331 (474) T protein:vir:96 255 IDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGD--GSGVDTIQ-IEVPVQSSKEYLDMLRDY 331 (474) T ss_pred HHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCC--CCceeEEe-ecCChHHHHHHHHHHHHH Confidence 99999999999999999999988765421 111223455566655422 22233221 123455555666776666 Q ss_pred HHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEE Q lcl|NC_019445. 359 INSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKV 438 (559) Q Consensus 359 I~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~ 438 (559) |-..-... .......+...|+..+..+-.-. .+-.....+ .+..-+.+.+.++....-.. ..-.+++| T Consensus 332 i~~~s~~p--~~~~~~~~~n~Sg~Al~~~~~~l-~~k~~~k~~----~~~~~l~~~~~~i~~~~~~~-----~~~~~i~i 399 (474) T protein:vir:96 332 VIEFGQGV--DFQQDKFGNSPSGIALKFMYSNL-DLKANKLKN----KTLTALQELLQYIIDFYKLN-----IKVQDVEI 399 (474) T ss_pred HHHHhCCc--cccccccccccHHHHHHHHHHHH-HHHHHHHHH----HHHHHHHHHHHHHHHHhCCC-----cccceeeE Confidence 65444321 11111112333444333221111 111222222 23333344444443322111 11123455 Q ss_pred EeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 439 EYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQ 518 (559) Q Consensus 439 ~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~ 518 (559) ++.-.+..- .. ..++.+.+ ++ .+.-..++..+ -+++ -.++|++++.+++++..+... T Consensus 400 ~f~~~~p~~----~~----e~~~~~~~-ag-------~iS~et~~~~~---~~v~----d~~~E~~ri~~E~~e~~~~~~ 456 (474) T protein:vir:96 400 TFNFNVMVN----EL----EQSQIGVQ-SQ-------YLSKETVVTNH---PWVD----DPVAELERIEQDNIDFNKQLP 456 (474) T ss_pred EeccCCCcC----HH----HHHHHHHh-cC-------CCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhccc Confidence 553333210 11 11111111 11 24444444332 1232 134666666555433322211 Q ss_pred HHHHHHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 519 MMAMGMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 519 ~~~~~~~~~~~a~~~~~~~~~~ 540 (559) .+ .....+.... +...++ T Consensus 457 ~~--~~~~~~~~~d--~~~e~~ 474 (474) T protein:vir:96 457 PL--EGDANGRAQD--NESETN 474 (474) T ss_pred cc--ccccccccCC--CcccCC Confidence 10 0000000000 111111 No 117 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.24 E-value=2.2e-06 Score=51.62 Aligned_cols=440 Identities=11% Similarity=0.060 Sum_probs=183.5 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) +.- +.+++.+..++.+..|.+ +++++.+|..=.-.-..........+...++..+.+...++..++-|++ -|+ T Consensus 13 ~~~-~~~~~~~~i~~~~~~~~~---r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g--~~~- 85 (489) T protein:vir:99 13 SKL-WIDQLKNYISRFKAEQLE---RLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLG--VPV- 85 (489) T ss_pred CCC-CHHHHHHHHHHHHHHHHH---HHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhcc--CCc- Confidence 221 234455555555555444 4455555532110000000011111223467778888888888765543 122 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee----cCCceEEEEEeecc Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE----DDEDIIRTMPFPIG 156 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~----~~~~~~~~~~~~l~ 156 (559) +++..|+ .++.+ +...+...+|.....++.++..++|.+..++.. |...-+++..++.. T Consensus 86 ----~~~~~d~------~~~~~-------l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~ 148 (489) T protein:vir:99 86 ----EYKNENK------DLQAA-------IDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAE 148 (489) T ss_pred ----eeecCCh------hHHHH-------HHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEccc Confidence 2333332 22333 334555678888899999999999999866542 33334788889999 Q ss_pred EEEEeeCCC--CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEE Q lcl|NC_019445. 157 SYYLANSPR--GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKS 234 (559) Q Consensus 157 ~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~s 234 (559) +++...|.. +.+...+|.+... .+ .+.....+++| .+..- T Consensus 149 ~~~~v~dd~~~~~~~~~i~~~~~~-------~~-------------~~~~~~~~~~y---~~~~i--------------- 190 (489) T protein:vir:99 149 QTFVIYDDTYQRNSLMAVHFYDID-------YG-------------SGKRKQIIKAY---TSDTI--------------- 190 (489) T ss_pred ceEEEEcCCCCCceEEEEEEEEEe-------cC-------------CCceEEEEEEE---eCCcE--------------- Confidence 987777643 3455444444311 00 00111122222 11100 Q ss_pred EEEEecC---CCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_019445. 235 VYYEVGG---DNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPT 309 (559) Q Consensus 235 v~~~~~~---~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~ 309 (559) ++|.... ....+. .+-+|..+|++.++. +..|+|. .+...+-+..++.+...++..++...+|.+++.+ T Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~s~-~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g 264 (489) T protein:vir:99 191 YTYEDYNLETKGMRLKDYEGHFFKGVPVNEYAN-----NEERTGA-YESVLDNIDAYDLSQSELANFQQDSVNALLVIAG 264 (489) T ss_pred EEEEecCCCcccceecccccccCCceeEEEeec-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhcc Confidence 0111100 000111 224577899988764 3568884 7788888999999999999999988888776643 Q ss_pred CCcc--------ccceecCCc------------eeecCCcCCch----hhhhhhhccccHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 310 SLKN--------QRASLLPGD------------ITYIDQITGQD----GFRPAYLVNPSTADLVADIQDTRQIINSAYFV 365 (559) Q Consensus 310 ~~~~--------~~~~~~pg~------------~~~~~~~~~~~----~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~ 365 (559) .... ......+++ +.......... .++.+ +...+.......++.+...|-..-+. T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~l~~~i~~~s~~ 343 (489) T protein:vir:99 265 NAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFL-KKEYDTAGSEAYKNRLVADILRFTFT 343 (489) T ss_pred CCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeee-eecCChHHHHHHHHHHHHHHHHHhCC Confidence 2100 011112221 12211111101 11111 11123344444556665555332221 Q ss_pred chhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHH Q lcl|NC_019445. 366 DLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMA 445 (559) Q Consensus 366 dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La 445 (559) .-+... ..+...|+..+..+...+... .....+...+.+.-+++-++.++...+. + ......-.+++|.+.-++. T Consensus 344 p~~~~~--~~~~n~Sg~Al~~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~-~-~~~~~~~~~i~v~f~~~~p 418 (489) T protein:vir:99 344 PDTQDM--KFSGVQSGESMKYKLMASDNY-REKQERLFKKGLMRRLRLAANIWAIKGN-E-ATTYSLVNDTSIVFTPNLP 418 (489) T ss_pred cccccc--cccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCC-c-cccccccccceEEeCCCCC Confidence 101111 111234554443322211111 1111222222333333333333322221 0 1111222345555543332 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 446 --QAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMG 523 (559) Q Consensus 446 --~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~ 523 (559) .+.. + +.+..++++ +....++..+ -+++.. ..++|++++++++.+.++. .++ T Consensus 419 ~d~~~~---~-------~~~~kl~gi-------is~et~~~~l---~~v~~~--d~~~E~~ri~~E~~~~~~~----~~~ 472 (489) T protein:vir:99 419 QNDNEI---V-------TAAQNLYGI-------VSDQTIFEIL---NTVTGV--DAEAELKRLKEEADKKQSL----PEP 472 (489) T ss_pred cCHHHH---H-------HHHHHHhcc-------CCHHHHHHhc---CCCCch--hHHHHHHHHHHHHHHHhcc----ccc Confidence 2221 1 122222221 3333333322 233211 1234454444432221111 110 Q ss_pred HHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 524 MAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 524 ~~~~~~a~~~~~~~~~~~~ 542 (559) ... +..+-.+.++...+ T Consensus 473 ~~~--~~~~~~~~~~~~~p 489 (489) T protein:vir:99 473 RLV--GDASGQEEPTAEKP 489 (489) T ss_pred ccc--CCCCCCcCCCCCCC Confidence 000 00011111111112 No 118 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=98.24 E-value=2.2e-06 Score=51.57 Aligned_cols=445 Identities=11% Similarity=0.057 Sum_probs=194.5 Q ss_pred CCh---hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCC--CCCcccccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAE---TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSE--VNRNDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~---~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~--~~~~~~~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |.+ .+.+++.+..++.+..| .++|+++.+|....-....... .....+.+.++..+.+...++..++-|++ T Consensus 16 ~~~~~~l~~~~i~~li~~~~~~~---~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G- 91 (506) T protein:vir:94 16 QESLENLTPNKIMKFITHHFNYQ---RPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVG- 91 (506) T ss_pred ccchhcCCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcc- Confidence 322 23344444444444433 4467777777543311100000 01111234567777888888888875543 Q ss_pred hcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeec Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPI 155 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l 155 (559) .| +.++..++. ....+...+..++|.....++.++..++|.+.+++..+...-+++..++. T Consensus 92 -----~p-~~~~~~d~~-------------~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p 152 (506) T protein:vir:94 92 -----NP-INVKLPDDG-------------SNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDP 152 (506) T ss_pred -----cC-ceeecCcch-------------HHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcc Confidence 12 223333322 12335556667899999999999999999999988887666678888888 Q ss_pred cEEEEeeCC--CCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEE Q lcl|NC_019445. 156 GSYYLANSP--RGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFK 233 (559) Q Consensus 156 ~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~ 233 (559) .+.++-.|. .+.+...+|.+...-. .++....+..++.+|-... T Consensus 153 ~~~~~v~dd~~~~~~~~~v~~~~~~~~--------------------~~~~~~~~~~~~~~yt~~~-------------- 198 (506) T protein:vir:94 153 LDTFVIYSTDVDPKPIMAVRYHQIELV--------------------DDNQVSTINYVPETWTADT-------------- 198 (506) T ss_pred cceEEEecCCCCCceEEEEEEEeeeec--------------------cCCceeEEEEEEEEEeCce-------------- Confidence 888776664 4555555554432200 0111111222222221110 Q ss_pred EEEEEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCC Q lcl|NC_019445. 234 SVYYEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSL 311 (559) Q Consensus 234 sv~~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~ 311 (559) .+++........+. ..-+|..+|++.++= +..|.|. .+...+.+-.++.+.-..+...+..++|.+++.+.. T Consensus 199 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~sd-~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~ 272 (506) T protein:vir:94 199 YTLYNPTPIMGKMQVDTTKPITTFPVVEFKN-----SNFRLGD-FENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDI 272 (506) T ss_pred EEEeccccCccceeccccccCCccceEEecC-----CCCCCCc-hhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCc Confidence 01111111111111 123566788876532 3457785 778888888999988888888888888876654321 Q ss_pred cccc--cee--------------------------cCCceeecCCcC------CchhhhhhhhccccHHHHHHHHHHHHH Q lcl|NC_019445. 312 KNQR--ASL--------------------------LPGDITYIDQIT------GQDGFRPAYLVNPSTADLVADIQDTRQ 357 (559) Q Consensus 312 ~~~~--~~~--------------------------~pg~~~~~~~~~------~~~~~~p~~~~~~~~~~~~~~i~~~~~ 357 (559) .... ... .-+++..++..+ ....++-+ +-+.+...+...+..+.. T Consensus 273 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l-~~~~~~~~~~~~~~~l~~ 351 (506) T protein:vir:94 273 DTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYI-NKTYDVVGSEAYKKRVAG 351 (506) T ss_pred cccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceee-eecCCHHHHHHHHHHHHH Confidence 0000 000 000111111100 00111111 112244555566666666 Q ss_pred HHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceE Q lcl|NC_019445. 358 IINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLK 437 (559) Q Consensus 358 rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~ 437 (559) .|-..-+.. .......+...|+..+..+..-+.. -.....+...+.+..+++-++.++...+... .+....++ T Consensus 352 ~I~~~s~~p--~~~~~~~~~n~Sg~Aik~~~~~l~~-k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~----~~d~~~i~ 424 (506) T protein:vir:94 352 DIHKFSHTP--DLTDENFASNSSGVAMQYKVLGTVE-LASTKRRMFERGLYARYQIISDIENSIHGDW----TFDPQELT 424 (506) T ss_pred HHHHHhCcc--ccccccccccchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc----ccccccce Confidence 664433221 1111111234566555544332222 1222233333444444444444443322111 12233456 Q ss_pred EEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 438 VEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQ 517 (559) Q Consensus 438 ~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~ 517 (559) |.+.-++.. +. ...++.+..+++ .+....++..+ -+++ -.++|++++.++++++.... T Consensus 425 i~f~~~~p~-----d~---~e~a~~~~kl~g-------~iS~et~~~~l---p~v~----d~~~E~~ri~~E~~~~~~~~ 482 (506) T protein:vir:94 425 FTFRDNLPA-----DN---ISQIKALVQAGA-------TLPQKYLYQQL---PGVT----NPQDIVDMMKEQSANGDYSF 482 (506) T ss_pred EEeCCCCCc-----CH---HHHHHHHHHHhc-------cCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHhhcc Confidence 666444321 11 111122222222 13333333321 1332 12456665555432211110 Q ss_pred HHHHHHHHHHHHHhhhhhhcCC---ChhHH Q lcl|NC_019445. 518 QMMAMGMAAAQGAKTLSEAKTS---DPSVL 544 (559) Q Consensus 518 ~~~~~~~~~~~~a~~~~~~~~~---~~~~~ 544 (559) .........-...... +.+.. T Consensus 483 ------~~~~~~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 483 ------DQNGVISNDGQTNTTATQTDEEVR 506 (506) T ss_pred ------hhhcCCCcccCccccccccccCCC Confidence 0000000000000000 00000 No 119 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=98.09 E-value=4.8e-06 Score=49.75 Aligned_cols=458 Identities=10% Similarity=0.059 Sum_probs=206.0 Q ss_pred CChh-----hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCC--cc-cccCCCCcchHHHHHHHHHHHH Q lcl|NC_019445. 1 MAET-----TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNR--ND-RRNTRIIDSTGTMAARTLASGM 72 (559) Q Consensus 1 M~~~-----~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~--~~-~~~~~~~~s~~~~a~~~Las~l 72 (559) |.+- .-..+..+|+..+.--.. ...|++...-.||.-.....+..+. -. +...-.|-+...+.++.++..+ T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G-~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~v 110 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSG-QEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQV 110 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchh Confidence 8762 123344445444443332 4667777777777743221111110 11 1112356666666666665544 Q ss_pred HHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee-cCCce---- Q lcl|NC_019445. 73 MSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE-DDEDI---- 147 (559) Q Consensus 73 ~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~-~~~~~---- 147 (559) + - ..|.+.+ + ..++.+++.|+. .-.+++.-+..++.+...+|-+.++|+- ..+.. T Consensus 111 f----r-k~p~~~~--p-------~~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~a 170 (535) T protein:vir:80 111 F----S-RDPIRQL--P-------PALEAIVEDIDG------EGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVL 170 (535) T ss_pred h----c-CCcceec--c-------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHH Confidence 3 2 2233422 1 223444444332 2457788888899999999999999983 22211 Q ss_pred --------EEEEEeeccEEE-Eee---CCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEe Q lcl|NC_019445. 148 --------IRTMPFPIGSYY-LAN---SPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVY 215 (559) Q Consensus 148 --------~~~~~~~l~~~~-v~~---d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~ 215 (559) -.+..|+..+.. ++. ++.+++.-+..+++.+.++ ..|| .+.++.|-++. T Consensus 171 de~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~d--d~f~-----------------~~~~~q~RvL~ 231 (535) T protein:vir:80 171 EQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQD--DGFE-----------------TTYVQQWRVLQ 231 (535) T ss_pred HHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecC--CCcc-----------------cceeEEEEEEE Confidence 123344433322 121 2233344333333322211 2332 12344444454 Q ss_pred ecCcccccccccccccEE-EEEEEecC------CCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHH Q lcl|NC_019445. 216 PNIDRDTSKLDSKNKPFK-SVYYEVGG------DNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQL 288 (559) Q Consensus 216 p~~~~~~~~~~~~~~~~~-sv~~~~~~------~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~ 288 (559) |..++. |. .+|-.... ..+.+.-.+|-+.+++|++.|.-..++.+..|.|- .+ |+..||. T Consensus 232 ~~~~G~----------y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pP--Ll-~LA~lni 298 (535) T protein:vir:80 232 LNAEGN----------YQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPP--LL-DLCEVNI 298 (535) T ss_pred ecCCce----------EEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccc--hH-HHHHHHH Confidence 432211 11 11111110 11223333455678889998887777777666542 22 4444443 Q ss_pred HH---HH-HHHHHHHHhcCceeecC-------CC-ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHH Q lcl|NC_019445. 289 LQ---KR-KSQLIDKATNPPMVAPT-------SL-KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTR 356 (559) Q Consensus 289 l~---~~-~~~~~~~~~~p~~~~p~-------~~-~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~ 356 (559) -. .+ .-+.+..+..|.+.+.+ +. ....+.+.|++.+..+..++-..+++ +++ .+. .+.+.+++ T Consensus 299 ~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~--~~~-~~a--~~~l~~~e 373 (535) T protein:vir:80 299 GHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQI--TPN-SVP--FEAMTHKE 373 (535) T ss_pred HHhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeee--ccc-hhH--HHHHHHHH Confidence 22 22 23346667777655432 11 22335677777766554322222322 111 222 34567777 Q ss_pred HHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcc Q lcl|NC_019445. 357 QIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMP 435 (559) Q Consensus 357 ~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~ 435 (559) +++.+.-.. ++ .......||+|.....+.....|..+..++++- ++++|.++-+. |..+ ++.. T Consensus 374 ~qM~~lGa~----ll-~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~a-----l~~aL~~~A~w~G~~~------~~~~ 437 (535) T protein:vir:80 374 SQMIAMGAN----LL-VKSGGNRTFGEAQQEEASEQSILSACTKNVSMA-----FRKALRWANQFQTGIV------NDET 437 (535) T ss_pred HHHHHHHHH----hh-ccCcccccHHHHHHHHHHHhHHHHHHHHHHHHH-----HHHHHHHHHHHcCCcc------CCCc Confidence 777654321 22 234556899999999998888898888887663 34455555443 3211 1223 Q ss_pred eEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHH Q lcl|NC_019445. 436 LKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQ 515 (559) Q Consensus 436 v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q 515 (559) +++++-.-... ...+.+.+..++..+. .+ .|..+.+...+ ...||...-+..++|...+..+. T Consensus 438 ~~i~~n~dF~~--~~ld~~~~~all~~~~----~G-----~Is~et~~~~L-~r~gvl~~~~~~eee~~ri~~E~----- 500 (535) T protein:vir:80 438 VEYNLNTDFPA--ARLTPNERAELILEWQ----QG-----AITFKEMRAGL-RRAGVASEDDAKAETEGKATVEF----- 500 (535) T ss_pred eEEEecccccc--ccCCHHHHHHHHHHHh----cC-----CCCHHHHHHHH-HhCCCCCcccchHHHHHHHHhhh----- Confidence 34433221110 0012222333332221 11 36666666666 44466432222233322221111 Q ss_pred HHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 516 QQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 516 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) . .....++.-....+++++ ++-+.-..++++++. T Consensus 501 ~--------~~~~~~g~~~d~~~~g~~--~~~~~~~~~~~~~~~ 534 (535) T protein:vir:80 501 I--------AKTAAAGKVGDAASGGTN--KAKLNNGNGGGNQAG 534 (535) T ss_pred h--------hccccCCCCCCCCCCCCC--cCcccCCccccccCC Confidence 0 001011111111222211 111112445555555 No 120 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.04 E-value=6.3e-06 Score=49.11 Aligned_cols=434 Identities=10% Similarity=0.045 Sum_probs=174.7 Q ss_pred CChhhHHHHHHH-HHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCC-CcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQ-FAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVN-RNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r-~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~-~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |...+..++.++ ...+.. ..++.+.+.+|..-.-. ........ ..+....++..+-+..+++.+++.|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~----~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~--- 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP--- 73 (456) T ss_pred CCCCCHHHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc--- Confidence 888776555443 333332 33444455555421100 00011111 111123445666777777777775432 Q ss_pred CCCCcceeccCC-ccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeecc Q lcl|NC_019445. 78 SPARPWFRLATP-DPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIG 156 (559) Q Consensus 78 pp~~~Wf~l~~~-d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~ 156 (559) .++. +... |.+. ... +++.+.++++-....++.++..+||.+.+++..+...-.++..++.. T Consensus 74 ---~~~~-~~~~~d~~~--~~~-----------~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~ 136 (456) T protein:vir:10 74 ---NGIT-VGGSADSDL--ALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPE 136 (456) T ss_pred ---CCee-cCCCCCcch--HHH-----------HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccc Confidence 2332 2222 2211 111 33345567888888999999999999999888776655677788888 Q ss_pred EEEEeeCCCC--CEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEE Q lcl|NC_019445. 157 SYYLANSPRG--SVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKS 234 (559) Q Consensus 157 ~~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~s 234 (559) +.++..|+.- ++...+|.++.. ...+.-. ..+ ..+..++.+-.++...+.. . T Consensus 137 ~~~~i~d~~~~~~~~~~i~~~~~~-----d~~~~~~------~~~---~~~~~~~~~~~~~~~~~~~-----~------- 190 (456) T protein:vir:10 137 TMVVSVDPLQPWRIRAAMRWWRDL-----DAESDFA------IVW---SGDGWQKFARPCFVQSSSR-----R------- 190 (456) T ss_pred eeEEEEcCCCCcceEEEEEEEEec-----CCceeEE------EEE---eccceeEEEEEEEEeeccc-----c------- Confidence 8887777543 334444443210 0000000 000 0000011110000000000 0 Q ss_pred EEEEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCC-- Q lcl|NC_019445. 235 VYYEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTS-- 310 (559) Q Consensus 235 v~~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~-- 310 (559) ......++..... .+.++..+|++.. .+..|.|. .+..++.+..++...-..+..++..+.|.+.+.+. T Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd-~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~ 263 (456) T protein:vir:10 191 RLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGE-VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEH 263 (456) T ss_pred eeeeecCCceeeccccCCCCCceeEEEe------cCCCCCch-hhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCc Confidence 0001111111111 1122344554432 23578895 88888888888888887777788877776544321 Q ss_pred --------Ccc----ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 311 --------LKN----QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 311 --------~~~----~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) +.. ..+...+|.++..+ .+.+ +.-+ ...++....+.++.+...|...--.. ...+.. +... T Consensus 264 ~~~~~d~~g~~~~~~~~~~~~~~~~~~~~--~~~~-~~q~--~~~~~~~~~~~l~~~i~~~~~~s~~p-~~~~~~-~~~N 336 (456) T protein:vir:10 264 GLPNVDENGNAIDYASIFEAAPGALWELP--PGVD-IWES--QANDFTPMLSAIKEHIRQLSSATKTP-LPMLMP-DSAN 336 (456) T ss_pred ccccccccccccchhhhhhhhccccccCC--CCcc-eEEe--cccChhHHHHHHHHHHHHHHhccCCC-hHHhcc-cccC Confidence 100 01223344433322 1111 1111 11233444444444444432111110 111111 1123 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) .++.-+.....-+.. ..++.+ ..+.+-+.+.+.++.+..-- ++ ...+++.+.-++..- .++. T Consensus 337 ~Sg~Ai~~~~~~l~~----k~~~~~-~~f~~~l~~~~rl~~~~~g~---~~---~~~~~v~w~~~~~~~-~~~~------ 398 (456) T protein:vir:10 337 QSAEGAHNIEKGFLF----KCEDRL-SIAKIGLEAILVKALQIEGE---SV---EDTVDVSFESPDRVT-LGEK------ 398 (456) T ss_pred hHHHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHhcCC---Cc---ccceeEEecCCCCcC-HHHH------ Confidence 345433332222211 222222 34445556666665543211 11 123666664443210 0111 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKT 538 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~ 538 (559) ++.+..+.+++ +=..+ .....+|++ +++++++..+|...++.+ ++ ++.++. +. T Consensus 399 -ada~~kl~~~g------i~~~~---~~~~~lg~~------~~~i~~~e~er~~~e~~~----~~---~~~~~~----~~ 451 (456) T protein:vir:10 399 -YSAASLAKAAG------ESWAS---IRRNILNYN------ADQIKQDDLDRAREQITL----FA---GNPVQR----PQ 451 (456) T ss_pred -HHHHHHHHHcC------CChHH---HHHhhCCCC------HHHHHHHHHHHHHHHHHH----Hh---hhhhhc----CC Confidence 12222222211 10111 112345664 344433323322222211 11 111111 11 Q ss_pred CChhH Q lcl|NC_019445. 539 SDPSV 543 (559) Q Consensus 539 ~~~~~ 543 (559) .+.+- T Consensus 452 ~~~~~ 456 (456) T protein:vir:10 452 EDGSR 456 (456) T ss_pred CCCCC Confidence 11111 No 121 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.04 E-value=6.3e-06 Score=49.11 Aligned_cols=434 Identities=10% Similarity=0.045 Sum_probs=174.7 Q ss_pred CChhhHHHHHHH-HHHHHHHhhhHHHHHHHHHHHhccccC-CCCCCCCC-CcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQ-FAQLESERQSFEPHWRELSDYINPRGS-RFLTSEVN-RNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r-~~~l~~~R~~~~~~w~e~~~~~~P~~~-~~~~~~~~-~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |...+..++.++ ...+.. ..++.+.+.+|..-.-. ........ ..+....++..+-+..+++.+++.|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~----~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~--- 73 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP--- 73 (456) T ss_pred CCCCCHHHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc--- Confidence 888776555443 333332 33444455555421100 00011111 111123445666777777777775432 Q ss_pred CCCCcceeccCC-ccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeecc Q lcl|NC_019445. 78 SPARPWFRLATP-DPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIG 156 (559) Q Consensus 78 pp~~~Wf~l~~~-d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~ 156 (559) .++. +... |.+. ... +++.+.++++-....++.++..+||.+.+++..+...-.++..++.. T Consensus 74 ---~~~~-~~~~~d~~~--~~~-----------~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~ 136 (456) T protein:vir:10 74 ---NGIT-VGGSADSDL--ALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPE 136 (456) T ss_pred ---CCee-cCCCCCcch--HHH-----------HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccc Confidence 2332 2222 2211 111 33345567888888999999999999999888776655677788888 Q ss_pred EEEEeeCCCC--CEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEE Q lcl|NC_019445. 157 SYYLANSPRG--SVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKS 234 (559) Q Consensus 157 ~~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~s 234 (559) +.++..|+.- ++...+|.++.. ...+.-. ..+ ..+..++.+-.++...+.. . T Consensus 137 ~~~~i~d~~~~~~~~~~i~~~~~~-----d~~~~~~------~~~---~~~~~~~~~~~~~~~~~~~-----~------- 190 (456) T protein:vir:10 137 TMVVSVDPLQPWRIRAAMRWWRDL-----DAESDFA------IVW---SGDGWQKFARPCFVQSSSR-----R------- 190 (456) T ss_pred eeEEEEcCCCCcceEEEEEEEEec-----CCceeEE------EEE---eccceeEEEEEEEEeeccc-----c------- Confidence 8887777543 334444443210 0000000 000 0000011110000000000 0 Q ss_pred EEEEecCCCceee--eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCC-- Q lcl|NC_019445. 235 VYYEVGGDNDKLL--RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTS-- 310 (559) Q Consensus 235 v~~~~~~~~~~il--~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~-- 310 (559) ......++..... .+.++..+|++.. .+..|.|. .+..++.+..++...-..+..++..+.|.+.+.+. T Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd-~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~ 263 (456) T protein:vir:10 191 RLVTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGE-VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEH 263 (456) T ss_pred eeeeecCCceeeccccCCCCCceeEEEe------cCCCCCch-hhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCc Confidence 0001111111111 1122344554432 23578895 88888888888888887777788877776544321 Q ss_pred --------Ccc----ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 311 --------LKN----QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 311 --------~~~----~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) +.. ..+...+|.++..+ .+.+ +.-+ ...++....+.++.+...|...--.. ...+.. +... T Consensus 264 ~~~~~d~~g~~~~~~~~~~~~~~~~~~~~--~~~~-~~q~--~~~~~~~~~~~l~~~i~~~~~~s~~p-~~~~~~-~~~N 336 (456) T protein:vir:10 264 GLPNVDENGNAIDYASIFEAAPGALWELP--PGVD-IWES--QANDFTPMLSAIKEHIRQLSSATKTP-LPMLMP-DSAN 336 (456) T ss_pred ccccccccccccchhhhhhhhccccccCC--CCcc-eEEe--cccChhHHHHHHHHHHHHHHhccCCC-hHHhcc-cccC Confidence 100 01223344433322 1111 1111 11233444444444444432111110 111111 1123 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) .++.-+.....-+.. ..++.+ ..+.+-+.+.+.++.+..-- ++ ...+++.+.-++..- .++. T Consensus 337 ~Sg~Ai~~~~~~l~~----k~~~~~-~~f~~~l~~~~rl~~~~~g~---~~---~~~~~v~w~~~~~~~-~~~~------ 398 (456) T protein:vir:10 337 QSAEGAHNIEKGFLF----KCEDRL-SIAKIGLEAILVKALQIEGE---SV---EDTVDVSFESPDRVT-LGEK------ 398 (456) T ss_pred hHHHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHhcCC---Cc---ccceeEEecCCCCcC-HHHH------ Confidence 345433332222211 222222 34445556666665543211 11 123666664443210 0111 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKT 538 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~ 538 (559) ++.+..+.+++ +=..+ .....+|++ +++++++..+|...++.+ ++ ++.++. +. T Consensus 399 -ada~~kl~~~g------i~~~~---~~~~~lg~~------~~~i~~~e~er~~~e~~~----~~---~~~~~~----~~ 451 (456) T protein:vir:10 399 -YSAASLAKAAG------ESWAS---IRRNILNYN------ADQIKQDDLDRAREQITL----FA---GNPVQR----PQ 451 (456) T ss_pred -HHHHHHHHHcC------CChHH---HHHhhCCCC------HHHHHHHHHHHHHHHHHH----Hh---hhhhhc----CC Confidence 12222222211 10111 112345664 344433323322222211 11 111111 11 Q ss_pred CChhH Q lcl|NC_019445. 539 SDPSV 543 (559) Q Consensus 539 ~~~~~ 543 (559) .+.+- T Consensus 452 ~~~~~ 456 (456) T protein:vir:10 452 EDGSR 456 (456) T ss_pred CCCCC Confidence 11111 No 122 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=97.83 E-value=1.6e-05 Score=46.90 Aligned_cols=426 Identities=8% Similarity=-0.006 Sum_probs=186.5 Q ss_pred hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--ccccCCCC-CCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 4 TTKERLNKQFAQLESERQSFEPHWRELSDYI--NPRGSRFL-TSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 4 ~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~--~P~~~~~~-~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) .+.+.|.+..+..+..+ +.....+++|.=- ++.+.... ........+.+.++..+.+...++..++-|+ + T Consensus 1 l~~~~i~~~i~~~~~~~-~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~------G 73 (451) T protein:vir:10 1 MELEKIRAIISADAARR-QEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMF------T 73 (451) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhhee------c Confidence 44455555555555433 3333333333211 11111100 0011111122335666677777777666332 2 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCC--------ceEEEEE Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDE--------DIIRTMP 152 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~--------~~~~~~~ 152 (559) .| ..+...+.. +..+ .+. .+...+|-....++.++...+|.|.+++..+.. ..+++.. T Consensus 74 ~p-~~~~~~~~~-----~~~~-------~~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~ 139 (451) T protein:vir:10 74 YP-VLFDIDNNK-----ELNE-------KVT-DVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGV 139 (451) T ss_pred cc-ceeecCCcH-----HHHH-------HHH-HHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEE Confidence 22 122222221 1111 111 222468888999999999999999887655432 3467777 Q ss_pred eeccEEEEeeC--CCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCC-CceEEEEEEEeecCccccccccccc Q lcl|NC_019445. 153 FPIGSYYLANS--PRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTY-EKWIEVMHSVYPNIDRDTSKLDSKN 229 (559) Q Consensus 153 ~~l~~~~v~~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~-~~~v~v~~~v~p~~~~~~~~~~~~~ 229 (559) ++..+.++-.| -.+++...+|.+...-.. .+.. ...+..++ +|-... - T Consensus 140 i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~-------------------~~~~~~~~~~~~e-~yt~~~---------~ 190 (451) T protein:vir:10 140 VNTEEIIPIYRNGIERELEAVIRYYIQLEDV-------------------KGQIQKQAYTYVE-FWTDKI---------L 190 (451) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEeeecc-------------------cccccceEEEEEE-EEeCCe---------E Confidence 87777765543 357777777766433110 0010 01111111 111100 0 Q ss_pred ccEEEEEEEecCCCceee---eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_019445. 230 KPFKSVYYEVGGDNDKLL---RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV 306 (559) Q Consensus 230 ~~~~sv~~~~~~~~~~il---~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~ 306 (559) ..|.+ .........+. .+-+|..+|++.+.. +.+|.|. .+...+.+..++.+.-..+...+...+|.++ T Consensus 191 ~~~~~--~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~~~d-~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~ 262 (451) T protein:vir:10 191 DKYKF--FGVSCCGSQIEHITVQHRFNSVPFVEFSN-----NIKKQSD-LSKYKKILDLYDRVMSGFANDLEDIQQIIYI 262 (451) T ss_pred EEEEe--cccCccccccccccccCCCCeeeEEEecc-----CCCCCCc-hhhHHHHHHHHHHHHHHHHHHHHHhccceee Confidence 01110 00111111111 123677888876653 4567885 7888899999999999999999999999888 Q ss_pred ecCCCcc--c--cceecCCceeecCCcC--CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcC Q lcl|NC_019445. 307 APTSLKN--Q--RASLLPGDITYIDQIT--GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMP 380 (559) Q Consensus 307 ~p~~~~~--~--~~~~~pg~~~~~~~~~--~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~T 380 (559) +.+-... . .-.+..++++.+...+ ....+..+ +.+.+...+...+..++..|-..-+..-+ .....+..| T Consensus 263 ~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~l~~~I~~~s~~p~~---~~~~~gn~S 338 (451) T protein:vir:10 263 LENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTM-QIEIPTEARKIILEILKKQIYESGQGLQQ---DTENFGNAS 338 (451) T ss_pred eecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEE-eecCCHHHHHHHHHHHHHHHHHHhCcccc---ccccccccc Confidence 7642111 1 1233444444443211 11223322 12234566666777777777655543211 111112334 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 381 VEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAST 459 (559) Q Consensus 381 A~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~ 459 (559) +.-+..+-.-+.. ...+.+. .+.+.+.+.+.++.+. |.. .-.++++.|.-.+.+ ++... T Consensus 339 g~Alk~~~~~l~~----k~~~k~~-~f~~~l~~~~~li~~~~~~~-------d~~~i~i~f~~~~p~--------n~~e~ 398 (451) T protein:vir:10 339 GVALKFFYRKLEL----KSGLLET-EFRTSFDKLIKAILYFLGVT-------DYKKIQQTYTRNMMS--------NDLED 398 (451) T ss_pred HHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHhCCC-------CccceeEEecCCCCC--------CHHHH Confidence 4333332221111 1223322 3344445555555542 221 123466766555432 11112 Q ss_pred HHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019445. 460 VNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQ-VDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSE 535 (559) Q Consensus 460 ~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~e-v~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~ 535 (559) ++.+..+++ .+.-..++.. ++ ++-+.++ .+.+.++++ .+. .+..+..+.+.+ T Consensus 399 ~~~~~kl~g-------~iS~et~~~~----~p----~v~d~~~e~~~~~ee~~--~~~-------~~~~~~~~~~~~ 451 (451) T protein:vir:10 399 ADIATKSVG-------IIPTKIILRH----HP----WVDDVEEAEKLYLEEKK--IQA-------SKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHhc-------cCchHHHHHh----CC----CCCCHHHHHHHHHHHHH--HHH-------HHHHhhcCCCCC Confidence 223333322 1332333222 22 1222222 211111111 111 111222222222 No 123 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.82 E-value=1.7e-05 Score=46.76 Aligned_cols=448 Identities=9% Similarity=0.034 Sum_probs=194.8 Q ss_pred CChhh--HHHHHHHHHHHHHHhhhH--HHHHHHHHHHhccccCCCCCCCCC--Cccc-ccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETT--KERLNKQFAQLESERQSF--EPHWRELSDYINPRGSRFLTSEVN--RNDR-RNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~~~~--~~~l~~r~~~l~~~R~~~--~~~w~e~~~~~~P~~~~~~~~~~~--~~~~-~~~~~~~s~~~~a~~~Las~l~ 73 (559) |.+=+ ...+...-.+++--|.-+ ...|++...-.||..........+ .-.. ...-.|-+...+.++.+.+.++ T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf 80 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVF 80 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhh Confidence 88621 112333333333444444 356677777777764322111111 1111 1122455555555555554433 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC--Cce---- Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD--EDI---- 147 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~--~~~---- 147 (559) . -|| .++.+ ..++.+++.|+. .-.+++.-+..++.+...+|-+.++|+-.. ..+ T Consensus 81 ~--k~p-----~~~~p-------~~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~ 140 (501) T protein:vir:95 81 M--RDP-----VVKVP-------ALLNPLVANATG------SGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASI 140 (501) T ss_pred c--CCc-----ceeCc-------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccH Confidence 2 111 12211 223334444332 245778888889999999999999998432 111 Q ss_pred ---------EEEEEeeccEEE-Ee---eCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEE Q lcl|NC_019445. 148 ---------IRTMPFPIGSYY-LA---NSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSV 214 (559) Q Consensus 148 ---------~~~~~~~l~~~~-v~---~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v 214 (559) -.+..|+..+.. ++ .+...++.-+..+++.+.++ ..|+. ..++.|.++ T Consensus 141 a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d--~~f~~-----------------~~~~q~RvL 201 (501) T protein:vir:95 141 ADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAAD--DGFEM-----------------KTSGQFRVL 201 (501) T ss_pred HHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecC--CCccc-----------------ceeEEEEEE Confidence 123444443321 12 12222333332222222111 22321 234444444 Q ss_pred eecCcccccccccccccEEEEEEEecC---------------CCceeeeecCcccCCeEEEEeeecCCCcccccchHHHH Q lcl|NC_019445. 215 YPNIDRDTSKLDSKNKPFKSVYYEVGG---------------DNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLA 279 (559) Q Consensus 215 ~p~~~~~~~~~~~~~~~~~sv~~~~~~---------------~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~ 279 (559) .+..+. .....+|..... .....+..+|-+.+++|++.|.-..+..++.|.|- . T Consensus 202 ~~~~~g---------~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pP--L 270 (501) T protein:vir:95 202 RLDEEG---------YYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPN--F 270 (501) T ss_pred eeCCCc---------eEEEEEEEecCCcccCcceecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccc--h Confidence 432110 011122222110 01123344566789999999987777777666442 2 Q ss_pred HHHHHHHHHH--H--HHHHHHHHHHhcCceeecC--C-----CccccceecCCceeecCCcCCchhhhhhhhccccHHHH Q lcl|NC_019445. 280 LGPVKALQLL--Q--KRKSQLIDKATNPPMVAPT--S-----LKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADL 348 (559) Q Consensus 280 l~d~~~L~~l--~--~~~~~~~~~~~~p~~~~p~--~-----~~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~ 348 (559) + ++..||.- . ...-..+..+..|.+.+.+ + +....+.+.|+..+..+..++-..+++ +++ .+ . T Consensus 271 l-~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~--~~~-~i--~ 344 (501) T protein:vir:95 271 Y-DLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQA--SEN-TM--L 344 (501) T ss_pred H-HHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCceeecccccccCCCCCceeEEec--Chh-hH--H Confidence 2 44444432 1 2233456667777655432 1 122234556665555543333333443 111 22 2 Q ss_pred HHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCC Q lcl|NC_019445. 349 VADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPP 427 (559) Q Consensus 349 ~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~ 427 (559) .+.|.+++++++++= .. ++ ..+....||++.+.+.......|..+..+++.- + +++|.++-+. |+- T Consensus 345 ~~~l~~l~~~m~~~G-a~---ll-~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~a-l----~~~l~~~a~w~g~~--- 411 (501) T protein:vir:95 345 KEAMDTKERQMVALG-AK---LV-EQKEVQRTATEAELEAASEGSTLSSATKNVSAA-F----EWALKWAARWVGQA--- 411 (501) T ss_pred HHHHHHHHHHHHHHH-Hh---hc-cCCccchhHHHHHHHHHHHhHHHHHHHHHHHHH-H----HHHHHHHHHHcCCC--- Confidence 455777777776654 11 22 344456899999999999999999998888653 3 3444444442 321 Q ss_pred chhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHH Q lcl|NC_019445. 428 PDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQAR 507 (559) Q Consensus 428 p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~r 507 (559) +. .++|++..=+... ..+.+.+..++... +.+ .|..+.+.+.+.+ .||+..- .++|.++++ T Consensus 412 ~~-----~~~v~i~~df~~~--~~~~~~~~al~~~~----~~G-----~is~~t~~~~L~~-~~v~~~~--~~~e~e~i~ 472 (501) T protein:vir:95 412 DS-----GVKFELNTDFDIA--RMTPDERRSLVEEW----QKG-----AITFEEMRTGLRK-AGVATED--DSKAKEKIA 472 (501) T ss_pred CC-----ceEEEEecccccc--cCCHHHHHHHHHHH----hCC-----CCcHHHHHHHHHh-CCCCChh--HHHHHHHHH Confidence 11 1233322111100 01122222222221 111 3666666666654 4665321 122222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 508 QQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 508 q~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) .+.+++....+.... ..-..++++++.. + T Consensus 473 ~~~~~~~~~~~~~~~-~~~~~gg~~~~~~-----~ 501 (501) T protein:vir:95 473 KDTAEAMALATPANV-PGDGSGGDNVGNS-----E 501 (501) T ss_pred hhhcCcccccccCCC-CCCCcccccccCC-----C Confidence 221111000000000 0000111111111 1 No 124 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=97.49 E-value=5.7e-05 Score=43.87 Aligned_cols=459 Identities=10% Similarity=0.027 Sum_probs=193.4 Q ss_pred CChhhHHH----------HHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCc-ccccCCCCcchHHHHHHHHH Q lcl|NC_019445. 1 MAETTKER----------LNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRN-DRRNTRIIDSTGTMAARTLA 69 (559) Q Consensus 1 M~~~~~~~----------l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~~~~~s~~~~a~~~La 69 (559) |.|...+. ...+|+.++..-.. ....++...-.||... ..+...- .+...-.|-+...+.++.++ T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G-~~~~r~~g~~YLPk~~---~E~~~~Y~~rl~rA~~~n~~~~tl~~l~ 76 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGG-TEAMREAGETYLPRHQ---EETDKGYQERLASAVLLNMVEQTLDTLS 76 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcC-hHHHHhhcccCCCCCC---CCCHHHHHHHHhcccCCChHHHHHHHHh Confidence 88865332 23333333332211 3444555555566542 1111111 11223367777777777777 Q ss_pred HHHHHhhcCCCCcceeccCCccchhhHHHHHH-HHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce- Q lcl|NC_019445. 70 SGMMSGITSPARPWFRLATPDPEMMDYGPVKL-WLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI- 147 (559) Q Consensus 70 s~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~-~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~- 147 (559) ..++.- ||.. |.. . -..... +++.|.. .-.+++.-+..++.+.+.+|-+.++|+-....+ T Consensus 77 G~vf~k--~p~~-~~~------~---p~~~~~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~ 138 (513) T protein:vir:97 77 GKPFSE--PIKL-NED------V---PKAIEETILPDVDL------QGNNLDVFARQWFREGMAKALCHVLIDMPRPAPR 138 (513) T ss_pred hhhhhc--Cccc-CcC------c---hHHHHHHHhhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCc Confidence 544331 2211 111 0 112222 2233322 245778888889999999999999997432110 Q ss_pred -----------------EEEEEeeccEEE---Eee-CCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCc Q lcl|NC_019445. 148 -----------------IRTMPFPIGSYY---LAN-SPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEK 206 (559) Q Consensus 148 -----------------~~~~~~~l~~~~---v~~-d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~ 206 (559) -.+..|+..+.. .++ +..+.+.-+.-+.+...+ +.|+ .. T Consensus 139 ~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~---Dgf~-----------------~~ 198 (513) T protein:vir:97 139 EDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQ---DGFA-----------------EV 198 (513) T ss_pred cchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeec---CCCc-----------------ce Confidence 123344444321 111 222233323222222210 1121 12 Q ss_pred eEEEEEEEeecCcccccccccccccEEEEEEEecCC----CceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHH Q lcl|NC_019445. 207 WIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGD----NDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGP 282 (559) Q Consensus 207 ~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~----~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d 282 (559) .++.|.+..+. . | .+|...++. .+..+..+|-..+++|++.|....++.+..|.|- . =+ T Consensus 199 ~~~q~rvL~~g------~-------~-~v~r~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pP--L-l~ 261 (513) T protein:vir:97 199 CKRRIRVLEPG------L-------V-QLWEPVKKSNAQKEEWALADEWATGLNYVPLVTFYADRQGFMMGKPP--L-LD 261 (513) T ss_pred EEEEEEEEeCc------e-------E-EEEEeecCCCccccceEEecCCCCcCCceeEEEEecCCCCCCCCccc--h-HH Confidence 22222222110 0 1 122111111 1123333444568889988888777777666542 2 24 Q ss_pred HHHHHHHHH---H-HHHHHHHHhcCceeecCC--CccccceecCCceeecCCcCC-chhhhhhhhccccHHHHHHHHHHH Q lcl|NC_019445. 283 VKALQLLQK---R-KSQLIDKATNPPMVAPTS--LKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPSTADLVADIQDT 355 (559) Q Consensus 283 ~~~L~~l~~---~-~~~~~~~~~~p~~~~p~~--~~~~~~~~~pg~~~~~~~~~~-~~~~~p~~~~~~~~~~~~~~i~~~ 355 (559) +..||.-.- + .-..+..+..|.+.+... .....+.+.|++++..+..++ -..++|- + ..+....+.|.++ T Consensus 262 LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~~i~iG~~~~~~lpe~~~~~~yie~~--g-~~i~~~~~~l~~l 338 (513) T protein:vir:97 262 LAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSDPVVVGPNKVLYNPDPAGRFYYVEHT--G-QAIAAGRTDLKDL 338 (513) T ss_pred HHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCCceEeeccccccCCCCCCcceeeccC--c-hhHHHHHHHHHHH Confidence 555543322 2 223455566665555421 111346678888776653222 2233332 1 2456667788888 Q ss_pred HHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCc Q lcl|NC_019445. 356 RQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGM 434 (559) Q Consensus 356 ~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~ 434 (559) ++.++++=. . ++. ..+...||++.+.+....-..|+.+...+++ .+++++.++-+. |. + + . T Consensus 339 e~qm~~~Ga-~---ll~-~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~-----al~~~l~~~a~wlg~--~-~-----~ 400 (513) T protein:vir:97 339 EEQMAGYGA-E---FLK-RKTGGQTATARALDSAEATSDLSAMTGLFED-----ALAQALDITADWLRL--G-P-----N 400 (513) T ss_pred HHHHHHHHH-H---hhc-cCCccccHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhCC--C-C-----C Confidence 888866652 1 222 2344689999999999999999998887755 334455555442 21 1 1 1 Q ss_pred ceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCC-CccccCCHHH-HHHHHHHHHH Q lcl|NC_019445. 435 PLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGV-SPTVIVPQEQ-VDQARQQRAQ 512 (559) Q Consensus 435 ~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gv-p~~~~rs~~e-v~~~rq~r~q 512 (559) .++++|-.-..... .+.+.+..+++.. +.+ .|....+.+.+-+ .|| ++.+ +.++ .++++. +-+ T Consensus 401 ~~~v~in~dF~~~~--~~~~~~~al~~a~----~~G-----~is~~t~~~~L~r-~gvl~~d~--d~~~~~e~~~~-~~~ 465 (513) T protein:vir:97 401 GGTVELVKDYDLEE--MDAPGLQALQVAR----EKR-----DISRKTYLNGLRL-RGVLPEDF--DEDEDWEELME-EIS 465 (513) T ss_pred ccEEEeccccCccc--CCHHHHHHHHHHH----hCC-----CCCHHHHHHHHHh-ccCCCccC--CHHHHHHHHHH-hhh Confidence 23333322221110 1112222222211 111 2444445555544 233 2221 2222 111111 100 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhcCC-----ChhHHH-HHHHHhhcCCCCCC Q lcl|NC_019445. 513 QQQQQQMMAMGMAAAQGAKTLSEAKTS-----DPSVLS-AMANAVSGQGGQSQ 559 (559) Q Consensus 513 ~~q~~~~~~~~~~~~~~a~~~~~~~~~-----~~~~~~-~~~~~~~~~~~~~~ 559 (559) .+... ...-...+....+.... +....+ +-.|.+++-+|+.- T Consensus 466 ~~~~~-----~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 466 EAMGR-----AGLDLDPAQKNPPEGGEGEGEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred hccCC-----CCccccccCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCC Confidence 00000 00000000111000000 001001 11111111111111 No 125 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=97.43 E-value=7e-05 Score=43.38 Aligned_cols=475 Identities=11% Similarity=0.043 Sum_probs=198.4 Q ss_pred CChhhHHHHH-HHHHHHHHHhhhHHHHHHHHHHHhcccc---CC---CCCCC---CCCcccccCCCCcchHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLN-KQFAQLESERQSFEPHWRELSDYINPRG---SR---FLTSE---VNRNDRRNTRIIDSTGTMAARTLAS 70 (559) Q Consensus 1 M~~~~~~~l~-~r~~~l~~~R~~~~~~w~e~~~~~~P~~---~~---~~~~~---~~~~~~~~~~~~~s~~~~a~~~Las 70 (559) |.-...+++. +.+..... +...++.+.+.+|..-.- .+ ..+.. .....+.+.|+..+.+...++..++ T Consensus 8 ~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~ 85 (537) T protein:vir:78 8 KPIDQLGGLLNTEITTYMA--SNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQ 85 (537) T ss_pred ccHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhh Confidence 4333333222 22222221 122344555666643210 00 00000 0111123456777888888888887 Q ss_pred HHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEE Q lcl|NC_019445. 71 GMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRT 150 (559) Q Consensus 71 ~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~ 150 (559) -|++- |+. ++..+... ++ ..+.+...+ ..+|.....++.+++.++|.|.+++..+....+++ T Consensus 86 yl~G~--Pv~-----~~~~d~~~------~e----~~~~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~ 147 (537) T protein:vir:78 86 YLLSN--GVE-----VKVKDEDN------TQ----LDEILQEYF-DEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKF 147 (537) T ss_pred hhccc--Cce-----eecCcchh------HH----HHHHHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEE Confidence 66543 322 23333221 11 122233333 36777788899999999999988887776667889 Q ss_pred EEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEE---eecCcccc----- Q lcl|NC_019445. 151 MPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSV---YPNIDRDT----- 222 (559) Q Consensus 151 ~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v---~p~~~~~~----- 222 (559) ..++..+.+.-.|..+....++|.|.....+-.+ . ....-..+++|+.- +-+.+... T Consensus 148 ~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~-----~----------~~~~~~~~evyt~~~i~~y~~~~~~~~~~~ 212 (537) T protein:vir:78 148 QTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQ-----Q----------STETIWHADVWNEEAVCYYIQDDEGVSTTY 212 (537) T ss_pred EEEccceeEEEEcCCCCceeEEEEEeeeeccccc-----c----------CcceEEEEEEEcCCcEEEEEecCCcccccc Confidence 9999999888888888888888876654221100 0 00001122222100 00000000 Q ss_pred -cccccccccEEEEEEEecCC-------CceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 223 -SKLDSKNKPFKSVYYEVGGD-------NDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKS 294 (559) Q Consensus 223 -~~~~~~~~~~~sv~~~~~~~-------~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~ 294 (559) ........|...++...... .......-+|..+|++.++= +.+|.|. .+...+-+-.++.+.-..+ T Consensus 213 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n-----n~~~~sd-~e~v~~LiDayd~~~S~~a 286 (537) T protein:vir:78 213 KLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN-----NKDGMSD-VKRVKSIIDDYDVMNCFLS 286 (537) T ss_pred cccccccccccceeeeccccccccccccccccccccCCcceeEEEecc-----CccCCCc-hhhhHHHHHHHHHHHHhhh Confidence 00000011111111111000 00011123567788776654 4578896 8899999999999999999 Q ss_pred HHHHHHhcCceeecCC-Ccc-cc--ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_019445. 295 QLIDKATNPPMVAPTS-LKN-QR--ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMM 370 (559) Q Consensus 295 ~~~~~~~~p~~~~p~~-~~~-~~--~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~ 370 (559) ...+...+|.+++.+. +.. .. .++.-.+++-++. +...+..+ +...+...+...+..+++.|-+..+.. . T Consensus 287 n~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~--d~~~v~~l-~~~~~~~~~e~~ld~L~~~I~~~s~~~--~- 360 (537) T protein:vir:78 287 NNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNG--DNAGMEIQ-TVSIPYEARKAKMDIDVENIYRSGMGF--N- 360 (537) T ss_pred hHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecC--CCCceeEE-EecCCHHHHHHHHHHHHHHHHHhcCCC--C- Confidence 9999999998887653 111 11 1222233343432 12223322 122345555566777777665443322 1 Q ss_pred ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHH Q lcl|NC_019445. 371 LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKS 450 (559) Q Consensus 371 ~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~ 450 (559) ......+..|..-+..+-.-+ .+-.....+...+.+.-+++-++.++...|. ..+....|+++|.-.+-. T Consensus 361 ~~~~~~gn~SGvAlk~~~~~l-~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~-----~~~d~~~i~i~f~~~~P~---- 430 (537) T protein:vir:78 361 STAVGDGNVTNVVIKSRYTLL-AMKARKMETSLRKVLRWCADMVVSDIALRGL-----GEYDSNDICFEIEPHVLA---- 430 (537) T ss_pred CccccccCCcHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-----cccccceeeEEeccCCCC---- Confidence 122233444543333322211 1122222333333333333333333333331 122334567777655431 Q ss_pred HHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_019445. 451 IGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ-G 529 (559) Q Consensus 451 ~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~-~ 529 (559) +...+. +.+..+.+.+ .+.-..++ ..++. +-+. |.+++.++ +..+...... ....+ . T Consensus 431 -n~~e~a---~~~~~l~~~g-----iiS~eT~l----~~~p~----vdd~-e~ek~~~e--e~~~~~~~~~--~~~~~~~ 488 (537) T protein:vir:78 431 -NELDIA---TTRKTEAETE-----ALKIGNIM----TVAPR----IGDD-ETLKLIAE--ELDLDYNELK--DALAEQD 488 (537) T ss_pred -CHHHHH---HHHHHHHhcC-----cchHHHHH----HhCCC----CCCH-HHHHHHHH--HHHhhhhhhh--hhhhhhc Confidence 111111 1111211111 12222222 12221 1122 22222111 1111100000 00000 0 Q ss_pred HhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 530 AKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 530 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ++.....+ -.+.+..+....+.+.. T Consensus 489 ~~~~~~~~-----~~~~~~~~~~~~~~~~~ 513 (537) T protein:vir:78 489 AQSLDVSP-----DVQAMLDGLPVNANQPP 513 (537) T ss_pred ccccCcCc-----chhhhcCCCCCCCCCCC Confidence 00000000 00111111111111100 No 126 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=97.29 E-value=0.0001 Score=42.41 Aligned_cols=435 Identities=9% Similarity=0.024 Sum_probs=192.0 Q ss_pred CCh----hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcc-cccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAE----TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~----~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |.= +.-.....+|+.++.--.. ...+++...-.||... +.+...-+ +...-.|-+...+.++.++. . T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G-~~~~r~~g~~YLpk~~---~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G----~ 72 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLG-QREVKKKGVRFLPKLS---GQTDDMYNAYKQRALFYSITSKTLSALSG----M 72 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcC-hHHHHcCCcccCCCCC---CCCHHHHHHHHhhccCCchHHHHHHHHhc----h Confidence 762 2223344444444433222 2455555555566542 22111111 11222455555555555544 4 Q ss_pred hcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce-EEEEEee Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI-IRTMPFP 154 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~-~~~~~~~ 154 (559) +|. .++ .++.++ ....+. .-..-.+++.-+...+.+...+|-+.++|+-+..+. -.+..|+ T Consensus 73 vf~--k~p-~~~~p~-------~l~~~~--------~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~ 134 (452) T protein:vir:94 73 VLD--QPP-VITHPD-------AMSKYF--------EDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYT 134 (452) T ss_pred hhc--CCc-eecccH-------HHHHHH--------hcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEec Confidence 442 222 123221 111111 124467788888889999999999999998654332 3344555 Q ss_pred ccEEE-EeeCCCCCE-EEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecC-cccccccc-cccc Q lcl|NC_019445. 155 IGSYY-LANSPRGSV-DICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNI-DRDTSKLD-SKNK 230 (559) Q Consensus 155 l~~~~-v~~d~~G~v-d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~-~~~~~~~~-~~~~ 230 (559) ..+.. ++.+..|+. ..++|+ +...++-..+||. +.++.|.+..-.+ .+....+. ..+. T Consensus 135 ~~~Ii~W~~~~~g~l~~v~lre-~~~~~d~~d~f~~-----------------~~~~~yRvL~l~~g~~~v~~~~~~~~~ 196 (452) T protein:vir:94 135 TENILNWEEDEDGRLLMVVLRE-FYTVRDTADRYVQ-----------------NIRVRYRCLELVDGLLQITVHETQDGK 196 (452) T ss_pred hhhhcCccccccCCeeEEEEEE-EEEEecCCCcccc-----------------eeEEEEEEEEEeCCeEEEEEEEccCCc Confidence 44433 344445554 334443 3333333333432 1222222211000 00000000 0000 Q ss_pred cEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHH----HHHHHHHHHHHHhcCcee Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQL----LQKRKSQLIDKATNPPMV 306 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~----l~~~~~~~~~~~~~p~~~ 306 (559) +|+.. .. ..-.+|=..+++|++.|....++....|.|- |=|+..||. .+-..-+.+..+..|.+. T Consensus 197 -----~~~~~--~~-~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pP---Ll~LA~ln~~hy~~~sd~~~~l~~~~~P~l~ 265 (452) T protein:vir:94 197 -----VWELA--KT-STIQNVGVTMDYIPFFCITPSGLSMTPAKPP---MIDIVDINYSHYRTSADLEHGRHFTGLPTPW 265 (452) T ss_pred -----eeeec--cc-eeecCCCcccceeEEEEEcCCCCCCCCCccc---hHHHHHHHHHHhcchhHHHHHHHHcccceeE Confidence 11110 01 1122333457788888887777666555432 224444433 223344556677777666 Q ss_pred ecCCCccccceecCCceeecCCcCCc-hhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH- Q lcl|NC_019445. 307 APTSLKNQRASLLPGDITYIDQITGQ-DGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV- 384 (559) Q Consensus 307 ~p~~~~~~~~~~~pg~~~~~~~~~~~-~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei- 384 (559) +.+.-....+.+.|++.+..+..++. ..++|- ++ .+....+.|.++++.+.++=- .+...++.+.|++|- T Consensus 266 ~~g~~~~~~i~iG~~~~~~lpe~~~~~~yie~~--g~-~i~~~~~~l~~le~~m~~~Ga-----~ll~~~~~~~~s~ea~ 337 (452) T protein:vir:94 266 ITGAESQSTMHIGSTKAWVIPEVAAKVGFLEFT--GQ-GLQSLEKALSEKQAQLASLSA-----RLIDNSTRGSEATETV 337 (452) T ss_pred eecCcCCCceEecccccccCCCCCCcceEEccC--ch-hHHHHHHHHHHHHHHHHHHHH-----HhhccCCCcchHHHHH Confidence 55443334567888887766643332 234431 22 355566778888877755431 122233333455554 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFI 463 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~ 463 (559) ..+.......|..+..++++- +++++.++.+. |. . .++++++..-.... ..+.+.+..+++.. T Consensus 338 ~~~~~~~~s~L~~~a~~~e~a-----l~~~l~~~a~w~g~-~--------~~~~v~~n~dF~~~--~~~~~~~~al~~~~ 401 (452) T protein:vir:94 338 KLRYMSETASLKSVTRAVEAL-----LNKAYSCIMDMESM-G--------GTLNIKLNSAFLDS--KLTAAELKAWVEAY 401 (452) T ss_pred HHHHHHhhHHHHHHHHHHHHH-----HHHHHHHHHHHcCC-C--------CceEEEeccccccc--cCCHHHHHHHHHHH Confidence 455554568888888887553 35666666663 32 1 12334333221100 00112222223211 Q ss_pred HHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhH Q lcl|NC_019445. 464 GQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSV 543 (559) Q Consensus 464 ~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 543 (559) +.+ .|....+...+-+ .||. ..++|.+.+..++. ++++...++.+.++..+ T Consensus 402 ----~~G-----~is~~t~~~~L~~-~gvl----~~~~e~~~i~~E~~---------------~~~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 402 ----LSG-----GISKEIYIHALKV-GKVL----PPPGESMGVIPDPP---------------APEPSPSNTPPNPSSKA 452 (452) T ss_pred ----hcC-----CCcHHHHHHHHHh-CCCC----CCccCHHHHHHHhh---------------ccCcccCCCCCCCccCC Confidence 111 3555555555544 4663 23333332222210 01111111222222222 No 127 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=96.08 E-value=0.001 Score=36.95 Aligned_cols=415 Identities=8% Similarity=0.006 Sum_probs=179.9 Q ss_pred CC-h---hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCC----ccc-cc----------CCCCcchH Q lcl|NC_019445. 1 MA-E---TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNR----NDR-RN----------TRIIDSTG 61 (559) Q Consensus 1 M~-~---~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~----~~~-~~----------~~~~~s~~ 61 (559) |. + +.-.....+|+. .|......-+....-.||....-.....++ ... +. .-.|-+.- T Consensus 14 m~V~~~hp~y~a~~~~W~~---~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~ 90 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLR---NLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIV 90 (488) T ss_pred ecccccCHHHHHHhhhhhH---hhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchh Confidence 54 2 223334444543 344455555566666778653211111111 000 00 11234444 Q ss_pred HHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEe Q lcl|NC_019445. 62 TMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVL 141 (559) Q Consensus 62 ~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~ 141 (559) .+.++.+ ++.+|- .++ .++.++ ...++.+++.|+. .-.+++.-+..++.+...+|-+.++|+ T Consensus 91 ~~tl~~l----~G~vfr--k~p-~~~~~~-----~~~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~ilVD 152 (488) T protein:vir:96 91 NPTMNAI----TGAVMR--REP-EFDTMD-----NPVLIGLRDNIDG------KGNGIDQECKQALNALQWGSRCGWLVR 152 (488) T ss_pred HHHHHHh----cchhhc--cCc-eeccCC-----cHHHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEe Confidence 4444443 333331 111 112211 1234455555443 246778888889999999999999998 Q ss_pred ecCCc---------e--EEEEEeeccEEE-EeeCC-CC--CEEEE-EEEEeecHHHHHHhcCcccCCHHHHHHHhcCCC- Q lcl|NC_019445. 142 EDDED---------I--IRTMPFPIGSYY-LANSP-RG--SVDIC-FRKFSMTVRQLVQEFGLNNVSESVKSMWESGTY- 204 (559) Q Consensus 142 ~~~~~---------~--~~~~~~~l~~~~-v~~d~-~G--~vd~i-~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~- 204 (559) -.... + -.+..|+..+.. ++.+. +| .+.-+ +|+ +.+..+ .++. T Consensus 153 ~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE-~~~~~D-------------------~~~~~ 212 (488) T protein:vir:96 153 SHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLE-DYQERD-------------------GGTYV 212 (488) T ss_pred cCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEE-EEEecc-------------------CCCcc Confidence 64321 1 123344443322 11111 22 23322 333 222110 0011 Q ss_pred -CceEEEEEEEeecCcccccccccccccEEEEEEEec-C-CCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHH Q lcl|NC_019445. 205 -EKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVG-G-DNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALG 281 (559) Q Consensus 205 -~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~-~-~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~ 281 (559) ...+.++ .+.+. .| ++|.... + ..+.+...+|-..+++|++.|....++.+..|.|- .+ T Consensus 213 ~~~~~~~~-~l~~g-------------~~-~v~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pP--Ll- 274 (488) T protein:vir:96 213 SKQRLINH-RLVDG-------------LC-EFQEVTDDEYSDEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTP--LT- 274 (488) T ss_pred cceEEEEE-EEECc-------------EE-EEEEEecCCcccceEeecCCCcccCeeEEEEEecCCCCCCCCCCc--hH- Confidence 1122222 22211 01 1222111 1 12223333455578888888888888777776542 22 Q ss_pred HHHHHHHH---HHHHHHH-HHHHhcCceeec-CCCcc-ccceecCCceeecCCcC---CchhhhhhhhccccHHHHHHHH Q lcl|NC_019445. 282 PVKALQLL---QKRKSQL-IDKATNPPMVAP-TSLKN-QRASLLPGDITYIDQIT---GQDGFRPAYLVNPSTADLVADI 352 (559) Q Consensus 282 d~~~L~~l---~~~~~~~-~~~~~~p~~~~p-~~~~~-~~~~~~pg~~~~~~~~~---~~~~~~p~~~~~~~~~~~~~~i 352 (559) |+..||.- ..+-++. +..+.-|+|+.. +++.. ......++|+....... ....++ ..+..++ ....+.| T Consensus 275 dLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~g~~~-~~e~~~~-~l~~~~l 352 (488) T protein:vir:96 275 SLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTMASEMNPLGFTLAGRMPYYVKNGDVK-VIQAQFS-PETENKV 352 (488) T ss_pred HHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCcccccccccceeeecccccccccCCcee-ecCCchh-HHHHHHH Confidence 44444432 1222333 344445556553 22221 22334455554443211 111111 1111111 1134567 Q ss_pred HHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhh Q lcl|NC_019445. 353 QDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVR-KNMLPPPPDAM 431 (559) Q Consensus 353 ~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r-~g~lp~~p~~l 431 (559) .+++.++.++=.. ++. .+ .+.||++...+.......|+.+...+++- ++++|.++.+ .|.-.. .. T Consensus 353 ~~l~~qm~~~Ga~----l~~-~~-~~~Ta~~~~~~~~~~~S~L~~~a~~le~a-----l~~~l~~~A~w~g~~~~---~~ 418 (488) T protein:vir:96 353 EKLFEQAVKVGAS----LFT-QQ-SNETATGAAIRSGSSTASMATLGNNVEDT-----VRNMLRFIMRYFEGTNL---YV 418 (488) T ss_pred HHHHHHHHHHhHh----hcc-CC-CcchHHHHHHHHHHhhHHHHHHHHHHHHH-----HHHHHHHHHHHcCCCCC---Cc Confidence 7777777554421 222 22 35799999999999999999998887653 3445555554 232110 00 Q ss_pred CCcceEEEee----c-H-----HHHHHHHHHHHHHHHHHHHHHHHhccChhhH-hcCCHHHHHHHHHHHcCCCc Q lcl|NC_019445. 432 EGMPLKVEYI----S-V-----MAQAQKSIGLSSLASTVNFIGQLAQAKPEAL-DKLNVDQAIDAFADMSGVSP 494 (559) Q Consensus 432 ~g~~v~~~~i----s-~-----La~a~r~~~~~~l~~~~~~~~~la~~~P~~~-~~id~d~~~~~~a~~~Gvp~ 494 (559) ...++++++. . . ++++.++.....| ..-.+...|... .++ +.+++++..+++++ -|++- T Consensus 419 ~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~I-s~~t~~~~L~~~--gvl~~d~~~e~~~~~ie~-~g~~~ 488 (488) T protein:vir:96 419 NPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNL-PQVSWFELLKRA--RVVRGDMSKEEFDEHIAE-LGFGM 488 (488) T ss_pred CccceEEEeccCCCCccCCHHHHHHHHHHHhcCCC-CHHHHHHHHHhC--CcCCccCCHHHHHHHHhh-cCCCC Confidence 1111222222 1 1 2222222222211 111122222221 222 35677777777764 23322 No 128 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=96.04 E-value=0.0011 Score=36.85 Aligned_cols=427 Identities=9% Similarity=0.058 Sum_probs=174.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCC-cccccCC---------CC--cchHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNR-NDRRNTR---------II--DSTGTMAARTL 68 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~-~~~~~~~---------~~--~s~~~~a~~~L 68 (559) |....-.+.. +. ...+.+ ...|.. ..+.....+... +.-..++ +| ...+..+|++. T Consensus 1 ~~~~~~a~~~-~~---~~~a~~-------~~~~~~-~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~ 68 (461) T protein:vir:80 1 MYSIDKAKQA-KI---DSKIVN-------RNDFMV-GHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDII 68 (461) T ss_pred Cccchhhhhh-hh---hhhhhh-------hhHHHh-hcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccc Confidence 7764321111 11 111111 222221 111100011000 0000111 11 11222233333 Q ss_pred HHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc-- Q lcl|NC_019445. 69 ASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED-- 146 (559) Q Consensus 69 as~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~-- 146 (559) |. -+| +.|+.+...+++ ....++. .+.+-++...+.++++.--+||.|.+++.-+..+ T Consensus 69 a~----d~~---r~g~~i~~~~~~--~~~~~~~-----------~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~ 128 (461) T protein:vir:80 69 SE----DMV---RAGWSLKTDNKE--MKKNIES-----------KWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNRE 128 (461) T ss_pred hH----Hhh---cCCeeeecCCHH--HHHHHHH-----------HHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCcc Confidence 33 333 468888654321 1112222 2333467888999999999999998887542221 Q ss_pred -eEEEEEeeccEEEEeeCCCCCE--EEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccc Q lcl|NC_019445. 147 -IIRTMPFPIGSYYLANSPRGSV--DICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTS 223 (559) Q Consensus 147 -~~~~~~~~l~~~~v~~d~~G~v--d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~ 223 (559) .....++..+. -+.+ ..+|.+..++..... .+-+++ ++. .-+.|+. ......... T Consensus 129 ~~~~~~pl~~~~-------~~~~~~l~~~~~~~i~~~~~~----~dp~sp---------~fg-~P~~y~i-~~~~~~~~~ 186 (461) T protein:vir:80 129 QADLSTAIDPKT-------IKSIPYINTFNTQKVTQLYLN----QDMFSE---------HFG-EVEFFEV-NRVSQLGEE 186 (461) T ss_pred ccCccCCccccc-------ccceeEEEeccccccchhhhc----ccCcCc---------ccc-cceEEEE-ecccccccc Confidence 11111111111 1111 122322222221111 111110 111 1111111 110000000 Q ss_pred ccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019445. 224 KLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNP 303 (559) Q Consensus 224 ~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p 303 (559) .+....+..... +|..+++++.-...++..||+|. .+.+++.++..+.......+.+..+..+ T Consensus 187 -----------~~~~~~~~~~~~-----iH~SRii~~~~~~~~~~~~G~S~-le~~~~~l~~~~~~~~~~~~l~~~~~~~ 249 (461) T protein:vir:80 187 -----------ILSGTTASTSEQ-----IHRSRIIHEQGLRFEGETKGRSI-FESLYDIITVMDTSLWSVGQILYDFAFK 249 (461) T ss_pred -----------ccccccCccceE-----EccccEEEecCCCCCccccCcch-HHHHHHHHHHHHHHHHHHHHHHHHhCCC Confidence 000000000111 23446666666667788899995 8889999998888888877766666555 Q ss_pred ceeecCC--Ccc-------ccce-ecC-CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhcc Q lcl|NC_019445. 304 PMVAPTS--LKN-------QRAS-LLP-GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQ 372 (559) Q Consensus 304 ~~~~p~~--~~~-------~~~~-~~p-g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~ 372 (559) .+..+.- +.. ..++ ... .++..++.. +.+..+. .++..+-..+..+.+.|.-+.=-.+.-.++ T Consensus 250 v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d~~---e~~e~~~---~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G 323 (461) T protein:vir:80 250 VYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIKGD---EQLTKES---TNVSGMKDLLDYGWDYLAGAVRMPKTVLKG 323 (461) T ss_pred ceecchHHhhhchHHHHHHHHHHHhcCCceEEEEcCC---cceEEEe---cCcCCHHHHHHHHHHHHhhhhcCCeeeeec Confidence 5544321 100 0111 111 133333221 2233222 234444455666667776655333222233 Q ss_pred CCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCchhhCCcceEEEeec--HHHHHH Q lcl|NC_019445. 373 NINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK--NMLPPPPDAMEGMPLKVEYIS--VMAQAQ 448 (559) Q Consensus 373 ~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~--g~lp~~p~~l~g~~v~~~~is--~La~a~ 448 (559) +..+..=|.+ +-...+---+..++...+.|.+++.+.+|++. |..|.+... ..++++++-. .+.... T Consensus 324 ~s~g~~asge-------~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~--~~~~~i~f~~L~~~s~ke 394 (461) T protein:vir:80 324 QEAGTLTGAQ-------YDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPD--SFEWAIEFNPLWNLDSKT 394 (461) T ss_pred ccCCccccch-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCcc--ccceEEEeCCCCCCCHHH Confidence 3323322222 12233444455566678899999999998863 222221111 1245666643 223333 Q ss_pred HHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc-ccC-CHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 449 KSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT-VIV-PQEQVDQARQQRAQQQQQQQMMAMGMAA 526 (559) Q Consensus 449 r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~-~~r-s~~ev~~~rq~r~q~~q~~~~~~~~~~~ 526 (559) |+.-..+..... ..+.+.+ .|+.+++.+.+....|++.. .+- ...|.+.+..+.-+..+.+ . T Consensus 395 kAe~~~~~a~a~---~~~~~~g-----~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e----~---- 458 (461) T protein:vir:80 395 DAEVRKLTAEAD---QIYIVNG-----VLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKK----N---- 458 (461) T ss_pred HHHHHHHHHHHH---HHHHhcC-----CCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcccccccc----C---- Confidence 333333332322 2332322 48888988888877777532 221 2222222211111000000 0 Q ss_pred HHH Q lcl|NC_019445. 527 AQG 529 (559) Q Consensus 527 ~~~ 529 (559) +++ T Consensus 459 ~~g 461 (461) T protein:vir:80 459 ADG 461 (461) T ss_pred CCC Confidence 000 No 129 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=95.93 E-value=0.0012 Score=36.52 Aligned_cols=447 Identities=9% Similarity=0.025 Sum_probs=190.0 Q ss_pred CChhhHH-HHHHHHHHHHHHhhhHHHH--HHHHHHHh-ccccCCCCCCCC-----CCcccccCCCCcchHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKE-RLNKQFAQLESERQSFEPH--WRELSDYI-NPRGSRFLTSEV-----NRNDRRNTRIIDSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~~~~-~l~~r~~~l~~~R~~~~~~--w~e~~~~~-~P~~~~~~~~~~-----~~~~~~~~~~~~s~~~~a~~~Las~ 71 (559) ..|.... ....+....+..++.|+.- -+....|. .|........-. ...+.+.--.-++.+..+++.+++. T Consensus 10 ~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~n 89 (505) T protein:vir:96 10 LAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNN 89 (505) T ss_pred hhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 1111100 1111122222322222210 00000010 111100000000 0000111112467899999999999 Q ss_pred HHH--hhcCCCCcceeccCCccchhhHH--HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCc- Q lcl|NC_019445. 72 MMS--GITSPARPWFRLATPDPEMMDYG--PVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDED- 146 (559) Q Consensus 72 l~~--~l~pp~~~Wf~l~~~d~~~~~~~--~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~- 146 (559) +++ +|+|..++.......++++.+.- .-+.|.+. .-.++=-+.+||.....++...++-|-+++......+. T Consensus 90 vVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~---~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~ 166 (505) T protein:vir:96 90 VIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKK---GNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNK 166 (505) T ss_pred hcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCC---cCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCC Confidence 995 89998877655544444433321 11222211 00112224579999999999999999987654433222 Q ss_pred -eEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccc Q lcl|NC_019445. 147 -IIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 (559) Q Consensus 147 -~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~ 225 (559) ++.++.+....+--.. +.+....-.|...|+-+.. T Consensus 167 ~~~~lqliepd~l~~~~--------------------------------------n~~~~~~~~i~~GIe~d~~------ 202 (505) T protein:vir:96 167 WGYALQILECDRLDLNY--------------------------------------NADLQNGNRIRMSIELDAW------ 202 (505) T ss_pred cceEEEEechhhcCCCC--------------------------------------CcccCCcCeEEeceEECCC------ Confidence 1222222222211000 0000001124555553322 Q ss_pred ccccccEEEEEEEecCCCceee-ee---cCcccCC--eEEEEe-eecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 226 DSKNKPFKSVYYEVGGDNDKLL-RE---SGFDEFP--IMAPRW-EVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLID 298 (559) Q Consensus 226 ~~~~~~~~sv~~~~~~~~~~il-~e---sg~~~~P--~~~~rw-~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~ 298 (559) +.|.. ||+.....++... .. ..|...| -+..-| ...+|..=|.+ ..-.+|..++.|+....+.+.++. T Consensus 203 ---Gr~~a-Y~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis-~lapvl~~l~~l~~y~dael~~a~ 277 (505) T protein:vir:96 203 ---ERPVA-YHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNRGIP-WTHASMVELHHIGEYRKSEMIAAE 277 (505) T ss_pred ---CceEE-EEEeecCCCccccccccccccccccCHhHhhhhhcccCCccccCcc-hHHHHHHHHHHHhHHHHHHHHHHH Confidence 22221 3332211111111 00 1122233 122223 34577778888 577889999999999999999999 Q ss_pred HHhcCceeecCC---C--------ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_019445. 299 KATNPPMVAPTS---L--------KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDL 367 (559) Q Consensus 299 ~~~~p~~~~p~~---~--------~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl 367 (559) .++.....+..+ + ......+.||.+.+.........+.|-. .+.++..+. ..+...|..++=..+ T Consensus 278 i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~~~~f~---~~~lr~iaaglgi~y 353 (505) T protein:vir:96 278 LGAKKVGFYEQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDH-PHTNFGAFV---KSSLRGVAAGMGPAY 353 (505) T ss_pred HhhhheeeeecCCccCCCccccccCccccccCCceeeecCCCCeeeeeCCCC-CCCCHHHHH---HHHHHHHHhhcCCCH Confidence 988877555322 1 1123457788877664333333333322 122333222 222333433332111 Q ss_pred hhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHH Q lcl|NC_019445. 368 FMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQA 447 (559) Q Consensus 368 ~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a 447 (559) .++ ..|-..++-.-+++-..|.-+.+-..=..+..-|+.|+..+++..+...|.+|-+. .-...-++++++.|- T Consensus 354 -e~l-t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~-~~~~~~~~~~w~~p~--- 427 (505) T protein:vir:96 354 -NRL-AHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNM-VDIDRLSQYAFQPRG--- 427 (505) T ss_pred -HHH-hcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCC-ccchhhceeeeccCC--- Confidence 122 13444566666666666666666666666777899999999999999999987442 110111344444431 Q ss_pred HHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHH--------------HHHHHcCCCccccCCHHHHHHHHHHHHHH Q lcl|NC_019445. 448 QKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAID--------------AFADMSGVSPTVIVPQEQVDQARQQRAQQ 513 (559) Q Consensus 448 ~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~--------------~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~ 513 (559) ...||+-+-++ .+....|.+. +||. .|++.. T Consensus 428 --------------------------~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~------~~v~---~q~a~e 472 (505) T protein:vir:96 428 --------------------------WDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDP------EDVF---DEIAWE 472 (505) T ss_pred --------------------------ccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCH------HHHH---HHHHHH Confidence 11122211111 1222234332 2221 111111 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 514 QQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 514 ~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .+.. .. .+ +.... ......++ ........++ T Consensus 473 ~~~~---~~-------~G-l~~~~-~~~~~~~~---~~~~~~~~~~ 503 (505) T protein:vir:96 473 EQLM---RD-------KG-VNPTP-PEQESKDA---TTDEEDDSAS 503 (505) T ss_pred HHHH---HH-------cC-CCCCC-CCCCCCCC---CCCCCCCCCC Confidence 1111 00 00 00000 00000000 0000000000 No 130 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=95.70 E-value=0.0016 Score=35.91 Aligned_cols=437 Identities=10% Similarity=0.105 Sum_probs=175.7 Q ss_pred CChhh--HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCC------CCCcc-----cccCCCCcchHHHHHHH Q lcl|NC_019445. 1 MAETT--KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSE------VNRND-----RRNTRIIDSTGTMAART 67 (559) Q Consensus 1 M~~~~--~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~------~~~~~-----~~~~~~~~s~~~~a~~~ 67 (559) |-... ...+..+--.. ....++|+-+++.+--.+..+.... ...++ +...-.|-+.-.+.++. T Consensus 1 ~~~~~~~~~~V~~~hp~y----~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~ 76 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREW----LHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSG 76 (489) T ss_pred CccCCCccCCCCccCHHH----HHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHH Confidence 32110 00000000000 1223445445444322110000000 00000 01111233333444443 Q ss_pred HHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce Q lcl|NC_019445. 68 LASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI 147 (559) Q Consensus 68 Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~ 147 (559) |++.+|- ..|++.+ + +.++.+++.|+. .-.+++.-+..++.+...+|-+.++|+-....+ T Consensus 77 ----l~G~vfr-k~p~~~~--p-------~~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~ 136 (489) T protein:vir:78 77 ----MVGSVMR-KEPEINI--P-------KELEYLLKNADG------SGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGA 136 (489) T ss_pred ----Hhchhhc-CCcceec--c-------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCC Confidence 3444442 3445531 1 123444444433 245778888889999999999999998543221 Q ss_pred ------------EEEEEeeccEE---EEee-CCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE Q lcl|NC_019445. 148 ------------IRTMPFPIGSY---YLAN-SPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM 211 (559) Q Consensus 148 ------------~~~~~~~l~~~---~v~~-d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~ 211 (559) -.+..|+..+. -..+ ++.+++.-+.-+++...++=...|+ .+.++.| T Consensus 137 ~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~-----------------~~~~~q~ 199 (489) T protein:vir:78 137 ATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFE-----------------TKYGEQY 199 (489) T ss_pred cCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCcc-----------------ceeEEEE Confidence 12444554443 1222 2333444333232222211111122 2345555 Q ss_pred EEEeecCcccccccccccccEE-EEEEEec-CCC----ceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHH Q lcl|NC_019445. 212 HSVYPNIDRDTSKLDSKNKPFK-SVYYEVG-GDN----DKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 (559) Q Consensus 212 ~~v~p~~~~~~~~~~~~~~~~~-sv~~~~~-~~~----~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~ 285 (559) .++.+..++. |. .+|.... +.. ..+....|-..+++|++.|.-..++.+..|.|-.. |+.. T Consensus 200 RvL~~~~~g~----------~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl---~LA~ 266 (489) T protein:vir:78 200 RVLDIDSDGN----------YRQRLFRFDAEGGAQEDVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLL---PLAE 266 (489) T ss_pred EEEecCCCcc----------eEEEEEEeecCCcccceeeEEeccCCCCccCeeeEEEEecCCCCCCCCcCchH---HHHH Confidence 5554433221 11 1221111 110 01222344456889999998888887777654322 4444 Q ss_pred HHHH----HHHHHHHHHHHhcCceeec-CC-Cc--------cccceecCCceeecCCcCCchhhhhhhhccccHHHHHHH Q lcl|NC_019445. 286 LQLL----QKRKSQLIDKATNPPMVAP-TS-LK--------NQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVAD 351 (559) Q Consensus 286 L~~l----~~~~~~~~~~~~~p~~~~p-~~-~~--------~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 351 (559) ||.- +-..-+.+..+..|.+.+. .+ .. ...+.+.++..++.+..+....+++- + .....+. T Consensus 267 lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~----~-~~~~r~~ 341 (489) T protein:vir:78 267 LNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAG----E-NNLARQN 341 (489) T ss_pred HHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeCCcccccCCCCCCcceeccC----c-chHHHHH Confidence 4432 2222334556666755442 22 11 11123334444433322222333331 1 1223456 Q ss_pred HHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchh Q lcl|NC_019445. 352 IQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDA 430 (559) Q Consensus 352 i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~ 430 (559) |.+++.++..+=.. ++. . +.+.||++.+.+....-..|+.+...+++- ++++|.++-+- |. +. +.+ T Consensus 342 l~~le~qm~~lGa~----l~~-~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~a-----l~~~l~~~a~w~G~-~~-~~~ 408 (489) T protein:vir:78 342 MLDKEQQAIQIGAQ----LIT-P-TQQITAQSARIQRGADTSVMATIARNVSQA-----YTDALRWVAVMLGK-PE-DTE 408 (489) T ss_pred HHHHHHHHHHHhhh----hcc-C-CcchhHHHHHHHHHHhhHHHHHHHHHHHHH-----HHHHHHHHHHHcCC-CC-CCc Confidence 77777776654321 222 2 346899999999999999999998887663 34455555443 32 11 111 Q ss_pred hCCcceEEEeec-HHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHH-HHHH Q lcl|NC_019445. 431 MEGMPLKVEYIS-VMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVD-QARQ 508 (559) Q Consensus 431 l~g~~v~~~~is-~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~-~~rq 508 (559) . .-.++.+|.. ++ +.+.+..++.... .+ .|..+.+.+.+-+ .||.. .+.++++ ++.. T Consensus 409 ~-~i~~n~dF~~~~~-------d~~~~~al~~~~~----~G-----~is~~t~~~~L~~-~gv~d---~~~e~~~~ei~~ 467 (489) T protein:vir:78 409 V-EFRLNMDFFLEPM-------TAQDRAAWMADIN----AG-----LLPATAYYAALRK-AGVTD---WTDADIKDAVAD 467 (489) T ss_pred e-EEEeecccCcccC-------CHHHHHHHHHHHh----cC-----CCCHHHHHHHHHh-CCCCC---ccHHHHHHHHhh Confidence 0 0012333322 22 1111222222111 11 2444444444433 34431 1222221 1110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHH Q lcl|NC_019445. 509 QRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLS 545 (559) Q Consensus 509 ~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 545 (559) + +.+-++..-.+.+..+|..=+ T Consensus 468 -------------~--~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 468 -------------Q--PLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred -------------c--CCCcccCCcccCCCCcccccC Confidence 0 000011111111111111111 No 131 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=95.16 E-value=0.00089 Score=37.31 Aligned_cols=461 Identities=13% Similarity=0.100 Sum_probs=162.8 Q ss_pred CChhh----------------------------------HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCC Q lcl|NC_019445. 1 MAETT----------------------------------KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEV 46 (559) Q Consensus 1 M~~~~----------------------------------~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~ 46 (559) |...- ..-....+..+..-..+.... .-.+.|+.|.. |.+..- T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~--f~gyql 113 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVP-TMLQDWYNSQG--FIGYQA 113 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchh-hHHHhhhcccC--CccHHH Confidence 11110 000000011111110000000 00111222111 111100 Q ss_pred CCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHH Q lcl|NC_019445. 47 NRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQL 126 (559) Q Consensus 47 ~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~ 126 (559) .. .. -....+..+|++.|-.+ -+.|+.+...+.+..+ +...+ +.+.+++-++...+.++ T Consensus 114 ~a--lY---~~~~l~rkiVd~pAeDa-------~R~g~~I~~~~~e~~~--~~~~~-------l~~~~~rl~v~~~l~ea 172 (765) T protein:vir:96 114 CA--II---SQHWLVDKACSMSGEDA-------ARNGWELKSDGRKLSD--EQSAL-------IARRDMEFRVKDNLVEL 172 (765) T ss_pred HH--HH---HhCchhhhhhhcchHHh-------hcCCceeecCccccCH--HHHHH-------HHHHHHHhhHHHHHHHH Confidence 00 00 01223333444444333 3579988764433222 12222 33344445778899999 Q ss_pred HHHHHhhCcEEEEEeecCCceEEEEEeeccEEEEeeCCCCCEEEE--EEEEeecHHHHHHhcCcccCCHHHHHHHhcCCC Q lcl|NC_019445. 127 YGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLANSPRGSVDIC--FRKFSMTVRQLVQEFGLNNVSESVKSMWESGTY 204 (559) Q Consensus 127 ~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d~~G~vd~i--~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~ 204 (559) ++..-.||.+++++.-+...+-.. .-||..-.|.. |.+..| +..+..+. .++.++..|-+++ ++ T Consensus 173 ~~~~RlyGga~i~i~i~~~D~~~l-~~PL~~~~I~k---g~~kgl~vldp~~~~~-~~v~e~~~Dp~sp---------~f 238 (765) T protein:vir:96 173 NRFKNVFGVRIALFVVESDDPDYY-EKPFNPDGIAP---GSYKGISQIDPYWAMP-QLTAESTADPSAE---------HF 238 (765) T ss_pred HHHhhhceeeEEEEEecccCcchh-hcccccccccc---ceeeEEEEechhhccc-ccchhcccccccc---------cc Confidence 999999999988765432111000 11221111111 112111 11111110 0011111111111 11 Q ss_pred CceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHH Q lcl|NC_019445. 205 EKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVK 284 (559) Q Consensus 205 ~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~ 284 (559) .+ .+.+. |. + ..+|. .+++.-.|+. .|+ +.+....-||+|. .+.++..++ T Consensus 239 g~-P~~y~-i~-------g-----------~~IH~----SRli~~~g~~-lpd----~lk~~~~~~G~Sv-lq~~yd~I~ 288 (765) T protein:vir:96 239 YE-PDFWI-IS-------G-----------KKYHR----SHLVVVRGPQ-PPD----ILKPTYIFGGIPL-TQRIYERVY 288 (765) T ss_pred Cc-ceeee-ec-------C-----------ceecc----ceEEEecCCC-chh----hhccccCccCccH-HHHHHHHHH Confidence 11 11111 10 0 01111 1233322222 233 4455555679995 788888888 Q ss_pred HHHHHHHHHHHHHHHHhcCceeecCC--Ccc-----ccce----ec-CCceeecCCcCCchhhhhhhhccccHHHHHHHH Q lcl|NC_019445. 285 ALQLLQKRKSQLIDKATNPPMVAPTS--LKN-----QRAS----LL-PGDITYIDQITGQDGFRPAYLVNPSTADLVADI 352 (559) Q Consensus 285 ~L~~l~~~~~~~~~~~~~p~~~~p~~--~~~-----~~~~----~~-pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i 352 (559) ..+.......+.+..+.-..+.+... +.. .++. .. -.++..++. .+.+..+. .++..+-..+ T Consensus 289 ~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~---ee~~e~~s---~~lsgl~d~l 362 (765) T protein:vir:96 289 AAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGI---DETMEQFD---TNLSDFDSVI 362 (765) T ss_pred HHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecC---CcceeEEe---cccCCHHHHH Confidence 88887777777666655555544211 100 0111 11 113333332 12233332 2344444555 Q ss_pred HHHHHHHHHHhhcchhhhccCC-CCCCcCHH-HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh Q lcl|NC_019445. 353 QDTRQIINSAYFVDLFMMLQNI-NTRSMPVE-AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDA 430 (559) Q Consensus 353 ~~~~~rI~~af~~dl~~~~~~~-~~~~~TA~-Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~ 430 (559) ....+.|.-+.=-.+--.+++. .+..=|.. +++--. --+..++...+.|+|++.+.++.+.+.+|+ T Consensus 363 ~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYy--------D~I~s~Qe~~l~p~le~L~~li~~s~~i~~---- 430 (765) T protein:vir:96 363 MNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYH--------EELESIQEHIFDPLLERHYLLLAKSESIDV---- 430 (765) T ss_pred HHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCC---- Confidence 5666666655422111112222 22222333 332222 233445666799999999999999876542 Q ss_pred hCCcceEEEeecH--HHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc-ccc---------C Q lcl|NC_019445. 431 MEGMPLKVEYISV--MAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP-TVI---------V 498 (559) Q Consensus 431 l~g~~v~~~~is~--La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~-~~~---------r 498 (559) ++.+++-.- +....|+.-..+..+.. ..+.+.+ .|+.+++.+.+...-..+- .+- - T Consensus 431 ----d~~i~FnpL~~~sekEkAei~~k~Aea~---~~~~~~G-----vis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~ 498 (765) T protein:vir:96 431 ----QLEIVWNPVDSTTSQQQAELNNKKAATD---EIYINSG-----VVSPDEVRERLRDDPRSGYNRLTDDQAETEPGM 498 (765) T ss_pred ----cceEEeCCCCCCCHHHHHHHHHHHHHHH---HHHHhcC-----CCCHHHHHHHHhccccCCCCCCCccccccccCC Confidence 356665432 33344433333332333 2332222 4777777776653211110 010 1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 499 PQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 499 s~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) ++++.+++...-......-....+..+.++.+...++.....+....-....+..++|-+. T Consensus 499 ~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~ 559 (765) T protein:vir:96 499 SPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAA 559 (765) T ss_pred CccccccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCcccc Confidence 1111111110000000000000000000000000000000000000000111111111111 No 132 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=95.06 E-value=0.0029 Score=34.54 Aligned_cols=401 Identities=12% Similarity=0.079 Sum_probs=164.6 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |+...-+-+ ..+...-.++.+.|- +++-|........... .....-.-.++--.|++.+|+.+. + T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~~~~---~~~g~~~~~~~~~~~~--~~~~~a~~~~~v~~~v~~ia~~iA------~ 65 (460) T protein:vir:10 1 MANRIIRAL----RELTGLDNKFNDAFI---KYIGQTFTKYDNNGKT--YLEQGYNINPDVYSCISQMAAKTV------A 65 (460) T ss_pred CchhHHHHH----hhhhccCCCchHHHH---HhhccccCCCccchhh--hhHHHHhcchHHHHHHHHHHHhhh------h Confidence 887653322 222333334445564 4554443221111100 011112334556677777777653 4 Q ss_pred CcceeccCCcc-chhhH-------HHHH-----------HHHHHHHHHHHHHHHhc----cchHHHHHHHHHHHhhCcEE Q lcl|NC_019445. 81 RPWFRLATPDP-EMMDY-------GPVK-----------LWLEAVQNRMNDMFNKS----NLYQSLPQLYGSLGTYSTGA 137 (559) Q Consensus 81 ~~Wf~l~~~d~-~~~~~-------~~v~-----------~~l~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~ 137 (559) -||.-...... ...+. .... ..+...+......+++= +.+.-+..++.++.++|||. T Consensus 66 lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay 145 (460) T protein:vir:10 66 VPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCY 145 (460) T ss_pred CceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeE Confidence 45543322111 00000 0000 11112222233333332 33444566778999999999 Q ss_pred EEEeecC-----CceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE Q lcl|NC_019445. 138 MAVLEDD-----EDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMH 212 (559) Q Consensus 138 l~v~~~~-----~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~ 212 (559) +++..+. +.+..+.+++.+.+-+..+.+|.+-.. + +.+.. T Consensus 146 ~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~--~---------------------------------~~~~~ 190 (460) T protein:vir:10 146 FYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLST--D---------------------------------SPIKS 190 (460) T ss_pred EEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeee--e---------------------------------eeeeE Confidence 9987642 334456667777777776665533110 0 00000 Q ss_pred EEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeee-----cCCCcccccchHHHHHHHHHHHH Q lcl|NC_019445. 213 SVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEV-----NGEDVYGSSCPGMLALGPVKALQ 287 (559) Q Consensus 213 ~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~-----~~g~~YGrG~P~~~~l~d~~~L~ 287 (559) ..+ ..++. .+. |...=.+++|+.. ..+..||.| |...+...+.... T Consensus 191 ~~~----------------------~~~g~-~~~-----~~~~evih~r~~~~~~~~~~~~~~G~s-p~~~~~~~i~~~~ 241 (460) T protein:vir:10 191 YML----------------------IQGDQ-FIE-----FNEDEVIHTKYANPNFDLQGSHLYGMS-PIRAILRNINSQN 241 (460) T ss_pred EEE----------------------ecCce-eEE-----ecccceEEEecCCCCcccccCcccccc-HHHHHHHHHHHHH Confidence 000 00000 000 0001134444332 234579999 8988877777777 Q ss_pred HHHHHHHHHHHHHhcCceeecCC--Ccccc-------ce-e-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHH Q lcl|NC_019445. 288 LLQKRKSQLIDKATNPPMVAPTS--LKNQR-------AS-L-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADI 352 (559) Q Consensus 288 ~l~~~~~~~~~~~~~p~~~~p~~--~~~~~-------~~-~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i 352 (559) ...+.......-...|.+++..+ +.... +. . ..|++...+ ++-.++|+.. ++....+.+.. T Consensus 242 ~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~---~g~~~~~l~~-~~~d~q~~e~~ 317 (460) T protein:vir:10 242 STIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGAS---GEIAFTKISL-NTDELKPFDYL 317 (460) T ss_pred HHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCceecC---CCceEEEccC-ChhHHHHHHHH Confidence 77777777666666665554322 21110 10 0 123333332 2234555532 33344455666 Q ss_pred HHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHH-HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh Q lcl|NC_019445. 353 QDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEE-KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAM 431 (559) Q Consensus 353 ~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e-~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l 431 (559) +..+..|-++|-.... +++..++...|-.-+.+.... ....|.|...+++.||-.-| +|+. +. T Consensus 318 ~~~~~~Ia~~fgVPp~-~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl-------------~~~~--~~ 381 (460) T protein:vir:10 318 KYDQKAICNALGWSDK-LLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKF-------------IKRF--KG 381 (460) T ss_pred HHHHHHHHHHhCCCHH-HhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhh-------------cCcc--cc Confidence 7788899999976433 343333322222222222222 22345555555555543222 3321 11 Q ss_pred -CCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc------c-ccCCH--H Q lcl|NC_019445. 432 -EGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP------T-VIVPQ--E 501 (559) Q Consensus 432 -~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~------~-~~rs~--~ 501 (559) .+.-|++.+ +.+..++ .+...+..+++ . + -+.+++ +-+.+|.|+ + ++.+- . T Consensus 382 ~~~~~i~~d~-~~l~~l~--~d~~~~~~~~~---~--g-------~~T~NE----~R~~~g~~pi~~~~gD~~~~~~n~~ 442 (460) T protein:vir:10 382 YENAVIEWDI-SELPEMQ--TDMVAMASWLN---T--I-------PVTPNE----IRIAMKYETLNQDGMDIVFMPSNKV 442 (460) T ss_pred cCCceEEeec-chhhhHH--HHHHHHHHHHh---C--C-------CCCHHH----HHHHhCCCCCCCCCCCeeeeccccc Confidence 122233433 3443222 12222222221 0 1 123222 333445542 1 11110 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCCh Q lcl|NC_019445. 502 QVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDP 541 (559) Q Consensus 502 ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~ 541 (559) .++..-++.... . ..+++ T Consensus 443 ~~~~~~~~~~~~--------------------~--~nq~~ 460 (460) T protein:vir:10 443 RIDDVSNNLIDS--------------------A--FNQNQ 460 (460) T ss_pred chhhcccccCCC--------------------c--ccCCC Confidence 001000000000 0 00000 No 133 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=94.78 E-value=0.0035 Score=34.04 Aligned_cols=417 Identities=12% Similarity=0.167 Sum_probs=165.5 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC- Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP- 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp- 79 (559) |.-. +.+..-...+-..|. .+-|. .+.......++=......+-|+.++... |+ T Consensus 1 ~~~~--D~~~~~~~~~g~~~~--------~~~~~-------------~~~~~~~~~~~l~a~Y~~~~l~~~~vd~--~a~ 55 (437) T protein:vir:52 1 MKFF--DGIKSLALKLGSKQE--------QTYYS-------------PSLSLTDDLVQLEALWRDNWIANKVCIK--RPE 55 (437) T ss_pred Cchh--hhhHhHHhcCCCccc--------cceee-------------cCccccccHHHHHHHHHhCchhhHHhhc--chH Confidence 1110 001110000111000 00000 0000000000101122222333333332 22 Q ss_pred --CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE Q lcl|NC_019445. 80 --ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS 157 (559) Q Consensus 80 --~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~ 157 (559) -+.|+.+...|.+...... +.+.+++-++...+.++++.--.||.|++++..|.... .-|+. T Consensus 56 d~~r~~~~i~~~d~~~~~~~~-----------~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~----~~pl~- 119 (437) T protein:vir:52 56 DMVRNWREIYSNDLNSKQLDL-----------FTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNT----SAPLK- 119 (437) T ss_pred HhhcCCceEecCCCCHHHHHH-----------HHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCc----ccccc- Confidence 3689998764433222222 33344455788999999998889999999987765431 12221 Q ss_pred EEEeeCCCCCEEEE--EEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEE Q lcl|NC_019445. 158 YYLANSPRGSVDIC--FRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSV 235 (559) Q Consensus 158 ~~v~~d~~G~vd~i--~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv 235 (559) ..|.+..+ +-++.+++... .-.+- .+.++. ..+.|.. ... .. .+ T Consensus 120 ------~~~~~~~~~v~~~~~v~~~~~---~~~dp---------~s~~fg-~p~~y~v-~~~-----------~~---~~ 165 (437) T protein:vir:52 120 ------PTERLKRLIILPKWKISPTGT---KDDDV---------LSPNFG-RYSEYSI-LGG-----------SQ---SI 165 (437) T ss_pred ------cCCceeEEEEechhhcccccc---ccccc---------cccccC-cceEEEE-ecC-----------Cc---ce Confidence 12333221 11111111000 00000 000111 1122211 100 00 01 Q ss_pred EEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC--C-Cc Q lcl|NC_019445. 236 YYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPT--S-LK 312 (559) Q Consensus 236 ~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~--~-~~ 312 (559) .+|. .+++.-.|. ..| ......||+| +.+.++..++..+.......+.+..+..+.+..++ + +. T Consensus 166 ~iH~----SRii~~~~~-~~~-------~~~~~~~G~s-~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~ 232 (437) T protein:vir:52 166 TVHH----SRLIILNAN-DAP-------LSDNDIWGVS-DLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIA 232 (437) T ss_pred eEcc----ceeEEecCc-cCC-------CccccccCCc-hHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhc Confidence 1111 122221111 122 2336678999 68889999999888888887777766655555432 1 10 Q ss_pred c-------ccce----ecC-CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcC Q lcl|NC_019445. 313 N-------QRAS----LLP-GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMP 380 (559) Q Consensus 313 ~-------~~~~----~~p-g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~T 380 (559) . .... ... +++..++. .+.+..+. .++..+-+.+....+.|..++=-.+-...++. +... T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~---~~~~e~~~---~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s-~~Gl- 304 (437) T protein:vir:52 233 AGMENEVASVISAVQEIKSATNSLLLDA---ENEYDRKE---LTFTGLKDLLTEFRNAVAGAADMPVTILFGQS-VSGL- 304 (437) T ss_pred CCcHHHHHHHHHHHHHhcCCCceEEEcC---CcceEEEe---cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcC-cccc- Confidence 0 0111 111 23333332 12233332 13344445566777788777744332333333 2223 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 381 VEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTV 460 (559) Q Consensus 381 A~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~ 460 (559) |+--.. .+.+---+..++...+.|++++.+.++.+....+ +|++ +++++. ||.+......++...... T Consensus 305 asge~D-----~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~-~~~~-----~~~~f~-pL~~~s~kekae~~~~~a 372 (437) T protein:vir:52 305 ASGDED-----IQNYHEAIRRLQETRLRPIFEIIDPLICNELFGG-LPAD-----WWFEFV-PLTTVKQEQQINMLNTFA 372 (437) T ss_pred cccHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CCCc-----ceEEeC-CcCCcCHHHHHHHHHHHH Confidence 321111 1222223444566679999999999988764333 3333 666665 444333333333222333 Q ss_pred HHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 461 NFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 461 ~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~ 540 (559) +.+..+.+.+ .++++++.+.+.+. |+=..+ ++++++...- .....++..+... T Consensus 373 ~a~~~~~~~g-----~i~~~e~r~~L~~~-g~~~~i--~~~~~~~~~~-----------------~~~~~~~~~~~~~-- 425 (437) T protein:vir:52 373 TAANTLIQNG-----VLNEYQIANELRES-GLFANI--SAEHIEELKN-----------------ADEFAGNFEEPEK-- 425 (437) T ss_pred HHHHHHHhcC-----CCCHHHHHHHHHhc-CCCCCC--CccccccccC-----------------CCCCCCccCCCCC-- Confidence 3333333332 47777777776553 321111 1111111100 0000000000000 Q ss_pred hhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 541 PSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 541 ~~~~~~~~~~~~~~~~~~~ 559 (559) .. +...+.+..| T Consensus 426 ------~~-~~~~~~~~~~ 437 (437) T protein:vir:52 426 ------ME-GAQVQNSEDQ 437 (437) T ss_pred ------CC-CCCCCCCCCC Confidence 00 0011111111 No 134 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=94.51 E-value=0.0042 Score=33.61 Aligned_cols=355 Identities=10% Similarity=0.020 Sum_probs=144.4 Q ss_pred HHHHHHHHHHhhhH---HHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcceec Q lcl|NC_019445. 10 NKQFAQLESERQSF---EPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWFRL 86 (559) Q Consensus 10 ~~r~~~l~~~R~~~---~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l 86 (559) -+.|+.++..+.+. ...|-+......-.... .+...+. ...+=.++--.|++.+|+.+. . +| +. + T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~----~~al~~~~v~~~i~~ia~~ia-~-~p----~~-~ 68 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLN-SSEWVSA----ENALKNSDLFSIISQLSNDLA-T-AK----IT-T 68 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhcccccccc-CCceech----hhhhccHHHHHHHHHHHHHhh-h-Cc----ee-e Confidence 23344554444432 22232222211100000 0000010 011112333345555555333 2 22 21 1 Q ss_pred cCCccchhhHHHHHHHHHHHHHHHHHHHHhc----cchHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEe Q lcl|NC_019445. 87 ATPDPEMMDYGPVKLWLEAVQNRMNDMFNKS----NLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLA 161 (559) Q Consensus 87 ~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~ 161 (559) .+... + ..+.+- +.+.-+..++.++..+|||.+++..+. +.++.+.+++.+.+-+. T Consensus 69 --~~~~~------~-----------~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~ 129 (386) T protein:vir:49 69 --SRKQL------Q-----------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFN 129 (386) T ss_pred --ccchh------h-----------hhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEE Confidence 11110 0 112222 234445667788999999999987643 45566777777776666 Q ss_pred eCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecC Q lcl|NC_019445. 162 NSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGG 241 (559) Q Consensus 162 ~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~ 241 (559) .+.+|... +.+++.+ ....... T Consensus 130 ~~~~~~~~--~y~~~~~----------------------~~~~~~~---------------------------------- 151 (386) T protein:vir:49 130 RLDNQNGL--YYNITFD----------------------DPHIAPK---------------------------------- 151 (386) T ss_pred EcCCCceE--EEEEEEc----------------------Cccccce---------------------------------- Confidence 65544321 1111100 0000000 Q ss_pred CCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCccc----- Q lcl|NC_019445. 242 DNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQ----- 314 (559) Q Consensus 242 ~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~----- 314 (559) +.+ ...=+++.|+...++..||.| |...+...+.......+.......-...|..++ +..+... T Consensus 152 ---~~~-----~~~evih~~~~~~~~~~~G~s-~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~ 222 (386) T protein:vir:49 152 ---QHV-----PQNDILHFRLLSVDGGLTSVS-PLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKV 222 (386) T ss_pred ---eEE-----ccccEEEecCCCCCCcccccc-HHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHH Confidence 000 011145556656667899999 898888888888888888888888888887544 4443221 Q ss_pred -----cceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHH Q lcl|NC_019445. 315 -----RASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKE 389 (559) Q Consensus 315 -----~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~ 389 (559) ...-..|++.+.+ ++..++|+.. ++....+.+..+..+..|-++|-....... ......-+++.+.. T Consensus 223 ~~~~~~~~~n~g~~~vl~---~g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~~~~~~~--- 294 (386) T protein:vir:49 223 SRSRQAMKQMQGGPLVLD---DLEDFTPLEI-KSNVAQLLSQADWTTGQFAKVYGIPESIVG-GDGDQQSSLEMIYN--- 294 (386) T ss_pred HHHHHHhccCCCCceecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCccchHHHHHH--- Confidence 1122345544442 2234566542 334444556667788999999987644432 22222223322221 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHH-HHH--HHhcCCCCCCchhh---CCcceEEEeecHHHHHHHHHHH----HHHHHH Q lcl|NC_019445. 390 EKLLMLGPVLERLNDECLNPLIDRA-FSM--MVRKNMLPPPPDAM---EGMPLKVEYISVMAQAQKSIGL----SSLAST 459 (559) Q Consensus 390 e~~~~LG~v~~~l~~E~l~Pli~r~-~~i--l~r~g~lp~~p~~l---~g~~v~~~~is~La~a~r~~~~----~~l~~~ 459 (559) -....+-|.+..+..++-.-|..++ |.+ +.+ ++ +... ...-++--+.|+-......... ..+... T Consensus 295 ~~~~~i~~~l~~i~~~~~~~l~~~~~~~~~~~~~----~d-~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~ 369 (386) T protein:vir:49 295 IYFKSVSRYLRPFVSEMSKKLSCEVDVDISPAVD----PT-GSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDG 369 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHhcchhcccchhhhc----cC-HHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcch Confidence 2223344444444443322111110 000 000 00 0000 0000111122222211110000 001111 Q ss_pred HHHH-HHHhccChhhHhcCC Q lcl|NC_019445. 460 VNFI-GQLAQAKPEALDKLN 478 (559) Q Consensus 460 ~~~~-~~la~~~P~~~~~id 478 (559) .... ..+..-+.+ .=| T Consensus 370 ~~~~~~~~~gGd~~---~~~ 386 (386) T protein:vir:49 370 KNPNRTSLKGGEIN---EQD 386 (386) T ss_pred hccCCCCCCCCCCC---CCC Confidence 1100 011111111 111 No 135 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=369 Identities=13% Similarity=0.119 Sum_probs=154.6 Q ss_pred CChhhHHHHHHHHHHHHH---HhhhHHHHHHHHHHHhccccCCCCCCCCCCccc--ccCCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAETTKERLNKQFAQLES---ERQSFEPHWRELSDYINPRGSRFLTSEVNRNDR--RNTRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~---~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~~~~~s~~~~a~~~Las~l~~~ 75 (559) |.|.. .-..|+++++ .++++..-+..-..-+-+....+....+..+.. ..+-+=.++--.|++.+|+.+.+. T Consensus 1 ~~~~~---~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~l 77 (432) T protein:vir:81 1 MPDEK---KLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAM 77 (432) T ss_pred CCchh---hcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhC Confidence 87744 3344444432 222211111000000000000000000000100 001112244445666666655433 Q ss_pred hcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecCCceEEE Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRT 150 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~ 150 (559) ||.-..-.+....+ ..++-+...|+ +-| -+.-...++.++..+|||.+++..+.+++..+ T Consensus 78 ------p~~~y~~~~~g~~~---------~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~g~~~~L 142 (432) T protein:vir:81 78 ------PLTMYMRTPDGRKE---------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESL 142 (432) T ss_pred ------ceeeEEecCCccee---------cccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEE Confidence 44212111111000 01222333443 222 23345567788899999998887776666667 Q ss_pred EEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccc Q lcl|NC_019445. 151 MPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) Q Consensus 151 ~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 230 (559) .+++...+-+..|.+|++- |+... .+ ...+++ + T Consensus 143 ~~l~~~~v~v~~~~~g~~~--y~~~~-------------------------~~-g~~~~~-----~-------------- 175 (432) T protein:vir:81 143 QYLANDRLTITTDPKGNTA--YRYRR-------------------------TD-GQMIDI-----P-------------- 175 (432) T ss_pred EEEcCCceEEEECCCCcEE--EEEEe-------------------------cC-ceEEEE-----c-------------- Confidence 6777777777777666432 22100 00 000100 0 Q ss_pred cEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ec Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--AP 308 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p 308 (559) ..+ ++++|....+| .||.| |...+...+......++.......-...|..+ ++ T Consensus 176 -----------~~~------------iih~r~~~~dg-~~G~s-pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~ 230 (432) T protein:vir:81 176 -----------KQQ------------IWKIMGYSLDG-ENGLS-AIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID 230 (432) T ss_pred -----------ccc------------EEEecCCCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC Confidence 000 23334444455 79999 89877777776666666666666556667544 34 Q ss_pred CCCcccc-------ce--ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC-CC Q lcl|NC_019445. 309 TSLKNQR-------AS--LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINT-RS 378 (559) Q Consensus 309 ~~~~~~~-------~~--~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~-~~ 378 (559) ..+.... +. ...|++...+ ++..++|+.. ++.-..+.+..+..+..|-++|-.... +++..+. .. T Consensus 231 ~~l~~e~~~~~~~~~~~~~nag~~~vl~---~g~~~~~l~~-~~~d~q~le~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~ 305 (432) T protein:vir:81 231 RFLTDDQYDSFAKKVSGSVEAGRAPLLE---GGMDVKSLGL-NPVDAQLLQSRQYSVESICRFFGVPPS-MIGHSSAGTT 305 (432) T ss_pred CCCCHHHHHHHHHHHhhhhcCCCceecC---CCceEEEccC-CHHHHHHHHHHHHHHHHHHHHhCCCHH-HcCCcCCccc Confidence 4332211 11 1234444442 2223555532 333344456667778889999977543 3333222 22 Q ss_pred cCHHHHHHHHHHH-HHHhhhHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCCC-------- Q lcl|NC_019445. 379 MPVEAVIEMKEEK-LLMLGPVLERLNDECLNPLIDR-------------------------AFSMMVRKNML-------- 424 (559) Q Consensus 379 ~TA~Ei~~r~~e~-~~~LG~v~~~l~~E~l~Pli~r-------------------------~~~il~r~g~l-------- 424 (559) -|..-+.+..... ...|.|.+.+++.|+-.-|+.+ .+..+.+.|.+ T Consensus 306 ~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~ 385 (432) T protein:vir:81 306 SWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREI 385 (432) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Confidence 2333343333332 3468888888877775544321 22233444432 Q ss_pred ---CCCchhhCCcceEEEeec---HHHHHHHHHHHHHHHHHHHHHHHHhccChh-hHhcCCHHHHHH Q lcl|NC_019445. 425 ---PPPPDAMEGMPLKVEYIS---VMAQAQKSIGLSSLASTVNFIGQLAQAKPE-ALDKLNVDQAID 484 (559) Q Consensus 425 ---p~~p~~l~g~~v~~~~is---~La~a~r~~~~~~l~~~~~~~~~la~~~P~-~~~~id~d~~~~ 484 (559) ||+| |.+..+...+ |+..+ +.-..-.|. -..+=+-++.-+ T Consensus 386 ~glpp~~----g~~~~~~~~~~~~pl~~~----------------~~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 386 EGLPKLG----GNAAVLTVQSAMVPLDSI----------------GLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred hCCCCCC----CCcceEeecCcccchhhh----------------ccCCCCCCCCCCCCcccccccC Confidence 2222 2111111111 12111 110000000 010111111111 No 136 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=94.42 E-value=0.0045 Score=33.46 Aligned_cols=441 Identities=10% Similarity=0.105 Sum_probs=175.0 Q ss_pred CChhh--HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCC------CCcc-----cccCCCCcchHHHHHHH Q lcl|NC_019445. 1 MAETT--KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEV------NRND-----RRNTRIIDSTGTMAART 67 (559) Q Consensus 1 M~~~~--~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~------~~~~-----~~~~~~~~s~~~~a~~~ 67 (559) |-... ...+..+--.. ....++|+-+++.+--.+..+.+... ..++ +...-.|-+.-.+.++. T Consensus 1 ~~~~~~~~~~V~~~hp~y----~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~ 76 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREW----LHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSG 76 (491) T ss_pred CcccCCccCCCCccCHHH----HHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHH Confidence 32110 00000000000 12234444444443211000000000 0000 11112344444444444 Q ss_pred HHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCce Q lcl|NC_019445. 68 LASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDI 147 (559) Q Consensus 68 Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~ 147 (559) ++ +.+|- ..|.+. .+ +.++.+++.|+. .-.+++.-+..++.+...+|-+.++|+-....+ T Consensus 77 l~----G~vfr-k~p~~~--~p-------~~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~ 136 (491) T protein:vir:95 77 MV----GSVMR-KEPEIN--IP-------KELEYLLKNADG------SGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAA 136 (491) T ss_pred Hh----chhhc-CCceee--cc-------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcc Confidence 43 33332 223332 11 123444444432 245778888889999999999999998532211 Q ss_pred ------------EEEEEeeccEEE---E-eeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE Q lcl|NC_019445. 148 ------------IRTMPFPIGSYY---L-ANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM 211 (559) Q Consensus 148 ------------~~~~~~~l~~~~---v-~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~ 211 (559) -.+..|+..+.. . ..++.+++.-+.-+++...++-...|+ .+.++.| T Consensus 137 ~T~Ade~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~-----------------~~~~~qy 199 (491) T protein:vir:95 137 ATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFE-----------------TKYGEQY 199 (491) T ss_pred cCHHHHHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcc-----------------cceEEEE Confidence 124445544431 1 123444454443333332222112222 2345555 Q ss_pred EEEeecCcccccccccccccEEEEEEEe-cCCC----ceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHH Q lcl|NC_019445. 212 HSVYPNIDRDTSKLDSKNKPFKSVYYEV-GGDN----DKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKAL 286 (559) Q Consensus 212 ~~v~p~~~~~~~~~~~~~~~~~sv~~~~-~~~~----~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L 286 (559) .++.+..++. +-..+|-.. .+.. ..+....|-..+++|++.|.-..++.+..|.|- .+ |+..| T Consensus 200 RvL~l~~~g~---------~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pP--Ll-~LA~l 267 (491) T protein:vir:95 200 RVLDIDTDGN---------YRQRLFRFDAEGGAQEEVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAP--LL-PLAEL 267 (491) T ss_pred EEEeecCCCc---------eEEEEEEEcCCCcceeeeeeeeecCCCcccCeeEEEEEecCCCCCCCCcCc--hH-HHHHH Confidence 5555432221 001122111 1110 112223444568889988887777777766543 22 44444 Q ss_pred HHH---HHH-HHHHHHHHhcCceee-cCC-Ccc--------ccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHH Q lcl|NC_019445. 287 QLL---QKR-KSQLIDKATNPPMVA-PTS-LKN--------QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADI 352 (559) Q Consensus 287 ~~l---~~~-~~~~~~~~~~p~~~~-p~~-~~~--------~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i 352 (559) |.- ..+ .-+.+..+..|.+.+ +.+ ... ..+.+.++.....+..+....+++- ++ ....+.| T Consensus 268 ni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~--~~---~~~~~~l 342 (491) T protein:vir:95 268 NIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGSRCGHNLGYGGSAQLIQAG--EN---NLARQNM 342 (491) T ss_pred HHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecCcCCcCCCCCCccceeecC--cc---hHHHHHH Confidence 432 122 233455666665544 322 211 1123333333333322222333331 11 1234567 Q ss_pred HHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhh Q lcl|NC_019445. 353 QDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRK-NMLPPPPDAM 431 (559) Q Consensus 353 ~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~-g~lp~~p~~l 431 (559) .+++.+...+=.. ++ ..+ .+.||++...+.......|+.+...+++- + +++|.++-+. |. + .+.+. T Consensus 343 ~~~e~qm~~~Ga~----l~-~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~a-l----~~~l~~~a~w~G~-~-~~~~v 409 (491) T protein:vir:95 343 LDKEQQAIQIGAQ----LI-TPS-QQITAESARIQRGADTSVMATIARNVSQA-Y----TDALRWVAMMLGK-P-EDSEV 409 (491) T ss_pred HHHHHHHHHHHHH----hc-cCC-cchhHHHHHHHHHHhhHHHHHHHHHHHHH-H----HHHHHHHHHHcCC-C-CCCce Confidence 7777666554321 22 222 36899999999999999999998888663 3 3344444442 32 1 11110 Q ss_pred CCcceEEEee-cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHH Q lcl|NC_019445. 432 EGMPLKVEYI-SVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQR 510 (559) Q Consensus 432 ~g~~v~~~~i-s~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r 510 (559) .-.++.+|. .+++ .++ +..++.... . ..|....+...+ ...||+.. ..+++.+++..+- T Consensus 410 -~i~~n~dF~~~~~~----~~~---~~all~~~~----~-----G~is~~t~~~~L-~~~~vl~~--~~e~~~~~ie~~~ 469 (491) T protein:vir:95 410 -EFQLNMDFFLQPMT----AQD---RAAWMADIN----A-----GLLPATAYYAAL-RKAGVTDW--TDEDILNAIEDAP 469 (491) T ss_pred -EEEeecccccccCC----HHH---HHHHHHHHh----c-----CCCCHHHHHHHH-HhCCCCCc--cHHHHHHHHHhcC Confidence 001233332 2221 112 222222111 1 123333333333 33455421 1122222221110 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 511 AQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 511 ~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~ 542 (559) .. .+..+..|+.+..+..+..+ T Consensus 470 ---------~~-~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 470 ---------LP-SGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred ---------CC-CCccccccccchhhhhhccC Confidence 00 00011122222111111111 No 137 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=93.40 E-value=0.0047 Score=33.34 Aligned_cols=493 Identities=10% Similarity=0.014 Sum_probs=171.4 Q ss_pred CChhhHHHHHHHHHHH----------HH--HhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQL----------ES--ERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTL 68 (559) Q Consensus 1 M~~~~~~~l~~r~~~l----------~~--~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~L 68 (559) |.|.. ++.+.+-=.. ++ -++--+.+|..-...+-|-. ++...+.=.-..++...|++.+ T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~--------~~~~L~~~~e~~~~~~~~i~~~ 71 (651) T protein:vir:99 1 MTDTT-GETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPY--------NPDRLAAFLELNETLATGIRKK 71 (651) T ss_pred CCCcc-ceeeeeEEEeecccccccccccccccccchhhhcccCCCCCCCC--------CHHHHHHHHhcChHHHHHHHHH Confidence 77653 2222110000 00 01111222221111111110 0111111011245667778888 Q ss_pred HHHHHHhhcCCCCcc-----eeccCCccchhhHHHHHHHHHHHHHHHHHHHHhcc----chHHHHHHHHHHHhhCcEEEE Q lcl|NC_019445. 69 ASGMMSGITSPARPW-----FRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSN----LYQSLPQLYGSLGTYSTGAMA 139 (559) Q Consensus 69 as~l~~~l~pp~~~W-----f~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~ 139 (559) +..+.+ -.| +.+...+....+...+..++..+....+......| +..-+...+.|+.++||+|+= T Consensus 72 ~~~iag------~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ie 145 (651) T protein:vir:99 72 SRYEVG------FGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALE 145 (651) T ss_pred hhhhhc------cCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhh Confidence 876633 223 11222233333444454544444444443333333 334556677899999999884 Q ss_pred Ee-ecCCceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHH---HHhcC--------CCCce Q lcl|NC_019445. 140 VL-EDDEDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKS---MWESG--------TYEKW 207 (559) Q Consensus 140 v~-~~~~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~---~~~~~--------~~~~~ 207 (559) +- ...++++....+|+..+.+..+........++-+...+. ....+...+. .+..+ +.... T Consensus 146 iIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn-------~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~ 218 (651) T protein:vir:99 146 MLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVD-------GDVADIASRGYVQIRNGNRRYFGEAGDRYRG 218 (651) T ss_pred hhhcCccchhhhhhcChhheeeecccccccchhhhhhhcccc-------cccchhHHHHHHHHHhcCcceEEEeeccccc Confidence 43 334455666667777766655433221111111000000 0000000000 00000 00000 Q ss_pred EEEEE--------EEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCC---eEEEEeeecCCCcccccchH Q lcl|NC_019445. 208 IEVMH--------SVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFP---IMAPRWEVNGEDVYGSSCPG 276 (559) Q Consensus 208 v~v~~--------~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P---~~~~rw~~~~g~~YGrG~P~ 276 (559) +..++ .+++.+..........+.-+. .|.... ..+...+| .|++|.....+..||.| |. T Consensus 219 ~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g--~~~~~~-------~~~~~~~~~~eViHir~~~~~~g~~G~s-pl 288 (651) T protein:vir:99 219 QEVVIDESGDEPTIRYREDEESEREPIFVDRETG--DVTTGD-------ANGLENRPANELIFIPNPSILEDDYGVP-DW 288 (651) T ss_pred eeeeeccCCcceeEEeccCcceeeeeecccceee--eEEEcC-------CCceeEecccceEEecCCCCCCCccccc-HH Confidence 00000 000000000000000000000 000000 01111222 56667665556689999 89 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCC-Ccccc----------ceecCCceeecCCcC--------Cchhh Q lcl|NC_019445. 277 MLALGPVKALQLLQKRKSQLIDKATNPPMV--APTS-LKNQR----------ASLLPGDITYIDQIT--------GQDGF 335 (559) Q Consensus 277 ~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~-~~~~~----------~~~~pg~~~~~~~~~--------~~~~~ 335 (559) +.++..+.....+++.......-...|..+ +|+. +.... ..-.+|++.+....+ .+-.+ T Consensus 289 ~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~ 368 (651) T protein:vir:99 289 VSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIEL 368 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceE Confidence 988888888888888777777767777655 4543 22111 111234333332211 12235 Q ss_pred hhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 336 RPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAF 415 (559) Q Consensus 336 ~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~ 415 (559) +|+......-..+.+..+.....|-++|-..........++..-|+++... .+....|.|++.+.- T Consensus 369 ~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~--------------~f~~~tL~P~~~~ie 434 (651) T protein:vir:99 369 EPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDK--------------DFALEVIQPEQHTFA 434 (651) T ss_pred EEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHH Confidence 555432222234455667778889999988644332222222334433322 123334555555544 Q ss_pred HHHHhcCCCCCCchhhCCcceEEEee-cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc Q lcl|NC_019445. 416 SMMVRKNMLPPPPDAMEGMPLKVEYI-SVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP 494 (559) Q Consensus 416 ~il~r~g~lp~~p~~l~g~~v~~~~i-s~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~ 494 (559) ..|.+. ++++- ....+-.+.+++. +.+-+. +......+++.+-+..-+-| ++ +-+.+|.|+ T Consensus 435 ~eln~k-Ll~~~-e~~~~~~i~~ef~~~~llr~----D~~~~~e~~~~~i~~G~~T~--------NE----~R~~lglpp 496 (651) T protein:vir:99 435 EWLYQI-IHQQA-LGVTDWTIEYELRGADQPKQ----EAQLAEQRVRAMRLAGVGLV--------DE----AREELGLDP 496 (651) T ss_pred HHHHHh-hcCcc-ccccCceEEEEeccchhhhc----cHHHHHHHHHHHHhCCCcCH--------HH----HHHHhCCCC Confidence 444332 22221 1122334555553 233222 22222222221111111111 11 111223221 Q ss_pred c------ccCCHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHhhhhhh--cCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 495 T------VIVPQEQVDQARQ------QRAQQQQQQQMMAMGMAAAQGAKTLSEA--KTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 495 ~------~~rs~~ev~~~rq------~r~q~~q~~~~~~~~~~~~~~a~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) - .....-.....-+ ....+.. ...-......++..+..... ...-...-.+.+..++ --...+ T Consensus 497 i~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~-~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~ss~~~~~g-yd~~~~ 573 (651) T protein:vir:99 497 LGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEP-PEENKIGEREWDTVKSELTTKDPIEQMQFSSSNLDEGL-YDFGEN 573 (651) T ss_pred CCCccccccccccccccccccccCCCCcccccC-ccccccccchhhhhhhhhcccchhhhhhHHHHHHHhhc-CCCccc Confidence 0 0000000000000 0000000 00000000000000000000 0000011111111111 111111 No 138 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=93.18 E-value=0.0085 Score=31.94 Aligned_cols=319 Identities=8% Similarity=-0.012 Sum_probs=135.0 Q ss_pred HhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC-CCcc----eec----cCCccchhhHHHHHHHH Q lcl|NC_019445. 33 YINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP-ARPW----FRL----ATPDPEMMDYGPVKLWL 103 (559) Q Consensus 33 ~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp-~~~W----f~l----~~~d~~~~~~~~v~~~l 103 (559) ..++-..-|........ .+.. +..++. ..+| |.- .+......+...|..-. T Consensus 1 m~m~~~~~~~~~~~~~~-~~~~-------------------~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v 60 (392) T protein:vir:74 1 MILPILNFINQTNDPPE-AGSV-------------------QSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSII 60 (392) T ss_pred CcchhhhhhhcccCccc-cccc-------------------ccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHH Confidence 32222211111100000 0000 000000 0000 000 00000001111121111 Q ss_pred HHHHHH------------HHHHHHhccc----hHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEeeCCCC Q lcl|NC_019445. 104 EAVQNR------------MNDMFNKSNL----YQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRG 166 (559) Q Consensus 104 ~~ve~~------------~~~~l~~snf----~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G 166 (559) +.+... ....+.+-|- +.=+...+.++.++|||.+++..+. +.++.+.+++...+-+..+.+| T Consensus 61 ~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~ 140 (392) T protein:vir:74 61 LQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE 140 (392) T ss_pred HHHHHhhccCceeeccchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC Confidence 111111 1112222222 3345566789999999999887654 3455666666666666555544 Q ss_pred CEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCcee Q lcl|NC_019445. 167 SVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKL 246 (559) Q Consensus 167 ~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~i 246 (559) .. ++.++... .......+ . T Consensus 141 ~~--~~y~~~~~----------------------~~~~~~~~-----------------------------~-------- 159 (392) T protein:vir:74 141 NG--MYYNITFD----------------------DPKIEPIL-----------------------------Q-------- 159 (392) T ss_pred ce--EEEEEEec----------------------CCccceeE-----------------------------E-------- Confidence 21 11111000 00000000 0 Q ss_pred eeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCCCcccc--------- Q lcl|NC_019445. 247 LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTSLKNQR--------- 315 (559) Q Consensus 247 l~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~~~~~~--------- 315 (559) |..--+++.++...+|..||.| |...+...+.......+.......-...|..+ ++++..... T Consensus 160 -----~~~~evih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~ 233 (392) T protein:vir:74 160 -----APQSDLIHMKLLSIDGGKTGIS-PLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRS 233 (392) T ss_pred -----EcCccEEEecCCCCCCcccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHH Confidence 0001144555555678789999 89988888988888888888888888888754 454432110 Q ss_pred ce--ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHH Q lcl|NC_019445. 316 AS--LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLL 393 (559) Q Consensus 316 ~~--~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~ 393 (559) +. ...|++.+.+ ++..++|+.. ++....+.+..+..+..|-++|-...... +.. ..-|.. +.+..+-... T Consensus 234 ~~~~~n~g~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~--~~~~~~-~e~~~~~~~~ 305 (392) T protein:vir:74 234 FMKRSRSGGPVVLD---DLEEFTALEI-KSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQ--GDQQSS-IQQISGMYAS 305 (392) T ss_pred HhccccCCCeeecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCC--CCcccH-HHHHHHHHHH Confidence 11 1234444332 2234566543 23444455666778889999998764443 222 222221 1222333456 Q ss_pred HhhhHHHHHHHHHHHHHHHHH------------------H---------------HHHHhcCCCCCCchhhCCcceEEEe Q lcl|NC_019445. 394 MLGPVLERLNDECLNPLIDRA------------------F---------------SMMVRKNMLPPPPDAMEGMPLKVEY 440 (559) Q Consensus 394 ~LG~v~~~l~~E~l~Pli~r~------------------~---------------~il~r~g~lp~~p~~l~g~~v~~~~ 440 (559) .|.|.+..+++|+-.-|+..+ + .++.+.|..|. + T Consensus 306 ~l~p~~~~ie~~l~~~l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pn---e---------- 372 (392) T protein:vir:74 306 ALNRYLRPAISELEYKLSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---D---------- 372 (392) T ss_pred HHHHHHHHHHHHHHHhccchhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcc---c---------- Confidence 678888888777644332111 0 11122222220 0 Q ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHhccCh-hhHh Q lcl|NC_019445. 441 ISVMAQAQKSIGLSSLASTVNFIGQLAQAKP-EALD 475 (559) Q Consensus 441 is~La~a~r~~~~~~l~~~~~~~~~la~~~P-~~~~ 475 (559) +. -...+..+..-+. +-.+ T Consensus 373 ---------------~r-~~enl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 373 ---------------LP-APENTNKKTTGQSNEPVP 392 (392) T ss_pred ---------------cc-hhcCCCCCCCCCCCCCCC Confidence 00 0011111111000 0011 No 139 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=92.85 E-value=0.0097 Score=31.61 Aligned_cols=451 Identities=9% Similarity=-0.018 Sum_probs=191.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc-CC-CCCCCC-CC--------------cccccCCCCcchHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRG-SR-FLTSEV-NR--------------NDRRNTRIIDSTGTM 63 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~-~~-~~~~~~-~~--------------~~~~~~~~~~s~~~~ 63 (559) |.+...+.+...-...-..|. .+ ..+.|--... .+ +.+... +. .+.+.--.-++.+.. T Consensus 1 m~~~~~r~~~~~a~~~~~~~~----~~-~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~ 75 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSA----SL-GGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNG 75 (553) T ss_pred Ccchhhhhhcccccccchhhh----hh-hcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHH Confidence 877654433211111111111 11 1112211110 00 000000 00 000111134677888 Q ss_pred HHHHHHHHHHHh-hcCCCCccee-ccCCccchhhHHHHHHHHHHHHHHHHHHH----------HhccchHHHHHHHHHHH Q lcl|NC_019445. 64 AARTLASGMMSG-ITSPARPWFR-LATPDPEMMDYGPVKLWLEAVQNRMNDMF----------NKSNLYQSLPQLYGSLG 131 (559) Q Consensus 64 a~~~Las~l~~~-l~pp~~~Wf~-l~~~d~~~~~~~~v~~~l~~ve~~~~~~l----------~~snf~~~~~~~~~dl~ 131 (559) +++.+++.+++. |+|..+|=.+ |.-.+. ...+.|-+.|++.-...- -..+||.....++..++ T Consensus 76 av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~-----~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~ 150 (553) T protein:vir:63 76 AVGYQRDSIVGAQYRLNSMPDINVIPGATE-----EWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYV 150 (553) T ss_pred HHHHHHHhhccCCceeeeccchhhhcCCCH-----HHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHH Confidence 888888877765 7765444332 211111 223344444444332221 24579999999999999 Q ss_pred hhCcEEEEEeecCCce----EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCce Q lcl|NC_019445. 132 TYSTGAMAVLEDDEDI----IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKW 207 (559) Q Consensus 132 ~~G~~~l~v~~~~~~~----~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~ 207 (559) +-|-+++....++..+ +.++.+....+.-..+ .+.. T Consensus 151 ~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~--------------------------------------~~~~-- 190 (553) T protein:vir:63 151 KTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQ--------------------------------------QLDT-- 190 (553) T ss_pred hCCceEEEeeeccCCCCcccceEEEechhhcCCCCC--------------------------------------CCCC-- Confidence 9999887655443322 1222222211111000 0101 Q ss_pred EEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeec---------Cc--ccCCeEEEEee-ecCCCcccccch Q lcl|NC_019445. 208 IEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRES---------GF--DEFPIMAPRWE-VNGEDVYGSSCP 275 (559) Q Consensus 208 v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~es---------g~--~~~P~~~~rw~-~~~g~~YGrG~P 275 (559) -.|+..|+-+.. ++|.+ ||+.....++.+.... .+ ---|-+..-|. ..+|..=|.+ . T Consensus 191 ~~i~~GVE~d~~---------Gr~va-Y~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis-~ 259 (553) T protein:vir:63 191 PTLRRGVQYDKR---------GRPQG-YWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIA-D 259 (553) T ss_pred CeeEeeeEECCC---------CceEE-EEeeccCCCccccccccccceeeeccccccChhHheecccccCCCcccCCc-h Confidence 124555543322 12221 2222111111110000 00 01122222333 3577888888 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCCCc----------------------------------cccceecCC Q lcl|NC_019445. 276 GMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK----------------------------------NQRASLLPG 321 (559) Q Consensus 276 ~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~----------------------------------~~~~~~~pg 321 (559) .-.+|..++.|+....+.+.++..++...+.+..+.. .....+.|| T Consensus 260 lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG 339 (553) T protein:vir:63 260 IVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGA 339 (553) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCc Confidence 7889999999999999999999998888765532110 012346788 Q ss_pred ceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHH Q lcl|NC_019445. 322 DITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLER 401 (559) Q Consensus 322 ~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~ 401 (559) .+......+....+.|-. .+.++..+. ..+...|..++=..+ .++ ..|-..++-.-+++-..|.-+.+-..=.. T Consensus 340 ~i~~L~pGe~i~~~~p~~-p~~~~~~F~---~~~lr~iaaglGi~Y-e~l-t~D~s~~nYSS~R~~~~e~~r~~~~~q~~ 413 (553) T protein:vir:63 340 KIPHLFPGTKLNLKPMGT-PGGVGSEFE---ASLNRHLASAFGMSY-EEF-TRDFSKANYSSIQAGIAMTRRFLEGRKKM 413 (553) T ss_pred eeeecCCCCeeeecCCCC-CCCCHHHHH---HHHHHHHHhhcCCCH-HHH-hhhcccccHHHHHHHHHHHHHHHHHHHHH Confidence 777664333333333322 122333322 333344544442211 122 13445566666777777777777666667 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCchh---h------CCcceEEEeecH-------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 402 LNDECLNPLIDRAFSMMVRKNMLPPPPDA---M------EGMPLKVEYISV-------MAQAQKSIGLSSLASTVNFIGQ 465 (559) Q Consensus 402 l~~E~l~Pli~r~~~il~r~g~lp~~p~~---l------~g~~v~~~~is~-------La~a~r~~~~~~l~~~~~~~~~ 465 (559) |...|..|+.++++....-.|.+|-|+-. + ...-++++++.| +--++ +....|..-+..... T Consensus 414 ~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~--A~~~~i~~G~~t~~~ 491 (553) T protein:vir:63 414 CADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQ--AAVMRIDAGLSTYER 491 (553) T ss_pred HHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHH--HHHHHHHcCCCCHHH Confidence 77889999999999999999988743211 0 001133444443 21110 111111111111111 Q ss_pred HhccChhhHhcCCHHHHHHHHH------HHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCC Q lcl|NC_019445. 466 LAQAKPEALDKLNVDQAIDAFA------DMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTS 539 (559) Q Consensus 466 la~~~P~~~~~id~d~~~~~~a------~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~ 539 (559) +.. ..-.|+++.++.++ +.+|++...- .....++..++.+. T Consensus 492 ~~a-----~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~----------------------------~~~~~~~~~~~~~~ 538 (553) T protein:vir:63 492 EIA-----RLGGDFRKSFAQRAREDALLKKYGLTFNLS----------------------------AKRSLGDGRDAATG 538 (553) T ss_pred HHH-----HhCCCHHHHHHHHHHHHHHHHHcCCCCCCC----------------------------CccccCCCcccCCC Confidence 100 00022222222221 2223321100 00000000001110 Q ss_pred ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 540 DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 540 ~~~~~~~~~~~~~~~~~~~~ 559 (559) ..+ ......+++++ T Consensus 539 ~~~------~~~~~~~~~~~ 552 (553) T protein:vir:63 539 IAE------DPAAAQTSQQG 552 (553) T ss_pred CCC------CCCCCCccccc Confidence 000 00111111111 No 140 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=92.54 E-value=0.011 Score=31.33 Aligned_cols=238 Identities=12% Similarity=0.053 Sum_probs=104.2 Q ss_pred CChhhHHHHHHHHHHHHHHhh-hHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQ-SFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~-~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |-= |... ..|+ .....|..-.--++|..........+. ..-+-.++--.|++.+|+.+.+. T Consensus 1 Mgl---------F~~~-~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~~i~~ia~~iA~l---- 62 (251) T protein:vir:46 1 MGI---------FYKN-EKRDLQYNEDDLQMMVQTLPSFQGTKLRQYKD----IEAIRHSDIFTAVMMIASDLARM---- 62 (251) T ss_pred CCc---------cccc-cccccCCCccchhhhhhhhccccCcCcceech----hhhhccHHHHHHHHHHHHhHhhC---- Confidence 321 2111 1121 111111111111122221111111110 11122344455666666655443 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hccc----hHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEe Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSNL----YQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPF 153 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~snf----~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~ 153 (559) ||.-.. .... .. ++-+...|+ +-|- +.-+.....++..+|||.+++..+. +.++.+.++ T Consensus 63 --p~~~~~-~~~~-~~-----------~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i 127 (251) T protein:vir:46 63 --PIRVTV-NGQI-NY-----------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFR 127 (251) T ss_pred --ceEEee-Cccc-cc-----------cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 343222 1111 11 122233342 3333 3344556788899999999998764 456778888 Q ss_pred eccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEE Q lcl|NC_019445. 154 PIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFK 233 (559) Q Consensus 154 ~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~ 233 (559) +...+-+..|.+|++--.|.. .. . ..+ ...+ .++ T Consensus 128 ~~~~v~v~~~~~g~~~~~~~~--~~---------~------------~~~-g~~~-----~~~----------------- 161 (251) T protein:vir:46 128 KTSEIELKSDARGRLYYFHQR--ID---------S------------NGN-NIER-----NVK----------------- 161 (251) T ss_pred CCceEEEEECCCCcEEEEEEE--ec---------c------------CCc-ceeE-----EEC----------------- Confidence 888888888777754211110 00 0 000 0000 000 Q ss_pred EEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCCC Q lcl|NC_019445. 234 SVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTSL 311 (559) Q Consensus 234 sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~~ 311 (559) .++ ++++|....+| .||.| |...+...+......++.......-...|..+ +++.+ T Consensus 162 --------~~d------------iiH~r~~~~dg-~~G~s-pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l 219 (251) T protein:vir:46 162 --------FED------------MLDIKFYSLDG-INGLS-LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL 219 (251) T ss_pred --------Ccc------------EEEecCcCCCC-eeecC-HHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCC Confidence 001 34445444444 79999 89988888888888888888888888888754 44444 Q ss_pred cccc-ce-ecCC-ceeecC-CcCCchhhhhhhhccccHHHHHHH Q lcl|NC_019445. 312 KNQR-AS-LLPG-DITYID-QITGQDGFRPAYLVNPSTADLVAD 351 (559) Q Consensus 312 ~~~~-~~-~~pg-~~~~~~-~~~~~~~~~p~~~~~~~~~~~~~~ 351 (559) .+.. .+ +.-. ...+.+ ...+. +...+++ T Consensus 220 ~~~e~~~~~~~~~~~~~~g~~n~g~------------~~~gm~~ 251 (251) T protein:vir:46 220 DNKKARDRAREEFPKVLVELNKLGK------------LSYSMNQ 251 (251) T ss_pred CCHHHHHHHHHHHHHHhcCcccccc------------cccccCC Confidence 2221 11 1100 000000 00000 0000110 No 141 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=92.16 E-value=0.013 Score=31.00 Aligned_cols=330 Identities=9% Similarity=0.007 Sum_probs=146.7 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc-CCCCC-CCCCCccc--ccCCCCcchHHHHHHHHHHHHHHhh Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRG-SRFLT-SEVNRNDR--RNTRIIDSTGTMAARTLASGMMSGI 76 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~-~~~~~-~~~~~~~~--~~~~~~~s~~~~a~~~Las~l~~~l 76 (559) |.=.. ++.++..|.+-.. .....+..+-. ..+.. -....+.. ...-+-.++--.|++.+|+.+.+. T Consensus 1 m~m~~-------f~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l- 70 (392) T protein:vir:10 1 MILPI-------LNFINQTNDPPEV--GSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIV- 70 (392) T ss_pred Ccchh-------hhhhhcccccccc--cccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccC- Confidence 22222 1122222221110 00011110000 00000 00000100 000111234455666666655441 Q ss_pred cCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccc----hHHHHHHHHHHHhhCcEEEEEeecC-CceEEEE Q lcl|NC_019445. 77 TSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNL----YQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTM 151 (559) Q Consensus 77 ~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf----~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~ 151 (559) |+ ++. +... ...+.+-|- +.=+..++.++.++|||.+++..+. +.++.+. T Consensus 71 -----p~-~~~--~~~~-----------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~ 125 (392) T protein:vir:10 71 -----KI-NAE--KKKN-----------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWE 125 (392) T ss_pred -----ce-eec--cchh-----------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEE Confidence 22 221 1110 012223333 3345566779999999999887654 3455666 Q ss_pred EeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +++..++-+..+.+|..- +.++... . ..... T Consensus 126 ~l~~~~v~~~~~~~~~~~--~y~~~~~-----------~-----------~~~~~------------------------- 156 (392) T protein:vir:10 126 YLRPSQVNTYYFEYENGM--YYNITFD-----------D-----------PKIEP------------------------- 156 (392) T ss_pred EEcCceeEEEEcCCCceE--EEEEEec-----------C-----------cccce------------------------- Confidence 666666655555443211 1100000 0 00000 Q ss_pred EEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecC Q lcl|NC_019445. 232 FKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APT 309 (559) Q Consensus 232 ~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~ 309 (559) ..+ |..--+++.++...+|..||.| |...+...+.......+.......-...|..+ +++ T Consensus 157 --~~~---------------~~~~eiih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 218 (392) T protein:vir:10 157 --ILQ---------------APQSDLIHMKLLSIDGGKTGIS-PLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKG 218 (392) T ss_pred --eEE---------------EccccEEEecCCCCCCcccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC Confidence 000 0011255666666778899999 89988888888888888888888888888755 444 Q ss_pred CCcccc---------c--eecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 310 SLKNQR---------A--SLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 310 ~~~~~~---------~--~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) +..... + ....|++...+ ++..++|+.. ++....+.+..+..+..|-++|-...... +.. .. T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~g~~~vl~---~g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~--~~ 291 (392) T protein:vir:10 219 GGLLSDKDKASRSRSFMKRSRSGGPVVLD---DLEEFTALEI-KSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQ--GD 291 (392) T ss_pred CCCchHHHHHHHHHHHhccccCCCeeecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCC--CC Confidence 421110 1 11234444332 2234566543 23334445666778888999997754433 222 22 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHH------------------H---------------HHHHhcCCCC Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRA------------------F---------------SMMVRKNMLP 425 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~------------------~---------------~il~r~g~lp 425 (559) -|.. +.+...=....|.|.+.++++|+-.-|+..+ + .++.+.|..| T Consensus 292 ~~~~-~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p 370 (392) T protein:vir:10 292 QQSS-IQQISGMYASALNRYLRPAISELEYKLSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIP 370 (392) T ss_pred cccH-HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCc Confidence 2221 1222333456788888888887754432210 0 1122233322 Q ss_pred -------CCchhhCCcceEEEe Q lcl|NC_019445. 426 -------PPPDAMEGMPLKVEY 440 (559) Q Consensus 426 -------~~p~~l~g~~v~~~~ 440 (559) .+|..-.|+.=++.+ T Consensus 371 ~e~r~~e~l~~~~~Gd~~~p~p 392 (392) T protein:vir:10 371 KDLPAPENTNKKTTGQSNEPVP 392 (392) T ss_pred cccchhcCCCCCCCCCCCCCCC Confidence 111111111111111 No 142 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=92.16 E-value=0.013 Score=31.00 Aligned_cols=330 Identities=9% Similarity=0.007 Sum_probs=146.7 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc-CCCCC-CCCCCccc--ccCCCCcchHHHHHHHHHHHHHHhh Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRG-SRFLT-SEVNRNDR--RNTRIIDSTGTMAARTLASGMMSGI 76 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~-~~~~~-~~~~~~~~--~~~~~~~s~~~~a~~~Las~l~~~l 76 (559) |.=.. ++.++..|.+-.. .....+..+-. ..+.. -....+.. ...-+-.++--.|++.+|+.+.+. T Consensus 1 m~m~~-------f~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l- 70 (392) T protein:vir:39 1 MILPI-------LNFINQTNDPPEV--GSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIV- 70 (392) T ss_pred Ccchh-------hhhhhcccccccc--cccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccC- Confidence 22222 1122222221110 00011110000 00000 00000100 000111234455666666655441 Q ss_pred cCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccc----hHHHHHHHHHHHhhCcEEEEEeecC-CceEEEE Q lcl|NC_019445. 77 TSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNL----YQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTM 151 (559) Q Consensus 77 ~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf----~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~ 151 (559) |+ ++. +... ...+.+-|- +.=+..++.++.++|||.+++..+. +.++.+. T Consensus 71 -----p~-~~~--~~~~-----------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~ 125 (392) T protein:vir:39 71 -----KI-NAE--KKKN-----------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWE 125 (392) T ss_pred -----ce-eec--cchh-----------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEE Confidence 22 221 1110 012223333 3345566779999999999887654 3455666 Q ss_pred EeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +++..++-+..+.+|..- +.++... . ..... T Consensus 126 ~l~~~~v~~~~~~~~~~~--~y~~~~~-----------~-----------~~~~~------------------------- 156 (392) T protein:vir:39 126 YLRPSQVNTYYFEYENGM--YYNITFD-----------D-----------PKIEP------------------------- 156 (392) T ss_pred EEcCceeEEEEcCCCceE--EEEEEec-----------C-----------cccce------------------------- Confidence 666666655555443211 1100000 0 00000 Q ss_pred EEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecC Q lcl|NC_019445. 232 FKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APT 309 (559) Q Consensus 232 ~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~ 309 (559) ..+ |..--+++.++...+|..||.| |...+...+.......+.......-...|..+ +++ T Consensus 157 --~~~---------------~~~~eiih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 218 (392) T protein:vir:39 157 --ILQ---------------APQSDLIHMKLLSIDGGKTGIS-PLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKG 218 (392) T ss_pred --eEE---------------EccccEEEecCCCCCCcccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC Confidence 000 0011255666666778899999 89988888888888888888888888888755 444 Q ss_pred CCcccc---------c--eecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 310 SLKNQR---------A--SLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 310 ~~~~~~---------~--~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) +..... + ....|++...+ ++..++|+.. ++....+.+..+..+..|-++|-...... +.. .. T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~g~~~vl~---~g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~--~~ 291 (392) T protein:vir:39 219 GGLLSDKDKASRSRSFMKRSRSGGPVVLD---DLEEFTALEI-KSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQ--GD 291 (392) T ss_pred CCCchHHHHHHHHHHHhccccCCCeeecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCC--CC Confidence 421110 1 11234444332 2234566543 23334445666778888999997754433 222 22 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHH------------------H---------------HHHHhcCCCC Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRA------------------F---------------SMMVRKNMLP 425 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~------------------~---------------~il~r~g~lp 425 (559) -|.. +.+...=....|.|.+.++++|+-.-|+..+ + .++.+.|..| T Consensus 292 ~~~~-~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p 370 (392) T protein:vir:39 292 QQSS-IQQISGMYASALNRYLRPAISELEYKLSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIP 370 (392) T ss_pred cccH-HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCc Confidence 2221 1222333456788888888887754432210 0 1122233322 Q ss_pred -------CCchhhCCcceEEEe Q lcl|NC_019445. 426 -------PPPDAMEGMPLKVEY 440 (559) Q Consensus 426 -------~~p~~l~g~~v~~~~ 440 (559) .+|..-.|+.=++.+ T Consensus 371 ~e~r~~e~l~~~~~Gd~~~p~p 392 (392) T protein:vir:39 371 KDLPAPENTNKKTTGQSNEPVP 392 (392) T ss_pred cccchhcCCCCCCCCCCCCCCC Confidence 111111111111111 No 143 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=91.18 E-value=0.017 Score=30.27 Aligned_cols=386 Identities=11% Similarity=0.014 Sum_probs=155.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc-CCCCCCCCCCccccc--CCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRG-SRFLTSEVNRNDRRN--TRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~~~~~--~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |++++.+-+.+.... .|+.....-......+-|.. ..|.+..+..+..-+ +-+=.++--.|++.+|+.+-+ T Consensus 1 ~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~--- 74 (434) T protein:vir:43 1 MSKSLGKVLSSATSA---PRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAG--- 74 (434) T ss_pred Cccchhhhhhhcccc---cchhhhcccccccccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhh--- Confidence 999876644433332 22221110011111111100 011111111111100 111122334556666665443 Q ss_pred CCCCcceeccCC-ccchhhHHHHHHHHHHHHHHHHHHHH-hccch----HHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 78 SPARPWFRLATP-DPEMMDYGPVKLWLEAVQNRMNDMFN-KSNLY----QSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 78 pp~~~Wf~l~~~-d~~~~~~~~v~~~l~~ve~~~~~~l~-~snf~----~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) -||.-..-. +....+ ..+..+...|+ +-|-+ .=...++.+|..+||+.+++..+.+.++.+. T Consensus 75 ---lp~~~~~~~~~g~~~~---------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~~G~~~~L~ 142 (434) T protein:vir:43 75 ---LPLGVYERKADGSRVD---------ARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRAAGRPAALD 142 (434) T ss_pred ---CceEEEEEcCCCcccc---------ccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEE Confidence 355322211 111000 01222344443 23333 3345567899999999999988777777777 Q ss_pred EeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +++...+-+..|.+|++- |+.+. .+ ...+++ + T Consensus 143 ~l~p~~v~~~~~~~g~~~--y~~~~-------------------------~~-g~~~~~-----~--------------- 174 (434) T protein:vir:43 143 FLLPSRVDLECDENGRLK--YFYTT-------------------------KK-GARREI-----E--------------- 174 (434) T ss_pred EEcCcceEEEEcCCCeEE--EEEEe-------------------------cC-ceEEEE-----c--------------- Confidence 777777777777666432 11100 00 001100 0 Q ss_pred EEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cC Q lcl|NC_019445. 232 FKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PT 309 (559) Q Consensus 232 ~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~ 309 (559) .. -+++.+....+| .||.| |...+...+.......+.......-...|..++ +. T Consensus 175 ----------~~------------eVih~~~~~~dg-~~G~s-pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~ 230 (434) T protein:vir:43 175 ----------RT------------NMLHIPAFTLDG-RIGLS-AIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDR 230 (434) T ss_pred ----------cc------------cEEEecCcCCCC-ccccC-HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCC Confidence 00 122333333344 79999 898887777777777777666666667776543 44 Q ss_pred CCcccc-------ceec-----CCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Q lcl|NC_019445. 310 SLKNQR-------ASLL-----PGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTR 377 (559) Q Consensus 310 ~~~~~~-------~~~~-----pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~ 377 (559) .+.... ++-. .|++..+ +++..++|+.. ++.-..+.+........|-++|-.... .++..+.. T Consensus 231 ~l~~e~~~~~r~~~~~~~g~~nag~~~vl---~~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~ 305 (434) T protein:vir:43 231 ILQPAQREEFREYVKSVSGAMNSGRSPVL---EQGITPETIGI-NPVDAQLLETREHGVIEICRWFGVPPW-MIGQTDKG 305 (434) T ss_pred CCCHHHHHHHHHHHHHhcCccccCCcccc---CCCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHH-HhCCCcCC Confidence 332211 1101 2232222 22234555532 333334455566678889999977533 33333333 Q ss_pred CcCHHHHHHHHHH-HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeec-----HHHHHHH-- Q lcl|NC_019445. 378 SMPVEAVIEMKEE-KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYIS-----VMAQAQK-- 449 (559) Q Consensus 378 ~~TA~Ei~~r~~e-~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is-----~La~a~r-- 449 (559) ..+..-+.+.... ....|.|.+.+++.++-.-| +++ .+..+--|++.... ..+++.- T Consensus 306 ~~~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL-------------~~~--~~~~~~~~~fd~~~llr~d~~~r~~~~~ 370 (434) T protein:vir:43 306 SNWGTGLEQQMLAFLTFSISSITNQIQQCVNKRL-------------LTA--PERIRYYAEFSLEGFLKADSAGRAAWYS 370 (434) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhc-------------CCh--hhhcCceEEEechhhhccCHHHHHHHHH Confidence 2222222222222 12234555555544443222 222 11112223332221 2222211 Q ss_pred -HHHHH--HHHHHHHHHHHHhcc-Chh-h---HhcCCHHHHH---------HHHHHHcCCCccccCCHHH Q lcl|NC_019445. 450 -SIGLS--SLASTVNFIGQLAQA-KPE-A---LDKLNVDQAI---------DAFADMSGVSPTVIVPQEQ 502 (559) Q Consensus 450 -~~~~~--~l~~~~~~~~~la~~-~P~-~---~~~id~d~~~---------~~~a~~~Gvp~~~~rs~~e 502 (559) ..+.. .+...-..+ .+..+ +-+ . +..+..|.+- ...-...|-| +++| T Consensus 371 ~~~~~G~~T~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~ 434 (434) T protein:vir:43 371 TMAQNGFMTRNEGRRKE-NLPELPGGDILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQP-----EPQE 434 (434) T ss_pred HHHhCCCcCHHHHHHHh-CCCCCCCCCeEeeccCccchhhhhccCCCcchhhhhhccCCCC-----CCCC Confidence 11110 122222221 12222 001 0 0111111111 1111112222 1222 No 144 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=90.83 E-value=0.019 Score=30.04 Aligned_cols=445 Identities=10% Similarity=0.010 Sum_probs=189.4 Q ss_pred CChhhHHHH-----------HHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCC-----CCCCcccccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERL-----------NKQFAQLESERQSFEPHWRELSDYINPRGSRFLTS-----EVNRNDRRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l-----------~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~-----~~~~~~~~~~~~~~s~~~~a 64 (559) |.-+....+ ...|....+-+......|. |........ .....+.+.----++.+..+ T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~-------p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~a 73 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWN-------PPSESVDAALLPNFTRGNARADDLVRNNGYAANA 73 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccc-------cCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 544422211 1122222111111112221 111000000 00000111111246788999 Q ss_pred HHHHHHHHHHh-hcCCCCccee-ccCCccchhhHHHHHHHHHHHHHHHHHH----------HHhccchHHHHHHHHHHHh Q lcl|NC_019445. 65 ARTLASGMMSG-ITSPARPWFR-LATPDPEMMDYGPVKLWLEAVQNRMNDM----------FNKSNLYQSLPQLYGSLGT 132 (559) Q Consensus 65 ~~~Las~l~~~-l~pp~~~Wf~-l~~~d~~~~~~~~v~~~l~~ve~~~~~~----------l~~snf~~~~~~~~~dl~~ 132 (559) ++.+++.+++. |+|..+|=.+ |...++ ..++|-+.+++.-... =.+.+||.....++..+++ T Consensus 74 v~~~~~nvVG~Gi~~~~~p~~~~lg~~~~------~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~ 147 (533) T protein:vir:34 74 IQLHQDHIVGSFFRLSHRPSWRYLGIGEE------EARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAF 147 (533) T ss_pred HHHHHHHhhCCCceeeeccchhhcCCChh------HHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHh Confidence 99999888764 7776655433 332221 1233333444433222 2245899999999999999 Q ss_pred hCcEEEEEeecCCce----EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceE Q lcl|NC_019445. 133 YSTGAMAVLEDDEDI----IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWI 208 (559) Q Consensus 133 ~G~~~l~v~~~~~~~----~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v 208 (559) -|-+++.+..+...+ +.++.+....+--..+ .+... T Consensus 148 dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~--------------------------------------~~~~~-- 187 (533) T protein:vir:34 148 NGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNN--------------------------------------TGDSR-- 187 (533) T ss_pred CCceEEEeeeccCCCCccceEEEEechhhcCCCCC--------------------------------------CCCCC-- Confidence 999887655443332 2222222222110000 00011 Q ss_pred EEEEEEeecCcccccccccccccEEEEEEEecCCCceeee----ecCcccCC---eEEEEeeecCCCcccccchHHHHHH Q lcl|NC_019445. 209 EVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLR----ESGFDEFP---IMAPRWEVNGEDVYGSSCPGMLALG 281 (559) Q Consensus 209 ~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~----esg~~~~P---~~~~rw~~~~g~~YGrG~P~~~~l~ 281 (559) .|+..|+-+... .|.+ ||+.....+..... ..-+...| +++.-....+|..=|.+ ..-.+|. T Consensus 188 ~i~~GIe~d~~G---------r~~a-Y~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~gQ~RGis-~lapvl~ 256 (533) T protein:vir:34 188 NCRAGVQINDSG---------AALG-YYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGAN-VFYSVME 256 (533) T ss_pred ceEeeeEECCCC---------CeEE-EEEeecCCCCccccccceeeeeeccChhHeeeeccccCCCcccCCc-hHHHHHH Confidence 244555432221 2221 22221110000000 00011112 34444445688899998 5889999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCceeecCCCc--------------------------------cccceecCCceeecCCc Q lcl|NC_019445. 282 PVKALQLLQKRKSQLIDKATNPPMVAPTSLK--------------------------------NQRASLLPGDITYIDQI 329 (559) Q Consensus 282 d~~~L~~l~~~~~~~~~~~~~p~~~~p~~~~--------------------------------~~~~~~~pg~~~~~~~~ 329 (559) .++.|+....+.+.++..++.....+..+.- ...+.+.||.+...... T Consensus 257 ~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG 336 (533) T protein:vir:34 257 QMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPG 336 (533) T ss_pred HHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCC Confidence 9999999999999999988887765532210 01124677776655332 Q ss_pred CCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_019445. 330 TGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNP 409 (559) Q Consensus 330 ~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~P 409 (559) .....+.|-. .+.++..+ +..+...|-.++=..+ .++ ..|-+.++-.-+++-..|.-+.+-..=..|..-|..| T Consensus 337 e~i~~~~~~~-p~~~~~~f---~~~~lr~iAaglGi~y-e~l-t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~p 410 (533) T protein:vir:34 337 DSLNLQTAQD-TDNGYSVF---EQSLLRYIAAGLGVSY-EQL-SRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQ 410 (533) T ss_pred CeeeecCCCC-CCCCHHHH---HHHHHHHHHhhcCCCH-HHH-hhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222233221 12233322 2333444544442211 222 2344556666666666666666666666677778899 Q ss_pred HHHHHHHHHHhcCCCCCCch---hh-C--CcceEEEeecH-------HHHHHHHHHHHHHHHHHHHHHHHhccChhhHhc Q lcl|NC_019445. 410 LIDRAFSMMVRKNMLPPPPD---AM-E--GMPLKVEYISV-------MAQAQKSIGLSSLASTVNFIGQLAQAKPEALDK 476 (559) Q Consensus 410 li~r~~~il~r~g~lp~~p~---~l-~--g~~v~~~~is~-------La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~ 476 (559) +..+++..+.-.|.+|-|.. +. . ..-.+++++.| +--++ +....|..-+... T Consensus 411 i~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~--a~~~~i~~G~~s~------------- 475 (533) T protein:vir:34 411 MFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQ--EAVMLIEAGLSTY------------- 475 (533) T ss_pred HHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHH--HHHHHHHcCCCCH------------- Confidence 99999999999999874321 00 0 01134444443 21110 1111111111111 Q ss_pred CCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHhhhhhhcCCChhHHHHHHHHhh Q lcl|NC_019445. 477 LNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAM----GMAAAQGAKTLSEAKTSDPSVLSAMANAVS 552 (559) Q Consensus 477 id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~----~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 552 (559) ..++...|.+. +|+.+. ++...+.....-. ........+...+......+. T Consensus 476 -------~~~~a~~G~D~------~ev~~q---~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~--------- 530 (533) T protein:vir:34 476 -------EKECAKRGDDY------QEIFAQ---QVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDS--------- 530 (533) T ss_pred -------HHHHHHcCCCH------HHHHHH---HHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccC--------- Confidence 12222234332 122111 1111111100000 000000000000000000000 Q ss_pred cCCCCC Q lcl|NC_019445. 553 GQGGQS 558 (559) Q Consensus 553 ~~~~~~ 558 (559) .++ T Consensus 531 ---~~~ 533 (533) T protein:vir:34 531 ---RAA 533 (533) T ss_pred ---CCC Confidence 000 No 145 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=89.34 E-value=0.027 Score=29.19 Aligned_cols=258 Identities=11% Similarity=0.068 Sum_probs=120.4 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-h----ccchHHHHHHHHHHHhhCcEEEEEeec-CCceEEEE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-K----SNLYQSLPQLYGSLGTYSTGAMAVLED-DEDIIRTM 151 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~v~~~-~~~~~~~~ 151 (559) =++-||--.. .++. . +.-+...|+ + .+.+.=+...+.++.++|||++++..+ .+.++.+. T Consensus 1 ia~l~~~~~~-~~~~------~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~ 66 (278) T protein:vir:78 1 MASLPLKMYE-DYKV------V-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLF 66 (278) T ss_pred CccceeEEEe-cCcc------c-------ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEE Confidence 1133332211 1111 0 111223333 2 234445677888999999999988764 34455666 Q ss_pred EeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) +++...+-+..+.+|.. ++..+. ..+ ...++ + T Consensus 67 ~l~~~~v~v~~~~~~~~--~~y~~~------------------------~~~-g~~~~-----~---------------- 98 (278) T protein:vir:78 67 LLNPDVVEMLIENQSRE--LYYSIH------------------------AAT-GNKLI-----V---------------- 98 (278) T ss_pred EECCceeEEEEcCCCce--EEEEEE------------------------cCC-ceEEE-----E---------------- Confidence 66667666666555432 121111 000 00000 0 Q ss_pred EEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc-Cceeec-- Q lcl|NC_019445. 232 FKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATN-PPMVAP-- 308 (559) Q Consensus 232 ~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~-p~~~~p-- 308 (559) ...-.++.|.....+..||.| |...+...+.......+..+ ....+ |..++. T Consensus 99 ---------------------~~~evih~~~~~~~~~~~G~s-~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~i~~~~ 153 (278) T protein:vir:78 99 ---------------------HNMDMLHFKHIVASNMVQGIS-PIDVLKNTTDFDNAVRTFNL---TEMQKPDSFMLKYG 153 (278) T ss_pred ---------------------ccccEEEECCCCCCCCeeecc-HHHHHHHHHHHHHHHHHHHH---HHhcCCCcEEEEeC Confidence 001133444433456689999 88877776766666655533 33333 444432 Q ss_pred CCCcccc----------ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 309 TSLKNQR----------ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 309 ~~~~~~~----------~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) ..+.... ..-..|++.+.+ + +..++++.. ++.-..+.+..+...+.|-.+|-..........++.. T Consensus 154 ~~l~~e~~~~~~~~~~~~~~~~g~~~vl~--~-g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~ 229 (278) T protein:vir:78 154 SNVGKEKRQQVLEDFKQYYEENGGILFQE--P-GVEIEPLPK-KYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNF 229 (278) T ss_pred CCCCHHHHHHHHHHHHHHhccCCCceecC--C-CceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc Confidence 2221110 111244555443 2 223555532 2233344555677788899999775333322222333 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh-CCcceEEEeecHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAM-EGMPLKVEYISVM 444 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l-~g~~v~~~~is~L 444 (559) -|+++.. ..+...-+.|++++.-..|.+. ++|+ .+. .|.-|++...+ | T Consensus 230 sn~~~~~--------------~~~~~~~l~P~~~~i~~~ln~~-L~~~--~e~~~g~~~~f~~~~-l 278 (278) T protein:vir:78 230 AKNEELN--------------RFYLQHTLLPIVKQYEEEFNRK-LLTK--TDREKIGILNLTLNL-I 278 (278) T ss_pred ccHHHHH--------------HHHHHHHHHHHHHHHHHHHHhh-cCCh--hHhcCCceEEEeccc-C Confidence 3443322 1234445666666655555443 3443 232 34445554333 3 No 146 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=88.22 E-value=0.034 Score=28.66 Aligned_cols=426 Identities=9% Similarity=0.066 Sum_probs=158.2 Q ss_pred CChhhHH----------------------HHHHHHHHHHHHh--hhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCC Q lcl|NC_019445. 1 MAETTKE----------------------RLNKQFAQLESER--QSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRI 56 (559) Q Consensus 1 M~~~~~~----------------------~l~~r~~~l~~~R--~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~ 56 (559) |..+++. .+..++.+++..= -+....-++ -.|+-|......+.. +. .+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~~p~~~~~~~~~---~~--~~~p 74 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQ-QAYAEPFLEVMDTNP---EF--RTKR 74 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChhHHHHHhhhhhhhhccccCCcc-chhhcceeeeeecCC---Cc--cccC Confidence 3333221 1222232332100 000000111 112223211111111 11 1111 Q ss_pred Cc-----------------chHHHHHHHHHHHHHHhhcCC-----CCcc-eeccCCccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 57 ID-----------------STGTMAARTLASGMMSGITSP-----ARPW-FRLATPDPEMMDYGPVKLWLEAVQNRMNDM 113 (559) Q Consensus 57 ~~-----------------s~~~~a~~~Las~l~~~l~pp-----~~~W-f~l~~~d~~~~~~~~v~~~l~~ve~~~~~~ 113 (559) +. ++.-.|++.+|+.+...-.|. .-.| +++.-.+-...+. -..-+..+++.+... T Consensus 75 ~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~--~~~~~~~l~~~l~~~ 152 (576) T protein:vir:96 75 SYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKK--EKEEIKRIENFILNT 152 (576) T ss_pred cchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchh--hhHhhhhHHhhHhhc Confidence 11 123355666665554321221 1112 3333222221111 111122233333332 Q ss_pred HH-----hccchHHHHHHHHHHHhhCcEEEEEeec---CCceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHh Q lcl|NC_019445. 114 FN-----KSNLYQSLPQLYGSLGTYSTGAMAVLED---DEDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQE 185 (559) Q Consensus 114 l~-----~snf~~~~~~~~~dl~~~G~~~l~v~~~---~~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~ 185 (559) +. ..+|+.-+..++.|+.++|||.+++..+ .+.++.+.+++...+.+..+.+|.+-.-...|.... T Consensus 153 ~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~------ 226 (576) T protein:vir:96 153 GRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVI------ 226 (576) T ss_pred cCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEec------ Confidence 22 1345666677889999999999987632 234567777777777777777775422111110000 Q ss_pred cCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeec Q lcl|NC_019445. 186 FGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVN 265 (559) Q Consensus 186 fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~ 265 (559) +....+ ... ..+ .++.+.... T Consensus 227 -----------------~~~~~~-----------------------------~~~-~~d------------ii~~~~~~~ 247 (576) T protein:vir:96 227 -----------------NKKVVA-----------------------------SFT-SRE------------MAMGIRNPR 247 (576) T ss_pred -----------------CCceEE-----------------------------Eec-ccc------------eEEEeecCC Confidence 000000 000 000 111111111 Q ss_pred C---CCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce--eecCCCcccc---------ce-ec-----CCceee Q lcl|NC_019445. 266 G---EDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM--VAPTSLKNQR---------AS-LL-----PGDITY 325 (559) Q Consensus 266 ~---g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~--~~p~~~~~~~---------~~-~~-----pg~~~~ 325 (559) . ...||.+ |...+...+.......+.......-...|.. .++.+..... +. .. .|++.. T Consensus 248 ~d~~~~~~G~S-pi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~ 326 (576) T protein:vir:96 248 TELSSSGYGLS-EVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPV 326 (576) T ss_pred CCcccCccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccccccee Confidence 1 2469999 8988877777777777777777776777774 3454432110 11 11 122211 Q ss_pred cCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC---------cCHHHHHHHHHHHHHHhh Q lcl|NC_019445. 326 IDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS---------MPVEAVIEMKEEKLLMLG 396 (559) Q Consensus 326 ~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~---------~TA~Ei~~r~~e~~~~LG 396 (559) + .+++-.++|+. .++.-..+.+..+.....|-++|-.+.... +..+... .|=.-+.+... T Consensus 327 v--l~~G~~~~~ls-~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~g~~~~~s~t~sn~e~~~~------- 395 (576) T protein:vir:96 327 V--MADDIKFVNMT-PTANDMQFEKWLTYLINIISALYGIDPAEI-GFPNRGGATGGKGGNTLNEADPGKKQQ------- 395 (576) T ss_pred e--cCCCceEEecc-CChhhHHHHHHHHHhHHHHHHHhCCCHHHc-cccccccccccccccccccccHHHHHH------- Confidence 1 12223466663 234444556667788899999998875544 3222211 11111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhc Q lcl|NC_019445. 397 PVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDK 476 (559) Q Consensus 397 ~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~ 476 (559) .+...-|.|++.++-..|.+ .++|. .+..+.+++..+-. ............+. . .- T Consensus 396 ----~f~~~tL~P~~~~ie~~ln~-~Ll~~-----~~~~~~~~f~r~d~--------~~~~e~~~~~~~~~---~---G~ 451 (576) T protein:vir:96 396 ----QSQNKGLQPLLRFIEDLINT-HIISE-----YSDKYVFQFVGGDT--------KSELDKIKILQEEV---K---TY 451 (576) T ss_pred ----HHHHHHHHHHHHHHHHHHHh-hhchh-----ccCceEEEeccCCH--------HHHHHHHHHHHHHh---c---Cc Confidence 12333445554444443332 12222 13346666643311 11111111111110 0 01 Q ss_pred CCHHHHHHHHHHHcCCCc----cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhh Q lcl|NC_019445. 477 LNVDQAIDAFADMSGVSP----TVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVS 552 (559) Q Consensus 477 id~d~~~~~~a~~~Gvp~----~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 552 (559) +..++ +-+.+|.|+ +.+...--+ +...+.....+......++.......... T Consensus 452 lT~NE----~R~~~gl~piegGD~~~~~~~~--------------------~~~~~~~~~~~~e~~~~~~~~~~~~~~~~ 507 (576) T protein:vir:96 452 KTVNE----ARKEKGLKPIEGGDVLLDGSFI--------------------QSMSLNTQKEQYEDTKQKERFDMIQQFLN 507 (576) T ss_pred cCHHH----HHHHhCCCCCCCcceecccccc--------------------ccccccccCCCCCCccccccccccccccC Confidence 22222 223456653 111110000 00000000000000001111111111111 Q ss_pred cCCCC--CC Q lcl|NC_019445. 553 GQGGQ--SQ 559 (559) Q Consensus 553 ~~~~~--~~ 559 (559) ++... .. T Consensus 508 ~~~~~~~~~ 516 (576) T protein:vir:96 508 SPDDEEPQQ 516 (576) T ss_pred CCCCCCCCC Confidence 11111 11 No 147 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=88.20 E-value=0.034 Score=28.65 Aligned_cols=449 Identities=10% Similarity=0.017 Sum_probs=179.0 Q ss_pred CChhhHHHHHHHHHH-HHHHhhhHHHHHHHHHHHhccccCCCCCCCCCC-cc--------------cccCCCCcchHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQ-LESERQSFEPHWRELSDYINPRGSRFLTSEVNR-ND--------------RRNTRIIDSTGTMA 64 (559) Q Consensus 1 M~~~~~~~l~~r~~~-l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~-~~--------------~~~~~~~~s~~~~a 64 (559) |+=-+ +++.-++= ....|..... .+..|--...++..+..... +. .+.--.-++.+..+ T Consensus 1 mn~~d--r~i~~~sP~~~~~R~~ar~---~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~a 75 (502) T protein:vir:79 1 MAILD--DVIGVFSPGWKAARLRSRA---VIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGV 75 (502) T ss_pred CchHh--hHHhhcChHHHHHHHhhHH---HHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 43221 11111110 0011111010 11122211111111110000 00 01111347788999 Q ss_pred HHHHHHHHHH--hhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHH------HHHhccchHHHHHHHHHHHhhCcE Q lcl|NC_019445. 65 ARTLASGMMS--GITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMND------MFNKSNLYQSLPQLYGSLGTYSTG 136 (559) Q Consensus 65 ~~~Las~l~~--~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~------~l~~snf~~~~~~~~~dl~~~G~~ 136 (559) ++.+++.+++ +++|..++ ...+.... ++|-..+++.-.. .=.+.+||.....++...++-|-+ T Consensus 76 v~~~~~nvVG~ggi~~~~~~----~~~~~~~~-----~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~ 146 (502) T protein:vir:79 76 FDKLEERVVGKNGIIVEPHP----VLRNGAIA-----RDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEV 146 (502) T ss_pred HHHHHHhhccCCceeeeecc----CCCChhHH-----HHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCce Confidence 9999999996 56554333 11111111 1222222222111 112468999999999999999999 Q ss_pred EEEEeecCC------c--eEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceE Q lcl|NC_019445. 137 AMAVLEDDE------D--IIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWI 208 (559) Q Consensus 137 ~l~v~~~~~------~--~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v 208 (559) ++.+..+.. . ++.++.+....+-...+. + - T Consensus 147 f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~~~-------------------------------------~-----~ 184 (502) T protein:vir:79 147 FAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTSDE-------------------------------------S-----N 184 (502) T ss_pred EEEEeecccCccCCCcccceEEEEecchhcCCCCCC-------------------------------------C-----C Confidence 887644321 1 123333333332111000 0 1 Q ss_pred EEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCC---eEEEEeeecCCCcccccchHHHHHHHHHH Q lcl|NC_019445. 209 EVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFP---IMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 (559) Q Consensus 209 ~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P---~~~~rw~~~~g~~YGrG~P~~~~l~d~~~ 285 (559) .|...|+-+.. ++|.+ ||+......+... ..+...| +++.-....+|..=|.+ ..-.+|..++. T Consensus 185 ~i~~GVe~d~~---------Gr~~a-Y~i~~~hPgd~~~--~~~~rvpA~~vlH~f~~~r~gQ~RGis-~lapvl~~l~~ 251 (502) T protein:vir:79 185 RLNQGVFVDDW---------GRPEK-YLVYKSRPVSGRQ--METKEVDAERMLHLKFVRRLHQMRGTS-LLSGVLIRLSA 251 (502) T ss_pred eeEeeeEECCC---------CceEE-EEEeecCCCCCcc--cceeEechhheEEeecccCCccccCCc-hHHHHHHHHHH Confidence 23444432221 12221 2222111111000 0011122 44444556788888998 58899999999 Q ss_pred HHHHHHHHHHHHHHHhcCceeecCCC-------------ccccceecCCceee-cCCcCCchhhhhhhhccccHHHHHHH Q lcl|NC_019445. 286 LQLLQKRKSQLIDKATNPPMVAPTSL-------------KNQRASLLPGDITY-IDQITGQDGFRPAYLVNPSTADLVAD 351 (559) Q Consensus 286 L~~l~~~~~~~~~~~~~p~~~~p~~~-------------~~~~~~~~pg~~~~-~~~~~~~~~~~p~~~~~~~~~~~~~~ 351 (559) |+.+..+.+.++..++.....+..+. ......+.||+++. .........+.|-. .+.++..+ T Consensus 252 l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~-p~~~~~~f--- 327 (502) T protein:vir:79 252 LKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDR-PNPNLETF--- 327 (502) T ss_pred HhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCC-CCCCHHHH--- Confidence 99999999999999888776553211 11224577887653 32222222222221 12233222 Q ss_pred HHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh Q lcl|NC_019445. 352 IQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAM 431 (559) Q Consensus 352 i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l 431 (559) +..+...|..++=..+- ++. .|-.. +-.-+++-..|.-+.+--.=..|...|+.|+.++++..+...|.+|-| ... T Consensus 328 ~~~~lr~iaaglGi~ye-~lt-~D~s~-nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p-~~~ 403 (502) T protein:vir:79 328 RNGQLRAVAAGSRLSFS-STA-RNYNG-TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLP-RDL 403 (502) T ss_pred HHHHHHHHHhhcCCCHH-HHh-ccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC-CCC Confidence 22333334444422111 111 12222 444455555555555555555666789999999999999999998743 221 Q ss_pred CC-cceEEEeecHHH-HHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHH Q lcl|NC_019445. 432 EG-MPLKVEYISVMA-QAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQ 509 (559) Q Consensus 432 ~g-~~v~~~~is~La-~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~ 509 (559) .- .-++++++.|-- ..=-..++++. +..+.. -+.. ...++...|.+. ++|. .| T Consensus 404 ~~~~~~~~~W~~p~~~~iDP~Ke~~a~------~~~i~~-------Gl~t---~~~~~a~~G~D~------~~v~---~q 458 (502) T protein:vir:79 404 DRSSLYTAVYSGPVMPWIDPVKEAEAW------KIQIRG-------GAAT---ESDWVRAGGRNP------DDVK---RR 458 (502) T ss_pred CchhhcceeeecCCccccChHHHHHHH------HHHHHc-------CCCC---HHHHHHHcCCCH------HHHH---HH Confidence 11 112344433310 00000011000 000000 0000 112233344443 2221 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhcCC-ChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 510 RAQQQQQQQMMAMGMAAAQGAKTLSEAKTS-DPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 510 r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 559 (559) ++...+.. .... ..-...++.+. +.++-..- .....++++.| T Consensus 459 ~a~e~~~~---~~~G----l~~~~~~~~~~~~~~~~~~~-~e~~~~~~~~e 501 (502) T protein:vir:79 459 RKAEIDEN---RKLD----LVFDTDPASDKGGSSAATKR-QEPQHTDDQSE 501 (502) T ss_pred HHHHHHHH---HHcC----CCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCC Confidence 11111111 0000 00000000000 00000000 00011111222 No 148 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=87.68 E-value=0.037 Score=28.42 Aligned_cols=432 Identities=10% Similarity=-0.005 Sum_probs=172.3 Q ss_pred CChhhH-----------HHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCC------cccccCCCCcchHHH Q lcl|NC_019445. 1 MAETTK-----------ERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNR------NDRRNTRIIDSTGTM 63 (559) Q Consensus 1 M~~~~~-----------~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~------~~~~~~~~~~s~~~~ 63 (559) |+=..+ +...+.|+....-|. |+- .|..+ ....... .+.+.--.-++.+.. T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~~-----~~~-----~~~~s--~d~~~~~~~~~lr~RaRdl~rNn~~a~~ 68 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGHR-----WQD-----IGDYG--PDTAVASGIQTLRARSHHNVRNNPWATN 68 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCcc-----cCC-----CCCCC--hhHHHHHHHHHHHHHHHHHHhcChHHHH Confidence 332211 111122222222111 100 00000 0000000 001111134667888 Q ss_pred HHHHHHHHHHH-hhcCCCCcceeccCCccchhhH--HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEE Q lcl|NC_019445. 64 AARTLASGMMS-GITSPARPWFRLATPDPEMMDY--GPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAV 140 (559) Q Consensus 64 a~~~Las~l~~-~l~pp~~~Wf~l~~~d~~~~~~--~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v 140 (559) +++.+.+.+++ +++|..++ .++++.+. ..-+.|.+.| ++=.+.+||.....++..+++-|-+++.+ T Consensus 69 av~~~~~~vVG~Gi~p~~~~------~~~~~~~~ie~~w~~wa~~~-----D~~g~~~f~~lq~l~~r~~~~dGE~f~~~ 137 (495) T protein:vir:10 69 AVATWVAAAVGNGLTPRWRM------KEQELRQELQELWGDWVNEA-----DFDEVQSFYGLQALVVRTVINSGEAFVIK 137 (495) T ss_pred HHHHHHHhhcCCCcccccCC------chHHHHHHHHHHHHHhhcCc-----ccccccCHHHHHHHHHHHHHhCCceEEEE Confidence 88888887754 55554432 23222221 1112332221 12225689999999999999999988755 Q ss_pred eecCC---c--eEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEe Q lcl|NC_019445. 141 LEDDE---D--IIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVY 215 (559) Q Consensus 141 ~~~~~---~--~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~ 215 (559) ...+. . ++.++.+....+-...+.. . .++. -.|+..|+ T Consensus 138 ~~~~~~~g~~~~~~lqliepd~l~~~~~~~------------------------~------------~~~g-~~i~~GIe 180 (495) T protein:vir:10 138 KPRPLSEGLSVPLQLQIIEPDMLASDIPDE------------------------T------------LPSG-GYVKGGIR 180 (495) T ss_pred eecccCCCCccceEEEEechhhcCCCCCCC------------------------C------------CCCC-CEEEeceE Confidence 43221 1 2344444433321111100 0 0000 11333333 Q ss_pred ecCcccccccccccccEEEEEEEecCCCceeeeecC--cccCC--eEEEEeeecCCCcccccchHHHHHHHHHHHHHHHH Q lcl|NC_019445. 216 PNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESG--FDEFP--IMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQK 291 (559) Q Consensus 216 p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg--~~~~P--~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~ 291 (559) -+.. ++|.. ||+.....++......+ +.-.| -++.-|.+.+|..=|.+. . -.+-.++.|+.... T Consensus 181 ~d~~---------Gr~va-Y~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~r~gQ~RGis~-l-a~i~~l~~l~~y~d 248 (495) T protein:vir:10 181 FSNG---------GKRKA-YCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTVLTVRSDAGAPW-F-QLLLRLNELDQYED 248 (495) T ss_pred ECCC---------CceEE-EEEeecCCCcccccccccceeeechhheEeccccCCCcccCcch-h-HHHHHHHHhhHHHH Confidence 2211 11111 11111111110000000 01112 122335667888877663 4 34557899999999 Q ss_pred HHHHHHHHHhcCceeecCC----C-------------ccccceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHH Q lcl|NC_019445. 292 RKSQLIDKATNPPMVAPTS----L-------------KNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQD 354 (559) Q Consensus 292 ~~~~~~~~~~~p~~~~p~~----~-------------~~~~~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~ 354 (559) +.+.++..++.....+..+ . ......+.||.+.+.........+.|-.. +.++..+ +.. T Consensus 249 ael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p-~~~~~~f---~~~ 324 (495) T protein:vir:10 249 AELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADV-GTTYEPW---LRY 324 (495) T ss_pred HHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCC-CCCHHHH---HHH Confidence 9999999888876554211 0 00123477888776643333333333211 1233322 223 Q ss_pred HHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCC Q lcl|NC_019445. 355 TRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE-RLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEG 433 (559) Q Consensus 355 ~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~-~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g 433 (559) +...|-.++=..+ .++ ..|-..++-.-+++-..|.-+.+-..=. .+...|..|+..+++..+...|.++-| .-... T Consensus 325 ~lr~iaaglGi~Y-e~l-tgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p-~~~~~ 401 (495) T protein:vir:10 325 QLLSIAKGYGITY-EML-TGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIP-DYLQR 401 (495) T ss_pred HHHHHHhhcCCCH-HHH-hcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC-Cchhh Confidence 3344555542221 122 2355556666666666666655554433 345668899999999999999998743 22211 Q ss_pred c--ceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHH--------------HHHHHHcCCCcccc Q lcl|NC_019445. 434 M--PLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAI--------------DAFADMSGVSPTVI 497 (559) Q Consensus 434 ~--~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~--------------~~~a~~~Gvp~~~~ 497 (559) . -.+++++.|- ...||+-+-+ ..+....|.+. T Consensus 402 ~~~~~~~~w~~p~-----------------------------~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~--- 449 (495) T protein:vir:10 402 RRYYNRVSWRTPR-----------------------------WEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDM--- 449 (495) T ss_pred hHhhhccccccCC-----------------------------ccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCH--- Confidence 1 1233333331 1111111111 11222234332 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 498 VPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 498 rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) +||. .|++...+. +.... ..-...+..+++..+.+ ........+.+ T Consensus 450 ---~~v~---~q~a~e~~~---~~~~G----l~~~~~p~~~~~~~~~~---~~~~~~~~~~e 495 (495) T protein:vir:10 450 ---EELF---DMISDANQL---IDEYD----LRLDSDPRYVNGSGAEQ---KSVMEAALNNE 495 (495) T ss_pred ---HHHH---HHHHHHHHH---HHHcC----CCCCCCCCcCCCccCCC---CCCCCCCCCCC Confidence 2221 111111111 00000 00000000000000000 00000000000 No 149 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=87.10 E-value=0.041 Score=28.19 Aligned_cols=353 Identities=13% Similarity=0.137 Sum_probs=143.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCccccc--CCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRN--TRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |-= .+.| +..+..-...|..+... +.+.....|..-+ +-+-.++--.|++.+|+.+.+ T Consensus 1 m~~------~~~~---~~~~~~~~~~~~~~~~~-------~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~---- 60 (419) T protein:vir:57 1 MFI------PQFW---KGRPSENRVNWQVVPGG-------MRSSSSQAGVIITPETALALSAVRACVTLLAESVAQ---- 60 (419) T ss_pred Ccc------hhhh---ccCCccccccccccccc-------cccccccCCceechHHhhccHHHHHHHHHHHHhhcc---- Confidence 211 1111 11111111222221111 1111111111100 112223444556666654443 Q ss_pred CCCcceeccCCcc---chhhHHHHHHHHHHHHHHHHHHHH-hc----cchHHHHHHHHHHHhhCcEEEEEeecC-CceEE Q lcl|NC_019445. 79 PARPWFRLATPDP---EMMDYGPVKLWLEAVQNRMNDMFN-KS----NLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIR 149 (559) Q Consensus 79 p~~~Wf~l~~~d~---~~~~~~~v~~~l~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~ 149 (559) -||--+...+. .... +..+...|+ +- +.+.-+...+.+|.++||+++++..+. +.++. T Consensus 61 --lp~~~~~~~~~g~~~~~~-----------~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~ 127 (419) T protein:vir:57 61 --LPCVLYRRTENGGREIAF-----------DHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITE 127 (419) T ss_pred --CceEEEEEcCCCceeccc-----------cchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEE Confidence 24532222111 1111 122233343 22 334445667789999999999987654 45566 Q ss_pred EEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccc Q lcl|NC_019445. 150 TMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKN 229 (559) Q Consensus 150 ~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~ 229 (559) ..+++...+.+..+.+|.+- |+ +. ...+ ++| T Consensus 128 L~pl~~~~v~v~~~~~g~~~--y~---~~------------------------~~~~-------~~~------------- 158 (419) T protein:vir:57 128 LIPINPHKVIVLKGPDGMPY--YD---IP------------------------SIGE-------ILP------------- 158 (419) T ss_pred EEEEcCcceEEEECCCceEE--EE---Ec------------------------CCce-------EEc------------- Confidence 66677777777666665421 11 00 0000 000 Q ss_pred ccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-- Q lcl|NC_019445. 230 KPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA-- 307 (559) Q Consensus 230 ~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~-- 307 (559) ... +++.|....++ .||.| |...+...+.....+.+.......-...|..++ T Consensus 159 --~~~----------------------vih~r~~~~d~-~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 212 (419) T protein:vir:57 159 --MRM----------------------VHHIKSFSLDG-YIGTS-PIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIER 212 (419) T ss_pred --hhh----------------------EEEecCcCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEe Confidence 000 12222222234 89999 888888888888888888888888878887554 Q ss_pred cCCC----cccc-------c-ee-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_019445. 308 PTSL----KNQR-------A-SL-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMM 370 (559) Q Consensus 308 p~~~----~~~~-------~-~~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~ 370 (559) |.+. .... + +. ..|++...+ ++-.++|+.. ++.-..+.+..+..+..|-++|-...... T Consensus 213 ~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 288 (419) T protein:vir:57 213 PFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQ---EGMTYKQLSQ-DNEKAQLLQSRQYTVNEVCRLYKVPPHMI 288 (419) T ss_pred cCcCCcccCHHHHHHHHHHHHHHhccccccccceecC---CCceEEEcCC-ChhhHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 3221 1110 0 00 123333332 2234555542 33333445556677788999998764443 Q ss_pred ccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCC-- Q lcl|NC_019445. 371 LQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDR-------------------------AFSMMVRKNM-- 423 (559) Q Consensus 371 ~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r-------------------------~~~il~r~g~-- 423 (559) ....++..-++++... .=....|.|....++.|+-.-++.. .+..+.+.|. T Consensus 289 g~~~~~t~sn~e~~~~--~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T 366 (419) T protein:vir:57 289 QDLQKSTNNNIEHQGL--QYVIYTMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLS 366 (419) T ss_pred CCCCCCccccHHHHHH--HHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcC Confidence 2222222233333221 1123445666666655554433211 1111222222 Q ss_pred ---------CCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHH Q lcl|NC_019445. 424 ---------LPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDA 485 (559) Q Consensus 424 ---------lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~ 485 (559) +||+| .|+.+-+ ++... . ++.+...++..|+-....+.....+. T Consensus 367 ~NE~R~~~gl~p~~---ggD~~~~----~~n~~----~-------~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (419) T protein:vir:57 367 VNDIRRMENLTPIP---GGDKYLT----PLNMV----D-------SKALTGIGKATPQQLKDIEAILCTRN 419 (419) T ss_pred HHHHHHHhCCCCCC---CcCeeee----ccccc----c-------ccccccccCCCcccCcchhhhhhccC Confidence 12221 1111110 00000 0 00001111112222222222222222 No 150 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=86.61 E-value=0.044 Score=28.00 Aligned_cols=403 Identities=14% Similarity=0.131 Sum_probs=150.1 Q ss_pred HHHHhccccCCCCCCCCCC------------------cccccCCCCcchHHH-------HHHHHHHHHHHhhcCCCCcce Q lcl|NC_019445. 30 LSDYINPRGSRFLTSEVNR------------------NDRRNTRIIDSTGTM-------AARTLASGMMSGITSPARPWF 84 (559) Q Consensus 30 ~~~~~~P~~~~~~~~~~~~------------------~~~~~~~~~~s~~~~-------a~~~Las~l~~~l~pp~~~Wf 84 (559) +..|-+ ...+... .......+++..+.. .+..+.+-+...+. +.|| T Consensus 1 ~~~~~~------~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA--~l~~- 71 (542) T protein:vir:41 1 MFNYHL------SIRSLEKYKAIKREEVESQALGETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDII--RTGY- 71 (542) T ss_pred Cccccc------cccccccchhhhhccccccccccccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHh--hCce- Confidence 222111 1111000 000011122222111 01222222222221 2222 Q ss_pred eccCCccchhhHHHHHHHHHHHHHHHHHHHH--hccchHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEe Q lcl|NC_019445. 85 RLATPDPEMMDYGPVKLWLEAVQNRMNDMFN--KSNLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLA 161 (559) Q Consensus 85 ~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~--~snf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~ 161 (559) ++...+. .+ +...+. .-+++.-+...+.++.++|||.+++..+. +.+..+.+++...+-+. T Consensus 72 ~~~~~~~---------~~-------l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~ 135 (542) T protein:vir:41 72 ILEGDDE---------GV-------VDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVH 135 (542) T ss_pred eeecccc---------hh-------hhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEE Confidence 2211100 00 111111 12345556778889999999999887654 56677777777766666 Q ss_pred eCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecC Q lcl|NC_019445. 162 NSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGG 241 (559) Q Consensus 162 ~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~ 241 (559) .|..+.+.... .. ..+.+..-.++ +. ++...+. T Consensus 136 ~d~~~~~~~~~-----------------------------~~--~~~~~~~y~~~---~~-------------~~~~~g~ 168 (542) T protein:vir:41 136 KDGSRYRQTWD-----------------------------GV--NITHFKDYRYE---GE-------------INPETGE 168 (542) T ss_pred EcCCeeEeeec-----------------------------CC--cceeEEeeccc---cc-------------ccccccc Confidence 65443211100 00 11111000000 00 0000000 Q ss_pred CCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCccc----- Q lcl|NC_019445. 242 DNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQ----- 314 (559) Q Consensus 242 ~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~----- 314 (559) ... .+...-.++.|+....+..||.| |...++..+.......+.......-...|..++ ++.+... T Consensus 169 ~~~------~~~~~eIiHir~~~~~~~~~Gls-pi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~ 241 (542) T protein:vir:41 169 DQD------SVGANELVFIHIPSPVCSYYGVP-RYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDP 241 (542) T ss_pred ccc------ccCcccEEEecCCCCCCCccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCcccccccccc Confidence 000 01112356777776667799999 898887776666666666666555566676543 4322110 Q ss_pred ----------------cc---eecCCceeecCCcC---CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhcc Q lcl|NC_019445. 315 ----------------RA---SLLPGDITYIDQIT---GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQ 372 (559) Q Consensus 315 ----------------~~---~~~pg~~~~~~~~~---~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~ 372 (559) .+ .-..|...+...++ +.-.++|+.. ++.-..+.+..+.....|-.+|-.+.... + T Consensus 242 ~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~-~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G 319 (542) T protein:vir:41 242 DGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNT-SQKELSFREYAAEKKYDIAAAHMIDPYRL-G 319 (542) T ss_pred ccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHh-C Confidence 00 01233333332221 1223555533 33334455666777889999998765443 3 Q ss_pred CCCCC---CcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeec-HHHHHH Q lcl|NC_019445. 373 NINTR---SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYIS-VMAQAQ 448 (559) Q Consensus 373 ~~~~~---~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is-~La~a~ 448 (559) ..++. .-++++.. ..+....+.|++.+.-..+.+ .++|+. +..+.++|.. .+.+.. T Consensus 320 ~~~~~t~n~sn~Eq~~--------------~~f~~~tL~P~~~~ie~~ln~-~L~~~~-----~~~~~~~f~~~~ll~~d 379 (542) T protein:vir:41 320 IADTGPLGGNFAEVTR--------------RTYYESVVRPQQNIISSILTD-FFQVKF-----NPKTRFKFNDETLLESD 379 (542) T ss_pred cCCCcccccccHHHHH--------------HHHHHHHHHHHHHHHHHHHHh-hccccc-----CCceEEEecchhhcchH Confidence 22221 12333322 224455566776666555554 233322 2234555532 222211 Q ss_pred HHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc---ccCC-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 449 KSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT---VIVP-QEQVDQARQQRAQQQQQQQMMAMGM 524 (559) Q Consensus 449 r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~---~~rs-~~ev~~~rq~r~q~~q~~~~~~~~~ 524 (559) ....+...+ +.+ .+.++++-. ...|+|+. ++.+ .-..+++..+....+ . . T Consensus 380 ---~~~~~~~~v-------~~G-----ilT~NE~Re---~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~-~-------~ 433 (542) T protein:vir:41 380 ---SVRNCALLV-------QSG-----VLTPAEARE---RLFGLDGGPDIFMVPSKGAAKSVKRQERNYE-K-------N 433 (542) T ss_pred ---HHHHHHHHH-------hCC-----CCCHHHHHH---hhCCCCCCCccccccccccccccccCCcCCC-C-------C Confidence 111111111 111 234444421 12355531 1111 000011111000000 0 0 Q ss_pred HHHHHHhhhhhhcC----------------C--ChhHHHHHHHHhhcCC-----CCC--------C Q lcl|NC_019445. 525 AAAQGAKTLSEAKT----------------S--DPSVLSAMANAVSGQG-----GQS--------Q 559 (559) Q Consensus 525 ~~~~~a~~~~~~~~----------------~--~~~~~~~~~~~~~~~~-----~~~--------~ 559 (559) +..+..|..+.... . ..+.++-+..+...|- |+. | T Consensus 434 ~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (542) T protein:vir:41 434 QIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAEFRAEAYEAGKKMLIIGGDMGSMSALNQ 499 (542) T ss_pred chhhhhhcccccCccccccccccccchhhcccccchhhhhHHhHHhcCceEEEeecCchhhhhhhc Confidence 11111111111110 0 0111111111111110 000 0 No 151 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=86.39 E-value=0.046 Score=27.92 Aligned_cols=375 Identities=13% Similarity=0.085 Sum_probs=151.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |-=- +.+.+.+. .|++-.. .+.|....+.+...... ..-+-.++--.|++.+|+.+. + T Consensus 1 MG~~--~~~~~~~~----~~~~~~~-------~~~~~~~~~~g~~~~~~---~~al~~~~V~~~v~~Ia~~iA------~ 58 (411) T protein:vir:81 1 MGWW--SRLTRFFR----PRNETVD-------MTNPLLLQWLGVDPDTP---RNQLSEATYFACLKILSESLG------K 58 (411) T ss_pred CchH--HHHHhhcc----Ccccccc-------cchHHHHHHhcCcccCh---hhhhccHHHHHHHHHHHHhHh------h Confidence 3221 11211111 1111110 11111111111110000 011122334455665555433 2 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hc----cchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeec Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KS----NLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPI 155 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l 155 (559) .||--..-.+....+ + .+..+...|+ +- +.+.=+...+.+|.++|||.+++..+.+++..+.++|. T Consensus 59 lp~~~~~~~~~~~~~---~------~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~l~~l~~ 129 (411) T protein:vir:81 59 LPLKMYQKTERGIVK---S------DREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSGPQLQALWILPS 129 (411) T ss_pred CceeEEEecCCceee---e------cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCceEEEEEECC Confidence 244322222111000 0 1122333343 22 23344566778899999999999888777777888888 Q ss_pred cEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEE Q lcl|NC_019445. 156 GSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSV 235 (559) Q Consensus 156 ~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv 235 (559) +.+.+..|..|.+.. .. . + +++...+. ++. ++ T Consensus 130 ~~v~~~~~~~~~~~~--------------------------------~~-~-~-~~~~~~~~--------~g~-----~~ 161 (411) T protein:vir:81 130 QYVTIVVDDRGLLGE--------------------------------KN-A-I-WYRYNDPY--------DGK-----MY 161 (411) T ss_pred ceEEEEEcCcccccc--------------------------------cc-e-E-EEEEEecC--------Cce-----EE Confidence 888777776654210 00 0 0 11111000 000 00 Q ss_pred EEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCCCcc Q lcl|NC_019445. 236 YYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTSLKN 313 (559) Q Consensus 236 ~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~~~~ 313 (559) . |..--++++|+....+..||.| |..-+...+.......+.......-...|..+ ++.++.. T Consensus 162 ----------~-----~~~~eiih~k~~~~~~~~~G~s-~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~ 225 (411) T protein:vir:81 162 ----------V-----FRNDEILHFKTSVTFDGITGLS-VRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQ 225 (411) T ss_pred ----------E-----EccccEEEEcCCCCCCCccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCH Confidence 0 0111256666555556689999 89888787888888888777777777778754 4444422 Q ss_pred cc-------c----ee--cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcC Q lcl|NC_019445. 314 QR-------A----SL--LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMP 380 (559) Q Consensus 314 ~~-------~----~~--~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~T 380 (559) .. + .. ..|++... +++-.++|+.. ++.-..+.+..+..+..|-.+|-..........++..-+ T Consensus 226 e~~~~~~~~~~~~~~g~~n~g~~~vl---~~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n 301 (411) T protein:vir:81 226 EARDRLVKGFEQFANGSKNAGKIIPV---PLGMKLVPLDI-KLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYAS 301 (411) T ss_pred HHHHHHHHHHHHHhcCccccCCceec---CCCceEEEccC-CHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchh Confidence 11 1 00 11233322 22234666642 233334445566778899999987644432222222223 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh-CCcceEEEeecH-----HHH---HHHHH Q lcl|NC_019445. 381 VEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAM-EGMPLKVEYISV-----MAQ---AQKSI 451 (559) Q Consensus 381 A~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l-~g~~v~~~~is~-----La~---a~r~~ 451 (559) +++. ...=....|.|.+.+++.++-.-| |++ .++ .|..|++..... .++ +.+.. T Consensus 302 ~e~~--~~~f~~~~l~P~~~~ie~~l~~~l-------------l~~--~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~ 364 (411) T protein:vir:81 302 AEAQ--NLAFYVDTLLYVLKQYEEEITYKI-------------LSN--DLISQGHYFKFNVNVILRADIKTQMDSLSTAV 364 (411) T ss_pred HHHH--HHHHHHHHHHHHHHHHHHHHHhhc-------------CCh--hhcCCCcEEEeechhhhccCHHHHHHHHHHHH Confidence 3222 112223335555555544443322 221 111 111122222111 111 11111 Q ss_pred HHH--HHHHHHHHHHHHhccC-hh-h---HhcCCHHHHHHHHHHHcCCC Q lcl|NC_019445. 452 GLS--SLASTVNFIGQLAQAK-PE-A---LDKLNVDQAIDAFADMSGVS 493 (559) Q Consensus 452 ~~~--~l~~~~~~~~~la~~~-P~-~---~~~id~d~~~~~~a~~~Gvp 493 (559) +.. .+...-..++ +..++ -+ . ..++-.+.+-+... .-|-. T Consensus 365 ~~g~~t~NE~R~~~g-l~p~~ggD~~~~~~n~~pl~~~~~~~~-kgGd~ 411 (411) T protein:vir:81 365 QNGIMTPNEARDYLD-MPADDYGNNLMANGNYIPLSMLGANYG-KGGDS 411 (411) T ss_pred hCCCcCHHHHHHHhC-CCCCCCCCeeeeccCccchhhhhhhhc-cCCCC Confidence 111 1122222111 11110 00 0 01122222222221 11211 No 152 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=86.17 E-value=0.047 Score=27.84 Aligned_cols=355 Identities=10% Similarity=0.039 Sum_probs=144.9 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccc--cCCCCcchHHHHHHHHHHHHHHhhcCCCCcceecc Q lcl|NC_019445. 10 NKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRR--NTRIIDSTGTMAARTLASGMMSGITSPARPWFRLA 87 (559) Q Consensus 10 ~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~--~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~ 87 (559) -..|+.+...|..-...-..+..+..|.. .+.. ..+... .+-+-.++--.|++.+|+.+. .+ || ++ T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~~~v~~~~al~~~~V~~~i~~Ia~~ia-~l-----~~-~~- 68 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEF---LDAL-NGSEWVSAETALKNSDLFSIISQLSNDLA-TA-----KI-TT- 68 (384) T ss_pred CccccccccCcccccccchhhccccchhh---cccc-cCCceechhhhhccHHHHHHHHHHHHHHh-hC-----ce-ee- Confidence 12222222222111100011112222211 1100 001100 011112333445555555433 22 22 11 Q ss_pred CCccchhhHHHHHHHHHHHHHHHHHHHHhc----cchHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEee Q lcl|NC_019445. 88 TPDPEMMDYGPVKLWLEAVQNRMNDMFNKS----NLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLAN 162 (559) Q Consensus 88 ~~d~~~~~~~~v~~~l~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~ 162 (559) .+... . ..+.+- +.+.=+...+.++.++|||.+++..+. +.++.+.+++...+-+.. T Consensus 69 -~~~~~--------------~---~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~ 130 (384) T protein:vir:49 69 -SRKQL--------------Q---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNR 130 (384) T ss_pred -ecchh--------------h---hhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEE Confidence 11110 0 012222 234445567788999999999988654 445666666666665555 Q ss_pred CCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCC Q lcl|NC_019445. 163 SPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGD 242 (559) Q Consensus 163 d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~ 242 (559) +.++.. ++.++... . ...... +.+. . T Consensus 131 ~~~~~~--~~y~~~~~-----------~-----------~~~~~~---------------------------~~~~---~ 156 (384) T protein:vir:49 131 LDNQNG--LYYNITFD-----------D-----------PRIPPK---------------------------QHVP---Q 156 (384) T ss_pred cCCCce--EEEEEEec-----------C-----------ccccce---------------------------eEec---C Confidence 433221 11111000 0 000000 0000 0 Q ss_pred CceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCcccc----- Q lcl|NC_019445. 243 NDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQR----- 315 (559) Q Consensus 243 ~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~~----- 315 (559) + =+++.|+...++..||.| |...+...+.......+.......-...|..++ ++...... T Consensus 157 ~------------eVih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~ 223 (384) T protein:vir:49 157 G------------DILHFRLLSVDGGLTSVS-PLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQS 223 (384) T ss_pred c------------cEEEecCCCCCCceeecc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHH Confidence 0 145556555667799999 899888888888888888888888788887554 44332110 Q ss_pred -----ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHH Q lcl|NC_019445. 316 -----ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEE 390 (559) Q Consensus 316 -----~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e 390 (559) ..-..|++...+ ++..++++.. ++....+.+..+..+..|-++|-..... ++......-|++.+.+.... T Consensus 224 ~~~~~~~~n~~~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVp~~~-lg~~~~~~~~~~~~~~~~~~ 298 (384) T protein:vir:49 224 RSRQAMKQMQGGPLVLD---DLEDFTPLEI-KSNVAQLLSQADWTTGQFAKVYGIPESV-VGGEGDKQSSLEMIYNIYFK 298 (384) T ss_pred HHHHhcccCCccceecC---CCceEEEccC-ChhhHHHHHHHHHHHHHHHHHhCCCHHH-hCCCCCccccHHHHHHHHHH Confidence 112234444442 2233455532 3344445566777889999999775433 33223333455554443332 Q ss_pred -HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhC---CcceEEEeecHHHHHHHHHHH----HHHHHHHHH Q lcl|NC_019445. 391 -KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAME---GMPLKVEYISVMAQAQKSIGL----SSLASTVNF 462 (559) Q Consensus 391 -~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~---g~~v~~~~is~La~a~r~~~~----~~l~~~~~~ 462 (559) ....|-|+.++++.+|..-+.. ++..... ++ +..+. ..-++-...+.-......... ..+.. +.. T Consensus 299 ~i~~~l~pi~~~i~~~l~~~l~~---~~~~~~~--~~-~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~-~~~ 371 (384) T protein:vir:49 299 AVSRFLRPFVSELSKKLSCEVDA---DILPAVD--PT-GSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDLPE-GET 371 (384) T ss_pred HHHHHHHHHHHHHHHHhchhhhh---hhhhhhh--cc-chHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhHHH-HcC Confidence 3345677777777765432200 0000000 00 00000 000111111111111100000 00111 111 Q ss_pred HHHHhcc--Chhh Q lcl|NC_019445. 463 IGQLAQA--KPEA 473 (559) Q Consensus 463 ~~~la~~--~P~~ 473 (559) +..+..- +.+- T Consensus 372 ~~p~~gGd~~~~~ 384 (384) T protein:vir:49 372 DSTLKGGETNEQY 384 (384) T ss_pred CCCCCCCCCCCCC Confidence 2222211 1111 No 153 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=85.81 E-value=0.05 Score=27.71 Aligned_cols=416 Identities=14% Similarity=0.081 Sum_probs=153.7 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHH-HHH--HHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEP-HWR--ELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~-~w~--e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |.= .+.+.+|...-.. ...+. .|. +-+.+.+-... ..+...+. ..-+=.++--.|++.+|+.+.+. T Consensus 1 Mg~--~~~l~~r~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~g~~V~~----~~al~~~~V~~~v~~Ia~~iA~l-- 69 (457) T protein:vir:13 1 MGF--WSALFGRGHSPAL--DGIEARAWEPYDPSIYNLGAVA-ASGETVTP----HDALQVSAVFASVRLLSETIATL-- 69 (457) T ss_pred Cch--hhhhhcccccccc--cccccccccccchHHHhhcccc-cCCceech----HHhhccHHHHHHHHHHHHhhccC-- Confidence 332 1112111111000 00010 010 00111100000 00011110 01111233345566666654432 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhc----cchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEe Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKS----NLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPF 153 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~ 153 (559) ||--..-.+.. .+ ++ ....+...++.. +.+.-+..++.++..+||+.+++..+.+.++.+.++ T Consensus 70 ----p~~~~~~~~~~-~~--~~------~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~~~~l~~l 136 (457) T protein:vir:13 70 ----PLSTYSKRGGS-RK--EI------VTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPNIVGLDVL 136 (457) T ss_pred ----ceEEEEecCCc-cc--cc------ccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEE Confidence 33222211111 00 11 122334445432 234456667788899999999997776666666555 Q ss_pred eccEEEEeeCCCCCE-EEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 154 PIGSYYLANSPRGSV-DICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 154 ~l~~~~v~~d~~G~v-d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) +...+.+.++..+.. ..+|+.|..+. T Consensus 137 ~p~~v~v~~~~~~~~~~~~~~~y~~~~----------------------------------------------------- 163 (457) T protein:vir:13 137 DPTKIHVHMVMVDGLRRKVFEAYDIDA----------------------------------------------------- 163 (457) T ss_pred ccCceEEEEecCCCccceeEEEEEEec----------------------------------------------------- Confidence 555555544432211 11121111110 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCC Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTS 310 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~ 310 (559) . .......+ |..--++++++...++..||.| |...+...+.....+.+.......-...|..++ +.. T Consensus 164 -------~-~~~~~~~~--~~~~diih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 232 (457) T protein:vir:13 164 -------D-GNEVLLGW--FTPRDVLHIPGMMLPGDFVGCS-PISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGT 232 (457) T ss_pred -------C-CceeeEEe--eCccceEEecCCCCCCcccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCC Confidence 0 00001111 1111255566666667789999 898888878888888877777777777887554 333 Q ss_pred Ccccc-------ce-e-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Q lcl|NC_019445. 311 LKNQR-------AS-L-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTR 377 (559) Q Consensus 311 ~~~~~-------~~-~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~ 377 (559) +.... +. . ..|++.+++ ++-.++|+.. ++.-..+.+..+..+..|-++|-+... +++..+.. T Consensus 233 ls~e~~~~~~~~~~~~~~g~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~ 307 (457) T protein:vir:13 233 MSEEGLARAREAWRAANSGVDNAHRVALLT---EGAKFSKVAM-SPDEAQFLQTRQFQVPEIARIFGVPPH-LISDATNS 307 (457) T ss_pred CCHHHHHHHHHHHHHHhcCccccCcceecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHH-HcCCCCCc Confidence 32211 10 0 123344442 2233555532 233334445556677889999987544 33333322 Q ss_pred CcCHHHHHHHHHH-HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHH Q lcl|NC_019445. 378 SMPVEAVIEMKEE-KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSL 456 (559) Q Consensus 378 ~~TA~Ei~~r~~e-~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l 456 (559) ..+..-+.+.... ....|.|.+.+++.+| .+ .++++. +..+.-+++...+.+. .+.... T Consensus 308 ~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l------------n~-~L~~~~--~~~~~~i~fd~~~l~~-----~D~~~r 367 (457) T protein:vir:13 308 TSWGSGLAEQNIAFTMFSLRPWLERIEAGF------------NR-LLFAET--ADRFRFVKFNLDEIKR-----GAPKER 367 (457) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHH------------HH-hhcCcc--ccCceeEEeechhhhc-----cCHHHH Confidence 2222323333322 1233444444444443 32 223322 2233335554433321 121111 Q ss_pred HHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc------cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 457 ASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP------TVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGA 530 (559) Q Consensus 457 ~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~------~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a 530 (559) ..++.. +.+.+ -+.++++ -..+|.|+ ..+.-+--...+-+. ...+.+ T Consensus 368 ~~~~~~---~~~~G-----~~T~NE~----R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~---------------~~~~~~ 420 (457) T protein:vir:13 368 MELWSL---GLQNG-----IYSIDEV----RAAEDMTPLPDGLGEKYRVPLNLGEVGEE---------------PEPEPA 420 (457) T ss_pred HHHHHH---HHhCC-----CcCHHHH----HHHhCCCCCCCCcccceeecccccccccc---------------cccccc Confidence 111211 11111 1232332 22334432 100000000000000 000000 Q ss_pred hhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 531 KTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) .. .++.. +...+.--+.-..+....+ T Consensus 421 ~~-~~~~~--~~~~~~~~~~~~~g~~d~~ 446 (457) T protein:vir:13 421 PA-PPAIE--PPAEEPDEEPEPEGKPDDE 446 (457) T ss_pred CC-CCCCC--CCccccCCCCCCCCCCccc Confidence 00 00000 0000000000000000000 No 154 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=84.48 E-value=0.06 Score=27.28 Aligned_cols=310 Identities=12% Similarity=0.055 Sum_probs=118.8 Q ss_pred EEEEEeecCcc--ccccccccc-ccEEEEEEEecCCCce-eeee---cCcc-----cCCeEEEEeeecCCCcccccchHH Q lcl|NC_019445. 210 VMHSVYPNIDR--DTSKLDSKN-KPFKSVYYEVGGDNDK-LLRE---SGFD-----EFPIMAPRWEVNGEDVYGSSCPGM 277 (559) Q Consensus 210 v~~~v~p~~~~--~~~~~~~~~-~~~~sv~~~~~~~~~~-il~e---sg~~-----~~P~~~~rw~~~~g~~YGrG~P~~ 277 (559) |...||...+. .+.++..+. ..|. ||....++.. .++. .|.+ .+=|++.|....+|+.||.| ... T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~--~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~g-Llr 77 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTIS--RFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQS-LLR 77 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCcccee--eeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchh-hHH Confidence 33333322221 111111111 1111 2222212211 1111 1222 12288899999999999999 488 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC-c-eeecCCCccc-----------------cce----ecCC--ceeecCCcCCc Q lcl|NC_019445. 278 LALGPVKALQLLQKRKSQLIDKATNP-P-MVAPTSLKNQ-----------------RAS----LLPG--DITYIDQITGQ 332 (559) Q Consensus 278 ~~l~d~~~L~~l~~~~~~~~~~~~~p-~-~~~p~~~~~~-----------------~~~----~~pg--~~~~~~~~~~~ 332 (559) .+..-..--+...+..+..+++-..| | .++|.+.... -.+ +.-| +...++..... T Consensus 78 ~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~i 157 (355) T protein:vir:78 78 QAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANF 157 (355) T ss_pred HHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceE Confidence 88888887888888888899886555 3 3445331100 000 0112 12222222222 Q ss_pred hhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHH-HHHHHhhhH-HHHHHHHHHHHH Q lcl|NC_019445. 333 DGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKE-EKLLMLGPV-LERLNDECLNPL 410 (559) Q Consensus 333 ~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~-e~~~~LG~v-~~~l~~E~l~Pl 410 (559) ..+.. . .........|+.+.+.|+.+++...+.+-...+++.....|++.... .+...-.-. -+.|+..++.|+ T Consensus 158 e~~ea-~---g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l 233 (355) T protein:vir:78 158 TLTGV-Q---GKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDL 233 (355) T ss_pred EEeec-C---CCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22221 1 12223445688899999999988754442222333334456543221 111111111 111222233333 Q ss_pred HHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHc Q lcl|NC_019445. 411 IDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMS 490 (559) Q Consensus 411 i~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~ 490 (559) +. +- .|--.+.| +++|...-. +...+ ...+..|..++= .+..+....++.+.+ T Consensus 234 ~~----lN--~~~~~~~P--------~~~~~~~~~------~~~~~---a~~~~~l~~~G~----~~~~~~~~~~~~e~~ 286 (355) T protein:vir:78 234 VD----QN--WGPEEPAP--------RLVPAQLGK------EQPVT---AEAIRALVECGA----FTADPELEKDLRARY 286 (355) T ss_pred HH----hc--CCCCCCCC--------EEEecCcCh------hHHHH---HHHHHHHHhCCC----ccccHHHHHHHHHHh Confidence 22 11 12111122 233321100 11111 222233333321 244455677888999 Q ss_pred CCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH-HhhhhhhcCCChh----HHHHHHHHhhcCCCCCC Q lcl|NC_019445. 491 GVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAA-AQG-AKTLSEAKTSDPS----VLSAMANAVSGQGGQSQ 559 (559) Q Consensus 491 Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~-~~~-a~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 559 (559) |+|..--..+ ++....+ ..-..+ .+.+... .++ ..+.......++. ++-..+.-+.--.+++. T Consensus 287 gip~p~~~~~-~~~~~~~----~~~~~~-~~~~~~~~~~~~~~~a~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (355) T protein:vir:78 287 GLPAPAERDD-GADAAAA----KAAGRR-RAKRLPGQRQGAALPSRSPRADPPRRRGPLRRRPRHPAHRRCAPDG 355 (355) T ss_pred CCCCCCCCCc-ccCCccc----cccccc-cccccCCccccccccccCCCCCChhhhHHHHHHhhccccCCCCCCC Confidence 9985322211 1100000 000000 0000000 000 0000000111222 22222222222223333 No 155 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=84.39 E-value=0.061 Score=27.25 Aligned_cols=323 Identities=7% Similarity=0.006 Sum_probs=140.2 Q ss_pred cCCCCCCCCCCcccccCCCCc---------------chHHH-----HHHHHHHHHHHhhcCCCCcceeccCCccchhhHH Q lcl|NC_019445. 38 GSRFLTSEVNRNDRRNTRIID---------------STGTM-----AARTLASGMMSGITSPARPWFRLATPDPEMMDYG 97 (559) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~---------------s~~~~-----a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~ 97 (559) ++-|..-...+.... ...++ -+... ++....+.+...+- +.||--.... T Consensus 1 Mg~f~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia--~~~~~~~~~~-------- 69 (382) T protein:vir:48 1 MPIFNLATESPPDNQ-GGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLA--TVKLITSRKK-------- 69 (382) T ss_pred CccccccccCCcccc-cccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhc--cCceeeecch-------- Confidence 332221111110000 00000 00000 12222222222221 2222111000 Q ss_pred HHHHHHHHHHHHHHHHHHhcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEeeCCCCCEEEEE Q lcl|NC_019445. 98 PVKLWLEAVQNRMNDMFNKSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRGSVDICF 172 (559) Q Consensus 98 ~v~~~l~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G~vd~i~ 172 (559) -...+.+-| .+.=+..++.+|.++|||.+++..+. +.++.+.+++...+-+..+.+|..- + T Consensus 70 ------------~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~--~ 135 (382) T protein:vir:48 70 ------------LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGI--Y 135 (382) T ss_pred ------------hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeE--E Confidence 001122222 24445567788999999999987764 4566777777777766666554311 1 Q ss_pred EEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCc Q lcl|NC_019445. 173 RKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGF 252 (559) Q Consensus 173 r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~ 252 (559) .++... +... ...+. | T Consensus 136 y~~~~~------------------------~~~~-----------------------------------~~~~~-----~ 151 (382) T protein:vir:48 136 YNITFD------------------------DPRI-----------------------------------PPKQH-----V 151 (382) T ss_pred EEEEec------------------------Cccc-----------------------------------cceeE-----E Confidence 110000 0000 00000 0 Q ss_pred ccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCcccc----------ceecC Q lcl|NC_019445. 253 DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQR----------ASLLP 320 (559) Q Consensus 253 ~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~~----------~~~~p 320 (559) ..--+++.|+...++..||.| |...+...+...+...+.......-...|.+++ +..+.... ..-.+ T Consensus 152 ~~~evih~~~~~~~~~~~G~s-~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~ 230 (382) T protein:vir:48 152 PQNDVLHFRLLSVDGGMTSVS-PLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQ 230 (382) T ss_pred cCccEEEecCCCCCCcccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCC Confidence 011255666666778899999 899998889888888888888888888887654 43332211 11124 Q ss_pred CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHH Q lcl|NC_019445. 321 GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLE 400 (559) Q Consensus 321 g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~ 400 (559) |++.+.+. +..++|+.. ++....+.+..+..+..|-++|-....... ..+.. |..+ .+...-....|-|... T Consensus 231 g~~~vl~~---g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg-~~~~~--~~~~-~~~~~~~~~~l~p~~~ 302 (382) T protein:vir:48 231 GGPLVLDD---LEDFTPLEI-KSNVSQLLKQADWTTGQFAKVYGIPDNVVG-GQGDQ--QSSL-EMSSDLYSKAVSRYLR 302 (382) T ss_pred CCeeEcCC---CceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCc--ccHH-HHHHHHHHHHHHHHHH Confidence 55554432 233555542 233344456667788999999987644432 22211 2222 2223445566777777 Q ss_pred HHHHHHHHHHHHHH------------------HHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHH----HHHHHH Q lcl|NC_019445. 401 RLNDECLNPLIDRA------------------FSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIG----LSSLAS 458 (559) Q Consensus 401 ~l~~E~l~Pli~r~------------------~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~----~~~l~~ 458 (559) .+..|+-.-++... +.-|.+.| +.++-........ ...+.. T Consensus 303 ~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g-----------------~~t~~e~r~~l~~~g~~~~~~~~ 365 (382) T protein:vir:48 303 PFLSELSQKLSCDVDADIFPAVDPTGSNYISRINSLVKTG-----------------TLAQNQGLYILQQAEILPKELPN 365 (382) T ss_pred HHHHHHHHHhcChhhhhhhhhhccchhHHHHHHHHHhhcC-----------------ccCHHHHHHHHhhCCCCCcchhh Confidence 77766543332110 00111111 1111111110000 000111 Q ss_pred HHHHHHHHhccChhhHh Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALD 475 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~ 475 (559) ..+....+-+-+.+--+ T Consensus 366 ~~~~~~~~~GGd~~~~~ 382 (382) T protein:vir:48 366 GENPNSTLKGGEEDGQD 382 (382) T ss_pred hhcCCCCCCCCCCCCCC Confidence 11101111111111111 No 156 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=82.25 E-value=0.079 Score=26.64 Aligned_cols=342 Identities=15% Similarity=0.062 Sum_probs=130.4 Q ss_pred HHHHHHHHHH-------------------------hhhHHHHHHHHHHHhccccCC-CC-CCCCCCcccccCC--CCcch Q lcl|NC_019445. 10 NKQFAQLESE-------------------------RQSFEPHWRELSDYINPRGSR-FL-TSEVNRNDRRNTR--IIDST 60 (559) Q Consensus 10 ~~r~~~l~~~-------------------------R~~~~~~w~e~~~~~~P~~~~-~~-~~~~~~~~~~~~~--~~~s~ 60 (559) -.-|++|++. |.+-++.-.....+..|.... ++ ......+.....+ .-.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 3445555552 222222222222222222110 00 0000011111111 11233 Q ss_pred HHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hc----cchHHHHHHHHHHHhhCc Q lcl|NC_019445. 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KS----NLYQSLPQLYGSLGTYST 135 (559) Q Consensus 61 ~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~ 135 (559) .-.|++.+|+.+- +-|+....-. ... +.. .+ .++ += +.+.-+...+.+|+. || T Consensus 81 v~acV~~Ia~~iA------~lpl~~~~~~-~~~---~~~-~~----------ll~~~PN~~~t~~~f~~~l~~~lll-Gn 138 (409) T protein:vir:83 81 AWACIDLNASVLS------SMPIYRMRNG-RII---DSV-AW----------MSNPDPEVYTSWQEFAKQLFWDFQL-GE 138 (409) T ss_pred HHHHHHHHHHhhc------cCceEEeeCC-ccc---cch-hh----------hcccCCCCCCCHHHHHHHHHHHHhh-CC Confidence 3345555555332 2245433211 100 000 00 111 11 122223445677765 99 Q ss_pred EEEEE-eecC-CceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE Q lcl|NC_019445. 136 GAMAV-LEDD-EDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHS 213 (559) Q Consensus 136 ~~l~v-~~~~-~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~ 213 (559) +..++ ..+. +.++...+++...+.+..+.+|... |+ T Consensus 139 ay~~~i~r~~~G~~~~L~pl~p~~v~v~~~~~g~~~--y~---------------------------------------- 176 (409) T protein:vir:83 139 AFVLPMAHGSDGYPIRFRVVPPWLVNVELKKGARRE--YR---------------------------------------- 176 (409) T ss_pred cEEEEEEECCCCcEEEEEEECCcceEEEEcCCceEE--EE---------------------------------------- Confidence 87653 3332 3344444455444444444333211 10 Q ss_pred EeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 214 VYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRK 293 (559) Q Consensus 214 v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~ 293 (559) ..+. +..-..++.|+....+..||.| |.+.+...+...+..++.. T Consensus 177 -------------------------~~~~---------~~~~eiiHir~~~~~~~~~G~s-pi~~~~~~i~~~~a~~~~~ 221 (409) T protein:vir:83 177 -------------------------IGGL---------NVTDEILHIRYQGNTADAHGHG-PLESAAPRQVVIGLLQKYV 221 (409) T ss_pred -------------------------Eccc---------cCccceEEeCCCCCCCCccccc-HHHHHHHHHHHHHHHHHHH Confidence 0000 0001245556555566789999 8987776666666666655 Q ss_pred HHHHHHHhcCceee--cCCCccccc-----ee------cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHH Q lcl|NC_019445. 294 SQLIDKATNPPMVA--PTSLKNQRA-----SL------LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIIN 360 (559) Q Consensus 294 ~~~~~~~~~p~~~~--p~~~~~~~~-----~~------~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~ 360 (559) .....-.+.|..++ +..+..... .+ ..|+...+ .++...-+|+. .++.-..+.+..+.....|- T Consensus 222 ~~~f~nga~p~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il--~~g~~~~~~~~-~s~~d~q~le~r~~~~~eIa 298 (409) T protein:vir:83 222 QNLAETGGVPLYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALV--TGGATLNQAKS-MSAQDLSLMELTQFNEARIA 298 (409) T ss_pred HHHHhcCCCcceEeecCCCCCHHHHHHHHHHHHHhhCCccCcccee--cCCcccccccC-CCHHHHHHHHHHHhhHHHHH Confidence 55555566676443 333322111 11 11221212 11212123322 22332334444455677799 Q ss_pred HHhhcchhhhccCCCCCCcCHHHHHHHHHHHH-HHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEE Q lcl|NC_019445. 361 SAYFVDLFMMLQNINTRSMPVEAVIEMKEEKL-LMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVE 439 (559) Q Consensus 361 ~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~-~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~ 439 (559) ++|-+.........+..+-|-.-+.+...... ..|.|.+.+++.++-.-|+.+ +..+++. T Consensus 299 ~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~-------------------~~~~~f~ 359 (409) T protein:vir:83 299 ILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPS-------------------PQHLELN 359 (409) T ss_pred HHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC-------------------CcEEEee Confidence 99988755443333444455333333333333 367777777777765444211 0112211 Q ss_pred eecH-----HHH---HHHHHHH-----HHHHHHHHHHHHHhc---c-Chhh Q lcl|NC_019445. 440 YISV-----MAQ---AQKSIGL-----SSLASTVNFIGQLAQ---A-KPEA 473 (559) Q Consensus 440 ~is~-----La~---a~r~~~~-----~~l~~~~~~~~~la~---~-~P~~ 473 (559) +..- .++ .++..+. +-+.... .+-.+.. + .-.| T Consensus 360 ~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~-glpp~~ggd~l~~~gv 409 (409) T protein:vir:83 360 RDDYTRPSLVERATAYKIMIEAGVMEPNEARAME-RLHSEAAAVRLSGGGV 409 (409) T ss_pred hhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCcccCCCCC Confidence 1110 000 0000000 0001111 1111100 0 0111 No 157 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=81.97 E-value=0.081 Score=26.57 Aligned_cols=419 Identities=13% Similarity=0.119 Sum_probs=149.6 Q ss_pred HHHHhccc--cCCCCC--CCCCCcc---cc----cCCCCcchH-HHHH------HHHHHHHHHhhcCCCCcceeccCCcc Q lcl|NC_019445. 30 LSDYINPR--GSRFLT--SEVNRND---RR----NTRIIDSTG-TMAA------RTLASGMMSGITSPARPWFRLATPDP 91 (559) Q Consensus 30 ~~~~~~P~--~~~~~~--~~~~~~~---~~----~~~~~~s~~-~~a~------~~Las~l~~~l~pp~~~Wf~l~~~d~ 91 (559) +.+|-++. +.++.. .....-. .+ ...+|+..+ ++++ ..+..-+...+ .+.+|. +...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~i--a~~~~~-i~~~~~ 77 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRFEEYVEPKVHPLVLLSLLQVNPYHASACSIKANDI--LRTGYL-IDGDDG 77 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCCCccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHH--hcCCce-EecCcc Confidence 22222221 111110 0000000 00 011222111 1111 11111111111 133332 222221 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEeeCCCCCEEE Q lcl|NC_019445. 92 EMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRGSVDI 170 (559) Q Consensus 92 ~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G~vd~ 170 (559) .+ ..++-. ..-+++.-+...+.|+.++|||.+++..+. +.++...+++...+-+.++..+.+.. T Consensus 78 ~~------~~~lpN---------~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~ 142 (540) T protein:vir:41 78 GV------EELLRA---------CRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSRYMQT 142 (540) T ss_pred ch------hhhccC---------CCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCceeEee Confidence 11 111100 011344456677889999999999887654 45566666666666555554432110 Q ss_pred EEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeec Q lcl|NC_019445. 171 CFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRES 250 (559) Q Consensus 171 i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~es 250 (559) . .+. ..++-..+ +. ... + ....+.... T Consensus 143 ~-----------------------------d~~----~~~~~~~~-~~---~~~----------~-~~~~g~~~~----- 169 (540) T protein:vir:41 143 W-----------------------------DGI----HVTYFKDY-RY---EGE----------V-NPDNGEDQD----- 169 (540) T ss_pred e-----------------------------cCc----eeeeeecc-cc---cce----------e-eccccccce----- Confidence 0 000 00000000 00 000 0 000000000 Q ss_pred CcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCCCcccc------------- Q lcl|NC_019445. 251 GFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTSLKNQR------------- 315 (559) Q Consensus 251 g~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~~~~~~------------- 315 (559) .|..--.+++|+....+..||.| |...++..+.......+.......-...|..+ +++.+.... T Consensus 170 ~~~~~eViHir~~~~~~~~~G~S-pi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~ 248 (540) T protein:vir:41 170 GVGANEIIFIHLPSPICSYYGVP-RYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTV 248 (540) T ss_pred eecccceEEecCCCCCCCccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHH Confidence 01122366677666667799999 89888777777666666666666556677654 343332110 Q ss_pred -----------ceecCCceeecCCcC---CchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC--- Q lcl|NC_019445. 316 -----------ASLLPGDITYIDQIT---GQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS--- 378 (559) Q Consensus 316 -----------~~~~pg~~~~~~~~~---~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~--- 378 (559) ..-.+|+......++ ++-.++|+.. ++.-..+.+..+..++.|-.+|-...-. ++..++.. T Consensus 249 ~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~-~~~d~qfle~~~~~~~eIa~afgVPp~~-lG~~~~~~~n~ 326 (540) T protein:vir:41 249 LQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNT-SQKELSFREYAAEKKHDIAAAHMIDPYR-LGITDVGPLGG 326 (540) T ss_pred HHHHHHHHhccccccccceEEEecCCCcccceeEEeccc-chhHHHHHHHHHHHHHHHHHHhCCCHHH-cCcccCCCCCc Confidence 011344444433221 1223555543 2333445566777889999999886443 33322221 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) -++++.... +....|.|++.++-..+.+ .++++.+ ..+.++|...- +.+..-...+ T Consensus 327 sn~eq~~~~--------------f~~~tL~P~~~~ie~~ln~-~L~~~~~-----~~~~i~f~~~~--ll~~D~~~~~-- 382 (540) T protein:vir:41 327 NFAEVARRT--------------YYESVVRPQQEIVSSVLTD-FIQLKLD-----PGARFVFNEEI--LMESEFVHNY-- 382 (540) T ss_pred ccHHHHHHH--------------HHHHHHHHHHHHHHHHHHH-hhhhccC-----CceEEEecchh--hcchHHHHHH-- Confidence 133332221 2233345555555444443 2333322 12455554321 1122111111 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc---ccCCHHH-HHHHHHHH-HHHH----HHHHHHHHHHHHHHH Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT---VIVPQEQ-VDQARQQR-AQQQ----QQQQMMAMGMAAAQG 529 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~---~~rs~~e-v~~~rq~r-~q~~----q~~~~~~~~~~~~~~ 529 (559) + .+.+.+ -+.++++-.. ..|+|+. .+.+..- ...+..+. .++. ..+.......+..+. T Consensus 383 --~---~lv~~G-----~lT~NE~Re~---L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~ 449 (540) T protein:vir:41 383 --A---LLVQCG-----VLTPSEVREK---LFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQE 449 (540) T ss_pred --H---HHHhCC-----CCCHHHHHHH---hCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccccC Confidence 1 111111 1333343211 2344431 1111000 00000000 0000 000000001111000 Q ss_pred -----------Hhh----hhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 530 -----------AKT----LSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 530 -----------a~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) -+. +........+.++.++.-.+-.|.-+- T Consensus 450 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (540) T protein:vir:41 450 IISSESPLEDKKKKIDEVLSDFRAEAYENGKKMLSIAGDMGTMSA 494 (540) T ss_pred ccccccccccccccccccccccCCccccchhHHHHHhhhhhhhhh Confidence 000 000111111223333222211111111 No 158 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=76.44 E-value=0.14 Score=25.34 Aligned_cols=343 Identities=9% Similarity=0.004 Sum_probs=153.0 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccc--cCCCCcchHHHHHHHHHHHHHHhhcCCCCcceecc Q lcl|NC_019445. 10 NKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRR--NTRIIDSTGTMAARTLASGMMSGITSPARPWFRLA 87 (559) Q Consensus 10 ~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~--~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~ 87 (559) -..|+.++..++.-...-.+...++.|. +.+.. ..+..- ..-.-.++--.|++.+|+.+.+. |+ ++ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~------p~-~~- 68 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPD---FLSTL-NGSEWVSAESALRNSDLFSIINQLSNDLATV------KL-TA- 68 (386) T ss_pred Ccccccccccccccccccccccccccch---hcccc-cCCceechhhhhcchHHHHHHHHHHHhhccC------ce-ee- Confidence 3334444444443222222222222221 11110 111100 01122344445666666655442 22 11 Q ss_pred CCccchhhHHHHHHHHHHHHHHHHHHHHhccch----HHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEee Q lcl|NC_019445. 88 TPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLY----QSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLAN 162 (559) Q Consensus 88 ~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~----~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~ 162 (559) -+.. ....+.+-|.+ .-+..++.++.++||+.+++..+. +.++.+.++|...+-+.+ T Consensus 69 -~~~~-----------------~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~ 130 (386) T protein:vir:48 69 -SRKQ-----------------LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNR 130 (386) T ss_pred -ccch-----------------hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEE Confidence 1110 11133344433 334456778899999999987764 355677778888877777 Q ss_pred CCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCC Q lcl|NC_019445. 163 SPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGD 242 (559) Q Consensus 163 d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~ 242 (559) +.+|... .|+ +... ..... ...... T Consensus 131 ~~~~~~~-~y~-~~~~------------------------~~~~~---------------------------~~~~~~-- 155 (386) T protein:vir:48 131 LDNKDGI-YYN-ITFD------------------------DPRIP---------------------------PKQHVP-- 155 (386) T ss_pred cCCCceE-EEE-EEec------------------------Ccccc---------------------------ceeEec-- Confidence 6655321 111 1000 00000 000000 Q ss_pred CceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCcccc----- Q lcl|NC_019445. 243 NDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQR----- 315 (559) Q Consensus 243 ~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~~----- 315 (559) .--.++.|....++..||.| |...+...+.....+.+.......-...|..++ ++...... T Consensus 156 -----------~~evih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~ 223 (386) T protein:vir:48 156 -----------QGDVLHFKLLSVDGGLTSVS-PLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLS 223 (386) T ss_pred -----------CccEEEecCCCCCCceeecc-HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHH Confidence 01134445555667789999 899888888888888888888888777887654 33332210 Q ss_pred -----ceecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHH Q lcl|NC_019445. 316 -----ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEE 390 (559) Q Consensus 316 -----~~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e 390 (559) ..-..|++...+ ++..++|+.. ++.-..+.+..+..+..|-.+|-......... +..-+++|- ...- T Consensus 224 ~~~~~~~~n~g~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~--~~~~~~e~~--~~~~ 295 (386) T protein:vir:48 224 RSRQAMKQMQGGPLVLD---DLEEFTPLEI-KSNVSQLLKQADWTTGQFAKVYGIPENVVGGQ--GDQQSSLEM--SLDL 295 (386) T ss_pred HHHHHhhcCCCCceecC---CCceEEEcCC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC--CCcccHHHH--HHHH Confidence 111234433332 2233555532 23333445666777889999998765543221 211233322 1223 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHH------------------HHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHH Q lcl|NC_019445. 391 KLLMLGPVLERLNDECLNPLIDRA------------------FSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIG 452 (559) Q Consensus 391 ~~~~LG~v~~~l~~E~l~Pli~r~------------------~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~ 452 (559) ....|.|.+..++.++-.-|+.++ +.-+.+.|. .|+-..-.. .+ T Consensus 296 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~-----------------~t~nE~r~~-lg 357 (386) T protein:vir:48 296 YNKAVSRYLRPFLSELSQKLSCDVDADILPAVDPTGSNSVSRINSMVKSGT-----------------LAQNQGLYI-LQ 357 (386) T ss_pred HHHHHHHHHHHHHHHHHHhhcchhhcchhhhhccChHHHHHHHHHHHhCCC-----------------cCHHHHHHH-hh Confidence 455678888888777654443211 111222221 111110000 00 Q ss_pred HHHH----HHHHHH--HHHHhccChhhHh Q lcl|NC_019445. 453 LSSL----ASTVNF--IGQLAQAKPEALD 475 (559) Q Consensus 453 ~~~l----~~~~~~--~~~la~~~P~~~~ 475 (559) ...+ ...... ...+.+-+++--+ T Consensus 358 ~~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 358 QAEILPKELPEGENPNKTTLKGGEINGED 386 (386) T ss_pred cCCCCCccchhhcCCCCCccCCCCCCCCC Confidence 0000 000110 1122222222222 No 159 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=76.39 E-value=0.14 Score=25.33 Aligned_cols=394 Identities=14% Similarity=0.146 Sum_probs=163.7 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCC---CCCCCCCcccccCCC--CcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRF---LTSEVNRNDRRNTRI--IDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~---~~~~~~~~~~~~~~~--~~s~~~~a~~~Las~l~~~ 75 (559) |...+ -+...++= .+.+ ...-.+........+ ....+..+|++.|..+ T Consensus 1 ~~~~D-----------------------~~~n~~~g-g~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~--- 53 (422) T protein:vir:10 1 MVKTD-----------------------SYANIFLG-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETA--- 53 (422) T ss_pred Cccch-----------------------hhHHHHcC-CCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHH--- Confidence 32211 01111110 0000 000000000000001 1333444555555444 Q ss_pred hcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeec Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPI 155 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l 155 (559) | +.|+.++-.+.. .. +.+.+++-++...+.++++.--+||.|++++.-+..+.+ .-|+ T Consensus 54 -~---r~g~~i~~~~~~----~~-----------~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~---~~Pl 111 (422) T protein:vir:10 54 -L---AAGFHIDGIDDE----PA-----------FWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL---TSPV 111 (422) T ss_pred -h---cCCccccCCCHH----HH-----------HHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCc---cccc Confidence 2 688888632211 11 112334457789999999999999999998875333322 1233 Q ss_pred cEEEEeeCCCCCEEE--EEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEE Q lcl|NC_019445. 156 GSYYLANSPRGSVDI--CFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFK 233 (559) Q Consensus 156 ~~~~v~~d~~G~vd~--i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~ 233 (559) . ..|.+-. ++-+..+++.. +..+.++ .++. +.+.|+ |.++.. ..+ T Consensus 112 ~-------~~g~~~~l~v~d~~~i~~~~----~~~dp~s---------~~fg-~P~~y~-v~~~~~---------~~~-- 158 (422) T protein:vir:10 112 R-------EGAELETVRVYDRTQVKVQT----REENPRN---------ARFG-EPLTYR-ITTNES---------DMF-- 158 (422) T ss_pred c-------ccCceeeEEeeccccccchh----cccCccc---------cccC-cceEEE-EecCCC---------Ccc-- Confidence 2 1343322 22233333211 1111110 1111 112221 211110 001 Q ss_pred EEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecC--- Q lcl|NC_019445. 234 SVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGML-ALGPVKALQLLQKRKSQLIDKATNPPMVAPT--- 309 (559) Q Consensus 234 sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~-~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~--- 309 (559) +.+|. .+++.-.|. ..|+ +.+.....||+| |... +++.++..+..+....+.+..+....+.++. T Consensus 159 -~~iH~----SRli~~~g~-~~p~----~~~~~~~~~G~S-~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~ 227 (422) T protein:vir:10 159 -YDVHY----SRIHIIDGE-RIPN----VMRRQNDGWGRS-VLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAE 227 (422) T ss_pred -eeecc----ceeEEeCCC-Cchh----hhcccCCcccch-hHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHH Confidence 11121 122222111 1233 445567778998 7765 5688888888888888877766655554432 Q ss_pred ----CCcc----cccee------cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCC Q lcl|NC_019445. 310 ----SLKN----QRASL------LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNIN 375 (559) Q Consensus 310 ----~~~~----~~~~~------~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~ 375 (559) +... .++.. .-|++. +. +..+.+.++. .++..+-..+....+.|.-+.=-.+--..++ . T Consensus 228 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~-l~--~~~e~~e~~~---~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~-s 300 (422) T protein:vir:10 228 LCDDSEGFGAARLRLAQVDNNSGVGQAIG-ID--AESEEYSVLN---SDIGGIDAFLDKKFDRIVALSGIHEIILKNK-N 300 (422) T ss_pred hcCCccchHHHHHHHHHHHHhcCCcccee-Ee--cCCcceEEEe---cccCChHHHHHHHHHHHHhhhCCCeeeeccC-C Confidence 1100 00110 111211 11 1122344332 2344445556666777766653321112222 3 Q ss_pred CCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHH Q lcl|NC_019445. 376 TRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSS 455 (559) Q Consensus 376 ~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~ 455 (559) +...+||.= +-....---+..++...+.|++++.+.++.+.. ++.+++- ||-+......++. T Consensus 301 ~~Glnatgd-----~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~s~------------~~~~~f~-pL~~~sekekaei 362 (422) T protein:vir:10 301 VGGVSSSQN-----TALETFHKLVDRKRNAELLPILEFLIPFIVNAE------------EWSVEFN-PLAQESSKDKAEI 362 (422) T ss_pred cccccccch-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC------------CcEEEeC-CCCCCCHHHHHHH Confidence 445544311 111222223344566678999999999987632 3566654 3333332222222 Q ss_pred HHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHH---cCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019445. 456 LASTVNFIGQLAQAKPEALDKLNVDQAIDAFADM---SGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKT 532 (559) Q Consensus 456 l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~---~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~ 532 (559) .....+.+..+.+.+ .++.+++.+.+... .|+...+...+.+.+.. + .. T Consensus 363 ~~~~a~a~~~~~~~g-----~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~--~---------------------~~ 414 (422) T protein:vir:10 363 LEKNVNSIAALIAAG-----AMDIDEARDTLRTIAPEVKINDGSVETEVTISET--S---------------------ND 414 (422) T ss_pred HHHHHHHHHHHHhcC-----CCCHHHHHHHhhhhcccccCCCCCCccccchhhc--C---------------------CC Confidence 223333333333322 47777777766543 44444433322211000 0 11 Q ss_pred hhhhcCCC Q lcl|NC_019445. 533 LSEAKTSD 540 (559) Q Consensus 533 ~~~~~~~~ 540 (559) -.+.+.++ T Consensus 415 ~~~~~~~d 422 (422) T protein:vir:10 415 PLEVPTDD 422 (422) T ss_pred CCCCCCCC Confidence 11112222 No 160 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=75.93 E-value=0.14 Score=25.25 Aligned_cols=417 Identities=11% Similarity=0.043 Sum_probs=142.5 Q ss_pred cccCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCc-c-----chhhHHHHHHHHHHH--HHHHHH-HHHhccchH Q lcl|NC_019445. 51 RRNTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPD-P-----EMMDYGPVKLWLEAV--QNRMND-MFNKSNLYQ 121 (559) Q Consensus 51 ~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d-~-----~~~~~~~v~~~l~~v--e~~~~~-~l~~snf~~ 121 (559) .+.=.-.+++...|++.+|..+. +.||- +...+ . .......+..++... ...+.. .+....+.. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia------~~p~~-i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~ 73 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVA------GFGIN-IIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATN 73 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhh------cCCeE-EEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHH Confidence 11111124566677777777664 23342 21111 1 111111111111110 000100 011223455 Q ss_pred HHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHh Q lcl|NC_019445. 122 SLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWE 200 (559) Q Consensus 122 ~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~ 200 (559) -+..++.|+.++|||.+++..+. +.++.+.+++...+.+..|..+.+...- T Consensus 74 ~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~---------------------------- 125 (467) T protein:vir:31 74 VLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLE---------------------------- 125 (467) T ss_pred HHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecC---------------------------- Confidence 66778899999999999988754 5566676676666666666543321110 Q ss_pred cCCCCceEEEEEEEeecCcccccccccccccEEEEE-EEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHH Q lcl|NC_019445. 201 SGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVY-YEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLA 279 (559) Q Consensus 201 ~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~-~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~ 279 (559) .....+.++ ...+.. +..+..+..++ +........+ .+..-=.+++|.....+..||.+ |..-+ T Consensus 126 --~~~~~~~~~-----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~----~~~~~diih~r~~~~~~~~~G~s-~~~~~ 190 (467) T protein:vir:31 126 --EKEKYFGVA-----GDRYQT---NGNGDLDPVFVDADDGSTGTSV----SNPANELIFKRNHSPLYPHYGAP-DIIPA 190 (467) T ss_pred --CceeeEEec-----ccccee---ecccceeeeeeeecccccccee----EeccccEEEecCCCCCCCccccc-HHHHH Confidence 000011100 000000 00000000000 0100000000 11122356677666677899999 88888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceee--cCC-Ccccc-------c-eecC-----------Ccee---ecCCcCCch- Q lcl|NC_019445. 280 LGPVKALQLLQKRKSQLIDKATNPPMVA--PTS-LKNQR-------A-SLLP-----------GDIT---YIDQITGQD- 333 (559) Q Consensus 280 l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~-~~~~~-------~-~~~p-----------g~~~---~~~~~~~~~- 333 (559) +..+......++.......-...|..++ ++. +.... + +... |..+ +.-..++.. T Consensus 191 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~ 270 (467) T protein:vir:31 191 VKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADR 270 (467) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcc Confidence 7766666666555555555556666443 432 11110 0 0000 1000 000001100 Q ss_pred -----hhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHH-HHHHHHHhhhHHHHHHHHHH Q lcl|NC_019445. 334 -----GFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEM-KEEKLLMLGPVLERLNDECL 407 (559) Q Consensus 334 -----~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r-~~e~~~~LG~v~~~l~~E~l 407 (559) .+.|+....+.-..+.+........|.++|-..... ++..+.. -+++-+.+. ..-....|.|.+.+++.++- T Consensus 271 ~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~-lG~~~~~-~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln 348 (467) T protein:vir:31 271 SDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVI-AGVVESG-AFSTDAEEQRKEFAEETIQPKQHDFGELLY 348 (467) T ss_pred cccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHH-cccCCCC-CcccCHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123332222222334455566778899999876433 3322211 122222111 12123334454444444433 Q ss_pred HHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHH Q lcl|NC_019445. 408 NPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFA 487 (559) Q Consensus 408 ~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a 487 (559) ..+ ++.- ....+..|++.+...+. .......+.+...++ .+ -+..++ +- T Consensus 349 ~~l-------------~~~~-~~~~~~~i~f~~~~l~~-~d~~~~~~~~~~~~~-------~G-----~~T~NE----~R 397 (467) T protein:vir:31 349 ELV-------------HKQG-LDAPDWTIEFELAKPDT-KLQDVEIASQRVQAM-------QG-----LLTVNE----LR 397 (467) T ss_pred Hhh-------------cchh-hccCCceEEEecchhhc-cCHHHHHHHHHHHHh-------CC-----CcCHHH----HH Confidence 222 2210 01122235554444431 111111111111111 00 011111 11 Q ss_pred HHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCC-hhHHHHHHHHh-----hcCCCCCC Q lcl|NC_019445. 488 DMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSD-PSVLSAMANAV-----SGQGGQSQ 559 (559) Q Consensus 488 ~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~~~~~~~~-----~~~~~~~~ 559 (559) ..+|.|+- .+++.-....-..+.. ....+..+..+.-.+..... .+.+..+-+.. .+-|.++- T Consensus 398 ~~~Gl~pi---~d~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (467) T protein:vir:31 398 DEFGFEPF---PEEHVYGGETLVAEVT------GGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGANAD 466 (467) T ss_pred HHhCCCCC---CcccccCCcccccccc------cccCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhhccccC Confidence 22333320 0000000000000000 00000000000000000000 00000000000 00000000 No 161 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=75.49 E-value=0.15 Score=25.16 Aligned_cols=403 Identities=11% Similarity=0.093 Sum_probs=157.6 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCC-CCCCCCCCccc--ccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSR-FLTSEVNRNDR--RNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~-~~~~~~~~~~~--~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |.+.- + +++++++.....|... ..+ ..-|.... +.+..+..+.. ..+-+=.++--.|++.+|+.+.+ T Consensus 1 ~~~~~-~---~~~~~~~~~~~~~~g~--~~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~--- 70 (437) T protein:vir:10 1 MKQGK-Q---RALGRIKSSFLKWLGV--PIS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIAT--- 70 (437) T ss_pred CCcch-h---hhhhhhHHhhhhhcCC--ccc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhh--- Confidence 76643 3 3444444443333211 000 00000000 00111111110 00111123344456666665433 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hc----cchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KS----NLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMP 152 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~ 152 (559) -||.-....... .+. .+ .+..+...|+ +- +.+.=.+..+.++.++||+.+++..+.+.+..+.+ T Consensus 71 ---lp~~~~~~~~~g-~~~-~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~g~~~~L~~ 139 (437) T protein:vir:10 71 ---LPLNLYQTKPDG-TRV-LA------KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSAGVLIGLEL 139 (437) T ss_pred ---CceeEEEEcCCC-cee-ec------cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEE Confidence 255433221111 000 00 1222333443 22 33444666788999999999999888776666777 Q ss_pred eeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 153 FPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) ++...+-+.++.+|.+- |+. . ..+ .... .++ T Consensus 140 l~p~~v~i~~~~~g~~~--y~~-~------------------------~~~-g~~~-----~~~---------------- 170 (437) T protein:vir:10 140 MLPQRTTVKRLTSGALQ--YTY-R------------------------NVD-GTVS-----TLA---------------- 170 (437) T ss_pred EcCcceEEEECCCCeEE--EEE-E------------------------ecC-ceEE-----EEc---------------- Confidence 77777666666555421 110 0 000 0000 000 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCC Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTS 310 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~ 310 (559) . .=++++|....+| .||.| |...+...+.....+.+.......-...|..++ +.. T Consensus 171 ---------~------------~dIih~r~~~~d~-~~G~s-pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 227 (437) T protein:vir:10 171 ---------E------------DDVFHVRGFSLDG-LMGLT-PIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQI 227 (437) T ss_pred ---------c------------ccEEEecCcCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC Confidence 0 0123334333334 89999 898887777777777777777777777786554 333 Q ss_pred Ccccc-------c-ee-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Q lcl|NC_019445. 311 LKNQR-------A-SL-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTR 377 (559) Q Consensus 311 ~~~~~-------~-~~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~ 377 (559) +.... + +. ..|++.+.+ ++-.++|+.. ++....+.+.....+..|-.+|-..... ++..+.. T Consensus 228 l~~e~~~~~~~~~~~~~~g~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~ 302 (437) T protein:vir:10 228 LQKEKRAEIRTDLAEQFGGAMQAGKTMVLE---AGMKYQAITM-NPGDVQLLETRAFNIEEICRWYRVPPFM-VGHSEKS 302 (437) T ss_pred CCHHHHHHHHHHHHHHhcCccccCcceecc---CCceEEeccC-ChhhHHHHHHHHHHHHHHHHHhCCCHHH-hCCCCCc Confidence 32211 1 11 123333332 2234555532 2333344555566678899999875433 3333332 Q ss_pred CcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHH Q lcl|NC_019445. 378 SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 (559) Q Consensus 378 ~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~ 457 (559) ..+...+.+.... |...-|.|++.+.-..|.+. +|++ .+..+.-|++.+.+.+. .--....+.+. T Consensus 303 t~~~sn~e~~~~~-----------f~~~tl~P~~~~ie~~l~~k-ll~~--~e~~~~~~~fd~~~ll~-~d~~~r~~~~~ 367 (437) T protein:vir:10 303 TSWGTGIEQQTLG-----------FLTFTLRPWLTRIEQAARRS-LLRP--GERDQFYAEFSVEGLLR-ADSAGRAAFYS 367 (437) T ss_pred ccccchHHHHHHH-----------HHHHHHHHHHHHHHHHHHhh-ccCc--cccCceEEEEechhhhc-cCHHHHHHHHH Confidence 2233333333322 33444566655554444432 2332 12222235554443321 11112222222 Q ss_pred HHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc---c--cc------CCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 458 STVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP---T--VI------VPQEQVDQARQQRAQQQQQQQMMAMGMAA 526 (559) Q Consensus 458 ~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~---~--~~------rs~~ev~~~rq~r~q~~q~~~~~~~~~~~ 526 (559) ..++. + -+.++++- +.+|.|+ . ++ .+-+... ++ ....+.+ + +.. T Consensus 368 ~~~~~-----G-------~~T~NE~R----~~~gl~pi~gg~~~~~~~~~~~~~~~~~---~~-~~~~~~~---~--~~~ 422 (437) T protein:vir:10 368 TMTQN-----G-------LMTRDECR----AKENLPPMGGNAAVLTVQSALLPIDKLG---EH-TTATAAQ---D--ALK 422 (437) T ss_pred HHHhC-----C-------CcCHHHHH----HHhCCCCCCCCcceEeecCcccchhhcc---Cc-CCCcchh---c--ccc Confidence 22210 0 01111111 1112111 0 00 0111100 00 0000000 0 000 Q ss_pred HHHHhhhhhhcCCChhH Q lcl|NC_019445. 527 AQGAKTLSEAKTSDPSV 543 (559) Q Consensus 527 ~~~a~~~~~~~~~~~~~ 543 (559) ...... +...+.++- T Consensus 423 ~~~~~~--~~~~~~~e~ 437 (437) T protein:vir:10 423 AWLYQE--EKTRATQER 437 (437) T ss_pred ccCCCC--CCCCccccC Confidence 000000 000001111 No 162 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=75.48 E-value=0.15 Score=25.16 Aligned_cols=444 Identities=11% Similarity=0.039 Sum_probs=168.0 Q ss_pred CChhhHHHHHHHHHHHH-HHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLE-SERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~-~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |+-.........+...- ..|.. -+..|..+.. |.+..-. ... -....+..+|++.|..+ T Consensus 68 ~a~d~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~--~~~~~l~--a~Y---~~~~l~r~iVd~~A~d~------- 127 (537) T protein:vir:10 68 MAMDGLDVEGGTFSAYANPNLSE------GLVLWYAQQA--FIGHQMC--ALI---ATHWLVNKACSQMPRDA------- 127 (537) T ss_pred hhccccccchhhhhhhccccccc------hhhhhccccC--CccHHHH--HHH---HhCchhhhhhhhhhHHh------- Confidence 32211000001111100 00000 0111222111 2111100 000 01234555566665533 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYY 159 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~ 159 (559) -+.|+.+...+.+..+...++. +.+.+++-+++..+.+++...-+||.+++++.-+...+... .-|+.--. T Consensus 128 ~r~~~~i~~~~~~~~~~~~~~~--------l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~-~~Pl~~~~ 198 (537) T protein:vir:10 128 MRKGYKIISDDGNELDPKDAKF--------IDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYY-EKPFNIDG 198 (537) T ss_pred hcCCceeecCCcccccHHHHHH--------HHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCccc-cccccccc Confidence 3679998876543333333333 33344455788899999999888999988876432211000 11221100 Q ss_pred EeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceE---EEEEEEeecCcccccccccccccEEEEE Q lcl|NC_019445. 160 LANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWI---EVMHSVYPNIDRDTSKLDSKNKPFKSVY 236 (559) Q Consensus 160 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v---~v~~~v~p~~~~~~~~~~~~~~~~~sv~ 236 (559) | ..|.+..+.. +.... ++........+++....| +.|. |. ... T Consensus 199 i---~kg~~k~l~v---idp~~---------~~~~~~~~~~~dp~sp~fg~P~~y~-v~------------------g~~ 244 (537) T protein:vir:10 199 V---MPGAYKGIVQ---IDPYW---------CAPLLDAQASSNPVSMHFYEPTYWL-IN------------------GKK 244 (537) T ss_pred c---cccceeEEEE---echhh---------cccccchhhhccCCccccCCceeee-ec------------------CeE Confidence 1 1111111110 00000 000000011111111111 1111 10 011 Q ss_pred EEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC--CCccc Q lcl|NC_019445. 237 YEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPT--SLKNQ 314 (559) Q Consensus 237 ~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~--~~~~~ 314 (559) +| ..+++.-.|.. .|++ .+....-||++ ..+.++..++..+.......+.+....-..+.+.. .+.+. T Consensus 245 iH----~SRli~f~g~~-~p~~----~~~~~~~~G~S-vlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~ 314 (537) T protein:vir:10 245 YH----RSHLAIYINDE-VVDF----LKPSYIYGGVP-LPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANK 314 (537) T ss_pred ec----ceeEEEecCCC-Cchh----hhcccCccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCH Confidence 11 11233322222 3443 23334457999 48889899999888888888777777666665432 11111 Q ss_pred -----cce----e-cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCH--H Q lcl|NC_019445. 315 -----RAS----L-LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPV--E 382 (559) Q Consensus 315 -----~~~----~-~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA--~ 382 (559) .+. . .-.+...++.. .+.+.... .++..+-..+....+.|.-++=..+.-.+++. +....| + T Consensus 315 ~~~~~r~~~~~~~r~n~g~~~id~e--~e~~e~~~---~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~s-p~GlnatGe 388 (537) T protein:vir:10 315 QQFDETMSWWTATRDNYQVRVVDKD--NEDVVQID---TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTV-PTGFNSTGD 388 (537) T ss_pred HHHHHHHHHHHhhcCCcceeEecCC--CceeEEEe---ccCCCHHHHHHHHHHHHHhhhCCCceeeccCC-ccccccchh Confidence 110 0 11233444321 12233222 13334444556666667666533222222222 122222 2 Q ss_pred -HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 383 -AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVN 461 (559) Q Consensus 383 -Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~ 461 (559) ++.. .---+..++.+ +.|++++++.+|++....+++ ++++++- ||.+......++......+ T Consensus 389 ~D~~~--------yyd~I~~~Qe~-l~p~l~~l~~ll~~~~~~~~~-------~~~i~f~-pL~~~s~kEkAei~~~~a~ 451 (537) T protein:vir:10 389 YEEAS--------YHEECESTQDD-MRPLIDRHHQLVCRSHLRKRI-------RVKVEFP-PMDAPKESERADTFLKKMQ 451 (537) T ss_pred HHHHH--------HHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCc-------ceEEEeC-CCCCCCHHHHHHHHHHHHH Confidence 2222 22223445544 789999999999887655432 3666655 3332222222222222223 Q ss_pred HHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc-cccC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC Q lcl|NC_019445. 462 FIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP-TVIV--PQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKT 538 (559) Q Consensus 462 ~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~-~~~r--s~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~ 538 (559) .+..+.+.+ .|+.+++-+.+...-..+. .+.- +.++.+....+ ..........+...+... T Consensus 452 a~~~~~~~G-----~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~-----------~~~~~~~~~~~~~~~~~~ 515 (537) T protein:vir:10 452 AAKLAFEMG-----AVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVD-----------DEGKPVRIIEDQPAPSEM 515 (537) T ss_pred HHHHHHHcC-----CCCHHHHHHHHhccCccccccccCCCChhhhhcccCC-----------ccCCcCCCCCCCCCcccc Confidence 333333322 4888888888876422221 1111 11221111000 000000000000000000 Q ss_pred CCh-hHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 539 SDP-SVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 539 ~~~-~~~~~~~~~~~~~~~~~~ 559 (559) .+. +.... .--...+++- T Consensus 516 ~~~~~~~~~---~~~~~~~~a~ 534 (537) T protein:vir:10 516 FGATSSGES---ANDPRDSGAA 534 (537) T ss_pred CCCCccccc---cCCCccCccc Confidence 000 00000 0000000000 No 163 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=75.04 E-value=0.15 Score=25.08 Aligned_cols=369 Identities=13% Similarity=0.092 Sum_probs=149.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHH--HHHHHHHHhccccC-CCCCCCCCCccccc--CCCCcchHHHHHHHHHHHHHHh Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEP--HWRELSDYINPRGS-RFLTSEVNRNDRRN--TRIIDSTGTMAARTLASGMMSG 75 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~--~w~e~~~~~~P~~~-~~~~~~~~~~~~~~--~~~~~s~~~~a~~~Las~l~~~ 75 (559) |-+. .....++++++--.+-++ .+.....--.+... .+....+..+..-+ .-+=.++--.|++.+|+.+. . T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia-~ 76 (432) T protein:vir:10 1 MPDE---KKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIA-A 76 (432) T ss_pred CCCC---cccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhh-h Confidence 4332 122333333222211110 00000000000000 00000000111000 00112333445555555443 2 Q ss_pred hcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecCCceEEE Q lcl|NC_019445. 76 ITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRT 150 (559) Q Consensus 76 l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~ 150 (559) -||.-..-.+....+ ..++-+...|+ +-| .+.=.+..+.++.++|||.+++..+.+++... T Consensus 77 -----lp~~~y~~~~~g~~~---------~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L 142 (432) T protein:vir:10 77 -----MPLTMYMRTPDGRKE---------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESL 142 (432) T ss_pred -----CceeEEEecCCCccc---------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEE Confidence 255322111111000 11233344443 222 33335567788899999999888776666667 Q ss_pred EEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccc Q lcl|NC_019445. 151 MPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) Q Consensus 151 ~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 230 (559) .+++...+-+..|.+|++ +|+... .+ .+.++ ++ T Consensus 143 ~~l~~~~v~v~~~~~g~~--~y~~~~-------------------------~~-g~~~~-----~~-------------- 175 (432) T protein:vir:10 143 QYLANDRLTITTDTKGNT--AYRYRR-------------------------TD-GQMID-----IP-------------- 175 (432) T ss_pred EEEcCCceEEEEcCCCcE--EEEEEe-------------------------cC-ceEEE-----Ec-------------- Confidence 778888888877777753 222100 00 00010 00 Q ss_pred cEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--c Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--P 308 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p 308 (559) .++ +++.|....+| .||.| |...+...+.......+.......-...|..++ + T Consensus 176 -----------~~~------------iih~~~~~~dg-~~G~s-pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~ 230 (432) T protein:vir:10 176 -----------KQQ------------IWKIMGYSLDG-ENGLS-AIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID 230 (432) T ss_pred -----------Ccc------------EEEecCCCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC Confidence 000 22333333344 78999 888776666655555555555555555665443 3 Q ss_pred CCCcccc-------ce--ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCC-CCC Q lcl|NC_019445. 309 TSLKNQR-------AS--LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNIN-TRS 378 (559) Q Consensus 309 ~~~~~~~-------~~--~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~-~~~ 378 (559) ..+.... +. ...|++...+ ++-.++++.. ++.-..+.+..+..+..|-++|-..... ++..+ +.. T Consensus 231 ~~l~~e~~~~~~~~~~~~~nag~~~vl~---~g~~~~~l~~-~~~d~q~le~~~~~~~~Ia~afgVPp~~-lg~~~~~t~ 305 (432) T protein:vir:10 231 RFLTDDQYDSFAKKVSGSVEAGRAPLLE---GGMDVKSLGL-NPVDAQLLQSRQYSVESICRFFGVPPSM-IGHSSAGTT 305 (432) T ss_pred CCCCHHHHHHHHHHHhhhhhCCCceecC---CCceEEEccC-ChHHHHHHHHHHHHHHHHHHHhCCCHHH-cCCccCCcc Confidence 3332211 11 1234444442 2233555532 3333445566677888899999775433 33322 222 Q ss_pred cCHHHHHHHHHH-HHHHhhhHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCC--------- Q lcl|NC_019445. 379 MPVEAVIEMKEE-KLLMLGPVLERLNDECLNPLIDR-------------------------AFSMMVRKNM--------- 423 (559) Q Consensus 379 ~TA~Ei~~r~~e-~~~~LG~v~~~l~~E~l~Pli~r-------------------------~~~il~r~g~--------- 423 (559) -+..-+.+.... ....|.|.+.+++.|+-.-|+.. .+..+...|. T Consensus 306 ~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~ 385 (432) T protein:vir:10 306 SWGSGIESQQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREI 385 (432) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Confidence 222333333332 23467777777777765444211 2222333343 Q ss_pred --CCCCchhhCCcceEEEeec---HHHHHHHHHHHHHHHHHHHHHHHHhccChh-hHhcCCHHHHHH Q lcl|NC_019445. 424 --LPPPPDAMEGMPLKVEYIS---VMAQAQKSIGLSSLASTVNFIGQLAQAKPE-ALDKLNVDQAID 484 (559) Q Consensus 424 --lp~~p~~l~g~~v~~~~is---~La~a~r~~~~~~l~~~~~~~~~la~~~P~-~~~~id~d~~~~ 484 (559) +||+| |.+..+...+ ||..+ +.-..-.|. ...+-+-++.-+ T Consensus 386 ~glppi~----g~~~~~~~~~~~~pl~~~----------------~~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 386 EGLPKLG----GNAAVLTVQSAMVPLDSI----------------GLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred hCCCCCC----CCcceEeecCcccchhhh----------------cccCCCCCCCCCCCcccccccC Confidence 23222 2111111111 22211 111111111 111222222222 No 164 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=72.67 E-value=0.18 Score=24.67 Aligned_cols=448 Identities=10% Similarity=-0.002 Sum_probs=187.5 Q ss_pred CChhhH-----HHHH---HHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCC-----CCCCcccccCCCCcchHHHHHHH Q lcl|NC_019445. 1 MAETTK-----ERLN---KQFAQLESERQSFEPHWRELSDYINPRGSRFLTS-----EVNRNDRRNTRIIDSTGTMAART 67 (559) Q Consensus 1 M~~~~~-----~~l~---~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~-----~~~~~~~~~~~~~~s~~~~a~~~ 67 (559) |.-++. +... ..|..-...+. +..+.| .|........ .....+.+.--.-++.+..+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~------~~~~~w-~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~ 73 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFG------GQLRGW-NPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQL 73 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCC------Cccccc-ccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 433321 1111 11211111111 111111 1111000000 00000111111246688999999 Q ss_pred HHHHHHHh-hcCCCCccee-ccCCccchhhHHHHHHHHHHHHHHHHHHHH----------hccchHHHHHHHHHHHhhCc Q lcl|NC_019445. 68 LASGMMSG-ITSPARPWFR-LATPDPEMMDYGPVKLWLEAVQNRMNDMFN----------KSNLYQSLPQLYGSLGTYST 135 (559) Q Consensus 68 Las~l~~~-l~pp~~~Wf~-l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~----------~snf~~~~~~~~~dl~~~G~ 135 (559) +++.+++. ++|..+|=++ |...++ ..+.|-+.|++.-...-+ ..+||.....++...++-|- T Consensus 74 ~~~nvVG~Gi~~~~~p~~~~l~~~~~------~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE 147 (530) T protein:vir:38 74 HQDHIVGSFFRLSYRPSWRYLGINEE------DSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGE 147 (530) T ss_pred HHHHhhCCCceeeeccchhhcCCCHh------HHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCc Confidence 98888875 6765544333 322211 123333344443322211 34799999999999999999 Q ss_pred EEEEEeecCCce----EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE Q lcl|NC_019445. 136 GAMAVLEDDEDI----IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM 211 (559) Q Consensus 136 ~~l~v~~~~~~~----~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~ 211 (559) +++.+..++..+ +.++.+....+--..+ .+... .|+ T Consensus 148 ~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~--------------------------------------~~~~~--~i~ 187 (530) T protein:vir:38 148 LCVQATWDSDSTRLFRTQFKMVSPKRVSNPNN--------------------------------------IGDTR--NCR 187 (530) T ss_pred eEEEeeeccCCCCccceEEEEechhhcCCCCC--------------------------------------CCCCC--eeE Confidence 988766544433 2222222222110000 01011 244 Q ss_pred EEEeecCcccccccccccccEEEEEEEecCCC-ce---eeeecCcccCC---eEEEEeeecCCCcccccchHHHHHHHHH Q lcl|NC_019445. 212 HSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDN-DK---LLRESGFDEFP---IMAPRWEVNGEDVYGSSCPGMLALGPVK 284 (559) Q Consensus 212 ~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~-~~---il~esg~~~~P---~~~~rw~~~~g~~YGrG~P~~~~l~d~~ 284 (559) ..|+-+... .|.+ ||+.....+ .. ..+...+...| +++.-....+|..=|.+ ..-.+|..++ T Consensus 188 ~GIe~d~~G---------r~~a-Y~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis-~lapvl~~l~ 256 (530) T protein:vir:38 188 AGVKINDSG---------AALG-YYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDGQTRGAN-AFYSVMEQMK 256 (530) T ss_pred eeeEECCCC---------ceEE-EEEeeccCCCccccccceeeeeeccChhHeEeeccccCCCcccCCc-hHHHHHHHHH Confidence 555432221 1221 222211000 00 00000011112 33343445578888998 5889999999 Q ss_pred HHHHHHHHHHHHHHHHhcCceeecCCCc--------------------------------cccceecCCceeecCCcCCc Q lcl|NC_019445. 285 ALQLLQKRKSQLIDKATNPPMVAPTSLK--------------------------------NQRASLLPGDITYIDQITGQ 332 (559) Q Consensus 285 ~L~~l~~~~~~~~~~~~~p~~~~p~~~~--------------------------------~~~~~~~pg~~~~~~~~~~~ 332 (559) .|+....+.+.++..++.....+..+.. .....+.||.+.+....... T Consensus 257 ~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i 336 (530) T protein:vir:38 257 MLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSL 336 (530) T ss_pred HHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCee Confidence 9999999999999988887655432110 01124677776655332222 Q ss_pred hhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_019445. 333 DGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLID 412 (559) Q Consensus 333 ~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~ 412 (559) ..+.|-.. +.++..+ ...+...|-.++=..+ .++. .|-..++-.-+++-..|.-+.+-..=..|..-|+.|+.. T Consensus 337 ~~~~p~~p-~~~~~~f---~~~~lr~iaaglGi~y-e~lt-~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~ 410 (530) T protein:vir:38 337 NLQSAQDT-DNGYSTF---EQSLLRYIAAGLGVSY-EQLS-RNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFL 410 (530) T ss_pred eeeCCCCC-CCCHHHH---HHHHHHHHHhhcCCCH-HHHh-cccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHH Confidence 33333211 2233222 2333444555542211 1221 344456666667766777776666666777788999999 Q ss_pred HHHHHHHhcCCCCCCchh----hC--CcceEEEeecH-------HHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCH Q lcl|NC_019445. 413 RAFSMMVRKNMLPPPPDA----ME--GMPLKVEYISV-------MAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNV 479 (559) Q Consensus 413 r~~~il~r~g~lp~~p~~----l~--g~~v~~~~is~-------La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~ 479 (559) +++..+...|.+|-|... .. -.-++++++.| +--++ +....|..-+. . T Consensus 411 ~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~--a~~~~i~~G~~----------------s- 471 (530) T protein:vir:38 411 CWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQ--EAVMLIEAGLS----------------T- 471 (530) T ss_pred HHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHH--HHHHHHHcCCC----------------C- Confidence 999999999998843210 00 01134444443 21100 00000111110 0 Q ss_pred HHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hhhhhhcCCChhHHHHHHHHhhcCCCCC Q lcl|NC_019445. 480 DQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGA-KTLSEAKTSDPSVLSAMANAVSGQGGQS 558 (559) Q Consensus 480 d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (559) ...++...|.+. +||.+. ++...+.....-. ..-...+ .+.+.......+ -..+++++ T Consensus 472 ---~~~~~a~~G~D~------~~v~~q---~a~e~~~~~~~Gl-~~~~~~~~~~~~~~~~~~~~--------~~d~~~~a 530 (530) T protein:vir:38 472 ---YEKECAKRGDDY------QEIFAQ---QVRESMERRAAGL-NPPAWAAAAFEAGVKKSNEE--------EQDGARAA 530 (530) T ss_pred ---HHHHHHHcCCCH------HHHHHH---HHHHHHHHHHcCC-CCCCCcccccCCCCCCCCCC--------CCCCCCCC Confidence 112222344433 222111 1111111100000 0000000 000000000000 00111111 No 165 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=71.90 E-value=0.19 Score=24.54 Aligned_cols=427 Identities=10% Similarity=0.085 Sum_probs=155.3 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccC--CCC-CCCCCCccc------ccCCCC--cchHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGS--RFL-TSEVNRNDR------RNTRII--DSTGTMAARTLA 69 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~--~~~-~~~~~~~~~------~~~~~~--~s~~~~a~~~La 69 (559) |.......+. ..+..... .|..|..+ +|. +....+..+ ..-+.| ++....|++..| T Consensus 31 ~~~~~~~~~~----k~~~~~~~---------~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a 97 (547) T protein:vir:63 31 IQQREQEQIS----KAMNNKEV---------AYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRS 97 (547) T ss_pred hhhhhHHHHH----Hhhcccch---------hhhchhhheeecccccccCCccCChhHHHHHHHHhhcCHHHHHHHHHHH Confidence 2222211111 11111110 12333321 111 101111000 001122 234456666666 Q ss_pred HHHHHhhcC-----CCCcceeccCCcc--chhhHHHHHHHHHHHHHHHHHHHHhcc---------chHHHHHHHHHHHhh Q lcl|NC_019445. 70 SGMMSGITS-----PARPWFRLATPDP--EMMDYGPVKLWLEAVQNRMNDMFNKSN---------LYQSLPQLYGSLGTY 133 (559) Q Consensus 70 s~l~~~l~p-----p~~~Wf~l~~~d~--~~~~~~~v~~~l~~ve~~~~~~l~~sn---------f~~~~~~~~~dl~~~ 133 (559) ..+.+...| .+-. |.+.+.+. ...+.... -...++ ..|++-| |..-+...+.++.++ T Consensus 98 ~~ia~~~~~~~~~~~~~~-~~ir~k~~~~~~~~~~~~--~~~~l~----~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~ 170 (547) T protein:vir:63 98 NQVSMYCKPARHSEKGVG-FEVRLKDLDKKPTSHDEA--TIKRIE----SFIEKTGVDNDINRDSFSSFVKKIVRDTYMY 170 (547) T ss_pred HHHhhhhhhhhhhccCCC-ceeEecccccccChhhHH--HHHHHH----HHHHhhCCCCCCccchHHHHHHHHHHHHHhh Confidence 655432222 2222 33333332 21111111 111122 2333332 333455577888999 Q ss_pred CcEEEEEeecC-CceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE Q lcl|NC_019445. 134 STGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMH 212 (559) Q Consensus 134 G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~ 212 (559) ||+++++..+. +.++.+.+++...+.+..+.+|.+..- .+.++. T Consensus 171 Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~-----------------------------------~~~y~~ 215 (547) T protein:vir:63 171 DQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDN-----------------------------------GNRFVQ 215 (547) T ss_pred CCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccC-----------------------------------ceEEEE Confidence 99998887653 456677777777766666666543110 000000 Q ss_pred EEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecC---CCcccccchHHHHHHHHHHHHHH Q lcl|NC_019445. 213 SVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNG---EDVYGSSCPGMLALGPVKALQLL 289 (559) Q Consensus 213 ~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~---g~~YGrG~P~~~~l~d~~~L~~l 289 (559) .+ .. + +.+.... ++ +++.|.+... ...||.| |...+...+...... T Consensus 216 ~~---~~----~----------~~~~~~~-----------~e--iih~r~n~~~~~~~~~~G~S-pi~~~~~~i~~~~~a 264 (547) T protein:vir:63 216 VI---DQ----K----------IVATFNA-----------RE--MAFAVRNPRSDIYATGYGYP-ELEIALKQFIAHENT 264 (547) T ss_pred Ec---CC----c----------EEEEecc-----------cc--EEEecccCCCCccccccccc-HHHHHHHHHHHHHHH Confidence 00 00 0 0000000 00 2333332222 2469999 898888777777777 Q ss_pred HHHHHHHHHHHhcCce--eecCCCc--ccc-------c-eecCC-----ceeecCCcCCchhhhhhhhccccHHHHHHHH Q lcl|NC_019445. 290 QKRKSQLIDKATNPPM--VAPTSLK--NQR-------A-SLLPG-----DITYIDQITGQDGFRPAYLVNPSTADLVADI 352 (559) Q Consensus 290 ~~~~~~~~~~~~~p~~--~~p~~~~--~~~-------~-~~~pg-----~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i 352 (559) ++.......-...|.. .++.+.. ... + ...-| ++..+ .++.-.++|+. .++.-..+.+.. T Consensus 265 ~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl--~~~g~~~~~l~-~~~~d~qfle~~ 341 (547) T protein:vir:63 265 EAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVV--SAEDVKFVNMT-PSARDMEFEKWL 341 (547) T ss_pred HHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCcccccccccc--cCCCceEEEcC-CChhHHHHHHHH Confidence 7777777776677764 3454421 110 1 11111 11111 12223355554 233334455666 Q ss_pred HHHHHHHHHHhhcchhhhccCCCC-------CCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_019445. 353 QDTRQIINSAYFVDLFMMLQNINT-------RSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLP 425 (559) Q Consensus 353 ~~~~~rI~~af~~dl~~~~~~~~~-------~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp 425 (559) +.....|-++|-..+.......++ ..+|-.-+.+... .+....|.|++.+.-..|.+. ++| T Consensus 342 ~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~-----------~~~~~tL~P~~~~ie~~ln~~-L~~ 409 (547) T protein:vir:63 342 NYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQ-----------ASKNKGLQPLLGFIEDFINKH-IVA 409 (547) T ss_pred HHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHH-----------HHHHHHHHHHHHHHHHHHHhh-ccc Confidence 778889999998876554322211 1122111211111 123344555555554444432 333 Q ss_pred CCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc-----cccCCH Q lcl|NC_019445. 426 PPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP-----TVIVPQ 500 (559) Q Consensus 426 ~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~-----~~~rs~ 500 (559) .. +..+.+++..... ... .+...+...+. +. -+..++ +-+.+|.|+ +.+... T Consensus 410 ~~-----~~~~~~~f~~~~~-~~~-~~~~~~~~~~~-----~g-------~lT~NE----~R~~~gl~P~~egGD~~~~~ 466 (547) T protein:vir:63 410 EF-----GDKYTFQFVGGDI-KSE-LESVKILAEKA-----KV-------AMTVNE----VRKELNLPGDVIGGDIPLNG 466 (547) T ss_pred cc-----CCceEEEeecccc-ccH-HHHHHHHHHHh-----CC-------CcCHHH----HHHHhCCCCCCCCCceeecc Confidence 21 2346666654321 111 11111111110 01 122222 222334432 111110 Q ss_pred HHHH----HHHHHHHHHHHHHHHHHHHHHHH-HHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 501 EQVD----QARQQRAQQQQQQQMMAMGMAAA-QGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 501 ~ev~----~~rq~r~q~~q~~~~~~~~~~~~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) --+. .+.++.-+-+. +.+...+.. +.++.....+...++... ..+.-+.++ T Consensus 467 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~d~ 522 (547) T protein:vir:63 467 VIVQRIGQLMQQEQFEHEK---QQSNLQMLQEQTGNRVSTDVEDIPDGKD-----TTGDIGKDG 522 (547) T ss_pred cccccccccccccCCcccc---chhhccccccccCCCCCCCCCCCCCCcc-----cCCCcCccc Confidence 0000 00000000000 000000000 011111111111110000 000000000 No 166 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=71.66 E-value=0.19 Score=24.50 Aligned_cols=398 Identities=11% Similarity=0.078 Sum_probs=145.8 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCC-c-chHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRII-D-STGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~-~-s~~~~a~~~Las~l~~~l~p 78 (559) -+.++- .++.+|.. +.+.+ -|.. +..........-.++ + ++--.|++.+|+.+- T Consensus 9 ~~~p~~-----------~e~~~~~~---~~~~~-~~~~----~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA----- 64 (518) T protein:vir:10 9 LSAPAM-----------AELSPQMQ---DSYYY-APAV----GMQLERQFSLYGGIYKNQPWVRTVIAKRAQALA----- 64 (518) T ss_pred ecCchh-----------hhhhhhhh---ccccc-cccc----ceecccccchhhHHHhhhHHHHHHHHHHHHhhc----- Confidence 001100 00011100 00000 0000 000000000000001 1 122344555555443 Q ss_pred CCCcc--eeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccc----hHHHHHHHHHHHhhCcEEEEEeecC-CceEEEE Q lcl|NC_019445. 79 PARPW--FRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNL----YQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTM 151 (559) Q Consensus 79 p~~~W--f~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf----~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~ 151 (559) +-|| ++-........ ....+...+.+=|- +.-+...+.++.++||+++++..+. +.++.+. T Consensus 65 -~lpl~l~~~~~~~~~~~-----------~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~ 132 (518) T protein:vir:10 65 -RLPVKCMFTSGDTETEE-----------SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLM 132 (518) T ss_pred -cCceEEEEEcCCCceec-----------cchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEE Confidence 2234 33222111100 11222333333332 3334566778889999999988754 4456666 Q ss_pred EeeccEEEEeeCCCC-CEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRG-SVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) Q Consensus 152 ~~~l~~~~v~~d~~G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 230 (559) +++.+.+.+..+..+ .+.-.|. .. .......+ T Consensus 133 ~l~p~~v~v~~~~~~~~~~y~~~---~~----------------------~~~~~~~~---------------------- 165 (518) T protein:vir:10 133 PMHPSRVAIKRNSRTGRYEYYFQ---AG----------------------AGVGTQLV---------------------- 165 (518) T ss_pred EECCCceEEEEcCCCCEEEEEEE---ec----------------------CCccceEE---------------------- Confidence 677777766665432 2111110 00 00000000 Q ss_pred cEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--c Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--P 308 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p 308 (559) ... . -=+++.|+...+|..||.| |..-+...+.....+.+.......-...|..++ + T Consensus 166 -------~~~-~------------~eViHir~~s~dg~~~G~s-pi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~ 224 (518) T protein:vir:10 166 -------SFA-D------------DEVVPIRFFNPDGLERGLS-LMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE 224 (518) T ss_pred -------Eec-C------------CcEEEecCCCCCccccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC Confidence 000 0 0134455555566678999 888777777777777777777777777786543 4 Q ss_pred CCCcccc-------ce------ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCC Q lcl|NC_019445. 309 TSLKNQR-------AS------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNIN 375 (559) Q Consensus 309 ~~~~~~~-------~~------~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~ 375 (559) ..+.... +. -..|++...+ ++..++|+.. ++.-..+.+..+..+..|-++|-..... ++..+ T Consensus 225 ~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~---~G~~~~~l~~-s~~D~q~le~r~~~~~eIa~afgVPp~~-lg~~~ 299 (518) T protein:vir:10 225 KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE---EGMEPIPLQL-TAVEMQFIEARQLNREEVCGVYDIAPPI-VHILD 299 (518) T ss_pred CCCCHHHHHHHHHHHHHHhcCccccCcceEcC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHH-hccCC Confidence 3332210 11 1123333332 2233555542 2333334555666778899999875433 33222 Q ss_pred CCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHH Q lcl|NC_019445. 376 TRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSS 455 (559) Q Consensus 376 ~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~ 455 (559) .. |-.-+.+... .+...-+.|++.+.-..+.+. ++++.. .+.-+++.+ +.|-+. +... T Consensus 300 ~~--t~sn~eq~~~-----------~f~~~tL~P~l~~ie~~ln~~-L~~~~~---~~~~~~fd~-~~llr~----D~~~ 357 (518) T protein:vir:10 300 RA--TFSNISAQMR-----------AFYRDTMAIPIARIQSAMDKY-VGQYWV---RKNRMKFDI-DDVIQP----DWEA 357 (518) T ss_pred CC--CchhHHHHHH-----------HHHHHHHHHHHHHHHHHHHHh-hccccc---CCceEEEec-hhhhcc----CHHH Confidence 21 2221211111 133445667766666655543 344322 122344433 233111 2211 Q ss_pred HHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc-------ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 456 LASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT-------VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQ 528 (559) Q Consensus 456 l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~-------~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~ 528 (559) ....+. .+.+.+ -+.+++ +-+.+|.|+- ++.+..- .-+-.. .. T Consensus 358 r~~~~~---~~~~~G-----~lT~NE----~R~~~Gl~pie~~~gD~~~~~~n~-~pl~~~-----------------~~ 407 (518) T protein:vir:10 358 KSESTQ---KMVNSG-----VATPNE----GREIMGLPRSDDPKADELYANSAL-QPLGAT-----------------PD 407 (518) T ss_pred HHHHHH---HHHhCC-----CcCHHH----HHHHhCCCCCCCCCCCeeeecccc-eecccc-----------------cc Confidence 111111 111111 123333 2234454321 1111000 000000 00 Q ss_pred HHhhhhhhcCCChhHHHHHHHHhh----cCCCCCC Q lcl|NC_019445. 529 GAKTLSEAKTSDPSVLSAMANAVS----GQGGQSQ 559 (559) Q Consensus 529 ~a~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 559 (559) .....++++.....+.+...+.-. +.++... T Consensus 408 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (518) T protein:vir:10 408 GAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSP 442 (518) T ss_pred cccCCCCCCCCCCCCccccccccccccccCCCCCc Confidence 000000000000000000000000 0000000 No 167 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=71.10 E-value=0.2 Score=24.41 Aligned_cols=401 Identities=10% Similarity=0.075 Sum_probs=146.3 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCC--cchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRII--DSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~--~s~~~~a~~~Las~l~~~l~p 78 (559) -+.|... ++++|... ... .-|..+ ...+......-..| +++--.|++.+|+.+-+ T Consensus 9 ~~~p~~~-----------~~~~~~~~---~~~-~~~~~g----~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~---- 65 (518) T protein:vir:78 9 LSAPAMA-----------ELSPQMQD---SYY-YAPAVG----MQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR---- 65 (518) T ss_pred eccchhh-----------hhhhhhhh---ccc-ccceec----eecccccchhhHHhhhhHHHHHHHHHHHHhhcc---- Confidence 1111110 11111100 000 001110 00000000000001 11223455555554432 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEe Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPF 153 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~ 153 (559) .||--+...+....+ .....+...+.+=| .+.=+..++.+|.++||+.+++..+. +.++.+.++ T Consensus 66 --lp~~l~~~~~~~~~~---------~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l 134 (518) T protein:vir:78 66 --LPVKCMFTSGDTETE---------EHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPM 134 (518) T ss_pred --CceEEEEEcCCcccc---------ccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEE Confidence 244322221111000 01112222333333 22335567788889999999988754 345566666 Q ss_pred eccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEE Q lcl|NC_019445. 154 PIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFK 233 (559) Q Consensus 154 ~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~ 233 (559) +.+.+.+..+.++....++ |... .......++ +| T Consensus 135 ~p~~Vtv~~~~~~~~~~y~--~~~~----------------------~~~~~~~~~-----~~----------------- 168 (518) T protein:vir:78 135 HPSRVAIKRNSRTGRYEYY--FQAG----------------------AGVGTQLVS-----FA----------------- 168 (518) T ss_pred CCCceEEEEcCCCCEEEEE--EEec----------------------CCccceeEE-----ec----------------- Confidence 6666666555432211100 0000 000000000 00 Q ss_pred EEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCC Q lcl|NC_019445. 234 SVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSL 311 (559) Q Consensus 234 sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~ 311 (559) .-=++++|+...+|..||.| |...+...+.......+.......-...|..++ +..+ T Consensus 169 --------------------~~eIiHir~~~~dg~~~G~S-pi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~l 227 (518) T protein:vir:78 169 --------------------DDEVVPIRFFNPDGLERGLS-LMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL 227 (518) T ss_pred --------------------CCcEEEecCCCCCccccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCC Confidence 01144555555567679999 898777777777777777777677777786554 4333 Q ss_pred cccc-------ce------ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 312 KNQR-------AS------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 312 ~~~~-------~~------~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) .... +. -..|++.+.+ ++-.++|+.. ++.-..+.+..+.....|-++|-..... ++..+.. T Consensus 228 s~e~~~~~k~~~~~~~~G~~nag~~~vL~---~G~~~~~l~~-~~~d~q~le~r~~~~~eIa~afgVPp~~-lg~~~~s- 301 (518) T protein:vir:78 228 SPEAQQRLREQFDRAHAGSSNTGKTMVVE---EGMEPIPLQL-TAVEMQFIEARQLNREEVCGVYDIAPPI-VHILDRA- 301 (518) T ss_pred CHHHHHHHHHHHHHHhcCcccCCceeEcC---CCceEEeccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHH-hccCCCC- Confidence 2211 10 0123334332 2233555543 2333334555566778899999775433 3322221 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) |..-+.+... .+...-+.|++.+.-..+.+ .++++.. .+.-+++.. +.|-+. +...... T Consensus 302 -t~sn~e~~~~-----------~f~~~tL~P~~~~ie~eln~-~L~~~~~---~~~~~~fd~-~~Llr~----D~~~r~~ 360 (518) T protein:vir:78 302 -TFSNISAQMR-----------AFYRDTMAIPIARIQSAMDK-YVGQYWV---RKNRMKFDI-DDVIQP----DWEAKSE 360 (518) T ss_pred -CchhHHHHHH-----------HHHHHHHHHHHHHHHHHHHH-hhccccc---CcceEEeec-hhhhcc----CHHHHHH Confidence 2222211111 13344566666666555543 2343322 122344432 233211 2221222 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc-------ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT-------VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAK 531 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~-------~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~ 531 (559) .+. .+.+.+ -+.+++ +-+.+|.|+- ++.+..-+ -+- ....+.. T Consensus 361 ~~~---~~~~~G-----~lT~NE----~R~~~gl~pie~~~gD~~~v~~n~~-pl~-----------------~~~~~~~ 410 (518) T protein:vir:78 361 STQ---KMVNSG-----VATPNE----GREIMGLPRSDDPKADELYANSALQ-PLG-----------------ATPDGAV 410 (518) T ss_pred HHH---HHHhCC-----CcCHHH----HHHHhCCCCCCCCCCceeeecccce-ecc-----------------ccccccc Confidence 222 111111 123223 2233454321 11110000 000 0000000 Q ss_pred hhhhhcCCChhHHHHHHHHh----hcCCC-------------------------CCC Q lcl|NC_019445. 532 TLSEAKTSDPSVLSAMANAV----SGQGG-------------------------QSQ 559 (559) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~----~~~~~-------------------------~~~ 559 (559) ...+++.....+.+...+.. .+.++ ++. T Consensus 411 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (518) T protein:vir:78 411 EGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKE 467 (518) T ss_pred CCCCCCCCCCCCcccccccccCccccCCCCCcccccccccccccchhcccCCCCccc Confidence 00011110000000000000 00000 000 No 168 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=69.74 E-value=0.22 Score=24.20 Aligned_cols=369 Identities=12% Similarity=0.051 Sum_probs=137.2 Q ss_pred CChhhHHHHHHHHHHHHHHhhh-HHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCc-chHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQS-FEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIID-STGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~-s~~~~a~~~Las~l~~~l~p 78 (559) |- |......|+. .-..+..+..+.= ....+...+ ....+. ++--.|++.+|+.+. .+ T Consensus 1 m~----------f~~~~~~~~~~~~~~~~~~~~~~g---~~~~~~~v~-----~~~al~~~~v~~~i~~ia~~ia-~l-- 59 (409) T protein:vir:10 1 ML----------FRKGFKNQSQEISIDDKKILEWLG---INPSETYVN-----GKSCLKQATVFGCIRILSDNIS-KL-- 59 (409) T ss_pred Cc----------ccccccCcCCCCCCChHHHHHHhc---CCcCcceec-----hhhhhccHHHHHHHHHHHHhhh-hC-- Confidence 21 1111111110 0001111111110 000000000 011122 233445555555433 22 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hc----cchHHHHHHHHHHHhhCcEEEEEeecCC-ceEEEEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KS----NLYQSLPQLYGSLGTYSTGAMAVLEDDE-DIIRTMP 152 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~~-~~~~~~~ 152 (559) ||-=..-.+.. .+ + .+..+...|+ +- +.+.-+...+.++.++|||.+++..+.. ....+.+ T Consensus 60 ---p~~~~~~~~~~-~~---~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~ 126 (409) T protein:vir:10 60 ---PIKIYQKKDGI-KR---V------PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYP 126 (409) T ss_pred ---ceEEEEecCCe-ee---c------cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEE Confidence 34211211111 00 0 1112233343 22 2333456677889999999999876543 3455666 Q ss_pred eeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 153 FPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) +|....-+..|.+|....- ..+. |. +. T Consensus 127 i~~~~V~v~~~~~~~~~~~----------------------------------~~~~-y~-~~----------------- 153 (409) T protein:vir:10 127 LKSDGMKIFVDDTGLLNSE----------------------------------NNVW-YL-YT----------------- 153 (409) T ss_pred EcCCceEEEEcCCcccccc----------------------------------ceEE-EE-EE----------------- Confidence 6666655555554432110 0110 00 00 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCC Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTS 310 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~ 310 (559) ...+.. .. |...=+++.|....+ ..||.| |.+.+...+.......+.......-...|..++ +.. T Consensus 154 -----~~~g~~-~~-----~~~~evih~r~~~~d-~~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~ 220 (409) T protein:vir:10 154 -----DDLGQR-HK-----FMSDEILHFKGLTAD-GLAGLS-VIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGD 220 (409) T ss_pred -----eCCcee-EE-----eccccEEEecCcCCC-Cccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC Confidence 000000 00 001114455544333 489999 888777777777777777777777777787654 333 Q ss_pred Ccccc-------ce-e-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Q lcl|NC_019445. 311 LKNQR-------AS-L-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTR 377 (559) Q Consensus 311 ~~~~~-------~~-~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~ 377 (559) +.... ++ . ..|++.+.+ ++-.++|+.. ++.-..+.+..+.....|-++|-..........++. T Consensus 221 l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~ 296 (409) T protein:vir:10 221 LNPEAEEVFKENFERMSSGLKNAHRIAMLP---IGYKFEPISQ-KLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRAT 296 (409) T ss_pred CCHHHHHHHHHHHHHHhccccccCCceecC---CCceEEEccC-ChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc Confidence 32211 10 1 133344332 2234666543 233334455566778889999987544332222233 Q ss_pred CcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh-CCcceEEEee-----cHHHHH---H Q lcl|NC_019445. 378 SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAM-EGMPLKVEYI-----SVMAQA---Q 448 (559) Q Consensus 378 ~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l-~g~~v~~~~i-----s~La~a---~ 448 (559) .-++++.. ..=....|.|....++.|+-.-| +++ .++ .|..+++... ...+++ . T Consensus 297 ~~~~e~~~--~~f~~~~l~P~~~~ie~~ln~kL-------------~~~--~~~~~~~~~~fd~~~ll~~d~~~~~~~~~ 359 (409) T protein:vir:10 297 HSNITEQN--REFYIDTLQSILNMYELEINYKL-------------FLI--SEIKNGFYSKFNVDTILRADIKTRYESYK 359 (409) T ss_pred cccHHHHH--HHHHHHHHHHHHHHHHHHHHHhh-------------cCc--hhccCCcEEEEechhhhccCHHHHHHHHH Confidence 33443322 11122334444444444433222 111 111 1111222211 111111 1 Q ss_pred HHHHH-----HHHHHHHHHHHHHhccChhhHh---cCCHHHHHHHHHHHcCCC Q lcl|NC_019445. 449 KSIGL-----SSLASTVNFIGQLAQAKPEALD---KLNVDQAIDAFADMSGVS 493 (559) Q Consensus 449 r~~~~-----~~l~~~~~~~~~la~~~P~~~~---~id~d~~~~~~a~~~Gvp 493 (559) +..+. +-+...++ +-.+-..+ .... .+-.+.+-+....+ |-- T Consensus 360 ~~~~~G~~T~NE~R~~lg-l~p~~ggD-~~~~~~n~~~~~~~~~~~~kg-Ge~ 409 (409) T protein:vir:10 360 EAIQNGFKTPNEIRELEE-DEPLEGGD-VLLINGNMIPVKMAGEQYSKG-GEK 409 (409) T ss_pred HHHhCCCcCHHHHHHHhC-CCCCCCcC-eeeeccCccchhhcccccccc-CCC Confidence 11110 11111111 11111100 0000 00000000000000 110 No 169 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=69.48 E-value=0.22 Score=24.16 Aligned_cols=314 Identities=9% Similarity=0.050 Sum_probs=127.4 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~ 80 (559) |.++..+...+ ...+ ...-|+=. ++.+...-..+- ..+...+-.+ T Consensus 1 m~~~~~~~~~~------~~~~---------------~~~~~~~~-------------~p~~~~~~~~~~-~~~~~~~~~~ 45 (337) T protein:vir:78 1 MTKRQQQPAQA------AASS---------------PRPSVVFS-------------MPEAIDPTAWMT-DYTGVFYNPY 45 (337) T ss_pred CCCcccCcccc------cccC---------------ceeEEEec-------------CcccccCcchhH-hhhhhhhccC Confidence 55543221100 0000 00001000 111110000011 1333344445 Q ss_pred CcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH--hccc---hHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEee Q lcl|NC_019445. 81 RPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN--KSNL---YQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFP 154 (559) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~--~snf---~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~ 154 (559) ..|+.--++-..+++.-.+..+...+ +....+ .+.| +..+..+..|+.+||||.+++..+. +.++.+.++| T Consensus 46 ~~~~~pP~~~~~La~l~~~~~~h~~~---L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~ 122 (337) T protein:vir:78 46 GEYYQPPIDRKGLAKVARANAHHGAI---LMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLS 122 (337) T ss_pred cceecCCCCHHHHHHHhhcchhhhhH---HHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeC Confidence 66764333223344443333332211 111111 2233 3467778899999999999887753 5566665554 Q ss_pred ccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEE Q lcl|NC_019445. 155 IGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKS 234 (559) Q Consensus 155 l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~s 234 (559) .. ++.+..+|+. +|.. .. ...+. + T Consensus 123 ~~--~v~~~~d~~~--~~~~----------~~------------------~~~~~-----~------------------- 146 (337) T protein:vir:78 123 SV--YLRRREDGCF--VYLQ----------QG------------------KPNLI-----Y------------------- 146 (337) T ss_pred Cc--eeEeeeCCeE--EEEE----------cC------------------CceEE-----E------------------- Confidence 43 3443333321 1100 00 00000 0 Q ss_pred EEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCC-C Q lcl|NC_019445. 235 VYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTS-L 311 (559) Q Consensus 235 v~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~-~ 311 (559) ..--+++.|.....+..||.+ |..-++..+-.-+..++-..+...-...|..+ +++. + T Consensus 147 ------------------~~~eIiHik~~~~~~~~~Gls-~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l 207 (337) T protein:vir:78 147 ------------------RPDDVIWLAQYDPEQQVYGMP-DYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNM 207 (337) T ss_pred ------------------CCccEEEECCCCCCCCccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC Confidence 001133444333345699998 77766655544444444334444444566654 3442 2 Q ss_pred cccc-------ce--ecC--CceeecCCcCC---chhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Q lcl|NC_019445. 312 KNQR-------AS--LLP--GDITYIDQITG---QDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTR 377 (559) Q Consensus 312 ~~~~-------~~--~~p--g~~~~~~~~~~---~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~ 377 (559) .... +. ..+ ++...+..+++ +-.+.|+.....+. .+.+.-+-.++.|-++|-+.+..+....+.. T Consensus 208 ~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~-qfle~k~~s~~eIa~a~~VPp~llGi~~~~~ 286 (337) T protein:vir:78 208 DDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKD-EFAAIKGITAQDVLTAHRYPPALAGIIPTNG 286 (337) T ss_pred CHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHH-HHHHHHHHhHHHHHHHhCCCHHHcccccCCC Confidence 2211 11 011 11122222221 12345554332233 3344445567789899987654433222222 Q ss_pred CcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHH Q lcl|NC_019445. 378 SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVM 444 (559) Q Consensus 378 ~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~L 444 (559) .-|-..+.+... .+...-|.|+++++...+.+.++ |+ .. -..++...-+.| T Consensus 287 ~~~~~n~e~~~~-----------~f~~~~L~P~~~~ie~~~n~~ll-~~---~~-~~~f~~~~~~~~ 337 (337) T protein:vir:78 287 GGGLGDPEKYDA-----------TYARNEVLPLCELVQDAINSAGL-PR---AL-WVTFRETIGAAV 337 (337) T ss_pred cCccccHHHHHH-----------HHHHHHHHHHHHHHHHHHhhhcC-Ch---hh-ceeccccccccC Confidence 222112222111 14455577777777777765442 21 11 012333333333 No 170 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=68.59 E-value=0.24 Score=24.03 Aligned_cols=393 Identities=12% Similarity=0.164 Sum_probs=155.0 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcc-cccCCCC--------cchHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRND-RRNTRII--------DSTGTMAARTLASG 71 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~~~~--------~s~~~~a~~~Las~ 71 (559) |.--.+. .+ .++.. + +...+... ......| ...+..+|++.|.. T Consensus 1 ~~~~~~d--------------~~-------~~~~~---~---~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed 53 (427) T protein:vir:10 1 MKIVKHD--------------GY-------NDIFN---G---GADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEE 53 (427) T ss_pred CCccccc--------------hH-------HHHhh---c---CCCCcccCccccCchHHHHHHHHcCchhhhhhccchHH Confidence 1111110 00 01100 0 00000000 0000111 22233333333333 Q ss_pred HHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEE Q lcl|NC_019445. 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTM 151 (559) Q Consensus 72 l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~ 151 (559) ++ +.|+.++-.+ + .. .+.+.+++-++...+.++++.--+||.+++++.-+..+.+.- T Consensus 54 ~~-------r~g~~i~g~~-~---~~-----------~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~- 110 (427) T protein:vir:10 54 MV-------TAGFKMSGVK-D---EK-----------EFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTS- 110 (427) T ss_pred hh-------cCCccccCcc-H---HH-----------HHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCcccc- Confidence 22 6899886422 1 11 122334445788899999999999999999887655443321 Q ss_pred EeeccEEEEeeCCCCCEEE--EEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDI--CFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKN 229 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~--i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~ 229 (559) |+ +..|.+-. ++-+..+|+.. +-.+-+ +.++. +.+.|+ |.++.. T Consensus 111 --p~-------~~~g~l~~l~v~d~~~~~~~~----~~~dp~---------s~~fg-~P~~y~-v~~~~~---------- 156 (427) T protein:vir:10 111 --QA-------KPGAKLEGVRVYDRFAITVEK----RVTNAR---------SPRYG-EPEIYK-VSPGDN---------- 156 (427) T ss_pred --cc-------CCCcceeEEEEechhcccccc----cccCcc---------ccccC-cceEEE-EecCCC---------- Confidence 11 23333322 22222333211 100100 01111 122222 111110 Q ss_pred ccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHH-HHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_019445. 230 KPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLA-LGPVKALQLLQKRKSQLIDKATNPPMVAP 308 (559) Q Consensus 230 ~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~-l~d~~~L~~l~~~~~~~~~~~~~p~~~~p 308 (559) .+ .+.+|. .+++.-.|+. .| -+.+..+..||.| |...+ .+.++..+.......+.+.++.-..+.++ T Consensus 157 ~~--~~~iH~----SRli~~~g~~-~p----~~~~~~~~~~G~S-~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~ 224 (427) T protein:vir:10 157 MQ--PYLIHH----SRVFIADGER-VA----QQARKQNQGWGAS-VLNKSLIDAICDYDYCESLATQILRRKQQAVWKVK 224 (427) T ss_pred Cc--ceEEcc----ccEEEecCCC-ch----hhhcccCCcccch-hhhHHHHHHHHHHHHHHHHHHHHHHHhccccccch Confidence 00 011221 1232222211 23 3455677788998 67554 46677777777777776666555444432 Q ss_pred C-------CCcc----ccce----ecC-CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhcc Q lcl|NC_019445. 309 T-------SLKN----QRAS----LLP-GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQ 372 (559) Q Consensus 309 ~-------~~~~----~~~~----~~p-g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~ 372 (559) + +... .++. ... ++...+. +..+.+..+. .++..+-..+....+.|.-+.=-.+--..+ T Consensus 225 ~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~--~~~e~~e~~~---~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G 299 (427) T protein:vir:10 225 GLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGID--AETEEYDVLN---SDISGVPEFLSSKMDRIVSLSGIHEIIIKN 299 (427) T ss_pred hHHHHhcCccchHHHHHHHHHHHHhcCcccceeee--cCCCceeEEe---cccCChHHHHHHHHHHHHhhhCCCeeeecc Confidence 1 1100 0111 011 1112221 1112233332 234444455666667776666332111222 Q ss_pred CCCCCCcCHH---HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHH Q lcl|NC_019445. 373 NINTRSMPVE---AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQK 449 (559) Q Consensus 373 ~~~~~~~TA~---Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r 449 (559) ..+....+| +++. .---+..++...+.|++++.+.++.+.. ++++++- ||-+... T Consensus 300 -~sp~Glnstgd~D~~n--------yyd~i~~~Qe~~l~p~l~~l~~~i~~s~------------~~~~~f~-pL~~~s~ 357 (427) T protein:vir:10 300 -KNVGGVSASQNTALET--------FYKLVDRKREEDYRPLLEFLLPFIVDEE------------EWSIEFE-PLSVPSK 357 (427) T ss_pred -CCccccccchhHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHhhcCC------------CcEEEeC-CCCCCCH Confidence 234445553 2222 2222333556678999999999987641 3566654 3332222 Q ss_pred HHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHH---cCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 450 SIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADM---SGVSPTVIVPQEQVDQARQQRAQQQQQQQMMAMGMAA 526 (559) Q Consensus 450 ~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~---~Gvp~~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~ 526 (559) ...++--....+.+..+.+.+ .++++++-+.+... .|++...-.+.++.++. .. T Consensus 358 kEkaei~~~~a~a~~~~~~~g-----vi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~--------------~e---- 414 (427) T protein:vir:10 358 KEESEITKNNVESVTKAITEQ-----IIDLEEARDTLRSIAPEFKLKDGNNINIREPEET--------------TE---- 414 (427) T ss_pred HHHHHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHHhhhccccCCCCccccccccchh--------------cC---- Confidence 222221122222233332222 36666666655433 23322111112221100 00 Q ss_pred HHHHhhhhhhcCCChh Q lcl|NC_019445. 527 AQGAKTLSEAKTSDPS 542 (559) Q Consensus 527 ~~~a~~~~~~~~~~~~ 542 (559) +....+....+.+ T Consensus 415 ---~~p~~~e~~~d~~ 427 (427) T protein:vir:10 415 ---PEPGLGEKLEDEN 427 (427) T ss_pred ---CCCCCCCCCCCCC Confidence 0000011111112 No 171 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=64.95 E-value=0.29 Score=23.52 Aligned_cols=444 Identities=10% Similarity=0.040 Sum_probs=147.4 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCC--CCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTS--EVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~--~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |.+...|.+.+.=....+-.......-+.++. -|....+..+ .... ... ..++-..++...-+.++.+-.-.+- T Consensus 42 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~l~-~~~-~n~i~~~~I~t~~~~vA~~~~~~~~ 117 (563) T protein:vir:99 42 EYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRD--KRSYMKNEHNLHDVLK-KFG-NNPILNAIILTRSNQVAMYCQPARY 117 (563) T ss_pred hHHHHHhhhccCCCcchhhhHhhhcccccccc--cccCCCCcccHHHHHH-Hhh-cchHHHHHHHHHHHHHHHHhhhhhh Confidence 33322221100000000000011111111111 0110000000 0000 000 0112222223222222222111111 Q ss_pred --CCCcc-eeccCCccchhhHHHHHHHHHHHHHHHHHHHH-----hccchHHHHHHHHHHHhhCcEEEEEe--ec-CCce Q lcl|NC_019445. 79 --PARPW-FRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-----KSNLYQSLPQLYGSLGTYSTGAMAVL--ED-DEDI 147 (559) Q Consensus 79 --p~~~W-f~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-----~snf~~~~~~~~~dl~~~G~~~l~v~--~~-~~~~ 147 (559) ....| ++|.-.+....+.. ......+++.+..... ..+|..-+..++.|+.++|||.+|+. .+ .+.+ T Consensus 118 ~~~~~~~~i~l~~~~~~~~~~~--~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~ 195 (563) T protein:vir:99 118 SEKGLGFEVRLRDLDAEPGRKE--KEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKL 195 (563) T ss_pred hcccccceeEEeecCCCcchhh--hhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCce Confidence 11222 33322222111110 1111122222222111 23455666678899999999988754 33 3456 Q ss_pred EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccc Q lcl|NC_019445. 148 IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) Q Consensus 148 ~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~ 227 (559) +.+.+++...+.+..+.+|.+-.-..+|..+.. + ..+ T Consensus 196 ~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~---------------------g---~~~------------------- 232 (563) T protein:vir:99 196 EKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVD---------------------K---RVV------------------- 232 (563) T ss_pred EEEEEeCCceeEEEECCCCceeccceeEEEEeC---------------------C---cee------------------- Confidence 677777778877777777654221111111000 0 000 Q ss_pred ccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeec---CCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 228 KNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVN---GEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 228 ~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~---~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) .....+ + .+..+.... ....||.| |...+...+.....+++.......-...|. T Consensus 233 ---------~~~~~~-e------------vI~~~~~~~~d~~~~~~G~S-pi~~a~~~i~~~~~~~~~~~~~f~ng~~p~ 289 (563) T protein:vir:99 233 ---------ASFTSR-E------------LAMGIRNPRTELSSSGYGLS-EVEIAMKEFIAYNNTESFNDRFFSHGGTTR 289 (563) T ss_pred ---------EEecCc-c------------eEEEeccCCCCcccCcccch-HHHHHHHHHHHHHHHHHHHHHHHHccCCCc Confidence 000000 0 011111111 12469999 898888888877788887777777777787 Q ss_pred e--eecCCCc--cc-------cce-ecC-----CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_019445. 305 M--VAPTSLK--NQ-------RAS-LLP-----GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDL 367 (559) Q Consensus 305 ~--~~p~~~~--~~-------~~~-~~p-----g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl 367 (559) . .++++.. .. .+. ..- |++..+ . +++-.++|+.. ++.-..+.+..+..+..|-++|-.++ T Consensus 290 giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~v-l-~~G~~~~~l~~-~~~d~qfle~~~~~~~~Ia~afgVPp 366 (563) T protein:vir:99 290 GILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVV-M-ADDIKFVNMTP-TANDMQFEKWLNYLINIISALYGIDP 366 (563) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEE-c-CCCceEEeccC-ChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 4 4454421 11 011 111 221111 1 22233555532 33334455666778899999998875 Q ss_pred hhhccCCCCC-----------CcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcce Q lcl|NC_019445. 368 FMMLQNINTR-----------SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPL 436 (559) Q Consensus 368 ~~~~~~~~~~-----------~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v 436 (559) ........+. +-++++.. ..=....|.|.+.+++.+|-.-| +|+. +..+ T Consensus 367 ~~lG~~~~~~~~~~~~~ss~~~sn~e~~~--~~f~~~tL~P~l~~ie~~ln~~L-------------~~~~-----~~~~ 426 (563) T protein:vir:99 367 AEIGFPNRGGATGSKGGSTLNEADPGKKQ--QQSQNKGLQPLLRFIEDLVNRHI-------------ISEY-----GDKY 426 (563) T ss_pred HHccccccccccccccccchhhccHHHHH--HHHHHHHHHHHHHHHHHHHHhhh-------------chhc-----cccc Confidence 5442211111 11222111 12223345555555544443322 2221 2345 Q ss_pred EEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc----cccCCHHHHHHH---HHH Q lcl|NC_019445. 437 KVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP----TVIVPQEQVDQA---RQQ 509 (559) Q Consensus 437 ~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~----~~~rs~~ev~~~---rq~ 509 (559) .+++..+=. ..+.... .+... +. +. -+.++ ++-+.+|.|+ +.+...--+... .+. T Consensus 427 ~~~f~r~D~-~~~~e~~-~~~~~---~~--~G-------~lT~N----E~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~ 488 (563) T protein:vir:99 427 TFQFVGGDT-KSATDKL-NILKL---ET--QI-------FKTVN----EAREEQGKKPIEGGDIILDASFLQGTAQLQQD 488 (563) T ss_pred EEEeccCCH-HHHHHHH-HHHHH---hc--CC-------ccCHH----HHHHHhCCCCCCCcceeecccccccccccccc Confidence 566543311 1111111 11110 00 01 01212 2222334432 122111100000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhh-----hhcCCChh---------HHHHHHHHhhcC-CCCCC Q lcl|NC_019445. 510 RAQQQQQQQMMAMGMAAAQGAKTLS-----EAKTSDPS---------VLSAMANAVSGQ-GGQSQ 559 (559) Q Consensus 510 r~q~~q~~~~~~~~~~~~~~a~~~~-----~~~~~~~~---------~~~~~~~~~~~~-~~~~~ 559 (559) .....+.+ ........++...-. +..+.+.+ .+.+.=+.-..+ +.-.| T Consensus 489 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (563) T protein:vir:99 489 KQYNDGKQ--KERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQ 551 (563) T ss_pred cCCCcccc--chhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccc Confidence 00000000 000001110000000 00000000 000000000000 00000 No 172 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=64.95 E-value=0.29 Score=23.52 Aligned_cols=444 Identities=10% Similarity=0.040 Sum_probs=147.4 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCC--CCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTS--EVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~--~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |.+...|.+.+.=....+-.......-+.++. -|....+..+ .... ... ..++-..++...-+.++.+-.-.+- T Consensus 42 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~l~-~~~-~n~i~~~~I~t~~~~vA~~~~~~~~ 117 (563) T protein:vir:95 42 EYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRD--KRSYMKNEHNLHDVLK-KFG-NNPILNAIILTRSNQVAMYCQPARY 117 (563) T ss_pred hHHHHHhhhccCCCcchhhhHhhhcccccccc--cccCCCCcccHHHHHH-Hhh-cchHHHHHHHHHHHHHHHHhhhhhh Confidence 33322221100000000000011111111111 0110000000 0000 000 0112222223222222222111111 Q ss_pred --CCCcc-eeccCCccchhhHHHHHHHHHHHHHHHHHHHH-----hccchHHHHHHHHHHHhhCcEEEEEe--ec-CCce Q lcl|NC_019445. 79 --PARPW-FRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-----KSNLYQSLPQLYGSLGTYSTGAMAVL--ED-DEDI 147 (559) Q Consensus 79 --p~~~W-f~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-----~snf~~~~~~~~~dl~~~G~~~l~v~--~~-~~~~ 147 (559) ....| ++|.-.+....+.. ......+++.+..... ..+|..-+..++.|+.++|||.+|+. .+ .+.+ T Consensus 118 ~~~~~~~~i~l~~~~~~~~~~~--~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~ 195 (563) T protein:vir:95 118 SEKGLGFEVRLRDLDAEPGRKE--KEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKL 195 (563) T ss_pred hcccccceeEEeecCCCcchhh--hhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCce Confidence 11222 33322222111110 1111122222222111 23455666678899999999988754 33 3456 Q ss_pred EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccc Q lcl|NC_019445. 148 IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) Q Consensus 148 ~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~ 227 (559) +.+.+++...+.+..+.+|.+-.-..+|..+.. + ..+ T Consensus 196 ~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~---------------------g---~~~------------------- 232 (563) T protein:vir:95 196 EKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVD---------------------K---RVV------------------- 232 (563) T ss_pred EEEEEeCCceeEEEECCCCceeccceeEEEEeC---------------------C---cee------------------- Confidence 677777778877777777654221111111000 0 000 Q ss_pred ccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeec---CCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019445. 228 KNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVN---GEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) Q Consensus 228 ~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~---~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~ 304 (559) .....+ + .+..+.... ....||.| |...+...+.....+++.......-...|. T Consensus 233 ---------~~~~~~-e------------vI~~~~~~~~d~~~~~~G~S-pi~~a~~~i~~~~~~~~~~~~~f~ng~~p~ 289 (563) T protein:vir:95 233 ---------ASFTSR-E------------LAMGIRNPRTELSSSGYGLS-EVEIAMKEFIAYNNTESFNDRFFSHGGTTR 289 (563) T ss_pred ---------EEecCc-c------------eEEEeccCCCCcccCcccch-HHHHHHHHHHHHHHHHHHHHHHHHccCCCc Confidence 000000 0 011111111 12469999 898888888877788887777777777787 Q ss_pred e--eecCCCc--cc-------cce-ecC-----CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_019445. 305 M--VAPTSLK--NQ-------RAS-LLP-----GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDL 367 (559) Q Consensus 305 ~--~~p~~~~--~~-------~~~-~~p-----g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl 367 (559) . .++++.. .. .+. ..- |++..+ . +++-.++|+.. ++.-..+.+..+..+..|-++|-.++ T Consensus 290 giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~v-l-~~G~~~~~l~~-~~~d~qfle~~~~~~~~Ia~afgVPp 366 (563) T protein:vir:95 290 GILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVV-M-ADDIKFVNMTP-TANDMQFEKWLNYLINIISALYGIDP 366 (563) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEE-c-CCCceEEeccC-ChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 4 4454421 11 011 111 221111 1 22233555532 33334455666778899999998875 Q ss_pred hhhccCCCCC-----------CcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcce Q lcl|NC_019445. 368 FMMLQNINTR-----------SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPL 436 (559) Q Consensus 368 ~~~~~~~~~~-----------~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v 436 (559) ........+. +-++++.. ..=....|.|.+.+++.+|-.-| +|+. +..+ T Consensus 367 ~~lG~~~~~~~~~~~~~ss~~~sn~e~~~--~~f~~~tL~P~l~~ie~~ln~~L-------------~~~~-----~~~~ 426 (563) T protein:vir:95 367 AEIGFPNRGGATGSKGGSTLNEADPGKKQ--QQSQNKGLQPLLRFIEDLVNRHI-------------ISEY-----GDKY 426 (563) T ss_pred HHccccccccccccccccchhhccHHHHH--HHHHHHHHHHHHHHHHHHHHhhh-------------chhc-----cccc Confidence 5442211111 11222111 12223345555555544443322 2221 2345 Q ss_pred EEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc----cccCCHHHHHHH---HHH Q lcl|NC_019445. 437 KVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP----TVIVPQEQVDQA---RQQ 509 (559) Q Consensus 437 ~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~----~~~rs~~ev~~~---rq~ 509 (559) .+++..+=. ..+.... .+... +. +. -+.++ ++-+.+|.|+ +.+...--+... .+. T Consensus 427 ~~~f~r~D~-~~~~e~~-~~~~~---~~--~G-------~lT~N----E~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~ 488 (563) T protein:vir:95 427 TFQFVGGDT-KSATDKL-NILKL---ET--QI-------FKTVN----EAREEQGKKPIEGGDIILDASFLQGTAQLQQD 488 (563) T ss_pred EEEeccCCH-HHHHHHH-HHHHH---hc--CC-------ccCHH----HHHHHhCCCCCCCcceeecccccccccccccc Confidence 566543311 1111111 11110 00 01 01212 2222334432 122111100000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhh-----hhcCCChh---------HHHHHHHHhhcC-CCCCC Q lcl|NC_019445. 510 RAQQQQQQQMMAMGMAAAQGAKTLS-----EAKTSDPS---------VLSAMANAVSGQ-GGQSQ 559 (559) Q Consensus 510 r~q~~q~~~~~~~~~~~~~~a~~~~-----~~~~~~~~---------~~~~~~~~~~~~-~~~~~ 559 (559) .....+.+ ........++...-. +..+.+.+ .+.+.=+.-..+ +.-.| T Consensus 489 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (563) T protein:vir:95 489 KQYNDGKQ--KERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQ 551 (563) T ss_pred cCCCcccc--chhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccc Confidence 00000000 000001110000000 00000000 000000000000 00000 No 173 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=64.76 E-value=0.29 Score=23.49 Aligned_cols=393 Identities=13% Similarity=0.123 Sum_probs=154.1 Q ss_pred CChhhHHHHHH------HHHHHHHHhhh-HHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNK------QFAQLESERQS-FEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~~~~~~~l~~------r~~~l~~~R~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |.=..+|+-++ .|... ..|+. .-..+....--++|....-.....+. ..-+=.++--.|++.+|+.+. T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~-e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~al~~~~V~~cv~~Ia~~iA 85 (441) T protein:vir:79 11 VDFKSRKQSRKELVVVGIFYKN-EKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD----IEAIRHSDIFTAVMMIASDLA 85 (441) T ss_pred ccccccccchhhhhcccccccc-ccccccCCCcchHHHHHHhcccCcccccccch----hhhhccHHHHHHHHHHHHhhc Confidence 32222222211 12111 11210 00111111111122111000000000 001112344456666666554 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-Cce Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDI 147 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~ 147 (559) + -|| ++.- +.... .++.+...|+ +-| .+.-....+.++..+|||.+++..+. +.+ T Consensus 86 ~------lp~-~~~~-~~~~~-----------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~ 146 (441) T protein:vir:79 86 R------MPI-RVTV-NGQIN-----------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEP 146 (441) T ss_pred c------Cce-eeec-Ccccc-----------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 4 233 3321 11111 1122333443 222 23345667788899999999987654 456 Q ss_pred EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccc Q lcl|NC_019445. 148 IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) Q Consensus 148 ~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~ 227 (559) +.+.+++...+-+..|.+|++--.+.. ++ .+. ..+. ..+ T Consensus 147 ~~L~~i~~~~v~v~~d~~g~~~~~~~~-----------~~--------------~~~-~~~~---~~~------------ 185 (441) T protein:vir:79 147 MNLTFRKTSEIELKSDARGRLYYFHQR-----------ID--------------SNG-NNIE---RNV------------ 185 (441) T ss_pred EEEEEEcCceeEEEECCCccEEEEEEE-----------ec--------------cCC-ceeE---EEE------------ Confidence 677778888888888887764221110 00 000 0000 000 Q ss_pred ccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee- Q lcl|NC_019445. 228 KNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV- 306 (559) Q Consensus 228 ~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~- 306 (559) ..--++++|+...+| .||.| |.+.+...+.......+.......-...|..+ T Consensus 186 -------------------------~~~dvih~k~~~~dg-~~G~s-pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 238 (441) T protein:vir:79 186 -------------------------KFEDMLDIKFYSLDG-INGLS-LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGIL 238 (441) T ss_pred -------------------------ccccEEEeccCCCCC-ccccC-HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEE Confidence 001134455544455 79999 88877777776677777777777777778755 Q ss_pred -ecCCCcccc--------ce-e-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhc Q lcl|NC_019445. 307 -APTSLKNQR--------AS-L-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMML 371 (559) Q Consensus 307 -~p~~~~~~~--------~~-~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~ 371 (559) +++.+.... ++ . ..|++..++ ++-.++|+.. ++....+.+........|-++|-..+.. + T Consensus 239 ~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~---~G~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-l 313 (441) T protein:vir:79 239 KMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD---ESMTFDQLEV-DTEVLKLIRENKSSTREIAGVFGIPLHK-F 313 (441) T ss_pred EcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecC---CCceEEEccC-ChhHHHHHHHHHHhHHHHHHHhCCCHHH-c Confidence 444432211 10 1 123333332 2234566542 3344445566667778899999875443 3 Q ss_pred cCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHH Q lcl|NC_019445. 372 QNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSI 451 (559) Q Consensus 372 ~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~ 451 (559) +. +....+.+|. ....... |.|++.+.-..+.+. +++ ...+..+++.... |-+. T Consensus 314 g~-~~~~~s~~q~---~~~~~~t------------l~P~~~~ie~eln~k-l~~----~~~~~~~~fd~~~-llr~---- 367 (441) T protein:vir:79 314 GI-ETANMSITDA---NLDYLST------------LKPYITCVCAELNFK-FND----EYVNREFKFDTTE-IRVV---- 367 (441) T ss_pred CC-CCCCccHHHH---HHHHHHH------------HHHHHHHHHHHHhhh-ccc----cccCceEEeechh-hhcc---- Confidence 32 2222222221 1112223 445555444444332 122 1234445554333 3211 Q ss_pred HHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc------cc-CCHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 452 GLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT------VI-VPQEQ--VDQARQQRAQQQQQQQMMAM 522 (559) Q Consensus 452 ~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~------~~-rs~~e--v~~~rq~r~q~~q~~~~~~~ 522 (559) +......+++ .+.+.+ -+.+++ +-+.+|.|+- ++ .+-.- ++..- +-|.. ... T Consensus 368 D~~~~~~~~~---~~i~~G-----~~T~NE----~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~-----~~~~~--~~~ 428 (441) T protein:vir:79 368 DEKTQAEIDK---INIDSG-----KMNIDE----IRQRDGLAPIPGGNGSIHRVDLNHVNIELVD-----EYQMN--KSR 428 (441) T ss_pred CHHHHHHHHH---HHHhCC-----CcCHHH----HHHHhCCCCCCCCCcceEeeccccccccccc-----ccccc--ccc Confidence 1111111111 111111 122222 2334555431 11 11000 00000 00000 000 Q ss_pred HHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 523 GMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 523 ~~~~~~~a~~~~~~~~~~~~ 542 (559) +.... .| .++. + + T Consensus 429 ~~~~~--~k-gGe~---~-e 441 (441) T protein:vir:79 429 ATDKK--LK-GGEE---N-E 441 (441) T ss_pred ccccc--cC-CCCC---C-C Confidence 00000 00 0000 1 1 No 174 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=64.76 E-value=0.29 Score=23.49 Aligned_cols=393 Identities=13% Similarity=0.123 Sum_probs=154.1 Q ss_pred CChhhHHHHHH------HHHHHHHHhhh-HHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHH Q lcl|NC_019445. 1 MAETTKERLNK------QFAQLESERQS-FEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) Q Consensus 1 M~~~~~~~l~~------r~~~l~~~R~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~ 73 (559) |.=..+|+-++ .|... ..|+. .-..+....--++|....-.....+. ..-+=.++--.|++.+|+.+. T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~-e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~al~~~~V~~cv~~Ia~~iA 85 (441) T protein:vir:94 11 VDFKSRKQSRKELVVVGIFYKN-EKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD----IEAIRHSDIFTAVMMIASDLA 85 (441) T ss_pred ccccccccchhhhhcccccccc-ccccccCCCcchHHHHHHhcccCcccccccch----hhhhccHHHHHHHHHHHHhhc Confidence 32222222211 12111 11210 00111111111122111000000000 001112344456666666554 Q ss_pred HhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-Cce Q lcl|NC_019445. 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDI 147 (559) Q Consensus 74 ~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~ 147 (559) + -|| ++.- +.... .++.+...|+ +-| .+.-....+.++..+|||.+++..+. +.+ T Consensus 86 ~------lp~-~~~~-~~~~~-----------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~ 146 (441) T protein:vir:94 86 R------MPI-RVTV-NGQIN-----------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEP 146 (441) T ss_pred c------Cce-eeec-Ccccc-----------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 4 233 3321 11111 1122333443 222 23345667788899999999987654 456 Q ss_pred EEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccc Q lcl|NC_019445. 148 IRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) Q Consensus 148 ~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~ 227 (559) +.+.+++...+-+..|.+|++--.+.. ++ .+. ..+. ..+ T Consensus 147 ~~L~~i~~~~v~v~~d~~g~~~~~~~~-----------~~--------------~~~-~~~~---~~~------------ 185 (441) T protein:vir:94 147 MNLTFRKTSEIELKSDARGRLYYFHQR-----------ID--------------SNG-NNIE---RNV------------ 185 (441) T ss_pred EEEEEEcCceeEEEECCCccEEEEEEE-----------ec--------------cCC-ceeE---EEE------------ Confidence 677778888888888887764221110 00 000 0000 000 Q ss_pred ccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee- Q lcl|NC_019445. 228 KNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV- 306 (559) Q Consensus 228 ~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~- 306 (559) ..--++++|+...+| .||.| |.+.+...+.......+.......-...|..+ T Consensus 186 -------------------------~~~dvih~k~~~~dg-~~G~s-pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 238 (441) T protein:vir:94 186 -------------------------KFEDMLDIKFYSLDG-INGLS-LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGIL 238 (441) T ss_pred -------------------------ccccEEEeccCCCCC-ccccC-HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEE Confidence 001134455544455 79999 88877777776677777777777777778755 Q ss_pred -ecCCCcccc--------ce-e-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhc Q lcl|NC_019445. 307 -APTSLKNQR--------AS-L-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMML 371 (559) Q Consensus 307 -~p~~~~~~~--------~~-~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~ 371 (559) +++.+.... ++ . ..|++..++ ++-.++|+.. ++....+.+........|-++|-..+.. + T Consensus 239 ~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~---~G~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-l 313 (441) T protein:vir:94 239 KMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD---ESMTFDQLEV-DTEVLKLIRENKSSTREIAGVFGIPLHK-F 313 (441) T ss_pred EcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecC---CCceEEEccC-ChhHHHHHHHHHHhHHHHHHHhCCCHHH-c Confidence 444432211 10 1 123333332 2234566542 3344445566667778899999875443 3 Q ss_pred cCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHH Q lcl|NC_019445. 372 QNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSI 451 (559) Q Consensus 372 ~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~ 451 (559) +. +....+.+|. ....... |.|++.+.-..+.+. +++ ...+..+++.... |-+. T Consensus 314 g~-~~~~~s~~q~---~~~~~~t------------l~P~~~~ie~eln~k-l~~----~~~~~~~~fd~~~-llr~---- 367 (441) T protein:vir:94 314 GI-ETANMSITDA---NLDYLST------------LKPYITCVCAELNFK-FND----EYVNREFKFDTTE-IRVV---- 367 (441) T ss_pred CC-CCCCccHHHH---HHHHHHH------------HHHHHHHHHHHHhhh-ccc----cccCceEEeechh-hhcc---- Confidence 32 2222222221 1112223 445555444444332 122 1234445554333 3211 Q ss_pred HHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc------cc-CCHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 452 GLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT------VI-VPQEQ--VDQARQQRAQQQQQQQMMAM 522 (559) Q Consensus 452 ~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~------~~-rs~~e--v~~~rq~r~q~~q~~~~~~~ 522 (559) +......+++ .+.+.+ -+.+++ +-+.+|.|+- ++ .+-.- ++..- +-|.. ... T Consensus 368 D~~~~~~~~~---~~i~~G-----~~T~NE----~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~-----~~~~~--~~~ 428 (441) T protein:vir:94 368 DEKTQAEIDK---INIDSG-----KMNIDE----IRQRDGLAPIPGGNGSIHRVDLNHVNIELVD-----EYQMN--KSR 428 (441) T ss_pred CHHHHHHHHH---HHHhCC-----CcCHHH----HHHHhCCCCCCCCCcceEeeccccccccccc-----ccccc--ccc Confidence 1111111111 111111 122222 2334555431 11 11000 00000 00000 000 Q ss_pred HHHHHHHHhhhhhhcCCChh Q lcl|NC_019445. 523 GMAAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 523 ~~~~~~~a~~~~~~~~~~~~ 542 (559) +.... .| .++. + + T Consensus 429 ~~~~~--~k-gGe~---~-e 441 (441) T protein:vir:94 429 ATDKK--LK-GGEE---N-E 441 (441) T ss_pred ccccc--cC-CCCC---C-C Confidence 00000 00 0000 1 1 No 175 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=60.92 E-value=0.36 Score=23.00 Aligned_cols=465 Identities=13% Similarity=0.064 Sum_probs=160.3 Q ss_pred CChhh-HHHHHHHHHHHHHHhhhHHHHHH-----HHHHHhccccCCCCCCCC-CCccccc---CCCCcc---hHHHHHHH Q lcl|NC_019445. 1 MAETT-KERLNKQFAQLESERQSFEPHWR-----ELSDYINPRGSRFLTSEV-NRNDRRN---TRIIDS---TGTMAART 67 (559) Q Consensus 1 M~~~~-~~~l~~r~~~l~~~R~~~~~~w~-----e~~~~~~P~~~~~~~~~~-~~~~~~~---~~~~~s---~~~~a~~~ 67 (559) |++.. +..+...--...+...+....+. -+..+..+.......... .+..... ...|.. ....+.+- T Consensus 68 ~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~ 147 (862) T protein:vir:99 68 ISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALIAQHW 147 (862) T ss_pred ccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHHHhCc Confidence 22211 11111100011111111111111 133444433221110000 0000000 000111 11122222 Q ss_pred HHHHHHHhhcC-CCCcceeccCCcc-chhhHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHhhCcEEEEEee--- Q lcl|NC_019445. 68 LASGMMSGITS-PARPWFRLATPDP-EMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLE--- 142 (559) Q Consensus 68 Las~l~~~l~p-p~~~Wf~l~~~d~-~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~v~~--- 142 (559) |+.++...+-= .-+.|+.+...++ +..+...+ +.+.+.+.+-+....+.++++.--.||.+++++.- T Consensus 148 larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~--------~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~ 219 (862) T protein:vir:99 148 LVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESL--------EKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSE 219 (862) T ss_pred hhhhhhhhhhHHHhhCCceEeecCcccccCHHHH--------HHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCc Confidence 22222222111 2467999886432 21121112 22344455567788888999988889988776542 Q ss_pred cCC---ceEEEEEeeccEEEEeeCCCCCEEEE--EEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Q lcl|NC_019445. 143 DDE---DIIRTMPFPIGSYYLANSPRGSVDIC--FRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPN 217 (559) Q Consensus 143 ~~~---~~~~~~~~~l~~~~v~~d~~G~vd~i--~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~ 217 (559) |+. ++++...+ ..|.+-.| +-.+..+. ..+.++..|-++ .++.+ .+.|. |-. T Consensus 220 D~~~LsqPLn~e~I----------~kG~lkgl~vlDp~w~~p-~~v~~~~~Dp~s---------p~yGk-P~~y~-I~g- 276 (862) T protein:vir:99 220 DPDYYEKPFNPDGI----------TPGSYRGISQIDPYWMMP-MLTAESTADPSS---------QFFYE-PEFWI-ISG- 276 (862) T ss_pred CchhhhcCcCcccc----------cccceeEEEEechhhhcc-cccccccccccc---------cccCC-ceeee-ecC- Confidence 221 22222111 11222111 11110000 000001111111 01111 11111 100 Q ss_pred CcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 218 IDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLI 297 (559) Q Consensus 218 ~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~ 297 (559) ..+|. .+++.-.| +..|++ .+....-||+| ..+.++..++..+.......+.+ T Consensus 277 -----------------~~IH~----SRliif~g-~~vpd~----lk~ay~f~G~S-vLe~iyd~L~~~d~t~~saa~Ll 329 (862) T protein:vir:99 277 -----------------QKYHR----SHLIIARG-PQPADI----LKPTYIFGGIP-LVQRIYERVYAAERTANEAPLLA 329 (862) T ss_pred -----------------eeecc----ceeEEecC-CCchhh----hhccCCccCcc-HHHHHHHHHHHHHHHHHHHHHHH Confidence 01111 12222222 233442 22334457999 47778777777777666666655 Q ss_pred HHHhcCceeecCC--Ccc-----ccc---ee-cC-CceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 298 DKATNPPMVAPTS--LKN-----QRA---SL-LP-GDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV 365 (559) Q Consensus 298 ~~~~~p~~~~p~~--~~~-----~~~---~~-~p-g~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~ 365 (559) ..+....+.+..- +.. ..+ +. .- .++..++.. +.+..+. .++..+-..+....+.|.-++=- T Consensus 330 ~ka~l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN~Gi~liD~e---Ee~e~ls---~slSGL~dll~~~~q~IAaas~I 403 (862) T protein:vir:99 330 MNKRTTAIHTDTAKAIANEDKFIQRLMFWVRYRDNHAVKVLGTD---ETMEQFD---TSLADFDAVIMGQYQLVASIAKT 403 (862) T ss_pred HHhccceeechhHhhhccHHHHHHHHHHHHhccCcceeEEecCC---CceeEEe---cccCChHHHHHHHHHHHHhhhCC Confidence 5544444433211 110 011 11 11 233333321 2233322 23444455566666777766633 Q ss_pred chhhhccCC-CCCCcCHH-HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecH Q lcl|NC_019445. 366 DLFMMLQNI-NTRSMPVE-AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISV 443 (559) Q Consensus 366 dl~~~~~~~-~~~~~TA~-Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~ 443 (559) .+--.+++. .+-.=|.+ +++. .---+..++...+.|+|+|.+.++....-+ |. ++.+++ .| T Consensus 404 P~tiLfGqspaGlnATGE~D~~n--------YyD~I~s~QE~~L~P~LerL~~li~~~lg~---~~-----d~~ieF-np 466 (862) T protein:vir:99 404 PATKLLGTAPKGFNSTGEFETIS--------YHEELESIQEHVYMPFLQRHYLISRLSLGI---QH-----EIDVVM-EP 466 (862) T ss_pred CceeecccCcccccCchHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---CC-----cceEEe-CC Confidence 221122222 22222333 3222 222233344567889999998887654222 22 366766 35 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHc--CC---CccccC-----CHHHHHHHHHHHHHH Q lcl|NC_019445. 444 MAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMS--GV---SPTVIV-----PQEQVDQARQQRAQQ 513 (559) Q Consensus 444 La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~--Gv---p~~~~r-----s~~ev~~~rq~r~q~ 513 (559) |.+......++......+.+..+.+.+ .|+.+++.+.++..- |. +...+- .+++.++.....+.. T Consensus 467 L~~~sekEkAEi~kk~Aea~~~lv~sG-----vispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~ 541 (862) T protein:vir:99 467 VASMTAQQQADLNKTKAEGGKVLIDGG-----VISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQ 541 (862) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCccc Confidence 544433333332233333333333322 478888888776421 11 111100 011111110000000 Q ss_pred HHHHHHHHHHHH-HHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCC----C Q lcl|NC_019445. 514 QQQQQMMAMGMA-AAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQS----Q 559 (559) Q Consensus 514 ~q~~~~~~~~~~-~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 559 (559) .++.. ..+.++..+.+. +.++.-....+...|++-+. . T Consensus 542 -------~~ap~de~~aga~~~~~e-~d~~~~p~~~~~~~g~~~~~t~~~~ 584 (862) T protein:vir:99 542 -------ETASAKETQAGAAVTTAE-GDQPNVQMVPSMKPGQMVGPEVGIT 584 (862) T ss_pred -------ccccccccccccCCcccc-CCcccccccCCCCCCCccccccccc Confidence 00000 001111111111 11111000100011110000 0 No 176 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=60.33 E-value=0.37 Score=22.92 Aligned_cols=378 Identities=10% Similarity=0.049 Sum_probs=146.9 Q ss_pred hhHHHHHHHH---HHHhccc-cCCCCC--CCCCCccccc--CCCCcchHHHHHHHHHHHHHHhhcCCCCcc--eeccCCc Q lcl|NC_019445. 21 QSFEPHWREL---SDYINPR-GSRFLT--SEVNRNDRRN--TRIIDSTGTMAARTLASGMMSGITSPARPW--FRLATPD 90 (559) Q Consensus 21 ~~~~~~w~e~---~~~~~P~-~~~~~~--~~~~~~~~~~--~~~~~s~~~~a~~~Las~l~~~l~pp~~~W--f~l~~~d 90 (559) =.+...|-.- ...+-|. .....+ .....+..-. .-+=.++--.|++.+|+.+. +-|| ++..-.. T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia------~~p~~~~~~~~~~ 74 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVG------MLPCNLYHLNGSL 74 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhc------cCceEEEEecCCc Confidence 1111111000 0000000 000000 0000000000 00111222334444444322 3333 2222111 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHH-hc----cchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEEeeCCC Q lcl|NC_019445. 91 PEMMDYGPVKLWLEAVQNRMNDMFN-KS----NLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLANSPR 165 (559) Q Consensus 91 ~~~~~~~~v~~~l~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d~~ 165 (559) ..... +..+...|+ +- +.+.-+..++.++.++|||.+++..+.+.+..+.+++.+.+.+..+.. T Consensus 75 ~~~~~-----------~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~g~~~~L~~l~~~~v~~~~~~~ 143 (414) T protein:vir:44 75 KQRAT-----------GERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDPGCVVPKLNSS 143 (414) T ss_pred eeecc-----------cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcCceEEEEECCC Confidence 11111 111222332 22 334445667788889999999988776666667667777766666655 Q ss_pred CCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCce Q lcl|NC_019445. 166 GSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDK 245 (559) Q Consensus 166 G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~ 245 (559) |++ +|+. .. .+. ... ++. T Consensus 144 ~~~--~y~~-~~------------------------~~g-~~~---------------------------~~~------- 161 (414) T protein:vir:44 144 WEP--VYQV-TF------------------------PDG-STD---------------------------VLS------- 161 (414) T ss_pred CcE--EEEE-Ee------------------------cCc-eEE---------------------------EEc------- Confidence 542 1210 00 000 000 000 Q ss_pred eeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCccc-------cc Q lcl|NC_019445. 246 LLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQ-------RA 316 (559) Q Consensus 246 il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~-------~~ 316 (559) ..-++++|....++ .||.| |...+...+.....+.+.......-...|..++ +..+... .+ T Consensus 162 --------~~evih~~~~~~d~-~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~ 231 (414) T protein:vir:44 162 --------QEDIWHVRTLTLDG-LVGLN-PIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDF 231 (414) T ss_pred --------cccEEEecCCCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHH Confidence 00123333222233 79999 888777777777777777777777777787554 3333221 01 Q ss_pred e-e-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHH Q lcl|NC_019445. 317 S-L-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEE 390 (559) Q Consensus 317 ~-~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e 390 (559) . . ..|++.+.+ ++-.++|+.. ++.-..+.+..+..+..|-++|-..........++..-++++.. T Consensus 232 ~~~~~g~~n~~~~~vl~---~g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~----- 302 (414) T protein:vir:44 232 EERHTGLGNAHRPMILE---MGLDWKSMAL-NAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG----- 302 (414) T ss_pred HHHhcCccccCcceecC---CCceEEEccC-ChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----- Confidence 1 1 123333332 2223555542 23333344556667788999998754332221122222332222 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019445. 391 KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAK 470 (559) Q Consensus 391 ~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~ 470 (559) ..+...-+.|++.+.-..|.+. ++++. +-.+--|++.....+ +. +......+++ .+.+.+ T Consensus 303 ---------~~~~~~~l~P~~~~ie~~ln~~-L~~~~--~~~~~~i~fd~~~ll-~~----d~~~~~~~~~---~~~~~G 362 (414) T protein:vir:44 303 ---------LGFINYSLVPYLTRIEQRINTG-LVRKS--KQGVFYAKFNAGALL-RG----DMKSRFEAYA---TGINWG 362 (414) T ss_pred ---------HHHHHHHHHHHHHHHHHHHHhh-cCCcc--ccCceEEEEechhhh-cc----CHHHHHHHHH---HHHhCC Confidence 2244556777777665555442 34432 212222444333322 11 1111111111 111111 Q ss_pred hhhHhcCCHHHHHHHHHHHcCCCcc-----ccCCHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhhhhhcCCChhH Q lcl|NC_019445. 471 PEALDKLNVDQAIDAFADMSGVSPT-----VIVPQEQVDQARQQRAQQQQQQQMMAMGM-AAAQGAKTLSEAKTSDPSV 543 (559) Q Consensus 471 P~~~~~id~d~~~~~~a~~~Gvp~~-----~~rs~~ev~~~rq~r~q~~q~~~~~~~~~-~~~~~a~~~~~~~~~~~~~ 543 (559) -+.++++- +.+|.|+- ++.+.. . ...+ ...+..+...++....++. T Consensus 363 -----~~t~NE~R----~~~gl~p~~ggD~~~~~~n-~-----------------~~~~~~~~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 363 -----IYSPNDCR----DLEDMNPRPGGDVYLTPMN-M-----------------TTKPSDGSKAGKQKDNANADETTS 414 (414) T ss_pred -----CcCHHHHH----HHhCCCCCCCcceeccccc-c-----------------cccCCccccCCCCCCCCCCCCCCC Confidence 13333332 34566541 111100 0 0000 0001111111111111111 No 177 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=60.21 E-value=0.38 Score=22.91 Aligned_cols=392 Identities=13% Similarity=0.105 Sum_probs=151.7 Q ss_pred hhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCccc--ccCCCCcchHHHHHHHHHHHHHHhhcCCCC Q lcl|NC_019445. 4 TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDR--RNTRIIDSTGTMAARTLASGMMSGITSPAR 81 (559) Q Consensus 4 ~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~~~~~s~~~~a~~~Las~l~~~l~pp~~ 81 (559) +..+++-++-+..+.....+...+.+ +. .+..+..+.. ...-+-.++--.|++.+|+.+.+ - T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~---~~-------~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~------l 64 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLN---MF-------GGRKTASGERVSESNSLVQPDIFACVNVLSDDIAK------L 64 (416) T ss_pred CccchhcccccCccccCccchhHHHH---hh-------cCcccccCceechhhhhccHHHHHHHHHHHHhhhh------C Confidence 23222222212222222222222222 11 1111111111 11223344555677777665543 2 Q ss_pred cceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeec Q lcl|NC_019445. 82 PWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPI 155 (559) Q Consensus 82 ~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l 155 (559) ||--+...+....+. .+.-+...|. +-| .+.=+...+.++.++|||.+++..+. +.+....+++. T Consensus 65 ~~~~~~~~~~~~~~~---------~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~ 135 (416) T protein:vir:12 65 PIHTYKRTDGGIERK---------PEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRP 135 (416) T ss_pred ceEEEEecCCccccc---------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECC Confidence 442222222111110 0111222232 222 23345667789999999999887653 33334444444 Q ss_pred cEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEE Q lcl|NC_019445. 156 GSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSV 235 (559) Q Consensus 156 ~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv 235 (559) ..+-+..+.++.. +|.++ T Consensus 136 ~~v~v~~~~~~~~--~~~~~------------------------------------------------------------ 153 (416) T protein:vir:12 136 DYTNAYVHPTTGM--LWYQT------------------------------------------------------------ 153 (416) T ss_pred cceEEEEeCCCcE--EEEEE------------------------------------------------------------ Confidence 4433333222210 01000 Q ss_pred EEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCcc Q lcl|NC_019445. 236 YYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKN 313 (559) Q Consensus 236 ~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~ 313 (559) ...+ ..+ .|...-++++|+...++ .||.| |..-+...+.......+.......-...|..++ +..+.. T Consensus 154 --~~~g--~~~----~~~~~eiih~~~~~~~~-~~G~s-~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~ 223 (416) T protein:vir:12 154 --VLNG--KAI----ELYDYEVLHFKGLSTDG-IHGKS-PIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDE 223 (416) T ss_pred --ecCC--eEE----EecCccEEEecCcCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCH Confidence 0000 000 01112355556554444 89999 898887777777777777777777777786654 333221 Q ss_pred cc-------ce--ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHH Q lcl|NC_019445. 314 QR-------AS--LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) Q Consensus 314 ~~-------~~--~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei 384 (559) .. +. ...|++.+++ ++-.++|+.. ++.-..+.+........|-++|-..........++..-++++. T Consensus 224 e~~~~~~~~~~~~~~~~~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~ 299 (416) T protein:vir:12 224 KPKENVRKEWKRVNKVENIAIID---YGLEYQSISM-PLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQ 299 (416) T ss_pred HHHHHHHHHHHHHhcCCCeeecC---CCceEEEccC-ChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHH Confidence 11 11 1234444332 2234555543 2333334455667788899999775433322222222233222 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) Q Consensus 385 ~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~ 464 (559) . .. +...-|.|++.+.-..+.+. ++++.. ...|..|++.+. .|- +. +......... T Consensus 300 ~--~~------------f~~~~l~P~~~~ie~~l~~~-l~~~~~-~~~g~~i~fd~~-~l~---~~-d~~~~~~~~~--- 355 (416) T protein:vir:12 300 S--IE------------YVRNTLQPWIVNFEQELNVK-LFLDHD-QKSGHYVKFNID-SEL---RG-DSKTQAEYLK--- 355 (416) T ss_pred H--HH------------HHHHHHHHHHHHHHHHHHHh-hcCchh-hcCCceEEeech-hhh---cc-CHHHHHHHHH--- Confidence 2 11 33445666655555554332 233221 112333444333 332 11 1111111111 Q ss_pred HHhccChhhHhcCCHHHHHHHHHHHcCCCcc-----ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCC Q lcl|NC_019445. 465 QLAQAKPEALDKLNVDQAIDAFADMSGVSPT-----VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTS 539 (559) Q Consensus 465 ~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~-----~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~ 539 (559) .+...+ -+.. +++-+.+|.|+- ++.+..-+. +- ....++ .. . +.++...++.... T Consensus 356 ~~~~~G-----~~T~----NE~R~~~gl~Pi~ggd~~~~~~n~~~-~~--~~~~~~---~~----~-~~~~~~gge~~~~ 415 (416) T protein:vir:12 356 TLHETG-----VLNK----DEIRELLERNPIENGDKYISSLNYVF-LD--FLEEYQ---RL----K-AGGAMKGGDNKNE 415 (416) T ss_pred HHHhCC-----CcCH----HHHHHHhCCCCCCCcceeeecccccc-cc--ccchhh---cc----c-cccccCCCCCcCC Confidence 111111 1232 233334566541 111110000 00 000000 00 0 0000111111111 Q ss_pred C Q lcl|NC_019445. 540 D 540 (559) Q Consensus 540 ~ 540 (559) | T Consensus 416 g 416 (416) T protein:vir:12 416 G 416 (416) T ss_pred C Confidence 1 No 178 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=55.68 E-value=0.47 Score=22.36 Aligned_cols=398 Identities=11% Similarity=0.096 Sum_probs=157.0 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHH---HHHHHHHhccccCCCCCCCCCCcccccCCCCc-chHHHHHHHHHHHHHHhh Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPH---WRELSDYINPRGSRFLTSEVNRNDRRNTRIID-STGTMAARTLASGMMSGI 76 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~---w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~-s~~~~a~~~Las~l~~~l 76 (559) |.-- .+-|.-.+....+.... -..+..+.--+. .+-..+ ....+. ++--.|++.+|+.+. T Consensus 1 M~~~-----~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~---~~~~v~-----~~~al~~~~v~~~i~~ia~~ia--- 64 (429) T protein:vir:10 1 MDSV-----KKFFNFEKRQTSQVIELNKDDEKLLEWLGISP---STISVK-----GKNALKVATVFACIKILSESVS--- 64 (429) T ss_pred Cchh-----hhhhcccccCcccccccCCChHHHHHHhcCCC---Ccceec-----hhhhhccHHHHHHHHHHHHhhc--- Confidence 4321 12222111111111110 011122211000 000000 011122 233344554444333 Q ss_pred cCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHh-c----cchHHHHHHHHHHHhhCcEEEEEeecC-CceEEE Q lcl|NC_019445. 77 TSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNK-S----NLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRT 150 (559) Q Consensus 77 ~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~-s----nf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~ 150 (559) +-||--..-.+....+ ..+..+...|+. - +.+.-+..++.++.++||+.+++..+. ++++.+ T Consensus 65 ---~l~~~~~~~~~~~~~~---------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L 132 (429) T protein:vir:10 65 ---KLPLKIYQEDEYGIQR---------GTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQAL 132 (429) T ss_pred ---cCceEEEEecCCceee---------ccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 2344322211111000 011223334432 1 233446677889999999999987654 445666 Q ss_pred EEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccc Q lcl|NC_019445. 151 MPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) Q Consensus 151 ~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 230 (559) .++|...+.+..|..|.+..-++ ++.. T Consensus 133 ~~i~~~~v~v~~~~~~~~~~~~~------------------------------------~~~~----------------- 159 (429) T protein:vir:10 133 WPIDASKVTVYIDDVGLLNSKTK------------------------------------MWYV----------------- 159 (429) T ss_pred EEEcCceeEEEEcCcccccccce------------------------------------EEEE----------------- Confidence 67777777766666554321111 0000 Q ss_pred cEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--c Q lcl|NC_019445. 231 PFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--P 308 (559) Q Consensus 231 ~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p 308 (559) +..++ ..+. |..--++++|.....+..||.| |...+...+.......+.......-...|..++ + T Consensus 160 ------~~~~g-~~~~-----~~~~evih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~ 226 (429) T protein:vir:10 160 ------VNTGG-QQRV-----LKPEEILHFKNGITLDGLVGVP-TMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV 226 (429) T ss_pred ------EccCC-eEEE-----EccccEEEecCCCCCCCccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC Confidence 00000 0011 1111255566555556689999 898888888888888888888888888887654 3 Q ss_pred CCCccc-------cce------ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCC Q lcl|NC_019445. 309 TSLKNQ-------RAS------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNIN 375 (559) Q Consensus 309 ~~~~~~-------~~~------~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~ 375 (559) ..+... .+. -..|++.+.+ ++-.++|+.. ++.-..+.+.....++.|-.+|-..........+ T Consensus 227 ~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~ 302 (429) T protein:vir:10 227 GDLNEDAKKVFRENFESMSSGLQNSHRIALMP---VGYQFQPISL-NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSK 302 (429) T ss_pred CCCCHHHHHHHHHHHHHHhccccccCceeecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Confidence 333221 111 0123434332 2233566543 2333334455566788899999876444322222 Q ss_pred CCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEe-ecHHHHHHHHHHHH Q lcl|NC_019445. 376 TRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEY-ISVMAQAQKSIGLS 454 (559) Q Consensus 376 ~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~-is~La~a~r~~~~~ 454 (559) +..-++++... .+...-|.|++...-..+.+. ++++. ++ +..+.++| ++.|-+ .+.. T Consensus 303 ~~~sn~e~~~~--------------~f~~~~l~P~~~~ie~~ln~k-l~~~~--~~-~~g~~~~fd~~~ll~----~d~~ 360 (429) T protein:vir:10 303 ATLNNIEQQQQ--------------QFYTDTLQATLTMYEQEMTYK-LFLDS--EL-DKGFYSKFNVDAILR----ADIK 360 (429) T ss_pred CCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHh-hcChh--hc-CCCcEEEeechhhhc----CCHH Confidence 22223333211 133445566655555544432 22221 22 22233444 223311 1111 Q ss_pred HHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc-----ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 455 SLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT-----VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQG 529 (559) Q Consensus 455 ~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~-----~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~ 529 (559) .....+ ..+.+.+ -+.++++- +.+|.|+- ++.+-.-+ .+ ....+. +.. ..+- T Consensus 361 ~~~~~~---~~~~~~G-----~~T~NE~R----~~~gl~p~~ggD~~~~~~n~~-~~-----d~~~~~----~~k-~g~~ 417 (429) T protein:vir:10 361 TRYEAY---RTGIQGG-----FLKPNEAR----SKEDLPPEAGGDRLLVNGNML-PI-----DMAGQA----YLK-GGDT 417 (429) T ss_pred HHHHHH---HHHHhCC-----CcCHHHHH----HHhCCCCCCCcCeeeeccccc-ch-----hhcccc----ccC-CCCC Confidence 111111 1111111 13333332 33465531 11111000 00 000000 000 0000 Q ss_pred HhhhhhhcCCCh Q lcl|NC_019445. 530 AKTLSEAKTSDP 541 (559) Q Consensus 530 a~~~~~~~~~~~ 541 (559) .......+..+. T Consensus 418 ~~~~~~~~~e~~ 429 (429) T protein:vir:10 418 NGEVSKEGNEGN 429 (429) T ss_pred CCCCCCCCCCCC Confidence 000000000000 No 179 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=53.43 E-value=0.53 Score=22.10 Aligned_cols=380 Identities=15% Similarity=0.140 Sum_probs=152.6 Q ss_pred CChhhHHHHHHHHHHHHHHhh-hHHHH-HHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQ-SFEPH-WRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~-~~~~~-w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |-= |...+ +|. ..-.. +.... -++|......+...+. ..-+=.++--.|++.+|+.+.+ T Consensus 1 Mg~---------f~~~~-~r~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~----~~al~~~~v~~cv~~Ia~~iA~---- 61 (416) T protein:vir:45 1 MGI---------FYKNE-KRDLQYNEDDLQMMV-QTLPGFQGTKLRQYKD----IEAIRHSDIFTAVMMIASDLAR---- 61 (416) T ss_pred CCc---------ccccc-cccccCCCcchhHHH-HHhccccccCccccch----hhhhcchHHHHHHHHHHHhhcc---- Confidence 221 11111 111 11001 11111 1122211100100000 0001123334466666665543 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMP 152 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~ 152 (559) .|| ++.-.. ... .++.+...|+ +=| .+.-....+.++..+|||.+++..+. +.+..+.+ T Consensus 62 --~p~-~~~~~~-~~~-----------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~ 126 (416) T protein:vir:45 62 --MPI-RVTVNG-QIN-----------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTF 126 (416) T ss_pred --Cce-EEecCc-ccc-----------ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 233 343211 111 1222333443 222 23345667788899999999988754 45667778 Q ss_pred eeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 153 FPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) ++...+.+..|.+|++--.|.. +. .+. ..+. ..++ T Consensus 127 i~~~~v~v~~~~~g~~~~~~~~-----------~~--------------~~~-~~~~---~~~~---------------- 161 (416) T protein:vir:45 127 RKTSEIELKSDARGRLYYFHQR-----------ID--------------SNG-NNIE---RNVK---------------- 161 (416) T ss_pred EcCceeEEEECCCccEEEEEEE-----------ec--------------CCC-ceeE---EEEc---------------- Confidence 8888888888888764322210 00 000 0000 0000 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCC Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTS 310 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~ 310 (559) .--++++|+...+| .||.| |.+.+...+.......+.......-...|..+ +++. T Consensus 162 ---------------------~~evihir~~~~d~-~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~ 218 (416) T protein:vir:45 162 ---------------------FEDMLDIKFYSLDG-INGLS-LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV 218 (416) T ss_pred ---------------------cccEEEeccCCCCC-ccccC-HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC Confidence 00134445444444 79999 89888777777777777777666667777654 4444 Q ss_pred Ccccc--------ce------ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC Q lcl|NC_019445. 311 LKNQR--------AS------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINT 376 (559) Q Consensus 311 ~~~~~--------~~------~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~ 376 (559) +.... +. -..|++.+++ ++..++|+.. ++....+.+.....+..|-.+|-..... +.. +. T Consensus 219 ~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~-~~ 292 (416) T protein:vir:45 219 LDNKKARDRAREEFHKSFSGTKQAGKVVVLD---ESMTFDQLEV-DTEVLKLIRENKSSTREIAGVFGIPLHK-FGI-ET 292 (416) T ss_pred CCCHHHHHHHHHHHHHHhcCccccCceeecC---CCceeEeccC-CHHHHHHHHHHHHHHHHHHHHhCCCHHH-cCC-CC Confidence 32211 10 1123333332 2234555532 3344445565666778899999875433 322 22 Q ss_pred CCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHH Q lcl|NC_019445. 377 RSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSL 456 (559) Q Consensus 377 ~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l 456 (559) ...+.++. ....... +.|++...-..+.+. ++++ ..+..+++.... |-.. +.... T Consensus 293 ~~~~~~~~---~~~~~~~------------l~P~~~~ie~~ln~~-l~~~----~~~~~~~f~~~~-l~~~----D~~~~ 347 (416) T protein:vir:45 293 ANMSITDA---NLDYLST------------LKPYITCVCAELNFK-FNDE----YVNREFKFDTTE-IRVV----DEKTQ 347 (416) T ss_pred CCccHHHH---HHHHHHH------------HHHHHHHHHHHHhhh-cccc----ccCceEEEechh-hhcc----CHHHH Confidence 21222221 1112223 455555444444332 1221 234455554433 2211 11111 Q ss_pred HHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc------ccC-C-----HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 457 ASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT------VIV-P-----QEQVDQARQQRAQQQQQQQMMAMGM 524 (559) Q Consensus 457 ~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~------~~r-s-----~~ev~~~rq~r~q~~q~~~~~~~~~ 524 (559) ..+++ .+.+.+ .+..++ +-+.+|.|+- ++. + -+.+.+ .+..... T Consensus 348 ~~~~~---~~~~~G-----~~T~NE----~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~------------~~~~~~~ 403 (416) T protein:vir:45 348 AEIDK---INIDSG-----KMNIDE----IRQRDGLAPIPGGNGSIHRVDLNHVNIELVDE------------YQMNKSR 403 (416) T ss_pred HHHHH---HHHhCC-----CcCHHH----HHHHhCCCCCCCCCcceEeecccccccccccc------------cCccccc Confidence 11111 111111 233333 2333466531 111 1 011100 0000000 Q ss_pred HHHHHHhhhhhhcCCChh Q lcl|NC_019445. 525 AAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 525 ~~~~~a~~~~~~~~~~~~ 542 (559) ......|- ++ ..+ T Consensus 404 ~~~~~~kg-Ge----~n~ 416 (416) T protein:vir:45 404 ATDKKLKG-GE----ENE 416 (416) T ss_pred ccccccCC-CC----CCC Confidence 00000000 00 001 No 180 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=53.43 E-value=0.53 Score=22.10 Aligned_cols=380 Identities=15% Similarity=0.140 Sum_probs=152.6 Q ss_pred CChhhHHHHHHHHHHHHHHhh-hHHHH-HHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQ-SFEPH-WRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~-~~~~~-w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~p 78 (559) |-= |...+ +|. ..-.. +.... -++|......+...+. ..-+=.++--.|++.+|+.+.+ T Consensus 1 Mg~---------f~~~~-~r~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~----~~al~~~~v~~cv~~Ia~~iA~---- 61 (416) T protein:vir:81 1 MGI---------FYKNE-KRDLQYNEDDLQMMV-QTLPGFQGTKLRQYKD----IEAIRHSDIFTAVMMIASDLAR---- 61 (416) T ss_pred CCc---------ccccc-cccccCCCcchhHHH-HHhccccccCccccch----hhhhcchHHHHHHHHHHHhhcc---- Confidence 221 11111 111 11001 11111 1122211100100000 0001123334466666665543 Q ss_pred CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEE Q lcl|NC_019445. 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMP 152 (559) Q Consensus 79 p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~ 152 (559) .|| ++.-.. ... .++.+...|+ +=| .+.-....+.++..+|||.+++..+. +.+..+.+ T Consensus 62 --~p~-~~~~~~-~~~-----------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~ 126 (416) T protein:vir:81 62 --MPI-RVTVNG-QIN-----------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTF 126 (416) T ss_pred --Cce-EEecCc-ccc-----------ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 233 343211 111 1222333443 222 23345667788899999999988754 45667778 Q ss_pred eeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 153 FPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) ++...+.+..|.+|++--.|.. +. .+. ..+. ..++ T Consensus 127 i~~~~v~v~~~~~g~~~~~~~~-----------~~--------------~~~-~~~~---~~~~---------------- 161 (416) T protein:vir:81 127 RKTSEIELKSDARGRLYYFHQR-----------ID--------------SNG-NNIE---RNVK---------------- 161 (416) T ss_pred EcCceeEEEECCCccEEEEEEE-----------ec--------------CCC-ceeE---EEEc---------------- Confidence 8888888888888764322210 00 000 0000 0000 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCC Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTS 310 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~ 310 (559) .--++++|+...+| .||.| |.+.+...+.......+.......-...|..+ +++. T Consensus 162 ---------------------~~evihir~~~~d~-~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~ 218 (416) T protein:vir:81 162 ---------------------FEDMLDIKFYSLDG-INGLS-LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV 218 (416) T ss_pred ---------------------cccEEEeccCCCCC-ccccC-HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC Confidence 00134445444444 79999 89888777777777777777666667777654 4444 Q ss_pred Ccccc--------ce------ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC Q lcl|NC_019445. 311 LKNQR--------AS------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINT 376 (559) Q Consensus 311 ~~~~~--------~~------~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~ 376 (559) +.... +. -..|++.+++ ++..++|+.. ++....+.+.....+..|-.+|-..... +.. +. T Consensus 219 ~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~-~~ 292 (416) T protein:vir:81 219 LDNKKARDRAREEFHKSFSGTKQAGKVVVLD---ESMTFDQLEV-DTEVLKLIRENKSSTREIAGVFGIPLHK-FGI-ET 292 (416) T ss_pred CCCHHHHHHHHHHHHHHhcCccccCceeecC---CCceeEeccC-CHHHHHHHHHHHHHHHHHHHHhCCCHHH-cCC-CC Confidence 32211 10 1123333332 2234555532 3344445565666778899999875433 322 22 Q ss_pred CCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHH Q lcl|NC_019445. 377 RSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSL 456 (559) Q Consensus 377 ~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l 456 (559) ...+.++. ....... +.|++...-..+.+. ++++ ..+..+++.... |-.. +.... T Consensus 293 ~~~~~~~~---~~~~~~~------------l~P~~~~ie~~ln~~-l~~~----~~~~~~~f~~~~-l~~~----D~~~~ 347 (416) T protein:vir:81 293 ANMSITDA---NLDYLST------------LKPYITCVCAELNFK-FNDE----YVNREFKFDTTE-IRVV----DEKTQ 347 (416) T ss_pred CCccHHHH---HHHHHHH------------HHHHHHHHHHHHhhh-cccc----ccCceEEEechh-hhcc----CHHHH Confidence 21222221 1112223 455555444444332 1221 234455554433 2211 11111 Q ss_pred HHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCcc------ccC-C-----HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 457 ASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSPT------VIV-P-----QEQVDQARQQRAQQQQQQQMMAMGM 524 (559) Q Consensus 457 ~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~~------~~r-s-----~~ev~~~rq~r~q~~q~~~~~~~~~ 524 (559) ..+++ .+.+.+ .+..++ +-+.+|.|+- ++. + -+.+.+ .+..... T Consensus 348 ~~~~~---~~~~~G-----~~T~NE----~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~------------~~~~~~~ 403 (416) T protein:vir:81 348 AEIDK---INIDSG-----KMNIDE----IRQRDGLAPIPGGNGSIHRVDLNHVNIELVDE------------YQMNKSR 403 (416) T ss_pred HHHHH---HHHhCC-----CcCHHH----HHHHhCCCCCCCCCcceEeecccccccccccc------------cCccccc Confidence 11111 111111 233333 2333466531 111 1 011100 0000000 Q ss_pred HHHHHHhhhhhhcCCChh Q lcl|NC_019445. 525 AAAQGAKTLSEAKTSDPS 542 (559) Q Consensus 525 ~~~~~a~~~~~~~~~~~~ 542 (559) ......|- ++ ..+ T Consensus 404 ~~~~~~kg-Ge----~n~ 416 (416) T protein:vir:81 404 ATDKKLKG-GE----ENE 416 (416) T ss_pred ccccccCC-CC----CCC Confidence 00000000 00 001 No 181 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=48.89 E-value=0.66 Score=21.59 Aligned_cols=391 Identities=13% Similarity=0.073 Sum_probs=144.7 Q ss_pred cCCCC---CCCCCC----cccccCCCCc------------------------chHHHHHHHHHHHHHHhhcCCCCcceec Q lcl|NC_019445. 38 GSRFL---TSEVNR----NDRRNTRIID------------------------STGTMAARTLASGMMSGITSPARPWFRL 86 (559) Q Consensus 38 ~~~~~---~~~~~~----~~~~~~~~~~------------------------s~~~~a~~~Las~l~~~l~pp~~~Wf~l 86 (559) ++-|. +....+ +..+....++ ++--.|++.+|..+.+ .||-=. T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~------lp~~~~ 74 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIAT------LPLSTY 74 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhh------CceEEE Confidence 33221 111110 0000000010 1111244444443322 233211 Q ss_pred cCCccchhhHHHHHHHHHHHHHHHHHHHHh----ccchHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEEee Q lcl|NC_019445. 87 ATPDPEMMDYGPVKLWLEAVQNRMNDMFNK----SNLYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLAN 162 (559) Q Consensus 87 ~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~----snf~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~ 162 (559) .-.+.. .+ .++ ...+...+.+ -+.+.-+..++.++..+|||++++..+.+.+..+.+++...+.+.+ T Consensus 75 ~~~~~~-~~--~~~------~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~~l~p~~v~v~~ 145 (457) T protein:vir:62 75 SKRGGT-RK--EID------TPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLDVLDPTKIHVHM 145 (457) T ss_pred EecCCc-cc--ccc------chHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcCcceEEEE Confidence 111110 00 010 1111222222 2345556677889999999999997776666566555555555544 Q ss_pred CCCCCE-EEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecC Q lcl|NC_019445. 163 SPRGSV-DICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGG 241 (559) Q Consensus 163 d~~G~v-d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~ 241 (559) +..+.. ..+|+.|..+ .. T Consensus 146 ~~~~~~~~~~~~~y~~~------------------------------------------------------------~~- 164 (457) T protein:vir:62 146 VMVDGLRRKVFEAYDID------------------------------------------------------------AD- 164 (457) T ss_pred eccCCccceeEEEEEEc------------------------------------------------------------cC- Confidence 432211 1111111110 00 Q ss_pred CCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCcccc---- Q lcl|NC_019445. 242 DNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQR---- 315 (559) Q Consensus 242 ~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~~---- 315 (559) .....+.. |...=+|++|....+|..||.| |...+...+.....+.+.......-...|..++ +..+.... T Consensus 165 g~~~~~~~--~~~~eiih~r~~~~~~~~~G~s-p~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~ 241 (457) T protein:vir:62 165 GNEVLLGW--FTPRDVLHIPGMMLPGDFVGCS-PISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARA 241 (457) T ss_pred CceeEEEe--eCccceEEecCCCCCCceeccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHH Confidence 00001110 0001145556555667789999 898887777777777777777777667776543 44432211 Q ss_pred ---ce-e-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHH Q lcl|NC_019445. 316 ---AS-L-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIE 386 (559) Q Consensus 316 ---~~-~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~ 386 (559) +. . ..|++..++ ++-.++|+.. ++.-..+.+..+..+..|-++|-+... +++..+....+..-+.+ T Consensus 242 ~~~~~~~~~G~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~sn~eq 316 (457) T protein:vir:62 242 REAWRAANSGVDNAHRVALLT---EGAKFSKVAM-SPDEAQFLQTRQFQVPEIARIFGVPPH-LISDATNSTSWGSGLAE 316 (457) T ss_pred HHHHHHHhcCccccCcceecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHH-HcCCCCCcccccchHHH Confidence 10 0 123333332 2234555542 333334455566778889999987543 34333332222222322 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 387 MKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQL 466 (559) Q Consensus 387 r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~l 466 (559) .... |...-|.|++.++-..+.+ .++++. +-.+..|++.+...+ +. +......++..+.. T Consensus 317 ~~~~-----------f~~~~l~P~~~~ie~~ln~-~L~~~~--~~~~~~i~fd~~~l~-~~----d~~~r~~~~~~~~~- 376 (457) T protein:vir:62 317 QNIA-----------FTMFSLRPWLERIEAGFNR-LLFAET--ADRFRFVKFNLDEIK-RG----APKERMELWSLGLQ- 376 (457) T ss_pred HHHH-----------HHHHHHHHHHHHHHHHHHh-hhcCcc--ccCceEEEeechhhh-cc----CHHHHHHHHHHHHh- Confidence 2222 2233345555444433332 233322 222223444333322 11 11111111111111 Q ss_pred hccChhhHhcCCHHHHHHHHHHHcCCCcc------ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCC Q lcl|NC_019445. 467 AQAKPEALDKLNVDQAIDAFADMSGVSPT------VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLSEAKTSD 540 (559) Q Consensus 467 a~~~P~~~~~id~d~~~~~~a~~~Gvp~~------~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~~~~~~~ 540 (559) .+ -+.+ +++-+.+|.|+- .+...--+...-. +-+.+ .+..+. ....-.+.+..+ T Consensus 377 --~G-----~~T~----NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~----~~~~~--~~~~~~---~~~~~~~~~~~~ 436 (457) T protein:vir:62 377 --NG-----IYSI----DEVRAAEDMTPLPDGLGEKYRVPLNLGEIGE----EPEPE--PAPAPP---AIDPPAEEPADD 436 (457) T ss_pred --CC-----CcCH----HHHHHHhCCCCCCCCCcceeeeccccccccc----ccccc--ccCCCc---cCCCCccCCCCC Confidence 11 1222 222333444321 1110000000000 00000 000000 000000000000 Q ss_pred hhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 541 PSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 541 ~~~~~~~~~~~~~~~~~~~ 559 (559) ++ .....+.+.. T Consensus 437 ~~-------~~~~~~~~d~ 448 (457) T protein:vir:62 437 EE-------PDNAEGDPDE 448 (457) T ss_pred CC-------CCCCCCCCcc Confidence 00 0001111111 No 182 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=47.98 E-value=0.68 Score=21.49 Aligned_cols=360 Identities=14% Similarity=0.125 Sum_probs=147.4 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHH--------------HHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHH Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEP--------------HWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAAR 66 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~--------------~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~ 66 (559) |-+. .....|+.+++--.+.++ .|++. +..++. .+...+. .+-+=.++--.|++ T Consensus 1 ~~~~---~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~g~~v~~----~~a~~~~aV~~~v~ 68 (432) T protein:vir:97 1 MPDE---KKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDL--GIIISD---TGAAVNA----DAIMRLDAVAACVK 68 (432) T ss_pred CCCc---ccCchhhhhHhhcCCccccccccccccccCchhhhhh--cccccc---cCcccch----HhhhcchHHHHHHH Confidence 4332 233334443333222211 11111 111110 0000000 01111223334444 Q ss_pred HHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEe Q lcl|NC_019445. 67 TLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVL 141 (559) Q Consensus 67 ~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~ 141 (559) .+|+.+ +. -||.-..-......+ ..+.-+...|+ +-| .+.=....+.++.++|||.+++. T Consensus 69 ~Ia~~i-a~-----lp~~~y~~~~~g~~~---------~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~ 133 (432) T protein:vir:97 69 LVSQAV-AA-----MPLMMYMRTPDGRKE---------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKV 133 (432) T ss_pred HHHHhh-cc-----CceEEEEecCCCccc---------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 444433 32 255322211111000 11222334443 222 23344556778999999999888 Q ss_pred ecCCceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccc Q lcl|NC_019445. 142 EDDEDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRD 221 (559) Q Consensus 142 ~~~~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~ 221 (559) .+.+++....+++...+.+..|.+|++ +|+... .+ .+.++ ++ T Consensus 134 ~~~g~~~~L~~l~p~~v~v~~~~~g~~--~y~~~~-------------------------~~-g~~~~-----~~----- 175 (432) T protein:vir:97 134 VTDGRIESLQYLANDRLTITTDTKGNT--AYRYRR-------------------------TD-GQMID-----IP----- 175 (432) T ss_pred ecCCcEEEEEEEcCcceEEEEcCCCcE--EEEEEe-------------------------cC-ceEEE-----Ec----- Confidence 776666666677777777777776653 222100 00 00000 00 Q ss_pred ccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 222 TSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKAT 301 (559) Q Consensus 222 ~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~ 301 (559) ..+ +++.|....+| .||.| |...+...+.......+.......-.. T Consensus 176 --------------------~~~------------iih~r~~~~dg-~~G~s-pi~~~~~~i~~~~a~~~~~~~~f~ng~ 221 (432) T protein:vir:97 176 --------------------RQQ------------IWKIMGYSLDG-ENGLS-AIRYGAQIFGTAIAAEAQAARAFRNGQ 221 (432) T ss_pred --------------------ccc------------EEEecCcCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 000 23334333445 79999 888775555555555555555555555 Q ss_pred cCcee--ecCCCcccc---c------eecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_019445. 302 NPPMV--APTSLKNQR---A------SLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMM 370 (559) Q Consensus 302 ~p~~~--~p~~~~~~~---~------~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~ 370 (559) .|..+ ++..+.... + ....|++.+.+ ++-.++++.. ++.-..+.+........|-++|-..... T Consensus 222 ~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~- 296 (432) T protein:vir:97 222 LQSVYYQIDRFLTDDQYDSFSKKVSGSVEAGRAPLLE---GGMDVKSLGL-NPVDAQLLQSRQYSVESICRFFGVPPSM- 296 (432) T ss_pred CcceeEecCCCCCHHHHHHHHHHHhhhhcCCCceecC---CCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHH- Confidence 66544 344332211 1 01234444432 2233555532 3334444566667788899999875433 Q ss_pred ccCCCC-CCcCHHHHHHHHHH-HHHHhhhHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCC Q lcl|NC_019445. 371 LQNINT-RSMPVEAVIEMKEE-KLLMLGPVLERLNDECLNPLIDR-------------------------AFSMMVRKNM 423 (559) Q Consensus 371 ~~~~~~-~~~TA~Ei~~r~~e-~~~~LG~v~~~l~~E~l~Pli~r-------------------------~~~il~r~g~ 423 (559) ++..+. ..-+..-+.+.... ....|.|.+.+++.|+-.-|+.. .+..+.+.|. T Consensus 297 lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~ 376 (432) T protein:vir:97 297 IGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGL 376 (432) T ss_pred cCCcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 332221 11222333333222 23467777777777765444211 2222333333 Q ss_pred -----------CCCCchhhCCcceEEEeec---HHHHHHHHHHHHHHHHHHHHHHHHhccChh-hHhcCCHHHHHH Q lcl|NC_019445. 424 -----------LPPPPDAMEGMPLKVEYIS---VMAQAQKSIGLSSLASTVNFIGQLAQAKPE-ALDKLNVDQAID 484 (559) Q Consensus 424 -----------lp~~p~~l~g~~v~~~~is---~La~a~r~~~~~~l~~~~~~~~~la~~~P~-~~~~id~d~~~~ 484 (559) +|++| |.+..+...+ ||..+. .-..-.|. -...-+-++.-+ T Consensus 377 ~T~NE~R~~~glpp~~----g~~~~~~~~~~~~pl~~~~----------------~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:97 377 MTRDEAREIEGLPKLG----GNAAVLTVQSAMVPLDSIG----------------LQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCHHHHHHHhCCCCCC----CCcceEeecccccchhhhc----------------ccCCCCCCCCCCCcccccccC Confidence 23322 2222121111 222111 10000111 111111112222 No 183 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=45.71 E-value=0.76 Score=21.24 Aligned_cols=439 Identities=9% Similarity=0.040 Sum_probs=159.0 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCCCCCCCCCCcccccCCCC--cchHHHHHHHHHHHHHHhhcC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRII--DSTGTMAARTLASGMMSGITS 78 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~--~s~~~~a~~~Las~l~~~l~p 78 (559) |.+.. ++.+.--........--...+.+.+. ..|.--++. +-. ..-+.| .++...|++..|.-+.+.-.+ T Consensus 39 ~~~~~-~k~~~~~~~a~~~~~~~~~~~~~~~~-~r~~~~~~~--~l~----~~~~~~~~npiv~~~I~~ia~~IA~~~~~ 110 (551) T protein:vir:80 39 EQEQI-SKAMNNKEVAYSQPVIGSMSANPGFK-TKPSIRNNQ--DLH----GVLKKFGGNIILNAIINTRSNQVSMYCKP 110 (551) T ss_pred cHHHH-HHhhccCcceeecccccceecCcccc-cCccccChh--HHH----HHHHHhhcCHHHHHHHHHHHHHHhhhhhh Confidence 55543 22211111110000000111111111 111111110 000 000011 224456666666655432221 Q ss_pred -----CCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhcc---------chHHHHHHHHHHHhhCcEEEEEeecC Q lcl|NC_019445. 79 -----PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSN---------LYQSLPQLYGSLGTYSTGAMAVLEDD 144 (559) Q Consensus 79 -----p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~sn---------f~~~~~~~~~dl~~~G~~~l~v~~~~ 144 (559) .+.+ |.+.+.+.+.........-.+.++ ..|++-| |..-+...+.|+.++|||.+++..+. T Consensus 111 ~~~~~~g~~-~~i~~kd~~~~~~~~~~~~~~~i~----~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~ 185 (551) T protein:vir:80 111 ARHSEKGVG-FEVRLKDLDKKPTSHDEATIKRIE----SFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNR 185 (551) T ss_pred hhhhcCCCC-ceEEecccCcccChhHHHHHHHHH----HHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECC Confidence 1122 334433322111111111111222 2333333 33445567788999999988877643 Q ss_pred -CceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccc Q lcl|NC_019445. 145 -EDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTS 223 (559) Q Consensus 145 -~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~ 223 (559) +.+..+.+++...+.+..+.+|.+..-..+|..+. +....++ T Consensus 186 ~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~-----------------------~g~~~~~-------------- 228 (551) T protein:vir:80 186 NQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVI-----------------------DQKIVAT-------------- 228 (551) T ss_pred CCcEEEEEEeCCceeEEEECCccccccCceEEEEEe-----------------------CCcEEEE-------------- Confidence 45677777777777777777765421000000000 0000000 Q ss_pred ccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeee---cCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 224 KLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEV---NGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKA 300 (559) Q Consensus 224 ~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~---~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~ 300 (559) +. .+ + .++++.+. ..+.+||.| |..-+...+......++.......-. T Consensus 229 -------------~~--~~-----------e--iiH~~~n~~~~~~~~~~G~s-pi~~a~~~i~~~~a~~~~~~~~f~Ng 279 (551) T protein:vir:80 229 -------------FN--AR-----------E--MAFAVRNPRSDIYATGYGYP-ELEIALKQFIAHENTEAFNDRFFSHG 279 (551) T ss_pred -------------Ec--cc-----------c--eEEecccCCCCccccccccc-HHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 00 00 0 12222111 123479999 89888888888888887777777777 Q ss_pred hcCcee--ecCCC--ccccc--------eecCCceeec--CCc-CCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 301 TNPPMV--APTSL--KNQRA--------SLLPGDITYI--DQI-TGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFV 365 (559) Q Consensus 301 ~~p~~~--~p~~~--~~~~~--------~~~pg~~~~~--~~~-~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~ 365 (559) ..|..+ ++.+. ..... +..-|..+.. .-. +++-.++|+. .++.-..+.+..+.....|-++|-+ T Consensus 280 ~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~-~~~~D~qfle~~~~~~~~Ia~aFgV 358 (551) T protein:vir:80 280 GTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMT-PSARDMEFEKWLNYLINVISALYGI 358 (551) T ss_pred CCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEcc-CChhHHHHHHHHHHHHHHHHHHhcC Confidence 778744 45442 11110 0111111110 111 2223455654 2334444556667788899999988 Q ss_pred chhhhccCCCC-------CCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEE Q lcl|NC_019445. 366 DLFMMLQNINT-------RSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKV 438 (559) Q Consensus 366 dl~~~~~~~~~-------~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~ 438 (559) .+.......++ +.+|-.-+.+.. ..+....|.|++.++-..|.+ .++|+. +..+.+ T Consensus 359 Pp~~lG~~~~~~~~~~~~~s~t~sn~e~~~-----------~~f~~~tL~P~~~~ie~~ln~-~L~~~~-----~~~~~f 421 (551) T protein:vir:80 359 DPAEINIPNNGGATGSKGGSLNEGNSAEKN-----------QASKNKGLQPLLGFIEDFINK-HIVAEF-----GDKYTF 421 (551) T ss_pred CHHHcCcccccccccccccccchhhHHHHH-----------HHHHHHHHHHHHHHHHHHHHh-hhcccc-----CCceEE Confidence 75554322111 112211111111 123344555655555544444 233322 334667 Q ss_pred EeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc-----cccCCHHHHHHHHH-HHHH Q lcl|NC_019445. 439 EYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP-----TVIVPQEQVDQARQ-QRAQ 512 (559) Q Consensus 439 ~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~-----~~~rs~~ev~~~rq-~r~q 512 (559) ++...... ... +...+. ..+. +. -+.+++ +-+.+|.|+ +.+...--+..+-+ .+.+ T Consensus 422 ~f~~~~~~-~~~-~~~~~~---~~~~--~g-------~lT~NE----~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~ 483 (551) T protein:vir:80 422 QFVGGDIK-SEL-ESVKIL---AEKA--KV-------AMTVNE----VRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQE 483 (551) T ss_pred EeeccChh-hHH-HHHHHH---HHHh--cC-------CcCHHH----HHHHhCCCCCCCCCceeeccccccccccccccc Confidence 77654321 111 111111 1110 01 022222 223334432 11111000000000 0000 Q ss_pred HHHHHHHHHHHHH-HHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCCC Q lcl|NC_019445. 513 QQQQQQMMAMGMA-AAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) Q Consensus 513 ~~q~~~~~~~~~~-~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (559) +.+.+.+.+.... ....++...+.+...++..+. .+.-+.+. T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~-----~~~~~~~~ 526 (551) T protein:vir:80 484 QFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDT-----TGDIGKDG 526 (551) T ss_pred CcchhhhhhccccccCcCCCCCCCCCCCCCCcccc-----CCCccccc Confidence 0000000000000 000111111111111110000 00000000 No 184 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=45.40 E-value=0.77 Score=21.20 Aligned_cols=289 Identities=12% Similarity=0.110 Sum_probs=117.7 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTM 151 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~ 151 (559) =++.||.-.. .+.. +..-+...|+ +-| .+.=+...+.+|.++|||++++..+. +.+..+. T Consensus 1 ia~lp~~~~~-~~~~-------------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~ 66 (348) T protein:vir:93 1 MASLPLKMYE-DYKV-------------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLF 66 (348) T ss_pred CcccceEeEe-cCcC-------------cccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEE Confidence 1233443211 1111 1222334554 333 22234566788899999999987643 4445565 Q ss_pred EeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCccccccccccccc Q lcl|NC_019445. 152 PFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKP 231 (559) Q Consensus 152 ~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~ 231 (559) ++|.+.+-+..+.+|... .|+ +.. .+ ...++ T Consensus 67 ~l~~~~v~~~~~~~~~~~-~y~-~~~------------------------~~-g~~~~---------------------- 97 (348) T protein:vir:93 67 LLNPDVVEMLIENQSREL-YYS-IHA------------------------AT-GNKLI---------------------- 97 (348) T ss_pred EEcCCceEEEEeCCCcEE-EEE-EEc------------------------CC-CeEEE---------------------- Confidence 566665555554443311 111 000 00 00000 Q ss_pred EEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-eee--c Q lcl|NC_019445. 232 FKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP-MVA--P 308 (559) Q Consensus 232 ~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~-~~~--p 308 (559) + ...-++++|-....+..||.| |.+.+...+...+...+.. ......++ ++. + T Consensus 98 -----~---------------~~~eiih~r~~~~~~~~~G~s-~~~~~~~~i~~~~~~~~~~---~~~~~~~~~~i~~~~ 153 (348) T protein:vir:93 98 -----V---------------HNMDMLHFKHIVASNMVQGIS-PIDVLKNTTDFDNAVRTFN---LTEMQKPDSFMLKYG 153 (348) T ss_pred -----E---------------ccccEEEecCCCCCCceeecc-HHHHHHHHHHHHHHHHHHH---HHhcCCCceeEEecC Confidence 0 000133333333446689999 8876655555544444443 22333343 332 2 Q ss_pred CCCccc-------cc---eecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 309 TSLKNQ-------RA---SLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 309 ~~~~~~-------~~---~~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) ..+... .+ .-..|++.+.+ ++..++|+.. ++.-..+.+........|-++|-..........++.. T Consensus 154 ~~l~~e~~~~~~~~~~~~~~n~~~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~ 229 (348) T protein:vir:93 154 SNVSTEKRQQVLEDFKQYYEENGGILFQE---PGVEIEPLPK-KYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNF 229 (348) T ss_pred CCCCHHHHHHHHHHHHHHhhcCCCeeecC---CCceEEEcCC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc Confidence 222111 01 11234444332 2233565532 2333344455566778899999875443322222222 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH--------------------------HHHHHHhcCC--------- Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDR--------------------------AFSMMVRKNM--------- 423 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r--------------------------~~~il~r~g~--------- 423 (559) -++++. ...-....|.|...+++++|-..|+.. .+..+.+.|. T Consensus 230 ~~~e~~--~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~ 307 (348) T protein:vir:93 230 AKNEEL--NRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREW 307 (348) T ss_pred ccHHHH--HHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Confidence 333332 223344567777777777765544311 1122333332 Q ss_pred --CCCCchhhCCcceE--EEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhh-HhcC Q lcl|NC_019445. 424 --LPPPPDAMEGMPLK--VEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEA-LDKL 477 (559) Q Consensus 424 --lp~~p~~l~g~~v~--~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~-~~~i 477 (559) +||+|. |+.+- -.++ |+......+ ... + +.+. .+.= T Consensus 308 ~g~~p~~g---gD~~~~~~n~~-~~~~~~~~~------------~~~-~-gg~~n~~~~ 348 (348) T protein:vir:93 308 EDLPPVEG---GDKPLISGDLY-PIDTPLELR------------KSL-K-GGDKNVNES 348 (348) T ss_pred hCCCCCCC---cCeEeeccccc-ccccchhhc------------ccc-c-CCCCCcCCC Confidence 222221 22111 1111 111100000 000 1 1110 0000 No 185 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=39.71 E-value=1 Score=20.57 Aligned_cols=382 Identities=15% Similarity=0.121 Sum_probs=150.5 Q ss_pred HHHHHhhhHHHHHHHHHHHhccccCC----CCCCC-CCCccccc-C-CCCcchHHHHHHHHHHHHHHhhcCCCCcceecc Q lcl|NC_019445. 15 QLESERQSFEPHWRELSDYINPRGSR----FLTSE-VNRNDRRN-T-RIIDSTGTMAARTLASGMMSGITSPARPWFRLA 87 (559) Q Consensus 15 ~l~~~R~~~~~~w~e~~~~~~P~~~~----~~~~~-~~~~~~~~-~-~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~ 87 (559) .+.+. .+..-...+.+.... +.+.. +..+..-+ . -+=.++--.|++.+|+.+- +-||.-.. T Consensus 1 ~~~~r------~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA------~lp~~~~~ 68 (419) T protein:vir:14 1 MFFSR------QLLSNLGQTQMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIA------QLPIELYE 68 (419) T ss_pred Ccccc------cccccccccccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhc------cCceEEEE Confidence 11111 111111111111111 11111 11111100 0 1112333334444444332 34553333 Q ss_pred CCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEe Q lcl|NC_019445. 88 TPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLA 161 (559) Q Consensus 88 ~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~ 161 (559) ..+....+ + .+.-+...|+ +-| .+.-+...+.++.++||+++++..+. +.+..+.+++.+.+.+. T Consensus 69 ~~~~~~~~---~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~ 139 (419) T protein:vir:14 69 RSGEDRKP---A------TDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVM 139 (419) T ss_pred ecCCcccc---c------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEE Confidence 22211111 1 1122233333 222 23334556788899999999998764 44566666777777777 Q ss_pred eCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecC Q lcl|NC_019445. 162 NSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGG 241 (559) Q Consensus 162 ~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~ 241 (559) .+.+|++ +|+ ++ +.+.+ |. T Consensus 140 ~~~~~~~--~y~---~~--------~~~~~---------------------------------------~~--------- 158 (419) T protein:vir:14 140 RGSDLKP--VYR---VR--------GSDPM---------------------------------------PQ--------- 158 (419) T ss_pred ECCCceE--EEE---Ec--------cCccc---------------------------------------ch--------- Confidence 7666542 121 00 00000 00 Q ss_pred CCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCcc---cc- Q lcl|NC_019445. 242 DNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKN---QR- 315 (559) Q Consensus 242 ~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~---~~- 315 (559) .+ +++.++...+| .||.| |...+...+.....+.+.......-...|..++ +.++.. .. T Consensus 159 -~~------------i~h~~~~~~dg-~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~ 223 (419) T protein:vir:14 159 -RL------------VHHVRWMSING-YTGLS-PVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQAS 223 (419) T ss_pred -hh------------eeEecCcCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHH Confidence 00 22233333344 79999 898887778777778877777777777887554 433311 10 Q ss_pred ---ce----------ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHH Q lcl|NC_019445. 316 ---AS----------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVE 382 (559) Q Consensus 316 ---~~----------~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~ 382 (559) +. -..|++.+.+ ++..++|+.. ++.-..+.+.....+..|-++|-..........++..-+++ T Consensus 224 ~~~~~~~~~~~~~g~~nag~~~vl~---~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E 299 (419) T protein:vir:14 224 VDRITDGWNAKFGGSGNAKKVALLQ---EGMTFRPLSM-TNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIE 299 (419) T ss_pred HHHHHHHHHHHhcCccccCCceecC---CCceEEEccC-ChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHH Confidence 10 0123444442 2234566543 23333344555666788999998754433221122222222 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 383 AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNF 462 (559) Q Consensus 383 Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~ 462 (559) +.. ..|...-|.|++.+.-..+.+. ++++ .+..+.-|++.. +.|-+. +.......++ T Consensus 300 ~~~--------------~~f~~~~L~P~~~~ie~~l~~k-ll~~--~~~~~~~i~fd~-~~l~r~----d~~~~~~~~~- 356 (419) T protein:vir:14 300 HQS--------------LQFVIYTLLPWVKRHEQAKTRD-LLLP--SERKQYFIEYNL-AGLLRG----DQSSRYAAYA- 356 (419) T ss_pred HHH--------------HHHHHHHHHHHHHHHHHHHhhh-ccCc--cccCCeEEEEec-hhhhcc----CHHHHHHHHH- Confidence 211 1244556777766665555543 3332 222233344433 333221 1111111111 Q ss_pred HHHHhccChhhHhcCCHHHHHHHHHHHcCCCc----c-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019445. 463 IGQLAQAKPEALDKLNVDQAIDAFADMSGVSP----T-VIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTLS 534 (559) Q Consensus 463 ~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~----~-~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~~ 534 (559) .+.+.+ -+..+++ -..+|.|+ + ++.+-.-+. . ....+.+........+...+..+.|+ T Consensus 357 --~~~~~G-----~~T~NE~----R~~~gl~p~~gGD~~~~~~n~~~-~--~~~~~~~~~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 357 --VGRQWG-----WLSINDI----RRLENMPPVKGGDIYLSPMNMVD-A--SKPQQLPVGKSEPTKAAIDEIGRILS 419 (419) T ss_pred --HHHhCC-----CcCHHHH----HHHhCCCCCCCcCeeeecccccc-c--cccccccCCCCCCccccccchhcccC Confidence 111111 1333332 23455543 1 111100000 0 00000000000000011111122222 No 186 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=35.39 E-value=1.2 Score=20.08 Aligned_cols=200 Identities=8% Similarity=-0.059 Sum_probs=77.2 Q ss_pred HhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHH Q lcl|NC_019445. 199 WESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGML 278 (559) Q Consensus 199 ~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~ 278 (559) .+.+.. -.+.+ . +.... ...++. .+. |..-=.+++|.....+..||.+ |..- T Consensus 1 ~r~~~d-g~~~y-~--~~~~~-----------------~~~~g~-~~~-----~~~~eilH~r~~~~~~~~~Gls-pi~~ 52 (219) T protein:vir:98 1 MRVCKD-GNYKY-L--MKKSL-----------------YDTKSE-IYE-----YNKNDVIFIKLYDPMQQVYGSP-DYVG 52 (219) T ss_pred Cceeec-CeEEE-E--Eecce-----------------ecCCce-eEE-----eccccEEEecCCCCCCCcceec-HHHH Confidence 111111 11100 0 00000 000010 111 1111145555433345589999 8887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCC-Ccccc-------ceecCCcee----ecCCc---CCchhhhhhhhc Q lcl|NC_019445. 279 ALGPVKALQLLQKRKSQLIDKATNPPMV--APTS-LKNQR-------ASLLPGDIT----YIDQI---TGQDGFRPAYLV 341 (559) Q Consensus 279 ~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~-~~~~~-------~~~~pg~~~----~~~~~---~~~~~~~p~~~~ 341 (559) ++..+..-+...+-...-..--..|..+ +|+. +.... +.-.-|+.+ .+..+ +++-.++|+... T Consensus 53 a~~~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~ 132 (219) T protein:vir:98 53 GITSALLNSDATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDT 132 (219) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCC Confidence 7666654444444333334445566643 3542 22110 111112111 11111 112234554322 Q ss_pred cccHHHHHHHHHHHHHHHHHHhhcchhhhccCCC--CCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 342 NPSTADLVADIQDTRQIINSAYFVDLFMMLQNIN--TRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMV 419 (559) Q Consensus 342 ~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~--~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~ 419 (559) ..+.+ +.+.-+..+..|-++|-+.+..+....+ +..-++++... . +...-|.|++.+....+. T Consensus 133 ~~d~q-fle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~--~------------f~~~tL~P~~~~ie~~ln 197 (219) T protein:vir:98 133 GQKDE-FANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIRE--A------------YQADEVLPLQEIIAESIN 197 (219) T ss_pred HHHHH-HHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHH--H------------HHHHHHHHHHHHHHHHhh Confidence 22333 4455566678899999887665432222 12233333222 2 334445566555555554 Q ss_pred hcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHH Q lcl|NC_019445. 420 RKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLS 454 (559) Q Consensus 420 r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~ 454 (559) +.=.+| .+ ++++|..+-- .|.. T Consensus 198 ~~~~~~---~~-----~~~~F~~~~~-----~d~~ 219 (219) T protein:vir:98 198 SDYEIK---SA-----LKVNFKQPEK-----RDKN 219 (219) T ss_pred hhhcCC---Cc-----cEEeecCccc-----ccCC Confidence 322232 22 2333333210 0111 No 187 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=33.72 E-value=1.3 Score=19.89 Aligned_cols=469 Identities=11% Similarity=0.089 Sum_probs=164.5 Q ss_pred CChhhHH---HH----HHHHHHHHHHhhhHHHHH--H----HHHHHhccccCC----CCC----CCCCCccccc-----C Q lcl|NC_019445. 1 MAETTKE---RL----NKQFAQLESERQSFEPHW--R----ELSDYINPRGSR----FLT----SEVNRNDRRN-----T 54 (559) Q Consensus 1 M~~~~~~---~l----~~r~~~l~~~R~~~~~~w--~----e~~~~~~P~~~~----~~~----~~~~~~~~~~-----~ 54 (559) |++.+.. ++ ...-++..++|+.+-..- + +-..+ .|...+ ... ......++.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~ 79 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDPTAY-SPYERNAAQNAMAMDYGLQTGRNGRNALSFVEA 79 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhccccc-ccccccccccccccccccCcccccccccccccc Confidence 7774311 11 112233333333322111 1 11111 222111 010 0000001111 1 Q ss_pred CCCcch-----------HHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchHHH Q lcl|NC_019445. 55 RIIDST-----------GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSL 123 (559) Q Consensus 55 ~~~~s~-----------~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~ 123 (559) ..|... +..+|++.|. - ..+.|+.+...+.+..+ .+...+++. .+.+-++...+ T Consensus 80 ~~~~~~~l~a~Y~~~~l~r~~Vd~~ae----d---~~r~~~~i~~~~~~~~~-~~~~~~i~~-------~~~~l~v~~~l 144 (532) T protein:vir:94 80 TSWPGFPTLALLAQLPEYRTMHETPAD----E---CVRAWGKITCSSKDELA-ADKATRITQ-------KLEQYNVRTLV 144 (532) T ss_pred cccchHHHHHHHHcCchhhhhhccchH----H---HhhCCceEeeCCccccc-hHHHHHHHH-------HHHhhhHHHHH Confidence 111111 1122222222 1 24578888654332222 233333332 33334677889 Q ss_pred HHHHHHHHhhCcEEEEEeecCCceEEEEEeeccE-EEEeeC--CCCCEEEE--EEEEeecHHHHHHhcCc-ccCCHHHHH Q lcl|NC_019445. 124 PQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGS-YYLANS--PRGSVDIC--FRKFSMTVRQLVQEFGL-NNVSESVKS 197 (559) Q Consensus 124 ~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~-~~v~~d--~~G~vd~i--~r~~~~t~~ql~~~fg~-~~l~~~v~~ 197 (559) .++++.--+||.+++++.-+....- .|+.. +.++.. ..|.+..+ +-.+.+++. .+.. +-++ T Consensus 145 ~~a~~~~rlyG~a~i~i~v~~~~~~----~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~----~~~~~dp~s----- 211 (532) T protein:vir:94 145 RTVVIHDQAYGGAHVFPHLKMDGDS----VPADAPLLLSPSFVQRGCLIGFATIEPMWLSPN----AYNATDPTL----- 211 (532) T ss_pred HHHHHhhhcccceEEEEEeccCCcc----ccccccccccccccccceeeEEEeechheeccc----ccccccccc----- Confidence 9999988899999888754321110 01110 011111 12222111 111222211 0000 0000 Q ss_pred HHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHH Q lcl|NC_019445. 198 MWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGM 277 (559) Q Consensus 198 ~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~ 277 (559) .++. +.+.|.. .... .+| ..+++.-.|. ..|.+ .+....-||++. .+ T Consensus 212 ----p~fg-~P~~y~v-~~g~-----------------~iH----~SRli~f~g~-~~p~~----~~~~~~~~G~Sv-lq 258 (532) T protein:vir:94 212 ----PSFY-KPDSWIA-TSGK-----------------KIH----SSRIHTVVGR-PVGDM----LKAAYSFRGVSI-SQ 258 (532) T ss_pred ----cccC-CceeEEE-ccCe-----------------eec----cceEEEecCC-Cchhh----hccccccccccH-HH Confidence 0111 1111111 1100 011 1122222111 23322 222333469984 78 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeecC-C-----Ccc---ccce----ecC-CceeecCCcCCchhhhhhhhccc Q lcl|NC_019445. 278 LALGPVKALQLLQKRKSQLIDKATNPPMVAPT-S-----LKN---QRAS----LLP-GDITYIDQITGQDGFRPAYLVNP 343 (559) Q Consensus 278 ~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~p~-~-----~~~---~~~~----~~p-g~~~~~~~~~~~~~~~p~~~~~~ 343 (559) .++..++..+.......+.+..+.-..+.... + ... .++. ..- .++..++. +.+.++.+. . T Consensus 259 ~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~--~~e~~e~~~---~ 333 (532) T protein:vir:94 259 LAMPYVDNWLRTRQSVSDTVKQFSMTNLATDMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDK--GTEEIQQTN---T 333 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcchhHHHHHHHHHHHHhhcCCccceEEcC--CCceeEEEe---c Confidence 88888888888888777766655544443310 0 000 0111 011 12222321 112233332 2 Q ss_pred cHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHH---HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019445. 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVE---AVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVR 420 (559) Q Consensus 344 ~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~---Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r 420 (559) ++..+-..+....+.|.-+.=..+.-.+++. +....++ ++.. .---+..++...+.|++++.+.++.+ T Consensus 334 ~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~s-p~GlnstGe~D~~~--------yyd~I~s~Qe~~l~p~le~l~~~l~~ 404 (532) T protein:vir:94 334 PLSGLDSLQAQSQEQMAAVSHIPLVKLLGIT-PNGLNASSDGEIRV--------WYDFIAGYQATNLTPLMEWIIDLIQL 404 (532) T ss_pred ccCCHHHHHHHHHHHHHhHhCCCeeeeecCC-cccccccchHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3444445556666777666533222222222 2233232 2222 22233345556789999999999987 Q ss_pred cCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHc--CCCccccC Q lcl|NC_019445. 421 KNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQAIDAFADMS--GVSPTVIV 498 (559) Q Consensus 421 ~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~--Gvp~~~~r 498 (559) .... .+|+ ++.+++-. |-+......++......+....+.+.+ .|+.+++-+++...- |+. ..+. T Consensus 405 s~~g-~~~~-----d~~~~f~p-L~~~s~kEkAei~~~~a~a~~~~~~~G-----vi~~~Evr~~l~~~~~~~~~-~~~~ 471 (532) T protein:vir:94 405 SEYG-QIDP-----GLAWEWSP-LMELDDKELAEVRQLNASTDSTLMELG-----VIDAKMVQQRLAADPTSGYA-GALG 471 (532) T ss_pred HhcC-CCCC-----CceEEeCC-CCCCCHHHHHHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHhcCCccccc-cccc Confidence 5322 2222 35666653 332222222222222222222332222 477777777664321 111 1222 Q ss_pred CHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhhhcCCChhHHHHHHHHhhcCCCCC Q lcl|NC_019445. 499 PQEQVDQARQQRAQQQQQQQMM-AMGMAAAQGAKTLSEAKTSDPSVLSAMANAVSGQGGQS 558 (559) Q Consensus 499 s~~ev~~~rq~r~q~~q~~~~~-~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (559) +.++.+....+...-+...... +.+...........+-.+..+++.+.-.+....+-|-- T Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:94 472 ERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQNDQPVGNR 532 (532) T ss_pred cccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccccCCCcCCC Confidence 2232221111000000000000 00000000001111111111122222111111111111 No 188 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=33.54 E-value=1.3 Score=19.87 Aligned_cols=387 Identities=13% Similarity=0.080 Sum_probs=144.9 Q ss_pred CChhhHHHHHHHHHHHHHHhhh---HHHHHHHHHHHhcccc---CC-CCCCCCC-------Cccccc-----CC-CCcch Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQS---FEPHWRELSDYINPRG---SR-FLTSEVN-------RNDRRN-----TR-IIDST 60 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~---~~~~w~e~~~~~~P~~---~~-~~~~~~~-------~~~~~~-----~~-~~~s~ 60 (559) |- .++.++...++ -..+.+.-....-|.. ++ +.+.+.. .+.... .. +=.++ T Consensus 1 Mg---------l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~ 71 (431) T protein:vir:10 1 MG---------LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMA 71 (431) T ss_pred Cc---------chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHH Confidence 22 12222221111 1111111111111110 00 0000000 000000 00 00122 Q ss_pred HHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCc Q lcl|NC_019445. 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYST 135 (559) Q Consensus 61 ~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~ 135 (559) --.|++.+|+.+- +-||-=..- +... + ...++.+...|+ +-| -+.-...++.++.++|| T Consensus 72 V~~ci~~Ia~~iA------~lp~~v~~~-~~~~-~--------~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gn 135 (431) T protein:vir:10 72 VLRCVTLISGTIG------MLPMNLISS-DDSK-Q--------VLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGE 135 (431) T ss_pred HHHHHHHHHHhhc------cCceEEEEe-cCce-e--------eeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCC Confidence 2344444444332 224421121 1110 0 001122333443 222 22234566788899999 Q ss_pred EEEEEeecCCceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEe Q lcl|NC_019445. 136 GAMAVLEDDEDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVY 215 (559) Q Consensus 136 ~~l~v~~~~~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~ 215 (559) |++++..+.+.++...+++...+.+..+.+|.+ +|+. .. .+ ...++ + T Consensus 136 a~~~i~r~~g~~~~L~pl~~~~v~~~~~~~~~~--~y~~-~~------------------------~~-g~~~~-----~ 182 (431) T protein:vir:10 136 SMARIVWSGNRPIRLIPMDRGSAKGRLTSTWQI--VYDY-TT------------------------PT-GDKIE-----L 182 (431) T ss_pred eEEEEEEcCCceEEEEEEcCceeEEEEcCCCeE--EEEE-Ee------------------------CC-ceEEE-----E Confidence 999998887666666666666666666655543 1210 00 00 00000 0 Q ss_pred ecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 216 PNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQ 295 (559) Q Consensus 216 p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~ 295 (559) + ..+ .+++|....+| .||.| |...+...+......++.... T Consensus 183 ~-------------------------~~d------------ViHir~~~~dg-~~G~s-pi~~~~~~i~~~~~~~~~~~~ 223 (431) T protein:vir:10 183 P-------------------------ARE------------VFHLRDLSIDG-VSGVS-RVKLSGNALELAEQAERAASR 223 (431) T ss_pred c-------------------------hhh------------EEEecCcCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHH Confidence 0 000 22333322234 89999 898887777777777777777 Q ss_pred HHHHHhcCceee--cCCCccccc--------ee-----cCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHH Q lcl|NC_019445. 296 LIDKATNPPMVA--PTSLKNQRA--------SL-----LPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIIN 360 (559) Q Consensus 296 ~~~~~~~p~~~~--p~~~~~~~~--------~~-----~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~ 360 (559) ...-...|..++ +..+..... .. ..|++...+ ++-.++|+.. ++.-..+.+..+..+..|- T Consensus 224 ~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~---~g~~~~~l~~-~~~d~q~le~r~~~~~~Ia 299 (431) T protein:vir:10 224 TFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENAGSWMLLE---EGATAKQFSN-TAASAQQIENRNHQIEEVA 299 (431) T ss_pred HHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCCceecC---CCceEEEccC-ChhHHHHHHHHHHhHHHHH Confidence 777777786543 443322111 11 123333332 2234556543 3333344455555677898 Q ss_pred HHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEe Q lcl|NC_019445. 361 SAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEY 440 (559) Q Consensus 361 ~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~ 440 (559) ++|-+.... ++..+ +-|..-+.+... .|...-|.|++.+.-..+.+. +|+ +.+..+..+++.+ T Consensus 300 ~~fgVPp~~-lg~~~--~~t~sn~eq~~~-----------~f~~~tL~P~~~~ie~~ln~~-Ll~--~~~~~~~~~~fd~ 362 (431) T protein:vir:10 300 RMYGVPRPL-LMMDD--TSWGSGIEQLAI-----------FFIQYGLSHWFVSWEQAAARA-FLP--EKMLGQRQFKFNE 362 (431) T ss_pred HHhCCCHHH-hCCCC--CCccccHHHHHH-----------HHHHHHHHHHHHHHHHHHHhh-ccC--hhhcCCceEEEec Confidence 999775433 33222 223222222222 233445666666555555432 333 2333444455543 Q ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHhc---cCh-hhHhcCCHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHH Q lcl|NC_019445. 441 ISVMAQAQKSIGLSSLASTVNFIGQLAQ---AKP-EALDKLNVDQAIDAFADMSGVSPTVIVPQEQVDQARQQRAQQQQQ 516 (559) Q Consensus 441 is~La~a~r~~~~~~l~~~~~~~~~la~---~~P-~~~~~id~d~~~~~~a~~~Gvp~~~~rs~~ev~~~rq~r~q~~q~ 516 (559) ...| +.--....+.+...++ .-.+ +-| |+...++.+-+=...++.+-+|....... ..+ T Consensus 363 ~~ll-r~d~~~r~~~~~~~~~---~G~~~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~~~~~------------~~~- 425 (431) T protein:vir:10 363 GALL-RGTLNDQAAFFSKALG---AGGQSPWMKQNEVREMLDLPRADDPVADQLRNPMTQKQKG------------SGD- 425 (431) T ss_pred hhhh-ccCHHHHHHHHHHHHh---cccccCccCHHHHHHHhCCCCCCCccccceecccccccCC------------CCC- Confidence 3322 1111111111111111 0000 001 11111111111000111111121100000 000 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_019445. 517 QQMMAMGMAAAQGA 530 (559) Q Consensus 517 ~~~~~~~~~~~~~a 530 (559) + +++. + T Consensus 426 -----~-~p~~--~ 431 (431) T protein:vir:10 426 -----E-PPAT--T 431 (431) T ss_pred -----C-CCCC--C Confidence 0 0000 0 No 189 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=32.82 E-value=1.4 Score=19.79 Aligned_cols=450 Identities=13% Similarity=0.084 Sum_probs=149.1 Q ss_pred CC----------hhhHHHHHHHHHH---HHHH----------h-hhHHHHHHHH-HHHhccccC--CCCCCCCCCccccc Q lcl|NC_019445. 1 MA----------ETTKERLNKQFAQ---LESE----------R-QSFEPHWREL-SDYINPRGS--RFLTSEVNRNDRRN 53 (559) Q Consensus 1 M~----------~~~~~~l~~r~~~---l~~~----------R-~~~~~~w~e~-~~~~~P~~~--~~~~~~~~~~~~~~ 53 (559) |- +++.++..+-..- ++.. + +.....+... -.+.-|..+ ++......+...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRN 80 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCC Confidence 11 1111111110000 0000 0 0011111110 000000000 00000000010111 Q ss_pred C----CC---C--cchHHHHHHHHHHHHHHh-----hcCCCCcce-eccCCccchhhHHHHHHHHHHHHHHHHHHHHh-- Q lcl|NC_019445. 54 T----RI---I--DSTGTMAARTLASGMMSG-----ITSPARPWF-RLATPDPEMMDYGPVKLWLEAVQNRMNDMFNK-- 116 (559) Q Consensus 54 ~----~~---~--~s~~~~a~~~Las~l~~~-----l~pp~~~Wf-~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~-- 116 (559) . .+ | ..+...|++.-++.+.+. -+-.+-||. ++.-.+....+.. ... ...+...|+. T Consensus 81 ~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~--~~~----~~~l~~ll~~~~ 154 (574) T protein:vir:80 81 SQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHD--IAN----IKRIESFLENTA 154 (574) T ss_pred cccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchh--hhh----hhHHHHHHhccC Confidence 0 00 0 011122333333222211 123466774 2222222211111 111 1122333332 Q ss_pred -------ccchHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCc Q lcl|NC_019445. 117 -------SNLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGL 188 (559) Q Consensus 117 -------snf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~ 188 (559) ..|..-+..++.|+.++||+.+++..+. +.++.+.+++...+.+..|.+|.+..-- T Consensus 155 ~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~---------------- 218 (574) T protein:vir:80 155 QFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNG---------------- 218 (574) T ss_pred CCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCc---------------- Confidence 2344456667889999999998877653 4566777777777777777766432100 Q ss_pred ccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEeeecCC- Q lcl|NC_019445. 189 NNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGE- 267 (559) Q Consensus 189 ~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g- 267 (559) .. ++..+ .+.....+. .-=++++|.+..++ T Consensus 219 -----------------~~--y~~~~-------------------------~g~~~~~~~-----~~eiih~~~~~~~~~ 249 (574) T protein:vir:80 219 -----------------ER--FVQVI-------------------------DNRIVAKFN-----ERELAFAVRNPRADI 249 (574) T ss_pred -----------------eE--EEEEe-------------------------CCceEEEEc-----cccEEEEeccCCCCc Confidence 00 00000 000000000 00123333332222 Q ss_pred --CcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee--ecCCC--cccc-------c-ee-----cCCceeecCC Q lcl|NC_019445. 268 --DVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMV--APTSL--KNQR-------A-SL-----LPGDITYIDQ 328 (559) Q Consensus 268 --~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~--~p~~~--~~~~-------~-~~-----~pg~~~~~~~ 328 (559) ..||.| |...+...+.......+.......-...|..+ ++.+. .... + +. ..|++..+ . T Consensus 250 ~~~~~G~s-pi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl-~ 327 (574) T protein:vir:80 250 EVGQYGYP-ELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVV-S 327 (574) T ss_pred cccccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceee-c Confidence 469999 89888777777777777777777777778743 44332 1110 0 11 11121111 1 Q ss_pred cCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHH----HHHHHHHHhhhHHHHHHH Q lcl|NC_019445. 329 ITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIE----MKEEKLLMLGPVLERLND 404 (559) Q Consensus 329 ~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~----r~~e~~~~LG~v~~~l~~ 404 (559) +++-.++|+.. ++.-..+.+..+.....|-++|-....... ..+....+.+.+.. -.++. ...+.. T Consensus 328 -~~G~~~~~l~~-s~~D~qfle~~~~~~~~Ia~afgVPp~~lG-~~~~~t~~gs~~~~~n~sn~E~~-------~~~f~~ 397 (574) T protein:vir:80 328 -AEDVKFVNMTP-SANDMQFEKWLNYLINVISALYGIDPAEIN-FPNNGGATGSKGGSLNEGNSKEK-------MQASQN 397 (574) T ss_pred -CCCceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhc-ccccccccccccccccchhHHHH-------HHHHHH Confidence 12233555532 333344456667788899999987654432 22211111111000 00000 112344 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecH--HHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHH Q lcl|NC_019445. 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISV--MAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQA 482 (559) Q Consensus 405 E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~--La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~ 482 (559) .-|.|++.+.-..+.+. ++++. +..+.+++..+ +.++. ...+.. .+. +. -+.+++ T Consensus 398 ~tL~P~~~~ie~~ln~~-Ll~~~-----~~~~~~~f~~~d~~~~~~----~~~~~~---~~~--~G-------~lT~NE- 454 (574) T protein:vir:80 398 KGLQPLLRFIEDTVNTY-IVAEF-----GEKYQFQFRGGDLSAQLD----KLKIIE---QEG--KV-------FRTVNE- 454 (574) T ss_pred HHHHHHHHHHHHHHHhh-hhhhc-----CCceEEEecccchhhHHH----HHHHHH---HHh--CC-------ccCHHH- Confidence 45555555554444442 23322 12344555432 22211 111111 110 01 122222 Q ss_pred HHHHHHHcCCCc----cccCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhhh----hhcCCC-hhHHHHHHHHh Q lcl|NC_019445. 483 IDAFADMSGVSP----TVIVPQEQVDQARQQRAQQ--QQQQQMMAMGMAAAQGAKTLS----EAKTSD-PSVLSAMANAV 551 (559) Q Consensus 483 ~~~~a~~~Gvp~----~~~rs~~ev~~~rq~r~q~--~q~~~~~~~~~~~~~~a~~~~----~~~~~~-~~~~~~~~~~~ 551 (559) +-..+|.|+ +.+...--+..+-+.-++. ..+.+.....+...+.+..-. +.+... .+.....-..- T Consensus 455 ---~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~ 531 (574) T protein:vir:80 455 ---IRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQ 531 (574) T ss_pred ---HHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhh Confidence 222234332 1111110000000000000 000000000000111111100 010000 00000111111 Q ss_pred hcCCCCCC Q lcl|NC_019445. 552 SGQGGQSQ 559 (559) Q Consensus 552 ~~~~~~~~ 559 (559) .+..|..- T Consensus 532 ~~~~~~~~ 539 (574) T protein:vir:80 532 QGLNGKSK 539 (574) T ss_pred hhhccchh Confidence 11111110 No 190 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=32.53 E-value=1.4 Score=19.75 Aligned_cols=358 Identities=13% Similarity=0.132 Sum_probs=140.8 Q ss_pred hHHHHH-HHHHHHhccccC----CCCCC-CCCCccccc--CCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccch Q lcl|NC_019445. 22 SFEPHW-RELSDYINPRGS----RFLTS-EVNRNDRRN--TRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEM 93 (559) Q Consensus 22 ~~~~~w-~e~~~~~~P~~~----~~~~~-~~~~~~~~~--~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~ 93 (559) -|..+| ..-..-.-|... .+.+. .+..+..-+ +-+=.++--.|++.+|+.+.+ -||--....... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~------lp~~~~~~~~~~- 73 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQ------LPVELYERSGDD- 73 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhcc------CceEEEEecCCC- Confidence 111111 000000011000 00000 000010000 001123333345555553332 245222211111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEeeCCCCC Q lcl|NC_019445. 94 MDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRGS 167 (559) Q Consensus 94 ~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G~ 167 (559) .+ .+ .+..+...|+ +-| .+.-.+..+.++.++|||++++..+. +.+..+.+++.+.+-+..+.+|. T Consensus 74 ~~--~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~ 145 (419) T protein:vir:80 74 RK--PA------TDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLK 145 (419) T ss_pred cc--cc------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCce Confidence 10 00 1122333343 222 23334566788999999999987654 34455666666776666655543 Q ss_pred EEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceee Q lcl|NC_019445. 168 VDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLL 247 (559) Q Consensus 168 vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il 247 (559) + +|+ ++ |.+ .+ T Consensus 146 ~--~y~---~~--------~~~-------------------------------------------------------~~- 156 (419) T protein:vir:80 146 P--MYR---VA--------GAD-------------------------------------------------------PL- 156 (419) T ss_pred E--EEE---Ec--------Ccc-------------------------------------------------------cc- Confidence 2 111 00 000 00 Q ss_pred eecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCc---cc-c---c-- Q lcl|NC_019445. 248 RESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLK---NQ-R---A-- 316 (559) Q Consensus 248 ~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~---~~-~---~-- 316 (559) ..-=+++.|+...+| .||.| |..-+...+.....+.+.......-...|..++ +.+.. .. . + T Consensus 157 -----~~~~i~h~~~~~~d~-~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~ 229 (419) T protein:vir:80 157 -----PQRLVHHVRWMSING-YTGLS-PVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITD 229 (419) T ss_pred -----chhheEEecCCCCCC-ccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHH Confidence 000034455555555 89999 888777777777777777777777777786554 33221 11 0 0 Q ss_pred --e------ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHH Q lcl|NC_019445. 317 --S------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) Q Consensus 317 --~------~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~ 388 (559) . -..|++.+.+ ++-.++|+.. ++.-..+.+..+...+.|-.+|-..........++..-++++... T Consensus 230 ~~~~~~~g~~n~g~~~vl~---~g~~~~~l~~-s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~-- 303 (419) T protein:vir:80 230 GWNAKFGGSGNAKKVALLQ---EGMKFKPLSM-TNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSL-- 303 (419) T ss_pred HHHHHhcCccccCCceecC---CCceEEeccC-ChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH-- Confidence 0 0123344442 2234566543 233334455566677889999987544432222222223333221 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeec-----HHHH---HHHHHH-----HHH Q lcl|NC_019445. 389 EEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYIS-----VMAQ---AQKSIG-----LSS 455 (559) Q Consensus 389 ~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is-----~La~---a~r~~~-----~~~ 455 (559) .=....|.|...+++.++-.-|+ ++ .+-.+-.+++.+.. ..++ ..+..+ .+- T Consensus 304 ~f~~~~l~P~~~~ie~~l~~kll-------------~~--~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE 368 (419) T protein:vir:80 304 QFVIYTLLPWVKRHEQAKTRDLL-------------LP--SERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSIND 368 (419) T ss_pred HHHHHHHHHHHHHHHHHHhhhcc-------------Cc--cccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 11223455655555555433222 11 01111112221111 1111 111110 000 Q ss_pred HHHHHH--------------HHHHHhccChhhH-hcCCHHHHHHHHHHHcC Q lcl|NC_019445. 456 LASTVN--------------FIGQLAQAKPEAL-DKLNVDQAIDAFADMSG 491 (559) Q Consensus 456 l~~~~~--------------~~~~la~~~P~~~-~~id~d~~~~~~a~~~G 491 (559) +...++ .+..+.+..|... +.=+.+..++.+-+.+. T Consensus 369 ~R~~~g~~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 419 (419) T protein:vir:80 369 IRRLENMPPVKGGDIYLSPMNMVDASKPQPIPMGKTEPTKAALDEIGRILS 419 (419) T ss_pred HHHHhCCCCCCCcceeeeccccccccccccccCCCCCchhhhHHHHHhhcC Confidence 111110 0111112222211 12233444444444444 No 191 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=28.61 E-value=1.7 Score=19.27 Aligned_cols=311 Identities=10% Similarity=0.030 Sum_probs=115.0 Q ss_pred ccCCCCCC---CCCCcccccCCCC---cchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHHHHH Q lcl|NC_019445. 37 RGSRFLTS---EVNRNDRRNTRII---DSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRM 110 (559) Q Consensus 37 ~~~~~~~~---~~~~~~~~~~~~~---~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~ 110 (559) .+...... ..+.+..+. ..| ++++...++ ++....-.+..|+.--++-..+++...+......+=..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-----y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k 74 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPIND-RTFSLSEITASPALD-----YVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSR 74 (345) T ss_pred CCccccccchhhhcCCCceE-EEeecCCcccchhhc-----ccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhh Confidence 11110000 001110000 001 223221111 111111123445543222223333322222221110000 Q ss_pred HHHHHhccc-------hHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecHHHH Q lcl|NC_019445. 111 NDMFNKSNL-------YQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTVRQL 182 (559) Q Consensus 111 ~~~l~~snf-------~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql 182 (559) ...+ .++| ...+.++..|+.+||||.+++..+. +.++.+.++|. .++.+..+|...-.++.+.. T Consensus 75 ~n~l-~~~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~--~~vr~~~d~~~~~~~~~~~~----- 146 (345) T protein:vir:37 75 ANMV-SATYEGGKALSKMEMRALCLNLIQFGDVGLLKVRNGFGQVVRLVPLSS--LYLRVHKDGGYSYLMKKSLY----- 146 (345) T ss_pred hhHH-hhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecC--ceeEEeecCCeeEEEeeeee----- Confidence 0011 1222 2345677789999999999987764 45555544443 33333322221111110000 Q ss_pred HHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEEEEe Q lcl|NC_019445. 183 VQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRW 262 (559) Q Consensus 183 ~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~~rw 262 (559) ...+ ..+. |..--++++|. T Consensus 147 -------------------------------------------------------~~~g-~~~~-----~~~~eViHir~ 165 (345) T protein:vir:37 147 -------------------------------------------------------DTAQ-EIYR-----YDAKDIIFIKL 165 (345) T ss_pred -------------------------------------------------------ccCc-eEEE-----EccccEEEEcC Confidence 0000 0000 00011333443 Q ss_pred eecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCC-Cccccce---------ecCC--ceeecCC Q lcl|NC_019445. 263 EVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTS-LKNQRAS---------LLPG--DITYIDQ 328 (559) Q Consensus 263 ~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~-~~~~~~~---------~~pg--~~~~~~~ 328 (559) ....+..||.+ |..-++-.+-.-+..++-..+...-.+.|..++ ++. +.....+ -.+| +..++.. T Consensus 166 ~~~~~~~~Gl~-~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~ 244 (345) T protein:vir:37 166 YDPMQQVYGSP-DYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNI 244 (345) T ss_pred CCCCCCcccch-HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEec Confidence 22335689987 655443333222222222222233345566543 332 2211111 1111 1111211 Q ss_pred cCC---chhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCC--CCCcCHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_019445. 329 ITG---QDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNIN--TRSMPVEAVIEMKEEKLLMLGPVLERLN 403 (559) Q Consensus 329 ~~~---~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~--~~~~TA~Ei~~r~~e~~~~LG~v~~~l~ 403 (559) +++ +-.+.|+.....+.+ ..+..+..++.|-.+|-.....+....+ +.--++++... .+. T Consensus 245 ~~g~~~G~~~~pl~~~~~d~q-f~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~--------------~f~ 309 (345) T protein:vir:37 245 AGGHPDGLKVIPIGDTGTKDE-FANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYRE--------------VYH 309 (345) T ss_pred CCCCccceeEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHH--------------HHH Confidence 221 223556544332333 4455566778899999876444322111 11122222221 134 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeec-HHHH Q lcl|NC_019445. 404 DECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYIS-VMAQ 446 (559) Q Consensus 404 ~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is-~La~ 446 (559) ..-+.|++.++...+.+ +|+.+.. ..++|.- -|.+ T Consensus 310 ~~~l~P~~~~ie~~ln~---~~e~~~~-----~~i~F~~~~l~k 345 (345) T protein:vir:37 310 YDEVMPLQEIIAETINQ---DPEIKNL-----LKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHhhh---hhccCCc-----ceEEECchhhcC Confidence 45577888877777765 3333322 3333432 2222 No 192 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=25.52 E-value=2 Score=18.88 Aligned_cols=330 Identities=9% Similarity=0.040 Sum_probs=124.2 Q ss_pred cCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCCCCcceeccCCccchhhHHHHHHHHHHHH---------- Q lcl|NC_019445. 38 GSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQ---------- 107 (559) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve---------- 107 (559) .+-|..-.....+ ......+.... ....+.+ + |=.-.+......+...|..-.+.+. T Consensus 1 Mg~~~~~~~~~~~--~~~~~~~~~~~--------~~~~~~~-~--~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v 67 (385) T protein:vir:10 1 MGLLTPRNFNKRK--AKNMVYPSNPA--------FFTTTVG-G--MQLSYVSALSALQNTNVYSVINRIASDVASAHFKT 67 (385) T ss_pred Cccccchhccccc--ccccccccchh--------hhhhhcc-c--cCccccCHHHhhccHHHHHHHHHHHHHHhhCceee Confidence 3322211111111 11111111000 0111110 0 0000111111122223332222221 Q ss_pred --HHHHHHHHhccch----HHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEEeeC--CCCCEEEEEEEEeecH Q lcl|NC_019445. 108 --NRMNDMFNKSNLY----QSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLANS--PRGSVDICFRKFSMTV 179 (559) Q Consensus 108 --~~~~~~l~~snf~----~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d--~~G~vd~i~r~~~~t~ 179 (559) +.....|++-|-+ .=...++.++..+|||.+++..+. ...+|+....|... ..|. T Consensus 68 ~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~-----~~~~p~~~~~v~~~~~~~~~------------ 130 (385) T protein:vir:10 68 ENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN-----LEHIPNSDVQINYLPGNMGI------------ 130 (385) T ss_pred eccchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc-----eeEeecCCceEEEEEcCCce------------ Confidence 1222233333322 224456678889999999987553 22244443332221 1110 Q ss_pred HHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEE Q lcl|NC_019445. 180 RQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMA 259 (559) Q Consensus 180 ~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~ 259 (559) ++..... .+..... |..--+++ T Consensus 131 ------------------------------~~~~~~~-----------------------~~~~~~~-----~~~~eiih 152 (385) T protein:vir:10 131 ------------------------------VYTVLES-----------------------NDRPQMV-----LRQDQMLH 152 (385) T ss_pred ------------------------------EEEEEEc-----------------------CCceEEE-----EccccEEE Confidence 0110000 0000000 00111344 Q ss_pred EEeeecC--CCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCcccc--------ce-e----cCCc Q lcl|NC_019445. 260 PRWEVNG--EDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQR--------AS-L----LPGD 322 (559) Q Consensus 260 ~rw~~~~--g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~~--------~~-~----~pg~ 322 (559) +|....+ +..||.| |...+...+.......+.......-...|..++ ++.+.... ++ . ..|+ T Consensus 153 ik~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~ 231 (385) T protein:vir:10 153 FRLMPDPQYRYLIGRS-PLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGR 231 (385) T ss_pred eccCCCCccccccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCC Confidence 4432222 2468999 899998888888888888888888888887654 43332211 11 1 1222 Q ss_pred eeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHH Q lcl|NC_019445. 323 ITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERL 402 (559) Q Consensus 323 ~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l 402 (559) +.+. +++..++|+.....+.+.+.+..+.....|-++|-.... +++..+.+.-|...+.+........|.|.+.++ T Consensus 232 ~~vl---~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~-~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~i 307 (385) T protein:vir:10 232 LMVL---PDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSD-ILGGGTSTESQHSNIDQIKATYLANLNSYVNPI 307 (385) T ss_pred cccc---CCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHH-HcCCccCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 3322 223345655432223443334556667889999977433 333333333333333333333334556666666 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEee-----cH---HHHHHHHHHHH--HHHHHHHHH--HHHhcc- Q lcl|NC_019445. 403 NDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYI-----SV---MAQAQKSIGLS--SLASTVNFI--GQLAQA- 469 (559) Q Consensus 403 ~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~i-----s~---La~a~r~~~~~--~l~~~~~~~--~~la~~- 469 (559) .+|+-.-+ +. ..+++... +. ...+.+..+.. .....-..+ -.+-.- T Consensus 308 e~~l~~~l-------------~~--------~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~ 366 (385) T protein:vir:10 308 VDELRLKM-------------NA--------PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDN 366 (385) T ss_pred HHHHHHhh-------------CC--------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCC Confidence 66543222 11 01222221 11 22222222211 111111111 122100 Q ss_pred ChhhHhcC-------CHHH Q lcl|NC_019445. 470 KPEALDKL-------NVDQ 481 (559) Q Consensus 470 ~P~~~~~i-------d~d~ 481 (559) .+++.... +-|+ T Consensus 367 ~~~~~~~~~~~~~g~~~dn 385 (385) T protein:vir:10 367 LPEFKPLTTQVKGGDEGDN 385 (385) T ss_pred CccccCcccccCCCCCCCC Confidence 00100000 1111 No 193 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=23.59 E-value=2.3 Score=18.61 Aligned_cols=356 Identities=14% Similarity=0.111 Sum_probs=141.3 Q ss_pred CChhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccCC-CCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSR-FLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~~~~w~e~~~~~~P~~~~-~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.=- .++ |..-...|.. .....+..+.... ..+...+. ..-+=.++--.|++.+|+.+.+ T Consensus 1 Mgl~--~~~---f~~~~~~~~~-----~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~v~~~v~~ia~~iA~----- 61 (409) T protein:vir:84 1 MSLF--TRI---FSGPSEERTL-----TKISGIPSPAEDWAMHGDRPGA----NSAMTLGAFYACVTLLADTVAS----- 61 (409) T ss_pred Cchh--hhh---hcCCCccccc-----ccccccccccchhhccCcccch----hhhhccHHHHHHHHHHHHhhhh----- Confidence 4321 111 1111111111 0001111110000 00111000 0101123444556666665543 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHH-hcc----chHHHHHHHHHHHhhCcEEEEEee--cCCceEEEEE Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLE--DDEDIIRTMP 152 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~v~~--~~~~~~~~~~ 152 (559) -||.-+...+....+. +.+...|+ +-| .+.-+...+.++.++||+.+|+.. ..+.+..+.+ T Consensus 62 -lp~~~~~~~~~~~~~~-----------~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~ 129 (409) T protein:vir:84 62 -LSIDAYRKKDNVRIPV-----------SPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMP 129 (409) T ss_pred -CceEEEEecCCccccc-----------chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEE Confidence 2454333222211111 12233343 222 333455667788999999988753 2344455555 Q ss_pred eeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 153 FPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) ++...+.|....++....++ .++ T Consensus 130 l~p~~v~v~~~~~~~~~~~~----------------------------------------~~~----------------- 152 (409) T protein:vir:84 130 IHPDCIHVTDAKDEDGDWIE----------------------------------------PVY----------------- 152 (409) T ss_pred EcCceeEEEEcCCCcceEEE----------------------------------------EEe----------------- Confidence 65555544433222111000 000 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCC Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTS 310 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~ 310 (559) .. ++ +.+. .--+++.++....|..||.| |...+...+.......+.......-...|..++ +.. T Consensus 153 -----~~--~g-~~~~-----~~dvih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 218 (409) T protein:vir:84 153 -----RI--DG-KVVP-----NHRIMHIKRYPVAGCALGMS-PIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDAD 218 (409) T ss_pred -----cC--Cc-eEEc-----hhhEEEecCCCCCccccccc-HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC Confidence 00 00 0000 01145555555667789999 898887777777777777777777777776554 443 Q ss_pred Ccccc--------ce--ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcC Q lcl|NC_019445. 311 LKNQR--------AS--LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMP 380 (559) Q Consensus 311 ~~~~~--------~~--~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~T 380 (559) +.... .. -..|++.+.+ + +..++++.. ++.-..+.+.....+..|-++|-...- .++..+....+ T Consensus 219 l~~e~~~~~~~~~~~~~~n~g~~~vl~--~-g~~~~~~~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~ 293 (409) T protein:vir:84 219 LTPDQVKQTQKQWIQSHHNRRLPAVMS--A-GIKWQSVSI-TPNESQFLETRSFQRSEIAMWFRIPPH-MIGDVEKSTSW 293 (409) T ss_pred CCHHHHHHHHHHHHHHhccCCCeeecC--C-CceEEEccC-ChhHHHHHHHHHHHHHHHHHHhCCCHH-HhCCCCCcccc Confidence 32211 11 1233334332 2 223555542 222233445556777889999977543 33332222222 Q ss_pred HHHHHHHHHH-HHHHhhhHHHHHHHHHHHHHHH------------------H--HHHHHHhcCC-----------CCCCc Q lcl|NC_019445. 381 VEAVIEMKEE-KLLMLGPVLERLNDECLNPLID------------------R--AFSMMVRKNM-----------LPPPP 428 (559) Q Consensus 381 A~Ei~~r~~e-~~~~LG~v~~~l~~E~l~Pli~------------------r--~~~il~r~g~-----------lp~~p 428 (559) +.-+.+.... ....|.|.+..++++|-.-|.. | .+..+.+.|. +||+| T Consensus 294 ~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ 373 (409) T protein:vir:84 294 GTGIEEQGINFVRHTLLPWLRCIEQALDTFLPRGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIP 373 (409) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 2223333222 3455778877777765322100 0 0122222222 12222 Q ss_pred hhhCCcce-EEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHH Q lcl|NC_019445. 429 DAMEGMPL-KVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQ 481 (559) Q Consensus 429 ~~l~g~~v-~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~ 481 (559) . |+.+ ...-.+++..+...+ . .-.|+--..-|..+ T Consensus 374 g---gD~~~~~~n~~~~~~~~~~~-------------~--~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 374 E---GDIHLQPMNFVPLGYVPPEE-------------P--AQEPQPNSATEGNK 409 (409) T ss_pred C---cceeeecccccccccCCccc-------------c--CcCCCCCCccCCCC Confidence 1 1110 000001111000000 0 00111111123333 No 194 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=21.35 E-value=2.6 Score=18.29 Aligned_cols=337 Identities=11% Similarity=0.090 Sum_probs=128.2 Q ss_pred cCCCCCCCCCCcccccCC-CCcchHHHHHHHHHHHHHHhhcC-CCCcceeccCCccchhhHHHHHHHHHHHHH------- Q lcl|NC_019445. 38 GSRFLTSEVNRNDRRNTR-IIDSTGTMAARTLASGMMSGITS-PARPWFRLATPDPEMMDYGPVKLWLEAVQN------- 108 (559) Q Consensus 38 ~~~~~~~~~~~~~~~~~~-~~~s~~~~a~~~Las~l~~~l~p-p~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~------- 108 (559) .+-|...... ++..+ ...+.... ....+++ -+..|. ......+...|..-.+.+.. T Consensus 1 Mg~~~~~~~~---k~~~~~~~~~~~~~--------~~~~~~~~~~~~~v----~~~~~l~~~~v~~~i~~ia~~ia~~~~ 65 (383) T protein:vir:10 1 MGLLTPKNFS---KRNAKNMVYPSNPA--------FFTTTVGGMQLSYV----SALSALQNTNVYSVINRIASDVSSAHF 65 (383) T ss_pred CCcccccccc---cccccccccccchh--------hhhhhccCcccccc----chhHhhcchHHHHHHHHHHHhhccCce Confidence 3333211100 10111 00010000 0011111 011121 11111122223222222211 Q ss_pred -----HHHHHHHhcc----chHHHHHHHHHHHhhCcEEEEEeecCCceEEEEEeeccEEEEeeCCCCCEEEEEEEEeecH Q lcl|NC_019445. 109 -----RMNDMFNKSN----LYQSLPQLYGSLGTYSTGAMAVLEDDEDIIRTMPFPIGSYYLANSPRGSVDICFRKFSMTV 179 (559) Q Consensus 109 -----~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~v~~~~~~~~~~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~ 179 (559) .....|++=| .+.-+..++.++..+|||.+++..+. ...+|+....|....++ T Consensus 66 ~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~-----~~~~p~~~~~v~~~~~~------------- 127 (383) T protein:vir:10 66 KTENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN-----LEHIPNSDVQINYLPGN------------- 127 (383) T ss_pred eecccchhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc-----eeEeecCcceEEEEEcC------------- Confidence 1222233222 23335667788889999999886542 12244444332211100 Q ss_pred HHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEEEEEEecCCCceeeeecCcccCCeEE Q lcl|NC_019445. 180 RQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMA 259 (559) Q Consensus 180 ~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~sv~~~~~~~~~~il~esg~~~~P~~~ 259 (559) +.. ++...+ ..++ ..+. |..--+++ T Consensus 128 -------------------------~~~--~~~~~~----------------------~~~~-~~~~-----~~~~evih 152 (383) T protein:vir:10 128 -------------------------MGI--VYTVLE----------------------SNDR-PKMV-----LRQDQMLH 152 (383) T ss_pred -------------------------Cce--EEEEEE----------------------cCCc-eEEE-----EcccceEE Confidence 000 010000 0000 0000 11122444 Q ss_pred EEeeecCC--CcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCCccc-c-------ce-----ecCCc Q lcl|NC_019445. 260 PRWEVNGE--DVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSLKNQ-R-------AS-----LLPGD 322 (559) Q Consensus 260 ~rw~~~~g--~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~~~~-~-------~~-----~~pg~ 322 (559) +|....++ ..||.| |..-+...+.......+.......-...|..++ +++.... . ++ ...|+ T Consensus 153 ~r~~~~~~~~~~~G~s-~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~ 231 (383) T protein:vir:10 153 FRLMPDPQYRYLIGRS-PLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGR 231 (383) T ss_pred eccCCCCccccccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCC Confidence 55333333 368999 898888888888888888888888888887543 4333211 0 11 01223 Q ss_pred eeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCcCHHHHHHHHHHHHHHhhhHHHHH Q lcl|NC_019445. 323 ITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERL 402 (559) Q Consensus 323 ~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~TA~Ei~~r~~e~~~~LG~v~~~l 402 (559) +.+.+ ++..++|+.....+.+.+.+..+..+..|-.+|-.... .++..+....|...+.+........|.|....+ T Consensus 232 ~~vl~---~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~-~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~i 307 (383) T protein:vir:10 232 LMVLP---DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSD-ILGGGTSTESQHSNIDQIKATYLANLNSYVNPI 307 (383) T ss_pred ccccC---CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHH-HcCCccCCCCccccHHHHHHHHHHHHHHHHHHH Confidence 33332 23345665432223343345556778889999977533 333333333343333333333334455555555 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHH Q lcl|NC_019445. 403 NDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQAKPEALDKLNVDQA 482 (559) Q Consensus 403 ~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~~~~~~~~la~~~P~~~~~id~d~~ 482 (559) +.++- +. ++ +..+++.+...+. .-.....+.+...+ +.+ -+..+++ T Consensus 308 e~~l~------------~~-l~--------~~~~~f~~~~l~~-~d~~~~~~~~~~~~-------~~G-----~~t~nE~ 353 (383) T protein:vir:10 308 VDELR------------LK-MN--------APDLELDIKDMLD-VDDSILINQVSNLA-------KSG-----VLGAEQA 353 (383) T ss_pred HHHHH------------Hh-hC--------CceEEeechhhhc-cCHHHHHHHHHHHH-------hCC-----CcCHHHH Confidence 55432 21 11 1235554443321 11111111111111 111 1222222 Q ss_pred HHHHHHHcCCCc-cccC----------CHHH Q lcl|NC_019445. 483 IDAFADMSGVSP-TVIV----------PQEQ 502 (559) Q Consensus 483 ~~~~a~~~Gvp~-~~~r----------s~~e 502 (559) -+. ....++|. .... -++| T Consensus 354 R~~-lg~~p~~~~d~~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 354 QFI-LTRSGFLPDNLPEFKPLTNETKGGDDK 383 (383) T ss_pred HHH-hCCCcccCCcccccCCCcccCCCCCCC Confidence 221 11112221 1111 0122 No 195 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=21.19 E-value=2.6 Score=18.27 Aligned_cols=333 Identities=13% Similarity=0.156 Sum_probs=125.7 Q ss_pred CChhhHHHHHHHHHHHHHHhhh-HHHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhcCC Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQS-FEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSP 79 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~pp 79 (559) |.=-. .+.+ |+. -...|. .+.........+...+. .+-+-.++--.|++.+|+.+-+ T Consensus 1 M~~~~--~f~~--------r~~~~~~~~~---~~~~~~~~~~~~~~v~~----~~al~~~av~~cv~~ia~~ia~----- 58 (359) T protein:vir:10 1 MSILN--PFER--------RSSITPNNYY---PFMVQNGSIVPNSLVDA----TEALKNSDLYAVTSLISSDIAG----- 58 (359) T ss_pred Ccccc--hhhc--------cccCCCCcch---hhhhccccccCCcccCH----HHhhcchHHHHHHHHHHHhhhc----- Confidence 33211 0111 110 001111 11110000000111110 0111123333456666554432 Q ss_pred CCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhccchH----HHHHHHHHHHhhCcEEEEEeecC-CceEEEEEee Q lcl|NC_019445. 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQ----SLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMPFP 154 (559) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~----~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~~~ 154 (559) .|+- + .......+.+=|-+. =....+.++..+|||.+++..+. +.+..+.++| T Consensus 59 -~p~~----------~-----------~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~ 116 (359) T protein:vir:10 59 -TRFI----------G-----------NQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIP 116 (359) T ss_pred -Cccc----------c-----------chHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeC Confidence 2321 0 001111222323222 23456678888999999887654 3345555566 Q ss_pred ccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccEEE Q lcl|NC_019445. 155 IGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKS 234 (559) Q Consensus 155 l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~s 234 (559) ...+.+..+.++ ++.++... .+. ...+ ++ T Consensus 117 ~~~v~i~~~~~~----~~y~~~~~-----------------------~~~-~~~~-----~~------------------ 145 (359) T protein:vir:10 117 SNAITIDLTDDT----LTYEVNQF-----------------------DDY-PSAK-----YN------------------ 145 (359) T ss_pred CceEEEEEcCCe----EEEEEEec-----------------------CCc-eEEE-----Ec------------------ Confidence 565555544321 11110000 000 0000 00 Q ss_pred EEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCCC- Q lcl|NC_019445. 235 VYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTSL- 311 (559) Q Consensus 235 v~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~~- 311 (559) .++ |++ +.+........+ ..||.| |...+...+.......+.......-...|..++ |.+. T Consensus 146 -------~~e-vih------~~~~~~~~~~~d-g~~G~s-pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l 209 (359) T protein:vir:10 146 -------ASE-MIH------VKIMAYGVDTLH-NLVGHS-PLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTL 209 (359) T ss_pred -------ccc-eEE------eccCCCCCCccC-cccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC Confidence 001 111 111111111223 368999 898877777777777777777776677776543 4332 Q ss_pred ccc-------cceec-----CCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCCc Q lcl|NC_019445. 312 KNQ-------RASLL-----PGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSM 379 (559) Q Consensus 312 ~~~-------~~~~~-----pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~~ 379 (559) ... .++-. .|++... +++-.++|+.. ++....+.+..+.....|-++|-..+.. ++..+...- T Consensus 210 ~~e~~~~~~~~~~~~~~~~n~g~~~vl---~~g~~~~~l~~-~~~d~q~le~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~ 284 (359) T protein:vir:10 210 SSEAKDSIRKEFEKANGGNNSGRVMVL---DQSADFSTVSI-NADVANYLNSMNWGRTQIAKAFGVSDSY-LNGTGDQQS 284 (359) T ss_pred CHHHHHHHHHHHHHHhCccccCCceec---CCCcceeeecC-CHHHHHHHHHHHHHHHHHHHHhCCCHHH-hCCCCcccc Confidence 111 11111 1222222 22334566532 3333444566667788899999876444 333333444 Q ss_pred CHHHHHHHHHH-HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHH--HHHHHHHHH-- Q lcl|NC_019445. 380 PVEAVIEMKEE-KLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMA--QAQKSIGLS-- 454 (559) Q Consensus 380 TA~Ei~~r~~e-~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La--~a~r~~~~~-- 454 (559) |...+.+...+ ....|.|..++|+..|...+ ....+.+ +++.+.+- .+.+..... T Consensus 285 ~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~~-------~~~~~~~-------------~~~d~~~~~~~~~~~~~~G~~ 344 (359) T protein:vir:10 285 SLDQIKDLYVNALNRFIEPLISELRIKCDSSI-------GVDMSPI-------------TDYSNSVFKADILNWVKEGII 344 (359) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-------cccchhh-------------hhcCHHHHHHHHHHHHhCCCc Confidence 65555443322 22234444444433332211 0000000 11111110 011111110 Q ss_pred HHHHHHHHHHHHhccC Q lcl|NC_019445. 455 SLASTVNFIGQLAQAK 470 (559) Q Consensus 455 ~l~~~~~~~~~la~~~ 470 (559) .....-..+ .+.++- T Consensus 345 t~NE~R~~l-~~~pv~ 359 (359) T protein:vir:10 345 EPTEAKTLL-ESKGII 359 (359) T ss_pred CHHHHHHHh-CCCCCC Confidence 011111111 222333 No 196 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=20.94 E-value=2.7 Score=18.23 Aligned_cols=370 Identities=13% Similarity=0.090 Sum_probs=148.0 Q ss_pred CChhhHHHHHHHHHHHHHHhhhH---HHHHHHHHHHhccccCCCCCCCCCCcccccCCCCcchHHHHHHHHHHHHHHhhc Q lcl|NC_019445. 1 MAETTKERLNKQFAQLESERQSF---EPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) Q Consensus 1 M~~~~~~~l~~r~~~l~~~R~~~---~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~~~~~s~~~~a~~~Las~l~~~l~ 77 (559) |. .|+.++...+.. .+.|-+. ..... .+...+. ..-+=.++-..|++.+|+.+.+ . T Consensus 1 M~---------~f~~~~~~~~~~~~~~~~~~~~---~~~~~---~~~~v~~----~~al~~~~V~~~v~~ia~~ia~--~ 59 (397) T protein:vir:38 1 MP---------LLKLNKSHSQGFSLNDPDWVNF---LTGGE---AQKYVSA----DTALKNSDIFSLIMQLSGDLAM--V 59 (397) T ss_pred Cc---------chhhhhcccCcccCCchhhhhh---hcCCc---CCceech----HHhhccHHHHHHHHHHHHHHhh--C Confidence 32 122222211111 1222211 10000 0000110 0111234455567666665532 2 Q ss_pred CCCCcceeccCCccchhhHHHHHHHHHHHHHHHHHHHHhc----cchHHHHHHHHHHHhhCcEEEEEeecC-CceEEEEE Q lcl|NC_019445. 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKS----NLYQSLPQLYGSLGTYSTGAMAVLEDD-EDIIRTMP 152 (559) Q Consensus 78 pp~~~Wf~l~~~d~~~~~~~~v~~~l~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~v~~~~-~~~~~~~~ 152 (559) || ...++ .. ...+.+- +.+.-+..++.++.++|||.+++..+. +.++.+.+ T Consensus 60 ----p~---~~~~~------~~-----------~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~ 115 (397) T protein:vir:38 60 ----RY---TSESD------RS-----------QSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEY 115 (397) T ss_pred ----cc---ccccc------HH-----------HHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEE Confidence 23 11111 11 1122222 344456677889999999999887654 45567777 Q ss_pred eeccEEEEeeCCCCCEEEEEEEEeecHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeecCcccccccccccccE Q lcl|NC_019445. 153 FPIGSYYLANSPRGSVDICFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) Q Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~ 232 (559) ++...+-+..+.+|.. ++.+++.. . ......++ ++ T Consensus 116 l~~~~v~i~~~~~~~~--~~y~~~~~---------~-------------~~~~~~~~-----~~---------------- 150 (397) T protein:vir:38 116 LRPSQVQPMLLQDGSG--LIYNINFD---------E-------------PAIGYMEN-----VP---------------- 150 (397) T ss_pred EcCceeEEEEcCCCce--EEEEEEec---------c-------------ccccceeE-----ec---------------- Confidence 8888877777666542 11111100 0 00000000 00 Q ss_pred EEEEEEecCCCceeeeecCcccCCeEEEEeeecCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCC Q lcl|NC_019445. 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVA--PTS 310 (559) Q Consensus 233 ~sv~~~~~~~~~~il~esg~~~~P~~~~rw~~~~g~~YGrG~P~~~~l~d~~~L~~l~~~~~~~~~~~~~p~~~~--p~~ 310 (559) .. =+++.|.....+..||.| |...+...+.......+.......-...|..++ +.. T Consensus 151 ---------~~------------eiih~~~~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~ 208 (397) T protein:vir:38 151 ---------AA------------DVIHIRLLSKNGGKTGIS-PLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKG 208 (397) T ss_pred ---------Cc------------cEEEecCCCCCCcccccc-HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC Confidence 00 033444444566689999 899888888888888888888777777787554 333 Q ss_pred Ccccc-------ce-----ecCCceeecCCcCCchhhhhhhhccccHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCC Q lcl|NC_019445. 311 LKNQR-------AS-----LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) Q Consensus 311 ~~~~~-------~~-----~~pg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~i~~~~~rI~~af~~dl~~~~~~~~~~~ 378 (559) +.... ++ -..|+....+ ++..++++.. ++....+.+..+..+..|-.+|-....... ..... T Consensus 209 ~~~e~~~~~~~~~~~~~~~~n~~~~~vl~---~g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg-~~~~~- 282 (397) T protein:vir:38 209 GLLDAETRIARSKEISKQIHNSDGPVVID---ALEDYKPLEV-KGNIASLLNQVDWTRDQIAKVYGVPDSYLN-GQGDQ- 282 (397) T ss_pred CCHHHHHHHHHHHHHHhcccccCCceecC---CCceEEecCC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCc- Confidence 32211 11 1133333332 2233555542 234444556677888999999987544432 22221 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhCCcceEEEeecHHHHHHHHHHHHHHHH Q lcl|NC_019445. 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRAFSMMVRKNMLPPPPDAMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) Q Consensus 379 ~TA~Ei~~r~~e~~~~LG~v~~~l~~E~l~Pli~r~~~il~r~g~lp~~p~~l~g~~v~~~~is~La~a~r~~~~~~l~~ 458 (559) .+..| +...-....|-|.+..++.| +.+. ++++. + +++++. +. + +.... T Consensus 283 ~~~~e--~~~~~~~~~l~P~~~~ie~~------------ln~~-l~~~~--~-----~~~~~~--~~-~----d~~~~-- 331 (397) T protein:vir:38 283 QSSIT--QISGQYAKSLNRYVQAIVGE------------LNDK-LHANI--S-----ANIRFA--ID-A----MGDQY-- 331 (397) T ss_pred ccHHH--HHHHHHHHHHHHHHHHHHHH------------HHHh-ccChh--c-----cccccc--cc-C----CHHHH-- Confidence 22222 11222223344444444444 3331 22221 1 112111 00 0 01111 Q ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCCc---cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-- Q lcl|NC_019445. 459 TVNFIGQLAQAKPEALDKLNVDQAIDAFADMSGVSP---TVIVPQEQVDQARQQRAQQQQQQQMMAMGMAAAQGAKTL-- 533 (559) Q Consensus 459 ~~~~~~~la~~~P~~~~~id~d~~~~~~a~~~Gvp~---~~~rs~~ev~~~rq~r~q~~q~~~~~~~~~~~~~~a~~~-- 533 (559) .+.+..+.+.+ -+.++++-. .+|.|+ .=+...+.. ... ........++.... T Consensus 332 -~~~~~~~~~~G-----~~t~nE~R~----~lg~~p~~~~d~~~~~~~---------~~~----~~~~~~~~~g~~~~~~ 388 (397) T protein:vir:38 332 -ASTISSSVKGG-----TIAGNQARF----ILQNSGYLAKDLPDPEKE---------PQQ----AIQLIQQEGGENDGNN 388 (397) T ss_pred -HHHHHHHHhCC-----CcCHHHHHH----HhCCCCCCCCcccccccc---------ccc----cccccccccCCCCCCC Confidence 11111111111 133333332 234432 101100000 000 00000000000000 Q ss_pred hhhcCCChh Q lcl|NC_019445. 534 SEAKTSDPS 542 (559) Q Consensus 534 ~~~~~~~~~ 542 (559) ......+|+ T Consensus 389 ~~e~~~~~~ 397 (397) T protein:vir:38 389 SDERGSDPE 397 (397) T ss_pred CCCCCCCCC Confidence 001111111 Done!